Yefremova, Yelena; Al-Majdoub, Mahmoud; Opuni, Kwabena F M; Koy, Cornelia; Cui, Weidong; Yan, Yuetian; Gross, Michael L; Glocker, Michael O
2015-03-01
Mass spectrometric de-novo sequencing was applied to review the amino acid sequence of a commercially available recombinant protein G´ with great scientific and economic importance. Substantial deviations to the published amino acid sequence (Uniprot Q54181) were found by the presence of 46 additional amino acids at the N-terminus, including a so-called "His-tag" as well as an N-terminal partial α-N-gluconoylation and α-N-phosphogluconoylation, respectively. The unexpected amino acid sequence of the commercial protein G' comprised 241 amino acids and resulted in a molecular mass of 25,998.9 ± 0.2 Da for the unmodified protein. Due to the higher mass that is caused by its extended amino acid sequence compared with the original protein G' (185 amino acids), we named this protein "protein G'e." By means of mass spectrometric peptide mapping, the suggested amino acid sequence, as well as the N-terminal partial α-N-gluconoylations, was confirmed with 100% sequence coverage. After the protein G'e sequence was determined, we were able to determine the expression vector pET-28b from Novagen with the Xho I restriction enzyme cleavage site as the best option that was used for cloning and expressing the recombinant protein G'e in E. coli. A dissociation constant (K(d)) value of 9.4 nM for protein G'e was determined thermophoretically, showing that the N-terminal flanking sequence extension did not cause significant changes in the binding affinity to immunoglobulins.
COMPUTER SIMULATION STUDY OF AMYLOID FIBRIL FORMATION BY PALINDROMIC SEQUENCES IN PRION PEPTIDES
Wagoner, Victoria; Cheon, Mookyung; Chang, Iksoo; Hall, Carol
2011-01-01
We simulate the aggregation of large systems containing palindromic peptides from the Syrian hamster prion protein SHaPrP 113–120 (AGAAAAGA) and the mouse prion protein MoPrP 111–120 (VAGAAAAGAV) and eight sequence variations: GAAAAAAG, (AG)4, A8, GAAAGAAA, A10, V10, GAVAAAAVAG, and VAVAAAAVAV The first two peptides are thought to act as the Velcro that holds the parent prion proteins together in amyloid structures and can form fibrils themselves. Kinetic events along the fibrillization pathway influence the types of structures that occur and variations in the sequence affect aggregation kinetics and fibrillar structure. Discontinuous molecular dynamics simulations using the PRIME20 force field are performed on systems containing 48 peptides starting from a random coil configuration. Depending on the sequence, fibrillar structures form spontaneously over a range of temperatures, below which amorphous aggregates form and above which no aggregation occurs. AGAAAAGA forms well organized fibrillar structures whereas VAGAAAAGAV forms less well organized structures that are partially fibrillar and partially amorphous. The degree of order in the fibrillar structure stems in part from the types of kinetic events leading up to its formation, with AGAAAAGA forming less amorphous structures early in the simulation than VAGAAAAGAV. The ability to form fibrils increases as the chain length and the length of the stretch of hydrophobic residues increase. However as the hydrophobicity of the sequence increases, the ability to form well-ordered structures decreases. Thus, longer hydrophobic sequences form slightly disordered aggregates that are partially fibrillar and partially amorphous. Subtle changes in sequence result in slightly different fibril structures. PMID:21557317
Single-molecule Protein Unfolding in Solid State Nanopores
Talaga, David S.; Li, Jiali
2009-01-01
We use single silicon nitride nanopores to study folded, partially folded and unfolded single proteins by measuring their excluded volumes. The DNA-calibrated translocation signals of β-lactoglobulin and histidine-containing phosphocarrier protein match quantitatively with that predicted by a simple sum of the partial volumes of the amino acids in the polypeptide segment inside the pore when translocation stalls due to the primary charge sequence. Our analysis suggests that the majority of the protein molecules were linear or looped during translocation and that the electrical forces present under physiologically relevant potentials can unfold proteins. Our results show that the nanopore translocation signals are sensitive enough to distinguish the folding state of a protein and distinguish between proteins based on the excluded volume of a local segment of the polypeptide chain that transiently stalls in the nanopore due to the primary sequence of charges. PMID:19530678
Klaassen, V A; Boeshore, M; Dolja, V V; Falk, B W
1994-07-01
Purified virions of lettuce infectious yellows virus (LIYV), a tentative member of the closterovirus group, contained two RNAs of approximately 8500 and 7300 nucleotides (RNAs 1 and 2 respectively) and a single coat protein species with M(r) of approximately 28,000. LIYV-infected plants contained multiple dsRNAs. The two largest were the correct size for the replicative forms of LIYV virion RNAs 1 and 2. To assess the relationships between LIYV RNAs 1 and 2, cDNAs corresponding to the virion RNAs were cloned. Northern blot hybridization analysis showed no detectable sequence homology between these RNAs. A partial amino acid sequence obtained from purified LIYV coat protein was found to align in the most upstream of four complete open reading frames (ORFs) identified in a LIYV RNA 2 cDNA clone. The identity of this ORF was confirmed as the LIYV coat protein gene by immunological analysis of the gene product expressed in vitro and in Escherichia coli. Computer analysis of the LIYV coat protein amino acid sequence indicated that it belongs to a large family of proteins forming filamentous capsids of RNA plant viruses. The LIYV coat protein appears to be most closely related to the coat proteins of two closteroviruses, beet yellows virus and citrus tristeza virus.
The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae).
Pan, Hong-Chun; Fang, Hong-Yan; Li, Shi-Wei; Liu, Jun-Hong; Wang, Ying; Wang, An-Tai
2014-12-01
The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae) is composed of two linear DNA molecules. The mitochondrial DNA (mtDNA) molecule 1 is 8010 bp long and contains six protein-coding genes, large subunit rRNA, methionine and tryptophan tRNAs, two pseudogenes consisting respectively of a partial copy of COI, and terminal sequences at two ends of the linear mtDNA, while the mtDNA molecule 2 is 7576 bp long and contains seven protein-coding genes, small subunit rRNA, methionine tRNA, a pseudogene consisting of a partial copy of COI and terminal sequences at two ends of the linear mtDNA. COI gene begins with GTG as start codon, whereas other 12 protein-coding genes start with a typical ATG initiation codon. In addition, all protein-coding genes are terminated with TAA as stop codon.
Cell density signal protein suitable for treatment of connective tissue injuries and defects
Schwarz, Richard I.
2002-08-13
Identification, isolation and partial sequencing of a cell density protein produced by fibroblastic cells. The cell density signal protein comprising a 14 amino acid peptide or a fragment, variant, mutant or analog thereof, the deduced cDNA sequence from the 14 amino acid peptide, a recombinant protein, protein and peptide-specific antibodies, and the use of the peptide and peptide-specific antibodies as therapeutic agents for regulation of cell differentiation and proliferation. A method for treatment and repair of connective tissue and tendon injuries, collagen deficiency, and connective tissue defects.
Xu, Qifang; Dunbrack, Roland L
2012-11-01
Automating the assignment of existing domain and protein family classifications to new sets of sequences is an important task. Current methods often miss assignments because remote relationships fail to achieve statistical significance. Some assignments are not as long as the actual domain definitions because local alignment methods often cut alignments short. Long insertions in query sequences often erroneously result in two copies of the domain assigned to the query. Divergent repeat sequences in proteins are often missed. We have developed a multilevel procedure to produce nearly complete assignments of protein families of an existing classification system to a large set of sequences. We apply this to the task of assigning Pfam domains to sequences and structures in the Protein Data Bank (PDB). We found that HHsearch alignments frequently scored more remotely related Pfams in Pfam clans higher than closely related Pfams, thus, leading to erroneous assignment at the Pfam family level. A greedy algorithm allowing for partial overlaps was, thus, applied first to sequence/HMM alignments, then HMM-HMM alignments and then structure alignments, taking care to join partial alignments split by large insertions into single-domain assignments. Additional assignment of repeat Pfams with weaker E-values was allowed after stronger assignments of the repeat HMM. Our database of assignments, presented in a database called PDBfam, contains Pfams for 99.4% of chains >50 residues. The Pfam assignment data in PDBfam are available at http://dunbrack2.fccc.edu/ProtCid/PDBfam, which can be searched by PDB codes and Pfam identifiers. They will be updated regularly.
RECOVIR Software for Identifying Viruses
NASA Technical Reports Server (NTRS)
Chakravarty, Sugoto; Fox, George E.; Zhu, Dianhui
2013-01-01
Most single-stranded RNA (ssRNA) viruses mutate rapidly to generate a large number of strains with highly divergent capsid sequences. Determining the capsid residues or nucleotides that uniquely characterize these strains is critical in understanding the strain diversity of these viruses. RECOVIR (an acronym for "recognize viruses") software predicts the strains of some ssRNA viruses from their limited sequence data. Novel phylogenetic-tree-based databases of protein or nucleic acid residues that uniquely characterize these virus strains are created. Strains of input virus sequences (partial or complete) are predicted through residue-wise comparisons with the databases. RECOVIR uses unique characterizing residues to identify automatically strains of partial or complete capsid sequences of picorna and caliciviruses, two of the most highly diverse ssRNA virus families. Partition-wise comparisons of the database residues with the corresponding residues of more than 300 complete and partial sequences of these viruses resulted in correct strain identification for all of these sequences. This study shows the feasibility of creating databases of hitherto unknown residues uniquely characterizing the capsid sequences of two of the most highly divergent ssRNA virus families. These databases enable automated strain identification from partial or complete capsid sequences of these human and animal pathogens.
Partial DNA sequencing of Douglas-fir cDNAs used in RFLP mapping
K.D. Jermstad; D.L. Bassoni; C.S. Kinlaw; D.B. Neale
1998-01-01
DNA sequences from 87 Douglas-fir (Pseudotsuga menziesii [Mirb.] Franco) cDNA RFLP probes were determined. Sequences were submitted to the GenBank dbEST database and searched for similarity against nucleotide and protein databases using the BLASTn and BLASTx programs. Twenty-one sequences (24%) were assigned putative functions; 18 of which...
2010-01-01
Background Comparative genomics methods such as phylogenetic profiling can mine powerful inferences from inherently noisy biological data sets. We introduce Sites Inferred by Metabolic Background Assertion Labeling (SIMBAL), a method that applies the Partial Phylogenetic Profiling (PPP) approach locally within a protein sequence to discover short sequence signatures associated with functional sites. The approach is based on the basic scoring mechanism employed by PPP, namely the use of binomial distribution statistics to optimize sequence similarity cutoffs during searches of partitioned training sets. Results Here we illustrate and validate the ability of the SIMBAL method to find functionally relevant short sequence signatures by application to two well-characterized protein families. In the first example, we partitioned a family of ABC permeases using a metabolic background property (urea utilization). Thus, the TRUE set for this family comprised members whose genome of origin encoded a urea utilization system. By moving a sliding window across the sequence of a permease, and searching each subsequence in turn against the full set of partitioned proteins, the method found which local sequence signatures best correlated with the urea utilization trait. Mapping of SIMBAL "hot spots" onto crystal structures of homologous permeases reveals that the significant sites are gating determinants on the cytosolic face rather than, say, docking sites for the substrate-binding protein on the extracellular face. In the second example, we partitioned a protein methyltransferase family using gene proximity as a criterion. In this case, the TRUE set comprised those methyltransferases encoded near the gene for the substrate RF-1. SIMBAL identifies sequence regions that map onto the substrate-binding interface while ignoring regions involved in the methyltransferase reaction mechanism in general. Neither method for training set construction requires any prior experimental characterization. Conclusions SIMBAL shows that, in functionally divergent protein families, selected short sequences often significantly outperform their full-length parent sequence for making functional predictions by sequence similarity, suggesting avenues for improved functional classifiers. When combined with structural data, SIMBAL affords the ability to localize and model functional sites. PMID:20102603
Domain atrophy creates rare cases of functional partial protein domains.
Prakash, Ananth; Bateman, Alex
2015-04-30
Protein domains display a range of structural diversity, with numerous additions and deletions of secondary structural elements between related domains. We have observed a small number of cases of surprising large-scale deletions of core elements of structural domains. We propose a new concept called domain atrophy, where protein domains lose a significant number of core structural elements. Here, we implement a new pipeline to systematically identify new cases of domain atrophy across all known protein sequences. The output of this pipeline was carefully checked by hand, which filtered out partial domain instances that were unlikely to represent true domain atrophy due to misannotations or un-annotated sequence fragments. We identify 75 cases of domain atrophy, of which eight cases are found in a three-dimensional protein structure and 67 cases have been inferred based on mapping to a known homologous structure. Domains with structural variations include ancient folds such as the TIM-barrel and Rossmann folds. Most of these domains are observed to show structural loss that does not affect their functional sites. Our analysis has significantly increased the known cases of domain atrophy. We discuss specific instances of domain atrophy and see that there has often been a compensatory mechanism that helps to maintain the stability of the partial domain. Our study indicates that although domain atrophy is an extremely rare phenomenon, protein domains under certain circumstances can tolerate extreme mutations giving rise to partial, but functional, domains.
Dunbrack, Roland L.
2012-01-01
Motivation: Automating the assignment of existing domain and protein family classifications to new sets of sequences is an important task. Current methods often miss assignments because remote relationships fail to achieve statistical significance. Some assignments are not as long as the actual domain definitions because local alignment methods often cut alignments short. Long insertions in query sequences often erroneously result in two copies of the domain assigned to the query. Divergent repeat sequences in proteins are often missed. Results: We have developed a multilevel procedure to produce nearly complete assignments of protein families of an existing classification system to a large set of sequences. We apply this to the task of assigning Pfam domains to sequences and structures in the Protein Data Bank (PDB). We found that HHsearch alignments frequently scored more remotely related Pfams in Pfam clans higher than closely related Pfams, thus, leading to erroneous assignment at the Pfam family level. A greedy algorithm allowing for partial overlaps was, thus, applied first to sequence/HMM alignments, then HMM–HMM alignments and then structure alignments, taking care to join partial alignments split by large insertions into single-domain assignments. Additional assignment of repeat Pfams with weaker E-values was allowed after stronger assignments of the repeat HMM. Our database of assignments, presented in a database called PDBfam, contains Pfams for 99.4% of chains >50 residues. Availability: The Pfam assignment data in PDBfam are available at http://dunbrack2.fccc.edu/ProtCid/PDBfam, which can be searched by PDB codes and Pfam identifiers. They will be updated regularly. Contact: Roland.Dunbracks@fccc.edu PMID:22942020
Esmaelizad, Majid; Jelokhani-Niaraki, Saber; Hashemnejad, Khadije; Kamalzadeh, Morteza; Lotfi, Mohsen
2011-12-01
The nucleotide sequence of the VP1 (1D) and partial 3D polymerase (3D(pol)) coding regions of the foot and mouth disease virus (FMDV) vaccine strain A/Iran87, a highly passaged isolate (~150 passages), was determined and aligned with previously published FMDV serotype A sequences. Overall analysis of the amino acid substitutions revealed that the partial 3D(pol) coding region contained four amino acid alterations. Amino acid sequence comparison of the VP1 coding region of the field isolates revealed deletions in the highly passaged Iranian isolate (A/Iran87). The prominent G-H loop of the FMDV VP1 protein contains the conserved arginine-glycine-aspartic acid (RGD) tripeptide, which is a well-known ligand for a specific cell surface integrin. Despite losing the RGD sequence of the VP1 protein and an Asp(26)→Glu substitution in a beta sheet located within a small groove of the 3D(pol) protein, the virus grew in BHK 21 suspension cell cultures. Since this strain has been used as a vaccine strain, it may be inferred that the RGD deletion has no critical role in virus attachment to the cell during the initiation of infection. It is probable that this FMDV subtype can utilize other pathways for cell attachment.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nicholas, R.A.; Suzuki, H.; Hirota, Y.
This paper reports the sequence of the active site peptide of penicillin-binding protein 1b from Escherichia coli. Purified penicillin-binding protein 1b was labeled with (/sup 14/C)penicillin G, digested with trypsin, and partially purified by gel filtration. Upon further purification by high-pressure liquid chromatography, two radioactive peaks were observed, and the major peak, representing over 75% of the applied radioactivity, was submitted to amino acid analysis and sequencing. The sequence Ser-Ile-Gly-Ser-Leu-Ala-Lys was obtained. The active site nucleophile was identified by digesting the purified peptide with aminopeptidase M and separating the radioactive products on high-pressure liquid chromatography. Amino acid analysis confirmed thatmore » the serine residue in the middle of the sequence was covalently bonded to the (/sup 14/C)penicilloyl moiety. A comparison of this sequence to active site sequences of other penicillin-binding proteins and beta-lactamases is presented.« less
Walker, J; Tait, A
1997-11-01
A reverse-transcriptase polymerase chain reaction (PCR) procedure was used to isolate an Ostertagia circumcincta partial cDNA encoding a protein with general primary sequence features characteristic of members of the mitochondrial processing peptidase (MPP) subfamily of M16 metallopeptidases. The structural relationships of the predicted protein (Oc MPPX) with MPP subfamily proteins from other species (including the model free-living nematode Caenorhabditis elegans) were examined, and Northern analysis confirmed the expression of the Oc mppx gene in adult nematodes.
Morea, Edna G O; Viviescas, Maria Alejandra; Fernandes, Carlos A H; Matioli, Fabio F; Lira, Cristina B B; Fernandez, Maribel F; Moraes, Barbara S; da Silva, Marcelo S; Storti, Camila B; Fontes, Marcos R M; Cano, Maria Isabel N
2017-11-01
Leishmania spp. telomeres are composed of 5'-TTAGGG-3' repeats associated with proteins. We have previously identified LaRbp38 and LaRPA-1 as proteins that bind the G-rich telomeric strand. At that time, we had also partially characterized a protein: DNA complex, named LaGT1, but we could not identify its protein component. Using protein-DNA interaction and competition assays, we confirmed that LaGT1 is highly specific to the G-rich telomeric single-stranded DNA. Three protein bands, with LaGT1 activity, were isolated from affinity-purified protein extracts in-gel digested, and sequenced de novo using mass spectrometry analysis. In silico analysis of the digested peptide identified them as a putative calmodulin with sequences identical to the T. cruzi calmodulin. In the Leishmania genome, the calmodulin ortholog is present in three identical copies. We cloned and sequenced one of the gene copies, named it LCalA, and obtained the recombinant protein. Multiple sequence alignment and molecular modeling showed that LCalA shares homology to most eukaryotes calmodulin. In addition, we demonstrated that LCalA is nuclear, partially co-localizes with telomeres and binds in vivo the G-rich telomeric strand. Recombinant LCalA can bind specifically and with relative affinity to the G-rich telomeric single-strand and to a 3'G-overhang, and DNA binding is calcium dependent. We have described a novel candidate component of Leishmania telomeres, LCalA, a nuclear calmodulin that binds the G-rich telomeric strand with high specificity and relative affinity, in a calcium-dependent manner. LCalA is the first reported calmodulin that binds in vivo telomeric DNA. Copyright © 2017 Elsevier B.V. All rights reserved.
The mitochondrial genome of the Arizona Snowfly Mesocapnia arizonensis (Plecoptera, Capniidae).
Elbrecht, Vasco; Leese, Florian
2016-09-01
We assembled the mitochondrial genome of the capniid stonefly Mesocapnia arizonensis (Baumann & Gaufin, 1969) using Illumina HiSeq sequence data. The recovered mitogenome is 14,921 bp in length and includes 13 protein-coding genes, 2 ribosomal RNA genes and 22 transfer RNA genes. The control region could only be assembled partially. Gene order resembles that of basal arthropods. This is the first partial mitogenome sequence for the stonefly superfamily group Euholognatha and will be useful in future phylogenetic analyses.
Liu, Zhong-Yuan; Wang, Yun; Lü, Guo-Dong; Wang, Xian-Lei; Zhang, Fu-Chun; Ma, Ji
2006-12-01
The partial cDNA sequence coding for the antifreeze proteins in the Tenebrio molitor was obtained by RT-PCR. Sequence analysis revealed nine putative cDNAs with a high degree of homology to Tenebrio molitor antifreeze proteins. The recombinant pGEX-4T-1-tmafp-XJ430 was introduced into E. coli BL21 to induce a GST fusion protein by IPTG. SDS-PAGE of the fusion protein demonstrated that the antifreeze protein migrated at a size of 38 kDa. The immunization was performed by intra-muscular injection of pCDNA3-tmafp-XJ430, and then antiserum was detected by ELISA. The titer of the antibody was 1:2,000. Western blotting analysis showed the antiserum was specific against the antifreeze protein. This finding could lead to further investigation of the properties and function of antifreeze proteins.
NASA Astrophysics Data System (ADS)
Goubin, Gerard; Goldman, Debra S.; Luce, Judith; Neiman, Paul E.; Cooper, Geoffrey M.
1983-03-01
A transforming gene detected by transfection of chicken B-cell lymphoma DNA has been isolated by molecular cloning. It is homologous to a conserved family of sequences present in normal chicken and human DNAs but is not related to transforming genes of acutely transforming retroviruses. The nucleotide sequence of the cloned transforming gene suggests that it encodes a protein that is partially homologous to the amino terminus of transferrin and related proteins although only about one tenth the size of transferrin.
Häckel, M; Hinz, H J; Hedwig, G R
1999-11-15
The partial molar volumes of tripeptides of sequence glycyl-X-glycine, where X is one of the amino acids alanine, leucine, threonine, glutamine, phenylalanine, histidine, cysteine, proline, glutamic acid, and arginine, have been determined in aqueous solution over the temperature range 10-90 degrees C using differential scanning densitometry . These data, together with those reported previously, have been used to derive the partial molar volumes of the side-chains of all 20 amino acids. The side-chain volumes are critically compared with literature values derived using partial molar volumes for alternative model compounds. The new amino acid side-chain volumes, along with that for the backbone glycyl group, were used to calculate the partial specific volumes of several proteins in aqueous solution. The results obtained are compared with those observed experimentally. The new side-chain volumes have also been used to re-determine residue volume changes upon protein folding.
Wang, Jichao; Zhang, Tongchuan; Liu, Ruicun; Song, Meilin; Wang, Juncheng; Hong, Jiong; Chen, Quan; Liu, Haiyan
2017-02-01
An interesting way of generating novel artificial proteins is to combine sequence motifs from natural proteins, mimicking the evolutionary path suggested by natural proteins comprising recurring motifs. We analyzed the βα and αβ modules of TIM barrel proteins by structure alignment-based sequence clustering. A number of preferred motifs were identified. A chimeric TIM was designed by using recurring elements as mutually compatible interfaces. The foldability of the designed TIM protein was then significantly improved by six rounds of directed evolution. The melting temperature has been improved by more than 20°C. A variety of characteristics suggested that the resulting protein is well-folded. Our analysis provided a library of peptide motifs that is potentially useful for different protein engineering studies. The protein engineering strategy of using recurring motifs as interfaces to connect partial natural proteins may be applied to other protein folds. Copyright © 2016 Elsevier B.V. All rights reserved.
Harper, J R; Prince, J T; Healy, P A; Stuart, J K; Nauman, S J; Stallcup, W B
1991-03-01
We have isolated cDNA clones coding for the human homologue of the neuronal cell adhesion molecule L1. The nucleotide sequence of the cDNA clones and the deduced primary amino acid sequence of the carboxy terminal portion of the human L1 are homologous to the corresponding sequences of mouse L1 and rat NILE glycoprotein, with an especially high sequences identity in the cytoplasmic regions of the proteins. There is also protein sequence homology with the cytoplasmic region of the Drosophila cell adhesion molecule, neuroglian. The conservation of the cytoplasmic domain argues for an important functional role for this portion of the molecule.
Complete amino acid sequence of bovine colostrum low-Mr cysteine proteinase inhibitor.
Hirado, M; Tsunasawa, S; Sakiyama, F; Niinobe, M; Fujii, S
1985-07-01
The complete amino acid sequence of bovine colostrum cysteine proteinase inhibitor was determined by sequencing native inhibitor and peptides obtained by cyanogen bromide degradation, Achromobacter lysylendopeptidase digestion and partial acid hydrolysis of reduced and S-carboxymethylated protein. Achromobacter peptidase digestion was successfully used to isolate two disulfide-containing peptides. The inhibitor consists of 112 amino acids with an Mr of 12787. Two disulfide bonds were established between Cys 66 and Cys 77 and between Cys 90 and Cys 110. A high degree of homology in the sequence was found between the colostrum inhibitor and human gamma-trace, human salivary acidic protein and chicken egg-white cystatin.
Aspergillus section Versicolores: nine new species and multilocus DNA sequence based phylogeny
USDA-ARS?s Scientific Manuscript database
ß-tubulin, calmodulin, internal transcribed spacer and partial lsu-rDNA, RNA polymerase, DNA replication licensing factor Mcm7, and pre-rRNA processing protein Tsr1 were amplified and sequenced from 62 A. versicolor clade isolates and analyzed phylogenetically using the concordance model to establis...
Aspergillus section Versicolores, nine new species and multilocus DNA sequence based phylogeny
USDA-ARS?s Scientific Manuscript database
ß-tubulin, calmodulin, internal transcribed spacer and partial lsu-rDNA, RNA polymerase, DNA replication licensing factor Mcm7, and pre-rRNA processing protein Tsr1 were amplified and sequenced from 62 A. versicolor clade isolates and analyzed phylogenetically using the concordance model to establis...
Nature of the protein universe
Levitt, Michael
2009-01-01
The protein universe is the set of all proteins of all organisms. Here, all currently known sequences are analyzed in terms of families that have single-domain or multidomain architectures and whether they have a known three-dimensional structure. Growth of new single-domain families is very slow: Almost all growth comes from new multidomain architectures that are combinations of domains characterized by ≈15,000 sequence profiles. Single-domain families are mostly shared by the major groups of organisms, whereas multidomain architectures are specific and account for species diversity. There are known structures for a quarter of the single-domain families, and >70% of all sequences can be partially modeled thanks to their membership in these families. PMID:19541617
Wang, Penghao; Wilson, Susan R
2013-01-01
Mass spectrometry-based protein identification is a very challenging task. The main identification approaches include de novo sequencing and database searching. Both approaches have shortcomings, so an integrative approach has been developed. The integrative approach firstly infers partial peptide sequences, known as tags, directly from tandem spectra through de novo sequencing, and then puts these sequences into a database search to see if a close peptide match can be found. However the current implementation of this integrative approach has several limitations. Firstly, simplistic de novo sequencing is applied and only very short sequence tags are used. Secondly, most integrative methods apply an algorithm similar to BLAST to search for exact sequence matches and do not accommodate sequence errors well. Thirdly, by applying these methods the integrated de novo sequencing makes a limited contribution to the scoring model which is still largely based on database searching. We have developed a new integrative protein identification method which can integrate de novo sequencing more efficiently into database searching. Evaluated on large real datasets, our method outperforms popular identification methods.
Shark complement: an assessment.
Smith, S L
1998-12-01
The classical (CCP) and alternative (ACP) pathways of complement activation have been established for the nurse shark (Ginglymostoma cirratum). The isolation of a cDNA clone encoding a mannan-binding protein-associated serine protease (MASP)-1-like protein from the Japanese dogfish (Triakis scyllia) suggests the presence of a lectin pathway. The CCP consists of six functionally distinct components: C1n, C2n, C3n, C4n, C8n and C9n, and is activated by immune complexes in the presence of Ca++ and Mg++ ions. The ACP is antibody independent, requiring Mg++ ions and a heat-labile 90 kDa factor B-like protein for activity. Proteins considered homologues of C1q, C3 and C4 (C2n) of the mammalian complement system have been isolated from nurse shark serum. Shark C1q is composed of at least two chain types each showing 50% identity to human C1q chains A and B. Partial sequence of the globular domain of one of the chains shows it to be C1q-like rather than like mannan-binding protein. N-terminal amino acid sequences of the alpha and beta chain of shark C3 and C4 molecules show significant identity with corresponding human C3 and C4 chains. A sequence representing shark C4 gamma chain, shows little similarity to human C4 gamma chain. The terminal shark components C8n and C9n are functional analogues of mammalian C8 and C9. Anaphylatoxin activity has been demonstrated in activated shark serum, and porcine C5a desArg induces shark leucocyte chemotaxis. The deduced amino acid sequence of a partial C3 cDNA clone from the nurse shark shows 50%, 30% and 24% homology with the corresponding region of mammalian C3, C4 and alpha 2-macroglobulin. Deduced amino acid sequence data from partial Bf/C2 cDNA clones, two from the nurse shark and one from the Japanese dogfish, suggest that at least one species of elasmobranch has two distinct Bf/C2 genes.
Corfield, M. C.; Fletcher, J. C.; Robson, A.
1967-01-01
1. A tryptic digest of the protein fraction U.S.3 from oxidized wool has been separated into 32 peptide fractions by cation-exchange resin chromatography. 2. Most of these fractions have been resolved into their component peptides by a combination of the techniques of cation-exchange resin chromatography, paper chromatography and paper electrophoresis. 3. The amino acid compositions of 58 of the peptides in the digest present in the largest amounts have been determined. 4. The amino acid sequences of 38 of these have been completely elucidated and those of six others partially derived. 5. These findings indicate that the parent protein in wool from which the protein fraction U.S.3 is derived has a minimum molecular weight of 74000. 6. The structures of wool proteins are discussed in the light of the peptide sequences determined, and, in particular, of those sequences in fraction U.S.3 that could not be elucidated. PMID:16742497
Distribution, genetic diversity and recombination analysis of Citrus tristeza virus of India
USDA-ARS?s Scientific Manuscript database
Citrus tristeza virus (CTV) isolates representing all the citrus growing geographical zones of India were analyzed for sequence of the 5'ORF1a fragments of the partial LProI domain and for the coat protein (CP) gene. The sequences were compared with previously reported Indian and CTV genotypes from...
Genetic characterization of L-Zagreb mumps vaccine strain.
Ivancic, Jelena; Gulija, Tanja Kosutic; Forcic, Dubravko; Baricevic, Marijana; Jug, Renata; Mesko-Prejac, Majda; Mazuran, Renata
2005-04-01
Eleven mumps vaccine strains, all containing live attenuated virus, have been used throughout the world. Although L-Zagreb mumps vaccine has been licensed since 1972, only its partial nucleotide sequence was previously determined (accession numbers , and ). Therefore, we sequenced the entire genome of L-Zagreb vaccine strain (Institute of Immunology Inc., Zagreb, Croatia). In order to investigate the genetic stability of the vaccine, sequences of both L-Zagreb master seed and currently produced vaccine batch were determined and no difference between them was observed. A phylogenetic analysis based on SH gene sequence has shown that L-Zagreb strain does not belong to any of established mumps genotypes and that it is most similar to old, laboratory preserved European strains (1950s-1970s). L-Zagreb nucleotide and deduced protein sequences were compared with other mumps virus sequences obtained from the GenBank. Emphasis was put on functionally important protein regions and known antigenic epitopes. The extensive comparisons of nucleotide and deduced protein sequences between L-Zagreb vaccine strain and other previously determined mumps virus sequences have shown that while the functional regions of HN, V, and L proteins are well conserved among various mumps strains, there can be a substantial amino acid difference in antigenic epitopes of all proteins and in functional regions of F protein. No molecular pattern was identified that can be used as a distinction marker between virulent and attenuated strains.
Kim, Young-Ha; slam, Mohammad Saiful; You, Myung-Jo
2015-01-01
Proteomic tools allow large-scale, high-throughput analyses for the detection, identification, and functional investigation of proteome. For detection of antigens from Haemaphysalis longicornis, 1-dimensional electrophoresis (1-DE) quantitative immunoblotting technique combined with 2-dimensional electrophoresis (2-DE) immunoblotting was used for whole body proteins from unfed and partially fed female ticks. Reactivity bands and 2-DE immunoblotting were performed following 2-DE electrophoresis to identify protein spots. The proteome of the partially fed female had a larger number of lower molecular weight proteins than that of the unfed female tick. The total number of detected spots was 818 for unfed and 670 for partially fed female ticks. The 2-DE immunoblotting identified 10 antigenic spots from unfed females and 8 antigenic spots from partially fed females. Matrix Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry (MALDI-TOF) of relevant spots identified calreticulin, putative secreted WC salivary protein, and a conserved hypothetical protein from the National Center for Biotechnology Information and Swiss Prot protein sequence databases. These findings indicate that most of the whole body components of these ticks are non-immunogenic. The data reported here will provide guidance in the identification of antigenic proteins to prevent infestation and diseases transmitted by H. longicornis. PMID:25748713
Poliovirus replication proteins: RNA sequence encoding P3-1b and the sites of proteolytic processing
DOE Office of Scientific and Technical Information (OSTI.GOV)
Semler, B.L.; Anderson, C.W.; Kitamura, N.
1981-06-01
A partial amino-terminal amino acid sequence of each of the major proteins encoded by the replicase region of the poliovirus genome has been determined. A comparison of this sequence information with the amino acid sequence predicted from the RNA sequence that has been determined for the 3' region of the poliovirus genome has allowed us to locate precisely the proteolytic cleavage sites at which the initial polyprotein is processed to create the poliovirus products P3-1b (NCVP1b), P3-2 (NCVP2), P3-4b (NCVP4b), and P3-7c (NCVP7c). For each of these products, as well as for the small genome-linked protein VPg, proteolytic cleavage occursmore » between a glutamine and a glycine residue to create the amino terminus of each protein. This result suggests that a single proteinase may be responsible for all of these cleavages. The sequence data also allow the precise positioning of the genome-linked protein VPg within the precursor P3-1b just proximal to the amino terminus of polypeptide P3-2.« less
Bozzoni, I; Beccari, E; Luo, Z X; Amaldi, F
1981-01-01
Poly-A+ mRNA from Xenopus laevis oocytes, partially enriched for r-protein coding capacity has been used as starting material for preparing a cDNA bank in plasmid pBR322. The clones containing sequences specific for r-proteins have been selected by translation of the complementary mRNAs. Clones for six different r-proteins have been identified and utilized as probes for studying their genomic organization. Two gene copies per haploid genome were found for r-proteins L1, L14, S19, and four-five for protein S1, S8 and L32. Moreover a population polymorphism has been observed for the genomic regions containing sequences for r-protein S1, S8 and L14. Images PMID:6112733
The FOXP2 forkhead domain binds to a variety of DNA sequences with different rates and affinities.
Webb, Helen; Steeb, Olga; Blane, Ashleigh; Rotherham, Lia; Aron, Shaun; Machanick, Philip; Dirr, Heini; Fanucchi, Sylvia
2017-07-01
FOXP2 is a member of the P subfamily of FOX transcription factors, the DNA-binding domain of which is the winged helix forkhead domain (FHD). In this work we show that the FOXP2 FHD is able to bind to various DNA sequences, including a novel sequence identified in this work, with different affinities and rates as detected using surface plasmon resonance. Combining the experimental work with molecular docking, we show that high-affinity sequences remain bound to the protein for longer, form a greater number of interactions with the protein and induce a greater structural change in the protein than low-affinity sequences. We propose a binding model for the FOXP2 FHD that involves three types of binding sequence: low affinity sites which allow for rapid scanning of the genome by the protein in a partially unstructured state; moderate affinity sites which serve to locate the protein near target sites and high-affinity sites which secure the protein to the DNA and induce a conformational change necessary for functional binding and the possible initiation of downstream transcriptional events. © The Authors 2017. Published by Oxford University Press on behalf of the Japanese Biochemical Society. All rights reserved.
Molecular cloning, sequencing, and expression of Eimeria tenella HSP70 partial gene.
Bogado, A L G; Martins, G F; Sasse, J P; Guimarães, J da S; Garcia, J L
2017-03-15
Members of the Eimeria genus are protozoan parasites of the subphylum Apicomplexa (Eimeriidae family), and belong to the coccidia group. Eimeria tenella is one of the most pathogenic species owing to its ability to penetrate the mucosa, and cause inflammation and damage. It is an obligate intracellular parasite that causes disease by destroying the host cells during multiplication. Heat shock protein 70 (HSP70) is a molecular chaperone that prevents cellular stress. The objective of this study was to clone, sequence, and express E. tenella HSP70 protein. After selecting the region of highest hydrophilicity in the hsp70 gene, we cloned complementary DNA (cDNA) into a pTrcHis2-TOPO vector and transformed it into TOP10 Escherichia coli cells; after induction, the bacteria expressed a 23-kDa protein with insoluble expression levels of approximately 5 mg/L. In summary, the partial hsp70 gene was successfully expressed in E. coli, producing a 23-kDa protein under insoluble conditions, and the antigen characteristics predicted by hydrophilicity analysis suggest the development of a vaccine for use in avian coccidiosis.
Zhou, Jinlin; Gong, Haiyan; Zhou, Yongzhi; Xuan, Xuenan; Fujisaki, Kozo
2006-12-01
A cDNA coding a glycine-rich protein was identified from the Rhipicephalus haemaphysaloides tick. The cDNA named here as RH50 was 1,823 bp, including a single open reading frame (ORF) of 1,518 nucleotides. The ORF encodes a polypeptide of 506 amino acid residues with a size of 50 kDa, as calculated by a computer. The predicted amino acid sequence of RH50 showed a low homology to sequences of some known extracellular matrix-like proteins. The native protein was identified in both the fed tick salivary gland lysates and extracts of cement material using the serum against the recombinant protein. Reverse transcription polymerase chain reaction results showed that RH50 mRNA was only transcribed in partially fed tick salivary glands, not in unfed tick salivary glands or partially fed tick midgut, fat body, or ovary. The differential expression of RH50 protein in fed tick salivary glands was confirmed by immunofluorescence. The low attachment rate both in the adult and nymphal tick, and the high mortality of immature ticks (nymph) feeding on recombinant RH50-immunized rabbits were found. These results show that the RH50 protein could be a useful candidate for anti-tick vaccine development.
Cleavage sites within the poliovirus capsid protein precursors
DOE Office of Scientific and Technical Information (OSTI.GOV)
Larsen, G.R.; Anderson, C.W.; Dorner, A.J.
1982-01-01
Partial amino-terminal sequence analysis was performed on radiolabeled poliovirus capsid proteins VP1, VP2, and VP3. A computer-assisted comparison of the amino acid sequences obtained with that predicted by the nucleotide sequence of the poliovirus genome allows assignment of the amino terminus of each capsid protein to a unique position within the virus polyprotein. Sequence analysis of trypsin-digested VP4, which has a blocked amino terminus, demonstrates that VP4 is encoded at or very near to the amino terminus of the polyprotein. The gene order of the capsid proteins is VP4-VP2-VP3-VP1. Cleavage of VP0 to VP4 and VP2 is shown to occurmore » between asparagine and serine, whereas the cleavages that separate VP2/VP3 and VP3/VP1 occur between glutamine and glycine residues. This finding supports the hypothesis that the cleavage of VP0, which occurs during virion morphogenesis, is distinct from the cleavages that separate functional regions of the polyprotein.« less
Automated sequence-specific protein NMR assignment using the memetic algorithm MATCH.
Volk, Jochen; Herrmann, Torsten; Wüthrich, Kurt
2008-07-01
MATCH (Memetic Algorithm and Combinatorial Optimization Heuristics) is a new memetic algorithm for automated sequence-specific polypeptide backbone NMR assignment of proteins. MATCH employs local optimization for tracing partial sequence-specific assignments within a global, population-based search environment, where the simultaneous application of local and global optimization heuristics guarantees high efficiency and robustness. MATCH thus makes combined use of the two predominant concepts in use for automated NMR assignment of proteins. Dynamic transition and inherent mutation are new techniques that enable automatic adaptation to variable quality of the experimental input data. The concept of dynamic transition is incorporated in all major building blocks of the algorithm, where it enables switching between local and global optimization heuristics at any time during the assignment process. Inherent mutation restricts the intrinsically required randomness of the evolutionary algorithm to those regions of the conformation space that are compatible with the experimental input data. Using intact and artificially deteriorated APSY-NMR input data of proteins, MATCH performed sequence-specific resonance assignment with high efficiency and robustness.
Mapping the miRNA interactome by crosslinking ligation and sequencing of hybrids (CLASH)
Helwak, Aleksandra; Tollervey, David
2014-01-01
RNA-RNA interactions play critical roles in many cellular processes but studying them is difficult and laborious. Here, we describe an experimental procedure, termed crosslinking ligation and sequencing of hybrids (CLASH), which allows high-throughput identification of sites of RNA-RNA interaction. During CLASH, a tagged bait protein is UV crosslinked in vivo to stabilise RNA interactions and purified under denaturing conditions. RNAs associated with the bait protein are partially truncated, and the ends of RNA-duplexes are ligated together. Following linker addition, cDNA library preparation and high-throughput sequencing, the ligated duplexes give rise to chimeric cDNAs, which unambiguously identify RNA-RNA interaction sites independent of bioinformatic predictions. This protocol is optimized for studying miRNA targets bound by Argonaute proteins, but should be easily adapted for other RNA-binding proteins and classes of RNA. The protocol requires around 5 days to complete, excluding the time required for high-throughput sequencing and bioinformatic analyses. PMID:24577361
Calcium-binding protein from mouse Ehrlich ascites-tumour cells is homologous to human calcyclin.
Kuźnicki, J; Filipek, A; Hunziker, P E; Huber, S; Heizmann, C W
1989-01-01
A Ca2+-binding protein was purified from mouse Ehrlich ascites-tumour cells. The protein forms monomers and disulphide-linked dimers, which can be separated by reverse-phase h.p.l.c. A partial amino acid sequence analysis demonstrated that the protein has an EF-hand structure. A striking homology was found to rat and human calcyclin (a member of the S-100 protein family), which is possibly involved in cell-cycle regulation. Images Fig. 1. Fig. 2. PMID:2597136
Zhang, Hongtao; Setubal, Joao Carlos; Zhan, Xiaobei; Zheng, Zhiyong; Yu, Lijun; Wu, Jianrong; Chen, Dingqiang
2011-06-01
Agrobacterium sp. ATCC 31749 (formerly named Alcaligenes faecalis var. myxogenes) is a non-pathogenic aerobic soil bacterium used in large scale biotechnological production of curdlan. However, little is known about its genomic information. DNA partial sequence of electron transport chains (ETCs) protein genes were obtained in order to understand the components of ETC and genomic-specificity in Agrobacterium sp. ATCC 31749. Degenerate primers were designed according to ETC conserved sequences in other reported species. DNA partial sequences of ETC genes in Agrobacterium sp. ATCC 31749 were cloned by the PCR method using degenerate primers. Based on comparative genomic analysis, nine electron transport elements were ascertained, including NADH ubiquinone oxidoreductase, succinate dehydrogenase complex II, complex III, cytochrome c, ubiquinone biosynthesis protein ubiB, cytochrome d terminal oxidase, cytochrome bo terminal oxidase, cytochrome cbb (3)-type terminal oxidase and cytochrome caa (3)-type terminal oxidase. Similarity and phylogenetic analyses of these genes revealed that among fully sequenced Agrobacterium species, Agrobacterium sp. ATCC 31749 is closest to Agrobacterium tumefaciens C58. Based on these results a comprehensive ETC model for Agrobacterium sp. ATCC 31749 is proposed.
Cloud, Joann L; Conville, Patricia S; Croft, Ann; Harmsen, Dag; Witebsky, Frank G; Carroll, Karen C
2004-02-01
Identification of clinically significant nocardiae to the species level is important in patient diagnosis and treatment. A study was performed to evaluate Nocardia species identification obtained by partial 16S ribosomal DNA (rDNA) sequencing by the MicroSeq 500 system with an expanded database. The expanded portion of the database was developed from partial 5' 16S rDNA sequences derived from 28 reference strains (from the American Type Culture Collection and the Japanese Collection of Microorganisms). The expanded MicroSeq 500 system was compared to (i). conventional identification obtained from a combination of growth characteristics with biochemical and drug susceptibility tests; (ii). molecular techniques involving restriction enzyme analysis (REA) of portions of the 16S rRNA and 65-kDa heat shock protein genes; and (iii). when necessary, sequencing of a 999-bp fragment of the 16S rRNA gene. An unknown isolate was identified as a particular species if the sequence obtained by partial 16S rDNA sequencing by the expanded MicroSeq 500 system was 99.0% similar to that of the reference strain. Ninety-four nocardiae representing 10 separate species were isolated from patient specimens and examined by using the three different methods. Sequencing of partial 16S rDNA by the expanded MicroSeq 500 system resulted in only 72% agreement with conventional methods for species identification and 90% agreement with the alternative molecular methods. Molecular methods for identification of Nocardia species provide more accurate and rapid results than the conventional methods using biochemical and susceptibility testing. With an expanded database, the MicroSeq 500 system for partial 16S rDNA was able to correctly identify the human pathogens N. brasiliensis, N. cyriacigeorgica, N. farcinica, N. nova, N. otitidiscaviarum, and N. veterana.
Retroviral DNA Integration Directed by HIV Integration Protein in Vitro
NASA Astrophysics Data System (ADS)
Bushman, Frederic D.; Fujiwara, Tamio; Craigie, Robert
1990-09-01
Efficient retroviral growth requires integration of a DNA copy of the viral RNA genome into a chromosome of the host. As a first step in analyzing the mechanism of integration of human immunodeficiency virus (HIV) DNA, a cell-free system was established that models the integration reaction. The in vitro system depends on the HIV integration (IN) protein, which was partially purified from insect cells engineered to express IN protein in large quantities. Integration was detected in a biological assay that scores the insertion of a linear DNA containing HIV terminal sequences into a λ DNA target. Some integration products generated in this assay contained five-base pair duplications of the target DNA at the recombination junctions, a characteristic of HIV integration in vivo; the remaining products contained aberrant junctional sequences that may have been produced in a variation of the normal reaction. These results indicate that HIV IN protein is the only viral protein required to insert model HIV DNA sequences into a target DNA in vitro.
Sequence analysis and expression of the M1 and M2 matrix protein genes of hirame rhabdovirus (HIRRV)
Nishizawa, T.; Kurath, G.; Winton, J.R.
1997-01-01
We have cloned and sequenced a 2318 nucleotide region of the genomic RNA of hirame rhabdovirus (HIRRV), an important viral pathogen of Japanese flounder Paralichthys olivaceus. This region comprises approximately two-thirds of the 3' end of the nucleocapsid protein (N) gene and the complete matrix protein (M1 and M2) genes with the associated intergenic regions. The partial N gene sequence was 812 nucleotides in length with an open reading frame (ORF) that encoded the carboxyl-terminal 250 amino acids of the N protein. The M1 and M2 genes were 771 and 700 nucleotides in length, respectively, with ORFs encoding proteins of 227 and 193 amino acids. The M1 gene sequence contained an additional small ORF that could encode a highly basic, arginine-rich protein of 25 amino acids. Comparisons of the N, M1, and M2 gene sequences of HIRRV with the corresponding sequences of the fish rhabdoviruses, infectious hematopoietic necrosis virus (IHNV) or viral hemorrhagic septicemia virus (VHSV) indicated that HIRRV was more closely related to IHNV than to VHSV, but was clearly distinct from either. The putative consensus gene termination sequence for IHNV and VHSV, AGAYAG(A)(7), was present in the N-M1, M1-M2, and M2-G intergenic regions of HIRRV as were the putative transcription initiation sequences YGGCAC and AACA. An Escherichia coli expression system was used to produce recombinant proteins from the M1 and M2 genes of HIRRV. These were the same size as the authentic M1 and M2 proteins and reacted with anti-HIRRV rabbit serum in western blots. These reagents can be used for further study of the fish immune response and to test novel control methods.
Lampel, J S; Aphale, J S; Lampel, K A; Strohl, W R
1992-01-01
The gene encoding a novel milk protein-hydrolyzing proteinase was cloned on a 6.56-kb SstI fragment from Streptomyces sp. strain C5 genomic DNA into Streptomyces lividans 1326 by using the plasmid vector pIJ702. The gene encoding the small neutral proteinase (snpA) was located within a 2.6-kb BamHI-SstI restriction fragment that was partially sequenced. The molecular mass of the deduced amino acid sequence of the mature protein was determined to be 15,740, which corresponds very closely with the relative molecular mass of the purified protein (15,500) determined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The N-terminal amino acid sequence of the purified neutral proteinase was determined, and the DNA encoding this sequence was found to be located within the sequenced DNA. The deduced amino acid sequence contains a conserved zinc binding site, although secondary ligand binding and active sites typical of thermolysinlike metalloproteinases are absent. The combination of its small size, deduced amino acid sequence, and substrate and inhibition profile indicate that snpA encodes a novel neutral proteinase. Images PMID:1569011
Koloniuk, Igor; Fránová, Jana; Sarkisova, Tatiana; Přibylová, Jaroslava
2018-05-04
Strawberry crinkle disease is one of the major diseases that threatens strawberry production. Although the biological properties of the agent, strawberry crinkle virus (SCV), have been thoroughly investigated, its complete genome sequence has never been published. Existing RT-PCR-based detection relies on a partial sequence of the L protein gene, presumably the least expressed viral gene. Here, we present complete sequences of two divergent SCV isolates co-infecting a single plant, Fragaria x ananassa cv. Čačanská raná.
ssHMM: extracting intuitive sequence-structure motifs from high-throughput RNA-binding protein data
Krestel, Ralf; Ohler, Uwe; Vingron, Martin; Marsico, Annalisa
2017-01-01
Abstract RNA-binding proteins (RBPs) play an important role in RNA post-transcriptional regulation and recognize target RNAs via sequence-structure motifs. The extent to which RNA structure influences protein binding in the presence or absence of a sequence motif is still poorly understood. Existing RNA motif finders either take the structure of the RNA only partially into account, or employ models which are not directly interpretable as sequence-structure motifs. We developed ssHMM, an RNA motif finder based on a hidden Markov model (HMM) and Gibbs sampling which fully captures the relationship between RNA sequence and secondary structure preference of a given RBP. Compared to previous methods which output separate logos for sequence and structure, it directly produces a combined sequence-structure motif when trained on a large set of sequences. ssHMM’s model is visualized intuitively as a graph and facilitates biological interpretation. ssHMM can be used to find novel bona fide sequence-structure motifs of uncharacterized RBPs, such as the one presented here for the YY1 protein. ssHMM reaches a high motif recovery rate on synthetic data, it recovers known RBP motifs from CLIP-Seq data, and scales linearly on the input size, being considerably faster than MEMERIS and RNAcontext on large datasets while being on par with GraphProt. It is freely available on Github and as a Docker image. PMID:28977546
Lieutaud, Philippe; Uversky, Alexey V.; Uversky, Vladimir N.; Longhi, Sonia
2016-01-01
ABSTRACT In the last 2 decades it has become increasingly evident that a large number of proteins are either fully or partially disordered. Intrinsically disordered proteins lack a stable 3D structure, are ubiquitous and fulfill essential biological functions. Their conformational heterogeneity is encoded in their amino acid sequences, thereby allowing intrinsically disordered proteins or regions to be recognized based on properties of these sequences. The identification of disordered regions facilitates the functional annotation of proteins and is instrumental for delineating boundaries of protein domains amenable to structural determination with X-ray crystallization. This article discusses a comprehensive selection of databases and methods currently employed to disseminate experimental and putative annotations of disorder, predict disorder and identify regions involved in induced folding. It also provides a set of detailed instructions that should be followed to perform computational analysis of disorder. PMID:28232901
Partial Gene Cloning and Enzyme Structure Modeling of Exolevanase Fragment from Bacillus subtilis
NASA Astrophysics Data System (ADS)
Azhar, M.; Natalia, D.; Syukur, S.; Andriani, N.; Jamsari, J.
2018-04-01
Inulin hydrolysis thermophilic and thermotolerant bacteria are potential sources of inulin hydrolysis enzymes. Partial gene that encodes inulin hydrolysis enzymes had been isolated from Bacillus subtilis using polymerase chain reaction (PCR) method with the DPE.slFandDPE.eR degenerative primers. The partial gene was cloned into pGEM-T Easy vector with E. coli as host cells and analyzed using BLASTx, CrustalW2, and Phyre2 programs. Size of thepartial gene had been found539 bp that encoded 179aminoacid residues of protein fragment. The sequences of protein fragment was more similar to exolevanase than exoinulinase. The protein fragment had conserved motif FSGS, and specific hits GH32 β-fructosidase. It had three residues of active site and five residues of substrate binding. The active site on the protein fragment were D (1-WLNDP-5), D (125-FRDPK-129) and E (177-WEC-179). Substrate binding on the protein fragment were ND (1-WLNDP-5), Q (18-FYQY-21), FS (60-FSGS-63) RD (125-FRDPK-129) and E (177-WEC-179).
Enterocin T, a novel class IIa bacteriocin produced by Enterococcus sp. 812.
Chen, Yi-Sheng; Yu, Chi-Rong; Ji, Si-Hua; Liou, Min-Shiuan; Leong, Kun-Hon; Pan, Shwu-Fen; Wu, Hui-Chung; Lin, Yu-Hsuan; Yu, Bi; Yanagida, Fujitoshi
2013-09-01
Enterococcus sp. 812, isolated from fresh broccoli, was previously found to produce a bacteriocin active against a number of Gram-positive bacteria, including Listeria monocytogenes. Bacteriocin activity decreased slightly after autoclaving (121 °C for 15 min), but was inactivated by protease K. Mass spectrometry analysis revealed the bacteriocin mass to be approximately 4,521.34 Da. N-terminal amino acid sequencing yielded a partial sequence, NH2-ATYYGNGVYXDKKKXWVEWGQA, by Edman degradation, which contained the consensus class IIa bacteriocin motif YGNGV in the N-terminal region. The obtained partial sequence showed high homology with some enterococcal bacteriocins; however, no identical peptide or protein was found. This peptide was therefore considered to be a novel bacteriocin produced by Enterococcus sp. 812 and was termed enterocin T.
Merchant, Mark; Kinney, Clint; Sanders, Paige
2009-12-01
Blood was collected from three juvenile alligators (Alligator mississippiensis) before, and again 24h after, injection with bacterial lipopolysaccharide (LPS). The leukocytes were collected from both samples, and the proteins were extracted. Each group of proteins was labeled with a different fluorescent dye and the differences in protein expression were analyzed by two dimensional differential in-gel expressions (2D-DIGE). The proteins which appeared to be increased or decreased by treatment with LPS were selected and analyzed by MALDI-TOF to determine mass and LC-MS/MS to acquire the partial protein sequences. The peptide sequences were compared to the NCBI protein sequence database to determine homology with other sequences from other species. Several proteins of interest appeared to be increased upon LPS stimulation. Proteins with homology to human transgelin-2, fish glucose-6-phosphate dehydrogenase, amphibian α-enolase, alligator lactate dehydrogenase, fish ubiquitin-activating enzyme, and fungal β-tubulin were also increased after LPS injection. Proteins with homology to fish vimentin 4, murine heterogeneous nuclear ribonucleoprotein A3, and avian calreticulin were found to be decreased in response to LPS. In addition, five proteins, four of which were up-regulated (827, 560, 512, and 650%) and one that exhibited repressed expression (307%), did not show homology to any protein in the database, and thus may represent newly discovered proteins. We are using this biochemical approach to isolate and characterize alligator proteins with potential relevant immune function.
Amino acid sequence of the smaller basic protein from rat brain myelin
Dunkley, Peter R.; Carnegie, Patrick R.
1974-01-01
1. The complete amino acid sequence of the smaller basic protein from rat brain myelin was determined. This protein differs from myelin basic proteins of other species in having a deletion of a polypeptide of 40 amino acid residues from the centre of the molecule. 2. A detailed comparison is made of the constant and variable regions in a group of myelin basic proteins from six species. 3. An arginine residue in the rat protein was found to be partially methylated. The ratio of methylated to unmethylated arginine at this position differed from that found for the human basic protein. 4. Three tryptic peptides were isolated in more than one form. The differences between the two forms of each peptide are discussed in relation to the electrophoretic heterogeneity of myelin basic proteins, which is known to occur at alkaline pH values. 5. Detailed evidence for the amino acid sequence of the protein has been deposited as Supplementary Publication SUP 50029 at the British Library (Lending Division) (formerly the National Lending Library for Science and Technology), Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies may be obtained on the terms given in Biochem. J. (1973) 131, 5. PMID:4141893
Modahl, Cassandra M.; Mackessy, Stephen P.
2016-01-01
Envenomation of humans by snakes is a complex and continuously evolving medical emergency, and treatment is made that much more difficult by the diverse biochemical composition of many venoms. Venomous snakes and their venoms also provide models for the study of molecular evolutionary processes leading to adaptation and genotype-phenotype relationships. To compare venom complexity and protein sequences, venom gland transcriptomes are assembled, which usually requires the sacrifice of snakes for tissue. However, toxin transcripts are also present in venoms, offering the possibility of obtaining cDNA sequences directly from venom. This study provides evidence that unknown full-length venom protein transcripts can be obtained from the venoms of multiple species from all major venomous snake families. These unknown venom protein cDNAs are obtained by the use of primers designed from conserved signal peptide sequences within each venom protein superfamily. This technique was used to assemble a partial venom gland transcriptome for the Middle American Rattlesnake (Crotalus simus tzabcan) by amplifying sequences for phospholipases A2, serine proteases, C-lectins, and metalloproteinases from within venom. Phospholipase A2 sequences were also recovered from the venoms of several rattlesnakes and an elapid snake (Pseudechis porphyriacus), and three-finger toxin sequences were recovered from multiple rear-fanged snake species, demonstrating that the three major clades of advanced snakes (Elapidae, Viperidae, Colubridae) have stable mRNA present in their venoms. These cDNA sequences from venom were then used to explore potential activities derived from protein sequence similarities and evolutionary histories within these large multigene superfamilies. Venom-derived sequences can also be used to aid in characterizing venoms that lack proteomic profiles and identify sequence characteristics indicating specific envenomation profiles. This approach, requiring only venom, provides access to cDNA sequences in the absence of living specimens, even from commercial venom sources, to evaluate important regional differences in venom composition and to study snake venom protein evolution. PMID:27280639
Wei, Dong; Zhang, Keyi; Ngo, Huu Hao; Guo, Wenshan; Wang, Siyu; Li, Jibin; Han, Fei; Du, Bin; Wei, Qin
2017-04-01
In present study, the feasibility of partial nitrification (PN) process achievement and its greenhouse gas emission were evaluated in a sequencing batch biofilm reactor (SBBR). After 90days' operation, the average effluent NH 4 + -N removal efficiency and nitrite accumulation rate of PN-SBBR were high of 98.2% and 87.6%, respectively. Both polysaccharide and protein contents were reduced in loosely bound extracellular polymeric substances (LB-EPS) and tightly bound EPS (TB-EPS) during the achievement of PN-biofilm. Excitation-emission matrix spectra implied that aromatic protein-like, tryptophan protein-like and humic acid-like substances were the main compositions of both kinds of EPS in seed sludge and PN-biofilm. According to typical cycle, the emission rate of CO 2 had a much higher value than that of N 2 O, and their total amounts per cycle were 67.7 and 16.5mg, respectively. Free ammonia (FA) played a significant role on the inhibition activity of nitrite-oxidizing bacteria and the occurrence of nitrite accumulation. Copyright © 2017 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mishra, N.C.
1996-10-01
Neurospora has the capability to solubilize coal and the protein fraction accounting for this ability has been isolated. During this period the cola solubilizing activity (CSA) was fractionated and partially sequenced. The activity has been determined to be a tyrosinase and/or a phenol oxidase. The amino acid sequence of the protein was used to prepare oligonucleotides to identify the clone carrying Neurospora CSA. It is intended to clone the Neurospora gene into yeast, since yeast cannot solubilize coal, to further characterize the CSA.
GWFASTA: server for FASTA search in eukaryotic and microbial genomes.
Issac, Biju; Raghava, G P S
2002-09-01
Similarity searches are a powerful method for solving important biological problems such as database scanning, evolutionary studies, gene prediction, and protein structure prediction. FASTA is a widely used sequence comparison tool for rapid database scanning. Here we describe the GWFASTA server that was developed to assist the FASTA user in similarity searches against partially and/or completely sequenced genomes. GWFASTA consists of more than 60 microbial genomes, eight eukaryote genomes, and proteomes of annotatedgenomes. Infact, it provides the maximum number of databases for similarity searching from a single platform. GWFASTA allows the submission of more than one sequence as a single query for a FASTA search. It also provides integrated post-processing of FASTA output, including compositional analysis of proteins, multiple sequences alignment, and phylogenetic analysis. Furthermore, it summarizes the search results organism-wise for prokaryotes and chromosome-wise for eukaryotes. Thus, the integration of different tools for sequence analyses makes GWFASTA a powerful toolfor biologists.
Analysis of Ribosome Inactivating Protein (RIP): A Bioinformatics Approach
NASA Astrophysics Data System (ADS)
Jothi, G. Edward Gnana; Majilla, G. Sahaya Jose; Subhashini, D.; Deivasigamani, B.
2012-10-01
In spite of the medical advances in recent years, the world is in need of different sources to encounter certain health issues.Ribosome Inactivating Proteins (RIPs) were found to be one among them. In order to get easy access about RIPs, there is a need to analyse RIPs towards constructing a database on RIPs. Also, multiple sequence alignment was done towards screening for homologues of significant RIPs from rare sources against RIPs from easily available sources in terms of similarity. Protein sequences were retrieved from SWISS-PROT and are further analysed using pair wise and multiple sequence alignment.Analysis shows that, 151 RIPs have been characterized to date. Amongst them, there are 87 type I, 37 type II, 1 type III and 25 unknown RIPs. The sequence length information of various RIPs about the availability of full or partial sequence was also found. The multiple sequence alignment of 37 type I RIP using the online server Multalin, indicates the presence of 20 conserved residues. Pairwise alignment and multiple sequence alignment of certain selected RIPs in two groups namely Group I and Group II were carried out and the consensus level was found to be 98%, 98% and 90% respectively.
Fourment, Mathieu; Gibbs, Mark J
2008-02-05
Viruses of the Bunyaviridae have segmented negative-stranded RNA genomes and several of them cause significant disease. Many partial sequences have been obtained from the segments so that GenBank searches give complex results. Sequence databases usually use HTML pages to mediate remote sorting, but this approach can be limiting and may discourage a user from exploring a database. The VirusBanker database contains Bunyaviridae sequences and alignments and is presented as two spreadsheets generated by a Java program that interacts with a MySQL database on a server. Sequences are displayed in rows and may be sorted using information that is displayed in columns and includes data relating to the segment, gene, protein, species, strain, sequence length, terminal sequence and date and country of isolation. Bunyaviridae sequences and alignments may be downloaded from the second spreadsheet with titles defined by the user from the columns, or viewed when passed directly to the sequence editor, Jalview. VirusBanker allows large datasets of aligned nucleotide and protein sequences from the Bunyaviridae to be compiled and winnowed rapidly using criteria that are formulated heuristically.
Marciniak, R A; Garcia-Blanco, M A; Sharp, P A
1990-01-01
Human immunodeficiency virus type 1 RNAs contain a sequence, trans-activation-response (TAR) element, which is required for tat protein-mediated trans-activation of viral gene expression. We have identified a nuclear protein from extracts of HeLa cells that binds to the TAR element RNA in a sequence-specific manner. The binding of this 68-kDa polypeptide was detected by UV cross-linking proteins to TAR element RNA transcribed in vitro. Competition experiments were performed by using a partially purified preparation of the protein to quantify the relative binding affinities of TAR element RNA mutants. The binding affinity of the TAR mutants paralleled the reported ability of those mutants to support tat trans-activation in vivo. We propose that this cellular protein moderates TAR activity in vivo. Images PMID:2333305
Investigation of SnSPR1, a novel and abundant surface protein of Sarcocystis neurona merozoites.
Zhang, Deqing; Howe, Daniel K
2008-04-15
An expressed sequence tag (EST) sequencing project has produced over 15,000 partial cDNA sequences from the equine pathogen Sarcocystis neurona. While many of the sequences are clear homologues of previously characterized genes, a significant number of the S. neurona ESTs do not exhibit similarity to anything in the extensive sequence databases that have been generated. In an effort to characterize parasite proteins that are novel to S. neurona, a seemingly unique gene was selected for further investigation based on its abundant representation in the collection of ESTs and the predicted presence of a signal peptide and glycolipid anchor addition on the encoded protein. The gene was expressed in E. coli, and monospecific polyclonal antiserum against the recombinant protein was produced by immunization of a rabbit. Characterization of the native protein in S. neurona merozoites and schizonts revealed that it is a low molecular weight surface protein that is expressed throughout intracellular development of the parasite. The protein was designated Surface Protein 1 (SPR1) to reflect its display on the outer surface of merozoites and to distinguish it from the ubiquitous SAG/SRS surface antigens of the heteroxenous Coccidia. Interestingly, infection assays in the presence of the polyclonal antiserum suggested that SnSPR1 plays some role in attachment and/or invasion of host cells by S. neurona merozoites. The work described herein represents a general template for selecting and characterizing the various unidentified gene sequences that are plentiful in the EST databases for S. neurona and other apicomplexans. Furthermore, this study illustrates the value of investigating these novel sequences since it can offer new candidates for diagnostic or vaccine development while also providing greater insight into the biology of these parasites.
Steiner, G; Hartmuth, K; Skriner, K; Maurer-Fogy, I; Sinski, A; Thalmann, E; Hassfeld, W; Barta, A; Smolen, J S
1992-01-01
RA33 is a nuclear autoantigen with an apparent molecular mass of 33 kD. Autoantibodies against RA33 are found in about 30% of sera from RA patients, but only occasionally in sera from patients with other connective tissue diseases. To characterize RA33, the antigen was purified from HeLa cell nuclear extracts to more than 90% homogeneity by affinity chromatography on heparin-Sepharose and by chromatofocusing. Sequence analysis of five tryptic peptides revealed that their sequences matched corresponding sequences of the A2 protein of the heterogeneous nuclear ribonucleoprotein (hnRNP) complex. Furthermore, RA33 was shown to be present in the 40S hnRNP complex and to behave indistinguishably from A2 in binding to single stranded DNA. In summary, these data strongly indicate that RA33 and A2 are the same protein, and thus identify on a molecular level a new autoantigen. Images PMID:1522214
The leukocyte common antigen (CD45): a putative receptor-linked protein tyrosine phosphatase.
Charbonneau, H; Tonks, N K; Walsh, K A; Fischer, E H
1988-01-01
A major protein tyrosine phosphatase (PTPase 1B) has been isolated in essentially homogeneous form from the soluble and particulate fractions of human placenta. Unexpectedly, partial amino acid sequences displayed no homology with the primary structures of the protein Ser/Thr phosphatases deduced from cDNA clones. However, the sequence is strikingly similar to the tandem C-terminal homologous domains of the leukocyte common antigen (CD45). A 157-residue segment of PTPase 1B displayed 40% and 33% sequence identity with corresponding regions from cytoplasmic domains I and II of human CD45. Similar degrees of identity have been observed among the catalytic domains of families of regulatory proteins such as protein kinases and cyclic nucleotide phosphodiesterases. On this basis, it is proposed that the CD45 family has protein tyrosine phosphatase activity and may represent a set of cell-surface receptors involved in signal transduction. This suggests that the repertoire of signal transduction mechanisms may include the direct control of an intracellular protein tyrosine phosphatase, offering the possibility of a regulatory balance with those protein tyrosine kinases that act at the internal surface of the membrane. Images PMID:2845400
Kohda, Daisuke
2018-04-01
Promiscuous recognition of ligands by proteins is as important as strict recognition in numerous biological processes. In living cells, many short, linear amino acid motifs function as targeting signals in proteins to specify the final destination of the protein transport. In general, the target signal is defined by a consensus sequence containing wild-characters, and hence represented by diverse amino acid sequences. The classical lock-and-key or induced-fit/conformational selection mechanism may not cover all aspects of the promiscuous recognition. On the basis of our crystallographic and NMR studies on the mitochondrial Tom20 protein-presequence interaction, we proposed a new hypothetical mechanism based on "a rapid equilibrium of multiple states with partial recognitions". This dynamic, multiple recognition mode enables the Tom20 receptor to recognize diverse mitochondrial presequences with nearly equal affinities. The plant Tom20 is evolutionally unrelated to the animal Tom20 in our study, but is a functional homolog of the animal/fungal Tom20. NMR studies by another research group revealed that the presequence binding by the plant Tom20 was not fully explained by simple interaction modes, suggesting the presence of a similar dynamic, multiple recognition mode. Circumstantial evidence also suggested that similar dynamic mechanisms may be applicable to other promiscuous recognitions of signal peptides by the SRP54/Ffh and SecA proteins.
A Novel Locomotion-based Validation Assay for Candidate Drugs Using Drosophila DYT1 Disease Model
2013-11-01
the genome using the same parental fly line, minimizing the effect of surrounding sequences and genetic variations on the ...locomotion and GTPC cyclrohydolase protein levels; (3) supplementation of dopamine can partially rescue the locomotion defects of Drosophila larvae...8217- GCGAACAACCAAAAAATCATTGAGATAATAAACTCCTCCATTAG-3’) to make dtorsin cDNA that lacks GAC (D307) (Fig. 1) respectively. After confirming mutated sequences , the insert was again
Janarthanan, Sundaram; Sakthivelkumar, Shanmugavel; Veeramani, Velayutham; Radhika, Dixit; Muthukrishanan, Subbaratnam
2012-12-15
The anti-metabolic or insecticidal gene, arcelin (Arl) was isolated, cloned and sequenced using sequence specific degenerate primers from the seeds of Lablab purpureus collected from the Western Ghats, Tamil Nadu, India. The L. purpureus arcelin nucleotide sequence was homologous to Arl-3 and Arl-4 alleles from Phaseolus spp. The protein it encodes has 70% amino acid identity with the amino acid sequences of Arl-3I, Arl-3III, Arl-4 precursor, Arl-4 and Arl-4I. The partially purified arcelin from the seeds of L. purpureus using an artificial diet confirmed the complete retardation of development of the stored product pest Callosobruchus maculatus at 0.2% w/w arcelin-incorporated artificial seeds. Copyright © 2012 Elsevier Ltd. All rights reserved.
Vipsita, Swati; Rath, Santanu Kumar
2015-01-01
Protein superfamily classification deals with the problem of predicting the family membership of newly discovered amino acid sequence. Although many trivial alignment methods are already developed by previous researchers, but the present trend demands the application of computational intelligent techniques. As there is an exponential growth in size of biological database, retrieval and inference of essential knowledge in the biological domain become a very cumbersome task. This problem can be easily handled using intelligent techniques due to their ability of tolerance for imprecision, uncertainty, approximate reasoning, and partial truth. This paper discusses the various global and local features extracted from full length protein sequence which are used for the approximation and generalisation of the classifier. The various parameters used for evaluating the performance of the classifiers are also discussed. Therefore, this review article can show right directions to the present researchers to make an improvement over the existing methods.
Hierarchy and extremes in selections from pools of randomized proteins
Boyer, Sébastien; Biswas, Dipanwita; Kumar Soshee, Ananda; Scaramozzino, Natale; Nizak, Clément; Rivoire, Olivier
2016-01-01
Variation and selection are the core principles of Darwinian evolution, but quantitatively relating the diversity of a population to its capacity to respond to selection is challenging. Here, we examine this problem at a molecular level in the context of populations of partially randomized proteins selected for binding to well-defined targets. We built several minimal protein libraries, screened them in vitro by phage display, and analyzed their response to selection by high-throughput sequencing. A statistical analysis of the results reveals two main findings. First, libraries with the same sequence diversity but built around different “frameworks” typically have vastly different responses; second, the distribution of responses of the best binders in a library follows a simple scaling law. We show how an elementary probabilistic model based on extreme value theory rationalizes the latter finding. Our results have implications for designing synthetic protein libraries, estimating the density of functional biomolecules in sequence space, characterizing diversity in natural populations, and experimentally investigating evolvability (i.e., the potential for future evolution). PMID:26969726
Hierarchy and extremes in selections from pools of randomized proteins.
Boyer, Sébastien; Biswas, Dipanwita; Kumar Soshee, Ananda; Scaramozzino, Natale; Nizak, Clément; Rivoire, Olivier
2016-03-29
Variation and selection are the core principles of Darwinian evolution, but quantitatively relating the diversity of a population to its capacity to respond to selection is challenging. Here, we examine this problem at a molecular level in the context of populations of partially randomized proteins selected for binding to well-defined targets. We built several minimal protein libraries, screened them in vitro by phage display, and analyzed their response to selection by high-throughput sequencing. A statistical analysis of the results reveals two main findings. First, libraries with the same sequence diversity but built around different "frameworks" typically have vastly different responses; second, the distribution of responses of the best binders in a library follows a simple scaling law. We show how an elementary probabilistic model based on extreme value theory rationalizes the latter finding. Our results have implications for designing synthetic protein libraries, estimating the density of functional biomolecules in sequence space, characterizing diversity in natural populations, and experimentally investigating evolvability (i.e., the potential for future evolution).
The Complete Sequence of a Human Parainfluenzavirus 4 Genome
Yea, Carmen; Cheung, Rose; Collins, Carol; Adachi, Dena; Nishikawa, John; Tellier, Raymond
2009-01-01
Although the human parainfluenza virus 4 (HPIV4) has been known for a long time, its genome, alone among the human paramyxoviruses, has not been completely sequenced to date. In this study we obtained the first complete genomic sequence of HPIV4 from a clinical isolate named SKPIV4 obtained at the Hospital for Sick Children in Toronto (Ontario, Canada). The coding regions for the N, P/V, M, F and HN proteins show very high identities (95% to 97%) with previously available partial sequences for HPIV4B. The sequence for the L protein and the non-coding regions represent new information. A surprising feature of the genome is its length, more than 17 kb, making it the longest genome within the genus Rubulavirus, although the length is well within the known range of 15 kb to 19 kb for the subfamily Paramyxovirinae. The availability of a complete genomic sequence will facilitate investigations on a respiratory virus that is still not completely characterized. PMID:21994536
Gabe, Jeffrey D.; Dragon, Elizabeth; Chang, Ray-Jen; McCaman, Michael T.
1998-01-01
A tandem pair of nearly identical genes from Serpulina hyodysenteriae (B204) were cloned and sequenced. The full open reading frame of one gene and the partial open reading frame of the neighboring gene appear to encode secreted proteins which are homologous to, yet distinct from, the 39-kDa extracytoplasmic protein purified from the membrane fraction of S. hyodysenteriae. We have designated these newly identified genes vspA and vspB (for variable surface protein). PMID:9440540
A New Zamilon-like Virophage Partial Genome Assembled from a Bioreactor Metagenome
Bekliz, Meriem; Verneau, Jonathan; Benamar, Samia; Raoult, Didier; La Scola, Bernard; Colson, Philippe
2015-01-01
Virophages replicate within viral factories inside the Acanthamoeba cytoplasm, and decrease the infectivity and replication of their associated giant viruses. Culture isolation and metagenome analyses have suggested that they are common in our environment. By screening metagenomic databases in search of amoebal viruses, we detected virophage-related sequences among sequences generated from the same non-aerated bioreactor metagenome as recently screened by another team for virophage capsid-encoding genes. We describe here the assembled partial genome of a virophage closely related to Zamilon, which infects Acanthamoeba with mimiviruses of lineages B and C but not A. Searches for sequences related to amoebal giant viruses, other Megavirales representatives and virophages were conducted using BLAST against this bioreactor metagenome (PRJNA73603). Comparative genomic and phylogenetic analyses were performed using sequences from previously identified virophages. A total of 72 metagenome contigs generated from the bioreactor were identified as best matching with sequences from Megavirales representatives, mostly Pithovirus sibericum, pandoraviruses and amoebal mimiviruses from three lineages A–C, as well as from virophages. In addition, a partial genome from a Zamilon-like virophage, we named Zamilon 2, was assembled. This genome has a size of 6716 base pairs, corresponding to 39% of the Zamilon genome, and comprises partial or full-length homologs for 15 Zamilon predicted open reading frames (ORFs). Mean nucleotide and amino acid identities for these 15 Zamilon 2 ORFs with their Zamilon counterparts were 89% (range, 81–96%) and 91% (range, 78–99%), respectively. Notably, these ORFs included two encoding a capsid protein and a packaging ATPase. Comparative genomics and phylogenetic analyses indicated that the partial genome was that of a new Zamilon-like virophage. Further studies are needed to gain better knowledge of the tropism and prevalence of virophages in our biosphere and in humans. PMID:26640459
Nishida, I; Sugiura, M; Enju, A; Nakamura, M
2000-12-01
A new isogene for acyl-(acyl-carrier-protein):glycerol-3-phosphate acyltransferase (GPAT; EC 2.3.1.15) in squash has been cloned and the gene product was identified as oleate-selective GPAT. Using PCR primers that could hybridise with exons for a previously cloned squash GPAT, we obtained two PCR products of different size: one coded for a previously cloned squash GPAT corresponding to non-selective isoforms AT2 and AT3, and the other for a new isozyme, probably the oleate-selective isoform AT1. Full-length amino acid sequences of respective isozymes were deduced from the nucleotide sequences of genomic genes and cDNAs, which were cloned by a series of PCR-based methods. Thus, we designated the new gene CmATS1;1 and the other one CmATS1;2. Genome blot analysis revealed that the squash genome contained the two isogenes at non-allelic loci. AT1-active fractions were partially purified, and three polypeptide bands were identified as being AT1 polypeptides, which exhibited relative molecular masses of 39.5-40.5 kDa, pI values of 6.75-7.15, and oleate selectivity over palmitate. Partial amino-terminal sequences obtained from two of these bands verified that the new isogene codes for AT1 polypeptides.
Vettore, André L.; da Silva, Felipe R.; Kemper, Edson L.; Souza, Glaucia M.; da Silva, Aline M.; Ferro, Maria Inês T.; Henrique-Silva, Flavio; Giglioti, Éder A.; Lemos, Manoel V.F.; Coutinho, Luiz L.; Nobrega, Marina P.; Carrer, Helaine; França, Suzelei C.; Bacci, Maurício; Goldman, Maria Helena S.; Gomes, Suely L.; Nunes, Luiz R.; Camargo, Luis E.A.; Siqueira, Walter J.; Van Sluys, Marie-Anne; Thiemann, Otavio H.; Kuramae, Eiko E.; Santelli, Roberto V.; Marino, Celso L.; Targon, Maria L.P.N.; Ferro, Jesus A.; Silveira, Henrique C.S.; Marini, Danyelle C.; Lemos, Eliana G.M.; Monteiro-Vitorello, Claudia B.; Tambor, José H.M.; Carraro, Dirce M.; Roberto, Patrícia G.; Martins, Vanderlei G.; Goldman, Gustavo H.; de Oliveira, Regina C.; Truffi, Daniela; Colombo, Carlos A.; Rossi, Magdalena; de Araujo, Paula G.; Sculaccio, Susana A.; Angella, Aline; Lima, Marleide M.A.; de Rosa, Vicente E.; Siviero, Fábio; Coscrato, Virginia E.; Machado, Marcos A.; Grivet, Laurent; Di Mauro, Sonia M.Z.; Nobrega, Francisco G.; Menck, Carlos F.M.; Braga, Marilia D.V.; Telles, Guilherme P.; Cara, Frank A.A.; Pedrosa, Guilherme; Meidanis, João; Arruda, Paulo
2003-01-01
To contribute to our understanding of the genome complexity of sugarcane, we undertook a large-scale expressed sequence tag (EST) program. More than 260,000 cDNA clones were partially sequenced from 26 standard cDNA libraries generated from different sugarcane tissues. After the processing of the sequences, 237,954 high-quality ESTs were identified. These ESTs were assembled into 43,141 putative transcripts. Of the assembled sequences, 35.6% presented no matches with existing sequences in public databases. A global analysis of the whole SUCEST data set indicated that 14,409 assembled sequences (33% of the total) contained at least one cDNA clone with a full-length insert. Annotation of the 43,141 assembled sequences associated almost 50% of the putative identified sugarcane genes with protein metabolism, cellular communication/signal transduction, bioenergetics, and stress responses. Inspection of the translated assembled sequences for conserved protein domains revealed 40,821 amino acid sequences with 1415 Pfam domains. Reassembling the consensus sequences of the 43,141 transcripts revealed a 22% redundancy in the first assembling. This indicated that possibly 33,620 unique genes had been identified and indicated that >90% of the sugarcane expressed genes were tagged. PMID:14613979
Xiong, Wangdan; Wei, Qian; Wu, Pingzhi; Zhang, Sheng; Li, Jun; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang
2017-07-01
The β-ketoacyl-acyl carrier protein synthase I (KASI) is involved in de novo fatty acid biosynthesis in many organisms. Two putative KASI genes, JcKASI-1 and JcKASI-2, were isolated from Jatropha curcas. The deduced amino acid sequences of JcKASI-1 and JcKASI-2 exhibit around 83.8% and 72.5% sequence identities with AtKASI, respectively, and both contain conserved Cys-His-Lys-His-Phe catalytic active sites. Phylogenetic analysis indicated that JcKASI-2 belongs to a clade with several KASI proteins from dicotyledonous plants. Both JcKASI genes were expressed in multiple tissues, most strongly in filling stage seeds of J. curcas. Additionally, the JcKASI-1 and JcKASI-2 proteins were both localized to the plastids. Expressing JcKASI-1 in the Arabidopsis kasI mutant rescued the mutant's phenotype and restored the fatty acid composition and oil content in seeds to wild-type, but expressing JcKASI-2 in the Arabidopsis kasI mutant resulted in only partial rescue. This implies that JcKASI-1 and JcKASI-2 exhibit partial functional redundancy and KASI genes play a universal role in regulating fatty acid biosynthesis, growth, and development in plants. Copyright © 2017 Elsevier GmbH. All rights reserved.
Kathuria, Sagar V; Chan, Yvonne H; Nobrega, R Paul; Özen, Ayşegül; Matthews, C Robert
2016-03-01
Measurements of protection against exchange of main chain amide hydrogens (NH) with solvent hydrogens in globular proteins have provided remarkable insights into the structures of rare high-energy states that populate their folding free-energy surfaces. Lacking, however, has been a unifying theory that rationalizes these high-energy states in terms of the structures and sequences of their resident proteins. The Branched Aliphatic Side Chain (BASiC) hypothesis has been developed to explain the observed patterns of protection in a pair of TIM barrel proteins. This hypothesis supposes that the side chains of isoleucine, leucine, and valine (ILV) residues often form large hydrophobic clusters that very effectively impede the penetration of water to their underlying hydrogen bond networks and, thereby, enhance the protection against solvent exchange. The linkage between the secondary and tertiary structures enables these ILV clusters to serve as cores of stability in high-energy partially folded states. Statistically significant correlations between the locations of large ILV clusters in native conformations and strong protection against exchange for a variety of motifs reported in the literature support the generality of the BASiC hypothesis. The results also illustrate the necessity to elaborate this simple hypothesis to account for the roles of adjacent hydrocarbon moieties in defining stability cores of partially folded states along folding reaction coordinates. © 2015 The Protein Society.
The proteolytic processing site of the precursor of lysyl oxidase.
Cronshaw, A D; Fothergill-Gilmore, L A; Hulmes, D J
1995-01-01
The precise cleavage site of the N-terminal propeptide region of the precursor of lysyl oxidase has not yet been established, due to N-terminal blocking of the mature protein. Using a combination of peptide fragmentation, amino acid sequencing, time-of-flight m.s. and partial chemical unblocking procedures, it is shown that the mature form of lysyl oxidase begins at residue Asp-169 of the precursor protein (numbered according to the human sequence). The cleavage site is 28 residues to the C-terminal side of the site previously suggested on the basis of apparant molecular mass by SDS/PAGE, with the consequence that the two putative, N-linked glycosylation sites and the position of the Arg/Gln sequence polymorphism are now all in the precursor region. PMID:7864821
Wilson, D J; Kim, D S; Clarke, G A; Marshall-Clarke, S; Moss, D J
1996-12-01
We have previously identified a glycosylphosphatidylinositol-linked glycoprotein of 55 kDa (GP55) which inhibits neurite outgrowth. We now provide evidence that GP55, isolated from adult chick brain, consists of at least two bands, both of which are active, i.e., block outgrowth of neurites from chick dorsal root ganglion neurons. An antiserum raised against the adult proteins reverses the inhibition and preliminary experiments suggest that GP55 is restricted to the nervous system, increases during development from very low levels at embryonic day 10 and is most abundant after hatching. Immunofluorescence reveals that GP55 is expressed on neurons cultured from an embryonic day 14 chick brain but is barely detectable on embryonic day 10 dorsal root ganglion neurons or embryonic day 8 forebrain neurons; the neurons which respond to substrate-bound GP55. Peptide sequencing revealed considerable homology with OBCAM, a protein previously identified on the basis of binding opiates. Nested polymerase chain reaction using primers to the OBCAM sequence and internal primers to GP55 peptides produced two different polymerase chain reaction fragments with homology to OBCAM. A full length clone (E19S) corresponding to one polymerase chain reaction product and a partial length clone (E14S) corresponding to the second have been isolated from an embryonic chick brain library. Both are members of the immunoglobulin superfamily and have (or are expected to have) three C2 domains. E19S has 90% homology with LAMP at the amino acid level. This sequence only partially matches the peptides from the adult protein and hence is probably not a major component of the adult proteins. E14S (GP55-A) has 83% homology to OBCAM at the amino acid level over the region sequenced. The sequence matches several of the peptides from the adult protein and is hence likely to correspond to a major component of the adult proteins. Thus members of the GP55 family are related to OBCAM, neurotrimin, LAMP and a recently discovered chick protein CEPU-1. Our results suggest molecules within this family are capable of acting as cell adhesion molecules and inhibitors of neurite outgrowth.
The arbuscular mycorrhizal fungal protein glomalin is a putative homolog of heat shock protein 60.
Gadkar, Vijay; Rillig, Matthias C
2006-10-01
Work on glomalin-related soil protein produced by arbuscular mycorrhizal (AM) fungi (AMF) has been limited because of the unknown identity of the protein. A protein band cross-reactive with the glomalin-specific antibody MAb32B11 from the AM fungus Glomus intraradices was partially sequenced using tandem liquid chromatography-mass spectrometry. A 17 amino acid sequence showing similarity to heat shock protein 60 (hsp 60) was obtained. Based on degenerate PCR, a full-length cDNA of 1773 bp length encoding the hsp 60 gene was isolated from a G. intraradices cDNA library. The ORF was predicted to encode a protein of 590 amino acids. The protein sequence had three N-terminal glycosylation sites and a string of GGM motifs at the C-terminal end. The GiHsp 60 ORF had three introns of 67, 76 and 131 bp length. The GiHsp 60 was expressed using an in vitro translation system, and the protein was purified using the 6xHis-tag system. A dot-blot assay on the purified protein showed that it was highly cross-reactive with the glomalin-specific antibody MAb32B11. The present work provides the first evidence for the identity of the glomalin protein in the model AMF G. intraradices, thus facilitating further characterization of this protein, which is of great interest in soil ecology.
RNA regulators responding to ribosomal protein S15 are frequent in sequence space
Slinger, Betty L.; Meyer, Michelle M.
2016-01-01
There are several natural examples of distinct RNA structures that interact with the same ligand to regulate the expression of homologous genes in different organisms. One essential question regarding this phenomenon is whether such RNA regulators are the result of convergent or divergent evolution. Are the RNAs derived from some common ancestor and diverged to the point where we cannot identify the similarity, or have multiple solutions to the same biological problem arisen independently? A key variable in assessing these alternatives is how frequently such regulators arise within sequence space. Ribosomal protein S15 is autogenously regulated via an RNA regulator in many bacterial species; four apparently distinct regulators have been functionally validated in different bacterial phyla. Here, we explore how frequently such regulators arise within a partially randomized sequence population. We find many RNAs that interact specifically with ribosomal protein S15 from Geobacillus kaustophilus with biologically relevant dissociation constants. Furthermore, of the six sequences we characterize, four show regulatory activity in an Escherichia coli reporter assay. Subsequent footprinting and mutagenesis analysis indicates that protein binding proximal to regulatory features such as the Shine–Dalgarno sequence is sufficient to enable regulation, suggesting that regulation in response to S15 is relatively easily acquired. PMID:27580716
Zuotin, a putative Z-DNA binding protein in Saccharomyces cerevisiae
NASA Technical Reports Server (NTRS)
Zhang, S.; Lockshin, C.; Herbert, A.; Winter, E.; Rich, A.
1992-01-01
A putative Z-DNA binding protein, named zuotin, was purified from a yeast nuclear extract by means of a Z-DNA binding assay using [32P]poly(dG-m5dC) and [32P]oligo(dG-Br5dC)22 in the presence of B-DNA competitor. Poly(dG-Br5dC) in the Z-form competed well for the binding of a zuotin containing fraction, but salmon sperm DNA, poly(dG-dC) and poly(dA-dT) were not effective. Negatively supercoiled plasmid pUC19 did not compete, whereas an otherwise identical plasmid pUC19(CG), which contained a (dG-dC)7 segment in the Z-form was an excellent competitor. A Southwestern blot using [32P]poly(dG-m5dC) as a probe in the presence of MgCl2 identified a protein having a molecular weight of 51 kDa. The 51 kDa zuotin was partially sequenced at the N-terminal and the gene, ZUO1, was cloned, sequenced and expressed in Escherichia coli; the expressed zuotin showed similar Z-DNA binding activity, but with lower affinity than zuotin that had been partially purified from yeast. Zuotin was deduced to have a number of potential phosphorylation sites including two CDC28 (homologous to the human and Schizosaccharomyces pombe cdc2) phosphorylation sites. The hexapeptide motif KYHPDK was found in zuotin as well as in several yeast proteins, DnaJ of E.coli, csp29 and csp32 proteins of Drosophila and the small t and large T antigens of the polyoma virus. A 60 amino acid segment of zuotin has similarity to several histone H1 sequences. Disruption of ZUO1 in yeast resulted in a slow growth phenotype.
Wilske, B; Preac-Mursic, V; Jauris, S; Hofmann, A; Pradel, I; Soutschek, E; Schwab, E; Will, G; Wanner, G
1993-01-01
The gene of the immunodominant major protein pC of Borrelia burgdorferi was previously cloned and sequenced (R. Fuchs, S. Jauris, F. Lottspeich, V. Preac-Mursic, B. Wilske, and E. Soutschek, Mol. Microbiol. 6:503-509, 1992). pC is abundantly expressed on the outer surface of B. burgdorferi, as demonstrated by immunoelectron microscopy with monoclonal antibody L22 1F8. Accordingly, pC is renamed OspC, by analogy to the outer surface proteins OspA and OspB. Western immunoblot analysis of 45 B. burgdorferi isolates with monoclonal antibodies revealed that OspC is immunologically heterogeneous. Partial sequence analysis of the ospC gene confirmed the protein heterogeneity at the genetic level. We found that the degree of identity between the ospC partial sequences of five strains representing different OspA serotypes was only 63.3 to 85.4%. Immunological heterogeneity was also observed among representatives of the three newly designated genospecies of B. burgdorferi sensu lato, B. burgdorferi sensu stricto, B. garinii, and group VS461. Heterogeneity was confirmed for B. garinii at the genetic level. The ospC gene was also cloned from strains that did not express OspC, and antibody-reactive OspC was expressed in Escherichia coli. In addition, OspC-expressing variants were obtained from a nonexpressing strain by plating single colonies on solid medium. These findings confirm that the ospC gene is also present in nonexpressing strains. Because OspC is an immunodominant protein for the early immune response in Lyme borreliosis and was effective as a vaccine in an animal model, the immunological and molecular polymorphisms of ospC and OspC have important implications for the development of diagnostic reagents and vaccines. Images PMID:8478108
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ris-Stalpers, C.; Verleun-Mooijman, M.C.T.; Blaeij, T.J.P. de
1994-04-01
The analysis of the androgen receptor (AR) gene, mRNA, and protein in a subject with X-linked Reifenstein syndrome (partial androgen insensitivity) is reported. The presence of two mature AR transcripts in genital skin fibroblasts of the patient is established, and, by reverse transcriptase-PCR and RNase transcription analysis, the wild-type transcript and a transcript in which exon 3 sequences are absent without disruption of the translational reading frame are identified. Sequencing and hybridization analysis show a deletion of >6 kb in intron 2 of the human AR gene, starting 18 bp upstream of exon 3. The deletion includes the putative branch-pointmore » sequence (BPS) but not the acceptor splice site on the intron 2/exon 3 boundary. The deletion of the putative intron 2 BPS results in 90% inhibition of wild-type splicing. The mutant transcript encodes an AR protein lacking the second zinc finger of the DNA-binding domain. Western/immunoblotting analysis is used to show that the mutant AR protein is expressed in genital skin fibroblasts of the patient. The residual 10% wild-type transcript can be the result of the use of a cryptic BPS located 63 bp upstream of the intron 2/exon 3 boundary of the mutant AR gene. The mutated AR protein has no transcription-activating potential and does not influence the transactivating properties of the wild-type AR, as tested in cotransfection studies. It is concluded that the partial androgen-insensitivity syndrome of this patient is the consequence of the limited amount of wild-type AR protein expressed in androgen target cells, resulting from the deletion of the intron 2 putative BPS. 42 refs., 6 figs., 1 tab.« less
Genetics Home Reference: PURA syndrome
... lead to a reduced amount of functional Purα protein. Although it is not understood how a partial loss of Purα function leads to the ... KJ, Gawne-Cain M; DDD study, Magee AC, Turnpenny PD, Baralle D. Whole exome sequencing in family trios reveals de novo mutations in ...
Fourment, Mathieu; Gibbs, Mark J
2008-01-01
Background Viruses of the Bunyaviridae have segmented negative-stranded RNA genomes and several of them cause significant disease. Many partial sequences have been obtained from the segments so that GenBank searches give complex results. Sequence databases usually use HTML pages to mediate remote sorting, but this approach can be limiting and may discourage a user from exploring a database. Results The VirusBanker database contains Bunyaviridae sequences and alignments and is presented as two spreadsheets generated by a Java program that interacts with a MySQL database on a server. Sequences are displayed in rows and may be sorted using information that is displayed in columns and includes data relating to the segment, gene, protein, species, strain, sequence length, terminal sequence and date and country of isolation. Bunyaviridae sequences and alignments may be downloaded from the second spreadsheet with titles defined by the user from the columns, or viewed when passed directly to the sequence editor, Jalview. Conclusion VirusBanker allows large datasets of aligned nucleotide and protein sequences from the Bunyaviridae to be compiled and winnowed rapidly using criteria that are formulated heuristically. PMID:18251994
Haseloff, J; Goelet, P; Zimmern, D; Ahlquist, P; Dasgupta, R; Kaesberg, P
1984-01-01
The plant viruses alfalfa mosaic virus (AMV) and brome mosaic virus (BMV) each divide their genetic information among three RNAs while tobacco mosaic virus (TMV) contains a single genomic RNA. Amino acid sequence comparisons suggest that the single proteins encoded by AMV RNA 1 and BMV RNA 1 and by AMV RNA 2 and BMV RNA 2 are related to the NH2-terminal two-thirds and the COOH-terminal one-third, respectively, of the largest protein encoded by TMV. Separating these two domains in the TMV RNA sequence is an amber termination codon, whose partial suppression allows translation of the downstream domain. Many of the residues that the TMV read-through domain and the segmented plant viruses have in common are also conserved in a read-through domain found in the nonstructural polyprotein of the animal alphaviruses Sindbis and Middelburg. We suggest that, despite substantial differences in gene organization and expression, all of these viruses use related proteins for common functions in RNA replication. Reassortment of functional modules of coding and regulatory sequence from preexisting viral or cellular sources, perhaps via RNA recombination, may be an important mechanism in RNA virus evolution. PMID:6611550
Liberti, D; Marais, A; Svanella-Dumas, L; Dulucq, M J; Alioto, D; Ragozzino, A; Rodoni, B; Candresse, T
2005-04-01
ABSTRACT A trichovirus closely related to Apple chlorotic leaf spot virus (ACLSV) was detected in symptomatic apricot and Japanese plum from Italy. The Sus2 isolate of this agent cross-reacted with anti-ACLSV polyclonal reagents but was not detected by broad-specificity anti- ACLSV monoclonal antibodies. It had particles with typical trichovirus morphology but, contrary to ACLSV, was unable to infect Chenopodium quinoa and C. amaranticolor. The sequence of its genome (7,494 nucleotides [nt], missing only approximately 30 to 40 nt of the 5' terminal sequence) and the partial sequence of another isolate were determined. The new virus has a genomic organization similar to that of ACLSV, with three open reading frames coding for a replication-associated protein (RNA-dependent RNA polymerase), a movement protein, and a capsid protein, respectively. However, it had only approximately 65 to 67% nucleotide identity with sequenced isolates of ACLSV. The differences in serology, host range, genome sequence, and phylogenetic reconstructions for all viral proteins support the idea that this agent should be considered a new virus, for which the name Apricot pseudo-chlorotic leaf spot virus (APCLSV) is proposed. APCLSV shows substantial sequence variability and has been recovered from various Prunus sources coming from seven countries, an indication that it is likely to have a wide geographical distribution.
Timmermans, M J T N; Dodsworth, S; Culverwell, C L; Bocak, L; Ahrens, D; Littlewood, D T J; Pons, J; Vogler, A P
2010-11-01
Mitochondrial genome sequences are important markers for phylogenetics but taxon sampling remains sporadic because of the great effort and cost required to acquire full-length sequences. Here, we demonstrate a simple, cost-effective way to sequence the full complement of protein coding mitochondrial genes from pooled samples using the 454/Roche platform. Multiplexing was achieved without the need for expensive indexing tags ('barcodes'). The method was trialled with a set of long-range polymerase chain reaction (PCR) fragments from 30 species of Coleoptera (beetles) sequenced in a 1/16th sector of a sequencing plate. Long contigs were produced from the pooled sequences with sequencing depths ranging from ∼10 to 100× per contig. Species identity of individual contigs was established via three 'bait' sequences matching disparate parts of the mitochondrial genome obtained by conventional PCR and Sanger sequencing. This proved that assembly of contigs from the sequencing pool was correct. Our study produced sequences for 21 nearly complete and seven partial sets of protein coding mitochondrial genes. Combined with existing sequences for 25 taxa, an improved estimate of basal relationships in Coleoptera was obtained. The procedure could be employed routinely for mitochondrial genome sequencing at the species level, to provide improved species 'barcodes' that currently use the cox1 gene only.
POOL server: machine learning application for functional site prediction in proteins.
Somarowthu, Srinivas; Ondrechen, Mary Jo
2012-08-01
We present an automated web server for partial order optimum likelihood (POOL), a machine learning application that combines computed electrostatic and geometric information for high-performance prediction of catalytic residues from 3D structures. Input features consist of THEMATICS electrostatics data and pocket information from ConCavity. THEMATICS measures deviation from typical, sigmoidal titration behavior to identify functionally important residues and ConCavity identifies binding pockets by analyzing the surface geometry of protein structures. Both THEMATICS and ConCavity (structure only) do not require the query protein to have any sequence or structure similarity to other proteins. Hence, POOL is applicable to proteins with novel folds and engineered proteins. As an additional option for cases where sequence homologues are available, users can include evolutionary information from INTREPID for enhanced accuracy in site prediction. The web site is free and open to all users with no login requirements at http://www.pool.neu.edu. m.ondrechen@neu.edu Supplementary data are available at Bioinformatics online.
Huang, Liping; Van Renne, Nicolaas; Liu, Changming; Nauwynck, Hans J
2015-12-01
Porcine circovirus type 2 (PCV2) encodes two major proteins: the replication protein (Rep) and the capsid protein (Cap). Cap displays a conserved stretch of basic residues situated on the inside of the capsid, whose role is so far unknown. We used a reverse-genetics approach to investigate its function and found that mutations in these amino acids hindered Cap mRNA translation and hampered Cap/Rep co-localization, yielding unfit viruses. Intriguingly, co-transfection with a WT PCV2 of a different genotype partially rescued mutant Cap expression, showing the importance of this basic pattern for efficient translation of Cap mRNA into protein. Our results show that Cap and Rep are expressed independently of each other, and that this amino acid sequence of Cap is vital for virus propagation. This study provides a method for studying unfit PCV2 virions and offers new insights into the intracellular modus vivendi of PCV2.
Ma, Zhengqiang
2013-01-01
Rht-B1c, allelic to the DELLA protein-encoding gene Rht-B1a, is a natural mutation documented in common wheat (Triticum aestivum). It confers variation to a number of traits related to cell and plant morphology, seed dormancy, and photosynthesis. The present study was conducted to examine the sequence variations of Rht-B1c and their functional impacts. The results showed that Rht-B1c was partially dominant or co-dominant for plant height, and exhibited an increased dwarfing effect. At the sequence level, Rht-B1c differed from Rht-B1a by one 2kb Veju retrotransposon insertion, three coding region single nucleotide polymorphisms (SNPs), one 197bp insertion, and four SNPs in the 1kb upstream sequence. Haplotype investigations, association analyses, transient expression assays, and expression profiling showed that the Veju insertion was primarily responsible for the extreme dwarfing effect. It was found that the Veju insertion changed processing of the Rht-B1c transcripts and resulted in DELLA motif primary structure disruption. Expression assays showed that Rht-B1c caused reduction of total Rht-1 transcript levels, and up-regulation of GATA-like transcription factors and genes positively regulated by these factors, suggesting that one way in which Rht-1 proteins affect plant growth and development is through GATA-like transcription factor regulation. PMID:23918966
Patnaik, Bharat Bhusan; Kim, Dong Hyun; Oh, Seung Han; Song, Yong-Su; Chanh, Nguyen Dang Minh; Kim, Jong Sun; Jung, Woo-jin; Saha, Atul Kumar; Bindroo, Bharat Bhushan; Han, Yeon Soo
2012-01-01
Background Silkworm fecal matter is considered one of the richest sources of antimicrobial and antiviral protein (substances) and such economically feasible and eco-friendly proteins acting as secondary metabolites from the insect system can be explored for their practical utility in conferring broad spectrum disease resistance against pathogenic microbial specimens. Methodology/Principal Findings Silkworm fecal matter extracts prepared in 0.02 M phosphate buffer saline (pH 7.4), at a temperature of 60°C was subjected to 40% saturated ammonium sulphate precipitation and purified by gel-filtration chromatography (GFC). SDS-PAGE under denaturing conditions showed a single band at about 21.5 kDa. The peak fraction, thus obtained by GFC wastested for homogeneityusing C18reverse-phase high performance liquid chromatography (HPLC). The activity of the purified protein was tested against selected Gram +/− bacteria and phytopathogenic Fusarium species with concentration-dependent inhibitionrelationship. The purified bioactive protein was subjected to matrix-assisted laser desorption and ionization-time of flight mass spectrometry (MALDI-TOF-MS) and N-terminal sequencing by Edman degradation towards its identification. The N-terminal first 18 amino acid sequence following the predicted signal peptide showed homology to plant germin-like proteins (Glp). In order to characterize the full-length gene sequence in detail, the partial cDNA was cloned and sequenced using degenerate primers, followed by 5′- and 3′-rapid amplification of cDNA ends (RACE-PCR). The full-length cDNA sequence composed of 630 bp encoding 209 amino acids and corresponded to germin-like proteins (Glps) involved in plant development and defense. Conclusions/Significance The study reports, characterization of novel Glpbelonging to subfamily 3 from M. alba by the purification of mature active protein from silkworm fecal matter. The N-terminal amino acid sequence of the purified protein was found similar to the deduced amino acid sequence (without the transit peptide sequence) of the full length cDNA from M. alba. PMID:23284650
Negative regulation of early polyomavirus expression in mouse embryonal carcinoma cells.
Cremisi, C; Babinet, C
1986-01-01
Embryonal carcinoma cells are resistant to infection by polyomavirus (Py). We showed that this block was partially removed by inhibiting protein synthesis temporarily. The block was also partially removed when Py was coinfected with simian virus 40. Cycloheximide treatment of cells infected with Py mutants able to grow on PCC4 embryonal carcinoma cells led to 3- to 10-fold increases in the production of T-antigen-positive cells. At 31 degrees C, Py T-antigen expression was enhanced when the cells were treated with cycloheximide. We suggest that a negative labile regulatory protein(s) is synthesized in PCC4 cells, preventing the initiation of early Py transcription by binding to the noncoding sequence, especially the enhancer element B and perhaps also element A, and that the Py mutants retained a binding site(s). PMID:3016339
Pru du 2S albumin or Pru du vicilin?
Garino, Cristiano; De Paolis, Angelo; Coïsson, Jean Daniel; Arlorio, Marco
2015-06-01
A short partial sequence of 28 amino acids is all the information we have so far about the putative allergen 2S albumin from almond. The aim of this work was to analyze this information using mainly bioinformatics tools, in order to verify its rightness. Based on the results reported in the paper describing this allergen from almond, we analyzed the original data of amino acids sequencing through available software. The degree of homology of the almond 12kDa protein with any other known 2S albumin appears to be much lower than the one reported in the paper that firstly described it. In a publicly available cDNA library we discovered an expressed sequence tag which translation generates a protein that perfectly matches both of the sequencing outputs described in the same paper. A further analysis indicated that the latter protein seems to belong to the vicilin superfamily rather than to the prolamin one. The fact that also vicilins are seed storage proteins known to be highly allergenic would explain the IgE reactivity originally observed. Based on our observations we suggest that the IgE reactive 12kDa protein from almond currently known as Pru du 2S albumin is in reality the cleaved N-terminal region of a 7S vicilin like protein. Copyright © 2015 Elsevier Ltd. All rights reserved.
Purification, amino acid sequence and characterisation of kangaroo IGF-I.
Yandell, C A; Francis, G L; Wheldrake, J F; Upton, Z
1998-01-01
Insulin-like growth factor-I (IGF-I) and IGF-II have been purified to homogeneity from kangaroo (Macropus fuliginosus) serum, thus this represents the first report of the purification, sequencing and characterisation of marsupial IGFs. N-Terminal protein sequencing reveals that there are six amino acid differences between kangaroo and human IGF-I. Kangaroo IGF-II has been partially sequenced and no differences were found between human and kangaroo IGF-II in the 53 residues identified. Thus the IGFs appear to be remarkably structurally conserved during mammalian radiation. In addition, in vitro characterisation of kangaroo IGF-I demonstrated that the functional properties of human, kangaroo and chicken IGF-I are very similar. In an assay measuring the ability of the proteins to stimulate protein synthesis in rat L6 myoblasts, all IGF-I proteins were found to be equally potent. The ability of all three proteins to compete for binding with radiolabelled human IGF-I to type-1 IGF receptors in L6 myoblasts and in Sminthopsis crassicaudata transformed lung fibroblasts, a marsupial cell line, was comparable. Furthermore, kangaroo and human IGF-I react equally in a human IGF-I RIA using a human reference standard, radiolabelled human IGF-I and a polyclonal antibody raised against recombinant human IGF-I. This study indicates that not only is the primary structure of eutherian and metatherian IGF-I conserved, but also the proteins appear to be functionally similar.
Ohto, C; Ishida, C; Koike-Takeshita, A; Yokoyama, K; Muramatsu, M; Nishino, T; Obata, S
1999-02-01
A geranylgeranyl diphosphate (GGPP) synthase gene of an extremely thermophilic bacterium, Thermus thermophilus, was cloned and sequenced. T. thermophilus GGPP synthase, overexpressed in Escherichia coli cells as a glutathione S-transferase fusion protein, was purified and characterized. The fusion protein, retaining thermostability, formed a homodimer, and showed higher specific activity than did a partially purified thermostable enzyme previously reported. Optimal reaction conditions and kinetic parameters were also examined. The deduced amino acid sequence indicated that T. thermophilus GGPP synthase was excluded from the group of bacterial type GGPP synthases and lacked the insertion amino acid residues in the first aspartate-rich motif as do archaeal and eukaryotic short-chain prenyltransferases.
Xuxia, Wang; Jie, Chen; Bo, Wang; Lijun, Liu; Hui, Jiang; Diluo, Tang; Dingxiang, Peng
2012-01-01
For the purpose of screening putative anthracnose resistance-related genes of ramie ( Boehmeria nivea L. Gaud), a cDNA library was constructed by suppression subtractive hybridization using anthracnose-resistant cultivar Huazhu no. 4. The cDNAs from Huazhu no. 4, which were infected with Colletotrichum gloeosporioides , were used as the tester and cDNAs from uninfected Huazhu no. 4 as the driver. Sequencing analysis and homology searching showed that these clones represented 132 single genes, which were assigned to functional categories, including 14 putative cellular functions, according to categories established for Arabidopsis . These 132 genes included 35 disease resistance and stress tolerance-related genes including putative heat-shock protein 90, metallothionein, PR-1.2 protein, catalase gene, WRKY family genes, and proteinase inhibitor-like protein. Partial disease-related genes were further analyzed by reverse transcription PCR and RNA gel blot. These expressed sequence tags are the first anthracnose resistance-related expressed sequence tags reported in ramie.
Jonniaux, J L; Coster, F; Purnelle, B; Goffeau, A
1994-12-01
We report the amino acid sequence of 13 open reading frames (ORF > 299 bp) located on a 21.7 kb DNA segment from the left arm of chromosome XIV of Saccharomyces cerevisiae. Five open reading frames had been entirely or partially sequenced previously: WHI3, GCR2, SPX19, SPX18 and a heat shock gene similar to SSB1. The products of 8 other ORFs are new putative proteins among which N1394 is probably a membrane protein. N1346 contains a leucine zipper pattern and the corresponding ORF presents an HAP (global regulator of respiratory genes) upstream activating sequence in the promoting region. N1386 shares homologies with the DNA structure-specific recognition protein family SSRPs and the corresponding ORF is preceded by an MCB (MluI cell cycle box) upstream activating factor.
UniDrug-target: a computational tool to identify unique drug targets in pathogenic bacteria.
Chanumolu, Sree Krishna; Rout, Chittaranjan; Chauhan, Rajinder S
2012-01-01
Targeting conserved proteins of bacteria through antibacterial medications has resulted in both the development of resistant strains and changes to human health by destroying beneficial microbes which eventually become breeding grounds for the evolution of resistances. Despite the availability of more than 800 genomes sequences, 430 pathways, 4743 enzymes, 9257 metabolic reactions and protein (three-dimensional) 3D structures in bacteria, no pathogen-specific computational drug target identification tool has been developed. A web server, UniDrug-Target, which combines bacterial biological information and computational methods to stringently identify pathogen-specific proteins as drug targets, has been designed. Besides predicting pathogen-specific proteins essentiality, chokepoint property, etc., three new algorithms were developed and implemented by using protein sequences, domains, structures, and metabolic reactions for construction of partial metabolic networks (PMNs), determination of conservation in critical residues, and variation analysis of residues forming similar cavities in proteins sequences. First, PMNs are constructed to determine the extent of disturbances in metabolite production by targeting a protein as drug target. Conservation of pathogen-specific protein's critical residues involved in cavity formation and biological function determined at domain-level with low-matching sequences. Last, variation analysis of residues forming similar cavities in proteins sequences from pathogenic versus non-pathogenic bacteria and humans is performed. The server is capable of predicting drug targets for any sequenced pathogenic bacteria having fasta sequences and annotated information. The utility of UniDrug-Target server was demonstrated for Mycobacterium tuberculosis (H37Rv). The UniDrug-Target identified 265 mycobacteria pathogen-specific proteins, including 17 essential proteins which can be potential drug targets. UniDrug-Target is expected to accelerate pathogen-specific drug targets identification which will increase their success and durability as drugs developed against them have less chance to develop resistances and adverse impact on environment. The server is freely available at http://117.211.115.67/UDT/main.html. The standalone application (source codes) is available at http://www.bioinformatics.org/ftp/pub/bioinfojuit/UDT.rar.
Corfield, M. C.; Fletcher, J. C.
1969-01-01
1. A chymotryptic digest of the protein fraction U.S.3. from oxidized wool was separated into 51 peptide fractions by chromatography on a column of cation-exchange resin. 2. The less acidic fractions were separated into their component peptides by a combination of cation-exchange-resin chromatography, paper chromatography and paper electrophoresis. 3. The amino acid sequences of 34 of these peptides were elucidated, and those of 14 others partially determined. 4. Overlaps between the tryptic and chymotryptic peptides from fraction U.S.3 have enabled ten extended amino acid sequences to be deduced, the longest containing 20 amino acid residues. 5. The relevance of the results to the structures of the helical and non-helical regions of wool is discussed. PMID:5395876
Rodríguez-Martín, Andrea; Acosta, Raquel; Liddell, Susan; Núñez, Félix; Benito, M José; Asensio, Miguel A
2010-04-01
The strain RP42C from Penicillium chrysogenum produces a small protein PgAFP that inhibits the growth of some toxigenic molds. The molecular mass of the protein determined by electrospray ionization mass spectrometry (ESI-MS) was 6 494Da. PgAFP showed a cationic character with an estimated pI value of 9.22. Upon chemical and enzymatic treatments of PgAFP, no evidence for N- or O-glycosylations was obtained. Five partial sequences of PgAFP were obtained by Edman degradation and by ESI-MS/MS after trypsin and chymotrypsin digestions. Using degenerate primers from these peptide sequences, a segment of 70bp was amplified by PCR from pgafp gene. 5'- and 3'-ends of pgafp were obtained by RACE-PCR with gene-specific primers designed from the 70bp segment. The complete pgafp sequence of 404bp was obtained using primers designed from 5'- and 3'-ends. Comparison of genomic and cDNA sequences revealed a 279bp coding region interrupted by two introns of 63 and 62bp. The precursor of the antifungal protein consists of 92 amino acids and appears to be processed to the mature 58 amino acids PgAFP. The deduced amino acid sequence of the mature protein shares 79% identity to the antifungal protein Anafp from Aspergillus niger. PgAFP is a new protein that belongs to the group of small, cysteine-rich, and basic proteins with antifungal activity produced by ascomycetes. Given that P. chrysogenum is regarded as safe mold commonly found in foods, PgAFP may be useful to prevent growth of toxigenic molds in food and agricultural products. Copyright (c) 2009 Elsevier Inc. All rights reserved.
Ferro, Myriam; Tardif, Marianne; Reguer, Erwan; Cahuzac, Romain; Bruley, Christophe; Vermat, Thierry; Nugues, Estelle; Vigouroux, Marielle; Vandenbrouck, Yves; Garin, Jérôme; Viari, Alain
2008-05-01
PepLine is a fully automated software which maps MS/MS fragmentation spectra of trypsic peptides to genomic DNA sequences. The approach is based on Peptide Sequence Tags (PSTs) obtained from partial interpretation of QTOF MS/MS spectra (first module). PSTs are then mapped on the six-frame translations of genomic sequences (second module) giving hits. Hits are then clustered to detect potential coding regions (third module). Our work aimed at optimizing the algorithms of each component to allow the whole pipeline to proceed in a fully automated manner using raw nucleic acid sequences (i.e., genomes that have not been "reduced" to a database of ORFs or putative exons sequences). The whole pipeline was tested on controlled MS/MS spectra sets from standard proteins and from Arabidopsis thaliana envelope chloroplast samples. Our results demonstrate that PepLine competed with protein database searching softwares and was fast enough to potentially tackle large data sets and/or high size genomes. We also illustrate the potential of this approach for the detection of the intron/exon structure of genes.
dos Reis, Sávio Pinho; Tavares, Liliane de Souza Conceição; Costa, Carinne de Nazaré Monteiro; Brígida, Aílton Borges Santa; de Souza, Cláudia Regina Batista
2012-06-01
Cassava (Manihot esculenta Crantz) is one of the world's most important food crops. It is cultivated mainly in developing countries of tropics, since its root is a major source of calories for low-income people due to its high productivity and resistance to many abiotic and biotic factors. A previous study has identified a partial cDNA sequence coding for a putative RING zinc finger in cassava storage root. The RING zinc finger protein is a specialized type of zinc finger protein found in many organisms. Here, we isolated the full-length cDNA sequence coding for M. esculenta RZF (MeRZF) protein by a combination of 5' and 3' RACE assays. BLAST analysis showed that its deduced amino acid sequence has a high level of similarity to plant proteins of RZF family. MeRZF protein contains a signature sequence motif for a RING zinc finger at its C-terminal region. In addition, this protein showed a histidine residue at the fifth coordination site, likely belonging to the RING-H2 subgroup, as confirmed by our phylogenetic analysis. There is also a transmembrane domain in its N-terminal region. Finally, semi-quantitative RT-PCR assays showed that MeRZF expression is increased in detached leaves treated with sodium chloride. Here, we report the first evidence of a RING zinc finger gene of cassava showing potential role in response to salt stress.
Defining the Nature of Thermal Intermediate in 3 State Folding Proteins: Apoflavodoxin, a Study Case
García-Fandiño, Rebeca; Bernadó, Pau; Ayuso-Tejedor, Sara; Sancho, Javier; Orozco, Modesto
2012-01-01
The early stages of the thermal unfolding of apoflavodoxin have been determined by using atomistic multi microsecond-scale molecular dynamics (MD) simulations complemented with a variety of experimental techniques. Results strongly suggest that the intermediate is reached very early in the thermal unfolding process and that it has the properties of an “activated” form of the native state, where thermal fluctuations in the loops break loop-loop contacts. The unrestrained loops gain then kinetic energy corrupting short secondary structure elements without corrupting the core of the protein. The MD-derived ensembles agree with experimental observables and draw a picture of the intermediate state inconsistent with a well-defined structure and characteristic of a typical partially disordered protein. Our results allow us to speculate that proteins with a well packed core connected by long loops might behave as partially disordered proteins under native conditions, or alternatively behave as three state folders. Small details in the sequence, easily tunable by evolution, can yield to one or the other type of proteins. PMID:22927805
Self-regulation of 70-kilodalton heat shock proteins in Saccharomyces cerevisiae.
Stone, D E; Craig, E A
1990-01-01
To determine whether the 70-kilodalton heat shock proteins of Saccharomyces cerevisiae play a role in regulating their own synthesis, we studied the effect of overexpressing the SSA1 protein on the activity of the SSA1 5'-regulatory region. The constitutive level of Ssa1p was increased by fusing the SSA1 structural gene to the GAL1 promoter. A reporter vector consisting of an SSA1-lacZ translational fusion was used to assess SSA1 promoter activity. In a strain producing approximately 10-fold the normal heat shock level of Ssa1p, induction of beta-galactosidase activity by heat shock was almost entirely blocked. Expression of a transcriptional fusion vector in which the CYC1 upstream activating sequence of a CYC1-lacZ chimera was replaced by a sequence containing a heat shock upstream activating sequence (heat shock element 2) from the 5'-regulatory region of SSA1 was inhibited by excess Ssa1p. The repression of an SSA1 upstream activating sequence by the SSA1 protein indicates that SSA1 self-regulation is at least partially mediated at the transcriptional level. The expression of another transcriptional fusion vector, containing heat shock element 2 and a lesser amount of flanking sequence, is not inhibited when Ssa1p is overexpressed. This suggests the existence of an element, proximal to or overlapping heat shock element 2, that confers sensitivity to the SSA1 protein. Images PMID:2181281
Smith, Geoff M.; Hutson, Jarod M.; Kindler, Lutz; Garcia-Moreno, Alejandro; Villaluenga, Aritza; Turner, Elaine
2017-01-01
Background Ancient protein sequences are increasingly used to elucidate the phylogenetic relationships between extinct and extant mammalian taxa. Here, we apply these recent developments to Middle Pleistocene bone specimens of the rhinoceros genus Stephanorhinus. No biomolecular sequence data is currently available for this genus, leaving phylogenetic hypotheses on its evolutionary relationships to extant and extinct rhinoceroses untested. Furthermore, recent phylogenies based on Rhinocerotidae (partial or complete) mitochondrial DNA sequences differ in the placement of the Sumatran rhinoceros (Dicerorhinus sumatrensis). Therefore, studies utilising ancient protein sequences from Middle Pleistocene contexts have the potential to provide further insights into the phylogenetic relationships between extant and extinct species, including Stephanorhinus and Dicerorhinus. Methods ZooMS screening (zooarchaeology by mass spectrometry) was performed on several Late and Middle Pleistocene specimens from the genus Stephanorhinus, subsequently followed by liquid chromatography-tandem mass spectrometry (LC-MS/MS) to obtain ancient protein sequences from a Middle Pleistocene Stephanorhinus specimen. We performed parallel analysis on a Late Pleistocene woolly rhinoceros specimen and extant species of rhinoceroses, resulting in the availability of protein sequence data for five extant species and two extinct genera. Phylogenetic analysis additionally included all extant Perissodactyla genera (Equus, Tapirus), and was conducted using Bayesian (MrBayes) and maximum-likelihood (RAxML) methods. Results Various ancient proteins were identified in both the Middle and Late Pleistocene rhinoceros samples. Protein degradation and proteome complexity are consistent with an endogenous origin of the identified proteins. Phylogenetic analysis of informative proteins resolved the Perissodactyla phylogeny in agreement with previous studies in regards to the placement of the families Equidae, Tapiridae, and Rhinocerotidae. Stephanorhinus is shown to be most closely related to the genera Coelodonta and Dicerorhinus. The protein sequence data further places the Sumatran rhino in a clade together with the genus Rhinoceros, opposed to forming a clade with the black and white rhinoceros species. Discussion The first biomolecular dataset available for Stephanorhinus places this genus together with the extinct genus Coelodonta and the extant genus Dicerorhinus. This is in agreement with morphological studies, although we are unable to resolve the order of divergence between these genera based on the protein sequences available. Our data supports the placement of the genus Dicerorhinus in a clade together with extant Rhinoceros species. Finally, the availability of protein sequence data for both extinct European rhinoceros genera allows future investigations into their geographic distribution and extinction chronologies. PMID:28316883
Welker, Frido; Smith, Geoff M; Hutson, Jarod M; Kindler, Lutz; Garcia-Moreno, Alejandro; Villaluenga, Aritza; Turner, Elaine; Gaudzinski-Windheuser, Sabine
2017-01-01
Ancient protein sequences are increasingly used to elucidate the phylogenetic relationships between extinct and extant mammalian taxa. Here, we apply these recent developments to Middle Pleistocene bone specimens of the rhinoceros genus Stephanorhinus . No biomolecular sequence data is currently available for this genus, leaving phylogenetic hypotheses on its evolutionary relationships to extant and extinct rhinoceroses untested. Furthermore, recent phylogenies based on Rhinocerotidae (partial or complete) mitochondrial DNA sequences differ in the placement of the Sumatran rhinoceros ( Dicerorhinus sumatrensis ). Therefore, studies utilising ancient protein sequences from Middle Pleistocene contexts have the potential to provide further insights into the phylogenetic relationships between extant and extinct species, including Stephanorhinus and Dicerorhinus . ZooMS screening (zooarchaeology by mass spectrometry) was performed on several Late and Middle Pleistocene specimens from the genus Stephanorhinus , subsequently followed by liquid chromatography-tandem mass spectrometry (LC-MS/MS) to obtain ancient protein sequences from a Middle Pleistocene Stephanorhinus specimen. We performed parallel analysis on a Late Pleistocene woolly rhinoceros specimen and extant species of rhinoceroses, resulting in the availability of protein sequence data for five extant species and two extinct genera. Phylogenetic analysis additionally included all extant Perissodactyla genera ( Equus , Tapirus ), and was conducted using Bayesian (MrBayes) and maximum-likelihood (RAxML) methods. Various ancient proteins were identified in both the Middle and Late Pleistocene rhinoceros samples. Protein degradation and proteome complexity are consistent with an endogenous origin of the identified proteins. Phylogenetic analysis of informative proteins resolved the Perissodactyla phylogeny in agreement with previous studies in regards to the placement of the families Equidae, Tapiridae, and Rhinocerotidae. Stephanorhinus is shown to be most closely related to the genera Coelodonta and Dicerorhinus . The protein sequence data further places the Sumatran rhino in a clade together with the genus Rhinoceros , opposed to forming a clade with the black and white rhinoceros species. The first biomolecular dataset available for Stephanorhinus places this genus together with the extinct genus Coelodonta and the extant genus Dicerorhinus . This is in agreement with morphological studies, although we are unable to resolve the order of divergence between these genera based on the protein sequences available. Our data supports the placement of the genus Dicerorhinus in a clade together with extant Rhinoceros species. Finally, the availability of protein sequence data for both extinct European rhinoceros genera allows future investigations into their geographic distribution and extinction chronologies.
Bombyx mori Nucleopolyhedrovirus Encodes a DNA-Binding Protein Capable of Destabilizing Duplex DNA
Mikhailov, Victor S.; Mikhailova, Alla L.; Iwanaga, Masashi; Gomi, Sumiko; Maeda, Susumu
1998-01-01
A DNA-binding protein (designated DBP) with an apparent molecular mass of 38 kDa was purified to homogeneity from BmN cells (derived from Bombyx mori) infected with the B. mori nucleopolyhedrovirus (BmNPV). Six peptides obtained after digestion of the isolated protein with Achromobacter protease I were partially or completely sequenced. The determined amino acid sequences indicated that DBP was encoded by an open reading frame (ORF16) located at nucleotides (nt) 16189 to 17139 in the BmNPV genome (GenBank accession no. L33180). This ORF (designated dbp) is a homolog of Autographa californica multicapsid NPV ORF25, whose product has not been identified. BmNPV DBP is predicted to contain 317 amino acids (calculated molecular mass of 36.7 kDa) and to have an isoelectric point of 7.8. DBP showed a tendency to multimerization in the course of purification and was found to bind preferentially to single-stranded DNA. When bound to oligonucleotides, DBP protected them from hydrolysis by phage T4 DNA polymerase-associated 3′→5′ exonuclease. The sizes of the protected fragments indicated that a binding site size for DBP is about 30 nt per protein monomer. DBP, but not BmNPV LEF-3, was capable of unwinding partial DNA duplexes in an in vitro system. This helix-destabilizing ability is consistent with the prediction that DBP functions as a single-stranded DNA binding protein in virus replication. PMID:9525636
Wise, C A; Chiang, L C; Paznekas, W A; Sharma, M; Musy, M M; Ashley, J A; Lovett, M; Jabs, E W
1997-04-01
Treacher Collins Syndrome (TCS) is the most common of the human mandibulofacial dysostosis disorders. Recently, a partial TCOF1 cDNA was identified and shown to contain mutations in TCS families. Here we present the entire exon/intron genomic structure and the complete coding sequence of TCOF1. TCOF1 encodes a low complexity protein of 1,411 amino acids, whose predicted protein structure reveals repeated motifs that mirror the organization of its exons. These motifs are shared with nucleolar trafficking proteins in other species and are predicted to be highly phosphorylated by casein kinase. Consistent with this, the full-length TCOF1 protein sequence also contains putative nuclear and nucleolar localization signals. Throughout the open reading frame, we detected an additional eight mutations in TCS families and several polymorphisms. We postulate that TCS results from defects in a nucleolar trafficking protein that is critically required during human craniofacial development.
Wise, Carol A.; Chiang, Lydia C.; Paznekas, William A.; Sharma, Mridula; Musy, Maurice M.; Ashley, Jennifer A.; Lovett, Michael; Jabs, Ethylin W.
1997-01-01
Treacher Collins Syndrome (TCS) is the most common of the human mandibulofacial dysostosis disorders. Recently, a partial TCOF1 cDNA was identified and shown to contain mutations in TCS families. Here we present the entire exon/intron genomic structure and the complete coding sequence of TCOF1. TCOF1 encodes a low complexity protein of 1,411 amino acids, whose predicted protein structure reveals repeated motifs that mirror the organization of its exons. These motifs are shared with nucleolar trafficking proteins in other species and are predicted to be highly phosphorylated by casein kinase. Consistent with this, the full-length TCOF1 protein sequence also contains putative nuclear and nucleolar localization signals. Throughout the open reading frame, we detected an additional eight mutations in TCS families and several polymorphisms. We postulate that TCS results from defects in a nucleolar trafficking protein that is critically required during human craniofacial development. PMID:9096354
ScaffoldSeq: Software for characterization of directed evolution populations.
Woldring, Daniel R; Holec, Patrick V; Hackel, Benjamin J
2016-07-01
ScaffoldSeq is software designed for the numerous applications-including directed evolution analysis-in which a user generates a population of DNA sequences encoding for partially diverse proteins with related functions and would like to characterize the single site and pairwise amino acid frequencies across the population. A common scenario for enzyme maturation, antibody screening, and alternative scaffold engineering involves naïve and evolved populations that contain diversified regions, varying in both sequence and length, within a conserved framework. Analyzing the diversified regions of such populations is facilitated by high-throughput sequencing platforms; however, length variability within these regions (e.g., antibody CDRs) encumbers the alignment process. To overcome this challenge, the ScaffoldSeq algorithm takes advantage of conserved framework sequences to quickly identify diverse regions. Beyond this, unintended biases in sequence frequency are generated throughout the experimental workflow required to evolve and isolate clones of interest prior to DNA sequencing. ScaffoldSeq software uniquely handles this issue by providing tools to quantify and remove background sequences, cluster similar protein families, and dampen the impact of dominant clones. The software produces graphical and tabular summaries for each region of interest, allowing users to evaluate diversity in a site-specific manner as well as identify epistatic pairwise interactions. The code and detailed information are freely available at http://research.cems.umn.edu/hackel. Proteins 2016; 84:869-874. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Craig, R K; Hall, L; Parker, D; Campbell, P N
1981-01-01
A complementary DNA (cDNA) plasmid library has been constructed in the plasmid pAT153, using poly(A)-containing RNA isolated from the lactating guinea-pig mammary gland as the starting material. Double stranded cDNA was inserted into the EcoRI site of the plasmid using poly(dA . dT) tails, then transformed into Escherichia coli HB101. From the resulting colonies we have selected and partially characterized plasmids containing cDNA copies of the mRNAs for casein A, casein B, casein C and alpha-lactalbumin. However, the proportion containing casein C cDNA was exceptionally low, and these contained at best 60% of the mRNA sequence. Images Fig. 2. Fig. 3. Fig. 4. Fig. 5. PMID:7306038
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hou, Xiaomin; Meehan, Edward J.; Xie, Jieming
2008-10-27
A novel type 1 ribosome-inactivating protein (RIP) designated cucurmosin was isolated from the sarcocarp of Cucurbita moschata (pumpkin). Besides rRNA N-glycosidase activity, cucurmosin exhibits strong cytotoxicities to three cancer cell lines of both human and murine origins, but low toxicity to normal cells. Plant genomic DNA extracted from the tender leaves was amplified by PCR between primers based on the N-terminal sequence and X-ray sequence of the C-terminal. The complete mature protein sequence was obtained from N-terminal protein sequencing and partial DNA sequencing, confirmed by high resolution crystal structure analysis. The crystal structure of cucurmosin has been determined at 1.04more » {angstrom}, a resolution that has never been achieved before for any RIP. The structure contains two domains: a large N-terminal domain composed of seven {alpha}-helices and eight {beta}-strands, and a smaller C-terminal domain consisting of three {alpha}-helices and two {beta}-strands. The high resolution structure established a glycosylation pattern of GlcNAc{sub 2}Man3Xyl. Asn225 was identified as a glycosylation site. Residues Tyr70, Tyr109, Glu158 and Arg161 define the active site of cucurmosin as an RNA N-glycosidase. The structural basis of cytotoxicity difference between cucurmosin and trichosanthin is discussed.« less
Liu, Ye; Li, Nan; Zhang, Shoufeng; Zhang, Fei; Lian, Hai; Wang, Ying; Zhang, Jinxia; Hu, Rongliang
2013-12-01
The genome of Irkut virus, isolate IRKV-THChina12, the first non-rabies lyssavirus from China (of bat origin), has been completely sequenced. In general, coding and non-coding regions of this viral genome are similar to those of other lyssaviruses. However, alignment of the deduced amino acid sequences of the structural proteins of IRKV-THChina12 with those of other lyssavirus representatives revealed significant variability between viral species. The nucleoprotein and matrix protein were found to be the most conserved, followed by the large protein, glycoprotein and phosphoprotein. Differences in the antigenic sites in glycoprotein may result in only partial protection of the available rabies biologics against Irkut virus, which is of particular concern for pre- and post-exposure rabies prophylaxis. Copyright © 2013 Elsevier Inc. All rights reserved.
Albornos, Lucía; Martín, Ignacio; Iglesias, Rebeca; Jiménez, Teresa; Labrador, Emilia; Dopico, Berta
2012-11-07
Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the described group of 20 to 40 amino acid tandem repeat proteins and also from known cell wall proteins with repeat sequences. Several putative roles in plant physiology can be inferred from the characteristics found.
2012-01-01
Background Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. Results ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. Conclusions We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the described group of 20 to 40 amino acid tandem repeat proteins and also from known cell wall proteins with repeat sequences. Several putative roles in plant physiology can be inferred from the characteristics found. PMID:23134664
Nisin-Producing Lactococcus lactis Strains Isolated from Human Milk
Beasley, Shea S.; Saris, Per E. J.
2004-01-01
Characterization by partial 16S rRNA gene sequencing, ribotyping, and green fluorescent protein-based nisin bioassay revealed that 6 of 20 human milk samples contained nisin-producing Lactococcus lactis bacteria. This suggests that the history of humans consuming nisin is older than the tradition of consuming fermented milk products. PMID:15294850
Primary structure of Lep d I, the main Lepidoglyphus destructor allergen.
Varela, J; Ventas, P; Carreira, J; Barbas, J A; Gimenez-Gallego, G; Polo, F
1994-10-01
The most relevant allergen of the storage mite Lepidoglyphus destructor (Lep d I) has been characterized. Lep d I is a monomer protein of 13273 Da. The primary structure of Lep d I was determined by N-terminal Edman degradation and partially confirmed by cDNA sequencing. Sequence polymorphism was observed at six positions, with non-conservative substitutions in three of them. No potential N-glycosylation site was revealed by peptide sequencing. The 125-residue sequence of Lep d I shows approximately 40% identity (including the six cysteines) with the overlapping regions of group II allergens from the genus Dermatophagoides, which, however, do not share common allergenic epitopes with Lep d I.
Guerriero, Gea; Spadiut, Oliver; Kerschbamer, Christine; Giorno, Filomena; Baric, Sanja; Ezcurra, Inés
2016-01-01
Cellulose synthase (CesA) genes constitute a complex multigene family with six major phylogenetic clades in angiosperms. The recently sequenced genome of domestic apple, Malus×domestica, was mined for CesA genes, by blasting full-length cellulose synthase protein (CESA) sequences annotated in the apple genome against protein databases from the plant models Arabidopsis thaliana and Populus trichocarpa. Thirteen genes belonging to the six angiosperm CesA clades and coding for proteins with conserved residues typical of processive glycosyltransferases from family 2 were detected. Based on their phylogenetic relationship to Arabidopsis CESAs, as well as expression patterns, a nomenclature is proposed to facilitate further studies. Examination of their genomic organization revealed that MdCesA8-A is closely linked and co-oriented with WDR53, a gene coding for a WD40 repeat protein. The WDR53 and CesA8 genes display conserved collinearity in dicots and are partially co-expressed in the apple xylem. Interestingly, the presence of a bicistronic WDR53–CesA8A transcript was detected in phytoplasma-infected phloem tissues of apple. The bicistronic transcript contains a spliced intergenic sequence that is predicted to fold into hairpin structures typical of internal ribosome entry sites, suggesting its potential cap-independent translation. Surprisingly, the CesA8A cistron is alternatively spliced and lacks the zinc-binding domain. The possible roles of WDR53 and the alternatively spliced CESA8 variant during cellulose biosynthesis in M.×domestica are discussed. PMID:23048131
1996-01-01
Mutations in the Caenorhabditis elegans gene unc-89 result in nematodes having disorganized muscle structure in which thick filaments are not organized into A-bands, and there are no M-lines. Beginning with a partial cDNA from the C. elegans sequencing project, we have cloned and sequenced the unc-89 gene. An unc-89 allele, st515, was found to contain an 84-bp deletion and a 10-bp duplication, resulting in an in- frame stop codon within predicted unc-89 coding sequence. Analysis of the complete coding sequence for unc-89 predicts a novel 6,632 amino acid polypeptide consisting of sequence motifs which have been implicated in protein-protein interactions. UNC-89 begins with 67 residues of unique sequences, SH3, dbl/CDC24, and PH domains, 7 immunoglobulins (Ig) domains, a putative KSP-containing multiphosphorylation domain, and ends with 46 Ig domains. A polyclonal antiserum raised to a portion of unc-89 encoded sequence reacts to a twitchin-sized polypeptide from wild type, but truncated polypeptides from st515 and from the amber allele e2338. By immunofluorescent microscopy, this antiserum localizes to the middle of A-bands, consistent with UNC-89 being a structural component of the M-line. Previous studies indicate that myofilament lattice assembly begins with positional cues laid down in the basement membrane and muscle cell membrane. We propose that the intracellular protein UNC-89 responds to these signals, localizes, and then participates in assembling an M-line. PMID:8603916
Zhu, Ruo-Lin; Lei, Xiao-Ying; Ke, Fei; Yuan, Xiu-Ping; Zhang, Qi-Ya
2011-02-01
Genomic sequence of Scophthalmus maximus rhabdovirus (SMRV) isolated from diseased turbot has been characterized. The complete genome of SMRV comprises 11,492 nucleotides and encodes five typical rhabdovirus genes N, P, M, G and L. In addition, two open reading frames (ORF) are predicted overlapping with P gene, one upstream of P and smaller than P (temporarily called Ps), and another in P gene which may encodes a protein similar to the vesicular stomatitis virus C protein. The C ORF is contained within the P ORF. The five typical proteins share the highest sequence identities (48.9%) with the corresponding proteins of rhabdoviruses in genus Vesiculovirus. Phylogenetic analysis of partial L protein sequence indicates that SMRV is close to genus Vesiculovirus. The first 13 nucleotides at the ends of the SMRV genome are absolutely inverse complementarity. The gene junctions between the five genes show conserved polyadenylation signal (CATGA(7)) and intergenic dinucleotide (CT) followed by putative transcription initiation sequence A(A/G)(C/G)A(A/G/T), which are different from known rhabdoviruses. The entire Ps ORF was cloned and expressed, and used to generate polyclonal antibody in mice. One obvious band could be detected in SMRV-infected carp leucocyte cells (CLCs) by anti-Ps/C serum via Western blot, and the subcellular localization of Ps-GFP fusion protein exhibited cytoplasm distribution as multiple punctuate or doughnut shaped foci of uneven size. Copyright © 2010 Elsevier B.V. All rights reserved.
Identification of phosphorylation sites in the nucleocapsid protein (N protein) of SARS-coronavirus
NASA Astrophysics Data System (ADS)
Lin, Liang; Shao, Jianmin; Sun, Maomao; Liu, Jinxiu; Xu, Gongjin; Zhang, Xumin; Xu, Ningzhi; Wang, Rong; Liu, Siqi
2007-12-01
After decoding the genome of SARS-coronavirus (SARS-CoV), next challenge is to understand how this virus causes the illness at molecular bases. Of the viral structural proteins, the N protein plays a pivot role in assembly process of viral particles as well as viral replication and transcription. The SARS-CoV N proteins expressed in the eukaryotes, such as yeast and HEK293 cells, appeared in the multiple spots on two-dimensional electrophoresis (2DE), whereas the proteins expressed in E. coli showed a single 2DE spotE These 2DE spots were further examined by Western blot and MALDI-TOF/TOF MS, and identified as the N proteins with differently apparent pI values and similar molecular mass of 50 kDa. In the light of the observations and other evidences, a hypothesis was postulated that the SARS-CoV N protein could be phosphorylated in eukaryotes. To locate the plausible regions of phosphorylation in the N protein, two truncated N proteins were generated in E. coli and treated with PKC[alpha]. The two truncated N proteins after incubation of PKC[alpha] exhibited the differently electrophoretic behaviors on 2DE, suggesting that the region of 1-256 aa in the N protein was the possible target for PKC[alpha] phosphorylation. Moreover, the SARS-CoV N protein expressed in yeast were partially digested with trypsin and carefully analyzed by MALDI-TOF/TOF MS. In contrast to the completely tryptic digestion, these partially digested fragments generated two new peptide mass signals with neutral loss, and MS/MS analysis revealed two phosphorylated peptides located at the "dense serine" island in the N protein with amino acid sequences, GFYAEGSRGGSQASSRSSSR and GNSGNSTPGSSRGNSPARMASGGGK. With the PKC[alpha] phosphorylation treatment and the partially tryptic digestion, the N protein expressed in E. coli released the same peptides as observed in yeast cells. Thus, this investigation provided the preliminary data to determine the phosphorylation sites in the SARS-CoV N protein, and partially clarified the argument regarding the phosphorylation possibility of the N protein during the infection process of SARS-CoV to human host.
Amino acid sequence of the Amur tiger prion protein.
Wu, Changde; Pang, Wanyong; Zhao, Deming
2006-10-01
Prion diseases are fatal neurodegenerative disorders in human and animal associated with conformational conversion of a cellular prion protein (PrP(C)) into the pathologic isoform (PrP(Sc)). Various data indicate that the polymorphisms within the open reading frame (ORF) of PrP are associated with the susceptibility and control the species barrier in prion diseases. In the present study, partial Prnp from 25 Amur tigers (tPrnp) were cloned and screened for polymorphisms. Four single nucleotide polymorphisms (T423C, A501G, C511A, A610G) were found; the C511A and A610G nucleotide substitutions resulted in the amino acid changes Lysine171Glutamine and Alanine204Threoine, respectively. The tPrnp amino acid sequence is similar to house cat (Felis catus ) and sheep, but differs significantly from other two cat Prnp sequences that were previously deposited in GenBank.
Identification of a new phospholipase D in Carica papaya latex.
Abdelkafi, Slim; Abousalham, Abdelkarim; Fendri, Imen; Ogata, Hiroyuki; Barouh, Nathalie; Fouquet, Benjamin; Scheirlinckx, Frantz; Villeneuve, Pierre; Carrière, Frédéric
2012-05-15
Phospholipase D (PLD) is a lipolytic enzyme involved in signal transduction, vesicle trafficking and membrane metabolism. It catalyzes the hydrolysis and transphosphatidylation of glycerophospholipids at the terminal phosphodiester bond. The presence of a PLD in the latex of Carica papaya (CpPLD1) was demonstrated by transphosphatidylation of phosphatidylcholine (PtdCho) in the presence of 2% ethanol. Although the protein could not be purified to homogeneity due to its presence in high molecular mass aggregates, a protein band was separated by SDS-PAGE after SDS/chloroform-methanol/TCA-acetone extraction of the latex insoluble fraction. This material was digested with trypsin and the amino acid sequences of the tryptic peptides were determined by micro-LC/ESI/MS/MS. These sequences were used to identify a partial cDNA (723 bp) from expressed sequence tags (ESTs) of C. papaya. Based upon EST sequences, a full-length gene was identified in the genome of C. papaya, with an open reading frame of 2424 bp encoding a protein of 808 amino acid residues, with a theoretical molecular mass of 92.05 kDa. From sequence analysis, CpPLD1 was identified as a PLD belonging to the plant phosphatidylcholine phosphatidohydrolase family. Copyright © 2012 Elsevier B.V. All rights reserved.
Sun, W; O'Connell, M; Speck, N A
1993-01-01
Mammalian type C retrovirus enhancer factor 1 (MCREF-1) is a nuclear protein that binds several directly repeated sequences (CNGGN6CNGG) in the Moloney and Friend murine leukemia virus (MLV) enhancers (N. R. Manley, M. O'Connell, W. Sun, N. A. Speck, and N. Hopkins, J. Virol. 67:1967-1975, 1993). In this paper, we describe the partial purification of MCREF-1 from calf thymus nuclei and further characterize the binding properties of MCREF-1. MCREF-1 binds four sites in the Moloney MLV enhancer and three sites in the Friend MLV enhancer. Ethylation interference analysis suggests that the MCREF-1 binding site spans two adjacent minor grooves of DNA. Images PMID:8445719
Song, Sheng-Nan; Chen, Peng-Yan; Wei, Shu-Jun; Chen, Xue-Xin
2016-07-01
The mitochondrial genome sequence of Polistes jokahamae (Radoszkowski, 1887) (Hymenoptera: Vespidae) (GenBank accession no. KR052468) was sequenced. The current length with partial A + T-rich region of this mitochondrial genome is 16,616 bp. All the typical mitochondrial genes were sequenced except for three tRNAs (trnI, trnQ, and trnY) located between the A + T-rich region and nad2. At least three rearrangement events occurred in the sequenced region compared with the pupative ancestral arrangement of insects, corresponding to the shuffling of trnK and trnD, translocation or remote inversion of tnnY and translocation of trnL1. All protein-coding genes start with ATN codons. Eleven, one, and another one protein-coding genes stop with termination codon TAA, TA, and T, respectively. Phylogenetic analysis using the Bayesian method based on all codon positions of the 13 protein-coding genes supports the monophyly of Vespidae and Formicidae. Within the Formicidae, the Myrmicinae and Formicinae form a sister lineage and then sister to the Dolichoderinae, while within the Vespidae, the Eumeninae is sister to the lineage of Vespinae + Polistinae.
Shimamoto, I; Sonoda, S; Vazquez, P; Minaka, N; Nishiguchi, M
1998-01-01
The 3' terminal 2378 nucleotides of a wasabi strain of crucifer tobamovirus (CTMV-W) infectious to crucifer plants was determined. This includes the 3' non-coding region of 235 nucleotides, coat protein (CP) gene (468 nucleotides), movement protein (MP) gene (798 nucleotides) and C-terminal partial readthrough portion of 180 K protein gene (940 nucleotides). Comparison of the sequence with homologous regions of thirteen other tobamovirus genomes showed that it had much higher identity to those of four other crucifer tobamoviruses, 85.2% to cr-TMV and turnip vein-clearing virus (TVCV), 87.4% to oilseed rape mosaic virus (ORMV) and 87.1% to TMV-Cg, than to those of other tobamoviruses. Thus CTMV-W was most similar to ORMV and TMV-Cg in sequence, but only marginally so, whereas the location and size of its MP gene was the same as cr-TMV amd TVCV. These results, together with other analyses, show that CTMV-W is a new crucifer tobamovirus, that the five crucifer tobamoviruses can be classified into two subgroups based on MP gene organization, and that the rate of sequence change is not the same in all lineages.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Geraghty, M.T.; Stetten, G.; Kearns, W.
1994-09-01
X-linked adrenoleukodystrophy (ALD) is a disorder of peroxisomal {beta}-oxidation of very long chain fatty acids. It presents either as progressive dementia in childhood or as progressive paraparesis in later years. Adrenal insufficiency occurs in both phenotypes. The gene of the ALD protein has been mapped to Xq28 and has recently been cloned and characterized. The ALD protein has significant homology to the peroxisomal membrane protein, PMP70 and belongs to the ATP binding cassette superfamily of transporters. We screened a human genomic library with an ALDP cDNA and isolated 5 different but highly similar clones containing sequences corresponding to the 3{prime}more » end of the ALDP gene. Comparison of the sequences over the region corresponding to exon 9 through the 3{prime} end of the ALDP gene reveals {approximately}96% nucleotide identity in both exonic and intronic regions. Splice sites and open reading frames are maintained. Using both FISH and human-rodent DNA mapping panels, we positively assign these ALDP-related sequences to chromosomes 2, 16 and 22, and provisionally to 1 and 20. Southern blot of primate DNA probed with a partial ALDP cDNA (exon 2-10) shows that expansion of ALDP-related sequences occurred in higher primates (chimp, gorilla and human). Although Northern blots show multiple ALDP-hybridizing transcripts in certain tissues, we have no evidence to date for expression of these ALDP-related sequences. In conclusion, our data show there has been an unusual and recent dispersal to multiple chromosomes of structural gene sequences related to the ALDP gene. The functional significance of these sequences remains to be determined but their existence complicates PCR and mutation analysis of the ALDP gene.« less
Ahmed, Nisar; Riaz, Adeel; Zubair, Zahra; Saqib, Muhammad; Ijaz, Sehrish; Nawaz-Ul-Rehman, Muhammad Shah; Al-Qahtani, Ahmed; Mubin, Muhammad
2018-03-15
The infection in dogs due to canine parvovirus (CPV), is a highly contagious one with high mortality rate. The present study was undertaken for a detailed genetic analysis of partial VP2 gene i.e., 630 bp isolated from rectal swab samples of infected domestic and stray dogs from all areas of district Faisalabad. Monitoring of viruses is important, as continuous prevalence of viral infection might be associated with emergence of new virulent strains. In the present study, 40 rectal swab samples were collected from diarrheic dogs from different areas of district Faisalabad, Pakistan, in 2014-15 and screened for the presence of CPV by immunochromatography. Most of these dogs were stray dogs showing symptoms of diarrhea. Viral DNA was isolated and partial VP2 gene was amplified using gene specific primer pair Hfor/Hrev through PCR. Amplified fragments were cloned in pTZ57R/T (Fermentas) and completely sequenced. Sequences were analyzed and assembled by the Lasergene DNA analysis package (v8; DNAStar Inc., Madison, WI, USA). The results with immunochromatography showed that 33/40 (82%) of dogs were positive for CPV. We were able to amplify a fragment of 630 bp from 25 samples. In 25 samples the sequences of CPV-2a were detected showing the amino acid substitution Ser297Ala and presence of amino acid (426-Asn) in partial VP2 protein. Interestingly the BLAST analysis showed the of feline panleukopenia virus (FPV) sequences in 3 samples which were already positive for new CPV-2a, with 99% sequence homology to other FPV sequences present in GenBank. Phylogenetic analysis showed clustering of partial CPV-VP-2 gene with viruses from China, India, Japan and Uruguay identifying a new variant, whereas the 3 FPV sequences showed immediate ancestral relationship with viruses from Portugal, South Africa and USA. Interesting observation was that CPV are clustering away from the commercial vaccine strains. In this work we provide a better understanding of CPV prevailing in Pakistan at molecular level. The detection of FPV could be a case of real co-infection or a case of dual presence, due to ingestion of contaminated food.
Berben, Tom; Sorokin, Dimitry Y.; Ivanova, Natalia; ...
2015-10-26
Thioalkalivibrio thiocyanodenitrificans strain ARhD 1 T is a motile, Gram-negative bacterium isolated from soda lakes that belongs to the Gammaproteobacteria. It derives energy for growth and carbon fixation from the oxidation of sulfur compounds, most notably thiocyanate, and so is a chemolithoautotroph. It is capable of complete denitrification under anaerobic conditions. In addition, the draft genome sequence consists of 3,746,647 bp in 3 scaffolds, containing 3558 protein-coding and 121 RNA genes. T. thiocyanodenitrificans ARhD 1 T was sequenced as part of the DOE Joint Genome Institute Community Science Program.
Berben, Tom; Sorokin, Dimitry Y.; Ivanova, Natalia; ...
2015-10-26
Thioalkalivibrio thiocyanoxidans strain ARh 2 T is a sulfur-oxidizing bacterium isolated from haloalkaline soda lakes. It is a motile, Gram-negative member of the Gammaproteobacteria. Remarkable properties include the ability to grow on thiocyanate as the sole energy, sulfur and nitrogen source, and the capability of growth at salinities of up to 4.3 M total Na +. This draft genome sequence consists of 61 scaffolds comprising 2,765,337 bp, and contains 2616 protein-coding and 61 RNA-coding genes. In conclusion, this organism was sequenced as part of the Community Science Program of the DOE Joint Genome Institute.
Plaga, W; Lottspeich, F; Oesterhelt, D
1992-04-01
An improved purification procedure, including nickel chelate affinity chromatography, is reported which resulted in a crystallizable pyruvate:ferredoxin oxidoreductase preparation from Halobacterium halobium. Crystals of the enzyme were obtained using potassium citrate as the precipitant. The genes coding for pyruvate:ferredoxin oxidoreductase were cloned and their nucleotide sequences determined. The genes of both subunits were adjacent to one another on the halobacterial genome. The derived amino acid sequences were confirmed by partial primary structure analysis of the purified protein. The structural motif of thiamin-diphosphate-binding enzymes was unequivocally located in the deduced amino acid sequence of the small subunit.
Definition of IgG- and albumin-binding regions of streptococcal protein G.
Akerström, B; Nielsen, E; Björck, L
1987-10-05
Protein G, the immunoglobin G-binding surface protein of group C and G streptococci, also binds serum albumin. The albumin-binding site on protein G is distinct from the immunoglobulin G-binding site. By mild acid hydrolysis of the papain-liberated protein G fragment (35 kDa), a 28-kDa fragment was produced which retained full immunoglobulin G-binding activity (determined by Scatchard plotting) but had lost all albumin-binding capacity. A protein G (65 kDa), isolated after cloning and expression of the protein G gene in Escherichia coli, had comparable affinity to immunoglobulin G (5-10 X 10(10)M-1), but much higher affinity to albumin than the 35- and 28-kDa protein G fragments (31, 2.6, and 0 X 10(9)M-1, respectively). The amino-terminal amino acid sequences of the 65-, 35-, and 28-kDa fragments allowed us to exactly locate the three fragments in an overall sequence map of protein G, based on the partial gene sequences published by Guss et al. (Guss, B., Eliasson, M., Olsson, A., Uhlen, M., Frej, A.-K., Jörnvall, H., Flock, J.-I., and Lindberg, M. (1986) EMBO J. 5, 1567-1575) and Fahnestock et al. (Fahnestock, S. R., Alexander, P., Nagle, J., and Filpula, D. (1986) J. Bacteriol. 167, 870-880). In this map could then be deduced the location of three homologous albumin-binding regions and three homologous immunoglobulin G-binding regions.
Cloning and sequencing of the cDNA species for mammalian dimeric dihydrodiol dehydrogenases.
Arimitsu, E; Aoki, S; Ishikura, S; Nakanishi, K; Matsuura, K; Hara, A
1999-01-01
Cynomolgus and Japanese monkey kidneys, dog and pig livers and rabbit lens contain dimeric dihydrodiol dehydrogenase (EC 1.3.1.20) associated with high carbonyl reductase activity. Here we have isolated cDNA species for the dimeric enzymes by reverse transcriptase-PCR from human intestine in addition to the above five animal tissues. The amino acid sequences deduced from the monkey, pig and dog cDNA species perfectly matched the partial sequences of peptides digested from the respective enzymes of these animal tissues, and active recombinant proteins were expressed in a bacterial system from the monkey and human cDNA species. Northern blot analysis revealed the existence of a single 1.3 kb mRNA species for the enzyme in these animal tissues. The human enzyme shared 94%, 85%, 84% and 82% amino acid identity with the enzymes of the two monkey strains (their sequences were identical), the dog, the pig and the rabbit respectively. The sequences of the primate enzymes consisted of 335 amino acid residues and lacked one amino acid compared with the other animal enzymes. In contrast with previous reports that other types of dihydrodiol dehydrogenase, carbonyl reductases and enzymes with either activity belong to the aldo-keto reductase family or the short-chain dehydrogenase/reductase family, dimeric dihydrodiol dehydrogenase showed no sequence similarity with the members of the two protein families. The dimeric enzyme aligned with low degrees of identity (14-25%) with several prokaryotic proteins, in which 47 residues are strictly or highly conserved. Thus dimeric dihydrodiol dehydrogenase has a primary structure distinct from the previously known mammalian enzymes and is suggested to constitute a novel protein family with the prokaryotic proteins. PMID:10477285
Genetic variation in potential Giardia vaccine candidates cyst wall protein 2 and α1-giardin.
Radunovic, Matej; Klotz, Christian; Saghaug, Christina Skår; Brattbakk, Hans-Richard; Aebischer, Toni; Langeland, Nina; Hanevik, Kurt
2017-08-01
Giardia is a prevalent intestinal parasitic infection. The trophozoite structural protein a1-giardin (a1-g) and the cyst protein cyst wall protein 2 (CWP2) have shown promise as Giardia vaccine antigen candidates in murine models. The present study assesses the genetic diversity of a1-g and CWP2 between and within assemblages A and B in human clinical isolates. a1-g and CWP2 sequences were acquired from 15 Norwegian isolates by PCR amplification and 20 sequences from German cultured isolates by whole genome sequencing. Sequences were aligned to reference genomes from assemblage A2 and B to identify genetic variance. Genetic diversity was found between assemblage A and B reference sequences for both a1-g (90.8% nucleotide identity) and CWP2 (82.5% nucleotide identity). However, for a1-g, this translated into only 3 amino acid (aa) substitutions, while for CWP2 there were 41 aa substitutions, and also one aa deletion. Genetic diversity within assemblage B was larger; nucleotide identity 92.0% for a1-g and 94.3% for CWP2, than within assemblage A (nucleotide identity 99.0% for a1-g and 99.7% for CWP2). For CWP2, the diversity on both nucleotide and protein level was higher in the C-terminal end. Predicted antigenic epitopes were not affected for a1-g, but partially for CWP2. Despite genetic diversity in a1-g, we found aa sequence, characteristics, and antigenicity to be well preserved. CWP2 showed more aa variance and potential antigenic differences. Several CWP2 antigens might be necessary in a future Giardia vaccine to provide cross protection against both Giardia assemblages infecting humans.
Evolutionary trend toward kinetic stability in the folding trajectory of RNases H
Lim, Shion A.; Hart, Kathryn M.; Marqusee, Susan
2016-01-01
Proper folding of proteins is critical to producing the biological machinery essential for cellular function. The rates and energetics of a protein’s folding process, which is described by its energy landscape, are encoded in the amino acid sequence. Over the course of evolution, this landscape must be maintained such that the protein folds and remains folded over a biologically relevant time scale. How exactly a protein’s energy landscape is maintained or altered throughout evolution is unclear. To study how a protein’s energy landscape changed over time, we characterized the folding trajectories of ancestral proteins of the ribonuclease H (RNase H) family using ancestral sequence reconstruction to access the evolutionary history between RNases H from mesophilic and thermophilic bacteria. We found that despite large sequence divergence, the overall folding pathway is conserved over billions of years of evolution. There are robust trends in the rates of protein folding and unfolding; both modern RNases H evolved to be more kinetically stable than their most recent common ancestor. Finally, our study demonstrates how a partially folded intermediate provides a readily adaptable folding landscape by allowing the independent tuning of kinetics and thermodynamics. PMID:27799545
Complete genome analysis of jasmine virus T from Jasminum sambac in China.
Tang, Yajun; Gao, Fangluan; Yang, Zhen; Wu, Zujian; Yang, Liang
2016-07-01
The genome of a potyvirus (isolate JaVT_FZ) recovered from jasmine (Jasminum sambac L.) showing yellow ringspot symptoms in Fuzhou, China, was sequenced. JaVT_FZ is closely related to seven other potyviruses with completely sequenced genomes, with which it shares 66-70 % nucleotide and 52-56 % amino acid sequence identity. However, the coat protein (CP) gene shares 82-92 % nucleotide and 90-97 % amino acid sequence identity with those of two partially sequenced potyviruses, named jasmine potyvirus T (JaVT-jasmine) and jasmine yellow mosaic potyvirus (JaYMV-India), respectively. This suggests that JaVT_FZ, JaVT-jasmine and JaYMV-India should be regarded as members of a single potyvirus species, for which the name "Jasmine virus T" has priority.
Syed, Muhammad Ibrahim; Moorthy, Balaji T; Jenner, Andreas; Fetka, Ingrid; Jansen, Ralf-Peter
2018-05-17
Localization of mRNAs depends on specific RNA-binding proteins (RBPs) and critically contributes not only to cell polarization but also to basal cell function. The yeast RBP Khd1p binds to several hundred mRNAs, the majority of which encodes secreted or membrane proteins. We demonstrate that a subfraction of Khd1p associates with artificial liposomes and endoplasmic reticulum (ER), and that Khd1p endomembrane association is partially dependent on its binding to RNA. ER targeting of at least two mRNAs, MID2 and SLG1/WSC1, requires KHD1 but is independent of their translation. Together, our results suggest interdependence of Khd1p and mRNA for their targeting to the ER and presents additional evidence for signal sequence-independent, RBP-mediated mRNA targeting. © 2018 Federation of European Biochemical Societies.
A single determinant dominates the rate of yeast protein evolution.
Drummond, D Allan; Raval, Alpan; Wilke, Claus O
2006-02-01
A gene's rate of sequence evolution is among the most fundamental evolutionary quantities in common use, but what determines evolutionary rates has remained unclear. Here, we carry out the first combined analysis of seven predictors (gene expression level, dispensability, protein abundance, codon adaptation index, gene length, number of protein-protein interactions, and the gene's centrality in the interaction network) previously reported to have independent influences on protein evolutionary rates. Strikingly, our analysis reveals a single dominant variable linked to the number of translation events which explains 40-fold more variation in evolutionary rate than any other, suggesting that protein evolutionary rate has a single major determinant among the seven predictors. The dominant variable explains nearly half the variation in the rate of synonymous and protein evolution. We show that the two most commonly used methods to disentangle the determinants of evolutionary rate, partial correlation analysis and ordinary multivariate regression, produce misleading or spurious results when applied to noisy biological data. We overcome these difficulties by employing principal component regression, a multivariate regression of evolutionary rate against the principal components of the predictor variables. Our results support the hypothesis that translational selection governs the rate of synonymous and protein sequence evolution in yeast.
Conformational Entropy of Intrinsically Disordered Proteins from Amino Acid Triads
Baruah, Anupaul; Rani, Pooja; Biswas, Parbati
2015-01-01
This work quantitatively characterizes intrinsic disorder in proteins in terms of sequence composition and backbone conformational entropy. Analysis of the normalized relative composition of the amino acid triads highlights a distinct boundary between globular and disordered proteins. The conformational entropy is calculated from the dihedral angles of the middle amino acid in the amino acid triad for the conformational ensemble of the globular, partially and completely disordered proteins relative to the non-redundant database. Both Monte Carlo (MC) and Molecular Dynamics (MD) simulations are used to characterize the conformational ensemble of the representative proteins of each group. The results show that the globular proteins span approximately half of the allowed conformational states in the Ramachandran space, while the amino acid triads in disordered proteins sample the entire range of the allowed dihedral angle space following Flory’s isolated-pair hypothesis. Therefore, only the sequence information in terms of the relative amino acid triad composition may be sufficient to predict protein disorder and the backbone conformational entropy, even in the absence of well-defined structure. The predicted entropies are found to agree with those calculated using mutual information expansion and the histogram method. PMID:26138206
Delineating slowly and rapidly evolving fractions of the Drosophila genome.
Keith, Jonathan M; Adams, Peter; Stephen, Stuart; Mattick, John S
2008-05-01
Evolutionary conservation is an important indicator of function and a major component of bioinformatic methods to identify non-protein-coding genes. We present a new Bayesian method for segmenting pairwise alignments of eukaryotic genomes while simultaneously classifying segments into slowly and rapidly evolving fractions. We also describe an information criterion similar to the Akaike Information Criterion (AIC) for determining the number of classes. Working with pairwise alignments enables detection of differences in conservation patterns among closely related species. We analyzed three whole-genome and three partial-genome pairwise alignments among eight Drosophila species. Three distinct classes of conservation level were detected. Sequences comprising the most slowly evolving component were consistent across a range of species pairs, and constituted approximately 62-66% of the D. melanogaster genome. Almost all (>90%) of the aligned protein-coding sequence is in this fraction, suggesting much of it (comprising the majority of the Drosophila genome, including approximately 56% of non-protein-coding sequences) is functional. The size and content of the most rapidly evolving component was species dependent, and varied from 1.6% to 4.8%. This fraction is also enriched for protein-coding sequence (while containing significant amounts of non-protein-coding sequence), suggesting it is under positive selection. We also classified segments according to conservation and GC content simultaneously. This analysis identified numerous sub-classes of those identified on the basis of conservation alone, but was nevertheless consistent with that classification. Software, data, and results available at www.maths.qut.edu.au/-keithj/. Genomic segments comprising the conservation classes available in BED format.
NASA Astrophysics Data System (ADS)
Sun, S. M.; Slightom, J. L.; Hall, T. C.
1981-01-01
A plant gene coding for the major storage protein (phaseolin, G1-globulin) of the French bean was isolated from a genomic library constructed in the phage vector Charon 24A. Comparison of the nucleotide sequence of part of the gene with that of the cloned messenger RNA (cDNA) revealed the presence of three intervening sequences, all beginning with GTand ending with AG. The 5' and 3' boundaries of intervening sequences TVS-A (88 base pairs) and IVS-B (124 base pairs) are similar to those described for animal and viral genes, but the 3' boundary of IVS-C (129 base pairs) shows some differences. A sequence of 185 amino acids deduced from the cloned DMAs represents about 40% of a phaseolin polypeptide.
Costa, Caroline B; Monteiro, Karina M; Teichmann, Aline; da Silva, Edileuza D; Lorenzatto, Karina R; Cancela, Martín; Paes, Jéssica A; Benitz, André de N D; Castillo, Estela; Margis, Rogério; Zaha, Arnaldo; Ferreira, Henrique B
2015-08-01
The histone chaperone SET/TAF-Iβ is implicated in processes of chromatin remodelling and gene expression regulation. It has been associated with the control of developmental processes, but little is known about its function in helminth parasites. In Mesocestoides corti, a partial cDNA sequence related to SET/TAF-Iβ was isolated in a screening for genes differentially expressed in larvae (tetrathyridia) and adult worms. Here, the full-length coding sequence of the M. corti SET/TAF-Iβ gene was analysed and the encoded protein (McSET/TAF) was compared with orthologous sequences, showing that McSET/TAF can be regarded as a SET/TAF-Iβ family member, with a typical nucleosome-assembly protein (NAP) domain and an acidic tail. The expression patterns of the McSET/TAF gene and protein were investigated during the strobilation process by RT-qPCR, using a set of five reference genes, and by immunoblot and immunofluorescence, using monospecific polyclonal antibodies. A gradual increase in McSET/TAF transcripts and McSET/TAF protein was observed upon development induction by trypsin, demonstrating McSET/TAF differential expression during strobilation. These results provided the first evidence for the involvement of a protein from the NAP family of epigenetic effectors in the regulation of cestode development.
Yoshida, Naoto; Shimura, Hanako; Masuta, Chikara
2018-06-01
Allexiviruses are economically important garlic viruses that are involved in garlic mosaic diseases. In this study, we characterized the allexivirus cysteine-rich protein (CRP) gene located just downstream of the coat protein (CP) gene in the viral genome. We determined the nucleotide sequences of the CP and CRP genes from numerous allexivirus isolates and performed a phylogenetic analysis. According to the resulting phylogenetic tree, we found that allexiviruses were clearly divided into two major groups (group I and group II) based on the sequences of the CP and CRP genes. In addition, the allexiviruses in group II had distinct sequences just before the CRP gene, while group I isolates did not. The inserted sequence between the CP and CRP genes was partially complementary to garlic 18S rRNA. Using a potato virus X vector, we showed that the CRPs affected viral accumulation and symptom induction in Nicotiana benthamiana, suggesting that the allexivirus CRP is a pathogenicity determinant. We assume that the inserted sequences before the CRP gene may have been generated during viral evolution to alter the termination-reinitiation mechanism for coupled translation of CP and CRP.
Characterization of Urtica dioica agglutinin isolectins and the encoding gene family.
Does, M P; Ng, D K; Dekker, H L; Peumans, W J; Houterman, P M; Van Damme, E J; Cornelissen, B J
1999-01-01
Urtica dioica agglutinin (UDA) has previously been found in roots and rhizomes of stinging nettles as a mixture of UDA-isolectins. Protein and cDNA sequencing have shown that mature UDA is composed of two hevein domains and is processed from a precursor protein. The precursor contains a signal peptide, two in-tandem hevein domains, a hinge region and a carboxyl-terminal chitinase domain. Genomic fragments encoding precursors for UDA-isolectins have been amplified by five independent polymerase chain reactions on genomic DNA from stinging nettle ecotype Weerselo. One amplified gene was completely sequenced. As compared to the published cDNA sequence, the genomic sequence contains, besides two basepair substitutions, two introns located at the same positions as in other plant chitinases. By partial sequence analysis of 40 amplified genes, 16 different genes were identified which encode seven putative UDA-isolectins. The deduced amino acid sequences share 78.9-98.9% identity. In extracts of roots and rhizomes of stinging nettle ecotype Weerselo six out of these seven isolectins were detected by mass spectrometry. One of them is an acidic form, which has not been identified before. Our results demonstrate that UDA is encoded by a large gene family.
DOE R&D Accomplishments Database
Chandonia, John-Marc; Hon, Gary; Walker, Nigel S.; Lo Conte, Loredana; Koehl, Patrice; Levitt, Michael; Brenner, Steven E.
2003-09-15
The ASTRAL compendium provides several databases and tools to aid in the analysis of protein structures, particularly through the use of their sequences. Partially derived from the SCOP database of protein structure domains, it includes sequences for each domain and other resources useful for studying these sequences and domain structures. The current release of ASTRAL contains 54,745 domains, more than three times as many as the initial release four years ago. ASTRAL has undergone major transformations in the past two years. In addition to several complete updates each year, ASTRAL is now updated on a weekly basis with preliminary classifications of domains from newly released PDB structures. These classifications are available as a stand-alone database, as well as available integrated into other ASTRAL databases such as representative subsets. To enhance the utility of ASTRAL to structural biologists, all SCOP domains are now made available as PDB-style coordinate files as well as sequences. In addition to sequences and representative subsets based on SCOP domains, sequences and subsets based on PDB chains are newly included in ASTRAL. Several search tools have been added to ASTRAL to facilitate retrieval of data by individual users and automated methods.
Yu, Y X; Béarzotti, M; Vende, P; Ahne, W; Brémont, M
1999-09-01
Iridovirus-like pathogens have been recognized as a cause of serious systemic diseases among feral, cultured and ornamental fish in the recent years. Mortalities of fish due to systemic iridovirus infection reaching 30-100% were observed in Europe, Australia, Japan and Thailand. Up to now, the molecular biology of these important pathogens has been poorly documented. To get better insights on the genomic organization of these piscine iridoviruses, we have constructed a cosmid viral DNA library from the epizootic hematopoietic necrosis virus (EHNV). Two recombinant cosmids (Cos7 and Cos12) have been selected for systematic sequencing. Cos7 and 12 are localized side by side along the genome and cover the 2/3 part of the total EHNV genome which has been estimated to be approximately 101.47 kb in length. Thirty five kilobase pairs (kbps) from Cos7 and 10 kbps from Cos12 have been determined. Sequence analysis revealed open reading frames (ORF) sharing homologies with sequences from the Frog virus 3 such as the p31 and p40 proteins. Among the others identified ORFs, some of them presented homologies with known protein sequences, such as the human eIF2alpha protein, and some did not show any significant homologies with sequences available in the databases. But, none were related to Lymphocystis virus, a member of the Iridoviridae family, for which the full genome nucleotide sequence has been determined.
Yadav, Sushila; Tomar, Anil Kumar; Jithesh, O; Khan, Meraj Alam; Yadav, R N; Srinivasan, A; Singh, Tej P; Yadav, Savita
2011-12-01
The watermelon (Citrullus lanatus) seeds are highly nutritive and contain large amount of proteins and many beneficial minerals such as magnesium, calcium, potassium, iron, phosphorous, zinc etc. In various parts of the world, C. lanatus seed extracts are used to cure cancer, cardiovascular diseases, hypertension, and blood pressure. C. lanatus seed extracts are also used as home remedy for edema and urinary tract problems. In this study, we isolated protein fraction of C. lanatus seeds using various protein separation methods. We successfully purified a low molecular weight vicilin-like glycoprotein using chromatographic methods followed by SDS-PAGE and MALDI-TOF/MS identification. This is the first report of purification of a vicilin like polypeptide from C. lanatus seeds. In next step, we extracted mRNA from immature seeds and reverse transcribed it using suitable forward and reverse primers for purified glycoprotein. The PCR product was analysed on 1% agarose gel and was subsequently sequenced by Dideoxy DNA sequencing method. An amino acid translation of the gene is in agreement with amino acid sequences of the identified peptides.
Impact of extracellularity on the evolutionary rate of mammalian proteins.
Liao, Ben-Yang; Weng, Meng-Pin; Zhang, Jianzhi
2010-01-06
It is of fundamental importance to understand the determinants of the rate of protein evolution. Eukaryotic extracellular proteins are known to evolve faster than intracellular proteins. Although this rate difference appears to be due to the lower essentiality of extracellular proteins than intracellular proteins in yeast, we here show that, in mammals, the impact of extracellularity is independent from the impact of gene essentiality. Our partial correlation analysis indicated that the impact of extracellularity on mammalian protein evolutionary rate is also independent from those of tissue-specificity, expression level, gene compactness, and the number of protein-protein interactions and, surprisingly, is the strongest among all the factors we examined. Similar results were also found from principal component regression analysis. Our findings suggest that different rules govern the pace of protein sequence evolution in mammals and yeasts.
Abdelkafi, Slim; Ogata, Hiroyuki; Barouh, Nathalie; Fouquet, Benjamin; Lebrun, Régine; Pina, Michel; Scheirlinckx, Frantz; Villeneuve, Pierre; Carrière, Frédéric
2009-11-01
An esterase (CpEst) showing high specific activities on tributyrin and short chain vinyl esters was obtained from Carica papaya latex after an extraction step with zwitterionic detergent and sonication, followed by gel filtration chromatography. Although the protein could not be purified to complete homogeneity due to its presence in high molecular mass aggregates, a major protein band with an apparent molecular mass of 41 kDa was obtained by SDS-PAGE. This material was digested with trypsin and the amino acid sequences of the tryptic peptides were determined by LC/ESI/MS/MS. These sequences were used to identify a partial cDNA (679 bp) from expressed sequence tags (ESTs) of C. papaya. Based upon EST sequences, a full-length gene was identified in the genome of C. papaya, with an open reading frame of 1029 bp encoding a protein of 343 amino acid residues, with a theoretical molecular mass of 38 kDa. From sequence analysis, CpEst was identified as a GDSL-motif carboxylester hydrolase belonging to the SGNH protein family and four potential N-glycosylation sites were identified. The putative catalytic triad was localised (Ser(35)-Asp(307)-His(310)) with the nucleophile serine being part of the GDSL-motif. A 3D-model of CpEst was built from known X-ray structures and sequence alignments and the catalytic triad was found to be exposed at the surface of the molecule, thus confirming the results of CpEst inhibition by tetrahydrolipstatin suggesting a direct accessibility of the inhibitor to the active site.
Homology of aspartyl- and lysyl-tRNA synthetases.
Gampel, A; Tzagoloff, A
1989-01-01
The yeast nuclear gene MSD1 coding for mitochondrial aspartyl-tRNA synthetase has been cloned and sequenced. The identity of the gene is confirmed by the following evidence. (i) The primary structure of the protein derived from the gene sequence is similar to that of the yeast cytoplasmic aspartyl-tRNA synthetase. (ii) In situ disruption of MSD1 in a respiratory-competent haploid strain of yeast induces a pleiotropic phenotype consistent with a lesion in mitochondrial protein synthesis. (iii) Mitochondria from a mutant with a disrupted chromosomal copy of MSD1 are unable to acylate mitochondrial aspartyl-tRNA. The primary structures of the cytoplasmic and mitochondrial aspartyl-tRNA synthetases are similar to the yeast cytoplasmic lysyl-tRNA synthetase, suggesting that the two types of synthetases may have a common evolutionary origin. Searches of the current protein banks also have revealed a high degree of sequence similarity of the lysyl-tRNA synthetase to the product of the Escherichia coli herC gene and to the partial sequence of a protein encoded by an unidentified reading frame located adjacent to the E. coli frdA gene. Based on the sequence similarities and the map positions of the herC and frdA loci, we propose herC to be the structural gene of the constitutively expressed lysyl-tRNA synthetase of E. coli and the unidentified reading frame to be the structural gene of the heat-inducible lysyl-tRNA synthetase. Images PMID:2668951
Elkins, Christopher A.; Savage, Dwayne C.
1998-01-01
Cytosolic extracts of Lactobacillus johnsonii 100-100 (previously reported as Lactobacillus sp. strain 100-100) contain four heterotrimeric isozymes composed of two peptides, α and β, with conjugated bile salt hydrolase (BSH) activity. We now report cloning, from the genome of strain 100-100, a 2,977-bp DNA segment that expresses BSH activity in Escherichia coli. The sequencing of this segment showed that it contained one complete and two partial open reading frames (ORFs). The 3′ partial ORF (927 nucleotides) was predicted by BLAST and confirmed with 5′ and 3′ deletions to be a BSH gene. Thermal asymmetric interlaced PCR was used to extend and complete the 948-nucleotide sequence of the BSH gene 3′ of the cloned segment. The predicted amino acid sequence of the 5′ partial ORF (651 nucleotides) was about 80% similar to the C-terminal half of the largest, complete ORF (1,353 nucleotides), and these two putative proteins were similar to several amine, multidrug resistance, and sugar transport proteins of the major facilitator superfamily. E. coli DH5α cells transformed with a construct containing these ORFs, in concert with an extracellular factor produced by strain 100-100, demonstrated levels of uptake of [14C]taurocholic acid that were increased as much as threefold over control levels. [14C]Cholic acid was taken up in similar amounts by strain DH5α pSportI (control) and DH5α p2000 (transport clones). These findings support a hypothesis that the ORFs are conjugated bile salt transport genes which may be arranged in an operon with BSH genes. PMID:9721268
Drosophila Nnf1 paralogs are partially redundant for somatic and germ line kinetochore function.
Blattner, Ariane C; Aguilar-Rodríguez, José; Kränzlin, Marcella; Wagner, Andreas; Lehner, Christian F
2017-02-01
Kinetochores allow attachment of chromosomes to spindle microtubules. Moreover, they host proteins that permit correction of erroneous attachments and prevent premature anaphase onset before bi-orientation of all chromosomes in metaphase has been achieved. Kinetochores are assembled from subcomplexes. Kinetochore proteins as well as the underlying centromere proteins and the centromeric DNA sequences evolve rapidly despite their fundamental importance for faithful chromosome segregation during mitotic and meiotic divisions. During evolution of Drosophila melanogaster, several centromere proteins were lost and a recent gene duplication has resulted in two Nnf1 paralogs, Nnf1a and Nnf1b, which code for alternative forms of a Mis12 kinetochore complex component. The rapid evolutionary divergence of centromere/kinetochore constituents in animals and plants has been proposed to be driven by an intragenome conflict resulting from centromere drive during female meiosis. Thus, a female meiosis-specific paralog might be expected to evolve rapidly under positive selection. While our characterization of the D. melanogaster Nnf1 paralogs hints at some partial functional specialization of Nnf1b for meiosis, we have failed to detect evidence for positive selection in our analysis of Nnf1 sequence evolution in the Drosophilid lineage. Neither paralog is essential, even though we find some clear differences in subcellular localization and expression during development. Loss of both paralogs results in developmental lethality. We therefore conclude that the two paralogs are still in early stages of differentiation.
Qin, Yanhong; Wang, Li; Zhang, Zhenchen; Qiao, Qi; Zhang, Desheng; Tian, Yuting; Wang, Shuang; Wang, Yongjiang; Yan, Zhaoling
2014-01-01
Background Sweet potato chlorotic stunt virus (family Closteroviridae, genus Crinivirus) features a large bipartite, single-stranded, positive-sense RNA genome. To date, only three complete genomic sequences of SPCSV can be accessed through GenBank. SPCSV was first detected from China in 2011, only partial genomic sequences have been determined in the country. No report on the complete genomic sequence and genome structure of Chinese SPCSV isolates or the genetic relation between isolates from China and other countries is available. Methodology/Principal Findings The complete genomic sequences of five isolates from different areas in China were characterized. This study is the first to report the complete genome sequences of SPCSV from whitefly vectors. Genome structure analysis showed that isolates of WA and EA strains from China have the same coding protein as isolates Can181-9 and m2-47, respectively. Twenty cp genes and four RNA1 partial segments were sequenced and analyzed, and the nucleotide identities of complete genomic, cp, and RNA1 partial sequences were determined. Results indicated high conservation among strains and significant differences between WA and EA strains. Genetic analysis demonstrated that, except for isolates from Guangdong Province, SPCSVs from other areas belong to the WA strain. Genome organization analysis showed that the isolates in this study lack the p22 gene. Conclusions/Significance We presented the complete genome sequences of SPCSV in China. Comparison of nucleotide identities and genome structures between these isolates and previously reported isolates showed slight differences. The nucleotide identities of different SPCSV isolates showed high conservation among strains and significant differences between strains. All nine isolates in this study lacked p22 gene. WA strains were more extensively distributed than EA strains in China. These data provide important insights into the molecular variation and genomic structure of SPCSV in China as well as genetic relationships among isolates from China and other countries. PMID:25170926
Goldbaum, F A; Leoni, J; Wallach, J C; Fossati, C A
1993-01-01
Some anticytoplasmic protein monoclonal antibodies (MAbs) from mice immunized by infection with Brucella ovis cells have been obtained. One of these MAbs, BI24, was used to purify by immunoaffinity a protein with a pI of 5.6 and a molecular mass of 18 kDa. This protein was present in all of the rough and smooth Brucella species studied, but it could not be detected in Yersinia enterocolitica 09. Three internal peptides of this protein were partially sequenced; no homology with other bacterial proteins was found. The immunogenicity of the 18-kDa protein was studied with both human and bovine sera by a capture enzyme-linked immunosorbent assay system with MAb BI24. Images PMID:8370742
Goldbaum, F A; Leoni, J; Wallach, J C; Fossati, C A
1993-08-01
Some anticytoplasmic protein monoclonal antibodies (MAbs) from mice immunized by infection with Brucella ovis cells have been obtained. One of these MAbs, BI24, was used to purify by immunoaffinity a protein with a pI of 5.6 and a molecular mass of 18 kDa. This protein was present in all of the rough and smooth Brucella species studied, but it could not be detected in Yersinia enterocolitica 09. Three internal peptides of this protein were partially sequenced; no homology with other bacterial proteins was found. The immunogenicity of the 18-kDa protein was studied with both human and bovine sera by a capture enzyme-linked immunosorbent assay system with MAb BI24.
Conserved Sequences at the Origin of Adenovirus DNA Replication
Stillman, Bruce W.; Topp, William C.; Engler, Jeffrey A.
1982-01-01
The origin of adenovirus DNA replication lies within an inverted sequence repetition at either end of the linear, double-stranded viral DNA. Initiation of DNA replication is primed by a deoxynucleoside that is covalently linked to a protein, which remains bound to the newly synthesized DNA. We demonstrate that virion-derived DNA-protein complexes from five human adenovirus serological subgroups (A to E) can act as a template for both the initiation and the elongation of DNA replication in vitro, using nuclear extracts from adenovirus type 2 (Ad2)-infected HeLa cells. The heterologous template DNA-protein complexes were not as active as the homologous Ad2 DNA, most probably due to inefficient initiation by Ad2 replication factors. In an attempt to identify common features which may permit this replication, we have also sequenced the inverted terminal repeated DNA from human adenovirus serotypes Ad4 (group E), Ad9 and Ad10 (group D), and Ad31 (group A), and we have compared these to previously determined sequences from Ad2 and Ad5 (group C), Ad7 (group B), and Ad12 and Ad18 (group A) DNA. In all cases, the sequence around the origin of DNA replication can be divided into two structural domains: a proximal A · T-rich region which is partially conserved among these serotypes, and a distal G · C-rich region which is less well conserved. The G · C-rich region contains sequences similar to sequences present in papovavirus replication origins. The two domains may reflect a dual mechanism for initiation of DNA replication: adenovirus-specific protein priming of replication, and subsequent utilization of this primer by host replication factors for completion of DNA synthesis. Images PMID:7143575
Reicher, S; Seroussi, E; Weller, J I; Rosov, A; Gootwine, E
2012-07-01
Polymorphisms in mitochondrial DNA (mtDNA) protein- and tRNA-coding genes were shown to be associated with various diseases in humans as well as with production and reproduction traits in livestock. Alignment of full length mitochondria sequences from the 5 known ovine haplogroups: HA (n = 3), HB (n = 5), HC (n = 3), HD (n = 2), and HE (n = 2; GenBank accession nos. HE577847-50 and 11 published complete ovine mitochondria sequences) revealed sequence variation in 10 out of the 13 protein coding mtDNA sequences. Twenty-six of the 245 variable sites found in the protein coding sequences represent non-synonymous mutations. Sequence variation was observed also in 8 out of the 22 tRNA mtDNA sequences. On the basis of the mtDNA control region and cytochrome b partial sequences along with information on maternal lineages within an Afec-Assaf flock, 1,126 Afec-Assaf ewes were assigned to mitochondrial haplogroups HA, HB, and HC, with frequencies of 0.43, 0.43, and 0.14, respectively. Analysis of birth weight and growth rate records of lamb (n = 1286) and productivity from 4,993 lambing records revealed no association between mitochondrial haplogroup affiliation and female longevity, lambs perinatal survival rate, birth weight, and daily growth rate of lambs up to 150 d that averaged 1,664 d, 88.3%, 4.5 kg, and 320 g/d, respectively. However, significant (P < 0.0001) differences among the haplogroups were found for prolificacy of ewes, with prolificacies (mean ± SE) of 2.14 ± 0.04, 2.25 ± 0.04, and 2.30 ± 0.06 lamb born/ewe lambing for the HA, HB, and the HC haplogroups, respectively. Our results highlight the ovine mitogenome genetic variation in protein- and tRNA coding genes and suggest that sequence variation in ovine mtDNA is associated with variation in ewe prolificacy.
Valiadi, Martha; Iglesias-Rodriguez, Maria Debora
2014-01-01
Dinoflagellate bioluminescence systems operate with or without a luciferin binding protein, representing two distinct modes of light production. However, the distribution, diversity, and evolution of the luciferin binding protein gene within bioluminescent dinoflagellates are not well known. We used PCR to detect and partially sequence this gene from the heterotrophic dinoflagellate Noctiluca scintillans and a group of ecologically important gonyaulacoid species. We report an additional luciferin binding protein gene in N. scintillans which is not attached to luciferase, further to its typical combined bioluminescence gene. This supports the hypothesis that a profound re-organization of the bioluminescence system has taken place in this organism. We also show that the luciferin binding protein gene is present in the genera Ceratocorys, Gonyaulax, and Protoceratium, and is prevalent in bioluminescent species of Alexandrium. Therefore, this gene is an integral component of the standard molecular bioluminescence machinery in dinoflagellates. Nucleotide sequences showed high within-strain variation among gene copies, revealing a highly diverse gene family comprising multiple gene types in some organisms. Phylogenetic analyses showed that, in some species, the evolution of the luciferin binding protein gene was different from the organism's general phylogenies, highlighting the complex evolutionary history of dinoflagellate bioluminescence systems. © 2013 The Author(s) Journal of Eukaryotic Microbiology © 2013 International Society of Protistologists.
Freudl, R; Schwarz, H; Klose, M; Movva, N R; Henning, U
1985-12-16
Information, in addition to that provided by signal sequences, for translocation across the plasma membrane is thought to be present in exported proteins of Escherichia coli. Such information must also exist for the localization of such proteins. To determine the nature of this information, overlapping inframe deletions have been constructed in the ompA gene which codes for a 325-residue major outer membrane protein. In addition, one deletion, encoding only the NH2-terminal part of the protein up to residue 160, was prepared. The location of each product was determined by immunoelectron microscopy. Proteins missing residues 4-45, 43-84, 46-227, 86-227 or 160-325 of the mature protein were all efficiently translocated across the plasma membrane. The first two proteins were found in the outer membrane, the others in the periplasmic space. It has been proposed that export and sorting signals consist of relatively small amino acid sequences near the NH2 terminus of an outer membrane protein. On the basis of sequence homologies it has also been suggested that such proteins possess a common sorting signal. The locations of the partially deleted proteins described here show that a unique export signal does not exist in the OmpA protein. The proposed common sorting signal spans residues 1-14 of OmpA. Since this region is not essential for routing the protein, the existence of a common sorting signal is doubtful. It is suggested that information both for export (if existent) and localization lies within protein conformation which for the former process should be present repeatedly in the polypeptide.
Wong, Diane K.; Lee, Bai-Yu; Horwitz, Marcus A.; Gibson, Bradford W.
1999-01-01
Iron plays a critical role in the pathophysiology of Mycobacterium tuberculosis. To gain a better understanding of iron regulation by this organism, we have used two-dimensional (2-D) gel electrophoresis, mass spectrometry, and database searching to study protein expression in M. tuberculosis under conditions of high and low iron concentration. Proteins in cellular extracts from M. tuberculosis Erdman strain grown under low-iron (1 μM) and high-iron (70 μM) conditions were separated by 2-D polyacrylamide gel electrophoresis, which allowed high-resolution separation of several hundred proteins, as visualized by Coomassie staining. The expression of at least 15 proteins was induced, and the expression of at least 12 proteins was decreased under low-iron conditions. In-gel trypsin digestion was performed on these differentially expressed proteins, and the digestion mixtures were analyzed by matrix-assisted laser desorption ionization time-of-flight mass spectrometry to determine the molecular masses of the resulting tryptic peptides. Partial sequence data on some of the peptides were obtained by using after source decay and/or collision-induced dissociation. The fragmentation data were used to search computerized peptide mass and protein sequence databases for known proteins. Ten iron-regulated proteins were identified, including Fur and aconitase proteins, both of which are known to be regulated by iron in other bacterial systems. Our study shows that, where large protein sequence databases are available from genomic studies, the combined use of 2-D gel electrophoresis, mass spectrometry, and database searching to analyze proteins expressed under defined environmental conditions is a powerful tool for identifying expressed proteins and their physiologic relevance. PMID:9864233
Li, Peng; Bai, Juan; Li, Jun-xing; Zhang, Guo-long; Song, Yan-hua; Li, Yu-feng; Wang, Xian-wei; Jiang, Ping
2012-10-01
Haemophilus parasuis is the etiological agent of Glässer's disease characterized by fibrinous polyserositis, polyarthritis, and meningitis in young pigs. But it is difficult to develop universal serological diagnostic tools and effective vaccines against this disease because of the serovar diversity of the isolates. In this study, enterobacterial repetitive intergenic consensus-polymerase chain reaction, were performed to investigate the gene profile of 111 isolates of H. parasuis from China. And a specific common gene of H. parasuis was cloned and identified as the outer-membrane protein (OMP) P2 gene. Sequencing results of OMP P2 genes of 22 isolates showed that they had high homology and could be divided into 2 genetic types. Moreover, the OMPP2 protein was expressed in Escherichia coli expressing system. And the purified recombinant protein provided partial protection against H. parasuis infection in mice. It suggested the OMP P2 was an immunogenic protein and had great potential to serve as a vaccine and diagnostic antigen. Copyright © 2011 Elsevier Ltd. All rights reserved.
Dong, Hongjuan; Marchetti-Deschmann, Martina; Allmaier, Günter
2014-01-01
Traditionally characterization of microbial proteins is performed by a complex sequence of steps with the final step to be either Edman sequencing or mass spectrometry, which generally takes several weeks or months to be complete. In this work, we proposed a strategy for the characterization of tryptic peptides derived from Giberella zeae (anamorph: Fusarium graminearum) proteins in parallel to intact cell mass spectrometry (ICMS) in which no complicated and time-consuming steps were needed. Experimentally, after a simple washing treatment of the spores, the aliquots of the intact G. zeae macro conidia spores solution, were deposited two times onto one MALDI (matrix-assisted laser desorption ionization) mass spectrometry (MS) target (two spots). One spot was used for ICMS and the second spot was subject to a brief on-target digestion with bead-immobilized or non-immobilized trypsin. Subsequently, one spot was analyzed immediately by MALDI MS in the linear mode (ICMS) whereas the second spot containing the digested material was investigated by MALDI MS in the reflectron mode ("peptide mass fingerprint") followed by protonated peptide selection for MS/MS (post source decay (PSD) fragment ion) analysis. Based on the formed fragment ions of selected tryptic peptides a complete or partial amino acid sequence was generated by manual de novo sequencing. These sequence data were used for homology search for protein identification. Finally four different peptides of varying abundances have been identified successfully allowing the verification that our desorbed/ionized surface compounds were indeed derived from proteins. The presence of three different proteins could be found unambiguously. Interestingly, one of these proteins is belonging to the ribosomal superfamily which indicates that not only surface-associated proteins were digested. This strategy minimized the amount of time and labor required for obtaining deeper information on spore preparations within the nowadays widely used ICMS approach. Copyright © 2013 Elsevier Ltd. All rights reserved.
Molecular and biochemical characterization of tomato farnesyl-protein transferase.
Schmitt, D; Callan, K; Gruissem, W
1996-10-01
The prenylation of membrane-associated proteins involved in the regulation of eukaryotic cell growth and signal transduction is critically important for their subcellular localization and biological activity. In contrast to mammalian cells and yeast, however, the function of protein prenylation in plants is not well understood and only a few prenylated proteins have been identified. We partially purified and characterized farnesyl-protein transferase from tomato (Lycopersicon esculentum, LeFTase) to analyze its biochemical and molecular properties. Using Ras- and G gamma-specific peptide substrates and competition assays we showed that tomato protein extracts have both farnesyl-protein transferase and geranylgeranyl-protein transferase 1 activities. Compared with the heterologous synthetic peptide substrates, the plant-specific CaaX sequence of the ANJ1 protein is a less efficient substrate for LeFTase in vitro. LeFTase activity profiles and LeFTase beta-subunit protein (LeFTB) levels differ significantly in various tissues and are regulated during fruit development. Partially purified LeFTase requires Zn2+ and Mg2+ for enzymatic activity and has an apparent molecular mass of 100 kD Immunoprecipitation experiments using anti-alpha LeFTB antibodies confirmed that LeFTB is a component of LeFTase but not of tomato geranylgeranyl-protein transferase 1. Based on their conserved bio-chemical activities, we expect that prenyltransferases are likely integrated with the sterol biosynthesis pathway in the control of plant cell growth.
Origin and implications of zero degeneracy in networks spectra.
Yadav, Alok; Jalan, Sarika
2015-04-01
The spectra of many real world networks exhibit properties which are different from those of random networks generated using various models. One such property is the existence of a very high degeneracy at the zero eigenvalue. In this work, we provide all the possible reasons behind the occurrence of the zero degeneracy in the network spectra, namely, the complete and partial duplications, as well as their implications. The power-law degree sequence and the preferential attachment are the properties which enhances the occurrence of such duplications and hence leading to the zero degeneracy. A comparison of the zero degeneracy in protein-protein interaction networks of six different species and in their corresponding model networks indicates importance of the degree sequences and the power-law exponent for the occurrence of zero degeneracy.
Isolation of stress responsive Psb A gene from rice (Oryza sativa l.) using differential display.
Tyagi, Aruna; Chandra, Arti
2006-08-01
Differential display (DD) experiments were performed on drought-tolerant rice (Oryza sativa L.) genotype N22 to identify both upregulated and downregulated partial cDNAs with respect to moisture stress. DNA polymorphism was detected between drought-stressed and control leaf tissues on the DD gels. A partial cDNA showing differential expression, with respect to moisture stress was isolated from the gel. Northern blotting analysis was performed using this cDNA as a probe and it was observed that mRNA corresponding to this transcript was accumulated to high level in rice leaves under water deficit stress. At the DNA sequence level, the partial cDNA showed homology with psb A gene encoding for Dl protein.
The NPC2 protein: A novel dog allergen.
Khurana, Taruna; Newman-Lindsay, Shoshana; Young, Philip R; Slater, Jay E
2016-05-01
Dogs are an important source of indoor allergens that cause rhinoconjunctivitis, urticaria, and asthma in sensitized individuals. Can f 1 is reported as a major dog allergen, but other allergens have also been identified. Identification of immunologically important allergens is important for both the diagnosis and treatment of dog allergy. To identify and characterize the canine NPC2 protein, a novel dog allergen. We screened commercial and laboratory-generated aqueous dog extracts by 2-dimensional polyacrylamide gel electrophoresis with IgE immunoblotting using human serum samples from 71 dog-allergic individuals. A target of interest was excised from the gel and sequenced. Canine NPC2 sequence was generated, and recombinant proteins expressed in yeast and bacteria were used to determine allergenicity. An IgE enzyme-linked immunosorbent assay was used for screening 71 dog-positive and 30 dog-negative serum samples. A 16-kDa protein (pK = 8.5) in dog allergen extracts was recognized by specific IgE. The protein was identified by sequencing as a CE1 protein or NPC2 protein. Human IgE bound to recombinant protein was expressed in both yeast and bacteria. Ten (14%) of 71 individuals had specific IgE to NPC2 protein from bacteria, and 12 (17%) had IgE to NPC2 protein from yeast. Binding of pooled dog-allergic serum IgE to the dust mite protein Der p 2 was partially inhibited by recombinant NPC2 protein. NPC2 protein, a member of the MD-2-related lipid recognition family, is identified as a dog allergen (Can f 7), with an apparent seroprevalence of 10% to 20%. Published by Elsevier Inc.
Sequencing of cDNA Clones from the Genetic Map of Tomato (Lycopersicon esculentum)
Ganal, Martin W.; Czihal, Rosemarie; Hannappel, Ulrich; Kloos, Dorothee-U.; Polley, Andreas; Ling, Hong-Qing
1998-01-01
The dense RFLP linkage map of tomato (Lycopersicon esculentum) contains >300 anonymous cDNA clones. Of those clones, 272 were partially or completely sequenced. The sequences were compared at the DNA and protein level to known genes in databases. For 57% of the clones, a significant match to previously described genes was found. The information will permit the conversion of those markers to STS markers and allow their use in PCR-based mapping experiments. Furthermore, it will facilitate the comparative mapping of genes across distantly related plant species by direct comparison of DNA sequences and map positions. [cDNA sequence data reported in this paper have been submitted to the EMBL database under accession nos. AA824695–AA825005 and the dbEST_Id database under accession nos. 1546519–1546862.] PMID:9724330
Schiex, Thomas; Gouzy, Jérôme; Moisan, Annick; de Oliveira, Yannick
2003-07-01
We describe FrameD, a program that predicts coding regions in prokaryotic and matured eukaryotic sequences. Initially targeted at gene prediction in bacterial GC rich genomes, the gene model used in FrameD also allows to predict genes in the presence of frameshifts and partially undetermined sequences which makes it also very suitable for gene prediction and frameshift correction in unfinished sequences such as EST and EST cluster sequences. Like recent eukaryotic gene prediction programs, FrameD also includes the ability to take into account protein similarity information both in its prediction and its graphical output. Its performances are evaluated on different bacterial genomes. The web site (http://genopole.toulouse.inra.fr/bioinfo/FrameD/FD) allows direct prediction, sequence correction and translation and the ability to learn new models for new organisms.
Tetteh, Kevin K A; Conway, David J
2011-10-13
Merozoite surface protein 1 (MSP1) of Plasmodium falciparum has been implicated as an important target of acquired immunity, and candidate components for a vaccine include polymorphic epitopes in the N-terminal polymorphic block 2 region. We designed a polyvalent hybrid recombinant protein incorporating sequences of the three major allelic types of block 2 together with a composite repeat sequence of one of the types and N-terminal flanking T cell epitopes, and compared this with a series of recombinant proteins containing modular sub-components and similarly expressed in Escherichia coli. Immunogenicity of the full polyvalent hybrid protein was tested in both mice and rabbits, and comparative immunogenicity studies of the sub-component modules were performed in mice. The full hybrid protein induced high titre antibodies against each of the major block 2 allelic types expressed as separate recombinant proteins and against a wide range of allelic types naturally expressed by a panel of diverse P. falciparum isolates, while the sub-component modules had partial antigenic coverage as expected. This encourages further development and evaluation of the full MSP1 block 2 polyvalent hybrid protein as a candidate blood-stage component of a malaria vaccine. Copyright © 2011 Elsevier Ltd. All rights reserved.
Chen, Zhi-Teng; Wu, Hai-Yan; Du, Yu-Zhou
2016-07-01
We report the nearly complete mitochondrial genome of a stonefly species, Styloperla sp. (Plecoptera: Styloperlidae), which is a circular molecule of 15,416 bp in length and consists of 13 protein-coding genes, 2 ribosomal RNAs, 20 transfer RNAs and a partial control region (645 bp). Using the 13 protein-coding genes of 8 stoneflies and 3 other related species, we constructed a phylogenetic tree to verify the accuracy of the new determined mitogenome sequences. Our results provide basic data for further study of phylogeny in Plecoptera.
Avian sarcoma virus 17 carries the jun oncogene.
Maki, Y; Bos, T J; Davis, C; Starbuck, M; Vogt, P K
1987-01-01
Biologically active molecular clones of avian sarcoma virus 17 (ASV 17) contain a replication-defective proviral genome of 3.5 kilobases (kb). The genome retains partial gag and env sequences, which flank a cell-derived putative oncogene of 0.93 kb, termed jun. The jun gene lacks preserved coding domains of tyrosine-specific protein kinases. It also shows no significant nucleic acid homology with other known oncogenes. The probable transformation-specific protein in ASV 17-transformed cells is a 55-kDa gag-jun fusion product. Images PMID:3033666
Quissell, D O; Deisher, L M
1992-04-01
Rat submandibular and parotid gland exocytosis is primarily controlled by beta-adrenergic receptor stimulation. Although its precise role in the regulation of salivary gland exocytosis is not fully understood, protein phosphorylation, mediated by the activation of cAMP-dependent protein kinase, may be directly involved. Previous studies suggest that analogous 26-kDa integral membrane phosphoproteins may play a direct role in regulating exocytosis. Studies were here undertaken to purify and partially characterize both phosphoproteins. After endogenous phosphorylation with 32P, subcellular fraction and solubilization of the microsomal fraction in n-octyl beta-glucopyranoside, the 26-kDa integral membrane phosphoproteins were purified by high performance liquid chromatography (HPLC), followed by sodium dodecyl sulphate-polyacrylamide gel electrophoresis and electroelution of the proteins. Amino acid analysis indicated a significant number of serine amino acids: N-terminal sequence data demonstrated a high level of homology; and trypsin digestion followed by reversed-phase HPLC indicated the possibility of multiple phosphorylation sites.
Blaiotta, Giuseppe; Fusco, Vincenzina; Ercolini, Danilo; Aponte, Maria; Pepe, Olimpia; Villani, Francesco
2008-01-01
A phylogenetic tree showing diversities among 116 partial (499-bp) Lactobacillus hsp60 (groEL, encoding a 60-kDa heat shock protein) nucleotide sequences was obtained and compared to those previously described for 16S rRNA and tuf gene sequences. The topology of the tree produced in this study showed a Lactobacillus species distribution similar, but not identical, to those previously reported. However, according to the most recent systematic studies, a clear differentiation of 43 single-species clusters was detected/identified among the sequences analyzed. The slightly higher variability of the hsp60 nucleotide sequences than of the 16S rRNA sequences offers better opportunities to design or develop molecular assays allowing identification and differentiation of either distant or very closely related Lactobacillus species. Therefore, our results suggest that hsp60 can be considered an excellent molecular marker for inferring the taxonomy and phylogeny of members of the genus Lactobacillus and that the chosen primers can be used in a simple PCR procedure allowing the direct sequencing of the hsp60 fragments. Moreover, in this study we performed a computer-aided restriction endonuclease analysis of all 499-bp hsp60 partial sequences and we showed that the PCR-restriction fragment length polymorphism (RFLP) patterns obtainable by using both endonucleases AluI and TacI (in separate reactions) can allow identification and differentiation of all 43 Lactobacillus species considered, with the exception of the pair L. plantarum/L. pentosus. However, the latter species can be differentiated by further analysis with Sau3AI or MseI. The hsp60 PCR-RFLP approach was efficiently applied to identify and to differentiate a total of 110 wild Lactobacillus strains (including closely related species, such as L. casei and L. rhamnosus or L. plantarum and L. pentosus) isolated from cheese and dry-fermented sausages.
Blaiotta, Giuseppe; Fusco, Vincenzina; Ercolini, Danilo; Aponte, Maria; Pepe, Olimpia; Villani, Francesco
2008-01-01
A phylogenetic tree showing diversities among 116 partial (499-bp) Lactobacillus hsp60 (groEL, encoding a 60-kDa heat shock protein) nucleotide sequences was obtained and compared to those previously described for 16S rRNA and tuf gene sequences. The topology of the tree produced in this study showed a Lactobacillus species distribution similar, but not identical, to those previously reported. However, according to the most recent systematic studies, a clear differentiation of 43 single-species clusters was detected/identified among the sequences analyzed. The slightly higher variability of the hsp60 nucleotide sequences than of the 16S rRNA sequences offers better opportunities to design or develop molecular assays allowing identification and differentiation of either distant or very closely related Lactobacillus species. Therefore, our results suggest that hsp60 can be considered an excellent molecular marker for inferring the taxonomy and phylogeny of members of the genus Lactobacillus and that the chosen primers can be used in a simple PCR procedure allowing the direct sequencing of the hsp60 fragments. Moreover, in this study we performed a computer-aided restriction endonuclease analysis of all 499-bp hsp60 partial sequences and we showed that the PCR-restriction fragment length polymorphism (RFLP) patterns obtainable by using both endonucleases AluI and TacI (in separate reactions) can allow identification and differentiation of all 43 Lactobacillus species considered, with the exception of the pair L. plantarum/L. pentosus. However, the latter species can be differentiated by further analysis with Sau3AI or MseI. The hsp60 PCR-RFLP approach was efficiently applied to identify and to differentiate a total of 110 wild Lactobacillus strains (including closely related species, such as L. casei and L. rhamnosus or L. plantarum and L. pentosus) isolated from cheese and dry-fermented sausages. PMID:17993558
Yu, Wilson; Chiaw, Patrick Kim; Bear, Christine E.
2011-01-01
Cystic fibrosis (CF) is caused by mutations in the cystic fibrosis transmembrane conductance regulator (CFTR) gene that cause loss of function of the CFTR channel on the apical surface of epithelial cells. The major CF-causing mutation, F508del-CFTR, is misfolded, retained in the endoplasmic reticulum, and degraded. Small molecule corrector compounds have been identified using high throughput screens, which partially rescue the trafficking defect of F508del-CFTR, allowing a fraction of the mutant protein to escape endoplasmic reticulum retention and traffic to the plasma membrane, where it exhibits partial function as a cAMP-regulated chloride channel. A subset of such corrector compounds binds directly to the mutant protein, prompting the hypothesis that they rescue the biosynthetic defect by inducing improved protein conformation. We tested this hypothesis directly by evaluating the consequences of a corrector compound on the conformation of each nucleotide binding domain (NBD) in the context of the full-length mutant protein in limited proteolytic digest studies. Interestingly, we found that VRT-325 was capable of partially restoring compactness in NBD1. However, VRT-325 had no detectable effect on the conformation of the second half of the molecule. In comparison, ablation of the di-arginine sequence, R553XR555 (F508del-KXK-CFTR), modified protease susceptibility of NBD1, NBD2, and the full-length protein. Singly, each intervention led to a partial correction of the processing defect. Together, these interventions restored processing of F508del-CFTR to near wild type. Importantly, however, a defect in NBD1 conformation persisted, as did a defect in channel activation after the combined interventions. Importantly, this defect in channel activation can be fully corrected by the addition of the potentiator, VX-770. PMID:21602569
Isolation of a cDNA Encoding a Granule-Bound 152-Kilodalton Starch-Branching Enzyme in Wheat1
Båga, Monica; Nair, Ramesh B.; Repellin, Anne; Scoles, Graham J.; Chibbar, Ravindra N.
2000-01-01
Screening of a wheat (Triticum aestivum) cDNA library for starch-branching enzyme I (SBEI) genes combined with 5′-rapid amplification of cDNA ends resulted in isolation of a 4,563-bp composite cDNA, Sbe1c. Based on sequence alignment to characterized SBEI cDNA clones isolated from plants, the SBEIc predicted from the cDNA sequence was produced with a transit peptide directing the polypeptide into plastids. Furthermore, the predicted mature form of SBEIc was much larger (152 kD) than previously characterized plant SBEI (80–100 kD) and contained a partial duplication of SBEI sequences. The first SBEI domain showed high amino acid similarity to a 74-kD wheat SBEI-like protein that is inactive as a branching enzyme when expressed in Escherichia coli. The second SBEI domain on SBEIc was identical in sequence to a functional 87-kD SBEI produced in the wheat endosperm. Immunoblot analysis of proteins produced in developing wheat kernels demonstrated that the 152-kD SBEIc was, in contrast to the 87- to 88-kD SBEI, preferentially associated with the starch granules. Proteins similar in size and recognized by wheat SBEI antibodies were also present in Triticum monococcum, Triticum tauschii, and Triticum turgidum subsp. durum. PMID:10982440
Human endomembrane H+ pump strongly resembles the ATP-synthetase of Archaebacteria.
Südhof, T C; Fried, V A; Stone, D K; Johnston, P A; Xie, X S
1989-01-01
Preparations of mammalian H+ pumps that acidify intracellular vesicles contain eight or nine polypeptides, ranging in size from 116 to 17 kDa. Biochemical analysis indicates that the 70- and 58-kDa polypeptides are subunits critical for ATP hydrolysis. The amino acid sequences of the major catalytic subunits (58 and 70 kDa) of the endomembrane H+ pump are unknown from animal cells. We report here the complete sequence of the 58-kDa subunit derived from a human kidney cDNA clone and partial sequences of the 70- and 58-kDa subunits purified from clathrin-coated vesicles of bovine brain. The amino acid sequences of both proteins strongly resemble the sequences of the corresponding subunits of the vacuolar H+ pumps of Archaebacteria, plants, and fungi. The archaebacterial enzyme is believed to use a H+ gradient to synthesize ATP. Thus, a common ancestral protein has given rise to a H+ pump that synthesizes ATP in one organism and hydrolyzes it in another and is highly conserved from prokaryotes to humans. The same pump appears to mediate the acidification of intracellular organelles, including coated vesicles, lysosomes, and secretory granules, as well as extracellular fluids such as urine. PMID:2527371
Translocation of an 89-kDa periplasmic protein is associated with Holospora infection
DOE Office of Scientific and Technical Information (OSTI.GOV)
Iwatani, Koichi; Dohra, Hideo; Lang, B. Franz
2005-12-02
The symbiotic bacterium Holospora obtusa infects the macronucleus of the ciliate Paramecium caudatum. After ingestion by its host, an infectious form of Holospora with an electron-translucent tip passes through the host digestive vacuole and penetrates the macronuclear envelope with this tip. To investigate the underlying molecular mechanism of this process, we raised a monoclonal antibody against the tip-specific 89-kDa protein, sequenced this partially, and identified the corresponding complete gene. The deduced protein sequence carries two actin-binding motifs. Indirect immunofluorescence microscopy shows that during escape from the host digestive vacuole, the 89-kDa proteins translocates from the inside to the outside ofmore » the tip. When the bacterium invades the macronucleus, the 89-kDa protein is left behind at the entry point of the nuclear envelope. Transmission electron microscopy shows the formation of fine fibrous structures that co-localize with the antibody-labeled regions of the bacterium. Our findings suggest that the 89-kDa protein plays a role in Holospora's escape from the host digestive vacuole, the migration through the host cytoplasm, and the invasion into the macronucleus.« less
Silverstone, Aron L.; Jung, Hou-Sung; Dill, Alyssa; Kawaide, Hiroshi; Kamiya, Yuji; Sun, Tai-ping
2001-01-01
RGA (for repressor of ga1-3) and SPINDLY (SPY) are likely repressors of gibberellin (GA) signaling in Arabidopsis because the recessive rga and spy mutations partially suppressed the phenotype of the GA-deficient mutant ga1-3. We found that neither rga nor spy altered the GA levels in the wild-type or the ga1-3 background. However, expression of the GA biosynthetic gene GA4 was reduced 26% by the rga mutation, suggesting that partial derepression of the GA response pathway by rga resulted in the feedback inhibition of GA4 expression. The green fluorescent protein (GFP)–RGA fusion protein was localized to nuclei in transgenic Arabidopsis. This result supports the predicted function of RGA as a transcriptional regulator based on sequence analysis. Confocal microscopy and immunoblot analyses demonstrated that the levels of both the GFP-RGA fusion protein and endogenous RGA were reduced rapidly by GA treatment. Therefore, the GA signal appears to derepress the GA signaling pathway by degrading the repressor protein RGA. The effect of rga on GA4 gene expression and the effect of GA on RGA protein level allow us to identify part of the mechanism by which GA homeostasis is achieved. PMID:11449051
Tapia, Lorena I; Shaw, Chad A; Aideyan, Letisha O; Jewell, Alan M; Dawson, Brian C; Haq, Taha R; Piedra, Pedro A
2014-01-01
Human respiratory syncytial virus (HRSV) has three surface glycoproteins: small hydrophobic (SH), attachment (G) and fusion (F), encoded by three consecutive genes (SH-G-F). A 270-nt fragment of the G gene is used to genotype HRSV isolates. This study genotyped and investigated the variability of the gene and amino acid sequences of the three surface proteins of HRSV strains collected from 1987 to 2005 from one center. Sixty original clinical isolates and 5 prototype strains were analyzed. Sequences containing SH, F and G genes were generated, and multiple alignments and phylogenetic trees were analyzed. Genetic variability by protein domains comparing virus genotypes was assessed. Complete sequences of the SH-G-F genes were obtained for all 65 samples: HRSV-A = 35; HRSV-B = 30. In group A strains, genotypes GA5 and GA2 were predominant. For HRSV-B strains, the genotype GB4 was predominant from 1992 to 1994 and only genotype BA viruses were detected in 2004-2005. Different genetic variability at nucleotide level was detected between the genes, with G gene being the most variable and the highest variability detected in the 270-nt G fragment that is frequently used to genotype the virus. High variability (>10%) was also detected in the signal peptide and transmembrane domains of the F gene of HRSV A strains. Variability among the HRSV strains resulting in non-synonymous changes was detected in hypervariable domains of G protein, the signal peptide of the F protein, a not previously defined domain in the F protein, and the antigenic site Ø in the pre-fusion F. Divergent trends were observed between HRSV -A and -B groups for some functional domains. A diverse population of HRSV -A and -B genotypes circulated in Houston during an 18 year period. We hypothesize that diverse sequence variation of the surface protein genes provide HRSV strains a survival advantage in a partially immune-protected community.
Tapia, Lorena I.; Shaw, Chad A.; Aideyan, Letisha O.; Jewell, Alan M.; Dawson, Brian C.; Haq, Taha R.; Piedra, Pedro A.
2014-01-01
Human respiratory syncytial virus (HRSV) has three surface glycoproteins: small hydrophobic (SH), attachment (G) and fusion (F), encoded by three consecutive genes (SH-G-F). A 270-nt fragment of the G gene is used to genotype HRSV isolates. This study genotyped and investigated the variability of the gene and amino acid sequences of the three surface proteins of HRSV strains collected from 1987 to 2005 from one center. Sixty original clinical isolates and 5 prototype strains were analyzed. Sequences containing SH, F and G genes were generated, and multiple alignments and phylogenetic trees were analyzed. Genetic variability by protein domains comparing virus genotypes was assessed. Complete sequences of the SH-G-F genes were obtained for all 65 samples: HRSV-A = 35; HRSV-B = 30. In group A strains, genotypes GA5 and GA2 were predominant. For HRSV-B strains, the genotype GB4 was predominant from 1992 to 1994 and only genotype BA viruses were detected in 2004–2005. Different genetic variability at nucleotide level was detected between the genes, with G gene being the most variable and the highest variability detected in the 270-nt G fragment that is frequently used to genotype the virus. High variability (>10%) was also detected in the signal peptide and transmembrane domains of the F gene of HRSV A strains. Variability among the HRSV strains resulting in non-synonymous changes was detected in hypervariable domains of G protein, the signal peptide of the F protein, a not previously defined domain in the F protein, and the antigenic site Ø in the pre-fusion F. Divergent trends were observed between HRSV -A and -B groups for some functional domains. A diverse population of HRSV -A and -B genotypes circulated in Houston during an 18 year period. We hypothesize that diverse sequence variation of the surface protein genes provide HRSV strains a survival advantage in a partially immune-protected community. PMID:24625544
SARNAclust: Semi-automatic detection of RNA protein binding motifs from immunoprecipitation data
Dotu, Ivan; Adamson, Scott I.; Coleman, Benjamin; Fournier, Cyril; Ricart-Altimiras, Emma; Eyras, Eduardo
2018-01-01
RNA-protein binding is critical to gene regulation, controlling fundamental processes including splicing, translation, localization and stability, and aberrant RNA-protein interactions are known to play a role in a wide variety of diseases. However, molecular understanding of RNA-protein interactions remains limited; in particular, identification of RNA motifs that bind proteins has long been challenging, especially when such motifs depend on both sequence and structure. Moreover, although RNA binding proteins (RBPs) often contain more than one binding domain, algorithms capable of identifying more than one binding motif simultaneously have not been developed. In this paper we present a novel pipeline to determine binding peaks in crosslinking immunoprecipitation (CLIP) data, to discover multiple possible RNA sequence/structure motifs among them, and to experimentally validate such motifs. At the core is a new semi-automatic algorithm SARNAclust, the first unsupervised method to identify and deconvolve multiple sequence/structure motifs simultaneously. SARNAclust computes similarity between sequence/structure objects using a graph kernel, providing the ability to isolate the impact of specific features through the bulge graph formalism. Application of SARNAclust to synthetic data shows its capability of clustering 5 motifs at once with a V-measure value of over 0.95, while GraphClust achieves only a V-measure of 0.083 and RNAcontext cannot detect any of the motifs. When applied to existing eCLIP sets, SARNAclust finds known motifs for SLBP and HNRNPC and novel motifs for several other RBPs such as AGGF1, AKAP8L and ILF3. We demonstrate an experimental validation protocol, a targeted Bind-n-Seq-like high-throughput sequencing approach that relies on RNA inverse folding for oligo pool design, that can validate the components within the SLBP motif. Finally, we use this protocol to experimentally interrogate the SARNAclust motif predictions for protein ILF3. Our results support a newly identified partially double-stranded UUUUUGAGA motif similar to that known for the splicing factor HNRNPC. PMID:29596423
2010-01-01
Background The extended light-harvesting complex (LHC) protein superfamily is a centerpiece of eukaryotic photosynthesis, comprising the LHC family and several families involved in photoprotection, like the LHC-like and the photosystem II subunit S (PSBS). The evolution of this complex superfamily has long remained elusive, partially due to previously missing families. Results In this study we present a meticulous search for LHC-like sequences in public genome and expressed sequence tag databases covering twelve representative photosynthetic eukaryotes from the three primary lineages of plants (Plantae): glaucophytes, red algae and green plants (Viridiplantae). By introducing a coherent classification of the different protein families based on both, hidden Markov model analyses and structural predictions, numerous new LHC-like sequences were identified and several new families were described, including the red lineage chlorophyll a/b-binding-like protein (RedCAP) family from red algae and diatoms. The test of alternative topologies of sequences of the highly conserved chlorophyll-binding core structure of LHC and PSBS proteins significantly supports the independent origins of LHC and PSBS families via two unrelated internal gene duplication events. This result was confirmed by the application of cluster likelihood mapping. Conclusions The independent evolution of LHC and PSBS families is supported by strong phylogenetic evidence. In addition, a possible origin of LHC and PSBS families from different homologous members of the stress-enhanced protein subfamily, a diverse and anciently paralogous group of two-helix proteins, seems likely. The new hypothesis for the evolution of the extended LHC protein superfamily proposed here is in agreement with the character evolution analysis that incorporates the distribution of families and subfamilies across taxonomic lineages. Intriguingly, stress-enhanced proteins, which are universally found in the genomes of green plants, red algae, glaucophytes and in diatoms with complex plastids, could represent an important and previously missing link in the evolution of the extended LHC protein superfamily. PMID:20673336
Martínez-Quintana, José A; Peregrino-Uriarte, Alma B; Gollas-Galván, Teresa; Gómez-Jiménez, Silvia; Yepiz-Plascencia, Gloria
2014-12-01
During hypoxia the shrimp Litopenaeus vannamei accelerates anaerobic glycolysis to obtain energy; therefore, a correct supply of glucose to the cells is needed. Facilitated glucose transport across the cells is mediated by a group of membrane embedded integral proteins called GLUT; being GLUT1 the most ubiquitous form. In this work, we report the first cDNA nucleotide and deduced amino acid sequences of a glucose transporter 1 from L. vannamei. A 1619 bp sequence was obtained by RT-PCR and RACE approaches. The 5´ UTR is 161 bp and the poly A tail is exactly after the stop codon in the mRNA. The ORF is 1485 bp and codes for 485 amino acids. The deduced protein sequence has high identity to GLUT1 proteins from several species and contains all the main features of glucose transporter proteins, including twelve transmembrane domains, the conserved motives and amino acids involved in transport activity, ligands binding and membrane anchor. Therefore, we decided to name this sequence, glucose transporter 1 of L. vannamei (LvGLUT1). A partial gene sequence of 8.87 Kbp was also obtained; it contains the complete coding sequence divided in 10 exons. LvGlut1 expression was detected in hemocytes, hepatopancreas, intestine gills, muscle and pleopods. The higher relative expression was found in gills and the lower in hemocytes. This indicates that LvGlut1 is ubiquitously expressed but its levels are tissue-specific and upon short-term hypoxia, the GLUT1 transcripts increase 3.7-fold in hepatopancreas and gills. To our knowledge, this is the first evidence of expression of GLUT1 in crustaceans.
Comprehensive Analysis of Protein Modifications by Top-down Mass Spectrometry
Zhang, Han; Ge, Ying
2012-01-01
Mass spectrometry (MS)-based proteomics is playing an increasingly important role in cardiovascular research. Proteomics includes not only identification and quantification of proteins, but also the characterization of protein modifications such as post-translational modifications and sequence variants. The conventional bottom-up approach, involving proteolytic digestion of proteins into small peptides prior to MS analysis, is routinely used for protein identification and quantification with high throughput and automation. Nevertheless, it has limitations in the analysis of protein modifications mainly due to the partial sequence coverage and loss of connections among modifications on disparate portions of a protein. An alternative approach, top-down MS, has emerged as a powerful tool for the analysis of protein modifications. The top-down approach analyzes whole proteins directly, providing a “bird’s eye” view of all existing modifications. Subsequently, each modified protein form can be isolated and fragmented in the mass spectrometer to locate the modification site. The incorporation of the non-ergodic dissociation methods such as electron capture dissociation (ECD) greatly enhances the top-down capabilities. ECD is especially useful for mapping labile post-translational modifications which are well-preserved during the ECD fragmentation process. Top-down MS with ECD has been successfully applied to cardiovascular research with the unique advantages in unraveling the molecular complexity, quantifying modified protein forms, complete mapping of modifications with full sequence coverage, discovering unexpected modifications, and identifying and quantifying positional isomers and determining the order of multiple modifications. Nevertheless, top-down MS still needs to overcome some technical challenges to realize its full potential. Herein, we reviewed the advantages and challenges of top-down methodology with a focus on its application in cardiovascular research. PMID:22187450
The bean. alpha. -amylase inhibitor is encoded by a lectin gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Moreno, J.; Altabella, T.; Chrispeels, M.J.
The common bean, Phaseolus vulgaris, contains an inhibitor of insect and mammalian {alpha}-amylases that does not inhibit plant {alpha}-amylase. This inhibitor functions as an anti-feedant or seed-defense protein. We purified this inhibitor by affinity chromatography and found that it consists of a series of glycoforms of two polypeptides (Mr 14,000-19,000). Partial amino acid sequencing was carried out, and the sequences obtained are identical with portions of the derived amino acid sequence of a lectin-like gene. This lectin gene encodes a polypeptide of MW 28,000, and the primary in vitro translation product identified by antibodies to the {alpha}-amylase inhibitor has themore » same size. Co- and posttranslational processing of this polypeptide results in glycosylated polypeptides of 14-19 kDa. Our interpretation of these results is that the bean lectins constitute a gene family that encodes diverse plant defense proteins, including phytohemagglutinin, arcelin and {alpha}-amylase inhibitor.« less
Churchill, Mair E.A.; Klass, Janet; Zoetewey, David L.
2010-01-01
The ubiquitous eukaryotic High-Mobility-Group-Box (HMGB) chromosomal proteins promote many chromatin-mediated cellular activities through their non-sequence-specific binding and bending of DNA. Minor groove DNA binding by the HMG box results in substantial DNA bending toward the major groove owing to electrostatic interactions, shape complementarity and DNA intercalation that occurs at two sites. Here, the structures of the complexes formed with DNA by a partially DNA intercalation-deficient mutant of Drosophila melanogaster HMGD have been determined by X-ray crystallography at a resolution of 2.85 Å. The six proteins and fifty base pairs of DNA in the crystal structure revealed a variety of bound conformations. All of the proteins bound in the minor groove, bridging DNA molecules, presumably because these DNA regions are easily deformed. The loss of the primary site of DNA intercalation decreased overall DNA bending and shape complementarity. However, DNA bending at the secondary site of intercalation was retained and most protein-DNA contacts were preserved. The mode of binding resembles the HMGB1-boxA-cisplatin-DNA complex, which also lacks a primary intercalating residue. This study provides new insights into the binding mechanisms used by HMG boxes to recognize varied DNA structures and sequences as well as modulate DNA structure and DNA bending. PMID:20800069
Bhattacharya, D; Steinkötter, J; Melkonian, M
1993-12-01
Centrin (= caltractin) is a ubiquitous, cytoskeletal protein which is a member of the EF-hand superfamily of calcium-binding proteins. A centrin-coding cDNA was isolated and characterized from the prasinophyte green alga Scherffelia dubia. Centrin PCR amplification primers were used to isolate partial, homologous cDNA sequences from the green algae Tetraselmis striata and Spermatozopsis similis. Annealing analyses suggested that centrin is a single-copy-coding region in T. striata and S. similis and other green algae studied. Centrin-coding regions from S. dubia, S. similis and T. striata encode four colinear EF-hand domains which putatively bind calcium. Phylogenetic analyses, including homologous sequences from Chlamydomonas reinhardtii and the land plant Atriplex nummularia, demonstrate that the domains of centrins are congruent and arose from the two-fold duplication of an ancestral EF hand with Domains 1+3 and Domains 2+4 clustering. The domains of centrins are also congruent with those of calmodulins demonstrating that, like calmodulin, centrin is an ancient protein which arose within the ancestor of all eukaryotes via gene duplication. Phylogenetic relationships inferred from centrin-coding region comparisons mirror results of small subunit ribosomal RNA sequence analyses suggesting that centrin-coding regions are useful evolutionary markers within the green algae.
A novel class of small RNAs bind to MILI protein in mouse testes.
Aravin, Alexei; Gaidatzis, Dimos; Pfeffer, Sébastien; Lagos-Quintana, Mariana; Landgraf, Pablo; Iovino, Nicola; Morris, Patricia; Brownstein, Michael J; Kuramochi-Miyagawa, Satomi; Nakano, Toru; Chien, Minchen; Russo, James J; Ju, Jingyue; Sheridan, Robert; Sander, Chris; Zavolan, Mihaela; Tuschl, Thomas
2006-07-13
Small RNAs bound to Argonaute proteins recognize partially or fully complementary nucleic acid targets in diverse gene-silencing processes. A subgroup of the Argonaute proteins--known as the 'Piwi family'--is required for germ- and stem-cell development in invertebrates, and two Piwi members--MILI and MIWI--are essential for spermatogenesis in mouse. Here we describe a new class of small RNAs that bind to MILI in mouse male germ cells, where they accumulate at the onset of meiosis. The sequences of the over 1,000 identified unique molecules share a strong preference for a 5' uridine, but otherwise cannot be readily classified into sequence families. Genomic mapping of these small RNAs reveals a limited number of clusters, suggesting that these RNAs are processed from long primary transcripts. The small RNAs are 26-31 nucleotides (nt) in length--clearly distinct from the 21-23 nt of microRNAs (miRNAs) or short interfering RNAs (siRNAs)--and we refer to them as 'Piwi-interacting RNAs' or piRNAs. Orthologous human chromosomal regions also give rise to small RNAs with the characteristics of piRNAs, but the cloned sequences are distinct. The identification of this new class of small RNAs provides an important starting point to determine the molecular function of Piwi proteins in mammalian spermatogenesis.
Austin, Christopher M; Tan, Mun Hua; Lee, Yin Peng; Croft, Laurence J; Meekan, Mark G; Pierce, Simon J; Gan, Han Ming
2016-01-01
The complete mitochondrial genome of the parasitic copepod Pandarus rhincodonicus was obtained from a partial genome scan using the HiSeq sequencing system. The Pandarus rhincodonicus mitogenome has 14,480 base pairs (62% A+T content) made up of 12 protein-coding genes, 2 ribosomal subunit genes, 22 transfer RNAs, and a putative 384 bp non-coding AT-rich region. This Pandarus mitogenome sequence is the first for the family Pandaridae, the second for the order Siphonostomatoida and the sixth for the Copepoda.
Luo, Lilan; Ando, Sayuri; Sasabe, Michiko; Machida, Chiyoko; Kurihara, Daisuke; Higashiyama, Tetsuya; Machida, Yasunori
2012-09-01
Leaf primordia with high division and developmental competencies are generated around the periphery of stem cells at the shoot apex. Arabidopsis ASYMMETRIC-LEAVES2 (AS2) protein plays a key role in the regulation of many genes responsible for flat symmetric leaf formation. The AS2 gene, expressed in leaf primordia, encodes a plant-specific nuclear protein containing an AS2/LOB domain with cysteine repeats (C-motif). AS2 proteins are present in speckles in and around the nucleoli, and in the nucleoplasm of some leaf epidermal cells. We used the tobacco cultured cell line BY-2 expressing the AS2-fused yellow fluorescent protein to examine subnuclear localization of AS2 in dividing cells. AS2 mainly localized to speckles (designated AS2 bodies) in cells undergoing mitosis and distributed in a pairwise manner during the separation of sets of daughter chromosomes. Few interphase cells contained AS2 bodies. Deletion analyses showed that a short stretch of the AS2 amino-terminal sequence and the C-motif play negative and positive roles, respectively, in localizing AS2 to the bodies. These results suggest that AS2 bodies function to properly distribute AS2 to daughter cells during cell division in leaf primordia; and this process is controlled at least partially by signals encoded by the AS2 sequence itself.
Identification of the ubiquinone-binding domain in the disulfide catalyst disulfide bond protein B.
Xie, Tong; Yu, Linda; Bader, Martin W; Bardwell, James C A; Yu, Chang-An
2002-01-18
Disulfide bond (Dsb) formation is catalyzed in the periplasm of prokaryotes by the Dsb proteins. DsbB, a key enzyme in this process, generates disulfides de novo by using the oxidizing power of quinones. To explore the mechanism of this newly described enzymatic activity, we decided to study the ubiquinone-protein interaction and identify the ubiquinone-binding domain in DsbB by cross-linking to photoactivatable quinone analogues. When purified Escherichia coli DsbB was incubated with an azidoubiquinone derivative, 3-azido-2-methyl-5-[(3)H]methoxy-6-decyl-1,4-benzoquinone ([(3)H]azido-Q), and illuminated with long wavelength UV light, the decrease in enzymatic activity correlated with the amount of 3-azido-2-methyl-5-methoxy-6-decyl-1,4-benzoquinone (azido-Q) incorporated into the protein. One azido-Q-linked peptide with a retention time of 33.5 min was obtained by high performance liquid chromatography of the V8 digest of [(3)H]azido-Q-labeled DsbB. This peptide has a partial NH(2)-terminal amino acid sequence of NH(2)-HTMLQLY corresponding to residues 91-97. This sequence occurs in the second periplasmic domain of the inner membrane protein DsbB in a loop connecting transmembrane helices 3 and 4. We propose that the quinone-binding site is within or very near to this sequence.
Dessimoz, Christophe; Zoller, Stefan; Manousaki, Tereza; Qiu, Huan; Meyer, Axel; Kuraku, Shigehiro
2011-09-01
Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references.
Zoller, Stefan; Manousaki, Tereza; Qiu, Huan; Meyer, Axel; Kuraku, Shigehiro
2011-01-01
Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references. PMID:21712341
Covaleda, Giovanni; Trejo, Sebastian A; Salas-Sarduy, Emir; Del Rivero, Maday Alonso; Chavez, Maria Angeles; Aviles, Francesc X
2017-08-08
Proteases and their inhibitors have become molecules of increasing fundamental and applicative value. Here we report an integrated strategy to identify and analyze such inhibitors from Caribbean marine invertebrates extracts by a fast and sensitive functional proteomics-like approach. The strategy works in three steps: i) multiplexed enzymatic inhibition kinetic assays, ii) Intensity Fading MALDI-TOF MS to establish a link between inhibitory molecules and the related MALDI signal(s) detected in the extract(s), and iii) ISD-CID-T 3 MS fragmentation on the parent MALDI signals selected in the previous step, enabling the partial or total top-down sequencing of the molecules. The present study has allowed validation of the whole approach, identification of a substantial number of novel protein protease inhibitors, as well as full or partial sequencing of reference molecular species and of many unknown ones, respectively. Such inhibitors correspond to six protease subfamilies (metallocarboxypeptidases-A and -B, pepsin, papain, trypsin and subtilisin), are small (1-10KDa) disulfide-rich proteins, and have been found at diverse frequencies among the invertebrates (13 to 41%). The overall procedure could be tailored to other enzyme-inhibitor and protein interacting systems, analyzing samples at medium-throughput level and leading to the functional and structural characterization of proteinaceous ligands from complex biological extracts. Invertebrate animals, and marine ones among, display a remarkable diversity of species and contained biomolecules. Many of their proteins-peptides have high biological, biotechnological and biomedical potential interest but, because of the lack of sequenced genomes behind, their structural and functional characterization constitutes a great challenge. Here, looking at the small, disulfide-rich, proteinaceous inhibitors of proteases found in them, it is shown that such problem can be significatively facilitated by integrative multiplexed enzymatic assays, affinity-based Intensity-Fading (IF-) MALDI-TOF mass spectrometry (MS), and on-line MS fragmentation, in a fast and easy approach. Copyright © 2017. Published by Elsevier B.V.
Unraveling the sequence and structure of the protein osteocalcin from a 42 ka fossil horse
NASA Astrophysics Data System (ADS)
Ostrom, Peggy H.; Gandhi, Hasand; Strahler, John R.; Walker, Angela K.; Andrews, Philip C.; Leykam, Joseph; Stafford, Thomas W.; Kelly, Robert L.; Walker, Danny N.; Buckley, Mike; Humpula, James
2006-04-01
We report the first complete amino acid sequence and evidence of secondary structure for osteocalcin from a temperate fossil. The osteocalcin derives from a 42 ka equid bone excavated from Juniper Cave, Wyoming. Results were determined by matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-MS) and Edman sequencing with independent confirmation of the sequence in two laboratories. The ancient sequence was compared to that of three modern taxa: horse ( Equus caballus), zebra ( Equus grevyi), and donkey ( Equus asinus). Although there was no difference in sequence among modern taxa, MALDI-MS and Edman sequencing show that residues 48 and 49 of our modern horse are Thr, Ala rather than Pro, Val as previously reported (Carstanjen B., Wattiez, R., Armory, H., Lepage, O.M., Remy, B., 2002. Isolation and characterization of equine osteocalcin. Ann. Med. Vet.146(1), 31-38). MALDI-MS and Edman sequencing data indicate that the osteocalcin sequence of the 42 ka fossil is similar to that of modern horse. Previously inaccessible structural attributes for ancient osteocalcin were observed. Glu 39 rather than Gln 39 is consistent with deamidation, a process known to occur during fossilization and aging. Two post-translational modifications were documented: Hyp 9 and a disulfide bridge. The latter suggests at least partial retention of secondary structure. As has been done for ancient DNA research, we recommend standards for preparation and criteria for authenticating results of ancient protein sequencing.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Adámik, Matej; Bažantová, Pavla; Department of Biology and Ecology, Faculty of Science, University of Ostrava, Chittussiho 10, 701 03 Ostrava
Highlights: • DNA binding of p53 family core domains is inhibited by cadmium, cobalt and nickel. • Binding to DNA protects p53 family core domains from metal induced inhibition. • Cadmium, cobalt and nickel induced inhibition was reverted by EDTA in vitro. - Abstract: Site-specific DNA recognition and binding activity belong to common attributes of all three members of tumor suppressor p53 family proteins: p53, p63 and p73. It was previously shown that heavy metals can affect p53 conformation, sequence-specific binding and suppress p53 response to DNA damage. Here we report for the first time that cadmium, nickel and cobalt,more » which have already been shown to disturb various DNA repair mechanisms, can also influence p63 and p73 sequence-specific DNA binding activity and transactivation of p53 family target genes. Based on results of electrophoretic mobility shift assay and luciferase reporter assay, we conclude that cadmium inhibits sequence-specific binding of all three core domains to p53 consensus sequences and abolishes transactivation of several promoters (e.g. BAX and MDM2) by 50 μM concentrations. In the presence of specific DNA, all p53 family core domains were partially protected against loss of DNA binding activity due to cadmium treatment. Effective cadmium concentration to abolish DNA–protein interactions was about two times higher for p63 and p73 proteins than for p53. Furthermore, we detected partial reversibility of cadmium inhibition for all p53 family members by EDTA. DTT was able to reverse cadmium inhibition only for p53 and p73. Nickel and cobalt abolished DNA–p53 interaction at sub-millimolar concentrations while inhibition of p63 and p73 DNA binding was observed at millimolar concentrations. In summary, cadmium strongly inhibits p53, p63 and p73 DNA binding in vitro and in cells in comparison to nickel and cobalt. The role of cadmium inhibition of p53 tumor suppressor family in carcinogenesis is discussed.« less
Li, Ye Long; Dai, Xin Ren; Yue, Xun; Gao, Xin-Qi; Zhang, Xian Sheng
2014-10-01
Maize 1,491 small secreted peptides were identified, which were classified according to the character of peptide sequences. Partial SSP gene expressions in reproductive tissues were determined by qRT-PCR. Small secreted peptides (SSPs) are important cell-cell communication messengers in plants. Most information on plant SSPs come from Arabidopsis thaliana and Oryza sativa, while little is known about the SSPs of other grass species such as maize (Zea mays). In this study, we identified 1,491 SSP genes from maize genomic sequences. These putative SSP genes were distributed throughout the ten maize chromosomes. Among them, 611 SSPs were classified into 198 superfamilies according to their conserved domains, and 725 SSPs with four or more cysteines at their C-termini shared similar cysteine arrangements with their counterparts in other plant species. Moreover, the SSPs requiring post-translational modification, as well as defensin-like (DEFL) proteins, were identified. Further, the expression levels of 110 SSP genes were analyzed in reproductive tissues, including male flower, pollen, silk, and ovary. Most of the genes encoding basal-layer antifungal peptide-like, small coat proteins-like, thioredoxin-like proteins, γ-thionins-like, and DEFL proteins showed high expression levels in the ovary and male flower compared with their levels in silk and mature pollen. The rapid alkalinization factor-like genes were highly expressed only in the mature ovary and mature pollen, and pollen Ole e 1-like genes showed low expression in silk. The results of this study provide basic information for further analysis of SSP functions in the reproductive process of maize.
Rokyta, Darin R; Wray, Kenneth P; Lemmon, Alan R; Lemmon, Emily Moriarty; Caudle, S Brian
2011-04-01
Despite causing considerable human mortality and morbidity, animal toxins represent a valuable source of pharmacologically active macromolecules, a unique system for studying molecular adaptation, and a powerful framework for examining structure-function relationships in proteins. Snake venoms are particularly useful in the latter regard as they consist primarily of a moderate number of proteins and peptides that have been found to belong to just a handful of protein families. As these proteins and peptides are produced in dedicated glands, transcriptome sequencing has proven to be an effective approach to identifying the expressed toxin genes. We generated a venom-gland transcriptome for the Eastern Diamondback Rattlesnake (Crotalus adamanteus) using Roche 454 sequencing technology. In the current work, we focus on transcripts encoding toxins. We identified 40 unique toxin transcripts, 30 of which have full-length coding sequences, and 10 have only partial coding sequences. These toxins account for 24% of the total sequencing reads. We found toxins from 11 previously described families of snake-venom toxins and have discovered two putative, previously undescribed toxin classes. The most diverse and highly expressed toxin classes in the C. adamanteus venom-gland transcriptome are the serine proteinases, metalloproteinases, and C-type lectins. The serine proteinases are the most abundant class, accounting for 35% of the toxin sequencing reads. Metalloproteinases are the most diverse; 11 different forms have been identified. Using our sequences and those available in public databases, we detected positive selection in seven of the eight toxin families for which sufficient sequences were available for the analysis. We find that the vast majority of the genes that contribute directly to this vertebrate trait show evidence for a role for positive selection in their evolutionary history. Copyright © 2011 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Goldmuntz, E.; Budarf, M.L.; Wang, Zhili
1996-04-15
DiGeorge syndrome (DGS) and velocardiofacial syndrome have been shown to be associated with microdeletions of chromosomal region 22q11. More recently, patients with conotruncal anomaly face syndrome and some nonsyndromic patients with isolated forms of conotruncal cardiac defects have been found to have 22q11 microdeletions as well. The commonly deleted region, called the DiGeorge chromosomal region (DGCR), spans approximately 1.2 mb and is estimated to contain at least 30 genes. We report a computational approach for gene identification that makes use of large-scale sequencing of cosmids from a contig spanning the DGCR. Using this methodology, we have mapped the human homologmore » of a rodent citrate transport protein to the DGCR. We have isolated a partial cDNA containing the complete open reading frame and have determined the genomic structure by comparing the genomic sequence from the cosmid to the sequence of the cDNA clone. Whether the citrate transport protein can be implicated in the biological etiology of DGS or other 22q11 microdeletion syndromes remains to be defined. 36 refs., 3 figs., 1 tab.« less
Jia, P; Zhang, C; Huang, X P; Poda, M; Akbas, F; Lemanski, S L; Erginel-Unaltuna, N; Lemanski, L F
2008-11-01
The discovery of the naturally occurring cardiac non-function (c) animal strain in Ambystoma mexicanum (axolotl) provides a valuable animal model to study cardiomyocyte differentiation. In homozygous mutant animals (c/c), rhythmic contractions of the embryonic heart are absent due to a lack of organized myofibrils. We have previously cloned a partial sequence of a peptide cDNA (N1) from an anterior-endoderm-conditioned-medium RNA library that had been shown to be able to rescue the mutant phenotype. In the current studies we have fully cloned the N1 full length cDNA sequence from the library. N1 protein has been detected in both adult heart and skeletal muscle but not in any other adult tissues. GFP-tagged expression of the N1 protein has revealed localization of the N1 protein in the endoplasmic reticulum (ER). Results from in situ hybridization experiments have confirmed the dramatic decrease of expression of N1 mRNA in mutant (c/c) embryos indicating that the N1 gene is involved in heart development.
Molecular cloning and expression of the CRISP family of proteins in the boar.
Vadnais, Melissa L; Foster, Douglas N; Roberts, Kenneth P
2008-12-01
The family of mammalian cysteine-rich secretory proteins (CRISP) have been well characterized in the rat, mouse, and human. Here we report the molecular cloning and expression analysis of CRISP1, CRISP2, and CRISP3 in the boar. A partial sequence published in the National Center for Biotechnology Information (NCBI) database was used to derive the full-length sequences for CRISP1 and CRISP2 using rapid amplification of cDNA ends. RT-PCR confirmed the expression of these mRNAs in the boar reproductive tract, and real time RT-PCR showed CRISP1 to be highly expressed throughout the epididymis, with CRISP2 highly expressed in the testis. A search of the porcine genomic sequence in the NCBI database identified a BAC (CH242-199E6) encoding the CRISP1 gene. This BAC is derived from porcine Chromosome 7 and is syntenic with the regions of the mouse, rat, and human genomes encoding the CRISP gene family. This BAC was found to encode a third CRISP protein with a predicted amino acid sequence of high similarity to human CRISP3. Using RT-PCR we show that CRISP3 expression in the boar reproductive tract is confined to the prostate. Recombinant porcine (rp) CRISP2 protein was produced and purified. When incubated with capacitated boar sperm, rpCRISP2 induced an acrosome reaction, consistent with its demonstrated ability to alter the activity of calcium channels.
Yang, Q; Radebaugh, C A; Kubaska, W; Geiss, G K; Paule, M R
1995-11-11
The intergenic spacer (IGS) of Acanthamoeba castellanii rRNA genes contains repeated elements which are weak enhancers for transcription by RNA polymerase I. A protein, EBF, was identified and partially purified which binds to the enhancers and to several other sequences within the IGS, but not to other DNA fragments, including the rRNA core promoter. No consensus binding sequence could be discerned in these fragments and bound factor is in rapid equilibrium with unbound. EBF has functional characteristics similar to vertebrate upstream binding factors (UBF). Not only does it bind to the enhancer and other IGS elements, but it also stimulates binding of TIF-IB, the fundamental transcription initiation factor, to the core promoter and stimulates transcription from the promoter. Attempts to identify polypeptides with epitopes similar to rat or Xenopus laevis UBF suggest that structurally the protein from A.castellanii is not closely related to vertebrate UBF.
Yang, Q; Radebaugh, C A; Kubaska, W; Geiss, G K; Paule, M R
1995-01-01
The intergenic spacer (IGS) of Acanthamoeba castellanii rRNA genes contains repeated elements which are weak enhancers for transcription by RNA polymerase I. A protein, EBF, was identified and partially purified which binds to the enhancers and to several other sequences within the IGS, but not to other DNA fragments, including the rRNA core promoter. No consensus binding sequence could be discerned in these fragments and bound factor is in rapid equilibrium with unbound. EBF has functional characteristics similar to vertebrate upstream binding factors (UBF). Not only does it bind to the enhancer and other IGS elements, but it also stimulates binding of TIF-IB, the fundamental transcription initiation factor, to the core promoter and stimulates transcription from the promoter. Attempts to identify polypeptides with epitopes similar to rat or Xenopus laevis UBF suggest that structurally the protein from A.castellanii is not closely related to vertebrate UBF. Images PMID:7501455
The intracellular region of ClC-3 chloride channel is in a partially folded state and a monomer.
Li, Shu Jie; Kawazaki, Masanobu; Ogasahara, Kyoko; Nakagawa, Atsushi
2006-05-01
In contrast to bacterial ClC chloride channels, all eukaryotic ClC chloride channels have a conserved long intracellular region that makes up of the carboxyl terminus of the protein and is necessary for channel functions as a channel gate. Little is known, however, about the molecular structure of the intracellular region of ClC chloride channels so far. Here, for the first time, we have expressed and purified the intracellular region of the rat ClC-3 chloride channel (C-ClC-3) as a water-soluble protein under physiological conditions, and investigated its structural characteristics and assembly behavior by means of circular dichroism (CD) spectroscopy, differential scanning calorimetry (DSC), size exclusion chromatography and analytical ultracentrifugation. The far-UV CD spectra of C-ClC-3 in the native state and in the presence of urea clearly show that the protein has a significantly folded secondary structure consisting of alpha-helices and beta-sheets, while the near-UV CD spectra and DSC experiments indicate the protein is deficient in well-defined tertiary packing. Its Stokes radius is larger than its expected size as a folded globular protein, as determined on size exclusion chromatography. Furthermore, the DisEMBL program, a useful computational tool for the prediction of disordered/unstructured regions within a protein sequence, predicts that the protein is in a partially folded state. Based on these results, we conclude that C-ClC-3 is partially folded. On the other hand, both size exclusion chromatography and sedimentation equilibrium analysis show that C-ClC-3 exists as a monomer in solution, not a dimer like the whole ClC-3 molecule.
Sumi, S; Tsuneyoshi, T; Furutani, H
1993-09-01
Rod-shaped flexuous viruses were partially purified from garlic plants (Allium sativum) showing typical mosaic symptoms. The genome was shown to be composed of RNA with a poly(A) tail of an estimated size of 10 kb as shown by denaturing agarose gel electrophoresis. We constructed cDNA libraries and screened four independent clones, which were designated GV-A, GV-B, GV-C and GV-D, using Northern and Southern blot hybridization. Nucleotide sequence determination of the cDNAs, two of which correspond to nearly one-third of the virus genomic RNA, shows that all of these viruses possess an identical genomic structure and that also at least four proteins are encoded in the viral cDNA, their M(r)s being estimated to be 15K, 27K, 40K and 11K. The 15K open reading frame (ORF) encodes the core-like sequence of a zinc finger protein preceded by a cluster of basic amino acid residues. The 27K ORF probably encodes the viral coat protein (CP), based on both the existence of some conserved sequences observed in many other rod-shaped or flexuous virus CPs and an overall amino acid sequence similarity to potexvirus and carlavirus CPs. The 11K ORF shows significant amino acid sequence similarities to the corresponding 12K proteins of the potexviruses and carlaviruses. On the other hand, the 40K ORF product does not resemble any other plant virus gene products reported so far. The genomic organization in the 3' region of the garlic viruses resembles, but clearly differs from, that of carlaviruses. Phylogenetic analysis based upon the amino acid sequence of the viral capsid protein also indicates that the garlic viruses have a unique and distinct domain different from those of the potexvirus and carlavirus groups. The results suggest that the garlic viruses described here belong to an unclassified and new virus group closely related to the carlaviruses.
Gao, J; Naglich, J G; Laidlaw, J; Whaley, J M; Seizinger, B R; Kley, N
1995-02-15
The human von Hippel-Lindau disease (VHL) gene has recently been identified and, based on the nucleotide sequence of a partial cDNA clone, has been predicted to encode a novel protein with as yet unknown functions [F. Latif et al., Science (Washington DC), 260: 1317-1320, 1993]. The length of the encoded protein and the characteristics of the cellular expressed protein are as yet unclear. Here we report the cloning and characterization of a mouse gene (mVHLh1) that is widely expressed in different mouse tissues and shares high homology with the human VHL gene. It predicts a protein 181 residues long (and/or 162 amino acids, considering a potential alternative start codon), which across a core region of approximately 140 residues displays a high degree of sequence identity (98%) to the predicted human VHL protein. High stringency DNA and RNA hybridization experiments and protein expression analyses indicate that this gene is the most highly VHL-related mouse gene, suggesting that it represents the mouse VHL gene homologue rather than a related gene sharing a conserved functional domain. These findings provide new insights into the potential organization of the VHL gene and nature of its encoded protein.
Jelokhani-Niaraki, Saber; Tahmoorespur, Mojtaba; Bitaraf-Sani, Morteza
2015-01-01
Very little is known about LHR and FSHR genes of domestic dromedary camels. The main objective of this study was to determine and analyze partial genomic regions of FSHR and LHR genes in dromedary camels for the first time. To this end, a total of50 DNA samples belonging to dromedary camels raised in Iran were sent for sequencing (25 samples of each gene). We compared the nucleotide sequences of Camelus dromedarius with corresponding sequences of previously published FSHR and LHR genes in bactrian camels and other species. According to the data, the same nucleotide variation was identified in both regions of the two camel species. The alignment of deduced protein sequences of the two different species revealed an amino acid variation at the FSHR region. No evidence of amino acid variation was observed, however, in LHR sequences. Phylogenetic analysis indicated that both camel species had a close relationship and clustered together in a separate branch. This was further confirmed by genetic distance values illustrating significant sequence identity between Camelus dromedarius and Camelus bactrianus. Interestingly, sequence comparisons revealed heterozygote patterns in FSHR sequences isolated from dromedary camels of Iran. In comparison to other species, this camel contains three amino acid substitutions at 5, 67, and 105 positions in the FSHR coding region. These positions are found exclusively in camels and can be considered as species specific. The results of our study can be used for hormone functionality research (FSHR and LHR) as well as reproduction-linked polymorphisms and breeding programs. PMID:27844002
Jelokhani-Niaraki, Saber; Tahmoorespur, Mojtaba; Bitaraf-Sani, Morteza
2015-06-01
Very little is known about LHR and FSHR genes of domestic dromedary camels. The main objective of this study was to determine and analyze partial genomic regions of FSHR and LHR genes in dromedary camels for the first time. To this end, a total of50 DNA samples belonging to dromedary camels raised in Iran were sent for sequencing (25 samples of each gene). We compared the nucleotide sequences of Camelus dromedarius with corresponding sequences of previously published FSHR and LHR genes in bactrian camels and other species. According to the data, the same nucleotide variation was identified in both regions of the two camel species. The alignment of deduced protein sequences of the two different species revealed an amino acid variation at the FSHR region. No evidence of amino acid variation was observed, however, in LHR sequences. Phylogenetic analysis indicated that both camel species had a close relationship and clustered together in a separate branch. This was further confirmed by genetic distance values illustrating significant sequence identity between Camelus dromedarius and Camelus bactrianus . Interestingly, sequence comparisons revealed heterozygote patterns in FSHR sequences isolated from dromedary camels of Iran. In comparison to other species, this camel contains three amino acid substitutions at 5, 67, and 105 positions in the FSHR coding region. These positions are found exclusively in camels and can be considered as species specific. The results of our study can be used for hormone functionality research ( FSHR and LHR ) as well as reproduction-linked polymorphisms and breeding programs.
Genome analysis and identification of gelatinase encoded gene in Enterobacter aerogenes
NASA Astrophysics Data System (ADS)
Shahimi, Safiyyah; Mutalib, Sahilah Abdul; Khalid, Rozida Abdul; Repin, Rul Aisyah Mat; Lamri, Mohd Fadly; Bakar, Mohd Faizal Abu; Isa, Mohd Noor Mat
2016-11-01
In this study, bioinformatic analysis towards genome sequence of E. aerogenes was done to determine gene encoded for gelatinase. Enterobacter aerogenes was isolated from hot spring water and gelatinase species-specific bacterium to porcine and fish gelatin. This bacterium offers the possibility of enzymes production which is specific to both species gelatine, respectively. Enterobacter aerogenes was partially genome sequenced resulting in 5.0 mega basepair (Mbp) total size of sequence. From pre-process pipeline, 87.6 Mbp of total reads, 68.8 Mbp of total high quality reads and 78.58 percent of high quality percentage was determined. Genome assembly produced 120 contigs with 67.5% of contigs over 1 kilo base pair (kbp), 124856 bp of N50 contig length and 55.17 % of GC base content percentage. About 4705 protein gene was identified from protein prediction analysis. Two candidate genes selected have highest similarity identity percentage against gelatinase enzyme available in Swiss-Prot and NCBI online database. They were NODE_9_length_26866_cov_148.013245_12 containing 1029 base pair (bp) sequence with 342 amino acid sequence and NODE_24_length_155103_cov_177.082458_62 which containing 717 bp sequence with 238 amino acid sequence, respectively. Thus, two paired of primers (forward and reverse) were designed, based on the open reading frame (ORF) of selected genes. Genome analysis of E. aerogenes resulting genes encoded gelatinase were identified.
Antioxidant Activity of Oxygen Evolving Enhancer Protein 1 Purified from Capsosiphon fulvescens.
Kim, Eun-Young; Choi, Youn Hee; Lee, Jung Im; Kim, In-Hye; Nam, Taek-Jeong
2015-06-01
This study was conducted to determine the antioxidant activity of a protein purified from Capsosiphon fulvescens. The purification steps included sodium acetate (pH 6) extraction and diethylaminoethyl-cellulose, reversed phase Shodex C4P-50 column chromatography. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis analysis indicated that the molecular weight of the purified protein was 33 kDa. The N-terminus and partial peptide amino acid sequence of this protein was identical to the sequence of oxygen evolving enhancer (OEE) 1 protein. The antioxidant activity of the OEE 1 was determined in vitro using a scavenging test with 4 types of reactive oxygen species (ROS), including the 2,2-diphenyl-1-picrylhydrazyl radical, hydroxyl radical, superoxide anion, and hydrogen peroxide (H2 O2 ). OEE 1 had higher H2 O2 scavenging activity, which proved to be the result of enzymatic antioxidants rather than nonenzymatic antioxidants. In addition, OEE 1 showed less H2 O2 -mediated ROS formation in HepG2 cells. In conclusion, this study demonstrates that OEE 1 purified from C. fulvescens is an excellent antioxidant. © 2015 Institute of Food Technologists®
Exaptation of Bornavirus-Like Nucleoprotein Elements in Afrotherians
Kobayashi, Yuki; Horie, Masayuki; Nakano, Ayumi; Murata, Koichi; Itou, Takuya; Suzuki, Yoshiyuki
2016-01-01
Endogenous bornavirus-like nucleoprotein elements (EBLNs), the nucleotide sequence elements derived from the nucleoprotein gene of ancient bornavirus-like viruses, have been identified in many animal genomes. Here we show evidence that EBLNs encode functional proteins in their host. Some afrotherian EBLNs were observed to have been maintained for more than 83.3 million years under negative selection. Splice variants were expressed from the genomic loci of EBLNs in elephant, and some were translated into proteins. The EBLN proteins appeared to be localized to the rough endoplasmic reticulum in African elephant cells, in contrast to the nuclear localization of bornavirus N. These observations suggest that afrotherian EBLNs have acquired a novel function in their host. Interestingly, genomic sequences of the first exon and its flanking regions in these EBLN loci were homologous to those of transmembrane protein 106B (TMEM106B). The upstream region of the first exon in the EBLN loci exhibited a promoter activity, suggesting that the ability of these EBLNs to be transcribed in the host cell was gained through capturing a partial duplicate of TMEM106B. In conclusion, our results strongly support for exaptation of EBLNs to encode host proteins in afrotherians. PMID:27518265
Jaw1/LRMP has a role in maintaining nuclear shape via interaction with SUN proteins.
Kozono, Takuma; Tadahira, Kazuko; Okumura, Wataru; Itai, Nao; Tamura-Nakano, Miwa; Dohi, Taeko; Tonozuka, Takashi; Nishikawa, Atsushi
2018-06-06
Jaw1/LRMP is characterized as a type II integral membrane protein that is localized to endoplasmic reticulum (ER), however, its physiological functions have been poorly understood. An alignment of amino acid sequence of Jaw1 with KASH proteins, outer nuclear membrane proteins, revealed that Jaw1 has a partial homology to the KASH domain. Here, we show that the function of Jaw1 is to maintain nuclear shape in mouse melanoma cell line. The siRNA-mediated knockdown of Jaw1 caused a severe defect in nuclear shape, and the defect was rescued by ectopic expression of siRNA-resistant Jaw1. Since co-immunoprecipitation assay indicates that Jaw1 interacts with SUN proteins that are inner nuclear proteins and microtubules, this study suggests that Jaw1 has a role in maintaining nuclear shape via interactions with SUN proteins and microtubules.
Liu, Min; Zhang, Zhongqi; Zang, Tianzhu; Spahr, Chris; Cheetham, Janet; Ren, Da; Sunny Zhou, Zhaohui
2013-01-01
Characterization of protein crosslinking, particularly without prior knowledge of the chemical nature and site of crosslinking, poses a significant challenge due to their intrinsic structural complexity and the lack of a comprehensive analytical approach. Towards this end, we have developed a generally applicable workflow—XChem-Finder that involves four stages. (1) Detection of crosslinked peptides via 18O-labeling at C-termini. (2) Determination of the putative partial sequences of each crosslinked peptide pair using a fragment ion mass database search against known protein sequences coupled with a de novo sequence tag search. (3) Extension to full sequences based on protease specificity, the unique combination of mass, and other constraints. (4) Deduction of crosslinking chemistry and site. The mass difference between the sum of two putative full-length peptides and the crosslinked peptide provides the formulas (elemental composition analysis) for the functional groups involved in each cross- linking. Combined with sequence restraint from MS/MS data, plausible crosslinking chemistry and site were inferred, and ultimately, confirmed by matching with all data. Applying our approach to a stressed IgG2 antibody, ten cross-linked peptides were discovered and found to be connected via thioether originating from disulfides at locations that had not been previously recognized. Furthermore, once the crosslink chemistry was revealed, a targeted crosslink search yielded four additional crosslinked peptides that all contain the C-terminus of the light chain. PMID:23634697
Characterization of a novel ADAM protease expressed by Pneumocystis carinii.
Kennedy, Cassie C; Kottom, Theodore J; Limper, Andrew H
2009-08-01
Pneumocystis species are opportunistic fungal pathogens that cause severe pneumonia in immunocompromised hosts. Recent evidence has suggested that unidentified proteases are involved in Pneumocystis life cycle regulation. Proteolytically active ADAM (named for "a disintegrin and metalloprotease") family molecules have been identified in some fungal organisms, such as Aspergillus fumigatus and Schizosaccharomyces pombe, and some have been shown to participate in life cycle regulation. Accordingly, we sought to characterize ADAM-like molecules in the fungal opportunistic pathogen, Pneumocystis carinii (PcADAM). After an in silico search of the P. carinii genomic sequencing project identified a 329-bp partial sequence with homology to known ADAM proteins, the full-length PcADAM sequence was obtained by PCR extension cloning, yielding a final coding sequence of 1,650 bp. Sequence analysis detected the presence of a typical ADAM catalytic active site (HEXXHXXGXXHD). Expression of PcADAM over the Pneumocystis life cycle was analyzed by Northern blot. Southern and contour-clamped homogenous electronic field blot analysis demonstrated its presence in the P. carinii genome. Expression of PcADAM was observed to be increased in Pneumocystis cysts compared to trophic forms. The full-length gene was subsequently cloned and heterologously expressed in Saccharomyces cerevisiae. Purified PcADAMp protein was proteolytically active in casein zymography, requiring divalent zinc. Furthermore, native PcADAMp extracted directly from freshly isolated Pneumocystis organisms also exhibited protease activity. This is the first report of protease activity attributable to a specific, characterized protein in the clinically important opportunistic fungal pathogen Pneumocystis.
Games, Patrícia Dias; daSilva, Elói Quintas Gonçalves; Barbosa, Meire de Oliveira; Almeida-Souza, Hebréia Oliveira; Fontes, Patrícia Pereira; deMagalhães, Marcos Jorge; Pereira, Paulo Roberto Gomes; Prates, Maura Vianna; Franco, Gloria Regina; Faria-Campos, Alessandra; Campos, Sérgio Vale Aguiar; Baracat-Pereira, Maria Cristina
2016-12-15
Antimicrobial peptides from plants present mechanisms of action that are different from those of conventional defense agents. They are under-explored but have a potential as commercial antimicrobials. Bell pepper leaves ('Magali R') are discarded after harvesting the fruit and are sources of bioactive peptides. This work reports the isolation by peptidomics tools, and the identification and partially characterization by computational tools of an antimicrobial peptide from bell pepper leaves, and evidences the usefulness of records and the in silico analysis for the study of plant peptides aiming biotechnological uses. Aqueous extracts from leaves were enriched in peptide by salt fractionation and ultrafiltration. An antimicrobial peptide was isolated by tandem chromatographic procedures. Mass spectrometry, automated peptide sequencing and bioinformatics tools were used alternately for identification and partial characterization of the Hevein-like peptide, named HEV-CANN. The computational tools that assisted to the identification of the peptide included BlastP, PSI-Blast, ClustalOmega, PeptideCutter, and ProtParam; conventional protein databases (DB) as Mascot, Protein-DB, GenBank-DB, RefSeq, Swiss-Prot, and UniProtKB; specific for peptides DB as Amper, APD2, CAMP, LAMPs, and PhytAMP; other tools included in ExPASy for Proteomics; The Bioactive Peptide Databases, and The Pepper Genome Database. The HEV-CANN sequence presented 40 amino acid residues, 4258.8 Da, theoretical pI-value of 8.78, and four disulfide bonds. It was stable, and it has inhibited the growth of phytopathogenic bacteria and a fungus. HEV-CANN presented a chitin-binding domain in their sequence. There was a high identity and a positive alignment of HEV-CANN sequence in various databases, but there was not a complete identity, suggesting that HEV-CANN may be produced by ribosomal synthesis, which is in accordance with its constitutive nature. Computational tools for proteomics and databases are not adjusted for short sequences, which hampered HEV-CANN identification. The adjustment of statistical tests in large databases for proteins is an alternative to promote the significant identification of peptides. The development of specific DB for plant antimicrobial peptides, with information about peptide sequences, functional genomic data, structural motifs and domains of molecules, functional domains, and peptide-biomolecule interactions are valuable and necessary.
Lindeberg, M; Collmer, A
1992-01-01
Many extracellular proteins produced by Erwinia chrysanthemi require the out gene products for transport across the outer membrane. In a previous report (S. Y. He, M. Lindeberg, A. K. Chatterjee, and A. Collmer, Proc. Natl. Acad. Sci. USA 88:1079-1083, 1991) cosmid pCPP2006, sufficient for secretion of Erwinia chrysanthemi extracellular proteins by Escherichia coli, was partially sequenced, revealing four out genes sharing high homology with pulH through pulK from Klebsiella oxytoca. The nucleotide sequence of eight additional out genes reveals homology with pulC through pulG, pulL, pulM, pulO, and other genes involved in secretion by various gram-negative bacteria. Although signal sequences and hydrophobic regions are generally conserved between Pul and Out proteins, four out genes contain unique inserts, a pulN homolog is not present, and outO appears to be transcribed separately from outC through outM. The sequenced region was subcloned, and an additional 7.6-kb region upstream was identified as being required for secretion in E. coli. out gene homologs were found on Erwinia carotovora cosmid clone pAKC651 but were not detected in E. coli. The outC-through-outM operon is weakly induced by polygalacturonic acid and strongly expressed in the early stationary phase. The out and pul genes are highly similar in sequence, hydropathic properties, and overall arrangement but differ in both transcriptional organization and the nature of their induction. Images PMID:1429461
Rezaei Far, A; Peighambari, S M; Pourbakhsh, S A; Ashtari, A; Soltani, M
2017-02-01
Pigeons are considered as one of the major natural reservoirs in the epidemiology of Newcastle disease (ND). In this study, the partial sequence of fusion protein gene of 17 pigeon-origin ND viruses (NDVs) isolated during 2012-2013 in Iran was analysed. Since the studied isolates showed F0 protein cleavage sites compatible with velogenic NDVs, all were considered as virulent NDVs. Two isolates carried 112RRQKRF117 as the cleavage site motif, whereas the rest demonstrated 112KRQKRF117 motif which just recently has been reported among Iranian virulent NDVs. Phylogenetic analysis divided all these diverse isolates in two distinct clusters within class II genotype VI. Based on the partial fusion protein gene sequence, 15 out of 17 isolates showed the highest genetic identity to subgenotype VIb/2 and the other two isolates were placed in a distinct genetic group of genotype VI. Based on recent findings, at least two different sublineages of genotype VI are causing the ND outbreaks in the pigeon population and are circulating simultaneously along with virulent NDVs of genotype VII in various species in Iran. The continuing circulation of a diverse group of virulent NDVs as an enzootic in widespread species such as pigeon can cause outbreaks in commercial poultry flocks and also failure in controlling programmes. Therefore, the constant monitoring and awareness of the virus characteristics should be considered in controlling programmes against ND in Iran.
Lee, Ra Mi; Ryu, Rae Hyung; Jeong, Seong Won; Oh, Soo Jin; Huang, Hue; Han, Jin Soo; Lee, Chi Ho; Lee, C. Justin; Jan, Lily Yeh
2011-01-01
To clone the first anion channel from Xenopus laevis (X. laevis), we isolated a calcium-activated chloride channel (CLCA)-like membrane protein 6 gene (CMP6) in X. laevis. As a first step in gene isolation, an expressed sequence tags database was screened to find the partial cDNA fragment. A putative partial cDNA sequence was obtained by comparison with rat CLCAs identified in our laboratory. First stranded cDNA was synthesized by reverse transcription polymerase-chain reaction (RT-PCR) using a specific primer designed for the target cDNA. Repeating the 5' and 3' rapid amplification of cDNA ends, full-length cDNA was constructed from the cDNA pool. The full-length CMP6 cDNA completed via 5'- and 3'-RACE was 2,940 bp long and had an open reading frame (ORF) of 940 amino acids. The predicted 940 polypeptides have four major transmembrane domains and showed about 50% identity with that of rat brain CLCAs in our previously published data. Semi-quantification analysis revealed that CMP6 was most abundantly expressed in small intestine, colon and liver. However, all tissues except small intestine, colon and liver had undetectable levels. This result became more credible after we did real-time PCR quantification for the target gene. In view of all CLCA studies focused on human or murine channels, this finding suggests a hypothetical protein as an ion channel, an X. laevis CLCA. PMID:21826170
Côté, Richard G; Jones, Philip; Martens, Lennart; Kerrien, Samuel; Reisinger, Florian; Lin, Quan; Leinonen, Rasko; Apweiler, Rolf; Hermjakob, Henning
2007-10-18
Each major protein database uses its own conventions when assigning protein identifiers. Resolving the various, potentially unstable, identifiers that refer to identical proteins is a major challenge. This is a common problem when attempting to unify datasets that have been annotated with proteins from multiple data sources or querying data providers with one flavour of protein identifiers when the source database uses another. Partial solutions for protein identifier mapping exist but they are limited to specific species or techniques and to a very small number of databases. As a result, we have not found a solution that is generic enough and broad enough in mapping scope to suit our needs. We have created the Protein Identifier Cross-Reference (PICR) service, a web application that provides interactive and programmatic (SOAP and REST) access to a mapping algorithm that uses the UniProt Archive (UniParc) as a data warehouse to offer protein cross-references based on 100% sequence identity to proteins from over 70 distinct source databases loaded into UniParc. Mappings can be limited by source database, taxonomic ID and activity status in the source database. Users can copy/paste or upload files containing protein identifiers or sequences in FASTA format to obtain mappings using the interactive interface. Search results can be viewed in simple or detailed HTML tables or downloaded as comma-separated values (CSV) or Microsoft Excel (XLS) files suitable for use in a local database or a spreadsheet. Alternatively, a SOAP interface is available to integrate PICR functionality in other applications, as is a lightweight REST interface. We offer a publicly available service that can interactively map protein identifiers and protein sequences to the majority of commonly used protein databases. Programmatic access is available through a standards-compliant SOAP interface or a lightweight REST interface. The PICR interface, documentation and code examples are available at http://www.ebi.ac.uk/Tools/picr.
Côté, Richard G; Jones, Philip; Martens, Lennart; Kerrien, Samuel; Reisinger, Florian; Lin, Quan; Leinonen, Rasko; Apweiler, Rolf; Hermjakob, Henning
2007-01-01
Background Each major protein database uses its own conventions when assigning protein identifiers. Resolving the various, potentially unstable, identifiers that refer to identical proteins is a major challenge. This is a common problem when attempting to unify datasets that have been annotated with proteins from multiple data sources or querying data providers with one flavour of protein identifiers when the source database uses another. Partial solutions for protein identifier mapping exist but they are limited to specific species or techniques and to a very small number of databases. As a result, we have not found a solution that is generic enough and broad enough in mapping scope to suit our needs. Results We have created the Protein Identifier Cross-Reference (PICR) service, a web application that provides interactive and programmatic (SOAP and REST) access to a mapping algorithm that uses the UniProt Archive (UniParc) as a data warehouse to offer protein cross-references based on 100% sequence identity to proteins from over 70 distinct source databases loaded into UniParc. Mappings can be limited by source database, taxonomic ID and activity status in the source database. Users can copy/paste or upload files containing protein identifiers or sequences in FASTA format to obtain mappings using the interactive interface. Search results can be viewed in simple or detailed HTML tables or downloaded as comma-separated values (CSV) or Microsoft Excel (XLS) files suitable for use in a local database or a spreadsheet. Alternatively, a SOAP interface is available to integrate PICR functionality in other applications, as is a lightweight REST interface. Conclusion We offer a publicly available service that can interactively map protein identifiers and protein sequences to the majority of commonly used protein databases. Programmatic access is available through a standards-compliant SOAP interface or a lightweight REST interface. The PICR interface, documentation and code examples are available at . PMID:17945017
Investigations on the ORF 167L of lymphocystis disease virus (Iridoviridae).
Essbauer, Sandra; Fischer, Uwe; Bergmann, Sven; Ahne, Winfried
2004-01-01
The predicted open reading frame 167L (ORF 167L) of lymphocystis disease virus (LCDV, Iridoviridae ) isolated from plaice, dab and flounder was investigated. The ORF 167L corresponding genes of the three LCDV isolates were amplified, cloned and sequenced. A comparison of the LCDV strains showed that the nucleotide sequence of ORF 167L and its deduced amino acid sequence were highly conserved in the genus lymphocystivirus (a homology of 80% in dab and flounder/plaice, 97% in plaice and flounder). The N-terminus protein predicted from the ORF 167L suggests similarities to the tumor necrosis factor receptor (TNFR)-family, and to TNFR-like proteins, which play an important role in various poxvirus species. Further, homology to the CUB-domain was shown at the C-terminus of the LCDV protein. Phylogenetic analyses of partial LCDV protein sequences identified two clusters: one cluster containing the flounder and plaice LCDV isolate (LCDV-1), and another cluster, containing the dab LCDV isolate (LCDV-2). The ORF 167L of plaice LCDV was expressed in Escherichia coli, and in fish cells. The expressed ORF resulted in a 30-kDa cytoplasmic protein lacking a signal peptide. An established monoclonal antibody (mAb 18) was used to detect LCDV proteins in skin explants of flounders and cryosections of dab skin. Specific fluorescence was found in the cytoplasm of intact epitheloid cells of the lymphocystis capsule and in the epidermis skin covering the lymphocystic nodules. LCDV-specific labelling of mAb 18 was also shown in spleen and liver tissue of LCDV-positive flounders. The ORF 167L protein seemed not to have the extracellular receptor function predicted from the usual cellular TNFR. The myxomavirus M-T2 protein, a poxviral TNFR homologue, was also shown not to have TNFR-like functions but to be involved in the apoptosis signal cascade.
Gross deletions in TCOF1 are a cause of Treacher–Collins–Franceschetti syndrome
Bowman, Michael; Oldridge, Michael; Archer, Caroline; O'Rourke, Anthony; McParland, Joanna; Brekelmans, Roel; Seller, Anneke; Lester, Tracy
2012-01-01
Treacher–Collins–Franceschetti syndrome (TCS) is an autosomal dominant craniofacial disorder characterised by midface hypoplasia, micrognathia, downslanting palpebral fissures, eyelid colobomata, and ear deformities that often lead to conductive deafness. A total of 182 patients with signs consistent with a diagnosis of TCS were screened by DNA sequence and dosage analysis of the TCOF1 gene. In all, 92 cases were found to have a pathogenic mutation by sequencing and 5 to have a partial gene deletion. A further case had a novel in-frame deletion in the alternatively spliced exon 6A of uncertain pathogenicity. The majority of the pathogenic sequence changes were found to predict premature protein termination, however, four novel missense changes in the LIS1 homology motif at the 5′ end of the gene were identified. The partial gene deletions of different sizes represent ∼5.2% of all the pathogenic TCOF1 mutations identified, indicating that gene rearrangements account for a significant proportion of TCS cases. This is the first report of gene rearrangements resulting in TCS. These findings expand the TCOF1 mutation spectrum indicating that dosage analysis should be performed together with sequence analysis, a strategy that is predicted to have a sensitivity of 71% for patients in whom TCS is strongly suspected. PMID:22317976
Gross deletions in TCOF1 are a cause of Treacher-Collins-Franceschetti syndrome.
Bowman, Michael; Oldridge, Michael; Archer, Caroline; O'Rourke, Anthony; McParland, Joanna; Brekelmans, Roel; Seller, Anneke; Lester, Tracy
2012-07-01
Treacher-Collins-Franceschetti syndrome (TCS) is an autosomal dominant craniofacial disorder characterised by midface hypoplasia, micrognathia, downslanting palpebral fissures, eyelid colobomata, and ear deformities that often lead to conductive deafness. A total of 182 patients with signs consistent with a diagnosis of TCS were screened by DNA sequence and dosage analysis of the TCOF1 gene. In all, 92 cases were found to have a pathogenic mutation by sequencing and 5 to have a partial gene deletion. A further case had a novel in-frame deletion in the alternatively spliced exon 6A of uncertain pathogenicity. The majority of the pathogenic sequence changes were found to predict premature protein termination, however, four novel missense changes in the LIS1 homology motif at the 5' end of the gene were identified. The partial gene deletions of different sizes represent ~5.2% of all the pathogenic TCOF1 mutations identified, indicating that gene rearrangements account for a significant proportion of TCS cases. This is the first report of gene rearrangements resulting in TCS. These findings expand the TCOF1 mutation spectrum indicating that dosage analysis should be performed together with sequence analysis, a strategy that is predicted to have a sensitivity of 71% for patients in whom TCS is strongly suspected.
Li, Ming; Wang, Rui; Zhao, Dahe; Xiang, Hua
2014-01-01
The clustered regularly interspaced short palindromic repeat (CRISPR)-Cas system mediates adaptive immunity against foreign nucleic acids in prokaryotes. However, efficient adaptation of a native CRISPR to purified viruses has only been observed for the type II-A system from a Streptococcus thermophilus industry strain, and rarely reported for laboratory strains. Here, we provide a second native system showing efficient adaptation. Infected by a newly isolated virus HHPV-2, Haloarcula hispanica type I-B CRISPR system acquired spacers discriminatively from viral sequences. Unexpectedly, in addition to Cas1, Cas2 and Cas4, this process also requires Cas3 and at least partial Cascade proteins, which are involved in interference and/or CRISPR RNA maturation. Intriguingly, a preexisting spacer partially matching a viral sequence is also required, and spacer acquisition from upstream and downstream sequences of its target sequence (i.e. priming protospacer) shows different strand bias. These evidences strongly indicate that adaptation in this system strictly requires a priming process. This requirement, if validated also true for other CRISPR systems as implied by our bioinformatic analysis, may help to explain failures to observe efficient adaptation to purified viruses in many laboratory strains, and the discrimination mechanism at the adaptation level that has confused scientists for years. PMID:24265226
A putative peroxidase cDNA from turnip and analysis of the encoded protein sequence.
Romero-Gómez, S; Duarte-Vázquez, M A; García-Almendárez, B E; Mayorga-Martínez, L; Cervantes-Avilés, O; Regalado, C
2008-12-01
A putative peroxidase cDNA was isolated from turnip roots (Brassica napus L. var. purple top white globe) by reverse transcriptase-polymerase chain reaction (RT-PCR) and rapid amplification of cDNA ends (RACE). Total RNA extracted from mature turnip roots was used as a template for RT-PCR, using a degenerated primer designed to amplify the highly conserved distal motif of plant peroxidases. The resulting partial sequence was used to design the rest of the specific primers for 5' and 3' RACE. Two cDNA fragments were purified, sequenced, and aligned with the partial sequence from RT-PCR, and a complete overlapping sequence was obtained and labeled as BbPA (Genbank Accession No. AY423440, named as podC). The full length cDNA is 1167bp long and contains a 1077bp open reading frame (ORF) encoding a 358 deduced amino acid peroxidase polypeptide. The putative peroxidase (BnPA) showed a calculated Mr of 34kDa, and isoelectric point (pI) of 4.5, with no significant identity with other reported turnip peroxidases. Sequence alignment showed that only three peroxidases have a significant identity with BnPA namely AtP29a (84%), and AtPA2 (81%) from Arabidopsis thaliana, and HRPA2 (82%) from horseradish (Armoracia rusticana). Work is in progress to clone this gene into an adequate host to study the specific role and possible biotechnological applications of this alternative peroxidase source.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Feild, M.J.; Armstrong, F.B.
1987-05-01
E. coli JA199 pDU11 harbors a multicopy plasmid containing the ilv GEDAY gene cluster of S. typhimurium. TmB, gene product of ilv E, was purified, crystallized, and subjected to Edman degradation using a gas phase sequencer. The intact protein yielded an amino terminal 31 residue sequence. Both carboxymethylated apoenzyme and (/sup 3/H)-NaBH-reduced holoenzyme were then subjected to digestion by trypsin. The digests were fractionated using reversed phase HPLC, and the peptides isolated were sequenced. The borohydride-treated holoenzyme was used to isolate the cofactor-binding peptide. The peptide is 27 residues long and a comparison with known sequences of other aminotransferases revealedmore » limited homology. Peptides accounting for 211 of 288 predicted residues have been sequenced, including 9 residues of the carboxyl terminus. Comparison of peptides with the inferred amino acid sequence of the E. coli K-12 enzyme has helped determine the sequence of the amino terminal 59 residues; only two differences between the sequences are noted in this region.« less
He, Kui-Fang; Liu, Jian-Guo; Liu, Tian-Jia; Yang, De-Qin; Zhuang, Heng; Li, Song
2006-08-01
To analysis the homology among the extended-V region of the surface proteins in different serotype Streptococcus mutans (c, f, d, g) and to find out it's significance in anti-caries vaccine. The DNA of the bacteria (standarded serotype c, d, f, g and partial serotype c clinicals) was extracted and the extended-V region (SrV+, 1 384-2 514 bp) was amplified using polymerase chain reaction (PCR). Then the products were assessed using restriction fragment length polymorphism (RFLP) by endonuclease Dde I. The genotypings were sequenced and analysised using the program of BLAST on NCBI Gene Bank database. About 1.13 kb fragments were produced both in serotype c and f, the serotype d and g were failed. The RFLP results showed that five different patterns(A, B, C, D, E) among the 117 PCR products were reveled by Dde I. The ration of the genotypings A and B were the most among the strains, the C was lower, the D and E respectively was 1 and 3 strains per genotype. OMZ175 (serotype f) was belong to B genotype. Selected one of the A, B, C genotypings to sequenced and blasted. Then the results of the blastn showed that the identities of the gene sequence were 92%-98% between the serotype c and serotype f, part sequence of the serotype g was homology with the SrV+ of the serotype c, the protein sequence among serotype c, d, f, g were 77%-82%. It is reasonable to use some putative pipetides to study the anti-caries vaccine among the extended-V regions of the surface proteins in different serotype (c, d, f, g) in S. mutans.
The pig X and Y Chromosomes: structure, sequence, and evolution
Skinner, Benjamin M.; Sargent, Carole A.; Churcher, Carol; Hunt, Toby; Herrero, Javier; Loveland, Jane E.; Dunn, Matt; Louzada, Sandra; Fu, Beiyuan; Chow, William; Gilbert, James; Austin-Guest, Siobhan; Beal, Kathryn; Carvalho-Silva, Denise; Cheng, William; Gordon, Daria; Grafham, Darren; Hardy, Matt; Harley, Jo; Hauser, Heidi; Howden, Philip; Howe, Kerstin; Lachani, Kim; Ellis, Peter J.I.; Kelly, Daniel; Kerry, Giselle; Kerwin, James; Ng, Bee Ling; Threadgold, Glen; Wileman, Thomas; Wood, Jonathan M.D.; Yang, Fengtang; Harrow, Jen; Affara, Nabeel A.; Tyler-Smith, Chris
2016-01-01
We have generated an improved assembly and gene annotation of the pig X Chromosome, and a first draft assembly of the pig Y Chromosome, by sequencing BAC and fosmid clones from Duroc animals and incorporating information from optical mapping and fiber-FISH. The X Chromosome carries 1033 annotated genes, 690 of which are protein coding. Gene order closely matches that found in primates (including humans) and carnivores (including cats and dogs), which is inferred to be ancestral. Nevertheless, several protein-coding genes present on the human X Chromosome were absent from the pig, and 38 pig-specific X-chromosomal genes were annotated, 22 of which were olfactory receptors. The pig Y-specific Chromosome sequence generated here comprises 30 megabases (Mb). A 15-Mb subset of this sequence was assembled, revealing two clusters of male-specific low copy number genes, separated by an ampliconic region including the HSFY gene family, which together make up most of the short arm. Both clusters contain palindromes with high sequence identity, presumably maintained by gene conversion. Many of the ancestral X-related genes previously reported in at least one mammalian Y Chromosome are represented either as active genes or partial sequences. This sequencing project has allowed us to identify genes—both single copy and amplified—on the pig Y Chromosome, to compare the pig X and Y Chromosomes for homologous sequences, and thereby to reveal mechanisms underlying pig X and Y Chromosome evolution. PMID:26560630
Tabachnick, M; Perret, V
1987-08-01
[125I] Thyroxine has been covalently bound to the thyroxine binding site in thyroxine-binding globulin by reaction with the bifunctional reagent, 1,5-difluoro-2,4-dinitrobenzene. An average of 0.47 mol of [125I] thyroxine was incorporated per mol protein; nonspecific binding amounted to 8%. A labeled peptide fragment was isolated from a proteolytic digest of the derivatized protein by HPLC and its amino acid composition was determined. Comparison with the amino acid sequence of thyroxine-binding globulin indicated partial correspondence of the labeled peptide with two possible regions in the protein. These regions also coincide with part of the barrel structure present in the closely homologous protein, alpha 1-antitrypsin.
Yadav, Pragya D; Vincent, Martin J; Khristova, Marina; Kale, Charuta; Nichol, Stuart T; Mishra, Akhilesh C; Mourya, Devendra T
2011-07-01
Nairobi sheep disease (NSD) virus, the prototype tick-borne virus of the genus Nairovirus, family Bunyaviridae is associated with acute hemorrhagic gastroenteritis in sheep and goats in East and Central Africa. The closely related Ganjam virus found in India is associated with febrile illness in humans and disease in livestock. The complete S, M and L segment sequences of Ganjam and NSD virus and partial sequence analysis of Ganjam viral RNA genome S, M and L segments encoding regions (396 bp, 701 bp and 425 bp) of the viral nucleocapsid (N), glycoprotein precursor (GPC) and L polymerase (L) proteins, respectively, was carried out for multiple Ganjam virus isolates obtained from 1954 to 2002 and from various regions of India. M segments of NSD and Ganjam virus encode a large ORF for the glycoprotein precursor (GPC), (1627 and 1624 amino acids in length, respectively) and their L segments encode a very large L polymerase (3991 amino acids). The complete S, M and L segments of NSD and Ganjam viruses were more closely related to one another than to other characterized nairoviruses, and no evidence of reassortment was found. However, the NSD and Ganjam virus complete M segment differed by 22.90% and 14.70%, for nucleotide and amino acid respectively, and the complete L segment nucleotide and protein differing by 9.90% and 2.70%, respectively among themselves. Ganjam and NSD virus, complete S segment differed by 9.40-10.40% and 3.2-4.10 for nucleotide and proteins while among Ganjam viruses 0.0-6.20% and 0.0-1.4%, variation was found for nucleotide and amino acids. Ganjam virus isolates differed by up to 17% and 11% at the nucleotide level for the partial S and L gene fragments, respectively, with less variation observed at the deduced amino acid level (10.5 and 2%, S and L, respectively). However, the virus partial M gene fragment (which encodes the hypervariable mucin-like domain) of these viruses differed by as much as 56% at the nucleotide level. Phylogenetic analysis of partial sequence differences suggests considerable mixing and movement of Ganjam virus strains within India, with no clear relationship between genetic lineages and virus geographic origin or year of isolation. Surprisingly, NSD virus does not represent a distinct lineage, but appears as a variant with other Ganjam virus among NSD virus group. Copyright © 2011 Elsevier B.V. All rights reserved.
Kottom, Theodore J; Limper, Andrew H
2011-10-01
Pneumocystis carinii (Pc) undergoes morphological transitions between cysts and trophic forms. We have previously described two Pc serine/threonine kinases, termed PcCbk1 and PcSte20, with PcSte20 belonging to a family of kinases involved in yeast mating, while PcCbk1 is a member of a group of protein kinases involved in regulation of cell cycle, shape, and proliferation. As Pc remains genetically intractable, knowledge on specific substrates phosphorylated by these kinases remains limited. Utilizing the phylogenetic relatedness of Pc to Saccharomyces cerevisiae, we interrogated a yeast proteome microarray containing >4000 purified protein based peptides, leading to the identification of 18 potential PcCbk1 and 15 PcSte20 substrates (Z-score > 3.0). A number of these potential protein substrates are involved in bud site selection, polarized growth, and response to mating α factor and pseudohyphal and invasive growth. Full-length open reading frames suggested by the PcCbk1 and PcSte20 protoarrays were amplified and expressed. These five proteins were used as substrates for PcCbk1 or PcSte20, with each being highly phosphorylated by the respective kinase. Finally, to demonstrate the utility of this method to identify novel PcCbk1 and PcSte20 substrates, we analysed DNA sequence data from the partially complete Pc genome database and detected partial sequence information of potential PcCbk1 kinase substrates PcPxl1 and PcInt1. We additionally identified the potential PcSte20 kinase substrate PcBdf2. Full-length Pc substrates were cloned and expressed in yeast, and shown to be phosphorylated by the respective Pc kinases. In conclusion, the yeast protein microarray represents a novel crossover technique for identifying unique potential Pc kinase substrates. Copyright © 2011 John Wiley & Sons, Ltd.
Russo, Roberta; Chiaramonte, Marco; Matranga, Valeria; Arizza, Vincenzo
2015-08-01
The innate immune response involves proteins such as the membrane receptors of the Toll-like family (TLRs), which trigger different intracellular signalling pathways that are dependent on specific stimulating molecules. In sea urchins, TLR proteins are encoded by members of a large multigenic family composed of 60-250 genes in different species. Here, we report a newly identified mRNA sequence encoding a TLR protein (referred to as Pl-Tlr) isolated from Paracentrotus lividus immune cells. The partial protein sequence contained the conserved Toll/IL-1 receptor (TIR) domain, the transmembrane domain and part of the leucine repeats. Phylogenetic analysis of the Pl-Tlr protein was accomplished by comparing its sequence with those of TLRs from different classes of vertebrates and invertebrates. This analysis was suggestive of an evolutionary path that most likely represented the course of millions of years, starting from simple organisms and extending to humans. Challenge of the sea urchin immune system with poly-I:C, a chemical compound that mimics dsRNA, caused time-dependent Pl-Tlr mRNA up-regulation that was detected by QPCR. In contrast, bacterial LPS injury did not affect Pl-Tlr transcription. The study of the Tlr genes in the sea urchin model system may provide new perspectives on the role of Tlrs in the invertebrate immune response and clues concerning their evolution in a changing world. Copyright © 2015 Elsevier Ltd. All rights reserved.
Can the HIV-1 splicing machinery be targeted for drug discovery?
Dlamini, Zodwa; Hull, Rodney
2017-01-01
HIV-1 is able to express multiple protein types and isoforms from a single 9 kb mRNA transcript. These proteins are also expressed at particular stages of viral development, and this is achieved through the control of alternative splicing and the export of these transcripts from the nucleus. The nuclear export is controlled by the HIV protein Rev being required to transport incompletely spliced and partially spliced mRNA from the nucleus where they are normally retained. This implies a close relationship between the control of alternate splicing and the nuclear export of mRNA in the control of HIV-1 viral proliferation. This review discusses both the processes. The specificity and regulation of splicing in HIV-1 is controlled by the use of specific splice sites as well as exonic splicing enhancer and exonic splicing silencer sequences. The use of these silencer and enhancer sequences is dependent on the serine arginine family of proteins as well as the heterogeneous nuclear ribonucleoprotein family of proteins that bind to these sequences and increase or decrease splicing. Since alternative splicing is such a critical factor in viral development, it presents itself as a promising drug target. This review aims to discuss the inhibition of splicing, which would stall viral development, as an anti-HIV therapeutic strategy. In this review, the most recent knowledge of splicing in human immunodeficiency viral development and the latest therapeutic strategies targeting human immunodeficiency viral splicing are discussed. PMID:28331370
Hambly, Emma; Tétart, Francoise; Desplats, Carine; Wilson, William H.; Krisch, Henry M.; Mann, Nicholas H.
2001-01-01
Sequence analysis of a 10-kb region of the genome of the marine cyanomyovirus S-PM2 reveals a homology to coliphage T4 that extends as a contiguous block from gene (g)18 to g23. The order of the S-PM2 genes in this region is similar to that of T4, but there are insertions and deletions of small ORFs of unknown function. In T4, g18 codes for the tail sheath, g19, the tail tube, g20, the head portal protein, g21, the prohead core protein, g22, a scaffolding protein, and g23, the major capsid protein. Thus, the entire module that determines the structural components of the phage head and contractile tail is conserved between T4 and this cyanophage. The significant differences in the morphology of these phages must reflect the considerable divergence of the amino acid sequence of their homologous virion proteins, which uniformly exceeds 50%. We suggest that their enormous diversity in the sea could be a result of genetic shuffling between disparate phages mediated by such commonly shared modules. These conserved sequences could facilitate genetic exchange by providing partially homologous substrates for recombination between otherwise divergent phage genomes. Such a mechanism would thus expand the pool of phage genes accessible by recombination to all those phages that share common modules. PMID:11553768
Dollerup, Pia; Thomsen, Troels Møller; Nejsum, Lene N; Færch, Mia; Österbrand, Martin; Gregersen, Niels; Rittig, Søren; Christensen, Jane H; Corydon, Thomas J
2015-12-29
Autosomal dominant inheritance of congenital nephrogenic diabetes insipidus (CNDI) is rare and usually caused by variations in the AQP2 gene. We have investigated the genetic and molecular background underlying symptoms of diabetes insipidus (DI) in a Swedish family with autosomal dominant inheritance of the condition. The proband and her father were subjected to water deprivation testing and direct DNA sequencing of the coding regions of the AQP2 and AVP genes. Madin-Darby canine kidney (MDCK) cells stably expressing AQP2 variant proteins were generated by lentiviral gene delivery. Localization of AQP2 variant proteins in the cells under stimulated and unstimulated conditions was analyzed by means of immunostaining and confocal laser scanning microscopy. Intracellular trafficking of AQP2 variant proteins was studied using transient expression of mutant dynamin2-K44A-GFP protein and AQP2 variant protein phosphorylation levels were assessed by Western blotting analysis. Clinical and genetic data suggest that the proband and her father suffer from partial nephrogenic DI due to a variation (g.4807C > T) in the AQP2 gene. The variation results in substitution of arginine-254 to tryptophan (p.R254W) in AQP2. Analysis of MDCK cells stably expressing AQP2 variant proteins revealed disabled phosphorylation, impaired trafficking and intracellular accumulation of AQP2-R254W protein. Notably, blocking of the endocytic pathway demonstrated impairment of AQP2-R254W to reach the cell surface. Partial CNDI in the Swedish family is caused by an AQP2 variation that seems to disable the encoded AQP2-R254W protein to reach the subapical vesicle population as well as impairing its phosphorylation at S256. The AQP2-R254W protein is thus unable to reach the plasma membrane to facilitate AVP mediated urine concentration.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jensen, B.A.; Hahn, M.E.
1995-12-31
The aryl hydrocarbon receptor (AhR) mediates the effects of many common and potentially toxic organic hydrocarbons, including some polychlorinated biphenyls and dioxins. Since small cetaceans often inhabit industrially polluted coastal waters, comparison of the molecular structure and function of this protein in cetaeans with other marine and mammalian species is important for evaluating the sensitivity of cetaceans to these pollutants. An AhR protein has been identified in beluga liver by photoaffinity labeling. In the present study, the authors sought to clone and sequence an AhR cDNA from beluga as a prelude to studying its structure and function, using reverse-transcription polymerasemore » chain reaction (RT-PCR) and degenerate primers, a 515 base pair fragment was amplified, cloned and sequenced, revealing homology to the PAS domain (ligand binding and dimerization region) of AhRs from terrestrial mammals. This portion of the putative beluga AhR has 82% amino acid and 81% nucleotide sequence identity to the mouse AhR, and 63% amino acid and 64% nucleotide sequence identity to an AhR from the marine fish Fundulus heteroclitus. A beluga cDNA library was synthesized and is currently being screened with the PCR-generated fragment to obtain the complete coding sequence. This is the first molecular evidence of AhR presence in cetaceans.« less
Predicting human protein function with multi-task deep neural networks.
Fa, Rui; Cozzetto, Domenico; Wan, Cen; Jones, David T
2018-01-01
Machine learning methods for protein function prediction are urgently needed, especially now that a substantial fraction of known sequences remains unannotated despite the extensive use of functional assignments based on sequence similarity. One major bottleneck supervised learning faces in protein function prediction is the structured, multi-label nature of the problem, because biological roles are represented by lists of terms from hierarchically organised controlled vocabularies such as the Gene Ontology. In this work, we build on recent developments in the area of deep learning and investigate the usefulness of multi-task deep neural networks (MTDNN), which consist of upstream shared layers upon which are stacked in parallel as many independent modules (additional hidden layers with their own output units) as the number of output GO terms (the tasks). MTDNN learns individual tasks partially using shared representations and partially from task-specific characteristics. When no close homologues with experimentally validated functions can be identified, MTDNN gives more accurate predictions than baseline methods based on annotation frequencies in public databases or homology transfers. More importantly, the results show that MTDNN binary classification accuracy is higher than alternative machine learning-based methods that do not exploit commonalities and differences among prediction tasks. Interestingly, compared with a single-task predictor, the performance improvement is not linearly correlated with the number of tasks in MTDNN, but medium size models provide more improvement in our case. One of advantages of MTDNN is that given a set of features, there is no requirement for MTDNN to have a bootstrap feature selection procedure as what traditional machine learning algorithms do. Overall, the results indicate that the proposed MTDNN algorithm improves the performance of protein function prediction. On the other hand, there is still large room for deep learning techniques to further enhance prediction ability.
Moon, Jaewoong; Liu, Z Lewis
2015-04-01
The aldehyde reductase gene ARI1 is a recently characterized member of an intermediate subfamily within the short-chain dehydrogenase/reductase (SDR) superfamily that clarified mechanisms of in situ detoxification of 2-furaldehyde and 5-hydroxymethyl-2-furaldehyde by Saccharomyces cerevisiae. Uncharacterized open reading frames (ORFs) are common among tolerant candidate genes identified for lignocellulose-to-advanced biofuels conversion. This study presents partially purified proteins of two ORFs, YDR541C and YGL039W, and direct enzyme assay evidence against aldehyde-inhibitory compounds commonly encountered during lignocellulosic biomass fermentation processes. Each of the partially purified proteins encoded by these ORFs showed a molecular mass of approximately 38 kDa, similar to Ari1p, a protein encoded by aldehyde reductase gene. Both proteins demonstrated strong aldehyde reduction activities toward 14 aldehyde substrates, with high levels of reduction activity for Ydr541cp toward both aromatic and aliphatic aldehydes. While Ydr541cp was observed to have a significantly higher specific enzyme activity at 20 U/mg using co-factor NADPH, Ygl039wp displayed a NADH preference at 25 U/mg in reduction of butylaldehyde. Amino acid sequence analysis identified a characteristic catalytic triad, Ser, Tyr and Lys; a conserved catalytic motif of Tyr-X-X-X-Lys; and a cofactor-binding sequence motif, Gly-X-X-Gly-X-X-Ala, near the N-terminus that are shared by Ydr541cp, Ygl039wp, Yol151wp/GRE2 and Ari1p. Findings of aldehyde reductase genes contribute to the yeast gene annotation and aids development of the next-generation biocatalyst for advanced biofuels production. Copyright © 2015 John Wiley & Sons, Ltd.
Andreani, Julien; Verneau, Jonathan; Raoult, Didier; Levasseur, Anthony; La Scola, Bernard
2018-04-10
Nucleo-cytoplasmic large DNA viruses are doubled stranded DNA viruses capable of infecting eukaryotic cells. Since the discovery of Mimivirus and Pandoravirus, there has been no doubt about their extraordinary features compared to "classic" viruses. Recently, we reported the expansion of the proposed family Pithoviridae, with the description of Cedratvirus and Orpheovirus, two new viruses related to Pithoviruses. Studying the major capsid protein of Orpheovirus, we detected a homologous sequence in a mine drainage metagenome. The in-depth exploration of this metagenome, using the MG-Digger program, enabled us to retrieve up to 10 contigs with clear evidence of viral sequences. Moreover, phylogenetic analyses further extended our screening with the discovery in another marine metagenome of a second virus closely related to Orpheovirus IHUMI-LCC2. This virus is a misidentified virus confused with and annotated as a Rickettsiales bacterium. It presents a partial genome size of about 170 kbp.
Petersen, M; Sander, L; Child, R; van Onckelen, H; Ulvskov, P; Borkhardt, B
1996-06-01
Seven distinct partial cDNAs, similar in sequence to previously described polygalacturonases (PGs), were amplified from cDNA derived from rape pod wall, dehiscence zone and leaves by the polymerase chain reaction. Northern analysis showed that one clone, PG35-8, was expressed at low levels in the dehiscence zone during the first five weeks after anthesis but was very abundantly expressed at week 6. In contrast, no PG35-8-related RNA was detected in the pod wall. Our data suggest that there are temporal and spatial correlations between the breakdown of the middle lamella, of the dehiscence zone cells and the pattern of synthesis of PG35-8 transcripts which may indicate a role for this particular PG in rape pod dehiscence. PG35-8 was used to isolate five cDNA clones from a rape dehiscence zone cDNA library. Restriction enzyme analysis and partial sequencing revealed that they were derived from four highly homologous transcripts which are probably allelic forms of a single gene. One full-length clone, RDPG1, was completely sequenced. The predicted protein of RDPG1 showed its highest identity with PG from apple fruit with an identity of 52%.
Brambila-Tapia, Aniel Jessica Leticia; Perez-Rueda, Ernesto; Barrios, Humberto; Dávalos-Rodríguez, Nory Omayra; Dávalos-Rodríguez, Ingrid Patricia; Cardona-Muñoz, Ernesto Germán; Salazar-Páramo, Mario
2017-08-01
A systematic analysis of beta-lactamases based on comparative proteomics has not been performed thus far. In this report, we searched for the presence of beta-lactam-related proteins in 591 bacterial proteomes belonging to 52 species that are pathogenic to humans. The amino acid sequences for 19 different types of beta-lactamases (ACT, CARB, CifA, CMY, CTX, FOX, GES, GOB, IMP, IND, KPC, LEN, OKP, OXA, OXY, SHV, TEM, NDM, and VIM) were obtained from the ARG-ANNOT database and were used to construct 19 HMM profiles, which were used to identify potential beta-lactamases in the completely sequenced bacterial proteomes. A total of 2877 matches that included the word "beta-lactamase" and/or "penicillin" in the functional annotation and/or in any of its regions were obtained. These enzymes were mainly described as "penicillin-binding proteins," "beta-lactamases," and "metallo-beta-lactamases" and were observed in 47 of the 52 species studied. In addition, proteins classified as "beta-lactamases" were observed in 39 of the species included. A positive correlation between the number of beta-lactam-related proteins per species and the proteome size was observed (R 0.78, P < 0.00001). This correlation partially explains the high presence of beta-lactam-related proteins in large proteomes, such as Nocardia brasiliensis, Bacillus anthracis, and Mycobacterium tuberculosis, along with their absence in small proteomes, such as Chlamydia spp. and Mycoplasma spp. We detected only five types of beta-lactamases (TEM, SHV, CTX, IMP, and OXA) and other related proteins in particular species that corresponded with those reported in the literature. We additionally detected other potential species-specific beta-lactamases that have not yet been reported. In the future, better results will be achieved due to more accurate sequence annotations and a greater number of sequenced genomes.
Genome-Scale Phylogeny of the Alphavirus Genus Suggests a Marine Origin
Palacios, G.; Tesh, R. B.; Savji, N.; Guzman, H.; Sherman, M.; Weaver, S. C.; Lipkin, W. I.
2012-01-01
The genus Alphavirus comprises a diverse group of viruses, including some that cause severe disease. Using full-length sequences of all known alphaviruses, we produced a robust and comprehensive phylogeny of the Alphavirus genus, presenting a more complete evolutionary history of these viruses compared to previous studies based on partial sequences. Our phylogeny suggests the origin of the alphaviruses occurred in the southern oceans and spread equally through the Old and New World. Since lice appear to be involved in aquatic alphavirus transmission, it is possible that we are missing a louse-borne branch of the alphaviruses. Complete genome sequencing of all members of the genus also revealed conserved residues forming the structural basis of the E1 and E2 protein dimers. PMID:22190718
Jiménez, Diego Javier; Dini-Andreote, Francisco; Ottoni, Júlia Ronzella; de Oliveira, Valéria Maia; van Elsas, Jan Dirk; Andreote, Fernando Dini
2015-01-01
The occurrence of genes encoding biotechnologically relevant α/β-hydrolases in mangrove soil microbial communities was assessed using data obtained by whole-metagenome sequencing of four mangroves areas, denoted BrMgv01 to BrMgv04, in São Paulo, Brazil. The sequences (215 Mb in total) were filtered based on local amino acid alignments against the Lipase Engineering Database. In total, 5923 unassembled sequences were affiliated with 30 different α/β-hydrolase fold superfamilies. The most abundant predicted proteins encompassed cytosolic hydrolases (abH08; ∼ 23%), microsomal hydrolases (abH09; ∼ 12%) and Moraxella lipase-like proteins (abH04 and abH01; < 5%). Detailed analysis of the genes predicted to encode proteins of the abH08 superfamily revealed a high proportion related to epoxide hydrolases and haloalkane dehalogenases in polluted mangroves BrMgv01-02-03. This suggested selection and putative involvement in local degradation/detoxification of the pollutants. Seven sequences that were annotated as genes for putative epoxide hydrolases and five for putative haloalkane dehalogenases were found in a fosmid library generated from BrMgv02 DNA. The latter enzymes were predicted to belong to Actinobacteria, Deinococcus-Thermus, Planctomycetes and Proteobacteria. Our integrated approach thus identified 12 genes (complete and/or partial) that may encode hitherto undescribed enzymes. The low amino acid identity (< 60%) with already-described genes opens perspectives for both production in an expression host and genetic screening of metagenomes. PMID:25171437
Homoeologous cloning of omega-secalin gene family in a wheat 1BL/1RS translocation.
Chai, Jian Fang; Liu, Xu; Jia, Ji Zeng
2005-08-01
Wheat 1BL/1RS translocations are widely planted in China as well as in most of the wheat producing area in the world for their good qualities of disease resistance and high yield. 1BL/1RS translocations are however poor in bread making, partially caused by a family of small monomeric proteins, omega-secalins, which are encoded by genes on 1RS. Based on published sequence of a rye omega-secalin gene we designed a pair of primers to cover the whole mature protein coding sequence. A major band could be amplified from 1BL/1RS translocations but not from euploid wheat. Using this primer set we conducted PCR amplification by using high fidelity Pfu polymerase on the genomic DNAs and cDNAs purified from a 1BL/1RS translocation Lankao 906. Sequencing analysis indicated that this gene family contains several members of 1150 bp, 1076 bp, 1075 bp, 1052 bp and 1004 bp genes, including two pseudogenes and three active genes. The gene transcripts were differentially expressed in developing seeds.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sugaya, K.; Fukagawa, T.; Matsumoto, K.
Cosmid walking of about 250 kb from MHC class III gene CYP21 to class II was conducted. The gene for receptor of advanced glycosylation end products of proteins (RAGE, a member of immunoglobulin super-family molecules), the PBX2 homeobox gene designated HOX12, and the human counterpart of the mouse mammary tumor gene int-3 were found. The contiguous RAGE and HOX12 genes were completely sequenced, and the human int-3 counterpart was partially sequenced and assigned to a Notch homolog. This human Notch homolog, designated NOTCH3, showed both the intracellular portion present in the mouse int-3 sequence and the extracellular portion absent inmore » the int-3. It thus corresponds to the intact form of a Notch-type transmembrane protein. About 20 kb of dense Alu clustering was found just centromeric to the NOTCH3. 48 refs., 9 figs., 2 tabs.« less
A conserved 19-kDa Eimeria tenella antigen is a profilin-like protein.
Fetterer, R H; Miska, K B; Jenkins, M C; Barfield, R C
2004-12-01
A wide range of recombinant proteins from Eimeria species have been reported to offer some degree of protection against infection and disease, but the specific biological function of these proteins is largely unknown. Previous studies have demonstrated a 19-kDa protein of unknown function designated SZ-1 in sporozoites and merozoites of Eimeria acervulina that can be used to confer partial protection against coccidiosis. Reverse transcriptase-polymerase chain reaction indicated that the gene for SZ-1 is expressed by all the asexual stages of Eimeria tenella. Rabbit antisera to recombinant SZ-1 recognized an approximately 19-kDa protein from extracts of E. tenella sporozoites, merozoites, sporulated oocysts, and oocysts in various stages of sporulation. Immunofluorescence antibody staining indicated specific staining of E. tenella sporozoites and merozoites. Staining was most intense in the cytoplasm of the posterior end of the parasite. The primary amino acid sequence of the gene for E. tenella SZ-1 deduced from the E. tenella genome indicated a conserved domain for the actin-regulatory protein profilin. A conserved binding site for poly-L-proline (PLP), characteristic of profilin was also observed. SZ-1 was separated from soluble extract of E. tenella proteins by affinity chromatography using a PLP ligand, confirming the ability of SZ-1 to bind PLP. SZ-1 also partially inhibited the polymerization of actin. The current results are consistent with the classification of SZ-1 as a profilin-related protein.
A Single Rainbow Trout Cobalamin-binding Protein Stands in for Three Human Binders
Greibe, Eva; Fedosov, Sergey; Sorensen, Boe S.; Højrup, Peter; Poulsen, Steen S.; Nexo, Ebba
2012-01-01
Cobalamin uptake and transport in mammals are mediated by three cobalamin-binding proteins: haptocorrin, intrinsic factor, and transcobalamin. The nature of cobalamin-binding proteins in lower vertebrates remains to be elucidated. The aim of this study was to characterize the cobalamin-binding proteins of the rainbow trout (Oncorhynchus mykiss) and to compare their properties with those of the three human cobalamin-binding proteins. High cobalamin-binding capacity was found in trout stomach (210 pmol/g), roe (400 pmol/g), roe fluid (390 nmol/liter), and plasma (2500 nmol/liter). In all cases, it appeared to be the same protein based on analysis of partial sequences and immunological responses. The trout cobalamin-binding protein was purified from roe fluid, sequenced, and further characterized. Like haptocorrin, the trout cobalamin-binding protein was stable at low pH and had a high binding affinity for the cobalamin analog cobinamide. Like haptocorrin and transcobalamin, the trout cobalamin-binding protein was present in plasma and recognized ligands with altered nucleotide moiety. Like intrinsic factors, the trout cobalamin-binding protein was present in the stomach and resisted degradation by trypsin and chymotrypsin. It also resembled intrinsic factor in the composition of conserved residues in the primary cobalamin-binding site in the C terminus. The trout cobalamin-binding protein was glycosylated and displayed spectral properties comparable with those of haptocorrin and intrinsic factor. In conclusion, only one soluble cobalamin-binding protein was identified in the rainbow trout, a protein that structurally behaves like an intermediate between the three human cobalamin-binding proteins. PMID:22872637
Biosynthesis of small proteoglycan II (decorin) by chondrocytes and evidence for a procore protein.
Sawhney, R S; Hering, T M; Sandell, L J
1991-05-15
We have studied the biosynthesis of cartilage dermatan sulfate proteoglycan II (DS-PGII) (decorin) using in vitro translation of mRNA to determine the size of the primary gene product and by radiolabeling the protein in the presence of tunicamycin to inhibit the addition of Asn-linked oligosaccharides. Pulse-chase experiments were performed to examine post-translational processing and secretion. Inhibitors of oligosaccharide processing were used to determine whether DS-PGII molecules containing partially processed oligosaccharides could become proteoglycans and be secreted. Cell-free translation of sucrose gradient-fractionated RNA and subsequent immunoprecipitation of the core protein confirmed that the functional translated mRNA is in the size range of the two mRNA species observed by hybridization of chondrocyte RNA with a bone PGII cloned probe and that the translation product is a single protein with an apparent molecular mass of 42 kDa. Digestion of the intact proteoglycan (average molecular mass = 103 kDa) with chondroitinase ABC or AC results in an approximately 48-49-kDa product. Chondrocytes treated with tunicamycin to inhibit Asn-linked oligosaccharide addition synthesize and secrete a glycosaminoglycan (GAG)-substituted proteoglycan (average molecular mass = 86 kDa), yielding a 42-kDa core protein after chondroitinase ABC digestion, showing that Asn-linked oligosaccharides are not required for the addition of GAG chains or secretion. Following a short pulse (10 min) of [3H]leucine, three glycosylated forms of the DS-PGII core protein were observed, one of which is likely to be the precursor form of PGII predicted by the implied protein sequence of both bovine and human cDNA clones. Following the apparent cleavage of the propeptide, GAG-substituted intracellular core protein is detectable. Susceptibility to endoglycosidase H indicates that approximately one-third of the secreted core protein contains exclusively complex-type Asn-linked oligosaccharides and approximately two-thirds contain high mannose as well as complex-type oligosaccharides. Secreted DS-PGII appears to be fully substituted with three Asn-linked oligosaccharide chains. Inhibitors of oligosaccharide processing, however, permitted secretion of GAG-substituted DS-PGII that was fully (three chains) or incompletely (one or two chains) substituted with partially processed Asn-linked carbohydrate chains. By comparison of chondrocyte DS-PGII with fibroblast DS-PGII, we conclude that the addition and processing of Asn-linked carbohydrate chains are directed by the amino acid sequence of the core protein. The results reported here also suggest that the addition of xylose, the initial step in GAG chain synthesis, occurs early in biosynthesis and is determined by the primary amino acid sequence of the core protein.(ABSTRACT TRUNCATED AT 400 WORDS)
Jimenez-Lopez, Jose C.; Zafra, Adoración; Palanco, Lucía; Florido, José Fernando
2016-01-01
Olive seeds, which are a raw material of interest, have been reported to contain 11S seed storage proteins (SSPs). However, the presence of SSPs such as 7S vicilins has not been studied. In this study, following a search in the olive seed transcriptome, 58 sequences corresponding to 7S vicilins were retrieved. A partial sequence was amplified by PCR from olive seed cDNA and subjected to phylogenetic analysis with other sequences. Structural analysis showed that olive 7S vicilin contains 9 α-helixes and 22 β-sheets. Additionally, 3D structural analysis displayed good superimposition with vicilin models generated from Pistacia and Sesamum. In order to assess potential allergenicity, T and B epitopes present in these proteins were identified by bioinformatic approaches. Different motifs were observed among the species, as well as some species-specific motifs. Finally, expression analysis of vicilins was carried out in protein extracts obtained from seeds of different species, including the olive. Noticeable bands were observed for all species in the 15–75 kDa MW interval, which were compatible with vicilins. The reactivity of the extracts to sera from patients allergic to nuts was also analysed. The findings with regard to the potential use of olive seed as food are discussed. PMID:27034939
Angsuthanasombat, C; Chungjatupornchai, W; Kertbundit, S; Luxananil, P; Settasatian, C; Wilairat, P; Panyim, S
1987-07-01
Five recombinant E. coli clones exhibiting toxicity to Aedes aegypti larvae were obtained from a library of 800 clones containing XbaI DNA fragments of 110 kb plasmid from B. thuringiensis var. israelensis. All the five clones (pMU 14/258/303/388/679) had the same 3.8-kb insert and encoded a major protein of 130 kDa which was highly toxic to A. aegypti larvae. Three clones (pMU 258/303/388) transcribed the 130 kD a gene in the same direction as that of lac Z promoter of pUC12 vector whereas the transcription of the other two (pMU 14/679) was in the opposite direction. A 1.9-kb fragment of the 3.8 kb insert coded for a protein of 65 kDa. Partial DNA sequence of the 3.8 kb insert, corresponding to the 5'-terminal of the 130 kDa gene, revealed a continuous reading frame, a Shine-Dalgarno sequence and a tentative 5'-regulatory region. These results demonstrated that the 3.8 kb insert is a minimal DNA fragment containing a regulatory region plus the coding sequence of the 130 kDa protein that is highly toxic to mosquito larvae.
de Almeida Dias, Felipe; Souza dos Santos, Andre Luis; Santos Lery, Letícia Miranda; Alves e Silva, Thiago Luiz; Oliveira, Mauricio Martins; Bisch, Paulo Mascarello; Saraiva, Elvira Maria; Souto-Padrón, Thaïs Cristina; Lopes, Angela Hampshire
2012-01-01
Phytomonas species are plant parasites of the family Trypanosomatidae, which are transmitted by phytophagous insects. Some Phytomonas species cause major agricultural damages. The hemipteran Oncopeltus fasciatus is natural and experimental host for several species of trypanosomatids, including Phytomonas spp. The invasion of the insect vectors' salivary glands is one of the most important events for the life cycle of Phytomonas species. In the present study, we show the binding of Phytomonas serpens at the external face of O. fasciatus salivary glands by means of scanning electron microscopy and the in vitro interaction of living parasites with total proteins from the salivary glands in ligand blotting assays. This binding occurs primarily through an interaction with a 130 kDa salivary gland protein. The mass spectrometry of the trypsin-digest of this protein matched 23% of human laminin-5 β3 chain precursor sequence by 16 digested peptides. A protein sequence search through the transcriptome of O. fasciatus embryo showed a partial sequence with 51% similarity to human laminin β3 subunit. Anti-human laminin-5 β3 chain polyclonal antibodies recognized the 130 kDa protein by immunoblotting. The association of parasites with the salivary glands was strongly inhibited by human laminin-5, by the purified 130 kDa insect protein, and by polyclonal antibodies raised against the human laminin-5 β3 chain. This is the first report demonstrating that a laminin-like molecule from the salivary gland of O. fasciatus acts as a receptor for Phytomonas binding. The results presented in this investigation are important findings that will support further studies that aim at developing new approaches to prevent the transmission of Phytomonas species from insects to plants and vice-versa.
Dias, Felipe de Almeida; dos Santos, Andre Luis Souza; Lery, Letícia Miranda Santos; Alves e Silva, Thiago Luiz; Oliveira, Mauricio Martins; Bisch, Paulo Mascarello; Saraiva, Elvira Maria; Souto-Padrón, Thaïs Cristina; Lopes, Angela Hampshire
2012-01-01
Phytomonas species are plant parasites of the family Trypanosomatidae, which are transmitted by phytophagous insects. Some Phytomonas species cause major agricultural damages. The hemipteran Oncopeltus fasciatus is natural and experimental host for several species of trypanosomatids, including Phytomonas spp. The invasion of the insect vectors' salivary glands is one of the most important events for the life cycle of Phytomonas species. In the present study, we show the binding of Phytomonas serpens at the external face of O. fasciatus salivary glands by means of scanning electron microscopy and the in vitro interaction of living parasites with total proteins from the salivary glands in ligand blotting assays. This binding occurs primarily through an interaction with a 130 kDa salivary gland protein. The mass spectrometry of the trypsin-digest of this protein matched 23% of human laminin-5 β3 chain precursor sequence by 16 digested peptides. A protein sequence search through the transcriptome of O. fasciatus embryo showed a partial sequence with 51% similarity to human laminin β3 subunit. Anti-human laminin-5 β3 chain polyclonal antibodies recognized the 130 kDa protein by immunoblotting. The association of parasites with the salivary glands was strongly inhibited by human laminin-5, by the purified 130 kDa insect protein, and by polyclonal antibodies raised against the human laminin-5 β3 chain. This is the first report demonstrating that a laminin-like molecule from the salivary gland of O. fasciatus acts as a receptor for Phytomonas binding. The results presented in this investigation are important findings that will support further studies that aim at developing new approaches to prevent the transmission of Phytomonas species from insects to plants and vice-versa. PMID:23118944
Tsigelny, Igor; Sharikov, Yuriy; Ten Eyck, Lynn F
2002-05-01
HMMSPECTR is a tool for finding putative structural homologs for proteins with known primary sequences. HMMSPECTR contains four major components: a data warehouse with the hidden Markov models (HMM) and alignment libraries; a search program which compares the initial protein sequences with the libraries of HMMs; a secondary structure prediction and comparison program; and a dominant protein selection program that prepares the set of 10-15 "best" proteins from the chosen HMMs. The data warehouse contains four libraries of HMMs. The first two libraries were constructed using different HHM preparation options of the HAMMER program. The third library contains parts ("partial HMM") of initial alignments. The fourth library contains trained HMMs. We tested our program against all of the protein targets proposed in the CASP4 competition. The data warehouse included libraries of structural alignments and HMMs constructed on the basis of proteins publicly available in the Protein Data Bank before the CASP4 meeting. The newest fully automated versions of HMMSPECTR 1.02 and 1.02ss produced better results than the best result reported at CASP4 either by r.m.s.d. or by length (or both) in 64% (HMMSPECTR 1.02) and 79% (HMMSPECTR 1.02ss) of the cases. The improvement is most notable for the targets with complexity 4 (difficult fold recognition cases).
Donkey Orchid Symptomless Virus: A Viral ‘Platypus’ from Australian Terrestrial Orchids
Wylie, Stephen J.; Li, Hua; Jones, Michael G. K.
2013-01-01
Complete and partial genome sequences of two isolates of an unusual new plant virus, designated Donkey orchid symptomless virus (DOSV) were identified using a high-throughput sequencing approach. The virus was identified from asymptomatic plants of Australian terrestrial orchid Diuris longifolia (Common donkey orchid) growing in a remnant forest patch near Perth, western Australia. DOSV was identified from two D. longifolia plants of 264 tested, and from at least one plant of 129 Caladenia latifolia (pink fairy orchid) plants tested. Phylogenetic analysis of the genome revealed open reading frames (ORF) encoding seven putative proteins of apparently disparate origins. A 69-kDa protein (ORF1) that overlapped the replicase shared low identity with MPs of plant tymoviruses (Tymoviridae). A 157-kDa replicase (ORF2) and 22-kDa coat protein (ORF4) shared 32% and 40% amino acid identity, respectively, with homologous proteins encoded by members of the plant virus family Alphaflexiviridae. A 44-kDa protein (ORF3) shared low identity with myosin and an autophagy protein from Squirrelpox virus. A 27-kDa protein (ORF5) shared no identity with described proteins. A 14-kDa protein (ORF6) shared limited sequence identity (26%) over a limited region of the envelope glycoprotein precursor of mammal-infecting Crimea-Congo hemorrhagic fever virus (Bunyaviridae). The putative 25-kDa movement protein (MP) (ORF7) shared limited (27%) identity with 3A-like MPs of members of the plant-infecting Tombusviridae and Virgaviridae. Transmissibility was shown when DOSV systemically infected Nicotiana benthamiana plants. Structure and organization of the domains within the putative replicase of DOSV suggests a common evolutionary origin with ‘potexvirus-like’ replicases of viruses within the Alphaflexiviridae and Tymoviridae, and the CP appears to be ancestral to CPs of allexiviruses (Alphaflexiviridae). The MP shares an evolutionary history with MPs of dianthoviruses, but the other putative proteins are distant from plant viruses. DOSV is not readily classified in current lower order virus taxa. PMID:24223974
Botero, Adriana; Kapeller, Irit; Cooper, Crystal; Clode, Peta L; Shlomai, Joseph; Thompson, R C Andrew
2018-05-17
Kinetoplast DNA (kDNA) is the mitochondrial genome of trypanosomatids. It consists of a few dozen maxicircles and several thousand minicircles, all catenated topologically to form a two-dimensional DNA network. Minicircles are heterogeneous in size and sequence among species. They present one or several conserved regions that contain three highly conserved sequence blocks. CSB-1 (10 bp sequence) and CSB-2 (8 bp sequence) present lower interspecies homology, while CSB-3 (12 bp sequence) or the Universal Minicircle Sequence is conserved within most trypanosomatids. The Universal Minicircle Sequence is located at the replication origin of the minicircles, and is the binding site for the UMS binding protein, a protein involved in trypanosomatid survival and virulence. Here, we describe the structure and organisation of the kDNA of Trypanosoma copemani, a parasite that has been shown to infect mammalian cells and has been associated with the drastic decline of the endangered Australian marsupial, the woylie (Bettongia penicillata). Deep genomic sequencing showed that T. copemani presents two classes of minicircles that share sequence identity and organisation in the conserved sequence blocks with those of Trypanosoma cruzi and Trypanosoma lewisi. A 19,257 bp partial region of the maxicircle of T. copemani that contained the entire coding region was obtained. Comparative analysis of the T. copemani entire maxicircle coding region with the coding regions of T. cruzi and T. lewisi showed they share 71.05% and 71.28% identity, respectively. The shared features in the maxicircle/minicircle organisation and sequence between T. copemani and T. cruzi/T. lewisi suggest similarities in their process of kDNA replication, and are of significance in understanding the evolution of Australian trypanosomes. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Zheng, Zequn; Zhang, Qisen; Zhou, Gaofeng; Sweetingham, Mark W.; Howieson, John G.; Li, Chengdao
2013-01-01
Lupin (Lupinus angustifolius L.) is the most recently domesticated crop in major agricultural cultivation. Its seeds are high in protein and dietary fibre, but low in oil and starch. Medical and dietetic studies have shown that consuming lupin-enriched food has significant health benefits. We report the draft assembly from a whole genome shotgun sequencing dataset for this legume species with 26.9x coverage of the genome, which is predicted to contain 57,807 genes. Analysis of the annotated genes with metabolic pathways provided a partial understanding of some key features of lupin, such as the amino acid profile of storage proteins in seeds. Furthermore, we applied the NGS-based RAD-sequencing technology to obtain 8,244 sequence-defined markers for anchoring the genomic sequences. A total of 4,214 scaffolds from the genome sequence assembly were aligned into the genetic map. The combination of the draft assembly and a sequence-defined genetic map made it possible to locate and study functional genes of agronomic interest. The identification of co-segregating SNP markers, scaffold sequences and gene annotation facilitated the identification of a candidate R gene associated with resistance to the major lupin disease anthracnose. We demonstrated that the combination of medium-depth genome sequencing and a high-density genetic linkage map by application of NGS technology is a cost-effective approach to generating genome sequence data and a large number of molecular markers to study the genomics, genetics and functional genes of lupin, and to apply them to molecular plant breeding. This strategy does not require prior genome knowledge, which potentiates its application to a wide range of non-model species. PMID:23734219
Yang, Huaan; Tao, Ye; Zheng, Zequn; Zhang, Qisen; Zhou, Gaofeng; Sweetingham, Mark W; Howieson, John G; Li, Chengdao
2013-01-01
Lupin (Lupinus angustifolius L.) is the most recently domesticated crop in major agricultural cultivation. Its seeds are high in protein and dietary fibre, but low in oil and starch. Medical and dietetic studies have shown that consuming lupin-enriched food has significant health benefits. We report the draft assembly from a whole genome shotgun sequencing dataset for this legume species with 26.9x coverage of the genome, which is predicted to contain 57,807 genes. Analysis of the annotated genes with metabolic pathways provided a partial understanding of some key features of lupin, such as the amino acid profile of storage proteins in seeds. Furthermore, we applied the NGS-based RAD-sequencing technology to obtain 8,244 sequence-defined markers for anchoring the genomic sequences. A total of 4,214 scaffolds from the genome sequence assembly were aligned into the genetic map. The combination of the draft assembly and a sequence-defined genetic map made it possible to locate and study functional genes of agronomic interest. The identification of co-segregating SNP markers, scaffold sequences and gene annotation facilitated the identification of a candidate R gene associated with resistance to the major lupin disease anthracnose. We demonstrated that the combination of medium-depth genome sequencing and a high-density genetic linkage map by application of NGS technology is a cost-effective approach to generating genome sequence data and a large number of molecular markers to study the genomics, genetics and functional genes of lupin, and to apply them to molecular plant breeding. This strategy does not require prior genome knowledge, which potentiates its application to a wide range of non-model species.
NASA Technical Reports Server (NTRS)
Reddy, A. S.; Reddy, V. S.; Golovkin, M.
2000-01-01
Calmodulin (CaM), a key calcium sensor in all eukaryotes, regulates diverse cellular processes by interacting with other proteins. To isolate CaM binding proteins involved in ethylene signal transduction, we screened an expression library prepared from ethylene-treated Arabidopsis seedlings with 35S-labeled CaM. A cDNA clone, EICBP (Ethylene-Induced CaM Binding Protein), encoding a protein that interacts with activated CaM was isolated in this screening. The CaM binding domain in EICBP was mapped to the C-terminus of the protein. These results indicate that calcium, through CaM, could regulate the activity of EICBP. The EICBP is expressed in different tissues and its expression in seedlings is induced by ethylene. The EICBP contains, in addition to a CaM binding domain, several features that are typical of transcription factors. These include a DNA-binding domain at the N terminus, an acidic region at the C terminus, and nuclear localization signals. In database searches a partial cDNA (CG-1) encoding a DNA-binding motif from parsley and an ethylene up-regulated partial cDNA from tomato (ER66) showed significant similarity to EICBP. In addition, five hypothetical proteins in the Arabidopsis genome also showed a very high sequence similarity with EICBP, indicating that there are several EICBP-related proteins in Arabidopsis. The structural features of EICBP are conserved in all EICBP-related proteins in Arabidopsis, suggesting that they may constitute a new family of DNA binding proteins and are likely to be involved in modulating gene expression in the presence of ethylene.
Revisiting and re-engineering the classical zinc finger peptide: consensus peptide-1 (CP-1).
Besold, Angelique N; Widger, Leland R; Namuswe, Frances; Michalek, Jamie L; Michel, Sarah L J; Goldberg, David P
2016-04-01
Zinc plays key structural and catalytic roles in biology. Structural zinc sites are often referred to as zinc finger (ZF) sites, and the classical ZF contains a Cys2His2 motif that is involved in coordinating Zn(II). An optimized Cys2His2 ZF, named consensus peptide 1 (CP-1), was identified more than 20 years ago using a limited set of sequenced proteins. We have reexamined the CP-1 sequence, using our current, much larger database of sequenced proteins that have been identified from high-throughput sequencing methods, and found the sequence to be largely unchanged. The CCHH ligand set of CP-1 was then altered to a CAHH motif to impart hydrolytic activity. This ligand set mimics the His2Cys ligand set of peptide deformylase (PDF), a hydrolytically active M(II)-centered (M = Zn or Fe) protein. The resultant peptide [CP-1(CAHH)] was evaluated for its ability to coordinate Zn(II) and Co(II) ions, adopt secondary structure, and promote hydrolysis. CP-1(CAHH) was found to coordinate Co(II) and Zn(II) and a pentacoordinate geometry for Co(II)-CP-1(CAHH) was implicated from UV-vis data. This suggests a His2Cys(H2O)2 environment at the metal center. The Zn(II)-bound CP-1(CAHH) was shown to adopt partial secondary structure by 1-D (1)H NMR spectroscopy. Both Zn(II)-CP-1(CAHH) and Co(II)-CP-1(CAHH) show good hydrolytic activity toward the test substrate 4-nitrophenyl acetate, exhibiting faster rates than most active synthetic Zn(II) complexes.
Bes, M T; Hernández, J A; Peleato, M L; Fillat, M F
2001-01-15
A gene coding for a Fur (ferric uptake regulation) protein from the cyanobacterium Anabaena PCC 7119 has been cloned and overexpressed in Escherichia coli. DNA sequence analysis confirmed the presence of a 151-amino-acid open reading frame that showed homology with the Fur proteins reported for the unicellular cyanobacteria Synechococcus 7942 and Synechocystis PCC 6803. Two putative Fur-binding sites were detected in the promoter regions of the fur gene from Anabaena. Partially purified recombinant Fur binds to the flavodoxin promoter as well as its own promoter. This suggests that the Fur gene is autoregulated in Anabaena.
Talbot, Darren A; Hanuise, Nicolas; Rey, Benjamin; Rouanet, Jean-Louis; Duchamp, Claude; Brand, Martin D
2003-12-26
We present the partial nucleotide sequence of the avian uncoupling protein (avUCP) gene from king penguin (Aptenodytes patagonicus), showing that the protein is 88-92% identical to chicken (Gallus gallus), turkey (Meleagris gallopavo), and hummingbird (Eupetomena macroura). We show that superoxide activates the proton conductance of mitochondria isolated from king penguin skeletal muscle. GDP abolishes the superoxide-activated proton conductance, indicating that it is mediated via avUCP. In the absence of superoxide there is no GDP-sensitive component of the proton conductance from penguin muscle mitochondria demonstrating that avUCP plays no role in the basal proton leak.
NASA Technical Reports Server (NTRS)
Lanyi, J. K.
1986-01-01
The archaebacteria occupy a unique place in phylogenetic trees constructed from analyses of sequences from key informational macromolecules, and their study continues to yield interesting ideas on the early evolution and divergence of biological forms. It is now known that the halobacteria among these species contain various retinal-proteins, resembling eukaryotic rhodopsins, but with different functions. Two of these pigments, located in the cytoplasmic membranes of the bacteria, are bacteriorhodopsin (a light-driven proton pump) and halorhodopsin (a light-driven chloride pump). Comparison of these systems is expected to reveal structure/function relationships in these simple (primitive?) energy transducing membrane components and evolutionary relationships which had produced the structural features which allow the divergent functions. Findings indicate that very different primary structures are needed for these proteins to accomplish their different functions. Indeed, analysis of partial amino acid sequences from halo-opsin shows already that few if any long segments exist which are homologous to bacterio-opsin. Either these proteins diverged a very long time ago to allow for the observed differences, or the evolutionary clock in the halobacteria runs faster than usual.
Calcium-Dependent Protein Kinase Genes in Corn Roots
NASA Technical Reports Server (NTRS)
Takezawa, D.; Patil, S.; Bhatia, A.; Poovaiah, B. W.
1996-01-01
Two cDNAs encoding Ca-2(+) - Dependent Protein Kinases (CDPKs), Corn Root Protein Kinase 1 and 2 (CRPK 1, CRPK 2) were isolated from the root tip library of corn (Zea mays L., cv. Merit) and their nucleotide sequences were determined. Deduced amino acid sequences of both the clones have features characteristic of plant CDPKS, including all 11 conserved serine/threonine kinase subdomains, a junction domain and a calmodulin-like domain with four Ca-2(+), -binding sites. Northern analysis revealed that CRPKI mRNA is preferentially expressed in roots, especially in the root tip; whereas, the expression of CRPK2 mRNA was very low in all the tissues tested. In situ hybridization experiments revealed that CRPKI mRNA is highly expressed in the root apex, as compared to other parts of the root. Partially purified CDPK from the root tip phosphorylates syntide-2, a common peptide substrate for plant CDPKs, and the phosphorylation was stimulated 7-fold by the addition of Ca-2(+). Our results show that two CDPK isoforms are expressed in corn roots and they may be involved in the Ca-2(+)-dependent signal transduction process.
Yama, Tomonari; Ochi, Arisa; Suto, Takuro; Hirasaka, Katsuya; Teshima-Kondo, Shigetada; Okumura, Yuushi; Oarada, Motoko; Choi, Inho; Mukai, Rie; Terao, Junji
2013-01-01
Background. Unloading stress induces skeletal muscle atrophy. We have reported that Cbl-b ubiquitin ligase is a master regulator of unloading-associated muscle atrophy. The present study was designed to elucidate whether dietary soy glycinin protein prevents denervation-mediated muscle atrophy, based on the presence of inhibitory peptides against Cbl-b ubiquitin ligase in soy glycinin protein. Methods. Mice were fed either 20% casein diet, 20% soy protein isolate diet, 10% glycinin diet containing 10% casein, or 20% glycinin diet. One week later, the right sciatic nerve was cut. The wet weight, cross sectional area (CSA), IGF-1 signaling, and atrogene expression in hindlimb muscles were examined at 1, 3, 3.5, or 4 days after denervation. Results. 20% soy glycinin diet significantly prevented denervation-induced decreases in muscle wet weight and myofiber CSA. Furthermore, dietary soy protein inhibited denervation-induced ubiquitination and degradation of IRS-1 in tibialis anterior muscle. Dietary soy glycinin partially suppressed the denervation-mediated expression of atrogenes, such as MAFbx/atrogin-1 and MuRF-1, through the protection of IGF-1 signaling estimated by phosphorylation of Akt-1. Conclusions. Soy glycinin contains a functional inhibitory sequence against muscle-atrophy-associated ubiquitin ligase Cbl-b. Dietary soy glycinin protein significantly prevented muscle atrophy after denervation in mice. PMID:23762056
Time Course of Pathogenic and Adaptation Mechanisms in Cystinotic Mouse Kidneys
Gaide Chevronnay, Héloïse P.; Janssens, Virginie; Van Der Smissen, Patrick; N’Kuli, Francisca; Nevo, Nathalie; Guiot, Yves; Levtchenko, Elena; Marbaix, Etienne; Pierreux, Christophe E.; Cherqui, Stéphanie; Antignac, Corinne; Courtoy, Pierre J.
2014-01-01
Cystinosis, a main cause of Fanconi syndrome, is reproduced in congenic C57BL/6 cystinosin knockout (KO) mice. To identify the sequence of pathogenic and adaptation mechanisms of nephropathic cystinosis, we defined the onset of Fanconi syndrome in KO mice between 3 and 6 months of age and analyzed the correlation with structural and functional changes in proximal tubular cells (PTCs), with focus on endocytosis of ultrafiltrated disulfide-rich proteins as a key source of cystine. Despite considerable variation between mice at the same age, typical event sequences were delineated. At the cellular level, amorphous lysosomal inclusions preceded cystine crystals and eventual atrophy without crystals. At the nephron level, lesions started at the glomerulotubular junction and then extended distally. In situ hybridization and immunofluorescence revealed progressive loss of expression of megalin, cubilin, sodium-glucose cotransporter 2, and type IIa sodium-dependent phosphate cotransporter, suggesting apical dedifferentiation accounting for Fanconi syndrome before atrophy. Injection of labeled proteins revealed that defective endocytosis in S1 PTCs led to partial compensatory uptake by S3 PTCs, suggesting displacement of endocytic load and injury by disulfide-rich cargo. Increased PTC apoptosis allowed luminal shedding of cystine crystals and was partially compensated for by tubular proliferation. We conclude that lysosomal storage triggered by soluble cystine accumulation induces apical PTC dedifferentiation, which causes transfer of the harmful load of disulfide-rich proteins to more distal cells, possibly explaining longitudinal progression of swan-neck lesions. Furthermore, our results suggest that subsequent adaptation mechanisms include lysosomal clearance of free and crystalline cystine into urine and ongoing tissue repair. PMID:24525030
Cristancho, Marco A.; Botero-Rozo, David Octavio; Giraldo, William; Tabima, Javier; Riaño-Pachón, Diego Mauricio; Escobar, Carolina; Rozo, Yomara; Rivera, Luis F.; Durán, Andrés; Restrepo, Silvia; Eilam, Tamar; Anikster, Yehoshua; Gaitán, Alvaro L.
2014-01-01
Coffee leaf rust caused by the fungus Hemileia vastatrix is the most damaging disease to coffee worldwide. The pathogen has recently appeared in multiple outbreaks in coffee producing countries resulting in significant yield losses and increases in costs related to its control. New races/isolates are constantly emerging as evidenced by the presence of the fungus in plants that were previously resistant. Genomic studies are opening new avenues for the study of the evolution of pathogens, the detailed description of plant-pathogen interactions and the development of molecular techniques for the identification of individual isolates. For this purpose we sequenced 8 different H. vastatrix isolates using NGS technologies and gathered partial genome assemblies due to the large repetitive content in the coffee rust hybrid genome; 74.4% of the assembled contigs harbor repetitive sequences. A hybrid assembly of 333 Mb was built based on the 8 isolates; this assembly was used for subsequent analyses. Analysis of the conserved gene space showed that the hybrid H. vastatrix genome, though highly fragmented, had a satisfactory level of completion with 91.94% of core protein-coding orthologous genes present. RNA-Seq from urediniospores was used to guide the de novo annotation of the H. vastatrix gene complement. In total, 14,445 genes organized in 3921 families were uncovered; a considerable proportion of the predicted proteins (73.8%) were homologous to other Pucciniales species genomes. Several gene families related to the fungal lifestyle were identified, particularly 483 predicted secreted proteins that represent candidate effector genes and will provide interesting hints to decipher virulence in the coffee rust fungus. The genome sequence of Hva will serve as a template to understand the molecular mechanisms used by this fungus to attack the coffee plant, to study the diversity of this species and for the development of molecular markers to distinguish races/isolates. PMID:25400655
Production of recombinant proteins from Plasmodium falciparum in Escherichia coli.
Guerra, Ángela Patricia; Calvo, Eliana Patricia; Wasserman, Moisés; Chaparro-Olaya, Jacqueline
2016-02-23
The production of recombinant proteins is essential for the characterization and functional study of proteins from Plasmodium falciparum. However, the proteins of P. falciparum are among the most challenging to express, and when expression is achieved, the recombinant proteins usually fold incorrectly and lead to the formation of inclusion bodies. To obtain and purify four recombinant proteins and to use them as antigens to produce polyclonal antibodies. The production efficiency and solubility were evaluated as the proteins were expressed in two genetically modified strains of Escherichia coli to favor the production of heterologous proteins (BL21-CodonPlus (DE3)-RIL and BL21-pG-KJE8). The four recombinant P. falciparum proteins corresponding to partial sequences of PfMyoA (Myosin A) and PfGAP50 (gliding associated protein 50), and the complete sequences of PfMTIP (myosin tail interacting protein) and PfGAP45 (gliding associated protein 45), were produced as glutathione S-transferase-fusion proteins, purified and used for immunizing mice. The protein expression was much more efficient in BL21-CodonPlus, the strain that contains tRNAs that are rare in wild-type E. coli, compared to the expression in BL21-pG-KJE8. In spite of the fact that BL21-pG-KJE8 overexpresses chaperones, this strain did not minimize the formation of inclusion bodies. The use of genetically modified strains of E. coli was essential to achieve high expression levels of the four evaluated P. falciparum proteins and lead to improved solubility of two of them. The approach used here allowed us to obtain and purify four P. falciparum proteins in enough quantity to produce polyclonal antibodies in mice, and a fair amount of two pure and soluble recombinant proteins for future assays.
Koumenis, C; Nunez-Regueiro, M; Raju, U; Cook, R; Eskin, A
1995-06-16
Previous results using translation inhibitors in the ocular circadian system of Aplysia suggest that protein synthesis may be involved in the light and serotonin (5-HT) entrainment pathways or perhaps in the circadian oscillator. Proteins have been previously identified whose synthesis was altered by treatments of light capable of perturbing the phase of the circadian rhythm in the eye of Aplysia. We extended these studies by investigating the effects of other treatments that perturb the ocular circadian rhythm on protein synthesis. 5-HT altered the synthesis of nine proteins. Interestingly, five of the proteins affected by treatments with 5-HT were previously shown to be affected by treatments with light. Four of the proteins affected by treatments with 5-HT were also affected by treatments with analogs of cAMP, a treatment which mimics the effects of 5-HT on the ocular circadian rhythm. To identify the cellular function of some of these proteins, we obtained their partial amino acid sequences. Based on these sequences and additional characterizations, a 78-kDa, pI 5.6 Aplysia protein appears to be glucose-regulated protein 78/binding protein, and a 36-kDa, pI 5.7 Aplysia protein appears to be porin/voltage-dependent anion channel. Heat shock experiments on Aplysia eyes revealed that yet another one of the Aplysia proteins (70 kDa) affected by 5-HT appears to be a heat-inducible member (heat shock protein 70) of the family of heat shock proteins. These findings suggest that these three identified proteins, together or individually, may be involved in some way in the regulation of the timing of the circadian oscillator in the eye of Aplysia.
NASA Astrophysics Data System (ADS)
Rusyaman, E.; Parmikanti, K.; Chaerani, D.; Asefan; Irianingsih, I.
2018-03-01
One of the application of fractional ordinary differential equation is related to the viscoelasticity, i.e., a correlation between the viscosity of fluids and the elasticity of solids. If the solution function develops into function with two or more variables, then its differential equation must be changed into fractional partial differential equation. As the preliminary study for two variables viscoelasticity problem, this paper discusses about convergence analysis of function sequence which is the solution of the homogenous fractional partial differential equation. The method used to solve the problem is Homotopy Analysis Method. The results show that if given two real number sequences (αn) and (βn) which converge to α and β respectively, then the solution function sequences of fractional partial differential equation with order (αn, βn) will also converge to the solution function of fractional partial differential equation with order (α, β).
USDA-ARS?s Scientific Manuscript database
Polymorphic genetic markers were identified and characterized using a partial genomic library of Heliothis virescens enriched for simple sequence repeats (SSR) and nucleotide sequences of expressed sequence tags (EST). Nucleotide sequences of 192 clones from the partial genomic library yielded 147 u...
Lätzer, Joachim; Shen, Tongye; Wolynes, Peter G
2008-02-19
We investigate how post-translational phosphorylation modifies the global conformation of a protein by changing its free energy landscape using two test proteins, cystatin and NtrC. We first examine the changes in a free energy landscape caused by phosphorylation using a model containing information about both structural forms. For cystatin the free energy cost is fairly large indicating a low probability of sampling the phosphorylated conformation in a perfectly funneled landscape. The predicted barrier for NtrC conformational transition is several times larger than the barrier for cystatin, indicating that the switch protein NtrC most probably follows a partial unfolding mechanism to move from one basin to the other. Principal component analysis and linear response theory show how the naturally occurring conformational changes in unmodified proteins are captured and stabilized by the change of interaction potential. We also develop a partially guided structure prediction Hamiltonian which is capable of predicting the global structure of a phosphorylated protein using only knowledge of the structure of the unphosphorylated protein or vice versa. This algorithm makes use of a generic transferable long-range residue contact potential along with details of structure short range in sequence. By comparing the results obtained with this guided transferable potential to those from the native-only, perfectly funneled Hamiltonians, we show that the transferable Hamiltonian correctly captures the nature of the global conformational changes induced by phosphorylation and can sample substantially correct structures for the modified protein with high probability.
The nucleoplasmin homolog NLP mediates centromere clustering and anchoring to the nucleolus.
Padeken, Jan; Mendiburo, María José; Chlamydas, Sarantis; Schwarz, Hans-Jürgen; Kremmer, Elisabeth; Heun, Patrick
2013-04-25
Centromere clustering during interphase is a phenomenon known to occur in many different organisms and cell types, yet neither the factors involved nor their physiological relevance is well understood. Using Drosophila tissue culture cells and flies, we identified a network of proteins, including the nucleoplasmin-like protein (NLP), the insulator protein CTCF, and the nucleolus protein Modulo, to be essential for the positioning of centromeres. Artificial targeting further demonstrated that NLP and CTCF are sufficient for clustering, while Modulo serves as the anchor to the nucleolus. Centromere clustering was found to depend on centric chromatin rather than specific DNA sequences. Moreover, unclustering of centromeres results in the spatial destabilization of pericentric heterochromatin organization, leading to partial defects in the silencing of repetitive elements, defects during chromosome segregation, and genome instability. Copyright © 2013 Elsevier Inc. All rights reserved.
Coulter-Mackie, M B; Lian, Q
2008-07-01
Alanine:glyoxylate aminotransferase (AGT) is a liver peroxisomal enzyme whose deficiency results in primary hyperoxaluria type 1 (PH1). More than 75 PH1 mutations are now documented in the AGT gene (AGXT), of which about 50% are missense. We have previously demonstrated that many such mutants expressed by transcription/translation are subject to generalized degradation by the proteasome and a specific limited trimming by an endogenous ATP-independent protease activity. Here, we report the results of partial digestion using trypsin as a mimic for the endogenous non-proteasomal protease and the use of N-terminal protein sequencing to determine the sensitive site. Partial trypsin digestion also provided an indicator of proper folding of the mutant enzyme. For selected mutations the sensitivity to trypsin could be ameliorated by addition of pyridoxal phosphate or aminooxy acetic acid as specific pharmacological chaperones.
Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P
1988-02-01
Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators.
Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P
1988-01-01
Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators. Images PMID:3257578
Macedo, Maria Lígia R; Diz Filho, Eduardo B S; Freire, Mariadas Graças M; Oliva, Maria Luiza V; Sumikawa, Joana T; Toyama, Marcos H; Marangoni, Sérgio
2011-01-01
The present paper describes the purification, characterization and determination of the partial primary structure of the first trypsin inhibitor isolated from the family Sapindaceae. A highly stable, potent trypsin inhibitor (SSTI) was purified to homogeneity. SDS-PAGE analysis revealed that the protein consists of a two-polypeptide chain with molecular masses of approximately 15 and 3 kDa. The purified inhibitor inhibited bovine trypsin at a 1:1 M ratio. Kinetic analysis revealed that the protein is a competitive inhibitor with an equilibrium dissociation constant of 10⁻⁹ M for trypsin. The partial NH₂- terminal sequence of 36 amino acids in SSTI indicates homology with other members of the trypsin-inhibitor family from different sources. This inhibitor is highly stable in the presence of denaturing agents. SSTI showed significant inhibitory activity against trypsin-like proteases present in the larval midgut on Anagasta kuehniella, Corcyra cephalonica, Diatreae saccharalis and Anticarsia gemmatalis.
Gao, Yan; Su, Qiudong; Yi, Yao; Jia, Zhiyuan; Wang, Hao; Lu, Xuexin; Qiu, Feng; Bi, Shengli
2015-01-01
Hepatitis A virus (HAV) and Hepatitis E virus (HEV) are the most common causes of infectious hepatitis. These viruses are spread largely by the fecal-oral route and lead to clinically important disease in developing countries. To evaluate the potential of targeting hepatitis A and E infection simultaneously, a combined mucosal candidate vaccine was developed with the partial open reading frame 2 (ORF2) sequence (aa 368-607) of HEV (HE-ORF2) and partial virus protein 1 (VP1) sequence (aa 1-198) of HAV (HA-VP1), which included the viral neutralization epitopes. Tuftsin is an immunostimulatory peptide which can enhance the immunogenicity of a protein by targeting it to macrophages and dendritic cells. Here, we developed a novel combined protein vaccine by conjugating tuftsin to HE-ORF2 and HA-VP1 and used synthetic CpG oligodeoxynucleotides (ODNs) as the adjuvant. Subsequent experiments in BALB/c mice demonstrated that tuftsin enhanced the serum-specific IgG and IgA antibodies against HEV and HAV at the intestinal, vaginal and pulmonary interface when delivered intranasally. Moreover, mice from the intranasally immunized tuftsin group (HE-ORF2-tuftsin + HA-VP1-tuftsin + CpG) showed higher levels of IFN-γ-secreting splenocytes (Th1 response) and ratio of CD4+/CD8+ T cells than those of the no-tuftsin group (HE-ORF2 + HA-VP1 + CpG). Thus, the tuftsin group generated stronger humoral and cellular immune responses compared with the no-tuftsin group. Moreover, enhanced responses to the combined protein vaccine were obtained by intranasal immunization compared with intramuscular injection. By integrating HE-ORF2, HA-VP1 and tuftsin in a vaccine, this study validated an important concept for further development of a combined mucosal vaccine against hepatitis A and E infection.
Proteomic Profiling of Cereal Aphid Saliva Reveals Both Ubiquitous and Adaptive Secreted Proteins
Wilkinson, Tom L.
2013-01-01
The secreted salivary proteins from two cereal aphid species, Sitobion avenae and Metopolophium dirhodum, were collected from artificial diets and analysed by tandem mass spectrometry. Protein identification was performed by searching MS data against the official protein set from the current pea aphid (Acyrthosiphon pisum) genome assembly and revealed 12 and 7 proteins in the saliva of S. avenae and M. dirhodum, respectively. When combined with a comparable dataset from A. pisum, only three individual proteins were common to all the aphid species; two paralogues of the GMC oxidoreductase family (glucose dehydrogenase; GLD) and ACYPI009881, an aphid specific protein previously identified as a putative component of the salivary sheath. Antibodies were designed from translated protein sequences obtained from partial cDNA sequences for ACYPI009881 and both saliva associated GLDs. The antibodies detected all parent proteins in secreted saliva from the three aphid species, but could only detect ACYPI009881, and not saliva associated GLDs, in protein extractions from the salivary glands. This result was confirmed by immunohistochemistry using whole and sectioned salivary glands, and in addition, localised ACYPI009881 to specific cell types within the principal salivary gland. The implications of these findings for the origin of salivary components and the putative role of the proteins identified are discussed in the context of our limited understanding of the functional relationship between aphid saliva and the plants they feed on. The mass spectrometry data have been deposited to the ProteomeXchange and can be accessed under the identifier PXD000113. PMID:23460852
Proteomic profiling of cereal aphid saliva reveals both ubiquitous and adaptive secreted proteins.
Rao, Sohail A K; Carolan, James C; Wilkinson, Tom L
2013-01-01
The secreted salivary proteins from two cereal aphid species, Sitobion avenae and Metopolophium dirhodum, were collected from artificial diets and analysed by tandem mass spectrometry. Protein identification was performed by searching MS data against the official protein set from the current pea aphid (Acyrthosiphon pisum) genome assembly and revealed 12 and 7 proteins in the saliva of S. avenae and M. dirhodum, respectively. When combined with a comparable dataset from A. pisum, only three individual proteins were common to all the aphid species; two paralogues of the GMC oxidoreductase family (glucose dehydrogenase; GLD) and ACYPI009881, an aphid specific protein previously identified as a putative component of the salivary sheath. Antibodies were designed from translated protein sequences obtained from partial cDNA sequences for ACYPI009881 and both saliva associated GLDs. The antibodies detected all parent proteins in secreted saliva from the three aphid species, but could only detect ACYPI009881, and not saliva associated GLDs, in protein extractions from the salivary glands. This result was confirmed by immunohistochemistry using whole and sectioned salivary glands, and in addition, localised ACYPI009881 to specific cell types within the principal salivary gland. The implications of these findings for the origin of salivary components and the putative role of the proteins identified are discussed in the context of our limited understanding of the functional relationship between aphid saliva and the plants they feed on. The mass spectrometry data have been deposited to the ProteomeXchange and can be accessed under the identifier PXD000113.
Two new insulator proteins, Pita and ZIPIC, target CP190 to chromatin
Maksimenko, Oksana; Bartkuhn, Marek; Stakhov, Viacheslav; Herold, Martin; Zolotarev, Nickolay; Jox, Theresa; Buxa, Melanie K.; Kirsch, Ramona; Bonchuk, Artem; Fedotova, Anna; Kyrchanova, Olga
2015-01-01
Insulators are multiprotein–DNA complexes that regulate the nuclear architecture. The Drosophila CP190 protein is a cofactor for the DNA-binding insulator proteins Su(Hw), CTCF, and BEAF-32. The fact that CP190 has been found at genomic sites devoid of either of the known insulator factors has until now been unexplained. We have identified two DNA-binding zinc-finger proteins, Pita, and a new factor named ZIPIC, that interact with CP190 in vivo and in vitro at specific interaction domains. Genomic binding sites for these proteins are clustered with CP190 as well as with CTCF and BEAF-32. Model binding sites for Pita or ZIPIC demonstrate a partial enhancer-blocking activity and protect gene expression from PRE-mediated silencing. The function of the CTCF-bound MCP insulator sequence requires binding of Pita. These results identify two new insulator proteins and emphasize the unifying function of CP190, which can be recruited by many DNA-binding insulator proteins. PMID:25342723
Genome-Wide Profiling of RNA–Protein Interactions Using CLIP-Seq
Stork, Cheryl; Zheng, Sika
2017-01-01
UV crosslinking immunoprecipitation (CLIP) is an increasingly popular technique to study protein–RNA interactions in tissues and cells. Whole cells or tissues are ultraviolet irradiated to generate a covalent bond between RNA and proteins that are in close contact. After partial RNase digestion, antibodies specific to an RNA binding protein (RBP) or a protein–epitope tag is then used to immunoprecipitate the protein–RNA complexes. After stringent washing and gel separation the RBP–RNA complex is excised. The RBP is protease digested to allow purification of the bound RNA. Reverse transcription of the RNA followed by high-throughput sequencing of the cDNA library is now often used to identify protein bound RNA on a genome-wide scale. UV irradiation can result in cDNA truncations and/or mutations at the crosslink sites, which complicates the alignment of the sequencing library to the reference genome and the identification of the crosslinking sites. Meanwhile, one or more amino acids of a crosslinked RBP can remain attached to its bound RNA due to incomplete digestion of the protein. As a result, reverse transcriptase may not read through the crosslink sites, and produce cDNA ending at the crosslinked nucleotide. This is harnessed by one variant of CLIP methods to identify crosslinking sites at a nucleotide resolution. This method, individual nucleotide resolution CLIP (iCLIP) circularizes cDNA to capture the truncated cDNA and also increases the efficiency of ligating sequencing adapters to the library. Here, we describe the detailed procedure of iCLIP. PMID:26965263
Baxa, Michael C.; Yu, Wookyung; Adhikari, Aashish N.; Ge, Liang; Xia, Zhen; Zhou, Ruhong; Freed, Karl F.; Sosnick, Tobin R.
2015-01-01
Experimental and computational folding studies of Proteins L & G and NuG2 typically find that sequence differences determine which of the two hairpins is formed in the transition state ensemble (TSE). However, our recent work on Protein L finds that its TSE contains both hairpins, compelling a reassessment of the influence of sequence on the folding behavior of the other two homologs. We characterize the TSEs for Protein G and NuG2b, a triple mutant of NuG2, using ψ analysis, a method for identifying contacts in the TSE. All three homologs are found to share a common and near-native TSE topology with interactions between all four strands. However, the helical content varies in the TSE, being largely absent in Proteins G & L but partially present in NuG2b. The variability likely arises from competing propensities for the formation of nonnative β turns in the naturally occurring proteins, as observed in our TerItFix folding algorithm. All-atom folding simulations of NuG2b recapitulate the observed TSEs with four strands for 5 of 27 transition paths [Lindorff-Larsen K, Piana S, Dror RO, Shaw DE (2011) Science 334(6055):517–520]. Our data support the view that homologous proteins have similar folding mechanisms, even when nonnative interactions are present in the transition state. These findings emphasize the ongoing challenge of accurately characterizing and predicting TSEs, even for relatively simple proteins. PMID:26100906
Stewart, J M; Blakely, J A; Karpowicz, P A; Kalanxhi, E; Thatcher, B J; Martin, B M
2004-03-01
We purified myoglobin from beluga whale (Delphinapterus leucas) muscle (longissimus dorsi) with size exclusion and cation exchange chromatographies. The molecular mass was determined by mass spectrometry (17,081 Da) and the isoelectric pH (9.4) by capillary isoelectric focusing. The near-complete amino acid sequence was determined and a phylogeny indicated that beluga was in the same clad as Dall's and harbor porpoises. There were consensus motifs for a phosphorylation site on the protein surface with the most likely site at serine-117. This motif was common to all cetacean myoglobins examined. Two oxygen-binding studies at 37 degrees C indicated dissociation constants (20.5 and 23.6 microM) 5.7-6.6 times larger than horse myoglobin (3.6 microM). The autoxidation rate of beluga myoglobin at 37 degrees C, pH 7.2 was 0.218+/-0.028 h(-1), 1/3 larger than reported for myoglobin of terrestrial mammals. There was no clear sequence change to explain the difference in oxygen binding or autoxidation although substitutions (N66 and T67) in an invariant rich sequence (HGNTV) distal to the heme may play a role. Structural models based on the protein sequence and constructed on topologies of known templates (horse and sperm whale crystal structures) were not adequate to assess perturbation of the heme pocket.
Identifying RNA splicing factors using IFT genes in Chlamydomonas reinhardtii.
Lin, Huawen; Zhang, Zhengyan; Iomini, Carlo; Dutcher, Susan K
2018-03-01
Intraflagellar transport moves proteins in and out of flagella/cilia and it is essential for the assembly of these organelles. Using whole-genome sequencing, we identified splice site mutations in two IFT genes, IFT81 ( fla9 ) and IFT121 ( ift121-2 ), which lead to flagellar assembly defects in the unicellular green alga Chlamydomonas reinhardtii The splicing defects in these ift mutants are partially corrected by mutations in two conserved spliceosome proteins, DGR14 and FRA10. We identified a dgr14 deletion mutant, which suppresses the 3' splice site mutation in IFT81 , and a frameshift mutant of FRA10 , which suppresses the 5' splice site mutation in IFT121 Surprisingly, we found dgr14-1 and fra10 mutations suppress both splice site mutations. We suggest these two proteins are involved in facilitating splice site recognition/interaction; in their absence some splice site mutations are tolerated. Nonsense mutations in SMG1 , which is involved in nonsense-mediated decay, lead to accumulation of aberrant transcripts and partial restoration of flagellar assembly in the ift mutants. The high density of introns and the conservation of noncore splicing factors, together with the ease of scoring the ift mutant phenotype, make Chlamydomonas an attractive organism to identify new proteins involved in splicing through suppressor screening. © 2018 The Authors.
Tripathi, Prabhanshu; Nair, Smitha; Singh, B P; Arora, Naveen
2011-03-01
Serine protease from numerous sources have been identified and characterized as major allergens. The present study aimed to clone, express and characterize a serine protease from Curvularia lunata. cDNA library screening identified partial protease clones. A clone showed significant homology to subtilisin like serine proteases from Aspergillus and Penicillium species. Full length sequence was generated by RACE PCR, subcloned in pET vector, protein expressed in Escherichia coli and purified from inclusion bodies yielding 0.5 mg/L of culture. Bioinformatic analysis identified serine protease motifs of subtilase family, catalytic triad and N-glycosylation sites on the primary sequence. The protein resolved at 54-kDa on SDS-PAGE and was recognized as a major allergen on immunoblot with 13/16 C. lunata sensitive patients' sera in ELISA and immunoblot. Recombinant protein reacted with rabbit polyclonal antibodies against alkaline serine proteases from C. lunata. Recombinant protein required 50-56 ng of same protein for 50% inhibition of IgE binding in competitive ELISA. In addition, 13 of 16 patients' samples showed significant basophil histamine release upon stimulation with purified recombinant protein. In conclusion, a 54 kDa major allergen of C. lunata was cloned, expressed, characterized and showed biological activity. It has potential to be used in molecule based approach for allergy diagnosis and therapy. Copyright © 2010 Elsevier GmbH. All rights reserved.
Rocha, Antônio J; Sousa, Bruno L; Girão, Matheus S; Barroso-Neto, Ito L; Monteiro-Júnior, José E; Oliveira, José T A; Nagano, Celso S; Carneiro, Rômulo F; Monteiro-Moreira, Ana C O; Rocha, Bruno A M; Freire, Valder N; Grangeiro, Thalles B
2018-05-27
Vicilins are 7S globulins which constitute the major seed storage proteins in leguminous species. Variant vicilins showing differential binding affinities for chitin have been implicated in the resistance and susceptibility of cowpea to the bruchid Callosobruchus maculatus. These proteins are members of the cupin superfamily, which includes a wide variety of enzymes and non-catalytic seed storage proteins. The cupin fold does not share similarity with any known chitin-biding domain. Therefore, it is poorly understood how these storage proteins bind to chitin. In this work, partial cDNA sequences encoding β-vignin, the major component of cowpea vicilins, were obtained from developing seeds. Three-dimensional molecular models of β-vignin showed the characteristic cupin fold and computational simulations revealed that each vicilin trimer contained 3 chitin-binding sites. Interaction models showed that chito-oligosaccharides bound to β-vignin were stabilized mainly by hydrogen bonds, a common structural feature of typical carbohydrate-binding proteins. Furthermore, many of the residues involved in the chitin-binding sites of β-vignin are conserved in other 7S globulins. These results support previous experimental evidences on the ability of vicilin-like proteins from cowpea and other leguminous species to bind in vitro to chitin as well as in vivo to chitinous structures of larval C. maculatus midgut. Copyright © 2018. Published by Elsevier B.V.
Phylogenetic placement of the enigmatic parasite, Polypodium hydriforme, within the Phylum Cnidaria
2008-01-01
Background Polypodium hydriforme is a parasite with an unusual life cycle and peculiar morphology, both of which have made its systematic position uncertain. Polypodium has traditionally been considered a cnidarian because it possesses nematocysts, the stinging structures characteristic of this phylum. However, recent molecular phylogenetic studies using 18S rDNA sequence data have challenged this interpretation, and have shown that Polypodium is a close relative to myxozoans and together they share a closer affinity to bilaterians than cnidarians. Due to the variable rates of 18S rDNA sequences, these results have been suggested to be an artifact of long-branch attraction (LBA). A recent study, using multiple protein coding markers, shows that the myxozoan Buddenbrockia, is nested within cnidarians. Polypodium was not included in this study. To further investigate the phylogenetic placement of Polypodium, we have performed phylogenetic analyses of metazoans with 18S and partial 28S rDNA sequences in a large dataset that includes Polypodium and a comprehensive sampling of cnidarian taxa. Results Analyses of a combined dataset of 18S and partial 28S sequences, and partial 28S alone, support the placement of Polypodium within Cnidaria. Removal of the long-branched myxozoans from the 18S dataset also results in Polypodium being nested within Cnidaria. These results suggest that previous reports showing that Polypodium and Myxozoa form a sister group to Bilateria were an artifact of long-branch attraction. Conclusion By including 28S rDNA sequences and a comprehensive sampling of cnidarian taxa, we demonstrate that previously conflicting hypotheses concerning the phylogenetic placement of Polypodium can be reconciled. Specifically, the data presented provide evidence that Polypodium is indeed a cnidarian and is either the sister taxon to Hydrozoa, or part of the hydrozoan clade, Leptothecata. The former hypothesis is consistent with the traditional view that Polypodium should be placed in its own cnidarian class, Polypodiozoa. PMID:18471296
Phylogenetic placement of the enigmatic parasite, Polypodium hydriforme, within the Phylum Cnidaria.
Evans, Nathaniel M; Lindner, Alberto; Raikova, Ekaterina V; Collins, Allen G; Cartwright, Paulyn
2008-05-09
Polypodium hydriforme is a parasite with an unusual life cycle and peculiar morphology, both of which have made its systematic position uncertain. Polypodium has traditionally been considered a cnidarian because it possesses nematocysts, the stinging structures characteristic of this phylum. However, recent molecular phylogenetic studies using 18S rDNA sequence data have challenged this interpretation, and have shown that Polypodium is a close relative to myxozoans and together they share a closer affinity to bilaterians than cnidarians. Due to the variable rates of 18S rDNA sequences, these results have been suggested to be an artifact of long-branch attraction (LBA). A recent study, using multiple protein coding markers, shows that the myxozoan Buddenbrockia, is nested within cnidarians. Polypodium was not included in this study. To further investigate the phylogenetic placement of Polypodium, we have performed phylogenetic analyses of metazoans with 18S and partial 28S rDNA sequences in a large dataset that includes Polypodium and a comprehensive sampling of cnidarian taxa. Analyses of a combined dataset of 18S and partial 28S sequences, and partial 28S alone, support the placement of Polypodium within Cnidaria. Removal of the long-branched myxozoans from the 18S dataset also results in Polypodium being nested within Cnidaria. These results suggest that previous reports showing that Polypodium and Myxozoa form a sister group to Bilateria were an artifact of long-branch attraction. By including 28S rDNA sequences and a comprehensive sampling of cnidarian taxa, we demonstrate that previously conflicting hypotheses concerning the phylogenetic placement of Polypodium can be reconciled. Specifically, the data presented provide evidence that Polypodium is indeed a cnidarian and is either the sister taxon to Hydrozoa, or part of the hydrozoan clade, Leptothecata. The former hypothesis is consistent with the traditional view that Polypodium should be placed in its own cnidarian class, Polypodiozoa.
MHC class I loci of the Bar-Headed goose (Anser indicus)
2010-01-01
MHC class I proteins mediate functions in anti-pathogen defense. MHC diversity has already been investigated by many studies in model avian species, but here we chose the bar-headed goose, a worldwide migrant bird, as a non-model avian species. Sequences from exons encoding the peptide-binding region (PBR) of MHC class I molecules were isolated from liver genomic DNA, to investigate variation in these genes. These are the first MHC class I partial sequences of the bar-headed goose to be reported. A preliminary analysis suggests the presence of at least four MHC class I genes, which share great similarity with those of the goose and duck. A phylogenetic analysis of bar-headed goose, goose and duck MHC class I sequences using the NJ method supports the idea that they all cluster within the anseriforms clade. PMID:21637434
Somaraju Chalasani, Madhavi Latha; Muppirala, Madhavi; G Ponnam, Surya Prakash; Kannabiran, Chitra; Swarup, Ghanshyam
2013-01-01
Mutations in the eye lens gap junction protein connexin 50 cause cataract. Earlier we identified a frameshift mutant of connexin 50 (c.670insA; p.Thr203AsnfsX47) in a family with autosomal recessive cataract. The mutant protein is smaller and contains 46 aberrant amino acids at the C-terminus after amino acid 202. Here, we have analysed this frameshift mutant and observed that it localized to the endoplasmic reticulum (ER) but not in the plasma membrane. Moreover, overexpression of the mutant resulted in disintegration of the ER-Golgi intermediate compartment (ERGIC), reduction in the level of ERGIC-53 protein and breakdown of the Golgi in many cells. Overexpression of the frameshift mutant partially inhibited the transport of wild type connexin 50 to the plasma membrane. A deletion mutant lacking the aberrant sequence showed predominant localization in the ER and inhibited anterograde protein transport suggesting, therefore, that the aberrant sequence is not responsible for improper localization of the frameshift mutant. Further deletion analysis showed that the fourth transmembrane domain and a membrane proximal region (231-294 amino acids) of the cytoplasmic domain are needed for transport from the ER and localization to the plasma membrane. Our results show that a frameshift mutant of connexin 50 mislocalizes to the ER and causes disintegration of the ERGIC and Golgi. We have also identified a sequence of connexin 50 crucial for transport from the ER and localization to the plasma membrane.
Characterization of a cis-Golgi matrix protein, GM130
1995-01-01
Antisera raised to a detergent- and salt-resistant matrix fraction from rat liver Golgi stacks were used to screen an expression library from rat liver cDNA. A full-length clone was obtained encoding a protein of 130 kD (termed GM130), the COOH-terminal domain of which was highly homologous to a Golgi human auto-antigen, golgin-95 (Fritzler et al., 1993). Biochemical data showed that GM130 is a peripheral cytoplasmic protein that is tightly bound to Golgi membranes and part of a larger oligomeric complex. Predictions from the protein sequence suggest that GM130 is an extended rod-like protein with coiled-coil domains. Immunofluorescence microscopy showed partial overlap with medial- and trans-Golgi markers but almost complete overlap with the cis-Golgi network (CGN) marker, syntaxin5. Immunoelectron microscopy confirmed this location showing that most of the GM130 was located in the CGN and in one or two cisternae on the cis-side of the Golgi stack. GM130 was not re-distributed to the ER in the presence of brefeldin A but maintained its overlap with syntaxin5 and a partial overlap with the ER- Golgi intermediate compartment marker, p53. Together these results suggest that GM130 is part of a cis-Golgi matrix and has a role in maintaining cis-Golgi structure. PMID:8557739
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zelinka, L.; McCann, S.; Budde, J.
2011-08-05
Highlights: {yields} Affinity purification of the autoimmune rippling muscle disease immunogenic domain of titin. {yields} Partial sequence analysis confirms that the peptides is in the I band region of titin. {yields} This region of the human titin shows high degree of homology to mouse titin N2-A. -- Abstract: Autoimmune rippling muscle disease (ARMD) is an autoimmune neuromuscular disease associated with myasthenia gravis (MG). Past studies in our laboratory recognized a very high molecular weight skeletal muscle protein antigen identified by ARMD patient antisera as the titin isoform. These past studies used antisera from ARMD and MG patients as probes tomore » screen a human skeletal muscle cDNA library and several pBluescript clones revealed supporting expression of immunoreactive peptides. This study characterizes the products of subcloning the titin immunoreactive domain into pGEX-3X and the subsequent fusion protein. Sequence analysis of the fusion gene indicates the cloned titin domain (GenBank ID: (EU428784)) is in frame and is derived from a sequence of N2-A spanning the exons 248-250 an area that encodes the fibronectin III domain. PCR and EcoR1 restriction mapping studies have demonstrated that the inserted cDNA is of a size that is predicted by bioinformatics analysis of the subclone. Expression of the fusion protein result in the isolation of a polypeptide of 52 kDa consistent with the predicted inferred amino acid sequence. Immunoblot experiments of the fusion protein, using rippling muscle/myasthenia gravis antisera, demonstrate that only the titin domain is immunoreactive.« less
Structural elucidation and molecular characterization of Marinobacter sp. α-amylase.
Kumar, Sumit; Khan, Rizwan Hasan; Khare, S K
2016-01-01
Halophiles have been perceived as potential source of novel enzymes in recent years. The interest emanates from their ability to catalyze efficiently under high salt and organic solvents. Marinobacter sp. EMB8 α-amylase was found to be active and stable in salt and organic solvents. A study was carried out using circular dichroism (CD), fluorescence spectroscopy, and bioinformatics analysis of similar protein sequence to ascertain molecular basis of salt and solvent adaptability of α-amylase. Structural changes recorded in the presence of varying amounts of NaCl exhibited an increase in negative ellipticity as a function of salt, confirming that salt stabilizes the protein and increases the secondary structure, making it catalytically functional. The data of intrinsic and extrinsic fluorescence (using 1-anilinonaphthalene 8-sulfonate [ANS] as probe) further confirmed the role of salt. The α-amylase was active in the presence of nonpolar solvents, namely, hexane and decane, but inactivated by ethanol. The decrease in the activity was correlated with the loss of tertiary structure in the presence of ethanol. Guanidine hydrochloride and pH denaturation indicated the molten globule state at pH 4.0. Partial N-terminal amino acid sequence of the purified α-amylase revealed the relatedness to Pseudoalteromonas sp. α-amylase. "FVHLFEW" was found as the N-terminal signature sequence. Bioinformatics analysis was done using M. algicola α-amylase protein having the same N-terminal signature sequence. The three-dimensional structure of Marinobacter α-amylase was deduced using the I-TASSER server, which reflected the enrichment of acidic amino acids on the surface, imparting the stability in the presence of salt. Our study clearly indicate that salt is necessary for maintaining the secondary and tertiary structure of halophilic protein, which is a necessary prerequisite for catalysis.
Epigenetic Telomere Protection by Drosophila DNA Damage Response Pathways
Oikemus, Sarah R; Queiroz-Machado, Joana; Lai, KuanJu; McGinnis, Nadine; Sunkel, Claudio; Brodsky, Michael H
2006-01-01
Analysis of terminal deletion chromosomes indicates that a sequence-independent mechanism regulates protection of Drosophila telomeres. Mutations in Drosophila DNA damage response genes such as atm/tefu, mre11, or rad50 disrupt telomere protection and localization of the telomere-associated proteins HP1 and HOAP, suggesting that recognition of chromosome ends contributes to telomere protection. However, the partial telomere protection phenotype of these mutations limits the ability to test if they act in the epigenetic telomere protection mechanism. We examined the roles of the Drosophila atm and atr-atrip DNA damage response pathways and the nbs homolog in DNA damage responses and telomere protection. As in other organisms, the atm and atr-atrip pathways act in parallel to promote telomere protection. Cells lacking both pathways exhibit severe defects in telomere protection and fail to localize the protection protein HOAP to telomeres. Drosophila nbs is required for both atm- and atr-dependent DNA damage responses and acts in these pathways during DNA repair. The telomere fusion phenotype of nbs is consistent with defects in each of these activities. Cells defective in both the atm and atr pathways were used to examine if DNA damage response pathways regulate telomere protection without affecting telomere specific sequences. In these cells, chromosome fusion sites retain telomere-specific sequences, demonstrating that loss of these sequences is not responsible for loss of protection. Furthermore, terminally deleted chromosomes also fuse in these cells, directly implicating DNA damage response pathways in the epigenetic protection of telomeres. We propose that recognition of chromosome ends and recruitment of HP1 and HOAP by DNA damage response proteins is essential for the epigenetic protection of Drosophila telomeres. Given the conserved roles of DNA damage response proteins in telomere function, related mechanisms may act at the telomeres of other organisms. PMID:16710445
Epigenetic telomere protection by Drosophila DNA damage response pathways.
Oikemus, Sarah R; Queiroz-Machado, Joana; Lai, KuanJu; McGinnis, Nadine; Sunkel, Claudio; Brodsky, Michael H
2006-05-01
Analysis of terminal deletion chromosomes indicates that a sequence-independent mechanism regulates protection of Drosophila telomeres. Mutations in Drosophila DNA damage response genes such as atm/tefu, mre11, or rad50 disrupt telomere protection and localization of the telomere-associated proteins HP1 and HOAP, suggesting that recognition of chromosome ends contributes to telomere protection. However, the partial telomere protection phenotype of these mutations limits the ability to test if they act in the epigenetic telomere protection mechanism. We examined the roles of the Drosophila atm and atr-atrip DNA damage response pathways and the nbs homolog in DNA damage responses and telomere protection. As in other organisms, the atm and atr-atrip pathways act in parallel to promote telomere protection. Cells lacking both pathways exhibit severe defects in telomere protection and fail to localize the protection protein HOAP to telomeres. Drosophila nbs is required for both atm- and atr-dependent DNA damage responses and acts in these pathways during DNA repair. The telomere fusion phenotype of nbs is consistent with defects in each of these activities. Cells defective in both the atm and atr pathways were used to examine if DNA damage response pathways regulate telomere protection without affecting telomere specific sequences. In these cells, chromosome fusion sites retain telomere-specific sequences, demonstrating that loss of these sequences is not responsible for loss of protection. Furthermore, terminally deleted chromosomes also fuse in these cells, directly implicating DNA damage response pathways in the epigenetic protection of telomeres. We propose that recognition of chromosome ends and recruitment of HP1 and HOAP by DNA damage response proteins is essential for the epigenetic protection of Drosophila telomeres. Given the conserved roles of DNA damage response proteins in telomere function, related mechanisms may act at the telomeres of other organisms.
Jiménez, Diego Javier; Dini-Andreote, Francisco; Ottoni, Júlia Ronzella; de Oliveira, Valéria Maia; van Elsas, Jan Dirk; Andreote, Fernando Dini
2015-05-01
The occurrence of genes encoding biotechnologically relevant α/β-hydrolases in mangrove soil microbial communities was assessed using data obtained by whole-metagenome sequencing of four mangroves areas, denoted BrMgv01 to BrMgv04, in São Paulo, Brazil. The sequences (215 Mb in total) were filtered based on local amino acid alignments against the Lipase Engineering Database. In total, 5923 unassembled sequences were affiliated with 30 different α/β-hydrolase fold superfamilies. The most abundant predicted proteins encompassed cytosolic hydrolases (abH08; ∼ 23%), microsomal hydrolases (abH09; ∼ 12%) and Moraxella lipase-like proteins (abH04 and abH01; < 5%). Detailed analysis of the genes predicted to encode proteins of the abH08 superfamily revealed a high proportion related to epoxide hydrolases and haloalkane dehalogenases in polluted mangroves BrMgv01-02-03. This suggested selection and putative involvement in local degradation/detoxification of the pollutants. Seven sequences that were annotated as genes for putative epoxide hydrolases and five for putative haloalkane dehalogenases were found in a fosmid library generated from BrMgv02 DNA. The latter enzymes were predicted to belong to Actinobacteria, Deinococcus-Thermus, Planctomycetes and Proteobacteria. Our integrated approach thus identified 12 genes (complete and/or partial) that may encode hitherto undescribed enzymes. The low amino acid identity (< 60%) with already-described genes opens perspectives for both production in an expression host and genetic screening of metagenomes. © 2014 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.
Purification and partial characterization of canine S100A12.
Heilmann, Romy M; Suchodolski, Jan S; Steiner, Jörg M
2010-12-01
Canine S100A12 (cS100A12) is a calcium-binding protein of the S100 superfamily of EF-hand proteins, and its expression is restricted to neutrophils and monocytes. Interaction of S100A12 with the receptor for advanced glycation end products (RAGE) has been suggested to play a central role in inflammation. Moreover, S100A12 has been shown to represent a sensitive and specific marker for gastrointestinal inflammation in humans. Only human, porcine, bovine, and rabbit S100A12 have been purified to date, and an immunoassay for the quantification of S100A12 is available only for humans. Therefore, the aim of this study was to develop a protocol for the purification of S100A12 and to partially characterize this protein in the dog (Canis lupus familiaris) as a prelude to the development of an immunologic method for its detection and quantification in canine serum and fecal specimens. Leukocytes were isolated from canine whole blood by dextran sedimentation, and canine S100A12 was extracted from the cytosol fraction of these cells. Further purification of cS100A12 comprised of ammonium sulfate precipitation, hydrophobic interaction chromatography, and strong cation- and anion-exchange column chromatography. Canine S100A12 was successfully purified from canine whole blood. The relative molecular mass of the protein was estimated at 10,379.5 and isoelectric focusing revealed an isoelectric point of 6.0. The approximate specific absorbance of cS100A12 at 280 nm was determined to be 1.78 for a 1 mg/ml solution. The N-terminal AA sequence of the first 15 residues of cS100A12 was Thr-Lys-Leu-Glu-Asp-His-X-Glu-Gly-Ile-Val-Asp-Val-Phe-His, and revealed 100% identity with the predicted protein sequence available through the canine genome project. Sequence homology for the 14 N-terminal residues identified for cS100A12 with those of feline, bovine, porcine, and human S100A12 was 78.6%. We conclude that canine S100A12 can be successfully purified from canine whole blood using the described methods. Copyright © 2010 Elsevier Masson SAS. All rights reserved.
Protein synthesis and secretion by the epididymis of the brushtail possum, Trichosurus vulpecula.
Lamont, A E; Clarke, H; Cooper, N J; Holland, M K; Breed, W G
1998-09-01
In this study of brushtail possums, proteins present in epididymal fluid and not present in blood plasma and those that become associated with spermatozoa as they pass along the tract were investigated. At least 19 proteins were present in epididymal fluid in the various regions of the tract that were not detected in serum. Some of these may be present on the sperm plasmalemma. Six proteins were extracted from caput spermatozoa (M(r) 117,000, 103,000, 87,500, 85,000, 62,000 and 33,000) that did not appear in the caudal sperm extracts. Eight proteins (M(r) 50,000, 49,000, 32,000, 27,000, 26,500, 25,000, 24,500 and 18,000) were localized to caudal sperm extracts. These findings suggest that some sperm proteins are lost or modified, whereas others are added to the sperm plasma membrane during epididymal transit. In vitro incorporation of [35S]methionine by the epididymal tissue showed that the distal caput and corpus are the most active regions in the synthesis and secretion of proteins. Four caudal epididymal proteins (M(r) 72,000, 31,000, 26,500 and 21,000) were partially sequenced. Those of M(r) 31,000 and 26,500 had the same N-terminal amino acid sequence suggesting post-translational modification of the same protein; they showed homology to a retinoic acid-binding protein. The protein of M(r) 72,000 was found to be homologous to alpha-fetoprotein, whereas the protein of M(r) 21,000 showed no significant homology to any protein in the database. These results show that the lumen of the epididymis has many proteins that are not present in the blood, some of which appear to become associated with spermatozoa during epididymal transit.
NASA Technical Reports Server (NTRS)
Ni, Jianjun (David)
2012-01-01
This presentation discusses an analysis approach to evaluate the interuser interference for Direct-Sequence Spread-Spectrum (DSSS) Systems for Space Network (SN) Users. Part I of this analysis shows that the correlation property of pseudo noise (PN) sequences is the critical factor which determines the interuser interference performance of the DSSS system. For non-standard DSSS systems in which PN sequence s period is much larger than one data symbol duration, it is the partial-period cross-correlation that determines the system performance. This study reveals through an example that a well-designed PN sequence set (e.g. Gold Sequence, in which the cross-correlation for a whole-period is well controlled) may have non-controlled partial-period cross-correlation which could cause severe interuser interference for a DSSS system. Since the analytical derivation of performance metric (bit error rate or signal-to-noise ratio) based on partial-period cross-correlation is prohibitive, the performance degradation due to partial-period cross-correlation will be evaluated using simulation in Part II of this analysis in the future.
Erb, Laurie; Liu, Jun; Ockerhausen, Jonathan; Kong, Qiongman; Garrad, Richard C.; Griffin, Korey; Neal, Chris; Krugh, Brent; Santiago-Pérez, Laura I.; González, Fernando A.; Gresham, Hattie D.; Turner, John T.; Weisman, Gary A.
2001-01-01
The P2Y2 nucleotide receptor (P2Y2R) contains the integrin-binding domain arginine-glycine-aspartic acid (RGD) in its first extracellular loop, raising the possibility that this G protein–coupled receptor interacts directly with an integrin. Binding of a peptide corresponding to the first extracellular loop of the P2Y2R to K562 erythroleukemia cells was inhibited by antibodies against αVβ3/β5 integrins and the integrin-associated thrombospondin receptor, CD47. Immunofluorescence of cells transfected with epitope-tagged P2Y2Rs indicated that αV integrins colocalized 10-fold better with the wild-type P2Y2R than with a mutant P2Y2R in which the RGD sequence was replaced with RGE. Compared with the wild-type P2Y2R, the RGE mutant required 1,000-fold higher agonist concentrations to phosphorylate focal adhesion kinase, activate extracellular signal–regulated kinases, and initiate the PLC-dependent mobilization of intracellular Ca2+. Furthermore, an anti-αV integrin antibody partially inhibited these signaling events mediated by the wild-type P2Y2R. Pertussis toxin, an inhibitor of Gi/o proteins, partially inhibited Ca2+ mobilization mediated by the wild-type P2Y2R, but not by the RGE mutant, suggesting that the RGD sequence is required for P2Y2R-mediated activation of Go, but not Gq. Since CD47 has been shown to associate directly with Gi/o family proteins, these results suggest that interactions between P2Y2Rs, integrins, and CD47 may be important for coupling the P2Y2R to Go. PMID:11331301
Byun, Hyunjong; Park, Jiyeon; Kim, Sun Chang; Ahn, Jung Hoon
2017-12-01
Efficient protein production for industrial and academic purposes often involves engineering microorganisms to produce and secrete target proteins into the culture. Pseudomonas fluorescens has a TliDEF ATP-binding cassette transporter, a type I secretion system, which recognizes C-terminal LARD3 signal sequence of thermostable lipase TliA. Many proteins are secreted by TliDEF in vivo when recombined with LARD3, but there are still others that cannot be secreted by TliDEF even when LARD3 is attached. However, the factors that determine whether or not a recombinant protein can be secreted through TliDEF are still unknown. Here, we recombined LARD3 with several proteins and examined their secretion through TliDEF. We found that the proteins secreted via LARD3 are highly negatively charged with highly-acidic isoelectric points (pI) lower than 5.5. Attaching oligo-aspartate to lower the pI of negatively-charged recombinant proteins improved their secretion, and attaching oligo-arginine to negatively-charged proteins blocked their secretion by LARD3. In addition, negatively supercharged green fluorescent protein (GFP) showed improved secretion, whereas positively supercharged GFP did not secrete. These results disclosed that proteins' acidic pI and net negative charge are major factors that determine their secretion through TliDEF. Homology modeling for TliDEF revealed that TliD dimer forms evolutionarily-conserved positively-charged clusters in its pore and substrate entrance site, which also partially explains the pI dependence of the TliDEF-dependent secretions. In conclusion, lowering the isoelectric point improved LARD3-mediated protein secretion, both widening the range of protein targets for efficient production via secretion and signifying an important aspect of ABC transporter-mediated secretions. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Partial Purification of a Megadalton DNA Replication Complex by Free Flow Electrophoresis.
Li, Caroline M; Miao, Yunan; Lingeman, Robert G; Hickey, Robert J; Malkas, Linda H
2016-01-01
We describe a gentle and rapid method to purify the intact multiprotein DNA replication complex using free flow electrophoresis (FFE). In particular, we applied FFE to purify the human cell DNA synthesome, which is a multiprotein complex that is fully competent to carry-out all phases of the DNA replication process in vitro using a plasmid containing the simian virus 40 (SV40) origin of DNA replication and the viral large tumor antigen (T-antigen) protein. The isolated native DNA synthesome can be of use in studying the mechanism by which mammalian DNA replication is carried-out and how anti-cancer drugs disrupt the DNA replication or repair process. Partially purified extracts from HeLa cells were fractionated in a native, liquid based separation by FFE. Dot blot analysis showed co-elution of many proteins identified as part of the DNA synthesome, including proliferating cell nuclear antigen (PCNA), DNA topoisomerase I (topo I), DNA polymerase δ (Pol δ), DNA polymerase ɛ (Pol ɛ), replication protein A (RPA) and replication factor C (RFC). Previously identified DNA synthesome proteins co-eluted with T-antigen dependent and SV40 origin-specific DNA polymerase activity at the same FFE fractions. Native gels show a multiprotein PCNA containing complex migrating with an apparent relative mobility in the megadalton range. When PCNA containing bands were excised from the native gel, mass spectrometric sequencing analysis identified 23 known DNA synthesome associated proteins or protein subunits.
A Novel Partial Sequence Alignment Tool for Finding Large Deletions
Aruk, Taner; Ustek, Duran; Kursun, Olcay
2012-01-01
Finding large deletions in genome sequences has become increasingly more useful in bioinformatics, such as in clinical research and diagnosis. Although there are a number of publically available next generation sequencing mapping and sequence alignment programs, these software packages do not correctly align fragments containing deletions larger than one kb. We present a fast alignment software package, BinaryPartialAlign, that can be used by wet lab scientists to find long structural variations in their experiments. For BinaryPartialAlign, we make use of the Smith-Waterman (SW) algorithm with a binary-search-based approach for alignment with large gaps that we called partial alignment. BinaryPartialAlign implementation is compared with other straight-forward applications of SW. Simulation results on mtDNA fragments demonstrate the effectiveness (runtime and accuracy) of the proposed method. PMID:22566777
Foldability of a Natural De Novo Evolved Protein.
Bungard, Dixie; Copple, Jacob S; Yan, Jing; Chhun, Jimmy J; Kumirov, Vlad K; Foy, Scott G; Masel, Joanna; Wysocki, Vicki H; Cordes, Matthew H J
2017-11-07
The de novo evolution of protein-coding genes from noncoding DNA is emerging as a source of molecular innovation in biology. Studies of random sequence libraries, however, suggest that young de novo proteins will not fold into compact, specific structures typical of native globular proteins. Here we show that Bsc4, a functional, natural de novo protein encoded by a gene that evolved recently from noncoding DNA in the yeast S. cerevisiae, folds to a partially specific three-dimensional structure. Bsc4 forms soluble, compact oligomers with high β sheet content and a hydrophobic core, and undergoes cooperative, reversible denaturation. Bsc4 lacks a specific quaternary state, however, existing instead as a continuous distribution of oligomer sizes, and binds dyes indicative of amyloid oligomers or molten globules. The combination of native-like and non-native-like properties suggests a rudimentary fold that could potentially act as a functional intermediate in the emergence of new folded proteins de novo. Copyright © 2017 Elsevier Ltd. All rights reserved.
Shrimp miRNAs regulate innate immune response against white spot syndrome virus infection.
Kaewkascholkul, Napol; Somboonviwat, Kulwadee; Asakawa, Shuichi; Hirono, Ikuo; Tassanakajon, Anchalee; Somboonwiwat, Kunlaya
2016-07-01
MicroRNAs are short noncoding RNAs of RNA interference pathways that regulate gene expression through partial complementary base-pairing to target mRNAs. In this study, miRNAs that are expressed in white spot syndrome virus (WSSV)-infected Penaeus monodon, were identified using next generation sequencing. Forty-six miRNA homologs were identified from WSSV-infected shrimp hemocyte. Stem-loop real-time RT-PCR analysis showed that 11 out of 16 selected miRNAs were differentially expressed upon WSSV infection. Of those, pmo-miR-315 and pmo-miR-750 were highly responsive miRNAs. miRNA target prediction revealed that the miRNAs were targeted at 5'UTR, ORF, and 3'UTR of several immune-related genes such as genes encoding antimicrobial peptides, signaling transduction proteins, heat shock proteins, oxidative stress proteins, proteinases or proteinase inhibitors, proteins in blood clotting system, apoptosis-related proteins, proteins in prophenoloxidase system, pattern recognition proteins and other immune molecules. The highly conserved miRNA homolog, pmo-bantam, was characterized for its function in shrimp. The pmo-bantam was predicted to target the 3'UTR of Kunitz-type serine protease inhibitor (KuSPI). Binding of pmo-bantam to the target sequence of KuSPI gene was analyzed by luciferase reporter assay. Correlation of pmo-bantam and KuSPI expression was observed in lymphoid organ of WSSV-infected shrimp. These results implied that miRNAs might play roles as immune gene regulators in shrimp antiviral response. Copyright © 2016. Published by Elsevier Ltd.
G23D: Online tool for mapping and visualization of genomic variants on 3D protein structures.
Solomon, Oz; Kunik, Vered; Simon, Amos; Kol, Nitzan; Barel, Ortal; Lev, Atar; Amariglio, Ninette; Somech, Raz; Rechavi, Gidi; Eyal, Eran
2016-08-26
Evaluation of the possible implications of genomic variants is an increasingly important task in the current high throughput sequencing era. Structural information however is still not routinely exploited during this evaluation process. The main reasons can be attributed to the partial structural coverage of the human proteome and the lack of tools which conveniently convert genomic positions, which are the frequent output of genomic pipelines, to proteins and structure coordinates. We present G23D, a tool for conversion of human genomic coordinates to protein coordinates and protein structures. G23D allows mapping of genomic positions/variants on evolutionary related (and not only identical) protein three dimensional (3D) structures as well as on theoretical models. By doing so it significantly extends the space of variants for which structural insight is feasible. To facilitate interpretation of the variant consequence, pathogenic variants, functional sites and polymorphism sites are displayed on protein sequence and structure diagrams alongside the input variants. G23D also provides modeling of the mutant structure, analysis of intra-protein contacts and instant access to functional predictions and predictions of thermo-stability changes. G23D is available at http://www.sheba-cancer.org.il/G23D . G23D extends the fraction of variants for which structural analysis is applicable and provides better and faster accessibility for structural data to biologists and geneticists who routinely work with genomic information.
Zhu, Zhixuan; Gui, Songtao; Jin, Jing; Yi, Rong; Wu, Zhihua; Qian, Qian; Ding, Yi
2016-09-01
Centromeres on eukaryotic chromosomes consist of large arrays of DNA repeats that undergo very rapid evolution. Nelumbo nucifera Gaertn. (sacred lotus) is a phylogenetic relict and an aquatic perennial basal eudicot. Studies concerning the centromeres of this basal eudicot species could provide ancient evolutionary perspectives. In this study, we characterized the centromeric marker protein NnCenH3 (sacred lotus centromere-specific histone H3 variant), and used a chromatin immunoprecipitation (ChIP)-based technique to recover the NnCenH3 nucleosome-associated sequences of sacred lotus. The properties of the centromere-binding protein and DNA sequences revealed notable divergence between sacred lotus and other flowering plants, including the following factors: (i) an NnCenH3 alternative splicing variant comprising only a partial centromere-targeting domain, (ii) active genes with low transcription levels in the NnCenH3 nucleosomal regions, and (iii) the prevalence of the Ty1/copia class of long terminal repeat (LTR) retrotransposons in the centromeres of sacred lotus chromosomes. In addition, the dynamic natures of the centromeric region showed that some of the centromeric repeat DNA sequences originated from telomeric repeats, and a pair of centromeres on the dicentric chromosome 1 was inactive in the metaphase cells of sacred lotus. Our characterization of the properties of centromeric DNA structure within the sacred lotus genome describes a centromeric profile in ancient basal eudicots and might provide evidence of the origins and evolution of centromeres. Furthermore, the identification of centromeric DNA sequences is of great significance for the assembly of the sacred lotus genome. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.
Structure of the membrane channel porin from Rhodopseudomonas blastica at 2.0 A resolution.
Kreusch, A.; Neubüser, A.; Schiltz, E.; Weckesser, J.; Schulz, G. E.
1994-01-01
The crystal structure of a membrane channel, homotrimeric porin from Rhodopseudomonas blastica has been determined at 2.0 A resolution by multiple isomorphous replacement and structural refinement. The current model has an R-factor of 16.5% and consists of 289 amino acids, 238 water molecules, and 3 detergent molecules per subunit. The partial protein sequence and subsequently the complete DNA sequence were determined. The general architecture is similar to those of the structurally known porins. As a particular feature there are 3 adjacent binding sites for n-alkyl chains at the molecular 3-fold axis. The side chain arrangement in the channel indicates a transverse electric field across each of the 3 pore eyelets, which may explain the discrimination against nonpolar solutes. Moreover, there are 2 significantly ordered girdles of aromatic residues at the nonpolar/polar borderlines of the interface between protein and membrane. Possibly, these residues shield the polypeptide conformation against adverse membrane fluctuations. PMID:8142898
Morrone, A; Tylee, K.L.; Al-Sayed, M; Brusius-Facchin, A.C.; Caciotti, A.; Church, H.J.; Coll, M.J.; Davidson, K.; Fietz, M.J.; Gort, L.; Hegde, M.; Kubaski, F.; Lacerda, L.; Laranjeira, F.; Leistner-Segal, S.; Mooney, S.; Pajares, S.; Pollard, L.; Riberio, I.; Wang, R.Y.; Miller, N.
2014-01-01
Morquio A (Mucopolysaccharidosis IVA; MPS IVA) is an autosomal recessive lysosomal storage disorder caused by partial or total deficiency of the enzyme galactosamine-6-sulfate sulfatase (GALNS; also known as N-acetylgalactosamine-6-sulfate sulfatase) encoded by the GALNS gene. Patients who inherit two mutated GALNS gene alleles produce protein with decreased ability to degrade the glycosaminoglycans (GAGs) keratan sulfate and chondroitin 6-sulfate, thereby causing GAG accumulation within lysosomes and consequently pleiotropic disease. GALNS mutations occur throughout the gene and many mutations are identified only in single patients or families, causing difficulties both in mutation detection and interpretation. In this study, molecular analysis of 163 patients with Morquio A identified 99 unique mutations in the GALNS gene believed to negatively impact GALNS protein function, of which 39 are previously unpublished, together with 26 single-nucleotide polymorphisms. Recommendations for the molecular testing of patients, clear reporting of sequence findings, and interpretation of sequencing data are provided. PMID:24726177
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lalime, Erin N.; Pekosz, Andrew, E-mail: apekosz@jhsph.edu
The influenza A virus NS1 protein has a nuclear localization sequence (NLS) in the amino terminal region. This NLS overlaps sequences that are important for RNA binding as well as protein dimerization. To assess the significance of the NS1 NLS on influenza virus replication, the NLS amino acids were individually mutated to alanines and recombinant viruses encoding these mutations were rescued. Viruses containing NS1 proteins with mutations at R37, R38 and K41 displayed minimal changes in replication or NS1 protein nuclear localization. Recombinant viruses encoding NS1 R35A were not recovered but viruses containing second site mutations at position D39 inmore » addition to the R35A mutation were isolated. The mutations at position 39 were shown to partially restore NS1 protein dimerization but had minimal effects on nuclear localization. These data indicate that the amino acids in the NS1 NLS region play a more important role in protein dimerization compared to nuclear localization. - Highlights: • Mutations were introduced into influenza NS1 NLS1. • NS1 R37A, R38A, K41A viruses had minimal changes in replication and NS1 localization. • Viruses from NS1 R35A rescue all contained additional mutations at D39. • NS1 R35A D39X mutations recover dimerization lost in NS1 R35A mutations. • These results reaffirm the importance of dimerization for NS1 protein function.« less
Genetic engineering and chemical conjugation of potato virus X.
Lee, Karin L; Uhde-Holzem, Kerstin; Fischer, Rainer; Commandeur, Ulrich; Steinmetz, Nicole F
2014-01-01
Here we report the genetic engineering and chemical modification of potato virus X (PVX) for the presentation of various peptides, proteins, and fluorescent dyes, or other chemical modifiers. Three different ways of genetic engineering are described and by these means, peptides are successfully expressed not only when the foot and mouth disease virus (FMDV) 2A sequence or a flexible glycine-serine linker is included, but also when the peptide is fused directly to the PVX coat protein. When larger proteins or unfavorable peptide sequences are presented, a partial fusion via the FMDV 2A sequence is preferable. When these PVX chimeras retain the ability to assemble into viral particles and are thus able to infect plants systemically, they can be utilized to inoculate susceptible plants for isolation of sufficient amounts of virus particles for subsequent chemical modification. Chemical modification is required for the display of nonbiological ligands such as fluorophores, polymers, and small drug compounds. We present three methods of chemical bioconjugation. For direct conjugation of small chemical modifiers to solvent exposed lysines, N-hydroxysuccinimide chemistry can be applied. Bio-orthogonal reactions such as copper-catalyzed azide-alkyne cycloaddition or hydrazone ligation are alternatives to achieve more efficient conjugation (e.g., when working with high molecular weight or insoluble ligands). Furthermore, hydrazone ligation offers an attractive route for the introduction of pH-cleavable cargos (e.g., therapeutic molecules).
Packing Regularities in Biological Structures Relate to Their Dynamics
Jernigan, Robert L.; Kloczkowski, Andrzej
2007-01-01
The high packing density inside proteins leads to certain geometric regularities and also is one of the most important contributors to the high extent of cooperativity manifested by proteins in their cohesive domain motions. The orientations between neighboring non-bonded residues in proteins substantially follow the similar geometric regularities, regardless of whether the residues are on the surface or buried - a direct result of hydrophobicity forces. These orientations are relatively fixed and correspond closely to small deformations from those of the face-centered cubic lattice, which is the way in which identical spheres pack at the highest density. Packing density also is related to the extent of conservation of residues, and we show this relationship for residue packing densities by averaging over a large sample or residue packings. There are three regimes: 1) over a broad range of packing densities the relationship between sequence entropy and inverse packing density is nearly linear, 2) over a limited range of low packing densities the sequence entropy is nearly constant, and 3) at extremely low packing densities the sequence entropy is highly variable. These packing results provide important justification for the simple elastic network models that have been shown for a large number of proteins to represent protein dynamics so successfully, even when the models are extremely coarse-grained. Elastic network models for polymeric chains are simple and could be combined with these protein elastic networks to represent partially denatured parts of proteins. Finally, we show results of applications of the elastic network model to study the functional motions of the ribosome, based on its known structure. These results indicate expected correlations among its components for the step-wise processing steps in protein synthesis, and suggest ways to use these elastic network models to develop more detailed mechanisms - an important possibility, since most experiments yield only static structures. PMID:16957327
Functional amyloid in Pseudomonas.
Dueholm, Morten S; Petersen, Steen V; Sønderkær, Mads; Larsen, Poul; Christiansen, Gunna; Hein, Kim L; Enghild, Jan J; Nielsen, Jeppe L; Nielsen, Kåre L; Nielsen, Per H; Otzen, Daniel E
2010-08-01
Amyloids are highly abundant in many microbial biofilms and may play an important role in their architecture. Nevertheless, little is known of the amyloid proteins. We report the discovery of a novel functional amyloid expressed by a Pseudomonas strain of the P. fluorescens group. The amyloid protein was purified and the amyloid-like structure verified. Partial sequencing by MS/MS combined with full genomic sequencing of the Pseudomonas strain identified the gene coding for the major subunit of the amyloid fibril, termed fapC. FapC contains a thrice repeated motif that differs from those previously found in curli fimbrins and prion proteins. The lack of aromatic residues in the repeat shows that aromatic side chains are not needed for efficient amyloid formation. In contrast, glutamine and asparagine residues seem to play a major role in amyloid formation as these are highly conserved in curli, prion proteins and FapC. fapC is conserved in many Pseudomonas strains including the opportunistic pathogen P. aeruginosa and is situated in a conserved operon containing six genes, of which one encodes a fapC homologue. Heterologous expression of the fapA-F operon in Escherichia coli BL21(DE3) resulted in a highly aggregative phenotype, showing that the operon is involved in biofilm formation. © 2010 Blackwell Publishing Ltd.
Brandon, Nicholas J.; Moss, Stephen J.
2017-01-01
The RNA-binding and -processing protein TAR DNA-binding protein 43 (TDP-43) is heavily linked to the underlying causes and pathology of neurodegenerative diseases such as amyotrophic lateral sclerosis and frontotemporal lobar degeneration. In these diseases, TDP-43 is mislocalized, hyperphosphorylated, ubiquitinated, aggregated and cleaved. The importance of TDP-43 cleavage in the disease pathogenesis is still poorly understood. Here we detail the use of D-sorbitol as an exogenous stressor that causes TDP-43 cleavage in HeLa cells, resulting in a 35 kDa truncated product that accumulates in the cytoplasm within one hour of treatment. We confirm that the formation of this 35 kDa cleavage product is mediated by the activation of caspases. Inhibition of caspases blocks the cleavage of TDP-43, but does not prevent the accumulation of full-length protein in the cytoplasm. Using D-sorbitol as a stressor and caspase activator, we also demonstrate that the A90V variant of TDP-43, which lies adjacent to the caspase cleavage site within the nuclear localization sequence of TDP-43, confers partial resistance against caspase-mediated generation of the 35 kDa cleavage product. PMID:28510586
Manikandan, Selvaraj; Balaji, Seetharaaman; Kumar, Anil; Kumar, Rita
2007-01-01
The molecular basis for the survival of bacteria under extreme conditions in which growth is inhibited is a question of great current interest. A preliminary study was carried out to determine residue pattern conservation among the antiporters of enteric bacteria, responsible for extreme acid sensitivity especially in Escherichia coli and Shigella flexneri. Here we found the molecular evidence that proved the relationship between E. coli and S. flexneri. Multiple sequence alignment of the gadC coded acid sensitive antiporter showed many conserved residue patterns at regular intervals at the N-terminal region. It was observed that as the alignment approaches towards the C-terminal, the number of conserved residues decreases, indicating that the N-terminal region of this protein has much active role when compared to the carboxyl terminal. The motif, FHLVFFLLLGG, is well conserved within the entire gadC coded protein at the amino terminal. The motif is also partially conserved among other antiporters (which are not coded by gadC) but involved in acid sensitive/resistance mechanism. Phylogenetic cluster analysis proves the relationship of Escherichia coli and Shigella flexneri. The gadC coded proteins are converged as a clade and diverged from other antiporters belongs to the amino acid-polyamine-organocation (APC) superfamily. PMID:21670792
Goller, Katja V; Gabriel, Claudia; Dimna, Mireille Le; Le Potier, Marie-Frédérique; Rossi, Sophie; Staubach, Christoph; Merboth, Matthias; Beer, Martin; Blome, Sandra
2016-03-01
Classical swine fever is a viral disease of pigs that carries tremendous socio-economic impact. In outbreak situations, genetic typing is carried out for the purpose of molecular epidemiology in both domestic pigs and wild boar. These analyses are usually based on harmonized partial sequences. However, for high-resolution analyses towards the understanding of genetic variability and virus evolution, full-genome sequences are more appropriate. In this study, a unique set of representative virus strains was investigated that was collected during an outbreak in French free-ranging wild boar in the Vosges-du-Nord mountains between 2003 and 2007. Comparative sequence and evolutionary analyses of the nearly full-length sequences showed only slow evolution of classical swine fever virus strains over the years and no impact of vaccination on mutation rates. However, substitution rates varied amongst protein genes; furthermore, a spatial and temporal pattern could be observed whereby two separate clusters were formed that coincided with physical barriers.
Simian immunodeficiency viruses from African green monkeys display unusual genetic diversity.
Johnson, P R; Fomsgaard, A; Allan, J; Gravell, M; London, W T; Olmsted, R A; Hirsch, V M
1990-01-01
African green monkeys are asymptomatic carriers of simian immunodeficiency viruses (SIV), commonly called SIVagm. As many as 50% of African green monkeys in the wild may be SIV seropositive. This high seroprevalence rate and the potential for genetic variation of lentiviruses suggested to us that African green monkeys may harbor widely differing genotypes of SIVagm. To investigate this hypothesis, we determined the entire nucleotide sequence of an infectious proviral molecular clone of SIVagm (155-4) and partial sequences (long terminal repeat and Gag) of three other distinct SIVagm isolates (90, gri-1, and ver-1). Comparisons among the SIVagm isolates revealed extreme diversity at the nucleotide and amino acid levels. Long terminal repeat nucleotide sequences varied up to 35% and Gag protein sequences varied up to 30%. The variability among SIVagm isolates exceeded the variability among any other group of primate lentiviruses. Our data suggest that SIVagm has been in the African green monkey population for a long time and may be the oldest primate lentivirus group in existence. PMID:2304139
No evidence for a role of modified live virus vaccines in the emergence of canine parvovirus.
Truyen, U; Geissler, K; Parrish, C R; Hermanns, W; Siegl, G
1998-05-01
In this study the early evolution and potential origins of canine parvovirus (CPV) were examined. We cloned and sequenced the VP2 capsid protein genes of three German CPV strains isolated in 1979-1980, as well as two feline panleukopenia virus (FPV) vaccine viruses that were previously shown to have some restriction enzyme cleavage sites in common with CPV. Other partial VP2 gene sequences were obtained by amplifying CPV DNA from paraffin-embedded tissues of dogs which were early parvovirus disease cases in Germany in 1978-1979. Sequences were analysed with respect to their evolutionary relationships to other CPV and FPV isolates. Those analyses did not support the hypothesis that CPV emerged as a variant of an FPV vaccine virus. Neither did they reveal ancestral sequences among the very early CPV isolates examined. Other possible sources for the origin of CPV are examined, including the involvement of viruses from wild carnivores.
Time course of pathogenic and adaptation mechanisms in cystinotic mouse kidneys.
Gaide Chevronnay, Héloïse P; Janssens, Virginie; Van Der Smissen, Patrick; N'Kuli, Francisca; Nevo, Nathalie; Guiot, Yves; Levtchenko, Elena; Marbaix, Etienne; Pierreux, Christophe E; Cherqui, Stéphanie; Antignac, Corinne; Courtoy, Pierre J
2014-06-01
Cystinosis, a main cause of Fanconi syndrome, is reproduced in congenic C57BL/6 cystinosin knockout (KO) mice. To identify the sequence of pathogenic and adaptation mechanisms of nephropathic cystinosis, we defined the onset of Fanconi syndrome in KO mice between 3 and 6 months of age and analyzed the correlation with structural and functional changes in proximal tubular cells (PTCs), with focus on endocytosis of ultrafiltrated disulfide-rich proteins as a key source of cystine. Despite considerable variation between mice at the same age, typical event sequences were delineated. At the cellular level, amorphous lysosomal inclusions preceded cystine crystals and eventual atrophy without crystals. At the nephron level, lesions started at the glomerulotubular junction and then extended distally. In situ hybridization and immunofluorescence revealed progressive loss of expression of megalin, cubilin, sodium-glucose cotransporter 2, and type IIa sodium-dependent phosphate cotransporter, suggesting apical dedifferentiation accounting for Fanconi syndrome before atrophy. Injection of labeled proteins revealed that defective endocytosis in S1 PTCs led to partial compensatory uptake by S3 PTCs, suggesting displacement of endocytic load and injury by disulfide-rich cargo. Increased PTC apoptosis allowed luminal shedding of cystine crystals and was partially compensated for by tubular proliferation. We conclude that lysosomal storage triggered by soluble cystine accumulation induces apical PTC dedifferentiation, which causes transfer of the harmful load of disulfide-rich proteins to more distal cells, possibly explaining longitudinal progression of swan-neck lesions. Furthermore, our results suggest that subsequent adaptation mechanisms include lysosomal clearance of free and crystalline cystine into urine and ongoing tissue repair. Copyright © 2014 by the American Society of Nephrology.
Sequence diversity of wheat mosaic virus isolates.
Stewart, Lucy R
2016-02-02
Wheat mosaic virus (WMoV), transmitted by eriophyid wheat curl mites (Aceria tosichella) is the causal agent of High Plains disease in wheat and maize. WMoV and other members of the genus Emaravirus evaded thorough molecular characterization for many years due to the experimental challenges of mite transmission and manipulating multisegmented negative sense RNA genomes. Recently, the complete genome sequence of a Nebraska isolate of WMoV revealed eight segments, plus a variant sequence of the nucleocapsid protein-encoding segment. Here, near-complete and partial consensus sequences of five more WMoV isolates are reported and compared to the Nebraska isolate: an Ohio maize isolate (GG1), a Kansas barley isolate (KS7), and three Ohio wheat isolates (H1, K1, W1). Results show two distinct groups of WMoV isolates: Ohio wheat isolate RNA segments had 84% or lower nucleotide sequence identity to the NE isolate, whereas GG1 and KS7 had 98% or higher nucleotide sequence identity to the NE isolate. Knowledge of the sequence variability of WMoV isolates is a step toward understanding virus biology, and potentially explaining observed biological variation. Published by Elsevier B.V.
Boguś, M I; Wieloch, W; Ligęza-Żuber, M
2017-02-01
Coronatin-2, a 14.5 kDa protein, was isolated from culture filtrates of the entomopathogenic fungus Conidiobolus coronatus (Costantin) Batko (Entomophthoramycota: Entomophthorales). After LC-MS/MS (liquid chromatography tandem mass spectrometry) analysis of the tryptic peptide digest of coronatin-2 and a mass spectra database search no orthologs of this protein could be found in fungi. The highest homology was observed to the partial translation elongation factor 1a from Sphaerosporium equinum (protein sequence coverage, 21%), with only one peptide sequence, suggesting that coronatin-2 is a novel fungal protein that has not yet been described. In contrast to coronatin-1, an insecticidal 36 kDa protein, which shows both elastolytic and chitinolytic activity, coronatin-2 showed no enzymatic activity. Addition of coronatin-2 into cultures of hemocytes taken from larvae of Galleria mellonella Linnaeus (Lepidoptera: Pyralidae), resulted in progressive disintegration of nets formed by granulocytes and plasmatocytes due to rapid degranulation of granulocytes, extensive vacuolization of plasmatocytes accompanied by cytoplasm expulsion, and cell disintegration. Spherulocytes remained intact, while oenocytes rapidly disintegrated. Coronatin-2 produced 80% mortality when injected into G. mellonella at 5 µg larva-1. Further study is warranted to determine the relevance of the acute toxicity of coronatin-2 and its effects on hemocytes in vitro to virulence of C. coronatus against its hosts.
Fustiñana, Maria Sol; Ariel, Pablo; Federman, Noel; Freudenthal, Ramiro; Romano, Arturo
2010-09-01
Human β-amyloid, the main component in the neuritic plaques found in patients with Alzheimer's disease, is generated by cleavage of the β-amyloid precursor protein. Beyond the role in pathology, members of this protein family are synaptic proteins and have been associated with synaptogenesis, neuronal plasticity and memory, both in vertebrates and in invertebrates. Consolidation is necessary to convert a short-term labile memory to a long-term and stable form. During consolidation, gene expression and de novo protein synthesis are regulated in order to produce key proteins for the maintenance of plastic changes produced during the acquisition of new information. Here we partially cloned and sequenced the beta-amyloid precursor protein like gene homologue in the crab Chasmagnathus (cappl), showing a 37% of identity with the fruit fly Drosophila melanogaster homologue and 23% with Homo sapiens but with much higher degree of sequence similarity in certain regions. We observed a wide distribution of cappl mRNA in the nervous system as well as in muscle and gills. The protein localized in all tissues analyzed with the exception of muscle. Immunofluorescence revealed localization of cAPPL in associative and sensory brain areas. We studied gene and protein expression during long-term memory consolidation using a well characterized memory model: the context-signal associative memory in this crab species. mRNA levels varied at different time points during long-term memory consolidation and correlated with cAPPL protein levels cAPPL mRNA and protein is widely distributed in the central nervous system of the crab and the time course of expression suggests a role of cAPPL during long-term memory formation.
Multiple TPR motifs characterize the Fanconi anemia FANCG protein.
Blom, Eric; van de Vrugt, Henri J; de Vries, Yne; de Winter, Johan P; Arwert, Fré; Joenje, Hans
2004-01-05
The genome protection pathway that is defective in patients with Fanconi anemia (FA) is controlled by at least eight genes, including BRCA2. A key step in the pathway involves the monoubiquitylation of FANCD2, which critically depends on a multi-subunit nuclear 'core complex' of at least six FANC proteins (FANCA, -C, -E, -F, -G, and -L). Except for FANCL, which has WD40 repeats and a RING finger domain, no significant domain structure has so far been recognized in any of the core complex proteins. By using a homology search strategy comparing the human FANCG protein sequence with its ortholog sequences in Oryzias latipes (Japanese rice fish) and Danio rerio (zebrafish) we identified at least seven tetratricopeptide repeat motifs (TPRs) covering a major part of this protein. TPRs are degenerate 34-amino acid repeat motifs which function as scaffolds mediating protein-protein interactions, often found in multiprotein complexes. In four out of five TPR motifs tested (TPR1, -2, -5, and -6), targeted missense mutagenesis disrupting the motifs at the critical position 8 of each TPR caused complete or partial loss of FANCG function. Loss of function was evident from failure of the mutant proteins to complement the cellular FA phenotype in FA-G lymphoblasts, which was correlated with loss of binding to FANCA. Although the TPR4 mutant fully complemented the cells, it showed a reduced interaction with FANCA, suggesting that this TPR may also be of functional importance. The recognition of FANCG as a typical TPR protein predicts this protein to play a key role in the assembly and/or stabilization of the nuclear FA protein core complex.
Iwanowicz, L; Densmore, C; Hahn, C; McAllister, P; Odenkirk, J
2013-09-01
The Northern Snakehead Channa argus is an introduced species that now inhabits the Chesapeake Bay. During a preliminary survey for introduced pathogens possibly harbored by these fish in Virginia waters, a filterable agent was isolated from five specimens that produced cytopathic effects in BF-2 cells. Based on PCR amplification and partial sequencing of the major capsid protein (MCP), DNA polymerase (DNApol), and DNA methyltransferase (Mtase) genes, the isolates were identified as Largemouth Bass virus (LMBV). Nucleotide sequences of the MCP (492 bp) and DNApol (419 pb) genes were 100% identical to those of LMBV. The nucleotide sequence of the Mtase (206 bp) gene was 99.5% identical to that of LMBV, and the single nucleotide substitution did not lead to a predicted amino acid coding change. This is the first report of LMBV from the Northern Snakehead, and provides evidence that noncentrarchid fishes may be susceptible to this virus.
Iwanowicz, Luke R.; Densmore, Christine L.; Hahn, Cassidy M.; McAllister, Phillip; Odenkirk, John
2013-01-01
The Northern Snakehead Channa argus is an introduced species that now inhabits the Chesapeake Bay. During a preliminary survey for introduced pathogens possibly harbored by these fish in Virginia waters, a filterable agent was isolated from five specimens that produced cytopathic effects in BF-2 cells. Based on PCR amplification and partial sequencing of the major capsid protein (MCP), DNA polymerase (DNApol), and DNA methyltransferase (Mtase) genes, the isolates were identified as Largemouth Bass virus (LMBV). Nucleotide sequences of the MCP (492 bp) and DNApol (419 pb) genes were 100% identical to those of LMBV. The nucleotide sequence of the Mtase (206 bp) gene was 99.5% identical to that of LMBV, and the single nucleotide substitution did not lead to a predicted amino acid coding change. This is the first report of LMBV from the Northern Snakehead, and provides evidence that noncentrarchid fishes may be susceptible to this virus.
Lerner, D R; Raikhel, N V
1992-06-05
Chitin-binding proteins are present in a wide range of plant species, including both monocots and dicots, even though these plants contain no chitin. To investigate the relationship between in vitro antifungal and insecticidal activities of chitin-binding proteins and their unknown endogenous functions, the stinging nettle lectin (Urtica dioica agglutinin, UDA) cDNA was cloned using a synthetic gene as the probe. The nettle lectin cDNA clone contained an open reading frame encoding 374 amino acids. Analysis of the deduced amino acid sequence revealed a 21-amino acid putative signal sequence and the 86 amino acids encoding the two chitin-binding domains of nettle lectin. These domains were fused to a 19-amino acid "spacer" domain and a 244-amino acid carboxyl extension with partial identity to a chitinase catalytic domain. The authenticity of the cDNA clone was confirmed by deduced amino acid sequence identity with sequence data obtained from tryptic digests, RNA gel blot, and polymerase chain reaction analyses. RNA gel blot analysis also showed the nettle lectin message was present primarily in rhizomes and inflorescence (with immature seeds) but not in leaves or stems. Chitinase enzymatic activity was found when the chitinase-like domain alone or the chitinase-like domain with the chitin-binding domains were expressed in Escherichia coli. This is the first example of a chitin-binding protein with both a duplication of the 43-amino acid chitin-binding domain and a fusion of the chitin-binding domains to a structurally unrelated domain, the chitinase domain.
Xiong, H; Campelo, D; Pollack, R J; Raoult, D; Shao, R; Alem, M; Ali, J; Bilcha, K; Barker, S C
2014-08-01
The Illumina Hiseq platform was used to sequence the entire mitochondrial coding-regions of 20 body lice, Pediculus humanus Linnaeus, and head lice, P. capitis De Geer (Phthiraptera: Pediculidae), from eight towns and cities in five countries: Ethiopia, France, China, Australia and the U.S.A. These data (∼310 kb) were used to see how much more informative entire mitochondrial coding-region sequences were than partial mitochondrial coding-region sequences, and thus to guide the design of future studies of the phylogeny, origin, evolution and taxonomy of body lice and head lice. Phylogenies were compared from entire coding-region sequences (∼15.4 kb), entire cox1 (∼1.5 kb), partial cox1 (∼700 bp) and partial cytb (∼600 bp) sequences. On the one hand, phylogenies from entire mitochondrial coding-region sequences (∼15.4 kb) were much more informative than phylogenies from entire cox1 sequences (∼1.5 kb) and partial gene sequences (∼600 to ∼700 bp). For example, 19 branches had > 95% bootstrap support in our maximum likelihood tree from the entire mitochondrial coding-regions (∼15.4 kb) whereas the tree from 700 bp cox1 had only two branches with bootstrap support > 95%. Yet, by contrast, partial cytb (∼600 bp) and partial cox1 (∼486 bp) sequences were sufficient to genotype lice to Clade A, B or C. The sequences of the mitochondrial genomes of the P. humanus, P. capitis and P. schaeffi Fahrenholz studied are in NCBI GenBank under the accession numbers KC660761-800, KC685631-6330, KC241882-97, EU219988-95, HM241895-8 and JX080388-407. © 2014 The Royal Entomological Society.
Isolation and characterization of Scophthalmus maximus rhabdovirus.
Zhang, Qi-Ya; Tao, Jian-Jun; Gui, Lang; Zhou, Guang-Zhou; Ruan, Hong-Mei; Li, Zhen-Qiu; Gui, Jian-Fang
2007-02-28
A rhabdovirus associated with a lethal hemorrhagic disease in cultured turbot Scophthalmus maximus Linnaeus was isolated. The virus induced typical cytopathogenic effects (CPE) in 9 of 15 fish cell lines examined and was then propagated and isolated from infected carp leucocyte cells (CLC). Electron microscopy observations revealed that the negatively stained virions had a typical bullet-shaped morphology with one rounded end and one flat base end. The bullet-shaped morphology was more obvious and clear in ultrathin sections of infected cells. Experimental infections also indicated that the S. maximus rhabdovirus (SMRV) was not only a viral pathogen for cultured turbot, but also had the ability to infect other fish species, such as freshwater grass carp. A partial nucleotide sequence of the SMRV polymerase gene was determined by RT-PCR using 2 pairs of degenerate primers designed according to the conserved sequences of rhabdovirus polymerase genes. Homology analysis, amino acid sequence alignment, and phylogenetic relationship analysis of the partial SMRV polymerase sequence indicated that SMRV was genetically distinct from other rhabdoviruses. Sodium dodecyl sulphate polyacrylamide gel electrophoresis (SDS-PAGE) analysis of the purified SMRV revealed 5 major structural proteins, and their molecular masses were estimated to be about 250, 58, 47, 42, and 28 kDa. Significant serological reactivity differences were also observed between SMRV and its nearest neighbor, spring viremia of carp virus (SVCV). The data suggest that SMRV is likely a novel fish rhabdovirus, although it is closely related to rhabdoviruses in the genus Vesiculovirus.
Pagès, Sandrine; Bélaïch, Anne; Fierobe, Henri-Pierre; Tardif, Chantal; Gaudin, Christian; Bélaïch, Jean-Pierre
1999-01-01
The gene encoding the scaffolding protein of the cellulosome from Clostridium cellulolyticum, whose partial sequence was published earlier (S. Pagès, A. Bélaïch, C. Tardif, C. Reverbel-Leroy, C. Gaudin, and J.-P. Bélaïch, J. Bacteriol. 178:2279–2286, 1996; C. Reverbel-Leroy, A. Bélaïch, A. Bernadac, C. Gaudin, J. P. Bélaïch, and C. Tardif, Microbiology 142:1013–1023, 1996), was completely sequenced. The corresponding protein, CipC, is composed of a cellulose binding domain at the N terminus followed by one hydrophilic domain (HD1), seven highly homologous cohesin domains (cohesin domains 1 to 7), a second hydrophilic domain, and a final cohesin domain (cohesin domain 8) which is only 57 to 60% identical to the seven other cohesin domains. In addition, a second gene located 8.89 kb downstream of cipC was found to encode a three-domain protein, called ORFXp, which includes a cohesin domain. By using antiserum raised against the latter, it was observed that ORFXp is associated with the membrane of C. cellulolyticum and is not detected in the cellulosome fraction. Western blot and BIAcore experiments indicate that cohesin domains 1 and 8 from CipC recognize the same dockerins and have similar affinity for CelA (Ka = 4.8 × 109 M−1) whereas the cohesin from ORFXp, although it is also able to bind all cellulosome components containing a dockerin, has a 19-fold lower Ka for CelA (2.6 × 108 M−1). Taken together, these data suggest that ORFXp may play a role in cellulosome assembly. PMID:10074072
Pagès, S; Bélaïch, A; Fierobe, H P; Tardif, C; Gaudin, C; Bélaïch, J P
1999-03-01
The gene encoding the scaffolding protein of the cellulosome from Clostridium cellulolyticum, whose partial sequence was published earlier (S. Pagès, A. Bélaïch, C. Tardif, C. Reverbel-Leroy, C. Gaudin, and J.-P. Bélaïch, J. Bacteriol. 178:2279-2286, 1996; C. Reverbel-Leroy, A. Bélaïch, A. Bernadac, C. Gaudin, J. P. Bélaïch, and C. Tardif, Microbiology 142:1013-1023, 1996), was completely sequenced. The corresponding protein, CipC, is composed of a cellulose binding domain at the N terminus followed by one hydrophilic domain (HD1), seven highly homologous cohesin domains (cohesin domains 1 to 7), a second hydrophilic domain, and a final cohesin domain (cohesin domain 8) which is only 57 to 60% identical to the seven other cohesin domains. In addition, a second gene located 8.89 kb downstream of cipC was found to encode a three-domain protein, called ORFXp, which includes a cohesin domain. By using antiserum raised against the latter, it was observed that ORFXp is associated with the membrane of C. cellulolyticum and is not detected in the cellulosome fraction. Western blot and BIAcore experiments indicate that cohesin domains 1 and 8 from CipC recognize the same dockerins and have similar affinity for CelA (Ka = 4.8 x 10(9) M-1) whereas the cohesin from ORFXp, although it is also able to bind all cellulosome components containing a dockerin, has a 19-fold lower Ka for CelA (2.6 x 10(8) M-1). Taken together, these data suggest that ORFXp may play a role in cellulosome assembly.
Biddle, Jennifer F.; Siebert, Jason R.; Staunton, Eric; Hegg, Eric L.; Matthysse, Ann G.; Teske, Andreas
2013-01-01
Orange, white, and yellow vacuolated Beggiatoaceae filaments are visually dominant members of microbial mats found near sea floor hydrothermal vents and cold seeps, with orange filaments typically concentrated toward the mat centers. No marine vacuolate Beggiatoaceae are yet in pure culture, but evidence to date suggests they are nitrate-reducing, sulfide-oxidizing bacteria. The nearly complete genome sequence of a single orange Beggiatoa (“Candidatus Maribeggiatoa”) filament from a microbial mat sample collected in 2008 at a hydrothermal site in Guaymas Basin (Gulf of California, Mexico) was recently obtained. From this sequence, the gene encoding an abundant soluble orange-pigmented protein in Guaymas Basin mat samples (collected in 2009) was identified by microcapillary reverse-phase high-performance liquid chromatography (HPLC) nano-electrospray tandem mass spectrometry (μLC–MS-MS) of a pigmented band excised from a denaturing polyacrylamide gel. The predicted protein sequence is related to a large group of octaheme cytochromes whose few characterized representatives are hydroxylamine or hydrazine oxidases. The protein was partially purified and shown by in vitro assays to have hydroxylamine oxidase, hydrazine oxidase, and nitrite reductase activities. From what is known of Beggiatoaceae physiology, nitrite reduction is the most likely in vivo role of the octaheme protein, but future experiments are required to confirm this tentative conclusion. Thus, while present-day genomic and proteomic techniques have allowed precise identification of an abundant mat protein, and its potential activities could be assayed, proof of its physiological role remains elusive in the absence of a pure culture that can be genetically manipulated. PMID:23220958
Cui, Hongguang
2016-01-01
ABSTRACT The potyviral RNA genome encodes two polyproteins that are proteolytically processed by three viral protease domains into 11 mature proteins. Extensive molecular studies have identified functions for the majority of the viral proteins. For example, 6K2, one of the two smallest potyviral proteins, is an integral membrane protein and induces the endoplasmic reticulum (ER)-originated replication vesicles that target the chloroplast for robust viral replication. However, the functional role of 6K1, the other smallest protein, remains uncharacterized. In this study, we developed a series of recombinant full-length viral cDNA clones derived from a Canadian Plum pox virus (PPV) isolate. We found that deletion of any of the short motifs of 6K1 (each of which ranged from 5 to 13 amino acids), most of the 6K1 sequence (but with the conserved sequence of the cleavage sites being retained), or all of the 6K1 sequence in the PPV infectious clone abolished viral replication. The trans expression of 6K1 or the cis expression of a dislocated 6K1 failed to rescue the loss-of-replication phenotype, suggesting the temporal and spatial requirement of 6K1 for viral replication. Disruption of the N- or C-terminal cleavage site of 6K1, which prevented the release of 6K1 from the polyprotein, either partially or completely inhibited viral replication, suggesting the functional importance of the mature 6K1. We further found that green fluorescent protein-tagged 6K1 formed punctate inclusions at the viral early infection stage and colocalized with chloroplast-bound viral replicase elements 6K2 and NIb. Taken together, our results suggest that 6K1 is required for viral replication and is an important viral element of the viral replication complex at the early infection stage. IMPORTANCE Potyviruses account for more than 30% of known plant viruses and consist of many agriculturally important viruses. The genomes of potyviruses encode two polyproteins that are proteolytically processed into 11 mature proteins, with the majority of them having been at least partially functionally characterized. However, the functional role of a small protein named 6K1 remains obscure. In this study, we showed that deletion of 6K1 or a short motif/region of 6K1 in the full-length cDNA clones of plum pox virus abolishes viral replication and that mutation of the N- or C-terminal cleavage sites of 6K1 to prevent its release from the polyprotein greatly attenuates or completely inhibits viral replication, suggesting its important role in potyviral infection. We report that 6K1 forms punctate structures and targets the replication vesicles in PPV-infected plant leaf cells at the early infection stage. Our data reveal that 6K1 is an important viral protein of the potyviral replication complex. PMID:26962227
Zou, Zhengting; Zhang, Jianzhi
2017-07-01
Several authors reported lower frequencies of protein sequence convergence between more distantly related evolutionary lineages and attributed this trend to epistasis, which renders the acceptable amino acids at a site more different and convergence less likely in more divergent lineages. A recent primate study, however, suggested that this trend is at least partially and potentially entirely an artifact of gene tree discordance (GTD). Here, we demonstrate in a genome-wide data set from 17 mammals that the temporal trend remains (1) upon the control of the GTD level, (2) in genes whose genealogies are concordant with the species tree, and (3) for convergent changes, which are extremely unlikely to be caused by GTD. Similar results are observed in a comparable data set of 12 fruit flies in some but not all of these tests. We conclude that, at least in some cases, the temporal decline of convergence is genuine, reflecting an impact of epistasis on protein evolution. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Reyes Jara, Andrea Milagros; Liggieri, Constanza Silvina; Bruno, Mariela Anahí
2018-10-30
A partially purified proteolytic extract prepared from Maclura pomifera latex was employed in hydrolyzing a soybean-protein isolate (4.2 mg/mL). The hydrolysis-product formation, monitored by tricine-sodium-dodecyl-sulfate-polyacrylamyde-gel electrophoresis and reverse-phase high-performance liquid chromatography, indicated that after 10 min of reaction the main soybean proteins disappeared. The maximum degree of hydrolysis was 36.2% after a 180-min digestion. The 90-min hydrolysate presented an IC 50 of 31.6 ± 0.2 µg/mL, and a trolox equivalent antioxidant capacity of 157.6 and 176.9 µmoles TE per g of peptide determined by two different methods. Analysis by matrix-assisted-laser-desorption-ionization-time-of-flight mass spectrometry (MALDI-TOF MS), followed by the application of bioinformatics tools, enabled the deduction of fourteen theoretical peptide sequences containing antioxidant amino acids at >60%, none of which sequences had been previously reported as antioxidants. Finally, we consider that this 90-min hydrolysate would constitute a promising ingredient in the manufacture of functional foods. Copyright © 2018 Elsevier Ltd. All rights reserved.
Xie, Qian; Cao, Yujuan; Su, Juan; Wu, Jie; Wu, Xianbo; Wan, Chengsong; He, Mingliang; Ke, Changwen; Zhang, Bao; Zhao, Wei
2017-08-01
Significant sequence variation of Middle East respiratory syndrome coronavirus (MERS CoV) has never been detected since it was first reported in 2012. A MERS patient came from Korea to China in late May 2015. The patient was 44 years old and had symptoms including high fever, dry cough with a little phlegm, and shortness of breath, which are roughly consistent with those associated with MERS, and had had close contact with individuals with confirmed cases of MERS.After one month of therapy with antiviral, anti-infection, and immune-enhancing agents, the patient recovered in the hospital and was discharged. A nasopharyngeal swab sample was collected for direct sequencing, which revealed two deletion variants of MERS CoV. Deletions of 414 and 419 nt occurred between ORF5 and the E protein, resulting in a partial protein fusion or truncation of ORF5 and the E protein. Functional analysis by bioinformatics and comparison to previous studies implied that the two variants might be defective in their ability to package MERS CoV. However, the mechanism of how these deletions occurred and what effects they have need to be further investigated.
Prolonged fasting increases purine recycling in post-weaned northern elephant seals.
Soñanez-Organis, José Guadalupe; Vázquez-Medina, José Pablo; Zenteno-Savín, Tania; Aguilar, Andres; Crocker, Daniel E; Ortiz, Rudy M
2012-05-01
Northern elephant seals are naturally adapted to prolonged periods (1-2 months) of absolute food and water deprivation (fasting). In terrestrial mammals, food deprivation stimulates ATP degradation and decreases ATP synthesis, resulting in the accumulation of purines (ATP degradation byproducts). Hypoxanthine-guanine phosphoribosyl transferase (HGPRT) salvages ATP by recycling the purine degradation products derived from xanthine oxidase (XO) metabolism, which also promotes oxidant production. The contributions of HGPRT to purine recycling during prolonged food deprivation in marine mammals are not well defined. In the present study we cloned and characterized the complete and partial cDNA sequences that encode for HGPRT and xanthine oxidoreductase (XOR) in northern elephant seals. We also measured XO protein expression and circulating activity, along with xanthine and hypoxanthine plasma content in fasting northern elephant seal pups. Blood, adipose and muscle tissue samples were collected from animals after 1, 3, 5 and 7 weeks of their natural post-weaning fast. The complete HGPRT and partial XOR cDNA sequences are 771 and 345 bp long and encode proteins of 218 and 115 amino acids, respectively, with conserved domains important for their function and regulation. XOR mRNA and XO protein expression increased 3-fold and 1.7-fold with fasting, respectively, whereas HGPRT mRNA (4-fold) and protein (2-fold) expression increased after 7 weeks in adipose tissue and muscle. Plasma xanthine (3-fold) and hypoxanthine (2.5-fold) levels, and XO (1.7- to 20-fold) and HGPRT (1.5- to 1.7-fold) activities increased during the last 2 weeks of fasting. Results suggest that prolonged fasting in elephant seal pups is associated with increased capacity to recycle purines, which may contribute to ameliorating oxidant production and enhancing the supply of ATP, both of which would be beneficial during prolonged food deprivation and appear to be adaptive in this species.
Prolonged fasting increases purine recycling in post-weaned northern elephant seals
Soñanez-Organis, José Guadalupe; Vázquez-Medina, José Pablo; Zenteno-Savín, Tania; Aguilar, Andres; Crocker, Daniel E.; Ortiz, Rudy M.
2012-01-01
SUMMARY Northern elephant seals are naturally adapted to prolonged periods (1–2 months) of absolute food and water deprivation (fasting). In terrestrial mammals, food deprivation stimulates ATP degradation and decreases ATP synthesis, resulting in the accumulation of purines (ATP degradation byproducts). Hypoxanthine-guanine phosphoribosyl transferase (HGPRT) salvages ATP by recycling the purine degradation products derived from xanthine oxidase (XO) metabolism, which also promotes oxidant production. The contributions of HGPRT to purine recycling during prolonged food deprivation in marine mammals are not well defined. In the present study we cloned and characterized the complete and partial cDNA sequences that encode for HGPRT and xanthine oxidoreductase (XOR) in northern elephant seals. We also measured XO protein expression and circulating activity, along with xanthine and hypoxanthine plasma content in fasting northern elephant seal pups. Blood, adipose and muscle tissue samples were collected from animals after 1, 3, 5 and 7 weeks of their natural post-weaning fast. The complete HGPRT and partial XOR cDNA sequences are 771 and 345 bp long and encode proteins of 218 and 115 amino acids, respectively, with conserved domains important for their function and regulation. XOR mRNA and XO protein expression increased 3-fold and 1.7-fold with fasting, respectively, whereas HGPRT mRNA (4-fold) and protein (2-fold) expression increased after 7 weeks in adipose tissue and muscle. Plasma xanthine (3-fold) and hypoxanthine (2.5-fold) levels, and XO (1.7- to 20-fold) and HGPRT (1.5- to 1.7-fold) activities increased during the last 2 weeks of fasting. Results suggest that prolonged fasting in elephant seal pups is associated with increased capacity to recycle purines, which may contribute to ameliorating oxidant production and enhancing the supply of ATP, both of which would be beneficial during prolonged food deprivation and appear to be adaptive in this species. PMID:22496280
Gene Discovery through Genomic Sequencing of Brucella abortus
Sánchez, Daniel O.; Zandomeni, Ruben O.; Cravero, Silvio; Verdún, Ramiro E.; Pierrou, Ester; Faccio, Paula; Diaz, Gabriela; Lanzavecchia, Silvia; Agüero, Fernán; Frasch, Alberto C. C.; Andersson, Siv G. E.; Rossetti, Osvaldo L.; Grau, Oscar; Ugalde, Rodolfo A.
2001-01-01
Brucella abortus is the etiological agent of brucellosis, a disease that affects bovines and human. We generated DNA random sequences from the genome of B. abortus strain 2308 in order to characterize molecular targets that might be useful for developing immunological or chemotherapeutic strategies against this pathogen. The partial sequencing of 1,899 clones allowed the identification of 1,199 genomic sequence surveys (GSSs) with high homology (BLAST expect value < 10−5) to sequences deposited in the GenBank databases. Among them, 925 represent putative novel genes for the Brucella genus. Out of 925 nonredundant GSSs, 470 were classified in 15 categories based on cellular function. Seven hundred GSSs showed no significant database matches and remain available for further studies in order to identify their function. A high number of GSSs with homology to Agrobacterium tumefaciens and Rhizobium meliloti proteins were observed, thus confirming their close phylogenetic relationship. Among them, several GSSs showed high similarity with genes related to nodule nitrogen fixation, synthesis of nod factors, nodulation protein symbiotic plasmid, and nodule bacteroid differentiation. We have also identified several B. abortus homologs of virulence and pathogenesis genes from other pathogens, including a homolog to both the Shda gene from Salmonella enterica serovar Typhimurium and the AidA-1 gene from Escherichia coli. Other GSSs displayed significant homologies to genes encoding components of the type III and type IV secretion machineries, suggesting that Brucella might also have an active type III secretion machinery. PMID:11159979
Muries, Beatriz; Faize, Mohamed; Carvajal, Micaela; Martínez-Ballesta, María Del Carmen
2011-04-01
Plant aquaporins belong to a large superfamily of conserved proteins called the major intrinsic proteins (MIPs). There is limited information about the diversity of MIPs and their water transport capacity in broccoli (Brassica oleracea) plants. In this study, the cDNAs of isoforms of Plasma Membrane Intrinsic Proteins (PIPs), a class of aquaporins, from broccoli roots have been partially sequenced. Thus, sequencing experiments led to the identification of eight PIP1 and three PIP2 genes encoding PIPs in B. oleracea plants. The occurrence of different gene products encoding PIPs suggests that they may play different roles in plants. The screening of their expression as well as the expression of two specific PIP2 isoforms (BoPIP2;2 and BoPIP2;3), in different organs and under different salt-stress conditions in two varieties, has helped to unravel the function and the regulation of PIPs in plants. Thus, a high degree of BoPIP2;3 expression in mature leaves suggests that this BoPIP2;3 isoform plays important roles, not only in root water relations but also in the physiology and development of leaves. In addition, differences between gene and protein patterns led us to consider that mRNA synthesis is inhibited by the accumulation of the corresponding encoded protein. Therefore, transcript levels, protein abundance determination and the integrated hydraulic architecture of the roots must be considered in order to interpret the response of broccoli to salinity.
Liu, Feizhou; Bauer, Christopher C.; Ortiz, Irving; Cook, Richard G.; Schmid, Michael F.; Epstein, Henry F.
1998-01-01
Muscle thick filaments are stable assemblies of myosin and associated proteins whose dimensions are precisely regulated. The mechanisms underlying the stability and regulation of the assembly are not understood. As an approach to these problems, we have studied the core proteins that, together with paramyosin, form the core structure of the thick filament backbone in the nematode Caenorhabditis elegans. We obtained partial peptide sequences from one of the core proteins, β-filagenin, and then identified a gene that encodes a novel protein of 201–amino acid residues from databases using these sequences. β-Filagenin has a calculated isoelectric point at 10.61 and a high percentage of aromatic amino acids. Secondary structure algorithms predict that it consists of four β-strands but no α-helices. Western blotting using an affinity-purified antibody showed that β-filagenin was associated with the cores. β-Filagenin was localized by immunofluorescence microscopy to the A bands of body–wall muscles, but not the pharynx. β-filagenin assembled with the myosin homologue paramyosin into the tubular cores of wild-type nematodes at a periodicity matching the 72-nm repeats of paramyosin, as revealed by immunoelectron microscopy. In CB1214 mutants where paramyosin is absent, β-filagenin assembled with myosin to form abnormal tubular filaments with a periodicity identical to wild type. These results verify that β-filagenin is a core protein that coassembles with either myosin or paramyosin in C. elegans to form tubular filaments. PMID:9442110
FBI-1 functions as a novel AR co-repressor in prostate cancer cells.
Cui, Jiajun; Yang, Yutao; Zhang, Chuanfu; Hu, Pinliang; Kan, Wei; Bai, Xianhong; Liu, Xuelin; Song, Hongbin
2011-03-01
The pro-oncogene FBI-1, encoded by Zbtb7a, is a transcriptional repressor that belongs to the POK (POZ/BTB and Krüppel) protein family. In this study, we investigated a potential interaction between androgen receptor (AR) signaling and FBI-1 and demonstrated that overexpression of FBI-1 inhibited ligand-dependent AR activation. A protein-protein interaction was identified between FBI-1 and AR in a ligand-dependent manner. Furthermore, FBI-1, AR and SMRT formed a ternary complex and FBI-1 enhanced the recruitment of NCoR and SMRT to endogenous PSA upstream sequences. Our data also indicated that the FBI-1-mediated inhibition of AR transcriptional activity is partially dependent on HDAC. Interestingly, FBI-1 plays distinct roles in regulating LNCaP (androgen-dependent) and PC-3 cell (androgen-independent) proliferation.
Structure and biochemical functions of four simian virus 40 truncated large-T antigens.
Chaudry, F; Harvey, R; Smith, A E
1982-01-01
The structure of four abnormal T antigens which are present in different simian virus 40 (SV40)-transformed mouse cell lines was studied by tryptic peptide mapping, partial proteolysis fingerprinting, immunoprecipitation with monoclonal antibodies, and in vitro translation. The results obtained allowed us to deduce that these proteins, which have apparent molecular weights of 15,000, 22,000, 33,000 and 45,000, are truncated forms of large-T antigen extending to different amounts into the amino acid sequences unique to large-T. The proteins are all phosphorylated, probably at a site between amino acids 106 and 123. The mRNAs coding for the proteins probably contain the normal large-T splice but are shorter than the normal transcripts of the SV40 early region. The truncated large-Ts were tested for the ability to bind to double-stranded DNA-cellulose. This showed that the 33,000- and 45,000-molecular-weight polypeptides contained sequences sufficient for binding under the conditions used, whereas the 15,000- and 22,000-molecular-weight forms did not. Together with published data, this allows the tentative mapping of a region of SV40 large-T between amino acids 109 and 272 that is necessary and may be sufficient for the binding to double-stranded DNA-cellulose in vitro. None of the truncated large-T species formed a stable complex with the host cell protein referred to as nonviral T-antigen or p53, suggesting that the carboxy-terminal sequences of large-T are necessary for complex formation. Images PMID:6292504
Branger, Christine G; Fetherston, Jacqueline D; Perry, Robert D; Curtiss, Roy
2007-01-01
The use of live recombinant Salmonella attenuated vaccine (RASV) encoding Yersinia proteins is a promising new approach for the vaccination against Yersinia pestis. We have tested the efficacy of 2 proteins, Psn and a portion of LcrV in protecting mice against virulent Yersinia pestis challenge. To remove the immunosuppressive properties of LcrV protein, the lcrV gene, without the TLR2 receptor sequence, was cloned into a beta-lactamase secretion vector. Immunizations were performed with RSAV expressing LcrV or Psn. Challenge with a virulent Y. pestis strain was performed 4 weeks after the last immunization. Our results show that the truncated LcrV protein delivered by RASV is sufficient to afford a full protective immune response in a mouse model of bubonic plague and the Psn protein afforded partial protection in a non-optimized system. This finding should facilitate the design and development of a new generation of vaccines against Y. pestis.
NASA Technical Reports Server (NTRS)
Nematollahi, W. P.; Roux, S. J.
1999-01-01
Plants have a variety of glycosidic conjugates of hormones, defense compounds, and other molecules that are hydrolyzed by beta-glucosidases (beta-D-glucoside glucohydrolases, E.C. 3.2.1.21). Workers have reported several beta-glucosidases from maize (Zea mays L.; Poaceae), but have localized them mostly by indirect means. We have purified and partly characterized a 58-Ku beta-glucosidase from maize, which we conclude from a partial sequence analysis, from kinetic data, and from its localization is not identical to any of those already reported. A monoclonal antibody, mWP 19, binds this enzyme, and localizes it in the cell walls of maize coleoptiles. An earlier report showed that mWP19 inhibits peroxidase activity in crude cell wall extracts and can immunoprecipitate peroxidase activity from these extracts, yet purified preparations of the 58 Ku protein had little or no peroxidase activity. The level of sequence similarity between beta-glucosidases and peroxidases makes it unlikely that these enzymes share epitopes in common. Contrary to a previous conclusion, these results suggest that the enzyme recognized by mWP19 is not a peroxidase, but there is a wall peroxidase closely associated with the 58 Ku beta-glucosidase in crude preparations. Other workers also have co-purified distinct proteins with beta-glucosidases. We found no significant charge in the level of immunodetectable beta-glucosidase in mesocotyls or coleoptiles that precedes the red light-induced changes in the growth rate of these tissues.
Thakur, Jitendra; Sanyal, Kaustuv
2012-01-01
Unlike most eukaryotes, a kinetochore is fully assembled early in the cell cycle in budding yeasts Saccharomyces cerevisiae and Candida albicans. These kinetochores are clustered together throughout the cell cycle. Kinetochore assembly on point centromeres of S. cerevisiae is considered to be a step-wise process that initiates with binding of inner kinetochore proteins on specific centromere DNA sequence motifs. In contrast, kinetochore formation in C. albicans, that carries regional centromeres of 3–5 kb long, has been shown to be a sequence independent but an epigenetically regulated event. In this study, we investigated the process of kinetochore assembly/disassembly in C. albicans. Localization dependence of various kinetochore proteins studied by confocal microscopy and chromatin immunoprecipitation (ChIP) assays revealed that assembly of a kinetochore is a highly coordinated and interdependent event. Partial depletion of an essential kinetochore protein affects integrity of the kinetochore cluster. Further protein depletion results in complete collapse of the kinetochore architecture. In addition, GFP-tagged kinetochore proteins confirmed similar time-dependent disintegration upon gradual depletion of an outer kinetochore protein (Dam1). The loss of integrity of a kinetochore formed on centromeric chromatin was demonstrated by reduced binding of CENP-A and CENP-C at the centromeres. Most strikingly, Western blot analysis revealed that gradual depletion of any of these essential kinetochore proteins results in concomitant reduction in cellular protein levels of CENP-A. We further demonstrated that centromere bound CENP-A is protected from the proteosomal mediated degradation. Based on these results, we propose that a coordinated interdependent circuitry of several evolutionarily conserved essential kinetochore proteins ensures integrity of a kinetochore formed on the foundation of CENP-A containing centromeric chromatin. PMID:22536162
Hoffman, Brett; Li, Zhubing; Liu, Qiang
2015-08-01
Hepatitis C virus (HCV) non-structural protein 5A (NS5A) is essential for viral replication; however, its effect on HCV RNA translation remains controversial partially due to the use of reporters lacking the 3' UTR, where NS5A binds to the poly(U/UC) sequence. We investigated the role of NS5A in HCV translation using a monocistronic RNA containing a Renilla luciferase gene flanked by the HCV UTRs. We found that NS5A downregulated viral RNA translation in a dose-dependent manner. This downregulation required both the 5' and 3' UTRs of HCV because substitution of either sequence with the 5' and 3' UTRs of enterovirus 71 or a cap structure at the 5' end eliminated the effects of NS5A on translation. Translation of the HCV genomic RNA was also downregulated by NS5A. The inhibition of HCV translation by NS5A required the poly(U/UC) sequence in the 3' UTR as NS5A did not affect translation when it was deleted. In addition, we showed that, whilst the amphipathic α-helix of NS5A has no effect on viral translation, the three domains of NS5A can inhibit translation independently, also dependent on the presence of the poly(U/UC) sequence in the 3' UTR. These results suggested that NS5A downregulated HCV RNA translation through a mechanism involving the poly(U/UC) sequence in the 3' UTR.
Protein purification and crystallization artifacts: The tale usually not told.
Niedzialkowska, Ewa; Gasiorowska, Olga; Handing, Katarzyna B; Majorek, Karolina A; Porebski, Przemyslaw J; Shabalin, Ivan G; Zasadzinska, Ewelina; Cymborowski, Marcin; Minor, Wladek
2016-03-01
The misidentification of a protein sample, or contamination of a sample with the wrong protein, may be a potential reason for the non-reproducibility of experiments. This problem may occur in the process of heterologous overexpression and purification of recombinant proteins, as well as purification of proteins from natural sources. If the contaminated or misidentified sample is used for crystallization, in many cases the problem may not be detected until structures are determined. In the case of functional studies, the problem may not be detected for years. Here several procedures that can be successfully used for the identification of crystallized protein contaminants, including: (i) a lattice parameter search against known structures, (ii) sequence or fold identification from partially built models, and (iii) molecular replacement with common contaminants as search templates have been presented. A list of common contaminant structures to be used as alternative search models was provided. These methods were used to identify four cases of purification and crystallization artifacts. This report provides troubleshooting pointers for researchers facing difficulties in phasing or model building. © 2016 The Protein Society.
Two new insulator proteins, Pita and ZIPIC, target CP190 to chromatin.
Maksimenko, Oksana; Bartkuhn, Marek; Stakhov, Viacheslav; Herold, Martin; Zolotarev, Nickolay; Jox, Theresa; Buxa, Melanie K; Kirsch, Ramona; Bonchuk, Artem; Fedotova, Anna; Kyrchanova, Olga; Renkawitz, Rainer; Georgiev, Pavel
2015-01-01
Insulators are multiprotein-DNA complexes that regulate the nuclear architecture. The Drosophila CP190 protein is a cofactor for the DNA-binding insulator proteins Su(Hw), CTCF, and BEAF-32. The fact that CP190 has been found at genomic sites devoid of either of the known insulator factors has until now been unexplained. We have identified two DNA-binding zinc-finger proteins, Pita, and a new factor named ZIPIC, that interact with CP190 in vivo and in vitro at specific interaction domains. Genomic binding sites for these proteins are clustered with CP190 as well as with CTCF and BEAF-32. Model binding sites for Pita or ZIPIC demonstrate a partial enhancer-blocking activity and protect gene expression from PRE-mediated silencing. The function of the CTCF-bound MCP insulator sequence requires binding of Pita. These results identify two new insulator proteins and emphasize the unifying function of CP190, which can be recruited by many DNA-binding insulator proteins. © 2015 Maksimenko et al.; Published by Cold Spring Harbor Laboratory Press.
Johnson, Lucas B; Gintner, Lucas P; Park, Sehoo; Snow, Christopher D
2015-08-01
Accuracy of current computational protein design (CPD) methods is limited by inherent approximations in energy potentials and sampling. These limitations are often used to qualitatively explain design failures; however, relatively few studies provide specific examples or quantitative details that can be used to improve future CPD methods. Expanding the design method to include a library of sequences provides data that is well suited for discriminating between stabilizing and destabilizing design elements. Using thermophilic endoglucanase E1 from Acidothermus cellulolyticus as a model enzyme, we computationally designed a sequence with 60 mutations. The design sequence was rationally divided into structural blocks and recombined with the wild-type sequence. Resulting chimeras were assessed for activity and thermostability. Surprisingly, unlike previous chimera libraries, regression analysis based on one- and two-body effects was not sufficient for predicting chimera stability. Analysis of molecular dynamics simulations proved helpful in distinguishing stabilizing and destabilizing mutations. Reverting to the wild-type amino acid at destabilized sites partially regained design stability, and introducing predicted stabilizing mutations in wild-type E1 significantly enhanced thermostability. The ability to isolate stabilizing and destabilizing elements in computational design offers an opportunity to interpret previous design failures and improve future CPD methods. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Del Galdo, Sara; Amadei, Andrea
2016-10-12
In this paper we apply the computational analysis recently proposed by our group to characterize the solvation properties of a native protein in aqueous solution, and to four model aqueous solutions of globular proteins in their unfolded states thus characterizing the protein unfolded state hydration shell and quantitatively evaluating the protein unfolded state partial molar volumes. Moreover, by using both the native and unfolded protein partial molar volumes, we obtain the corresponding variations (unfolding partial molar volumes) to be compared with the available experimental estimates. We also reconstruct the temperature and pressure dependence of the unfolding partial molar volume of Myoglobin dissecting the structural and hydration effects involved in the process.
NASA Astrophysics Data System (ADS)
Basu, Sankar; Söderquist, Fredrik; Wallner, Björn
2017-05-01
The focus of the computational structural biology community has taken a dramatic shift over the past one-and-a-half decades from the classical protein structure prediction problem to the possible understanding of intrinsically disordered proteins (IDP) or proteins containing regions of disorder (IDPR). The current interest lies in the unraveling of a disorder-to-order transitioning code embedded in the amino acid sequences of IDPs/IDPRs. Disordered proteins are characterized by an enormous amount of structural plasticity which makes them promiscuous in binding to different partners, multi-functional in cellular activity and atypical in folding energy landscapes resembling partially folded molten globules. Also, their involvement in several deadly human diseases (e.g. cancer, cardiovascular and neurodegenerative diseases) makes them attractive drug targets, and important for a biochemical understanding of the disease(s). The study of the structural ensemble of IDPs is rather difficult, in particular for transient interactions. When bound to a structured partner, an IDPR adapts an ordered conformation in the complex. The residues that undergo this disorder-to-order transition are called protean residues, generally found in short contiguous stretches and the first step in understanding the modus operandi of an IDP/IDPR would be to predict these residues. There are a few available methods which predict these protean segments from their amino acid sequences; however, their performance reported in the literature leaves clear room for improvement. With this background, the current study presents `Proteus', a random forest classifier that predicts the likelihood of a residue undergoing a disorder-to-order transition upon binding to a potential partner protein. The prediction is based on features that can be calculated using the amino acid sequence alone. Proteus compares favorably with existing methods predicting twice as many true positives as the second best method (55 vs. 27%) with a much higher precision on an independent data set. The current study also sheds some light on a possible `disorder-to-order' transitioning consensus, untangled, yet embedded in the amino acid sequence of IDPs. Some guidelines have also been suggested for proceeding with a real-life structural modeling involving an IDPR using Proteus.
Expanding the proteome: disordered and alternatively-folded proteins
Dyson, H. Jane
2011-01-01
Proteins provide much of the scaffolding for life, as well as undertaking a variety of essential catalytic reactions. These characteristic functions have led us to presuppose that proteins are in general functional only when well-structured and correctly folded. As we begin to explore the repertoire of possible protein sequences inherent in the human and other genomes, two stark facts that belie this supposition become clear: firstly, the number of apparent open reading frames in the human genome is significantly smaller than appears to be necessary to code for all of the diverse proteins in higher organisms, and secondly that a significant proportion of the protein sequences that would be coded by the genome would not be expected to form stable three-dimensional structures. Clearly the genome must include coding for a multitude of alternative forms of proteins, some of which may be partly or fully disordered or incompletely structured in their functional states. At the same time as this likelihood was recognized, experimental studies also began to uncover examples of important protein molecules and domains that were incompletely structured or completely disordered in solution, yet remained perfectly functional. In the ensuing years, we have seen an explosion of experimental and genome-annotation studies that have mapped the extent of the intrinsic disorder phenomenon and explored the possible biological rationales for its widespread occurrence. Answers to the question “why would a particular domain need to be unstructured?” are as varied as the systems where such domains are found. This review provides a survey of recent new directions in this field, and includes an evaluation of the role not only of intrinsically disordered proteins but of partially structured and highly dynamic members of the disorder-order continuum. PMID:21729349
Gluchowski, Nina L.; Chitraju, Chandramohan; Picoraro, Joseph A.; Mejhert, Niklas; Pinto, Shirly; Xin, Winnie; Kamin, Daniel S.; Winter, Harland S.; Chung, Wendy K.; Walther, Tobias C.; Farese, Robert V.
2017-01-01
Acyl-CoA:diacylglycerol acyltransferase (DGAT)1 and DGAT2 catalyze triglyceride (TG) biosynthesis in humans. Biallelic loss-of-function mutations in human DGAT1 result in severe congenital diarrhea and protein-losing enteropathy. Additionally, pharmacologic inhibition of DGAT1 led to dose-related diarrhea in human clinical trials. Here we identify a previously unknown DGAT1 mutation in identical twins of South Asian descent. These male patients developed watery diarrhea shortly after birth, with protein-losing enteropathy and failure to thrive. Exome sequencing revealed a homozygous recessive mutation in DGAT1, c.314T>C, p.L105P. We show here that the p.L105P DGAT1 enzyme produced from the mutant allele is less abundant, resulting in partial loss of TG synthesis activity and decreased formation of lipid droplets in patient-derived primary dermal fibroblasts. Thus, in contrast with complete loss-of-function alleles of DGAT1, the p.L105P missense allele partially reduces TG synthesis activity and causes a less severe clinical phenotype. Our findings add to the growing recognition of DGAT1 deficiency as a cause of congenital diarrhea with protein-losing enteropathy and indicate that DGAT1 mutations result in a spectrum of diseases. PMID:28373485
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stephens, R.L.; Haygood, M.G.; Lidstrom, M.E.
An open-reading-frame fragment of a Methylobacterium sp. strain AM1 gene (moxF) encoding a portion of the methanol dehydrogenase structural protein has been used as a hybridization probe to detect similar sequences in a variety of methylotrophic bacteria. This hybridization was used to isolate clones containing putative moxF genes from two obligate methanotrophic bacteria, Methylococcus capsulatus Bath and Methylomonas albus BG8. The identity of these genes was confirmed in two ways. A T7 expression vector was used to produce methanol dehydrogenase protein in Escherichia coli from the cloned genes,a and in each case the protein was identified by immunoblotting with antiserummore » against the Methylomonas albus methanol dehydrogenase. In addition, a moxF mutant of Methylobacterium strain AM1 was complemented to a methanol-positive phenotype that partially restored methanol dehydrogenase activity, using broad-host-range plasmids containing the moxF genes from each methanotroph. The partial complementation of a moxF mutant in a facultative serine pathway methanol utilizer by moxF genes from type I and type X obligate methane utilizers suggests broad functional conservation of the methanol oxidation system among gram-negative methylotrophs.« less
Huang, Yu-Feng; Midha, Mohit; Chen, Tzu-Han; Wang, Yu-Tai; Smith, David Glenn; Pei, Kurtis Jai-Chyi; Chiu, Kuo Ping
2015-01-01
The Taiwanese (Formosan) macaque (Macaca cyclopis) is the only nonhuman primate endemic to Taiwan. This primate species is valuable for evolutionary studies and as subjects in medical research. However, only partial fragments of the mitochondrial genome (mitogenome) of this primate species have been sequenced, not mentioning its nuclear genome. We employed next-generation sequencing to generate 2 x 90 bp paired-end reads, followed by reference-assisted de novo assembly with multiple k-mer strategy to characterize the M. cyclopis mitogenome. We compared the assembled mitogenome with that of other macaque species for phylogenetic analysis. Our results show that, the M. cyclopis mitogenome consists of 16,563 nucleotides encoding for 13 protein-coding genes, 2 ribosomal RNAs and 22 transfer RNAs. Phylogenetic analysis indicates that M. cyclopis is most closely related to M. mulatta lasiota (Chinese rhesus macaque), supporting the notion of Asia-continental origin of M. cyclopis proposed in previous studies based on partial mitochondrial sequences. Our work presents a novel approach for assembling a mitogenome that utilizes the capabilities of de novo genome assembly with assistance of a reference genome. The availability of the complete Taiwanese macaque mitogenome will facilitate the study of primate evolution and the characterization of genetic variations for the potential usage of this species as a non-human primate model for medical research.
The genome sequence of sweet cherry (Prunus avium) for use in genomics-assisted breeding.
Shirasawa, Kenta; Isuzugawa, Kanji; Ikenaga, Mitsunobu; Saito, Yutaro; Yamamoto, Toshiya; Hirakawa, Hideki; Isobe, Sachiko
2017-10-01
We determined the genome sequence of sweet cherry (Prunus avium) using next-generation sequencing technology. The total length of the assembled sequences was 272.4 Mb, consisting of 10,148 scaffold sequences with an N50 length of 219.6 kb. The sequences covered 77.8% of the 352.9 Mb sweet cherry genome, as estimated by k-mer analysis, and included >96.0% of the core eukaryotic genes. We predicted 43,349 complete and partial protein-encoding genes. A high-density consensus map with 2,382 loci was constructed using double-digest restriction site-associated DNA sequencing. Comparing the genetic maps of sweet cherry and peach revealed high synteny between the two genomes; thus the scaffolds were integrated into pseudomolecules using map- and synteny-based strategies. Whole-genome resequencing of six modern cultivars found 1,016,866 SNPs and 162,402 insertions/deletions, out of which 0.7% were deleterious. The sequence variants, as well as simple sequence repeats, can be used as DNA markers. The genomic information helps us to identify agronomically important genes and will accelerate genetic studies and breeding programs for sweet cherries. Further information on the genomic sequences and DNA markers is available in DBcherry (http://cherry.kazusa.or.jp (8 May 2017, date last accessed)). © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Rea, Anita M; Simpson, Emma R; Meldrum, Jill K; Williams, Huw E L; Searle, Mark S
2008-12-02
The fast folding of small proteins is likely to be the product of evolutionary pressures that balance the search for native-like contacts in the transition state with the minimum number of stable non-native interactions that could lead to partially folded states prone to aggregation and amyloid formation. We have investigated the effects of non-native interactions on the folding landscape of yeast ubiquitin by introducing aromatic substitutions into the beta-turn region of the N-terminal beta-hairpin, using both the native G-bulged type I turn sequence (TXTGK) as well as an engineered 2:2 XNGK type I' turn sequence. The N-terminal beta-hairpin is a recognized folding nucleation site in ubiquitin. The folding kinetics for wt-Ub (TLTGK) and the type I' turn mutant (TNGK) reveal only a weakly populated intermediate, however, substitution with X = Phe or Trp in either context results in a high propensity to form a stable compact intermediate where the initial U-->I collapse is visible as a distinct kinetic phase. The introduction of Trp into either of the two host turn sequences results in either complex multiphase kinetics with the possibility of parallel folding pathways, or formation of a highly compact I-state stabilized by non-native interactions that must unfold before refolding. Sequence substitutions with aromatic residues within a localized beta-turn capable of forming non-native hydrophobic contacts in both the native state and partially folded states has the undesirable consequence that folding is frustrated by the formation of stable compact intermediates that evolutionary pressures at the sequence level may have largely eliminated.
Dupé, Aurélien; Dumas, Carole; Papadopoulou, Barbara
2015-01-01
Alba-domain proteins are RNA-binding proteins found in archaea and eukaryotes and recently studied in protozoan parasites where they play a role in the regulation of virulence factors and stage-specific proteins. This work describes in silico structural characterization, cellular localization and biochemical analyses of Alba-domain proteins in Leishmania infantum. We show that in contrast to other protozoa, Leishmania have two Alba-domain proteins, LiAlba1 and LiAlba3, representative of the Rpp20- and the Rpp25-like eukaryotic subfamilies, respectively, which share several sequence and structural similarities but also important differences with orthologs in other protozoa, especially in sequences targeted for post-translational modifications. LiAlba1 and LiAlba3 proteins form a complex interacting with other RNA-binding proteins, ribosomal subunits, and translation factors as supported by co-immunoprecipitation and sucrose gradient sedimentation analysis. A higher co-sedimentation of Alba proteins with ribosomal subunits was seen upon conditions of decreased translation, suggesting a role of these proteins in translational repression. The Leishmania Alba-domain proteins display differential cellular localization throughout the parasite development. In the insect promastigote stage, Alba proteins co-localize predominantly to the cytoplasm but they translocate to the nucleolus and the flagellum upon amastigote differentiation in the mammalian host and are found back to the cytoplasm once amastigote differentiation is completed. Heat-shock, a major signal of amastigote differentiation, triggers Alba translocation to the nucleolus and the flagellum. Purification of the Leishmania flagellum confirmed LiAlba3 enrichment in this organelle during amastigote differentiation. Moreover, partial characterization of the Leishmania flagellum proteome of promastigotes and differentiating amastigotes revealed the presence of other RNA-binding proteins, as well as differences in the flagellum composition between these two parasite lifestages. Shuttling of Alba-domain proteins between the cytoplasm and the nucleolus or the flagellum throughout the parasite life cycle suggests that these RNA-binding proteins participate in several distinct regulatory pathways controlling developmental gene expression in Leishmania.
Dupé, Aurélien; Dumas, Carole; Papadopoulou, Barbara
2015-01-01
Alba-domain proteins are RNA-binding proteins found in archaea and eukaryotes and recently studied in protozoan parasites where they play a role in the regulation of virulence factors and stage-specific proteins. This work describes in silico structural characterization, cellular localization and biochemical analyses of Alba-domain proteins in Leishmania infantum. We show that in contrast to other protozoa, Leishmania have two Alba-domain proteins, LiAlba1 and LiAlba3, representative of the Rpp20- and the Rpp25-like eukaryotic subfamilies, respectively, which share several sequence and structural similarities but also important differences with orthologs in other protozoa, especially in sequences targeted for post-translational modifications. LiAlba1 and LiAlba3 proteins form a complex interacting with other RNA-binding proteins, ribosomal subunits, and translation factors as supported by co-immunoprecipitation and sucrose gradient sedimentation analysis. A higher co-sedimentation of Alba proteins with ribosomal subunits was seen upon conditions of decreased translation, suggesting a role of these proteins in translational repression. The Leishmania Alba-domain proteins display differential cellular localization throughout the parasite development. In the insect promastigote stage, Alba proteins co-localize predominantly to the cytoplasm but they translocate to the nucleolus and the flagellum upon amastigote differentiation in the mammalian host and are found back to the cytoplasm once amastigote differentiation is completed. Heat-shock, a major signal of amastigote differentiation, triggers Alba translocation to the nucleolus and the flagellum. Purification of the Leishmania flagellum confirmed LiAlba3 enrichment in this organelle during amastigote differentiation. Moreover, partial characterization of the Leishmania flagellum proteome of promastigotes and differentiating amastigotes revealed the presence of other RNA-binding proteins, as well as differences in the flagellum composition between these two parasite lifestages. Shuttling of Alba-domain proteins between the cytoplasm and the nucleolus or the flagellum throughout the parasite life cycle suggests that these RNA-binding proteins participate in several distinct regulatory pathways controlling developmental gene expression in Leishmania. PMID:26334886
Speck, Stephanie; Kern, Tanja; Aistleitner, Karin; Dilcher, Meik; Dobler, Gerhard; Essbauer, Sandra
2018-02-01
Rickettsia (R.) helvetica is the most prevalent rickettsia found in Ixodes ricinus ticks in Germany. Several studies reported antibodies against R. helvetica up to 12.5% in humans investigated, however, fulminant clinical cases are rare indicating a rather low pathogenicity compared to other rickettsiae. We investigated growth characteristics of R. helvetica isolate AS819 in two different eukaryotic cell lines with focus on ultra-structural changes of host cells during infection determined by confocal laser scanning microscopy. Further investigations included partially sequencing of rickA, sca4 and sca2 genes, which have been reported to encode proteins involved in cell-to-cell spread and virulence in some rickettsiae. R. helvetica grew constantly but slowly in both cell lines used. Confocal laser scanning microscopy revealed that the dissemination of R. helvetica AS819 in both cell lines was rather mediated by cell break-down and bacterial release than cell-to-cell spread. The cytoskeleton of both investigated eukaryotic cell lines was not altered. R. helvetica possesses rickA, but its expression is not sufficient to promote actin-based motility as demonstrated by confocal laser scanning microscopy. Hypothetical Sca2 and Sca4 proteins were deduced from nucleotide gene sequences but the predicted amino acid sequences were disrupted or truncated compared to other rickettsiae most likely resulting in non-functional proteins. Taken together, these results might give a first hint to the underlying causes of the reduced virulence and pathogenicity of R. helvetica.
Ho, Michelle L; Adler, Benjamin A; Torre, Michael L; Silberg, Jonathan J; Suh, Junghae
2013-12-20
Adeno-associated virus (AAV) recombination can result in chimeric capsid protein subunits whose ability to assemble into an oligomeric capsid, package a genome, and transduce cells depends on the inheritance of sequence from different AAV parents. To develop quantitative design principles for guiding site-directed recombination of AAV capsids, we have examined how capsid structural perturbations predicted by the SCHEMA algorithm correlate with experimental measurements of disruption in seventeen chimeric capsid proteins. In our small chimera population, created by recombining AAV serotypes 2 and 4, we found that protection of viral genomes and cellular transduction were inversely related to calculated disruption of the capsid structure. Interestingly, however, we did not observe a correlation between genome packaging and calculated structural disruption; a majority of the chimeric capsid proteins formed at least partially assembled capsids and more than half packaged genomes, including those with the highest SCHEMA disruption. These results suggest that the sequence space accessed by recombination of divergent AAV serotypes is rich in capsid chimeras that assemble into 60-mer capsids and package viral genomes. Overall, the SCHEMA algorithm may be useful for delineating quantitative design principles to guide the creation of libraries enriched in genome-protecting virus nanoparticles that can effectively transduce cells. Such improvements to the virus design process may help advance not only gene therapy applications but also other bionanotechnologies dependent upon the development of viruses with new sequences and functions.
Slocinska, Malgorzata; Antos-Krzeminska, Nina; Rosinski, Grzegorz; Jarmuszkiewicz, Wieslawa
2012-08-01
Uncoupling protein 4 (UCP4) is a member of the UCP subfamily that mediates mitochondrial uncoupling, and sequence alignment predicts the existence of UCP4 in several insects. The present study demonstrates the first molecular identification of a partial Zophobas atratus UCP4-coding sequence and the functional characterisation of ZaUCP4 in the mitochondria of larval and pupal fat bodies of the beetle. ZaUCP4 shows a high similarity to predicted insect UCP4 isoforms and known mammalian UCP4s, both at the nucleotide and amino acid sequence levels. Bioenergetic studies clearly demonstrate UCP function in mitochondria from larval and pupal fat bodies. In non-phosphorylating mitochondria, ZaUCP activity was stimulated by palmitic acid and inhibited by the purine nucleotide GTP. In phosphorylating mitochondria, ZaUCP4 activity decreased the yield of oxidative phosphorylation. ZaUCP4 was immunodetected with antibodies raised against human UCP4 as a single 36-kDa band. A lower expression of ZaUCP4 at the level of mRNA and protein and a decreased ZaUCP4 activity were observed in the Z. atratus pupal fat body compared with the larval fat body. The different expression patterns and activity of ZaUCP4 during the larval-pupal transformation indicates an important physiological role for UCP4 in insect fat body development and function during insect metamorphosis. Copyright © 2012 Elsevier Inc. All rights reserved.
Ho, Michelle L.; Adler, Benjamin A.; Torre, Michael L.; Silberg, Jonathan J.; Suh, Junghae
2013-01-01
Adeno-associated virus (AAV) recombination can result in chimeric capsid protein subunits whose ability to assemble into an oligomeric capsid, package a genome, and transduce cells depends on the inheritance of sequence from different AAV parents. To develop quantitative design principles for guiding site-directed recombination of AAV capsids, we have examined how capsid structural perturbations predicted by the SCHEMA algorithm correlate with experimental measurements of disruption in seventeen chimeric capsid proteins. In our small chimera population, created by recombining AAV serotypes 2 and 4, we found that protection of viral genomes and cellular transduction were inversely related to calculated disruption of the capsid structure. Interestingly, however, we did not observe a correlation between genome packaging and calculated structural disruption; a majority of the chimeric capsid proteins formed at least partially assembled capsids and more than half packaged genomes, including those with the highest SCHEMA disruption. These results suggest that the sequence space accessed by recombination of divergent AAV serotypes is rich in capsid chimeras that assemble into 60-mer capsids and package viral genomes. Overall, the SCHEMA algorithm may be useful for delineating quantitative design principles to guide the creation of libraries enriched in genome-protecting virus nanoparticles that can effectively transduce cells. Such improvements to the virus design process may help advance not only gene therapy applications, but also other bionanotechnologies dependent upon the development of viruses with new sequences and functions. PMID:23899192
Positive selection in octopus haemocyanin indicates functional links to temperature adaptation.
Oellermann, Michael; Strugnell, Jan M; Lieb, Bernhard; Mark, Felix C
2015-07-05
Octopods have successfully colonised the world's oceans from the tropics to the poles. Yet, successful persistence in these habitats has required adaptations of their advanced physiological apparatus to compensate impaired oxygen supply. Their oxygen transporter haemocyanin plays a major role in cold tolerance and accordingly has undergone functional modifications to sustain oxygen release at sub-zero temperatures. However, it remains unknown how molecular properties evolved to explain the observed functional adaptations. We thus aimed to assess whether natural selection affected molecular and structural properties of haemocyanin that explains temperature adaptation in octopods. Analysis of 239 partial sequences of the haemocyanin functional units (FU) f and g of 28 octopod species of polar, temperate, subtropical and tropical origin revealed natural selection was acting primarily on charge properties of surface residues. Polar octopods contained haemocyanins with higher net surface charge due to decreased glutamic acid content and higher numbers of basic amino acids. Within the analysed partial sequences, positive selection was present at site 2545, positioned between the active copper binding centre and the FU g surface. At this site, methionine was the dominant amino acid in polar octopods and leucine was dominant in tropical octopods. Sites directly involved in oxygen binding or quaternary interactions were highly conserved within the analysed sequence. This study has provided the first insight into molecular and structural mechanisms that have enabled octopods to sustain oxygen supply from polar to tropical conditions. Our findings imply modulation of oxygen binding via charge-charge interaction at the protein surface, which stabilize quaternary interactions among functional units to reduce detrimental effects of high pH on venous oxygen release. Of the observed partial haemocyanin sequence, residue 2545 formed a close link between the FU g surface and the active centre, suggesting a role as allosteric binding site. The prevalence of methionine at this site in polar octopods, implies regulation of oxygen affinity via increased sensitivity to allosteric metal binding. High sequence conservation of sites directly involved in oxygen binding indicates that functional modifications of octopod haemocyanin rather occur via more subtle mechanisms, as observed in this study.
Prosdocimi, Francisco; Souto, Helena Magarinos; Ruschi, Piero Angeli; Furtado, Carolina; Jennings, W Bryan
2016-09-01
The genome of the versicoloured emerald hummingbird (Amazilia versicolor) was partially sequenced in one-sixth of an Illumina HiSeq lane. The mitochondrial genome was assembled using MIRA and MITObim software, yielding a circular molecule of 16,861 bp in length and deposited in GenBank under the accession number KF624601. The mitogenome contained 13 protein-coding genes, 22 transfer tRNAs, 2 ribosomal RNAs and 1 non-coding control region. The molecule was assembled using 21,927 sequencing reads of 100 bp each, resulting in ∼130 × coverage of uniformly distributed reads along the genome. This is the forth mitochondrial genome described for this highly diverse family of birds and may benefit further phylogenetic, phylogeographic, population genetic and species delimitation studies of hummingbirds.
Shibata, T; Akamine, T; Nikki, T; Yamashita, H; Nobukuni, K
2003-02-01
We examined thyroidectomized chickens in terms of plasma lipid concentration and protein expression within the liver. Although the body weight of thyroidectomized chickens was remarkably low due to growth retardation, the livers were enlarged and fatty compared to those of sham-operated chickens. An increase in phospholipid, triglyceride, and total cholesterol levels within the blood plasma of thyroidectomized chickens was observed, clearly reflecting increased lipid synthesis within the liver. Overexpression of some proteins, for example, 29- and 45-kDa proteins, was observed in thyroidectomized chicken livers by means of electrophoresis. A peptide map was made for the protein that exhibited the greatest degree of overexpression. One of them demonstrated a molecular mass of 45 kDa and an isoelectric point (pI) between 7.5 and 8.0, depending on its form. Partial N-terminal amino acid sequences were determined from three random peptides of this protein. The amino acid sequence of this protein showed a high degree of homology with the betaine-homocysteine S-methyltransferase (BHMT, EC 2.1.1.5) of some mammalian species. We identified this protein as chicken BHMT because, in addition to its sequence homology with mammalian BHMT, there were similarities were also observed between this 45-kDa protein and mammalian BHMT with respect to molecular mass and isoelectric behavior. In the liver, 10 d after thyroidectomy, the synthesis of hepatic BHMT had already been enhanced, and the high expression was maintained at 50 d of age. Generally, BHMT catalyzes the transfer of a methyl group from betaine to L-homocysteine. In addition, it seems that this enzyme is also closely related to lipid metabolism in the liver; in this study expression of BHMT in the liver corresponded to plasma lipid levels. Moreover, hypothyroidism may be directly or indirectly related to overexpression of BHMT. Due to similarities between the BHMT of chickens and mammalian species, the chicken model might provide a useful means by which to study BHMT, its role in lipid metabolism, and methods of targeting the expression of BHMT. Another 29-kDa protein was unidentified in the homology search.
HOXBES2: a novel epididymal HOXB2 homeoprotein and its domain-specific association with spermatozoa.
Prabagaran, E; Bandivdekar, A H; Dighe, V; Raghavan, V P
2007-02-01
The sperm from the testis acquires complete fertilizing ability and forward progressive motility following its transit through the epididymis. Acquisition of these characteristics results from the modification of the sperm proteome following interactions with epididymal secretions. In our attempts to identify epididymis-specific sperm plasma membrane proteins, a partial 2.83-kb clone was identified by immunoscreening a monkey epididymal cDNA library with an agglutinating monoclonal antibody raised against washed human spermatozoa. The sequence of the 2.83-kb clone exhibited homology to the region between 1 and 1097 bp of the homeobox gene, Hoxb2. This sequence was found to be species conserved, as revealed by RT-PCR analysis. To obtain a full-length clone of the sequence, 5' RACE-PCR (rapid amplification of cDNA ends PCR) was carried out using rat epididymal RNA as the template. It resulted in a full-length 1.657-kb cDNA encoding a 32.9-kDa putative protein. The protein designated HOXBES2 exhibited homology to the conserved 61-amino acid homeodomain region of the HOXB2 homeoprotein. However, characteristic differences were noted in its amino and carboxyl termini compared with HOXB2. A putative 30-kDa protein was detected in the tissue extracts from adult rat epididymis and caudal spermatozoa, and a 37-kDa protein was detected in the rat embryo when probed with a polyclonal antibody against HOXB2 protein. Multiple tissue Western blot and immunohistochemical analysis further indicated its expression in the cytoplasm of the principal and basal epithelial cells, with maximal expression in the distal epididymal segments. Northern blot analysis detected a single approximately 2.5-kb transcript from the adult epididymis. Indirect immunofluorescence localized the protein to the acrosome, midpiece, and equatorial segments of rat caudal and ejaculated human and monkey spermatozoa, respectively. In conclusion, we have identified and characterized a novel epididymal homeoprotein different from HOXB2 protein and hereafter referred to as HOXBES2, (HOXB2 homeodomain containing epididymis-specific sperm protein) with a probable role in fertilization.
Atago, Yuki; Shimodaira, Jun; Araki, Naoto; Bin Othman, Nor'azizi; Zakaria, Zuriati; Fukuda, Masao; Futami, Junichiro; Hara, Hirofumi
2016-05-01
Rhodococcus jostii RHA1 (RHA1) degrades polychlorinated biphenyl (PCB) via co-metabolism with biphenyl. To identify the novel open reading frames (ORFs) that contribute to PCB/biphenyl metabolism in RHA1, we compared chromatin immunoprecipitation chip and transcriptomic data. Six novel ORFs involved in PCB/biphenyl metabolism were identified. Gene deletion mutants of these 6 ORFs were made and were tested for their ability to grow on biphenyl. Interestingly, only the ro10225 deletion mutant showed deficient growth on biphenyl. Analysis of Ro10225 protein function showed that growth of the ro10225 deletion mutant on biphenyl was recovered when exogenous recombinant Ro10225 protein was added to the culture medium. Although Ro10225 protein has no putative secretion signal sequence, partially degraded Ro10225 protein was detected in conditioned medium from wild-type RHA1 grown on biphenyl. This Ro10225 fragment appeared to form a complex with another PCB/biphenyl oxidation enzyme. These results indicated that Ro10225 protein is essential for the formation of the PCB/biphenyl dioxygenase complex in RHA1.
Hamel, Louis-Philippe; Benchabane, Meriem; Nicole, Marie-Claude; Major, Ian T.; Morency, Marie-Josée; Pelletier, Gervais; Beaudoin, Nathalie; Sheen, Jen; Séguin, Armand
2011-01-01
Mitogen-activated protein kinases (MAPKs) contribute to the establishment of plant disease resistance by regulating downstream signaling components, including transcription factors. In this study, we identified MAPK-interacting proteins, and among the newly discovered candidates was a Cys-2/His-2-type zinc finger protein named PtiZFP1. This putative transcription factor belongs to a family of transcriptional repressors that rely on an ERF-associated amphiphilic repression (EAR) motif for their repression activity. Amino acids located within this repression motif were also found to be essential for MAPK binding. Close examination of the primary protein sequence revealed a functional bipartite MAPK docking site that partially overlaps with the EAR motif. Transient expression assays in Arabidopsis (Arabidopsis thaliana) protoplasts suggest that MAPKs promote PtiZFP1 degradation through the 26S proteasome. Since features of the MAPK docking site are conserved among other EAR repressors, our study suggests a novel mode of defense mechanism regulation involving stress-responsive MAPKs and EAR repressors. PMID:21873571
Characterisation of a collagen gene subfamily from the potato cyst nematode Globodera pallida.
Gray, L J; Curtis, R H; Jones, J T
2001-01-24
We have isolated two full-length genomic DNA sequences, which encode the cuticle collagen proteins GP-COL-1 and GP-COL-2, from the potato cyst nematode Globodera pallida. A third, partial collagen gene ORF termed gp-col-t(t=truncated) has also been isolated and appears to represent an unexpressed pseudogene. The gp-col-1 and gp-col-2 genes both contain three short (<97 bp) introns which disrupt coding regions predicted to specify proteins with molecular weights of 33 and 32.7 kDa respectively. All three sequences show high similarity to each other and to the previously isolated G. pallida cDNA clone gp-col-8. The conserved pattern of cysteine residues and non-(Gly-X-Y)(n) region sequence similarity observed in all four G. pallida genes suggests that these molecules form part of the same subfamily of collagens. Southern analysis indicates that this subfamily is likely to contain further members. The G. pallida collagen sequences show striking similarity to twelve genes from Caenorhabditis elegans which collectively represent the recently classified Group 1a collagen subfamily. No data exists on the function of this subfamily in C. elegans. gp-col-1 and gp-col-2 are developmentally regulated with transcripts of both genes detected in adult virgin and gravid females but not in pre-parasitic second stage juveniles. A similar expression pattern is observed for the Group 1a collagen lemmi 5 from Meloidogyne incognita perhaps indicating a generic link between subfamily and function during the various changes in cuticular structure which accompany nematode growth and reproduction. Immunochemical studies indicate that the GP-COL-1 protein is specifically located in the hypodermis of G. pallida adult females.
Unprecedented genomic diversity of AhR1 and AhR2 genes in Atlantic salmon (Salmo salar L.).
Hansson, Maria C; Wittzell, Håkan; Persson, Kerstin; von Schantz, Torbjörn
2004-06-24
Aryl hydrocarbon receptor (AhR) genes encode proteins involved in mediating the toxic responses induced by several environmental pollutants. Here, we describe the identification of the first two AhR1 (alpha and beta) genes and two additional AhR2 (alpha and beta) genes in the tetraploid species Atlantic salmon (Salmo salar L.) from a cosmid library screening. Cosmid clones containing genomic salmon AhR sequences were isolated using a cDNA clone containing the coding region of the Atlantic salmon AhR2gamma as a probe. Screening revealed 14 positive clones, from which four were chosen for further analyses. One of the cosmids contained genomic AhR sequences that were highly similar to the rainbow trout (Oncorhynchus mykiss) AhR2alpha and beta genes. SMART RACE amplified two complete, highly similar but not identical AhR type 2 sequences from salmon cDNA, which from phylogenetic analyses were determined as the rainbow trout AhR2alpha and beta orthologs. The salmon AhR2alpha and beta encode proteins of 1071 and 1058 residues, respectively, and encompass characteristic AhR sequence elements like a basic-helix-loop-helix (bHLH) and two PER-ARNT-SIM (PAS) domains. Both genes are transcribed in liver, spleen and muscle tissues of adult salmon. A second cosmid contained partial sequences, which were identical to the previously characterized AhR2gamma gene. The last two cosmids contained partial genomic AhR sequences, which were more similar to other AhR type 1 fish genes than the four characterized salmon AhR2 genes. However, attempts to amplify the corresponding complete cDNA sequences of the inserts proved very difficult, suggesting that these genes are non-functional or very weakly transcribed in the examined tissues. Phylogenetic analyses of the conserved regions did, however, clearly indicate that these two AhRs belong to the AhR type 1 clade and have been assigned as the Atlantic salmon AhR1alpha and AhR1beta genes. Taken together, these findings demonstrate that multiple AhR genes are present in Atlantic salmon genome, which likely is a consequence of previous genome duplications in the evolutionary past of salmonids. Plausible explanations for the high incidence of AhR genes in fish and more specifically in salmonids, like rapid divergences in specialized functions, are discussed.
NASA Astrophysics Data System (ADS)
Tyuryaeva, Irina I.; Lyublinskaya, Olga G.; Podkorytov, Ivan S.; Skrynnikov, Nikolai R.
2017-01-01
Antitumor GO peptides have been designed as dimerization inhibitors of prominent oncoprotein mucin 1. In this study we demonstrate that activity of GO peptides is independent of the level of cellular expression of mucin 1. Furthermore, these peptides prove to be broadly cytotoxic, causing cell death also in normal cells such as dermal fibroblasts and endometrial mesenchymal stem cells. To explore molecular mechanism of their cytotoxicity, we have designed and tested a number of new peptide sequences containing the key CxC or CxxC motifs. Of note, these sequences bear no similarity to mucin 1 except that they also contain a pair of proximal cysteines. Several of the new peptides turned out to be significantly more potent than their GO prototypes. The results suggest that cytotoxicity of these peptides stems from their (moderate) activity as disulfide oxidoreductases. It is expected that such peptides, which we have termed DO peptides, are involved in disulfide-dithiol exchange reaction, resulting in formation of adventitious disulfide bridges in cell proteins. In turn, this leads to a partial loss of protein function and rapid onset of apoptosis. We anticipate that coupling DO sequences with tumor-homing transduction domains can create a potentially valuable new class of tumoricidal peptides.
Tyuryaeva, Irina I.; Lyublinskaya, Olga G.; Podkorytov, Ivan S.; Skrynnikov, Nikolai R.
2017-01-01
Antitumor GO peptides have been designed as dimerization inhibitors of prominent oncoprotein mucin 1. In this study we demonstrate that activity of GO peptides is independent of the level of cellular expression of mucin 1. Furthermore, these peptides prove to be broadly cytotoxic, causing cell death also in normal cells such as dermal fibroblasts and endometrial mesenchymal stem cells. To explore molecular mechanism of their cytotoxicity, we have designed and tested a number of new peptide sequences containing the key CxC or CxxC motifs. Of note, these sequences bear no similarity to mucin 1 except that they also contain a pair of proximal cysteines. Several of the new peptides turned out to be significantly more potent than their GO prototypes. The results suggest that cytotoxicity of these peptides stems from their (moderate) activity as disulfide oxidoreductases. It is expected that such peptides, which we have termed DO peptides, are involved in disulfide-dithiol exchange reaction, resulting in formation of adventitious disulfide bridges in cell proteins. In turn, this leads to a partial loss of protein function and rapid onset of apoptosis. We anticipate that coupling DO sequences with tumor-homing transduction domains can create a potentially valuable new class of tumoricidal peptides. PMID:28091523
Expanding the cerebrospinal fluid endopeptidome.
Hansson, Karl T; Skillbäck, Tobias; Pernevik, Elin; Kern, Silke; Portelius, Erik; Höglund, Kina; Brinkmalm, Gunnar; Holmén-Larsson, Jessica; Blennow, Kaj; Zetterberg, Henrik; Gobom, Johan
2017-03-01
Biomarkers of neurodegenerative disorders are needed to assist in diagnosis, to monitor disease progression and therapeutic interventions, and to provide insight into disease mechanisms. One route to identify such biomarkers is by proteomic and peptidomic analysis of cerebrospinal fluid (CSF). In the current study, we performed an in-depth analysis of the human CSF endopeptidome to establish an inventory that may serve as a basis for future targeted biomarker studies. High-pH RP HPLC was employed for off-line sample prefractionation followed by low-pH nano-LC-MS analysis. Different software programs and scoring algorithms for peptide identification were employed and compared. A total of 18 031 endogenous peptides were identified at a FDR of 1%, increasing the number of known endogenous CSF peptides 10-fold compared to previous studies. The peptides were derived from 2 053 proteins of which more than 60 have been linked to neurodegeneration. Notably, among the findings were six peptides derived from microtubule-associated protein tau, three of which span the diagnostically interesting threonine-181 (Tau-F isoform). Also, 213 peptides from amyloid precursor protein were identified, 58 of which were partially or completely within the sequence of amyloid β 1-40/42, as well as 109 peptides from apolipoprotein E, spanning sequences that discriminate between the E2/E3/E4 isoforms of the protein. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Ito, Katsuhiko; Kidokoro, Kurako; Katsuma, Susumu; Sezutsu, Hideki; Uchino, Keiro; Kobayashi, Isao; Tamura, Toshiki; Yamamoto, Kimiko; Mita, Kazuei; Shimada, Toru; Kadono-Okuda, Keiko
2018-05-09
Bombyx mori densovirus type 1 (BmDV) is a pathogen that causes flacherie disease in the silkworm. The absolute nonsusceptibility to BmDV among certain silkworm strains is determined independently by two genes, nsd-1 and Nid-1. However, neither of these genes has been molecularly identified to date. Here, we isolated the nsd-1 gene by positional cloning and characterized the properties of its product, NSD-1. Sequence and biochemical analyses revealed that this gene encodes a Bombyx-specific mucin-like glycoprotein with a single transmembrane domain. The NSD-1 protein was specifically expressed in the larval midgut epithelium, the known infection site of BmDV. Sequence analysis of the nsd-1 gene from 13 resistant and 12 susceptible strains suggested that a specific arginine residue in the extracellular tail of the NSD-1 protein was common among susceptible strains. Germline transformation of the susceptible-type nsd-1 (with a single nucleotide substitution) conferred partial susceptibility to resistant larvae, indicating that the + nsd-1 gene is required for the susceptibility of B. mori larvae to BmDV and the susceptibility is solely a result of the substitution of a single amino acid with arginine. Taken together, our results provide striking evidence that a novel membrane-bound mucin-like protein functions as a cell-surface receptor for a densovirus.
Intragenome Diversity of Gene Families Encoding Toxin-like Proteins in Venomous Animals.
Rodríguez de la Vega, Ricardo C; Giraud, Tatiana
2016-11-01
The evolution of venoms is the story of how toxins arise and of the processes that generate and maintain their diversity. For animal venoms these processes include recruitment for expression in the venom gland, neofunctionalization, paralogous expansions, and functional divergence. The systematic study of these processes requires the reliable identification of the venom components involved in antagonistic interactions. High-throughput sequencing has the potential of uncovering the entire set of toxins in a given organism, yet the existence of non-venom toxin paralogs and the misleading effects of partial census of the molecular diversity of toxins make necessary to collect complementary evidence to distinguish true toxins from their non-venom paralogs. Here, we analyzed the whole genomes of two scorpions, one spider and one snake, aiming at the identification of the full repertoires of genes encoding toxin-like proteins. We classified the entire set of protein-coding genes into paralogous groups and monotypic genes, identified genes encoding toxin-like proteins based on known toxin families, and quantified their expression in both venom-glands and pooled tissues. Our results confirm that genes encoding toxin-like proteins are part of multigene families, and that these families arise by recruitment events from non-toxin genes followed by limited expansions of the toxin-like protein coding genes. We also show that failing to account for sequence similarity with non-toxin proteins has a considerable misleading effect that can be greatly reduced by comparative transcriptomics. Our study overall contributes to the understanding of the evolutionary dynamics of proteins involved in antagonistic interactions. © The Author 2016. Published by Oxford University Press on behalf of the Society for Integrative and Comparative Biology. All rights reserved. For permissions please email: journals.permissions@oup.com.
Takahashi, Kaori; Takabayashi, Atsushi; Tanaka, Ayumi; Tanaka, Ryouichi
2014-01-01
The light-harvesting complex (LHC) constitutes the major light-harvesting antenna of photosynthetic eukaryotes. LHC contains a characteristic sequence motif, termed LHC motif, consisting of 25–30 mostly hydrophobic amino acids. This motif is shared by a number of transmembrane proteins from oxygenic photoautotrophs that are termed light-harvesting-like (LIL) proteins. To gain insights into the functions of LIL proteins and their LHC motifs, we functionally characterized a plant LIL protein, LIL3. This protein has been shown previously to stabilize geranylgeranyl reductase (GGR), a key enzyme in phytol biosynthesis. It is hypothesized that LIL3 functions to anchor GGR to membranes. First, we conjugated the transmembrane domain of LIL3 or that of ascorbate peroxidase to GGR and expressed these chimeric proteins in an Arabidopsis mutant lacking LIL3 protein. As a result, the transgenic plants restored phytol-synthesizing activity. These results indicate that GGR is active as long as it is anchored to membranes, even in the absence of LIL3. Subsequently, we addressed the question why the LHC motif is conserved in the LIL3 sequences. We modified the transmembrane domain of LIL3, which contains the LHC motif, by substituting its conserved amino acids (Glu-171, Asn-174, and Asp-189) with alanine. As a result, the Arabidopsis transgenic plants partly recovered the phytol-biosynthesizing activity. However, in these transgenic plants, the LIL3-GGR complexes were partially dissociated. Collectively, these results indicate that the LHC motif of LIL3 is involved in the complex formation of LIL3 and GGR, which might contribute to the GGR reaction. PMID:24275650
Sequence Analysis of the Genome of Carnation (Dianthus caryophyllus L.)
Yagi, Masafumi; Kosugi, Shunichi; Hirakawa, Hideki; Ohmiya, Akemi; Tanase, Koji; Harada, Taro; Kishimoto, Kyutaro; Nakayama, Masayoshi; Ichimura, Kazuo; Onozaki, Takashi; Yamaguchi, Hiroyasu; Sasaki, Nobuhiro; Miyahara, Taira; Nishizaki, Yuzo; Ozeki, Yoshihiro; Nakamura, Noriko; Suzuki, Takamasa; Tanaka, Yoshikazu; Sato, Shusei; Shirasawa, Kenta; Isobe, Sachiko; Miyamura, Yoshinori; Watanabe, Akiko; Nakayama, Shinobu; Kishida, Yoshie; Kohara, Mitsuyo; Tabata, Satoshi
2014-01-01
The whole-genome sequence of carnation (Dianthus caryophyllus L.) cv. ‘Francesco’ was determined using a combination of different new-generation multiplex sequencing platforms. The total length of the non-redundant sequences was 568 887 315 bp, consisting of 45 088 scaffolds, which covered 91% of the 622 Mb carnation genome estimated by k-mer analysis. The N50 values of contigs and scaffolds were 16 644 bp and 60 737 bp, respectively, and the longest scaffold was 1 287 144 bp. The average GC content of the contig sequences was 36%. A total of 1050, 13, 92 and 143 genes for tRNAs, rRNAs, snoRNA and miRNA, respectively, were identified in the assembled genomic sequences. For protein-encoding genes, 43 266 complete and partial gene structures excluding those in transposable elements were deduced. Gene coverage was ∼98%, as deduced from the coverage of the core eukaryotic genes. Intensive characterization of the assigned carnation genes and comparison with those of other plant species revealed characteristic features of the carnation genome. The results of this study will serve as a valuable resource for fundamental and applied research of carnation, especially for breeding new carnation varieties. Further information on the genomic sequences is available at http://carnation.kazusa.or.jp. PMID:24344172
Mammalian splicing factor SF1 interacts with SURP domains of U2 snRNP-associated proteins
Crisci, Angela; Raleff, Flore; Bagdiul, Ivona; Raabe, Monika; Urlaub, Henning; Rain, Jean-Christophe; Krämer, Angela
2015-01-01
Splicing factor 1 (SF1) recognizes the branch point sequence (BPS) at the 3′ splice site during the formation of early complex E, thereby pre-bulging the BPS adenosine, thought to facilitate subsequent base-pairing of the U2 snRNA with the BPS. The 65-kDa subunit of U2 snRNP auxiliary factor (U2AF65) interacts with SF1 and was shown to recruit the U2 snRNP to the spliceosome. Co-immunoprecipitation experiments of SF1-interacting proteins from HeLa cell extracts shown here are consistent with the presence of SF1 in early splicing complexes. Surprisingly almost all U2 snRNP proteins were found associated with SF1. Yeast two-hybrid screens identified two SURP domain-containing U2 snRNP proteins as partners of SF1. A short, evolutionarily conserved region of SF1 interacts with the SURP domains, stressing their role in protein–protein interactions. A reduction of A complex formation in SF1-depleted extracts could be rescued with recombinant SF1 containing the SURP-interaction domain, but only partial rescue was observed with SF1 lacking this sequence. Thus, SF1 can initially recruit the U2 snRNP to the spliceosome during E complex formation, whereas U2AF65 may stabilize the association of the U2 snRNP with the spliceosome at later times. In addition, these findings may have implications for alternative splicing decisions. PMID:26420826
Kulshrestha, Saurabh; Hallan, Vipin; Sharma, Anshul; Seth, Chandrika Attri; Chauhan, Anjali; Zaidi, Aijaz Asghar
2013-09-01
Coat protein (CP) and RNA3 from Prunus necrotic ringspot virus (PNRSV-rose), the most prevalent virus infecting rose in India, were characterized and regions in the coat protein important for self-interaction, during dimer formation were identified. The sequence analysis of CP and partial RNA 3 revealed that the rose isolate of PNRSV in India belongs to PV-32 group of PNRSV isolates. Apart from the already established group specific features of PV-32 group member's additional group-specific and host specific features were also identified. Presence of methionine at position 90 in the amino acid sequence alignment of PNRSV CP gene (belonging to PV-32 group) was identified as the specific conserved feature for the rose isolates of PNRSV. As protein-protein interaction plays a vital role in the infection process, an attempt was made to identify the portions of PNRSV CP responsible for self-interaction using yeast two-hybrid system. It was found (after analysis of the deletion clones) that the C-terminal region of PNRSV CP (amino acids 153-226) plays a vital role in this interaction during dimer formation. N-terminal of PNRSV CP is previously known to be involved in CP-RNA interactions, but our results also suggested that N-terminal of PNRSV CP represented by amino acids 1-77 also interacts with C-terminal (amino acids 153-226) in yeast two-hybrid system, suggesting its probable involvement in the CP-CP interaction.
Zienius, D; Lelešius, R; Kavaliauskis, H; Stankevičius, A; Šalomskas, A
2016-01-01
The aim of the present study was to detect canine parvovirus (CPV) from faecal samples of clinically ill domestic dogs by polymerase chain reaction (PCR) followed by VP2 gene partial sequencing and molecular characterization of circulating strains in Lithuania. Eleven clinically and antigen-tested positive dog faecal samples, collected during the period of 2014-2015, were investigated by using PCR. The phylogenetic investigations indicated that the Lithuanian CPV VP2 partial sequences (3025-3706 cds) were closely related and showed 99.0-99.9% identity. All Lithuanian sequences were associated with one phylogroup, but grouped in different clusters. Ten of investigated Lithuanian CPV VP2 sequences were closely associated with CPV 2a antigenic variant (99.4% nt identity). Five CPV VP2 sequences from Lithuania were related to CPV-2a, but were rather divergent (6.8 nt differences). Only one CPV VP2 sequence from Lithuania was associated (99.3% nt identity) with CPV-2b VP2 sequences from France, Italy, USA and Korea. The four of eleven investigated Lithuanian dogs with CPV infection symptoms were vaccinated with CPV-2 vaccine, but their VP2 sequences were phylogenetically distantly associated with CPV vaccine strains VP2 sequences (11.5-15.8 nt differences). Ten Lithuanian CPV VP2 sequences had monophyletic relations among the close geographically associated samples, but five of them were rather divergent (1.0% less sequence similarity). The one Lithuanian CPV VP2 sequence was closely related with CPV-2b antigenic variant. All the Lithuanian CPV VP2 partial sequences were conservative and phylogenetically low associated with most commonly used CPV vaccine strains.
Wang, Jimin
2017-06-01
Partial charges of atoms in a molecule and electrostatic potential (ESP) density for that molecule are known to bear a strong correlation. In order to generate a set of point-field force field parameters for molecular dynamics, Kollman and coworkers have extracted atomic partial charges for each of all 20 amino acids using restrained partial charge-fitting procedures from theoretical ESP density obtained from condensed-state quantum mechanics. The magnitude of atomic partial charges for neutral peptide backbone they have obtained is similar to that of partial atomic charges for ionized carboxylate side chain atoms. In this study, the effect of these known atomic partial charges on ESP is examined using computer simulations and compared with the experimental ESP density recently obtained for proteins using electron microscopy. It is found that the observed ESP density maps are most consistent with the simulations that include atomic partial charges of protein backbone. Therefore, atomic partial charges are integral part of atomic properties in protein molecules and should be included in model refinement. © 2017 The Protein Society.
Haut, Larissa H; Gill, Amanda L; Kurupati, Raj K; Bian, Ang; Li, Yan; Giles-Davis, Wynetta; Xiang, Zhiquan; Zhou, Xiang Yang; Ertl, Hildegund C J
2016-10-01
Adenovirus (Ad) is used extensively for construction of viral vectors, most commonly with deletion in its E1 and/or E3 genomic regions. Previously, our attempts to insert envelope proteins (Env) of HIV-1 into such vectors based on chimpanzee-derived Ad (AdC) viruses were thwarted. Here, we describe that genetic instability of an E1- and E3-deleted AdC vector of serotype C6 expressing Env of HIV-1 can be overcome by reinsertion of E3 sequences with anti-apoptotic activities. This partial E3 deletion presumably delays premature death of HEK-293 packaging cell lines due to Env-induced cell apoptosis. The same partial E3 deletion also allows for the generation of stable glycoprotein 140 (gp140)- and gp160-expressing Ad vectors based on AdC7, a distinct AdC serotype. Env-expressing AdC vectors containing the partial E3 deletion are genetically stable upon serial cell culture passaging, produce yields comparable to those of other AdC vectors, and induce transgene product-specific antibody responses in mice. A partial E3 deletion thereby allows expansion of the repertoire of transgenes that can be expressed by Ad vectors.
Ge, Jingping; Sun, Yanyang; Xin, Xing; Wang, Ying; Ping, Wenxiang
2016-01-14
Bacteriocins have antimicrobial activities against food-spoiling bacteria and food-borne pathogens. Paracin 1.7, a bacteriocin synthesized by Lactobacillus paracasei HD1-7 isolated from Chinese sauerkraut juice, was studied. Following partial purification with ammonium sulfate precipitation, CM Sepharose Fast Flow, and Sephadex G-10 chromatography, the molecular weight of Paracin 1.7 was about 10 kDa based on Tricine-SDS-PAGE results. A 2.87 fold purified bacteriocin was produced, reaching a final yield of 39.93% and the specific activity of 1.56 × 10(3) AU/mg. The N-terminal amino acid sequence of Paracin 1.7 was VSNTFFA, and the LC/LTQ results revealed that the N-terminal amino acid sequence was similar to that of ABC-type oligopeptide transport system protein and N-acetylmuramoyl-L-alanine amidase. Paracin 1.7 was sensitive to protease K, had antimicrobial activities at a broad pH range (3.0-8.0), and was heat resistant (121 °C for 20 min). Paracin 1.7 from Lactobacillus paracasei HD1-7 is a novel bacteriocin that has potential applications in food preservation.
Ge, Jingping; Sun, Yanyang; Xin, Xing; Wang, Ying; Ping, Wenxiang
2016-01-01
Bacteriocins have antimicrobial activities against food-spoiling bacteria and food-borne pathogens. Paracin 1.7, a bacteriocin synthesized by Lactobacillus paracasei HD1-7 isolated from Chinese sauerkraut juice, was studied. Following partial purification with ammonium sulfate precipitation, CM Sepharose Fast Flow, and Sephadex G-10 chromatography, the molecular weight of Paracin 1.7 was about 10 kDa based on Tricine-SDS-PAGE results. A 2.87 fold purified bacteriocin was produced, reaching a final yield of 39.93% and the specific activity of 1.56 × 103 AU/mg. The N-terminal amino acid sequence of Paracin 1.7 was VSNTFFA, and the LC/LTQ results revealed that the N-terminal amino acid sequence was similar to that of ABC-type oligopeptide transport system protein and N-acetylmuramoyl-L-alanine amidase. Paracin 1.7 was sensitive to protease K, had antimicrobial activities at a broad pH range (3.0–8.0), and was heat resistant (121 °C for 20 min). Paracin 1.7 from Lactobacillus paracasei HD1-7 is a novel bacteriocin that has potential applications in food preservation. PMID:26763314
Koenig, R; Loss, S; Specht, J; Varrelmann, M; Lüddecke, P; Deml, G
2009-03-01
Beet necrotic yellow vein virus (BNYVV) A type isolates E12 and S8, originating from areas where resistance-breaking had or had not been observed, respectively, served as starting material for studying the influence of sequence variations in BNYVV RNA 3 on virus accumulation in partially resistant sugar beet varieties. Sub-isolates containing only RNAs 1 and 2 were obtained by serial local lesion passages; biologically active cDNA clones were prepared for RNAs 3 which differed in their coding sequences for P25 aa 67, 68 and 129. Sugar beet seedlings were mechanically inoculated with RNA 1+2/RNA 3 pseudorecombinants. The origin of RNAs 1+2 had little influence on virus accumulation in rootlets. E12 RNA 3 coding for V(67)C(68)Y(129) P25, however, enabled a much higher virus accumulation than S8 RNA 3 coding for A(67)H(68)H(129) P25. Mutants revealed that this was due only to the V(67) 'GUU' codon as opposed to the A(67) 'GCU' codon.
NASA Astrophysics Data System (ADS)
Sethaphong, Latsavongsakda
This work examines smart material properties of rational self-assembly and molecular recognition found in nano-biosystems. Exploiting the sequence and structural information encoded within nucleic acids and proteins will permit programmed synthesis of nanomaterials and help create molecular machines that may carry out new roles involving chemical catalysis and bioenergy. Responsive to different ionic environments thru self-reorgnization, nucleic acids (NA) are nature's signature smart material; organisms such as viruses and bacteria use features of NAs to react to their environment and orchestrate their lifecycle. Furthermore, nucleic acid systems (both RNA and DNA) are currently exploited as scaffolds; recent applications have been showcased to build bioelectronics and biotemplated nanostructures via directed assembly of multidimensional nanoelectronic devices 1. Since the most stable and rudimentary structure of nucleic acids is the helical duplex, these were modeled in order to examine the influence of the microenvironment, sequence, and cation-dependent perturbations of their canonical forms. Due to their negatively charged phosphate backbone, NA's rely on counterions to overcome the inherent repulsive forces that arise from the assembly of two complementary strands. As a realistic model system, we chose the HIV-TAR helix (PDB ID: 397D) to study specific sequence motifs on cation sequestration. At physiologically relevant concentrations of sodium and potassium ions, we observed sequence based effects where purine stretches were adept in retaining high residency cations. The transitional space between adenine and guanosine nucleotides (ApG step) in a sequence proved the most favorable. This work was the first to directly show these subtle interactions of sequence based cationic sequestration and may be useful for controlling metallization of nucleic acids in conductive nanowires. Extending the study further, we explored the degree to which the structure of NA duplexes alone interacted with cations distinct from a specific sequence. Under physiologically relevant conditions, a duplex of RNA polyguanine-polycitidine was highly responsive and able to sequester cations to the middle of the purine stretches. The least responsive structure was a DNA polyadenine-polythymine duplex. A random sequence DNA duplex contorted into an RNA-like helix resulted in cationic dynamics similar to RNA systems. These studies showed that cation diffusive binding events in nucleic acid duplex structures are sequence specific and heavily influenced by structural aspects helical forms to account for much of the differences observed. Although structural information in nucleic acids is encoded within their sequence, linking amino acid sequence to protein structure is murkier; the structural information within proteins is encoded by the folding process itself: a complex phenomenon driven toward the equilibrium state of the active conformation. Upwards of two thirds of a protein's sequence can be substituted with similar amino acids without significantly perturbing its function; conserved residues of about 10% seem to be vital; since evolutionary selection pressure in proteins operates 3-dimenionally, a linear sequence is partially informative. We explored this problem by folding de-novo the cytosolic portion of the membrane protein, cellulose synthase, CESA1 from upland cotton, Gossypium hirsutum (Ghcesa1). The cytoplasmic region was generated by homology modeling and refined with molecular dynamics. These mutations impair local structural flexibility which likely results in cellulose that is produced at a lower rate and is less crystalline. Additional modeling of fragments of cellulose synthases from the model plant, Arabidopsis thaliana, offered novel insights into the function of conserved cytosolic domains within plant cellulose synthases. Transport mechanisms related to the transmembrane region revealed significant differences between plants and a bacterial complex. These studies generated possible mutations that may allow for the creation of new synthases and identified other avenues of research in order to develop technologies that may alter the crystallinity and other useful properties of cellulose. 1. Karplus, K., SAM-T08, HMM-based protein structure prediction. Nucleic Acids Research, 2009. 37: p. W492-W497.
Pollock, Steve V; Colombo, Sergio L; Prout, Davey L; Godfrey, Ashley C; Moroney, James V
2003-12-01
This report describes a Chlamydomonas reinhardtii mutant that lacks Rubisco activase (Rca). Using the BleR (bleomycin resistance) gene as a positive selectable marker for nuclear transformation, an insertional mutagenesis screen was performed to select for cells that required a high-CO2 atmosphere for optimal growth. The DNA flanking the BleR insert of one of the high-CO2-requiring strains was cloned using thermal asymmetric interlaced-polymerase chain reaction and inverse polymerase chain reaction and sequenced. The flanking sequence matched the C. reinhardtii Rca cDNA sequence previously deposited in the National Center for Biotechnology Information database. The loss of a functional Rca in the strain was confirmed by the absence of Rca mRNA and protein. The open reading frame for Rca was cloned and expressed in pSL18, a C. reinhardtii expression vector conferring paromomycin resistance. This construct partially complemented the mutant phenotype, supporting the hypothesis that the loss of Rca was the reason the mutant grew poorly in a low-CO2 atmosphere. Sequencing of the C. reinhardtii Rca gene revealed that it contains 10 exons ranging in size from 18 to 470 bp. Low-CO2-grown rca1 cultures had a growth rate and maximum rate of photosynthesis 60% of wild-type cells. Results obtained from experiments on a cia5 rca1 double mutant also suggest that the CO2-concentrating mechanism partially compensates for the absence of an active Rca in the green alga C. reinhardtii.
Luisi-DeLuca, C; Clark, A J; Kolodner, R D
1988-01-01
Exonuclease VIII (exoVIII) of Escherichia coli has been purified from a strain carrying a plasmid-encoded recE gene by using a new procedure. This procedure yielded 30 times more protein per gram of cells, and the protein had a twofold higher specific activity than the enzyme purified by the previously published procedure (J. W. Joseph and R. Kolodner, J. Biol. Chem. 258:10411-10417, 1983). The sequence of the 12 N-terminal amino acids was also obtained and found to correspond to one of the open reading frames predicted from the nucleic acid sequence of the recE region of Rac (C. Chu, A. Templin, and A. J. Clark, manuscript in preparation). Polyclonal antibodies directed against purified exoVIII were also prepared. Cell-free extracts prepared from strains containing a wide range of chromosomal- or plasmid-encoded point, insertion, and deletion mutations which result in expression of exoVIII were examined by Western blot (immunoblot) analysis. This analysis showed that two point sbcA mutations (sbcA5 and sbcA23) and the sbc insertion mutations led to the synthesis of the 140-kilodalton (kDa) polypeptide of wild-type exoVIII. Plasmid-encoded partial deletion mutations of recE reduced the size of the cross-reacting protein(s) in direct proportion to the size of the deletion, even though exonuclease activity was still present. The analysis suggests that 39 kDa of the 140-kDa exoVIII subunit is all that is essential for exonuclease activity. One of the truncated but functional exonucleases (the pRAC3 exonuclease) has been purified and confirmed to be a 41-kDa polypeptide. The first 18 amino acids from the N terminus of the 41-kDa pRAC3 exonuclease were sequenced and fond to correspond to one of the translational start signals predicted from the nucleotide sequence of radC (Chu et al., in preparation). Images PMID:3056915
Bradford, P A; Urban, C; Mariano, N; Projan, S J; Rahal, J J; Bush, K
1997-01-01
Six Escherichia coli and 12 Klebsiella pneumoniae isolates from a single hospital expressed a common beta-lactamase with a pI of approximately 9.0 and were resistant to cefoxitin and cefotetan (MIC ranges, 64 to > 128 and 16 to > 128 micrograms/ml, respectively). Seventeen of the 18 strains produced multiple beta-lactamases. Most significantly, three K. pneumoniae strains were also resistant to imipenem (MICs, 8 to 32 micrograms/ml). Spectrophotometric beta-lactamase assays with purified enzyme indicated hydrolysis of cephamycins, in addition to cephaloridine and benzylpenicillin. The 4ene encoding the pI 9.0 beta-lactamase (designated ACT-1 for AmpC type) was cloned and sequenced, which revealed an ampC-type beta-lactamase gene that originated from Enterobacter cloacae and that had 86% sequence homology to the P99 beta-lactamase and 94% homology to the partial sequence of MIR-1. Southern blotting revealed that the gene encoding ACT-1 was on a large plasmid in some of the K. pneumoniae strains as well as on the chromosomes of all of the strains, suggesting that the gene is located on an easily mobilized element. Outer membrane protein profiles of the K. pneumoniae strains revealed that the three imipenem-resistant strains were lacking a major outer membrane protein of approximately 42 kDa which was present in the imipenem-susceptible strains. ACT-1 is the first plasmid-mediated AmpC-type beta-lactamase derived from Enterobacter which has been completely sequenced. This work demonstrates that in addition to resistance to cephamycins, imipenem resistance can occur in K. pneumoniae when a high level of the ACT-1 beta-lactamase is produced in combination with the loss of a major outer membrane protein. PMID:9055993
Suckau, Detlev; Resemann, Anja
2009-12-01
The ability to match Top-Down protein sequencing (TDS) results by MALDI-TOF to protein sequences by classical protein database searching was evaluated in this work. Resulting from these analyses were the protein identity, the simultaneous assignment of the N- and C-termini and protein sequences of up to 70 residues from either terminus. In combination with de novo sequencing using the MALDI-TDS data, even fusion proteins were assigned and the detailed sequence around the fusion site was elucidated. MALDI-TDS allowed to efficiently match protein sequences quickly and to validate recombinant protein structures-in particular, protein termini-on the level of undigested proteins.
Characterization and mapping of cDNA encoding aspartate aminotransferase in rice, Oryza sativa L.
Song, J; Yamamoto, K; Shomura, A; Yano, M; Minobe, Y; Sasaki, T
1996-10-31
Fifteen cDNA clones, putatively identified as encoding aspartate aminotransferase (AST, EC 2.6.1.1.), were isolated and partially sequenced. Together with six previously isolated clones putatively identified to encode ASTs (Sasaki, et al. 1994, Plant Journal 6, 615-624), their sequences were characterized and classified into 4 cDNA species. Two of the isolated clones, C60213 and C2079, were full-length cDNAs, and their complete nucleotide sequences were determined. C60213 was 1612 bp long and its deduced amino acid sequence showed 88% homology with that of Panicum miliaceum L. mitochondrial AST. The C60213-encoded protein had an N-terminal amino acid sequence that was characteristic of a mitochondrial transit peptide. On the other hand, C2079 was 1546 bp long and had 91% amino acid sequence homology with P. miliaceum L. cytosolic AST but lacked in the transit peptide sequence. The homologies of nucleotide sequences and deduced amino acid sequences of C2079 and C60213 were 54% and 52%, respectively. C2079 and C60213 were mapped on chromosomes 1 and 6, respectively, by restriction fragment length polymorphism linkage analysis. Northern blot analysis using C2079 as a probe revealed much higher transcript levels in callus and root than in green and etiolated shoots, suggesting tissue-specific variations of AST gene expression.
Charles, J. P.; Chihara, C.; Nejad, S.; Riddiford, L. M.
1997-01-01
A 36-kb genomic DNA segment of the Drosophila melanogaster genome containing 12 clustered cuticle genes has been mapped and partially sequenced. The cluster maps at 65A 5-6 on the left arm of the third chromosome, in agreement with the previously determined location of a putative cluster encompassing the genes for the third instar larval cuticle proteins LCP5, LCP6 and LCP8. This cluster is the largest cuticle gene cluster discovered to date and shows a number of surprising features that explain in part the genetic complexity of the LCP5, LCP6 and LCP8 loci. The genes encoding LCP5 and LCP8 are multiple copy genes and the presence of extensive similarity in their coding regions gives the first evidence for gene conversion in cuticle genes. In addition, five genes in the cluster are intronless. Four of these five have arisen by retroposition. The other genes in the cluster have a single intron located at an unusual location for insect cuticle genes. PMID:9383064
Reassortant group A rotavirus from straw-colored fruit bat (Eidolon helvum).
Esona, Mathew D; Mijatovic-Rustempasic, Slavica; Conrardy, Christina; Tong, Suxiang; Kuzmin, Ivan V; Agwanda, Bernard; Breiman, Robert F; Banyai, Krisztian; Niezgoda, Michael; Rupprecht, Charles E; Gentsch, Jon R; Bowen, Michael D
2010-12-01
Bats are known reservoirs of viral zoonoses. We report genetic characterization of a bat rotavirus (Bat/KE4852/07) detected in the feces of a straw-colored fruit bat (Eidolon helvum). Six bat rotavirus genes (viral protein [VP] 2, VP6, VP7, nonstructural protein [NSP] 2, NSP3, and NSP5) shared ancestry with other mammalian rotaviruses but were distantly related. The VP4 gene was nearly identical to that of human P[6] rotavirus strains, and the NSP4 gene was closely related to those of previously described mammalian rotaviruses, including human strains. Analysis of partial sequence of the VP1 gene indicated that it was distinct from cognate genes of other rotaviruses. No sequences were obtained for the VP3 and NSP1 genes of the bat rotavirus. This rotavirus was designated G25-P[6]-I15-R8(provisional)-C8-Mx-Ax-N8-T11-E2-H10. Results suggest that several reassortment events have occurred between human, animal, and bat rotaviruses. Several additional rotavirus strains were detected in bats.
Reassortant Group A Rotavirus from Straw-colored Fruit Bat (Eidolon helvum)
Esona, Mathew D.; Mijatovic-Rustempasic, Slavica; Conrardy, Christina; Tong, Suxiang; Kuzmin, Ivan V.; Agwanda, Bernard; Breiman, Robert F.; Banyai, Krisztian; Niezgoda, Michael; Rupprecht, Charles E.; Gentsch, Jon R.
2010-01-01
Bats are known reservoirs of viral zoonoses. We report genetic characterization of a bat rotavirus (Bat/KE4852/07) detected in the feces of a straw-colored fruit bat (Eidolon helvum). Six bat rotavirus genes (viral protein [VP] 2, VP6, VP7, nonstructural protein [NSP] 2, NSP3, and NSP5) shared ancestry with other mammalian rotaviruses but were distantly related. The VP4 gene was nearly identical to that of human P[6] rotavirus strains, and the NSP4 gene was closely related to those of previously described mammalian rotaviruses, including human strains. Analysis of partial sequence of the VP1 gene indicated that it was distinct from cognate genes of other rotaviruses. No sequences were obtained for the VP3 and NSP1 genes of the bat rotavirus. This rotavirus was designated G25-P[6]-I15-R8(provisional)-C8-Mx-Ax-N8-T11-E2-H10. Results suggest that several reassortment events have occurred between human, animal, and bat rotaviruses. Several additional rotavirus strains were detected in bats. PMID:21122212
Occurrence of hemocyanin in ostracod crustaceans.
Marxen, Julia C; Pick, Christian; Oakley, Todd H; Burmester, Thorsten
2014-08-01
Hemocyanin is a copper-containing protein that transports O2 in the hemolymph of many arthropod species. Within the crustaceans, hemocyanin appeared to be restricted to Malacostraca but has recently been identified in Remipedia. Here, we report the occurrence of hemocyanin in ostracods, indicating that this respiratory protein is more widespread within crustaceans than previously thought. By analyses of expressed sequence tags and by RT-PCR, we obtained four full length and nine partial hemocyanin sequences from six of ten investigated ostracod species. Hemocyanin was identified in Myodocopida (Actinoseta jonesi, Cypridininae sp., Euphilomedes morini, Skogsbergia lerneri, Vargula tsujii) and Platycopida (Cytherelloidea californica) but not in Podocopida. We found no evidence for the presence of hemoglobin in any of these ostracod species. Like in other arthropods, we identified multiple hemocyanin subunits (up to six) to occur in a single ostracod species. Bayesian phylogenetic analyses showed that ostracod hemocyanin subunit diversity evolved independently from that of other crustaceans. Ostracod hemocyanin subunits were found paraphyletic, with myodocopid and platycopid subunits forming distinct clades within those of the crustaceans. This pattern suggests that ostracod hemocyanins originated from distinct subunits in the pancrustacean stemline.
Gardères, Johan; Domart-Coulon, Isabelle; Marie, Arul; Hamer, Bojan; Batel, Renato; Müller, Werner E G; Bourguet-Kondracki, Marie-Lise
2016-10-01
Carbohydrate-binding proteins were purified from the marine calcareous sponge Clathrina clathrus via affinity chromatography on lactose and N-acetyl glucosamine-agarose resins. Proteomic analysis of acrylamide gel separated protein subunits obtained in reducing conditions pointed out several candidates for lectins. Based on amino-acid sequence similarity, two peptides displayed homology with the jack bean lectin Concanavalin A, including a conserved domain shared by proteins in the L-type lectin superfamily. An N-acetyl glucosamine - binding protein complex, named clathrilectin, was further purified via gel filtration chromatography, bioguided with a diagnostic rabbit erythrocyte haemagglutination assay, and its activity was found to be calcium dependent. Clathrilectin, a protein complex of 3200kDa estimated by gel filtration, is composed of monomers with apparent molecular masses of 208 and 180kDa estimated on 10% SDS-PAGE. Nine internal peptides were identified using proteomic analyses, and compared to protein libraries from the demosponge Amphimedon queenslandica and a calcareous sponge Sycon sp. from the Adriatic Sea. The clathrilectin is the first lectin isolated from a calcareous sponge and displays homologies with predicted sponge proteins potentially involved in cell aggregation and interaction with bacteria. Copyright © 2016 Elsevier Inc. All rights reserved.
Pucheta-Martinez, Encarna; D'Amelio, Nicola; Lelli, Moreno; Martinez-Torrecuadrada, Jorge L; Sudol, Marius; Saladino, Giorgio; Gervasio, Francesco Luigi
2016-07-26
WW domains are small domains present in many human proteins with a wide array of functions and acting through the recognition of proline-rich sequences. The WW domain belonging to polyglutamine tract-binding protein 1 (PQBP1) is of particular interest due to its direct involvement in several X chromosome-linked intellectual disabilities, including Golabi-Ito-Hall (GIH) syndrome, where a single point mutation (Y65C) correlates with the development of the disease. The mutant cannot bind to its natural ligand WBP11, which regulates mRNA processing. In this work we use high-field high-resolution NMR and enhanced sampling molecular dynamics simulations to gain insight into the molecular causes the disease. We find that the wild type protein is partially unfolded exchanging among multiple beta-strand-like conformations in solution. The Y65C mutation further destabilizes the residual fold and primes the protein for the formation of a disulphide bridge, which could be at the origin of the loss of function.
NASA Astrophysics Data System (ADS)
Pucheta-Martinez, Encarna; D'Amelio, Nicola; Lelli, Moreno; Martinez-Torrecuadrada, Jorge L.; Sudol, Marius; Saladino, Giorgio; Gervasio, Francesco Luigi
2016-07-01
WW domains are small domains present in many human proteins with a wide array of functions and acting through the recognition of proline-rich sequences. The WW domain belonging to polyglutamine tract-binding protein 1 (PQBP1) is of particular interest due to its direct involvement in several X chromosome-linked intellectual disabilities, including Golabi-Ito-Hall (GIH) syndrome, where a single point mutation (Y65C) correlates with the development of the disease. The mutant cannot bind to its natural ligand WBP11, which regulates mRNA processing. In this work we use high-field high-resolution NMR and enhanced sampling molecular dynamics simulations to gain insight into the molecular causes the disease. We find that the wild type protein is partially unfolded exchanging among multiple beta-strand-like conformations in solution. The Y65C mutation further destabilizes the residual fold and primes the protein for the formation of a disulphide bridge, which could be at the origin of the loss of function.
The unfolding mechanism of monomeric mutant SOD1 by simulated force spectroscopy.
Habibi, Mona; Rottler, Jörg; Plotkin, Steven S
2017-11-01
Mechanical unfolding of mutated apo, disulfide-reduced, monomeric superoxide dismutase 1 protein (SOD1) has been simulated via force spectroscopy techniques, using both an all-atom (AA), explicit solvent model and a coarse-grained heavy-atom Gō (HA-Gō) model. The HA-Gō model was implemented at two different pulling speeds for comparison. The most-common sequence of unfolding in the AA model agrees well with the most-common unfolding sequence of the HA-Gō model, when the same normalized pulling rate was used. Clustering of partially-native structures as the protein unfolds shows that the AA and HA-Gō models both exhibit a dominant pathway for early unfolding, which eventually bifurcates repeatedly to multiple branches after the protein is about half-unfolded. The force-extension curve exhibits multiple force drops, which are concomitant with jumps in the local interaction potential energy between specific β-strands in the protein. These sudden jumps in the potential energy coincide with the dissociation of specific pairs of β-strands, and thus intermediate unfolding events. The most common sequence of β-strand dissociation in the unfolding pathway of the AA model is β-strands 5, 4, 8, 7, 1, 2, then finally β-strands 3 and 6. The observation that β-strand 5 is among the first to unfold here, but the last to unfold in simulations of loop-truncated SOD1, could imply the existence of an evolutionary compensation mechanism, which would stabilize β-strands flanking long loops against their entropic penalty by strengthening intramolecular interactions. This article is part of a Special Issue entitled: Biophysics in Canada, edited by Lewis Kay, John Baenziger, Albert Berghuis and Peter Tieleman. Copyright © 2017 Elsevier B.V. All rights reserved.
The disorderly conduct of Hsc70 and its interaction with the Alzheimer's related Tau protein.
Taylor, Isabelle R; Ahmad, Atta; Wu, Taia; Nordhues, Bryce A; Bhullar, Anup; Gestwicki, Jason E; Zuiderweg, Erik R P
2018-05-15
Hsp70 chaperones bind to various protein substrates for folding, trafficking, and degradation. Considerable structural information is available about how prokaryotic Hsp70 (DnaK) binds substrates, but less is known about mammalian Hsp70s, of which there are 13 isoforms encoded in the human genome. Here, we report the interaction between the human Hsp70 isoform heat shock cognate 71 KDa protein (Hsc70 or HSPA8) and peptides derived from the microtubule-associated protein tau, which is linked to Alzheimer's disease. For structural studies, we used an Hsc70 construct (called BETA) comprising the substrate-binding domain, but lacking the lid. Importantly, we found that truncating the lid does not significantly impair Hsc70's chaperone activity or allostery in vitro. Using NMR, we show that BETA is partially dynamically disordered in the absence of substrate and that binding of the tau sequence GKVQIINKKG (with a KD = 500 nM) causes dramatic rigidification of BETA. Nuclear Overhauser effect distance measurements revealed that tau binds to the canonical substrate-binding cleft, similar to the binding observed with DnaK. To further develop BETA as a tool for studying Hsc70 interactions, we also measured BETA binding in NMR and fluorescent competition assays to peptides derived from huntingtin, insulin, a second tau-recognition sequence, and a KFERQ-like sequence linked to chaperone-mediated autophagy. We found that the insulin C-peptide binds BETA with high affinity (KD < 100 nM), whereas the others do not (KD > 100 μM). Together, our findings reveal several similarities and differences in how prokaryotic and mammalian Hsp70 isoforms interact with different substrate peptides. Published under license by The American Society for Biochemistry and Molecular Biology, Inc.
Diallinas, G; Gorfinkiel, L; Arst, H N; Cecchetto, G; Scazzocchio, C
1995-04-14
In Aspergillus nidulans, loss-of-function mutations in the uapA and azgA genes, encoding the major uric acid-xanthine and hypoxanthine-adenine-guanine permeases, respectively, result in impaired utilization of these purines as sole nitrogen sources. The residual growth of the mutant strains is due to the activity of a broad specificity purine permease. We have identified uapC, the gene coding for this third permease through the isolation of both gain-of-function and loss-of-function mutations. Uptake studies with wild-type and mutant strains confirmed the genetic analysis and showed that the UapC protein contributes 30% and 8-10% to uric acid and hypoxanthine transport rates, respectively. The uapC gene was cloned, its expression studied, its sequence and transcript map established, and the sequence of its putative product analyzed. uapC message accumulation is: (i) weakly induced by 2-thiouric acid; (ii) repressed by ammonium; (iii) dependent on functional uaY and areA regulatory gene products (mediating uric acid induction and nitrogen metabolite repression, respectively); (iv) increased by uapC gain-of-function mutations which specifically, but partially, suppress a leucine to valine mutation in the zinc finger of the protein coded by the areA gene. The putative uapC gene product is a highly hydrophobic protein of 580 amino acids (M(r) = 61,251) including 12-14 putative transmembrane segments. The UapC protein is highly similar (58% identity) to the UapA permease and significantly similar (23-34% identity) to a number of bacterial transporters. Comparisons of the sequences and hydropathy profiles of members of this novel family of transporters yield insights into their structure, functionally important residues, and possible evolutionary relationships.
Detection of a new bat gammaherpesvirus in the Philippines.
Watanabe, Shumpei; Ueda, Naoya; Iha, Koichiro; Masangkay, Joseph S; Fujii, Hikaru; Alviola, Phillip; Mizutani, Tetsuya; Maeda, Ken; Yamane, Daisuke; Walid, Azab; Kato, Kentaro; Kyuwa, Shigeru; Tohya, Yukinobu; Yoshikawa, Yasuhiro; Akashi, Hiroomi
2009-08-01
A new bat herpesvirus was detected in the spleen of an insectivorous bat (Hipposideros diadema, family Hipposideridae) collected on Panay Island, the Philippines. PCR analyses were performed using COnsensus-DEgenerate Hybrid Oligonucleotide Primers (CODEHOPs) targeting the herpesvirus DNA polymerase (DPOL) gene. Although we obtained PCR products with CODEHOPs, direct sequencing using the primers was not possible because of high degree of degeneracy. Direct sequencing technology developed in our rapid determination system of viral RNA sequences (RDV) was applied in this study, and a partial DPOL nucleotide sequence was determined. In addition, a partial gB gene nucleotide sequence was also determined using the same strategy. We connected the partial gB and DPOL sequences with long-distance PCR, and a 3741-bp nucleotide fragment, including the 3' part of the gB gene and the 5' part of the DPOL gene, was finally determined. Phylogenetic analysis showed that the sequence was novel and most similar to those of the subfamily Gammaherpesvirinae.
Desbiez, C; Lecoq, H
2004-08-01
Watermelon mosaic virus (WMV, Potyvirus) is a potyvirus with a worldwide distribution, mostly in temperate and mediterranean regions. According to the partial sequences that were available, WMV appeared to share high sequence similarity with Soybean mosaic virus (SMV), and it was almost considered as a strain of SMV in spite of its different and much broader host range. Like SMV, it was also related to legume-infecting potyviruses belonging to the " Bean common mosaic virus (BCMV) subgroup". In this paper we obtained the full-length sequence of WMV, and we confirmed that this virus is very closely related to SMV in most of its genome; however, there is evidence for an interspecific recombination in the P1 protein, as the P1 of WMV was 135 amino-acids longer than that of SMV, and the N-terminal half of the P1 showed no relation to SMV but was 85% identical to BCMV. This suggests that WMV has emerged through an ancestral recombination event, and supports the distinction of WMV and SMV as separate taxonomic units.
Salinas, Alejandro; Vega, Marcela; Lienqueo, María Elena; Garcia, Alejandro; Carmona, Rene; Salazar, Oriana
2011-12-10
Total cDNA isolated from cellulolytic fungi cultured in cellulose was examined for the presence of sequences encoding for endoglucanases. Novel sequences encoding for glycoside hydrolases (GHs) were identified in Fusarium oxysporum, Ganoderma applanatum and Trametes versicolor. The cDNA encoding for partial sequences of GH family 61 cellulases from F. oxysporum and G. applanatum shares 58 and 68% identity with endoglucanases from Glomerella graminicola and Laccaria bicolor, respectively. A new GH family 5 endoglucanase from T. versicolor was also identified. The cDNA encoding for the mature protein was completely sequenced. This enzyme shares 96% identity with Trametes hirsuta endoglucanase and 22% with Trichoderma reesei endoglucanase II (EGII). The enzyme, named TvEG, has N-terminal family 1 carbohydrate binding module (CBM1). The full length cDNA was cloned into the pPICZαB vector and expressed as an active, extracellular enzyme in the methylotrophic yeast Pichia pastoris. Preliminary studies suggest that T. versicolor could be useful for lignocellulose degradation. Copyright © 2011 Elsevier Inc. All rights reserved.
High-speed optical phase-shifting apparatus
Zortman, William A.
2016-11-08
An optical phase shifter includes an optical waveguide, a plurality of partial phase shifting elements arranged sequentially, and control circuitry electrically coupled to the partial phase shifting elements. The control circuitry is adapted to provide an activating signal to each of the N partial phase shifting elements such that the signal is delayed by a clock cycle between adjacent partial phase shifting elements in the sequence. The transit time for a guided optical pulse train between the input edges of consecutive partial phase shifting elements in the sequence is arranged to be equal to a clock cycle, thereby enabling pipelined processing of the optical pulses.
Cui, Hongguang; Wang, Aiming
2016-05-15
The potyviral RNA genome encodes two polyproteins that are proteolytically processed by three viral protease domains into 11 mature proteins. Extensive molecular studies have identified functions for the majority of the viral proteins. For example, 6K2, one of the two smallest potyviral proteins, is an integral membrane protein and induces the endoplasmic reticulum (ER)-originated replication vesicles that target the chloroplast for robust viral replication. However, the functional role of 6K1, the other smallest protein, remains uncharacterized. In this study, we developed a series of recombinant full-length viral cDNA clones derived from a Canadian Plum pox virus (PPV) isolate. We found that deletion of any of the short motifs of 6K1 (each of which ranged from 5 to 13 amino acids), most of the 6K1 sequence (but with the conserved sequence of the cleavage sites being retained), or all of the 6K1 sequence in the PPV infectious clone abolished viral replication. The trans expression of 6K1 or the cis expression of a dislocated 6K1 failed to rescue the loss-of-replication phenotype, suggesting the temporal and spatial requirement of 6K1 for viral replication. Disruption of the N- or C-terminal cleavage site of 6K1, which prevented the release of 6K1 from the polyprotein, either partially or completely inhibited viral replication, suggesting the functional importance of the mature 6K1. We further found that green fluorescent protein-tagged 6K1 formed punctate inclusions at the viral early infection stage and colocalized with chloroplast-bound viral replicase elements 6K2 and NIb. Taken together, our results suggest that 6K1 is required for viral replication and is an important viral element of the viral replication complex at the early infection stage. Potyviruses account for more than 30% of known plant viruses and consist of many agriculturally important viruses. The genomes of potyviruses encode two polyproteins that are proteolytically processed into 11 mature proteins, with the majority of them having been at least partially functionally characterized. However, the functional role of a small protein named 6K1 remains obscure. In this study, we showed that deletion of 6K1 or a short motif/region of 6K1 in the full-length cDNA clones of plum pox virus abolishes viral replication and that mutation of the N- or C-terminal cleavage sites of 6K1 to prevent its release from the polyprotein greatly attenuates or completely inhibits viral replication, suggesting its important role in potyviral infection. We report that 6K1 forms punctate structures and targets the replication vesicles in PPV-infected plant leaf cells at the early infection stage. Our data reveal that 6K1 is an important viral protein of the potyviral replication complex. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Characterization of parasporin gene harboring Indian isolates of Bacillus thuringiensis.
Lenina, N K; Naveenkumar, A; Sozhavendan, A E; Balakrishnan, N; Balasubramani, V; Udayasuriyan, V
2014-10-01
Bacillus thuringiensis (Bt) is popularly known as insecticidal bacterium. However, non-insecticidal Bt strains are more extensively available in natural environment than the insecticidal ones. Parasporin (PS) is a collection of genealogically heterogeneous Cry proteins synthesized in non-insecticidal isolates of Bt. An important character generally related with PS proteins is their strong cytocidal activity preferentially on human cancer cells of various origins. Identification and characterization of novel parasporin protein which are non-hemolytic and non-insecticidal but having selective anticancer activity raise the possibility of a novel application of Bt in medical field. In the present study, seven new indigenous isolates (T6, T37, T68, T98, T165, T186, and T461) of Bt showed variation in colony morphology, crystal characters and protein profiles with each other. Out of the seven new isolates screened for parasporin (ps) and cry genes, two of the new indigenous isolates (T98 and T186) of Bt showed the presence of ps4 gene. Partial ps4 gene was cloned from the two new isolates and the sequence of partial ps4 gene showed high homology with its holotype ps4Aa1. These two isolates were characterized based on the proteolytic processing of the inclusion proteins and the proteolytic products were found to be comparable to the PS4 reference strain A1470. The two isolates of Bt did not show toxicity toward Spodoptera litura and Helicoverpa armigera. Based on the results of this study, it can be concluded that the isolates T98 and T186 are parasporin producers.
Kumar, Avishek; Campitelli, Paul; Thorpe, M F; Ozkan, S Banu
2015-12-01
The most successful protein structure prediction methods to date have been template-based modeling (TBM) or homology modeling, which predicts protein structure based on experimental structures. These high accuracy predictions sometimes retain structural errors due to incorrect templates or a lack of accurate templates in the case of low sequence similarity, making these structures inadequate in drug-design studies or molecular dynamics simulations. We have developed a new physics based approach to the protein refinement problem by mimicking the mechanism of chaperons that rehabilitate misfolded proteins. The template structure is unfolded by selectively (targeted) pulling on different portions of the protein using the geometric based technique FRODA, and then refolded using hierarchically restrained replica exchange molecular dynamics simulations (hr-REMD). FRODA unfolding is used to create a diverse set of topologies for surveying near native-like structures from a template and to provide a set of persistent contacts to be employed during re-folding. We have tested our approach on 13 previous CASP targets and observed that this method of folding an ensemble of partially unfolded structures, through the hierarchical addition of contact restraints (that is, first local and then nonlocal interactions), leads to a refolding of the structure along with refinement in most cases (12/13). Although this approach yields refined models through advancement in sampling, the task of blind selection of the best refined models still needs to be solved. Overall, the method can be useful for improved sampling for low resolution models where certain of the portions of the structure are incorrectly modeled. © 2015 Wiley Periodicals, Inc.
Madison, Jon M.; Dudley, Aimée M.; Winston, Fred
1998-01-01
Spt3 and Mot1 are two transcription factors of Saccharomyces cerevisiae that are thought to act in a related fashion to control the function of TATA-binding protein (TBP). Current models suggest that while Spt3 and Mot1 do not directly interact, they do function in a related fashion to stabilize the TBP-TATA interaction at particular promoters. Consistent with this model, certain combinations of spt3 and mot1 mutations are inviable. To identify additional proteins related to Spt3 and Mot1 functions, we screened for high-copy-number suppressors of the mot1 spt3 inviability. This screen identified a previously unstudied gene, MOT3, that encodes a zinc finger protein. We show that Mot3 binds in vitro to three sites within the retrotransposon Ty long terminal repeat (δ) sequence. One of these sites is immediately 5′ of the δ TATA region. Although a mot3 null mutation causes no strong phenotypes, it does cause some mild phenotypes, including a very modest increase in Ty mRNA levels, partial suppression of transcriptional defects caused by a mot1 mutation, and partial suppression of an spt3 mutation. These results, in conjunction with those of an independent study of Mot3 (A. Grishin, M. Rothenberg, M. A. Downs, and K. J. Blumer, Genetics, in press), suggest that this protein plays a varied role in gene expression that may be largely redundant with other factors. PMID:9528759
Watanabe, K P; Kawai, Y K; Nakayama, S M M; Ikenaka, Y; Mizukawa, H; Takaesu, N; Ito, M; Ikushiro, S-I; Sakaki, T; Ishizuka, M
2015-04-01
Large interspecies differences in avian xenobiotic metabolism have been revealed by microsome-based studies, but specific enzyme isoforms in different bird species have not yet been compared. We have previously shown that CYP2C23 genes are the most induced CYP isoforms in chicken liver. In this study, we collected partial CYP2C23a gene sequences from eight avian species (ostrich, blue-eared pheasant, snowy owl, great-horned owl, Chilean flamingo, peregrin falcon, Humboldt penguin, and black-crowned night heron) selected to cover the whole avian lineage: Paleognathae, Galloanserae, and Neoaves. Genetic analysis showed that CYP2C23 genes of Galloanserae species (chicken and blue-eared pheasant) had unique characteristics. We found some duplicated genes (CYP2C23a and CYP2C23b) and two missing amino acid residues in Galloanserae compared to the other two lineages. The genes have lower homology than in other avian lineages, which suggests Galloanserae-specific rapid evolutionary changes. These genetic features suggested that the Galloanserae are not the most representative avian species, considering that the Neoaves comprise more than 95% of birds. Moreover, we succeeded in synthesizing an antipeptide polyclonal antibody against the region of CYP2C23 protein conserved in avians. However, comparative quantitation of CYP2C23 proteins in livers from six species showed that expression levels of these proteins differed no more than fourfold. Further study is needed to clarify the function of avian CYP2C23 proteins. © 2014 John Wiley & Sons Ltd.
Stokman, Marijn F; Oud, Machteld M; van Binsbergen, Ellen; Slaats, Gisela G; Nicolaou, Nayia; Renkema, Kirsten Y; Nijman, Isaac J; Roepman, Ronald; Giles, Rachel H; Arts, Heleen H; Knoers, Nine V A M; van Haelst, Mieke M
2016-06-01
We report an 11-year-old girl with mild intellectual disability, skeletal anomalies, congenital heart defect, myopia, and facial dysmorphisms including an extra incisor, cup-shaped ears, and a preauricular skin tag. Array comparative genomic hybridization analysis identified a de novo 4.5-Mb microdeletion on chromosome 14q24.2q24.3. The deleted region and phenotype partially overlap with previously reported patients. Here, we provide an overview of the literature on 14q24 microdeletions and further delineate the associated phenotype. We performed exome sequencing to examine other causes for the phenotype and queried genes present in the 14q24.2q24.3 microdeletion that are associated with recessive disease for variants in the non-deleted allele. The deleted region contains 65 protein-coding genes, including the ciliary gene IFT43. Although Sanger and exome sequencing did not identify variants in the second IFT43 allele or in other IFT complex A-protein-encoding genes, immunocytochemistry showed increased accumulation of IFT-B proteins at the ciliary tip in patient-derived fibroblasts compared to control cells, demonstrating defective retrograde ciliary transport. This could suggest a ciliary defect in the pathogenesis of this disorder. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Kang, Jong-Soo; Lee, Byoung Yoon; Kwak, Myounghai
2017-01-01
The complete chloroplast genomes of Lychnis wilfordii and Silene capitata were determined and compared with ten previously reported Caryophyllaceae chloroplast genomes. The chloroplast genome sequences of L. wilfordii and S. capitata contain 152,320 bp and 150,224 bp, respectively. The gene contents and orders among 12 Caryophyllaceae species are consistent, but several microstructural changes have occurred. Expansion of the inverted repeat (IR) regions at the large single copy (LSC)/IRb and small single copy (SSC)/IR boundaries led to partial or entire gene duplications. Additionally, rearrangements of the LSC region were caused by gene inversions and/or transpositions. The 18 kb inversions, which occurred three times in different lineages of tribe Sileneae, were thought to be facilitated by the intermolecular duplicated sequences. Sequence analyses of the L. wilfordii and S. capitata genomes revealed 39 and 43 repeats, respectively, including forward, palindromic, and reverse repeats. In addition, a total of 67 and 56 simple sequence repeats were discovered in the L. wilfordii and S. capitata chloroplast genomes, respectively. Finally, we constructed phylogenetic trees of the 12 Caryophyllaceae species and two Amaranthaceae species based on 73 protein-coding genes using both maximum parsimony and likelihood methods.
NASA Technical Reports Server (NTRS)
Lee, W. Y.; Brune, D. C.; LoBrutto, R.; Blankenship, R. E.
1995-01-01
Rubredoxin is a small nonheme iron protein that serves as an electron carrier in bacterial systems. Rubredoxin has now been isolated and characterized from the strictly anaerobic phototroph, Heliobacillus mobilis. THe molecular mass (5671.3 Da from the amino acid sequence) was confirmed and partial formylation of the N-terminal methionyl residue was established by matrix-assisted laser desorption mass spectroscopy. The complete 52-amino-acid sequence was determined by a combination of N-terminal sequencing by Edman degradation and C-terminal sequencing by a novel method using carboxypeptidase treatment in conjunction with amino acid analysis and laser desorption time of flight mass spectrometry. The molar absorption coefficient of Hc. mobilis rubredoxin at 490 nm is 6.9 mM-1 cm-1 and the midpoint redox potential at pH 8.0 is -46 mV. The EPR spectrum of the oxidized form shows resonances at g = 9.66 and 4.30 due to a high-spin ferric iron. The amino acid sequence is homologous to those of rubredoxins from other species, in particular, the gram-positive bacteria, and the phototrophic green sulfur bacteria, and the evolutionary implications of this are discussed.
Gomes, S L; Gober, J W; Shapiro, L
1990-01-01
Caulobacter crescentus has a single dnaK gene that is highly homologous to the hsp70 family of heat shock genes. Analysis of the cloned and sequenced dnaK gene has shown that the deduced amino acid sequence could encode a protein of 67.6 kilodaltons that is 68% identical to the DnaK protein of Escherichia coli and 49% identical to the Drosophila and human hsp70 protein family. A partial open reading frame 165 base pairs 3' to the end of dnaK encodes a peptide of 190 amino acids that is 59% identical to DnaJ of E. coli. Northern blot analysis revealed a single 4.0-kilobase mRNA homologous to the cloned fragment. Since the dnaK coding region is 1.89 kilobases, dnaK and dnaJ may be transcribed as a polycistronic message. S1 mapping and primer extension experiments showed that transcription initiated at two sites 5' to the dnaK coding sequence. A single start site of transcription was identified during heat shock at 42 degrees C, and the predicted promoter sequence conformed to the consensus heat shock promoters of E. coli. At normal growth temperature (30 degrees C), a different start site was identified 3' to the heat shock start site that conformed to the E. coli sigma 70 promoter consensus sequence. S1 protection assays and analysis of expression of the dnaK gene fused to the lux transcription reporter gene showed that expression of dnaK is temporally controlled under normal physiological conditions and that transcription occurs just before the initiation of DNA replication. Thus, in both human cells (I. K. L. Milarski and R. I. Morimoto, Proc. Natl. Acad. Sci. USA 83:9517-9521, 1986) and in a simple bacterium, the transcription of a hsp70 gene is temporally controlled as a function of the cell cycle under normal growth conditions. Images PMID:2345134
Biscola, V; Choiset, Y; Rabesona, H; Chobert, J-M; Haertlé, T; Franco, B D G M
2018-04-13
The objective was to obtain lactic acid bacteria (LAB) capable of hydrolysing immunoreactive proteins in milk, to optimize the hydrolysis, to determine the proteolysis kinetics and to test the safety of the best hydrolytic strain. Brazilian cheese was used as source of LAB capable of hydrolysing main milk allergens. Proteolytic isolates were submitted to RAPD-PCR for the characterization of clonal diversity. Optimized hydrolysis was strain and protein fraction dependent. 16S rDNA sequencing identified three proteolytic strains: Enterococcus faecalis VB43, that hydrolysed α S1 -, α S2 - and β-caseins, α-lactalbumin and β-lactoglobulin (partial hydrolysis), and Pediococcus acidilactici VB90 and Weissella viridescens VB111, that caused partial hydrolysis of α S1 - and α S2 -caseins. Enterococcus faecalis VB43 tested negative for virulence genes asa1, agg, efaA, hyl, esp, cylL L and cylL S but positive for genes ace and gelE. Ethylenediamine tetra-acetic acid inhibited the proteolysis, indicating that the main proteases of E. faecalis VB43 are metalloproteases. Brazilian artisanal cheese is a good source of LAB capable of hydrolysing allergenic proteins in milk. One isolate (E. faecalis VB43) presented outstanding activity against these proteins and lacked most of the tested virulence genes. Enterococcus faecalis VB43 presents good potential for the manufacture of hypoallergenic dairy products. © 2018 The Society for Applied Microbiology.
Rhodes, Ryan G.; Samarasam, Mudiarasan Napoleon; Shrivastava, Abhishek; van Baaren, Jessica M.; Pochiraju, Soumya; Bollampalli, Sreelekha; McBride, Mark J.
2010-01-01
Cells of the gliding bacterium Flavobacterium johnsoniae move rapidly over surfaces. Mutations in gldN cause a partial defect in gliding. A novel bacteriophage selection strategy was used to aid construction of a strain with a deletion spanning gldN and the closely related gene gldO in an otherwise wild-type F. johnsoniae UW101 background. Bacteriophage transduction was used to move a gldN mutation into F. johnsoniae UW101 to allow phenotypic comparison with the gldNO deletion mutant. Cells of the gldN mutant formed nonspreading colonies on agar but retained some ability to glide in wet mounts. In contrast, cells of the gldNO deletion mutant were completely nonmotile, indicating that cells require GldN, or the GldN-like protein GldO, to glide. Recent results suggest that Porphyromonas gingivalis PorN, which is similar in sequence to GldN, has a role in protein secretion across the outer membrane. Cells of the F. johnsoniae gldNO deletion mutant were defective in localization of the motility protein SprB to the cell surface, suggesting that GldN may be involved in secretion of components of the motility machinery. Cells of the gldNO deletion mutant were also deficient in chitin utilization and were resistant to infection by bacteriophages, phenotypes that may also be related to defects in protein secretion. PMID:20038590
Characterization and Expression of the Lucina pectinata Oxygen and Sulfide Binding Hemoglobin Genes
López-Garriga, Juan; Cadilla, Carmen L.
2016-01-01
The clam Lucina pectinata lives in sulfide-rich muds and houses intracellular symbiotic bacteria that need to be supplied with hydrogen sulfide and oxygen. This clam possesses three hemoglobins: hemoglobin I (HbI), a sulfide-reactive protein, and hemoglobin II (HbII) and III (HbIII), which are oxygen-reactive. We characterized the complete gene sequence and promoter regions for the oxygen reactive hemoglobins and the partial structure and promoters of the HbI gene from Lucina pectinata. We show that HbI has two mRNA variants, where the 5’end had either a sequence of 96 bp (long variant) or 37 bp (short variant). The gene structure of the oxygen reactive Hbs is defined by having 4-exons/3-introns with conservation of intron location at B12.2 and G7.0 and the presence of pre-coding introns, while the partial gene structure of HbI has the same intron conservation but appears to have a 5-exon/ 4-intron structure. A search for putative transcription factor binding sites (TFBSs) was done with the promoters for HbII, HbIII, HbI short and HbI long. The HbII, HbIII and HbI long promoters showed similar predicted TFBSs. We also characterized MITE-like elements in the HbI and HbII gene promoters and intronic regions that are similar to sequences found in other mollusk genomes. The gene expression levels of the clam Hbs, from sulfide-rich and sulfide-poor environments showed a significant decrease of expression in the symbiont-containing tissue for those clams in a sulfide-poor environment, suggesting that the sulfide concentration may be involved in the regulation of these proteins. Gene expression evaluation of the two HbI mRNA variants indicated that the longer variant is expressed at higher levels than the shorter variant in both environments. PMID:26824233
DOE Office of Scientific and Technical Information (OSTI.GOV)
Codina, J.; Olate, J.; Abramowitz, J.
1988-05-15
cDNA cloning has identified the presence in the human genome of three genes encoding ..cap alpha.. subunits of pertussis toxin substrates, generically called G/sub i/. They are named ..cap alpha../sub i/-1, ..cap alpha../sub i/-2 and ..cap alpha../sub i/-3. However, none of these genes has been functionally identified with any of the ..cap alpha.. subunits of several possible G proteins, including pertussis toxin-sensitive G/sub p/'s, stimulatory to phospholipase C or A/sub 2/, G/sub i/, inhibitory to adenylyl cyclase, or G/sub k/, stimulatory to a type of K/sup +/ channels. The authors now report the nucleotide sequence and the complete predicted aminomore » acid sequence of human liver ..cap alpha../sub i/-3 and the partial amino acid sequence of proteolytic fragments of the ..cap alpha.. subunit of human erythrocyte G/sub k/. The amino acid sequence of the proteolytic fragment is uniquely encoded by the cDNA of ..cap alpha../sub i/-3, thus identifying it as ..cap alpha../sub k/. The probable identity of ..cap alpha../sub i/-1 with ..cap alpha../sub p/ and possible roles for ..cap alpha../sub i/-2, as well as additional roles for ..cap alpha../sub i/-1 and ..cap alpha../sub i/-3 (..cap alpha../sub k/) are discussed.« less
RNA editing: trypanosomes rewrite the genetic code.
Stuart, K
1998-01-01
The understanding of how genetic information is stored and expressed has advanced considerably since the "central dogma" asserted that genetic information flows from the nucleotide sequence of DNA to that of messenger RNA (mRNA) which in turn specifies the amino acid sequence of a protein. It was found that genetic information can be stored as RNA (e.g. in RNA viruses) and can flow from RNA to DNA by reverse transcriptase enzyme activity. In addition, some genes contain introns, nucleotide sequences that are removed from their RNA (by RNA splicing) and thus are not represented in the resultant protein. Furthermore, alternative splicing was found to produce variant proteins from a single gene. More recently, the study of trypanosome parasites revealed an unexpected and indeed counter-intuitive genetic complexity. Genetic information for a single protein can be dispersed among several (DNA) genes in these organisms. One of these genes specifies an encrypted precursor mRNA that is converted to a functional mRNA by a process called RNA editing that inserts and deletes uridylate nucleotides. The sequence of the edited mRNA is specified by multiple small RNAs, named guide RNAs, (gRNAs) each of which is encoded in a separate gene. Thus, edited mRNA sequences are assembled from multiple genes by the transfer of information from one type of RNA to another. The existence of editing was surprising but has stimulated the discovery of other types of RNA editing. The Stuart laboratory has been exploring RNA editing in trypanosomes from the time of its discovery. They found dramatic differences between the mitochondrial gene sequences and those of the corresponding mRNAs, which indicated editing by the insertion and deletion of uridylates. Some editing was modest; simply eliminating shifts in sequence register of minimally extending the protein coding sequence. However, editing of many mRNAs was startingly extensive. The RNA sequence was essentially entirely remodeled with its sequence more the result of editing than the gene sequence. The identities of genes for such extensively edited RNA were not recognizable from the DNA sequence but they were readily identifiable from the edited mRNA sequence. Thus, despite the complex and extensive editing the resultant mRNA sequence is precise. Characterization of partially edited RNAs indicated that editing proceeds in the direction opposite to that used to specify the protein which reflects the use of the gRNAs. The numerous gRNAs that are used for editing are encoded in the DNA molecules whose role was previously a mystery. Using information gained in our earlier studies, the Stuart group developed an in vitro system that reproduces the fundamental process of editing in order to resolve the mechanism by which it occurs. They determined that editing entails a series of enzymatic steps rather than the mechanism used in RNA splicing. They also showed that chimeric gRNA-mRNA molecules are aberrant by-products of editing rather than intermediates in the process as had been proposed. Additional studies are exploring precisely how the number of added and deleted uridylates is specified by the gRNA. The Stuart laboratory showed that editing is performed by an aggregation of enzymes that catalyze the separate steps of editing. It also developed a method to purify this multimolecule complex that contains several, perhaps tens of, proteins. This will allow the study of its composition and the functions of its component parts. Indeed, the gene for one component has been identified and its detailed characterization begun. These studies are developing tools to explore related processes. An early finding in the lab was that the various mRNAs are differentially edited during the life cycle of the parasite. The pattern of this editing indicates that editing serves to regulate the alternation between two modes of energy generation. This regulation is coordinated with other events that are occurring during the life c
Molecular characterization of canine parvovirus in Vientiane, Laos.
Vannamahaxay, Soulasack; Vongkhamchanh, Souliya; Intanon, Montira; Tangtrongsup, Sahatchai; Tiwananthagorn, Saruda; Pringproa, Kidsadagon; Chuammitri, Phongsakorn
2017-05-01
The global emergence of canine parvovirus type 2c (CPV-2c) has been well documented. In the present study, 139 rectal swab samples collected from diarrheic dogs living in Vientiane, Laos, in 2016 were tested for the presence of the canine parvovirus (CPV) VP2 gene by PCR. The results showed that 82.73% (115/139) of dogs were CPV positive by PCR. The partial VP2 gene was sequenced in 94 of the positive samples; 91 samples belonged to CPV-2c (426Glu) subtype, while 3 samples belonged to the CPV-2a (426Asn) subtype. Notably, phylogenetic analysis of amino acid sequences revealed a close relationship between Laotian isolates and novel Chinese CPV-2c isolates. In Laotian CPV isolates, aligned protein sequences indicated a high rate of residue substitutions at positions 305, 324, 345, 370, 375, and 426 in the GH loop. The mutation at residue 370 (Q370R), a single mutation, was characterized as a unique mutant residue specific to the Laotian CPV-2c variant.
Monitoring of canine parvovirus (CPV) strains detected in vaccinated puppies in Brazil.
Castro, T X; Costa, E M; Leite, J P; Labarthe, N V; Cubel Garcia, R C N
2011-04-01
The objective of this study was to investigate, by partial sequencing of VP2 protein, the variability of CPV detected in 37 fecal samples collected from vaccinated puppies with enteritis. Laboratorial diagnosis of CPV was confirmed by HA/HI and PCR and, for sequencing analyses, two different regions of the VP2 gene were amplified by PCR. From 1995 to 2004, all strains were characterized as CPV-2a. After that, both CPV-2a and CPV-2b were detected. All CPV-2a showed a non-synonymous mutation in the residue 297 (Ser→Ala). A synonymous substitution at the AA 574 was also observed in 15/37 samples. Our findings indicate that the cases of vaccine failure are most likely not associated to the mutations detected in the sequenced regions. However, the monitoring of genotyping mutations that led to new CPV strains is essential to determinate if current vaccines will keep providing protection against all new future variants. Copyright © 2010 Elsevier Ltd. All rights reserved.
Characterizing partial AZFc deletions of the Y chromosome with amplicon-specific sequence markers
Navarro-Costa, Paulo; Pereira, Luísa; Alves, Cíntia; Gusmão, Leonor; Proença, Carmen; Marques-Vidal, Pedro; Rocha, Tiago; Correia, Sónia C; Jorge, Sónia; Neves, António; Soares, Ana P; Nunes, Joaquim; Calhaz-Jorge, Carlos; Amorim, António; Plancha, Carlos E; Gonçalves, João
2007-01-01
Background The AZFc region of the human Y chromosome is a highly recombinogenic locus containing multi-copy male fertility genes located in repeated DNA blocks (amplicons). These AZFc gene families exhibit slight sequence variations between copies which are considered to have functional relevance. Yet, partial AZFc deletions yield phenotypes ranging from normospermia to azoospermia, thwarting definite conclusions on their real impact on fertility. Results The amplicon content of partial AZFc deletion products was characterized with novel amplicon-specific sequence markers. Data indicate that partial AZFc deletions are a male infertility risk [odds ratio: 5.6 (95% CI: 1.6–30.1)] and although high diversity of partial deletion products and sequence conversion profiles were recorded, the AZFc marker profiles detected in fertile men were also observed in infertile men. Additionally, the assessment of rearrangement recurrence by Y-lineage analysis indicated that while partial AZFc deletions occurred in highly diverse samples, haplotype diversity was minimal in fertile men sharing identical marker profiles. Conclusion Although partial AZFc deletion products are highly heterogeneous in terms of amplicon content, this plasticity is not sufficient to account for the observed phenotypical variance. The lack of causative association between the deletion of specific gene copies and infertility suggests that AZFc gene content might be part of a multifactorial network, with Y-lineage evolution emerging as a possible phenotype modulator. PMID:17903263
Structure of the virulence-associated protein VapD from the intracellular pathogen Rhodococcus equi
DOE Office of Scientific and Technical Information (OSTI.GOV)
Whittingham, Jean L.; Blagova, Elena V.; Finn, Ciaran E.
2014-08-01
VapD is one of a set of highly homologous virulence-associated proteins from the multi-host pathogen Rhodococcus equi. The crystal structure reveals an eight-stranded β-barrel with a novel fold and a glycine rich ‘bald’ surface. Rhodococcus equi is a multi-host pathogen that infects a range of animals as well as immune-compromised humans. Equine and porcine isolates harbour a virulence plasmid encoding a homologous family of virulence-associated proteins associated with the capacity of R. equi to divert the normal processes of endosomal maturation, enabling bacterial survival and proliferation in alveolar macrophages. To provide a basis for probing the function of the Vapmore » proteins in virulence, the crystal structure of VapD was determined. VapD is a monomer as determined by multi-angle laser light scattering. The structure reveals an elliptical, compact eight-stranded β-barrel with a novel strand topology and pseudo-twofold symmetry, suggesting evolution from an ancestral dimer. Surface-associated octyl-β-d-glucoside molecules may provide clues to function. Circular-dichroism spectroscopic analysis suggests that the β-barrel structure is preceded by a natively disordered region at the N-terminus. Sequence comparisons indicate that the core folds of the other plasmid-encoded virulence-associated proteins from R. equi strains are similar to that of VapD. It is further shown that sequences encoding putative R. equi Vap-like proteins occur in diverse bacterial species. Finally, the functional implications of the structure are discussed in the light of the unique structural features of VapD and its partial structural similarity to other β-barrel proteins.« less
Meereis, Florian; Kaufmann, Michael
2004-10-15
The rapidly increasing number of completely sequenced genomes led to the establishment of the COG-database which, based on sequence homologies, assigns similar proteins from different organisms to clusters of orthologous groups (COGs). There are several bioinformatic studies that made use of this database to determine (hyper)thermophile-specific proteins by searching for COGs containing (almost) exclusively proteins from (hyper)thermophilic genomes. However, public software to perform individually definable group-specific searches is not available. The tool described here exactly fills this gap. The software is accessible at http://www.uni-wh.de/pcogr and is linked to the COG-database. The user can freely define two groups of organisms by selecting for each of the (current) 66 organisms to belong either to groupA, to the reference groupB or to be ignored by the algorithm. Then, for all COGs a specificity index is calculated with respect to the specificity to groupA, i. e. high scoring COGs contain proteins from the most of groupA organisms while proteins from the most organisms assigned to groupB are absent. In addition to ranking all COGs according to the user defined specificity criteria, a graphical visualization shows the distribution of all COGs by displaying their abundance as a function of their specificity indexes. This software allows detecting COGs specific to a predefined group of organisms. All COGs are ranked in the order of their specificity and a graphical visualization allows recognizing (i) the presence and abundance of such COGs and (ii) the phylogenetic relationship between groupA- and groupB-organisms. The software also allows detecting putative protein-protein interactions, novel enzymes involved in only partially known biochemical pathways, and alternate enzymes originated by convergent evolution.
Evidence for Horizontal Gene Transfer in Evolution of Elongation Factor Tu in Enterococci
Ke, Danbing; Boissinot, Maurice; Huletsky, Ann; Picard, François J.; Frenette, Johanne; Ouellette, Marc; Roy, Paul H.; Bergeron, Michel G.
2000-01-01
The elongation factor Tu, encoded by tuf genes, is a GTP binding protein that plays a central role in protein synthesis. One to three tuf genes per genome are present, depending on the bacterial species. Most low-G+C-content gram-positive bacteria carry only one tuf gene. We have designed degenerate PCR primers derived from consensus sequences of the tuf gene to amplify partial tuf sequences from 17 enterococcal species and other phylogenetically related species. The amplified DNA fragments were sequenced either by direct sequencing or by sequencing cloned inserts containing putative amplicons. Two different tuf genes (tufA and tufB) were found in 11 enterococcal species, including Enterococcus avium, Enterococcus casseliflavus, Enterococcus dispar, Enterococcus durans, Enterococcus faecium, Enterococcus gallinarum, Enterococcus hirae, Enterococcus malodoratus, Enterococcus mundtii, Enterococcus pseudoavium, and Enterococcus raffinosus. For the other six enterococcal species (Enterococcus cecorum, Enterococcus columbae, Enterococcus faecalis, Enterococcus sulfureus, Enterococcus saccharolyticus, and Enterococcus solitarius), only the tufA gene was present. Based on 16S rRNA gene sequence analysis, the 11 species having two tuf genes all have a common ancestor, while the six species having only one copy diverged from the enterococcal lineage before that common ancestor. The presence of one or two copies of the tuf gene in enterococci was confirmed by Southern hybridization. Phylogenetic analysis of tuf sequences demonstrated that the enterococcal tufA gene branches with the Bacillus, Listeria, and Staphylococcus genera, while the enterococcal tufB gene clusters with the genera Streptococcus and Lactococcus. Primary structure analysis showed that four amino acid residues encoded within the sequenced regions are conserved and unique to the enterococcal tufB genes and the tuf genes of streptococci and Lactococcus lactis. The data suggest that an ancestral streptococcus or a streptococcus-related species may have horizontally transferred a tuf gene to the common ancestor of the 11 enterococcal species which now carry two tuf genes. PMID:11092850
Hallmarks of Hepatitis C Virus in Equine Hepacivirus
Tanaka, Tomohisa; Kasai, Hirotake; Yamashita, Atsuya; Okuyama-Dobashi, Kaori; Yasumoto, Jun; Maekawa, Shinya; Enomoto, Nobuyuki; Okamoto, Toru; Matsuura, Yoshiharu; Morimatsu, Masami; Manabe, Noboru; Ochiai, Kazuhiko; Yamashita, Kazuto
2014-01-01
ABSTRACT Equine hepacivirus (EHcV) has been identified as a closely related homologue of hepatitis C virus (HCV) in the United States, the United Kingdom, and Germany, but not in Asian countries. In this study, we genetically and serologically screened 31 serum samples obtained from Japanese-born domestic horses for EHcV infection and subsequently identified 11 PCR-positive and 7 seropositive serum samples. We determined the full sequence of the EHcV genome, including the 3′ untranslated region (UTR), which had previously not been completely revealed. The polyprotein of a Japanese EHcV strain showed approximately 95% homology to those of the reported strains. HCV-like cis-acting RNA elements, including the stem-loop structures of the 3′ UTR and kissing-loop interaction were deduced from regions around both UTRs of the EHcV genome. A comparison of the EHcV and HCV core proteins revealed that Ile190 and Phe191 of the EHcV core protein could be important for cleavage of the core protein by signal peptide peptidase (SPP) and were replaced with Ala and Leu, respectively, which inhibited intramembrane cleavage of the EHcV core protein. The loss-of-function mutant of SPP abrogated intramembrane cleavage of the EHcV core protein and bound EHcV core protein, suggesting that the EHcV core protein may be cleaved by SPP to become a mature form. The wild-type EHcV core protein, but not the SPP-resistant mutant, was localized on lipid droplets and partially on the lipid raft-like membrane in a manner similar to that of the HCV core protein. These results suggest that EHcV may conserve the genetic and biological properties of HCV. IMPORTANCE EHcV, which shows the highest amino acid or nucleotide homology to HCV among hepaciviruses, was previously reported to infect horses from Western, but not Asian, countries. We herein report EHcV infection in Japanese-born horses. In this study, HCV-like RNA secondary structures around both UTRs were predicted by determining the whole-genome sequence of EHcV. Our results also suggest that the EHcV core protein is cleaved by SPP to become a mature form and then is localized on lipid droplets and partially on lipid raft-like membranes in a manner similar to that of the HCV core protein. Hence, EHcV was identified as a closely related homologue of HCV based on its genetic structure as well as its biological properties. A clearer understanding of the epidemiology, genetic structure, and infection mechanism of EHcV will assist in elucidating the evolution of hepaciviruses as well as the development of surrogate models for the study of HCV. PMID:25210167
Hallmarks of hepatitis C virus in equine hepacivirus.
Tanaka, Tomohisa; Kasai, Hirotake; Yamashita, Atsuya; Okuyama-Dobashi, Kaori; Yasumoto, Jun; Maekawa, Shinya; Enomoto, Nobuyuki; Okamoto, Toru; Matsuura, Yoshiharu; Morimatsu, Masami; Manabe, Noboru; Ochiai, Kazuhiko; Yamashita, Kazuto; Moriishi, Kohji
2014-11-01
Equine hepacivirus (EHcV) has been identified as a closely related homologue of hepatitis C virus (HCV) in the United States, the United Kingdom, and Germany, but not in Asian countries. In this study, we genetically and serologically screened 31 serum samples obtained from Japanese-born domestic horses for EHcV infection and subsequently identified 11 PCR-positive and 7 seropositive serum samples. We determined the full sequence of the EHcV genome, including the 3' untranslated region (UTR), which had previously not been completely revealed. The polyprotein of a Japanese EHcV strain showed approximately 95% homology to those of the reported strains. HCV-like cis-acting RNA elements, including the stem-loop structures of the 3' UTR and kissing-loop interaction were deduced from regions around both UTRs of the EHcV genome. A comparison of the EHcV and HCV core proteins revealed that Ile(190) and Phe(191) of the EHcV core protein could be important for cleavage of the core protein by signal peptide peptidase (SPP) and were replaced with Ala and Leu, respectively, which inhibited intramembrane cleavage of the EHcV core protein. The loss-of-function mutant of SPP abrogated intramembrane cleavage of the EHcV core protein and bound EHcV core protein, suggesting that the EHcV core protein may be cleaved by SPP to become a mature form. The wild-type EHcV core protein, but not the SPP-resistant mutant, was localized on lipid droplets and partially on the lipid raft-like membrane in a manner similar to that of the HCV core protein. These results suggest that EHcV may conserve the genetic and biological properties of HCV. EHcV, which shows the highest amino acid or nucleotide homology to HCV among hepaciviruses, was previously reported to infect horses from Western, but not Asian, countries. We herein report EHcV infection in Japanese-born horses. In this study, HCV-like RNA secondary structures around both UTRs were predicted by determining the whole-genome sequence of EHcV. Our results also suggest that the EHcV core protein is cleaved by SPP to become a mature form and then is localized on lipid droplets and partially on lipid raft-like membranes in a manner similar to that of the HCV core protein. Hence, EHcV was identified as a closely related homologue of HCV based on its genetic structure as well as its biological properties. A clearer understanding of the epidemiology, genetic structure, and infection mechanism of EHcV will assist in elucidating the evolution of hepaciviruses as well as the development of surrogate models for the study of HCV. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Novel primers for complete mitochondrial cytochrome b genesequencing in mammals
Naidu, Ashwin; Fitak, Robert R.; Munguia-Vega, Adrian; Culver, Melanie
2011-01-01
Sequence-based species identification relies on the extent and integrity of sequence data available in online databases such as GenBank. When identifying species from a sample of unknown origin, partial DNA sequences obtained from the sample are aligned against existing sequences in databases. When the sequence from the matching species is not present in the database, high-scoring alignments with closely related sequences might produce unreliable results on species identity. For species identification in mammals, the cytochrome b (cyt b) gene has been identified to be highly informative; thus, large amounts of reference sequence data from the cyt b gene are much needed. To enhance availability of cyt b gene sequence data on a large number of mammalian species in GenBank and other such publicly accessible online databases, we identified a primer pair for complete cyt b gene sequencing in mammals. Using this primer pair, we successfully PCR amplified and sequenced the complete cyt b gene from 40 of 44 mammalian species representing 10 orders of mammals. We submitted 40 complete, correctly annotated, cyt b protein coding sequences to GenBank. To our knowledge, this is the first single primer pair to amplify the complete cyt b gene in a broad range of mammalian species. This primer pair can be used for the addition of new cyt b gene sequences and to enhance data available on species represented in GenBank. The availability of novel and complete gene sequences as high-quality reference data can improve the reliability of sequence-based species identification.
Khan, Sehroon; Nadir, Sadia; Lihua, Guo; Xu, Jianchu; Holmes, Keith A; Dewen, Qiu
2016-01-01
An insect-toxic protein, Bb70p, was purified from Beauveria bassiana 70 using ammonium sulfate precipitation, ion exchange chromatography, and gel filtration. Bb70p has a high affinity for anion exchangers and 2D electrophoresis results revealed a single spot with a molecular weight of 35.5 kDa and an iso-electric point of ∼4.5. Bb70p remains active from 4 to 60°C, within a pH range of 4-10, but is more active in slightly acidic pH. A pure protein, Bb70p does not have any carbohydrate side chains. The protein caused high mortality by intra-haemocelic injection into Galleria mellonella with LD50 of 334.4 μg/g body weight and activates the phenol oxidase cascade. With a partial amino acid sequence comparison using the NCBI database, we showed no homology to known toxin proteins of entomopathogenic fungi. Thus, Bb70p appears to be an insect toxin protein, demonstrating novelty. Identification of this insect-toxic protein presents potential to enhance the virulence of B. bassiana through genetic manipulation. Copyright © 2015 Elsevier Inc. All rights reserved.
Structural alphabets derived from attractors in conformational space
2010-01-01
Background The hierarchical and partially redundant nature of protein structures justifies the definition of frequently occurring conformations of short fragments as 'states'. Collections of selected representatives for these states define Structural Alphabets, describing the most typical local conformations within protein structures. These alphabets form a bridge between the string-oriented methods of sequence analysis and the coordinate-oriented methods of protein structure analysis. Results A Structural Alphabet has been derived by clustering all four-residue fragments of a high-resolution subset of the protein data bank and extracting the high-density states as representative conformational states. Each fragment is uniquely defined by a set of three independent angles corresponding to its degrees of freedom, capturing in simple and intuitive terms the properties of the conformational space. The fragments of the Structural Alphabet are equivalent to the conformational attractors and therefore yield a most informative encoding of proteins. Proteins can be reconstructed within the experimental uncertainty in structure determination and ensembles of structures can be encoded with accuracy and robustness. Conclusions The density-based Structural Alphabet provides a novel tool to describe local conformations and it is specifically suitable for application in studies of protein dynamics. PMID:20170534
Genetic Diversity of Crimean Congo Hemorrhagic Fever Virus Strains from Iran
Chinikar, Sadegh; Bouzari, Saeid; Shokrgozar, Mohammad Ali; Mostafavi, Ehsan; Jalali, Tahmineh; Khakifirouz, Sahar; Nowotny, Norbert; Fooks, Anthony R.; Shah-Hosseini, Nariman
2016-01-01
Background: Crimean Congo hemorrhagic fever virus (CCHFV) is a member of the Bunyaviridae family and Nairovirus genus. It has a negative-sense, single stranded RNA genome approximately 19.2 kb, containing the Small, Medium, and Large segments. CCHFVs are relatively divergent in their genome sequence and grouped in seven distinct clades based on S-segment sequence analysis and six clades based on M-segment sequences. Our aim was to obtain new insights into the molecular epidemiology of CCHFV in Iran. Methods: We analyzed partial and complete nucleotide sequences of the S and M segments derived from 50 Iranian patients. The extracted RNA was amplified using one-step RT-PCR and then sequenced. The sequences were analyzed using Mega5 software. Results: Phylogenetic analysis of partial S segment sequences demonstrated that clade IV-(Asia 1), clade IV-(Asia 2) and clade V-(Europe) accounted for 80 %, 4 % and 14 % of the circulating genomic variants of CCHFV in Iran respectively. However, one of the Iranian strains (Iran-Kerman/22) was associated with none of other sequences and formed a new clade (VII). The phylogenetic analysis of complete S-segment nucleotide sequences from selected Iranian CCHFV strains complemented with representative strains from GenBank revealed similar topology as partial sequences with eight major clusters. A partial M segment phylogeny positioned the Iranian strains in either association with clade III (Asia-Africa) or clade V (Europe). Conclusion: The phylogenetic analysis revealed subtle links between distant geographic locations, which we propose might originate either from international livestock trade or from long-distance carriage of CCHFV by infected ticks via bird migration. PMID:27308271
Molecular confirmation of Hepatozoon canis in Mauritius.
Daskalaki, Aikaterini Alexandra; Ionică, Angela Monica; Jeetah, Keshav; Gherman, Călin Mircea; Mihalca, Andrei Daniel
2018-01-01
In this study, Hepatozoon species was molecularly identified and characterized for the first time on the Indian Ocean island of Mauritius. Partial sequences of the 18S rRNA gene of the Hepatozoon isolates were analysed from three naturally infected dogs. The sequences of H. canis were similar to the 18S rRNA partial sequences (JX112783, AB365071 99%) from dog blood samples from West Indies and Nigeria. Our sequences were deposited in the GenBank database. Copyright © 2017 Elsevier B.V. All rights reserved.
Jimenez-Guardeño, Jose M; Regla-Nava, Jose A; Nieto-Torres, Jose L; DeDiego, Marta L; Castaño-Rodriguez, Carlos; Fernandez-Delgado, Raul; Perlman, Stanley; Enjuanes, Luis
2015-10-01
A SARS-CoV lacking the full-length E gene (SARS-CoV-∆E) was attenuated and an effective vaccine. Here, we show that this mutant virus regained fitness after serial passages in cell culture or in vivo, resulting in the partial duplication of the membrane gene or in the insertion of a new sequence in gene 8a, respectively. The chimeric proteins generated in cell culture increased virus fitness in vitro but remained attenuated in mice. In contrast, during SARS-CoV-∆E passage in mice, the virus incorporated a mutated variant of 8a protein, resulting in reversion to a virulent phenotype. When the full-length E protein was deleted or its PDZ-binding motif (PBM) was mutated, the revertant viruses either incorporated a novel chimeric protein with a PBM or restored the sequence of the PBM on the E protein, respectively. Similarly, after passage in mice, SARS-CoV-∆E protein 8a mutated, to now encode a PBM, and also regained virulence. These data indicated that the virus requires a PBM on a transmembrane protein to compensate for removal of this motif from the E protein. To increase the genetic stability of the vaccine candidate, we introduced small attenuating deletions in E gene that did not affect the endogenous PBM, preventing the incorporation of novel chimeric proteins in the virus genome. In addition, to increase vaccine biosafety, we introduced additional attenuating mutations into the nsp1 protein. Deletions in the carboxy-terminal region of nsp1 protein led to higher host interferon responses and virus attenuation. Recombinant viruses including attenuating mutations in E and nsp1 genes maintained their attenuation after passage in vitro and in vivo. Further, these viruses fully protected mice against challenge with the lethal parental virus, and are therefore safe and stable vaccine candidates for protection against SARS-CoV.
Nieto-Torres, Jose L.; DeDiego, Marta L.; Castaño-Rodriguez, Carlos; Fernandez-Delgado, Raul; Perlman, Stanley; Enjuanes, Luis
2015-01-01
A SARS-CoV lacking the full-length E gene (SARS-CoV-∆E) was attenuated and an effective vaccine. Here, we show that this mutant virus regained fitness after serial passages in cell culture or in vivo, resulting in the partial duplication of the membrane gene or in the insertion of a new sequence in gene 8a, respectively. The chimeric proteins generated in cell culture increased virus fitness in vitro but remained attenuated in mice. In contrast, during SARS-CoV-∆E passage in mice, the virus incorporated a mutated variant of 8a protein, resulting in reversion to a virulent phenotype. When the full-length E protein was deleted or its PDZ-binding motif (PBM) was mutated, the revertant viruses either incorporated a novel chimeric protein with a PBM or restored the sequence of the PBM on the E protein, respectively. Similarly, after passage in mice, SARS-CoV-∆E protein 8a mutated, to now encode a PBM, and also regained virulence. These data indicated that the virus requires a PBM on a transmembrane protein to compensate for removal of this motif from the E protein. To increase the genetic stability of the vaccine candidate, we introduced small attenuating deletions in E gene that did not affect the endogenous PBM, preventing the incorporation of novel chimeric proteins in the virus genome. In addition, to increase vaccine biosafety, we introduced additional attenuating mutations into the nsp1 protein. Deletions in the carboxy-terminal region of nsp1 protein led to higher host interferon responses and virus attenuation. Recombinant viruses including attenuating mutations in E and nsp1 genes maintained their attenuation after passage in vitro and in vivo. Further, these viruses fully protected mice against challenge with the lethal parental virus, and are therefore safe and stable vaccine candidates for protection against SARS-CoV. PMID:26513244
MP3: a software tool for the prediction of pathogenic proteins in genomic and metagenomic data.
Gupta, Ankit; Kapil, Rohan; Dhakan, Darshan B; Sharma, Vineet K
2014-01-01
The identification of virulent proteins in any de-novo sequenced genome is useful in estimating its pathogenic ability and understanding the mechanism of pathogenesis. Similarly, the identification of such proteins could be valuable in comparing the metagenome of healthy and diseased individuals and estimating the proportion of pathogenic species. However, the common challenge in both the above tasks is the identification of virulent proteins since a significant proportion of genomic and metagenomic proteins are novel and yet unannotated. The currently available tools which carry out the identification of virulent proteins provide limited accuracy and cannot be used on large datasets. Therefore, we have developed an MP3 standalone tool and web server for the prediction of pathogenic proteins in both genomic and metagenomic datasets. MP3 is developed using an integrated Support Vector Machine (SVM) and Hidden Markov Model (HMM) approach to carry out highly fast, sensitive and accurate prediction of pathogenic proteins. It displayed Sensitivity, Specificity, MCC and accuracy values of 92%, 100%, 0.92 and 96%, respectively, on blind dataset constructed using complete proteins. On the two metagenomic blind datasets (Blind A: 51-100 amino acids and Blind B: 30-50 amino acids), it displayed Sensitivity, Specificity, MCC and accuracy values of 82.39%, 97.86%, 0.80 and 89.32% for Blind A and 71.60%, 94.48%, 0.67 and 81.86% for Blind B, respectively. In addition, the performance of MP3 was validated on selected bacterial genomic and real metagenomic datasets. To our knowledge, MP3 is the only program that specializes in fast and accurate identification of partial pathogenic proteins predicted from short (100-150 bp) metagenomic reads and also performs exceptionally well on complete protein sequences. MP3 is publicly available at http://metagenomics.iiserb.ac.in/mp3/index.php.
MP3: A Software Tool for the Prediction of Pathogenic Proteins in Genomic and Metagenomic Data
Gupta, Ankit; Kapil, Rohan; Dhakan, Darshan B.; Sharma, Vineet K.
2014-01-01
The identification of virulent proteins in any de-novo sequenced genome is useful in estimating its pathogenic ability and understanding the mechanism of pathogenesis. Similarly, the identification of such proteins could be valuable in comparing the metagenome of healthy and diseased individuals and estimating the proportion of pathogenic species. However, the common challenge in both the above tasks is the identification of virulent proteins since a significant proportion of genomic and metagenomic proteins are novel and yet unannotated. The currently available tools which carry out the identification of virulent proteins provide limited accuracy and cannot be used on large datasets. Therefore, we have developed an MP3 standalone tool and web server for the prediction of pathogenic proteins in both genomic and metagenomic datasets. MP3 is developed using an integrated Support Vector Machine (SVM) and Hidden Markov Model (HMM) approach to carry out highly fast, sensitive and accurate prediction of pathogenic proteins. It displayed Sensitivity, Specificity, MCC and accuracy values of 92%, 100%, 0.92 and 96%, respectively, on blind dataset constructed using complete proteins. On the two metagenomic blind datasets (Blind A: 51–100 amino acids and Blind B: 30–50 amino acids), it displayed Sensitivity, Specificity, MCC and accuracy values of 82.39%, 97.86%, 0.80 and 89.32% for Blind A and 71.60%, 94.48%, 0.67 and 81.86% for Blind B, respectively. In addition, the performance of MP3 was validated on selected bacterial genomic and real metagenomic datasets. To our knowledge, MP3 is the only program that specializes in fast and accurate identification of partial pathogenic proteins predicted from short (100–150 bp) metagenomic reads and also performs exceptionally well on complete protein sequences. MP3 is publicly available at http://metagenomics.iiserb.ac.in/mp3/index.php. PMID:24736651
Phylogenetic relationships of Malassezia species based on multilocus sequence analysis.
Castellá, Gemma; Coutinho, Selene Dall' Acqua; Cabañes, F Javier
2014-01-01
Members of the genus Malassezia are lipophilic basidiomycetous yeasts, which are part of the normal cutaneous microbiota of humans and other warm-blooded animals. Currently, this genus consists of 14 species that have been characterized by phenetic and molecular methods. Although several molecular methods have been used to identify and/or differentiate Malassezia species, the sequencing of the rRNA genes and the chitin synthase-2 gene (CHS2) are the most widely employed. There is little information about the β-tubulin gene in the genus Malassezia, a gene has been used for the analysis of complex species groups. The aim of the present study was to sequence a fragment of the β-tubulin gene of Malassezia species and analyze their phylogenetic relationship using a multilocus sequence approach based on two rRNA genes (ITS including 5.8S rRNA and D1/D2 region of 26S rRNA) together with two protein encoding genes (CHS2 and β-tubulin). The phylogenetic study of the partial β-tubulin gene sequences indicated that this molecular marker can be used to assess diversity and identify new species. The multilocus sequence analysis of the four loci provides robust support to delineate species at the terminal nodes and could help to estimate divergence times for the origin and diversification of Malassezia species.
Emergence of Vaccine-derived Polioviruses, Democratic Republic of Congo, 2004–2011
Lentsoane, Olivia; Burns, Cara C.; Pallansch, Mark; de Gourville, Esther; Yogolelo, Riziki; Muyembe-Tamfum, Jean Jacques; Puren, Adrian; Schoub, Barry D.; Venter, Marietjie
2013-01-01
Polioviruses isolated from 70 acute flaccid paralysis patients from the Democratic Republic of Congo (DRC) during 2004–2011 were characterized and found to be vaccine-derived type 2 polioviruses (VDPV2s). Partial genomic sequencing of the isolates revealed nucleotide sequence divergence of up to 3.5% in the viral protein 1 capsid region of the viral genome relative to the Sabin vaccine strain. Genetic analysis identified at least 7 circulating lineages localized to specific geographic regions. Multiple independent events of VDPV2 emergence occurred throughout DRC during this 7-year period. During 2010–2011, VDPV2 circulation in eastern DRC occurred in an area distinct from that of wild poliovirus circulation, whereas VDPV2 circulation in the southwestern part of DRC (in Kasai Occidental) occurred within the larger region of wild poliovirus circulation. PMID:24047933
Emergence of vaccine-derived polioviruses, Democratic Republic of Congo, 2004-2011.
Gumede, Nicksy; Lentsoane, Olivia; Burns, Cara C; Pallansch, Mark; de Gourville, Esther; Yogolelo, Riziki; Muyembe-Tamfum, Jean Jacques; Puren, Adrian; Schoub, Barry D; Venter, Marietjie
2013-10-01
Polioviruses isolated from 70 acute flaccid paralysis patients from the Democratic Republic of Congo (DRC) during 2004-2011 were characterized and found to be vaccine-derived type 2 polioviruses (VDPV2s). Partial genomic sequencing of the isolates revealed nucleotide sequence divergence of up to 3.5% in the viral protein 1 capsid region of the viral genome relative to the Sabin vaccine strain. Genetic analysis identified at least 7 circulating lineages localized to specific geographic regions. Multiple independent events of VDPV2 emergence occurred throughout DRC during this 7-year period. During 2010-2011, VDPV2 circulation in eastern DRC occurred in an area distinct from that of wild poliovirus circulation, whereas VDPV2 circulation in the southwestern part of DRC (in Kasai Occidental) occurred within the larger region of wild poliovirus circulation.
Algorithm, applications and evaluation for protein comparison by Ramanujan Fourier transform.
Zhao, Jian; Wang, Jiasong; Hua, Wei; Ouyang, Pingkai
2015-12-01
The amino acid sequence of a protein determines its chemical properties, chain conformation and biological functions. Protein sequence comparison is of great importance to identify similarities of protein structures and infer their functions. Many properties of a protein correspond to the low-frequency signals within the sequence. Low frequency modes in protein sequences are linked to the secondary structures, membrane protein types, and sub-cellular localizations of the proteins. In this paper, we present Ramanujan Fourier transform (RFT) with a fast algorithm to analyze the low-frequency signals of protein sequences. The RFT method is applied to similarity analysis of protein sequences with the Resonant Recognition Model (RRM). The results show that the proposed fast RFT method on protein comparison is more efficient than commonly used discrete Fourier transform (DFT). RFT can detect common frequencies as significant feature for specific protein families, and the RFT spectrum heat-map of protein sequences demonstrates the information conservation in the sequence comparison. The proposed method offers a new tool for pattern recognition, feature extraction and structural analysis on protein sequences. Copyright © 2015 Elsevier Ltd. All rights reserved.
Siebert, Stefan; Robinson, Mark D; Tintori, Sophia C; Goetz, Freya; Helm, Rebecca R; Smith, Stephen A; Shaner, Nathan; Haddock, Steven H D; Dunn, Casey W
2011-01-01
We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing.
Siebert, Stefan; Robinson, Mark D.; Tintori, Sophia C.; Goetz, Freya; Helm, Rebecca R.; Smith, Stephen A.; Shaner, Nathan; Haddock, Steven H. D.; Dunn, Casey W.
2011-01-01
We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing. PMID:21829563
Wopereis, Harm; Sim, Kathleen; Shaw, Alexander; Warner, John O; Knol, Jan; Kroll, J Simon
2018-04-01
Development of the gut microbiota in infancy is important in maturation of the immune system. Deviations in colonization patterns have been associated with allergic manifestations such as eczema, but exact microbiome dysfunctions underlying allergies remain unclear. We studied the gut microbiota of 138 infants at increased risk of allergy, participating in a clinical trial investigating the effectiveness of a partially hydrolyzed protein formula supplemented with nondigestible oligosaccharides on the prevention of eczema. The effects of interventions and breast-feeding on fecal microbiota were investigated. Additionally, we aimed to identify microbial patterns associated with the onset of eczema. Bacterial taxonomic compositions in the first 26 weeks of life were analyzed by using 16S rRNA gene sequencing. Additionally, fecal pH and microbial metabolite levels were measured. Fecal microbial composition, metabolites, and pH of infants receiving partially hydrolyzed protein formula supplemented with nondigestible oligosaccharides was closer to that of breast-fed infants than that of infants receiving standard cow's milk formula. Infants with eczema by 18 months showed discordant development of bacterial genera of Enterobacteriaceae and Parabacteroides species in the first 26 weeks, as well as decreased acquisition of lactate-utilizing bacteria producing butyrate, namely Eubacterium and Anaerostipes species, supported by increased lactate and decreased butyrate levels. We showed that a partially hydrolyzed protein infant formula with specific prebiotics modulated the gut microbiota closer to that of breast-fed infants. Additionally, we identified a potential link between microbial activity and onset of eczema, which might reflect a suboptimal implementation of gut microbiota at specific developmental stages in infants at high risk for allergy. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Characterization of an outbreak of astroviral diarrhea in a group of cheetahs (Acinonyx jubatus).
Atkins, Adrienne; Wellehan, James F X; Childress, April L; Archer, Linda L; Fraser, William A; Citino, Scott B
2009-04-14
A Mamastrovirus was identified in an outbreak of diarrhea in cheetahs (Acinonyx jubatus). Five young adult and two adult cheetahs presented with lethargy, anorexia, watery diarrhea and regurgitation over an 11-day period. Fecal samples were submitted for electron microscopy and culture. Electron microscopy results revealed particles morphologically consistent with an astrovirus, and no other viral pathogens or significant bacterial pathogens were identified. The astrovirus was confirmed and sequenced using consensus astroviral PCR, resulting in a 367 base pair partial RNA-dependent-RNA polymerase (RdRp) product and a 628 base pair partial capsid product. Bayesian and maximum likelihood phylogenetic analyses were performed on both the RdRp and the capsid protein segments. All animals were monitored and treated with bismuth subsalicylate tablets (524mg PO BID for 5 days), and recovered without additional intervention. This is the first report we are aware of documenting an astrovirus outbreak in cheetah.
Molecular analysis of the NDP gene in two families with Norrie disease.
Rivera-Vega, M Refugio; Chiñas-Lopez, Silvet; Vaca, Ana Luisa Jimenez; Arenas-Sordo, M Luz; Kofman-Alfaro, Susana; Messina-Baas, Olga; Cuevas-Covarrubias, Sergio Alberto
2005-04-01
To describe the molecular defects in the Norrie disease protein (NDP) gene in two families with Norrie disease (ND). We analysed two families with ND at molecular level through polymerase chain reaction, DNA sequence analysis and GeneScan. Two molecular defects found in the NDP gene were: a missense mutation (265C > G) within codon 97 that resulted in the interchange of arginine by proline, and a partial deletion in the untranslated 3' region of exon 3 of the NDP gene. Clinical findings were more severe in the family that presented the partial deletion. We also diagnosed the carrier status of one daughter through GeneScan; this method proved to be a useful tool for establishing female carriers of ND. Here we report two novel mutations in the NDP gene in Mexican patients and propose that GeneScan is a viable mean of establishing ND carrier status.
Plantinga, Matthew J; Korennykh, Alexei V; Piccirilli, Joseph A; Correll, Carl C
2008-08-26
Restrictocin, a member of the alpha-sarcin family of site-specific endoribonucleases, uses electrostatic interactions to bind to the ribosome and to RNA oligonucleotides, including the minimal specific substrate, the sarcin/ricin loop (SRL) of 23S-28S rRNA. Restrictocin binds to the SRL by forming a ground-state E:S complex that is stabilized predominantly by Coulomb interactions and depends on neither the sequence nor structure of the RNA, suggesting a nonspecific complex. The 22 cationic residues of restrictocin are dispersed throughout this protein surface, complicating a priori identification of a Coulomb interacting surface. Structural studies have identified an enzyme-substrate interface, which is expected to overlap with the electrostatic E:S interface. Here, we identified restrictocin residues that contribute to binding in the E:S complex by determining the salt dependence [partial differential log(k 2/ K 1/2)/ partial differential log[KCl
Folding and Stabilization of Native-Sequence-Reversed Proteins
Zhang, Yuanzhao; Weber, Jeffrey K; Zhou, Ruhong
2016-01-01
Though the problem of sequence-reversed protein folding is largely unexplored, one might speculate that reversed native protein sequences should be significantly more foldable than purely random heteropolymer sequences. In this article, we investigate how the reverse-sequences of native proteins might fold by examining a series of small proteins of increasing structural complexity (α-helix, β-hairpin, α-helix bundle, and α/β-protein). Employing a tandem protein structure prediction algorithmic and molecular dynamics simulation approach, we find that the ability of reverse sequences to adopt native-like folds is strongly influenced by protein size and the flexibility of the native hydrophobic core. For β-hairpins with reverse-sequences that fail to fold, we employ a simple mutational strategy for guiding stable hairpin formation that involves the insertion of amino acids into the β-turn region. This systematic look at reverse sequence duality sheds new light on the problem of protein sequence-structure mapping and may serve to inspire new protein design and protein structure prediction protocols. PMID:27113844
Folding and Stabilization of Native-Sequence-Reversed Proteins
NASA Astrophysics Data System (ADS)
Zhang, Yuanzhao; Weber, Jeffrey K.; Zhou, Ruhong
2016-04-01
Though the problem of sequence-reversed protein folding is largely unexplored, one might speculate that reversed native protein sequences should be significantly more foldable than purely random heteropolymer sequences. In this article, we investigate how the reverse-sequences of native proteins might fold by examining a series of small proteins of increasing structural complexity (α-helix, β-hairpin, α-helix bundle, and α/β-protein). Employing a tandem protein structure prediction algorithmic and molecular dynamics simulation approach, we find that the ability of reverse sequences to adopt native-like folds is strongly influenced by protein size and the flexibility of the native hydrophobic core. For β-hairpins with reverse-sequences that fail to fold, we employ a simple mutational strategy for guiding stable hairpin formation that involves the insertion of amino acids into the β-turn region. This systematic look at reverse sequence duality sheds new light on the problem of protein sequence-structure mapping and may serve to inspire new protein design and protein structure prediction protocols.
Yong, Hoi-Sen; Lim, Phaik-Eem; Eamsobhana, Praphathip
2017-01-01
The tephritid fruit fly Zeugodacus tau (Walker) is a polyphagous fruit pest of economic importance in Asia. Studies based on genetic markers indicate that it forms a species complex. We report here (1) the complete mitogenome of Z. tau from Malaysia and comparison with that of China as well as the mitogenome of other congeners, and (2) the relationship of Z. tau taxa from different geographical regions based on sequences of cytochrome c oxidase subunit I gene. The complete mitogenome of Z. tau had a total length of 15631 bp for the Malaysian specimen (ZT3) and 15835 bp for the China specimen (ZT1), with similar gene order comprising 37 genes (13 protein-coding genes—PCGs, 2 rRNA genes, and 22 tRNA genes) and a non-coding A + T-rich control region (D-loop). Based on 13 PCGs and 15 mt-genes, Z. tau NC_027290 (China) and Z. tau ZT1 (China) formed a sister group in the lineage containing also Z. tau ZT3 (Malaysia). Phylogenetic analysis based on partial sequences of cox1 gene indicates that the taxa from China, Japan, Laos, Malaysia, Bangladesh, India, Sri Lanka, and Z. tau sp. A from Thailand belong to Z. tau sensu stricto. A complete cox1 gene (or 13 PCGs or 15 mt-genes) instead of partial sequence is more appropriate for determining phylogenetic relationship. PMID:29216281
Bergmame, Laura; Huffman, Jane; Cole, Rebecca; Dayanandan, Selvadurai; Tkach, Vasyl; McLaughlin, J. Daniel
2011-01-01
Flukes belonging to Sphaeridiotrema are important parasites of waterfowl, and 2 morphologically similar species Sphaeridiotrema globulus and Sphaeridiotrema pseudoglobulus, have been implicated in waterfowl mortality in North America. Cytochrome oxidase I (barcode region) and partial LSU-rDNA sequences from specimens of S. globulus and S. pseudoglobulus, obtained from naturally and experimentally infected hosts from New Jersey and Quebec, respectively, confirmed that these species were distinct. Barcode sequences of the 2 species differed at 92 of 590 nucleotide positions (15.6%) and the translated sequences differed by 13 amino acid residues. Partial LSU-rDNA sequences differed at 29 of 1,208 nucleotide positions (2.4%). Additional barcode sequences from specimens collected from waterfowl in Wisconsin and Minnesota and morphometric data obtained from specimens acquired along the north shore of Lake Superior revealed the presence of S. pseudoglobulus in these areas. Although morphometric data suggested the presence of S. globulus in the Lake Superior sample, it was not found among the specimens sequenced from Wisconsin or Minnesota.
Bergmame, L.; Huffman, J.; Cole, R.; Dayanandan, S.; Tkach, V.; McLaughlin, J.D.
2011-01-01
Flukes belonging to Sphaeridiotrema are important parasites of waterfowl, and 2 morphologically similar species Sphaeridiotrema globulus and Sphaeridiotrema pseudoglobulus, have been implicated in waterfowl mortality in North America. Cytochrome oxidase I (barcode region) and partial LSU-rDNA sequences from specimens of S. globulus and S. pseudoglobulus, obtained from naturally and experimentally infected hosts from New Jersey and Quebec, respectively, confirmed that these species were distinct. Barcode sequences of the 2 species differed at 92 of 590 nucleotide positions (15.6%) and the translated sequences differed by 13 amino acid residues. Partial LSU-rDNA sequences differed at 29 of 1,208 nucleotide positions (2.4%). Additional barcode sequences from specimens collected from waterfowl in Wisconsin and Minnesota and morphometric data obtained from specimens acquired along the north shore of Lake Superior revealed the presence of S. pseudoglobulus in these areas. Although morphometric data suggested the presence of S. globulus in the Lake Superior sample, it was not found among the specimens sequenced from Wisconsin or Minnesota. ?? 2011 American Society of Parasitologists.
Savidor, Alon; Barzilay, Rotem; Elinger, Dalia; Yarden, Yosef; Lindzen, Moshit; Gabashvili, Alexandra; Adiv Tal, Ophir; Levin, Yishai
2017-06-01
Traditional "bottom-up" proteomic approaches use proteolytic digestion, LC-MS/MS, and database searching to elucidate peptide identities and their parent proteins. Protein sequences absent from the database cannot be identified, and even if present in the database, complete sequence coverage is rarely achieved even for the most abundant proteins in the sample. Thus, sequencing of unknown proteins such as antibodies or constituents of metaproteomes remains a challenging problem. To date, there is no available method for full-length protein sequencing, independent of a reference database, in high throughput. Here, we present Database-independent Protein Sequencing, a method for unambiguous, rapid, database-independent, full-length protein sequencing. The method is a novel combination of non-enzymatic, semi-random cleavage of the protein, LC-MS/MS analysis, peptide de novo sequencing, extraction of peptide tags, and their assembly into a consensus sequence using an algorithm named "Peptide Tag Assembler." As proof-of-concept, the method was applied to samples of three known proteins representing three size classes and to a previously un-sequenced, clinically relevant monoclonal antibody. Excluding leucine/isoleucine and glutamic acid/deamidated glutamine ambiguities, end-to-end full-length de novo sequencing was achieved with 99-100% accuracy for all benchmarking proteins and the antibody light chain. Accuracy of the sequenced antibody heavy chain, including the entire variable region, was also 100%, but there was a 23-residue gap in the constant region sequence. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
2017-01-01
Abstract Partial charges of atoms in a molecule and electrostatic potential (ESP) density for that molecule are known to bear a strong correlation. In order to generate a set of point‐field force field parameters for molecular dynamics, Kollman and coworkers have extracted atomic partial charges for each of all 20 amino acids using restrained partial charge‐fitting procedures from theoretical ESP density obtained from condensed‐state quantum mechanics. The magnitude of atomic partial charges for neutral peptide backbone they have obtained is similar to that of partial atomic charges for ionized carboxylate side chain atoms. In this study, the effect of these known atomic partial charges on ESP is examined using computer simulations and compared with the experimental ESP density recently obtained for proteins using electron microscopy. It is found that the observed ESP density maps are most consistent with the simulations that include atomic partial charges of protein backbone. Therefore, atomic partial charges are integral part of atomic properties in protein molecules and should be included in model refinement. PMID:28370507
Ruppitsch, W; Stöger, A; Indra, A; Grif, K; Schabereiter-Gurtner, C; Hirschl, A; Allerberger, F
2007-03-01
In a bioterrorism event a rapid tool is needed to identify relevant dangerous bacteria. The aim of the study was to assess the usefulness of partial 16S rRNA gene sequence analysis and the suitability of diverse databases for identifying dangerous bacterial pathogens. For rapid identification purposes a 500-bp fragment of the 16S rRNA gene of 28 isolates comprising Bacillus anthracis, Brucella melitensis, Burkholderia mallei, Burkholderia pseudomallei, Francisella tularensis, Yersinia pestis, and eight genus-related and unrelated control strains was amplified and sequenced. The obtained sequence data were submitted to three public and two commercial sequence databases for species identification. The most frequent reason for incorrect identification was the lack of the respective 16S rRNA gene sequences in the database. Sequence analysis of a 500-bp 16S rDNA fragment allows the rapid identification of dangerous bacterial species. However, for discrimination of closely related species sequencing of the entire 16S rRNA gene, additional sequencing of the 23S rRNA gene or sequencing of the 16S-23S rRNA intergenic spacer is essential. This work provides comprehensive information on the suitability of partial 16S rDNA analysis and diverse databases for rapid and accurate identification of dangerous bacterial pathogens.
Quatrini, Raquel; Jedlicki, Eugenia; Holmes, David S
2005-12-01
Commercial bioleaching of copper and the biooxidation of gold is a cost-effective and environmentally friendly process for metal recovery. A partial genome sequence of the acidophilic, bioleaching bacterium Acidithiobacillus ferrooxidans is available from two public sources. This information has been used to build preliminary models that describe how this microorganism confronts unusually high iron loads in the extremely acidic conditions (pH 2) found in natural environments and in bioleaching operations. A. ferrooxidans contains candidate genes for iron uptake, sensing, storage, and regulation of iron homeostasis. Predicted proteins exhibit significant amino acid similarity with known proteins from neutrophilic organisms, including conservation of functional motifs, permitting their identification by bioinformatics tools and allowing the recognition of common themes in iron transport across distantly related species. However, significant differences in amino acid sequence were detected in pertinent domains that suggest ways in which the periplasmic and outer membrane proteins of A. ferrooxidans maintain structural integrity and relevant protein-protein contacts at low pH. Unexpectedly, the microorganism also contains candidate genes, organized in operon-like structures that potentially encode at least 11 siderophore systems for the uptake of Fe(III), although it does not exhibit genes that could encode the biosynthesis of the siderophores themselves. The presence of multiple Fe(III) uptake systems suggests that A. ferrooxidans can inhabit aerobic environments where iron is scarce and where siderophore producers are present. It may also help to explain why it cannot tolerate high Fe(III) concentrations in bioleaching operations where it is out-competed by Leptospirillum species.
Fuzzy regions in an intrinsically disordered protein impair protein-protein interactions.
Gruet, Antoine; Dosnon, Marion; Blocquel, David; Brunel, Joanna; Gerlier, Denis; Das, Rahul K; Bonetti, Daniela; Gianni, Stefano; Fuxreiter, Monika; Longhi, Sonia; Bignon, Christophe
2016-02-01
Despite the partial disorder-to-order transition that intrinsically disordered proteins often undergo upon binding to their partners, a considerable amount of residual disorder may be retained in the bound form, resulting in a fuzzy complex. Fuzzy regions flanking molecular recognition elements may enable partner fishing through non-specific, transient contacts, thereby facilitating binding, but may also disfavor binding through various mechanisms. So far, few computational or experimental studies have addressed the effect of fuzzy appendages on partner recognition by intrinsically disordered proteins. In order to shed light onto this issue, we used the interaction between the intrinsically disordered C-terminal domain of the measles virus (MeV) nucleoprotein (NTAIL ) and the X domain (XD) of the viral phosphoprotein as model system. After binding to XD, the N-terminal region of NTAIL remains conspicuously disordered, with α-helical folding taking place only within a short molecular recognition element. To study the effect of the N-terminal fuzzy region on NTAIL /XD binding, we generated N-terminal truncation variants of NTAIL , and assessed their binding abilities towards XD. The results revealed that binding increases with shortening of the N-terminal fuzzy region, with this also being observed with hsp70 (another MeV NTAIL binding partner), and for the homologous NTAIL /XD pairs from the Nipah and Hendra viruses. Finally, similar results were obtained when the MeV NTAIL fuzzy region was replaced with a highly dissimilar artificial disordered sequence, supporting a sequence-independent inhibitory effect of the fuzzy region. © 2015 Federation of European Biochemical Societies.
Wang, Jianye; Huang, Yu; Zhou, Mingxu; Hardwidge, Philip R; Zhu, Guoqiang
2016-06-21
Muscovy duck parvovirus (MDPV) is the etiological agent of Muscovy duckling parvoviral disease, which is characterized by diarrhea, locomotive dysfunction, stunting, and death in young ducklings, and causes substantial economic losses in the Muscovy duck industry worldwide. FZ91-30 is an attenuated vaccine strain that is safe and immunogenic to ducklings, but the genomic information and molecular mechanism underlining the attenuation are not understood. The FZ91-30 strain was propagated in 11-day-old embryonated goose eggs, and viral particles were purified from the pooled allantoic fluid by differential centrifugation and ultracentrifugation. Single-stranded genomic DNA was extracted and annealed to form double-stranded DNA. The dsDNA digested with NcoI resulted two sub-genomic fragments, which were then cloned into the modified plasmid pBluescript II SK, respectively, generating plasmid pBSKNL and pBSKNR. The sub-genomic plasmid clones were sequenced and further combined to construct the plasmid pFZ that contained the entire genome of strain FZ91-30. The complete genome sequences of strain FM and YY and partial genome sequences of other strains were retrieved from GenBank for sequence comparison. The plasmid pFZ containing the entire genome of FZ91-30 was transfected in 11-day-old embryonated goose eggs via the chorioallantoic membranes route to rescue infectious virus. A genetic marker was introduced into the rescued virus to discriminate from its parental virus. The genome of FZ91-30 consists of 5,131 nucleotides and has 98.9 % similarity to the FM strain. The inverted terminal repeats (ITR) are 456 nucleotides in length, 14 nucleotides longer than that of Goose parvovirus (GPV). The exterior 415 nucleotides of the ITR form a hairpin structure, and the interior 41 nucleotides constitute the D sequence, a reverse complement of the D' sequence at the 3' ITR. Amino acid sequence alignment of the VP1 proteins between FZ91-30 and five pathogenic MDPV strains revealed that FZ91-30 had five mutations; two in the unique region of the VP1 protein (VP1u) and three in VP3. Sequence alignment of the Rep1 proteins revealed two amino acid alterations for FZ91-30, both of which were conserved for two pathogenic strains YY and P. Transfection of the plasmid pFZ in 11-day-old embryonated goose eggs resulted in generation of infectious virus with similar biological properties as compared with the parental strain. The amino acid mutations identified in the VP1 and Rep1 protein may contribute to the attenuation of FZ91-30 in Muscovy ducklings. Plasmid transfection in embryonated goose eggs was suitable for rescue of infectious MDPV.
Sequence Complexity of Amyloidogenic Regions in Intrinsically Disordered Human Proteins
Das, Swagata; Pal, Uttam; Das, Supriya; Bagga, Khyati; Roy, Anupam; Mrigwani, Arpita; Maiti, Nakul C.
2014-01-01
An amyloidogenic region (AR) in a protein sequence plays a significant role in protein aggregation and amyloid formation. We have investigated the sequence complexity of AR that is present in intrinsically disordered human proteins. More than 80% human proteins in the disordered protein databases (DisProt+IDEAL) contained one or more ARs. With decrease of protein disorder, AR content in the protein sequence was decreased. A probability density distribution analysis and discrete analysis of AR sequences showed that ∼8% residue in a protein sequence was in AR and the region was in average 8 residues long. The residues in the AR were high in sequence complexity and it seldom overlapped with low complexity regions (LCR), which was largely abundant in disorder proteins. The sequences in the AR showed mixed conformational adaptability towards α-helix, β-sheet/strand and coil conformations. PMID:24594841
Sequence analysis of the genome of carnation (Dianthus caryophyllus L.).
Yagi, Masafumi; Kosugi, Shunichi; Hirakawa, Hideki; Ohmiya, Akemi; Tanase, Koji; Harada, Taro; Kishimoto, Kyutaro; Nakayama, Masayoshi; Ichimura, Kazuo; Onozaki, Takashi; Yamaguchi, Hiroyasu; Sasaki, Nobuhiro; Miyahara, Taira; Nishizaki, Yuzo; Ozeki, Yoshihiro; Nakamura, Noriko; Suzuki, Takamasa; Tanaka, Yoshikazu; Sato, Shusei; Shirasawa, Kenta; Isobe, Sachiko; Miyamura, Yoshinori; Watanabe, Akiko; Nakayama, Shinobu; Kishida, Yoshie; Kohara, Mitsuyo; Tabata, Satoshi
2014-06-01
The whole-genome sequence of carnation (Dianthus caryophyllus L.) cv. 'Francesco' was determined using a combination of different new-generation multiplex sequencing platforms. The total length of the non-redundant sequences was 568,887,315 bp, consisting of 45,088 scaffolds, which covered 91% of the 622 Mb carnation genome estimated by k-mer analysis. The N50 values of contigs and scaffolds were 16,644 bp and 60,737 bp, respectively, and the longest scaffold was 1,287,144 bp. The average GC content of the contig sequences was 36%. A total of 1050, 13, 92 and 143 genes for tRNAs, rRNAs, snoRNA and miRNA, respectively, were identified in the assembled genomic sequences. For protein-encoding genes, 43 266 complete and partial gene structures excluding those in transposable elements were deduced. Gene coverage was ∼ 98%, as deduced from the coverage of the core eukaryotic genes. Intensive characterization of the assigned carnation genes and comparison with those of other plant species revealed characteristic features of the carnation genome. The results of this study will serve as a valuable resource for fundamental and applied research of carnation, especially for breeding new carnation varieties. Further information on the genomic sequences is available at http://carnation.kazusa.or.jp. © The Author 2013. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome
Camargo, Anamaria A.; Samaia, Helena P. B.; Dias-Neto, Emmanuel; Simão, Daniel F.; Migotto, Italo A.; Briones, Marcelo R. S.; Costa, Fernando F.; Aparecida Nagai, Maria; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; Sonati, Maria de Fátima; Tajara, Eloiza H.; Valentini, Sandro R.; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Arnaldi, Liliane A. T.; de Assis, Angela M.; Bengtson, Mário Henrique; Bergamo, Nadia Aparecida; Bombonato, Vanessa; de Camargo, Maria E. R.; Canevari, Renata A.; Carraro, Dirce M.; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Corrêa, Rosana F. R.; Costa, Maria Cristina R.; Curcio, Cyntia; Hokama, Paula O. M.; Ferreira, Ari J. S.; Furuzawa, Gilberto K.; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Krieger, José E.; Leite, Luciana C. C.; Majumder, Paromita; Marins, Mozart; Marques, Everaldo R.; Melo, Analy S. A.; Melo, Monica; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana G.; Prevedel, Aline C.; Rahal, Paula; Rainho, Claudia A.; Reis, Eduardo M. R.; Ribeiro, Marcelo L.; da Rós, Nancy; de Sá, Renata G.; Sales, Magaly M.; Sant'anna, Simone Cristina; dos Santos, Mariana L.; da Silva, Aline M.; da Silva, Neusa P.; Silva, Wilson A.; da Silveira, Rosana A.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Soares, Fernando; Moreira, Eloisa S.; Nunes, Diana N.; Correa, Ricardo G.; Zalcberg, Heloisa; Carvalho, Alex F.; Reis, Luis F. L.; Brentani, Ricardo R.; Simpson, Andrew J. G.; de Souza, Sandro J.
2001-01-01
Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription–PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning. PMID:11593022
The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome.
Camargo, A A; Samaia, H P; Dias-Neto, E; Simão, D F; Migotto, I A; Briones, M R; Costa, F F; Nagai, M A; Verjovski-Almeida, S; Zago, M A; Andrade, L E; Carrer, H; El-Dorry, H F; Espreafico, E M; Habr-Gama, A; Giannella-Neto, D; Goldman, G H; Gruber, A; Hackel, C; Kimura, E T; Maciel, R M; Marie, S K; Martins, E A; Nobrega, M P; Paco-Larson, M L; Pardini, M I; Pereira, G G; Pesquero, J B; Rodrigues, V; Rogatto, S R; da Silva, I D; Sogayar, M C; Sonati, M F; Tajara, E H; Valentini, S R; Alberto, F L; Amaral, M E; Aneas, I; Arnaldi, L A; de Assis, A M; Bengtson, M H; Bergamo, N A; Bombonato, V; de Camargo, M E; Canevari, R A; Carraro, D M; Cerutti, J M; Correa, M L; Correa, R F; Costa, M C; Curcio, C; Hokama, P O; Ferreira, A J; Furuzawa, G K; Gushiken, T; Ho, P L; Kimura, E; Krieger, J E; Leite, L C; Majumder, P; Marins, M; Marques, E R; Melo, A S; Melo, M B; Mestriner, C A; Miracca, E C; Miranda, D C; Nascimento, A L; Nobrega, F G; Ojopi, E P; Pandolfi, J R; Pessoa, L G; Prevedel, A C; Rahal, P; Rainho, C A; Reis, E M; Ribeiro, M L; da Ros, N; de Sa, R G; Sales, M M; Sant'anna, S C; dos Santos, M L; da Silva, A M; da Silva, N P; Silva, W A; da Silveira, R A; Sousa, J F; Stecconi, D; Tsukumo, F; Valente, V; Soares, F; Moreira, E S; Nunes, D N; Correa, R G; Zalcberg, H; Carvalho, A F; Reis, L F; Brentani, R R; Simpson, A J; de Souza, S J; Melo, M
2001-10-09
Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription-PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning.
How Many Protein Sequences Fold to a Given Structure? A Coevolutionary Analysis.
Tian, Pengfei; Best, Robert B
2017-10-17
Quantifying the relationship between protein sequence and structure is key to understanding the protein universe. A fundamental measure of this relationship is the total number of amino acid sequences that can fold to a target protein structure, known as the "sequence capacity," which has been suggested as a proxy for how designable a given protein fold is. Although sequence capacity has been extensively studied using lattice models and theory, numerical estimates for real protein structures are currently lacking. In this work, we have quantitatively estimated the sequence capacity of 10 proteins with a variety of different structures using a statistical model based on residue-residue co-evolution to capture the variation of sequences from the same protein family. Remarkably, we find that even for the smallest protein folds, such as the WW domain, the number of foldable sequences is extremely large, exceeding the Avogadro constant. In agreement with earlier theoretical work, the calculated sequence capacity is positively correlated with the size of the protein, or better, the density of contacts. This allows the absolute sequence capacity of a given protein to be approximately predicted from its structure. On the other hand, the relative sequence capacity, i.e., normalized by the total number of possible sequences, is an extremely tiny number and is strongly anti-correlated with the protein length. Thus, although there may be more foldable sequences for larger proteins, it will be much harder to find them. Lastly, we have correlated the evolutionary age of proteins in the CATH database with their sequence capacity as predicted by our model. The results suggest a trade-off between the opposing requirements of high designability and the likelihood of a novel fold emerging by chance. Published by Elsevier Inc.
Amyloidogenesis of Natively Unfolded Proteins
Uversky, Vladimir N.
2009-01-01
Aggregation and subsequent development of protein deposition diseases originate from conformational changes in corresponding amyloidogenic proteins. The accumulated data support the model where protein fibrillogenesis proceeds via the formation of a relatively unfolded amyloidogenic conformation, which shares many structural properties with the pre-molten globule state, a partially folded intermediate first found during the equilibrium and kinetic (un)folding studies of several globular proteins and later described as one of the structural forms of natively unfolded proteins. The flexibility of this structural form is essential for the conformational rearrangements driving the formation of the core cross-beta structure of the amyloid fibril. Obviously, molecular mechanisms describing amyloidogenesis of ordered and natively unfolded proteins are different. For ordered protein to fibrillate, its unique and rigid structure has to be destabilized and partially unfolded. On the other hand, fibrillogenesis of a natively unfolded protein involves the formation of partially folded conformation; i.e., partial folding rather than unfolding. In this review recent findings are surveyed to illustrate some unique features of the natively unfolded proteins amyloidogenesis. PMID:18537543
Chen, Yiyang; Liu, Baoyuan; Sun, Yani; Li, Huixia; Du, Taofeng; Nan, Yuchen; Hiscox, Julian A; Zhou, En-Min; Zhao, Qin
2018-07-01
Hepatitis E virus (HEV) causes liver disease in humans and is thought to be a zoonotic infection, with domestic animals, including swine and rabbits, being a reservoir. One of the proteins encoded by the virus is the capsid protein. This is likely the major immune-dominant protein and a target for vaccination. Four monoclonal antibodies (MAbs), three novel, 1E4, 2C7, and 2G9, and one previously characterized, 1B5, were evaluated for binding to the capsid protein from genotype 4 swine HEV. The results indicated that 625 DFCP 628 , 458 PSRPF 462 , and 407 EPTV 410 peptides on the capsid protein comprised minimal amino acid sequence motifs recognized by 1E4, 2C7, and 2G9, respectively. The data suggested that 2C7 and 2G9 epitopes were partially exposed on the surface of the capsid protein. Truncated genotype 4 swine HEV capsid protein (sp239, amino acids 368 to 606) can exist in multimeric forms. Preincubation of swine HEV with 2C7, 2G9, or 1B5 before addition to HepG2 cells partially blocked sp239 cell binding and inhibited swine HEV infection. The study indicated that 2C7, 2G9, and 1B5 partially blocked swine HEV infection of rabbits better than 1E4 or normal mouse IgG. The cross-reactivity of antibodies suggested that capsid epitopes recognized by 2C7 and 2G9 are common to HEV strains infecting most host species. Collectively, MAbs 2C7, 2G9, and 1B5 were shown to recognize three novel linear neutralizing B-cell epitopes of genotype 4 HEV capsid protein. These results enhance understanding of HEV capsid protein structure to guide vaccine and antiviral design. IMPORTANCE Genotype 3 and 4 HEVs are zoonotic viruses. Here, genotype 4 HEV was studied due to its prevalence in human populations and pig herds in China. To improve HEV disease diagnosis and prevention, a better understanding of the antigenic structure and neutralizing epitopes of HEV capsid protein are needed. In this study, the locations of three novel linear B-cell recognition epitopes within genotype 4 swine HEV capsid protein were characterized. Moreover, the neutralizing abilities of three MAbs specific for this protein, 2C7, 2G9, and 1B5, were studied in vitro and in vivo Collectively, these findings reveal structural details of genotype 4 HEV capsid protein and should facilitate development of applications for the design of vaccines and antiviral drugs for broader prevention, detection, and treatment of HEV infection of diverse human and animal hosts. Copyright © 2018 American Society for Microbiology.
Characterization of Durham virus, a novel rhabdovirus that encodes both a C and SH protein.
Allison, A B; Palacios, G; Travassos da Rosa, A; Popov, V L; Lu, L; Xiao, S Y; DeToy, K; Briese, T; Lipkin, W I; Keel, M K; Stallknecht, D E; Bishop, G R; Tesh, R B
2011-01-01
The family Rhabdoviridae is a diverse group of non-segmented, negative-sense RNA viruses that are distributed worldwide and infect a wide range of hosts including vertebrates, invertebrates, and plants. Of the 114 currently recognized vertebrate rhabdoviruses, relatively few have been well characterized at both the antigenic and genetic level; hence, the phylogenetic relationships between many of the vertebrate rhabdoviruses remain unknown. The present report describes a novel rhabdovirus isolated from the brain of a moribund American coot (Fulica americana) that exhibited neurological signs when found in Durham County, North Carolina, in 2005. Antigenic characterization of the virus revealed that it was serologically unrelated to 68 other known vertebrate rhabdoviruses. Genomic sequencing of the virus indicated that it shared the highest identity to Tupaia rhabdovirus (TUPV), and as only previously observed in TUPV, the genome encoded a putative C protein in an overlapping open reading frame (ORF) of the phosphoprotein gene and a small hydrophobic (SH) protein located in a novel ORF between the matrix and glycoprotein genes. Phylogenetic analysis of partial amino acid sequences of the nucleoprotein and polymerase protein indicated that, in addition to TUPV, the virus was most closely related to avian and small mammal rhabdoviruses from Africa and North America. In this report, we present the morphological, pathological, antigenic, and genetic characterization of the new virus, tentatively named Durham virus (DURV), and discuss its potential evolutionary relationship to other vertebrate rhabdoviruses. Copyright © 2010 Elsevier B.V. All rights reserved.
Characterization of Durham virus, a novel rhabdovirus that encodes both a C and SH protein
Allison, A. B.; Palacios, G.; Rosa, A. Travassos da; Popov, V. L.; Lu, L.; Xiao, S. Y.; DeToy, K.; Briese, T.; Lipkin, W. Ian; Keel, M. K.; Stallknecht, D. E.; Bishop, G. R.; Tesh, R. B.
2010-01-01
The family Rhabdoviridae is a diverse group of non-segmented, negative-sense RNA viruses that are distributed worldwide and infect a wide range of hosts including vertebrates, invertebrates, and plants. Of the 114 currently recognized vertebrate rhabdoviruses, relatively few have been well characterized at both the antigenic and genetic level; hence, the phylogenetic relationships between many of the vertebrate rhabdoviruses remain unknown. The present report describes a novel rhabdovirus isolated from the brain of a moribund American coot (Fulica americana) that exhibited neurological signs when found in Durham County, North Carolina, in 2005. Antigenic characterization of the virus revealed that it was serologically unrelated to 68 other known vertebrate rhabdoviruses. Genomic sequencing of the virus indicated that it shared the highest identity to Tupaia rhabdovirus (TUPV), and as only previously observed in TUPV, the genome encoded a putative C protein in an overlapping open reading frame (ORF) of the phosphoprotein gene and a small hydrophobic protein located in a novel ORF between the matrix and glycoprotein genes. Phylogenetic analysis of partial amino acid sequences of the nucleoprotein and polymerase proteins indicated that, in addition to TUPV, the virus was most closely related to avian and small mammal rhabdoviruses from Africa and North America. In this report, we present the morphological, pathological, antigenic, and genetic characterization of the new virus, tentatively named Durham virus (DURV), and discuss its potential evolutionary relationship to other vertebrate rhabdoviruses. PMID:20863863
Jailani, A Abdul Kader; Solanki, Vikas; Roy, Anirban; Sivasudha, T; Mandal, Bikash
2017-04-02
A highly infectious clone of Cucumber green mottle mosaic virus (CGMMV), a cucurbit-infecting tobamovirus was utilized for designing of gene expression vectors. Two versions of vector were examined for their efficacy in expressing the green fluorescent protein (GFP) in Nicotiana benthamiana. When the GFP gene was inserted at the stop codon of coat protein (CP) gene of the CGMMV genome without any read-through codon, systemic expression of GFP, as well as virion formation and systemic symptoms expression were obtained in N. benthamiana. The qRT-PCR analysis showed 23 fold increase of GFP over actin at 10days post inoculation (dpi), which increased to 45 fold at 14dpi and thereafter the GFP expression was significantly declined. Further, we show that when the most of the CP sequence is deleted retaining only the first 105 nucleotides, the shortened vector containing GFP in frame of original CP open reading frame (ORF) resulted in 234 fold increase of GFP expression over actin at 5dpi in N. benthamiana without the formation of virions and disease symptoms. Our study demonstrated that a simple manipulation of CP gene in the CGMMV genome while preserving the translational frame of CP resulted in developing a virus-free, rapid and efficient foreign protein expression system in the plant. The CGMMV based vectors developed in this study may be potentially useful for the production of edible vaccines in cucurbits. Copyright © 2017 Elsevier B.V. All rights reserved.
Tachibana, K; Marquardt, H; Yokoya, S; Friesen, H G
1988-10-01
We have reported that the secretion of at least 17 distinct peptides [including rat (rGH)] GH by cultured rat pituitary cells was stimulated by GH-releasing hormone and inhibited by somatostatin, when analyzed by two-dimensional polyacrylamide gel electrophoresis. Three of these peptides (no. 23, 24, and 25) were not rGH immunoreactive. In order to determine whether these three peptides are fragments, degradation products or posttranscriptionally modified forms of rGH, rGH and peptide no. 23 were characterized structurally. From partial peptide maps of rGH and peptide no. 23 by V8 protease or chymotrypsin, it appeared that these peptides were not related to each other. By N-terminal microsequencing of two-dimensional polyacrylamide gel electrophoresis purified peptide, we have obtained the sequence of 24 N-terminal amino acid residues of peptide no. 23. This sequence has no significant homology with rGH or any other reported protein sequence. Antiserum was generated against a synthetic oligopeptide corresponding to amino acid residues 3-24 of peptide no. 23. The antiserum cross-reacted with peptides no. 23, 24, and 25 upon Western blot analysis. These results indicate that peptide no. 23 has a novel structure unrelated to other pituitary hormones. Since its secretion is influenced by GH-releasing hormone and somatostatin, peptide no. 23 may represent a previously unrecognized structurally unique growth factor.
Diversity in VP3, NSP3, and NSP4 of rotavirus B detected from Japanese cattle.
Hayashi-Miyamoto, Michiko; Murakami, Toshiaki; Minami-Fukuda, Fujiko; Tsuchiaka, Shinobu; Kishimoto, Mai; Sano, Kaori; Naoi, Yuki; Asano, Keigo; Ichimaru, Toru; Haga, Kei; Omatsu, Tsutomu; Katayama, Yukie; Oba, Mami; Aoki, Hiroshi; Shirai, Junsuke; Ishida, Motohiko; Katayama, Kazuhiko; Mizutani, Tetsuya; Nagai, Makoto
2017-04-01
Bovine rotavirus B (RVB) is an etiological agent of diarrhea mostly in adult cattle. Currently, a few sequences of viral protein (VP)1, 2, 4, 6, and 7 and nonstructural protein (NSP)1, 2, and 5 of bovine RVB are available in the DDBJ/EMBL/GenBank databases, and none have been reported for VP3, NSP3, and NSP4. In order to fill this gap in the genetic characterization of bovine RVB strains, we used a metagenomics approach and sequenced and analyzed the complete coding sequences (CDS) of VP3, NSP3, and NSP4 genes, as well as the partial or complete CDS of other genes of RVBs detected from Japanese cattle. VP3, NSP3, and NSP4 of bovine RVBs shared low nucleotide sequence identities (63.3-64.9% for VP3, 65.9-68.2% for NSP3, and 52.6-56.2% for NSP4) with those of murine, human, and porcine RVBs, suggesting that bovine RVBs belong to a novel genotype. Furthermore, significantly low amino acid sequence identities were observed for NSP4 (36.1-39.3%) between bovine RVBs and the RVBs of other species. In contrast, hydrophobic plot analysis of NSP4 revealed profiles similar to those of RVBs of other species and rotavirus A (RVA) strains. Phylogenetic analyses of all gene segments revealed that bovine RVB strains formed a cluster that branched distantly from other RVBs. These results suggest that bovine RVBs have evolved independently from other RVBs but in a similar manner to other rotaviruses. These findings provide insights into the evolution and diversity of RVB strains. Copyright © 2017 Elsevier B.V. All rights reserved.
Möbius, Petra; Hölzer, Martin; Felder, Marius; Nordsiek, Gabriele; Groth, Marco; Köhler, Heike; Reichwald, Kathrin; Platzer, Matthias; Marz, Manja
2015-01-01
Mycobacterium avium (M. a.) subsp. paratuberculosis (MAP)—the etiologic agent of Johne’s disease—affects cattle, sheep, and other ruminants worldwide. To decipher phenotypic differences among sheep and cattle strains (belonging to MAP-S [Type-I/III], respectively, MAP-C [Type-II]), comparative genome analysis needs data from diverse isolates originating from different geographic regions of the world. This study presents the so far best assembled genome of a MAP-S-strain: Sheep isolate JIII-386 from Germany. One newly sequenced cattle isolate (JII-1961, Germany), four published MAP strains of MAP-C and MAP-S from the United States and Australia, and M. a. subsp. hominissuis (MAH) strain 104 were used for assembly improvement and comparisons. All genomes were annotated by BacProt and results compared with NCBI (National Center for Biotechnology Information) annotation. Corresponding protein-coding sequences (CDSs) were detected, but also CDSs that were exclusively determined by either NCBI or BacProt. A new Shine–Dalgarno sequence motif (5′-AGCTGG-3′) was extracted. Novel CDSs including PE-PGRS family protein genes and about 80 noncoding RNAs exhibiting high sequence conservation are presented. Previously found genetic differences between MAP-types are partially revised. Four of ten assumed MAP-S-specific large sequence polymorphism regions (LSPSs) are still present in MAP-C strains; new LSPSs were identified. Independently of the regional origin of the strains, the number of individual CDSs and single nucleotide variants confirms the strong similarity of MAP-C strains and shows higher diversity among MAP-S strains. This study gives ambiguous results regarding the hypothesis that MAP-S is the evolutionary intermediate between MAH and MAP-C, but it clearly shows a higher similarity of MAP to MAH than to Mycobacterium intracellulare. PMID:26384038
Jiang, W; Gupta, D; Gallagher, D; Davis, S; Bhavanandan, V P
2000-04-01
We previously elucidated five distinct protein domains (I-V) for bovine submaxillary mucin, which is encoded by two genes, BSM1 and BSM2. Using Southern blot analysis, genomic cloning and sequencing of the BSM1 gene, we now show that the central domain (V) consists of approximately 55 tandem repeats of 329 amino acids and that domains III-V are encoded by a 58.4-kb exon, the largest exon known for all genes to date. The BSM1 gene was mapped by fluorescence in situ hybridization to the proximal half of chromosome 5 at bands q2. 2-q2.3. The amino-acid sequence of six tandem repeats (two full and four partial) were found to have only 92-94% identities. We propose that the variability in the amino-acid sequences of the mucin tandem repeat is important for generating the combinatorial library of saccharides that are necessary for the protective function of mucins. The deduced peptide sequences of the central domain match those determined from the purified bovine submaxillary mucin and also show 68-94% identity to published peptide sequences of ovine submaxillary mucin. This indicates that the core protein of ovine submaxillary mucin is closely related to that of bovine submaxillary mucin and contains similar tandem repeats in the central domain. In contrast, the central domain of porcine submaxillary mucin is reported to consist of 81-amino-acid tandem repeats. However, both bovine submaxillary mucin and porcine submaxillary mucin contain similar N-terminal and C-terminal domains and the corresponding genes are in the conserved linkage regions of the respective genomes.
The Drosophila Tis11 protein and its effects on mRNA expression in flies.
Choi, Youn-Jeong; Lai, Wi S; Fedic, Robert; Stumpo, Deborah J; Huang, Weichun; Li, Leping; Perera, Lalith; Brewer, Brandy Y; Wilson, Gerald M; Mason, James M; Blackshear, Perry J
2014-12-19
Members of the mammalian tristetraprolin family of CCCH tandem zinc finger proteins can bind to certain AU-rich elements (AREs) in mRNAs, leading to their deadenylation and destabilization. Mammals express three or four members of this family, but Drosophila melanogaster and other insects appear to contain a single gene, Tis11. We found that recombinant Drosophila Tis11 protein could bind to ARE-containing RNA oligonucleotides with low nanomolar affinity. Remarkably, co-expression in mammalian cells with "target" RNAs demonstrated that Tis11 could promote destabilization of ARE-containing mRNAs and that this was partially dependent on a conserved C-terminal sequence resembling the mammalian NOT1 binding domain. Drosophila Tis11 promoted both deadenylation and decay of a target transcript in this heterologous cell system. We used chromosome deletion/duplication and P element insertion to produce two types of Tis11 deficiency in adult flies, both of which were viable and fertile. To address the hypothesis that Tis11 deficiency would lead to the abnormal accumulation of potential target transcripts, we analyzed gene expression in adult flies by deep mRNA sequencing. We identified 69 transcripts from 56 genes that were significantly up-regulated more than 1.5-fold in both types of Tis11-deficient flies. Ten of the up-regulated transcripts encoded probable proteases, but many other functional classes of proteins were represented. Many of the up-regulated transcripts contained potential binding sites for tristetraprolin family member proteins that were conserved in other Drosophila species. Tis11 is thus an ARE-binding, mRNA-destabilizing protein that may play a role in post-transcriptional gene expression in Drosophila and other insects. © 2014 by The American Society for Biochemistry and Molecular Biology, Inc.
The Drosophila Tis11 Protein and Its Effects on mRNA Expression in Flies*
Choi, Youn-Jeong; Lai, Wi S.; Fedic, Robert; Stumpo, Deborah J.; Huang, Weichun; Li, Leping; Perera, Lalith; Brewer, Brandy Y.; Wilson, Gerald M.; Mason, James M.; Blackshear, Perry J.
2014-01-01
Members of the mammalian tristetraprolin family of CCCH tandem zinc finger proteins can bind to certain AU-rich elements (AREs) in mRNAs, leading to their deadenylation and destabilization. Mammals express three or four members of this family, but Drosophila melanogaster and other insects appear to contain a single gene, Tis11. We found that recombinant Drosophila Tis11 protein could bind to ARE-containing RNA oligonucleotides with low nanomolar affinity. Remarkably, co-expression in mammalian cells with “target” RNAs demonstrated that Tis11 could promote destabilization of ARE-containing mRNAs and that this was partially dependent on a conserved C-terminal sequence resembling the mammalian NOT1 binding domain. Drosophila Tis11 promoted both deadenylation and decay of a target transcript in this heterologous cell system. We used chromosome deletion/duplication and P element insertion to produce two types of Tis11 deficiency in adult flies, both of which were viable and fertile. To address the hypothesis that Tis11 deficiency would lead to the abnormal accumulation of potential target transcripts, we analyzed gene expression in adult flies by deep mRNA sequencing. We identified 69 transcripts from 56 genes that were significantly up-regulated more than 1.5-fold in both types of Tis11-deficient flies. Ten of the up-regulated transcripts encoded probable proteases, but many other functional classes of proteins were represented. Many of the up-regulated transcripts contained potential binding sites for tristetraprolin family member proteins that were conserved in other Drosophila species. Tis11 is thus an ARE-binding, mRNA-destabilizing protein that may play a role in post-transcriptional gene expression in Drosophila and other insects. PMID:25342740
SAXS and other spectroscopic analysis of 12S cruciferin isolated from the seeds of Brassica nigra
NASA Astrophysics Data System (ADS)
Khaliq, Binish; Falke, Sven; Negm, Amr; Buck, Friedrich; Munawar, Aisha; Saqib, Maria; Mahmood, Seema; Ahmad, Malik Shoaib; Betzel, Christian; Akrem, Ahmed
2017-06-01
Oilseeds of the plant family Brassicaceae are important for providing both lipid and protein contents to human nutrition. Cruciferins (12S globulins) are seed storage proteins, which are getting attention due to their allergenic and pathogenicity related nature. This study describes the purification and characterization of a trimeric (∼190 kDa) cruciferin protein from the seeds of Brassica nigra (L.). Cruciferin was first partially purified by ammonium sulfate precipitation (30% saturation constant) and further purified by size exclusion chromatography. The N-terminal amino-acid sequence analysis showed 82% sequence homology with cruciferin from Arabidopsis thaliana. The 50-55 kDa monomeric cruciferin produced multiple bands of two major molecular weight ranges (α-polypeptides of 28-32 kDa and β-polypeptides of 17-20 kDa) under reduced conditions of SDS-PAGE. The 2D gel electrophoretic analysis showed the further separation of the bands into their isoforms with major pI ranges between 5.7 and 8.0 (α-polypeptides) and 5.5-8.5 (β-polypeptides). The Dynamic Light Scattering (DLS) showed the monodisperse nature of the cruciferin with hydrodynamic radius of 5.8 ± 0.1 nm confirming the trimeric nature of the protein. The Circular Dichroism (CD) spectra showed both α-helices and β-sheets in the native conformation of the trimeric protein. The pure cruciferin protein (40 mg/ml) was successfully crystallized; however, the crystals diffracted only to low resolution data (8 Å). Small-angle x-ray scattering (SAXS) was applied to gain insights into the three-dimensional structure in solution. SAXS showed that the radius of gyration is 4.24 ± 0.25 nm and confirmed the nearly globular shape. The SAXS based ab initio dummy model of B. nigra cruciferin was compared with 11S globulins.
Murasugi, Akira
2013-01-01
Midkine is a heparin-binding growth factor that promotes cell growth, survival, and migration. Externally added midkine prevents ventricular remodeling and improves long-term survival after myocardial infarction in the mouse. Preclinical testing of this protein is in progress. Externally added pleiotrophin, a member of the midkine protein family, promotes functional recovery after neural transplantation in rats. Thus, pleiotrophin is also a candidate therapeutic protein. Large amounts of these proteins were obtained by using the heterologous protein expression system of Pichia pastoris, and the recombinant P. pastoris clones were cultured in a controlled fermentor. Intracellular expression yielded about 300 mg/L recombinant human (rh)-midkine, which was extracted, renatured, and purified. From 1 L of the culture, 64 mg of rh-midkine was purified. Secretory expression induced by the midkine secretion signal resulted in about 100 mg of rhmidkine in 1 L of the culture supernatant, but over 70% of the rh-midkine had yeast-specific glycosylation. Three threonyl residues that are targets for glycosylation were substituted with alanyl residues, and nonglycosylated, active rh-midkine was obtained. In secretory expression using α-mating factor prepro-sequence, about 640 mg/L rh-midkine was obtained, but it was partially truncated. Therefore, a protease-deficient host was used, and about 360 mg/L intact rh-midkine was then obtained. The rh-midkine was recovered and purified, with 70% final yield. All purified rh-midkine, regardless of expression method, was able to promote mammalian cell proliferation. In secretory expression of rh-pleiotrophin using α- mating factor prepro-sequence, 260 mg/L rh-pleiotrophin could be secreted. The rh-pleiotrophin was recovered and efficiently purified with 72% final yield.
Sun, D; Leung, C L; Liem, R K
2001-01-01
MACF (microtubule actin cross-linking factor) is a large, 608-kDa protein that can associate with both actin microfilaments and microtubules (MTs). Structurally, MACF can be divided into 3 domains: an N-terminal domain that contains both a calponin type actin-binding domain and a plakin domain; a rod domain that is composed of 23 dystrophin-like spectrin repeats; and a C-terminal domain that includes two EF-hand calcium-binding motifs, as well as a region that is homologous to two related proteins, GAR22 and Gas2. We have previously demonstrated that the C-terminal domain of MACF binds to MTs, although no homology was observed between this domain and other known microtubule-binding proteins. In this report, we describe the characterization of this microtubule-binding domain of MACF by transient transfection studies and in vitro binding assays. We found that the C-terminus of MACF contains at least two microtubule-binding regions, a GAR domain and a domain containing glycine-serine-arginine (GSR) repeats. In transfected cells, the GAR domain bound to and partially stabilized MTs to depolymerization by nocodazole. The GSR-containing domain caused MTs to form bundles that are still sensitive to nocodazole-induced depolymerization. When present together, these two domains acted in concert to bundle MTs and render them stable to nocodazole treatment. Recently, a study has shown that the N-terminal half of the plakin domain (called the M1 domain) of MACF also binds MTs. We therefore examined the microtubule binding ability of the M1 domain in the context of the entire plakin domain with and without the remaining N-terminal regions of two different MACF isoforms. Interestingly, in the presence of the surrounding sequences, the M1 domain did not bind MTs. In addition to MACF, cDNA sequences encoding the GAR and GSR-containing domains are also found in the partial human EST clone KIAA0728, which has high sequence homology to the 3' end of the MACF cDNA; hence, we refer to it as MACF2. The C-terminal domain of mouse MACF2 was cloned and characterized. The microtubule-binding properties of MACF2 C-terminal domain are similar to that of MACF. The GAR domain was originally found in Gas 2 protein and here we show that it can associate with MTs in transfected cells. Plectin and desmoplakin have GSR-containing domains at their C-termini and we further demonstrate that the GSR-containing domain of plectin, but not desmoplakin, can bind to MTs in vivo.
Nabuurs, Sanne M; Westphal, Adrie H; aan den Toorn, Marije; Lindhoud, Simon; van Mierlo, Carlo P M
2009-06-17
Partially folded protein species transiently exist during folding of most proteins. Often these species are molten globules, which may be on- or off-pathway to native protein. Molten globules have a substantial amount of secondary structure but lack virtually all the tertiary side-chain packing characteristic of natively folded proteins. These ensembles of interconverting conformers are prone to aggregation and potentially play a role in numerous devastating pathologies, and thus attract considerable attention. The molten globule that is observed during folding of apoflavodoxin from Azotobacter vinelandii is off-pathway, as it has to unfold before native protein can be formed. Here we report that this species can be trapped under nativelike conditions by substituting amino acid residue F44 by Y44, allowing spectroscopic characterization of its conformation. Whereas native apoflavodoxin contains a parallel beta-sheet surrounded by alpha-helices (i.e., the flavodoxin-like or alpha-beta parallel topology), it is shown that the molten globule has a totally different topology: it is helical and contains no beta-sheet. The presence of this remarkably nonnative species shows that single polypeptide sequences can code for distinct folds that swap upon changing conditions. Topological switching between unrelated protein structures is likely a general phenomenon in the protein structure universe.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tanabe, Yuko; Fujita, Eriko; Department of Pediatrics, Jichi Medical University, 3311-1 Yakushiji, Shimotsuke, Tochigi 329-0498
Highlights: {yields} We isolated protection of telomeres 1 (POT1) as a FOXP2-associated protein by a yeast two-hybrid. {yields} FOXP2 associated and co-localized with POT1 in the nuclei. {yields} FOXP2(R553H) also co-localized with POT1 in both the cytoplasm and nuclei. {yields} FOXP2(R553H) partially prevented the nuclear translocation of POT1. {yields} FOXP2(R553H) mutation may be associated with the pathogenesis of speech-language disorder. -- Abstract: FOXP2 is a forkhead box-containing transcription factor with several recognizable sequence motifs. However, little is known about the FOXP2-associated proteins except for C-terminal binding protein (CtBP). In the present study, we attempted to isolate the FOXP2-associated protein withmore » a yeast two-hybrid system using the C-terminal region, including the forkhead domain, as a bait probe, and identified protection of telomeres 1 (POT1) as a FOXP2-associated protein. Immunoprecipitation assay confirmed the association with FOXP2 and POT1. POT1 alone localized in the cytoplasm but co-localized with FOXP2 and the forkhead domain of FOXP2 in nuclei. However, both FOXP2 with mutated nuclear localization signals and (R553H) mutated forkhead, which is associated with speech-language disorder, prevented the nuclear translocation of POT1. These results suggest that FOXP2 is a binding partner for the nuclear translocation of POT1. As loss of POT1 function induces the cell arrest, the impaired nuclear translocation of POT1 in the developing neuronal cells may be associated with the pathogenesis of speech-language disorder with FOXP2(R553H) mutation.« less
Bhattachary, R; Bukkapatnam, R; Prawoko, I; Soto, J; Morgan, M; Salup, R R
2002-05-01
Despite early diagnosis and improved therapy, 31,500 men will die from prostate cancer (PC) this year. The HER2/neu oncoprotein is an important effector of cell growth found in the majority of high-grade prostatic tumors and is capable of rendering immunogenicity. The antigenicity of this oncoprotein might prove useful in the development of PC vaccines. Our goal is to prove the principle that a single DNA vaccine can provide reliable immunity against PC in the MatLyLu (MLL) translational tumor model. The parental rat MatLyLu PC cell line expresses low to moderate levels of the rat neu protein. To simulate in vivo human PC, MatLyLu cells were transfected with a truncated sequence of human HER2/neu cDNA cloned into the pCI-neo vector. This HER2/neu cDNA sequence encodes the first 433 amino acids of the extracellular domain (ECD). MatLyLu cells were also transfected with the same HER2/neu cDNA sequence cloned into the N1-terminal sequence of EGFP reporter gene to produce a fusion protein. The partial ECD sequence of HER2/neu includes five rat major histocompatibility (MHC)-II-restricted peptides with complete human-to-rat cross-species homology. The HER2/neu protein overexpression was documented by Western Blot analysis, and the expression of fusion protein was monitored by confocal microscopy and fluorimetry. Vaccination with a single injection of HER2/neu cDNA protected 50% of animals against HER2/neu-MatLyLu tumors (P < 0.01). When the tumor cells were engineered to express HER2/neu-EGFP fusion protein, the antitumor immunity was enhanced, as following vaccination with HER2/neu-EGFP cDNA, 80% of these rats rejected HER2/neu-EGFP-MatLyLu (P<0.001). Both vaccines induced HER2/neu-specific antibody titers. Rats vaccinated with EGFP-cDNA rejected 80% of EGFP-MatLyLu tumors and, interestingly, 40% of HER2/neu-MatLyLu tumors. None of the cDNA vaccines induced immunity against parental MatLyLu cells. Our data clearly demonstrate that a single injection of HER2/neu-EGFP cDNA is a very effective vaccine against PC tumors expressing the cognate tumor-associated antigen (TA). The antitumor immunity is significantly more pronounced if the tumors express xenogeneic HER2/neu-EGFP fusion protein as opposed to only the syngeneic HER2/neu oncoprotein. Our data suggests that the HER2/neu-EGFP-MatLyLu tumor is a potential animal tumor model for investigating therapeutic vaccine strategies against PC in vivo and demonstrates the limitations of a cDNA vaccine only encoding for MHC-II-restricted HER2/neu-ECD sequence peptides.
Analysis of sequence repeats of proteins in the PDB.
Mary Rajathei, David; Selvaraj, Samuel
2013-12-01
Internal repeats in protein sequences play a significant role in the evolution of protein structure and function. Applications of different bioinformatics tools help in the identification and characterization of these repeats. In the present study, we analyzed sequence repeats in a non-redundant set of proteins available in the Protein Data Bank (PDB). We used RADAR for detecting internal repeats in a protein, PDBeFOLD for assessing structural similarity, PDBsum for finding functional involvement and Pfam for domain assignment of the repeats in a protein. Through the analysis of sequence repeats, we found that identity of the sequence repeats falls in the range of 20-40% and, the superimposed structures of the most of the sequence repeats maintain similar overall folding. Analysis sequence repeats at the functional level reveals that most of the sequence repeats are involved in the function of the protein through functionally involved residues in the repeat regions. We also found that sequence repeats in single and two domain proteins often contained conserved sequence motifs for the function of the domain. Copyright © 2013 Elsevier Ltd. All rights reserved.
Eichmann, Cédric; Orts, Julien; Tzitzilonis, Christos; Vögeli, Beat; Smrt, Sean; Lorieau, Justin; Riek, Roland
2014-12-11
The interaction between membrane proteins and lipids or lipid mimetics such as detergents is key for the three-dimensional structure and dynamics of membrane proteins. In NMR-based structural studies of membrane proteins, qualitative analysis of intermolecular nuclear Overhauser enhancements (NOEs) or paramagnetic resonance enhancement are used in general to identify the transmembrane segments of a membrane protein. Here, we employed a quantitative characterization of intermolecular NOEs between (1)H of the detergent and (1)H(N) of (2)H-perdeuterated, (15)N-labeled α-helical membrane protein-detergent complexes following the exact NOE (eNOE) approach. Structural considerations suggest that these intermolecular NOEs should show a helical-wheel-type behavior along a transmembrane helix or a membrane-attached helix within a membrane protein as experimentally demonstrated for the complete influenza hemagglutinin fusion domain HAfp23. The partial absence of such a NOE pattern along the amino acid sequence as shown for a truncated variant of HAfp23 and for the Escherichia coli inner membrane protein YidH indicates the presence of large tertiary structure fluctuations such as an opening between helices or the presence of large rotational dynamics of the helices. Detergent-protein NOEs thus appear to be a straightforward probe for a qualitative characterization of structural and dynamical properties of membrane proteins embedded in detergent micelles.
Polysaccharide intercellular adhesin in biofilm: structural and regulatory aspects
Arciola, Carla Renata; Campoccia, Davide; Ravaioli, Stefano; Montanaro, Lucio
2015-01-01
Staphylococcus aureus and Staphylococcus epidermidis are the leading etiologic agents of implant-related infections. Biofilm formation is the main pathogenetic mechanism leading to the chronicity and irreducibility of infections. The extracellular polymeric substances of staphylococcal biofilms are the polysaccharide intercellular adhesin (PIA), extracellular-DNA, proteins, and amyloid fibrils. PIA is a poly-β(1-6)-N-acetylglucosamine (PNAG), partially deacetylated, positively charged, whose synthesis is mediated by the icaADBC locus. DNA sequences homologous to ica locus are present in many coagulase-negative staphylococcal species, among which S. lugdunensis, however, produces a biofilm prevalently consisting of proteins. The product of icaA is an N-acetylglucosaminyltransferase that synthetizes PIA oligomers from UDP-N-acetylglucosamine. The product of icaD gives optimal efficiency to IcaA. The product of icaC is involved in the externalization of the nascent polysaccharide. The product of icaB is an N-deacetylase responsible for the partial deacetylation of PIA. The expression of ica locus is affected by environmental conditions. In S. aureus and S. epidermidis ica-independent alternative mechanisms of biofilm production have been described. S. epidermidis and S. aureus undergo to a phase variation for the biofilm production that has been ascribed, in turn, to the transposition of an insertion sequence in the icaC gene or to the expansion/contraction of a tandem repeat naturally harbored within icaC. A role is played by the quorum sensing system, which negatively regulates biofilm formation, favoring the dispersal phase that disseminates bacteria to new infection sites. Interfering with the QS system is a much debated strategy to combat biofilm-related infections. In the search of vaccines against staphylococcal infections deacetylated PNAG retained on the surface of S. aureus favors opsonophagocytosis and is a potential candidate for immune-protection. PMID:25713785
Arai, Kazune; Kashiwazaki, Aki; Fujiwara, Yoko; Tsuchiya, Hiroyoshi; Sakai, Nobuya; Shibata, Katsushi; Koshimizu, Taka-aki
2015-02-15
A group of synthetic substance P (SP) antagonists, such as [Arg(6),D-Trp(7,9),N(Me)Phe(8)]-substance P(6-11) and [D-Arg(1),D-Phe(5),D-Trp(7,9),Leu(11)]-substance P, bind to a range of distinct G-protein-coupled receptor (GPCR) family members, including V1a vasopressin receptors, and they competitively inhibit agonist binding. This extended accessibility enabled us to identify a GPCR subset with a partially conserved binding site structure. By combining pharmacological data and amino acid sequence homology matrices, a pharmacological lineage of GPCRs that are sensitive to these two SP antagonists was constructed. We found that sensitivity to the SP antagonists was not limited to the Gq-protein-coupled V1a and V1b receptors; Gs-coupled V2 receptors and oxytocin receptors, which couple with both Gq and Gi, also demonstrated sensitivity. Unexpectedly, a dendrogram based on the amino acid sequences of 222 known GPCRs showed that a group of receptors sensitive to the SP antagonists are located in close proximity to vasopressin/oxytocin receptors. Gonadotropin-releasing peptide receptors, located near the vasopressin receptors in the dendrogram, were also sensitive to the SP analogs, whereas α1B adrenergic receptors, located more distantly from the vasopressin receptors, were not sensitive. Our finding suggests that pharmacological lineage analysis is useful in selecting subsets of candidate receptors that contain a conserved binding site for a ligand with broad-spectrum binding abilities. The knowledge that the binding site of the two broad-spectrum SP analogs partially overlaps with that of distinct peptide agonists is valuable for understanding the specificity/broadness of peptide ligands. Copyright © 2015 Elsevier B.V. All rights reserved.
Tanhaeian, Abbas; Shahriari Ahmadi, Farajollah; Sekhavati, Mohammad Hadi; Mamarabadi, Mojtaba
2018-04-04
Lactoferrin is the most dominant protein in milk after casein. This protein plays a crucial role in many biological processes including the regulation of iron metabolism, induction and modulation of the immune system, the primary defense against microorganisms, inhibiting lipid peroxidation and presenting antimicrobial activity against various pathogens such as parasites, fungi, bacteria, and viruses. The major antimicrobial effect of lactoferrin is related to its N-terminal tail where different peptides for instance lactoferricin and lactoferrampin which are important for their antimicrobial abilities are present. The growth rate of bacterial cells in camel milk is lower than that of the cow milk due to having more antimicrobial compounds. In this study, we have fused a codon-optimized partial camel lactoferrcin and lactoferrampin DNA sequences in order to construct a fused peptide via a lysine. This chimeric 42-mer peptide consists of complete and partial amino acid sequence of camel lactoferrampin and lactoferricin, respectively. Human embryonic kidney 293 (HEK-293) cells were used for synthesizing this recombinant peptide. Finally, the antibacterial activities of this constructed peptide were investigated under in vitro condition. The result showed that, all construction, cloning and expression processes were successfully performed in HEK-293. One His-tag tail was added to the chimera in order to optimize the isolation and purification processes and also reduce the cost of production. Additionally, His-tag retained the antimicrobial activity of the chimera. The antimicrobial tests showed that the growth rate in the majority of bacterial plant pathogens, including gram negative and positive bacteria, was inhibited by recombinant chimera as the level of MIC values were evaluated between 0.39 and 25.07 μg/ml for different bacterial isolates.
Modular protein domains: an engineering approach toward functional biomaterials.
Lin, Charng-Yu; Liu, Julie C
2016-08-01
Protein domains and peptide sequences are a powerful tool for conferring specific functions to engineered biomaterials. Protein sequences with a wide variety of functionalities, including structure, bioactivity, protein-protein interactions, and stimuli responsiveness, have been identified, and advances in molecular biology continue to pinpoint new sequences. Protein domains can be combined to make recombinant proteins with multiple functionalities. The high fidelity of the protein translation machinery results in exquisite control over the sequence of recombinant proteins and the resulting properties of protein-based materials. In this review, we discuss protein domains and peptide sequences in the context of functional protein-based materials, composite materials, and their biological applications. Copyright © 2016 Elsevier Ltd. All rights reserved.
Oleas, Gabriela; Callegari, Eduardo; Sepúlveda, Romina; Eyzaguirre, Jaime
2017-04-18
The lignocellulolytic fungus, Penicillium purpurogenum, grows on a variety of natural carbon sources, among them sugar beet pulp. Culture supernatants of P. purpurogenum grown on sugar beet pulp were partially purified and the fractions obtained analyzed for esterase activity by zymograms. The bands with activity on methyl umbelliferyl acetate were subjected to mass spectrometry to identify peptides. The peptides obtained were probed against the proteins deduced from the genome sequence of P. purpurogenum. Eight putative esterases thus identified were chosen for future work. Their cDNAs were expressed in Pichia pastoris. The supernatants of the recombinant clones were assayed for esterase activity, and five of the proteins were active against one or more substrates: methyl umbelliferyl acetate, indoxyl acetate, methyl esterified pectin and fluorescein diacetate. Three of those enzymes were purified, further characterized and subjected to a BLAST search. Based on their amino acid sequence and properties, they were identified as follows: RAE1, pectin acetyl esterase (CAZy family CE 12); FAEA, feruloyl esterase (could not be assigned to a CAZy family) and EAN, acetyl esterase (former CAZy family CE 10). Copyright © 2017 Elsevier Ltd. All rights reserved.
Properties of a Bacteriocin Produced by Bacillus subtilis EMD4 Isolated from Ganjang (Soy Sauce).
Liu, Xiaoming; Lee, Jae Yong; Jeong, Seon-Ju; Cho, Kye Man; Kim, Gyoung Min; Shin, Jung-Hye; Kim, Jong-Sang; Kim, Jeong Hwan
2015-09-01
A Bacillus species, EMD4, with strong antibacterial activity was isolated from ganjang (soy sauce) and identified as B. subtilis. B. subtilis EMD4 strongly inhibited the growth of B. cereus ATCC14579 and B. thuringiensis ATCC33679. The antibacterial activity was stable at pH 3-9 but inactive at pH 10 and above. The activity was fully retained after 15 min at 80°C but reduced by 50% after 15 min at 90°C. The activity was completely destroyed by proteinase K and protease treatment, indicating its proteinaceous nature. The bacteriocin (BacEMD4) was partially purified from culture supernatant by ammonium sulfate precipitation, and QSepharose and Sephadex G-50 column chromatographies. The specific activity was increased from 769.2 AU/mg protein to 8,347.8 AU/mg protein and the final yield was 12.6%. The size of BacEMD4 was determined to be 3.5 kDa by Tricine SDS-PAGE. The N-terminal amino acid sequence was similar with that of Subtilosin A. Nucleotide sequencing of the cloned gene confirmed that BacEMD4 was Subtilosin A. BacEMD4 showed bactericidal activity against B. cereus ATCC14579.
Gbadegesin, M A; Beeching, J R
2011-06-07
Cassava can be cultivated on impoverished soils with minimum inputs, and its storage roots are a staple food for millions in Africa. However, these roots are low in bioavailable nutrients and in protein content, contain cyanogenic glycosides, and suffer from a very short post-harvest shelf-life, and the plant is susceptible to viral and bacterial diseases prevalent in Africa. The demand for improvement of cassava with respect to these traits comes from both farmers and national agricultural institutions. Genetic improvement of cassava cultivars by molecular biology techniques requires the availability of appropriate genes, a system to introduce these genes into cassava, and the use of suitable gene promoters. Cassava root-specific promoter for auxin-repressed protein was isolated using the gene walking approach, starting with a cDNA sequence. In silico analysis of promoter sequences revealed putative cis-acting regulatory elements, including root-specific elements, which may be required for gene expression in vascular tissues. Research on the activities of this promoter is continuing, with the development of plant expression cassettes for transformation into major African elite lines and farmers' preferred cassava cultivars to enable testing of tissue-specific expression patterns in the field.
Oleas, Gabriela; Callegari, Eduardo; Sepúlveda, Romina; Eyzaguirre, Jaime
2017-01-01
The lignocellulolytic fungus, Penicillium purpurogenum, grows on a variety of natural carbon sources, among them sugar beet pulp. Culture supernatants of P. purpurogenum grown on sugar beet pulp were partially purified and the fractions obtained analyzed for esterase activity by zymograms. The bands with activity on methyl umbelliferyl acetate were subjected to mass spectrometry to identify peptides. The peptides obtained were probed against the proteins deduced from the genome sequence of P. purpurogenum. Eight putative esterases thus identified were chosen for future work. Their cDNAs were expressed in Pichia pastoris. The supernatants of the recombinant clones were assayed for esterase activity, and five of the proteins were active against one or more substrates: methyl umbelliferyl acetate, indoxyl acetate, methyl esterified pectin and fluorescein diacetate. Three of those enzymes were purified, further characterized and subjected to a BLAST search. Based on their amino acid sequence and properties, they were identified as follows: RAE1, pectin acetyl esterase (CAZy family CE 12); FAEA, feruloyl esterase (could not be assigned to a CAZy family) and EAN, acetyl esterase (former CAZy family CE 10). PMID:28342968
Ullers, Ronald S.; Houben, Edith N.G.; Raine, Amanda; ten Hagen-Jongman, Corinne M.; Ehrenberg, Måns; Brunner, Joseph; Oudega, Bauke; Harms, Nellie; Luirink, Joen
2003-01-01
As newly synthesized polypeptides emerge from the ribosome, they interact with chaperones and targeting factors that assist in folding and targeting to the proper location in the cell. In Escherichia coli, the chaperone trigger factor (TF) binds to nascent polypeptides early in biosynthesis facilitated by its affinity for the ribosomal proteins L23 and L29 that are situated around the nascent chain exit site on the ribosome. The targeting factor signal recognition particle (SRP) interacts specifically with the signal anchor (SA) sequence in nascent inner membrane proteins (IMPs). Here, we have used photocross-linking to map interactions of the SA sequence in a short, in vitro–synthesized, nascent IMP. Both TF and SRP were found to interact with the SA with partially overlapping binding specificity. In addition, extensive contacts with L23 and L29 were detected. Both purified TF and SRP could be cross-linked to L23 on nontranslating ribosomes with a competitive advantage for SRP. The results suggest a role for L23 in the targeting of IMPs as an attachment site for TF and SRP that is close to the emerging nascent chain. PMID:12756233
The Status of Exon Skipping as a Therapeutic Approach to Duchenne Muscular Dystrophy
Lu, Qi-Long; Yokota, Toshifumi; Takeda, Shin'ichi; Garcia, Luis; Muntoni, Francesco; Partridge, Terence
2011-01-01
Duchenne muscular dystrophy (DMD) is associated with mutations in the dystrophin gene that disrupt the open reading frame whereas the milder Becker's form is associated with mutations which leave an in-frame mRNA transcript that can be translated into a protein that includes the N- and C- terminal functional domains. It has been shown that by excluding specific exons at, or adjacent to, frame-shifting mutations, open reading frame can be restored to an out-of-frame mRNA, leading to the production of a partially functional Becker-like dystrophin protein. Such targeted exclusion can be achieved by administration of oligonucleotides that are complementary to sequences that are crucial to normal splicing of the exon into the transcript. This principle has been validated in mouse and canine models of DMD with a number of variants of oligonucleotide analogue chemistries and by transduction with adeno-associated virus (AAV)-small nuclear RNA (snRNA) reagents encoding the antisense sequence. Two different oligonucleotide agents are now being investigated in human trials for splicing out of exon 51 with some early indications of success at the biochemical level. PMID:20978473
Benmansour, A.; Bascuro, B.; Monnier, A.F.; Vende, P.; Winton, J.R.; de Kinkelin, P.
1997-01-01
To evaluate the genetic diversity of viral haemorrhagic septicaemia virus (VHSV), the sequence of the glycoprotein genes (G) of 11 North American and European isolates were determined. Comparison with the G protein of representative members of the family Rhabdoviridae suggested that VHSV was a different virus species from infectious haemorrhagic necrosis virus (IHNV) and Hirame rhabdovirus (HIRRV). At a higher taxonomic level, VHSV, IHNV and HIRRV formed a group which was genetically closest to the genus Lyssavirus. Compared with each other, the G genes of VHSV displayed a dissimilar overall genetic diversity which correlated with differences in geographical origin. The multiple sequence alignment of the complete G protein, showed that the divergent positions were not uniformly distributed along the sequence. A central region (amino acid position 245-300) accumulated substitutions and appeared to be highly variable. The genetic heterogeneity within a single isolate was high, with an apparent internal mutation frequency of 1.2 x 10(-3) per nucleotide site, attesting the quasispecies nature of the viral population. The phylogeny separated VHSV strains according to the major geographical area of isolation: genotype I for continental Europe, genotype II for the British Isles, and genotype III for North America. Isolates from continental Europe exhibited the highest genetic variability, with sub-groups correlated partially with the serological classification. Neither neutralizing polyclonal sera, nor monoclonal antibodies, were able to discriminate between the genotypes. The overall structure of the phylogenetic tree suggests that VHSV genetic diversity and evolution fit within the model of random change and positive selection operating on quasispecies.
Pankovics, Péter; Simmonds, Peter
2011-01-01
A novel positive-sense, single-stranded RNA (+ssRNA) virus (Halastavi árva RNA virus, HalV; JN000306) with di-cistronic genome organization was serendipitously identified in intestinal contents of freshwater carps (Cyprinus carpio) fished by line-fishing from fishpond “Lőrinte halastó” located in Veszprém County, Hungary. The complete nucleotide (nt) sequence of the genomic RNA is 9565 nt in length and contains two long - non-in-frame - open reading frames (ORFs), which are separated by an intergenic region. The ORF1 (replicase) is preceded by an untranslated sequence of 827 nt, while an untranslated region of 139 nt follows the ORF2 (capsid proteins). The deduced amino acid (aa) sequences of the ORFs showed only low (less than 32%) and partial similarity to the non-structural (2C-like helicase, 3C-like cystein protease and 3D-like RNA dependent RNA polymerase) and structural proteins (VP2/VP4/VP3) of virus families in Picornavirales especially to members of the viruses with dicistronic genome. Halastavi árva RNA virus is present in intestinal contents of omnivorous freshwater carps but the origin and the host species of this virus remains unknown. The unique viral sequence and the actual position indicate that Halastavi árva RNA virus seems to be the first member of a new di-cistronic ssRNA virus. Further studies are required to investigate the specific host species (and spectrum), ecology and role of Halastavi árva RNA virus in the nature. PMID:22195010
Tsakou, Eugenia; Agathagelidis, Andreas; Boudjoghra, Myriam; Raff, Thorsten; Dagklis, Antonis; Chatzouli, Maria; Smilevska, Tatjana; Bourikas, George; Merle-Beral, Helene; Manioudaki-Kavallieratou, Eleni; Anagnostopoulos, Achilles; Brüggemann, Monika; Davi, Frederic; Stamatopoulos, Kostas; Belessi, Chrysoula
2012-01-01
The frequent occurrence of stereotyped heavy complementarity-determining region 3 (VH CDR3) sequences among unrelated cases with chronic lymphocytic leukemia (CLL) is widely taken as evidence for antigen selection. Stereotyped VH CDR3 sequences are often defined by the selective association of certain immunoglobulin heavy diversity (IGHD) genes in specific reading frames with certain immunoglobulin heavy joining (IGHJ ) genes. To gain insight into the mechanisms underlying VH CDR3 restrictions and also determine the developmental stage when restrictions in VH CDR3 are imposed, we analyzed partial IGHD-IGHJ rearrangements (D-J) in 829 CLL cases and compared the productively rearranged D-J joints (that is, in-frame junctions without junctional stop codons) to (a) the productive immunoglobulin heavy variable (IGHV )-IGHD-IGHJ rearrangements (V-D-J) from the same cases and (b) 174 D-J rearrangements from 160 precursor B-cell acute lymphoblastic leukemia cases (pre-B acute lymphoblastic leukemia [ALL]). Partial D-J rearrangements were detected in 272/829 CLL cases (32.8%). Sequence analysis was feasible in 238 of 272 D-J rearrangements; 198 of 238 (83.2%) were productively rearranged. The D-J joints in CLL did not differ significantly from those in pre-B ALL, except for higher frequency of the IGHD7-27 and IGHJ6 genes in the latter. Among CLL carrying productively rearranged D-J, comparison of the IGHD gene repertoire in productive V-D-J versus D-J revealed the following: (a) overuse of IGHD reading frames encoding hydrophilic peptides among V-D-J and (b) selection of the IGHD3-3 and IGHD6-19 genes in V-D-J junctions. These results document that the IGHD and IGHJ gene biases in the CLL expressed VH CDR3 repertoire are not stochastic but are directed by selection operating at the immunoglobulin protein level. PMID:21968789
Cloning, sequencing, and expression of cDNA for human. beta. -glucuronidase
DOE Office of Scientific and Technical Information (OSTI.GOV)
Oshima, A.; Kyle, J.W.; Miller, R.D.
1987-02-01
The authors report here the cDNA sequence for human placental ..beta..-glucuronidase (..beta..-D-glucuronoside glucuronosohydrolase, EC 3.2.1.31) and demonstrate expression of the human enzyme in transfected COS cells. They also sequenced a partial cDNA clone from human fibroblasts that contained a 153-base-pair deletion within the coding sequence and found a second type of cDNA clone from placenta that contained the same deletion. Nuclease S1 mapping studies demonstrated two types of mRNAs in human placenta that corresponded to the two types of cDNA clones isolated. The NH/sub 2/-terminal amino acid sequence determined for human spleen ..beta..-glucuronidase agreed with that inferred from the DNAmore » sequence of the two placental clones, beginning at amino acid 23, suggesting a cleaved signal sequence of 22 amino acids. When transfected into COS cells, plasmids containing either placental clone expressed an immunoprecipitable protein that contained N-linked oligosaccharides as evidenced by sensitivity to endoglycosidase F. However, only transfection with the clone containing the 153-base-pair segment led to expression of human ..beta..-glucuronidase activity. These studies provide the sequence for the full-length cDNA for human ..beta..-glucuronidase, demonstrate the existence of two populations of mRNA for ..beta..-glucuronidase in human placenta, only one of which specifies a catalytically active enzyme, and illustrate the importance of expression studies in verifying that a cDNA is functionally full-length.« less
Horibata, Y; Okino, N; Ichinose, S; Omori, A; Ito, M
2000-10-06
Endoglycoceramidase (EC ) is an enzyme capable of cleaving the glycosidic linkage between oligosaccharides and ceramides in various glycosphingolipids. We report here the purification, characterization, and cDNA cloning of a novel endoglycoceramidase from the jellyfish, Cyanea nozakii. The purified enzyme showed a single protein band estimated to be 51 kDa on SDS-polyacrylamide gel electrophoresis. The enzyme showed a pH optimum of 3.0 and was activated by Triton X-100 and Lubrol PX but not by sodium taurodeoxycholate. This enzyme preferentially hydrolyzed gangliosides, especially GT1b and GQ1b, whereas neutral glycosphingolipids were somewhat resistant to hydrolysis by the enzyme. A full-length cDNA encoding the enzyme was cloned by 5'- and 3'-rapid amplification of cDNA ends using a partial amino acid sequence of the purified enzyme. The open reading frame of 1509 nucleotides encoded a polypeptide of 503 amino acids including a signal sequence of 25 residues and six potential N-glycosylation sites. Interestingly, the Asn-Glu-Pro sequence, which is the putative active site of Rhodococcus endoglycoceramidase, was conserved in the deduced amino acid sequences. This is the first report of the cloning of an endoglycoceramidase from a eukaryote.
Preparation of highly multiplexed small RNA sequencing libraries.
Persson, Helena; Søkilde, Rolf; Pirona, Anna Chiara; Rovira, Carlos
2017-08-01
MicroRNAs (miRNAs) are ~22-nucleotide-long small non-coding RNAs that regulate the expression of protein-coding genes by base pairing to partially complementary target sites, preferentially located in the 3´ untranslated region (UTR) of target mRNAs. The expression and function of miRNAs have been extensively studied in human disease, as well as the possibility of using these molecules as biomarkers for prognostication and treatment guidance. To identify and validate miRNAs as biomarkers, their expression must be screened in large collections of patient samples. Here, we develop a scalable protocol for the rapid and economical preparation of a large number of small RNA sequencing libraries using dual indexing for multiplexing. Combined with the use of off-the-shelf reagents, more samples can be sequenced simultaneously on large-scale sequencing platforms at a considerably lower cost per sample. Sample preparation is simplified by pooling libraries prior to gel purification, which allows for the selection of a narrow size range while minimizing sample variation. A comparison with publicly available data from benchmarking of miRNA analysis platforms showed that this method captures absolute and differential expression as effectively as commercially available alternatives.