Methods and apparatuses using filter banks for multi-carrier spread-spectrum signals
Moradi, Hussein; Farhang, Behrouz; Kutsche, Carl A
2014-10-14
A transmitter includes a synthesis filter bank to spread a data symbol to a plurality of frequencies by encoding the data symbol on each frequency, apply a common pulse-shaping filter, and apply gains to the frequencies such that a power level of each frequency is less than a noise level of other communication signals within the spectrum. Each frequency is modulated onto a different evenly spaced subcarrier. A demodulator in a receiver converts a radio frequency input to a spread-spectrum signal in a baseband. A matched filter filters the spread-spectrum signal with a common filter having characteristics matched to the synthesis filter bank in the transmitter by filtering each frequency to generate a sequence of narrow pulses. A carrier recovery unit generates control signals responsive to the sequence of narrow pulses suitable for generating a phase-locked loop between the demodulator, the matched filter, and the carrier recovery unit.
Methods and apparatuses using filter banks for multi-carrier spread-spectrum signals
Moradi, Hussein; Farhang, Behrouz; Kutsche, Carl A
2014-05-20
A transmitter includes a synthesis filter bank to spread a data symbol to a plurality of frequencies by encoding the data symbol on each frequency, apply a common pulse-shaping filter, and apply gains to the frequencies such that a power level of each frequency is less than a noise level of other communication signals within the spectrum. Each frequency is modulated onto a different evenly spaced subcarrier. A demodulator in a receiver converts a radio frequency input to a spread-spectrum signal in a baseband. A matched filter filters the spread-spectrum signal with a common filter having characteristics matched to the synthesis filter bank in the transmitter by filtering each frequency to generate a sequence of narrow pulses. A carrier recovery unit generates control signals responsive to the sequence of narrow pulses suitable for generating a phase-locked loop between the demodulator, the matched filter, and the carrier recovery unit.
Neutral details associated with emotional events are encoded: evidence from a cued recall paradigm.
Mickley Steinmetz, Katherine R; Knight, Aubrey G; Kensinger, Elizabeth A
2016-11-01
Enhanced emotional memory often comes at the cost of memory for surrounding background information. Narrowed-encoding theories suggest that this is due to narrowed attention for emotional information at encoding, leading to impaired encoding of background information. Recent work has suggested that an encoding-based theory may be insufficient. Here, we examined whether cued recall-instead of previously used recognition memory tasks-would reveal evidence that non-emotional information associated with emotional information was effectively encoded. Participants encoded positive, negative, or neutral objects on neutral backgrounds. At retrieval, they were given either the item or the background as a memory cue and were asked to recall the associated scene element. Counter to narrowed-encoding theories, emotional items were more likely than neutral items to trigger recall of the associated background. This finding suggests that there is a memory trace of this contextual information and that emotional cues may facilitate retrieval of this information.
Narrow-Host-Range Bacteriophages That Infect Rhizobium etli Associate with Distinct Genomic Types
Santamaría, Rosa Isela; Bustos, Patricia; Sepúlveda-Robles, Omar; Lozano, Luis; Rodríguez, César; Fernández, José Luis; Juárez, Soledad; Kameyama, Luis; Guarneros, Gabriel; Dávila, Guillermo
2014-01-01
In this work, we isolated and characterized 14 bacteriophages that infect Rhizobium etli. They were obtained from rhizosphere soil of bean plants from agricultural lands in Mexico using an enrichment method. The host range of these phages was narrow but variable within a collection of 48 R. etli strains. We obtained the complete genome sequence of nine phages. Four phages were resistant to several restriction enzymes and in vivo cloning, probably due to nucleotide modifications. The genome size of the sequenced phages varied from 43 kb to 115 kb, with a median size of ∼45 to 50 kb. A large proportion of open reading frames of these phage genomes (65 to 70%) consisted of hypothetical and orphan genes. The remainder encoded proteins needed for phage morphogenesis and DNA synthesis and processing, among other functions, and a minor percentage represented genes of bacterial origin. We classified these phages into four genomic types on the basis of their genomic similarity, gene content, and host range. Since there are no reports of similar sequences, we propose that these bacteriophages correspond to novel species. PMID:24185856
Chromosome-encoded narrow-spectrum Ambler class A beta-lactamase GIL-1 from Citrobacter gillenii.
Naas, Thierry; Aubert, Daniel; Ozcan, Ayla; Nordmann, Patrice
2007-04-01
A novel beta-lactamase gene was cloned from the whole-cell DNA of an enterobacterial Citrobacter gillenii reference strain that displayed a weak narrow-spectrum beta-lactam-resistant phenotype and was expressed in Escherichia coli. It encoded a clavulanic acid-inhibited Ambler class A beta-lactamase, GIL-1, with a pI value of 7.5 and a molecular mass of ca. 29 kDa. GIL-1 had the highest percent amino acid sequence identity with TEM-1 and SHV-1, 77%, and 67%, respectively, and only 46%, 31%, and 32% amino acid sequence identity with CKO-1 (C. koseri), CdiA1 (C. diversus), and SED-1 (C. sedlaki), respectively. The substrate profile of the purified GIL-1 was similar to that of beta-lactamases TEM-1 and SHV-1. The blaGIL-1 gene was chromosomally located, as revealed by I-CeuI experiments, and was constitutively expressed at a low level in C. gillenii. No gene homologous to the regulatory ampR genes of chromosomal class C beta-lactamases was found upstream of the blaGIL-1 gene, which fits the noninducibility of beta-lactamase expression in C. gillenii. Rapid amplification of DNA 5' ends analysis of the promoter region revealed putative promoter sequences that diverge from what has been identified as the consensus sequence in E. coli. The blaGIL-1 gene was part of a 5.5-kb DNA fragment bracketed by a 9-bp duplication and inserted between the d-lactate dehydrogenase gene and the ydbH genes; this DNA fragment was absent in other Citrobacter species. This work further illustrates the heterogeneity of beta-lactamases in Citrobacter spp., which may indicate that the variability of Citrobacter species is greater than expected.
Neutral Details Associated with Emotional Events are Encoded: Evidence from a Cued Recall Paradigm
Steinmetz, Katherine R. Mickley; Knight, Aubrey G.; Kensinger, Elizabeth A.
2015-01-01
Enhanced emotional memory often comes at the cost of memory for surrounding background information. Narrowed-encoding theories suggest that this is due to narrowed attention for emotional information at encoding, leading to impaired encoding of background information. Recent work has suggested that an encoding-based theory may be insufficient. Here, we examined whether cued recall – instead of previously used recognition memory tasks - would reveal evidence that non-emotional information associated with emotional information was effectively encoded. Participants encoded positive, negative, or neutral objects on neutral backgrounds. At retrieval, they were given either the item or the background as a memory cue and were asked to recall the associated scene element. Counter to narrowed-encoding theories, emotional items were more likely than neutral items to trigger recall of the associated background. This finding suggests that there is a memory trace of this contextual information and that emotional cues may facilitate retrieval of this information. PMID:26220708
Methods and apparatuses using filter banks for multi-carrier spread spectrum signals
DOE Office of Scientific and Technical Information (OSTI.GOV)
Moradi, Hussein; Farhang, Behrouz; Kutsche, Carl A
2017-01-31
A transmitter includes a synthesis filter bank to spread a data symbol to a plurality of frequencies by encoding the data symbol on each frequency, apply a common pulse-shaping filter, and apply gains to the frequencies such that a power level of each frequency is less than a noise level of other communication signals within the spectrum. Each frequency is modulated onto a different evenly spaced subcarrier. A demodulator in a receiver converts a radio frequency input to a spread-spectrum signal in a baseband. A matched filter filters the spread-spectrum signal with a common filter having characteristics matched to themore » synthesis filter bank in the transmitter by filtering each frequency to generate a sequence of narrow pulses. A carrier recovery unit generates control signals responsive to the sequence of narrow pulses suitable for generating a phase-locked loop between the demodulator, the matched filter, and the carrier recovery unit.« less
Methods and apparatuses using filter banks for multi-carrier spread spectrum signals
DOE Office of Scientific and Technical Information (OSTI.GOV)
Moradi, Hussein; Farhang, Behrouz; Kutsche, Carl A.
2016-06-14
A transmitter includes a synthesis filter bank to spread a data symbol to a plurality of frequencies by encoding the data symbol on each frequency, apply a common pulse-shaping filter, and apply gains to the frequencies such that a power level of each frequency is less than a noise level of other communication signals within the spectrum. Each frequency is modulated onto a different evenly spaced subcarrier. A demodulator in a receiver converts a radio frequency input to a spread-spectrum signal in a baseband. A matched filter filters the spread-spectrum signal with a common filter having characteristics matched to themore » synthesis filter bank in the transmitter by filtering each frequency to generate a sequence of narrow pulses. A carrier recovery unit generates control signals responsive to the sequence of narrow pulses suitable for generating a phase-locked loop between the demodulator, the matched filter, and the carrier recovery unit.« less
Taroncher-Oldenburg, Gaspar; Anderson, Donald M.
2000-01-01
Genes showing differential expression related to the early G1 phase of the cell cycle during synchronized circadian growth of the toxic dinoflagellate Alexandrium fundyense were identified and characterized by differential display (DD). The determination in our previous work that toxin production in Alexandrium is relegated to a narrow time frame in early G1 led to the hypothesis that transcriptionally up- or downregulated genes during this subphase of the cell cycle might be related to toxin biosynthesis. Three genes, encoding S-adenosylhomocysteine hydrolase (Sahh), methionine aminopeptidase (Map), and a histone-like protein (HAf), were isolated. Sahh was downregulated, while Map and HAf were upregulated, during the early G1 phase of the cell cycle. Sahh and Map encoded amino acid sequences with about 90 and 70% similarity to those encoded by several eukaryotic and prokaryotic Sahh and Map genes, respectively. The partial Map sequence also contained three cobalt binding motifs characteristic of all Map genes. HAf encoded an amino acid sequence with 60% similarity to those of two histone-like proteins from the dinoflagellate Crypthecodinium cohnii Biecheler. This study documents the potential of applying DD to the identification of genes that are related to physiological processes or cell cycle events in phytoplankton under conditions where small sample volumes represent an experimental constraint. The identification of an additional 21 genes with various cell cycle-related DD patterns also provides evidence for the importance of pretranslational or transcriptional regulation in dinoflagellates, contrary to previous reports suggesting the possibility that translational mechanisms are the primary means of circadian regulation in this group of organisms. PMID:10788388
2011-01-01
Background Lupinus angustifolius L, also known as narrow-leafed lupin (NLL), is becoming an important grain legume crop that is valuable for sustainable farming and is becoming recognised as a potential human health food. Recent interest is being directed at NLL to improve grain production, disease and pest management and health benefits of the grain. However, studies have been hindered by a lack of extensive genomic resources for the species. Results A NLL BAC library was constructed consisting of 111,360 clones with an average insert size of 99.7 Kbp from cv Tanjil. The library has approximately 12 × genome coverage. Both ends of 9600 randomly selected BAC clones were sequenced to generate 13985 BAC end-sequences (BESs), covering approximately 1% of the NLL genome. These BESs permitted a preliminary characterisation of the NLL genome such as organisation and composition, with the BESs having approximately 39% G:C content, 16.6% repetitive DNA and 5.4% putative gene-encoding regions. From the BESs 9966 simple sequence repeat (SSR) motifs were identified and some of these are shown to be potential markers. Conclusions The NLL BAC library and BAC-end sequences are powerful resources for genetic and genomic research on lupin. These resources will provide a robust platform for future high-resolution mapping, map-based cloning, comparative genomics and assembly of whole-genome sequencing data for the species. PMID:22014081
NASA Astrophysics Data System (ADS)
Demberg, Kerstin; Laun, Frederik Bernd; Windschuh, Johannes; Umathum, Reiner; Bachert, Peter; Kuder, Tristan Anselm
2017-02-01
Diffusion pore imaging is an extension of diffusion-weighted nuclear magnetic resonance imaging enabling the direct measurement of the shape of arbitrarily formed, closed pores by probing diffusion restrictions using the motion of spin-bearing particles. Examples of such pores comprise cells in biological tissue or oil containing cavities in porous rocks. All pores contained in the measurement volume contribute to one reconstructed image, which reduces the problem of vanishing signal at increasing resolution present in conventional magnetic resonance imaging. It has been previously experimentally demonstrated that pore imaging using a combination of a long and a narrow magnetic field gradient pulse is feasible. In this work, an experimental verification is presented showing that pores can be imaged using short gradient pulses only. Experiments were carried out using hyperpolarized xenon gas in well-defined pores. The phase required for pore image reconstruction was retrieved from double diffusion encoded (DDE) measurements, while the magnitude could either be obtained from DDE signals or classical diffusion measurements with single encoding. The occurring image artifacts caused by restrictions of the gradient system, insufficient diffusion time, and by the phase reconstruction approach were investigated. Employing short gradient pulses only is advantageous compared to the initial long-narrow approach due to a more flexible sequence design when omitting the long gradient and due to faster convergence to the diffusion long-time limit, which may enable application to larger pores.
The Effect of Divided Attention on Emotion-Induced Memory Narrowing
Steinmetz, Katherine R. Mickley; Waring, Jill D.; Kensinger, Elizabeth A.
2014-01-01
Individuals are more likely to remember emotional than neutral information, but this benefit does not always extend to the surrounding background information. This memory narrowing is theorized to be linked to the availability of attentional resources at encoding. In contrast to the predictions of this theoretical account, altering participants’ attentional resources at encoding, by dividing attention, did not affect the emotion-induced memory narrowing. Attention was divided using three separate manipulations: a digit ordering task (Experiment 1), an arithmetic task (Experiment 2), and an auditory discrimination task (Experiment 3). Across all three experiments, divided attention decreased memory across-the-board but did not affect the degree of memory narrowing. These findings suggest that theories to explain memory narrowing must be expanded to include other potential mechanisms beyond limitations of attentional resources. PMID:24295041
The effect of divided attention on emotion-induced memory narrowing.
Mickley Steinmetz, Katherine R; Waring, Jill D; Kensinger, Elizabeth A
2014-01-01
Individuals are more likely to remember emotional than neutral information, but this benefit does not always extend to the surrounding background information. This memory narrowing is theorised to be linked to the availability of attentional resources at encoding. In contrast to the predictions of this theoretical account, altering participants' attentional resources at encoding by dividing attention did not affect emotion-induced memory narrowing. Attention was divided using three separate manipulations: a digit ordering task (Experiment 1), an arithmetic task (Experiment 2) and an auditory discrimination task (Experiment 3). Across all three experiments, divided attention decreased memory across the board but did not affect the degree of memory narrowing. These findings suggest that theories to explain memory narrowing must be expanded to include other potential mechanisms beyond the limitations of attentional resources.
Poly A tail length analysis of in vitro transcribed mRNA by LC-MS.
Beverly, Michael; Hagen, Caitlin; Slack, Olga
2018-02-01
The 3'-polyadenosine (poly A) tail of in vitro transcribed (IVT) mRNA was studied using liquid chromatography coupled to mass spectrometry (LC-MS). Poly A tails were cleaved from the mRNA using ribonuclease T1 followed by isolation with dT magnetic beads. Extracted tails were then analyzed by LC-MS which provided tail length information at single-nucleotide resolution. A 2100-nt mRNA with plasmid-encoded poly A tail lengths of either 27, 64, 100, or 117 nucleotides was used for these studies as enzymatically added poly A tails showed significant length heterogeneity. The number of As observed in the tails closely matched Sanger sequencing results of the DNA template, and even minor plasmid populations with sequence variations were detected. When the plasmid sequence contained a discreet number of poly As in the tail, analysis revealed a distribution that included tails longer than the encoded tail lengths. These observations were consistent with transcriptional slippage of T7 RNAP taking place within a poly A sequence. The type of RNAP did not alter the observed tail distribution, and comparison of T3, T7, and SP6 showed all three RNAPs produced equivalent tail length distributions. The addition of a sequence at the 3' end of the poly A tail did, however, produce narrower tail length distributions which supports a previously described model of slippage where the 3' end can be locked in place by having a G or C after the poly nucleotide region. Graphical abstract Determination of mRNA poly A tail length using magnetic beads and LC-MS.
Vidgren, Virve; Ruohonen, Laura; Londesborough, John
2005-12-01
Maltose and maltotriose are the major sugars in brewer's wort. Brewer's yeasts contain multiple genes for maltose transporters. It is not known which of these express functional transporters. We correlated maltose transport kinetics with the genotypes of some ale and lager yeasts. Maltose transport by two ale strains was strongly inhibited by other alpha-glucosides, suggesting the use of broad substrate specificity transporters, such as Agt1p. Maltose transport by three lager strains was weakly inhibited by other alpha-glucosides, suggesting the use of narrow substrate specificity transporters. Hybridization studies showed that all five strains contained complete MAL1, MAL2, MAL3, and MAL4 loci, except for one ale strain, which lacked a MAL2 locus. All five strains also contained both AGT1 (coding a broad specificity alpha-glucoside transporter) and MAL11 alleles. MPH genes (maltose permease homologues) were present in the lager but not in the ale strains. During growth on maltose, the lager strains expressed AGT1 at low levels and MALx1 genes at high levels, whereas the ale strains expressed AGT1 at high levels and MALx1 genes at low levels. MPHx expression was negligible in all strains. The AGT1 sequences from the ale strains encoded full-length (616 amino acid) polypeptides, but those from both sequenced lager strains encoded truncated (394 amino acid) polypeptides that are unlikely to be functional transporters. Thus, despite the apparently similar genotypes of these ale and lager strains revealed by hybridization, maltose is predominantly carried by AGT1-encoded transporters in the ale strains and by MALx1-encoded transporters in the lager strains.
Haigler, B E; Suen, W C; Spain, J C
1996-01-01
4-Methyl-5-nitrocatechol (MNC) is an intermediate in the degradation of 2,4-dinitrotoluene by Burkholderia sp. strain DNT. In the presence of NADPH and oxygen, MNC monooxygenase catalyzes the removal of the nitro group from MNC to form 2-hydroxy-5-methylquinone. The gene (dntB) encoding MNC monooxygenase has been previously cloned and characterized. In order to examine the properties of MNC monooxygenase and to compare it with other enzymes, we sequenced the gene encoding the MNC monooxygenase and purified the enzyme from strain DNT. dntB was localized within a 2.2-kb ApaI DNA fragment. Sequence analysis of this fragment revealed an open reading frame of 1,644 bp with an N-terminal amino acid sequence identical to that of purified MNC monooxygenase from strain DNT. Comparison of the derived amino acid sequences with those of other genes showed that DntB contains the highly conserved ADP and flavin adenine dinucleotide (FAD) binding motifs characteristic of flavoprotein hydroxylases. MNC monooxygenase was purified to homogeneity from strain DNT by anion exchange and gel filtration chromatography. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis revealed a single protein with a molecular weight of 60,200, which is consistent with the size determined from the gene sequence. The native molecular weight determined by gel filtration was 65,000, which indicates that the native enzyme is a monomer. It used either NADH or NADPH as electron donors, and NADPH was the preferred cofactor. The purified enzyme contained 1 mol of FAD per mol of protein, which is also consistent with the detection of an FAD binding motif in the amino acid sequence of DntB. MNC monooxygenase has a narrow substrate specificity. MNC and 4-nitrocatechol are good substrates whereas 3-methyl-4-nitrophenol, 3-methyl-4-nitrocatechol, 4-nitrophenol, 3-nitrophenol, and 4-chlorocatechol were not. These studies suggest that MNC monooxygenase is a flavoprotein that shares some properties with previously studied nitrophenol oxygenases. PMID:8830701
Image domain propeller fast spin echo☆
Skare, Stefan; Holdsworth, Samantha J.; Lilja, Anders; Bammer, Roland
2013-01-01
A new pulse sequence for high-resolution T2-weighted (T2-w) imaging is proposed –image domain propeller fast spin echo (iProp-FSE). Similar to the T2-w PROPELLER sequence, iProp-FSE acquires data in a segmented fashion, as blades that are acquired in multiple TRs. However, the iProp-FSE blades are formed in the image domain instead of in the k-space domain. Each iProp-FSE blade resembles a single-shot fast spin echo (SSFSE) sequence with a very narrow phase-encoding field of view (FOV), after which N rotated blade replicas yield the final full circular FOV. Our method of combining the image domain blade data to a full FOV image is detailed, and optimal choices of phase-encoding FOVs and receiver bandwidths were evaluated on phantom and volunteers. The results suggest that a phase FOV of 15–20%, a receiver bandwidth of ±32–63 kHz and a subsequent readout time of about 300 ms provide a good tradeoff between signal-to-noise ratio (SNR) efficiency and T2 blurring. Comparisons between iProp-FSE, Cartesian FSE and PROPELLER were made on single-slice axial brain data, showing similar T2-w tissue contrast and SNR with great anatomical conspicuity at similar scan times –without colored noise or streaks from motion. A new slice interleaving order is also proposed to improve the multislice capabilities of iProp-FSE. PMID:23200683
Image domain propeller fast spin echo.
Skare, Stefan; Holdsworth, Samantha J; Lilja, Anders; Bammer, Roland
2013-04-01
A new pulse sequence for high-resolution T2-weighted (T2-w) imaging is proposed - image domain propeller fast spin echo (iProp-FSE). Similar to the T2-w PROPELLER sequence, iProp-FSE acquires data in a segmented fashion, as blades that are acquired in multiple TRs. However, the iProp-FSE blades are formed in the image domain instead of in the k-space domain. Each iProp-FSE blade resembles a single-shot fast spin echo (SSFSE) sequence with a very narrow phase-encoding field of view (FOV), after which N rotated blade replicas yield the final full circular FOV. Our method of combining the image domain blade data to a full FOV image is detailed, and optimal choices of phase-encoding FOVs and receiver bandwidths were evaluated on phantom and volunteers. The results suggest that a phase FOV of 15-20%, a receiver bandwidth of ±32-63 kHz and a subsequent readout time of about 300 ms provide a good tradeoff between signal-to-noise ratio (SNR) efficiency and T2 blurring. Comparisons between iProp-FSE, Cartesian FSE and PROPELLER were made on single-slice axial brain data, showing similar T2-w tissue contrast and SNR with great anatomical conspicuity at similar scan times - without colored noise or streaks from motion. A new slice interleaving order is also proposed to improve the multislice capabilities of iProp-FSE. Copyright © 2013 Elsevier Inc. All rights reserved.
Compressing DNA sequence databases with coil.
White, W Timothy J; Hendy, Michael D
2008-05-20
Publicly available DNA sequence databases such as GenBank are large, and are growing at an exponential rate. The sheer volume of data being dealt with presents serious storage and data communications problems. Currently, sequence data is usually kept in large "flat files," which are then compressed using standard Lempel-Ziv (gzip) compression - an approach which rarely achieves good compression ratios. While much research has been done on compressing individual DNA sequences, surprisingly little has focused on the compression of entire databases of such sequences. In this study we introduce the sequence database compression software coil. We have designed and implemented a portable software package, coil, for compressing and decompressing DNA sequence databases based on the idea of edit-tree coding. coil is geared towards achieving high compression ratios at the expense of execution time and memory usage during compression - the compression time represents a "one-off investment" whose cost is quickly amortised if the resulting compressed file is transmitted many times. Decompression requires little memory and is extremely fast. We demonstrate a 5% improvement in compression ratio over state-of-the-art general-purpose compression tools for a large GenBank database file containing Expressed Sequence Tag (EST) data. Finally, coil can efficiently encode incremental additions to a sequence database. coil presents a compelling alternative to conventional compression of flat files for the storage and distribution of DNA sequence databases having a narrow distribution of sequence lengths, such as EST data. Increasing compression levels for databases having a wide distribution of sequence lengths is a direction for future work.
Compressing DNA sequence databases with coil
White, W Timothy J; Hendy, Michael D
2008-01-01
Background Publicly available DNA sequence databases such as GenBank are large, and are growing at an exponential rate. The sheer volume of data being dealt with presents serious storage and data communications problems. Currently, sequence data is usually kept in large "flat files," which are then compressed using standard Lempel-Ziv (gzip) compression – an approach which rarely achieves good compression ratios. While much research has been done on compressing individual DNA sequences, surprisingly little has focused on the compression of entire databases of such sequences. In this study we introduce the sequence database compression software coil. Results We have designed and implemented a portable software package, coil, for compressing and decompressing DNA sequence databases based on the idea of edit-tree coding. coil is geared towards achieving high compression ratios at the expense of execution time and memory usage during compression – the compression time represents a "one-off investment" whose cost is quickly amortised if the resulting compressed file is transmitted many times. Decompression requires little memory and is extremely fast. We demonstrate a 5% improvement in compression ratio over state-of-the-art general-purpose compression tools for a large GenBank database file containing Expressed Sequence Tag (EST) data. Finally, coil can efficiently encode incremental additions to a sequence database. Conclusion coil presents a compelling alternative to conventional compression of flat files for the storage and distribution of DNA sequence databases having a narrow distribution of sequence lengths, such as EST data. Increasing compression levels for databases having a wide distribution of sequence lengths is a direction for future work. PMID:18489794
Suppression of a NAC-like transcription factor gene improves boron-toxicity tolerance in rice.
Ochiai, Kumiko; Shimizu, Akifumi; Okumoto, Yutaka; Fujiwara, Toru; Matoh, Toru
2011-07-01
We identified a gene responsible for tolerance to boron (B) toxicity in rice (Oryza sativa), named BORON EXCESS TOLERANT1. Using recombinant inbred lines derived from the B-toxicity-sensitive indica-ecotype cultivar IR36 and the tolerant japonica-ecotype cultivar Nekken 1, the region responsible for tolerance to B toxicity was narrowed to 49 kb on chromosome 4. Eight genes are annotated in this region. The DNA sequence in this region was compared between the B-toxicity-sensitive japonica cultivar Wataribune and the B-toxicity-tolerant japonica cultivar Nipponbare by eco-TILLING analysis and revealed a one-base insertion mutation in the open reading frame sequence of the gene Os04g0477300. The gene encodes a NAC (NAM, ATAF, and CUC)-like transcription factor and the function of the transcript is abolished in B-toxicity-tolerant cultivars. Transgenic plants in which the expression of Os04g0477300 is abolished by RNA interference gain tolerance to B toxicity.
Suppression of a NAC-Like Transcription Factor Gene Improves Boron-Toxicity Tolerance in Rice1
Ochiai, Kumiko; Shimizu, Akifumi; Okumoto, Yutaka; Fujiwara, Toru; Matoh, Toru
2011-01-01
We identified a gene responsible for tolerance to boron (B) toxicity in rice (Oryza sativa), named BORON EXCESS TOLERANT1. Using recombinant inbred lines derived from the B-toxicity-sensitive indica-ecotype cultivar IR36 and the tolerant japonica-ecotype cultivar Nekken 1, the region responsible for tolerance to B toxicity was narrowed to 49 kb on chromosome 4. Eight genes are annotated in this region. The DNA sequence in this region was compared between the B-toxicity-sensitive japonica cultivar Wataribune and the B-toxicity-tolerant japonica cultivar Nipponbare by eco-TILLING analysis and revealed a one-base insertion mutation in the open reading frame sequence of the gene Os04g0477300. The gene encodes a NAC (NAM, ATAF, and CUC)-like transcription factor and the function of the transcript is abolished in B-toxicity-tolerant cultivars. Transgenic plants in which the expression of Os04g0477300 is abolished by RNA interference gain tolerance to B toxicity. PMID:21543724
Liu, Zhanliang; Ma, Ping; Holtsmark, Ingrid; Skaugen, Morten; Eijsink, Vincent G. H.
2013-01-01
It has previously been shown that the tomato pathogen Clavibacter michiganensis subsp. michiganensis secretes a 14-kDa protein, C. michiganensis subsp. michiganensis AMP-I (CmmAMP-I), that inhibits growth of Clavibacter michiganensis subsp. sepedonicus, the causal agent of bacterial ring rot of potato. Using sequences obtained from tryptic fragments, we have identified the gene encoding CmmAMP-I and we have recombinantly produced the protein with an N-terminal intein tag. The gene sequence showed that CmmAMP-I contains a typical N-terminal signal peptide for Sec-dependent secretion. The recombinant protein was highly active, with 50% growth inhibition (IC50) of approximately 10 pmol, but was not toxic to potato leaves or tubers. CmmAMP-I does not resemble any known protein and thus represents a completely new type of bacteriocin. Due to its high antimicrobial activity and its very narrow inhibitory spectrum, CmmAMP-1 may be of interest in combating potato ring rot disease. PMID:23851100
Local alignment of two-base encoded DNA sequence
Homer, Nils; Merriman, Barry; Nelson, Stanley F
2009-01-01
Background DNA sequence comparison is based on optimal local alignment of two sequences using a similarity score. However, some new DNA sequencing technologies do not directly measure the base sequence, but rather an encoded form, such as the two-base encoding considered here. In order to compare such data to a reference sequence, the data must be decoded into sequence. The decoding is deterministic, but the possibility of measurement errors requires searching among all possible error modes and resulting alignments to achieve an optimal balance of fewer errors versus greater sequence similarity. Results We present an extension of the standard dynamic programming method for local alignment, which simultaneously decodes the data and performs the alignment, maximizing a similarity score based on a weighted combination of errors and edits, and allowing an affine gap penalty. We also present simulations that demonstrate the performance characteristics of our two base encoded alignment method and contrast those with standard DNA sequence alignment under the same conditions. Conclusion The new local alignment algorithm for two-base encoded data has substantial power to properly detect and correct measurement errors while identifying underlying sequence variants, and facilitating genome re-sequencing efforts based on this form of sequence data. PMID:19508732
Neural Encoding and Integration of Learned Probabilistic Sequences in Avian Sensory-Motor Circuitry
Brainard, Michael S.
2013-01-01
Many complex behaviors, such as human speech and birdsong, reflect a set of categorical actions that can be flexibly organized into variable sequences. However, little is known about how the brain encodes the probabilities of such sequences. Behavioral sequences are typically characterized by the probability of transitioning from a given action to any subsequent action (which we term “divergence probability”). In contrast, we hypothesized that neural circuits might encode the probability of transitioning to a given action from any preceding action (which we term “convergence probability”). The convergence probability of repeatedly experienced sequences could naturally become encoded by Hebbian plasticity operating on the patterns of neural activity associated with those sequences. To determine whether convergence probability is encoded in the nervous system, we investigated how auditory-motor neurons in vocal premotor nucleus HVC of songbirds encode different probabilistic characterizations of produced syllable sequences. We recorded responses to auditory playback of pseudorandomly sequenced syllables from the bird's repertoire, and found that variations in responses to a given syllable could be explained by a positive linear dependence on the convergence probability of preceding sequences. Furthermore, convergence probability accounted for more response variation than other probabilistic characterizations, including divergence probability. Finally, we found that responses integrated over >7–10 syllables (∼700–1000 ms) with the sign, gain, and temporal extent of integration depending on convergence probability. Our results demonstrate that convergence probability is encoded in sensory-motor circuitry of the song-system, and suggest that encoding of convergence probability is a general feature of sensory-motor circuits. PMID:24198363
Ferry, Alissa L; Fló, Ana; Brusini, Perrine; Cattarossi, Luigi; Macagno, Francesco; Nespor, Marina; Mehler, Jacques
2016-05-01
To understand language, humans must encode information from rapid, sequential streams of syllables - tracking their order and organizing them into words, phrases, and sentences. We used Near-Infrared Spectroscopy (NIRS) to determine whether human neonates are born with the capacity to track the positions of syllables in multisyllabic sequences. After familiarization with a six-syllable sequence, the neonate brain responded to the change (as shown by an increase in oxy-hemoglobin) when the two edge syllables switched positions but not when two middle syllables switched positions (Experiment 1), indicating that they encoded the syllables at the edges of sequences better than those in the middle. Moreover, when a 25 ms pause was inserted between the middle syllables as a segmentation cue, neonates' brains were sensitive to the change (Experiment 2), indicating that subtle cues in speech can signal a boundary, with enhanced encoding of the syllables located at the edges of that boundary. These findings suggest that neonates' brains can encode information from multisyllabic sequences and that this encoding is constrained. Moreover, subtle segmentation cues in a sequence of syllables provide a mechanism with which to accurately encode positional information from longer sequences. Tracking the order of syllables is necessary to understand language and our results suggest that the foundations for this encoding are present at birth. © 2015 John Wiley & Sons Ltd.
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yiao, Jian
2014-03-18
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6 (SEQ ID NO:1 encodes the full length endoglucanase; SEQ ID NO:4 encodes the mature form), and the corresponding endoglucanase VI amino acid sequence ("EGVI"; SEQ ID NO:3 is the signal sequence; SEQ ID NO:2 is the mature sequence). The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
Deciphering the Hidden Informational Content of Protein Sequences
Liu, Ming; Hua, Qing-xin; Hu, Shi-Quan; Jia, Wenhua; Yang, Yanwu; Saith, Sunil Evan; Whittaker, Jonathan; Arvan, Peter; Weiss, Michael A.
2010-01-01
Protein sequences encode both structure and foldability. Whereas the interrelationship of sequence and structure has been extensively investigated, the origins of folding efficiency are enigmatic. We demonstrate that the folding of proinsulin requires a flexible N-terminal hydrophobic residue that is dispensable for the structure, activity, and stability of the mature hormone. This residue (PheB1 in placental mammals) is variably positioned within crystal structures and exhibits 1H NMR motional narrowing in solution. Despite such flexibility, its deletion impaired insulin chain combination and led in cell culture to formation of non-native disulfide isomers with impaired secretion of the variant proinsulin. Cellular folding and secretion were maintained by hydrophobic substitutions at B1 but markedly perturbed by polar or charged side chains. We propose that, during folding, a hydrophobic side chain at B1 anchors transient long-range interactions by a flexible N-terminal arm (residues B1–B8) to mediate kinetic or thermodynamic partitioning among disulfide intermediates. Evidence for the overall contribution of the arm to folding was obtained by alanine scanning mutagenesis. Together, our findings demonstrate that efficient folding of proinsulin requires N-terminal sequences that are dispensable in the native state. Such arm-dependent folding can be abrogated by mutations associated with β-cell dysfunction and neonatal diabetes mellitus. PMID:20663888
Gene discovery using next-generation pyrosequencing to develop ESTs for Phalaenopsis orchids
2011-01-01
Background Orchids are one of the most diversified angiosperms, but few genomic resources are available for these non-model plants. In addition to the ecological significance, Phalaenopsis has been considered as an economically important floriculture industry worldwide. We aimed to use massively parallel 454 pyrosequencing for a global characterization of the Phalaenopsis transcriptome. Results To maximize sequence diversity, we pooled RNA from 10 samples of different tissues, various developmental stages, and biotic- or abiotic-stressed plants. We obtained 206,960 expressed sequence tags (ESTs) with an average read length of 228 bp. These reads were assembled into 8,233 contigs and 34,630 singletons. The unigenes were searched against the NCBI non-redundant (NR) protein database. Based on sequence similarity with known proteins, these analyses identified 22,234 different genes (E-value cutoff, e-7). Assembled sequences were annotated with Gene Ontology, Gene Family and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. Among these annotations, over 780 unigenes encoding putative transcription factors were identified. Conclusion Pyrosequencing was effective in identifying a large set of unigenes from Phalaenopsis. The informative EST dataset we developed constitutes a much-needed resource for discovery of genes involved in various biological processes in Phalaenopsis and other orchid species. These transcribed sequences will narrow the gap between study of model organisms with many genomic resources and species that are important for ecological and evolutionary studies. PMID:21749684
Nucleotide sequences of two genomic DNAs encoding peroxidase of Arabidopsis thaliana.
Intapruk, C; Higashimura, N; Yamamoto, K; Okada, N; Shinmyo, A; Takano, M
1991-02-15
The peroxidase (EC 1.11.1.7)-encoding gene of Arabidopsis thaliana was screened from a genomic library using a cDNA encoding a neutral isozyme of horseradish, Armoracia rusticana, peroxidase (HRP) as a probe, and two positive clones were isolated. From the comparison with the sequences of the HRP-encoding genes, we concluded that two clones contained peroxidase-encoding genes, and they were named prxCa and prxEa. Both genes consisted of four exons and three introns; the introns had consensus nucleotides, GT and AG, at the 5' and 3' ends, respectively. The lengths of each putative exon of the prxEa gene were the same as those of the HRP-basic-isozyme-encoding gene, prxC3, and coded for 349 amino acids (aa) with a sequence homology of 89% to that encoded by prxC3. The prxCa gene was very close to the HRP-neutral-isozyme-encoding gene, prxC1b, and coded for 354 aa with 91% homology to that encoded by prxC1b. The aa sequence homology was 64% between the two peroxidases encoded by prxCa and prxEa.
Metabolism of β-valine via a CoA-dependent ammonia lyase pathway.
Otzen, Marleen; Crismaru, Ciprian G; Postema, Christiaan P; Wijma, Hein J; Heberling, Matthew M; Szymanski, Wiktor; de Wildeman, Stefaan; Janssen, Dick B
2015-11-01
Pseudomonas species strain SBV1 can rapidly grow on medium containing β-valine as a sole nitrogen source. The tertiary amine feature of β-valine prevents direct deamination reactions catalyzed by aminotransferases, amino acid dehydrogenases, and amino acid oxidases. However, lyase- or aminomutase-mediated conversions would be possible. To identify enzymes involved in the degradation of β-valine, a PsSBV1 gene library was prepared and used to complement the β-valine growth deficiency of a closely related Pseudomonas strain. This resulted in the identification of a gene encoding β-valinyl-coenzyme A ligase (BvaA) and two genes encoding β-valinyl-CoA ammonia lyases (BvaB1 and BvaB2). The BvaA protein demonstrated high sequence identity to several known phenylacetate CoA ligases. Purified BvaA enzyme did not convert phenyl acetic acid but was able to activate β-valine in an adenosine triphosphate (ATP)- and CoA-dependent manner. The substrate range of the enzyme appears to be narrow, converting only β-valine and to a lesser extent, 3-aminobutyrate and β-alanine. Characterization of BvaB1 and BvaB2 revealed that both enzymes were able to deaminate β-valinyl-CoA to produce 3-methylcrotonyl-CoA, a common intermediate in the leucine degradation pathway. Interestingly, BvaB1 and BvaB2 demonstrated no significant sequence identity to known CoA-dependent ammonia lyases, suggesting they belong to a new family of enzymes. BLAST searches revealed that BvaB1 and BvaB2 show high sequence identity to each other and to several enoyl-CoA hydratases, a class of enzymes that catalyze a similar reaction with water instead of amine as the leaving group.
Staphylococcus aureus innate immune evasion is lineage-specific: a bioinfomatics study.
McCarthy, Alex J; Lindsay, Jodi A
2013-10-01
Staphylococcus aureus is a major human pathogen, and is targeted by the host innate immune system. In response, S. aureus genomes encode dozens of secreted proteins that inhibit complement, chemotaxis and neutrophil activation resulting in successful evasion of innate immune responses. These proteins include immune evasion cluster proteins (IEC; Chp, Sak, Scn), staphylococcal superantigen-like proteins (SSLs), phenol soluble modulins (PSMs) and several leukocidins. Biochemical studies have indicated that genetic variants of these proteins can have unique functions. To ascertain the scale of genetic variation in secreted immune evasion proteins, whole genome sequences of 88 S. aureus isolates, representing 25 clonal complex (CC) lineages, in the public domain were analysed across 43 genes encoding 38 secreted innate immune evasion protein complexes. Twenty-three genes were variable, with between 2 and 15 variants, and the variants had lineage-specific distributions. They include genes encoding Eap, Ecb, Efb, Flipr/Flipr-like, Hla, Hld, Hlg, Sbi, Scin-B/C and 13 SSLs. Most of these protein complexes inhibit complement, chemotaxis and neutrophil activation suggesting that isolates from each S. aureus lineage respond to the innate immune system differently. In contrast, protein complexes that lyse neutrophils (LukSF-PVL, LukMF, LukED and PSMs) were highly conserved, but can be carried on mobile genetic elements (MGEs). MGEs also encode proteins with narrow host-specificities arguing that their acquisition has important roles in host/environmental adaptation. In conclusion, this data suggests that each lineage of S. aureus evades host immune responses differently, and that isolates can adapt to new host environments by acquiring MGEs and the immune evasion protein complexes that they encode. Cocktail therapeutics that targets multiple variant proteins may be the most appropriate strategy for controlling S. aureus infections. Copyright © 2013 Elsevier B.V. All rights reserved.
Kamel, Katarzyna A; Kroc, Magdalena; Święcicki, Wojciech
2015-01-01
Sequence tagged site (STS) markers are valuable tools for genetic and physical mapping that can be successfully used in comparative analyses among related species. Current challenges for molecular markers genotyping in plants include the lack of fast, sensitive and inexpensive methods suitable for sequence variant detection. In contrast, high resolution melting (HRM) is a simple and high-throughput assay, which has been widely applied in sequence polymorphism identification as well as in the studies of genetic variability and genotyping. The present study is the first attempt to use the HRM analysis to genotype STS markers in narrow-leafed lupin (Lupinus angustifolius L.). The sensitivity and utility of this method was confirmed by the sequence polymorphism detection based on melting curve profiles in the parental genotypes and progeny of the narrow-leafed lupin mapping population. Application of different approaches, including amplicon size and a simulated heterozygote analysis, has allowed for successful genetic mapping of 16 new STS markers in the narrow-leafed lupin genome.
Towards predicting the encoding capability of MR fingerprinting sequences.
Sommer, K; Amthor, T; Doneva, M; Koken, P; Meineke, J; Börnert, P
2017-09-01
Sequence optimization and appropriate sequence selection is still an unmet need in magnetic resonance fingerprinting (MRF). The main challenge in MRF sequence design is the lack of an appropriate measure of the sequence's encoding capability. To find such a measure, three different candidates for judging the encoding capability have been investigated: local and global dot-product-based measures judging dictionary entry similarity as well as a Monte Carlo method that evaluates the noise propagation properties of an MRF sequence. Consistency of these measures for different sequence lengths as well as the capability to predict actual sequence performance in both phantom and in vivo measurements was analyzed. While the dot-product-based measures yielded inconsistent results for different sequence lengths, the Monte Carlo method was in a good agreement with phantom experiments. In particular, the Monte Carlo method could accurately predict the performance of different flip angle patterns in actual measurements. The proposed Monte Carlo method provides an appropriate measure of MRF sequence encoding capability and may be used for sequence optimization. Copyright © 2017 Elsevier Inc. All rights reserved.
Working Memory Replay Prioritizes Weakly Attended Events.
Jafarpour, Anna; Penny, Will; Barnes, Gareth; Knight, Robert T; Duzel, Emrah
2017-01-01
One view of working memory posits that maintaining a series of events requires their sequential and equal mnemonic replay. Another view is that the content of working memory maintenance is prioritized by attention. We decoded the dynamics for retaining a sequence of items using magnetoencephalography, wherein participants encoded sequences of three stimuli depicting a face, a manufactured object, or a natural item and maintained them in working memory for 5000 ms. Memory for sequence position and stimulus details were probed at the end of the maintenance period. Decoding of brain activity revealed that one of the three stimuli dominated maintenance independent of its sequence position or category; and memory was enhanced for the selectively replayed stimulus. Analysis of event-related responses during the encoding of the sequence showed that the selectively replayed stimuli were determined by the degree of attention at encoding. The selectively replayed stimuli had the weakest initial encoding indexed by weaker visual attention signals at encoding. These findings do not rule out sequential mnemonic replay but reveal that attention influences the content of working memory maintenance by prioritizing replay of weakly encoded events. We propose that the prioritization of weakly encoded stimuli protects them from interference during the maintenance period, whereas the more strongly encoded stimuli can be retrieved from long-term memory at the end of the delay period.
Nucleotide sequences encoding a thermostable alkaline protease
Wilson, David B.; Lao, Guifang
1998-01-01
Nucleotide sequences, derived from a thermophilic actinomycete microorganism, which encode a thermostable alkaline protease are disclosed. Also disclosed are variants of the nucleotide sequences which encode a polypeptide having thermostable alkaline proteolytic activity. Recombinant thermostable alkaline protease or recombinant polypeptide may be obtained by culturing in a medium a host cell genetically engineered to contain and express a nucleotide sequence according to the present invention, and recovering the recombinant thermostable alkaline protease or recombinant polypeptide from the culture medium.
Lampel, J S; Aphale, J S; Lampel, K A; Strohl, W R
1992-01-01
The gene encoding a novel milk protein-hydrolyzing proteinase was cloned on a 6.56-kb SstI fragment from Streptomyces sp. strain C5 genomic DNA into Streptomyces lividans 1326 by using the plasmid vector pIJ702. The gene encoding the small neutral proteinase (snpA) was located within a 2.6-kb BamHI-SstI restriction fragment that was partially sequenced. The molecular mass of the deduced amino acid sequence of the mature protein was determined to be 15,740, which corresponds very closely with the relative molecular mass of the purified protein (15,500) determined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The N-terminal amino acid sequence of the purified neutral proteinase was determined, and the DNA encoding this sequence was found to be located within the sequenced DNA. The deduced amino acid sequence contains a conserved zinc binding site, although secondary ligand binding and active sites typical of thermolysinlike metalloproteinases are absent. The combination of its small size, deduced amino acid sequence, and substrate and inhibition profile indicate that snpA encodes a novel neutral proteinase. Images PMID:1569011
Nucleotide sequences encoding a thermostable alkaline protease
Wilson, D.B.; Lao, G.
1998-01-06
Nucleotide sequences, derived from a thermophilic actinomycete microorganism, which encode a thermostable alkaline protease are disclosed. Also disclosed are variants of the nucleotide sequences which encode a polypeptide having thermostable alkaline proteolytic activity. Recombinant thermostable alkaline protease or recombinant polypeptide may be obtained by culturing in a medium a host cell genetically engineered to contain and express a nucleotide sequence according to the present invention, and recovering the recombinant thermostable alkaline protease or recombinant polypeptide from the culture medium. 3 figs.
ERIC Educational Resources Information Center
Hasselmo, Michael E.
2007-01-01
Many memory models focus on encoding of sequences by excitatory recurrent synapses in region CA3 of the hippocampus. However, data and modeling suggest an alternate mechanism for encoding of sequences in which interference between theta frequency oscillations encodes the position within a sequence based on spatial arc length or time. Arc length…
Characterization and gene cloning of the rice (Oryza sativa L.) dwarf and narrow-leaf mutant dnl3.
Shi, L; Wei, X J; Adedze, Y M N; Sheng, Z H; Tang, S Q; Hu, P S; Wang, J L
2016-09-16
The dwarf and narrow-leaf rice (Oryza sativa L.) mutant dnl3 was isolated from the Japonica cultivar Zhonghua 11 (wild-type). dnl3 exhibited pleiotropic developmental defects. The narrow-leaf phenotype resulted from a marked reduction in the number of vascular bundles, while the dwarf stature was caused by the formation of foreshortened internodes and a reduced number of parenchyma cells. The suggestion that cell division is impaired in the mutant was consistent with the transcriptional behavior of various genes associated with cell division. The mutant was less responsive to exogenously supplied gibberellic acid than the wild-type, and profiling the transcription of genes involved in gibberellin synthesis and response revealed that a lesion in the mutant affected gibberellin signal transduction. The dnl3 phenotype was inherited as a single-dominant gene, mapping within a 19.1-kb region of chromosome 12, which was found to harbor three open reading frames. Resequencing the open reading frames revealed that the mutant carried an allele at one of the three genes that differed from the wild-type sequence by 2-bp deletions; this gene encoded a cellulose synthase-like D4 (CSLD4) protein. Therefore, OsCSLD4 is a candidate gene for DNL3. DNL3 was expressed in all of the rice organs tested at the heading stage, particularly in the leaves, roots, and culms. These results suggest that DNL3 plays important roles in rice leaf morphogenesis and vegetative development.
Working Memory Replay Prioritizes Weakly Attended Events
Penny, Will; Knight, Robert T.; Duzel, Emrah
2017-01-01
Abstract One view of working memory posits that maintaining a series of events requires their sequential and equal mnemonic replay. Another view is that the content of working memory maintenance is prioritized by attention. We decoded the dynamics for retaining a sequence of items using magnetoencephalography, wherein participants encoded sequences of three stimuli depicting a face, a manufactured object, or a natural item and maintained them in working memory for 5000 ms. Memory for sequence position and stimulus details were probed at the end of the maintenance period. Decoding of brain activity revealed that one of the three stimuli dominated maintenance independent of its sequence position or category; and memory was enhanced for the selectively replayed stimulus. Analysis of event-related responses during the encoding of the sequence showed that the selectively replayed stimuli were determined by the degree of attention at encoding. The selectively replayed stimuli had the weakest initial encoding indexed by weaker visual attention signals at encoding. These findings do not rule out sequential mnemonic replay but reveal that attention influences the content of working memory maintenance by prioritizing replay of weakly encoded events. We propose that the prioritization of weakly encoded stimuli protects them from interference during the maintenance period, whereas the more strongly encoded stimuli can be retrieved from long-term memory at the end of the delay period. PMID:28824955
Toward a Better Compression for DNA Sequences Using Huffman Encoding
Almarri, Badar; Al Yami, Sultan; Huang, Chun-Hsi
2017-01-01
Abstract Due to the significant amount of DNA data that are being generated by next-generation sequencing machines for genomes of lengths ranging from megabases to gigabases, there is an increasing need to compress such data to a less space and a faster transmission. Different implementations of Huffman encoding incorporating the characteristics of DNA sequences prove to better compress DNA data. These implementations center on the concepts of selecting frequent repeats so as to force a skewed Huffman tree, as well as the construction of multiple Huffman trees when encoding. The implementations demonstrate improvements on the compression ratios for five genomes with lengths ranging from 5 to 50 Mbp, compared with the standard Huffman tree algorithm. The research hence suggests an improvement on all such DNA sequence compression algorithms that use the conventional Huffman encoding. The research suggests an improvement on all DNA sequence compression algorithms that use the conventional Huffman encoding. Accompanying software is publicly available (AL-Okaily, 2016). PMID:27960065
Toward a Better Compression for DNA Sequences Using Huffman Encoding.
Al-Okaily, Anas; Almarri, Badar; Al Yami, Sultan; Huang, Chun-Hsi
2017-04-01
Due to the significant amount of DNA data that are being generated by next-generation sequencing machines for genomes of lengths ranging from megabases to gigabases, there is an increasing need to compress such data to a less space and a faster transmission. Different implementations of Huffman encoding incorporating the characteristics of DNA sequences prove to better compress DNA data. These implementations center on the concepts of selecting frequent repeats so as to force a skewed Huffman tree, as well as the construction of multiple Huffman trees when encoding. The implementations demonstrate improvements on the compression ratios for five genomes with lengths ranging from 5 to 50 Mbp, compared with the standard Huffman tree algorithm. The research hence suggests an improvement on all such DNA sequence compression algorithms that use the conventional Huffman encoding. The research suggests an improvement on all DNA sequence compression algorithms that use the conventional Huffman encoding. Accompanying software is publicly available (AL-Okaily, 2016 ).
Clute, Shalyn C.; Naumov, Yuri N.; Watkin, Levi B.; Aslan, Nuray; Sullivan, John L.; Thorley-Lawson, David A.; Luzuriaga, Katherine; Welsh, Raymond M.; Puzone, Roberto; Celada, Franco; Selin, Liisa K.
2013-01-01
Memory T cells cross-reactive with epitopes encoded by related or even unrelated viruses may alter the immune response and pathogenesis of infection by a process known as heterologous immunity. Because a challenge virus epitope may react with only a subset of the T cell repertoire in a cross-reactive epitope-specific memory pool, the vigorous cross-reactive response may be narrowly focused, or oligoclonal. We show here, by examining human T cell cross-reactivity between the HLA-A2-restricted influenza A virus-encoded M158-66 epitope (GILGFVFTL) and the dissimilar Epstein-Barr virus-encoded BMLF1280-288 epitope (GLCTLVAML), that under some conditions heterologous immunity can lead to a significant broadening rather than a narrowing of the T cell receptor repertoire. We suggest that dissimilar cross-reactive epitopes might generate a broad rather than narrow T cell repertoire if there is a lack of dominant high affinity clones, and this hypothesis is supported by computer simulation. PMID:21048112
Muller, Ryan Y; Hammond, Ming C; Rio, Donald C; Lee, Yeon J
2015-12-01
The Encyclopedia of DNA Elements (ENCODE) Project aims to identify all functional sequence elements in the human genome sequence by use of high-throughput DNA/cDNA sequencing approaches. To aid the standardization, comparison, and integration of data sets produced from different technologies and platforms, the ENCODE Consortium selected several standard human cell lines to be used by the ENCODE Projects. The Tier 1 ENCODE cell lines include GM12878, K562, and H1 human embryonic stem cell lines. GM12878 is a lymphoblastoid cell line, transformed with the Epstein-Barr virus, that was selected by the International HapMap Project for whole genome and transcriptome sequencing by use of the Illumina platform. K562 is an immortalized myelogenous leukemia cell line. The GM12878 cell line is attractive for the ENCODE Projects, as it offers potential synergy with the International HapMap Project. Despite the vast amount of sequencing data available on the GM12878 cell line through the ENCODE Project, including transcriptome, chromatin immunoprecipitation-sequencing for histone marks, and transcription factors, no small interfering siRNA-mediated knockdown studies have been performed in the GM12878 cell line, as cationic lipid-mediated transfection methods are inefficient for lymphoid cell lines. Here, we present an efficient and reproducible method for transfection of a variety of siRNAs into the GM12878 and K562 cell lines, which subsequently results in targeted protein depletion.
Cozens, A L; Walker, J E
1986-01-01
The nucleotide sequence has been determined of a segment of 4680 bases of the pea chloroplast genome. It adjoins a sequence described elsewhere that encodes subunits of the F0 membrane domain of the ATP-synthase complex. The sequence contains a potential gene encoding a protein which is strongly related to the S2 polypeptide of Escherichia coli ribosomes. It also encodes an incomplete protein which contains segments that are homologous to the beta'-subunit of E. coli RNA polymerase and to yeast RNA polymerases II and III. PMID:3530249
Disruption of Boundary Encoding During Sensorimotor Sequence Learning: An MEG Study.
Michail, Georgios; Nikulin, Vadim V; Curio, Gabriel; Maess, Burkhard; Herrojo Ruiz, María
2018-01-01
Music performance relies on the ability to learn and execute actions and their associated sounds. The process of learning these auditory-motor contingencies depends on the proper encoding of the serial order of the actions and sounds. Among the different serial positions of a behavioral sequence, the first and last (boundary) elements are particularly relevant. Animal and patient studies have demonstrated a specific neural representation for boundary elements in prefrontal cortical regions and in the basal ganglia, highlighting the relevance of their proper encoding. The neural mechanisms underlying the encoding of sequence boundaries in the general human population remain, however, largely unknown. In this study, we examined how alterations of auditory feedback, introduced at different ordinal positions (boundary or within-sequence element), affect the neural and behavioral responses during sensorimotor sequence learning. Analysing the neuromagnetic signals from 20 participants while they performed short piano sequences under the occasional effect of altered feedback (AF), we found that at around 150-200 ms post-keystroke, the neural activities in the dorsolateral prefrontal cortex (DLPFC) and supplementary motor area (SMA) were dissociated for boundary and within-sequence elements. Furthermore, the behavioral data demonstrated that feedback alterations on boundaries led to greater performance costs, such as more errors in the subsequent keystrokes. These findings jointly support the idea that the proper encoding of boundaries is critical in acquiring sensorimotor sequences. They also provide evidence for the involvement of a distinct neural circuitry in humans including prefrontal and higher-order motor areas during the encoding of the different classes of serial order.
Differences in reward processing between putative cell types in primate prefrontal cortex
Fan, Hongwei; Wang, Rubin; Sakagami, Masamichi
2017-01-01
Single-unit studies in monkeys have demonstrated that neurons in the prefrontal cortex predict the reward type, reward amount or reward availability associated with a stimulus. To examine contributions of pyramidal cells and interneurons in reward processing, single-unit activity was extracellularly recorded in prefrontal cortices of four monkeys performing a reward prediction task. Based on their shapes of spike waveforms, prefrontal neurons were classified into broad-spike and narrow-spike units that represented putative pyramidal cells and interneurons, respectively. We mainly observed that narrow-spike neurons showed higher firing rates but less bursty discharges than did broad-spike neurons. Both narrow-spike and broad-spike cells selectively responded to the stimulus, reward and their interaction, and the proportions of each type of selective neurons were similar between the two cell classes. Moreover, the two types of cells displayed equal reliability of reward or stimulus discrimination. Furthermore, we found that broad-spike and narrow-spike cells showed distinct mechanisms for encoding reward or stimulus information. Broad-spike neurons raised their firing rate relative to the baseline rate to represent the preferred reward or stimulus information, whereas narrow-spike neurons inhibited their firing rate lower than the baseline rate to encode the non-preferred reward or stimulus information. Our results suggest that narrow-spike and broad-spike cells were equally involved in reward and stimulus processing in the prefrontal cortex. They utilized a binary strategy to complementarily represent reward or stimulus information, which was consistent with the task structure in which the monkeys were required to remember two reward conditions and two visual stimuli. PMID:29261734
Differences in reward processing between putative cell types in primate prefrontal cortex.
Fan, Hongwei; Pan, Xiaochuan; Wang, Rubin; Sakagami, Masamichi
2017-01-01
Single-unit studies in monkeys have demonstrated that neurons in the prefrontal cortex predict the reward type, reward amount or reward availability associated with a stimulus. To examine contributions of pyramidal cells and interneurons in reward processing, single-unit activity was extracellularly recorded in prefrontal cortices of four monkeys performing a reward prediction task. Based on their shapes of spike waveforms, prefrontal neurons were classified into broad-spike and narrow-spike units that represented putative pyramidal cells and interneurons, respectively. We mainly observed that narrow-spike neurons showed higher firing rates but less bursty discharges than did broad-spike neurons. Both narrow-spike and broad-spike cells selectively responded to the stimulus, reward and their interaction, and the proportions of each type of selective neurons were similar between the two cell classes. Moreover, the two types of cells displayed equal reliability of reward or stimulus discrimination. Furthermore, we found that broad-spike and narrow-spike cells showed distinct mechanisms for encoding reward or stimulus information. Broad-spike neurons raised their firing rate relative to the baseline rate to represent the preferred reward or stimulus information, whereas narrow-spike neurons inhibited their firing rate lower than the baseline rate to encode the non-preferred reward or stimulus information. Our results suggest that narrow-spike and broad-spike cells were equally involved in reward and stimulus processing in the prefrontal cortex. They utilized a binary strategy to complementarily represent reward or stimulus information, which was consistent with the task structure in which the monkeys were required to remember two reward conditions and two visual stimuli.
Mollusk genes encoding lysine tRNA (UUU) contain introns.
Matsuo, M; Abe, Y; Saruta, Y; Okada, N
1995-11-20
New intron-containing genes encoding tRNAs were discovered when genomic DNA isolated from various animal species was amplified by the polymerase chain reaction (PCR) with primers based on sequences of rabbit tRNA(Lys). From sequencing analysis of the products of PCR, we found that introns are present in several genes encoding tRNA(Lys) in mollusks, such as Loligo bleekeri (squid) and Octopus vulgaris (octopus). These introns were specific to genes encoding tRNA(Lys)(CUU) and were not present in genes encoding tRNA(Lys)(CUU). In addition, the sequences of the introns were different from one another. To confirm the results of our initial experiments, we isolated and sequenced genes encoding tRNA(Lys)(CUU) and tRNA(Lys)(UUU). The gene for tRNA(Lys)(UUU) from squid contained an intron, whose sequence was the same as that identified by PCR, and the gene formed a cluster with a corresponding pseudogene. Several DNA regions of 2.1 kb containing this cluster appeared to be tandemly arrayed in the squid genome. By contrast, the gene encoding tRNA(Lys)(CUU) did not contain an intron, as shown also by PCR. The tRNA(Lys)(UUU) that corresponded to the analyzed gene was isolated and characterized. The present study provides the first example of an intron-containing gene encoding a tRNA in mollusks and suggests the universality of introns in such genes in higher eukaryotes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Chi, E-mail: chizheung@gmail.com; Xu, Yiqing; Wei, Xiaoming
2014-07-28
Time-stretch microscopy has emerged as an ultrafast optical imaging concept offering the unprecedented combination of the imaging speed and sensitivity. However, dedicated wideband and coherence optical pulse source with high shot-to-shot stability has been mandated for time-wavelength mapping—the enabling process for ultrahigh speed wavelength-encoded image retrieval. From the practical point of view, exploiting methods to relax the stringent requirements (e.g., temporal stability and coherence) for the source of time-stretch microscopy is thus of great value. In this paper, we demonstrated time-stretch microscopy by reconstructing the time-wavelength mapping sequence from a wideband incoherent source. Utilizing the time-lens focusing mechanism mediated bymore » a narrow-band pulse source, this approach allows generation of a wideband incoherent source, with the spectral efficiency enhanced by a factor of 18. As a proof-of-principle demonstration, time-stretch imaging with the scan rate as high as MHz and diffraction-limited resolution is achieved based on the wideband incoherent source. We note that the concept of time-wavelength sequence reconstruction from wideband incoherent source can also be generalized to any high-speed optical real-time measurements, where wavelength is acted as the information carrier.« less
Grohmann, L; Brennicke, A; Schuster, W
1992-01-01
The Oenothera mitochondrial genome contains only a gene fragment for ribosomal protein S12 (rps12), while other plants encode a functional gene in the mitochondrion. The complete Oenothera rps12 gene is located in the nucleus. The transit sequence necessary to target this protein to the mitochondrion is encoded by a 5'-extension of the open reading frame. Comparison of the amino acid sequence encoded by the nuclear gene with the polypeptides encoded by edited mitochondrial cDNA and genomic sequences of other plants suggests that gene transfer between mitochondrion and nucleus started from edited mitochondrial RNA molecules. Mechanisms and requirements of gene transfer and activation are discussed. Images PMID:1454526
Gawryluk, Ryan M R; Chisholm, Kenneth A; Pinto, Devanand M; Gray, Michael W
2014-09-23
We present a combined proteomic and bioinformatic investigation of mitochondrial proteins from the amoeboid protist Acanthamoeba castellanii, the first such comprehensive investigation in a free-living member of the supergroup Amoebozoa. This protist was chosen both for its phylogenetic position (as a sister to animals and fungi) and its ecological ubiquity and physiological flexibility. We report 1033 A. castellanii mitochondrial protein sequences, 709 supported by mass spectrometry data (676 nucleus-encoded and 33 mitochondrion-encoded), including two previously unannotated mtDNA-encoded proteins, which we identify as highly divergent mitochondrial ribosomal proteins. Other notable findings include duplicate proteins for all of the enzymes of the tricarboxylic acid (TCA) cycle-which, along with the identification of a mitochondrial malate synthase-isocitrate lyase fusion protein, suggests the interesting possibility that the glyoxylate cycle operates in A. castellanii mitochondria. Additionally, the A. castellanii genome encodes an unusually high number (at least 29) of mitochondrion-targeted pentatricopeptide repeat (PPR) proteins, organellar RNA metabolism factors in other organisms. We discuss several key mitochondrial pathways, including DNA replication, transcription and translation, protein degradation, protein import and Fe-S cluster biosynthesis, highlighting similarities and differences in these pathways in other eukaryotes. In compositional and functional complexity, the mitochondrial proteome of A. castellanii rivals that of multicellular eukaryotes. Comprehensive proteomic surveys of mitochondria have been undertaken in a limited number of predominantly multicellular eukaryotes. This phylogenetically narrow perspective constrains and biases our insights into mitochondrial function and evolution, as it neglects protists, which account for most of the evolutionary and functional diversity within eukaryotes. We report here the first comprehensive investigation of the mitochondrial proteome in a member (A. castellanii) of the eukaryotic supergroup Amoebozoa. Through a combination of tandem mass spectrometry (MS/MS) and in silico data mining, we have retrieved 1033 candidate mitochondrial protein sequences, 709 having MS support. These data were used to reconstruct the metabolic pathways and protein complexes of A. castellanii mitochondria, and were integrated with data from other characterized mitochondrial proteomes to augment our understanding of mitochondrial proteome evolution. Our results demonstrate the power of combining direct proteomic and bioinformatic approaches in the discovery of novel mitochondrial proteins, both nucleus-encoded and mitochondrion-encoded, and highlight the compositional complexity of the A. castellanii mitochondrial proteome, which rivals that of animals, fungi and plants. Copyright © 2014 Elsevier B.V. All rights reserved.
2012-01-01
Background Hawthorn is the common name of all plant species in the genus Crataegus, which belongs to the Rosaceae family. Crataegus are considered useful medicinal plants because of their high content of proanthocyanidins (PAs) and other related compounds. To improve PAs production in Crataegus tissues, the sequences of genes encoding PAs biosynthetic enzymes are required. Findings Different bioinformatics tools, including BLAST, multiple sequence alignment and alignment PCR analysis were used to design primers suitable for the amplification of DNA fragments from 10 candidate genes encoding enzymes involved in PAs biosynthesis in C. aronia. DNA sequencing results proved the utility of the designed primers. The primers were used successfully to amplify DNA fragments of different PAs biosynthesis genes in different Rosaceae plants. Conclusion To the best of our knowledge, this is the first use of the alignment PCR approach to isolate DNA sequences encoding PAs biosynthetic enzymes in Rosaceae plants. PMID:22883984
Zuiter, Afnan Saeid; Sawwan, Jammal; Al Abdallat, Ayed
2012-08-10
Hawthorn is the common name of all plant species in the genus Crataegus, which belongs to the Rosaceae family. Crataegus are considered useful medicinal plants because of their high content of proanthocyanidins (PAs) and other related compounds. To improve PAs production in Crataegus tissues, the sequences of genes encoding PAs biosynthetic enzymes are required. Different bioinformatics tools, including BLAST, multiple sequence alignment and alignment PCR analysis were used to design primers suitable for the amplification of DNA fragments from 10 candidate genes encoding enzymes involved in PAs biosynthesis in C. aronia. DNA sequencing results proved the utility of the designed primers. The primers were used successfully to amplify DNA fragments of different PAs biosynthesis genes in different Rosaceae plants. To the best of our knowledge, this is the first use of the alignment PCR approach to isolate DNA sequences encoding PAs biosynthetic enzymes in Rosaceae plants.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2014-02-25
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-05-16
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2008-04-01
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2010-10-12
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVIII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-05-23
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl8, and the corresponding EGVIII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVIII, recombinant EGVIII proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2010-10-05
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-06-06
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2009-05-05
The present invention provides an endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2013-07-16
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2012-02-14
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2015-04-14
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
An Integrated Microfluidic Processor for DNA-Encoded Combinatorial Library Functional Screening
2017-01-01
DNA-encoded synthesis is rekindling interest in combinatorial compound libraries for drug discovery and in technology for automated and quantitative library screening. Here, we disclose a microfluidic circuit that enables functional screens of DNA-encoded compound beads. The device carries out library bead distribution into picoliter-scale assay reagent droplets, photochemical cleavage of compound from the bead, assay incubation, laser-induced fluorescence-based assay detection, and fluorescence-activated droplet sorting to isolate hits. DNA-encoded compound beads (10-μm diameter) displaying a photocleavable positive control inhibitor pepstatin A were mixed (1920 beads, 729 encoding sequences) with negative control beads (58 000 beads, 1728 encoding sequences) and screened for cathepsin D inhibition using a biochemical enzyme activity assay. The circuit sorted 1518 hit droplets for collection following 18 min incubation over a 240 min analysis. Visual inspection of a subset of droplets (1188 droplets) yielded a 24% false discovery rate (1166 pepstatin A beads; 366 negative control beads). Using template barcoding strategies, it was possible to count hit collection beads (1863) using next-generation sequencing data. Bead-specific barcodes enabled replicate counting, and the false discovery rate was reduced to 2.6% by only considering hit-encoding sequences that were observed on >2 beads. This work represents a complete distributable small molecule discovery platform, from microfluidic miniaturized automation to ultrahigh-throughput hit deconvolution by sequencing. PMID:28199790
An Integrated Microfluidic Processor for DNA-Encoded Combinatorial Library Functional Screening.
MacConnell, Andrew B; Price, Alexander K; Paegel, Brian M
2017-03-13
DNA-encoded synthesis is rekindling interest in combinatorial compound libraries for drug discovery and in technology for automated and quantitative library screening. Here, we disclose a microfluidic circuit that enables functional screens of DNA-encoded compound beads. The device carries out library bead distribution into picoliter-scale assay reagent droplets, photochemical cleavage of compound from the bead, assay incubation, laser-induced fluorescence-based assay detection, and fluorescence-activated droplet sorting to isolate hits. DNA-encoded compound beads (10-μm diameter) displaying a photocleavable positive control inhibitor pepstatin A were mixed (1920 beads, 729 encoding sequences) with negative control beads (58 000 beads, 1728 encoding sequences) and screened for cathepsin D inhibition using a biochemical enzyme activity assay. The circuit sorted 1518 hit droplets for collection following 18 min incubation over a 240 min analysis. Visual inspection of a subset of droplets (1188 droplets) yielded a 24% false discovery rate (1166 pepstatin A beads; 366 negative control beads). Using template barcoding strategies, it was possible to count hit collection beads (1863) using next-generation sequencing data. Bead-specific barcodes enabled replicate counting, and the false discovery rate was reduced to 2.6% by only considering hit-encoding sequences that were observed on >2 beads. This work represents a complete distributable small molecule discovery platform, from microfluidic miniaturized automation to ultrahigh-throughput hit deconvolution by sequencing.
Foreman, Pamela [Los Altos, CA; Goedegebuur, Frits [Vlaardingen, NL; Van Solingen, Pieter [Naaldwijk, NL; Ward, Michael [San Francisco, CA
2012-06-19
Described herein are novel gene sequences isolated from Trichoderma reesei. Two genes encoding proteins comprising a cellulose binding domain, one encoding an arabionfuranosidase and one encoding an acetylxylanesterase are described. The sequences, CIP1 and CIP2, contain a cellulose binding domain. These proteins are especially useful in the textile and detergent industry and in pulp and paper industry.
Thermal and acid tolerant beta-xylosidases, genes encoding, related organisms, and methods
Thompson, David N [Idaho Falls, ID; Thompson, Vicki S [Idaho Falls, ID; Schaller, Kastli D [Ammon, ID; Apel, William A [Jackson, WY; Lacey, Jeffrey A [Idaho Falls, ID; Reed, David W [Idaho Falls, ID
2011-04-12
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius and variations thereof are provided. Further provided are methods of at least partially degrading xylotriose and/or xylobiose using isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius and variations thereof.
van der Ley, P
1988-11-01
Gonococci express a family of related outer membrane proteins designated protein II (P.II). These surface proteins are subject to both phase variation and antigenic variation. The P.II gene repertoire of Neisseria gonorrhoeae strain JS3 was found to consist of at least ten genes, eight of which were cloned. Sequence analysis and DNA hybridization studies revealed that one particular P.II-encoding sequence is present in three distinct, but almost identical, copies in the JS3 genome. These genes encode the P.II protein that was previously identified as P.IIc. Comparison of their sequences shows that the multiple copies of this P.IIc-encoding gene might have been generated by both gene conversion and gene duplication.
Kawano, Mitsuoki; Oshima, Taku; Kasai, Hiroaki; Mori, Hirotada
2002-07-01
Genome sequence analyses of Escherichia coli K-12 revealed four copies of long repetitive elements. These sequences are designated as long direct repeat (LDR) sequences. Three of the repeats (LDR-A, -B, -C), each approximately 500 bp in length, are located as tandem repeats at 27.4 min on the genetic map. Another copy (LDR-D), 450 bp in length and nearly identical to LDR-A, -B and -C, is located at 79.7 min, a position that is directly opposite the position of LDR-A, -B and -C. In this study, we demonstrate that LDR-D encodes a 35-amino-acid peptide, LdrD, the overexpression of which causes rapid cell killing and nucleoid condensation of the host cell. Northern blot and primer extension analysis showed constitutive transcription of a stable mRNA (approximately 370 nucleotides) encoding LdrD and an unstable cis-encoded antisense RNA (approximately 60 nucleotides), which functions as a trans-acting regulator of ldrD translation. We propose that LDR encodes a toxin-antitoxin module. LDR-homologous sequences are not pre-sent on any known plasmids but are conserved in Salmonella and other enterobacterial species.
BGL7 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2013-01-29
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL6 .beta.-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2012-10-02
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL5 .beta.-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-02-28
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
BGL5 .beta.-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2008-03-18
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dunn-Coleman, Nigel; Ward, Michael
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2014-03-04
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL7 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2015-04-14
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL7 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2014-03-25
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2015-08-11
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2007-09-25
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2008-04-01
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL4 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2011-12-06
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
BGL4 .beta.-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-05-16
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2011-06-14
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Ward, Michael [San Francisco, CA
2009-09-01
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2012-10-30
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL4 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2008-01-22
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
The Purine-Utilizing Bacterium Clostridium acidurici 9a: A Genome-Guided Metabolic Reconsideration
Hartwich, Katrin; Poehlein, Anja; Daniel, Rolf
2012-01-01
Clostridium acidurici is an anaerobic, homoacetogenic bacterium, which is able to use purines such as uric acid as sole carbon, nitrogen, and energy source. Together with the two other known purinolytic clostridia C. cylindrosporum and C. purinilyticum, C. acidurici serves as a model organism for investigation of purine fermentation. Here, we present the first complete sequence and analysis of a genome derived from a purinolytic Clostridium. The genome of C. acidurici 9a consists of one chromosome (3,105,335 bp) and one small circular plasmid (2,913 bp). The lack of candidate genes encoding glycine reductase indicates that C. acidurici 9a uses the energetically less favorable glycine-serine-pyruvate pathway for glycine degradation. In accordance with the specialized lifestyle and the corresponding narrow substrate spectrum of C. acidurici 9a, the number of genes involved in carbohydrate transport and metabolism is significantly lower than in other clostridia such as C. acetobutylicum, C. saccharolyticum, and C. beijerinckii. The only amino acid that can be degraded by C. acidurici is glycine but growth on glycine only occurs in the presence of a fermentable purine. Nevertheless, the addition of glycine resulted in increased transcription levels of genes encoding enzymes involved in the glycine-serine-pyruvate pathway such as serine hydroxymethyltransferase and acetate kinase, whereas the transcription levels of formate dehydrogenase-encoding genes decreased. Sugars could not be utilized by C. acidurici but the full genetic repertoire for glycolysis was detected. In addition, genes encoding enzymes that mediate resistance against several antimicrobials and metals were identified. High resistance of C. acidurici towards bacitracin, acriflavine and azaleucine was experimentally confirmed. PMID:23240052
The purine-utilizing bacterium Clostridium acidurici 9a: a genome-guided metabolic reconsideration.
Hartwich, Katrin; Poehlein, Anja; Daniel, Rolf
2012-01-01
Clostridium acidurici is an anaerobic, homoacetogenic bacterium, which is able to use purines such as uric acid as sole carbon, nitrogen, and energy source. Together with the two other known purinolytic clostridia C. cylindrosporum and C. purinilyticum, C. acidurici serves as a model organism for investigation of purine fermentation. Here, we present the first complete sequence and analysis of a genome derived from a purinolytic Clostridium. The genome of C. acidurici 9a consists of one chromosome (3,105,335 bp) and one small circular plasmid (2,913 bp). The lack of candidate genes encoding glycine reductase indicates that C. acidurici 9a uses the energetically less favorable glycine-serine-pyruvate pathway for glycine degradation. In accordance with the specialized lifestyle and the corresponding narrow substrate spectrum of C. acidurici 9a, the number of genes involved in carbohydrate transport and metabolism is significantly lower than in other clostridia such as C. acetobutylicum, C. saccharolyticum, and C. beijerinckii. The only amino acid that can be degraded by C. acidurici is glycine but growth on glycine only occurs in the presence of a fermentable purine. Nevertheless, the addition of glycine resulted in increased transcription levels of genes encoding enzymes involved in the glycine-serine-pyruvate pathway such as serine hydroxymethyltransferase and acetate kinase, whereas the transcription levels of formate dehydrogenase-encoding genes decreased. Sugars could not be utilized by C. acidurici but the full genetic repertoire for glycolysis was detected. In addition, genes encoding enzymes that mediate resistance against several antimicrobials and metals were identified. High resistance of C. acidurici towards bacitracin, acriflavine and azaleucine was experimentally confirmed.
CIP1 polypeptides and their uses
Foreman, Pamela [Los Altos, CA; Van Solingen, Pieter [Naaldwijk, NL; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA
2011-04-12
Described herein are novel gene sequences isolated from Trichoderma reesei. Two genes encoding proteins comprising a cellulose binding domain, one encoding an arabionfuranosidase and one encoding an acetylxylanesterase are described. The sequences, CIP1 and CIP2, contain a cellulose binding domain. These proteins are especially useful in the textile and detergent industry and in pulp and paper industry.
Thompson, David N; Thompson, Vicki S; Schaller, Kastli D; Apel, William A; Reed, David W; Lacey, Jeffrey A
2013-04-30
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius and variations thereof are provided. Further provided are methods of at least partially degrading xylotriose, xylobiose, and/or arabinofuranose-substituted xylan using isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius and variations thereof.
Concerted formation of macromolecular Suppressor–mutator transposition complexes
Raina, Ramesh; Schläppi, Michael; Karunanandaa, Balasulojini; Elhofy, Adam; Fedoroff, Nina
1998-01-01
Transposition of the maize Suppressor–mutator (Spm) transposon requires two element-encoded proteins, TnpA and TnpD. Although there are multiple TnpA binding sites near each element end, binding of TnpA to DNA is not cooperative, and the binding affinity is not markedly affected by the number of binding sites per DNA fragment. However, intermolecular complexes form cooperatively between DNA fragments with three or more TnpA binding sites. TnpD, itself not a sequence-specific DNA-binding protein, binds to TnpA and stabilizes the TnpA–DNA complex. The high redundancy of TnpA binding sites at both element ends and the protein–protein interactions between DNA-bound TnpA complexes and between these and TnpD imply a concerted transition of the element from a linear to a protein crosslinked transposition complex within a very narrow protein concentration range. PMID:9671711
[The ENCODE project and functional genomics studies].
Ding, Nan; Qu, Hongzhu; Fang, Xiangdong
2014-03-01
Upon the completion of the Human Genome Project, scientists have been trying to interpret the underlying genomic code for human biology. Since 2003, National Human Genome Research Institute (NHGRI) has invested nearly $0.3 billion and gathered over 440 scientists from more than 32 institutions in the United States, China, United Kingdom, Japan, Spain and Singapore to initiate the Encyclopedia of DNA Elements (ENCODE) project, aiming to identify and analyze all regulatory elements in the human genome. Taking advantage of the development of next-generation sequencing technologies and continuous improvement of experimental methods, ENCODE had made remarkable achievements: identified methylation and histone modification of DNA sequences and their regulatory effects on gene expression through altering chromatin structures, categorized binding sites of various transcription factors and constructed their regulatory networks, further revised and updated database for pseudogenes and non-coding RNA, and identified SNPs in regulatory sequences associated with diseases. These findings help to comprehensively understand information embedded in gene and genome sequences, the function of regulatory elements as well as the molecular mechanism underlying the transcriptional regulation by noncoding regions, and provide extensive data resource for life sciences, particularly for translational medicine. We re-viewed the contributions of high-throughput sequencing platform development and bioinformatical technology improve-ment to the ENCODE project, the association between epigenetics studies and the ENCODE project, and the major achievement of the ENCODE project. We also provided our prospective on the role of the ENCODE project in promoting the development of basic and clinical medicine.
Langner, Robert; Sternkopf, Melanie A; Kellermann, Tanja S; Grefkes, Christian; Kurth, Florian; Schneider, Frank; Zilles, Karl; Eickhoff, Simon B
2014-07-01
The neurobiological organization of action-oriented working memory is not well understood. To elucidate the neural correlates of translating visuo-spatial stimulus sequences into delayed (memory-guided) sequential actions, we measured brain activity using functional magnetic resonance imaging while participants encoded sequences of four to seven dots appearing on fingers of a left or right schematic hand. After variable delays, sequences were to be reproduced with the corresponding fingers. Recall became less accurate with longer sequences and was initiated faster after long delays. Across both hands, encoding and recall activated bilateral prefrontal, premotor, superior and inferior parietal regions as well as the basal ganglia, whereas hand-specific activity was found (albeit to a lesser degree during encoding) in contralateral premotor, sensorimotor, and superior parietal cortex. Activation differences after long versus short delays were restricted to motor-related regions, indicating that rehearsal during long delays might have facilitated the conversion of the memorandum into concrete motor programs at recall. Furthermore, basal ganglia activity during encoding selectively predicted correct recall. Taken together, the results suggest that to-be-reproduced visuo-spatial sequences are encoded as prospective action representations (motor intentions), possibly in addition to retrospective sensory codes. Overall, our study supports and extends multi-component models of working memory, highlighting the notion that sensory input can be coded in multiple ways depending on what the memorandum is to be used for. Copyright © 2013 Wiley Periodicals, Inc.
Dialynas, D P; Murre, C; Quertermous, T; Boss, J M; Leiden, J M; Seidman, J G; Strominger, J L
1986-01-01
Complementary DNA (cDNA) encoding a human T-cell gamma chain has been cloned and sequenced. At the junction of the variable and joining regions, there is an apparent deletion of two nucleotides in the human cDNA sequence relative to the murine gamma-chain cDNA sequence, resulting simultaneously in the generation of an in-frame stop codon and in a translational frameshift. For this reason, the sequence presented here encodes an aberrantly rearranged human T-cell gamma chain. There are several surprising differences between the deduced human and murine gamma-chain amino acid sequences. These include poor homology in the variable region, poor homology in a discrete segment of the constant region precisely bounded by the expected junctions of exon CII, and the presence in the human sequence of five potential sites for N-linked glycosylation. Images PMID:3458221
DNA-Encoded Solid-Phase Synthesis: Encoding Language Design and Complex Oligomer Library Synthesis.
MacConnell, Andrew B; McEnaney, Patrick J; Cavett, Valerie J; Paegel, Brian M
2015-09-14
The promise of exploiting combinatorial synthesis for small molecule discovery remains unfulfilled due primarily to the "structure elucidation problem": the back-end mass spectrometric analysis that significantly restricts one-bead-one-compound (OBOC) library complexity. The very molecular features that confer binding potency and specificity, such as stereochemistry, regiochemistry, and scaffold rigidity, are conspicuously absent from most libraries because isomerism introduces mass redundancy and diverse scaffolds yield uninterpretable MS fragmentation. Here we present DNA-encoded solid-phase synthesis (DESPS), comprising parallel compound synthesis in organic solvent and aqueous enzymatic ligation of unprotected encoding dsDNA oligonucleotides. Computational encoding language design yielded 148 thermodynamically optimized sequences with Hamming string distance ≥ 3 and total read length <100 bases for facile sequencing. Ligation is efficient (70% yield), specific, and directional over 6 encoding positions. A series of isomers served as a testbed for DESPS's utility in split-and-pool diversification. Single-bead quantitative PCR detected 9 × 10(4) molecules/bead and sequencing allowed for elucidation of each compound's synthetic history. We applied DESPS to the combinatorial synthesis of a 75,645-member OBOC library containing scaffold, stereochemical and regiochemical diversity using mixed-scale resin (160-μm quality control beads and 10-μm screening beads). Tandem DNA sequencing/MALDI-TOF MS analysis of 19 quality control beads showed excellent agreement (<1 ppt) between DNA sequence-predicted mass and the observed mass. DESPS synergistically unites the advantages of solid-phase synthesis and DNA encoding, enabling single-bead structural elucidation of complex compounds and synthesis using reactions normally considered incompatible with unprotected DNA. The widespread availability of inexpensive oligonucleotide synthesis, enzymes, DNA sequencing, and PCR make implementation of DESPS straightforward, and may prompt the chemistry community to revisit the synthesis of more complex and diverse libraries.
Isolated nucleic acids encoding antipathogenic polypeptides and uses thereof
Altier, Daniel J.; Crane, Virginia C.; Ellanskaya, Irina; Ellanskaya, Natalia; Gilliam, Jacob T.; Hunter-Cevera, Jennie; Presnail, James K.; Schepers, Eric J.; Simmons, Carl R.; Torok, Tamas; Yalpani, Nasser
2010-04-20
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from fungal fermentation broths. Nucleic acids that encode the antipathogenic polypeptides are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention are also disclosed.
Dziewit, Lukasz; Grzesiak, Jakub; Ciok, Anna; Nieckarz, Marta; Zdanowski, Marek K; Bartosik, Dariusz
2013-09-01
Pseudomonas sp. GLE121 (a psychrophilic Antarctic strain) carries three plasmids: pGLE121P1 (6899 bp), pGLE121P2 (8330 bp) and pGLE121P3 (39,583 bp). Plasmids pGLE121P1 and pGLE121P2 show significant sequence similarity to members of the IncP-9 and IncP-7 incompatibility groups, respectively, while the largest replicon, pGLE121P3, is highly related to plasmid pNCPPB880-40 of Pseudomonas syringae pathovar tomato NCPPB880. All three plasmids have a narrow host range, limited to members of the genus Pseudomonas. Plasmid pGLE121P3 encodes a conjugal transfer system, while pGLE121P1 carries only a putative MOB module, conserved in many mobilizable plasmids. Plasmid pGLE121P3 contains an additional load of genetic information, including a pair of genes with homology to the rulAB operon, responsible for ultraviolet radiation (UVR) tolerance. Given the increasing UV exposure in Antarctic regions, the expression of these genes is likely to be an important adaptive response. Copyright © 2013 Elsevier Inc. All rights reserved.
Rosa, Rafael Diego; Stoco, Patricia Hermes; Barracco, Margherita Anna
2008-11-01
Anti-lipopolysaccharide factors (ALFs) are antimicrobial peptides found in limulids and crustaceans that have a potent and broad range of antimicrobial activity. We report here the identification and molecular characterisation of new sequences encoding for ALFs in the haemocytes of the freshwater prawn Macrobrachium olfersi and also in two Brazilian penaeid species, Farfantepenaeus paulensis and Litopenaeus schmitti. All obtained sequences encoded for highly cationic peptides containing two conserved cysteine residues flanking a putative LPS-binding domain. They exhibited a significant amino acid similarity with crustacean and limulid ALF sequences, especially with those of penaeid shrimps. This is the first identification of ALF in a freshwater prawn.
Characterization and mapping of cDNA encoding aspartate aminotransferase in rice, Oryza sativa L.
Song, J; Yamamoto, K; Shomura, A; Yano, M; Minobe, Y; Sasaki, T
1996-10-31
Fifteen cDNA clones, putatively identified as encoding aspartate aminotransferase (AST, EC 2.6.1.1.), were isolated and partially sequenced. Together with six previously isolated clones putatively identified to encode ASTs (Sasaki, et al. 1994, Plant Journal 6, 615-624), their sequences were characterized and classified into 4 cDNA species. Two of the isolated clones, C60213 and C2079, were full-length cDNAs, and their complete nucleotide sequences were determined. C60213 was 1612 bp long and its deduced amino acid sequence showed 88% homology with that of Panicum miliaceum L. mitochondrial AST. The C60213-encoded protein had an N-terminal amino acid sequence that was characteristic of a mitochondrial transit peptide. On the other hand, C2079 was 1546 bp long and had 91% amino acid sequence homology with P. miliaceum L. cytosolic AST but lacked in the transit peptide sequence. The homologies of nucleotide sequences and deduced amino acid sequences of C2079 and C60213 were 54% and 52%, respectively. C2079 and C60213 were mapped on chromosomes 1 and 6, respectively, by restriction fragment length polymorphism linkage analysis. Northern blot analysis using C2079 as a probe revealed much higher transcript levels in callus and root than in green and etiolated shoots, suggesting tissue-specific variations of AST gene expression.
Multiplexed Sequence Encoding: A Framework for DNA Communication.
Zakeri, Bijan; Carr, Peter A; Lu, Timothy K
2016-01-01
Synthetic DNA has great propensity for efficiently and stably storing non-biological information. With DNA writing and reading technologies rapidly advancing, new applications for synthetic DNA are emerging in data storage and communication. Traditionally, DNA communication has focused on the encoding and transfer of complete sets of information. Here, we explore the use of DNA for the communication of short messages that are fragmented across multiple distinct DNA molecules. We identified three pivotal points in a communication-data encoding, data transfer & data extraction-and developed novel tools to enable communication via molecules of DNA. To address data encoding, we designed DNA-based individualized keyboards (iKeys) to convert plaintext into DNA, while reducing the occurrence of DNA homopolymers to improve synthesis and sequencing processes. To address data transfer, we implemented a secret-sharing system-Multiplexed Sequence Encoding (MuSE)-that conceals messages between multiple distinct DNA molecules, requiring a combination key to reveal messages. To address data extraction, we achieved the first instance of chromatogram patterning through multiplexed sequencing, thereby enabling a new method for data extraction. We envision these approaches will enable more widespread communication of information via DNA.
Nishibuchi, M; Murakami, A; Arita, M; Jikuya, H; Takano, J; Honda, T; Miwatani, T
1989-01-01
We examined variations in the genes encoding heat-stable enterotoxin (ST) and heat-labile enterotoxin (LT) in 88 strains of Escherichia coli isolated from individuals with traveler's diarrhea to find suitable sequences for use as oligonucleotide probes. Four oligonucleotide probes of the gene encoding ST of human origin (STIb or STh), one oligonucleotide probe of the gene encoding ST of porcine origin (STIa or STp), and three oligonucleotide probes of the gene encoding LT of human origin (LTIh) were used in DNA colony hybridization tests. In 15 of 22 strains possessing the STh gene and 28 of 42 strains producing LT, the sequences of all regions tested were identical to the published sequences. One region in the STh gene examined with a 18-mer probe was relatively well conserved and was shown to be closely associated with the enterotoxicity of the E. coli strains in suckling mice. This oligonucleotide, however, hybridized with strains of Vibrio cholerae O1, V. parahaemolyticus, and Yersinia enterocolitica that gave negative results in the suckling mouse assay. PMID:2685027
Sequence heuristics to encode phase behaviour in intrinsically disordered protein polymers
Quiroz, Felipe García; Chilkoti, Ashutosh
2015-01-01
Proteins and synthetic polymers that undergo aqueous phase transitions mediate self-assembly in nature and in man-made material systems. Yet little is known about how the phase behaviour of a protein is encoded in its amino acid sequence. Here, by synthesizing intrinsically disordered, repeat proteins to test motifs that we hypothesized would encode phase behaviour, we show that the proteins can be designed to exhibit tunable lower or upper critical solution temperature (LCST and UCST, respectively) transitions in physiological solutions. We also show that mutation of key residues at the repeat level abolishes phase behaviour or encodes an orthogonal transition. Furthermore, we provide heuristics to identify, at the proteome level, proteins that might exhibit phase behaviour and to design novel protein polymers consisting of biologically active peptide repeats that exhibit LCST or UCST transitions. These findings set the foundation for the prediction and encoding of phase behaviour at the sequence level. PMID:26390327
Experimental demonstration of a flexible time-domain quantum channel.
Xing, Xingxing; Feizpour, Amir; Hayat, Alex; Steinberg, Aephraim M
2014-10-20
We present an experimental realization of a flexible quantum channel where the Hilbert space dimensionality can be controlled electronically. Using electro-optical modulators (EOM) and narrow-band optical filters, quantum information is encoded and decoded in the temporal degrees of freedom of photons from a long-coherence-time single-photon source. Our results demonstrate the feasibility of a generic scheme for encoding and transmitting multidimensional quantum information over the existing fiber-optical telecommunications infrastructure.
Human jagged polypeptide, encoding nucleic acids and methods of use
Li, Linheng; Hood, Leroy
2000-01-01
The present invention provides an isolated polypeptide exhibiting substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the polypeptide does not have the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. The invention further provides an isolated nucleic acid molecule containing a nucleotide sequence encoding substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the nucleotide sequence does not encode the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. Also provided herein is a method of inhibiting differentiation of hematopoietic progenitor cells by contacting the progenitor cells with an isolated JAGGED polypeptide, or active fragment thereof. The invention additionally provides a method of diagnosing Alagille Syndrome in an individual. The method consists of detecting an Alagille Syndrome disease-associated mutation linked to a JAGGED locus.
Encoding and choice in the task span paradigm.
Reiman, Kaitlin M; Weaver, Starla M; Arrington, Catherine M
2015-03-01
Cognitive control during sequences of planned behaviors requires both plan-level processes such as generating, maintaining, and monitoring the plan, as well as task-level processes such as selecting, establishing and implementing specific task sets. The task span paradigm (Logan in J Exp Psychol Gen 133:218-236, 2004) combines two common cognitive control paradigms, task switching and working memory span, to investigate the integration of plan-level and task-level processes during control of sequential behavior. The current study expands past task span research to include measures of encoding processes and choice behavior with volitional sequence generation, using the standard task span as well as a novel voluntary task span paradigm. In two experiments, we consider how sequence complexity, defined separately for plan-level and task-level complexity, influences sequence encoding (Experiment 1), sequence choice (Experiment 2), sequence memory, and task performance of planned sequences of action. Results indicate that participants were sensitive to sequence complexity, but that different aspects of behavior are most strongly influenced by different types of complexity. Hierarchical complexity at the plan level best predicts voluntary sequence generation and memory; while switch frequency at the task level best predicts encoding of externally defined sequences and task performance. Furthermore, performance RTs were similar for externally and internally defined plans, whereas memory was improved for internally defined sequences. Finally, participants demonstrated a significant sequence choice bias in the voluntary task span. Consistent with past research on choice behavior, volitional selection of plans was markedly influenced by both the ease of memory and performance.
The bglA Gene of Aspergillus kawachii Encodes Both Extracellular and Cell Wall-Bound β-Glucosidases
Iwashita, Kazuhiro; Nagahara, Tatsuya; Kimura, Hitoshi; Takano, Makoto; Shimoi, Hitoshi; Ito, Kiyoshi
1999-01-01
We cloned the genomic DNA and cDNA of bglA, which encodes β-glucosidase in Aspergillus kawachii, based on a partial amino acid sequence of purified cell wall-bound β-glucosidase CB-1. The nucleotide sequence of the cloned bglA gene revealed a 2,933-bp open reading frame with six introns that encodes an 860-amino-acid protein. Based on the deduced amino acid sequence, we concluded that the bglA gene encodes cell wall-bound β-glucosidase CB-1. The amino acid sequence exhibited high levels of homology with the amino acid sequences of fungal β-glucosidases classified in subfamily B. We expressed the bglA cDNA in Saccharomyces cerevisiae and detected the recombinant β-glucosidase in the periplasm fraction of the recombinant yeast. A. kawachii can produce two extracellular β-glucosidases (EX-1 and EX-2) in addition to the cell wall-bound β-glucosidase. A. kawachii in which the bglA gene was disrupted produced none of the three β-glucosidases, as determined by enzyme assays and a Western blot analysis. Thus, we concluded that the bglA gene encodes both extracellular and cell wall-bound β-glucosidases in A. kawachii. PMID:10584016
Characterization of Urtica dioica agglutinin isolectins and the encoding gene family.
Does, M P; Ng, D K; Dekker, H L; Peumans, W J; Houterman, P M; Van Damme, E J; Cornelissen, B J
1999-01-01
Urtica dioica agglutinin (UDA) has previously been found in roots and rhizomes of stinging nettles as a mixture of UDA-isolectins. Protein and cDNA sequencing have shown that mature UDA is composed of two hevein domains and is processed from a precursor protein. The precursor contains a signal peptide, two in-tandem hevein domains, a hinge region and a carboxyl-terminal chitinase domain. Genomic fragments encoding precursors for UDA-isolectins have been amplified by five independent polymerase chain reactions on genomic DNA from stinging nettle ecotype Weerselo. One amplified gene was completely sequenced. As compared to the published cDNA sequence, the genomic sequence contains, besides two basepair substitutions, two introns located at the same positions as in other plant chitinases. By partial sequence analysis of 40 amplified genes, 16 different genes were identified which encode seven putative UDA-isolectins. The deduced amino acid sequences share 78.9-98.9% identity. In extracts of roots and rhizomes of stinging nettle ecotype Weerselo six out of these seven isolectins were detected by mass spectrometry. One of them is an acidic form, which has not been identified before. Our results demonstrate that UDA is encoded by a large gene family.
Becker, Y; Asher, Y; Tabor, E; Davidson, I; Malkinson, M
1994-01-01
A DNA segment of the MDV-1 BamHI-D fragment was sequenced, and the open reading frames (ORFs) present in the 4556 nucleotide fragment were analyzed by computer programs. Computer analysis identified 19 putative ORFs in the sequence ranging from a coding capacity of 37 amino acids (aa) (ORF-1a) to 684aa (ORF-1). The special properties of four ORFs (1a, 1, 2, and 3) were investigated. Two adjacent ORFs, ORF-1a and ORF-1, were found by computer analysis to have the properties of two introns encoding a glycoprotein: ORF-1a encodes an aa sequence with the properties of a signal peptide, and ORF-1 encodes a polypeptide with a membrane anchor domain and putative N-glycosylation sites in the aa sequence. ORF-1a and ORF-1 were found to be transcribed in MDV-1-infected cells. Two RNA transcripts were detected: a precursor RNA and its spliced form. Both are transcribed from a promoter located 5' to ORF-1a, and splice donor and acceptor sites are used to splice the mRNA after cleavage of a 71-nucleotide sequence. This finding suggest that ORF-1a and ORF-1 are two introns of a new MDV-1 glycoprotein gene. The DNA sequence containing ORF-1 was transiently expressed in COS-1 cells, and the viral protein produced in these cells was found to react with anti-MDV serotype-1 Antigen B-specific monoclonal antibodies. These studies indicate that the protein encoded by ORF-1 has antigenic properties resembling Antigen B of MDV-1. A gene homologous to ORF-1 was detected in the genome of both MDV-2(SB1) and MDV-3(HVT), which serve as commercial vaccine strains. Two additional ORFs were noted in the 4556 nucleotide sequence: ORF-2, which encodes a 333 aa polypeptide initiating in the UL and terminating in the TRL prior to the putative origin of replication, and ORF-3, which encodes a 155 aa polypeptide that is partly homologous to the phosphoprotein pp38 encoded by the BamHI-H sequence. The 65 N-terminal aa of the two gene products are identical, both being derived from the nucleotide sequences in the TRL and IRL, respectively. Additional homologous aa sequences are the hydrophobic aa domain in the middle of both proteins. The functions of ORF-2, ORF-3, and additional ORFs are under study.
Wei, Chunhua; Chen, Xiner; Wang, Zhongyuan; Liu, Qiyan; Li, Hao; Zhang, Yong; Ma, Jianxiang; Yang, Jianqiang
2017-01-01
The lobed leaf character is a unique morphologic trait in crops, featuring many potential advantages for agricultural productivity. Although the majority of watermelon varieties feature lobed leaves, the genetic factors responsible for lobed leaf formation remain elusive. The F2:3 leaf shape segregating population offers the opportunity to study the underlying mechanism of lobed leaf formation in watermelon. Genetic analysis revealed that a single dominant allele (designated ClLL1) controlled the lobed leaf trait. A large-sized F3:4 population derived from F2:3 individuals was used to map ClLL1. A total of 5,966 reliable SNPs and indels were identified genome-wide via a combination of BSA and RNA-seq. Using the validated SNP and indel markers, the location of ClLL1 was narrowed down to a 127.6-kb region between markers W08314 and W07061, containing 23 putative ORFs. Expression analysis via qRT-PCR revealed differential expression patterns (fold-changes above 2-fold or below 0.5-fold) of three ORFs (ORF3, ORF11, and ORF18) between lobed and non-lobed leaf plants. Based on gene annotation and expression analysis, ORF18 (encoding an uncharacterized protein) and ORF22 (encoding a homeobox-leucine zipper-like protein) were considered as most likely candidate genes. Furthermore, sequence analysis revealed no polymorphisms in cDNA sequences of ORF18; however, two notable deletions were identified in ORF22. This study is the first report to map a leaf shape gene in watermelon and will facilitate cloning and functional characterization of ClLL1 in future studies. PMID:28704497
Wei, Chunhua; Chen, Xiner; Wang, Zhongyuan; Liu, Qiyan; Li, Hao; Zhang, Yong; Ma, Jianxiang; Yang, Jianqiang; Zhang, Xian
2017-01-01
The lobed leaf character is a unique morphologic trait in crops, featuring many potential advantages for agricultural productivity. Although the majority of watermelon varieties feature lobed leaves, the genetic factors responsible for lobed leaf formation remain elusive. The F2:3 leaf shape segregating population offers the opportunity to study the underlying mechanism of lobed leaf formation in watermelon. Genetic analysis revealed that a single dominant allele (designated ClLL1) controlled the lobed leaf trait. A large-sized F3:4 population derived from F2:3 individuals was used to map ClLL1. A total of 5,966 reliable SNPs and indels were identified genome-wide via a combination of BSA and RNA-seq. Using the validated SNP and indel markers, the location of ClLL1 was narrowed down to a 127.6-kb region between markers W08314 and W07061, containing 23 putative ORFs. Expression analysis via qRT-PCR revealed differential expression patterns (fold-changes above 2-fold or below 0.5-fold) of three ORFs (ORF3, ORF11, and ORF18) between lobed and non-lobed leaf plants. Based on gene annotation and expression analysis, ORF18 (encoding an uncharacterized protein) and ORF22 (encoding a homeobox-leucine zipper-like protein) were considered as most likely candidate genes. Furthermore, sequence analysis revealed no polymorphisms in cDNA sequences of ORF18; however, two notable deletions were identified in ORF22. This study is the first report to map a leaf shape gene in watermelon and will facilitate cloning and functional characterization of ClLL1 in future studies.
Doublet, Benoît; Robin, Frédéric; Casin, Isabelle; Fabre, Laëtitia; Le Fleche, Anne; Bonnet, Richard; Weill, François-Xavier
2010-01-01
Pseudomonas luteola (formerly classified as CDC group Ve-1 and named Chryseomonas luteola) is an unusual pathogen implicated in rare but serious infections in humans. A novel β-lactamase gene, blaLUT-1, was cloned from the whole-cell DNA of the P. luteola clinical isolate LAM, which had a weak narrow-spectrum β-lactam-resistant phenotype, and expressed in Escherichia coli. This gene encoded LUT-1, a 296-amino-acid Ambler class A β-lactamase with a pI of 6 and a theoretical molecular mass of 28.9 kDa. The catalytic efficiency of this enzyme was higher for cephalothin, cefuroxime, and cefotaxime than for penicillins. It was found to be 49% to 59% identical to other Ambler class A β-lactamases from Burkholderia sp. (PenA to PenL), Ralstonia eutropha (REUT), Citrobacter sedlakii (SED-1), Serratia fonticola (FONA and SFC-1), Klebsiella sp. (KPC and OXY), and CTX-M extended-spectrum β-lactamases. No gene homologous to the regulatory ampR genes of class A β-lactamases was found in the vicinity of the blaLUT-1 gene. The entire blaLUT-1 coding region was amplified by PCR and sequenced in five other genetically unrelated P. luteola strains (including the P. luteola type strain). A new variant of blaLUT-1 was found for each strain. These genes (named blaLUT-2 to blaLUT-6) had nucleotide sequences 98.1 to 99.5% identical to that of blaLUT-1 and differing from this gene by two to four nonsynonymous single nucleotide polymorphisms. The blaLUT gene was located on a 700- to 800-kb chromosomal I-CeuI fragment, the precise size of this fragment depending on the P. luteola strain. PMID:19884377
The effects of alcohol intoxication on attention and memory for visual scenes.
Harvey, Alistair J; Kneller, Wendy; Campbell, Alison C
2013-01-01
This study tests the claim that alcohol intoxication narrows the focus of visual attention on to the more salient features of a visual scene. A group of alcohol intoxicated and sober participants had their eye movements recorded as they encoded a photographic image featuring a central event of either high or low salience. All participants then recalled the details of the image the following day when sober. We sought to determine whether the alcohol group would pay less attention to the peripheral features of the encoded scene than their sober counterparts, whether this effect of attentional narrowing was stronger for the high-salience event than for the low-salience event, and whether it would lead to a corresponding deficit in peripheral recall. Alcohol was found to narrow the focus of foveal attention to the central features of both images but did not facilitate recall from this region. It also reduced the overall amount of information accurately recalled from each scene. These findings demonstrate that the concept of alcohol myopia originally posited to explain the social consequences of intoxication (Steele & Josephs, 1990) may be extended to explain the relative neglect of peripheral information during the processing of visual scenes.
Vasala, A; Dupont, L; Baumann, M; Ritzenthaler, P; Alatossava, T
1993-01-01
Virulent phage LL-H and temperate phage mv4 are two related bacteriophages of Lactobacillus delbrueckii. The gene clusters encoding structural proteins of these two phages have been sequenced and further analyzed. Six open reading frames (ORF-1 to ORF-6) were detected. Protein sequencing and Western immunoblotting experiments confirmed that ORF-3 (g34) encoded the main capsid protein Gp34. The presence of a putative late promoter in front of the phage LL-H g34 gene was suggested by primer extension experiments. Comparative sequence analysis between phage LL-H and phage mv4 revealed striking similarities in the structure and organization of this gene cluster, suggesting that the genes encoding phage structural proteins belong to a highly conservative module. Images PMID:8497043
Specific minor groove solvation is a crucial determinant of DNA binding site recognition
Harris, Lydia-Ann; Williams, Loren Dean; Koudelka, Gerald B.
2014-01-01
The DNA sequence preferences of nearly all sequence specific DNA binding proteins are influenced by the identities of bases that are not directly contacted by protein. Discrimination between non-contacted base sequences is commonly based on the differential abilities of DNA sequences to allow narrowing of the DNA minor groove. However, the factors that govern the propensity of minor groove narrowing are not completely understood. Here we show that the differential abilities of various DNA sequences to support formation of a highly ordered and stable minor groove solvation network are a key determinant of non-contacted base recognition by a sequence-specific binding protein. In addition, disrupting the solvent network in the non-contacted region of the binding site alters the protein's ability to recognize contacted base sequences at positions 5–6 bases away. This observation suggests that DNA solvent interactions link contacted and non-contacted base recognition by the protein. PMID:25429976
The cDNA sequence of a neutral horseradish peroxidase.
Bartonek-Roxå, E; Eriksson, H; Mattiasson, B
1991-02-16
A cDNA clone encoding a horseradish (Armoracia rusticana) peroxidase has been isolated and characterized. The cDNA contains 1378 nucleotides excluding the poly(A) tail and the deduced protein contains 327 amino acids which includes a 28 amino acid leader sequence. The predicted amino acid sequence is nine amino acids shorter than the major isoenzyme belonging to the horseradish peroxidase C group (HRP-C) and the sequence shows 53.7% identity with this isoenzyme. The described clone encodes nine cysteines of which eight correspond well with the cysteines found in HRP-C. Five potential N-glycosylation sites with the general sequence Asn-X-Thr/Ser are present in the deduced sequence. Compared to the earlier described HRP-C this is three glycosylation sites less. The shorter sequence and fewer N-glycosylation sites give the native isoenzyme a molecular weight of several thousands less than the horseradish peroxidase C isoenzymes. Comparison with the net charge value of HRP-C indicates that the described cDNA clone encodes a peroxidase which has either the same or a slightly less basic pI value, depending on whether the encoded protein is N-terminally blocked or not. This excludes the possibility that HRP-n could belong to either the HRP-A, -D or -E groups. The low sequence identity (53.7%) with HRP-C indicates that the described clone does not belong to the HRP-C isoenzyme group and comparison of the total amino acid composition with the HRP-B group does not place the described clone within this isoenzyme group. Our conclusion is that the described cDNA clone encodes a neutral horseradish peroxidase which belongs to a new, not earlier described, horseradish peroxidase group.
Nucleic acids encoding antifungal polypeptides and uses thereof
Altier, Daniel J.; Ellanskaya, I. A.; Gilliam, Jacob T.; Hunter-Cevera, Jennie; Presnail, James K; Schepers, Eric; Simmons, Carl R.; Torok, Tamas; Yalpani, Nasser
2010-11-02
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include an amino acid sequence, and variants and fragments thereof, for an antipathogenic polypeptide that was isolated from a fungal fermentation broth. Nucleic acid molecules that encode the antipathogenic polypeptides of the invention, and antipathogenic domains thereof, are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention are also disclosed.
Nucleic Acid Encoding A Lectin-Derived Progenitor Cell Preservation Factor
Colucci, M. Gabriella; Chrispeels, Maarten J.; Moore, Jeffrey G.
2001-10-30
The invention relates to an isolated nucleic acid molecule that encodes a protein that is effective to preserve progenitor cells, such as hematopoietic progenitor cells. The nucleic acid comprises a sequence defined by SEQ ID NO:1, a homolog thereof, or a fragment thereof. The encoded protein has an amino acid sequence that comprises a sequence defined by SEQ ID NO:2, a homolog thereof, or a fragment thereof that contains an amino acid sequence TNNVLQVT. Methods of using the encoded protein for preserving progenitor cells in vitro, ex vivo, and in vivo are also described. The invention, therefore, include methods such as myeloablation therapies for cancer treatment wherein myeloid reconstitution is facilitated by means of the specified protein. Other therapeutic utilities are also enabled through the invention, for example, expanding progenitor cell populations ex vivo to increase chances of engraftation, improving conditions for transporting and storing progenitor cells, and facilitating gene therapy to treat and cure a broad range of life-threatening hematologic diseases.
Multiplexed Sequence Encoding: A Framework for DNA Communication
Zakeri, Bijan; Carr, Peter A.; Lu, Timothy K.
2016-01-01
Synthetic DNA has great propensity for efficiently and stably storing non-biological information. With DNA writing and reading technologies rapidly advancing, new applications for synthetic DNA are emerging in data storage and communication. Traditionally, DNA communication has focused on the encoding and transfer of complete sets of information. Here, we explore the use of DNA for the communication of short messages that are fragmented across multiple distinct DNA molecules. We identified three pivotal points in a communication—data encoding, data transfer & data extraction—and developed novel tools to enable communication via molecules of DNA. To address data encoding, we designed DNA-based individualized keyboards (iKeys) to convert plaintext into DNA, while reducing the occurrence of DNA homopolymers to improve synthesis and sequencing processes. To address data transfer, we implemented a secret-sharing system—Multiplexed Sequence Encoding (MuSE)—that conceals messages between multiple distinct DNA molecules, requiring a combination key to reveal messages. To address data extraction, we achieved the first instance of chromatogram patterning through multiplexed sequencing, thereby enabling a new method for data extraction. We envision these approaches will enable more widespread communication of information via DNA. PMID:27050646
Methods and materials relating to IMPDH and GMP production
Collart, Frank R.; Huberman, Eliezer
1997-01-01
Disclosed are purified and isolated DNA sequences encoding eukaryotic proteins possessing biological properties of inosine 5'-monophosphate dehydrogenase ("IMPDH"). Illustratively, mammalian (e.g., human) IMPDH-encoding DNA sequences are useful in transformation or transfection of host cells for the large scale recombinant production of the enzymatically active expression products and/or products (e.g., GMP) resulting from IMPDH catalyzed synthesis in cells. Vectors including IMPDH-encoding DNA sequences are useful in gene amplification procedures. Recombinant proteins and synthetic peptides provided by the invention are useful as immunological reagents and in the preparation of antibodies (including polyclonal and monoclonal antibodies) for quantitative detection of IMPDH.
Altier, Daniel J.; Dahlbacka, Glen; Ellanskaya, legal representative, Natalia; Herrmann, Rafael; Hunter-Cevera, Jennie; McCutchen, Billy F.; Presnail, James K.; Rice, Janet A.; Schepers, Eric; Simmons, Carl R.; Torok, Tamas; Yalpani, Nasser; Ellanskaya, deceased, Irina
2007-12-11
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include novel amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from microbial fermentation broths. Nucleic acid molecules comprising nucleotide sequences that encode the antipathogenic polypeptides of the invention are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention, or variant or fragment thereof, are also disclosed.
Altier, Daniel J.; Dahlbacka, Glen; Elleskaya, Irina; Ellanskaya, legal representative; Natalia; Herrmann, Rafael; Hunter-Cevera, Jennie; McCutchen, Billy F.; Presnail, James K.; Rice, Janet A.; Schepers, Eric; Simmons, Carl R.; Torok, Tamas; Yalpani, Nasser
2010-08-10
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include novel amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from microbial fermentation broths. Nucleic acid molecules comprising nucleotide sequences that encode the antipathogenic polypeptides of the invention are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention, or variant or fragment thereof, are also disclosed.
Altier, Daniel J [Waukee, IA; Dahlbacka, Glen [Oakland, CA; Elleskaya, Irina [Kyiv, UA; Ellanskaya, legal representative, Natalia; Herrmann, Rafael [Wilmington, DE; Hunter-Cevera, Jennie [Elliott City, MD; McCutchen, Billy F [College Station, IA; Presnail, James K [Avondale, PA; Rice, Janet A [Wilmington, DE; Schepers, Eric [Port Deposit, MD; Simmons, Carl R [Des Moines, IA; Torok, Tamas [Richmond, CA; Yalpani, Nasser [Johnston, IA
2011-04-12
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include novel amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from microbial fermentation broths. Nucleic acid molecules comprising nucleotide sequences that encode the antipathogenic polypeptides of the invention are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention, or variant or fragment thereof, are also disclosed.
Altier, Daniel J [Granger, IA; Dahlbacka, Glen [Oakland, CA; Ellanskaya, Irina [Kyiv, UA; Ellanskaya, legal representative, Natalia; Herrmann, Rafael [Wilmington, DE; Hunter-Cevera, Jennie [Elliott City, MD; McCutchen, Billy F [College Station, TX; Presnail, James K [Avondale, PA; Rice, Janet A [Wilmington, DE; Schepers, Eric [Port Deposit, MD; Simmons, Carl R [Des Moines, IA; Torok, Tamas [Richmond, CA; Yalpani, Nasser [Johnston, IA
2012-04-03
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include novel amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from microbial fermentation broths. Nucleic acid molecules comprising nucleotide sequences that encode the antipathogenic polypeptides of the invention are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention, or variant or fragment thereof, are also disclosed.
Xin, Min; Zhang, Peipei; Liu, Wenwen; Ren, Yingdang; Cao, Mengji; Wang, Xifeng
2017-10-01
The complete nucleotide sequence of a novel positive single-stranded (+ss) RNA virus, tentatively named watermelon virus A (WVA), was determined using a combination of three methods: RNA sequencing, small RNA sequencing, and Sanger sequencing. The full genome of WVA is comprised of 8,372 nucleotides (nt), excluding the poly (A) tail, and contains four open reading frames (ORFs). The largest ORF, ORF1 encodes a putative replication-associated polyprotein (RP) with three conserved domains. ORF2 and ORF4 encode a movement protein (MP) and coat protein (CP), respectively. The putative product encoded by ORF3, of an estimated molecular mass of 25 kDa, has no significant similarity with other proteins. Identity and phylogenetic analysis indicate that WVA is a new virus, closely related to members of the family Betaflexiviridae. However, the final taxonomic allocation of WVA within the family is yet to be determined.
Plasmids encoding therapeutic agents
Keener, William K [Idaho Falls, ID
2007-08-07
Plasmids encoding anti-HIV and anti-anthrax therapeutic agents are disclosed. Plasmid pWKK-500 encodes a fusion protein containing DP178 as a targeting moiety, the ricin A chain, an HIV protease cleavable linker, and a truncated ricin B chain. N-terminal extensions of the fusion protein include the maltose binding protein and a Factor Xa protease site. C-terminal extensions include a hydrophobic linker, an L domain motif peptide, a KDEL ER retention signal, another Factor Xa protease site, an out-of-frame buforin II coding sequence, the lacZ.alpha. peptide, and a polyhistidine tag. More than twenty derivatives of plasmid pWKK-500 are described. Plasmids pWKK-700 and pWKK-800 are similar to pWKK-500 wherein the DP178-encoding sequence is substituted by RANTES- and SDF-1-encoding sequences, respectively. Plasmid pWKK-900 is similar to pWKK-500 wherein the HIV protease cleavable linker is substituted by a lethal factor (LF) peptide-cleavable linker.
Walker, M D; Park, C W; Rosen, A; Aronheim, A
1990-01-01
Cell specific expression of the insulin gene is achieved through transcriptional mechanisms operating on multiple DNA sequence elements located in the 5' flanking region of the gene. Of particular importance in the rat insulin I gene are two closely similar 9 bp sequences (IEB1 and IEB2): mutation of either of these leads to 5-10 fold reduction in transcriptional activity. We have screened an expression cDNA library derived from mouse pancreatic endocrine beta cells with a radioactive DNA probe containing multiple copies of the IEB1 sequence. A cDNA clone (A1) isolated by this procedure encodes a protein which shows efficient binding to the IEB1 probe, but much weaker binding to either an unrelated DNA probe or to a probe bearing a single base pair insertion within the recognition sequence. DNA sequence analysis indicates a protein belonging to the helix-loop-helix family of DNA-binding proteins. The ability of the protein encoded by clone A1 to recognize a number of wild type and mutant DNA sequences correlates closely with the ability of each sequence element to support transcription in vivo in the context of the insulin 5' flanking DNA. We conclude that the isolated cDNA may encode a transcription factor that participates in control of insulin gene expression. Images PMID:2181401
Motivation Matters: Differing Effects of Pre-Goal and Post-Goal Emotions on Attention and Memory
Kaplan, Robin L.; Van Damme, Ilse; Levine, Linda J.
2012-01-01
People often show enhanced memory for information that is central to emotional events and impaired memory for peripheral details. The intensity of arousal elicited by an emotional event is commonly held to be the mechanism underlying memory narrowing, with the implication that all sources of emotional arousal should have comparable effects. Discrete emotions differ in their effects on memory, however, with some emotions broadening rather than narrowing the range of information attended to and remembered. Thus, features of emotion other than arousal appear to play a critical role in memory narrowing. We review theory and research on emotional memory narrowing and argue that motivation matters. Recent evidence suggests that emotions experienced prior to goal attainment or loss lead to memory narrowing whereas emotions experienced after goal attainment or loss broaden the range of information encoded in memory. The motivational component of emotion is an important but understudied feature that can help to clarify the conditions under which emotions enhance and impair attention and memory. PMID:23162490
Motivation matters: differing effects of pre-goal and post-goal emotions on attention and memory.
Kaplan, Robin L; Van Damme, Ilse; Levine, Linda J
2012-01-01
People often show enhanced memory for information that is central to emotional events and impaired memory for peripheral details. The intensity of arousal elicited by an emotional event is commonly held to be the mechanism underlying memory narrowing, with the implication that all sources of emotional arousal should have comparable effects. Discrete emotions differ in their effects on memory, however, with some emotions broadening rather than narrowing the range of information attended to and remembered. Thus, features of emotion other than arousal appear to play a critical role in memory narrowing. We review theory and research on emotional memory narrowing and argue that motivation matters. Recent evidence suggests that emotions experienced prior to goal attainment or loss lead to memory narrowing whereas emotions experienced after goal attainment or loss broaden the range of information encoded in memory. The motivational component of emotion is an important but understudied feature that can help to clarify the conditions under which emotions enhance and impair attention and memory.
Analysis of a MULE-cyanide hydratase gene fusion in Verticillium dahliae
USDA-ARS?s Scientific Manuscript database
The genome of the phytopathogenic fungus Verticillium dahliae encodes numerous Class II “cut-and-paste” transposable elements, including those of a small group of MULE transposons. We have previously identified a fusion event between a MULE transposon sequence and sequence encoding a cyanide hydrata...
Genomes: At the edge of chaos with maximum information capacity
NASA Astrophysics Data System (ADS)
Kong, Sing-Guan; Chen, Hong-Da; Torda, Andrew; Lee, H. C.
2016-12-01
We propose an order index, ϕ, which quantifies the notion of “life at the edge of chaos” when applied to genome sequences. It maps genomes to a number from 0 (random and of infinite length) to 1 (fully ordered) and applies regardless of sequence length and base composition. The 786 complete genomic sequences in GenBank were found to have ϕ values in a very narrow range, 0.037 ± 0.027. We show this implies that genomes are halfway towards being completely random, namely, at the edge of chaos. We argue that this narrow range represents the neighborhood of a fixed-point in the space of sequences, and genomes are driven there by the dynamics of a robust, predominantly neutral evolution process.
Cloning of an avilamycin biosynthetic gene cluster from Streptomyces viridochromogenes Tü57.
Gaisser, S; Trefzer, A; Stockert, S; Kirschning, A; Bechthold, A
1997-01-01
A 65-kb region of DNA from Streptomyces viridochromogenes Tü57, containing genes encoding proteins involved in the biosynthesis of avilamycins, was isolated. The DNA sequence of a 6.4-kb fragment from this region revealed four open reading frames (ORF1 to ORF4), three of which are fully contained within the sequenced fragment. The deduced amino acid sequence of AviM, encoded by ORF2, shows 37% identity to a 6-methylsalicylic acid synthase from Penicillium patulum. Cultures of S. lividans TK24 and S. coelicolor CH999 containing plasmids with ORF2 on a 5.5-kb PstI fragment were able to produce orsellinic acid, an unreduced version of 6-methylsalicylic acid. The amino acid sequence encoded by ORF3 (AviD) is 62% identical to that of StrD, a dTDP-glucose synthase from S. griseus. The deduced amino acid sequence of AviE, encoded by ORF4, shows 55% identity to a dTDP-glucose dehydratase (StrE) from S. griseus. Gene insertional inactivation experiments of aviE abolished avilamycin production, indicating the involvement of aviE in the biosynthesis of avilamycins. PMID:9335272
Not all order memory is equal: Test demands reveal dissociations in memory for sequence information.
Jonker, Tanya R; MacLeod, Colin M
2017-02-01
Remembering the order of a sequence of events is a fundamental feature of episodic memory. Indeed, a number of formal models represent temporal context as part of the memory system, and memory for order has been researched extensively. Yet, the nature of the code(s) underlying sequence memory is still relatively unknown. Across 4 experiments that manipulated encoding task, we found evidence for 3 dissociable facets of order memory. Experiment 1 introduced a test requiring a judgment of which of 2 alternatives had immediately followed a word during encoding. This measure revealed better retention of interitem associations following relational encoding (silent reading) than relatively item-specific encoding (judging referent size), a pattern consistent with that observed in previous research using order reconstruction tests. In sharp contrast, Experiment 2 demonstrated the reverse pattern: Memory for the studied order of 2 sequentially presented items was actually better following item-specific encoding than following relational encoding. Experiment 3 reproduced this dissociation in a single experiment using both tests. Experiment 4 extended these findings by further dissociating the roles of relational encoding and item strength in the 2 tests. Taken together, these results indicate that memory for event sequence is influenced by (a) interitem associations, (b) the emphasized directionality of an association, and (c) an item's strength independent of other items. Memory for order is more complicated than has been portrayed in theories of memory and its nuances should be carefully considered when designing tests and models of temporal and relational memory. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
DNA-encoded chemistry: enabling the deeper sampling of chemical space.
Goodnow, Robert A; Dumelin, Christoph E; Keefe, Anthony D
2017-02-01
DNA-encoded chemical library technologies are increasingly being adopted in drug discovery for hit and lead generation. DNA-encoded chemistry enables the exploration of chemical spaces four to five orders of magnitude more deeply than is achievable by traditional high-throughput screening methods. Operation of this technology requires developing a range of capabilities including aqueous synthetic chemistry, building block acquisition, oligonucleotide conjugation, large-scale molecular biological transformations, selection methodologies, PCR, sequencing, sequence data analysis and the analysis of large chemistry spaces. This Review provides an overview of the development and applications of DNA-encoded chemistry, highlighting the challenges and future directions for the use of this technology.
The ENCODE Project at UC Santa Cruz.
Thomas, Daryl J; Rosenbloom, Kate R; Clawson, Hiram; Hinrichs, Angie S; Trumbower, Heather; Raney, Brian J; Karolchik, Donna; Barber, Galt P; Harte, Rachel A; Hillman-Jackson, Jennifer; Kuhn, Robert M; Rhead, Brooke L; Smith, Kayla E; Thakkapallayil, Archana; Zweig, Ann S; Haussler, David; Kent, W James
2007-01-01
The goal of the Encyclopedia Of DNA Elements (ENCODE) Project is to identify all functional elements in the human genome. The pilot phase is for comparison of existing methods and for the development of new methods to rigorously analyze a defined 1% of the human genome sequence. Experimental datasets are focused on the origin of replication, DNase I hypersensitivity, chromatin immunoprecipitation, promoter function, gene structure, pseudogenes, non-protein-coding RNAs, transcribed RNAs, multiple sequence alignment and evolutionarily constrained elements. The ENCODE project at UCSC website (http://genome.ucsc.edu/ENCODE) is the primary portal for the sequence-based data produced as part of the ENCODE project. In the pilot phase of the project, over 30 labs provided experimental results for a total of 56 browser tracks supported by 385 database tables. The site provides researchers with a number of tools that allow them to visualize and analyze the data as well as download data for local analyses. This paper describes the portal to the data, highlights the data that has been made available, and presents the tools that have been developed within the ENCODE project. Access to the data and types of interactive analysis that are possible are illustrated through supplemental examples.
Aliotta, Eric; Moulin, Kévin; Ennis, Daniel B
2018-02-01
To design and evaluate eddy current-nulled convex optimized diffusion encoding (EN-CODE) gradient waveforms for efficient diffusion tensor imaging (DTI) that is free of eddy current-induced image distortions. The EN-CODE framework was used to generate diffusion-encoding waveforms that are eddy current-compensated. The EN-CODE DTI waveform was compared with the existing eddy current-nulled twice refocused spin echo (TRSE) sequence as well as monopolar (MONO) and non-eddy current-compensated CODE in terms of echo time (TE) and image distortions. Comparisons were made in simulations, phantom experiments, and neuro imaging in 10 healthy volunteers. The EN-CODE sequence achieved eddy current compensation with a significantly shorter TE than TRSE (78 versus 96 ms) and a slightly shorter TE than MONO (78 versus 80 ms). Intravoxel signal variance was lower in phantoms with EN-CODE than with MONO (13.6 ± 11.6 versus 37.4 ± 25.8) and not different from TRSE (15.1 ± 11.6), indicating good robustness to eddy current-induced image distortions. Mean fractional anisotropy values in brain edges were also significantly lower with EN-CODE than with MONO (0.16 ± 0.01 versus 0.24 ± 0.02, P < 1 x 10 -5 ) and not different from TRSE (0.16 ± 0.01 versus 0.16 ± 0.01, P = nonsignificant). The EN-CODE sequence eliminated eddy current-induced image distortions in DTI with a TE comparable to MONO and substantially shorter than TRSE. Magn Reson Med 79:663-672, 2018. © 2017 International Society for Magnetic Resonance in Medicine. © 2017 International Society for Magnetic Resonance in Medicine.
NASA Astrophysics Data System (ADS)
Bhooplapur, Sharad; Akbulut, Mehmetkan; Quinlan, Franklyn; Delfyett, Peter J.
2010-04-01
A novel scheme for recognition of electronic bit-sequences is demonstrated. Two electronic bit-sequences that are to be compared are each mapped to a unique code from a set of Walsh-Hadamard codes. The codes are then encoded in parallel on the spectral phase of the frequency comb lines from a frequency-stabilized mode-locked semiconductor laser. Phase encoding is achieved by using two independent spatial light modulators based on liquid crystal arrays. Encoded pulses are compared using interferometric pulse detection and differential balanced photodetection. Orthogonal codes eight bits long are compared, and matched codes are successfully distinguished from mismatched codes with very low error rates, of around 10-18. This technique has potential for high-speed, high accuracy recognition of bit-sequences, with applications in keyword searches and internet protocol packet routing.
Koike-Takeshita, A; Koyama, T; Obata, S; Ogura, K
1995-08-04
The genes encoding two dissociable components essential for Bacillus stearothermophilus heptaprenyl diphosphate synthase (all-trans-hexparenyl-diphosphate:isopentenyl-diphosphate hexaprenyl-trans-transferase, EC 2.5.1.30) were cloned, and their nucleotide sequences were determined. Sequence analyses revealed the presence of three open reading frames within 2,350 base pairs, designated as ORF-1, ORF-2, and ORF-3 in order of nucleotide sequence, which encode proteins of 220, 234, and 323 amino acids, respectively. Deletion experiments have shown that expression of the enzymatic activity requires the presence of ORF-1 and ORF-3, but ORF-2 is not essential. As a result, this enzyme was proved genetically to consist of two different protein compounds with molecular masses of 25 kDa (Component I) and 36 kDa (Component II), encoded by two of the three tandem genes. The protein encoded by ORF-1 has no similarity to any protein so far registered. However, the protein encoded by ORF-3 shows a 32% similarity to the farnesyl diphosphate synthase of the same bacterium and has seven highly conserved regions that have been shown typical in prenyltransferases (Koyama, T., Obata, S., Osabe, M., Takeshita, A., Yokoyama, K., Uchida, M., Nishino, T., and Ogura, K. (1993) J. Biochem. (Tokyo) 113, 355-363).
ADS genes for reducing saturated fatty acid levels in seed oils
Heilmann, Ingo H; Shanklin, John
2014-03-18
The present invention relates to enzymes involved in lipid metabolism. In particular, the present invention provides coding sequences for Arabidopsis Desaturases (ADS), the encoded ADS polypeptides, and methods for using the sequences and encoded polypeptides, where such methods include decreasing and increasing saturated fatty acid content in plant seed oils.
The DNA region encoding biphenyl dioxygenase, the first enzyme in the biphenyl-polychlorinated biphenyl degradation pathway of Pseudomonas species strain LB400, was sequenced. ix open reading frames were identified, four of which are, homologous to the components of toluene dioxy...
ADS genes for reducing saturated fatty acid levels in seed oils
Heilmann, Ingo H.; Shanklin, John
2010-02-02
The present invention relates to enzymes involved in lipid metabolism. In particular, the present invention provides coding sequences for Arabidopsis Desaturases (ADS), the encoded ADS polypeptides, and methods for using the sequences and encoded polypeptides, where such methods include decreasing and increasing saturated fatty acid content in plant seed oils.
Salinas, Alejandro; Vega, Marcela; Lienqueo, María Elena; Garcia, Alejandro; Carmona, Rene; Salazar, Oriana
2011-12-10
Total cDNA isolated from cellulolytic fungi cultured in cellulose was examined for the presence of sequences encoding for endoglucanases. Novel sequences encoding for glycoside hydrolases (GHs) were identified in Fusarium oxysporum, Ganoderma applanatum and Trametes versicolor. The cDNA encoding for partial sequences of GH family 61 cellulases from F. oxysporum and G. applanatum shares 58 and 68% identity with endoglucanases from Glomerella graminicola and Laccaria bicolor, respectively. A new GH family 5 endoglucanase from T. versicolor was also identified. The cDNA encoding for the mature protein was completely sequenced. This enzyme shares 96% identity with Trametes hirsuta endoglucanase and 22% with Trichoderma reesei endoglucanase II (EGII). The enzyme, named TvEG, has N-terminal family 1 carbohydrate binding module (CBM1). The full length cDNA was cloned into the pPICZαB vector and expressed as an active, extracellular enzyme in the methylotrophic yeast Pichia pastoris. Preliminary studies suggest that T. versicolor could be useful for lignocellulose degradation. Copyright © 2011 Elsevier Inc. All rights reserved.
Rademaker, Jan L. W.; Herbet, Hélène; Starrenburg, Marjo J. C.; Naser, Sabri M.; Gevers, Dirk; Kelly, William J.; Hugenholtz, Jeroen; Swings, Jean; van Hylckama Vlieg, Johan E. T.
2007-01-01
The diversity of a collection of 102 lactococcus isolates including 91 Lactococcus lactis isolates of dairy and nondairy origin was explored using partial small subunit rRNA gene sequence analysis and limited phenotypic analyses. A subset of 89 strains of L. lactis subsp. cremoris and L. lactis subsp. lactis isolates was further analyzed by (GTG)5-PCR fingerprinting and a novel multilocus sequence analysis (MLSA) scheme. Two major genomic lineages within L. lactis were found. The L. lactis subsp. cremoris type-strain-like genotype lineage included both L. lactis subsp. cremoris and L. lactis subsp. lactis isolates. The other major lineage, with a L. lactis subsp. lactis type-strain-like genotype, comprised L. lactis subsp. lactis isolates only. A novel third genomic lineage represented two L. lactis subsp. lactis isolates of nondairy origin. The genomic lineages deviate from the subspecific classification of L. lactis that is based on a few phenotypic traits only. MLSA of six partial genes (atpA, encoding ATP synthase alpha subunit; pheS, encoding phenylalanine tRNA synthetase; rpoA, encoding RNA polymerase alpha chain; bcaT, encoding branched chain amino acid aminotransferase; pepN, encoding aminopeptidase N; and pepX, encoding X-prolyl dipeptidyl peptidase) revealed 363 polymorphic sites (total length, 1,970 bases) among 89 L. lactis subsp. cremoris and L. lactis subsp. lactis isolates with unique sequence types for most isolates. This allowed high-resolution cluster analysis in which dairy isolates form subclusters of limited diversity within the genomic lineages. The pheS DNA sequence analysis yielded two genetic groups dissimilar to the other genotyping analysis-based lineages, indicating a disparate acquisition route for this gene. PMID:17890345
Rademaker, Jan L W; Herbet, Hélène; Starrenburg, Marjo J C; Naser, Sabri M; Gevers, Dirk; Kelly, William J; Hugenholtz, Jeroen; Swings, Jean; van Hylckama Vlieg, Johan E T
2007-11-01
The diversity of a collection of 102 lactococcus isolates including 91 Lactococcus lactis isolates of dairy and nondairy origin was explored using partial small subunit rRNA gene sequence analysis and limited phenotypic analyses. A subset of 89 strains of L. lactis subsp. cremoris and L. lactis subsp. lactis isolates was further analyzed by (GTG)(5)-PCR fingerprinting and a novel multilocus sequence analysis (MLSA) scheme. Two major genomic lineages within L. lactis were found. The L. lactis subsp. cremoris type-strain-like genotype lineage included both L. lactis subsp. cremoris and L. lactis subsp. lactis isolates. The other major lineage, with a L. lactis subsp. lactis type-strain-like genotype, comprised L. lactis subsp. lactis isolates only. A novel third genomic lineage represented two L. lactis subsp. lactis isolates of nondairy origin. The genomic lineages deviate from the subspecific classification of L. lactis that is based on a few phenotypic traits only. MLSA of six partial genes (atpA, encoding ATP synthase alpha subunit; pheS, encoding phenylalanine tRNA synthetase; rpoA, encoding RNA polymerase alpha chain; bcaT, encoding branched chain amino acid aminotransferase; pepN, encoding aminopeptidase N; and pepX, encoding X-prolyl dipeptidyl peptidase) revealed 363 polymorphic sites (total length, 1,970 bases) among 89 L. lactis subsp. cremoris and L. lactis subsp. lactis isolates with unique sequence types for most isolates. This allowed high-resolution cluster analysis in which dairy isolates form subclusters of limited diversity within the genomic lineages. The pheS DNA sequence analysis yielded two genetic groups dissimilar to the other genotyping analysis-based lineages, indicating a disparate acquisition route for this gene.
Cecconi, Massimiliano; Parodi, Maria I.; Formisano, Francesco; Spirito, Paolo; Autore, Camillo; Musumeci, Maria B.; Favale, Stefano; Forleo, Cinzia; Rapezzi, Claudio; Biagini, Elena; Davì, Sabrina; Canepa, Elisabetta; Pennese, Loredana; Castagnetta, Mauro; Degiorgio, Dario; Coviello, Domenico A.
2016-01-01
Hypertrophic cardiomyopathy (HCM) is mainly associated with myosin, heavy chain 7 (MYH7) and myosin binding protein C, cardiac (MYBPC3) mutations. In order to better explain the clinical and genetic heterogeneity in HCM patients, in this study, we implemented a target-next generation sequencing (NGS) assay. An Ion AmpliSeq™ Custom Panel for the enrichment of 19 genes, of which 9 of these did not encode thick/intermediate and thin myofilament (TTm) proteins and, among them, 3 responsible of HCM phenocopy, was created. Ninety-two DNA samples were analyzed by the Ion Personal Genome Machine: 73 DNA samples (training set), previously genotyped in some of the genes by Sanger sequencing, were used to optimize the NGS strategy, whereas 19 DNA samples (discovery set) allowed the evaluation of NGS performance. In the training set, we identified 72 out of 73 expected mutations and 15 additional mutations: the molecular diagnosis was achieved in one patient with a previously wild-type status and the pre-excitation syndrome was explained in another. In the discovery set, we identified 20 mutations, 5 of which were in genes encoding non-TTm proteins, increasing the diagnostic yield by approximately 20%: a single mutation in genes encoding non-TTm proteins was identified in 2 out of 3 borderline HCM patients, whereas co-occuring mutations in genes encoding TTm and galactosidase alpha (GLA) altered proteins were characterized in a male with HCM and multiorgan dysfunction. Our combined targeted NGS-Sanger sequencing-based strategy allowed the molecular diagnosis of HCM with greater efficiency than using the conventional (Sanger) sequencing alone. Mutant alleles encoding non-TTm proteins may aid in the complete understanding of the genetic and phenotypic heterogeneity of HCM: co-occuring mutations of genes encoding TTm and non-TTm proteins could explain the wide variability of the HCM phenotype, whereas mutations in genes encoding only the non-TTm proteins are identifiable in patients with a milder HCM status. PMID:27600940
Murphy, James; Klumpp, Jochen; Mahony, Jennifer; O'Connell-Motherway, Mary; Nauta, Arjen; van Sinderen, Douwe
2014-10-01
So-called 936-type phages are among the most frequently isolated phages in dairy facilities utilising Lactococcus lactis starter cultures. Despite extensive efforts to control phage proliferation and decades of research, these phages continue to negatively impact cheese production in terms of the final product quality and consequently, monetary return. Whole genome sequencing and in silico analysis of three 936-type phage genomes identified several putative (orphan) methyltransferase (MTase)-encoding genes located within the packaging and replication regions of the genome. Utilising SMRT sequencing, methylome analysis was performed on all three phages, allowing the identification of adenine modifications consistent with N-6 methyladenine sequence methylation, which in some cases could be attributed to these phage-encoded MTases. Heterologous gene expression revealed that M.Phi145I/M.Phi93I and M.Phi93DAM, encoded by genes located within the packaging module, provide protection against the restriction enzymes HphI and DpnII, respectively, representing the first functional MTases identified in members of 936-type phages. SMRT sequencing technology enabled the identification of the target motifs of MTases encoded by the genomes of three lytic 936-type phages and these MTases represent the first functional MTases identified in this species of phage. The presence of these MTase-encoding genes on 936-type phage genomes is assumed to represent an adaptive response to circumvent host encoded restriction-modification systems thereby increasing the fitness of the phages in a dynamic dairy environment.
Human AZU-1 gene, variants thereof and expressed gene products
Chen, Huei-Mei; Bissell, Mina
2004-06-22
A human AZU-1 gene, mutants, variants and fragments thereof. Protein products encoded by the AZU-1 gene and homologs encoded by the variants of AZU-1 gene acting as tumor suppressors or markers of malignancy progression and tumorigenicity reversion. Identification, isolation and characterization of AZU-1 and AZU-2 genes localized to a tumor suppressive locus at chromosome 10q26, highly expressed in nonmalignant and premalignant cells derived from a human breast tumor progression model. A recombinant full length protein sequences encoded by the AZU-1 gene and nucleotide sequences of AZU-1 and AZU-2 genes and variant and fragments thereof. Monoclonal or polyclonal antibodies specific to AZU-1, AZU-2 encoded protein and to AZU-1, or AZU-2 encoded protein homologs.
Yassin, Atteyet F; Langenberg, Stefan; Huntemann, Marcel; Clum, Alicia; Pillay, Manoj; Palaniappan, Krishnaveni; Varghese, Neha; Mikhailova, Natalia; Mukherjee, Supratim; Reddy, T B K; Daum, Chris; Shapiro, Nicole; Ivanova, Natalia; Woyke, Tanja; Kyrpides, Nikos C
2017-01-01
The permanent draft genome sequence of Actinotignum schaalii DSM 15541T is presented. The annotated genome includes 2,130,987 bp, with 1777 protein-coding and 58 rRNA-coding genes. Genome sequence analysis revealed absence of genes encoding for: components of the PTS systems, enzymes of the TCA cycle, glyoxylate shunt and gluconeogensis. Genomic data revealed that A. schaalii is able to oxidize carbohydrates via glycolysis, the nonoxidative pentose phosphate and the Entner-Doudoroff pathways. Besides, the genome harbors genes encoding for enzymes involved in the conversion of pyruvate to lactate, acetate and ethanol, which are found to be the end products of carbohydrate fermentation. The genome contained the gene encoding Type I fatty acid synthase required for de novo FAS biosynthesis. The plsY and plsX genes encoding the acyltransferases necessary for phosphatidic acid biosynthesis were absent from the genome. The genome harbors genes encoding enzymes responsible for isoprene biosynthesis via the mevalonate (MVA) pathway. Genes encoding enzymes that confer resistance to reactive oxygen species (ROS) were identified. In addition, A. schaalii harbors genes that protect the genome against viral infections. These include restriction-modification (RM) systems, type II toxin-antitoxin (TA), CRISPR-Cas and abortive infection system. A. schaalii genome also encodes several virulence factors that contribute to adhesion and internalization of this pathogen such as the tad genes encoding proteins required for pili assembly, the nanI gene encoding exo-alpha-sialidase, genes encoding heat shock proteins and genes encoding type VII secretion system. These features are consistent with anaerobic and pathogenic lifestyles. Finally, resistance to ciprofloxacin occurs by mutation in chromosomal genes that encode the subunits of DNA-gyrase (GyrA) and topisomerase IV (ParC) enzymes, while resistant to metronidazole was due to the frxA gene, which encodes NADPH-flavin oxidoreductase.
Li, Ruichao; Xie, Miaomiao; Dong, Ning; Lin, Dachuan; Yang, Xuemei; Wong, Marcus Ho Yin; Chan, Edward Wai-Chi; Chen, Sheng
2018-03-01
Multidrug resistance (MDR)-encoding plasmids are considered major molecular vehicles responsible for transmission of antibiotic resistance genes among bacteria of the same or different species. Delineating the complete sequences of such plasmids could provide valuable insight into the evolution and transmission mechanisms underlying bacterial antibiotic resistance development. However, due to the presence of multiple repeats of mobile elements, complete sequencing of MDR plasmids remains technically complicated, expensive, and time-consuming. Here, we demonstrate a rapid and efficient approach to obtaining multiple MDR plasmid sequences through the use of the MinION nanopore sequencing platform, which is incorporated in a portable device. By assembling the long sequencing reads generated by a single MinION run according to a rapid barcoding sequencing protocol, we obtained the complete sequences of 20 plasmids harbored by multiple bacterial strains. Importantly, single long reads covering a plasmid end-to-end were recorded, indicating that de novo assembly may be unnecessary if the single reads exhibit high accuracy. This workflow represents a convenient and cost-effective approach for systematic assessment of MDR plasmids responsible for treatment failure of bacterial infections, offering the opportunity to perform detailed molecular epidemiological studies to probe the evolutionary and transmission mechanisms of MDR-encoding elements.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Peters, J.; Peters, M.; Lottspeich, F.
1987-11-01
The complete nucleotide sequence of the gene encoding the surface (hexagonally packed intermediate (HPI))-layer polypeptide of Deinococcus radiodurans Sark was determined and found to encode a polypeptide of 1036 amino acids. Amino acid sequence analysis of about 30% of the residues revealed that the mature polypeptide consists of at least 978 amino acids. The N terminus was blocked to Edman degradation. The results of proteolytic modification of the HPI layer in situ and M/sub r/ estimations of the HPI polypeptide expressed in Escherichia coli indicated that there is a leader sequence. The N-terminal region contained a very high percentage (29%)more » of threonine and serine, including a cluster of nine consecutive serine or threonine residues, whereas a stretch near the C terminus was extremely rich in aromatic amino acids (29%). The protein contained at least two disulfide bridges, as well as tightly bound reducing sugars and fatty acids.« less
Multicore-based 3D-DWT video encoder
NASA Astrophysics Data System (ADS)
Galiano, Vicente; López-Granado, Otoniel; Malumbres, Manuel P.; Migallón, Hector
2013-12-01
Three-dimensional wavelet transform (3D-DWT) encoders are good candidates for applications like professional video editing, video surveillance, multi-spectral satellite imaging, etc. where a frame must be reconstructed as quickly as possible. In this paper, we present a new 3D-DWT video encoder based on a fast run-length coding engine. Furthermore, we present several multicore optimizations to speed-up the 3D-DWT computation. An exhaustive evaluation of the proposed encoder (3D-GOP-RL) has been performed, and we have compared the evaluation results with other video encoders in terms of rate/distortion (R/D), coding/decoding delay, and memory consumption. Results show that the proposed encoder obtains good R/D results for high-resolution video sequences with nearly in-place computation using only the memory needed to store a group of pictures. After applying the multicore optimization strategies over the 3D DWT, the proposed encoder is able to compress a full high-definition video sequence in real-time.
Possenti, Andrea; Vendruscolo, Michele; Camilloni, Carlo; Tiana, Guido
2018-05-23
Proteins employ the information stored in the genetic code and translated into their sequences to carry out well-defined functions in the cellular environment. The possibility to encode for such functions is controlled by the balance between the amount of information supplied by the sequence and that left after that the protein has folded into its structure. We study the amount of information necessary to specify the protein structure, providing an estimate that keeps into account the thermodynamic properties of protein folding. We thus show that the information remaining in the protein sequence after encoding for its structure (the 'information gap') is very close to what needed to encode for its function and interactions. Then, by predicting the information gap directly from the protein sequence, we show that it may be possible to use these insights from information theory to discriminate between ordered and disordered proteins, to identify unknown functions, and to optimize artificially-designed protein sequences. This article is protected by copyright. All rights reserved. © 2018 Wiley Periodicals, Inc.
Ilk, Nicola; Völlenkle, Christine; Egelseer, Eva M.; Breitwieser, Andreas; Sleytr, Uwe B.; Sára, Margit
2002-01-01
The nucleotide sequence encoding the crystalline bacterial cell surface (S-layer) protein SbpA of Bacillus sphaericus CCM 2177 was determined by a PCR-based technique using four overlapping fragments. The entire sbpA sequence indicated one open reading frame of 3,804 bp encoding a protein of 1,268 amino acids with a theoretical molecular mass of 132,062 Da and a calculated isoelectric point of 4.69. The N-terminal part of SbpA, which is involved in anchoring the S-layer subunits via a distinct type of secondary cell wall polymer to the rigid cell wall layer, comprises three S-layer-homologous motifs. For screening of amino acid positions located on the outer surface of the square S-layer lattice, the sequence encoding Strep-tag I, showing affinity to streptavidin, was linked to the 5′ end of the sequence encoding the recombinant S-layer protein (rSbpA) or a C-terminally truncated form (rSbpA31-1068). The deletion of 200 C-terminal amino acids did not interfere with the self-assembly properties of the S-layer protein but significantly increased the accessibility of Strep-tag I. Thus, the sequence encoding the major birch pollen allergen (Bet v1) was fused via a short linker to the sequence encoding the C-terminally truncated form rSpbA31-1068. Labeling of the square S-layer lattice formed by recrystallization of rSbpA31-1068/Bet v1 on peptidoglycan-containing sacculi with a Bet v1-specific monoclonal mouse antibody demonstrated the functionality of the fused protein sequence and its location on the outer surface of the S-layer lattice. The specific interactions between the N-terminal part of SbpA and the secondary cell wall polymer will be exploited for an oriented binding of the S-layer fusion protein on solid supports to generate regularly structured functional protein lattices. PMID:12089001
Polypeptide having or assisting in carbohydrate material degrading activity and uses thereof
Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter
2016-02-16
The invention relates to a polypeptide which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well asmore » the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.« less
Polypeptide having swollenin activity and uses thereof
Schoonneveld-Bergmans, Margot Elizabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica D; Damveld, Robbertus Antonius
2015-11-04
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof
Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel; Damveld, Robbertus Antonius
2015-09-01
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having cellobiohydrolase activity and uses thereof
Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter
2015-09-15
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having acetyl xylan esterase activity and uses thereof
Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter
2015-10-20
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having carbohydrate degrading activity and uses thereof
Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica Diana; Damveld, Robbertus Antonius
2015-08-18
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Guo, Y C; Wang, H; Wu, H P; Zhang, M Q
2015-12-21
Aimed to address the defects of the large mean square error (MSE), and the slow convergence speed in equalizing the multi-modulus signals of the constant modulus algorithm (CMA), a multi-modulus algorithm (MMA) based on global artificial fish swarm (GAFS) intelligent optimization of DNA encoding sequences (GAFS-DNA-MMA) was proposed. To improve the convergence rate and reduce the MSE, this proposed algorithm adopted an encoding method based on DNA nucleotide chains to provide a possible solution to the problem. Furthermore, the GAFS algorithm, with its fast convergence and global search ability, was used to find the best sequence. The real and imaginary parts of the initial optimal weight vector of MMA were obtained through DNA coding of the best sequence. The simulation results show that the proposed algorithm has a faster convergence speed and smaller MSE in comparison with the CMA, the MMA, and the AFS-DNA-MMA.
Klarhöfer, Markus; Dilharreguy, Bixente; van Gelderen, Peter; Moonen, Chrit T W
2003-10-01
A 3D sequence for dynamic susceptibility imaging is proposed which combines echo-shifting principles (such as PRESTO), sensitivity encoding (SENSE), and partial-Fourier acquisition. The method uses a moderate SENSE factor of 2 and takes advantage of an alternating partial k-space acquisition in the "slow" phase encode direction allowing an iterative reconstruction using high-resolution phase estimates. Offering an isotropic spatial resolution of 4 x 4 x 4 mm(3), the novel sequence covers the whole brain including parts of the cerebellum in 0.5 sec. Its temporal signal stability is comparable to that of a full-Fourier, full-FOV EPI sequence having the same dynamic scan time but much less brain coverage. Initial functional MRI experiments showed consistent activation in the motor cortex with an average signal change slightly less than that of EPI. Copyright 2003 Wiley-Liss, Inc.
Lafuente, M J; Gamo, F J; Gancedo, C
1996-09-01
We have determined the sequence of a 10624 bp DNA segment located in the left arm of chromosome XV of Saccharomyces cerevisiae. The sequence contains eight open reading frames (ORFs) longer than 100 amino acids. Two of them do not present significant homology with sequences found in the databases. The product of ORF o0553 is identical to the protein encoded by the gene SMF1. Internal to it there is another ORF, o0555 that is apparently expressed. The proteins encoded by ORFs o0559 and o0565 are identical to ribosomal proteins S19.e and L18 respectively. ORF o0550 encodes a protein with an RNA binding signature including RNP motifs and stretches rich in asparagine, glutamine and arginine.
Genome sequence of the model medicinal mushroom Ganoderma lucidum
Chen, Shilin; Xu, Jiang; Liu, Chang; Zhu, Yingjie; Nelson, David R.; Zhou, Shiguo; Li, Chunfang; Wang, Lizhi; Guo, Xu; Sun, Yongzhen; Luo, Hongmei; Li, Ying; Song, Jingyuan; Henrissat, Bernard; Levasseur, Anthony; Qian, Jun; Li, Jianqin; Luo, Xiang; Shi, Linchun; He, Liu; Xiang, Li; Xu, Xiaolan; Niu, Yunyun; Li, Qiushi; Han, Mira V.; Yan, Haixia; Zhang, Jin; Chen, Haimei; Lv, Aiping; Wang, Zhen; Liu, Mingzhu; Schwartz, David C.; Sun, Chao
2012-01-01
Ganoderma lucidum is a widely used medicinal macrofungus in traditional Chinese medicine that creates a diverse set of bioactive compounds. Here we report its 43.3-Mb genome, encoding 16,113 predicted genes, obtained using next-generation sequencing and optical mapping approaches. The sequence analysis reveals an impressive array of genes encoding cytochrome P450s (CYPs), transporters and regulatory proteins that cooperate in secondary metabolism. The genome also encodes one of the richest sets of wood degradation enzymes among all of the sequenced basidiomycetes. In all, 24 physical CYP gene clusters are identified. Moreover, 78 CYP genes are coexpressed with lanosterol synthase, and 16 of these show high similarity to fungal CYPs that specifically hydroxylate testosterone, suggesting their possible roles in triterpenoid biosynthesis. The elucidation of the G. lucidum genome makes this organism a potential model system for the study of secondary metabolic pathways and their regulation in medicinal fungi. PMID:22735441
The DNA region encoding biphenyl dioxygenase, the first enzyme in the biphenyl-polychlorinated biphenyl degradation pathway of Pseudomonas species strain LB400, was sequenced. Six open reading frames were identified, four of which are homologous to the components of toluene dioxy...
ERIC Educational Resources Information Center
Ferry, Alissa L.; Fló, Ana; Brusini, Perrine; Cattarossi, Luigi; Macagno, Francesco; Nespor, Marina; Mehler, Jacques
2016-01-01
To understand language, humans must encode information from rapid, sequential streams of syllables--tracking their order and organizing them into words, phrases, and sentences. We used Near-Infrared Spectroscopy (NIRS) to determine whether human neonates are born with the capacity to track the positions of syllables in multisyllabic sequences.…
de Bellocq, J Goüy; Leirs, H
2009-09-01
Sequences of the complete open reading frame (ORF) for rodents major histocompatibility complex (MHC) class II genes are rare. Multimammate rat (Mastomys natalensis) complementary DNA (cDNA) encoding the alpha and beta chains of MHC class II DQ gene was cloned from a rapid amplifications of cDNA Emds (RACE) cDNA library. The ORFs consist of 801 and 771 bp encoding 266 and 256 amino acid residues for DQB and DQA, respectively. The genomic structure of Mana-DQ genes is globally analogous to that described for other rodents except for the insertion of a serine residue in the signal peptide of Mana-DQB, which is unique among known rodents.
Nucleic acids encoding phloem small RNA-binding proteins and transgenic plants comprising them
Lucas, William J.; Yoo, Byung-Chun; Lough, Tony J.; Varkonyi-Gasic, Erika
2007-03-13
The present invention provides a polynucleotide sequence encoding a component of the protein machinery involved in small RNA trafficking, Cucurbita maxima phloem small RNA-binding protein (CmPSRB 1), and the corresponding polypeptide sequence. The invention also provides genetic constructs and transgenic plants comprising the polynucleotide sequence encoding a phloem small RNA-binding protein to alter (e.g., prevent, reduce or elevate) non-cell autonomous signaling events in the plants involving small RNA metabolism. These signaling events are involved in a broad spectrum of plant physiological and biochemical processes, including, for example, systemic resistance to pathogens, responses to environmental stresses, e.g., heat, drought, salinity, and systemic gene silencing (e.g., viral infections).
Hussain, Shahid M; De Becker, Jan; Hop, Wim C J; Dwarkasing, Soendersing; Wielopolski, Piotr A
2005-03-01
To optimize and assess the feasibility of a single-shot black-blood T2-weighted spin-echo echo-planar imaging (SSBB-EPI) sequence for MRI of the liver using sensitivity encoding (SENSE), and compare the results with those obtained with a T2-weighted turbo spin-echo (TSE) sequence. Six volunteers and 16 patients were scanned at 1.5T (Philips Intera). In the volunteer study, we optimized the SSBB-EPI sequence by interactively changing the parameters (i.e., the resolution, echo time (TE), diffusion weighting with low b-values, and polarity of the phase-encoding gradient) with regard to distortion, suppression of the blood signal, and sensitivity to motion. The influence of each change was assessed. The optimized SSBB-EPI sequence was applied in patients (N = 16). A number of items, including the overall image quality (on a scale of 1-5), were used for graded evaluation. In addition, the signal-to-noise ratio (SNR) of the liver was calculated. Statistical analysis was carried out with the use of Wilcoxon's signed rank test for comparison of the SSBB-EPI and TSE sequences, with P = 0.05 considered the limit for significance. The SSBB-EPI sequence was improved by the following steps: 1) less frequency points than phase-encoding steps, 2) a b-factor of 20, and 3) a reversed polarity of the phase-encoding gradient. In patients, the mean overall image quality score for the optimized SSBB-EPI (3.5 (range: 1-4)) and TSE (3.6 (range: 3-4)), and the SNR of the liver on SSBB-EPI (mean +/- SD = 7.6 +/- 4.0) and TSE (8.9 +/- 4.6) were not significantly different (P > .05). Optimized SSBB-EPI with SENSE proved to be feasible in patients, and the overall image quality and SNR of the liver were comparable to those achieved with the standard respiratory-triggered T2-weighted TSE sequence. (c) 2005 Wiley-Liss, Inc.
Hinaut, Xavier; Dominey, Peter Ford
2011-01-01
Categorical encoding is crucial for mastering large bodies of related sensory-motor experiences, but what is its neural substrate? In an effort to respond to this question, recent single-unit recording studies in the macaque lateral prefrontal cortex (LPFC) have demonstrated two characteristic forms of neural encoding of the sequential structure of the animal's sensory-motor experience. One population of neurons encodes the specific behavioral sequences. A second population of neurons encodes the sequence category (e.g. ABAB, AABB or AAAA) and does not differentiate sequences within the category (Shima, K., Isoda, M., Mushiake, H., Tanji, J., 2007. Categorization of behavioural sequences in the prefrontal cortex. Nature 445, 315-318.). Interestingly these neurons are intermingled in the lateral prefrontal cortex, and not topographically segregated. Thus, LPFC may provide a neurophysiological basis for sensorimotor categorization. Here we report on a neural network simulation study that reproduces and explains these results. We model a cortical circuit composed of three layers (infragranular, granular, and supragranular) of 5*5 leaky integrator neurons with a sigmoidal output function, and we examine 1000 such circuits running in parallel. Crucially the three layers are interconnected with recurrent connections, thus producing a dynamical system that is inherently sensitive to the spatiotemporal structure of the sequential inputs. The model is presented with 11 four-element sequences following Shima et al. We isolated one subpopulation of neurons each of whose activity predicts individual sequences, and a second population that predicts category independent of the specific sequence. We argue that a richly interconnected cortical circuit is capable of internally generating a neural representation of category membership, thus significantly extending the scope of recurrent network computation. In order to demonstrate that these representations can be used to create an explicit categorization capability, we introduced an additional neural structure corresponding to the striatum. We showed that via cortico-striatal plasticity, neurons in the striatum could produce an explicit representation both of the identity of each sequence, and its category membership. Copyright © 2011 Elsevier Ltd. All rights reserved.
Gene encoding a novel extracellular metalloprotease in Bacillus subtilis.
Sloma, A; Rudolph, C F; Rufo, G A; Sullivan, B J; Theriault, K A; Ally, D; Pero, J
1990-01-01
The gene for a novel extracellular metalloprotease was cloned, and its nucleotide sequence was determined. The gene (mpr) encodes a primary product of 313 amino acids that has little similarity to other known Bacillus proteases. The amino acid sequence of the mature protease was preceded by a signal sequence of approximately 34 amino acids and a pro sequence of 58 amino acids. Four cysteine residues were found in the deduced amino acid sequence of the mature protein, indicating the possible presence of disulfide bonds. The mpr gene mapped in the cysA-aroI region of the chromosome and was not required for growth or sporulation. Images FIG. 2 FIG. 7 PMID:2105291
Ridley, R G; Patel, H V; Gerber, G E; Morton, R C; Freeman, K B
1986-01-01
A cDNA clone spanning the entire amino acid sequence of the nuclear-encoded uncoupling protein of rat brown adipose tissue mitochondria has been isolated and sequenced. With the exception of the N-terminal methionine the deduced N-terminus of the newly synthesized uncoupling protein is identical to the N-terminal 30 amino acids of the native uncoupling protein as determined by protein sequencing. This proves that the protein contains no N-terminal mitochondrial targeting prepiece and that a targeting region must reside within the amino acid sequence of the mature protein. Images PMID:3012461
Utro, Filippo; Di Benedetto, Valeria; Corona, Davide F V; Giancarlo, Raffaele
2016-03-15
Thanks to research spanning nearly 30 years, two major models have emerged that account for nucleosome organization in chromatin: statistical and sequence specific. The first is based on elegant, easy to compute, closed-form mathematical formulas that make no assumptions of the physical and chemical properties of the underlying DNA sequence. Moreover, they need no training on the data for their computation. The latter is based on some sequence regularities but, as opposed to the statistical model, it lacks the same type of closed-form formulas that, in this case, should be based on the DNA sequence only. We contribute to close this important methodological gap between the two models by providing three very simple formulas for the sequence specific one. They are all based on well-known formulas in Computer Science and Bioinformatics, and they give different quantifications of how complex a sequence is. In view of how remarkably well they perform, it is very surprising that measures of sequence complexity have not even been considered as candidates to close the mentioned gap. We provide experimental evidence that the intrinsic level of combinatorial organization and information-theoretic content of subsequences within a genome are strongly correlated to the level of DNA encoded nucleosome organization discovered by Kaplan et al Our results establish an important connection between the intrinsic complexity of subsequences in a genome and the intrinsic, i.e. DNA encoded, nucleosome organization of eukaryotic genomes. It is a first step towards a mathematical characterization of this latter 'encoding'. Supplementary data are available at Bioinformatics online. futro@us.ibm.com. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Yamamoto, S; Mutoh, N; Tsuzuki, D; Ikai, H; Nakao, H; Shinoda, S; Narimatsu, S; Miyoshi, S I
2000-05-01
L-2,4-diaminobutyrate decarboxylase (DABA DC) catalyzes the formation of 1,3-diaminopropane (DAP) from DABA. In the present study, the ddc gene encoding DABA DC from Enterobacter aerogenes ATCC 13048 was cloned and characterized. Determination of the nucleotide sequence revealed an open reading frame of 1470 bp encoding a 53659-Da protein of 490 amino acids, whose deduced NH2-terminal sequence was identical to that of purified DABA DC from E. aerogenes. The deduced amino acid sequence was highly similar to those of Acinetobacter baumannii and Haemophilus influenzae DABA DCs encoded by the ddc genes. The lysine-307 of the E. aerogenes DABA DC was identified as the pyridoxal 5'-phosphate binding residue by site-directed mutagenesis. Furthermore, PCR analysis revealed the distribution of E. aerogenes ddc homologs in some other species of Enterobacteriaceae. Such a relatively wide occurrence of the ddc homologs implies biological significance of DABA DC and its product DAP.
Auerbach, Raymond K; Chen, Bin; Butte, Atul J
2013-08-01
Biological analysis has shifted from identifying genes and transcripts to mapping these genes and transcripts to biological functions. The ENCODE Project has generated hundreds of ChIP-Seq experiments spanning multiple transcription factors and cell lines for public use, but tools for a biomedical scientist to analyze these data are either non-existent or tailored to narrow biological questions. We present the ENCODE ChIP-Seq Significance Tool, a flexible web application leveraging public ENCODE data to identify enriched transcription factors in a gene or transcript list for comparative analyses. The ENCODE ChIP-Seq Significance Tool is written in JavaScript on the client side and has been tested on Google Chrome, Apple Safari and Mozilla Firefox browsers. Server-side scripts are written in PHP and leverage R and a MySQL database. The tool is available at http://encodeqt.stanford.edu. abutte@stanford.edu Supplementary material is available at Bioinformatics online.
Bioinformatics analysis and detection of gelatinase encoded gene in Lysinibacillussphaericus
NASA Astrophysics Data System (ADS)
Repin, Rul Aisyah Mat; Mutalib, Sahilah Abdul; Shahimi, Safiyyah; Khalid, Rozida Mohd.; Ayob, Mohd. Khan; Bakar, Mohd. Faizal Abu; Isa, Mohd Noor Mat
2016-11-01
In this study, we performed bioinformatics analysis toward genome sequence of Lysinibacillussphaericus (L. sphaericus) to determine gene encoded for gelatinase. L. sphaericus was isolated from soil and gelatinase species-specific bacterium to porcine and bovine gelatin. This bacterium offers the possibility of enzymes production which is specific to both species of meat, respectively. The main focus of this research is to identify the gelatinase encoded gene within the bacteria of L. Sphaericus using bioinformatics analysis of partially sequence genome. From the research study, three candidate gene were identified which was, gelatinase candidate gene 1 (P1), NODE_71_length_93919_cov_158.931839_21 which containing 1563 base pair (bp) in size with 520 amino acids sequence; Secondly, gelatinase candidate gene 2 (P2), NODE_23_length_52851_cov_190.061386_17 which containing 1776 bp in size with 591 amino acids sequence; and Thirdly, gelatinase candidate gene 3 (P3), NODE_106_length_32943_cov_169.147919_8 containing 1701 bp in size with 566 amino acids sequence. Three pairs of oligonucleotide primers were designed and namely as, F1, R1, F2, R2, F3 and R3 were targeted short sequences of cDNA by PCR. The amplicons were reliably results in 1563 bp in size for candidate gene P1 and 1701 bp in size for candidate gene P3. Therefore, the results of bioinformatics analysis of L. Sphaericus resulting in gene encoded gelatinase were identified.
Otsuki, Tetsuji; Ota, Toshio; Nishikawa, Tetsuo; Hayashi, Koji; Suzuki, Yutaka; Yamamoto, Jun-ichi; Wakamatsu, Ai; Kimura, Kouichi; Sakamoto, Katsuhiko; Hatano, Naoto; Kawai, Yuri; Ishii, Shizuko; Saito, Kaoru; Kojima, Shin-ichi; Sugiyama, Tomoyasu; Ono, Tetsuyoshi; Okano, Kazunori; Yoshikawa, Yoko; Aotsuka, Satoshi; Sasaki, Naokazu; Hattori, Atsushi; Okumura, Koji; Nagai, Keiichi; Sugano, Sumio; Isogai, Takao
2005-01-01
We have developed an in silico method of selection of human full-length cDNAs encoding secretion or membrane proteins from oligo-capped cDNA libraries. Fullness rates were increased to about 80% by combination of the oligo-capping method and ATGpr, software for prediction of translation start point and the coding potential. Then, using 5'-end single-pass sequences, cDNAs having the signal sequence were selected by PSORT ('signal sequence trap'). We also applied 'secretion or membrane protein-related keyword trap' based on the result of BLAST search against the SWISS-PROT database for the cDNAs which could not be selected by PSORT. Using the above procedures, 789 cDNAs were primarily selected and subjected to full-length sequencing, and 334 of these cDNAs were finally selected as novel. Most of the cDNAs (295 cDNAs: 88.3%) were predicted to encode secretion or membrane proteins. In particular, 165(80.5%) of the 205 cDNAs selected by PSORT were predicted to have signal sequences, while 70 (54.2%) of the 129 cDNAs selected by 'keyword trap' preserved the secretion or membrane protein-related keywords. Many important cDNAs were obtained, including transporters, receptors, and ligands, involved in significant cellular functions. Thus, an efficient method of selecting secretion or membrane protein-encoding cDNAs was developed by combining the above four procedures.
Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi
2004-02-01
To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.
ChIP-chip versus ChIP-seq: Lessons for experimental design and data analysis
2011-01-01
Background Chromatin immunoprecipitation (ChIP) followed by microarray hybridization (ChIP-chip) or high-throughput sequencing (ChIP-seq) allows genome-wide discovery of protein-DNA interactions such as transcription factor bindings and histone modifications. Previous reports only compared a small number of profiles, and little has been done to compare histone modification profiles generated by the two technologies or to assess the impact of input DNA libraries in ChIP-seq analysis. Here, we performed a systematic analysis of a modENCODE dataset consisting of 31 pairs of ChIP-chip/ChIP-seq profiles of the coactivator CBP, RNA polymerase II (RNA PolII), and six histone modifications across four developmental stages of Drosophila melanogaster. Results Both technologies produce highly reproducible profiles within each platform, ChIP-seq generally produces profiles with a better signal-to-noise ratio, and allows detection of more peaks and narrower peaks. The set of peaks identified by the two technologies can be significantly different, but the extent to which they differ varies depending on the factor and the analysis algorithm. Importantly, we found that there is a significant variation among multiple sequencing profiles of input DNA libraries and that this variation most likely arises from both differences in experimental condition and sequencing depth. We further show that using an inappropriate input DNA profile can impact the average signal profiles around genomic features and peak calling results, highlighting the importance of having high quality input DNA data for normalization in ChIP-seq analysis. Conclusions Our findings highlight the biases present in each of the platforms, show the variability that can arise from both technology and analysis methods, and emphasize the importance of obtaining high quality and deeply sequenced input DNA libraries for ChIP-seq analysis. PMID:21356108
Dopamine modulates episodic memory persistence in old age
Chowdhury, Rumana; Guitart-Masip, Marc; Bunzeck, Nico; Dolan, Raymond J; Düzel, Emrah
2013-01-01
Activation of the hippocampus is required in order to encode memories for new events (or episodes). Observations from animal studies suggest that for these memories to persist beyond 4 to 6 hours, a release of dopamine generated by strong hippocampal activation is needed. This predicts that dopaminergic enhancement should improve human episodic memory persistence also for events encoded with weak hippocampal activation. Here, using pharmacological fMRI in an elderly population where there is a loss of dopamine neurons as part of normal aging, we show this very effect. The dopamine precursor levodopa led to a dose-dependent (inverted U-shape) persistent episodic memory benefit for images of scenes when tested after 6 hours, independent of whether encoding-related hippocampal fMRI activity was weak or strong (U-shaped dose-response relationship). This lasting improvement even for weakly encoded events supports a role for dopamine in human episodic memory consolidation albeit operating within a narrow dose range. PMID:23055489
Ferriol, I; Silva Junior, D M; Nigg, J C; Zamora-Macorra, E J; Falk, B W
2016-11-01
Torradoviruses, family Secoviridae, are emergent bipartite RNA plant viruses. RNA1 is ca. 7kb and has one open reading frame (ORF) encoding for the protease, helicase and RNA-dependent RNA polymerase (RdRp). RNA2 is ca. 5kb and has two ORFs. RNA2-ORF1 encodes for a putative protein with unknown function(s). RNA2-ORF2 encodes for a putative movement protein and three capsid proteins. Little is known about the replication and polyprotein processing strategies of torradoviruses. Here, the cleavage sites in the RNA2-ORF2-encoded polyproteins of two torradoviruses, Tomato marchitez virus isolate M (ToMarV-M) and tomato chocolate spot virus, were determined by N-terminal sequencing, revealing that the amino acid (aa) at the -1 position of the cleavage sites is a glutamine. Multiple aa sequence comparison confirmed that this glutamine is conserved among other torradoviruses. Finally, site-directed mutagenesis of conserved aas in the ToMarV-M RdRp and protease prevented substantial accumulation of viral coat proteins or RNAs. Copyright © 2016 Elsevier Inc. All rights reserved.
Hiding message into DNA sequence through DNA coding and chaotic maps.
Liu, Guoyan; Liu, Hongjun; Kadir, Abdurahman
2014-09-01
The paper proposes an improved reversible substitution method to hide data into deoxyribonucleic acid (DNA) sequence, and four measures have been taken to enhance the robustness and enlarge the hiding capacity, such as encode the secret message by DNA coding, encrypt it by pseudo-random sequence, generate the relative hiding locations by piecewise linear chaotic map, and embed the encoded and encrypted message into a randomly selected DNA sequence using the complementary rule. The key space and the hiding capacity are analyzed. Experimental results indicate that the proposed method has a better performance compared with the competing methods with respect to robustness and capacity.
ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia.
Landt, Stephen G; Marinov, Georgi K; Kundaje, Anshul; Kheradpour, Pouya; Pauli, Florencia; Batzoglou, Serafim; Bernstein, Bradley E; Bickel, Peter; Brown, James B; Cayting, Philip; Chen, Yiwen; DeSalvo, Gilberto; Epstein, Charles; Fisher-Aylor, Katherine I; Euskirchen, Ghia; Gerstein, Mark; Gertz, Jason; Hartemink, Alexander J; Hoffman, Michael M; Iyer, Vishwanath R; Jung, Youngsook L; Karmakar, Subhradip; Kellis, Manolis; Kharchenko, Peter V; Li, Qunhua; Liu, Tao; Liu, X Shirley; Ma, Lijia; Milosavljevic, Aleksandar; Myers, Richard M; Park, Peter J; Pazin, Michael J; Perry, Marc D; Raha, Debasish; Reddy, Timothy E; Rozowsky, Joel; Shoresh, Noam; Sidow, Arend; Slattery, Matthew; Stamatoyannopoulos, John A; Tolstorukov, Michael Y; White, Kevin P; Xi, Simon; Farnham, Peggy J; Lieb, Jason D; Wold, Barbara J; Snyder, Michael
2012-09-01
Chromatin immunoprecipitation (ChIP) followed by high-throughput DNA sequencing (ChIP-seq) has become a valuable and widely used approach for mapping the genomic location of transcription-factor binding and histone modifications in living cells. Despite its widespread use, there are considerable differences in how these experiments are conducted, how the results are scored and evaluated for quality, and how the data and metadata are archived for public use. These practices affect the quality and utility of any global ChIP experiment. Through our experience in performing ChIP-seq experiments, the ENCODE and modENCODE consortia have developed a set of working standards and guidelines for ChIP experiments that are updated routinely. The current guidelines address antibody validation, experimental replication, sequencing depth, data and metadata reporting, and data quality assessment. We discuss how ChIP quality, assessed in these ways, affects different uses of ChIP-seq data. All data sets used in the analysis have been deposited for public viewing and downloading at the ENCODE (http://encodeproject.org/ENCODE/) and modENCODE (http://www.modencode.org/) portals.
ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia
Landt, Stephen G.; Marinov, Georgi K.; Kundaje, Anshul; Kheradpour, Pouya; Pauli, Florencia; Batzoglou, Serafim; Bernstein, Bradley E.; Bickel, Peter; Brown, James B.; Cayting, Philip; Chen, Yiwen; DeSalvo, Gilberto; Epstein, Charles; Fisher-Aylor, Katherine I.; Euskirchen, Ghia; Gerstein, Mark; Gertz, Jason; Hartemink, Alexander J.; Hoffman, Michael M.; Iyer, Vishwanath R.; Jung, Youngsook L.; Karmakar, Subhradip; Kellis, Manolis; Kharchenko, Peter V.; Li, Qunhua; Liu, Tao; Liu, X. Shirley; Ma, Lijia; Milosavljevic, Aleksandar; Myers, Richard M.; Park, Peter J.; Pazin, Michael J.; Perry, Marc D.; Raha, Debasish; Reddy, Timothy E.; Rozowsky, Joel; Shoresh, Noam; Sidow, Arend; Slattery, Matthew; Stamatoyannopoulos, John A.; Tolstorukov, Michael Y.; White, Kevin P.; Xi, Simon; Farnham, Peggy J.; Lieb, Jason D.; Wold, Barbara J.; Snyder, Michael
2012-01-01
Chromatin immunoprecipitation (ChIP) followed by high-throughput DNA sequencing (ChIP-seq) has become a valuable and widely used approach for mapping the genomic location of transcription-factor binding and histone modifications in living cells. Despite its widespread use, there are considerable differences in how these experiments are conducted, how the results are scored and evaluated for quality, and how the data and metadata are archived for public use. These practices affect the quality and utility of any global ChIP experiment. Through our experience in performing ChIP-seq experiments, the ENCODE and modENCODE consortia have developed a set of working standards and guidelines for ChIP experiments that are updated routinely. The current guidelines address antibody validation, experimental replication, sequencing depth, data and metadata reporting, and data quality assessment. We discuss how ChIP quality, assessed in these ways, affects different uses of ChIP-seq data. All data sets used in the analysis have been deposited for public viewing and downloading at the ENCODE (http://encodeproject.org/ENCODE/) and modENCODE (http://www.modencode.org/) portals. PMID:22955991
Xu, Ting; Xie, Jiasong; Yang, Shoubao; Ye, Shigen; Luo, Ming; Wu, Xinzhong
2016-08-01
Cyclophilins (CyPs) are a family of proteins that bind the immunosuppressive agent cyclosporin A (CsA) with high-affinity and belong to one of the three superfamilies of peptidyl-prolyl cis-trans isomerases (PPIase). In this report, three cyclophilin genes (Ca-CyPs), including Ca-CyPA, Ca-CyPB and Ca-PPIL3, were identified from oyster, Crassostrea ariakensis Gould in which Ca-CyPA encodes a protein with 165 amino acid sequences, Ca-CyPB encodes a protein with 217 amino acid sequences and Ca-PPIL3 encodes a protein with 162 amino acid sequences. All of the three Ca-CyPs genes contain a typical CyP-PPIase domain with its signature sequences and Ca-CyPB contains an N-signal peptide sequences. Tissue distribution study revealed that Ca-CyPs were ubiquitously expressed in all examined tissues and the highest levels were observed in hemocytes. RLO incubation upregulated the mRNA expression levels of Ca-CyPs, indicating that three Ca-CyPs might be involved in oyster immune response against RLO infection. Copyright © 2016 Elsevier Ltd. All rights reserved.
Isolation and characterization of the chicken trypsinogen gene family.
Wang, K; Gan, L; Lee, I; Hood, L
1995-01-01
Based on genomic Southern hybridizations and cDNA sequence analyses, the chicken trypsinogen gene family can be divided into two multi-member subfamilies, a six-member trypsinogen I subfamily which encodes the cationic trypsin isoenzymes and a three-member trypsinogen II subfamily which encodes the anionic trypsin isoenzymes. The chicken cDNA and genomic clones containing these two subfamilies were isolated and characterized by DNA sequence analysis. The results indicated that the chicken trypsinogen genes encoded a signal peptide of 15 to 16 amino acid residues, an activation peptide of 9 to 10 residues and a trypsin of 223 amino acid residues. The chicken trypsinogens contain all the common catalytic and structural features for trypsins, including the catalytic triad His, Asp and Ser and the six disulphide bonds. The trypsinogen I and II subfamilies share approximately 70% sequence identity at the nucleotide and amino acid level. The sequence comparison among chicken trypsinogen subfamily members and trypsin sequences from other species suggested that the chicken trypsinogen genes may have evolved in coincidental or concerted fashion. Images Figure 6 Figure 7 PMID:7733885
Cloning, sequencing and expression in MEL cells of a cDNA encoding the mouse ribosomal protein S5.
Vanegas, N; Castañeda, V; Santamaría, D; Hernández, P; Schvartzman, J B; Krimer, D B
1997-06-05
We describe the isolation and characterization of a cDNA encoding the mouse S5 ribosomal protein. It was isolated from a MEL (murine erythroleukemia) cell cDNA library by differential hybridization as a down regulated sequence during HMBA-induced differentiation. Northern series analysis showed that S5 mRNA expression is reduced 5-fold throughout the differentiation process. The mouse S5 mRNA is 760 bp long and encodes for a 204 amino acid protein with 94% homology with the human and rat S5.
Structure of adenovirus bound to cellular receptor car
Freimuth, Paul I.
2007-01-02
Disclosed is a mutant CAR-DI-binding adenovirus which has a genome comprising one or more mutations in sequences which encode the fiber protein knob domain wherein the mutation causes the encoded viral particle to have a significantly weakened binding affinity for CAR-DI relative to wild-type adenovirus. Such mutations may be in sequences which encode either the AB loop, or the HI loop of the fiber protein knob domain. Specific residues and mutations are described. Also disclosed is a method for generating a mutant adenovirus which is characterized by a receptor binding affinity or specificity which differs substantially from wild type.
Anzures, Gizelle; Wheeler, Andrea; Quinn, Paul C.; Pascalis, Olivier; Slater, Alan M.; Heron-Delaney, Michelle; Tanaka, James W.; Lee, Kang
2012-01-01
Perceptual narrowing in the visual, auditory, and multisensory domains has its developmental origins in infancy. The present study shows that experimentally induced experience can reverse the effects of perceptual narrowing on infants’ visual recognition memory of other-race faces. Caucasian 8- to 10-month-olds who could not discriminate between novel and familiarized Asian faces at the beginning of testing were given brief daily experience with Asian female faces in the experimental condition and Caucasian female faces in the control condition. At the end of three weeks, only infants who received daily experience with Asian females showed above-chance recognition of novel Asian female and male faces. Further, infants in the experimental condition showed greater efficiency in learning novel Asian females compared to infants in the control condition. Thus, visual experience with a novel stimulus category can reverse the effects of perceptual narrowing in infancy via improved stimulus recognition and encoding. PMID:22625845
Molecular mechanisms for protein-encoded inheritance
Wiltzius, Jed J. W.; Landau, Meytal; Nelson, Rebecca; Sawaya, Michael R.; Apostol, Marcin I.; Goldschmidt, Lukasz; Soriaga, Angela B.; Cascio, Duilio; Rajashankar, Kanagalaghatta; Eisenberg, David
2013-01-01
Strains are phenotypic variants, encoded by nucleic acid sequences in chromosomal inheritance and by protein “conformations” in prion inheritance and transmission. But how is a protein “conformation” stable enough to endure transmission between cells or organisms? Here new polymorphic crystal structures of segments of prion and other amyloid proteins offer structural mechanisms for prion strains. In packing polymorphism, prion strains are encoded by alternative packings (polymorphs) of β-sheets formed by the same segment of a protein; in a second mechanism, segmental polymorphism, prion strains are encoded by distinct β-sheets built from different segments of a protein. Both forms of polymorphism can produce enduring “conformations,” capable of encoding strains. These molecular mechanisms for transfer of information into prion strains share features with the familiar mechanism for transfer of information by nucleic acid inheritance, including sequence specificity and recognition by non-covalent bonds. PMID:19684598
Li, Ningzhi; An, Li; Johnson, Christopher; Shen, Jun
2017-01-01
Due to imperfect slice profiles, unwanted signals from outside the selected voxel may significantly contaminate metabolite signals acquired using in vivo magnetic resonance spectroscopy (MRS). The use of outer volume suppression may exceed the SAR threshold, especially at high field. We propose using phase-encoding gradients after radiofrequency (RF) excitation to spatially encode unwanted signals originating from outside of the selected single voxel. Phase-encoding gradients were added to a standard single voxel point-resolved spectroscopy (PRESS) sequence which selects a 2 × 2 × 2 cm 3 voxel. Subsequent spatial Fourier transform was used to encode outer volume signals. Phantom and in vivo experiments were performed using both phase-encoded PRESS and standard PRESS at 7 Tesla. Quantification was performed using fitting software developed in-house. Both phantom and in vivo studies showed that spectra from the phase-encoded PRESS sequence were relatively immune from contamination by oil signals and have more accurate quantification results than spectra from standard PRESS spectra of the same voxel. The proposed phase-encoded single-voxel PRESS method can significantly suppress outer volume signals that may appear in the spectra of standard PRESS without increasing RF power deposition.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Thompson, David N.; Apel, William A.; Thompson, Vicki S.
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods of at least partially degrading, cleaving, or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, or mannan-decorating groups using isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius.
Thompson, David N.; Apel, William A.; Thompson, Vicki S.; Reed, David W.; Lacey, Jeffrey A.; Henriksen, Emily D.
2015-06-02
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods of at least partially degrading, cleaving, or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, or mannan-decorating groups using isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius.
Thompson, David N.; Apel, William A.; Thompson, Vicki S.; Reed, David W.; Lacey, Jeffrey A.
2013-10-15
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods of at least partially degrading, cleaving, or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, or mannan-decorating groups using isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius.
Thompson, David N [Idaho Falls, ID; Apel, William A [Jackson, WY; Thompson, Vicki S [Idaho Falls, ID; Reed, David W [Idaho Falls, ID; Lacey, Jeffrey A [Idaho Falls, ID; Henriksen, Emily D [Idaho Falls, ID
2012-06-19
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods of at least partially degrading, cleaving, or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, or mannan-decorating groups using isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius.
Thompson, David N; Apel, William A; Thompson, Vicki S; Reed, David W; Lacey, Jeffrey A; Henriksen, Emily D
2013-04-23
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods of at least partially degrading, cleaving, or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, or mannan-decorating groups using isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius.
Thompson, David N.; Apel, William A.; Thompson, Vicki S.; Reed, David W.; Lacey, Jeffrey A.; Henriksen, Emily D.
2010-12-28
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods of at least partially degrading, cleaving, or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan, or mannan-decorating groups using isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius.
Thompson, David N; Apel, William A; Thompson, Vicki S; Reed, David W; Lacey, Jeffrey A; Henriksen, Emily D
2013-07-30
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods of at least partially degrading, cleaving, or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, or mannan-decorating groups using isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Thompson, David N; Apel, William A; Thompson, Vicki S
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods of at least partially degrading, cleaving, or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, or mannan-decorating groups using isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius.
USDA-ARS?s Scientific Manuscript database
One-hundred-thirty-six expressed sequence tags (ESTs) encoding alpha gliadins from Triticum aestivum cv Butte 86 were identified in public databases and assembled into 19 contigs. Consensus sequences for 12 of the contigs encoded complete alpha gliadin proteins, but only two were identical to protei...
USDA-ARS?s Scientific Manuscript database
The cattle tick, Rhipicephalus (Boophilus) microplus, has a genome over 2.4 times the size of the human genome, and with over 70% of repetitive DNA, this genome would prove very costly to sequence at today's prices and difficult to assemble and analyze. BAC clones give insight into the genome struct...
Cloning and sequence analysis of the invertase gene INV 1 from the yeast Pichia anomala.
Pérez, J A; Rodríguez, J; Rodríguez, L; Ruiz, T
1996-02-01
A genomic library from the yeast Pichia anomala has been constructed and employed to clone the gene encoding the sucrose-hydrolysing enzyme invertase by complementation of a sucrose non-fermenting mutant of Saccharomyces cerevisiae. The cloned gene, INV1, was sequenced and found to encode a polypeptide of 550 amino acids which contained a 22 amino-acid signal sequence and ten potential glycosylation sites. The amino-acid sequence shows significant identity with other yeast invertases and also with Kluyveromyces marxianus inulinase, a yeast beta-fructofuranosidase which has a different substrate specificity. The nucleotide sequences of the 5' and 3' non-coding regions were found to contain several consensus motifs probably involved in the initiation and termination of gene transcription.
Meher, J K; Meher, P K; Dash, G N; Raval, M K
2012-01-01
The first step in gene identification problem based on genomic signal processing is to convert character strings into numerical sequences. These numerical sequences are then analysed spectrally or using digital filtering techniques for the period-3 peaks, which are present in exons (coding areas) and absent in introns (non-coding areas). In this paper, we have shown that single-indicator sequences can be generated by encoding schemes based on physico-chemical properties. Two new methods are proposed for generating single-indicator sequences based on hydration energy and dipole moments. The proposed methods produce high peak at exon locations and effectively suppress false exons (intron regions having greater peak than exon regions) resulting in high discriminating factor, sensitivity and specificity.
Carbohydrate degrading polypeptide and uses thereof
Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter
2015-10-20
The invention relates to a polypeptide having carbohydrate material degrading activity which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 4, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional protein and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Sugimura; Sawabe; Ezura
2000-01-01
The alginate lyase-coding genes of Vibrio halioticoli IAM 14596(T), which was isolated from the gut of the abalone Haliotis discus hannai, were cloned using plasmid vector pUC 18, and expressed in Escherichia coli. Three alginate lyase-positive clones, pVHB, pVHC, and pVHE, were obtained, and all clones expressed the enzyme activity specific for polyguluronate. Three genes, alyVG1, alyVG2, and alyVG3, encoding polyguluronate lyase were sequenced: alyVG1 from pVHB was composed of a 1056-bp open reading frame (ORF) encoding 352 amino acid residues; alyVG2 gene from pVHC was composed of a 993-bp ORF encoding 331 amino acid residues; and alyVG3 gene from pVHE was composed of a 705-bp ORF encoding 235 amino acid residues. Comparison of nucleotide and deduced amino acid sequences among AlyVG1, AlyVG2, and AlyVG3 revealed low homologies. The identity value between AlyVG1 and AlyVG2 was 18.7%, and that between AlyVG2 and AlyVG3 was 17.0%. A higher identity value (26.0%) was observed between AlyVG1 and AlyVG3. Sequence comparison among known polyguluronate lyases including AlyVG1, AlyVG2, and AlyVG3 also did not reveal an identical region in these sequences. However, AlyVG1 showed the highest identity value (36.2%) and the highest similarity (73.3%) to AlyA from Klebsiella pneumoniae. A consensus region comprising nine amino acid (YFKAGXYXQ) in the carboxy-terminal region previously reported by Mallisard and colleagues was observed only in AlyVG1 and AlyVG2.
Conlan, Sean; Thomas, Pamela J.; Deming, Clayton; Park, Morgan; Lau, Anna F.; Dekker, John P.; Snitkin, Evan S.; Clark, Tyson A.; Luong, Khai; Song, Yi; Tsai, Yu-Chih; Boitano, Matthew; Gupta, Jyoti; Brooks, Shelise Y.; Schmidt, Brian; Young, Alice C.; Thomas, James W.; Bouffard, Gerard G.; Blakesley, Robert W.; Mullikin, James C.; Korlach, Jonas; Henderson, David K.; Frank, Karen M.; Palmore, Tara N.; Segre, Julia A.
2014-01-01
Public health officials have raised concerns that plasmid transfer between Enterobacteriaceae species may spread resistance to carbapenems, an antibiotic class of last resort, thereby rendering common healthcare-associated infections nearly impossible to treat. We performed comprehensive surveillance and genomic sequencing to identify carbapenem-resistant Enterobacteriaceae in the NIH Clinical Center patient population and hospital environment in order to to articulate the diversity of carbapenemase-encoding plasmids and survey the mobility of and assess the mobility of these plasmids between bacterial species. We isolated a repertoire of carbapenemase-encoding Enterobacteriaceae, including multiple strains of Klebsiella pneumoniae, Klebsiella oxytoca, Escherichia coli, Enterobacter cloacae, Citrobacter freundii, and Pantoea species. Long-read genome sequencing with full end-to-end assembly revealed that these organisms carry the carbapenem-resistance genes on a wide array of plasmids. Klebsiella pneumoniae and Enterobacter cloacae isolated simultaneously from a single patient harbored two different carbapenemase-encoding plasmids, overriding the epidemiological scenario of plasmid transfer between organisms within this patient. We did, however, find evidence supporting horizontal transfer of carbapenemase-encoding plasmids between Klebsiella pneumoniae, Enterobacter cloacae and Citrobacter freundii in the hospital environment. Our comprehensive sequence data, with full plasmid identification, challenges assumptions about horizontal gene transfer events within patients and identified wider possible connections between patients and the hospital environment. In addition, we identified a new carbapenemase-encoding plasmid of potentially high clinical impact carried by Klebsiella pneumoniae, Escherichia coli, Enterobacter cloacae and Pantoea species, from unrelated patients and the hospital environment. PMID:25232178
Identification of a novel circular DNA virus in pig feces
USDA-ARS?s Scientific Manuscript database
Metagenomic analysis of fecal samples collected from a swine with diarrhea detected sequences encoding a replicase (Rep) protein typically found in small circular Rep-encoding ssDNA (CRESS-DNA) viruses. The complete 3,062 nucleotide genome was generated and found to encode two bi-directionally trans...
Anticipatory activity in primary motor cortex codes memorized movement sequences.
Lu, Xiaofeng; Ashe, James
2005-03-24
Movement sequences, defined both by the component movements and by the serial order in which they are produced, are fundamental building blocks of motor behavior. The serial order of sequence production is strongly encoded in medial motor areas. It is not known to what extent sequences are further elaborated or encoded in primary motor cortex. Here, we describe cells in the primary motor cortex of the monkey that show anticipatory activity exclusively related to a specific memorized sequence of upcoming movements. In addition, the injection of muscimol, a GABA agonist, into motor cortex resulted in an increase in the error rate during sequence production, without concomitant effects on nonsequenced motor performance. Our results challenge the role of medial motor areas in the control of well-practiced movement sequences and suggest that motor cortex contains a complete apparatus for the planning and production of this complex behavior.
Novel encoding methods for DNA-templated chemical libraries.
Li, Gang; Zheng, Wenlu; Liu, Ying; Li, Xiaoyu
2015-06-01
Among various types of DNA-encoded chemical libraries, DNA-templated library takes advantage of the sequence-specificity of DNA hybridization, enabling not only highly effective DNA-templated chemical reactions, but also high fidelity in library encoding. This brief review summarizes recent advances that have been made on the encoding strategies for DNA-templated libraries, and it also highlights their respective advantages and limitations for the preparation of DNA-encoded libraries. Copyright © 2015 Elsevier Ltd. All rights reserved.
Exome sequencing identifies complex I NDUFV2 mutations as a novel cause of Leigh syndrome.
Cameron, Jessie M; MacKay, Nevena; Feigenbaum, Annette; Tarnopolsky, Mark; Blaser, Susan; Robinson, Brian H; Schulze, Andreas
2015-09-01
Two siblings with hypertrophic cardiomyopathy and brain atrophy were diagnosed with Complex I deficiency based on low enzyme activity in muscle and high lactate/pyruvate ratio in fibroblasts. Whole exome sequencing results of fibroblast gDNA from one sibling was narrowed down to 190 SNPs or In/Dels in 185 candidate genes by selecting non-synonymous coding sequence base pair changes that were not present in the SNP database. Two compound heterozygous mutations were identified in both siblings in NDUFV2, encoding the 24 kDa subunit of Complex I. The intronic mutation (c.IVS2 + 1delGTAA) is disease causing and has been reported before. The other mutation is novel (c.669_670insG, p.Ser224Valfs*3) and predicted to cause a pathogenic frameshift in the protein. Subsequent investigation of 10 probands with complex I deficiency from different families revealed homozygosity for the intronic c.IVS2 + 1delGTAA mutation in a second, consanguineous family. In this family three of five siblings were affected. Interestingly, they presented with Leigh syndrome but no cardiac involvement. The same genotype had been reported previously in a two families but presenting with hypertrophic cardiomyopathy, trunk hypotonia and encephalopathy. We have identified NDUFV2 mutations in two families with Complex I deficiency, including a novel mutation. The diagnosis of Leigh syndrome expands the clinical phenotypes associated with the c.IVS2 + 1delGTAA mutation in this gene. Copyright © 2015 European Paediatric Neurology Society. Published by Elsevier Ltd. All rights reserved.
Beccari, T; Hoade, J; Orlacchio, A; Stirling, J L
1992-01-01
cDNAs encoding the mouse beta-N-acetylhexosaminidase alpha-subunit were isolated from a mouse testis library. The longest of these (1.7 kb) was sequenced and showed 83% similarity with the human alpha-subunit cDNA sequence. The 5' end of the coding sequence was obtained from a genomic DNA clone. Alignment of the human and mouse sequences showed that all three putative N-glycosylation sites are conserved, but that the mouse alpha-subunit has an additional site towards the C-terminus. All eight cysteines in the human sequence are conserved in the mouse. There are an additional two cysteines in the mouse alpha-subunit signal peptide. All amino acids affected in Tay-Sachs-disease mutations are conserved in the mouse. Images Fig. 1. PMID:1379046
Livingston, B T; Shaw, R; Bailey, A; Wilt, F
1991-12-01
In order to investigate the role of proteins in the formation of mineralized tissues during development, we have isolated a cDNA that encodes a protein that is a component of the organic matrix of the skeletal spicule of the sea urchin, Lytechinus pictus. The expression of the RNA encoding this protein is regulated over development and is localized to the descendents of the micromere lineage. Comparison of the sequence of this cDNA to homologous cDNAs from other species of urchin reveal that the protein is basic and contains three conserved structural motifs: a signal peptide, a proline-rich region, and an unusual region composed of a series of direct repeats. Studies on the protein encoded by this cDNA confirm the predicted reading frame deduced from the nucleotide sequence and show that the protein is secreted and not glycosylated. Comparison of the amino acid sequence to databases reveal that the repeat domain is similar to proteins that form a unique beta-spiral supersecondary structure.
Neurologic 3D MR Spectroscopic Imaging with Low-Power Adiabatic Pulses and Fast Spiral Acquisition
Gagoski, Borjan A.; Sorensen, A. Gregory
2012-01-01
Purpose: To improve clinical three-dimensional (3D) MR spectroscopic imaging with more accurate localization and faster acquisition schemes. Materials and Methods: Institutional review board approval and patient informed consent were obtained. Data were acquired with a 3-T MR imager and a 32-channel head coil in phantoms, five healthy volunteers, and five patients with glioblastoma. Excitation was performed with localized adiabatic spin-echo refocusing (LASER) by using adiabatic gradient-offset independent adiabaticity wideband uniform rate and smooth truncation (GOIA-W[16,4]) pulses with 3.5-msec duration, 20-kHz bandwidth, 0.81-kHz amplitude, and 45-msec echo time. Interleaved constant-density spirals simultaneously encoded one frequency and two spatial dimensions. Conventional phase encoding (PE) (1-cm3 voxels) was performed after LASER excitation and was the reference standard. Spectra acquired with spiral encoding at similar and higher spatial resolution and with shorter imaging time were compared with those acquired with PE. Metabolite levels were fitted with software, and Bland-Altman analysis was performed. Results: Clinical 3D MR spectroscopic images were acquired four times faster with spiral protocols than with the elliptical PE protocol at low spatial resolution (1 cm3). Higher-spatial-resolution images (0.39 cm3) were acquired twice as fast with spiral protocols compared with the low-spatial-resolution elliptical PE protocol. A minimum signal-to-noise ratio (SNR) of 5 was obtained with spiral protocols under these conditions and was considered clinically adequate to reliably distinguish metabolites from noise. The apparent SNR loss was not linear with decreasing voxel sizes because of longer local T2* times. Improvement of spectral line width from 4.8 Hz to 3.5 Hz was observed at high spatial resolution. The Bland-Altman agreement between spiral and PE data is characterized by narrow 95% confidence intervals for their differences (0.12, 0.18 of their means). GOIA-W(16,4) pulses minimize chemical-shift displacement error to 2.1%, reduce nonuniformity of excitation to 5%, and eliminate the need for outer volume suppression. Conclusion: The proposed adiabatic spiral 3D MR spectroscopic imaging sequence can be performed in a standard clinical MR environment. Improvements in image quality and imaging time could enable more routine acquisition of spectroscopic data than is possible with current pulse sequences. © RSNA, 2011 PMID:22187628
Molecular cloning and characterization of alpha - galactosidase gene from Glaciozyma antarctica
NASA Astrophysics Data System (ADS)
Moheer, Reyad Qaed Al; Bakar, Farah Diba Abu; Murad, Abdul Munir Abdul
2015-09-01
Psychrophilic enzymes are proteins produced by psychrophilic organisms which recently are the limelight for industrial applications. A gene encoding α-galactosidase from a psychrophilic yeast, Glaciozyma antarctica PI12 which belongs to glycoside hydrolase family 27, was isolated and analyzed using several bioinformatic tools. The cDNA of the gene with the size of 1,404-bp encodes a protein with 467 amino acid residues. Predicted molecular weight of protein was 48.59 kDa and hence we name the gene encoding α-galactosidase as GAL48. We found that the predicted protein sequences possessed signal peptide sequence and are highly conserved among other fungal α-galactosidase.
File compression and encryption based on LLS and arithmetic coding
NASA Astrophysics Data System (ADS)
Yu, Changzhi; Li, Hengjian; Wang, Xiyu
2018-03-01
e propose a file compression model based on arithmetic coding. Firstly, the original symbols, to be encoded, are input to the encoder one by one, we produce a set of chaotic sequences by using the Logistic and sine chaos system(LLS), and the values of this chaotic sequences are randomly modified the Upper and lower limits of current symbols probability. In order to achieve the purpose of encryption, we modify the upper and lower limits of all character probabilities when encoding each symbols. Experimental results show that the proposed model can achieve the purpose of data encryption while achieving almost the same compression efficiency as the arithmetic coding.
Horse cDNA clones encoding two MHC class I genes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Barbis, D.P.; Maher, J.K.; Stanek, J.
1994-12-31
Two full-length clones encoding MHC class I genes were isolated by screening a horse cDNA library, using a probe encoding in human HLA-A2.2Y allele. The library was made in the pcDNA1 vector (Invitrogen, San Diego, CA), using mRNA from peripheral blood lymphocytes obtained from a Thoroughbred stallion (No. 0834) homozygous for a common horse MHC haplotype (ELA-A2, -B2, -D2; Antczak et al. 1984; Donaldson et al. 1988). The clones were sequenced, using SP6 and T7 universal primers and horse-specific oligonucleotides designed to extend previously determined sequences.
Appearances Aren't Everything: Shape Classifiers and Referential Processing in Cantonese
ERIC Educational Resources Information Center
Tsang, Cara; Chambers, Craig G.
2011-01-01
Cantonese shape classifiers encode perceptual information that is characteristic of their associated nouns, although certain nouns are exceptional. For example, the classifier "tiu" occurs primarily with nouns for long-narrow-flexible objects (e.g., scarves, snakes, and ropes) and also occurs with the noun for a (short, rigid) key. In 3…
nuID: a universal naming scheme of oligonucleotides for Illumina, Affymetrix, and other microarrays
Du, Pan; Kibbe, Warren A; Lin, Simon M
2007-01-01
Background Oligonucleotide probes that are sequence identical may have different identifiers between manufacturers and even between different versions of the same company's microarray; and sometimes the same identifier is reused and represents a completely different oligonucleotide, resulting in ambiguity and potentially mis-identification of the genes hybridizing to that probe. Results We have devised a unique, non-degenerate encoding scheme that can be used as a universal representation to identify an oligonucleotide across manufacturers. We have named the encoded representation 'nuID', for nucleotide universal identifier. Inspired by the fact that the raw sequence of the oligonucleotide is the true definition of identity for a probe, the encoding algorithm uniquely and non-degenerately transforms the sequence itself into a compact identifier (a lossless compression). In addition, we added a redundancy check (checksum) to validate the integrity of the identifier. These two steps, encoding plus checksum, result in an nuID, which is a unique, non-degenerate, permanent, robust and efficient representation of the probe sequence. For commercial applications that require the sequence identity to be confidential, we have an encryption schema for nuID. We demonstrate the utility of nuIDs for the annotation of Illumina microarrays, and we believe it has universal applicability as a source-independent naming convention for oligomers. Reviewers This article was reviewed by Itai Yanai, Rong Chen (nominated by Mark Gerstein), and Gregory Schuler (nominated by David Lipman). PMID:17540033
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jackson, P.J.; Walthers, E.A.; Richmond, K.L.
1997-04-01
PCR analysis of 198 Bacillus anthracis isolates revealed a variable region of DNA sequence differing in length among the isolates. Five Polymorphisms differed by the presence Of two to six copies of the 12-bp tandem repeat 5{prime}-CAATATCAACAA-3{prime}. This variable-number tandem repeat (VNTR) region is located within a larger sequence containing one complete open reading frame that encodes a putative 30-kDa protein. Length variation did not change the reading frame of the encoded protein and only changed the copy number of a 4-amino-acid sequence (QYQQ) from 2 to 6. The structure of the VNTR region suggests that these multiple repeats aremore » generated by recombination or polymerase slippage. Protein structures predicted from the reverse-translated DNA sequence suggest that any structural changes in the encoded protein are confined to the region encoded by the VNTR sequence. Copy number differences in the VNTR region were used to define five different B. anthracis alleles. Characterization of 198 isolates revealed allele frequencies of 6.1, 17.7, 59.6, 5.6, and 11.1% sequentially from shorter to longer alleles. The high degree of polymorphism in the VNTR region provides a criterion for assigning isolates to five allelic categories. There is a correlation between categories and geographic distribution. Such molecular markers can be used to monitor the epidemiology of anthrax outbreaks in domestic and native herbivore populations. 22 refs., 4 figs., 3 tabs.« less
A deep learning method for lincRNA detection using auto-encoder algorithm.
Yu, Ning; Yu, Zeng; Pan, Yi
2017-12-06
RNA sequencing technique (RNA-seq) enables scientists to develop novel data-driven methods for discovering more unidentified lincRNAs. Meantime, knowledge-based technologies are experiencing a potential revolution ignited by the new deep learning methods. By scanning the newly found data set from RNA-seq, scientists have found that: (1) the expression of lincRNAs appears to be regulated, that is, the relevance exists along the DNA sequences; (2) lincRNAs contain some conversed patterns/motifs tethered together by non-conserved regions. The two evidences give the reasoning for adopting knowledge-based deep learning methods in lincRNA detection. Similar to coding region transcription, non-coding regions are split at transcriptional sites. However, regulatory RNAs rather than message RNAs are generated. That is, the transcribed RNAs participate the biological process as regulatory units instead of generating proteins. Identifying these transcriptional regions from non-coding regions is the first step towards lincRNA recognition. The auto-encoder method achieves 100% and 92.4% prediction accuracy on transcription sites over the putative data sets. The experimental results also show the excellent performance of predictive deep neural network on the lincRNA data sets compared with support vector machine and traditional neural network. In addition, it is validated through the newly discovered lincRNA data set and one unreported transcription site is found by feeding the whole annotated sequences through the deep learning machine, which indicates that deep learning method has the extensive ability for lincRNA prediction. The transcriptional sequences of lincRNAs are collected from the annotated human DNA genome data. Subsequently, a two-layer deep neural network is developed for the lincRNA detection, which adopts the auto-encoder algorithm and utilizes different encoding schemes to obtain the best performance over intergenic DNA sequence data. Driven by those newly annotated lincRNA data, deep learning methods based on auto-encoder algorithm can exert their capability in knowledge learning in order to capture the useful features and the information correlation along DNA genome sequences for lincRNA detection. As our knowledge, this is the first application to adopt the deep learning techniques for identifying lincRNA transcription sequences.
Irie, S; Doi, S; Yorifuji, T; Takagi, M; Yano, K
1987-01-01
The nucleotide sequence of the genes from Pseudomonas putida encoding oxidation of benzene to catechol was determined. Five open reading frames were found in the sequence. Four corresponding protein molecules were detected by a DNA-directed in vitro translation system. Escherichia coli cells containing the fragment with the four open reading frames transformed benzene to cis-benzene glycol, which is an intermediate of the oxidation of benzene to catechol. The relation between the product of each cistron and the components of the benzene oxidation enzyme system is discussed. Images PMID:3667527
Continuous in vitro evolution of bacteriophage RNA polymerase promoters
NASA Technical Reports Server (NTRS)
Breaker, R. R.; Banerji, A.; Joyce, G. F.
1994-01-01
Rapid in vitro evolution of bacteriophage T7, T3, and SP6 RNA polymerase promoters was achieved by a method that allows continuous enrichment of DNAs that contain functional promoter elements. This method exploits the ability of a special class of nucleic acid molecules to replicate continuously in the presence of both a reverse transcriptase and a DNA-dependent RNA polymerase. Replication involves the synthesis of both RNA and cDNA intermediates. The cDNA strand contains an embedded promoter sequence, which becomes converted to a functional double-stranded promoter element, leading to the production of RNA transcripts. Synthetic cDNAs, including those that contain randomized promoter sequences, can be used to initiate the amplification cycle. However, only those cDNAs that contain functional promoter sequences are able to produce RNA transcripts. Furthermore, each RNA transcript encodes the RNA polymerase promoter sequence that was responsible for initiation of its own transcription. Thus, the population of amplifying molecules quickly becomes enriched for those templates that encode functional promoters. Optimal promoter sequences for phage T7, T3, and SP6 RNA polymerase were identified after a 2-h amplification reaction, initiated in each case with a pool of synthetic cDNAs encoding greater than 10(10) promoter sequence variants.
Genome analysis and identification of gelatinase encoded gene in Enterobacter aerogenes
NASA Astrophysics Data System (ADS)
Shahimi, Safiyyah; Mutalib, Sahilah Abdul; Khalid, Rozida Abdul; Repin, Rul Aisyah Mat; Lamri, Mohd Fadly; Bakar, Mohd Faizal Abu; Isa, Mohd Noor Mat
2016-11-01
In this study, bioinformatic analysis towards genome sequence of E. aerogenes was done to determine gene encoded for gelatinase. Enterobacter aerogenes was isolated from hot spring water and gelatinase species-specific bacterium to porcine and fish gelatin. This bacterium offers the possibility of enzymes production which is specific to both species gelatine, respectively. Enterobacter aerogenes was partially genome sequenced resulting in 5.0 mega basepair (Mbp) total size of sequence. From pre-process pipeline, 87.6 Mbp of total reads, 68.8 Mbp of total high quality reads and 78.58 percent of high quality percentage was determined. Genome assembly produced 120 contigs with 67.5% of contigs over 1 kilo base pair (kbp), 124856 bp of N50 contig length and 55.17 % of GC base content percentage. About 4705 protein gene was identified from protein prediction analysis. Two candidate genes selected have highest similarity identity percentage against gelatinase enzyme available in Swiss-Prot and NCBI online database. They were NODE_9_length_26866_cov_148.013245_12 containing 1029 base pair (bp) sequence with 342 amino acid sequence and NODE_24_length_155103_cov_177.082458_62 which containing 717 bp sequence with 238 amino acid sequence, respectively. Thus, two paired of primers (forward and reverse) were designed, based on the open reading frame (ORF) of selected genes. Genome analysis of E. aerogenes resulting genes encoded gelatinase were identified.
Inhibitor of striate conditionally suppresses cell proliferation in variegated maize
Park, Sung Han; Park, Su Hyun; Chin, Hang Gyeong; Cho, Moo Je; Martienssen, Robert A.; Han, Chang-deok
2000-01-01
Since the work done by R.A. Emerson in the 1930s, Inhibitor of striate (Isr) has been recognized as a dose-dependent genetic modifier of variegation in chlorotic leaf striping mutants of maize such as striate2 (sr2). We have shown that Isr specifically inhibits proliferation and differentiation of plastid defective cells in sr2 mutants. Leaf narrowing is due to loss of intermediate veins and ground tissue located at leaf margins, and the few remaining plastid defective cells are of irregular size and aberrant organization. The Isr gene has been cloned by targeted transposon tagging. Isr mRNA is expressed throughout young leaves, but Isr chimeras indicate that the expression of Isr at leaf margins is sufficient to suppress both the lateral expansion of sr2 leaves and the extent of striping. Isr protein appears to encode a chloroplast protein with sequence similarity to a family of bacterial phosphatases involved in carbon catabolite repression or in carbon metabolism. We propose that the action of Isr in nuclear and plastid communication could be triggered by carbon stress. PMID:10783171
Structure-Function Analysis of Chloroplast Proteins via Random Mutagenesis Using Error-Prone PCR.
Dumas, Louis; Zito, Francesca; Auroy, Pascaline; Johnson, Xenie; Peltier, Gilles; Alric, Jean
2018-06-01
Site-directed mutagenesis of chloroplast genes was developed three decades ago and has greatly advanced the field of photosynthesis research. Here, we describe a new approach for generating random chloroplast gene mutants that combines error-prone polymerase chain reaction of a gene of interest with chloroplast complementation of the knockout Chlamydomonas reinhardtii mutant. As a proof of concept, we targeted a 300-bp sequence of the petD gene that encodes subunit IV of the thylakoid membrane-bound cytochrome b 6 f complex. By sequencing chloroplast transformants, we revealed 149 mutations in the 300-bp target petD sequence that resulted in 92 amino acid substitutions in the 100-residue target subunit IV sequence. Our results show that this method is suited to the study of highly hydrophobic, multisubunit, and chloroplast-encoded proteins containing cofactors such as hemes, iron-sulfur clusters, and chlorophyll pigments. Moreover, we show that mutant screening and sequencing can be used to study photosynthetic mechanisms or to probe the mutational robustness of chloroplast-encoded proteins, and we propose that this method is a valuable tool for the directed evolution of enzymes in the chloroplast. © 2018 American Society of Plant Biologists. All rights reserved.
Hobbs, A A; Rosen, J M
1982-01-01
The complete sequences of rat alpha- and gamma-casein mRNAs have been determined. The 1402-nucleotide alpha- and 864-nucleotide gamma-casein mRNAs both encode 15 amino acid signal peptides and mature proteins of 269 and 164 residues, respectively. Considerable homology between the 5' non-coding regions, and the regions encoding the signal peptides and the phosphorylation sites, in these mRNAs as compared to several other rodent casein mRNAs, was observed. Significant homology was also detected between rat alpha- and bovine alpha s1-casein. Comparison of the rodent and bovine sequences suggests that the caseins evolved at about the time of the appearance of the primitive mammals. This may have occurred by intragenic duplication of a nucleotide sequence encoding a primitive phosphorylation site, -(Ser)n-Glu-Glu-, and intergenic duplication resulting in the small casein multigene family. A unique feature of the rat alpha-casein sequence is an insertion in the coding region containing 10 repeated elements of 18 nucleotides each. This insertion appears to have occurred 7-12 million years ago, just prior to the divergence of rat and mouse. Images PMID:6298707
2013-01-01
identity to acetylcholinesterase mRNA sequences of Culex tritaeniorhynchus and Lutzomyia longipalpis, respectively. The P. papatasi cDNA ORF encoded a...tritaeniorhynchus and Lutzomyia longipalpis, respectively. The P. papatasi cDNA ORF encoded a 710-amino acid protein [GenBank: AFP20868] exhibiting 85...improve effectiveness of pesticide application for control of the new world sand fly Lutzomyia longipalpis in chicken sheds [13]. Attempts to control
Auffret, Pauline; Segura, Audrey; Klopp, Christophe; Bouchez, Olivier; Kérourédan, Monique; Bibbal, Delphine; Brugère, Hubert; Forano, Evelyne
2017-01-01
ABSTRACT Enterohemorrhagic Escherichia coli (EHEC) with serotype O157:H7 is a major foodborne pathogen. Here, we report the draft genome sequence of EHEC O157:H7 strain MC2 isolated from cattle in France. The assembly contains 5,400,376 bp that encoded 5,914 predicted genes (5,805 protein-encoding genes and 109 RNA genes). PMID:28983004
Ciok, Anna; Adamczuk, Marcin; Bartosik, Dariusz; Dziewit, Lukasz
2016-11-28
Pseudomonas strains isolated from the heavily contaminated Lubin copper mine and Zelazny Most post-flotation waste reservoir in Poland were screened for the presence of integrons. This analysis revealed that two strains carried homologous DNA regions composed of a gene encoding a DNA_BRE_C domain-containing tyrosine recombinase (with no significant sequence similarity to other integrases of integrons) plus a three-component array of putative integron gene cassettes. The predicted gene cassettes encode three putative polypeptides with homology to (i) transmembrane proteins, (ii) GCN5 family acetyltransferases, and (iii) hypothetical proteins of unknown function (homologous proteins are encoded by the gene cassettes of several class 1 integrons). Comparative sequence analyses identified three structural variants of these novel integron-like elements within the sequenced bacterial genomes. Analysis of their distribution revealed that they are found exclusively in strains of the genus Pseudomonas .
Stayton, M M; Black, M; Bedbrook, J; Dunsmuir, P
1986-12-22
The 16 petunia Cab genes which have been characterized are all closely related at the nucleotide sequence level and they encode Cab precursor polypeptides which are similar in sequence and length. Here we describe a novel petunia Cab gene which encodes a unique Cab precursor protein. This protein is a member of the smallest class of Cab precursor proteins for which no gene has previously been assigned in petunia or any other species. The features of this Cab precursor protein are that it is shorter by 2-3 amino acids than the formerly characterized Cab precursors, its transit peptide sequence is unrelated, and the mature polypeptide is significantly diverged at the functionally important N terminus from other petunia Cab proteins. Gene structure also discriminates this gene which is the only intron containing Cab gene in petunia genomic DNA.
Yiheng Hu; Meng Dang; Xiaojia Feng; Keith Woeste; Peng Zhao
2017-01-01
The conservation of narrow endemic species relies on accurate information regarding their population structure. Juglans hopeiensis Hu (Ma walnut), found only in Hebei province, Beijing, and Tianjin, China, is a threatened tree species valued commercially for its nut and wood. Sequences of two maternally inherited mitochondrial markers and two...
Kobayashi, Michie; Hiraka, Yukie; Abe, Akira; Yaegashi, Hiroki; Natsume, Satoshi; Kikuchi, Hideko; Takagi, Hiroki; Saitoh, Hiromasa; Win, Joe; Kamoun, Sophien; Terauchi, Ryohei
2017-11-22
Downy mildew, caused by the oomycete pathogen Sclerospora graminicola, is an economically important disease of Gramineae crops including foxtail millet (Setaria italica). Plants infected with S. graminicola are generally stunted and often undergo a transformation of flower organs into leaves (phyllody or witches' broom), resulting in serious yield loss. To establish the molecular basis of downy mildew disease in foxtail millet, we carried out whole-genome sequencing and an RNA-seq analysis of S. graminicola. Sequence reads were generated from S. graminicola using an Illumina sequencing platform and assembled de novo into a draft genome sequence comprising approximately 360 Mbp. Of this sequence, 73% comprised repetitive elements, and a total of 16,736 genes were predicted from the RNA-seq data. The predicted genes included those encoding effector-like proteins with high sequence similarity to those previously identified in other oomycete pathogens. Genes encoding jacalin-like lectin-domain-containing secreted proteins were enriched in S. graminicola compared to other oomycetes. Of a total of 1220 genes encoding putative secreted proteins, 91 significantly changed their expression levels during the infection of plant tissues compared to the sporangia and zoospore stages of the S. graminicola lifecycle. We established the draft genome sequence of a downy mildew pathogen that infects Gramineae plants. Based on this sequence and our transcriptome analysis, we generated a catalog of in planta-induced candidate effector genes, providing a solid foundation from which to identify the effectors causing phyllody.
Tomazetto, Geizecler; Wibberg, Daniel; Schlüter, Andreas; Oliveira, Valéria M
2015-01-01
A fosmid metagenomic library was constructed with total community DNA obtained from a municipal wastewater treatment plant (MWWTP), with the aim of identifying new FeFe-hydrogenase genes encoding the enzymes most important for hydrogen metabolism. The dataset generated by pyrosequencing of a fosmid library was mined to identify environmental gene tags (EGTs) assigned to FeFe-hydrogenase. The majority of EGTs representing FeFe-hydrogenase genes were affiliated with the class Clostridia, suggesting that this group is the main hydrogen producer in the MWWTP analyzed. Based on assembled sequences, three FeFe-hydrogenase genes were predicted based on detection of the L2 motif (MPCxxKxxE) in the encoded gene product, confirming true FeFe-hydrogenase sequences. These sequences were used to design specific primers to detect fosmids encoding FeFe-hydrogenase genes predicted from the dataset. Three identified fosmids were completely sequenced. The cloned genomic fragments within these fosmids are closely related to members of the Spirochaetaceae, Bacteroidales and Firmicutes, and their FeFe-hydrogenase sequences are characterized by the structure type M3, which is common to clostridial enzymes. FeFe-hydrogenase sequences found in this study represent hitherto undetected sequences, indicating the high genetic diversity regarding these enzymes in MWWTP. Results suggest that MWWTP have to be considered as reservoirs for new FeFe-hydrogenase genes. Copyright © 2014 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
Spectral analysis of variable-length coded digital signals
NASA Astrophysics Data System (ADS)
Cariolaro, G. L.; Pierobon, G. L.; Pupolin, S. G.
1982-05-01
A spectral analysis is conducted for a variable-length word sequence by an encoder driven by a stationary memoryless source. A finite-state sequential machine is considered as a model of the line encoder, and the spectral analysis of the encoded message is performed under the assumption that the sourceword sequence is composed of independent identically distributed words. Closed form expressions for both the continuous and discrete parts of the spectral density are derived in terms of the encoder law and sourceword statistics. The jump part exhibits jumps at multiple integers of per lambda(sub 0)T, where lambda(sub 0) is the greatest common divisor of the possible codeword lengths, and T is the symbol period. The derivation of the continuous part can be conveniently factorized, and the theory is applied to the spectral analysis of BnZS and HDBn codes.
Qiu, T; Lu, R H; Zhang, J; Zhu, Z Y
2001-07-01
The complete nucleotide sequence of M6 gene of grass carp hemorrhage virus (GCHV) was determined. It is 2039 nucleotides in length and contains a single large open reading frame that could encode a protein of 648 amino acids with predicted molecular mass of 68.7 kDa. Amino acid sequence comparison revealed that the protein encoded by GCHV M6 is closely related to the protein mu1 of mammalian reovirus. The M6 gene, encoding the major outer-capsid protein, was expressed using the pET fusion protein vector in Escherichia coli and detected by Western blotting using chicken anti-GCHV immunoglobulin (IgY). The result indicates that the protein encoded by M6 may share a putative Asn-42-Pro-43 proteolytic cleavage site with mu1.
Skilled memory in expert figure skaters.
Deakin, J M; Allard, F
1991-01-01
The present studies extend skilled-memory theory to a domain involving the performance of motor sequences. Skilled figure skaters were better able than their less skilled counterparts to perform short skating sequences that were choreographed, rather than randomly constructed. Expert skaters encoded sequences for performance very differently from the way in which they encoded sequences that were verbally presented for verbal recall. Tasks interpolated between sequence and recall showed no significant influence on recall accuracy, implicating long-term memory in skating memory. There was little evidence for the use of retrieval structures when skaters learned the brief sequences used throughout these studies. Finally, expert skaters were able to judge the similarity of two skating elements faster than less skilled skaters, indicating a faster access to semantic memory for experts. The data indicate that skaters show many of the same skilled-memory characteristics as have been described in other skill domains involving memorization, such as digit span and memory for dinner orders.
Bricheux, G; Brugerolle, G
1997-08-01
The parasitic protozoan Trichomonas vaginalis is known to contain the ubiquitous and highly conserved protein actin. A genomic library and a cDNA library have been screened to identify and clone the actin gene(s) of T. vaginalis. The nucleotide sequence of one gene and its flanking regions have been determined. The open reading frame encodes a protein of 376 amino acids. The sequence is not interrupted by any introns and the promoter could be represented by a 10 bp motif close to a consensus motif also found upstream of most sequenced T. vaginalis genes. The five different clones isolated from the cDNA library have similar sequences and encode three actin proteins differing only by one or two amino acids. A phylogenetic analysis of 31 actin sequences by distance matrix and parsimony methods, using centractin as outgroup, gives congruent trees with Parabasala branching above Diplomonadida.
Methods of diagnosing alagille syndrome
Li, Linheng; Hood, Leroy; Krantz, Ian D.; Spinner, Nancy B.
2004-03-09
The present invention provides an isolated polypeptide exhibiting substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the polypeptide does not have the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. The invention further provides an isolated nucleic acid molecule containing a nucleotide sequence encoding substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the nucleotide sequence does not encode the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. Also provided herein is a method of inhibiting differentiation of hematopoietic progenitor cells by contacting the progenitor cells with an isolated JAGGED polypeptide, or active fragment thereof. The invention additionally provides a method of diagnosing Alagille Syndrome in an individual. The method consists of detecting an Alagille Syndrome disease-associated mutation linked to a JAGGED locus.
Opsin cDNA sequences of a UV and green rhodopsin of the satyrine butterfly Bicyclus anynana.
Vanhoutte, K J A; Eggen, B J L; Janssen, J J M; Stavenga, D G
2002-11-01
The cDNAs of an ultraviolet (UV) and long-wavelength (LW) (green) absorbing rhodopsin of the bush brown Bicyclus anynana were partially identified. The UV sequence, encoding 377 amino acids, is 76-79% identical to the UV sequences of the papilionids Papilio glaucus and Papilio xuthus and the moth Manduca sexta. A dendrogram derived from aligning the amino acid sequences reveals an equidistant position of Bicyclus between Papilio and Manduca. The sequence of the green opsin cDNA fragment, which encodes 242 amino acids, represents six of the seven transmembrane regions. At the amino acid level, this fragment is more than 80% identical to the corresponding LW opsin sequences of Dryas, Heliconius, Papilio (rhodopsin 2) and Manduca. Whereas three LW absorbing rhodopsins were identified in the papilionid butterflies, only one green opsin was found in B. anynana.
Evaluating the protein coding potential of exonized transposable element sequences
Piriyapongsa, Jittima; Rutledge, Mark T; Patel, Sanil; Borodovsky, Mark; Jordan, I King
2007-01-01
Background Transposable element (TE) sequences, once thought to be merely selfish or parasitic members of the genomic community, have been shown to contribute a wide variety of functional sequences to their host genomes. Analysis of complete genome sequences have turned up numerous cases where TE sequences have been incorporated as exons into mRNAs, and it is widely assumed that such 'exonized' TEs encode protein sequences. However, the extent to which TE-derived sequences actually encode proteins is unknown and a matter of some controversy. We have tried to address this outstanding issue from two perspectives: i-by evaluating ascertainment biases related to the search methods used to uncover TE-derived protein coding sequences (CDS) and ii-through a probabilistic codon-frequency based analysis of the protein coding potential of TE-derived exons. Results We compared the ability of three classes of sequence similarity search methods to detect TE-derived sequences among data sets of experimentally characterized proteins: 1-a profile-based hidden Markov model (HMM) approach, 2-BLAST methods and 3-RepeatMasker. Profile based methods are more sensitive and more selective than the other methods evaluated. However, the application of profile-based search methods to the detection of TE-derived sequences among well-curated experimentally characterized protein data sets did not turn up many more cases than had been previously detected and nowhere near as many cases as recent genome-wide searches have. We observed that the different search methods used were complementary in the sense that they yielded largely non-overlapping sets of hits and differed in their ability to recover known cases of TE-derived CDS. The probabilistic analysis of TE-derived exon sequences indicates that these sequences have low protein coding potential on average. In particular, non-autonomous TEs that do not encode protein sequences, such as Alu elements, are frequently exonized but unlikely to encode protein sequences. Conclusion The exaptation of the numerous TE sequences found in exons as bona fide protein coding sequences may prove to be far less common than has been suggested by the analysis of complete genomes. We hypothesize that many exonized TE sequences actually function as post-transcriptional regulators of gene expression, rather than coding sequences, which may act through a variety of double stranded RNA related regulatory pathways. Indeed, their relatively high copy numbers and similarity to sequences dispersed throughout the genome suggests that exonized TE sequences could serve as master regulators with a wide scope of regulatory influence. Reviewers: This article was reviewed by Itai Yanai, Kateryna D. Makova, Melissa Wilson (nominated by Kateryna D. Makova) and Cedric Feschotte (nominated by John M. Logsdon Jr.). PMID:18036258
Lab-on-a-chip platform for high throughput drug discovery with DNA-encoded chemical libraries
NASA Astrophysics Data System (ADS)
Grünzner, S.; Reddavide, F. V.; Steinfelder, C.; Cui, M.; Busek, M.; Klotzbach, U.; Zhang, Y.; Sonntag, F.
2017-02-01
The fast development of DNA-encoded chemical libraries (DECL) in the past 10 years has received great attention from pharmaceutical industries. It applies the selection approach for small molecular drug discovery. Because of the limited choices of DNA-compatible chemical reactions, most DNA-encoded chemical libraries have a narrow structural diversity and low synthetic yield. There is also a poor correlation between the ranking of compounds resulted from analyzing the sequencing data and the affinity measured through biochemical assays. By combining DECL with dynamical chemical library, the resulting DNA-encoded dynamic library (EDCCL) explores the thermodynamic equilibrium of reversible reactions as well as the advantages of DNA encoded compounds for manipulation/detection, thus leads to enhanced signal-to-noise ratio of the selection process and higher library quality. However, the library dynamics are caused by the weak interactions between the DNA strands, which also result in relatively low affinity of the bidentate interaction, as compared to a stable DNA duplex. To take advantage of both stably assembled dual-pharmacophore libraries and EDCCLs, we extended the concept of EDCCLs to heat-induced EDCCLs (hi-EDCCLs), in which the heat-induced recombination process of stable DNA duplexes and affinity capture are carried out separately. To replace the extremely laborious and repetitive manual process, a fully automated device will facilitate the use of DECL in drug discovery. Herein we describe a novel lab-on-a-chip platform for high throughput drug discovery with hi-EDCCL. A microfluidic system with integrated actuation was designed which is able to provide a continuous sample circulation by reducing the volume to a minimum. It consists of a cooled and a heated chamber for constant circulation. The system is capable to generate stable temperatures above 75 °C in the heated chamber to melt the double strands of the DNA and less than 15 °C in the cooled chamber, to reanneal the reshuffled library. In the binding chamber (the cooled chamber) specific retaining structures are integrated. These hold back beads functionalized with the target protein, while the chamber is continuously flushed with library molecules. Afterwards the whole system can be flushed with buffer to wash out unspecific bound molecules. Finally the protein-loaded beads with attached molecules can be eluted for further investigation.
NASA Astrophysics Data System (ADS)
Garrido, Marta Isabel; Teng, Chee Leong James; Taylor, Jeremy Alexander; Rowe, Elise Genevieve; Mattingley, Jason Brett
2016-06-01
The ability to learn about regularities in the environment and to make predictions about future events is fundamental for adaptive behaviour. We have previously shown that people can implicitly encode statistical regularities and detect violations therein, as reflected in neuronal responses to unpredictable events that carry a unique prediction error signature. In the real world, however, learning about regularities will often occur in the context of competing cognitive demands. Here we asked whether learning of statistical regularities is modulated by concurrent cognitive load. We compared electroencephalographic metrics associated with responses to pure-tone sounds with frequencies sampled from narrow or wide Gaussian distributions. We showed that outliers evoked a larger response than those in the centre of the stimulus distribution (i.e., an effect of surprise) and that this difference was greater for physically identical outliers in the narrow than in the broad distribution. These results demonstrate an early neurophysiological marker of the brain's ability to implicitly encode complex statistical structure in the environment. Moreover, we manipulated concurrent cognitive load by having participants perform a visual working memory task while listening to these streams of sounds. We again observed greater prediction error responses in the narrower distribution under both low and high cognitive load. Furthermore, there was no reliable reduction in prediction error magnitude under high-relative to low-cognitive load. Our findings suggest that statistical learning is not a capacity limited process, and that it proceeds automatically even when cognitive resources are taxed by concurrent demands.
NASA Technical Reports Server (NTRS)
Gladden, Roy E.; Khanampornpan, Teerapat; Fisher, Forest W.
2010-01-01
Version 5.0 of the AutoGen software has been released. Previous versions, variously denoted Autogen and autogen, were reported in two articles: Automated Sequence Generation Process and Software (NPO-30746), Software Tech Briefs (Special Supplement to NASA Tech Briefs), September 2007, page 30, and Autogen Version 2.0 (NPO- 41501), NASA Tech Briefs, Vol. 31, No. 10 (October 2007), page 58. To recapitulate: AutoGen (now signifying automatic sequence generation ) automates the generation of sequences of commands in a standard format for uplink to spacecraft. AutoGen requires fewer workers than are needed for older manual sequence-generation processes, and greatly reduces sequence-generation times. The sequences are embodied in spacecraft activity sequence files (SASFs). AutoGen automates generation of SASFs by use of another previously reported program called APGEN. AutoGen encodes knowledge of different mission phases and of how the resultant commands must differ among the phases. AutoGen also provides means for customizing sequences through use of configuration files. The approach followed in developing AutoGen has involved encoding the behaviors of a system into a model and encoding algorithms for context-sensitive customizations of the modeled behaviors. This version of AutoGen addressed the MRO (Mars Reconnaissance Orbiter) primary science phase (PSP) mission phase. On previous Mars missions this phase has more commonly been referred to as mapping phase. This version addressed the unique aspects of sequencing orbital operations and specifically the mission specific adaptation of orbital operations for MRO. This version also includes capabilities for MRO s role in Mars relay support for UHF relay communications with the MER rovers and the Phoenix lander.
Teichmann, A Lina; Nieuwenstein, Mark R; Rich, Anina N
2017-08-01
For digit-color synaesthetes, digits elicit vivid experiences of color that are highly consistent for each individual. The conscious experience of synaesthesia is typically unidirectional: Digits evoke colors but not vice versa. There is an ongoing debate about whether synaesthetes have a memory advantage over non-synaesthetes. One key question in this debate is whether synaesthetes have a general superiority or whether any benefit is specific to a certain type of material. Here, we focus on immediate serial recall and ask digit-color synaesthetes and controls to memorize digit and color sequences. We developed a sensitive staircase method manipulating presentation duration to measure participants' serial recall of both overlearned and novel sequences. Our results show that synaesthetes can activate digit information to enhance serial memory for color sequences. When color sequences corresponded to ascending or descending digit sequences, synaesthetes encoded these sequences at a faster rate than their non-synaesthetes counterparts and faster than non-structured color sequences. However, encoding color sequences is approximately 200 ms slower than encoding digit sequences directly, independent of group and condition, which shows that the translation process is time consuming. These results suggest memory advantages in synaesthesia require a modified dual-coding account, in which secondary (synaesthetically linked) information is useful only if it is more memorable than the primary information to be recalled. Our study further shows that duration thresholds are a sensitive method to measure subtle differences in serial recall performance. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Peng, Jing; Peng, Futian; Zhu, Chunfu; Wei, Shaochong
2008-06-01
A putative isopentenyltransferase (IPT) encoding gene was identified from a pingyitiancha (Malus hupehensis Rehd.) expressed sequence tag database, and the full-length gene was cloned by RACE. Based on expression profile and sequence alignment, the nucleotide sequence of the clone, named MhIPT3, was most similar to AtIPT3, an IPT gene in Arabidopsis. The full-length cDNA contained a 963-bp open reading frame encoding a protein of 321 amino acids with a molecular mass of 37.3 kDa. Sequence analysis of genomic DNA revealed the absence of introns in the frame. Quantitative real-time PCR analysis demonstrated that the gene was expressed in roots, stems and leaves. Application of nitrate to roots of nitrogen-deprived seedlings strongly induced expression of MhIPT3 and was accompanied by the accumulation of cytokinins, whereas MhIPT3 expression was little affected by ammonium application to roots of nitrogen-deprived seedlings. Application of nitrate to leaves also up-regulated the expression of MhIPT3 and corresponded closely with the accumulation of isopentyladenine and isopentyladenosine in leaves.
A Survey of Protein Structures from Archaeal Viruses
Dellas, Nikki; Lawrence, C. Martin; Young, Mark J.
2013-01-01
Viruses that infect the third domain of life, Archaea, are a newly emerging field of interest. To date, all characterized archaeal viruses infect archaea that thrive in extreme conditions, such as halophilic, hyperthermophilic, and methanogenic environments. Viruses in general, especially those replicating in extreme environments, contain highly mosaic genomes with open reading frames (ORFs) whose sequences are often dissimilar to all other known ORFs. It has been estimated that approximately 85% of virally encoded ORFs do not match known sequences in the nucleic acid databases, and this percentage is even higher for archaeal viruses (typically 90%–100%). This statistic suggests that either virus genomes represent a larger segment of sequence space and/or that viruses encode genes of novel fold and/or function. Because the overall three-dimensional fold of a protein evolves more slowly than its sequence, efforts have been geared toward structural characterization of proteins encoded by archaeal viruses in order to gain insight into their potential functions. In this short review, we provide multiple examples where structural characterization of archaeal viral proteins has indeed provided significant functional and evolutionary insight. PMID:25371334
Cloning and characterization of the gene encoding IMP dehydrogenase from Arabidopsis thaliana.
Collart, F R; Osipiuk, J; Trent, J; Olsen, G J; Huberman, E
1996-10-03
We have cloned and characterized the gene encoding inosine monophosphate dehydrogenase (IMPDH) from Arabidopsis thaliana (At). The transcription unit of the At gene spans approximately 1900 bp and specifies a protein of 503 amino acids with a calculated relative molecular mass (M(r)) of 54,190. The gene is comprised of a minimum of four introns and five exons with all donor and acceptor splice sequences conforming to previously proposed consensus sequences. The deduced IMPDH amino-acid sequence from At shows a remarkable similarity to other eukaryotic IMPDH sequences, with a 48% identity to human Type II enzyme. Allowing for conservative substitutions, the enzyme is 69% similar to human Type II IMPDH. The putative active-site sequence of At IMPDH conforms to the IMP dehydrogenase/guanosine monophosphate reductase motif and contains an essential active-site cysteine residue.
Molecular characterization of two prunus necrotic ringspot virus isolates from Canada.
Cui, Hongguang; Hong, Ni; Wang, Guoping; Wang, Aiming
2012-05-01
We determined the entire RNA1, 2 and 3 sequences of two prunus necrotic ringspot virus (PNRSV) isolates, Chr3 from cherry and Pch12 from peach, obtained from an orchard in the Niagara Fruit Belt, Canada. The RNA1, 2 and 3 of the two isolates share nucleotide sequence identities of 98.6%, 98.4% and 94.5%, respectively. Their RNA1- and 2-encoded amino acid sequences are about 98% identical to the corresponding sequences of a cherry isolate, CH57, the only other PNRSV isolate with complete RNA1 and 2 sequences available. Phylogenetic analysis of the coat protein and movement protein encoded by RNA3 of Pch12 and Chr3 and published PNRSV isolates indicated that Chr3 belongs to the PV96 group and Pch12 belongs to the PV32 group.
Root-Bernstein, Robert; Root-Bernstein, Meredith
2016-05-21
We have proposed that the ribosome may represent a missing link between prebiotic chemistries and the first cells. One of the predictions that follows from this hypothesis, which we test here, is that ribosomal RNA (rRNA) must have encoded the proteins necessary for ribosomal function. In other words, the rRNA also functioned pre-biotically as mRNA. Since these ribosome-binding proteins (rb-proteins) must bind to the rRNA, but the rRNA also functioned as mRNA, it follows that rb-proteins should bind to their own mRNA as well. This hypothesis can be contrasted to a "null" hypothesis in which rb-proteins evolved independently of the rRNA sequences and therefore there should be no necessary similarity between the rRNA to which rb-proteins bind and the mRNA that encodes the rb-protein. Five types of evidence reported here support the plausibility of the hypothesis that the mRNA encoding rb-proteins evolved from rRNA: (1) the ubiquity of rb-protein binding to their own mRNAs and autogenous control of their own translation; (2) the higher-than-expected incidence of Arginine-rich modules associated with RNA binding that occurs in rRNA-encoded proteins; (3) the fact that rRNA-binding regions of rb-proteins are homologous to their mRNA binding regions; (4) the higher than expected incidence of rb-protein sequences encoded in rRNA that are of a high degree of homology to their mRNA as compared with a random selection of other proteins; and (5) rRNA in modern prokaryotes and eukaryotes encodes functional proteins. None of these results can be explained by the null hypothesis that assumes independent evolution of rRNA and the mRNAs encoding ribosomal proteins. Also noteworthy is that very few proteins bind their own mRNAs that are not associated with ribosome function. Further tests of the hypothesis are suggested: (1) experimental testing of whether rRNA-encoded proteins bind to rRNA at their coding sites; (2) whether tRNA synthetases, which are also known to bind to their own mRNAs, are encoded by the tRNA sequences themselves; (3) and the prediction that archaeal and prokaryotic (DNA-based) genomes were built around rRNA "genes" so that rRNA-related sequences will be found to make up an unexpectedly high proportion of these genomes. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Zhao, Huanqiang; Hu, Fupin; Jin, Shu; Xu, Xiaogang; Zou, Yuhan; Ding, Baixing; He, Chunyan; Gong, Fang; Liu, Qingzhong
2016-01-01
Panton-Valentine leukocidin (PVL, encoded by lukSF-PV genes), a bi-component and pore-forming toxin, is carried by different staphylococcal bacteriophages. The prevalence of PVL in Staphylococcus aureus has been reported around the globe. However, the data on PVL-encoding phage types, lukSF-PV gene variation and chromosomal phage insertion sites for PVL-positive S. aureus are limited, especially in China. In order to obtain a more complete understanding of the molecular epidemiology of PVL-positive S. aureus, an integrated and modified PCR-based scheme was applied to detect the PVL-encoding phage types. Phage insertion locus and the lukSF-PV variant were determined by PCR and sequencing. Meanwhile, the genetic background was characterized by staphylococcal cassette chromosome mec (SCCmec) typing, staphylococcal protein A (spa) gene polymorphisms typing, pulsed-field gel electrophoresis (PFGE) typing, accessory gene regulator (agr) locus typing and multilocus sequence typing (MLST). Seventy eight (78/1175, 6.6%) isolates possessed the lukSF-PV genes and 59.0% (46/78) of PVL-positive strains belonged to CC59 lineage. Eight known different PVL-encoding phage types were detected, and Φ7247PVL/ΦST5967PVL (n = 13) and ΦPVL (n = 12) were the most prevalent among them. While 25 (25/78, 32.1%) isolates, belonging to ST30, and ST59 clones, were unable to be typed by the modified PCR-based scheme. Single nucleotide polymorphisms (SNPs) were identified at five locations in the lukSF-PV genes, two of which were non-synonymous. Maximum-likelihood tree analysis of attachment sites sequences detected six SNP profiles for attR and eight for attL, respectively. In conclusion, the PVL-positive S. aureus mainly harbored Φ7247PVL/ΦST5967PVL and ΦPVL in the regions studied. lukSF-PV gene sequences, PVL-encoding phages, and phage insertion locus generally varied with lineages. Moreover, PVL-positive clones that have emerged worldwide likely carry distinct phages.
Cloning and sequencing the genes encoding goldfish and carp ependymin.
Adams, D S; Shashoua, V E
1994-04-20
Ependymins (EPNs) are brain glycoproteins thought to function in optic nerve regeneration and long-term memory consolidation. To date, epn genes have been characterized in two orders of teleost fish. In this study, polymerase chain reactions (PCR) were used to amplify the complete 1.6-kb epn genes, gf-I and cc-I, from genomic DNA of Cypriniformes, goldfish and carp, respectively. Amplified bands were cloned and sequenced. Each gene consists of six exons and five introns. The exon portion of gf-I encodes a predicted 215-amino-acid (aa) protein previously characterized as GF-I, while cc-I encodes a predicted 215-aa protein 95% homologous to GF-I.
The complete mitochondrial genome sequence of Eimeria innocua (Eimeriidae, Coccidia, Apicomplexa).
Hafeez, Mian Abdul; Vrba, Vladimir; Barta, John Robert
2016-07-01
The complete mitochondrial genome of Eimeria innocua KR strain (Eimeriidae, Coccidia, Apicomplexa) was sequenced. This coccidium infects turkeys (Meleagris gallopavo), Bobwhite quails (Colinus virginianus), and Grey partridges (Perdix perdix). Genome organization and gene contents were comparable with other Eimeria spp. infecting galliform birds. The circular-mapping mt genome of E. innocua is 6247 bp in length with three protein-coding genes (cox1, cox3, and cytb), 19 gene fragments encoding large subunit (LSU) rRNA and 14 gene fragments encoding small subunit (SSU) rRNA. Like other Apicomplexa, no tRNA was encoded. The mitochondrial genome of E. innocua confirms its close phylogenetic affinities to Eimeria dispersa.
Burton, Rachel A.; Johnson, Philip E.; Beckles, Diane M.; Fincher, Geoffrey B.; Jenner, Helen L.; Naldrett, Mike J.; Denyer, Kay
2002-01-01
In most species, the synthesis of ADP-glucose (Glc) by the enzyme ADP-Glc pyrophosphorylase (AGPase) occurs entirely within the plastids in all tissues so far examined. However, in the endosperm of many, if not all grasses, a second form of AGPase synthesizes ADP-Glc outside the plastid, presumably in the cytosol. In this paper, we show that in the endosperm of wheat (Triticum aestivum), the cytosolic form accounts for most of the AGPase activity. Using a combination of molecular and biochemical approaches to identify the cytosolic and plastidial protein components of wheat endosperm AGPase we show that the large and small subunits of the cytosolic enzyme are encoded by genes previously thought to encode plastidial subunits, and that a gene, Ta.AGP.S.1, which encodes the small subunit of the cytosolic form of AGPase, also gives rise to a second transcript by the use of an alternate first exon. This second transcript encodes an AGPase small subunit with a transit peptide. However, we could not find a plastidial small subunit protein corresponding to this transcript. The protein sequence of the purified plastidial small subunit does not match precisely to that encoded by Ta.AGP.S.1 or to the predicted sequences of any other known gene from wheat or barley (Hordeum vulgare). Instead, the protein sequence is most similar to those of the plastidial small subunits from chickpea (Cicer arietinum) and maize (Zea mays) and rice (Oryza sativa) seeds. These data suggest that the gene encoding the major plastidial small subunit of AGPase in wheat endosperm has yet to be identified. PMID:12428011
2017-07-01
that IL6 is elevated under these in vitro conditions using an ELISA -based system (Fig 1). We are now investigating the potential functional role of...narrowed our focus on DNMT1 which encodes for a DNA methyltransferase that is key in regulating global epigenetic methylation Figure 1. ELISA
Sequence Diversity Diagram for comparative analysis of multiple sequence alignments.
Sakai, Ryo; Aerts, Jan
2014-01-01
The sequence logo is a graphical representation of a set of aligned sequences, commonly used to depict conservation of amino acid or nucleotide sequences. Although it effectively communicates the amount of information present at every position, this visual representation falls short when the domain task is to compare between two or more sets of aligned sequences. We present a new visual presentation called a Sequence Diversity Diagram and validate our design choices with a case study. Our software was developed using the open-source program called Processing. It loads multiple sequence alignment FASTA files and a configuration file, which can be modified as needed to change the visualization. The redesigned figure improves on the visual comparison of two or more sets, and it additionally encodes information on sequential position conservation. In our case study of the adenylate kinase lid domain, the Sequence Diversity Diagram reveals unexpected patterns and new insights, for example the identification of subgroups within the protein subfamily. Our future work will integrate this visual encoding into interactive visualization tools to support higher level data exploration tasks.
Fine-pitched microgratings encoded by interference of UV femtosecond laser pulses.
Kamioka, Hayato; Miura, Taisuke; Kawamura, Ken-ichi; Hirano, Masahiro; Hosono, Hideo
2002-01-01
Fine-pitched microgratings are encoded on fused silica surfaces by a two-beam laser interference technique employing UV femtosecond pulses from the third harmonics of a Ti:sapphire laser. A pump and prove method utilizing a laser-induced optical Kerr effect or transient optical absorption change has been developed to achieve the time coincidence of the two pulses. Use of the UV pulses makes it possible to narrow the grating pitches to an opening as small as 290 nm, and the groove width of the gratings is of nanoscale size. The present technique provides a novel opportunity for the fabrication of periodic nanoscale structures in various materials.
Improved Protocols for Illumina Sequencing
Bronner, Iraad F.; Quail, Michael A.; Turner, Daniel J.; Swerdlow, Harold
2013-01-01
In this unit, we describe a set of improvements we have made to the standard Illumina protocols to make the sequencing process more reliable in a high-throughput environment, reduce amplification bias, narrow the distribution of insert sizes, and reliably obtain high yields of data. PMID:19582764
Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P
1988-02-01
Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators.
Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P
1988-01-01
Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators. Images PMID:3257578
Hasinoff, Samuel W; Kutulakos, Kiriakos N
2011-11-01
In this paper, we consider the problem of imaging a scene with a given depth of field at a given exposure level in the shortest amount of time possible. We show that by 1) collecting a sequence of photos and 2) controlling the aperture, focus, and exposure time of each photo individually, we can span the given depth of field in less total time than it takes to expose a single narrower-aperture photo. Using this as a starting point, we obtain two key results. First, for lenses with continuously variable apertures, we derive a closed-form solution for the globally optimal capture sequence, i.e., that collects light from the specified depth of field in the most efficient way possible. Second, for lenses with discrete apertures, we derive an integer programming problem whose solution is the optimal sequence. Our results are applicable to off-the-shelf cameras and typical photography conditions, and advocate the use of dense, wide-aperture photo sequences as a light-efficient alternative to single-shot, narrow-aperture photography.
NASA Astrophysics Data System (ADS)
Enea, Vincenzo; Ellis, Joan; Zavala, Fidel; Arnot, David E.; Asavanich, Achara; Masuda, Aoi; Quakyi, Isabella; Nussenzweig, Ruth S.
1984-08-01
A clone of complementary DNA encoding the circumsporozoite (CS) protein of the human malaria parasite Plasmodium falciparum has been isolated by screening an Escherichia coli complementary DNA library with a monoclonal antibody to the CS protein. The DNA sequence of the complementary DNA insert encodes a four-amino acid sequence: proline-asparagine-alanine-asparagine, tandemly repeated 23 times. The CS β -lactamase fusion protein specifically binds monoclonal antibodies to the CS protein and inhibits the binding of these antibodies to native Plasmodium falciparum CS protein. These findings provide a basis for the development of a vaccine against Plasmodium falciparum malaria.
Myamoto, D T; Pidde-Queiroz, G; Pedroso, A; Gonçalves-de-Andrade, R M; van den Berg, C W; Tambourgi, D V
2016-09-01
A transcriptome analysis of the venom glands of the spider Loxosceles laeta, performed by our group, in a previous study (Fernandes-Pedrosa et al., 2008), revealed a transcript with a sequence similar to the human complement component C3. Here we present the analysis of this transcript. cDNA fragments encoding the C3 homologue (Lox-C3) were amplified from total RNA isolated from the venom glands of L. laeta by RACE-PCR. Lox-C3 is a 5178 bps cDNA sequence encoding a 190kDa protein, with a domain configuration similar to human C3. Multiple alignments of C3-like proteins revealed two processing sites, suggesting that Lox-C3 is composed of three chains. Furthermore, the amino acids consensus sequences for the thioester was found, in addition to putative sequences responsible for FB binding. The phylogenetic analysis showed that Lox-C3 belongs to the same group as two C3 isoforms from the spider Hasarius adansoni (Family Salcitidae), showing 53% homology with these. This is the first characterization of a Loxosceles cDNA sequence encoding a human C3 homologue, and this finding, together with our previous finding of the expression of a FB-like molecule, suggests that this spider species also has a complement system. This work will help to improve our understanding of the innate immune system in these spiders and the ancestral structure of C3. Copyright © 2016 Elsevier GmbH. All rights reserved.
Qiu, Gui-Hua; Weng, Zi-Hua; Hu, Pei-Pei; Duan, Wen-Jun; Xie, Bao-Ping; Sun, Bin; Tang, Xiao-Yan; Chen, Jin-Xiang
2018-04-01
From a three-dimensional (3D) metal-organic framework (MOF) of {[Cu(Cmdcp)(phen)(H 2 O)] 2 ·9H 2 O} n (1, H 3 CmdcpBr = N-carboxymethyl-(3,5-dicarboxyl)pyridinium bromide, phen = phenanthroline), a sensitive and selective fluorescence sensor has been developed for the simultaneous detection of ebolavirus conserved RNA sequences and ebolavirus-encoded microRNA-like (miRNA-like) fragment. The results from molecular dynamics simulation confirmed that MOF 1 absorbs carboxyfluorescein (FAM)-tagged and 5(6)-carboxyrhodamine, triethylammonium salt (ROX)-tagged probe ss-DNA (probe DNA, P-DNA) by π … π stacking and hydrogen bonding, as well as additional electrostatic interactions to form a sensing platform of P-DNAs@1 with quenched FAM and ROX fluorescence. In the presence of targeted ebolavirus conserved RNA sequences or ebolavirus-encoded miRNA-like fragment, the fluorophore-labeled P-DNA hybridizes with the analyte to give a P-DNA@RNA duplex and released from MOF 1, triggering a fluorescence recovery. Simultaneous detection of two target RNAs has also been realized by single and synchronous fluorescence analysis. The formed sensing platform shows high sensitivity for ebolavirus conserved RNA sequences and ebolavirus-encoded miRNA-like fragment with detection limits at the picomolar level and high selectivity without cross-reaction between the two probes. MOF 1 thus shows the potential as an effective fluorescent sensing platform for the synchronous detection of two ebolavirus-related sequences, and offer improved diagnostic accuracy of Ebola virus disease. Copyright © 2017 Elsevier B.V. All rights reserved.
The impact of path crossing on visuo-spatial serial memory: encoding or rehearsal effect?
Parmentier, Fabrice B R; Andrés, Pilar
2006-11-01
The determinants of visuo-spatial serial memory have been the object of little research, despite early evidence that not all sequences are equally remembered. Recently, empirical evidence was reported indicating that the complexity of the path formed by the to-be-remembered locations impacted on recall performance, defined for example by the presence of crossings in the path formed by successive locations (Parmentier, Elford, & Maybery, 2005). In this study, we examined whether this effect reflects rehearsal or encoding processes. We examined the effect of a retention interval and spatial interference on the ordered recall of spatial sequences with and without path crossings. Path crossings decreased recall performance, as did a retention interval. In line with the encoding hypothesis, but in contrast with the rehearsal hypothesis, the effect of crossing was not affected by the retention interval nor by tapping. The possible nature of the impact of path crossing on encoding mechanisms is discussed.
ERPs and oscillations during encoding predict retrieval of digit memory in superior mnemonists.
Pan, Yafeng; Li, Xianchun; Chen, Xi; Ku, Yixuan; Dong, Yujie; Dou, Zheng; He, Lin; Hu, Yi; Li, Weidong; Zhou, Xiaolin
2017-10-01
Previous studies have consistently demonstrated that superior mnemonists (SMs) outperform normal individuals in domain-specific memory tasks. However, the neural correlates of memory-related processes remain unclear. In the current EEG study, SMs and control participants performed a digit memory task during which their brain activity was recorded. Chinese SMs used a digit-image mnemonic for encoding digits, in which they associated 2-digit groups with images immediately after the presentation of each even-position digit in sequences. Behaviorally, SMs' memory of digit sequences was better than the controls'. During encoding in the study phase, SMs showed an increased right central P2 (150-250ms post onset) and a larger right posterior high-alpha (10-14Hz, 500-1720ms) oscillation on digits at even-positions compared with digits at odd-positions. Both P2 and high-alpha oscillations in the study phase co-varied with performance in the recall phase, but only in SMs, indicating that neural dynamics during encoding could predict successful retrieval of digit memory in SMs. Our findings suggest that representation of a digit sequence in SMs using mnemonics may recruit both the early-stage attention allocation process and the sustained information preservation process. This study provides evidence for the role of dynamic and efficient neural encoding processes in mnemonists. Copyright © 2017. Published by Elsevier Inc.
Identifying metabolic enzymes with multiple types of association evidence
Kharchenko, Peter; Chen, Lifeng; Freund, Yoav; Vitkup, Dennis; Church, George M
2006-01-01
Background Existing large-scale metabolic models of sequenced organisms commonly include enzymatic functions which can not be attributed to any gene in that organism. Existing computational strategies for identifying such missing genes rely primarily on sequence homology to known enzyme-encoding genes. Results We present a novel method for identifying genes encoding for a specific metabolic function based on a local structure of metabolic network and multiple types of functional association evidence, including clustering of genes on the chromosome, similarity of phylogenetic profiles, gene expression, protein fusion events and others. Using E. coli and S. cerevisiae metabolic networks, we illustrate predictive ability of each individual type of association evidence and show that significantly better predictions can be obtained based on the combination of all data. In this way our method is able to predict 60% of enzyme-encoding genes of E. coli metabolism within the top 10 (out of 3551) candidates for their enzymatic function, and as a top candidate within 43% of the cases. Conclusion We illustrate that a combination of genome context and other functional association evidence is effective in predicting genes encoding metabolic enzymes. Our approach does not rely on direct sequence homology to known enzyme-encoding genes, and can be used in conjunction with traditional homology-based metabolic reconstruction methods. The method can also be used to target orphan metabolic activities. PMID:16571130
Khajanchi, Bijay K; Hasan, Nur A; Choi, Seon Young; Han, Jing; Zhao, Shaohua; Colwell, Rita R; Cerniglia, Carl E; Foley, Steven L
2017-08-02
The degree to which the chromosomal mediated iron acquisition system contributes to virulence of many bacterial pathogens is well defined. However, the functional roles of plasmid encoded iron acquisition systems, specifically Sit and aerobactin, have yet to be determined for Salmonella spp. In a recent study, Salmonella enterica strains isolated from different food sources were sequenced on the Illumina MiSeq platform and found to harbor the incompatibility group (Inc) FIB plasmid. In this study, we examined sequence diversity and the contribution of factors encoded on the IncFIB plasmid to the virulence of S. enterica. Whole genome sequences of seven S. enterica isolates were compared to genomes of serovars of S. enterica isolated from food, animal, and human sources. SeqSero analysis predicted that six strains were serovar Typhimurium and one was Heidelberg. Among the S. Typhimurium strains, single nucleotide polymorphism (SNP)-based phylogenetic analyses revealed that five of the isolates clustered as a single monophyletic S. Typhimurium subclade, while one of the other strains branched with S. Typhimurium from a bovine source. DNA sequence based phylogenetic diversity analyses showed that the IncFIB plasmid-encoded Sit and aerobactin iron acquisition systems are conserved among bacterial species including S. enterica. The IncFIB plasmid was transferred to an IncFIB plasmid deficient strain of S. enterica by conjugation. The transconjugant SE819::IncFIB persisted in human intestinal epithelial (Caco-2) cells at a higher rate than the recipient SE819. Genes of the Sit and aerobactin operons in the IncFIB plasmid were differentially expressed in iron-rich and iron-depleted growth media. Minimal sequence diversity was detected in the Sit and aerobactin operons in the IncFIB plasmids present among different bacterial species, including foodborne Salmonella strains. IncFIB plasmid encoded factors play a role during infection under low-iron conditions in host cells.
Quark enables semi-reference-based compression of RNA-seq data.
Sarkar, Hirak; Patro, Rob
2017-11-01
The past decade has seen an exponential increase in biological sequencing capacity, and there has been a simultaneous effort to help organize and archive some of the vast quantities of sequencing data that are being generated. Although these developments are tremendous from the perspective of maximizing the scientific utility of available data, they come with heavy costs. The storage and transmission of such vast amounts of sequencing data is expensive. We present Quark, a semi-reference-based compression tool designed for RNA-seq data. Quark makes use of a reference sequence when encoding reads, but produces a representation that can be decoded independently, without the need for a reference. This allows Quark to achieve markedly better compression rates than existing reference-free schemes, while still relieving the burden of assuming a specific, shared reference sequence between the encoder and decoder. We demonstrate that Quark achieves state-of-the-art compression rates, and that, typically, only a small fraction of the reference sequence must be encoded along with the reads to allow reference-free decompression. Quark is implemented in C ++11, and is available under a GPLv3 license at www.github.com/COMBINE-lab/quark. rob.patro@cs.stonybrook.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Amiche, M; Ducancel, F; Mor, A; Boulain, J C; Menez, A; Nicolas, P
1994-07-08
The dermaseptins are a family of broad spectrum antimicrobial peptides, 27-34 amino acids long, involved in the defense of the naked skin of frogs against microbial invasion. They are the first vertebrate peptides to show lethal effects against the filamentous fungi responsible for severe opportunistic infections accompanying immunodeficiency syndrome and the use of immunosuppressive agents. A cDNA library was constructed from skin poly(A+) RNA of the arboreal frog Phyllomedusa bicolor and screened with an oligonucleotide probe complementary to the COOH terminus of dermaseptin b. Several clones contained a full-length DNA copy of a 443-nucleotide mRNA that encoded a 78-residue dermaseptin b precursor protein. The deduced precursor contained a putative signal sequence at the NH2 terminus, a 20-residue spacer sequence extremely rich (60%) in glutamic and aspartic acids, and a single copy of a dermaseptin b progenitor sequence at the COOH terminus. One clone contained a complete copy of adenoregulin, a 33-residue peptide reported to enhance the binding of agonists to the A1 adenosine receptor. The mRNAs encoding adenoregulin and dermaseptin b were very similar: 70 and 75% nucleotide identities between the 5'- and 3'-untranslated regions, respectively; 91% amino acid identity between the signal peptides; 82% identity between the acidic spacer sequences; and 38% identity between adenoregulin and dermaseptin b. Because adenoregulin and dermaseptin b have similar precursor designs and antimicrobial spectra, adenoregulin should be considered as a new member of the dermaseptin family and alternatively named dermaseptin b II. Preprodermaseptin b and preproadenoregulin have considerable sequence identities to the precursors encoding the opioid heptapeptides dermorphin, dermenkephalin, and deltorphins. This similarity extended into the 5'-untranslated regions of the mRNAs. These findings suggest that the genes encoding the four preproproteins are all members of the same family despite the fact that they encode end products having very different biological activities. These genes might contain a homologous export exon comprising the 5'-untranslated region, the 22-residue signal peptide, the 20-24-residue acidic spacer, and the basic pair Lys-Arg.
Gene sequences present in Citrullus sp. having been lost during domestication of watermelon
USDA-ARS?s Scientific Manuscript database
A wide genetic diversity exists among Citrullus species, while watermelon cultivars (Citrullus lanatus var. lanatus) share a narrow genetic base as a result of many years of domestication and selection for desirable fruit qualities. The recent international watermelon genome sequencing project reve...
Mao, Guangzhi; Ma, Qiang; Wei, Hengling; Su, Junji; Wang, Hantao; Ma, Qifeng; Fan, Shuli; Song, Meizhen; Zhang, Xianlong; Yu, Shuxun
2018-02-01
The young leaves of virescent mutants are yellowish and gradually turn green as the plants reach maturity. Understanding the genetic basis of virescent mutants can aid research of the regulatory mechanisms underlying chloroplast development and chlorophyll biosynthesis, as well as contribute to the application of virescent traits in crop breeding. In this study, fine mapping was employed, and a recessive gene (v 1 ) from a virescent mutant of Upland cotton was narrowed to an 84.1-Kb region containing ten candidate genes. The GhChlI gene encodes the cotton Mg-chelatase I subunit (CHLI) and was identified as the candidate gene for the virescent mutation using gene annotation. BLAST analysis showed that the GhChlI gene has two copies, Gh_A10G0282 and Gh_D10G0283. Sequence analysis indicated that the coding region (CDS) of GhChlI is 1269 bp in length, with three predicted exons and one non-synonymous nucleotide mutation (G1082A) in the third exon of Gh_D10G0283, with an amino acid (AA) substitution of arginine (R) to lysine (K). GhChlI-silenced TM-1 plants exhibited a lower GhChlI expression level, a lower chlorophyll content, and the virescent phenotype. Analysis of upstream regulatory elements and expression levels of GhChlI showed that the expression quantity of GhChlI may be normal, and with the development of the true leaf, the increase in the Gh_A10G0282 dosage may partially make up for the deficiency of Gh_D10G0283 in the v 1 mutant. Phylogenetic analysis and sequence alignment revealed that the protein sequence encoded by the third exon of GhChlI is highly conserved across diverse plant species, in which AA substitutions among the completely conserved residues frequently result in changes in leaf color in various species. These results suggest that the mutation (G1082A) within the GhChlI gene may cause a functional defect of the GhCHLI subunit and thus the virescent phenotype in the v 1 mutant. The GhChlI mutation not only provides a tool for understanding the associations of CHLI protein function and the chlorophyll biosynthesis pathway but also has implications for cotton breeding.
Diop, Awa; Diop, Khoudia; Tomei, Enora; Raoult, Didier; Fenollar, Florence; Fournier, Pierre-Edouard
2018-03-01
We report here the draft genome sequence of Ezakiella peruensis strain M6.X2 T The draft genome is 1,672,788 bp long and harbors 1,589 predicted protein-encoding genes, including 26 antibiotic resistance genes with 1 gene encoding vancomycin resistance. The genome also exhibits 1 clustered regularly interspaced short palindromic repeat region and 333 genes acquired by horizontal gene transfer. Copyright © 2018 Diop et al.
Phenolic acid esterases, coding sequences and methods
Blum, David L.; Kataeva, Irina; Li, Xin-Liang; Ljungdahl, Lars G.
2002-01-01
Described herein are four phenolic acid esterases, three of which correspond to domains of previously unknown function within bacterial xylanases, from XynY and XynZ of Clostridium thermocellum and from a xylanase of Ruminococcus. The fourth specifically exemplified xylanase is a protein encoded within the genome of Orpinomyces PC-2. The amino acids of these polypeptides and nucleotide sequences encoding them are provided. Recombinant host cells, expression vectors and methods for the recombinant production of phenolic acid esterases are also provided.
Niskanen, Einari A; Hytönen, Vesa P; Grapputo, Alessandro; Nordlund, Henri R; Kulomaa, Markku S; Laitinen, Olli H
2005-01-01
Background A chicken egg contains several biotin-binding proteins (BBPs), whose complete DNA and amino acid sequences are not known. In order to identify and characterise these genes and proteins we studied chicken cDNAs and genes available in the NCBI database and chicken genome database using the reported N-terminal amino acid sequences of chicken egg-yolk BBPs as search strings. Results Two separate hits showing significant homology for these N-terminal sequences were discovered. For one of these hits, the chromosomal location in the immediate proximity of the avidin gene family was found. Both of these hits encode proteins having high sequence similarity with avidin suggesting that chicken BBPs are paralogous to avidin family. In particular, almost all residues corresponding to biotin binding in avidin are conserved in these putative BBP proteins. One of the found DNA sequences, however, seems to encode a carboxy-terminal extension not present in avidin. Conclusion We describe here the predicted properties of the putative BBP genes and proteins. Our present observations link BBP genes together with avidin gene family and shed more light on the genetic arrangement and variability of this family. In addition, comparative modelling revealed the potential structural elements important for the functional and structural properties of the putative BBP proteins. PMID:15777476
Characterization of interleukin-8 receptors in non-human primates
DOE Office of Scientific and Technical Information (OSTI.GOV)
Alvarez, V.; Coto, E.; Gonzalez-Roces, S.
Interleukin-8 is a chemokine with a potent neutrophil chemoatractant activity. In humans, two different cDNAs encoding human IL8 receptors designated IL8RA and IL8RB have been cloned. IL8RA binds IL8, while IL8RB binds IL8 as well as other {alpha}-chemokines. Both human IL8Rs are encoded by two genes physically linked on chromosome 2. The IL8RA and IL8RB genes have open reading frames (ORF) lacking introns. By direct sequencing of the polymerase chain reaction products, we sequenced the IL8R genes of cell lines from four non-human primates: chimpanzee, gorilla, orangutan, and macaca. The IL8RB encodes an ORF in the four non-human primates, showingmore » 95%-99% similarity to the human IL8RB sequence. The IL8RA homologue in gorilla and chimpanzee consisted of two ORF 98%-99% identical to the human sequence. The macaca and orangutan IL8RA homologues are pseudogenes: a 2 base pair insertion generated a sequence with several stop codons. In addition, we describe the physical linkage of these genes in the four non-human primates and discuss the evolutionary implications of these findings. 25 refs., 5 figs., 3 tabs.« less
Nucleic acid molecules encoding isopentenyl monophosphate kinase, and methods of use
Croteau, Rodney B.; Lange, Bernd M.
2001-01-01
A cDNA encoding isopentenyl monophosphate kinase (IPK) from peppermint (Mentha x piperita) has been isolated and sequenced, and the corresponding amino acid sequence has been determined. Accordingly, an isolated DNA sequence (SEQ ID NO:1) is provided which codes for the expression of isopentenyl monophosphate kinase (SEQ ID NO:2), from peppermint (Mentha x piperita). In other aspects, replicable recombinant cloning vehicles are provided which code for isopentenyl monophosphate kinase, or for a base sequence sufficiently complementary to at least a portion of isopentenyl monophosphate kinase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding isopentenyl monophosphate kinase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant isopentenyl monophosphate kinase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant isopentenyl monophosphate kinase may be used to obtain expression or enhanced expression of isopentenyl monophosphate kinase in plants in order to enhance the production of isopentenyl monophosphate kinase, or isoprenoids derived therefrom, or may be otherwise employed for the regulation or expression of isopentenyl monophosphate kinase, or the production of its products.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nikolau, Basil J; Wurtele, Eve S; Oliver, David J
The present invention provides nucleic acid and amino acid sequences of acetyl CoA synthetase (ACS), plastidic pyruvate dehydrogenase (pPDH), ATP citrate lyase (ACL), Arabidopsis pyruvate decarboxylase (PDC), and Arabidopsis aldehyde dehydrogenase (ALDH), specifically ALDH-2 and ALDH-4. The present invention also provides a recombinant vector comprising a nucleic acid sequence encoding one of the aforementioned enzymes, an antisense sequence thereto or a ribozyme therefor, a cell transformed with such a vector, antibodies to the enzymes, a plant cell, a plant tissue, a plant organ or a plant in which the level of an enzyme has been altered, and a method ofmore » producing such a plant cell, plant tissue, plant organ or plant. Desirably, alteration of the level of enzyme results in an alteration of the level of acetyl CoA in the plant cell, plant tissue, plant organ or plant. In addition, the present invention provides a recombinant vector comprising an antisense sequence of a nucleic acid sequence encoding pyruvate decarboxylase (PDC), the E1.alpha. subunit of pPDH, the E1.beta. subunit of pPDH, the E2 subunit of pPDH, mitochondrial pyruvate dehydrogenase (mtPDH) or aldehyde dehydrogenase (ALDH) or a ribozyme that can cleave an RNA molecule encoding PDC, E1.alpha. pPDH, E1.beta. pPDH, E2 pPDH, mtPDH or ALDH.« less
Improving transmission efficiency of large sequence alignment/map (SAM) files.
Sakib, Muhammad Nazmus; Tang, Jijun; Zheng, W Jim; Huang, Chin-Tser
2011-01-01
Research in bioinformatics primarily involves collection and analysis of a large volume of genomic data. Naturally, it demands efficient storage and transfer of this huge amount of data. In recent years, some research has been done to find efficient compression algorithms to reduce the size of various sequencing data. One way to improve the transmission time of large files is to apply a maximum lossless compression on them. In this paper, we present SAMZIP, a specialized encoding scheme, for sequence alignment data in SAM (Sequence Alignment/Map) format, which improves the compression ratio of existing compression tools available. In order to achieve this, we exploit the prior knowledge of the file format and specifications. Our experimental results show that our encoding scheme improves compression ratio, thereby reducing overall transmission time significantly.
Gamo, F J; Lafuente, M J; Casamayor, A; Ariño, J; Aldea, M; Casas, C; Herrero, E; Gancedo, C
1996-06-15
We report the sequence of a 15.5 kb DNA segment located near the left telomere of chromosome XV of Saccharomyces cerevisiae. The sequence contains nine open reading frames (ORFs) longer than 300 bp. Three of them are internal to other ones. One corresponds to the gene LGT3 that encodes a putative sugar transporter. Three adjacent ORFs were separated by two stop codons in frame. These ORFs presented homology with the gene CPS1 that encodes carboxypeptidase S. The stop codons were not found in the same sequence derived from another yeast strain. Two other ORFs without significant homology in databases were also found. One of them, O0420, is very rich in serine and threonine and presents a series of repeated or similar amino acid stretches along the sequence.
Jiang, W; Woitach, J T; Gupta, D; Bhavanandan, V P
1998-10-20
Secreted epithelial mucins are extremely large and heterogeneous glycoproteins. We report the 5 kilobase DNA sequence of a second gene, BSM2, which encodes bovine submaxillary mucin. The determined nucleotide and deduced amino acid sequences of BSM2 are 95.2% and 92. 2% identical, respectively, to those of the previously described BSM1 gene isolated from the same cow. Further, the five predicted protein domains of the two genes are 100%, 94%, 93%, 77%, and 88% identical. Based on the above results, we propose that expression of multiple homologous core proteins from a single animal is a factor in generating diversity of saccharides in mucins and in providing resistance of the molecules to proteolysis. In addition, this work raises several important issues in mucin cloning such as assembling sequences from seemingly overlapping clones and deducing consensus sequences for nearly identical tandem repeats. Copyright 1998 Academic Press.
Gentry-Weeks, C R; Hultsch, A L; Kelly, S M; Keith, J M; Curtiss, R
1992-01-01
Three gene libraries of Bordetella avium 197 DNA were prepared in Escherichia coli LE392 by using the cosmid vectors pCP13 and pYA2329, a derivative of pCP13 specifying spectinomycin resistance. The cosmid libraries were screened with convalescent-phase anti-B. avium turkey sera and polyclonal rabbit antisera against B. avium 197 outer membrane proteins. One E. coli recombinant clone produced a 56-kDa protein which reacted with convalescent-phase serum from a turkey infected with B. avium 197. In addition, five E. coli recombinant clones were identified which produced B. avium outer membrane proteins with molecular masses of 21, 38, 40, 43, and 48 kDa. At least one of these E. coli clones, which encoded the 21-kDa protein, reacted with both convalescent-phase turkey sera and antibody against B. avium 197 outer membrane proteins. The gene for the 21-kDa outer membrane protein was localized by Tn5seq1 mutagenesis, and the nucleotide sequence was determined by dideoxy sequencing. DNA sequence analysis of the 21-kDa protein revealed an open reading frame of 582 bases that resulted in a predicted protein of 194 amino acids. Comparison of the predicted amino acid sequence of the gene encoding the 21-kDa outer membrane protein with protein sequences in the National Biomedical Research Foundation protein sequence data base indicated significant homology to the OmpA proteins of Shigella dysenteriae, Enterobacter aerogenes, E. coli, and Salmonella typhimurium and to Neisseria gonorrhoeae outer membrane protein III, Haemophilus influenzae protein P6, and Pseudomonas aeruginosa porin protein F. The gene (ompA) encoding the B. avium 21-kDa protein hybridized with 4.1-kb DNA fragments from EcoRI-digested, chromosomal DNA of Bordetella pertussis and Bordetella bronchiseptica and with 6.0- and 3.2-kb DNA fragments from EcoRI-digested, chromosomal DNA of B. avium and B. avium-like DNA, respectively. A 6.75-kb DNA fragment encoding the B. avium 21-kDa protein was subcloned into the Asd+ vector pYA292, and the construct was introduced into the avirulent delta cya delta crp delta asd S. typhimurium chi 3987 for oral immunization of birds. The gene encoding the 21-kDa protein was expressed equivalently in B. avium 197, delta asd E. coli chi 6097, and S. typhimurium chi 3987 and was localized primarily in the cytoplasmic membrane and outer membrane. In preliminary studies on oral inoculation of turkey poults with S. typhimurium chi 3987 expressing the gene encoding the B. avium 21-kDa protein, it was determined that a single dose of the recombinant Salmonella vaccine failed to elicit serum antibodies against the 21-kDa protein and challenge with wild-type B. avium 197 resulted in colonization of the trachea and thymus with B. avium 197. Images PMID:1447140
Ventura, Marco; Turroni, Francesca; Zomer, Aldert; Foroni, Elena; Giubellini, Vanessa; Bottacini, Francesca; Canchaya, Carlos; Claesson, Marcus J.; He, Fei; Mantzourani, Maria; Mulas, Laura; Ferrarini, Alberto; Gao, Beile; Delledonne, Massimo; Henrissat, Bernard; Coutinho, Pedro; Oggioni, Marco; Gupta, Radhey S.; Zhang, Ziding; Beighton, David; Fitzgerald, Gerald F.; O'Toole, Paul W.; van Sinderen, Douwe
2009-01-01
Bifidobacteria, one of the relatively dominant components of the human intestinal microbiota, are considered one of the key groups of beneficial intestinal bacteria (probiotic bacteria). However, in addition to health-promoting taxa, the genus Bifidobacterium also includes Bifidobacterium dentium, an opportunistic cariogenic pathogen. The genetic basis for the ability of B. dentium to survive in the oral cavity and contribute to caries development is not understood. The genome of B. dentium Bd1, a strain isolated from dental caries, was sequenced to completion to uncover a single circular 2,636,368 base pair chromosome with 2,143 predicted open reading frames. Annotation of the genome sequence revealed multiple ways in which B. dentium has adapted to the oral environment through specialized nutrient acquisition, defences against antimicrobials, and gene products that increase fitness and competitiveness within the oral niche. B. dentium Bd1 was shown to metabolize a wide variety of carbohydrates, consistent with genome-based predictions, while colonization and persistence factors implicated in tissue adhesion, acid tolerance, and the metabolism of human saliva-derived compounds were also identified. Global transcriptome analysis demonstrated that many of the genes encoding these predicted traits are highly expressed under relevant physiological conditions. This is the first report to identify, through various genomic approaches, specific genetic adaptations of a Bifidobacterium taxon, Bifidobacterium dentium Bd1, to a lifestyle as a cariogenic microorganism in the oral cavity. In silico analysis and comparative genomic hybridization experiments clearly reveal a high level of genome conservation among various B. dentium strains. The data indicate that the genome of this opportunistic cariogen has evolved through a very limited number of horizontal gene acquisition events, highlighting the narrow boundaries that separate commensals from opportunistic pathogens. PMID:20041198
Outlier Responses Reflect Sensitivity to Statistical Structure in the Human Brain
Garrido, Marta I.
2013-01-01
We constantly look for patterns in the environment that allow us to learn its key regularities. These regularities are fundamental in enabling us to make predictions about what is likely to happen next. The physiological study of regularity extraction has focused primarily on repetitive sequence-based rules within the sensory environment, or on stimulus-outcome associations in the context of reward-based decision-making. Here we ask whether we implicitly encode non-sequential stochastic regularities, and detect violations therein. We addressed this question using a novel experimental design and both behavioural and magnetoencephalographic (MEG) metrics associated with responses to pure-tone sounds with frequencies sampled from a Gaussian distribution. We observed that sounds in the tail of the distribution evoked a larger response than those that fell at the centre. This response resembled the mismatch negativity (MMN) evoked by surprising or unlikely events in traditional oddball paradigms. Crucially, responses to physically identical outliers were greater when the distribution was narrower. These results show that humans implicitly keep track of the uncertainty induced by apparently random distributions of sensory events. Source reconstruction suggested that the statistical-context-sensitive responses arose in a temporo-parietal network, areas that have been associated with attention orientation to unexpected events. Our results demonstrate a very early neurophysiological marker of the brain's ability to implicitly encode complex statistical structure in the environment. We suggest that this sensitivity provides a computational basis for our ability to make perceptual inferences in noisy environments and to make decisions in an uncertain world. PMID:23555230
Nucleic acid compositions and the encoding proteins
Preston, III, James F.; Chow, Virginia; Nong, Guang; Rice, John D.; St. John, Franz J.
2014-09-02
The subject invention provides at least one nucleic acid sequence encoding an aldouronate-utilization regulon isolated from Paenibacillus sp. strain JDR-2, a bacterium which efficiently utilizes xylan and metabolizes aldouronates (methylglucuronoxylosaccharides). The subject invention also provides a means for providing a coordinately regulated process in which xylan depolymerization and product assimilation are coupled in Paenibacillus sp. strain JDR-2 to provide a favorable system for the conversion of lignocellulosic biomass to biobased products. Additionally, the nucleic acid sequences encoding the aldouronate-utilization regulon can be used to transform other bacteria to form organisms capable of producing a desired product (e.g., ethanol, 1-butanol, acetoin, 2,3-butanediol, 1,3-propanediol, succinate, lactate, acetate, malate or alanine) from lignocellulosic biomass.
Compensating for Language Deficits in Amnesia II: H.M.’s Spared versus Impaired Encoding Categories
MacKay, Donald G.; Johnson, Laura W.; Hadley, Chris
2013-01-01
Although amnesic H.M. typically could not recall where or when he met someone, he could recall their topics of conversation after long interference-filled delays, suggesting impaired encoding for some categories of novel events but not others. Similarly, H.M. successfully encoded into internal representations (sentence plans) some novel linguistic structures but not others in the present language production studies. For example, on the Test of Language Competence (TLC), H.M. produced uncorrected errors when encoding a wide range of novel linguistic structures, e.g., violating reliably more gender constraints than memory-normal controls when encoding referent-noun, pronoun-antecedent, and referent-pronoun anaphora, as when he erroneously and without correction used the gender-inappropriate pronoun “her” to refer to a man. In contrast, H.M. never violated corresponding referent-gender constraints for proper names, suggesting that his mechanisms for encoding proper name gender-agreement were intact. However, H.M. produced no more dysfluencies, off-topic comments, false starts, neologisms, or word and phonological sequencing errors than controls on the TLC. Present results suggest that: (a) frontal mechanisms for retrieving and sequencing word, phrase, and phonological categories are intact in H.M., unlike in category-specific aphasia; (b) encoding mechanisms in the hippocampal region are category-specific rather than item-specific, applying to, e.g., proper names rather than words; (c) H.M.’s category-specific mechanisms for encoding referents into words, phrases, and propositions are impaired, with the exception of referent gender, person, and number for encoding proper names; and (d) H.M. overuses his intact proper name encoding mechanisms to compensate for his impaired mechanisms for encoding other functionally equivalent linguistic information. PMID:24961410
Compensating for Language Deficits in Amnesia II: H.M.'s Spared versus Impaired Encoding Categories.
MacKay, Donald G; Johnson, Laura W; Hadley, Chris
2013-03-27
Although amnesic H.M. typically could not recall where or when he met someone, he could recall their topics of conversation after long interference-filled delays, suggesting impaired encoding for some categories of novel events but not others. Similarly, H.M. successfully encoded into internal representations (sentence plans) some novel linguistic structures but not others in the present language production studies. For example, on the Test of Language Competence (TLC), H.M. produced uncorrected errors when encoding a wide range of novel linguistic structures, e.g., violating reliably more gender constraints than memory-normal controls when encoding referent-noun, pronoun-antecedent, and referent-pronoun anaphora, as when he erroneously and without correction used the gender-inappropriate pronoun "her" to refer to a man. In contrast, H.M. never violated corresponding referent-gender constraints for proper names, suggesting that his mechanisms for encoding proper name gender-agreement were intact. However, H.M. produced no more dysfluencies, off-topic comments, false starts, neologisms, or word and phonological sequencing errors than controls on the TLC. Present results suggest that: (a) frontal mechanisms for retrieving and sequencing word, phrase, and phonological categories are intact in H.M., unlike in category-specific aphasia; (b) encoding mechanisms in the hippocampal region are category-specific rather than item-specific, applying to, e.g., proper names rather than words; (c) H.M.'s category-specific mechanisms for encoding referents into words, phrases, and propositions are impaired, with the exception of referent gender, person, and number for encoding proper names; and (d) H.M. overuses his intact proper name encoding mechanisms to compensate for his impaired mechanisms for encoding other functionally equivalent linguistic information.
2008-10-13
Furthermore, the encoded protein of this gene is only 30 kDa. A potential GTG start codon at position 625 also encodes a protein that is too small...horizontal bar and putative alternate translation initiation sites (ATG, GTG , and TTG) are indicated. The sizes and locations of the proteins encoded... gray line with rounded rectangles showing sequence features and motifs, including the Ala- and Pro-rich N-terminal region and the C-terminal Cys and
NASA Astrophysics Data System (ADS)
Lin, Liangjie; Wei, Zhiliang; Yang, Jian; Lin, Yanqin; Chen, Zhong
2014-11-01
The spatial encoding technique can be used to accelerate the acquisition of multi-dimensional nuclear magnetic resonance spectra. However, with this technique, we have to make trade-offs between the spectral width and the resolution in the spatial encoding dimension (F1 dimension), resulting in the difficulty of covering large spectral widths while preserving acceptable resolutions for spatial encoding spectra. In this study, a selective shifting method is proposed to overcome the aforementioned drawback. This method is capable of narrowing spectral widths and improving spectral resolutions in spatial encoding dimensions by selectively shifting certain peaks in spectra of the ultrafast version of spin echo correlated spectroscopy (UFSECSY). This method can also serve as a powerful tool to obtain high-resolution correlated spectra in inhomogeneous magnetic fields for its resistance to any inhomogeneity in the F1 dimension inherited from UFSECSY. Theoretical derivations and experiments have been carried out to demonstrate performances of the proposed method. Results show that the spectral width in spatial encoding dimension can be reduced by shortening distances between cross peaks and axial peaks with the proposed method and the expected resolution improvement can be achieved. Finally, the shifting-absent spectrum can be recovered readily by post-processing.
Lin, Chentao; Thomashow, Michael F.
1992-01-01
Previous studies have indicated that changes in gene expression occur in Arabidopsis thaliana L. (Heyn) during cold acclimation and that certain of the cor (cold-regulated) genes encode polypeptides that share the unusual property of remaining soluble upon boiling in aqueous solution. Here, we identify a cDNA clone for a cold-regulated gene encoding one of the “boiling-stable” polypeptides, COR15. DNA sequence analysis indicated that the gene, designated cor15, encodes a 14.7-kilodalton hydrophilic polypeptide having an N-terminal amino acid sequence that closely resembles transit peptides that target proteins to the stromal compartment of chloroplasts. Immunological studies indicated that COR15 is processed in vivo and that the mature polypeptide, COR 15m, is present in the soluble fraction of chloroplasts. Possible functions of COR 15m are discussed. ImagesFigure 1Figure 4Figure 5Figure 6Figure 7 PMID:16668917
The bean. alpha. -amylase inhibitor is encoded by a lectin gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Moreno, J.; Altabella, T.; Chrispeels, M.J.
The common bean, Phaseolus vulgaris, contains an inhibitor of insect and mammalian {alpha}-amylases that does not inhibit plant {alpha}-amylase. This inhibitor functions as an anti-feedant or seed-defense protein. We purified this inhibitor by affinity chromatography and found that it consists of a series of glycoforms of two polypeptides (Mr 14,000-19,000). Partial amino acid sequencing was carried out, and the sequences obtained are identical with portions of the derived amino acid sequence of a lectin-like gene. This lectin gene encodes a polypeptide of MW 28,000, and the primary in vitro translation product identified by antibodies to the {alpha}-amylase inhibitor has themore » same size. Co- and posttranslational processing of this polypeptide results in glycosylated polypeptides of 14-19 kDa. Our interpretation of these results is that the bean lectins constitute a gene family that encodes diverse plant defense proteins, including phytohemagglutinin, arcelin and {alpha}-amylase inhibitor.« less
Kim, Sunhwa; Matsuo, Ichiro; Ajisaka, Katsumi; Nakajima, Harushi; Kitamoto, Katsuhiko
2002-10-01
We isolated a beta-N-acetylglucosaminidase encoding gene and its cDNA from the filamentous fungus Aspergillus nidulans, and designated it nagA. The nagA gene contained no intron and encoded a polypeptide of 603 amino acids with a putative 19-amino acid signal sequence. The deduced amino acid sequence was very similar to the sequence of Candida albicans Hex1 and Trichoderma harzianum Nag1. Yeast cells containing the nagA cDNA under the control of the GAL1 promoter expressed beta-N-acetylglucosaminidase activity. The chromosomal nagA gene of A. nidulans was disrupted by replacement with the argB marker gene. The disruptant strains expressed low levels of beta-N-acetylglucosaminidase activity and showed poor growth on a medium containing chitobiose as a carbon source. Aspergillus oryzae strain carrying the nagA gene under the control of the improved glaA promoter produced large amounts of beta-N-acetylglucosaminidase in a wheat bran solid culture.
Construct Validity of Fluency and Implications for the Factorial Structure of Memory
ERIC Educational Resources Information Center
Jewsbury, Paul A.; Bowden, Stephen C.
2017-01-01
Fluency is an important construct in clinical assessment and in cognitive taxonomies. In the Cattell-Horn-Carroll (CHC) model, Fluency is represented by several narrow factors that form a subset of the long-term memory encoding and retrieval (Glr) broad factor. The CHC broad classification of Fluency was evaluated in five data sets, and the CHC…
Trichoderma .beta.-glucosidase
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-01-03
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
Vandenbol, M; Jauniaux, J C; Grenson, M
1989-11-15
The complete nucleotide (nt) sequence of the PUT4 gene, whose product is required for high-affinity proline active transport in the yeast Saccharomyces cerevisiae, is presented. The sequence contains a single long open reading frame of 1881 nt, encoding a polypeptide with a calculated Mr of 68,795. The predicted protein is strongly hydrophobic and exhibits six potential glycosylation sites. Its hydropathy profile suggests the presence of twelve membrane-spanning regions flanked by hydrophilic N- and C-terminal domains. The N terminus does not resemble signal sequences found in secreted proteins. These features are characteristic of integral membrane proteins catalyzing translocation of ligands across cellular membranes. Protein sequence comparisons indicate strong resemblance to the arginine and histidine permeases of S. cerevisiae, but no marked sequence similarity to the proline permease of Escherichia coli or to other known prokaryotic or eukaryotic transport proteins. The strong similarity between the three yeast amino acid permeases suggests a common ancestor for the three proteins.
Wyrwa, Katarzyna; Książkiewicz, Michał; Szczepaniak, Anna; Susek, Karolina; Podkowiński, Jan; Naganowska, Barbara
2016-09-01
Narrow-leafed lupin (Lupinus angustifolius L.) has recently been considered a reference genome for the Lupinus genus. In the present work, genetic and cytogenetic maps of L. angustifolius were supplemented with 30 new molecular markers representing lupin genome regions, harboring genes involved in nitrogen fixation during the symbiotic interaction of legumes and soil bacteria (Rhizobiaceae). Our studies resulted in the precise localization of bacterial artificial chromosomes (BACs) carrying sequence variants for early nodulin 40, nodulin 26, nodulin 45, aspartate aminotransferase P2, asparagine synthetase, cytosolic glutamine synthetase, and phosphoenolpyruvate carboxylase. Together with previously mapped chromosomes, the integrated L. angustifolius map encompasses 73 chromosome markers, including 5S ribosomal DNA (rDNA) and 45S rDNA, and anchors 20 L. angustifolius linkage groups to corresponding chromosomes. Chromosomal identification using BAC fluorescence in situ hybridization identified two BAC clones as narrow-leafed lupin centromere-specific markers, which served as templates for preliminary studies of centromere composition within the genus. Bioinformatic analysis of these two BACs revealed that centromeric/pericentromeric regions of narrow-leafed lupin chromosomes consisted of simple sequence repeats ordered into tandem repeats containing the trinucleotide and pentanucleotide simple sequence repeats AGG and GATAC, structured into long arrays. Moreover, cross-genus microsynteny analysis revealed syntenic patterns of 31 single-locus BAC clones among several legume species. The gene and chromosome level findings provide evidence of ancient duplication events that must have occurred very early in the divergence of papilionoid lineages. This work provides a strong foundation for future comparative mapping among legumes and may facilitate understanding of mechanisms involved in shaping legume chromosomes.
Outlier Resistant Predictive Source Encoding for a Gaussian Stationary Nominal Source.
1987-09-18
breakdown point and influence function . The proposed sequence of predictive encoders attains strictly positive breakdown point and uniformly bounded... influence function , at the expense of increased mean difference-squared distortion and differential entropy, at the Gaussian nominal source.
Complete genome sequence of keunjorong mosaic virus, a potyvirus from Cynanchum wilfordii.
Nam, Moon; Lee, Joo-Hee; Choi, Hong Soo; Lim, Hyoun-Sub; Moon, Jae Sun; Lee, Su-Heon
2013-08-01
We have determined the complete genome sequence of keunjorong mosaic virus (KjMV). The KjMV genome is composed of 9,611 nucleotides, excluding the 3'-terminal poly(A) tail. It contains two open reading frames (ORFs), with the large one encoding a polyprotein of 3,070 amino acids and the small overlapping ORF encoding a PIPO protein of 81 amino acids. The KjMV genome shared the highest nucleotide sequence identity (57.5 %) with pepper mottle virus and freesia mosaic virus, two members of the genus Potyvirus. Based on the phylogenetic relatedness to known potyviruses, KjMV appears to be a member of a new species in the genus Potyvirus.
Krunic, Aleksandar L; Stone, Kristina L; Simpson, Michael A; McGrath, John A
2013-01-01
Acral peeling skin syndrome (APSS) is a clinically and genetically heterogeneous disorder. We used whole-exome sequencing to identify the molecular basis of APSS in a consanguineous Jordanian-American pedigree. We identified a homozygous nonsense mutation (p.Lys22X) in the CSTA gene, encoding cystatin A, that was confirmed using Sanger sequencing. Cystatin A is a protease inhibitor found in the cornified cell envelope, and loss-of-function mutations have previously been reported in two cases of exfoliative ichthyosis. Our study expands the molecular pathology of APSS and demonstrates the value of next-generation sequencing in the genetic characterization of inherited skin diseases. © 2013 Wiley Periodicals, Inc.
Metal resistant plants and phytoremediation of environmental contamination
Meagher, Richard B.; Li, Yujing; Dhankher, Om P.
2010-04-20
The present disclosure provides a method of producing transgenic plants which are resistant to at least one metal ion by transforming the plant with a recombinant DNA comprising a nucleic acid encoding a bacterial arsenic reductase under the control of a plant expressible promoter, and a nucleic acid encoding a nucleotide sequence encoding a phytochelatin biosynthetic enzyme under the control of a plant expressible promoter. The invention also relates a method of phytoremediation of a contaminated site by growing in the site a transgenic plant expressing a nucleic acid encoding a bacterial arsenate reductase and a nucleic acid encoding a phytochelatin biosynthetic enzyme.
Tappaz, M; Bitoun, M; Reymond, I; Sergeant, A
1999-09-01
Cysteine sulfinate decarboxylase (CSD) is considered as the rate-limiting enzyme in the biosynthesis of taurine, a possible osmoregulator in brain. Through cloning and sequencing of RT-PCR and RACE-PCR products of rat brain mRNAs, a 2,396-bp cDNA sequence was obtained encoding a protein of 493 amino acids (calculated molecular mass, 55.2 kDa). The corresponding fusion protein showed a substrate specificity similar to that of the endogenous enzyme. The sequence of the encoded protein is identical to that encoded by liver CSD cDNA. Among other characterized amino acid decarboxylases, CSD shows the highest homology (54%) with either isoform of glutamic acid decarboxylase (GAD65 and GAD67). A single mRNA band, approximately 2.5 kb, was detected by northern blot in RNA extracts of brain, liver, and kidney. However, brain and liver CSD cDNA sequences differed in the 5' untranslated region. This indicates two forms of CSD mRNA. Analysis of PCR-amplified products of genomic DNA suggests that the brain form results from the use of a 3' alternative internal splicing site within an exon specifically found in liver CSD mRNA. Through selective RT-PCR the brain form was detected in brain only, whereas the liver form was found in liver and kidney. These results indicate a tissue-specific regulation of CSD genomic expression.
Heatley, Susan L.; Pietra, Gabriella; Lin, Jie; Widjaja, Jacqueline M. L.; Harpur, Christopher M.; Lester, Sue; Rossjohn, Jamie; Szer, Jeff; Schwarer, Anthony; Bradstock, Kenneth; Bardy, Peter G.; Mingari, Maria Cristina; Moretta, Lorenzo; Sullivan, Lucy C.; Brooks, Andrew G.
2013-01-01
Natural killer (NK) cell recognition of the nonclassical human leukocyte antigen (HLA) molecule HLA-E is dependent on the presentation of a nonamer peptide derived from the leader sequence of other HLA molecules to CD94-NKG2 receptors. However, human cytomegalovirus can manipulate this central innate interaction through the provision of a “mimic” of the HLA-encoded peptide derived from the immunomodulatory glycoprotein UL40. Here, we analyzed UL40 sequences isolated from 32 hematopoietic stem cell transplantation recipients experiencing cytomegalovirus reactivation. The UL40 protein showed a “polymorphic hot spot” within the region that encodes the HLA leader sequence mimic. Although all sequences that were identical to those encoded within HLA-I genes permitted the interaction between HLA-E and CD94-NKG2 receptors, other UL40 polymorphisms reduced the affinity of the interaction between HLA-E and CD94-NKG2 receptors. Furthermore, functional studies using NK cell clones expressing either the inhibitory receptor CD94-NKG2A or the activating receptor CD94-NKG2C identified UL40-encoded peptides that were capable of inhibiting target cell lysis via interaction with CD94-NKG2A, yet had little capacity to activate NK cells through CD94-NKG2C. The data suggest that UL40 polymorphisms may aid evasion of NK cell immunosurveillance by modulating the affinity of the interaction with CD94-NKG2 receptors. PMID:23335510
Heatley, Susan L; Pietra, Gabriella; Lin, Jie; Widjaja, Jacqueline M L; Harpur, Christopher M; Lester, Sue; Rossjohn, Jamie; Szer, Jeff; Schwarer, Anthony; Bradstock, Kenneth; Bardy, Peter G; Mingari, Maria Cristina; Moretta, Lorenzo; Sullivan, Lucy C; Brooks, Andrew G
2013-03-22
Natural killer (NK) cell recognition of the nonclassical human leukocyte antigen (HLA) molecule HLA-E is dependent on the presentation of a nonamer peptide derived from the leader sequence of other HLA molecules to CD94-NKG2 receptors. However, human cytomegalovirus can manipulate this central innate interaction through the provision of a "mimic" of the HLA-encoded peptide derived from the immunomodulatory glycoprotein UL40. Here, we analyzed UL40 sequences isolated from 32 hematopoietic stem cell transplantation recipients experiencing cytomegalovirus reactivation. The UL40 protein showed a "polymorphic hot spot" within the region that encodes the HLA leader sequence mimic. Although all sequences that were identical to those encoded within HLA-I genes permitted the interaction between HLA-E and CD94-NKG2 receptors, other UL40 polymorphisms reduced the affinity of the interaction between HLA-E and CD94-NKG2 receptors. Furthermore, functional studies using NK cell clones expressing either the inhibitory receptor CD94-NKG2A or the activating receptor CD94-NKG2C identified UL40-encoded peptides that were capable of inhibiting target cell lysis via interaction with CD94-NKG2A, yet had little capacity to activate NK cells through CD94-NKG2C. The data suggest that UL40 polymorphisms may aid evasion of NK cell immunosurveillance by modulating the affinity of the interaction with CD94-NKG2 receptors.
DOE Office of Scientific and Technical Information (OSTI.GOV)
John C. Meeks
2001-12-31
Nostoc punctiforme is a filamentous cyanobacterium with extensive phenotypic characteristics and a relatively large genome, approaching 10 Mb. The phenotypic characteristics include a photoautotrophic, diazotrophic mode of growth, but N. punctiforme is also facultatively heterotrophic; its vegetative cells have multiple development alternatives, including terminal differentiation into nitrogen-fixing heterocysts and transient differentiation into spore-like akinetes or motile filaments called hormogonia; and N. punctiforme has broad symbiotic competence with fungi and terrestrial plants, including bryophytes, gymnosperms and an angiosperm. The shotgun-sequencing phase of the N. punctiforme strain ATCC 29133 genome has been completed by the Joint Genome Institute. Annotation of an 8.9more » Mb database yielded 7432 open reading frames, 45% of which encode proteins with known or probable known function and 29% of which are unique to N. punctiforme. Comparative analysis of the sequence indicates a genome that is highly plastic and in a state of flux, with numerous insertion sequences and multilocus repeats, as well as genes encoding transposases and DNA modification enzymes. The sequence also reveals the presence of genes encoding putative proteins that collectively define almost all characteristics of cyanobacteria as a group. N. punctiforme has an extensive potential to sense and respond to environmental signals as reflected by the presence of more than 400 genes encoding sensor protein kinases, response regulators and other transcriptional factors. The signal transduction systems and any of the large number of unique genes may play essential roles in the cell differentiation and symbiotic interaction properties of N. punctiforme.« less
Zymomonas pentose-sugar fermenting strains and uses thereof
Zhang, Min [Lakewood, CO; Chou, Yat-Chen [Golden, CO; Howe, William [Golden, CO; Eddy, Christine [Golden, CO; Evans, Kent [Littleton, CO; Mohagheghi, Ali [Northglenn, CO
2007-05-29
Disclosed in the present invention is a Zymomonas integrant and derivatives of these integrants that posses the ability to ferment pentose into ethanol. The genetic sequences encoding for the pentose-fermenting enzymes are integrated into the Zymomonas in a two-integration event of homologous recombination and transposition. Each operon includes more than one pentose-reducing enzyme encoding sequence. The integrant in some embodiments includes enzyme sequences encoding xylose isomerase, xylulokinase, transketolase and transketolase. The Zymomonas integrants are highly stable, and retain activity for producing the pentose-fermenting enzyme for between 80 to 160 generations. The integrants are also resistant to acetate inhibition, as the integrants demonstrate efficient ethanol production even in the presence of 8 up to 16 grams acetate per liter media. These stably integrated sequences provide a unique Zymomonas that may then be used for the efficient conversion of pentose sugars (xylose, arabinose) to ethanol. Method of using the Zymomonas integrants and derivatives thereof in production of ethanol from cellulosic feedstock is also disclosed. The invention also provides a method for preparing a Zymomonas integrant as part of the present invention. The host Zymomonas strain found particularly useful in the creation of these compositions and methods is Zymomonas mobilis 31821.
Gleave, A P; Taylor, R K; Morris, B A; Greenwood, D R
1995-09-15
Janthinobacterium lividum secretes a major 56-kDa chitinase and a minor 69-kDa chitinase. A chitinase gene was defined on a 3-kb fragment of clone pRKT10, by virtue of fluorescent colonies in the presence of 4-methylumbelliferyl-beta-D-N,N',N"-chitotrioside. Nucleotide sequencing revealed an 1998-bp open reading frame with the potential to encode a 69,716-Da protein with amino acid sequences similar to those in other chitinases, suggesting it encodes the minor chitinase (Chi69). Chitinase activity of Escherichia coli (pRKT10) lysates was detected mainly in the periplasmic fraction and immunoblotting detected a 70-kDa protein in this fraction. Chi69 has an N-terminal secretory leader peptide preceding two probable chitin-binding domains and a catalytic domain. These functional domains are separated by linker regions of proline-threonine repeats. Amino acid sequencing of cyanogen bromide cleavage-derived peptides from the major 56-kDa chitinase suggested that Chi69 may be a precursor of Chi56. In addition, an N-terminally truncated version of Chi69 retained chitinase activity as expected if in vivo processing of Chi69 generates Chi56.
Haseloff, J; Goelet, P; Zimmern, D; Ahlquist, P; Dasgupta, R; Kaesberg, P
1984-01-01
The plant viruses alfalfa mosaic virus (AMV) and brome mosaic virus (BMV) each divide their genetic information among three RNAs while tobacco mosaic virus (TMV) contains a single genomic RNA. Amino acid sequence comparisons suggest that the single proteins encoded by AMV RNA 1 and BMV RNA 1 and by AMV RNA 2 and BMV RNA 2 are related to the NH2-terminal two-thirds and the COOH-terminal one-third, respectively, of the largest protein encoded by TMV. Separating these two domains in the TMV RNA sequence is an amber termination codon, whose partial suppression allows translation of the downstream domain. Many of the residues that the TMV read-through domain and the segmented plant viruses have in common are also conserved in a read-through domain found in the nonstructural polyprotein of the animal alphaviruses Sindbis and Middelburg. We suggest that, despite substantial differences in gene organization and expression, all of these viruses use related proteins for common functions in RNA replication. Reassortment of functional modules of coding and regulatory sequence from preexisting viral or cellular sources, perhaps via RNA recombination, may be an important mechanism in RNA virus evolution. PMID:6611550
A Spiking Neural Network System for Robust Sequence Recognition.
Yu, Qiang; Yan, Rui; Tang, Huajin; Tan, Kay Chen; Li, Haizhou
2016-03-01
This paper proposes a biologically plausible network architecture with spiking neurons for sequence recognition. This architecture is a unified and consistent system with functional parts of sensory encoding, learning, and decoding. This is the first systematic model attempting to reveal the neural mechanisms considering both the upstream and the downstream neurons together. The whole system is a consistent temporal framework, where the precise timing of spikes is employed for information processing and cognitive computing. Experimental results show that the system is competent to perform the sequence recognition, being robust to noisy sensory inputs and invariant to changes in the intervals between input stimuli within a certain range. The classification ability of the temporal learning rule used in the system is investigated through two benchmark tasks that outperform the other two widely used learning rules for classification. The results also demonstrate the computational power of spiking neurons over perceptrons for processing spatiotemporal patterns. In summary, the system provides a general way with spiking neurons to encode external stimuli into spatiotemporal spikes, to learn the encoded spike patterns with temporal learning rules, and to decode the sequence order with downstream neurons. The system structure would be beneficial for developments in both hardware and software.
Multi-Temporal Land Cover Classification with Sequential Recurrent Encoders
NASA Astrophysics Data System (ADS)
Rußwurm, Marc; Körner, Marco
2018-03-01
Earth observation (EO) sensors deliver data with daily or weekly temporal resolution. Most land use and land cover (LULC) approaches, however, expect cloud-free and mono-temporal observations. The increasing temporal capabilities of today's sensors enables the use of temporal, along with spectral and spatial features. Domains, such as speech recognition or neural machine translation, work with inherently temporal data and, today, achieve impressive results using sequential encoder-decoder structures. Inspired by these sequence-to-sequence models, we adapt an encoder structure with convolutional recurrent layers in order to approximate a phenological model for vegetation classes based on a temporal sequence of Sentinel 2 (S2) images. In our experiments, we visualize internal activations over a sequence of cloudy and non-cloudy images and find several recurrent cells, which reduce the input activity for cloudy observations. Hence, we assume that our network has learned cloud-filtering schemes solely from input data, which could alleviate the need for tedious cloud-filtering as a preprocessing step for many EO approaches. Moreover, using unfiltered temporal series of top-of-atmosphere (TOA) reflectance data, we achieved in our experiments state-of-the-art classification accuracies on a large number of crop classes with minimal preprocessing compared to other classification approaches.
Hodel, Jérôme; Silvera, Jonathan; Bekaert, Olivier; Rahmouni, Alain; Bastuji-Garin, Sylvie; Vignaud, Alexandre; Petit, Eric; Durning, Bruno; Decq, Philippe
2011-02-01
To assess the three-dimensional turbo spin echo with variable flip-angle distribution magnetic resonance sequence (SPACE: Sampling Perfection with Application optimised Contrast using different flip-angle Evolution) for the imaging of intracranial cerebrospinal fluid (CSF) spaces. We prospectively investigated 18 healthy volunteers and 25 patients, 20 with communicating hydrocephalus (CH), five with non-communicating hydrocephalus (NCH), using the SPACE sequence at 1.5T. Volume rendering views of both intracranial and ventricular CSF were obtained for all patients and volunteers. The subarachnoid CSF distribution was qualitatively evaluated on volume rendering views using a four-point scale. The CSF volumes within total, ventricular and subarachnoid spaces were calculated as well as the ratio between ventricular and subarachnoid CSF volumes. Three different patterns of subarachnoid CSF distribution were observed. In healthy volunteers we found narrowed CSF spaces within the occipital aera. A diffuse narrowing of the subarachnoid CSF spaces was observed in patients with NCH whereas patients with CH exhibited narrowed CSF spaces within the high midline convexity. The ratios between ventricular and subarachnoid CSF volumes were significantly different among the volunteers, patients with CH and patients with NCH. The assessment of CSF spaces volume and distribution may help to characterise hydrocephalus.
Park, Dan M.; Akhtar, Md. Sohail; Ansari, Aseem Z.; Landick, Robert; Kiley, Patricia J.
2013-01-01
Despite the importance of maintaining redox homeostasis for cellular viability, how cells control redox balance globally is poorly understood. Here we provide new mechanistic insight into how the balance between reduced and oxidized electron carriers is regulated at the level of gene expression by mapping the regulon of the response regulator ArcA from Escherichia coli, which responds to the quinone/quinol redox couple via its membrane-bound sensor kinase, ArcB. Our genome-wide analysis reveals that ArcA reprograms metabolism under anaerobic conditions such that carbon oxidation pathways that recycle redox carriers via respiration are transcriptionally repressed by ArcA. We propose that this strategy favors use of catabolic pathways that recycle redox carriers via fermentation akin to lactate production in mammalian cells. Unexpectedly, bioinformatic analysis of the sequences bound by ArcA in ChIP-seq revealed that most ArcA binding sites contain additional direct repeat elements beyond the two required for binding an ArcA dimer. DNase I footprinting assays suggest that non-canonical arrangements of cis-regulatory modules dictate both the length and concentration-sensitive occupancy of DNA sites. We propose that this plasticity in ArcA binding site architecture provides both an efficient means of encoding binding sites for ArcA, σ70-RNAP and perhaps other transcription factors within the same narrow sequence space and an effective mechanism for global control of carbon metabolism to maintain redox homeostasis. PMID:24146625
Cloning and Expression of the Benzoate Dioxygenase Genes from Rhodococcus sp. Strain 19070
Haddad, Sandra; Eby, D. Matthew; Neidle, Ellen L.
2001-01-01
The bopXYZ genes from the gram-positive bacterium Rhodococcus sp. strain 19070 encode a broad-substrate-specific benzoate dioxygenase. Expression of the BopXY terminal oxygenase enabled Escherichia coli to convert benzoate or anthranilate (2-aminobenzoate) to a nonaromatic cis-diol or catechol, respectively. This expression system also rapidly transformed m-toluate (3-methylbenzoate) to an unidentified product. In contrast, 2-chlorobenzoate was not a good substrate. The BopXYZ dioxygenase was homologous to the chromosomally encoded benzoate dioxygenase (BenABC) and the plasmid-encoded toluate dioxygenase (XylXYZ) of gram-negative acinetobacters and pseudomonads. Pulsed-field gel electrophoresis failed to identify any plasmid in Rhodococcus sp. strain 19070. Catechol 1,2- and 2,3-dioxygenase activity indicated that strain 19070 possesses both meta- and ortho-cleavage degradative pathways, which are associated in pseudomonads with the xyl and ben genes, respectively. Open reading frames downstream of bopXYZ, designated bopL and bopK, resembled genes encoding cis-diol dehydrogenases and benzoate transporters, respectively. The bop genes were in the same order as the chromosomal ben genes of P. putida PRS2000. The deduced sequences of BopXY were 50 to 60% identical to the corresponding proteins of benzoate and toluate dioxygenases. The reductase components of these latter dioxygenases, BenC and XylZ, are 201 residues shorter than the deduced BopZ sequence. As predicted from the sequence, expression of BopZ in E. coli yielded an approximately 60-kDa protein whose presence corresponded to increased cytochrome c reductase activity. While the N-terminal region of BopZ was approximately 50% identical in sequence to the entire BenC or XylZ reductases, the C terminus was unlike other known protein sequences. PMID:11375157
ERIC Educational Resources Information Center
Carvalho, Paulo F.; Goldstone, Robert L.
2017-01-01
The sequence of study influences how we learn. Previous research has identified different sequences as potentially beneficial for learning in different contexts and with different materials. Here we investigate the mechanisms involved in inductive category learning that give rise to these sequencing effects. Across 3 experiments we show evidence…
Organization and sequence of four flagellin-encoding genes of Edwardsiella icataluri
USDA-ARS?s Scientific Manuscript database
Edwardsiella ictaluri, the cause of enteric septicemia in channel catfish (Ictalurus punctatus), is motile by means of peritrichous flagella. We determined the complete flagellin gene sequences and their organization in E. ictaluri by sequencing genomic segments selected from a lambda-ZAP phage gen...
.beta.-glucosidase 5 (BGL5) compositions
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2010-06-01
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
CoSMoS: Conserved Sequence Motif Search in the proteome
Liu, Xiao I; Korde, Neeraj; Jakob, Ursula; Leichert, Lars I
2006-01-01
Background With the ever-increasing number of gene sequences in the public databases, generating and analyzing multiple sequence alignments becomes increasingly time consuming. Nevertheless it is a task performed on a regular basis by researchers in many labs. Results We have now created a database called CoSMoS to find the occurrences and at the same time evaluate the significance of sequence motifs and amino acids encoded in the whole genome of the model organism Escherichia coli K12. We provide a precomputed set of multiple sequence alignments for each individual E. coli protein with all of its homologues in the RefSeq database. The alignments themselves, information about the occurrence of sequence motifs together with information on the conservation of each of the more than 1.3 million amino acids encoded in the E. coli genome can be accessed via the web interface of CoSMoS. Conclusion CoSMoS is a valuable tool to identify highly conserved sequence motifs, to find regions suitable for mutational studies in functional analyses and to predict important structural features in E. coli proteins. PMID:16433915
NASA Astrophysics Data System (ADS)
Cao, H.; Kalashnikov, M.; Osvay, K.; Khodakovskiy, N.; Nagymihaly, R. S.; Chvykov, V.
2018-04-01
A combination of a polarization-encoded (PE) and a conventional multi-pass amplifier was studied to overcome gain narrowing in the Ti:sapphire active medium. The seed spectrum was pre-shaped and blue-shifted during PE amplification and was then further broadened in a conventional, saturated multi-pass amplifier, resulting in an overall increase of the amplified bandwidth. Using this technique, seed pulses of 44 nm were amplified and simultaneously spectrally broadened to 57 nm without the use of passive spectral corrections. The amplified pulse after the PE amplifier was recompressed to 19 fs. The supported simulations confirm all aspects of experimental operation.
Beta.-glucosidase coding sequences and protein from orpinomyces PC-2
Li, Xin-Liang; Ljungdahl, Lars G.; Chen, Huizhong; Ximenes, Eduardo A.
2001-02-06
Provided is a novel .beta.-glucosidase from Orpinomyces sp. PC2, nucleotide sequences encoding the mature protein and the precursor protein, and methods for recombinant production of this .beta.-glucosidase.
Hughes, Robert W; Vachon, François; Jones, Dylan M
2005-07-01
A novel attentional capture effect is reported in which visual-verbal serial recall was disrupted if a single deviation in the interstimulus interval occurred within otherwise regularly presented task-irrelevant spoken items. The degree of disruption was the same whether the temporal deviant was embedded in a sequence made up of a repeating item or a sequence of changing items. Moreover, the effect was evident during the presentation of the to-be-remembered sequence but not during rehearsal just prior to recall, suggesting that the encoding of sequences is particularly susceptible. The results suggest that attentional capture is due to a violation of an algorithm rather than an aggregate-based neural model and further undermine an attentional capture-based account of the classical changing-state irrelevant sound effect. ((c) 2005 APA, all rights reserved).
In silico analysis of subtilisin from Glaciozyma antarctica PI12
NASA Astrophysics Data System (ADS)
Mustafha, Siti Mardhiah; Murad, Abdul Munir Abdul; Mahadi, Nor Muhammad; Kamaruddin, Shazilah; Bakar, Farah Diba Abu
2015-09-01
Subtilisin constitute as a major player in industrial enzymes that has a wide range of application especially in the detergent industry. In this study, a cDNA encoding for subtilisin (GaSUBT) was extracted from the psychrophilic yeast, Glaciozyma antarctica PI12, PCR amplified and sequenced. Various bioinformatics tools were used to characterize the GaSUBT. GaSUBT contains 1587 bp nucleotides encoding for 529 amino acids. The predicted molecular weight of the deduced protein is 55.34 kDa with an isoelectric point of 6.25. GaSUBT was predicted to possess a signal peptide and pro-peptide consisting of a peptidase inhibitor I9 sequence. From the sequence alignment analysis of deduced amino acids with other subtilisins in the NCBI database showed that the sequences surrounding the catalytic triad that forms the catalytic domain are well conserved.
NASA Technical Reports Server (NTRS)
Ingels, F. M.; Schoggen, W. O.
1982-01-01
The design to achieve the required bit transition density for the Space Shuttle high rate multiplexes (HRM) data stream of the Space Laboratory Vehicle is reviewed. It contained a recommended circuit approach, specified the pseudo random (PN) sequence to be used and detailed the properties of the sequence. Calculations showing the probability of failing to meet the required transition density were included. A computer simulation of the data stream and PN cover sequence was provided. All worst case situations were simulated and the bit transition density exceeded that required. The Preliminary Design Review and the critical Design Review are documented. The Cover Sequence Generator (CSG) Encoder/Decoder design was constructed and demonstrated. The demonstrations were successful. All HRM and HRDM units incorporate the CSG encoder or CSG decoder as appropriate.
Ehrmann, M A; Vogel, R E
2001-11-01
An insertion sequence has been identified in the genome of Lactobacillus sanfranciscensis DSM 20451T as segment of 1351 nucleotides containing 37-bp imperfect terminal inverted repeats. The sequence of this element encodes two out of phase, overlapping open reading frames, orfA and orfB, from which three putative proteins are produced. OrfAB is a transframe protein produced by -1 translational frame shifting between orf A and orf B that is presumed to be the transposase. The large orfAB of this element encodes a 342 amino acid protein that displays similarities with transposases encoded by bacterial insertion sequences belonging to the IS3 family. In L. sanfranciscensis type strain DSM 20451T multiple truncated IS elements were identified. Inverse PCR was used to analyze target sites of four of these elements, but except of their highly AT rich character not any sequence specificity was identified so far. Moreover, no flanking direct repeats were identified. Multiple copies of IS153 were detected by hybridization in other strains of L. sanfranciscensis. Resulting hybridization patterns were shown to differentiate between organisms at strain level rather than a probe targeted against the 16S rDNA. With a PCR based approach IS153 or highly similar sequences were detected in L. acidophilus, L. casei, L. malefermentans, L. plantarum, L. hilgardii, L. collinoides L. farciminis L. sakei and L. salivarius, L. reuteri as well as in Enterococcus faecium, Pediococcus acidilactici and P. pentosaceus.
Sequence analysis and expression of the M1 and M2 matrix protein genes of hirame rhabdovirus (HIRRV)
Nishizawa, T.; Kurath, G.; Winton, J.R.
1997-01-01
We have cloned and sequenced a 2318 nucleotide region of the genomic RNA of hirame rhabdovirus (HIRRV), an important viral pathogen of Japanese flounder Paralichthys olivaceus. This region comprises approximately two-thirds of the 3' end of the nucleocapsid protein (N) gene and the complete matrix protein (M1 and M2) genes with the associated intergenic regions. The partial N gene sequence was 812 nucleotides in length with an open reading frame (ORF) that encoded the carboxyl-terminal 250 amino acids of the N protein. The M1 and M2 genes were 771 and 700 nucleotides in length, respectively, with ORFs encoding proteins of 227 and 193 amino acids. The M1 gene sequence contained an additional small ORF that could encode a highly basic, arginine-rich protein of 25 amino acids. Comparisons of the N, M1, and M2 gene sequences of HIRRV with the corresponding sequences of the fish rhabdoviruses, infectious hematopoietic necrosis virus (IHNV) or viral hemorrhagic septicemia virus (VHSV) indicated that HIRRV was more closely related to IHNV than to VHSV, but was clearly distinct from either. The putative consensus gene termination sequence for IHNV and VHSV, AGAYAG(A)(7), was present in the N-M1, M1-M2, and M2-G intergenic regions of HIRRV as were the putative transcription initiation sequences YGGCAC and AACA. An Escherichia coli expression system was used to produce recombinant proteins from the M1 and M2 genes of HIRRV. These were the same size as the authentic M1 and M2 proteins and reacted with anti-HIRRV rabbit serum in western blots. These reagents can be used for further study of the fish immune response and to test novel control methods.
Dumonceaux, Tim J.; Green, Margaret; Hammond, Christine; Perez, Edel; Olivier, Chrystel
2014-01-01
Phytoplasmas (‘Candidatus Phytoplasma’ spp.) are insect-vectored bacteria that infect a wide variety of plants, including many agriculturally important species. The infections can cause devastating yield losses by inducing morphological changes that dramatically alter inflorescence development. Detection of phytoplasma infection typically utilizes sequences located within the 16S–23S rRNA-encoding locus, and these sequences are necessary for strain identification by currently accepted standards for phytoplasma classification. However, these methods can generate PCR products >1400 bp that are less divergent in sequence than protein-encoding genes, limiting strain resolution in certain cases. We describe a method for accessing the chaperonin-60 (cpn60) gene sequence from a diverse array of ‘Ca.Phytoplasma’ spp. Two degenerate primer sets were designed based on the known sequence diversity of cpn60 from ‘Ca.Phytoplasma’ spp. and used to amplify cpn60 gene fragments from various reference samples and infected plant tissues. Forty three cpn60 sequences were thereby determined. The cpn60 PCR-gel electrophoresis method was highly sensitive compared to 16S-23S-targeted PCR-gel electrophoresis. The topology of a phylogenetic tree generated using cpn60 sequences was congruent with that reported for 16S rRNA-encoding genes. The cpn60 sequences were used to design a hybridization array using oligonucleotide-coupled fluorescent microspheres, providing rapid diagnosis and typing of phytoplasma infections. The oligonucleotide-coupled fluorescent microsphere assay revealed samples that were infected simultaneously with two subtypes of phytoplasma. These tools were applied to show that two host plants, Brassica napus and Camelina sativa, displayed different phytoplasma infection patterns. PMID:25551224
Ensemble codes involving hippocampal neurons are at risk during delayed performance tests.
Hampson, R E; Deadwyler, S A
1996-11-26
Multielectrode recording techniques were used to record ensemble activity from 10 to 16 simultaneously active CA1 and CA3 neurons in the rat hippocampus during performance of a spatial delayed-nonmatch-to-sample task. Extracted sources of variance were used to assess the nature of two different types of errors that accounted for 30% of total trials. The two types of errors included ensemble "miscodes" of sample phase information and errors associated with delay-dependent corruption or disappearance of sample information at the time of the nonmatch response. Statistical assessment of trial sequences and associated "strength" of hippocampal ensemble codes revealed that miscoded error trials always followed delay-dependent error trials in which encoding was "weak," indicating that the two types of errors were "linked." It was determined that the occurrence of weakly encoded, delay-dependent error trials initiated an ensemble encoding "strategy" that increased the chances of being correct on the next trial and avoided the occurrence of further delay-dependent errors. Unexpectedly, the strategy involved "strongly" encoding response position information from the prior (delay-dependent) error trial and carrying it forward to the sample phase of the next trial. This produced a miscode type error on trials in which the "carried over" information obliterated encoding of the sample phase response on the next trial. Application of this strategy, irrespective of outcome, was sufficient to reorient the animal to the proper between trial sequence of response contingencies (nonmatch-to-sample) and boost performance to 73% correct on subsequent trials. The capacity for ensemble analyses of strength of information encoding combined with statistical assessment of trial sequences therefore provided unique insight into the "dynamic" nature of the role hippocampus plays in delay type memory tasks.
Farajzadeh-Sheikh, Ahmad; Jolodar, Abbas; Ghaemmaghami, Shamsedin
2013-01-01
Scorpion venom glands produce some antimicrobial peptides (AMP) that can rapidly kill a broad range of microbes and have additional activities that impact on the quality and effectiveness of innate responses and inflammation. In this study, we reported the identification of a cDNA sequence encoding cysteine-free antimicrobial peptides isolated from venomous glands of this species. Total RNA was extracted from the Iranian mesobuthus eupeus venom glands, and cDNA was synthesized by using the modified oligo (dT). The cDNA was used as the template for applying Semi-nested RT- PCR technique. PCR Products were used for direct nucleotide sequencing and the results were compared with Gen Bank database. A 213 BP cDNA fragment encoding the entire coding region of an antimicrobial toxin from the Iranian scorpion M. Eupeus venom glands were isolated. The full-length sequence of the coding region was 210 BP contained an open reading frame of 70 amino with a predicted molecular mass of 7970.48 Da and theoretical Pi of 9.10. The open reading frame consists of 210 BP encoding a precursor of 70 amino acid residues, including a signal peptide of 23 residues a propertied of 7 residues, and a mature peptide of 34 residues with no disulfide bridge. The peptide has detectable sequence identity to the Lesser Asian mesobuthus eupeus MeVAMP-2 (98%), MeVAMP-9 (60%) and several previously described AMPs from other scorpion venoms including mesobuthus martensii (94%) and buthus occitanus Israelis (82%). The secondary structure of the peptide mainly consisted of α-helical structure which was generally conserved by previously reported scorpion counterparts. The phylogenetic analysis showed that the Iranian MeAMP-like toxin was similar but not identical with that of venom antimicrobial peptides from lesser Asian scorpion mesobuthus eupeus.
Liu, X; Gorovsky, M A
1996-01-01
A truncated cDNA clone encoding Tetrahymena thermophila histone H2A2 was isolated using synthetic degenerate oligonucleotide probes derived from H2A protein sequences of Tetrahymena pyriformis. The cDNA clone was used as a homologous probe to isolate a truncated genomic clone encoding H2A1. The remaining regions of the genes for H2A1 (HTA1) and H2A2 (HTA2) were then isolated using inverse PCR on circularized genomic DNA fragments. These partial clones were assembled into intact HTA1 and HTA2 clones. Nucleotide sequences of the two genes were highly homologous within the coding region but not in the noncoding regions. Comparison of the deduced amino acid sequences with protein sequences of T. pyriformis H2As showed only two and three differences respectively, in a total of 137 amino acids for H2A1, and 132 amino acids for H2A2, indicating the two genes arose before the divergence of these two species. The HTA2 gene contains a TAA triplet within the coding region, encoding a glutamine residue. In contrast with the T. thermophila HHO and HTA3 genes, no introns were identified within the two genes. The 5'- and 3'-ends of the histone H2A mRNAs; were determined by RNase protection and by PCR mapping using RACE and RLM-RACE methods. Both genes encode polyadenylated mRNAs and are highly expressed in vegetatively growing cells but only weakly expressed in starved cultures. With the inclusion of these two genes, T. thermophila is the first organism whose entire complement of known core and linker histones, including replication-dependent and basal variants, has been cloned and sequenced. PMID:8760889
cDNA encoding a polypeptide including a hevein sequence
Raikhel, Natasha V.; Broekaert, Willem F.; Chua, Nam-Hai; Kush, Anil
1993-02-16
A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a pu GOVERNMENT RIGHTS This application was funded under Department of Energy Contract DE-AC02-76ER01338. The U.S. Government has certain rights under this application and any patent issuing thereon.
Maneu, V; Cervera, A M; Martinez, J P; Gozalbo, D
1997-06-15
We have cloned and sequenced a Candida albicans gene (SSB1) encoding a potential member of the heat-shock protein seventy (hsp70) family. The protein encoded by this gene contains 613 amino acids and shows a high degree (85%) of sequence identity to the ssb subfamily (ssb1 and ssb2) of the Saccharomyces cerevisiae hsp70 family. The transcribed mRNA (2.1 kb) is present in similar amounts both in yeast and germ tube cells of C. albicans.
Channel recovery from recent large floods in north coastal California: rates and processes
Thomas E. Lisle
1981-01-01
Abstract - Stream channel recovery from recent large floods in northern California involves a sequence of processes, including degradation of streambeds to stable levels, narrowing of channels, and accentuation of riffle-pool sequences. Most channels have degraded but remain widened because hillslope encroachment and establishment of riparian groves conducive to...
USDA-ARS?s Scientific Manuscript database
The application of genotyping by sequencing (GBS) approaches, combined with data imputation methodologies, is narrowing the genetic knowledge gap between major and understudied, minor crops. GBS is an excellent tool to characterize the genomic structure of recently domesticated (~200 years) and unde...
Materials and methods for the alteration of enzyme and acetyl CoA levels in plants
Nikolau, Basil J.; Wurtele, Eve S.; Oliver, David J.; Behal, Robert; Schnable, Patrick S.; Ke, Jinshan; Johnson, Jerry L.; Allred, Carolyn C.; Fatland, Beth; Lutziger, Isabelle; Wen, Tsui-Jung
2005-09-13
The present invention provides nucleic acid and amino acid sequences of acetyl CoA synthetase (ACS), plastidic pyruvate dehydrogenase (pPDH), ATP citrate lyase (ACL), Arabidopsis pyruvate decarboxylase (PDC), and Arabidopsis aldehyde dehydrogenase (ALDH), specifically ALDH-2 and ALDH-4. The present invention also provides a recombinant vector comprising a nucleic acid sequence encoding one of the aforementioned enzymes, an antisense sequence thereto or a ribozyme therefor, a cell transformed with such a vector, antibodies to the enzymes, a plant cell, a plant tissue, a plant organ or a plant in which the level of an enzyme has been altered, and a method of producing such a plant cell, plant tissue, plant organ or plant. Desirably, alteration of the level of enzyme results in an alteration of the level of acetyl CoA in the plant cell, plant tissue, plant organ or plant. In addition, the present invention provides a recombinant vector comprising an antisense sequence of a nucleic acid sequence encoding pyruvate decarboxylase (PDC), the E1.alpha. subunit of pPDH, the E1.beta. subunit of pPDH, the E2 subunit of pPDH, mitochondrial pyruvate dehydrogenase (mtPDH) or aldehyde dehydrogenase (ALDH) or a ribozyme that can cleave an RNA molecule encoding PDC, E1.alpha. pPDH, E1.beta. pPDH, E2 pPDH, mtPDH or ALDH.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nikolau, Basil J.; Wurtele, Eve S.; Oliver, David J.
The present invention provides nucleic acid and amino acid sequences of acetyl CoA synthetase (ACS), plastidic pyruvate dehydrogenase (pPDH), ATP citrate lyase (ACL), Arabidopsis pyruvate decarboxylase (PDC), and Arabidopsis aldehyde dehydrogenase (ALDH), specifically ALDH-2 and ALDH-4. The present invention also provides a recombinant vector comprising a nucleic acid sequence encoding one of the aforementioned enzymes, an antisense sequence thereto or a ribozyme therefor, a cell transformed with such a vector, antibodies to the enzymes, a plant cell, a plant tissue, a plant organ or a plant in which the level of an enzyme has been altered, and a method ofmore » producing such a plant cell, plant tissue, plant organ or plant. Desirably, alteration of the level of enzyme results in an alteration of the level of acetyl CoA in the plant cell, plant tissue, plant organ or plant. In addition, the present invention provides a recombinant vector comprising an antisense sequence of a nucleic acid sequence encoding pyruvate decarboxylase (PDC), the E1.sub..alpha. subunit of pPDH, the E1.sub..beta. subunit of pPDH, the E2 subunit of pPDH, mitochondrial pyurvate dehydrogenase (mtPDH) or aldehyde dehydrogenase (ALDH) or a ribozyme that can cleave an RNA molecule encoding PDC, E1.sub..alpha. pPDH, E1.sub..beta. pPDH, E2 pPDH, mtPDH or ALDH.« less
Nagarajan, G; Swami, Shelesh Kumar; Dahiya, Shyam Singh; Narnaware, S D; Mehta, S C; Singh, P K; Singh, Raghvendar; Tuteja, F C; Patil, N V
2015-06-01
The present study describes the PCR amplification of GM-CSF-inhibitory factor (GIF) and Uracil DNA glycosylase (UDG) encoding genes of pseudocowpoxvirus (PCPV) from the Indian Dromedaries (Camelus dromedarius) infected with contagious ecthyma using the primers based on the corresponding gene sequences of human PCPV and reindeer PCPV, respectively. The length of GIF gene of PCPV obtained from camel is 795 bp and due to the addition of one cytosine residue at position 374 and one adenine residue at position 516, the open reading frame (ORF) got altered, resulting in the production of truncated polypeptide. The ORF of UDG encoding gene of camel PCPV is 696 bp encoding a polypeptide of 26.0 kDa. Comparison of amino acid sequence homologies of GIF and UDG of camel PCPV revealed that the camel PCPV is closer to ORFV and PCPV (reference stains of both human and reindeer), respectively. Copyright © 2015 Elsevier Ltd. All rights reserved.
Tan, Wui Siew; Lewis, Christina L; Horelik, Nicholas E; Pregibon, Daniel C; Doyle, Patrick S; Yi, Hyunmin
2008-11-04
We demonstrate hierarchical assembly of tobacco mosaic virus (TMV)-based nanotemplates with hydrogel-based encoded microparticles via nucleic acid hybridization. TMV nanotemplates possess a highly defined structure and a genetically engineered high density thiol functionality. The encoded microparticles are produced in a high throughput microfluidic device via stop-flow lithography (SFL) and consist of spatially discrete regions containing encoded identity information, an internal control, and capture DNAs. For the hybridization-based assembly, partially disassembled TMVs were programmed with linker DNAs that contain sequences complementary to both the virus 5' end and a selected capture DNA. Fluorescence microscopy, atomic force microscopy (AFM), and confocal microscopy results clearly indicate facile assembly of TMV nanotemplates onto microparticles with high spatial and sequence selectivity. We anticipate that our hybridization-based assembly strategy could be employed to create multifunctional viral-synthetic hybrid materials in a rapid and high-throughput manner. Additionally, we believe that these viral-synthetic hybrid microparticles may find broad applications in high capacity, multiplexed target sensing.
Pasion, S G; Hines, J C; Aebersold, R; Ray, D S
1992-01-01
A type II DNA topoisomerase, topoIImt, was shown previously to be associated with the kinetoplast DNA of the trypanosomatid Crithidia fasciculata. The gene encoding this kinetoplast-associated topoisomerase has been cloned by immunological screening of a Crithidia genomic expression library with monoclonal antibodies raised against the purified enzyme. The gene CfaTOP2 is a single copy gene and is expressed as a 4.8-kb polyadenylated transcript. The nucleotide sequence of CfaTOP2 has been determined and encodes a predicted polypeptide of 1239 amino acids with a molecular mass of 138,445. The identification of the cloned gene is supported by immunoblot analysis of the beta-galactosidase-CfaTOP2 fusion protein expressed in Escherichia coli and by analysis of tryptic peptide sequences derived from purified topoIImt. CfaTOP2 shares significant homology with nuclear type II DNA topoisomerases of other eukaryotes suggesting that in Crithidia both nuclear and mitochondrial forms of topoisomerase II are encoded by the same gene.
Horizontal gene transfer of chromosomal Type II toxin-antitoxin systems of Escherichia coli.
Ramisetty, Bhaskar Chandra Mohan; Santhosh, Ramachandran Sarojini
2016-02-01
Type II toxin-antitoxin systems (TAs) are small autoregulated bicistronic operons that encode a toxin protein with the potential to inhibit metabolic processes and an antitoxin protein to neutralize the toxin. Most of the bacterial genomes encode multiple TAs. However, the diversity and accumulation of TAs on bacterial genomes and its physiological implications are highly debated. Here we provide evidence that Escherichia coli chromosomal TAs (encoding RNase toxins) are 'acquired' DNA likely originated from heterologous DNA and are the smallest known autoregulated operons with the potential for horizontal propagation. Sequence analyses revealed that integration of TAs into the bacterial genome is unique and contributes to variations in the coding and/or regulatory regions of flanking host genome sequences. Plasmids and genomes encoding identical TAs of natural isolates are mutually exclusive. Chromosomal TAs might play significant roles in the evolution and ecology of bacteria by contributing to host genome variation and by moderation of plasmid maintenance. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Identification of a New Class of Antifungals Targeting the Synthesis of Fungal Sphingolipids.
Mor, Visesato; Rella, Antonella; Farnoud, Amir M; Singh, Ashutosh; Munshi, Mansa; Bryan, Arielle; Naseem, Shamoon; Konopka, James B; Ojima, Iwao; Bullesbach, Erika; Ashbaugh, Alan; Linke, Michael J; Cushion, Melanie; Collins, Margaret; Ananthula, Hari Krishna; Sallans, Larry; Desai, Pankaj B; Wiederhold, Nathan P; Fothergill, Annette W; Kirkpatrick, William R; Patterson, Thomas; Wong, Lai Hong; Sinha, Sunita; Giaever, Guri; Nislow, Corey; Flaherty, Patrick; Pan, Xuewen; Cesar, Gabriele Vargas; de Melo Tavares, Patricia; Frases, Susana; Miranda, Kildare; Rodrigues, Marcio L; Luberto, Chiara; Nimrichter, Leonardo; Del Poeta, Maurizio
2015-06-23
Recent estimates suggest that >300 million people are afflicted by serious fungal infections worldwide. Current antifungal drugs are static and toxic and/or have a narrow spectrum of activity. Thus, there is an urgent need for the development of new antifungal drugs. The fungal sphingolipid glucosylceramide (GlcCer) is critical in promoting virulence of a variety of human-pathogenic fungi. In this study, we screened a synthetic drug library for compounds that target the synthesis of fungal, but not mammalian, GlcCer and found two compounds [N'-(3-bromo-4-hydroxybenzylidene)-2-methylbenzohydrazide (BHBM) and its derivative, 3-bromo-N'-(3-bromo-4-hydroxybenzylidene) benzohydrazide (D0)] that were highly effective in vitro and in vivo against several pathogenic fungi. BHBM and D0 were well tolerated in animals and are highly synergistic or additive to current antifungals. BHBM and D0 significantly affected fungal cell morphology and resulted in the accumulation of intracellular vesicles. Deep-sequencing analysis of drug-resistant mutants revealed that four protein products, encoded by genes APL5, COS111, MKK1, and STE2, which are involved in vesicular transport and cell cycle progression, are targeted by BHBM. Fungal infections are a significant cause of morbidity and mortality worldwide. Current antifungal drugs suffer from various drawbacks, including toxicity, drug resistance, and narrow spectrum of activity. In this study, we have demonstrated that pharmaceutical inhibition of fungal glucosylceramide presents a new opportunity to treat cryptococcosis and various other fungal infections. In addition to being effective against pathogenic fungi, the compounds discovered in this study were well tolerated by animals and additive to current antifungals. These findings suggest that these drugs might pave the way for the development of a new class of antifungals. Copyright © 2015 Mor et al.
Melendrez, Melanie C.; Lange, Rachel K.; Cohan, Frederick M.; Ward, David M.
2011-01-01
Previous research has shown that sequences of 16S rRNA genes and 16S-23S rRNA internal transcribed spacer regions may not have enough genetic resolution to define all ecologically distinct Synechococcus populations (ecotypes) inhabiting alkaline, siliceous hot spring microbial mats. To achieve higher molecular resolution, we studied sequence variation in three protein-encoding loci sampled by PCR from 60°C and 65°C sites in the Mushroom Spring mat (Yellowstone National Park, WY). Sequences were analyzed using the ecotype simulation (ES) and AdaptML algorithms to identify putative ecotypes. Between 4 and 14 times more putative ecotypes were predicted from variation in protein-encoding locus sequences than from variation in 16S rRNA and 16S-23S rRNA internal transcribed spacer sequences. The number of putative ecotypes predicted depended on the number of sequences sampled and the molecular resolution of the locus. Chao estimates of diversity indicated that few rare ecotypes were missed. Many ecotypes hypothesized by sequence analyses were different in their habitat specificities, suggesting different adaptations to temperature or other parameters that vary along the flow channel. PMID:21169433
Geranyl diphosphate synthase large subunit, and methods of use
Croteau, Rodney B.; Burke, Charles C.; Wildung, Mark R.
2001-10-16
A cDNA encoding geranyl diphosphate synthase large subunit from peppermint has been isolated and sequenced, and the corresponding amino acid sequence has been determined. Replicable recombinant cloning vehicles are provided which code for geranyl diphosphate synthase large subunit). In another aspect, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding geranyl diphosphate synthase large subunit. In yet another aspect, the present invention provides isolated, recombinant geranyl diphosphate synthase protein comprising an isolated, recombinant geranyl diphosphate synthase large subunit protein and an isolated, recombinant geranyl diphosphate synthase small subunit protein. Thus, systems and methods are provided for the recombinant expression of geranyl diphosphate synthase.
Molecular cloning and nucleotide sequence of CYP6BF1 from the diamondback moth, Plutella xylostella
Li, Hongshan; Dai, Huaguo; Wei, Hui
2005-01-01
A novel cDNA clong encoding a cytochrome P450 was screened from the insecticide-susceptible strain of Plutella xylostella (L.) (Lepidoptera:Yponomeutidae). The nucleotide sequence of the clone, designated CYP6BF1, was determined. This is the first full-length sequence of the CYP6 family from Plutella xylostella (L.). The cDNA is 1661bp in length and contains an open reading frame from base pairs 26 to 1570, encoding a protein of 514 amino acid residues. It is similar to the other insect P450s in gene family 6, including CYP6AE1 from Depressaria pastinacella, (46%). The GenBank accession number is AY971374. PMID:17119627
Mutations in a novel gene with transmembrane domains underlie Usher syndrome type 3.
Joensuu, T; Hämäläinen, R; Yuan, B; Johnson, C; Tegelberg, S; Gasparini, P; Zelante, L; Pirvola, U; Pakarinen, L; Lehesjoki, A E; de la Chapelle, A; Sankila, E M
2001-10-01
Usher syndrome type 3 (USH3) is an autosomal recessive disorder characterized by progressive hearing loss, severe retinal degeneration, and variably present vestibular dysfunction, assigned to 3q21-q25. Here, we report on the positional cloning of the USH3 gene. By haplotype and linkage-disequilibrium analyses in Finnish carriers of a putative founder mutation, the critical region was narrowed to 250 kb, of which we sequenced, assembled, and annotated 207 kb. Two novel genes-NOPAR and UCRP-and one previously identified gene-H963-were excluded as USH3, on the basis of mutational analysis. USH3, the candidate gene that we identified, encodes a 120-amino-acid protein. Fifty-two Finnish patients were homozygous for a termination mutation, Y100X; patients in two Finnish families were compound heterozygous for Y100X and for a missense mutation, M44K, whereas patients in an Italian family were homozygous for a 3-bp deletion leading to an amino acid deletion and substitution. USH3 has two predicted transmembrane domains, and it shows no homology to known genes. As revealed by northern blotting and reverse-transcriptase PCR, it is expressed in many tissues, including the retina.
Identification of a New Class of Antifungals Targeting the Synthesis of Fungal Sphingolipids
Mor, Visesato; Rella, Antonella; Farnoud, Amir M.; Singh, Ashutosh; Munshi, Mansa; Bryan, Arielle; Naseem, Shamoon; Konopka, James B.; Ojima, Iwao; Bullesbach, Erika; Ashbaugh, Alan; Linke, Michael J.; Cushion, Melanie; Collins, Margaret; Ananthula, Hari Krishna; Sallans, Larry; Desai, Pankaj B.; Wiederhold, Nathan P.; Fothergill, Annette W.; Kirkpatrick, William R.; Patterson, Thomas; Wong, Lai Hong; Sinha, Sunita; Giaever, Guri; Nislow, Corey; Flaherty, Patrick; Pan, Xuewen; Cesar, Gabriele Vargas; de Melo Tavares, Patricia; Frases, Susana; Miranda, Kildare; Rodrigues, Marcio L.; Luberto, Chiara; Nimrichter, Leonardo
2015-01-01
ABSTRACT Recent estimates suggest that >300 million people are afflicted by serious fungal infections worldwide. Current antifungal drugs are static and toxic and/or have a narrow spectrum of activity. Thus, there is an urgent need for the development of new antifungal drugs. The fungal sphingolipid glucosylceramide (GlcCer) is critical in promoting virulence of a variety of human-pathogenic fungi. In this study, we screened a synthetic drug library for compounds that target the synthesis of fungal, but not mammalian, GlcCer and found two compounds [N′-(3-bromo-4-hydroxybenzylidene)-2-methylbenzohydrazide (BHBM) and its derivative, 3-bromo-N′-(3-bromo-4-hydroxybenzylidene) benzohydrazide (D0)] that were highly effective in vitro and in vivo against several pathogenic fungi. BHBM and D0 were well tolerated in animals and are highly synergistic or additive to current antifungals. BHBM and D0 significantly affected fungal cell morphology and resulted in the accumulation of intracellular vesicles. Deep-sequencing analysis of drug-resistant mutants revealed that four protein products, encoded by genes APL5, COS111, MKK1, and STE2, which are involved in vesicular transport and cell cycle progression, are targeted by BHBM. PMID:26106079
Plasmids foster diversification and adaptation of bacterial populations in soil.
Heuer, Holger; Smalla, Kornelia
2012-11-01
It is increasingly being recognized that the transfer of conjugative plasmids across species boundaries plays a vital role in the adaptability of bacterial populations in soil. There are specific driving forces and constraints of plasmid transfer within bacterial communities in soils. Plasmid-mediated genetic variation allows bacteria to respond rapidly with adaptive responses to challenges such as irregular antibiotic or metal concentrations, or opportunities such as the utilization of xenobiotic compounds. Cultivation-independent detection and capture of plasmids from soil bacteria, and complete sequencing have provided new insights into the role and ecology of plasmids. Broad host range plasmids such as those belonging to IncP-1 transfer a wealth of accessory functions which are carried by similar plasmid backbones. Plasmids with a narrower host range can be more specifically adapted to particular species and often transfer genes which complement chromosomally encoded functions. Plasmids seem to be an ancient and successful strategy to ensure survival of a soil population in spatial and temporal heterogeneous conditions with various environmental stresses or opportunities that occur irregularly or as a novel challenge in soil. © 2012 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
ChIPWig: a random access-enabling lossless and lossy compression method for ChIP-seq data.
Ravanmehr, Vida; Kim, Minji; Wang, Zhiying; Milenkovic, Olgica
2018-03-15
Chromatin immunoprecipitation sequencing (ChIP-seq) experiments are inexpensive and time-efficient, and result in massive datasets that introduce significant storage and maintenance challenges. To address the resulting Big Data problems, we propose a lossless and lossy compression framework specifically designed for ChIP-seq Wig data, termed ChIPWig. ChIPWig enables random access, summary statistics lookups and it is based on the asymptotic theory of optimal point density design for nonuniform quantizers. We tested the ChIPWig compressor on 10 ChIP-seq datasets generated by the ENCODE consortium. On average, lossless ChIPWig reduced the file sizes to merely 6% of the original, and offered 6-fold compression rate improvement compared to bigWig. The lossy feature further reduced file sizes 2-fold compared to the lossless mode, with little or no effects on peak calling and motif discovery using specialized NarrowPeaks methods. The compression and decompression speed rates are of the order of 0.2 sec/MB using general purpose computers. The source code and binaries are freely available for download at https://github.com/vidarmehr/ChIPWig-v2, implemented in C ++. milenkov@illinois.edu. Supplementary data are available at Bioinformatics online.
2012-01-01
Background Natrialba magadii is an aerobic chemoorganotrophic member of the Euryarchaeota and is a dual extremophile requiring alkaline conditions and hypersalinity for optimal growth. The genome sequence of Nab. magadii type strain ATCC 43099 was deciphered to obtain a comprehensive insight into the genetic content of this haloarchaeon and to understand the basis of some of the cellular functions necessary for its survival. Results The genome of Nab. magadii consists of four replicons with a total sequence of 4,443,643 bp and encodes 4,212 putative proteins, some of which contain peptide repeats of various lengths. Comparative genome analyses facilitated the identification of genes encoding putative proteins involved in adaptation to hypersalinity, stress response, glycosylation, and polysaccharide biosynthesis. A proton-driven ATP synthase and a variety of putative cytochromes and other proteins supporting aerobic respiration and electron transfer were encoded by one or more of Nab. magadii replicons. The genome encodes a number of putative proteases/peptidases as well as protein secretion functions. Genes encoding putative transcriptional regulators, basal transcription factors, signal perception/transduction proteins, and chemotaxis/phototaxis proteins were abundant in the genome. Pathways for the biosynthesis of thiamine, riboflavin, heme, cobalamin, coenzyme F420 and other essential co-factors were deduced by in depth sequence analyses. However, approximately 36% of Nab. magadii protein coding genes could not be assigned a function based on Blast analysis and have been annotated as encoding hypothetical or conserved hypothetical proteins. Furthermore, despite extensive comparative genomic analyses, genes necessary for survival in alkaline conditions could not be identified in Nab. magadii. Conclusions Based on genomic analyses, Nab. magadii is predicted to be metabolically versatile and it could use different carbon and energy sources to sustain growth. Nab. magadii has the genetic potential to adapt to its milieu by intracellular accumulation of inorganic cations and/or neutral organic compounds. The identification of Nab. magadii genes involved in coenzyme biosynthesis is a necessary step toward further reconstruction of the metabolic pathways in halophilic archaea and other extremophiles. The knowledge gained from the genome sequence of this haloalkaliphilic archaeon is highly valuable in advancing the applications of extremophiles and their enzymes. PMID:22559199
Peptide library synthesis on spectrally encoded beads for multiplexed protein/peptide bioassays
NASA Astrophysics Data System (ADS)
Nguyen, Huy Q.; Brower, Kara; Harink, Björn; Baxter, Brian; Thorn, Kurt S.; Fordyce, Polly M.
2017-02-01
Protein-peptide interactions are essential for cellular responses. Despite their importance, these interactions remain largely uncharacterized due to experimental challenges associated with their measurement. Current techniques (e.g. surface plasmon resonance, fluorescence polarization, and isothermal calorimetry) either require large amounts of purified material or direct fluorescent labeling, making high-throughput measurements laborious and expensive. In this report, we present a new technology for measuring antibody-peptide interactions in vitro that leverages spectrally encoded beads for biological multiplexing. Specific peptide sequences are synthesized directly on encoded beads with a 1:1 relationship between peptide sequence and embedded code, thereby making it possible to track many peptide sequences throughout the course of an experiment within a single small volume. We demonstrate the potential of these bead-bound peptide libraries by: (1) creating a set of 46 peptides composed of 3 commonly used epitope tags (myc, FLAG, and HA) and single amino-acid scanning mutants; (2) incubating with a mixture of fluorescently-labeled antimyc, anti-FLAG, and anti-HA antibodies; and (3) imaging these bead-bound libraries to simultaneously identify the embedded spectral code (and thus the sequence of the associated peptide) and quantify the amount of each antibody bound. To our knowledge, these data demonstrate the first customized peptide library synthesized directly on spectrally encoded beads. While the implementation of the technology provided here is a high-affinity antibody/protein interaction with a small code space, we believe this platform can be broadly applicable to any range of peptide screening applications, with the capability to multiplex into libraries of hundreds to thousands of peptides in a single assay.
Christie, Andrew E.; Fontanilla, Tiana M.; Nesbit, Katherine T.; Lenz, Petra H.
2013-01-01
Diel vertical migration and seasonal diapause are critical life history events for the copepod Calanus finmarchicus. While much is known about these behaviors phenomenologically, little is known about their molecular underpinnings. Recent studies in insects suggest that some circadian genes/proteins also contribute to the establishment of seasonal diapause. Thus, it is possible that in Calanus these distinct timing regimes share some genetic components. To begin to address this possibility, we used the well-established Drosophila melanogaster circadian system as a reference for mining clock transcripts from a 200,000+ sequence Calanus transcriptome; the proteins encoded by the identified transcripts were also deduced and characterized. Sequences encoding homologs of the Drosophila core clock proteins CLOCK, CYCLE, PERIOD and TIMELESS were identified, as was one encoding CRYPTOCHROME 2, a core clock protein in ancestral insect systems, but absent in Drosophila. Calanus transcripts encoding proteins known to modulate the Drosophila core clock were also identified and characterized, e.g. CLOCKWORK ORANGE, DOUBLETIME, SHAGGY and VRILLE. Alignment and structural analyses of the deduced Calanus proteins with their Drosophila counterparts revealed extensive sequence conservation, particularly in functional domains. Interestingly, reverse BLAST analyses of these sequences against all arthropod proteins typically revealed non-Drosophila isoforms to be most similar to the Calanus queries. This, in combination with the presence of both CRYPTOCHROME 1 (a clock input pathway protein) and CRYPTOCHROME 2 in Calanus, suggests that the organization of the copepod circadian system is an ancestral one, more similar to that of insects like Danaus plexippus than to that of Drosophila. PMID:23727418
Dynamic Encoding of Speech Sequence Probability in Human Temporal Cortex
Leonard, Matthew K.; Bouchard, Kristofer E.; Tang, Claire
2015-01-01
Sensory processing involves identification of stimulus features, but also integration with the surrounding sensory and cognitive context. Previous work in animals and humans has shown fine-scale sensitivity to context in the form of learned knowledge about the statistics of the sensory environment, including relative probabilities of discrete units in a stream of sequential auditory input. These statistics are a defining characteristic of one of the most important sequential signals humans encounter: speech. For speech, extensive exposure to a language tunes listeners to the statistics of sound sequences. To address how speech sequence statistics are neurally encoded, we used high-resolution direct cortical recordings from human lateral superior temporal cortex as subjects listened to words and nonwords with varying transition probabilities between sound segments. In addition to their sensitivity to acoustic features (including contextual features, such as coarticulation), we found that neural responses dynamically encoded the language-level probability of both preceding and upcoming speech sounds. Transition probability first negatively modulated neural responses, followed by positive modulation of neural responses, consistent with coordinated predictive and retrospective recognition processes, respectively. Furthermore, transition probability encoding was different for real English words compared with nonwords, providing evidence for online interactions with high-order linguistic knowledge. These results demonstrate that sensory processing of deeply learned stimuli involves integrating physical stimulus features with their contextual sequential structure. Despite not being consciously aware of phoneme sequence statistics, listeners use this information to process spoken input and to link low-level acoustic representations with linguistic information about word identity and meaning. PMID:25948269
High-Molecular-Mass Multi-c-Heme Cytochromes from Methylococcus capsulatus Bath†
Bergmann, David J.; Zahn, James A.; DiSpirito, Alan A.
1999-01-01
The polypeptide and structural gene for a high-molecular-mass c-type cytochrome, cytochrome c553O, was isolated from the methanotroph Methylococcus capsulatus Bath. Cytochrome c553O is a homodimer with a subunit molecular mass of 124,350 Da and an isoelectric point of 6.0. The heme c concentration was estimated to be 8.2 ± 0.4 mol of heme c per subunit. The electron paramagnetic resonance spectrum showed the presence of multiple low spin, S = 1/2, hemes. A degenerate oligonucleotide probe synthesized based on the N-terminal amino acid sequence of cytochrome c553O was used to identify a DNA fragment from M. capsulatus Bath that contains occ, the gene encoding cytochrome c553O. occ is part of a gene cluster which contains three other open reading frames (ORFs). ORF1 encodes a putative periplasmic c-type cytochrome with a molecular mass of 118,620 Da that shows approximately 40% amino acid sequence identity with occ and contains nine c-heme-binding motifs. ORF3 encodes a putative periplasmic c-type cytochrome with a molecular mass of 94,000 Da and contains seven c-heme-binding motifs but shows no sequence homology to occ or ORF1. ORF4 encodes a putative 11,100-Da protein. The four ORFs have no apparent similarity to any proteins in the GenBank database. The subunit molecular masses, arrangement and number of hemes, and amino acid sequences demonstrate that cytochrome c553O and the gene products of ORF1 and ORF3 constitute a new class of c-type cytochrome. PMID:9922265
Encoding color information for visual tracking: Algorithms and benchmark.
Liang, Pengpeng; Blasch, Erik; Ling, Haibin
2015-12-01
While color information is known to provide rich discriminative clues for visual inference, most modern visual trackers limit themselves to the grayscale realm. Despite recent efforts to integrate color in tracking, there is a lack of comprehensive understanding of the role color information can play. In this paper, we attack this problem by conducting a systematic study from both the algorithm and benchmark perspectives. On the algorithm side, we comprehensively encode 10 chromatic models into 16 carefully selected state-of-the-art visual trackers. On the benchmark side, we compile a large set of 128 color sequences with ground truth and challenge factor annotations (e.g., occlusion). A thorough evaluation is conducted by running all the color-encoded trackers, together with two recently proposed color trackers. A further validation is conducted on an RGBD tracking benchmark. The results clearly show the benefit of encoding color information for tracking. We also perform detailed analysis on several issues, including the behavior of various combinations between color model and visual tracker, the degree of difficulty of each sequence for tracking, and how different challenge factors affect the tracking performance. We expect the study to provide the guidance, motivation, and benchmark for future work on encoding color in visual tracking.
Grove, J R; Deutsch, P J; Price, D J; Habener, J F; Avruch, J
1989-11-25
Plasmids that encode a bioactive amino-terminal fragment of the heat-stable inhibitor of the cAMP-dependent protein kinase, PKI(1-31), were employed to characterize the role of this protein kinase in the control of transcriptional activity mediated by three DNA regulatory elements in the JEG-3 human placental cell line. The 5'-flanking sequence of the human collagenase gene contains the heptameric sequence, 5'-TGAGTCA-3', previously identified as a "phorbol ester" response element. Reporter genes containing either the intact 1.2-kilobase 5'-flanking sequence from the human collagenase gene or just the 7-base pair (bp) response element, when coupled to an enhancerless promoter, each exhibit both cAMP and phorbol ester-stimulated expression in JEG-3 cells. Cotransfection of either construct with plasmids encoding PKI(1-31) inhibits cAMP-stimulated but not basal- or phorbol ester-stimulated expression. Pretreatment of cells with phorbol ester for 1 or 2 days abrogates completely the response to rechallenge with phorbol ester but does not alter the basal expression of either construct; cAMP-stimulated expression, while modestly inhibited, remains vigorous. The 5'-flanking sequence of the human chorionic gonadotropin-alpha subunit (HCG alpha) gene has two copies of the sequence, 5'-TGACGTCA-3', contained in directly adjacent identical 18-bp segments, previously identified as a cAMP-response element. Reporter genes containing either the intact 1.5 kilobase of 5'-flanking sequence from the HCG alpha gene, or just the 36-bp tandem repeat cAMP response element, when coupled to an enhancerless promoter, both exhibit a vigorous cAMP stimulation of expression but no response to phorbol ester in JEG-3 cells. Cotransfection with plasmids encoding PKI(1-31) inhibits both basal and cAMP-stimulated expression in a parallel fashion. The 5'-flanking sequence of the human enkephalin gene mediates cAMP-stimulated expression of reporter genes in both JEG-3 and CV-1 cells. Plasmids encoding PKI(1-31) inhibit the expression that is stimulated by the addition of cAMP analogs in both cell lines; basal expression, however, is inhibited by PKI(1-31) only in the JEG-3 cell line and not in the CV-1 cells. These observations indicate that, in JEG-3 cells, PKI(1-31) is a specific inhibitor of kinase A-mediated gene transcription, but it does not modify kinase C-directed transcription.(ABSTRACT TRUNCATED AT 400 WORDS)
Molecular cloning of crustins from the hemocytes of Brazilian penaeid shrimps.
Rosa, Rafael Diego; Bandeira, Paula Terra; Barracco, Margherita Anna
2007-09-01
Crustins are antimicrobial peptides initially identified in the hemocytes of the crab Carcinus maenas (11.5-kDa peptide or carcinin) and recently also recognized in penaeid shrimps and other crustacean species. The aim of this study was to identify sequences encoding for crustins from the hemocytes of four Brazilian penaeid species: Farfantepenaeus paulensis, Farfantepenaeus subtilis, Farfantepenaeus brasiliensis and Litopenaeus schmitti. Using primers based on consensus nucleotide alignment of crustins from different crustaceans, cDNA sequences coding for crustins in all indigenous penaeid species were amplified. The obtained four crustin sequences encoded for peptides containing a hydrophobic N-terminal region rich in glycine repeats and a C-terminal part with 12 cysteine residues and a conserved whey acidic protein domain. All obtained crustin sequences showed high amino acidic similarity among each other and with crustins from litopenaeid shrimps (76-98%). This is the first report of crustins in native Brazilian penaeid shrimps.
Isolation of Onchocerca lupi in Dogs and Black Flies, California, USA
Hassan, Hassan K.; Bolcen, Shanna; Kubofcik, Joseph; Nutman, Thomas B.; Eberhard, Mark L.; Middleton, Kelly; Wekesa, Joseph Wakoli; Ruedas, Gimena; Nelson, Kimberly J.; Dubielzig, Richard; De Lombaert, Melissa; Silverman, Bruce; Schorling, Jamie J.; Adler, Peter H.; Beeler, Emily S.
2015-01-01
In southern California, ocular infections caused by Onchocerca lupi were diagnosed in 3 dogs (1 in 2006, 2 in 2012). The infectious agent was confirmed through morphologic analysis of fixed parasites in tissues and by PCR and sequencing of amplicons derived from 2 mitochondrially encoded genes and 1 nuclear-encoded gene. A nested PCR based on the sequence of the cytochrome oxidase subunit 1 gene of the parasite was developed and used to screen Simulium black flies collected from southern California for O. lupi DNA. Six (2.8%; 95% CI 0.6%–5.0%) of 213 black flies contained O. lupi DNA. Partial mitochondrial16S rRNA gene sequences from the infected flies matched sequences derived from black fly larvae cytotaxonomically identified as Simulium tribulatum. These data implicate S. tribulatum flies as a putative vector for O. lupi in southern California. PMID:25897954
Complete genome sequence of a Watermelon silver mottle virus isolate from China.
Rao, Xueqin; Wu, Zhuyan; Li, Yuan
2013-06-01
The complete genome of a Watermelon silver mottle virus (WSMoV) (genus Tospovirus, family Bunyaviridae) isolate (WSMoV-GZ) from Guangdong province, China was sequenced. The genomes of WSMoV-GZ contained 3,603, 4,909, and 8,914 nt of small (S), medium (M), and large (L) RNA segments, respectively, and had a genomic organization characteristic of members of the genus Tospovirus. The amino acid sequence of the nucleocapsid (N) protein, S RNA-encoded nonstructural (NSs) protein, M RNA-encoded nonstructural (NSm) protein, Gn/Gc glycoprotein precursor, and RNA-dependent RNA polymerase (RdRp) protein showed 94.3-97.5 % identity with those of other WSMoV isolates. Phylogenetic analysis showed that the N protein of WSMoV-GZ was clustered together with those of the WSMoV isolates. The full sequence of WSMoV-GZ provides a reference genome for comparison with other tospoviruses.
Episodic sequence memory is supported by a theta-gamma phase code.
Heusser, Andrew C; Poeppel, David; Ezzyat, Youssef; Davachi, Lila
2016-10-01
The meaning we derive from our experiences is not a simple static extraction of the elements but is largely based on the order in which those elements occur. Models propose that sequence encoding is supported by interactions between high- and low-frequency oscillations, such that elements within an experience are represented by neural cell assemblies firing at higher frequencies (gamma) and sequential order is encoded by the specific timing of firing with respect to a lower frequency oscillation (theta). During episodic sequence memory formation in humans, we provide evidence that items in different sequence positions exhibit greater gamma power along distinct phases of a theta oscillation. Furthermore, this segregation is related to successful temporal order memory. Our results provide compelling evidence that memory for order, a core component of an episodic memory, capitalizes on the ubiquitous physiological mechanism of theta-gamma phase-amplitude coupling.
Cloning and expression of cDNA coding for bouganin.
den Hartog, Marcel T; Lubelli, Chiara; Boon, Louis; Heerkens, Sijmie; Ortiz Buijsse, Antonio P; de Boer, Mark; Stirpe, Fiorenzo
2002-03-01
Bouganin is a ribosome-inactivating protein that recently was isolated from Bougainvillea spectabilis Willd. In this work, the cloning and expression of the cDNA encoding for bouganin is described. From the cDNA, the amino-acid sequence was deduced, which correlated with the primary sequence data obtained by amino-acid sequencing on the native protein. Bouganin is synthesized as a pro-peptide consisting of 305 amino acids, the first 26 of which act as a leader signal while the 29 C-terminal amino acids are cleaved during processing of the molecule. The mature protein consists of 250 amino acids. Using the cDNA sequence encoding the mature protein of 250 amino acids, a recombinant protein was expressed, purified and characterized. The recombinant molecule had similar activity in a cell-free protein synthesis assay and had comparable toxicity on living cells as compared to the isolated native bouganin.
USDA-ARS?s Scientific Manuscript database
The P. ultimum DAOM BR144 (=CBS 805.95 = ATCC200006) genome (42.8 Mb) encodes 15,290 genes, and has extensive sequence similarity and synteny with related Phytophthora spp., including the potato late blight pathogen Phytophthora infestans. Whole transcriptome sequencing revealed expression of 86 % o...
Genome Sequences for Five Strains of the Emerging Pathogen Haemophilus haemolyticus
Jordan, I. King; Conley, Andrew B.; Antonov, Ivan V.; Arthur, Robert A.; Cook, Erin D.; Cooper, Guy P.; Jones, Bernard L.; Knipe, Kristen M.; Lee, Kevin J.; Liu, Xing; Mitchell, Gabriel J.; Pande, Pushkar R.; Petit, Robert A.; Qin, Shaopu; Rajan, Vani N.; Sarda, Shruti; Sebastian, Aswathy; Tang, Shiyuyun; Thapliyal, Racchit; Varghese, Neha J.; Ye, Tianjun; Katz, Lee S.; Wang, Xin; Rowe, Lori; Frace, Michael; Mayer, Leonard W.
2011-01-01
We report the first whole-genome sequences for five strains, two carried and three pathogenic, of the emerging pathogen Haemophilus haemolyticus. Preliminary analyses indicate that these genome sequences encode markers that distinguish H. haemolyticus from its closest Haemophilus relatives and provide clues to the identity of its virulence factors. PMID:21952546
Complete genome sequence of a divergent strain of Japanese yam mosaic virus from China
USDA-ARS?s Scientific Manuscript database
A novel strain of Japanese yam mosaic virus (JYMV-CN) was identified in a yam plant with foliar mottle symptoms in China. The complete genomic sequence of JYMV-CN was determined. Its genomic sequence of 9701 nucleotides encodes a polyprotein of 3247 amino acids. Its organization was virtually identi...
Sequences show rapid motor transfer and spatial translation in the oculomotor system.
Stainer, Matthew J; Carpenter, R H S; Brotchie, Peter; Anderson, Andrew J
2016-07-01
Every day we perform learnt sequences of actions that seem to happen almost without awareness. It has been argued that for learning such sequences parallel learning networks exist - one using spatial coordinates and one using motor coordinates - with sequence acquisition involving a progressive shift from the former to the latter as a sequence is rehearsed. When sequences are interrupted by an out-of-sequence target, there is a delay in the response to the target, and so here we transiently interrupt oculomotor sequences to probe the influence of oculomotor rehearsal and spatial coordinates in sequence acquisition. For our main experiments, we used a repeating sequences of eight targets in length that was first learnt either using saccadic eye movements (left/right), manual responses (left/right or up/down) or as a sequence of colour (blue/red) requiring no motor response. The sequence was immediately repeated for saccadic eye movements, during which the influence of on out-of-sequence target (an interruption) was assessed. When a sequence is learnt beforehand in an abstract way (for example, as a sequence of colours or of orthogonally mapped manual responses), interruptions are immediately disruptive to latency, suggesting neither motor rehearsal nor specific spatial coordinates are essential for encoding sequences of actions and that sequences - no matter how they are encoded - can be rapidly translated into oculomotor coordinates. The magnitude of a disruption does, however, correspond to how well a sequence is learnt: introducing an interruption to an extended sequence before it was reliably learnt reduces the magnitude of the latency disruption. Copyright © 2016 Elsevier Ltd. All rights reserved.
A fully decompressed synthetic bacteriophage øX174 genome assembled and archived in yeast.
Jaschke, Paul R; Lieberman, Erica K; Rodriguez, Jon; Sierra, Adrian; Endy, Drew
2012-12-20
The 5386 nucleotide bacteriophage øX174 genome has a complicated architecture that encodes 11 gene products via overlapping protein coding sequences spanning multiple reading frames. We designed a 6302 nucleotide synthetic surrogate, øX174.1, that fully separates all primary phage protein coding sequences along with cognate translation control elements. To specify øX174.1f, a decompressed genome the same length as wild type, we truncated the gene F coding sequence. We synthesized DNA encoding fragments of øX174.1f and used a combination of in vitro- and yeast-based assembly to produce yeast vectors encoding natural or designer bacteriophage genomes. We isolated clonal preparations of yeast plasmid DNA and transfected E. coli C strains. We recovered viable øX174 particles containing the øX174.1f genome from E. coli C strains that independently express full-length gene F. We expect that yeast can serve as a genomic 'drydock' within which to maintain and manipulate clonal lineages of other obligate lytic phage. Copyright © 2012 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mavromatis, K; Doyle, C Kuyler; Lykidis, A
2006-01-01
Ehrlichia canis, a small obligately intracellular, tick-transmitted, gram-negative, {alpha}-proteobacterium, is the primary etiologic agent of globally distributed canine monocytic ehrlichiosis. Complete genome sequencing revealed that the E. canis genome consists of a single circular chromosome of 1,315,030 bp predicted to encode 925 proteins, 40 stable RNA species, 17 putative pseudogenes, and a substantial proportion of noncoding sequence (27%). Interesting genome features include a large set of proteins with transmembrane helices and/or signal sequences and a unique serine-threonine bias associated with the potential for O glycosylation that was prominent in proteins associated with pathogen-host interactions. Furthermore, two paralogous protein families associatedmore » with immune evasion were identified, one of which contains poly(G-C) tracts, suggesting that they may play a role in phase variation and facilitation of persistent infections. Genes associated with pathogen-host interactions were identified, including a small group encoding proteins (n = 12) with tandem repeats and another group encoding proteins with eukaryote-like ankyrin domains (n = 7).« less
Le, Yuan; Stein, Ashley; Berry, Colin; Kellman, Peter; Bennett, Eric E.; Taylor, Joni; Lucas, Katherine; Kopace, Rael; Chefd’Hotel, Christophe; Lorenz, Christine H.; Croisille, Pierre; Wen, Han
2010-01-01
The purpose of this study is to develop and evaluate a displacement-encoded pulse sequence for simultaneous perfusion and strain imaging. Displacement-encoded images in 2–3 myocardial slices were repeatedly acquired using a single shot pulse sequence for 3 to 4 minutes, which covers a bolus infusion of Gd. The magnitudes of the images were T1 weighted and provided quantitative measures of perfusion, while the phase maps yielded strain measurements. In an acute coronary occlusion swine protocol (n=9), segmental perfusion measurements were validated against microsphere reference standard with a linear regression (slope 0.986, R2 = 0.765, Bland-Altman standard deviation = 0.15 ml/min/g). In a group of ST-elevation myocardial infarction(STEMI) patients (n=11), the scan success rate was 76%. Short-term contrast washout rate and perfusion are highly correlated (R2=0.72), and the pixel-wise relationship between circumferential strain and perfusion was better described with a sigmoidal Hill curve than linear functions. This study demonstrates the feasibility of measuring strain and perfusion from a single set of images. PMID:20544714
Bouchard, Kristofer E.; Ganguli, Surya; Brainard, Michael S.
2015-01-01
The majority of distinct sensory and motor events occur as temporally ordered sequences with rich probabilistic structure. Sequences can be characterized by the probability of transitioning from the current state to upcoming states (forward probability), as well as the probability of having transitioned to the current state from previous states (backward probability). Despite the prevalence of probabilistic sequencing of both sensory and motor events, the Hebbian mechanisms that mold synapses to reflect the statistics of experienced probabilistic sequences are not well understood. Here, we show through analytic calculations and numerical simulations that Hebbian plasticity (correlation, covariance, and STDP) with pre-synaptic competition can develop synaptic weights equal to the conditional forward transition probabilities present in the input sequence. In contrast, post-synaptic competition can develop synaptic weights proportional to the conditional backward probabilities of the same input sequence. We demonstrate that to stably reflect the conditional probability of a neuron's inputs and outputs, local Hebbian plasticity requires balance between competitive learning forces that promote synaptic differentiation and homogenizing learning forces that promote synaptic stabilization. The balance between these forces dictates a prior over the distribution of learned synaptic weights, strongly influencing both the rate at which structure emerges and the entropy of the final distribution of synaptic weights. Together, these results demonstrate a simple correspondence between the biophysical organization of neurons, the site of synaptic competition, and the temporal flow of information encoded in synaptic weights by Hebbian plasticity while highlighting the utility of balancing learning forces to accurately encode probability distributions, and prior expectations over such probability distributions. PMID:26257637
Capturing novel mouse genes encoding chromosomal and other nuclear proteins.
Tate, P; Lee, M; Tweedie, S; Skarnes, W C; Bickmore, W A
1998-09-01
The burgeoning wealth of gene sequences contrasts with our ignorance of gene function. One route to assigning function is by determining the sub-cellular location of proteins. We describe the identification of mouse genes encoding proteins that are confined to nuclear compartments by splicing endogeneous gene sequences to a promoterless betageo reporter, using a gene trap approach. Mouse ES (embryonic stem) cell lines were identified that express betageo fusions located within sub-nuclear compartments, including chromosomes, the nucleolus and foci containing splicing factors. The sequences of 11 trapped genes were ascertained, and characterisation of endogenous protein distribution in two cases confirmed the validity of the approach. Three novel proteins concentrated within distinct chromosomal domains were identified, one of which appears to be a serine/threonine kinase. The sequence of a gene whose product co-localises with splicesome components suggests that this protein may be an E3 ubiquitin-protein ligase. The majority of the other genes isolated represent novel genes. This approach is shown to be a powerful tool for identifying genes encoding novel proteins with specific sub-nuclear localisations and exposes our ignorance of the protein composition of the nucleus. Motifs in two of the isolated genes suggest new links between cellular regulatory mechanisms (ubiquitination and phosphorylation) and mRNA splicing and chromosome structure/function.
Minimum variance optimal rate allocation for multiplexed H.264/AVC bitstreams.
Tagliasacchi, Marco; Valenzise, Giuseppe; Tubaro, Stefano
2008-07-01
Consider the problem of transmitting multiple video streams to fulfill a constant bandwidth constraint. The available bit budget needs to be distributed across the sequences in order to meet some optimality criteria. For example, one might want to minimize the average distortion or, alternatively, minimize the distortion variance, in order to keep almost constant quality among the encoded sequences. By working in the rho-domain, we propose a low-delay rate allocation scheme that, at each time instant, provides a closed form solution for either the aforementioned problems. We show that minimizing the distortion variance instead of the average distortion leads, for each of the multiplexed sequences, to a coding penalty less than 0.5 dB, in terms of average PSNR. In addition, our analysis provides an explicit relationship between model parameters and this loss. In order to smooth the distortion also along time, we accommodate a shared encoder buffer to compensate for rate fluctuations. Although the proposed scheme is general, and it can be adopted for any video and image coding standard, we provide experimental evidence by transcoding bitstreams encoded using the state-of-the-art H.264/AVC standard. The results of our simulations reveal that is it possible to achieve distortion smoothing both in time and across the sequences, without sacrificing coding efficiency.
Molecular cloning of a cDNA encoding the glycoprotein of hen oviduct microsomal signal peptidase.
Newsome, A L; McLean, J W; Lively, M O
1992-01-01
Detergent-solubilized hen oviduct signal peptidase has been characterized previously as an apparent complex of a 19 kDa protein and a 23 kDa glycoprotein (GP23) [Baker & Lively (1987) Biochemistry 26, 8561-8567]. A cDNA clone encoding GP23 from a chicken oviduct lambda gt11 cDNA library has now been characterized. The cDNA encodes a protein of 180 amino acid residues with a single site for asparagine-linked glycosylation that has been directly identified by amino acid sequence analysis of a tryptic-digest peptide containing the glycosylated site. Immunoblot analysis reveals cross-reactivity with a dog pancreas protein. Comparison of the deduced amino acid sequence of GP23 with the 22/23 kDa glycoprotein of dog microsomal signal peptidase [Shelness, Kanwar & Blobel (1988) J. Biol. Chem. 263, 17063-17070], one of five proteins associated with this enzyme, reveals that the amino acid sequences are 90% identical. Thus the signal peptidase glycoprotein is as highly conserved as the sequences of cytochromes c and b from these same species and is likely to be found in a similar form in many, if not all, vertebrate species. The data also show conclusively that the dog and avian signal peptidases have at least one protein subunit in common. Images Fig. 1. PMID:1546959
Identification and characterisation of seed storage protein transcripts from Lupinus angustifolius
2011-01-01
Background In legumes, seed storage proteins are important for the developing seedling and are an important source of protein for humans and animals. Lupinus angustifolius (L.), also known as narrow-leaf lupin (NLL) is a grain legume crop that is gaining recognition as a potential human health food as the grain is high in protein and dietary fibre, gluten-free and low in fat and starch. Results Genes encoding the seed storage proteins of NLL were characterised by sequencing cDNA clones derived from developing seeds. Four families of seed storage proteins were identified and comprised three unique α, seven β, two γ and four δ conglutins. This study added eleven new expressed storage protein genes for the species. A comparison of the deduced amino acid sequences of NLL conglutins with those available for the storage proteins of Lupinus albus (L.), Pisum sativum (L.), Medicago truncatula (L.), Arachis hypogaea (L.) and Glycine max (L.) permitted the analysis of a phylogenetic relationships between proteins and demonstrated, in general, that the strongest conservation occurred within species. In the case of 7S globulin (β conglutins) and 2S sulphur-rich albumin (δ conglutins), the analysis suggests that gene duplication occurred after legume speciation. This contrasted with 11S globulin (α conglutin) and basic 7S (γ conglutin) sequences where some of these sequences appear to have diverged prior to speciation. The most abundant NLL conglutin family was β (56%), followed by α (24%), δ (15%) and γ (6%) and the transcript levels of these genes increased 103 to 106 fold during seed development. We used the 16 NLL conglutin sequences identified here to determine that for individuals specifically allergic to lupin, all seven members of the β conglutin family were potential allergens. Conclusion This study has characterised 16 seed storage protein genes in NLL including 11 newly-identified members. It has helped lay the foundation for efforts to use molecular breeding approaches to improve lupins, for example by reducing allergens or increasing the expression of specific seed storage protein(s) with desirable nutritional properties. PMID:21457583
Identification and characterisation of seed storage protein transcripts from Lupinus angustifolius.
Foley, Rhonda C; Gao, Ling-Ling; Spriggs, Andrew; Soo, Lena Y C; Goggin, Danica E; Smith, Penelope M C; Atkins, Craig A; Singh, Karam B
2011-04-04
In legumes, seed storage proteins are important for the developing seedling and are an important source of protein for humans and animals. Lupinus angustifolius (L.), also known as narrow-leaf lupin (NLL) is a grain legume crop that is gaining recognition as a potential human health food as the grain is high in protein and dietary fibre, gluten-free and low in fat and starch. Genes encoding the seed storage proteins of NLL were characterised by sequencing cDNA clones derived from developing seeds. Four families of seed storage proteins were identified and comprised three unique α, seven β, two γ and four δ conglutins. This study added eleven new expressed storage protein genes for the species. A comparison of the deduced amino acid sequences of NLL conglutins with those available for the storage proteins of Lupinus albus (L.), Pisum sativum (L.), Medicago truncatula (L.), Arachis hypogaea (L.) and Glycine max (L.) permitted the analysis of a phylogenetic relationships between proteins and demonstrated, in general, that the strongest conservation occurred within species. In the case of 7S globulin (β conglutins) and 2S sulphur-rich albumin (δ conglutins), the analysis suggests that gene duplication occurred after legume speciation. This contrasted with 11S globulin (α conglutin) and basic 7S (γ conglutin) sequences where some of these sequences appear to have diverged prior to speciation. The most abundant NLL conglutin family was β (56%), followed by α (24%), δ (15%) and γ (6%) and the transcript levels of these genes increased 103 to 106 fold during seed development. We used the 16 NLL conglutin sequences identified here to determine that for individuals specifically allergic to lupin, all seven members of the β conglutin family were potential allergens. This study has characterised 16 seed storage protein genes in NLL including 11 newly-identified members. It has helped lay the foundation for efforts to use molecular breeding approaches to improve lupins, for example by reducing allergens or increasing the expression of specific seed storage protein(s) with desirable nutritional properties.
Wang, Guiqin; Yin, Renfu; Zhou, Paul; Ding, Zhuang
2017-01-01
Hemagglutinin (HA) head has long been considered to be able to elicit only a narrow, strain-specific antibody response as it undergoes rapid antigenic drift. However, we previously showed that a heterologous prime-boost strategy, in which mice were primed twice with DNA encoding HA and boosted once with virus-like particles (VLP) from an H5N1 strain A/Thailand/1(KAN)-1/2004 (noted as TH DDV), induced anti-head broad cross-H5 neutralizing antibody response. To explain why TH DDV immunization could generate such breadth, we systemically compared the neutralization breadth and potency between TH DDV sera and immune sera elicited by TH DDD (three times of DNA immunizations), TH VVV (three times of VLP immunizations), TH DV (one DNA prime plus one VLP boost) and TK DDV (plasmid DNA and VLP derived from another H5N1 strain, A/Turkey/65596/2006). Then we determined the antigenic sites (AS) on TH HA head and the key residues of the main antigenic site. Through the comparison of different regiments, we found that the combination of the immunization with the sequence close to the consensus sequence and two DNA prime plus one VLP boost caused that TH DDV immunization generate broad neutralizing antibodies. Antigenic analysis showed that TH DDV, TH DV, TH DDD and TH VVV sera recognize the common antigenic site AS1. Antibodies directed to AS1 contribute to the largest proportion of the neutralizing activity of these immune sera. Residues 188 and 193 in AS1 are the key residues which are responsible for neutralization breadth of the immune sera. Interestingly, residues 188 and 193 locate in classical antigen sites but are relatively conserved among the 16 tested strains and 1,663 HA sequences from NCBI database. Thus, our results strongly indicate that it is feasible to develop broad cross-H5 influenza vaccines against HA head. PMID:28542275
BLIPPED (BLIpped Pure Phase EncoDing) high resolution MRI with low amplitude gradients
NASA Astrophysics Data System (ADS)
Xiao, Dan; Balcom, Bruce J.
2017-12-01
MRI image resolution is proportional to the maximum k-space value, i.e. the temporal integral of the magnetic field gradient. High resolution imaging usually requires high gradient amplitudes and/or long spatial encoding times. Special gradient hardware is often required for high amplitudes and fast switching. We propose a high resolution imaging sequence that employs low amplitude gradients. This method was inspired by the previously proposed PEPI (π Echo Planar Imaging) sequence, which replaced EPI gradient reversals with multiple RF refocusing pulses. It has been shown that when the refocusing RF pulse is of high quality, i.e. sufficiently close to 180°, the magnetization phase introduced by the spatial encoding magnetic field gradient can be preserved and transferred to the following echo signal without phase rewinding. This phase encoding scheme requires blipped gradients that are identical for each echo, with low and constant amplitude, providing opportunities for high resolution imaging. We now extend the sequence to 3D pure phase encoding with low amplitude gradients. The method is compared with the Hybrid-SESPI (Spin Echo Single Point Imaging) technique to demonstrate the advantages in terms of low gradient duty cycle, compensation of concomitant magnetic field effects and minimal echo spacing, which lead to superior image quality and high resolution. The 3D imaging method was then applied with a parallel plate resonator RF probe, achieving a nominal spatial resolution of 17 μm in one dimension in the 3D image, requiring a maximum gradient amplitude of only 5.8 Gauss/cm.
Ovule development: identification of stage-specific and tissue-specific cDNAs.
Nadeau, J A; Zhang, X S; Li, J; O'Neill, S D
1996-01-01
A differential screening approach was used to identify seven ovule-specific cDNAs representing genes that are expressed in a stage-specific manner during ovule development. The Phalaenopsis orchid takes 80 days to complete the sequence of ovule developmental events, making it a good system to isolate stage-specific ovule genes. We constructed cDNA libraries from orchid ovule tissue during archesporial cell differentiation, megasporocyte formation, and the transition to meiosis, as well as during the final mitotic divisions of female gametophyte development. RNA gel blot hybridization analysis revealed that four clones were stage specific and expressed solely in ovule tissue, whereas one clone was specific to pollen tubes. Two other clones were not ovule specific. Sequence analysis and in situ hybridization revealed the identities and domain of expression of several of the cDNAs. O39 encodes a putative homeobox transcription factor that is expressed early in the differentiation of the ovule primordium; O40 encodes a cytochrome P450 monooxygenase (CYP78A2) that is pollen tube specific. O108 encodes a protein of unknown function that is expressed exclusively in the outer layer of the outer integument and in the female gametophyte of mature ovules. O126 encodes a glycine-rich protein that is expressed in mature ovules, and O141 encodes a cysteine proteinase that is expressed in the outer integument of ovules during seed formation. Sequences homologous to these ovule clones can now be isolated from other organisms, and this should facilitate their functional characterization. PMID:8742709
Recombinant constructs of Borrelia burgdorferi
Dattwyler, Raymond J.; Gomes-Solecki, Maria J. C.; Luft, Benjamin J.; Dunn, John J.
2007-02-20
Novel chimeric nucleic acids, encoding chimeric Borrelia proteins comprising OspC or an antigenic fragment thereof and OspA or an antigenic fragment thereof, are disclosed. Chimeric proteins encoded by the nucleic acid sequences are also disclosed. The chimeric proteins are useful as vaccine immunogens against Lyme borreliosis, as well as for immunodiagnostic reagents.
Mehdizadeh Gohari, Iman; Kropinski, Andrew M; Weese, Scott J; Parreira, Valeria R; Whitehead, Ashley E; Boerlin, Patrick; Prescott, John F
2016-01-01
The recent discovery of a novel beta-pore-forming toxin, NetF, which is strongly associated with canine and foal necrotizing enteritis should improve our understanding of the role of type A Clostridium perfringens associated disease in these animals. The current study presents the complete genome sequence of two netF-positive strains, JFP55 and JFP838, which were recovered from cases of foal necrotizing enteritis and canine hemorrhagic gastroenteritis, respectively. Genome sequencing was done using Single Molecule, Real-Time (SMRT) technology-PacBio and Illumina Hiseq2000. The JFP55 and JFP838 genomes include a single 3.34 Mb and 3.53 Mb chromosome, respectively, and both genomes include five circular plasmids. Plasmid annotation revealed that three plasmids were shared by the two newly sequenced genomes, including a NetF/NetE toxins-encoding tcp-conjugative plasmid, a CPE/CPB2 toxins-encoding tcp-conjugative plasmid and a putative bacteriocin-encoding plasmid. The putative beta-pore-forming toxin genes, netF, netE and netG, were located in unique pathogenicity loci on tcp-conjugative plasmids. The C. perfringens JFP55 chromosome carries 2,825 protein-coding genes whereas the chromosome of JFP838 contains 3,014 protein-encoding genes. Comparison of these two chromosomes with three available reference C. perfringens chromosome sequences identified 48 (~247 kb) and 81 (~430 kb) regions unique to JFP55 and JFP838, respectively. Some of these divergent genomic regions in both chromosomes are phage- and plasmid-related segments. Sixteen of these unique chromosomal regions (~69 kb) were shared between the two isolates. Five of these shared regions formed a mosaic of plasmid-integrated segments, suggesting that these elements were acquired early in a clonal lineage of netF-positive C. perfringens strains. These results provide significant insight into the basis of canine and foal necrotizing enteritis and are the first to demonstrate that netF resides on a large and unique plasmid-encoded locus.
Yerrapragada, Shaila; Shukla, Animesh; Hallsworth-Pepin, Kymberlie; Choi, Kwangmin; Wollam, Aye; Clifton, Sandra; Qin, Xiang; Muzny, Donna; Raghuraman, Sriram; Ashki, Haleh; Uzman, Akif; Highlander, Sarah K.; Fryszczyn, Bartlomiej G.; Fox, George E.; Tirumalai, Madhan R.; Liu, Yamei; Kim, Sun
2015-01-01
Tolypothrix sp. PCC 7601 is a freshwater filamentous cyanobacterium with complex responses to environmental conditions. Here, we present its 9.96-Mbp draft genome sequence, containing 10,065 putative protein-coding sequences, including 305 predicted two-component system proteins and 27 putative phytochrome-class photoreceptors, the most such proteins in any sequenced genome. PMID:25953173
Clark, A M; Jacobsen, K R; Bostwick, D E; Dannenhoffer, J M; Skaggs, M I; Thompson, G A
1997-07-01
Sieve elements in the phloem of most angiosperms contain proteinaceous filaments and aggregates called P-protein. In the genus Cucurbita, these filaments are composed of two major proteins: PP1, the phloem filament protein, and PP2, the phloem lactin. The gene encoding the phloem filament protein in pumpkin (Cucurbita maxima Duch.) has been isolated and characterized. Nucleotide sequence analysis of the reconstructed gene gPP1 revealed a continuous 2430 bp protein coding sequence, with no introns, encoding an 809 amino acid polypeptide. The deduced polypeptide had characteristics of PP1 and contained a 15 amino acid sequence determined by N-terminal peptide sequence analysis of PP1. The sequence of PP1 was highly repetitive with four 200 amino acid sequence domains containing structural motifs in common with cysteine proteinase inhibitors. Expression of the PP1 gene was detected in roots, hypocotyls, cotyledons, stems, and leaves of pumpkin plants. PP1 and its mRNA accumulated in pumpkin hypocotyls during the period of rapid hypocotyl elongation after which mRNA levels declined, while protein levels remained elevated. PP1 was immunolocalized in slime plugs and P-protein bodies in sieve elements of the phloem. Occasionally, PP1 was detected in companion cells. PP1 mRNA was localized by in situ hybridization in companion cells at early stages of vascular differentiation. The developmental accumulation and localization of PP1 and its mRNA paralleled the phloem lactin, further suggesting an interaction between these phloem-specific proteins.
Yocum, R R; Perkins, J B; Howitt, C L; Pero, J
1996-01-01
The metE gene, encoding S-adenosylmethionine synthetase (EC 2.5.1.6) from Bacillus subtilis, was cloned in two steps by normal and inverse PCR. The DNA sequence of the metE gene contains an open reading frame which encodes a 400-amino-acid sequence that is homologous to other known S-adenosylmethionine synthetases. The cloned gene complements the metE1 mutation and integrates at or near the chromosomal site of metE1. Expression of S-adenosylmethionine synthetase is reduced by only a factor of about 2 by exogenous methioinine. Overproduction of S-adenosylmethionine synthetase from a strong constitutive promoter leads to methionine auxotrophy in B. subtilis, suggesting that S-adenosylmethionine is a corepressor of methionine biosynthesis in B. subtilis, as others have already shown for Escherichia coli. PMID:8755891
Yocum, R R; Perkins, J B; Howitt, C L; Pero, J
1996-08-01
The metE gene, encoding S-adenosylmethionine synthetase (EC 2.5.1.6) from Bacillus subtilis, was cloned in two steps by normal and inverse PCR. The DNA sequence of the metE gene contains an open reading frame which encodes a 400-amino-acid sequence that is homologous to other known S-adenosylmethionine synthetases. The cloned gene complements the metE1 mutation and integrates at or near the chromosomal site of metE1. Expression of S-adenosylmethionine synthetase is reduced by only a factor of about 2 by exogenous methioinine. Overproduction of S-adenosylmethionine synthetase from a strong constitutive promoter leads to methionine auxotrophy in B. subtilis, suggesting that S-adenosylmethionine is a corepressor of methionine biosynthesis in B. subtilis, as others have already shown for Escherichia coli.
Dong, J G; Kim, W T; Yip, W K; Thompson, G A; Li, L; Bennett, A B; Yang, S F
1991-08-01
1-Aminocyclopropane-1-carboxylate (ACC) synthase (EC 4.4.1.14) purified from apple (Malus sylvestris Mill.) fruit was subjected to trypsin digestion. Following separation by reversed-phase high-pressure liquid chromatography, ten tryptic peptides were sequenced. Based on the sequences of three tryptic peptides, three sets of mixed oligonucleotide probes were synthesized and used to screen a plasmid cDNA library prepared from poly(A)(+) RNA of ripe apple fruit. A 1.5-kb (kilobase) cDNA clone which hybridized to all three probes were isolated. The clone contained an open reading frame of 1214 base pairs (bp) encoding a sequence of 404 amino acids. While the polyadenine tail at the 3'-end was intact, it lacked a portion of sequence at the 5'-end. Using the RNA-based polymerase chain reaction, an additional sequence of 148 bp was obtained at the 5'-end. Thus, 1362 bp were sequenced and they encode 454 amino acids. The deduced amino-acid sequence contained peptide sequences corresponding to all ten tryptic fragments, confirming the identity of the cDNA clone. Comparison of the deduced amino-acid sequence between ACC synthase from apple fruit and those from tomato (Lycopersicon esculentum Mill.) and winter squash (Cucurbita maxima Duch.) fruits demonstrated the presence of seven highly conserved regions, including the previously identified region for the active site. The size of the translation product of ACC-synthase mRNA was similar to that of the mature protein on sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE), indicating that apple ACC-synthase undergoes only minor, if any, post-translational proteolytic processing. Analysis of ACC-synthase mRNA by in-vitro translation-immunoprecipitation, and by Northern blotting indicates that the ACC-synthase mRNA was undetectable in unripe fruit, but was accumulated massively during the ripening proccess. These data demonstrate that the expression of the ACC-synthase gene is developmentally regulated.
Complete genomic sequence of a Tobacco rattle virus isolate from Michigan-grown potatoes.
Crosslin, James M; Hamm, Philip B; Kirk, William W; Hammond, Rosemarie W
2010-04-01
Tobacco rattle virus (TRV) causes stem mottle on potato leaves and necrotic arcs and rings in potato tubers, known as corky ringspot disease. Recently, TRV was reported in Michigan potato tubers cv. FL1879 exhibiting corky ringspot disease. Sequence analysis of the RNA-1-encoded 16-kDa gene of the Michigan isolate, designated MI-1, revealed homology to TRV isolates from Florida and Washington. Here, we report the complete genomic sequence of RNA-1 (6,791 nt) and RNA-2 (3,685 nt) of TRV MI-1. RNA-1 is predicted to contain four open reading frames, and the genome structure and phylogenetic analyses of the RNA-1 nucleotide sequence revealed significant homologies to the known sequences of other TRV-1 isolates. The relationships based on the full-length nucleotide sequence were different from than those based on the 16-kDa gene encoded on genomic RNA-1 and reflect sequence variation within a 20-25-aa residue region of the 16-kDa protein. MI-1 RNA-2 is predicted to contain three ORFs, encoding the coat protein (CP), a 37.6-kDa protein (ORF 2b), and a 33.6-kDa protein (ORF 2c). In addition, it contains a region of similarity to the 3' terminus of RNA-1, including a truncated portion of the 16-kDa cistron. Phylogenetic analysis of RNA-2, based on a comparison of nucleotide sequences with other members of the genus Tobravirus, indicates that TRV MI-1 and other North American isolates cluster as a distinct group. TRV M1-1 is only the second North American isolate for which there is a complete sequence of the genome, and it is distinct from the North American isolate TRV ORY. The relationship of the TRV MI-1 isolate to other tobravirus isolates is discussed.
Conceptual issues in Bayesian divergence time estimation
2016-01-01
Bayesian inference of species divergence times is an unusual statistical problem, because the divergence time parameters are not identifiable unless both fossil calibrations and sequence data are available. Commonly used marginal priors on divergence times derived from fossil calibrations may conflict with node order on the phylogenetic tree causing a change in the prior on divergence times for a particular topology. Care should be taken to avoid confusing this effect with changes due to informative sequence data. This effect is illustrated with examples. A topology-consistent prior that preserves the marginal priors is defined and examples are constructed. Conflicts between fossil calibrations and relative branch lengths (based on sequence data) can cause estimates of divergence times that are grossly incorrect, yet have a narrow posterior distribution. An example of this effect is given; it is recommended that overly narrow posterior distributions of divergence times should be carefully scrutinized. This article is part of the themed issue ‘Dating species divergences using rocks and clocks’. PMID:27325831
Conceptual issues in Bayesian divergence time estimation.
Rannala, Bruce
2016-07-19
Bayesian inference of species divergence times is an unusual statistical problem, because the divergence time parameters are not identifiable unless both fossil calibrations and sequence data are available. Commonly used marginal priors on divergence times derived from fossil calibrations may conflict with node order on the phylogenetic tree causing a change in the prior on divergence times for a particular topology. Care should be taken to avoid confusing this effect with changes due to informative sequence data. This effect is illustrated with examples. A topology-consistent prior that preserves the marginal priors is defined and examples are constructed. Conflicts between fossil calibrations and relative branch lengths (based on sequence data) can cause estimates of divergence times that are grossly incorrect, yet have a narrow posterior distribution. An example of this effect is given; it is recommended that overly narrow posterior distributions of divergence times should be carefully scrutinized.This article is part of the themed issue 'Dating species divergences using rocks and clocks'. © 2016 The Author(s).
Sequence of a cDNA encoding pancreatic preprosomatostatin-22.
Magazin, M; Minth, C D; Funckes, C L; Deschenes, R; Tavianini, M A; Dixon, J E
1982-01-01
We report the nucleotide sequence of a precursor to somatostatin that upon proteolytic processing may give rise to a hormone of 22 amino acids. The nucleotide sequence of a cDNA from the channel catfish (Ictalurus punctatus) encodes a precursor to somatostatin that is 105 amino acids (Mr, 11,500). The cDNA coding for somatostatin-22 consists of 36 nucleotides in the 5' untranslated region, 315 nucleotides that code for the precursor to somatostatin-22, 269 nucleotides at the 3' untranslated region, and a variable length of poly(A). The putative preprohormone contains a sequence of hydrophobic amino acids at the amino terminus that has the properties of a "signal" peptide. A connecting sequence of approximately 57 amino acids is followed by a single Arg-Arg sequence, which immediately precedes the hormone. Somatostatin-22 is homologous to somatostatin-14 in 7 of the 14 amino acids, including the Phe-Trp-Lys sequence. Hybridization selection of mRNA, followed by its translation in a wheat germ cell-free system, resulted in the synthesis of a single polypeptide having a molecular weight of approximately 10,000 as estimated on Na-DodSO4/polyacrylamide gels. Images PMID:6127673
JVM: Java Visual Mapping tool for next generation sequencing read.
Yang, Ye; Liu, Juan
2015-01-01
We developed a program JVM (Java Visual Mapping) for mapping next generation sequencing read to reference sequence. The program is implemented in Java and is designed to deal with millions of short read generated by sequence alignment using the Illumina sequencing technology. It employs seed index strategy and octal encoding operations for sequence alignments. JVM is useful for DNA-Seq, RNA-Seq when dealing with single-end resequencing. JVM is a desktop application, which supports reads capacity from 1 MB to 10 GB.
Blackburn, Michael B; Sparks, Michael E; Gundersen-Rindal, Dawn E
2016-12-01
The genome of Chromobacterium subtsugae strain PRAA4-1, a betaproteobacterium producing insecticidal compounds, was sequenced and compared with the genome of C. violaceum ATCC 12472. The genome of C. subtsugae displayed a reduction in genes devoted to capsular and extracellular polysaccharide, possessed no genes encoding nitrate reductases, and exhibited many more phage-related sequences than were observed for C. violaceum. The genomes of both species possess a number of gene clusters predicted to encode biosynthetic complexes for secondary metabolites; these clusters suggest they produce overlapping, but distinct assortments of metabolites.
Jonas, V; Lin, C R; Kawashima, E; Semon, D; Swanson, L W; Mermod, J J; Evans, R M; Rosenfeld, M G
1985-01-01
Two mRNAs generated as a consequence of alternative RNA processing events in expression of the human calcitonin gene encode the protein precursors of either calcitonin or calcitonin gene-related peptide (CGRP). Both calcitonin and CGRP RNAs and their encoded peptide products are expressed in the human pituitary and in medullary thyroid tumors. On the basis of sequence comparison, it is suggested that both the calcitonin and CGRP exons arose from a common primordial sequence, suggesting that duplication and rearrangement events are responsible for the generation of this complex transcription unit. Images PMID:3872459
Identification and cloning of a gamma 3 subunit splice variant of the human GABA(A) receptor.
Poulsen, C F; Christjansen, K N; Hastrup, S; Hartvig, L
2000-05-31
cDNA sequences encoding two forms of the GABA(A) gamma 3 receptor subunit were cloned from human hippocampus. The nucleotide sequences differ by the absence (gamma 3S) or presence (gamma 3L) of 18 bp located in the presumed intracellular loop between transmembrane region (TM) III and IV. The extra 18 bp in the gamma 3L subunit generates a consensus site for phosphorylation by protein kinase C (PKC). Analysis of human genomic DNA encoding the gamma 3 subunit reveals that the 18 bp insert is contiguous with the upstream proximal exon.
Cloning and sequence analysis of the LEU2 homologue gene from Pichia anomala.
De la Rosa, J M; Pérez, J A; Gutiérrez, F; González, J M; Ruiz, T; Rodríguez, L
2001-11-01
The Pichia anomala LEU2 gene (PaLEU2) was isolated by complementation of a leu2 Saccharomyces cerevisiae mutant. The cloned gene also allowed growth of a Escherichia coli leuB mutant in leucine-lacking medium, indicating that it encodes a product able to complement the beta-isopropylmalate dehydrogenase deficiency of the mutants. The sequenced DNA fragment contains a complete ORF of 1092 bp, and the deduced polypeptide shares significant homologies with the products of the LEU2 genes from S. cerevisiae (84% identity) and other yeast species. A sequence resembling the GC-rich palindrome motif identified in the 5' region of S. cerevisiae LEU2 gene as the binding site for the transcription activating factor encoded by the LEU3 gene was found at the promoter region. In addition, upstream of the PaLEU2 the 3'-terminal half of a gene of the same orientation, encoding a homologue of the S. cerevisiae NFS1/SPL1 gene that encodes a mitochondrial cysteine desulphurase involved in both tRNA processing and mitochondrial metabolism, was found. The genomic organization of the PaNFS1-PaLEU2 gene pair is similar to that found in several other yeast species, including S. cerevisiae and Candida albicans, except that in some of them the LEU2 gene appears in the reverse orientation. Copyright 2001 John Wiley & Sons, Ltd.
de-Couet, H. G.; Fong, KSK.; Weeds, A. G.; McLaughlin, P. J.; Miklos, GLG.
1995-01-01
The flightless locus of Drosophila melanogaster has been analyzed at the genetic, molecular, ultrastructural and comparative crystallographic levels. The gene encodes a single transcript encoding a protein consisting of a leucine-rich amino terminal half and a carboxyterminal half with high sequence similarity to gelsolin. We determined the genomic sequence of the flightless landscape, the breakpoints of four chromosomal rearrangements, and the molecular lesions in two lethal and two viable alleles of the gene. The two alleles that lead to flight muscle abnormalities encode mutant proteins exhibiting amino acid replacements within the S1-like domain of their gelsolin-like region. Furthermore, the deduced intronexon structure of the D. melanogaster gene has been compared with that of the Caenorhabditis elegans homologue. Furthermore, the sequence similarities of the flightless protein with gelsolin allow it to be evaluated in the context of the published crystallographic structure of the S1 domain of gelsolin. Amino acids considered essential for the structural integrity of the core are found to be highly conserved in the predicted flightless protein. Some of the residues considered essential for actin and calcium binding in gelsolin S1 and villin V1 are also well conserved. These data are discussed in light of the phenotypic characteristics of the mutants and the putative functions of the protein. PMID:8582612
Regional outbreak of CTX-M-2 β-lactamase-producing Proteus mirabilis in Japan.
Nakano, Ryuichi; Nakano, Akiyo; Abe, Michiko; Inoue, Matsuhisa; Okamoto, Ryoichi
2012-12-01
Proteus mirabilis is a common cause of urinary tract infection. Wild-type P. mirabilis strains are usually susceptible to penicillins and cephalosporins, but occurrences of P. mirabilis producing extended-spectrum β-lactamases (ESBLs) have been recently reported. Here, we surveyed the prevalence of cefotaxime resistance among P. mirabilis strains at seven different hospitals in Kanagawa Prefecture, Japan, and investigated their molecular epidemiology to explain the mechanism of their spread. The prevalence of cefotaxime resistance among P. mirabilis increased annually, from 10.1 % in 1998 to 23.1 % in 2003, and increased drastically in 2004, exceeding 40 %. We collected 105 consecutive and non-duplicate cefotaxime-resistant P. mirabilis isolates (MIC 16 to >256 µg ml(-1)) from these hospitals from June 2004 to May 2005 and characterized their profile. PCR and sequence analysis revealed that all resistant strains produced exclusively CTX-M-2 β-lactamase. PFGE analysis identified 47 banding patterns with 83 % or greater similarity. These results indicated that a regional outbreak of P. mirabilis producing CTX-M-2 β-lactamase has occurred in Japan and suggest that the epidemic spread occurred within and across hospitals and communities by extended clonal strains. Plasmid analysis revealed that 44.8 % of plasmids harboured by bla(CTX-M-2) isolates had common profiles, encoding ISEcp1, IS26 and Int1, and belonged to incompatibility group T. Spread of the resistant isolates in Japan resulted from dissemination of narrow-host-range plasmids of the IncT group encoding bla(CTX-M-2). These findings indicate the rapidly developing problem of treating the species to prevent dissemination of ESBL producers.
Amicosante, G; Oratore, A; Joris, B; Galleni, M; Frère, J M; Van Beeumen, J
1988-01-01
Both forms of the chromosome-encoded beta-lactamase of Citrobacter diversus react with beta-iodopenicillanate at a rate characteristic of class A beta-lactamases. The active site of form I was labelled with the same reagent. The sequence of the peptide obtained after trypsin hydrolysis is identical with that of a peptide obtained in a similar manner from the chromosome-encoded beta-lactamase of Klebsiella pneumoniae. PMID:2848500
Lloyd-Jones, G; Lau, P C
1997-01-01
Homologs of the glutathione S-transferase (GST)-encoding gene were identified in a collection of aromatic hydrocarbon-degrading Sphingomonas spp. isolated from New Zealand, Antarctica, and the United States by using PCR primers designed from the GST-encoding gene of Sphingomonas paucimobilis EPA505. Sequence analysis of PCR fragments generated from these isolates and of the GST gene amplified from DNA extracted from polycyclic aromatic hydrocarbon (PAH)-contaminated soil revealed a high degree of conservation, which may make the GST-encoding gene a potentially useful marker for PAH-degrading bacteria. PMID:9251217
Meeuwissen, Esther B; Takashima, Atsuko; Fernández, Guillén; Jensen, Ole
2011-12-01
It is becoming increasingly clear that demanding cognitive tasks rely on an extended network engaging task-relevant areas and, importantly, disengaging task-irrelevant areas. Given that alpha activity (8-12 Hz) has been shown to reflect the disengagement of task-irrelevant regions in attention and working memory tasks, we here ask if alpha activity plays a related role for long-term memory formation. Subjects were instructed to encode and maintain the order of word sequences while the ongoing brain activity was recorded using magnetoencephalography (MEG). In each trial, three words were presented followed by a 3.4 s rehearsal interval. Considering the good temporal resolution of MEG this allowed us to investigate the word presentation and rehearsal interval separately. The sequences were grouped in trials where word order either could be tested immediately (working memory trials; WM) or later (LTM trials) according to instructions. Subjects were tested on their ability to retrieve the order of the three words. The data revealed that alpha power in parieto-occipital regions was lower during word presentation compared to rehearsal. Our key finding was that parieto-occipital alpha power during the rehearsal period was markedly stronger for successfully than unsuccessfully encoded LTM sequences. This subsequent memory effect demonstrates that high posterior alpha activity creates an optimal brain state for successful LTM formation possibly by actively reducing parieto-occipital activity that might interfere with sequence encoding. Copyright © 2010 Wiley Periodicals, Inc.
Yasukawa, Hiro; Sato, Aya; Kita, Ayaka; Kodaira, Ken-Ichi; Iseki, Mineo; Takahashi, Tetsuo; Shibusawa, Mami; Watanabe, Masakatsu; Yagita, Kenji
2013-01-01
Complete genome sequencing of Naegleria gruberi has revealed that the organism encodes polypeptides similar to photoactivated adenylyl cyclases (PACs). Screening in the N. australiensis genome showed that the organism also encodes polypeptides similar to PACs. Each of the Naegleria proteins consists of a "sensors of blue-light using FAD" domain (BLUF domain) and an adenylyl cyclase domain (AC domain). PAC activity of the Naegleria proteins was assayed by comparing sensitivities of Escherichia coli cells heterologously expressing the proteins to antibiotics in a dark condition and a blue light-irradiated condition. Antibiotics used in the assays were fosfomycin and fosmidomycin. E. coli cells expressing the Naegleria proteins showed increased fosfomycin sensitivity and fosmidomycin sensitivity when incubated under blue light, indicating that the proteins functioned as PACs in the bacterial cells. Analysis of the N. fowleri genome revealed that the organism encodes a protein bearing an amino acid sequence similar to that of BLUF. A plasmid expressing a chimeric protein consisting of the BLUF-like sequence found in N. fowleri and the adenylyl cyclase domain of N. gruberi PAC was constructed to determine whether the BLUF-like sequence functioned as a sensor of blue light. E. coli cells expressing a chimeric protein showed increased fosfomycin sensitivity and fosmidomycin sensitivity when incubated under blue light. These experimental results indicated that the sequence similar to the BLUF domain found in N. fowleri functioned as a sensor of blue light.
The spectrum and clinical impact of epigenetic modifier mutations in myeloma
Pawlyn, Charlotte; Kaiser, Martin F; Heuck, Christoph; Melchor, Lorenzo; Wardell, Christopher P; Murison, Alex; Chavan, Shweta; Johnson, David C; Begum, Dil; Dahir, Nasrin; Proszek, Paula; Cairns, David A; Boyle, Eileen M; Jones, John R; Cook, Gordon; Drayson, Mark T; Owen, Roger G; Gregory, Walter M; Jackson, Graham H; Barlogie, Bart; Davies, Faith E; Walker, Brian A; Morgan, Gareth J
2016-01-01
Purpose Epigenetic dysregulation is known to be an important contributor to myeloma pathogenesis but, unlike in other B cell malignancies, the full spectrum of somatic mutations in epigenetic modifiers has not been previously reported. We sought to address this using results from whole-exome sequencing in the context of a large prospective clinical trial of newly diagnosed patients and targeted sequencing in a cohort of previously treated patients for comparison. Experimental Design Whole-exome sequencing analysis of 463 presenting myeloma cases entered in the UK NCRI Myeloma XI study and targeted sequencing analysis of 156 previously treated cases from the University of Arkansas for Medical Sciences. We correlated the presence of mutations with clinical outcome from diagnosis and compared the mutations found at diagnosis with later stages of disease. Results In diagnostic myeloma patient samples we identify significant mutations in genes encoding the histone 1 linker protein, previously identified in other B-cell malignancies. Our data suggest an adverse prognostic impact from the presence of lesions in genes encoding DNA methylation modifiers and the histone demethylase KDM6A/UTX. The frequency of mutations in epigenetic modifiers appears to increase following treatment most notably in genes encoding histone methyltransferases and DNA methylation modifiers. Conclusions Numerous mutations identified raise the possibility of targeted treatment strategies for patients either at diagnosis or relapse supporting the use of sequencing-based diagnostics in myeloma to help guide therapy as more epigenetic targeted agents become available. PMID:27235425
Okamoto, Masaaki; Naito, Mariko; Miyanohara, Mayu; Imai, Susumu; Nomura, Yoshiaki; Saito, Wataru; Momoi, Yasuko; Takada, Kazuko; Miyabe-Nishiwaki, Takako; Tomonaga, Masaki; Hanada, Nobuhiro
2016-12-01
Streptococcus troglodytae TKU31 was isolated from the oral cavity of a chimpanzee (Pan troglodytes) and was found to be the most closely related species of the mutans group streptococci to Streptococcus mutans. The complete sequence of TKU31 genome consists of a single circular chromosome that is 2,097,874 base pairs long and has a G + C content of 37.18%. It possesses 2082 coding sequences (CDSs), 65 tRNAs and five rRNA operons (15 rRNAs). Two clustered regularly interspaced short palindromic repeats, six insertion sequences and two predicted prophage elements were identified. The genome of TKU31 harbors some putative virulence associated genes, including gtfB, gtfC and gtfD genes encoding glucosyltransferase and gbpA, gbpB, gbpC and gbpD genes encoding glucan-binding cell wall-anchored protein. The deduced amino acid identity of the rhamnose-glucose polysaccharide F gene (rgpF), which is one of the serotype determinants, is 91% identical with that of S. mutans LJ23 (serotype k) strain. However, two other virulence-associated genes cnm and cbm, which encode the collagen-binding proteins, were not found in the TKU31 genome. The complete genome sequence of S. troglodytae TKU31 has been deposited at DDBJ/European Nucleotide Archive/GenBank under the accession no. AP014612. © 2016 The Societies and John Wiley & Sons Australia, Ltd.
Yu, Jingyin; Tehrim, Sadia; Zhang, Fengqi; Tong, Chaobo; Huang, Junyan; Cheng, Xiaohui; Dong, Caihua; Zhou, Yanqiu; Qin, Rui; Hua, Wei; Liu, Shengyi
2014-01-03
Plant disease resistance (R) genes with the nucleotide binding site (NBS) play an important role in offering resistance to pathogens. The availability of complete genome sequences of Brassica oleracea and Brassica rapa provides an important opportunity for researchers to identify and characterize NBS-encoding R genes in Brassica species and to compare with analogues in Arabidopsis thaliana based on a comparative genomics approach. However, little is known about the evolutionary fate of NBS-encoding genes in the Brassica lineage after split from A. thaliana. Here we present genome-wide analysis of NBS-encoding genes in B. oleracea, B. rapa and A. thaliana. Through the employment of HMM search and manual curation, we identified 157, 206 and 167 NBS-encoding genes in B. oleracea, B. rapa and A. thaliana genomes, respectively. Phylogenetic analysis among 3 species classified NBS-encoding genes into 6 subgroups. Tandem duplication and whole genome triplication (WGT) analyses revealed that after WGT of the Brassica ancestor, NBS-encoding homologous gene pairs on triplicated regions in Brassica ancestor were deleted or lost quickly, but NBS-encoding genes in Brassica species experienced species-specific gene amplification by tandem duplication after divergence of B. rapa and B. oleracea. Expression profiling of NBS-encoding orthologous gene pairs indicated the differential expression pattern of retained orthologous gene copies in B. oleracea and B. rapa. Furthermore, evolutionary analysis of CNL type NBS-encoding orthologous gene pairs among 3 species suggested that orthologous genes in B. rapa species have undergone stronger negative selection than those in B .oleracea species. But for TNL type, there are no significant differences in the orthologous gene pairs between the two species. This study is first identification and characterization of NBS-encoding genes in B. rapa and B. oleracea based on whole genome sequences. Through tandem duplication and whole genome triplication analysis in B. oleracea, B. rapa and A. thaliana genomes, our study provides insight into the evolutionary history of NBS-encoding genes after divergence of A. thaliana and the Brassica lineage. These results together with expression pattern analysis of NBS-encoding orthologous genes provide useful resource for functional characterization of these genes and genetic improvement of relevant crops.
Polymeric peptide pigments with sequence-encoded properties
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lampel, Ayala; McPhee, Scott A.; Park, Hang-Ah
Melanins are a family of heterogeneous polymeric pigments that provide ultraviolet (UV) light protection, structural support, coloration, and free radical scavenging. Formed by oxidative oligomerization of catecholic small molecules, the physical properties of melanins are influenced by covalent and noncovalent disorder. We report the use of tyrosine-containing tripeptides as tunable precursors for polymeric pigments. In these structures, phenols are presented in a (supra-)molecular context dictated by the positions of the amino acids in the peptide sequence. Oxidative polymerization can be tuned in a sequence-dependent manner, resulting in peptide sequence–encoded properties such as UV absorbance, morphology, coloration, and electrochemical properties overmore » a considerable range. Short peptides have low barriers to application and can be easily scaled, suggesting near-term applications in cosmetics and biomedicine.« less
Conserved noncoding sequences (CNSs) in higher plants.
Freeling, Michael; Subramaniam, Shabarinath
2009-04-01
Plant conserved noncoding sequences (CNSs)--a specific category of phylogenetic footprint--have been shown experimentally to function. No plant CNS is conserved to the extent that ultraconserved noncoding sequences are conserved in vertebrates. Plant CNSs are enriched in known transcription factor or other cis-acting binding sites, and are usually clustered around genes. Genes that encode transcription factors and/or those that respond to stimuli are particularly CNS-rich. Only rarely could this function involve small RNA binding. Some transcribed CNSs encode short translation products as a form of negative control. Approximately 4% of Arabidopsis gene content is estimated to be both CNS-rich and occupies a relatively long stretch of chromosome: Bigfoot genes (long phylogenetic footprints). We discuss a 'DNA-templated protein assembly' idea that might help explain Bigfoot gene CNSs.
USDA-ARS?s Scientific Manuscript database
Background: In many bacteria including E. coli, genes encoding O-antigens are clustered in the chromosome, with a 39-bp JUMPstart sequence and gnd gene located upstream and downstream of the cluster, respectively. For determining the DNA sequence of the E. coli O-antigen gene cluster, one set of P...
Genetic engineering of syringyl-enriched lignin in plants
Chiang, Vincent Lee; Li, Laigeng
2004-11-02
The present invention relates to a novel DNA sequence, which encodes a previously unidentified lignin biosynthetic pathway enzyme, sinapyl alcohol dehydrogenase (SAD) that regulates the biosynthesis of syringyl lignin in plants. Also provided are methods for incorporating this novel SAD gene sequence or substantially similar sequences into a plant genome for genetic engineering of syringyl-enriched lignin in plants.
USDA-ARS?s Scientific Manuscript database
We recently described the complete genome of enterohemorrhagic Escherichia coli (EHEC) O157:H7 strain NADC 6564, an isolate of strain 86-24 linked to the 1986 disease outbreak. In the current study, we compared the chromosomal sequence of NADC 6564 to the well-characterized chromosomal sequences of ...
Yerrapragada, Shaila; Shukla, Animesh; Hallsworth-Pepin, Kymberlie; Choi, Kwangmin; Wollam, Aye; Clifton, Sandra; Qin, Xiang; Muzny, Donna; Raghuraman, Sriram; Ashki, Haleh; Uzman, Akif; Highlander, Sarah K; Fryszczyn, Bartlomiej G; Fox, George E; Tirumalai, Madhan R; Liu, Yamei; Kim, Sun; Kehoe, David M; Weinstock, George M
2015-05-07
Tolypothrix sp. PCC 7601 is a freshwater filamentous cyanobacterium with complex responses to environmental conditions. Here, we present its 9.96-Mbp draft genome sequence, containing 10,065 putative protein-coding sequences, including 305 predicted two-component system proteins and 27 putative phytochrome-class photoreceptors, the most such proteins in any sequenced genome. Copyright © 2015 Yerrapragada et al.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Reiser, Steven E.; Somerville, Chris R.
The present invention relates to bacterial enzymes, in particular to an acyl-CoA reductase and a gene encoding an acyl-CoA reductase, the amino acid and nucleic acid sequences corresponding to the reductase polypeptide and gene, respectively, and to methods of obtaining such enzymes, amino acid sequences and nucleic acid sequences. The invention also relates to the use of such sequences to provide transgenic host cells capable of producing fatty alcohols and fatty aldehydes.
Taboo words: the effect of emotion on memory for peripheral information.
Guillet, Rebecca; Arndt, Jason
2009-09-01
In three experiments, we examined memory for peripheral information that occurred in the same context as emotion-inducing information. In the first two experiments, participants studied either a sentence (Experiment 1) or a pair of words (Experiments 2A-2C) containing a neutral peripheral word, as well as a neutral, negative-valence, or taboo word, to induce an emotional response. At retrieval, the participants were asked to recall the neutral peripheral word from a sentence fragment or emotion-inducing word cue. In Experiment 3, we presented word pairs at encoding and tested memory with associative recognition. In all three experiments, memory for peripheral words was enhanced when it was encoded in the presence of emotionally arousing taboo words but not when it was encoded in the presence of words that were only negative in valence. These data are consistent with priority-binding theory (MacKay et al., 2004) and inconsistent with the attention-narrowing hypothesis (Easterbrook, 1959), as well as with object-based binding theory (Mather, 2007).
Bowring, Janine; Neamah, Maan M; Donderis, Jorge; Mir-Sanchis, Ignacio; Alite, Christian; Ciges-Tomas, J Rafael; Maiques, Elisa; Medmedov, Iltyar; Marina, Alberto; Penadés, José R
2017-08-08
Targeting conserved and essential processes is a successful strategy to combat enemies. Remarkably, the clinically important Staphylococcus aureus pathogenicity islands (SaPIs) use this tactic to spread in nature. SaPIs reside passively in the host chromosome, under the control of the SaPI-encoded master repressor, Stl. It has been assumed that SaPI de-repression is effected by specific phage proteins that bind to Stl, initiating the SaPI cycle. Different SaPIs encode different Stl repressors, so each targets a specific phage protein for its de-repression. Broadening this narrow vision, we report here that SaPIs ensure their promiscuous transfer by targeting conserved phage mechanisms. This is accomplished because the SaPI Stl repressors have acquired different domains to interact with unrelated proteins, encoded by different phages, but in all cases performing the same conserved function. This elegant strategy allows intra- and inter-generic SaPI transfer, highlighting these elements as one of nature's most fascinating subcellular parasites.
Lowe, J.B.; Lennon, G.; Rouquier, S.; Giorgi, D.; Kelly, R.J.
1998-09-15
The gene encoding GDP-L-fucose: {beta}-D-Galactoside 2-{alpha}-Lfucosyltransferase has been cloned, and a mutation in this gene has been found to be responsible for an individual being a non-secretor. 30 figs.
Opposite Effects of Cortisol on Consolidation of Temporal Sequence Memory during Waking and Sleep
ERIC Educational Resources Information Center
Wilhelm, Ines; Wagner, Ullrich; Born, Jan
2011-01-01
Memory functions involve three stages: encoding, consolidation, and retrieval. Modulating effects of glucocorticoids (GCs) have been consistently observed for declarative memory with GCs enhancing encoding and impairing retrieval, but surprisingly, little is known on how GCs affect memory consolidation. Studies in rats suggest a beneficial effect…
cDNA encoding a polypeptide including a hevein sequence
Raikhel, N.V.; Broekaert, W.F.; Namhai Chua; Kush, A.
1993-02-16
A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1,018 nucleotides long and includes an open reading frame of 204 amino acids.
Lowe, John B.; Lennon, Gregory; Rouquier, Sylvie; Giorgi, Dominique; Kelly, Robert J.
1998-01-01
The gene encoding GDP-L-fucose: .beta.-D-Galactoside 2-.alpha.-L-fucosyltransferase has been cloned, and a mutation in this gene has been found to be responsible for an individual being a non-secretor.
Identification in Marinomonas mediterranea of a novel quinoprotein with glycine oxidase activity.
Campillo-Brocal, Jonatan Cristian; Lucas-Elio, Patricia; Sanchez-Amat, Antonio
2013-08-01
A novel enzyme with lysine-epsilon oxidase activity was previously described in the marine bacterium Marinomonas mediterranea. This enzyme differs from other l-amino acid oxidases in not being a flavoprotein but containing a quinone cofactor. It is encoded by an operon with two genes lodA and lodB. The first one codes for the oxidase, while the second one encodes a protein required for the expression of the former. Genome sequencing of M. mediterranea has revealed that it contains two additional operons encoding proteins with sequence similarity to LodA. In this study, it is shown that the product of one of such genes, Marme_1655, encodes a protein with glycine oxidase activity. This activity shows important differences in terms of substrate range and sensitivity to inhibitors to other glycine oxidases previously described which are flavoproteins synthesized by Bacillus. The results presented in this study indicate that the products of the genes with different degrees of similarity to lodA detected in bacterial genomes could constitute a reservoir of different oxidases. © 2013 The Authors. Microbiology Open published by John Wiley & Sons Ltd.
Inventory of high-abundance mRNAs in skeletal muscle of normal men.
Welle, S; Bhatt, K; Thornton, C A
1999-05-01
G42875rial analysis of gene expression (SAGE) method was used to generate a catalog of 53,875 short (14 base) expressed sequence tags from polyadenylated RNA obtained from vastus lateralis muscle of healthy young men. Over 12,000 unique tags were detected. The frequency of occurrence of each tag reflects the relative abundance of the corresponding mRNA. The mRNA species that were detected 10 or more times, each comprising >/=0.02% of the mRNA population, accounted for 64% of the mRNA mass but <10% of the total number of mRNA species detected. Almost all of the abundant tags matched mRNA or EST sequences cataloged in GenBank. Mitochondrial transcripts accounted for approximately 20% of the polyadenylated RNA. Transcripts encoding proteins of the myofibrils were the most abundant nuclear-encoded mRNAs. Transcripts encoding ribosomal proteins, and those encoding proteins involved in energy metabolism, also were very abundant. The database can be used as a reference for investigations of alterations in gene expression associated with conditions that influence muscle function, such as muscular dystrophies, aging, and exercise.
Absolute Position Encoders With Vertical Image Binning
NASA Technical Reports Server (NTRS)
Leviton, Douglas B.
2005-01-01
Improved optoelectronic patternrecognition encoders that measure rotary and linear 1-dimensional positions at conversion rates (numbers of readings per unit time) exceeding 20 kHz have been invented. Heretofore, optoelectronic pattern-recognition absoluteposition encoders have been limited to conversion rates <15 Hz -- too low for emerging industrial applications in which conversion rates ranging from 1 kHz to as much as 100 kHz are required. The high conversion rates of the improved encoders are made possible, in part, by use of vertically compressible or binnable (as described below) scale patterns in combination with modified readout sequences of the image sensors [charge-coupled devices (CCDs)] used to read the scale patterns. The modified readout sequences and the processing of the images thus read out are amenable to implementation by use of modern, high-speed, ultra-compact microprocessors and digital signal processors or field-programmable gate arrays. This combination of improvements makes it possible to greatly increase conversion rates through substantial reductions in all three components of conversion time: exposure time, image-readout time, and image-processing time.
Identification of an opd (organophosphate degradation) gene in an Agrobacterium isolate.
Horne, Irene; Sutherland, Tara D; Harcourt, Rebecca L; Russell, Robyn J; Oakeshott, John G
2002-07-01
We isolated a bacterial strain, Agrobacterium radiobacter P230, which can hydrolyze a wide range of organophosphate (OP) insecticides. A gene encoding a protein involved in OP hydrolysis was cloned from A. radiobacter P230 and sequenced. This gene (called opdA) had sequence similarity to opd, a gene previously shown to encode an OP-hydrolyzing enzyme in Flavobacterium sp. strain ATCC 27551 and Brevundimonas diminuta MG. Insertional mutation of the opdA gene produced a strain lacking the ability to hydrolyze OPs, suggesting that this is the only gene encoding an OP-hydrolyzing enzyme in A. radiobacter P230. The OPH and OpdA proteins, encoded by opd and opdA, respectively, were overexpressed and purified as maltose-binding proteins, and the maltose-binding protein moiety was cleaved and removed. Neither protein was able to hydrolyze the aliphatic OP malathion. The kinetics of the two proteins for diethyl OPs were comparable. For dimethyl OPs, OpdA had a higher k(cat) than OPH. It was also capable of hydrolyzing the dimethyl OPs phosmet and fenthion, which were not hydrolyzed at detectable levels by OPH.
Banks, David J; Porcella, Stephen F; Barbian, Kent D; Beres, Stephen B; Philips, Lauren E; Voyich, Jovanka M; DeLeo, Frank R; Martin, Judith M; Somerville, Greg A; Musser, James M
2004-08-15
We describe the genome sequence of a macrolide-resistant strain (MGAS10394) of serotype M6 group A Streptococcus (GAS). The genome is 1,900,156 bp in length, and 8 prophage-like elements or remnants compose 12.4% of the chromosome. A 8.3-kb prophage remnant encodes the SpeA4 variant of streptococcal pyrogenic exotoxin A. The genome of strain MGAS10394 contains a chimeric genetic element composed of prophage genes and a transposon encoding the mefA gene conferring macrolide resistance. This chimeric element also has a gene encoding a novel surface-exposed protein (designated "R6 protein"), with an LPKTG cell-anchor motif located at the carboxyterminus. Surface expression of this protein was confirmed by flow cytometry. Humans with GAS pharyngitis caused by serotype M6 strains had antibody against the R6 protein present in convalescent, but not acute, serum samples. Our studies add to the theme that GAS prophage-encoded extracellular proteins contribute to host-pathogen interactions in a strain-specific fashion.
Wu, Jia Qian; Du, Jiang; Rozowsky, Joel; Zhang, Zhengdong; Urban, Alexander E; Euskirchen, Ghia; Weissman, Sherman; Gerstein, Mark; Snyder, Michael
2008-01-03
Recent studies of the mammalian transcriptome have revealed a large number of additional transcribed regions and extraordinary complexity in transcript diversity. However, there is still much uncertainty regarding precisely what portion of the genome is transcribed, the exact structures of these novel transcripts, and the levels of the transcripts produced. We have interrogated the transcribed loci in 420 selected ENCyclopedia Of DNA Elements (ENCODE) regions using rapid amplification of cDNA ends (RACE) sequencing. We analyzed annotated known gene regions, but primarily we focused on novel transcriptionally active regions (TARs), which were previously identified by high-density oligonucleotide tiling arrays and on random regions that were not believed to be transcribed. We found RACE sequencing to be very sensitive and were able to detect low levels of transcripts in specific cell types that were not detectable by microarrays. We also observed many instances of sense-antisense transcripts; further analysis suggests that many of the antisense transcripts (but not all) may be artifacts generated from the reverse transcription reaction. Our results show that the majority of the novel TARs analyzed (60%) are connected to other novel TARs or known exons. Of previously unannotated random regions, 17% were shown to produce overlapping transcripts. Furthermore, it is estimated that 9% of the novel transcripts encode proteins. We conclude that RACE sequencing is an efficient, sensitive, and highly accurate method for characterization of the transcriptome of specific cell/tissue types. Using this method, it appears that much of the genome is represented in polyA+ RNA. Moreover, a fraction of the novel RNAs can encode protein and are likely to be functional.
Alcántara, Cristina; Sarmiento-Rubiano, Luz Adriana; Monedero, Vicente; Deutscher, Josef; Pérez-Martínez, Gaspar; Yebra, María J.
2008-01-01
Sequence analysis of the five genes (gutRMCBA) downstream from the previously described sorbitol-6-phosphate dehydrogenase-encoding Lactobacillus casei gutF gene revealed that they constitute a sorbitol (glucitol) utilization operon. The gutRM genes encode putative regulators, while the gutCBA genes encode the EIIC, EIIBC, and EIIA proteins of a phosphoenolpyruvate-dependent sorbitol phosphotransferase system (PTSGut). The gut operon is transcribed as a polycistronic gutFRMCBA messenger, the expression of which is induced by sorbitol and repressed by glucose. gutR encodes a transcriptional regulator with two PTS-regulated domains, a galactitol-specific EIIB-like domain (EIIBGat domain) and a mannitol/fructose-specific EIIA-like domain (EIIAMtl domain). Its inactivation abolished gut operon transcription and sorbitol uptake, indicating that it acts as a transcriptional activator. In contrast, cells carrying a gutB mutation expressed the gut operon constitutively, but they failed to transport sorbitol, indicating that EIIBCGut negatively regulates GutR. A footprint analysis showed that GutR binds to a 35-bp sequence upstream from the gut promoter. A sequence comparison with the presumed promoter region of gut operons from various firmicutes revealed a GutR consensus motif that includes an inverted repeat. The regulation mechanism of the L. casei gut operon is therefore likely to be operative in other firmicutes. Finally, gutM codes for a conserved protein of unknown function present in all sequenced gut operons. A gutM mutant, the first constructed in a firmicute, showed drastically reduced gut operon expression and sorbitol uptake, indicating a regulatory role also for GutM. PMID:18676710
Sols, Ignasi; DuBrow, Sarah; Davachi, Lila; Fuentemilla, Lluís
2017-11-20
Although everyday experiences unfold continuously over time, shifts in context, or event boundaries, can influence how those events come to be represented in memory [1-4]. Specifically, mnemonic binding across sequential representations is more challenging at context shifts, such that successful temporal associations are more likely to be formed within than across contexts [1, 2, 5-9]. However, in order to preserve a subjective sense of continuity, it is important that the memory system bridge temporally adjacent events, even if they occur in seemingly distinct contexts. Here, we used pattern similarity analysis to scalp electroencephalographic (EEG) recordings during a sequential learning task [2, 3] in humans and showed that the detection of event boundaries triggered a rapid memory reinstatement of the just-encoded sequence episode. Memory reactivation was detected rapidly (∼200-800 ms from the onset of the event boundary) and was specific to context shifts that were preceded by an event sequence with episodic content. Memory reinstatement was not observed during the sequential encoding of events within an episode, indicating that memory reactivation was induced specifically upon context shifts. Finally, the degree of neural similarity between neural responses elicited during sequence encoding and at event boundaries correlated positively with participants' ability to later link across sequences of events, suggesting a critical role in binding temporally adjacent events in long-term memory. Current results shed light onto the neural mechanisms that promote episodic encoding not only for information within the event, but also, importantly, in the ability to link across events to create a memory representation of continuous experience. Copyright © 2017 Elsevier Ltd. All rights reserved.
Liu, G Y; Gao, S Z
2009-01-01
The complete coding sequences of three sheep genes- BCKDHA, NAGA and HEXA were amplified using the reverse transcriptase polymerase chain reaction (RT-PCR), based on the conserved sequence information of the mouse or other mammals. The nucleotide sequences of these three genes revealed that the sheep BCKDHA gene encodes a protein of 313 amino acids which has high homology with the BCKDHA gene that encodes a protein of 447 amino acids that has high homology with the Branched chain keto acid dehydrogenase El, alpha polypeptide (BCKDHA) of five species chimpanzee (93%), human (96%), crab-eating macaque (93%), bovine (98%) and mouse (91%). The sheep NAGA gene encodes a protein of 411 amino acids that has high homology with the alpha-N-acetylgalactosaminidase (NAGA) of five species human (85%), bovine (94%), mouse (91%), rat (83%) and chicken (74%). The sheep HEXA gene encodes a protein of 529 amino acids that has high homology with the hexosaminidase A(HEXA) of five species bovine (98%), human (84%), Bornean orangután (84%), rat (80%) and mouse (81%). Finally these three novel sheep genes were assigned to GenelDs: 100145857, 100145858 and 100145856. The phylogenetic tree analysis revealed that the sheep BCKDHA, NAGA, and HEXA all have closer genetic relationships to the BCKDHA, NAGA, and HEXA of bovine. Tissue expression profile analysis was also carried out and results revealed that sheep BCKDHA, NAGA and HEXA genes were differentially expressed in tissues including muscle, heart, liver, fat, kidney, lung, small and large intestine. Our experiment is the first to establish the primary foundation for further research on these three sheep genes.
Baek, Ji Hyeong; Lee, Si Hyeock
2010-06-01
To search for novel transcripts encoding biologically active venom components, a subtractive cDNA library specific to the venom gland and sac (gland/sac) of a solitary hunting wasp species, Eumenes pomiformis Fabricius (1781), was constructed by suppression subtractive hybridization. A total of 541 expressed sequence tags (ESTs) were clustered and assembled into 102 contigs (31 multiple sequences and 71 singletons). In total, 37 cDNAs were found in the library via BLASTx searching and manual annotation. Eight contigs (337 ESTs) encoding short venom peptides (10 to 16 amino acids) occupied 62% of the library. The deduced amino acid sequence (78 amino acids) of a novel venom peptide transcript shared sequence similarity with trypsin inhibitors and dendrotoxin-like venom peptides known to be K(+) channel blockers, implying that this novel peptide may play a role in the paralysis of prey. In addition to phospholipase A2 and hyaluronidase, which are known to be the main components of wasp venoms, several transcripts encoding enzymes, including three metallopeptidases and a decarboxylase likely involved in the processing and activation of venomous proteins, peptides, amines, and neurotransmitters, were also isolated from the library. The presence of a transcript encoding a putative insulin/insulin-like peptide binding protein suggests that solitary hunting wasps use their venom to control their prey, leading to larval growth cessation. The abundance of these venom components in the venom gland/sac and in the alimentary canal was confirmed by quantitative real-time PCR. Discovery of venom gland/sac-specific transcripts should promote further studies on biologically active components in the venom of solitary hunting wasps. Copyright 2010 Elsevier Ltd. All rights reserved.
Radiofrequency artefacts in echoplanar imaging induced by two 1.5 T MR scanners in close proximity.
Li, X; Cui, J; Christopasak, S P; Kumar, A; Peng, Z-G
2014-06-01
The purpose of this study was to assess radio frequency (RF) artefacts in echoplanar imaging (EPI) induced by two 1.5 T MR scanners in close proximity and to find an effective method to correct them. Based on the intact shielding of rooms, experiments were performed by two MR scanners with similar centre frequencies. Phantom A (PA) was scanned in one scanner by EPI at different bandwidths (BWs). Simultaneously, phantom B was scanned in a fixed sequence for scanning with the other scanner. RF artefact gaps of PA, scanning time and the image signal-noise ratio (SNR) were measured and recorded. Statistical analysis was performed with the repeated-measures analysis of variance test. Based on findings obtained from PA, three healthy volunteers were studied at a conventional BW and a lower BW to observe the artefact variance. EPI RF artefacts were symmetrically situated in both sides of the image following the phase-encoding direction. The gap size of the artefact became larger and the SNR was significantly improved with a narrower BW. RF artefacts with a lower BW in volunteers presented the same characteristic as PA. For EPI RF artefacts produced by two 1.5 T MR scanners with approximately similar centre frequencies, we can reduce BWs in a suitable range to minimize the effect on MRI. MR scanners with the same field strength installed in the same vicinity might produce RF artefacts in the sequence at larger BWs. Reducing BWs properly is effective to control the position of artefacts and improve the image quality.
Santos, Leonardo N; Silva, Eduardo S; Santos, André S; De Sá, Pablo H; Ramos, Rommel T; Silva, Artur; Cooper, Philip J; Barreto, Maurício L; Loureiro, Sebastião; Pinheiro, Carina S; Alcantara-Neves, Neuza M; Pacheco, Luis G C
2016-07-01
Infection with helminthic parasites, including the soil-transmitted helminth Trichuris trichiura (human whipworm), has been shown to modulate host immune responses and, consequently, to have an impact on the development and manifestation of chronic human inflammatory diseases. De novo derivation of helminth proteomes from sequencing of transcriptomes will provide valuable data to aid identification of parasite proteins that could be evaluated as potential immunotherapeutic molecules in near future. Herein, we characterized the transcriptome of the adult stage of the human whipworm T. trichiura, using next-generation sequencing technology and a de novo assembly strategy. Nearly 17.6 million high-quality clean reads were assembled into 6414 contiguous sequences, with an N50 of 1606bp. In total, 5673 protein-encoding sequences were confidentially identified in the T. trichiura adult worm transcriptome; of these, 1013 sequences represent potential newly discovered proteins for the species, most of which presenting orthologs already annotated in the related species T. suis. A number of transcripts representing probable novel non-coding transcripts for the species T. trichiura were also identified. Among the most abundant transcripts, we found sequences that code for proteins involved in lipid transport, such as vitellogenins, and several chitin-binding proteins. Through a cross-species expression analysis of gene orthologs shared by T. trichiura and the closely related parasites T. suis and T. muris it was possible to find twenty-six protein-encoding genes that are consistently highly expressed in the adult stages of the three helminth species. Additionally, twenty transcripts could be identified that code for proteins previously detected by mass spectrometry analysis of protein fractions of the whipworm somatic extract that present immunomodulatory activities. Five of these transcripts were amongst the most highly expressed protein-encoding sequences in the T. trichiura adult worm. Besides, orthologs of proteins demonstrated to have potent immunomodulatory properties in related parasitic helminths were also predicted from the T. trichiura de novo assembled transcriptome. Copyright © 2016. Published by Elsevier B.V.
Pasion, S G; Hines, J C; Ou, X; Mahmood, R; Ray, D S
1996-01-01
Gene expression in trypanosomatids appears to be regulated largely at the posttranscriptional level and involves maturation of mRNA precursors by trans splicing of a 39-nucleotide miniexon sequence to the 5' end of the mRNA and cleavage and polyadenylation at the 3' end of the mRNA. To initiate the identification of sequences involved in the periodic expression of DNA replication genes in trypanosomatids, we have mapped splice acceptor sites in the 5' flanking region of the TOP2 gene, which encodes the kinetoplast DNA topoisomerase, and have carried out deletion analysis of this region on a plasmid-encoded TOP2 gene. Block deletions within the 5' untranslated region (UTR) identified two regions (-608 to -388 and -387 to -186) responsible for periodic accumulation of the mRNA. Deletion of one or the other of these sequences had no effect on periodic expression of the mRNA, while deletion of both regions resulted in constitutive expression of the mRNA throughout the cell cycle. Subcloning of these sequences into the 5' UTR of a construct lacking both regions of the TOP2 5' UTR has shown that an octamer consensus sequence present in the 5' UTR of the TOP2, RPA1, and DHFR-TS mRNAs is required for normal cycling of the TOP2 mRNA. Mutation of the consensus octamer sequence in the TOP2 5' UTR in a plasmid construct containing only a single consensus octamer and that shows normal cycling of the plasmid-encoded TOP2 mRNA resulted in substantial reduction of the cycling of the mRNA level. These results imply a negative regulation of TOP2 mRNA during the cell cycle by a mechanism involving redundant elements containing one or more copies of a conserved octamer sequence within the 5' UTR of TOP2 mRNA. PMID:8943327
Pasion, S G; Hines, J C; Ou, X; Mahmood, R; Ray, D S
1996-12-01
Gene expression in trypanosomatids appears to be regulated largely at the posttranscriptional level and involves maturation of mRNA precursors by trans splicing of a 39-nucleotide miniexon sequence to the 5' end of the mRNA and cleavage and polyadenylation at the 3' end of the mRNA. To initiate the identification of sequences involved in the periodic expression of DNA replication genes in trypanosomatids, we have mapped splice acceptor sites in the 5' flanking region of the TOP2 gene, which encodes the kinetoplast DNA topoisomerase, and have carried out deletion analysis of this region on a plasmid-encoded TOP2 gene. Block deletions within the 5' untranslated region (UTR) identified two regions (-608 to -388 and -387 to -186) responsible for periodic accumulation of the mRNA. Deletion of one or the other of these sequences had no effect on periodic expression of the mRNA, while deletion of both regions resulted in constitutive expression of the mRNA throughout the cell cycle. Subcloning of these sequences into the 5' UTR of a construct lacking both regions of the TOP2 5' UTR has shown that an octamer consensus sequence present in the 5' UTR of the TOP2, RPA1, and DHFR-TS mRNAs is required for normal cycling of the TOP2 mRNA. Mutation of the consensus octamer sequence in the TOP2 5' UTR in a plasmid construct containing only a single consensus octamer and that shows normal cycling of the plasmid-encoded TOP2 mRNA resulted in substantial reduction of the cycling of the mRNA level. These results imply a negative regulation of TOP2 mRNA during the cell cycle by a mechanism involving redundant elements containing one or more copies of a conserved octamer sequence within the 5' UTR of TOP2 mRNA.
Barbi, Florian; Bragalini, Claudia; Vallon, Laurent; Prudent, Elsa; Dubost, Audrey; Fraissinet-Tachet, Laurence; Marmeisse, Roland; Luis, Patricia
2014-01-01
Plant biomass degradation in soil is one of the key steps of carbon cycling in terrestrial ecosystems. Fungal saprotrophic communities play an essential role in this process by producing hydrolytic enzymes active on the main components of plant organic matter. Open questions in this field regard the diversity of the species involved, the major biochemical pathways implicated and how these are affected by external factors such as litter quality or climate changes. This can be tackled by environmental genomic approaches involving the systematic sequencing of key enzyme-coding gene families using soil-extracted RNA as material. Such an approach necessitates the design and evaluation of gene family-specific PCR primers producing sequence fragments compatible with high-throughput sequencing approaches. In the present study, we developed and evaluated PCR primers for the specific amplification of fungal CAZy Glycoside Hydrolase gene families GH5 (subfamily 5) and GH11 encoding endo-β-1,4-glucanases and endo-β-1,4-xylanases respectively as well as Basidiomycota class II peroxidases, corresponding to the CAZy Auxiliary Activity family 2 (AA2), active on lignin. These primers were experimentally validated using DNA extracted from a wide range of Ascomycota and Basidiomycota species including 27 with sequenced genomes. Along with the published primers for Glycoside Hydrolase GH7 encoding enzymes active on cellulose, the newly design primers were shown to be compatible with the Illumina MiSeq sequencing technology. Sequences obtained from RNA extracted from beech or spruce forest soils showed a high diversity and were uniformly distributed in gene trees featuring the global diversity of these gene families. This high-throughput sequencing approach using several degenerate primers constitutes a robust method, which allows the simultaneous characterization of the diversity of different fungal transcripts involved in plant organic matter degradation and may lead to the discovery of complex patterns in gene expression of soil fungal communities. PMID:25545363
Storing data encoded DNA in living organisms
Wong,; Pak C. , Wong; Kwong K. , Foote; Harlan, P [Richland, WA
2006-06-06
Current technologies allow the generation of artificial DNA molecules and/or the ability to alter the DNA sequences of existing DNA molecules. With a careful coding scheme and arrangement, it is possible to encode important information as an artificial DNA strand and store it in a living host safely and permanently. This inventive technology can be used to identify origins and protect R&D investments. It can also be used in environmental research to track generations of organisms and observe the ecological impact of pollutants. Today, there are microorganisms that can survive under extreme conditions. As well, it is advantageous to consider multicellular organisms as hosts for stored information. These living organisms can provide as memory housing and protection for stored data or information. The present invention provides well for data storage in a living organism wherein at least one DNA sequence is encoded to represent data and incorporated into a living organism.
Piscopo, Sara-Pier; Drouin, Guy
2014-05-01
Gene conversions are nonreciprocal sequence exchanges between genes. They are relatively common in Saccharomyces cerevisiae, but few studies have investigated the evolutionary fate of gene conversions or their functional impacts. Here, we analyze the evolution and impact of gene conversions between the two genes encoding 2-deoxyglucose-6-phosphate phosphatase in S. cerevisiae, Saccharomyces paradoxus and Saccharomyces mikatae. Our results demonstrate that the last half of these genes are subject to gene conversions among these three species. The greater similarity and the greater percentage of GC nucleotides in the converted regions, as well as the absence of long regions of adjacent common converted sites, suggest that these gene conversions are frequent and occur independently in all three species. The high frequency of these conversions probably result from the fact that they have little impact on the protein sequences encoded by these genes.
Small scale sequence automation pays big dividends
NASA Technical Reports Server (NTRS)
Nelson, Bill
1994-01-01
Galileo sequence design and integration are supported by a suite of formal software tools. Sequence review, however, is largely a manual process with reviewers scanning hundreds of pages of cryptic computer printouts to verify sequence correctness. Beginning in 1990, a series of small, PC based sequence review tools evolved. Each tool performs a specific task but all have a common 'look and feel'. The narrow focus of each tool means simpler operation, and easier creation, testing, and maintenance. Benefits from these tools are (1) decreased review time by factors of 5 to 20 or more with a concomitant reduction in staffing, (2) increased review accuracy, and (3) excellent returns on time invested.
Error control techniques for satellite and space communications
NASA Technical Reports Server (NTRS)
Costello, Daniel J., Jr.
1994-01-01
The unequal error protection capabilities of convolutional and trellis codes are studied. In certain environments, a discrepancy in the amount of error protection placed on different information bits is desirable. Examples of environments which have data of varying importance are a number of speech coding algorithms, packet switched networks, multi-user systems, embedded coding systems, and high definition television. Encoders which provide more than one level of error protection to information bits are called unequal error protection (UEP) codes. In this work, the effective free distance vector, d, is defined as an alternative to the free distance as a primary performance parameter for UEP convolutional and trellis encoders. For a given (n, k), convolutional encoder, G, the effective free distance vector is defined as the k-dimensional vector d = (d(sub 0), d(sub 1), ..., d(sub k-1)), where d(sub j), the j(exp th) effective free distance, is the lowest Hamming weight among all code sequences that are generated by input sequences with at least one '1' in the j(exp th) position. It is shown that, although the free distance for a code is unique to the code and independent of the encoder realization, the effective distance vector is dependent on the encoder realization.
Napping to renew learning capacity: enhanced encoding after stimulation of sleep slow oscillations.
Antonenko, Daria; Diekelmann, Susanne; Olsen, Cathrin; Born, Jan; Mölle, Matthias
2013-04-01
As well as consolidating memory, sleep has been proposed to serve a second important function for memory, i.e. to free capacities for the learning of new information during succeeding wakefulness. The slow wave activity (SWA) that is a hallmark of slow wave sleep could be involved in both functions. Here, we aimed to demonstrate a causative role for SWA in enhancing the capacity for encoding of information during subsequent wakefulness, using transcranial slow oscillation stimulation (tSOS) oscillating at 0.75 Hz to induce SWA in healthy humans during an afternoon nap. Encoding following the nap was tested for hippocampus-dependent declarative materials (pictures, word pairs, and word lists) and procedural skills (finger sequence tapping). As compared with a sham stimulation control condition, tSOS during the nap enhanced SWA and significantly improved subsequent encoding on all three declarative tasks (picture recognition, cued recall of word pairs, and free recall of word lists), whereas procedural finger sequence tapping skill was not affected. Our results indicate that sleep SWA enhances the capacity for encoding of declarative materials, possibly by down-scaling hippocampal synaptic networks that were potentiated towards saturation during the preceding period of wakefulness. © 2013 Federation of European Neuroscience Societies and Blackwell Publishing Ltd.
How to kill the honey bee larva: genomic potential and virulence mechanisms of Paenibacillus larvae.
Djukic, Marvin; Brzuszkiewicz, Elzbieta; Fünfhaus, Anne; Voss, Jörn; Gollnow, Kathleen; Poppinga, Lena; Liesegang, Heiko; Garcia-Gonzalez, Eva; Genersch, Elke; Daniel, Rolf
2014-01-01
Paenibacillus larvae, a Gram positive bacterial pathogen, causes American Foulbrood (AFB), which is the most serious infectious disease of honey bees. In order to investigate the genomic potential of P. larvae, two strains belonging to two different genotypes were sequenced and used for comparative genome analysis. The complete genome sequence of P. larvae strain DSM 25430 (genotype ERIC II) consisted of 4,056,006 bp and harbored 3,928 predicted protein-encoding genes. The draft genome sequence of P. larvae strain DSM 25719 (genotype ERIC I) comprised 4,579,589 bp and contained 4,868 protein-encoding genes. Both strains harbored a 9.7 kb plasmid and encoded a large number of virulence-associated proteins such as toxins and collagenases. In addition, genes encoding large multimodular enzymes producing nonribosomally peptides or polyketides were identified. In the genome of strain DSM 25719 seven toxin associated loci were identified and analyzed. Five of them encoded putatively functional toxins. The genome of strain DSM 25430 harbored several toxin loci that showed similarity to corresponding loci in the genome of strain DSM 25719, but were non-functional due to point mutations or disruption by transposases. Although both strains cause AFB, significant differences between the genomes were observed including genome size, number and composition of transposases, insertion elements, predicted phage regions, and strain-specific island-like regions. Transposases, integrases and recombinases are important drivers for genome plasticity. A total of 390 and 273 mobile elements were found in strain DSM 25430 and strain DSM 25719, respectively. Comparative genomics of both strains revealed acquisition of virulence factors by horizontal gene transfer and provided insights into evolution and pathogenicity.
Elrobh, Mohamed S.; Alanazi, Mohammad S.; Khan, Wajahatullah; Abduljaleel, Zainularifeen; Al-Amri, Abdullah; Bazzi, Mohammad D.
2011-01-01
Heat shock proteins are ubiquitous, induced under a number of environmental and metabolic stresses, with highly conserved DNA sequences among mammalian species. Camelus dromedaries (the Arabian camel) domesticated under semi-desert environments, is well adapted to tolerate and survive against severe drought and high temperatures for extended periods. This is the first report of molecular cloning and characterization of full length cDNA of encoding a putative stress-induced heat shock HSPA6 protein (also called HSP70B′) from Arabian camel. A full-length cDNA (2417 bp) was obtained by rapid amplification of cDNA ends (RACE) and cloned in pET-b expression vector. The sequence analysis of HSPA6 gene showed 1932 bp-long open reading frame encoding 643 amino acids. The complete cDNA sequence of the Arabian camel HSPA6 gene was submitted to NCBI GeneBank (accession number HQ214118.1). The BLAST analysis indicated that C. dromedaries HSPA6 gene nucleotides shared high similarity (77–91%) with heat shock gene nucleotide of other mammals. The deduced 643 amino acid sequences (accession number ADO12067.1) showed that the predicted protein has an estimated molecular weight of 70.5 kDa with a predicted isoelectric point (pI) of 6.0. The comparative analyses of camel HSPA6 protein sequences with other mammalian heat shock proteins (HSPs) showed high identity (80–94%). Predicted camel HSPA6 protein structure using Protein 3D structural analysis high similarities with human and mouse HSPs. Taken together, this study indicates that the cDNA sequences of HSPA6 gene and its amino acid and protein structure from the Arabian camel are highly conserved and have similarities with other mammalian species. PMID:21845074
Complete genome sequence of Fer-de-Lance Virus reveals a novel gene in reptilian Paramyxoviruses
Kurath, G.; Batts, W.N.; Ahne, W.; Winton, J.R.
2004-01-01
The complete RNA genome sequence of the archetype reptilian paramyxovirus, Fer-de-Lance virus (FDLV), has been determined. The genome is 15,378 nucleotides in length and consists of seven nonoverlapping genes in the order 3??? N-U-P-M-F-HN-L 5???, coding for the nucleocapsid, unknown, phospho-, matrix, fusion, hemagglutinin-neuraminidase, and large polymerase proteins, respectively. The gene junctions contain highly conserved transcription start and stop signal sequences and tri-nucleotide intergenic regions similar to those of other Paramyxoviridae. The FDLV P gene expression strategy is like that of rubulaviruses, which express the accessory V protein from the primary transcript and edit a portion of the mRNA to encode P and I proteins. There is also an overlapping open reading frame potentially encoding a small basic protein in the P gene. The gene designated U (unknown), encodes a deduced protein of 19.4 kDa that has no counterpart in other paramyxoviruses and has no similarity with sequences in the National Center for Biotechnology Information database. Active transcription of the U gene in infected cells was demonstrated by Northern blot analysis, and bicistronic N-U mRNA was also evident. The genomes of two other snake paramyxovirus genotypes were also found to have U genes, with 11 to 16% nucleotide divergence from the FDLV U gene. Pairwise comparisons of amino acid identities and phylogenetic analyses of all deduced FDLV protein sequences with homologous sequences from other Paramyxoviridae indicate that FDLV represents a new genus within the subfamily Paramyxovirinae. We suggest the name Ferlavirus for the new genus, with FDLV as the type species.
Drosophila Nora virus capsid proteins differ from those of other picorna-like viruses.
Ekström, Jens-Ola; Habayeb, Mazen S; Srivastava, Vaibhav; Kieselbach, Thomas; Wingsle, Gunnar; Hultmark, Dan
2011-09-01
The recently discovered Nora virus from Drosophila melanogaster is a single-stranded RNA virus. Its published genomic sequence encodes a typical picorna-like cassette of replicative enzymes, but no capsid proteins similar to those in other picorna-like viruses. We have now done additional sequencing at the termini of the viral genome, extending it by 455 nucleotides at the 5' end, but no more coding sequence was found. The completeness of the final 12,333-nucleotide sequence was verified by the production of infectious virus from the cloned genome. To identify the capsid proteins, we purified Nora virus particles and analyzed their proteins by mass spectrometry. Our results show that the capsid is built from three major proteins, VP4A, B and C, encoded in the fourth open reading frame of the viral genome. The viral particles also contain traces of a protein from the third open reading frame, VP3. VP4A and B are not closely related to other picorna-like virus capsid proteins in sequence, but may form similar jelly roll folds. VP4C differs from the others and is predicted to have an essentially α-helical conformation. In a related virus, identified from EST database sequences from Nasonia parasitoid wasps, VP4C is encoded in a separate open reading frame, separated from VP4A and B by a frame-shift. This opens a possibility that VP4C is produced in non-equimolar quantities. Altogether, our results suggest that the Nora virus capsid has a different protein organization compared to the order Picornavirales. Copyright © 2011 Elsevier B.V. All rights reserved.
The organisation and interviral homologies of genes at the 3' end of tobacco rattle virus RNA1
Boccara, Martine; Hamilton, William D. O.; Baulcombe, David C.
1986-01-01
The RNA1 of tobacco rattle virus (TRV) has been cloned as cDNA and the nucleotide sequence determined of 2 kb from the 3'-terminal region. The sequence contains three long open reading frames. One of these starts 5' of the cDNA and probably corresponds to the carboxy-terminal sequence of a 170-K protein encoded on RNA1. The deduced protein sequence from this reading frame shows homology with the putative replicases of tobacco mosaic virus (TMV) and tricornaviruses. The location of the second open reading frame, which encodes a 29-K polypeptide, was shown by Northern blot analysis to coincide with a 1.6-kb subgenomic RNA. The validity of this reading frame was confirmed by showing that the cDNA extending over this region could be transcribed and translated in vitro to produce a polypeptide of the predicted size which co-migrates in electrophoresis with a translation product of authentic viral RNA. The sequence of this 29-K polypeptide showed homology with two regions in the 30-K protein of TMV. This homology includes positions in the TMV 30-K protein where mutations have been identified which affect the transport of virus between cells. The third open reading frame encodes a potential 16-K protein and was shown by Northern blot hybridisation to be contained within the region of a 0.7-kb subgenomic RNA which is found in cellular RNA of infected cells but not virus particles. The many similarities between TRV and TMV in viral morphology, gene organisation and sequence suggest that these two viral groups may share a common viral ancestor. ImagesFig. 2.Fig. 3. PMID:16453668
Wong, Gerard; Leckie, Christopher; Gorringe, Kylie L; Haviv, Izhak; Campbell, Ian G; Kowalczyk, Adam
2010-04-15
High-density single nucleotide polymorphism (SNP) genotyping arrays are efficient and cost effective platforms for the detection of copy number variation (CNV). To ensure accuracy in probe synthesis and to minimize production costs, short oligonucleotide probe sequences are used. The use of short probe sequences limits the specificity of binding targets in the human genome. The specificity of these short probeset sequences has yet to be fully analysed against a normal reference human genome. Sequence similarity can artificially elevate or suppress copy number measurements, and hence reduce the reliability of affected probe readings. For the purpose of detecting narrow CNVs reliably down to the width of a single probeset, sequence similarity is an important issue that needs to be addressed. We surveyed the Affymetrix Human Mapping SNP arrays for probeset sequence similarity against the reference human genome. Utilizing sequence similarity results, we identified a collection of fine-scaled putative CNVs between gender from autosomal probesets whose sequence matches various loci on the sex chromosomes. To detect these variations, we utilized our statistical approach, Detecting REcurrent Copy number change using rank-order Statistics (DRECS), and showed that its performance was superior and more stable than the t-test in detecting CNVs. Through the application of DRECS on the HapMap population datasets with multi-matching probesets filtered, we identified biologically relevant SNPs in aberrant regions across populations with known association to physical traits, such as height, covered by the span of a single probe. This provided empirical confirmation of the existence of naturally occurring narrow CNVs as well as the sensitivity of the Affymetrix SNP array technology in detecting them. The MATLAB implementation of DRECS is available at http://ww2.cs.mu.oz.au/ approximately gwong/DRECS/index.html.
Swanson, D S; Pan, X; Musser, J M
1996-01-01
Mycobacterium scrofulaceum is most commonly recovered from children with cervical lymphadenitis, although it also accounts for approximately 2% of the mycobacterial infections in AIDS patients. Species assignment of M. scrofulaceum isolated by conventional techniques can be difficult and time-consuming. To develop a strategy for rapid species assignment of these organisms, a 360-bp region of the gene (hsp65) encoding a 65-kDa heat shock protein in 37 isolates from diverse sources was sequenced. Eight hsp65 alleles were identified, and these sequences formed phylogenetic clusters and lineages largely distinct from other Mycobacterium species. There was incomplete correlation between serovar designation and hsp65 allele assignment. The hsp65 data correlated strongly with the results of sequence analysis of the gene coding for 16S rRNA. Automated DNA sequencing of a 360-bp region of the hsp65 gene provides a rapid and unambiguous method for species assignment of these acid-fast organisms for diagnostic purposes. PMID:8940463
Amalric, Marie; Wang, Liping; Pica, Pierre; Figueira, Santiago; Sigman, Mariano; Dehaene, Stanislas
2017-01-01
During language processing, humans form complex embedded representations from sequential inputs. Here, we ask whether a "geometrical language" with recursive embedding also underlies the human ability to encode sequences of spatial locations. We introduce a novel paradigm in which subjects are exposed to a sequence of spatial locations on an octagon, and are asked to predict future locations. The sequences vary in complexity according to a well-defined language comprising elementary primitives and recursive rules. A detailed analysis of error patterns indicates that primitives of symmetry and rotation are spontaneously detected and used by adults, preschoolers, and adult members of an indigene group in the Amazon, the Munduruku, who have a restricted numerical and geometrical lexicon and limited access to schooling. Furthermore, subjects readily combine these geometrical primitives into hierarchically organized expressions. By evaluating a large set of such combinations, we obtained a first view of the language needed to account for the representation of visuospatial sequences in humans, and conclude that they encode visuospatial sequences by minimizing the complexity of the structured expressions that capture them.
Amalric, Marie; Wang, Liping; Figueira, Santiago; Sigman, Mariano; Dehaene, Stanislas
2017-01-01
During language processing, humans form complex embedded representations from sequential inputs. Here, we ask whether a “geometrical language” with recursive embedding also underlies the human ability to encode sequences of spatial locations. We introduce a novel paradigm in which subjects are exposed to a sequence of spatial locations on an octagon, and are asked to predict future locations. The sequences vary in complexity according to a well-defined language comprising elementary primitives and recursive rules. A detailed analysis of error patterns indicates that primitives of symmetry and rotation are spontaneously detected and used by adults, preschoolers, and adult members of an indigene group in the Amazon, the Munduruku, who have a restricted numerical and geometrical lexicon and limited access to schooling. Furthermore, subjects readily combine these geometrical primitives into hierarchically organized expressions. By evaluating a large set of such combinations, we obtained a first view of the language needed to account for the representation of visuospatial sequences in humans, and conclude that they encode visuospatial sequences by minimizing the complexity of the structured expressions that capture them. PMID:28125595
CODEHOP (COnsensus-DEgenerate Hybrid Oligonucleotide Primer) PCR primer design
Rose, Timothy M.; Henikoff, Jorja G.; Henikoff, Steven
2003-01-01
We have developed a new primer design strategy for PCR amplification of distantly related gene sequences based on consensus-degenerate hybrid oligonucleotide primers (CODEHOPs). An interactive program has been written to design CODEHOP PCR primers from conserved blocks of amino acids within multiply-aligned protein sequences. Each CODEHOP consists of a pool of related primers containing all possible nucleotide sequences encoding 3–4 highly conserved amino acids within a 3′ degenerate core. A longer 5′ non-degenerate clamp region contains the most probable nucleotide predicted for each flanking codon. CODEHOPs are used in PCR amplification to isolate distantly related sequences encoding the conserved amino acid sequence. The primer design software and the CODEHOP PCR strategy have been utilized for the identification and characterization of new gene orthologs and paralogs in different plant, animal and bacterial species. In addition, this approach has been successful in identifying new pathogen species. The CODEHOP designer (http://blocks.fhcrc.org/codehop.html) is linked to BlockMaker and the Multiple Alignment Processor within the Blocks Database World Wide Web (http://blocks.fhcrc.org). PMID:12824413
Heinke, Florian; Bittrich, Sebastian; Kaiser, Florian; Labudde, Dirk
2016-01-01
To understand the molecular function of biopolymers, studying their structural characteristics is of central importance. Graphics programs are often utilized to conceive these properties, but with the increasing number of available structures in databases or structure models produced by automated modeling frameworks this process requires assistance from tools that allow automated structure visualization. In this paper a web server and its underlying method for generating graphical sequence representations of molecular structures is presented. The method, called SequenceCEROSENE (color encoding of residues obtained by spatial neighborhood embedding), retrieves the sequence of each amino acid or nucleotide chain in a given structure and produces a color coding for each residue based on three-dimensional structure information. From this, color-highlighted sequences are obtained, where residue coloring represent three-dimensional residue locations in the structure. This color encoding thus provides a one-dimensional representation, from which spatial interactions, proximity and relations between residues or entire chains can be deduced quickly and solely from color similarity. Furthermore, additional heteroatoms and chemical compounds bound to the structure, like ligands or coenzymes, are processed and reported as well. To provide free access to SequenceCEROSENE, a web server has been implemented that allows generating color codings for structures deposited in the Protein Data Bank or structure models uploaded by the user. Besides retrieving visualizations in popular graphic formats, underlying raw data can be downloaded as well. In addition, the server provides user interactivity with generated visualizations and the three-dimensional structure in question. Color encoded sequences generated by SequenceCEROSENE can aid to quickly perceive the general characteristics of a structure of interest (or entire sets of complexes), thus supporting the researcher in the initial phase of structure-based studies. In this respect, the web server can be a valuable tool, as users are allowed to process multiple structures, quickly switch between results, and interact with generated visualizations in an intuitive manner. The SequenceCEROSENE web server is available at https://biosciences.hs-mittweida.de/seqcerosene.
USDA-ARS?s Scientific Manuscript database
Molecular epidemiology and evolution of foot-and-mouth disease virus (FMDV) are widely studied using genomic sequences encoding VP1, the capsid protein containing the most relevant antigenic domains. Although sequencing of the full viral genome is not used as a routine diagnostic or surveillance too...
ERIC Educational Resources Information Center
Gagnon, Sylvain; Bedard, Marie-Josee; Turcotte, Josee
2005-01-01
Recent findings [Turcotte, Gagnon, & Poirier, 2005. The effect of old age on the learning of supra-span sequences. "Psychology and Aging," 20, 251-260.] indicate that incidental learning of visuo-spatial supra-span sequences through immediate serial recall declines with old age (Hebb's paradigm). In this study, we examined whether…
Sweet Taste Receptor Gene Variation and Aspartame Taste in Primates and Other Species
Li, Xia; Bachmanov, Alexander A.; Maehashi, Kenji; Li, Weihua; Lim, Raymond; Brand, Joseph G.; Beauchamp, Gary K.; Reed, Danielle R.; Thai, Chloe
2011-01-01
Aspartame is a sweetener added to foods and beverages as a low-calorie sugar replacement. Unlike sugars, which are apparently perceived as sweet and desirable by a range of mammals, the ability to taste aspartame varies, with humans, apes, and Old World monkeys perceiving aspartame as sweet but not other primate species. To investigate whether the ability to perceive the sweetness of aspartame correlates with variations in the DNA sequence of the genes encoding sweet taste receptor proteins, T1R2 and T1R3, we sequenced these genes in 9 aspartame taster and nontaster primate species. We then compared these sequences with sequences of their orthologs in 4 other nontasters species. We identified 9 variant sites in the gene encoding T1R2 and 32 variant sites in the gene encoding T1R3 that distinguish aspartame tasters and nontasters. Molecular docking of aspartame to computer-generated models of the T1R2 + T1R3 receptor dimer suggests that species variation at a secondary, allosteric binding site in the T1R2 protein is the most likely origin of differences in perception of the sweetness of aspartame. These results identified a previously unknown site of aspartame interaction with the sweet receptor and suggest that the ability to taste aspartame might have developed during evolution to exploit a specialized food niche. PMID:21414996
Sweet taste receptor gene variation and aspartame taste in primates and other species.
Li, Xia; Bachmanov, Alexander A; Maehashi, Kenji; Li, Weihua; Lim, Raymond; Brand, Joseph G; Beauchamp, Gary K; Reed, Danielle R; Thai, Chloe; Floriano, Wely B
2011-06-01
Aspartame is a sweetener added to foods and beverages as a low-calorie sugar replacement. Unlike sugars, which are apparently perceived as sweet and desirable by a range of mammals, the ability to taste aspartame varies, with humans, apes, and Old World monkeys perceiving aspartame as sweet but not other primate species. To investigate whether the ability to perceive the sweetness of aspartame correlates with variations in the DNA sequence of the genes encoding sweet taste receptor proteins, T1R2 and T1R3, we sequenced these genes in 9 aspartame taster and nontaster primate species. We then compared these sequences with sequences of their orthologs in 4 other nontasters species. We identified 9 variant sites in the gene encoding T1R2 and 32 variant sites in the gene encoding T1R3 that distinguish aspartame tasters and nontasters. Molecular docking of aspartame to computer-generated models of the T1R2 + T1R3 receptor dimer suggests that species variation at a secondary, allosteric binding site in the T1R2 protein is the most likely origin of differences in perception of the sweetness of aspartame. These results identified a previously unknown site of aspartame interaction with the sweet receptor and suggest that the ability to taste aspartame might have developed during evolution to exploit a specialized food niche.
Characterization of AFLAV, a Tf1/Sushi retrotransposon from Aspergillus flavus.
Hua, Sui-Sheng T; Tarun, Alice S; Pandey, Sonal N; Chang, Leo; Chang, Perng-Kuang
2007-02-01
The plasmid, pAF28, a genomic clone from Aspergillus flavus NRRL 6541, has been used as a hybridization probe to fingerprint A. flavus strains isolated in corn and peanut fields. The insert of pAF28 contains a 4.5 kb region which encodes a truncated retrotransposon (AfRTL-1). In search for a full-length and intact copy of retrotransposon, we exploited a novel PCR cloning strategy by amplifying a 3.4 kb region from the genomic DNA of A. flavus NRRL 6541. The fragment was cloned into pCR 4-TOPO. Sequence analysis confirmed that this region encoded putative domains of partial reverse transcriptase, RNase H, and integrase of the predicted retrotransposon. The two flanking long terminal repeats (LTRs) and the sequence between them comprise a putative full-length LTR retrotransposon of 7799 bp in length. This intact retrotransposon sequence is named AFLAV (A. flavus Retrotransposon). The order of the predicted catalytic domains in the polyprotein (Pol) placed AFLAV in the Tf1/sushi subgroup of the Ty3/gypsy retrotransposon family. Primers derived from AFLAV sequence were used to screen this retrotransposon in other strains of A. flavus. More than fifty strains of A. flavus isolated from different geological origins were surveyed and the results show that many strains have extensive deletions in the regions encoding the capsid (Gag) and Pol.
Kohli, Gurjeet S; Campbell, Katrina; John, Uwe; Smith, Kirsty F; Fraga, Santiago; Rhodes, Lesley L; Murray, Shauna A
2017-09-01
Gambierdiscus, a benthic dinoflagellate, produces ciguatoxins that cause the human illness Ciguatera. Ciguatoxins are polyether ladder compounds that have a polyketide origin, indicating that polyketide synthases (PKS) are involved in their production. We sequenced transcriptomes of Gambierdiscus excentricus and Gambierdiscus polynesiensis and found 264 contigs encoding single domain ketoacyl synthases (KS; G. excentricus: 106, G. polynesiensis: 143) and ketoreductases (KR; G. excentricus: 7, G. polynesiensis: 8) with sequence similarity to type I PKSs, as reported in other dinoflagellates. In addition, 24 contigs (G. excentricus: 3, G. polynesiensis: 21) encoding multiple PKS domains (forming typical type I PKSs modules) were found. The proposed structure produced by one of these megasynthases resembles a partial carbon backbone of a polyether ladder compound. Seventeen contigs encoding single domain KS, KR, s-malonyltransacylase, dehydratase and enoyl reductase with sequence similarity to type II fatty acid synthases (FAS) in plants were found. Type I PKS and type II FAS genes were distinguished based on the arrangement of domains on the contigs and their sequence similarity and phylogenetic clustering with known PKS/FAS genes in other organisms. This differentiation of PKS and FAS pathways in Gambierdiscus is important, as it will facilitate approaches to investigating toxin biosynthesis pathways in dinoflagellates. © 2017 The Author(s) Journal of Eukaryotic Microbiology © 2017 International Society of Protistologists.
Mauchline, Tim H.; Knox, Rachel; Mohan, Sharad; Powers, Stephen J.; Kerry, Brian R.; Davies, Keith G.; Hirsch, Penny R.
2011-01-01
Protein-encoding and 16S rRNA genes of Pasteuria penetrans populations from a wide range of geographic locations were examined. Most interpopulation single nucleotide polymorphisms (SNPs) were detected in the 16S rRNA gene. However, in order to fully resolve all populations, these were supplemented with SNPs from protein-encoding genes in a multilocus SNP typing approach. Examination of individual 16S rRNA gene sequences revealed the occurrence of “cryptic” SNPs which were not present in the consensus sequences of any P. penetrans population. Additionally, hierarchical cluster analysis separated P. penetrans 16S rRNA gene clones into four groups, and one of which contained sequences from the most highly passaged population, demonstrating that it is possible to manipulate the population structure of this fastidious bacterium. The other groups were made from representatives of the other populations in various proportions. Comparison of sequences among three Pasteuria species, namely, P. penetrans, P. hartismeri, and P. ramosa, showed that the protein-encoding genes provided greater discrimination than the 16S rRNA gene. From these findings, we have developed a toolbox for the discrimination of Pasteuria at both the inter- and intraspecies levels. We also provide a model to monitor genetic variation in other obligate hyperparasites and difficult-to-culture microorganisms. PMID:21803895
Mauchline, Tim H; Knox, Rachel; Mohan, Sharad; Powers, Stephen J; Kerry, Brian R; Davies, Keith G; Hirsch, Penny R
2011-09-01
Protein-encoding and 16S rRNA genes of Pasteuria penetrans populations from a wide range of geographic locations were examined. Most interpopulation single nucleotide polymorphisms (SNPs) were detected in the 16S rRNA gene. However, in order to fully resolve all populations, these were supplemented with SNPs from protein-encoding genes in a multilocus SNP typing approach. Examination of individual 16S rRNA gene sequences revealed the occurrence of "cryptic" SNPs which were not present in the consensus sequences of any P. penetrans population. Additionally, hierarchical cluster analysis separated P. penetrans 16S rRNA gene clones into four groups, and one of which contained sequences from the most highly passaged population, demonstrating that it is possible to manipulate the population structure of this fastidious bacterium. The other groups were made from representatives of the other populations in various proportions. Comparison of sequences among three Pasteuria species, namely, P. penetrans, P. hartismeri, and P. ramosa, showed that the protein-encoding genes provided greater discrimination than the 16S rRNA gene. From these findings, we have developed a toolbox for the discrimination of Pasteuria at both the inter- and intraspecies levels. We also provide a model to monitor genetic variation in other obligate hyperparasites and difficult-to-culture microorganisms.
Cloning and sequence analysis of the Antheraea pernyi nucleopolyhedrovirus gp64 gene.
Wang, Wenbing; Zhu, Shanying; Wang, Liqun; Yu, Feng; Shen, Weide
2005-12-01
Frequent outbreaks of the purulence disease of Chinese oak silkworm are reported in Middle and Northeast China. The disease is produced by the pathogen Antheraea pernyi nucleopolyhedrovirus (AnpeNPV). To obtain molecular information of the virus, the polyhedra of AnpeNPV were purified and characterized. The genomic DNA of AnpeNPV was extracted and digested with HindIII. The genome size of AnpeNPV is estimated at 128 kb. Based on the analysis of DNA fragments digested with HindIII, 23 fragments were bigger than 564 bp. A genomic library was generated using HindIII and the positive clones were sequenced and analysed. The gp64 gene, encoding the baculovirus envelope protein GP64, was found in an insert. The nucleotide sequence analysis indicated that the AnpeNPV gp64 gene consists of a 1,530 nucleotide open reading frame (ORF), encoding a protein of 509 amino acids. Of the eight gp64 homologues, the AnpeNPV gp64 ORF shared the most sequence similarity with the gp64 gene of Anticarsia gemmatalis NPV, but not Bombyx mori NPV. The upstream region of the AnpeNPV gp64 ORF encoded the conserved transcriptional elements for early and late stage of the viral infection cycle. These results indicated that AnpeNPV belongs to group I NPV and was far removed in molecular phylogeny from the BmNPV.
Poliovirus replication proteins: RNA sequence encoding P3-1b and the sites of proteolytic processing
DOE Office of Scientific and Technical Information (OSTI.GOV)
Semler, B.L.; Anderson, C.W.; Kitamura, N.
1981-06-01
A partial amino-terminal amino acid sequence of each of the major proteins encoded by the replicase region of the poliovirus genome has been determined. A comparison of this sequence information with the amino acid sequence predicted from the RNA sequence that has been determined for the 3' region of the poliovirus genome has allowed us to locate precisely the proteolytic cleavage sites at which the initial polyprotein is processed to create the poliovirus products P3-1b (NCVP1b), P3-2 (NCVP2), P3-4b (NCVP4b), and P3-7c (NCVP7c). For each of these products, as well as for the small genome-linked protein VPg, proteolytic cleavage occursmore » between a glutamine and a glycine residue to create the amino terminus of each protein. This result suggests that a single proteinase may be responsible for all of these cleavages. The sequence data also allow the precise positioning of the genome-linked protein VPg within the precursor P3-1b just proximal to the amino terminus of polypeptide P3-2.« less
Lammers, P J; McLaughlin, S; Papin, S; Trujillo-Provencio, C; Ryncarz, A J
1990-01-01
An 11-kbp DNA element of unknown function interrupts the nifD gene in vegetative cells of Anabaena sp. strain PCC 7120. In developing heterocysts the nifD element excises from the chromosome via site-specific recombination between short repeat sequences that flank the element. The nucleotide sequence of the nifH-proximal half of the element was determined to elucidate the genetic potential of the element. Four open reading frames with the same relative orientation as the nifD element-encoded xisA gene were identified in the sequenced region. Each of the open reading frames was preceded by a reasonable ribosome-binding site and had biased codon utilization preferences consistent with low levels of expression. Open reading frame 3 was highly homologous with three cytochrome P-450 omega-hydroxylase proteins and showed regional homology to functionally significant domains common to the cytochrome P-450 superfamily. The sequence encoding open reading frame 2 was the most highly conserved portion of the sequenced region based on heterologous hybridization experiments with three genera of heterocystous cyanobacteria. Images PMID:2123860
Davis, John K.; Paoli, George C.; He, Zhongqi; Nadeau, Lloyd J.; Somerville, Charles C.; Spain, Jim C.
2000-01-01
Pseudomonas pseudoalcaligenes JS45 grows on nitrobenzene by a partially reductive pathway in which the intermediate hydroxylaminobenzene is enzymatically rearranged to 2-aminophenol by hydroxylaminobenzene mutase (HAB mutase). The properties of the enzyme, the reaction mechanism, and the evolutionary origin of the gene(s) encoding the enzyme are unknown. In this study, two open reading frames (habA and habB), each encoding an HAB mutase enzyme, were cloned from a P. pseudoalcaligenes JS45 genomic library and sequenced. The open reading frames encoding HabA and HabB are separated by 2.5 kb and are divergently transcribed. The deduced amino acid sequences of HabA and HabB are 44% identical. The HAB mutase specific activities in crude extracts of Escherichia coli clones synthesizing either HabA or HabB were similar to the specific activities of extracts of strain JS45 grown on nitrobenzene. HAB mutase activity in E. coli extracts containing HabB withstood heating at 85°C for 10 min, but extracts containing HabA were inactivated when they were heated at temperatures above 60°C. HAB mutase activity in extracts of P. pseudoalcaligenes JS45 grown on nitrobenzene exhibited intermediate temperature stability. Although both the habA gene and the habB gene conferred HAB mutase activity when they were separately cloned and expressed in E. coli, reverse transcriptase PCR analysis indicated that only habA is transcribed in P. pseudoalcaligenes JS45. A mutant strain derived from strain JS45 in which the habA gene was disrupted was unable to grow on nitrobenzene, which provided physiological evidence that HabA is involved in the degradation of nitrobenzene. A strain in which habB was disrupted grew on nitrobenzene. Gene Rv3078 of Mycobacterium tuberculosis H37Rv encodes a protein whose deduced amino acid sequence is 52% identical to the HabB amino acid sequence. E. coli containing M. tuberculosis gene Rv3078 cloned into pUC18 exhibited low levels of HAB mutase activity. Sequences that exhibit similarity to transposable element sequences are present between habA and habB, as well as downstream of habB, which suggests that horizontal gene transfer resulted in acquisition of one or both of the hab genes. PMID:10877793
The complete chloroplast genome sequence of Dianthus superbus var. longicalycinus.
Gurusamy, Raman; Lee, Do-Hyung; Park, SeonJoo
2016-05-01
The complete chloroplast genome (cpDNA) sequence of Dianthus superbus var. longicalycinus is an economically important traditional Chinese medicine was reported and characterized. The cpDNA of Dianthus superbus var. longicalycinus is 149,539 bp, with 36.3% GC content. A pair of inverted repeats (IRs) of 24,803 bp is separated by a large single-copy region (LSC, 82,805 bp) and a small single-copy region (SSC, 17,128 bp). It encodes 85 protein-coding genes, 36 tRNA genes and 8 rRNA genes. Of 129 individual genes, 13 genes encoded one intron and three genes have two introns.
Takeda, Shuntaro; Furusawa, Akira
2017-09-22
We propose a scalable scheme for optical quantum computing using measurement-induced continuous-variable quantum gates in a loop-based architecture. Here, time-bin-encoded quantum information in a single spatial mode is deterministically processed in a nested loop by an electrically programmable gate sequence. This architecture can process any input state and an arbitrary number of modes with almost minimum resources, and offers a universal gate set for both qubits and continuous variables. Furthermore, quantum computing can be performed fault tolerantly by a known scheme for encoding a qubit in an infinite-dimensional Hilbert space of a single light mode.
NASA Astrophysics Data System (ADS)
Takeda, Shuntaro; Furusawa, Akira
2017-09-01
We propose a scalable scheme for optical quantum computing using measurement-induced continuous-variable quantum gates in a loop-based architecture. Here, time-bin-encoded quantum information in a single spatial mode is deterministically processed in a nested loop by an electrically programmable gate sequence. This architecture can process any input state and an arbitrary number of modes with almost minimum resources, and offers a universal gate set for both qubits and continuous variables. Furthermore, quantum computing can be performed fault tolerantly by a known scheme for encoding a qubit in an infinite-dimensional Hilbert space of a single light mode.
Mating-Type Genes and MAT Switching in Saccharomyces cerevisiae
Haber, James E.
2012-01-01
Mating type in Saccharomyces cerevisiae is determined by two nonhomologous alleles, MATa and MATα. These sequences encode regulators of the two different haploid mating types and of the diploids formed by their conjugation. Analysis of the MATa1, MATα1, and MATα2 alleles provided one of the earliest models of cell-type specification by transcriptional activators and repressors. Remarkably, homothallic yeast cells can switch their mating type as often as every generation by a highly choreographed, site-specific homologous recombination event that replaces one MAT allele with different DNA sequences encoding the opposite MAT allele. This replacement process involves the participation of two intact but unexpressed copies of mating-type information at the heterochromatic loci, HMLα and HMRa, which are located at opposite ends of the same chromosome-encoding MAT. The study of MAT switching has yielded important insights into the control of cell lineage, the silencing of gene expression, the formation of heterochromatin, and the regulation of accessibility of the donor sequences. Real-time analysis of MAT switching has provided the most detailed description of the molecular events that occur during the homologous recombinational repair of a programmed double-strand chromosome break. PMID:22555442
Dong, G; Vieille, C; Zeikus, J G
1997-01-01
The gene encoding the Pyrococcus furiosus hyperthermophilic amylopullulanase (APU) was cloned, sequenced, and expressed in Escherichia coli. The gene encoded a single 827-residue polypeptide with a 26-residue signal peptide. The protein sequence had very low homology (17 to 21% identity) with other APUs and enzymes of the alpha-amylase family. In particular, none of the consensus regions present in the alpha-amylase family could be identified. P. furiosus APU showed similarity to three proteins, including the P. furiosus intracellular alpha-amylase and Dictyoglomus thermophilum alpha-amylase A. The mature protein had a molecular weight of 89,000. The recombinant P. furiosus APU remained folded after denaturation at temperatures of < or = 70 degrees C and showed an apparent molecular weight of 50,000 in sodium dodecyl sulfate-polyacrylamide gel electrophoresis. Denaturating temperatures of above 100 degrees C were required for complete unfolding. The enzyme was extremely thermostable, with an optimal activity at 105 degrees C and pH 5.5. Ca2+ increased the enzyme activity, thermostability, and substrate affinity. The enzyme was highly resistant to chemical denaturing reagents, and its activity increased up to twofold in the presence of surfactants. PMID:9293009
ssrA (tmRNA) Plays a Role in Salmonella enterica Serovar Typhimurium Pathogenesis
Julio, Steven M.; Heithoff, Douglas M.; Mahan, Michael J.
2000-01-01
Escherichia coli ssrA encodes a small stable RNA molecule, tmRNA, that has many diverse functions, including tagging abnormal proteins for degradation, supporting phage growth, and modulating the activity of DNA binding proteins. Here we show that ssrA plays a role in Salmonella enterica serovar Typhimurium pathogenesis and in the expression of several genes known to be induced during infection. Moreover, the phage-like attachment site, attL, encoded within ssrA, serves as the site of integration of a region of Salmonella-specific sequence; adjacent to the 5′ end of ssrA is another region of Salmonella-specific sequence with extensive homology to predicted proteins encoded within the unlinked Salmonella pathogenicity island SPI4. S. enterica serovar Typhimurium ssrA mutants fail to support the growth of phage P22 and are delayed in their ability to form viable phage particles following induction of a phage P22 lysogen. These data indicate that ssrA plays a role in the pathogenesis of Salmonella, serves as an attachment site for Salmonella-specific sequences, and is required for the growth of phage P22. PMID:10692360
NASA Astrophysics Data System (ADS)
Yee, Chai Sin; Murad, Abdul Munir Abdul; Bakar, Farah Diba Abu
2013-11-01
A gene encoding an endo-β-1,4-mannanase from Trichoderma virens UKM1 (manTV) and Aspergillus flavus UKM1 (manAF) was analysed with bioinformatic tools. In addition, A. flavus NRRL 3357 genome database was screened for a β-mannosidase gene and analysed (mndA-AF). These three genes were analysed to understand their gene properties. manTV and manAF both consists of 1,332-bp and 1,386-bp nucleotides encoding 443 and 461 amino acid residues, respectively. Both the endo-β-1,4-mannanases belong to the glycosyl hydrolase family 5 and contain a carbohydrate-binding module family 1 (CBM1). On the other hand, mndA-AF which is a 2,745-bp gene encodes a protein sequence of 914 amino acid residues. This β-mannosidase belongs to the glycosyl hydrolase family 2. Predicted molecular weight of manTV, manAF and mndA-AF are 47.74 kDa, 49.71 kDa and 103 kDa, respectively. All three predicted protein sequences possessed signal peptide sequence and are highly conserved among other fungal β-mannanases and β-mannosidases.
Borst, Gregoire; Niven, Elaine; Logie, Robert H
2012-04-01
Visual mental imagery and working memory are often assumed to play similar roles in high-order functions, but little is known of their functional relationship. In this study, we investigated whether similar cognitive processes are involved in the generation of visual mental images, in short-term retention of those mental images, and in short-term retention of visual information. Participants encoded and recalled visually or aurally presented sequences of letters under two interference conditions: spatial tapping or irrelevant visual input (IVI). In Experiment 1, spatial tapping selectively interfered with the retention of sequences of letters when participants generated visual mental images from aural presentation of the letter names and when the letters were presented visually. In Experiment 2, encoding of the sequences was disrupted by both interference tasks. However, in Experiment 3, IVI interfered with the generation of the mental images, but not with their retention, whereas spatial tapping was more disruptive during retention than during encoding. Results suggest that the temporary retention of visual mental images and of visual information may be supported by the same visual short-term memory store but that this store is not involved in image generation.
A reciprocal HLA-Disease Association in Rheumatoid Arthritis and Pemphigus Vulgaris
van Drongelen, Vincent; Holoshitz, Joseph
2017-01-01
Human leukocyte antigens (HLA) have been extensively studied as being antigen presenting receptors, but many aspects of their function remain elusive, especially their association with various autoimmune diseases. Here we discuss an illustrative case of the reciprocal relationship between certain HLA-DRB1 alleles and two diseases, rheumatoid arthritis (RA) and pemphigus vulgaris (PV). RA is strongly associated with HLA-DRB1 alleles that encode a five amino acid sequence motif in the 70-74 region of the DRβ chain, called the shared epitope (SE), while PV is associated with the HLA-DRB1*04:02 allele that encodes a different sequence motif in the same region. Interestingly, while HLA-DRB1*04:02 confers susceptibility to PV, this and other alleles that encode the same sequence motif in the 70-74 region of the DRβ chain are protective against RA. Currently, no convincing explanation for this antagonistic effect is present. Here we briefly review the immunology and immunogenetics of both diseases, identify remaining gaps in our understanding of their association with HLA, and propose the possibility that the 70-74 DRβ epitope may contribute to disease risk by mechanisms other than antigen presentation. PMID:27814654
Sequence-encoded colloidal origami and microbot assemblies from patchy magnetic cubes
Han, Koohee; Shields, C. Wyatt; Diwakar, Nidhi M.; Bharti, Bhuvnesh; López, Gabriel P.; Velev, Orlin D.
2017-01-01
Colloidal-scale assemblies that reconfigure on demand may serve as the next generation of soft “microbots,” artificial muscles, and other biomimetic devices. This requires the precise arrangement of particles into structures that are preprogrammed to reversibly change shape when actuated by external fields. The design and making of colloidal-scale assemblies with encoded directional particle-particle interactions remain a major challenge. We show how assemblies of metallodielectric patchy microcubes can be engineered to store energy through magnetic polarization and release it on demand by microscale reconfiguration. The dynamic pattern of folding and reconfiguration of the chain-like assemblies can be encoded in the sequence of the cube orientation. The residual polarization of the metallic facets on the microcubes leads to local interactions between the neighboring particles, which is directed by the conformational restrictions of their shape after harvesting energy from external magnetic fields. These structures can also be directionally moved, steered, and maneuvered by global forces from external magnetic fields. We illustrate these capabilities by examples of assemblies of specific sequences that can be actuated, reoriented, and spatially maneuvered to perform microscale operations such as capturing and transporting live cells, acting as prototypes of microbots, micromixers, and other active microstructures. PMID:28798960
Recombinant pinoresinol/lariciresinol reductase, recombinant dirigent protein, and methods of use
Lewis, Norman G.; Davin, Laurence B.; Dinkova-Kostova, Albena T.; Fujita, Masayuki; Gang, David R.; Sarkanen, Simo; Ford, Joshua D.
2001-04-03
Dirigent proteins and pinoresinol/lariciresinol reductases have been isolated, together with cDNAs encoding dirigent proteins and pinoresinol/lariciresinol reductases. Accordingly, isolated DNA sequences are provided which code for the expression of dirigent proteins and pinoresinol/lariciresinol reductases. In other aspects, replicable recombinant cloning vehicles are provided which code for dirigent proteins or pinoresinol/lariciresinol reductases or for a base sequence sufficiently complementary to at least a portion of dirigent protein or pinoresinol/lariciresinol reductase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding dirigent protein or pinoresinol/lariciresinol reductase. Thus, systems and methods are provided for the recombinant expression of dirigent proteins and/or pinoresinol/lariciresinol reductases.
Van Damme, Els J.M.; Charels, Diana; Roy, Soma; Tierens, Koenraad; Barre, Annick; Martins, José C.; Rougé, Pierre; Van Leuven, Fred; Does, Mirjam; Peumans, Willy J.
1999-01-01
We isolated SN-HLPf (Sambucus nigra hevein-like fruit protein), a hevein-like chitin-binding protein, from mature elderberry fruits. Cloning of the corresponding gene demonstrated that SN-HLPf is synthesized as a chimeric precursor consisting of an N-terminal chitin-binding domain corresponding to the mature elderberry protein and an unrelated C-terminal domain. Sequence comparisons indicated that the N-terminal domain of this precursor has high sequence similarity with the N-terminal domain of class I PR-4 (pathogenesis-related) proteins, whereas the C terminus is most closely related to that of class V chitinases. On the basis of these sequence homologies the gene encoding SN-HLPf can be considered a hybrid between a PR-4 and a class V chitinase gene. PMID:10198114
Guo, Deyin; Spetz, Carl; Saarma, Mart; Valkonen, Jari P T
2003-05-01
Potyviral helper-component proteinase (HCpro) is a multifunctional protein exerting its cellular functions in interaction with putative host proteins. In this study, cellular protein partners of the HCpro encoded by Potato virus A (PVA) (genus Potyvirus) were screened in a potato leaf cDNA library using a yeast two-hybrid system. Two cellular proteins were obtained that interact specifically with PVA HCpro in yeast and in the two in vitro binding assays used. Both proteins are encoded by single-copy genes in the potato genome. Analysis of the deduced amino acid sequences revealed that one (HIP1) of the two HCpro interactors is a novel RING finger protein. The sequence of the other protein (HIP2) showed no resemblance to the protein sequences available from databanks and has known biological functions.
Cloning and baculovirus expression of a desiccation stress gene from the beetle, Tenebrio molitor.
Graham, L A; Bendena, W G; Walker, V K
1996-02-01
The cDNA sequence encoding a novel desiccation stress protein (dsp28) found in the hemolymph of the common yellow mealworm beetle, Tenebrio molitor, has been determined. The sequence encodes a 225 amino acid protein containing a 20 amino acid signal peptide. Dsp28 shows no significant similarity to any known nucleic acid or protein sequence. Levels of dsp28 mRNA were found to increase approx 5-fold following desiccation. Dsp28 cDNA has been cloned into a baculovirus expression vector and the expressed protein was compared to native dsp28. Both dsp28 expressed by recombinant baculovirus and native dsp28 are glycosylated and N-terminally processed. Although dsp28 is induced by cold in addition to desiccation stress, it does not contribute to the freezing point depression (thermal hysteresis) observed in Tenebrio hemolymph.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, Brady D.; Thompson, David N.; Apel, William A.
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods of modulating transcription or transcription or transcriptional control using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
Lee, Brady Deneys; Thompson, David N; Apel, William A.; Thompson, Vicki Slavchev; Reed, David W; Lacey, Jeffrey A
2014-05-06
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods of modulating transcription or transcription or transcriptional control using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
Lee, Brady D.; Thompson, David N.; Apel, William A.; Thompson, Vicki S.; Reed, David W.; Lacey, Jeffrey A.
2015-11-17
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods of modulating transcription or transcription or transcriptional control using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
Lee, Brady D; Thompson, David N; Apel, William A; Thompson, Vicki S; Reed, David W; Lacey, Jeffrey A
2016-11-22
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods of modulating transcription or transcription or transcriptional control using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dame, J.B.; Williams, J.L.; McCutchan, T.F.
An antimalarial immunogenic stimulant is described comprising an immunogenic carrier and a peptide sequence of between 2 and 1000 consecutive repeats of a sequence Asn-X-Y-Pro, wherein X is Ala or Val and Y is Asn or Asp.
Method to transform algae, materials therefor, and products produced thereby
Dunahay, T.G.; Roessler, P.G.; Jarvis, E.E.
1997-08-26
Disclosed is a method to transform chlorophyll C-containing algae. The method includes introducing a recombinant molecule comprising a nucleic acid molecule encoding a dominant selectable marker operatively linked to an algal regulatory control sequence into a chlorophyll C-containing alga in such a manner that the marker is produced by the alga. In a preferred embodiment the algal regulatory control sequence is derived from a diatom and preferably Cyclotella cryptica. Also disclosed is a chimeric molecule having one or more regulatory control sequences derived from one or more chlorophyll C-containing algae operatively linked to a nucleic acid molecule encoding a selectable marker, an RNA molecule and/or a protein, wherein the nucleic acid molecule does not normally occur with one or more of the regulatory control sequences. Further, specifically disclosed are molecules pACCNPT10, pACCNPT4.8 and pACCNPT5.1. The methods and materials of the present invention provide the ability to accomplish stable genetic transformation of chlorophyll C-containing algae. 2 figs.
Method to transform algae, materials therefor, and products produced thereby
Dunahay, Terri Goodman; Roessler, Paul G.; Jarvis, Eric E.
1997-01-01
Disclosed is a method to transform chlorophyll C-containing algae which includes introducing a recombinant molecule comprising a nucleic acid molecule encoding a dominant selectable marker operatively linked to an algal regulatory control sequence into a chlorophyll C-containing alga in such a manner that the marker is produced by the alga. In a preferred embodiment the algal regulatory control sequence is derived from a diatom and preferably Cyclotella cryptica. Also disclosed is a chimeric molecule having one or more regulatory control sequences derived from one or more chlorophyll C-containing algae operatively linked to a nucleic acid molecule encoding a selectable marker, an RNA molecule and/or a protein, wherein the nucleic acid molecule does not normally occur with one or more of the regulatory control sequences. Further specifically disclosed are molecules pACCNPT10, pACCNPT4.8 and pACCNPT5.1. The methods and materials of the present invention provide the ability to accomplish stable genetic transformation of chlorophyll C-containing algae.
The COG database: a tool for genome-scale analysis of protein functions and evolution
Tatusov, Roman L.; Galperin, Michael Y.; Natale, Darren A.; Koonin, Eugene V.
2000-01-01
Rational classification of proteins encoded in sequenced genomes is critical for making the genome sequences maximally useful for functional and evolutionary studies. The database of Clusters of Orthologous Groups of proteins (COGs) is an attempt on a phylogenetic classification of the proteins encoded in 21 complete genomes of bacteria, archaea and eukaryotes (http://www.ncbi.nlm.nih.gov/COG ). The COGs were constructed by applying the criterion of consistency of genome-specific best hits to the results of an exhaustive comparison of all protein sequences from these genomes. The database comprises 2091 COGs that include 56–83% of the gene products from each of the complete bacterial and archaeal genomes and ~35% of those from the yeast Saccharomyces cerevisiae genome. The COG database is accompanied by the COGNITOR program that is used to fit new proteins into the COGs and can be applied to functional and phylogenetic annotation of newly sequenced genomes. PMID:10592175
Design and synthesis of digitally encoded polymers that can be decoded and erased
NASA Astrophysics Data System (ADS)
Roy, Raj Kumar; Meszynska, Anna; Laure, Chloé; Charles, Laurence; Verchin, Claire; Lutz, Jean-François
2015-05-01
Biopolymers such as DNA store information in their chains using controlled sequences of monomers. Here we describe a non-natural information-containing macromolecule that can store and retrieve digital information. Monodisperse sequence-encoded poly(alkoxyamine amide)s were synthesized using an iterative strategy employing two chemoselective steps: the reaction of a primary amine with an acid anhydride and the radical coupling of a carbon-centred radical with a nitroxide. A binary code was implemented in the polymer chains using three monomers: one nitroxide spacer and two interchangeable anhydrides defined as 0-bit and 1-bit. This methodology allows encryption of any desired sequence in the chains. Moreover, the formed sequences are easy to decode using tandem mass spectrometry. Indeed, these polymers follow predictable fragmentation pathways that can be easily deciphered. Moreover, poly(alkoxyamine amide)s are thermolabile. Thus, the digital information encrypted in the chains can be erased by heating the polymers in the solid state or in solution.
Design and synthesis of digitally encoded polymers that can be decoded and erased.
Roy, Raj Kumar; Meszynska, Anna; Laure, Chloé; Charles, Laurence; Verchin, Claire; Lutz, Jean-François
2015-05-26
Biopolymers such as DNA store information in their chains using controlled sequences of monomers. Here we describe a non-natural information-containing macromolecule that can store and retrieve digital information. Monodisperse sequence-encoded poly(alkoxyamine amide)s were synthesized using an iterative strategy employing two chemoselective steps: the reaction of a primary amine with an acid anhydride and the radical coupling of a carbon-centred radical with a nitroxide. A binary code was implemented in the polymer chains using three monomers: one nitroxide spacer and two interchangeable anhydrides defined as 0-bit and 1-bit. This methodology allows encryption of any desired sequence in the chains. Moreover, the formed sequences are easy to decode using tandem mass spectrometry. Indeed, these polymers follow predictable fragmentation pathways that can be easily deciphered. Moreover, poly(alkoxyamine amide)s are thermolabile. Thus, the digital information encrypted in the chains can be erased by heating the polymers in the solid state or in solution.
Ebolavirus comparative genomics
Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; ...
2015-07-14
The 2014 Ebola outbreak in West Africa is the largest documented for this virus. We examine the dynamics of this genome, comparing more than one hundred currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus, and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of themore » same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP), and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. In conclusion, this information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies.« less
Recombinant soluble adenovirus receptor
Freimuth, Paul I.
2002-01-01
Disclosed are isolated polypeptides from human CAR (coxsackievirus and adenovirus receptor) protein which bind adenovirus. Specifically disclosed are amino acid sequences which corresponds to adenovirus binding domain D1 and the entire extracellular domain of human CAR protein comprising D1 and D2. In other aspects, the disclosure relates to nucleic acid sequences encoding these domains as well as expression vectors which encode the domains and bacterial cells containing such vectors. Also disclosed is an isolated fusion protein comprised of the D1 polypeptide sequence fused to a polypeptide sequence which facilitates folding of D1 into a functional, soluble domain when expressed in bacteria. The functional D1 domain finds application for example in a therapeutic method for treating a patient infected with a virus which binds to D1, and also in a method for identifying an antiviral compound which interferes with viral attachment. Also included is a method for specifically targeting a cell for infection by a virus which binds to D1.
Amino acid sequence of a trypsin inhibitor from a Spirometra (Spirometra erinaceieuropaei).
Sanda, A; Uchida, A; Itagaki, T; Kobayashi, H; Inokuchi, N; Koyama, T; Iwama, M; Ohgi, K; Irie, M
2001-12-01
A trypsin inhibitor that is highly homologous with bovine pancreatic trypsin inhibitor (BPTI) was co-purified along with RNase from Spirometra (Spirometra erinaceieuropaei). The amino acid sequence of this inhibitor (SETI) and the nucleotide sequence of the cDNA encoding this protein were determined by protein chemistry and gene technology. SETI contains 68 amino acid residues and has a molecular mass of 7,798 Da. SETI has 31 amino acid residues that are identical with BPTI's sequence, including 6 half-cystine and 5 aromatic amino acid residues. The active site Lys residue in BPTI is replaced by an Arg residue in SETI. SETI is an effective inhibitor of trypsin and moderately inhibits a-chymotrypsin, but less inhibits elastase or subtilisin. SETI was expressed by E. coli containing a PelB vector carrying the SETI encoding cDNA; an expression yield of 0.68 mg/l was obtained. The phylogenetic relationship of SETI and the other BPTI-like trypsin inhibitors was analyzed using most likelihood inference methods.
The ENCODE project: implications for psychiatric genetics.
Kavanagh, D H; Dwyer, S; O'Donovan, M C; Owen, M J
2013-05-01
The ENCyclopedia Of DNA Elements (ENCODE) project is a public research consortium that aims to identify all functional elements of the human genome sequence. The project comprised 1640 data sets, from 147 different cell type and the findings were released in a coordinated set of 34 publications across several journals. The ENCODE publications report that 80.4% of the human genome displays some functionality. These data have important implications for interpreting results from large-scale genetics studies. We reviewed some of the key findings from the ENCODE publications and discuss how they can influence or inform further investigations into the genetic factors contributing to neuropsychiatric disorders.
MCR-1 and OXA-48 In Vivo Acquisition in KPC-Producing Escherichia coli after Colistin Treatment.
Beyrouthy, Racha; Robin, Frederic; Lessene, Aude; Lacombat, Igor; Dortet, Laurent; Naas, Thierry; Ponties, Valérie; Bonnet, Richard
2017-08-01
The spread of mcr-1 -encoding plasmids into carbapenem-resistant Enterobacteriaceae raises concerns about the emergence of untreatable bacteria. We report the acquisition of mcr-1 in a carbapenem-resistant Escherichia coli strain after a 3-week course of colistin in a patient repatriated to France from Portugal. Whole-genome sequencing revealed that the Klebsiella pneumoniae carbapenemase-producing E. coli strain acquired two plasmids, an IncL OXA-48-encoding plasmid and an IncX4 mcr-1 -encoding plasmid. This is the first report of mcr-1 in carbapenemase-encoding bacteria in France. Copyright © 2017 American Society for Microbiology.
Multi-Level Sequential Pattern Mining Based on Prime Encoding
NASA Astrophysics Data System (ADS)
Lianglei, Sun; Yun, Li; Jiang, Yin
Encoding is not only to express the hierarchical relationship, but also to facilitate the identification of the relationship between different levels, which will directly affect the efficiency of the algorithm in the area of mining the multi-level sequential pattern. In this paper, we prove that one step of division operation can decide the parent-child relationship between different levels by using prime encoding and present PMSM algorithm and CROSS-PMSM algorithm which are based on prime encoding for mining multi-level sequential pattern and cross-level sequential pattern respectively. Experimental results show that the algorithm can effectively extract multi-level and cross-level sequential pattern from the sequence database.
The Role of Manual Encoding in Learning by the Prelingually Deaf: An Initial Investigation.
ERIC Educational Resources Information Center
Stall, C. Harmon; Marshall, Philip H.
1984-01-01
A study tested the hypothesis that manual encoding aids learning in the prelingually deaf. Twenty-four adults who used fingerspelling as their primary means of communication participated in two groups of a paired-associate learning paradigm, using eight study-test trial sequences. Those using fingerspelling showed more recall and a faster learning…
DNA encoding for plant digalactosyldiacylglycerol galactosyltransferase and methods of use
Benning, Christoph; Doermann, Peter
2003-11-04
The cDNA encoding digalactosyldiacylglycerol galactosyltransferase (DGD1) is provided. The deduced amino acid sequence is also provided. Methods of making and using DGD1 to screen for new herbicides and alter a plant's leaf lipid composition are also provided, as well as expression vectors, transgenic plants or other organisms transfected with said vectors.
USDA-ARS?s Scientific Manuscript database
Serine proteases, such as trypsin and chymotrypsin, are the primary digestive enzymes in lepidopteran larvae, and are also involved in Bacillus thuringiensis (Bt) protoxin activation and protoxin/toxin degradation. We isolated and sequenced 34 cDNAs putatively encoding trypsins, chymotrypsins and th...
Regulating the ethylene response of a plant by modulation of F-box proteins
Guo, Hongwei; Ecker, Joseph R.
2010-02-02
The invention relates to transgenic plants having reduced sensitivity to ethylene as a result of having a recombinant nucleic acid encoding a F-box protein, and a method of producing a transgenic plant with reduced ethylene sensitivity by transforming the plant with a nucleic acid sequence encoding a F-box protein.
Characterization and chromosomal mapping of the human TFG gene involved in thyroid carcinoma
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mencinger, M.; Panagopoulos, I.; Andreasson, P.
1997-05-01
Homology searches in the Expressed Sequence Tag Database were performed using SPYGQ-rich regions as query sequences to find genes encoding protein regions similar to the N-terminal parts of the sarcoma-associated EWS and FUS proteins. Clone 22911 (T74973), encoding a SPYGQ-rich region in its 5{prime} end, and several other clones that overlapped 22911 were selected. The combined data made it possible to assemble a full-length cDNA sequence. This cDNA sequence is 1677 bp, containing an initiation codon ATG, an open reading frame of 400 amino acids, a poly(A) signal, and a poly(A) tail. We found 100% identity between the 5{prime} partmore » of the consensus sequence and the 598-bp-long sequence named TFG. The TFG sequence is fused to the 3{prime} end of NTRK1, generating the TRK-T3 fusion transcript found in papillary thyroid carcinoma. The cDNA therefore represents the full-length transcript of the TFG gene. TFG was localized to 3q11-q12 by fluorescence in situ hybridization. The 3{prime} and the 5{prime} ends of the TFG cDNA probe hybridized to a 2.2-kb band on Northern blot filters in all tissues examined. 28 refs., 5 figs., 1 tab.« less
Berends Sexton, T; Jones, J T; Mullet, J E
1990-05-01
A 6.25 kbp barley plastid DNA region located between psbA and psbD-psbC were sequenced and RNAs produced from this DNA were analyzed. TrnK(UUU), rps16 and trnQ(UUG) were located upstream of psbA. These genes were transcribed from the same DNA strand as psbA and multiple RNAs hybridized to them. TrnK and rsp16 contained introns; a 504 amino acid open reading frame (ORF504) was located within the trnK intron. Between trnQ and psbD-psbC was a 2.24 kbp region encoding psbK, psbI and trnS(GCU). PsbK and psbI are encoded on the same DNA strand as psbD-psbC whereas trnS(GCU) is transcribed from the opposite strand. Two large RNAs accumulate in barley etioplasts which contain psbK, psbI, anti-sense trnS(GCU) and psbD-psbC sequences. Other RNAs encode psbK and psbI only, or psbK only. The divergent trnS(GCU) located upstream of psbD-psbC and a second divergent trnS(UGA) located downstream of psbD-psbC were both expressed. Furthermore, RNA complementary to psbK and psbI mRNA was detected, suggesting that transcription from divergent overlapping transcription units may modulate expression from this DNA region.
Functional metagenomics reveals novel β-galactosidases not predictable from gene sequences.
Cheng, Jiujun; Romantsov, Tatyana; Engel, Katja; Doxey, Andrew C; Rose, David R; Neufeld, Josh D; Charles, Trevor C
2017-01-01
The techniques of metagenomics have allowed researchers to access the genomic potential of uncultivated microbes, but there remain significant barriers to determination of gene function based on DNA sequence alone. Functional metagenomics, in which DNA is cloned and expressed in surrogate hosts, can overcome these barriers, and make important contributions to the discovery of novel enzymes. In this study, a soil metagenomic library carried in an IncP cosmid was used for functional complementation for β-galactosidase activity in both Sinorhizobium meliloti (α-Proteobacteria) and Escherichia coli (γ-Proteobacteria) backgrounds. One β-galactosidase, encoded by six overlapping clones that were selected in both hosts, was identified as a member of glycoside hydrolase family 2. We could not identify ORFs obviously encoding possible β-galactosidases in 19 other sequenced clones that were only able to complement S. meliloti. Based on low sequence identity to other known glycoside hydrolases, yet not β-galactosidases, three of these ORFs were examined further. Biochemical analysis confirmed that all three encoded β-galactosidase activity. Lac36W_ORF11 and Lac161_ORF7 had conserved domains, but lacked similarities to known glycoside hydrolases. Lac161_ORF10 had neither conserved domains nor similarity to known glycoside hydrolases. Bioinformatic and structural modeling implied that Lac161_ORF10 protein represented a novel enzyme family with a five-bladed propeller glycoside hydrolase domain. By discovering founding members of three novel β-galactosidase families, we have reinforced the value of functional metagenomics for isolating novel genes that could not have been predicted from DNA sequence analysis alone.
NASA Technical Reports Server (NTRS)
Hsieh, H. L.; Tong, C. G.; Thomas, C.; Roux, S. J.
1996-01-01
A CDNA encoding a 47 kDa nucleoside triphosphatase (NTPase) that is associated with the chromatin of pea nuclei has been cloned and sequenced. The translated sequence of the cDNA includes several domains predicted by known biochemical properties of the enzyme, including five motifs characteristic of the ATP-binding domain of many proteins, several potential casein kinase II phosphorylation sites, a helix-turn-helix region characteristic of DNA-binding proteins, and a potential calmodulin-binding domain. The deduced primary structure also includes an N-terminal sequence that is a predicted signal peptide and an internal sequence that could serve as a bipartite-type nuclear localization signal. Both in situ immunocytochemistry of pea plumules and immunoblots of purified cell fractions indicate that most of the immunodetectable NTPase is within the nucleus, a compartment proteins typically reach through nuclear pores rather than through the endoplasmic reticulum pathway. The translated sequence has some similarity to that of human lamin C, but not high enough to account for the earlier observation that IgG against human lamin C binds to the NTPase in immunoblots. Northern blot analysis shows that the NTPase MRNA is strongly expressed in etiolated plumules, but only poorly or not at all in the leaf and stem tissues of light-grown plants. Accumulation of NTPase mRNA in etiolated seedlings is stimulated by brief treatments with both red and far-red light, as is characteristic of very low-fluence phytochrome responses. Southern blotting with pea genomic DNA indicates the NTPase is likely to be encoded by a single gene.
Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes.
Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich
2012-02-01
The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information.
Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes
Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich
2012-01-01
The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information. PMID:22384404
DOE Office of Scientific and Technical Information (OSTI.GOV)
Martinez, Antonio D.; Berka, Randy; Henrissat, Bernard
2008-05-01
A major thrust of the white biotechnology movement involves the development of enzyme systems which depolymerize biomass to simple sugars which are subsequently converted to sustainable biofuels (e.g., ethanol) and chemical intermediates. The fungus Trichoderma reesei (syn. Hypocrea jecorina) represents a paradigm for the industrial production of highly efficient cellulases and hemicellulases needed for hydrolysis of biomass polysaccharides. Herein we describe intriguing attributes of the T. reeseigenome in relation to the future of fuel biotechnology. The T. reesei genome sequence was derived using a whole genome shotgun approach combined with finishing work to generate an assembly comprising 89 scaffolds totalingmore » 34 Mbp with few gaps. In total, 9,130 gene models were predicted using a combination of ab initio and sequence similarity-based methods and EST data. Considering the industrial utility and effectiveness of its enzymes, the T. reesei genome surprisingly encodes the fewest cellulases and hemicellulases of any fungus having the ability to hydrolyze plant cell wall polysaccharides and whose genome has been sequenced. Many genes encoding carbohydrate active enzymes are distributed non-randomly in groups or clusters that interestingly lie between regions of synteny with other Sordariomycetes. Additionally, the T. reesei genome contains a multitude of genes encoding biosynthetic pathways for secondary metabolites (possible antibacterial and antifungal compounds) which may promote successful competition and survival in the crowded and competitive soil habitat occupied by T. reesei. Our analysis coupled with the availability of genome sequence data provides a roadmap for construction of enhanced T. reesei strains for industrial applications.« less
Type II restriction modification system methylation subunit of Alicyclobacillus acidocaldarius
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, Brady D.; Newby, Deborah T.; Lacey, Jeffrey A.
2018-02-13
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods for modulating or altering recombination inside or outside of a cell using isolated and/or purified polypeptides and/or nucleic acid sequences from Alicyclobacillus acidocaldarius.
Type II restriction-modification system methylation subunit of Alicyclobacillus acidocaldarius
Lee, Brady D; Newby, Deborah T; Lacey, Jeffrey A; Thompson, David N; Thompson, Vicki S; Apel, William A; Roberto, Francisco F; Reed, David W
2013-10-29
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods for modulating or altering recombination inside or outside of a cell using isolated and/or purified polypeptides and/or nucleic acid sequences from Alicyclobacillus acidocaldarius.
2014-10-01
4 APPENDICES 4 INTRODUCTION: Despite tremendous advances in mutation detection with gene panels...population frequency and overlap with ENCODE regions. 2a. Align reads to the reference sequence (months 4-10) 2b. Identify SNPs, indels, CNVs and
Type II restriction-modification system methylation subunit of Alicyclobacillus acidocaldarius
Lee, Brady D; Newby, Deborah T; Lacey, Jeffrey A; Thompson, David N; Thompson, Vicki S; Apel, William A; Roberto, Francisco F; Reed, David W
2015-05-12
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods for modulating or altering recombination inside or outside of a cell using isolated and/or purified polypeptides and/or nucleic acid sequences from Alicyclobacillus acidocaldarius.