Lücke, S; Xu, G L; Palfi, Z; Cross, M; Bellofatto, V; Bindereif, A
1996-01-01
In trypanosomes mRNAs are generated through trans splicing. The spliced leader (SL) RNA, which donates the 5'-terminal mini-exon to each of the protein coding exons, plays a central role in the trans splicing process. We have established in vivo assays to study in detail trans splicing, cap4 modification, and RNP assembly of the SL RNA in the trypanosomatid species Leptomonas seymouri. First, we found that extensive sequences within the mini-exon are required for SL RNA function in vivo, although a conserved length of 39 nt is not essential. In contrast, the intron sequence appears to be surprisingly tolerant to mutation; only the stem-loop II structure is indispensable. The asymmetry of the sequence requirements in the stem I region suggests that this domain may exist in different functional conformations. Second, distinct mini-exon sequences outside the modification site are important for efficient cap4 formation. Third, all SL RNA mutations tested allowed core RNP assembly, suggesting flexible requirements for core protein binding. In sum, the results of our mutational analysis provide evidence for a discrete domain structure of the SL RNA and help to explain the strong phylogenetic conservation of the mini-exon sequence and of the overall SL RNA secondary structure; they also suggest that there may be certain differences between trans splicing in nematodes and trypanosomes. This approach provides a basis for studying RNA-RNA interactions in the trans spliceosome. Images PMID:8861965
Simultaneous master-slave Omega pairs. [navigation system featuring low cost receiver
NASA Technical Reports Server (NTRS)
Burhans, R. W.
1974-01-01
Master-slave sequence ordering of the Omega system is suggested as a method of improving the pair geometry for low-cost receiver user benefit. The sequence change will not affect present sophisticated processor users other than require new labels for some pair combinations, but may require worldwide transmitter operators to slightly alter their long-range synchronizing techniques.
Equally parsimonious pathways through an RNA sequence space are not equally likely
NASA Technical Reports Server (NTRS)
Lee, Y. H.; DSouza, L. M.; Fox, G. E.
1997-01-01
An experimental system for determining the potential ability of sequences resembling 5S ribosomal RNA (rRNA) to perform as functional 5S rRNAs in vivo in the Escherichia coli cellular environment was devised previously. Presumably, the only 5S rRNA sequences that would have been fixed by ancestral populations are ones that were functionally valid, and hence the actual historical paths taken through RNA sequence space during 5S rRNA evolution would have most likely utilized valid sequences. Herein, we examine the potential validity of all sequence intermediates along alternative equally parsimonious trajectories through RNA sequence space which connect two pairs of sequences that had previously been shown to behave as valid 5S rRNAs in E. coli. The first trajectory requires a total of four changes. The 14 sequence intermediates provide 24 apparently equally parsimonious paths by which the transition could occur. The second trajectory involves three changes, six intermediate sequences, and six potentially equally parsimonious paths. In total, only eight of the 20 sequence intermediates were found to be clearly invalid. As a consequence of the position of these invalid intermediates in the sequence space, seven of the 30 possible paths consisted of exclusively valid sequences. In several cases, the apparent validity/invalidity of the intermediate sequences could not be anticipated on the basis of current knowledge of the 5S rRNA structure. This suggests that the interdependencies in RNA sequence space may be more complex than currently appreciated. If ancestral sequences predicted by parsimony are to be regarded as actual historical sequences, then the present results would suggest that they should also satisfy a validity requirement and that, in at least limited cases, this conjecture can be tested experimentally.
Teaching Crystallography to Noncrystallographers.
ERIC Educational Resources Information Center
Glusker, Jenny P.
1988-01-01
Addresses the requirements of high school students and noncrystallographers in lectures on crystals, diffraction, and structure analysis. Discusses basic understanding and a sequence that addresses these requirements. Suggests visual and descriptive teaching methods used in this effort. (CW)
Sequence for the Training of Eye-Hand Coordination Needed for the Organization of Handwriting Tasks
ERIC Educational Resources Information Center
Trester, Mary Fran
1971-01-01
Suggested is a sequence of 11 class activities, progressing from gross to fine motor skills, to assist the development of skills required to perform handwriting tasks successfully, for use particularly with children who lack fine motor control and eye-hand coordination. (KW)
Diatom centromeres suggest a mechanism for nuclear DNA acquisition
Diner, Rachel E.; Noddings, Chari M.; Lian, Nathan C.; ...
2017-07-18
Centromeres are essential for cell division and growth in all eukaryotes, and knowledge of their sequence and structure guides the development of artificial chromosomes for functional cellular biology studies. Centromeric proteins are conserved among eukaryotes; however, centromeric DNA sequences are highly variable. We combined forward and reverse genetic approaches with chromatin immunoprecipitation to identify centromeres of the model diatom Phaeodactylum tricornutum. We observed 25 unique centromere sequences typically occurring once per chromosome, a finding that helps to resolve nuclear genome organization and indicates monocentric regional centromeres. Diatom centromere sequences contain low-GC content regions but lack repeats or other conserved sequencemore » features. Native and foreign sequences with similar GC content to P. tricornutum centromeres can maintain episomes and recruit the diatom centromeric histone protein CENH3, suggesting nonnative sequences can also function as diatom centromeres. Thus, simple sequence requirements may enable DNA from foreign sources to persist in the nucleus as extrachromosomal episomes, revealing a potential mechanism for organellar and foreign DNA acquisition.« less
Diatom centromeres suggest a mechanism for nuclear DNA acquisition
DOE Office of Scientific and Technical Information (OSTI.GOV)
Diner, Rachel E.; Noddings, Chari M.; Lian, Nathan C.
Centromeres are essential for cell division and growth in all eukaryotes, and knowledge of their sequence and structure guides the development of artificial chromosomes for functional cellular biology studies. Centromeric proteins are conserved among eukaryotes; however, centromeric DNA sequences are highly variable. We combined forward and reverse genetic approaches with chromatin immunoprecipitation to identify centromeres of the model diatom Phaeodactylum tricornutum. We observed 25 unique centromere sequences typically occurring once per chromosome, a finding that helps to resolve nuclear genome organization and indicates monocentric regional centromeres. Diatom centromere sequences contain low-GC content regions but lack repeats or other conserved sequencemore » features. Native and foreign sequences with similar GC content to P. tricornutum centromeres can maintain episomes and recruit the diatom centromeric histone protein CENH3, suggesting nonnative sequences can also function as diatom centromeres. Thus, simple sequence requirements may enable DNA from foreign sources to persist in the nucleus as extrachromosomal episomes, revealing a potential mechanism for organellar and foreign DNA acquisition.« less
ERIC Educational Resources Information Center
Barron, Kenneth E.; Apple, Kevin J.
2014-01-01
Coursework in statistics and research methods is a core requirement in most undergraduate psychology programs. However, is there an optimal way to structure and sequence methodology courses to facilitate student learning? For example, should statistics be required before research methods, should research methods be required before statistics, or…
Hess, M A; Duncan, R F
1996-01-01
Preferential translation of Drosophila heat shock protein 70 (Hsp70) mRNA requires only the 5'-untranslated region (5'-UTR). The sequence of this region suggests that it has relatively little secondary structure, which may facilitate efficient protein synthesis initiation. To determine whether minimal 5'-UTR secondary structure is required for preferential translation during heat shock, the effect of introducing stem-loops into the Hsp70 mRNA 5'-UTR was measured. Stem-loops of -11 kcal/mol abolished translation during heat shock, but did not reduce translation in non-heat shocked cells. A -22 kcal/mol stem-loop was required to comparably inhibit translation during growth at normal temperatures. To investigate whether specific sequence elements are also required for efficient preferential translation, deletion and mutation analyses were conducted in a truncated Hsp70 5'-UTR containing only the cap-proximal and AUG-proximal segments. Linker-scanner mutations in the cap-proximal segment (+1 to +37) did not impair translation. Re-ordering the segments reduced mRNA translational efficiency by 50%. Deleting the AUG-proximal segment severely inhibited translation. A 5-extension of the full-length leader specifically impaired heat shock translation. These results indicate that heat shock reduces the capacity to unwind 5-UTR secondary structure, allowing only mRNAs with minimal 5'-UTR secondary structure to be efficiently translated. A function for specific sequences is also suggested. PMID:8710519
Hardwicke, Joseph T; Richards, Helen; Cafferky, Louise; Underwood, Imogen; ter Horst, Britt; Slator, Rona
2016-03-01
Pierre Robin sequence results from a cascade of events that occur during embryologic development and frequently presents with cleft palate. Some studies have shown speech outcomes to be worse in patients with Pierre Robin sequence after cleft palate repair. A cohort of Pierre Robin sequence patients who all required an airway intervention and nasogastric feeding in the neonatal period were identified and speech outcomes assessed at 5 years of age. A cleft- and sex-matched non-Pierre Robin sequence, cleft palate-only comparison group was also identified from the same institution and study period. A total of 24 patients with Pierre Robin sequence that required airway and nutritional support in the neonatal period were matched for age, sex, and cleft type to a group of 24 non-Pierre Robin sequence cleft patients. There was no significant difference in the incidence of oronasal fistula between the groups. Secondary surgery for velopharyngeal incompetence was significantly more (p = 0.017) in the Pierre Robin sequence group, who also had significantly greater nasality (p = 0.031) and cleft speech characteristic (p = 0.023) scores. The authors hypothesize that other factors may exist in Pierre Robin sequence that may lead to poor speech outcomes. The authors would suggest counseling parents of children with Pierre Robin sequence that have required a neonatal airway intervention, that speech development may be poorer than in other children with cleft palate, and that these children will have a significantly higher incidence of secondary speech surgery. Risk, II.
Impact of sequencing depth in ChIP-seq experiments
Jung, Youngsook L.; Luquette, Lovelace J.; Ho, Joshua W.K.; Ferrari, Francesco; Tolstorukov, Michael; Minoda, Aki; Issner, Robbyn; Epstein, Charles B.; Karpen, Gary H.; Kuroda, Mitzi I.; Park, Peter J.
2014-01-01
In a chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) experiment, an important consideration in experimental design is the minimum number of sequenced reads required to obtain statistically significant results. We present an extensive evaluation of the impact of sequencing depth on identification of enriched regions for key histone modifications (H3K4me3, H3K36me3, H3K27me3 and H3K9me2/me3) using deep-sequenced datasets in human and fly. We propose to define sufficient sequencing depth as the number of reads at which detected enrichment regions increase <1% for an additional million reads. Although the required depth depends on the nature of the mark and the state of the cell in each experiment, we observe that sufficient depth is often reached at <20 million reads for fly. For human, there are no clear saturation points for the examined datasets, but our analysis suggests 40–50 million reads as a practical minimum for most marks. We also devise a mathematical model to estimate the sufficient depth and total genomic coverage of a mark. Lastly, we find that the five algorithms tested do not agree well for broad enrichment profiles, especially at lower depths. Our findings suggest that sufficient sequencing depth and an appropriate peak-calling algorithm are essential for ensuring robustness of conclusions derived from ChIP-seq data. PMID:24598259
Target Site Recognition by a Diversity-Generating Retroelement
Guo, Huatao; Tse, Longping V.; Nieh, Angela W.; Czornyj, Elizabeth; Williams, Steven; Oukil, Sabrina; Liu, Vincent B.; Miller, Jeff F.
2011-01-01
Diversity-generating retroelements (DGRs) are in vivo sequence diversification machines that are widely distributed in bacterial, phage, and plasmid genomes. They function to introduce vast amounts of targeted diversity into protein-encoding DNA sequences via mutagenic homing. Adenine residues are converted to random nucleotides in a retrotransposition process from a donor template repeat (TR) to a recipient variable repeat (VR). Using the Bordetella bacteriophage BPP-1 element as a prototype, we have characterized requirements for DGR target site function. Although sequences upstream of VR are dispensable, a 24 bp sequence immediately downstream of VR, which contains short inverted repeats, is required for efficient retrohoming. The inverted repeats form a hairpin or cruciform structure and mutational analysis demonstrated that, while the structure of the stem is important, its sequence can vary. In contrast, the loop has a sequence-dependent function. Structure-specific nuclease digestion confirmed the existence of a DNA hairpin/cruciform, and marker coconversion assays demonstrated that it influences the efficiency, but not the site of cDNA integration. Comparisons with other phage DGRs suggested that similar structures are a conserved feature of target sequences. Using a kanamycin resistance determinant as a reporter, we found that transplantation of the IMH and hairpin/cruciform-forming region was sufficient to target the DGR diversification machinery to a heterologous gene. In addition to furthering our understanding of DGR retrohoming, our results suggest that DGRs may provide unique tools for directed protein evolution via in vivo DNA diversification. PMID:22194701
Integration of Temporal and Ordinal Information During Serial Interception Sequence Learning
Gobel, Eric W.; Sanchez, Daniel J.; Reber, Paul J.
2011-01-01
The expression of expert motor skills typically involves learning to perform a precisely timed sequence of movements (e.g., language production, music performance, athletic skills). Research examining incidental sequence learning has previously relied on a perceptually-cued task that gives participants exposure to repeating motor sequences but does not require timing of responses for accuracy. Using a novel perceptual-motor sequence learning task, learning a precisely timed cued sequence of motor actions is shown to occur without explicit instruction. Participants learned a repeating sequence through practice and showed sequence-specific knowledge via a performance decrement when switched to an unfamiliar sequence. In a second experiment, the integration of representation of action order and timing sequence knowledge was examined. When either action order or timing sequence information was selectively disrupted, performance was reduced to levels similar to completely novel sequences. Unlike prior sequence-learning research that has found timing information to be secondary to learning action sequences, when the task demands require accurate action and timing information, an integrated representation of these types of information is acquired. These results provide the first evidence for incidental learning of fully integrated action and timing sequence information in the absence of an independent representation of action order, and suggest that this integrative mechanism may play a material role in the acquisition of complex motor skills. PMID:21417511
Vandenbol, M; Jauniaux, J C; Grenson, M
1989-11-15
The complete nucleotide (nt) sequence of the PUT4 gene, whose product is required for high-affinity proline active transport in the yeast Saccharomyces cerevisiae, is presented. The sequence contains a single long open reading frame of 1881 nt, encoding a polypeptide with a calculated Mr of 68,795. The predicted protein is strongly hydrophobic and exhibits six potential glycosylation sites. Its hydropathy profile suggests the presence of twelve membrane-spanning regions flanked by hydrophilic N- and C-terminal domains. The N terminus does not resemble signal sequences found in secreted proteins. These features are characteristic of integral membrane proteins catalyzing translocation of ligands across cellular membranes. Protein sequence comparisons indicate strong resemblance to the arginine and histidine permeases of S. cerevisiae, but no marked sequence similarity to the proline permease of Escherichia coli or to other known prokaryotic or eukaryotic transport proteins. The strong similarity between the three yeast amino acid permeases suggests a common ancestor for the three proteins.
Kemény, Ferenc; Meier, Beat
2016-02-01
While sequence learning research models complex phenomena, previous studies have mostly focused on unimodal sequences. The goal of the current experiment is to put implicit sequence learning into a multimodal context: to test whether it can operate across different modalities. We used the Task Sequence Learning paradigm to test whether sequence learning varies across modalities, and whether participants are able to learn multimodal sequences. Our results show that implicit sequence learning is very similar regardless of the source modality. However, the presence of correlated task and response sequences was required for learning to take place. The experiment provides new evidence for implicit sequence learning of abstract conceptual representations. In general, the results suggest that correlated sequences are necessary for implicit sequence learning to occur. Moreover, they show that elements from different modalities can be automatically integrated into one unitary multimodal sequence. Copyright © 2015 Elsevier B.V. All rights reserved.
Iterative Correction of Reference Nucleotides (iCORN) using second generation sequencing technology.
Otto, Thomas D; Sanders, Mandy; Berriman, Matthew; Newbold, Chris
2010-07-15
The accuracy of reference genomes is important for downstream analysis but a low error rate requires expensive manual interrogation of the sequence. Here, we describe a novel algorithm (Iterative Correction of Reference Nucleotides) that iteratively aligns deep coverage of short sequencing reads to correct errors in reference genome sequences and evaluate their accuracy. Using Plasmodium falciparum (81% A + T content) as an extreme example, we show that the algorithm is highly accurate and corrects over 2000 errors in the reference sequence. We give examples of its application to numerous other eukaryotic and prokaryotic genomes and suggest additional applications. The software is available at http://icorn.sourceforge.net
Sun, Zhizeng; Mehta, Shrenik C; Adamski, Carolyn J; Gibbs, Richard A; Palzkill, Timothy
2016-09-12
CphA is a Zn(2+)-dependent metallo-β-lactamase that efficiently hydrolyzes only carbapenem antibiotics. To understand the sequence requirements for CphA function, single codon random mutant libraries were constructed for residues in and near the active site and mutants were selected for E. coli growth on increasing concentrations of imipenem, a carbapenem antibiotic. At high concentrations of imipenem that select for phenotypically wild-type mutants, the active-site residues exhibit stringent sequence requirements in that nearly all residues in positions that contact zinc, the substrate, or the catalytic water do not tolerate amino acid substitutions. In addition, at high imipenem concentrations a number of residues that do not directly contact zinc or substrate are also essential and do not tolerate substitutions. Biochemical analysis confirmed that amino acid substitutions at essential positions decreased the stability or catalytic activity of the CphA enzyme. Therefore, the CphA active - site is fragile to substitutions, suggesting active-site residues are optimized for imipenem hydrolysis. These results also suggest that resistance to inhibitors targeted to the CphA active site would be slow to develop because of the strong sequence constraints on function.
Effects of learning duration on implicit transfer.
Tanaka, Kanji; Watanabe, Katsumi
2015-10-01
Implicit learning and transfer in sequence acquisition play important roles in daily life. Several previous studies have found that even when participants are not aware that a transfer sequence has been transformed from the learning sequence, they are able to perform the transfer sequence faster and more accurately; this suggests implicit transfer of visuomotor sequences. Here, we investigated whether implicit transfer could be modulated by the number of trials completed in a learning session. Participants learned a sequence through trial and error, known as the m × n task (Hikosaka et al. in J Neurophysiol 74:1652-1661, 1995). In the learning session, participants were required to successfully perform the same sequence 4, 12, 16, or 20 times. In the transfer session, participants then learned one of two other sequences: one where the button configuration Vertically Mirrored the learning sequence, or a randomly generated sequence. Our results show that even when participants did not notice the alternation rule (i.e., vertical mirroring), their total working time was less and their total number of errors was lower in the transfer session compared with those who performed a Random sequence, irrespective of the number of trials completed in the learning session. This result suggests that implicit transfer likely occurs even over a shorter learning duration.
Benfey, PN; Takatsuji, H; Ren, L; Shah, DM; Chua, NH
1990-01-01
We have analyzed expression from deletion derivatives of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) 5[prime]-upstream region in transgenic petunia flowers and seedlings. In seedlings, expression was strongest in root cortex cells and in trichomes. High-level expression in petals and in seedling roots was conferred by large (>500 base-pair) stretches of sequence, but was lost when smaller fragments were analyzed individually. This apparent requirement for extensive sequence suggests that combinations of cis-elements that are widely separated control tissue-specific expression from the EPSPS promoter. We have also used the high-level, petal-specific expression of the EPSPS promoter to change petal color in two mutant petunia lines. PMID:12354968
Ni, Xiangyang; Westpheling, Janet
1997-01-01
The chi63 promoter directs glucose-sensitive, chitin-dependent transcription of a gene involved in the utilization of chitin as carbon source. Analysis of 5′ and 3′ deletions of the promoter region revealed that a 350-bp segment is sufficient for wild-type levels of expression and regulation. The analysis of single base changes throughout the promoter region, introduced by random and site-directed mutagenesis, identified several sequences to be important for activity and regulation. Single base changes at −10, −12, −32, −33, −35, and −37 upstream of the transcription start site resulted in loss of activity from the promoter, suggesting that bases in these positions are important for RNA polymerase interaction. The sequences centered around −10 (TATTCT) and −35 (TTGACC) in this promoter are, in fact, prototypical of eubacterial promoters. Overlapping the RNA polymerase binding site is a perfect 12-bp direct repeat sequence. Some base changes within this direct repeat resulted in constitutive expression, suggesting that this sequence is an operator for negative regulation. Other base changes resulted in loss of glucose repression while retaining the requirement for chitin induction, suggesting that this sequence is also involved in glucose repression. The fact that cis-acting mutations resulted in glucose resistance but not inducer independence rules out the possibility that glucose repression acts exclusively by inducer exclusion. The fact that mutations that affect glucose repression and chitin induction fall within the same direct repeat sequence module suggests that the direct repeat sequence facilitates both chitin induction and glucose repression. PMID:9371809
UV-Visible Spectroscopy-Based Quantification of Unlabeled DNA Bound to Gold Nanoparticles.
Baldock, Brandi L; Hutchison, James E
2016-12-20
DNA-functionalized gold nanoparticles have been increasingly applied as sensitive and selective analytical probes and biosensors. The DNA ligands bound to a nanoparticle dictate its reactivity, making it essential to know the type and number of DNA strands bound to the nanoparticle surface. Existing methods used to determine the number of DNA strands per gold nanoparticle (AuNP) require that the sequences be fluorophore-labeled, which may affect the DNA surface coverage and reactivity of the nanoparticle and/or require specialized equipment and other fluorophore-containing reagents. We report a UV-visible-based method to conveniently and inexpensively determine the number of DNA strands attached to AuNPs of different core sizes. When this method is used in tandem with a fluorescence dye assay, it is possible to determine the ratio of two unlabeled sequences of different lengths bound to AuNPs. Two sizes of citrate-stabilized AuNPs (5 and 12 nm) were functionalized with mixtures of short (5 base) and long (32 base) disulfide-terminated DNA sequences, and the ratios of sequences bound to the AuNPs were determined using the new method. The long DNA sequence was present as a lower proportion of the ligand shell than in the ligand exchange mixture, suggesting it had a lower propensity to bind the AuNPs than the short DNA sequence. The ratio of DNA sequences bound to the AuNPs was not the same for the large and small AuNPs, which suggests that the radius of curvature had a significant influence on the assembly of DNA strands onto the AuNPs.
Chao, Michael C.; Pritchard, Justin R.; Zhang, Yanjia J.; Rubin, Eric J.; Livny, Jonathan; Davis, Brigid M.; Waldor, Matthew K.
2013-01-01
The coupling of high-density transposon mutagenesis to high-throughput DNA sequencing (transposon-insertion sequencing) enables simultaneous and genome-wide assessment of the contributions of individual loci to bacterial growth and survival. We have refined analysis of transposon-insertion sequencing data by normalizing for the effect of DNA replication on sequencing output and using a hidden Markov model (HMM)-based filter to exploit heretofore unappreciated information inherent in all transposon-insertion sequencing data sets. The HMM can smooth variations in read abundance and thereby reduce the effects of read noise, as well as permit fine scale mapping that is independent of genomic annotation and enable classification of loci into several functional categories (e.g. essential, domain essential or ‘sick’). We generated a high-resolution map of genomic loci (encompassing both intra- and intergenic sequences) that are required or beneficial for in vitro growth of the cholera pathogen, Vibrio cholerae. This work uncovered new metabolic and physiologic requirements for V. cholerae survival, and by combining transposon-insertion sequencing and transcriptomic data sets, we also identified several novel noncoding RNA species that contribute to V. cholerae growth. Our findings suggest that HMM-based approaches will enhance extraction of biological meaning from transposon-insertion sequencing genomic data. PMID:23901011
ERIC Educational Resources Information Center
Thackray, Richard I.; And Others
The ability to resist distraction is an important requirement for air traffic controllers. The study examined the relationship between performance on the Stroop color-word interference test (a suggested measure of distraction susceptibility) and impairment under auditory distraction on a task requiring the subject to generate random sequences of…
Design of lunar base observatories
NASA Technical Reports Server (NTRS)
Johnson, Stewart W.
1988-01-01
Several recently suggested concepts for conducting astronomy from a lunar base are cited. Then, the process and sequence of events that will be required to design an observatory to be emplaced on the Moon are examined.
Curriculum Guidelines for Periodontics.
ERIC Educational Resources Information Center
Journal of Dental Education, 1985
1985-01-01
Guidelines describe the interrelationships of this and other dental fields, give an overview of the curriculum and its primary educational objectives, and outline the suggested prerequisites, core content, specific behavioral objectives, sequencing, and faculty requirements. (MSE)
Chromatin and RNAi factors protect the C. elegans germline against repetitive sequences
Robert, Valérie J.P.; Sijen, Titia; van Wolfswinkel, Josien; Plasterk, Ronald H.A.
2005-01-01
Protection of genomes against invasion by repetitive sequences, such as transposons, viruses, and repetitive transgenes, involves strong and selective silencing of these sequences. During silencing of repetitive transgenes, a trans effect (“cosuppression”) occurs that results in silencing of cognate endogenous genes. Here we report RNA interference (RNAi) screens performed to catalog genes required for cosuppression in the Caenorhabditis elegans germline. We find factors with a putative role in chromatin remodeling and factors involved in RNAi. Together with molecular data also presented in this study, these results suggest that in C. elegans repetitive sequences trigger transcriptional gene silencing using RNAi and chromatin factors. PMID:15774721
MRPrimer: a MapReduce-based method for the thorough design of valid and ranked primers for PCR
Kim, Hyerin; Kang, NaNa; Chon, Kang-Wook; Kim, Seonho; Lee, NaHye; Koo, JaeHyung; Kim, Min-Soo
2015-01-01
Primer design is a fundamental technique that is widely used for polymerase chain reaction (PCR). Although many methods have been proposed for primer design, they require a great deal of manual effort to generate feasible and valid primers, including homology tests on off-target sequences using BLAST-like tools. That approach is inconvenient for many target sequences of quantitative PCR (qPCR) due to considering the same stringent and allele-invariant constraints. To address this issue, we propose an entirely new method called MRPrimer that can design all feasible and valid primer pairs existing in a DNA database at once, while simultaneously checking a multitude of filtering constraints and validating primer specificity. Furthermore, MRPrimer suggests the best primer pair for each target sequence, based on a ranking method. Through qPCR analysis using 343 primer pairs and the corresponding sequencing and comparative analyses, we showed that the primer pairs designed by MRPrimer are very stable and effective for qPCR. In addition, MRPrimer is computationally efficient and scalable and therefore useful for quickly constructing an entire collection of feasible and valid primers for frequently updated databases like RefSeq. Furthermore, we suggest that MRPrimer can be utilized conveniently for experiments requiring primer design, especially real-time qPCR. PMID:26109350
Probabilistic Motor Sequence Yields Greater Offline and Less Online Learning than Fixed Sequence
Du, Yue; Prashad, Shikha; Schoenbrun, Ilana; Clark, Jane E.
2016-01-01
It is well acknowledged that motor sequences can be learned quickly through online learning. Subsequently, the initial acquisition of a motor sequence is boosted or consolidated by offline learning. However, little is known whether offline learning can drive the fast learning of motor sequences (i.e., initial sequence learning in the first training session). To examine offline learning in the fast learning stage, we asked four groups of young adults to perform the serial reaction time (SRT) task with either a fixed or probabilistic sequence and with or without preliminary knowledge (PK) of the presence of a sequence. The sequence and PK were manipulated to emphasize either procedural (probabilistic sequence; no preliminary knowledge (NPK)) or declarative (fixed sequence; with PK) memory that were found to either facilitate or inhibit offline learning. In the SRT task, there were six learning blocks with a 2 min break between each consecutive block. Throughout the session, stimuli followed the same fixed or probabilistic pattern except in Block 5, in which stimuli appeared in a random order. We found that PK facilitated the learning of a fixed sequence, but not a probabilistic sequence. In addition to overall learning measured by the mean reaction time (RT), we examined the progressive changes in RT within and between blocks (i.e., online and offline learning, respectively). It was found that the two groups who performed the fixed sequence, regardless of PK, showed greater online learning than the other two groups who performed the probabilistic sequence. The groups who performed the probabilistic sequence, regardless of PK, did not display online learning, as indicated by a decline in performance within the learning blocks. However, they did demonstrate remarkably greater offline improvement in RT, which suggests that they are learning the probabilistic sequence offline. These results suggest that in the SRT task, the fast acquisition of a motor sequence is driven by concurrent online and offline learning. In addition, as the acquisition of a probabilistic sequence requires greater procedural memory compared to the acquisition of a fixed sequence, our results suggest that offline learning is more likely to take place in a procedural sequence learning task. PMID:26973502
Probabilistic Motor Sequence Yields Greater Offline and Less Online Learning than Fixed Sequence.
Du, Yue; Prashad, Shikha; Schoenbrun, Ilana; Clark, Jane E
2016-01-01
It is well acknowledged that motor sequences can be learned quickly through online learning. Subsequently, the initial acquisition of a motor sequence is boosted or consolidated by offline learning. However, little is known whether offline learning can drive the fast learning of motor sequences (i.e., initial sequence learning in the first training session). To examine offline learning in the fast learning stage, we asked four groups of young adults to perform the serial reaction time (SRT) task with either a fixed or probabilistic sequence and with or without preliminary knowledge (PK) of the presence of a sequence. The sequence and PK were manipulated to emphasize either procedural (probabilistic sequence; no preliminary knowledge (NPK)) or declarative (fixed sequence; with PK) memory that were found to either facilitate or inhibit offline learning. In the SRT task, there were six learning blocks with a 2 min break between each consecutive block. Throughout the session, stimuli followed the same fixed or probabilistic pattern except in Block 5, in which stimuli appeared in a random order. We found that PK facilitated the learning of a fixed sequence, but not a probabilistic sequence. In addition to overall learning measured by the mean reaction time (RT), we examined the progressive changes in RT within and between blocks (i.e., online and offline learning, respectively). It was found that the two groups who performed the fixed sequence, regardless of PK, showed greater online learning than the other two groups who performed the probabilistic sequence. The groups who performed the probabilistic sequence, regardless of PK, did not display online learning, as indicated by a decline in performance within the learning blocks. However, they did demonstrate remarkably greater offline improvement in RT, which suggests that they are learning the probabilistic sequence offline. These results suggest that in the SRT task, the fast acquisition of a motor sequence is driven by concurrent online and offline learning. In addition, as the acquisition of a probabilistic sequence requires greater procedural memory compared to the acquisition of a fixed sequence, our results suggest that offline learning is more likely to take place in a procedural sequence learning task.
NASA Technical Reports Server (NTRS)
Breault, D. T.; Lichtler, A. C.; Rowe, D. W.
1997-01-01
Collagen reporter gene constructs have be used to identify cell-specific sequences needed for transcriptional activation. The elements required for endogenous levels of COL1A1 expression, however, have not been elucidated. The human COL1A1 minigene is expressed at high levels and likely harbors sequence elements required for endogenous levels of activity. Using stably transfected osteoblastic Py1a cells, we studied a series of constructs (pOBColCAT) designed to characterize further the elements required for high level of expression. pOBColCAT, which contains the COL1A1 first intron, was expressed at 50-100-fold higher levels than ColCAT 3.6, which lacks the first intron. This difference is best explained by improved mRNA processing rather than a transcriptional effect. Furthermore, variation in activity observed with the intron deletion constructs is best explained by altered mRNA splicing. Two major regions of the human COL1A1 minigene, the 3'-flanking sequences and the minigene body, were introduced into pOBColCAT to assess both transcriptional enhancing activity and the effect on mRNA stability. Analysis of the minigene body, which includes the first five exons and introns fused with the terminal six introns and exons, revealed an orientation-independent 5-fold increase in CAT activity. In contrast the 3'-flanking sequences gave rise to a modest 61% increase in CAT activity. Neither region increased the mRNA half-life of the parent construct, suggesting that CAT-specific mRNA instability elements may serve as dominant negative regulators of stability. This study suggests that other sites within the body of the COL1A1 minigene are important for high expression, e.g. during periods of rapid extracellular matrix production.
Ward, T R; Hoang, M L; Prusty, R; Lau, C K; Keil, R L; Fangman, W L; Brewer, B J
2000-07-01
In the ribosomal DNA of Saccharomyces cerevisiae, sequences in the nontranscribed spacer 3' of the 35S ribosomal RNA gene are important to the polar arrest of replication forks at a site called the replication fork barrier (RFB) and also to the cis-acting, mitotic hyperrecombination site called HOT1. We have found that the RFB and HOT1 activity share some but not all of their essential sequences. Many of the mutations that reduce HOT1 recombination also decrease or eliminate fork arrest at one of two closely spaced RFB sites, RFB1 and RFB2. A simple model for the juxtaposition of RFB and HOT1 sequences is that the breakage of strands in replication forks arrested at RFB stimulates recombination. Contrary to this model, we show here that HOT1-stimulated recombination does not require the arrest of forks at the RFB. Therefore, while HOT1 activity is independent of replication fork arrest, HOT1 and RFB require some common sequences, suggesting the existence of a common trans-acting factor(s).
Mechanism of foreign DNA selection in a bacterial adaptive immune system
Sashital, Dipali G.; Wiedenheft, Blake; Doudna, Jennifer A.
2012-01-01
Summary In bacterial and archaeal CRISPR immune pathways, DNA sequences from invading bacteriophage or plasmids are integrated into CRISPR loci within the host genome, conferring immunity against subsequent infections. The ribonucleoprotein complex Cascade utilizes RNAs generated from these loci to target complementary “non-self” DNA sequences for destruction, while avoiding binding to “self” sequences within the CRISPR locus. Here we show that CasA, the largest protein subunit of Cascade, is required for non-self target recognition and binding. Combining a 2.3 Å crystal structure of CasA with cryo-EM structures of Cascade, we have identified a loop that is required for viral defense. This loop contacts a conserved 3-base pair motif that is required for non-self target selection. Our data suggest a model in which the CasA loop scans DNA for this short motif prior to target destabilization and binding, maximizing the efficiency of DNA surveillance by Cascade. PMID:22521690
Chung, Jongsuk; Son, Dae-Soon; Jeon, Hyo-Jeong; Kim, Kyoung-Mee; Park, Gahee; Ryu, Gyu Ha; Park, Woong-Yang; Park, Donghyun
2016-01-01
Targeted capture massively parallel sequencing is increasingly being used in clinical settings, and as costs continue to decline, use of this technology may become routine in health care. However, a limited amount of tissue has often been a challenge in meeting quality requirements. To offer a practical guideline for the minimum amount of input DNA for targeted sequencing, we optimized and evaluated the performance of targeted sequencing depending on the input DNA amount. First, using various amounts of input DNA, we compared commercially available library construction kits and selected Agilent’s SureSelect-XT and KAPA Biosystems’ Hyper Prep kits as the kits most compatible with targeted deep sequencing using Agilent’s SureSelect custom capture. Then, we optimized the adapter ligation conditions of the Hyper Prep kit to improve library construction efficiency and adapted multiplexed hybrid selection to reduce the cost of sequencing. In this study, we systematically evaluated the performance of the optimized protocol depending on the amount of input DNA, ranging from 6.25 to 200 ng, suggesting the minimal input DNA amounts based on coverage depths required for specific applications. PMID:27220682
Discriminative Prediction of A-To-I RNA Editing Events from DNA Sequence
Sun, Jiangming; Singh, Pratibha; Bagge, Annika; Valtat, Bérengère; Vikman, Petter; Spégel, Peter; Mulder, Hindrik
2016-01-01
RNA editing is a post-transcriptional alteration of RNA sequences that, via insertions, deletions or base substitutions, can affect protein structure as well as RNA and protein expression. Recently, it has been suggested that RNA editing may be more frequent than previously thought. A great impediment, however, to a deeper understanding of this process is the paramount sequencing effort that needs to be undertaken to identify RNA editing events. Here, we describe an in silico approach, based on machine learning, that ameliorates this problem. Using 41 nucleotide long DNA sequences, we show that novel A-to-I RNA editing events can be predicted from known A-to-I RNA editing events intra- and interspecies. The validity of the proposed method was verified in an independent experimental dataset. Using our approach, 203 202 putative A-to-I RNA editing events were predicted in the whole human genome. Out of these, 9% were previously reported. The remaining sites require further validation, e.g., by targeted deep sequencing. In conclusion, the approach described here is a useful tool to identify potential A-to-I RNA editing events without the requirement of extensive RNA sequencing. PMID:27764195
Miras, Manuel; Rodríguez-Hernández, Ana M; Romero-López, Cristina; Berzal-Herranz, Alfredo; Colchero, Jaime; Aranda, Miguel A; Truniger, Verónica
2018-01-01
In eukaryotes, the formation of a 5'-cap and 3'-poly(A) dependent protein-protein bridge is required for translation of its mRNAs. In contrast, several plant virus RNA genomes lack both of these mRNA features, but instead have a 3'-CITE (for cap-independent translation enhancer), a RNA element present in their 3'-untranslated region that recruits translation initiation factors and is able to control its cap-independent translation. For several 3'-CITEs, direct RNA-RNA long-distance interactions based on sequence complementarity between the 5'- and 3'-ends are required for efficient translation, as they bring the translation initiation factors bound to the 3'-CITE to the 5'-end. For the carmovirus melon necrotic spot virus (MNSV), a 3'-CITE has been identified, and the presence of its 5'-end in cis has been shown to be required for its activity. Here, we analyze the secondary structure of the 5'-end of the MNSV RNA genome and identify two highly conserved nucleotide sequence stretches that are complementary to the apical loop of its 3'-CITE. In in vivo cap-independent translation assays with mutant constructs, by disrupting and restoring sequence complementarity, we show that the interaction between the 3'-CITE and at least one complementary sequence in the 5'-end is essential for virus RNA translation, although efficient virus translation and multiplication requires both connections. The complementary sequence stretches are invariant in all MNSV isolates, suggesting that the dual 5'-3' RNA:RNA interactions are required for optimal MNSV cap-independent translation and multiplication.
Delimiting regulatory sequences of the Drosophila melanogaster Ddc gene.
Hirsh, J; Morgan, B A; Scholnick, S B
1986-01-01
We delimited sequences necessary for in vivo expression of the Drosophila melanogaster dopa decarboxylase gene Ddc. The expression of in vitro-altered genes was assayed following germ line integration via P-element vectors. Sequences between -209 and -24 were necessary for normally regulated expression, although genes lacking these sequences could be expressed at 10 to 50% of wild-type levels at specific developmental times. These genes showed components of normal developmental expression, which suggests that they retain some regulatory elements. All Ddc genes lacking the normal immediate 5'-flanking sequences were grossly deficient in larval central nervous system expression. Thus, this upstream region must contain at least one element necessary for this expression. A mutated Ddc gene without a normal TATA boxlike sequence used the normal RNA start points, indicating that this sequences is not required for start point specificity. Images PMID:3099170
Evidence of automatic processing in sequence learning using process-dissociation
Mong, Heather M.; McCabe, David P.; Clegg, Benjamin A.
2012-01-01
This paper proposes a way to apply process-dissociation to sequence learning in addition and extension to the approach used by Destrebecqz and Cleeremans (2001). Participants were trained on two sequences separated from each other by a short break. Following training, participants self-reported their knowledge of the sequences. A recognition test was then performed which required discrimination of two trained sequences, either under the instructions to call any sequence encountered in the experiment “old” (the inclusion condition), or only sequence fragments from one half of the experiment “old” (the exclusion condition). The recognition test elicited automatic and controlled process estimates using the process dissociation procedure, and suggested both processes were involved. Examining the underlying processes supporting performance may provide more information on the fundamental aspects of the implicit and explicit constructs than has been attainable through awareness testing. PMID:22679465
Gönner, Lorenz; Vitay, Julien; Hamker, Fred H.
2017-01-01
Hippocampal place-cell sequences observed during awake immobility often represent previous experience, suggesting a role in memory processes. However, recent reports of goals being overrepresented in sequential activity suggest a role in short-term planning, although a detailed understanding of the origins of hippocampal sequential activity and of its functional role is still lacking. In particular, it is unknown which mechanism could support efficient planning by generating place-cell sequences biased toward known goal locations, in an adaptive and constructive fashion. To address these questions, we propose a model of spatial learning and sequence generation as interdependent processes, integrating cortical contextual coding, synaptic plasticity and neuromodulatory mechanisms into a map-based approach. Following goal learning, sequential activity emerges from continuous attractor network dynamics biased by goal memory inputs. We apply Bayesian decoding on the resulting spike trains, allowing a direct comparison with experimental data. Simulations show that this model (1) explains the generation of never-experienced sequence trajectories in familiar environments, without requiring virtual self-motion signals, (2) accounts for the bias in place-cell sequences toward goal locations, (3) highlights their utility in flexible route planning, and (4) provides specific testable predictions. PMID:29075187
DOE Office of Scientific and Technical Information (OSTI.GOV)
Markussen, Turhan; Jonassen, Christine Monceyron; Numanovic, Sanela
2008-05-10
Infectious salmon anaemia virus (ISAV) is an orthomyxovirus causing a multisystemic, emerging disease in Atlantic salmon. Here we present, for the first time, detailed sequence analyses of the full-genome sequence of a presumed avirulent isolate displaying a full-length hemagglutinin-esterase (HE) gene (HPR0), and compare this with full-genome sequences of 11 Norwegian ISAV isolates from clinically diseased fish. These analyses revealed the presence of a virulence marker right upstream of the putative cleavage site R{sub 267} in the fusion (F) protein, suggesting a Q{sub 266} {yields} L{sub 266} substitution to be a prerequisite for virulence. To gain virulence in isolates lackingmore » this substitution, a sequence insertion near the cleavage site seems to be required. This strongly suggests the involvement of a protease recognition pattern at the cleavage site of the fusion protein as a determinant of virulence, as seen in highly pathogenic influenza A virus H5 or H7 and the paramyxovirus Newcastle disease virus.« less
Integration deficiencies associated with continuous limb movement sequences in Parkinson's disease.
Park, Jin-Hoon; Stelmach, George E
2009-11-01
The present study examined the extent to which Parkinson's disease (PD) influences integration of continuous limb movement sequences. Eight patients with idiopathic PD and 8 age-matched normal subjects were instructed to perform repetitive sequential aiming movements to specified targets under three-accuracy constraints: 1) low accuracy (W = 7 cm) - minimal accuracy constraint, 2) high accuracy (W = 0.64 cm) - maximum accuracy constraint, and 3) mixed accuracy constraint - one target of high accuracy and another target of low accuracy. The characteristic of sequential movements in the low accuracy condition was mostly cyclical, whereas in the high accuracy condition it was discrete in both groups. When the accuracy constraint was mixed, the sequential movements were executed by assembling discrete and cyclical movements in both groups, suggesting that for PD patients the capability to combine discrete and cyclical movements to meet a task requirement appears to be intact. However, such functional linkage was not as pronounced as was in normal subjects. Close examination of movement from the mixed accuracy condition revealed marked movement hesitations in the vicinity of the large target in PD patients, resulting in a bias toward discrete movement. These results suggest that PD patients may have deficits in ongoing planning and organizing processes during movement execution when the tasks require to assemble various accuracy requirements into more complex movement sequences.
NASA Astrophysics Data System (ADS)
Tremberger, G.; Dehipawala, Sunil; Cheung, E.; Holden, T.; Sullivan, R.; Nguyen, A.; Lieberman, D.; Cheung, T.
2015-09-01
All metallo-proteins need post-translation metal incorporation. In fact, the isotope ratio of Fe, Cu, and Zn in physiology and oncology have emerged as an important tool. The nickel containing F430 is the prosthetic group of the enzyme methyl coenzyme M reductase which catalyzes the release of methane in the final step of methano-genesis, a prime energy metabolism candidate for life exploration space mission in the solar system. The 3.5 Gyr early life sulfite reductase as a life switch energy metabolism had Fe-Mo clusters. The nitrogenase for nitrogen fixation 3 billion years ago had Mo. The early life arsenite oxidase needed for anoxygenic photosynthesis energy metabolism 2.8 billion years ago had Mo and Fe. The selection pressure in metal incorporation inside a protein would be quantifiable in terms of the related nucleotide sequence complexity with fractal dimension and entropy values. Simulation model showed that the studied metal-required energy metabolism sequences had at least ten times more selection pressure relatively in comparison to the horizontal transferred sequences in Mealybug, guided by the outcome histogram of the correlation R-sq values. The metal energy metabolism sequence group was compared to the circadian clock KaiC sequence group using magnesium atomic level bond shifting mechanism in the protein, and the simulation model would suggest a much higher selection pressure for the energy life switch sequence group. The possibility of using Kepler 444 as an example of ancient life in Galaxy with the associated exoplanets has been proposed and is further discussed in this report. Examples of arsenic metal bonding shift probed by Synchrotron-based X-ray spectroscopy data and Zn controlled FOXP2 regulated pathways in human and chimp brain studied tissue samples are studied in relationship to the sequence bioinformatics. The analysis results suggest that relatively large metal bonding shift amount is associated with low probability correlation R-sq outcome in the bioinformatics simulation.
Automated Finishing with Autofinish
Gordon, David; Desmarais, Cindy; Green, Phil
2001-01-01
Currently, the genome sequencing community is producing shotgun sequence data at a very high rate, but finishing (collecting additional directed sequence data to close gaps and improve the quality of the data) is not matching that rate. One reason for the difference is that shotgun sequencing is highly automated but finishing is not: Most finishing decisions, such as which directed reads to obtain and which specialized sequencing techniques to use, are made by people. If finishing rates are to increase to match shotgun sequencing rates, most finishing decisions also must be automated. The Autofinish computer program (which is part of the Consed computer software package) does this by automatically choosing finishing reads. Autofinish is able to suggest most finishing reads required for completion of each sequencing project, greatly reducing the amount of human attention needed. Autofinish sometimes completely finishes the project, with no human decisions required. It cannot solve the most complex problems, so we recommend that Autofinish be allowed to suggest reads for the first three rounds of finishing, and if the project still is not finished completely, a human finisher complete the work. We compared this Autofinish-Hybrid method of finishing against a human finisher in five different projects with a variety of shotgun depths by finishing each project twice—once with each method. This comparison shows that the Autofinish-Hybrid method saves many hours over a human finisher alone, while using roughly the same number and type of reads and closing gaps at roughly the same rate. Autofinish currently is in production use at several large sequencing centers. It is designed to be adaptable to the finishing strategy of the lab—it can finish using some or all of the following: resequencing reads, reverses, custom primer walks on either subclone templates or whole clone templates, PCR, or minilibraries. Autofinish has been used for finishing cDNA, genomic clones, and whole bacterial genomes (see http://www.phrap.org). PMID:11282977
Cheng, Jinkui; Lai, Jinsheng; Gong, Zhizhong
2016-01-01
DNA polymerase δ plays crucial roles in DNA repair and replication as well as maintaining genomic stability. However, the function of POLD2, the second small subunit of DNA polymerase δ, has not been characterized yet in Arabidopsis (Arabidopsis thaliana). During a genetic screen for release of transcriptional gene silencing, we identified a mutation in POLD2. Whole-genome bisulfite sequencing indicated that POLD2 is not involved in the regulation of DNA methylation. POLD2 genetically interacts with Ataxia Telangiectasia-mutated and Rad3-related and DNA polymerase α. The pold2-1 mutant exhibits genomic instability with a high frequency of homologous recombination. It also exhibits hypersensitivity to DNA-damaging reagents and short telomere length. Whole-genome chromatin immunoprecipitation sequencing and RNA sequencing analyses suggest that pold2-1 changes H3K27me3 and H3K4me3 modifications, and these changes are correlated with the gene expression levels. Our study suggests that POLD2 is required for maintaining genome integrity and properly establishing the epigenetic markers during DNA replication to modulate gene expression. PMID:27208288
Direct Estimation of Structure and Motion from Multiple Frames
1990-03-01
sequential frames in an image sequence. As a consequence, the information that can be extracted from a single optical flow field is limited to a snapshot of...researchers have developed techniques that extract motion and structure inform.4tion without computation of the optical flow. Best known are the "direct...operated iteratively on a sequence of images to recover structure. It required feature extraction and matching. Broida and Chellappa [9] suggested the use of
MRPrimer: a MapReduce-based method for the thorough design of valid and ranked primers for PCR.
Kim, Hyerin; Kang, NaNa; Chon, Kang-Wook; Kim, Seonho; Lee, NaHye; Koo, JaeHyung; Kim, Min-Soo
2015-11-16
Primer design is a fundamental technique that is widely used for polymerase chain reaction (PCR). Although many methods have been proposed for primer design, they require a great deal of manual effort to generate feasible and valid primers, including homology tests on off-target sequences using BLAST-like tools. That approach is inconvenient for many target sequences of quantitative PCR (qPCR) due to considering the same stringent and allele-invariant constraints. To address this issue, we propose an entirely new method called MRPrimer that can design all feasible and valid primer pairs existing in a DNA database at once, while simultaneously checking a multitude of filtering constraints and validating primer specificity. Furthermore, MRPrimer suggests the best primer pair for each target sequence, based on a ranking method. Through qPCR analysis using 343 primer pairs and the corresponding sequencing and comparative analyses, we showed that the primer pairs designed by MRPrimer are very stable and effective for qPCR. In addition, MRPrimer is computationally efficient and scalable and therefore useful for quickly constructing an entire collection of feasible and valid primers for frequently updated databases like RefSeq. Furthermore, we suggest that MRPrimer can be utilized conveniently for experiments requiring primer design, especially real-time qPCR. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Structural hot spots for the solubility of globular proteins
Ganesan, Ashok; Siekierska, Aleksandra; Beerten, Jacinte; Brams, Marijke; Van Durme, Joost; De Baets, Greet; Van der Kant, Rob; Gallardo, Rodrigo; Ramakers, Meine; Langenberg, Tobias; Wilkinson, Hannah; De Smet, Frederik; Ulens, Chris; Rousseau, Frederic; Schymkowitz, Joost
2016-01-01
Natural selection shapes protein solubility to physiological requirements and recombinant applications that require higher protein concentrations are often problematic. This raises the question whether the solubility of natural protein sequences can be improved. We here show an anti-correlation between the number of aggregation prone regions (APRs) in a protein sequence and its solubility, suggesting that mutational suppression of APRs provides a simple strategy to increase protein solubility. We show that mutations at specific positions within a protein structure can act as APR suppressors without affecting protein stability. These hot spots for protein solubility are both structure and sequence dependent but can be computationally predicted. We demonstrate this by reducing the aggregation of human α-galactosidase and protective antigen of Bacillus anthracis through mutation. Our results indicate that many proteins possess hot spots allowing to adapt protein solubility independently of structure and function. PMID:26905391
How Many Protein Sequences Fold to a Given Structure? A Coevolutionary Analysis.
Tian, Pengfei; Best, Robert B
2017-10-17
Quantifying the relationship between protein sequence and structure is key to understanding the protein universe. A fundamental measure of this relationship is the total number of amino acid sequences that can fold to a target protein structure, known as the "sequence capacity," which has been suggested as a proxy for how designable a given protein fold is. Although sequence capacity has been extensively studied using lattice models and theory, numerical estimates for real protein structures are currently lacking. In this work, we have quantitatively estimated the sequence capacity of 10 proteins with a variety of different structures using a statistical model based on residue-residue co-evolution to capture the variation of sequences from the same protein family. Remarkably, we find that even for the smallest protein folds, such as the WW domain, the number of foldable sequences is extremely large, exceeding the Avogadro constant. In agreement with earlier theoretical work, the calculated sequence capacity is positively correlated with the size of the protein, or better, the density of contacts. This allows the absolute sequence capacity of a given protein to be approximately predicted from its structure. On the other hand, the relative sequence capacity, i.e., normalized by the total number of possible sequences, is an extremely tiny number and is strongly anti-correlated with the protein length. Thus, although there may be more foldable sequences for larger proteins, it will be much harder to find them. Lastly, we have correlated the evolutionary age of proteins in the CATH database with their sequence capacity as predicted by our model. The results suggest a trade-off between the opposing requirements of high designability and the likelihood of a novel fold emerging by chance. Published by Elsevier Inc.
Failure to produce response variability with reinforcement
Schwartz, Barry
1982-01-01
Two experiments attempted to train pigeons to produce variable response sequences. In the first, naive pigeons were exposed to a procedure requiring four pecks on each of two keys in any order, with a reinforcer delivered only if a given sequence was different from the preceding one. In the second experiment, the same pigeons were exposed to this procedure after having been trained successfully to alternate between two specific response sequences. In neither case did any pigeon produce more than a few different sequences or obtain more than 50% of the possible reinforcers. Stereotyped sequences developed even though stereotypy was not reinforced. It is suggested that reinforcers have both hedonic and informative properties and that the hedonic properties are responsible for sterotyped repetition of reinforced responses, even when stereotypy is negatively related to reinforcer delivery. PMID:16812263
MMP21 is mutated in human heterotaxy and is required for normal left-right asymmetry in vertebrates.
Guimier, Anne; Gabriel, George C; Bajolle, Fanny; Tsang, Michael; Liu, Hui; Noll, Aaron; Schwartz, Molly; El Malti, Rajae; Smith, Laurie D; Klena, Nikolai T; Jimenez, Gina; Miller, Neil A; Oufadem, Myriam; Moreau de Bellaing, Anne; Yagi, Hisato; Saunders, Carol J; Baker, Candice N; Di Filippo, Sylvie; Peterson, Kevin A; Thiffault, Isabelle; Bole-Feysot, Christine; Cooley, Linda D; Farrow, Emily G; Masson, Cécile; Schoen, Patric; Deleuze, Jean-François; Nitschké, Patrick; Lyonnet, Stanislas; de Pontual, Loic; Murray, Stephen A; Bonnet, Damien; Kingsmore, Stephen F; Amiel, Jeanne; Bouvagnet, Patrice; Lo, Cecilia W; Gordon, Christopher T
2015-11-01
Heterotaxy results from a failure to establish normal left-right asymmetry early in embryonic development. By whole-exome sequencing, whole-genome sequencing and high-throughput cohort resequencing, we identified recessive mutations in MMP21 (encoding matrix metallopeptidase 21) in nine index cases with heterotaxy. In addition, Mmp21-mutant mice and mmp21-morphant zebrafish displayed heterotaxy and abnormal cardiac looping, respectively, suggesting a new role for extracellular matrix remodeling in the establishment of laterality in vertebrates.
MMP21 is mutated in human heterotaxy and is required for normal left-right asymmetry in vertebrates
Guimier, Anne; Gabriel, George C.; Bajolle, Fanny; Tsang, Michael; Liu, Hui; Noll, Aaron; Schwartz, Molly; El Malti, Rajae; Smith, Laurie D.; Klena, Nikolai T.; Jimenez, Gina; Miller, Neil A.; Oufadem, Myriam; Moreau de Bellaing, Anne; Yagi, Hisato; Saunders, Carol J.; Baker, Candice N.; Di Filippo, Sylvie; Peterson, Kevin A.; Thiffault, Isabelle; Bole-Feysot, Christine; Cooley, Linda D.; Farrow, Emily G.; Masson, Cécile; Schoen, Patric; Deleuze, Jean-François; Nitschké, Patrick; Lyonnet, Stanislas; de Pontual, Loic; Murray, Stephen A.; Bonnet, Damien; Kingsmore, Stephen F.; Amiel, Jeanne; Bouvagnet, Patrice; Lo, Cecilia W.; Gordon, Christopher T.
2017-01-01
Heterotaxy results from a failure to establish normal left-right asymmetry early in embryonic development. By whole exome sequencing, whole genome sequencing and high-throughput cohort resequencing we identified recessive mutations in matrix metallopeptidase 21 (MMP21), in nine index cases with heterotaxy. In addition, Mmp21 mutant mice and morphant zebrafish display heterotaxy and abnormal cardiac looping, respectively, suggesting a novel role for extra-cellular remodeling in the establishment of laterality in vertebrates. PMID:26437028
Superresolution restoration of an image sequence: adaptive filtering approach.
Elad, M; Feuer, A
1999-01-01
This paper presents a new method based on adaptive filtering theory for superresolution restoration of continuous image sequences. The proposed methodology suggests least squares (LS) estimators which adapt in time, based on adaptive filters, least mean squares (LMS) or recursive least squares (RLS). The adaptation enables the treatment of linear space and time-variant blurring and arbitrary motion, both of them assumed known. The proposed new approach is shown to be of relatively low computational requirements. Simulations demonstrating the superresolution restoration algorithms are presented.
Forgetting motor programmes: retrieval dynamics in procedural memory.
Tempel, Tobias; Frings, Christian
2014-01-01
When motor sequences are stored in memory in a categorised manner, selective retrieval of some sequences can induce forgetting of the non-retrieved sequences. We show that such retrieval-induced forgetting (RIF) occurs not only in cued recall but also in a test assessing memory indirectly by providing novel test cues without involving recall of items. Participants learned several sequential finger movements (SFMs), each consisting of the movement of two fingers of either the left or the right hand. Subsequently, they performed retrieval practice on half of the sequences of one hand. A final task then required participants to enter letter dyads. A subset of these dyads corresponded to the previously learned sequences. RIF was present in the response times during the entering of the dyads. The finding of RIF in the slowed-down execution of motor programmes overlapping with initially trained motor sequences suggests that inhibition resolved interference between procedural representations of the acquired motor sequences of one hand during retrieval practice.
Principles of protein folding--a perspective from simple exact models.
Dill, K. A.; Bromberg, S.; Yue, K.; Fiebig, K. M.; Yee, D. P.; Thomas, P. D.; Chan, H. S.
1995-01-01
General principles of protein structure, stability, and folding kinetics have recently been explored in computer simulations of simple exact lattice models. These models represent protein chains at a rudimentary level, but they involve few parameters, approximations, or implicit biases, and they allow complete explorations of conformational and sequence spaces. Such simulations have resulted in testable predictions that are sometimes unanticipated: The folding code is mainly binary and delocalized throughout the amino acid sequence. The secondary and tertiary structures of a protein are specified mainly by the sequence of polar and nonpolar monomers. More specific interactions may refine the structure, rather than dominate the folding code. Simple exact models can account for the properties that characterize protein folding: two-state cooperativity, secondary and tertiary structures, and multistage folding kinetics--fast hydrophobic collapse followed by slower annealing. These studies suggest the possibility of creating "foldable" chain molecules other than proteins. The encoding of a unique compact chain conformation may not require amino acids; it may require only the ability to synthesize specific monomer sequences in which at least one monomer type is solvent-averse. PMID:7613459
Repp, Bruno H
2004-10-01
In a task that requires in-phase synchronization of finger taps with an isochronous sequence of target tones that is interleaved with a sequence of distractor tones at various fixed phase relationships, the taps tend to be attracted to the distractor tones, especially when the distractor tones closely precede the target tones [Repp, B. H. (2003a). Phase attraction in sensorimotor synchronization with auditory sequences: Effects of single and periodic distractors on synchronization accuracy. Journal of Experimental Psychology: Human Perception and Performance, 29, 290-309]. The present research addressed two related questions about this distractor effect: (1) Is it a function of the absolute temporal separation or of the relative phase of the two stimulus sequences? (2) Is it the result of perceptual grouping (integration) of target and distractor tones or of simultaneous attraction to two independent sequences? In three experiments, distractor effects were compared across two different sequence rates. The results suggest that absolute temporal separation, not relative phase, is the critical variable. Experiment 3 also included an anti-phase tapping task that addressed the second question directly. The results suggest that the attraction of taps to distractor tones is caused mainly by temporal integration of target and distractor tones within a fixed window of 100-150 ms duration, with the earlier-occurring tone being weighted more strongly than the later-occurring one.
Hoffman, Brett; Li, Zhubing; Liu, Qiang
2015-08-01
Hepatitis C virus (HCV) non-structural protein 5A (NS5A) is essential for viral replication; however, its effect on HCV RNA translation remains controversial partially due to the use of reporters lacking the 3' UTR, where NS5A binds to the poly(U/UC) sequence. We investigated the role of NS5A in HCV translation using a monocistronic RNA containing a Renilla luciferase gene flanked by the HCV UTRs. We found that NS5A downregulated viral RNA translation in a dose-dependent manner. This downregulation required both the 5' and 3' UTRs of HCV because substitution of either sequence with the 5' and 3' UTRs of enterovirus 71 or a cap structure at the 5' end eliminated the effects of NS5A on translation. Translation of the HCV genomic RNA was also downregulated by NS5A. The inhibition of HCV translation by NS5A required the poly(U/UC) sequence in the 3' UTR as NS5A did not affect translation when it was deleted. In addition, we showed that, whilst the amphipathic α-helix of NS5A has no effect on viral translation, the three domains of NS5A can inhibit translation independently, also dependent on the presence of the poly(U/UC) sequence in the 3' UTR. These results suggested that NS5A downregulated HCV RNA translation through a mechanism involving the poly(U/UC) sequence in the 3' UTR.
Widespread and Persistent Populations of a Major New Marine Actinomycete Taxon in Ocean Sediments
Mincer, Tracy J.; Jensen, Paul R.; Kauffman, Christopher A.; Fenical, William
2002-01-01
A major taxon of obligate marine bacteria within the order Actinomycetales has been discovered from ocean sediments. Populations of these bacteria (designated MAR 1) are persistent and widespread, spanning at least three distinct ocean systems. In this study, 212 actinomycete isolates possessing MAR 1 morphologies were examined and all but two displayed an obligate requirement of seawater for growth. Forty-five of these isolates, representing all observed seawater-requiring morphotypes, were partially sequenced and found to share characteristic small-subunit rRNA signature nucleotides between positions 207 and 468 (Escherichia coli numbering). Phylogenetic characterization of seven representative isolates based on almost complete sequences of genes encoding 16S rRNA (16S ribosomal DNA) yielded a monophyletic clade within the family Micromonosporaceae and suggests novelty at the genus level. This is the first evidence for the existence of widespread populations of obligate marine actinomycetes. Organic extracts from cultured members of this new group exhibit remarkable biological activity, suggesting that they represent a prolific resource for biotechnological applications. PMID:12324350
miRNA-embedded shRNAs for Lineage-specific BCL11A Knockdown and Hemoglobin F Induction
Guda, Swaroopa; Brendel, Christian; Renella, Raffaele; Du, Peng; Bauer, Daniel E; Canver, Matthew C; Grenier, Jennifer K; Grimson, Andrew W; Kamran, Sophia C; Thornton, James; de Boer, Helen; Root, David E; Milsom, Michael D; Orkin, Stuart H; Gregory, Richard I; Williams, David A
2015-01-01
RNA interference (RNAi) technology using short hairpin RNAs (shRNAs) expressed via RNA polymerase (pol) III promoters has been widely exploited to modulate gene expression in a variety of mammalian cell types. For certain applications, such as lineage-specific knockdown, embedding targeting sequences into pol II-driven microRNA (miRNA) architecture is required. Here, using the potential therapeutic target BCL11A, we demonstrate that pol III-driven shRNAs lead to significantly increased knockdown but also increased cytotoxcity in comparison to pol II-driven miRNA adapted shRNAs (shRNAmiR) in multiple hematopoietic cell lines. We show that the two expression systems yield mature guide strand sequences that differ by a 4 bp shift. This results in alternate seed sequences and consequently influences the efficacy of target gene knockdown. Incorporating a corresponding 4 bp shift into the guide strand of shRNAmiRs resulted in improved knockdown efficiency of BCL11A. This was associated with a significant de-repression of the hemoglobin target of BCL11A, human γ-globin or the murine homolog Hbb-y. Our results suggest the requirement for optimization of shRNA sequences upon incorporation into a miRNA backbone. These findings have important implications in future design of shRNAmiRs for RNAi-based therapy in hemoglobinopathies and other diseases requiring lineage-specific expression of gene silencing sequences. PMID:26080908
Quiet Quincy Quarter. Teacher's Guide [and] Student Materials.
ERIC Educational Resources Information Center
Zishka, Phyllis
This document suggests learning activities, teaching methods, objectives, and evaluation measures for a second grade consumer education unit on quarters. The unit, which requires approximately six hours of class time, reinforces basic social studies and mathematics skills including following sequences of numbers, distinguishing left from right,…
The Teaching of Electromagnetic Induction at Sixth Form Level
ERIC Educational Resources Information Center
Archenhold, W. F.
1974-01-01
Presents some ideas about teaching electromagnetic induction at sixth form level, including educational objectives, learning difficulties, syllabus requirements, selection of unit system, and sequence of material presentation. Suggests the Education Group of the Institute of Physics hold further discussions on these aspects before including the…
Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes.
Haiminen, Niina; Feltus, F Alex; Parida, Laxmi
2011-04-15
We investigate if pooling BAC clones and sequencing the pools can provide for more accurate assembly of genome sequences than the "whole genome shotgun" (WGS) approach. Furthermore, we quantify this accuracy increase. We compare the pooled BAC and WGS approaches using in silico simulations. Standard measures of assembly quality focus on assembly size and fragmentation, which are desirable for large whole genome assemblies. We propose additional measures enabling easy and visual comparison of assembly quality, such as rearrangements and redundant sequence content, relative to the known target sequence. The best assembly quality scores were obtained using 454 coverage of 15× linear and 5× paired (3kb insert size) reads (15L-5P) on Arabidopsis. This regime gave similarly good results on four additional plant genomes of very different GC and repeat contents. BAC pooling improved assembly scores over WGS assembly, coverage and redundancy scores improving the most. BAC pooling works better than WGS, however, both require a physical map to order the scaffolds. Pool sizes up to 12Mbp work well, suggesting this pooling density to be effective in medium-scale re-sequencing applications such as targeted sequencing of QTL intervals for candidate gene discovery. Assuming the current Roche/454 Titanium sequencing limitations, a 12 Mbp region could be re-sequenced with a full plate of linear reads and a half plate of paired-end reads, yielding 15L-5P coverage after read pre-processing. Our simulation suggests that massively over-sequencing may not improve accuracy. Our scoring measures can be used generally to evaluate and compare results of simulated genome assemblies.
Motor cortex is required for learning but not executing a motor skill
Kawai, Risa; Markman, Timothy; Poddar, Rajesh; Ko, Raymond; Fantana, Antoniu; Dhawale, Ashesh; Kampff, Adam R.; Ölveczky, Bence P.
2018-01-01
Motor cortex is widely believed to underlie the acquisition and execution of motor skills, yet its contributions to these processes are not fully understood. One reason is that studies on motor skills often conflate motor cortex’s established role in dexterous control with roles in learning and producing task-specific motor sequences. To dissociate these aspects, we developed a motor task for rats that trains spatiotemporally precise movement patterns without requirements for dexterity. Remarkably, motor cortex lesions had no discernible effect on the acquired skills, which were expressed in their distinct pre-lesion forms on the very first day of post-lesion training. Motor cortex lesions prior to training, however, rendered rats unable to acquire the stereotyped motor sequences required for the task. These results suggest a remarkable capacity of subcortical motor circuits to execute learned skills and a previously unappreciated role for motor cortex in ‘tutoring’ these circuits during learning. PMID:25892304
Primate amygdala neurons evaluate the progress of self-defined economic choice sequences
Grabenhorst, Fabian; Hernadi, Istvan; Schultz, Wolfram
2016-01-01
The amygdala is a prime valuation structure yet its functions in advanced behaviors are poorly understood. We tested whether individual amygdala neurons encode a critical requirement for goal-directed behavior: the evaluation of progress during sequential choices. As monkeys progressed through choice sequences toward rewards, amygdala neurons showed phasic, gradually increasing responses over successive choice steps. These responses occurred in the absence of external progress cues or motor preplanning. They were often specific to self-defined sequences, typically disappearing during instructed control sequences with similar reward expectation. Their build-up rate reflected prospectively the forthcoming choice sequence, suggesting adaptation to an internal plan. Population decoding demonstrated a high-accuracy progress code. These findings indicate that amygdala neurons evaluate the progress of planned, self-defined behavioral sequences. Such progress signals seem essential for aligning stepwise choices with internal plans. Their presence in amygdala neurons may inform understanding of human conditions with amygdala dysfunction and deregulated reward pursuit. DOI: http://dx.doi.org/10.7554/eLife.18731.001 PMID:27731795
Primate amygdala neurons evaluate the progress of self-defined economic choice sequences.
Grabenhorst, Fabian; Hernadi, Istvan; Schultz, Wolfram
2016-10-12
The amygdala is a prime valuation structure yet its functions in advanced behaviors are poorly understood. We tested whether individual amygdala neurons encode a critical requirement for goal-directed behavior: the evaluation of progress during sequential choices. As monkeys progressed through choice sequences toward rewards, amygdala neurons showed phasic, gradually increasing responses over successive choice steps. These responses occurred in the absence of external progress cues or motor preplanning. They were often specific to self-defined sequences, typically disappearing during instructed control sequences with similar reward expectation. Their build-up rate reflected prospectively the forthcoming choice sequence, suggesting adaptation to an internal plan. Population decoding demonstrated a high-accuracy progress code. These findings indicate that amygdala neurons evaluate the progress of planned, self-defined behavioral sequences. Such progress signals seem essential for aligning stepwise choices with internal plans. Their presence in amygdala neurons may inform understanding of human conditions with amygdala dysfunction and deregulated reward pursuit.
Teaching Drafting 101: What Comes First?
ERIC Educational Resources Information Center
Carkhuff, Don
2006-01-01
Employers require pristine drawings that convey clarity and precision for the production of goods. Can a change in sequence of instruction be expeditious and help teachers better prepare their students for the workplace? Research suggests that combining traditional drafting and computer-aided drafting (CAD) instruction makes sense. It is analogous…
Kong, Daochun; Coleman, Thomas R.; DePamphilis, Melvin L.
2003-01-01
Budding yeast (Saccharomyces cerevisiae) origin recognition complex (ORC) requires ATP to bind specific DNA sequences, whereas fission yeast (Schizosaccharomyces pombe) ORC binds to specific, asymmetric A:T-rich sites within replication origins, independently of ATP, and frog (Xenopus laevis) ORC seems to bind DNA non-specifically. Here we show that despite these differences, ORCs are functionally conserved. Firstly, SpOrc1, SpOrc4 and SpOrc5, like those from other eukaryotes, bound ATP and exhibited ATPase activity, suggesting that ATP is required for pre-replication complex (pre-RC) assembly rather than origin specificity. Secondly, SpOrc4, which is solely responsible for binding SpORC to DNA, inhibited up to 70% of XlORC-dependent DNA replication in Xenopus egg extract by preventing XlORC from binding to chromatin and assembling pre-RCs. Chromatin-bound SpOrc4 was located at AT-rich sequences. XlORC in egg extract bound preferentially to asymmetric A:T-sequences in either bare DNA or in sperm chromatin, and it recruited XlCdc6 and XlMcm proteins to these sequences. These results reveal that XlORC initiates DNA replication preferentially at the same or similar sites to those targeted in S.pombe. PMID:12840006
O'Brien, Frances G.; Yui Eto, Karina; Murphy, Riley J. T.; Fairhurst, Heather M.; Coombs, Geoffrey W.; Grubb, Warren B.; Ramsay, Joshua P.
2015-01-01
Staphylococcus aureus is a common cause of hospital, community and livestock-associated infections and is increasingly resistant to multiple antimicrobials. A significant proportion of antimicrobial-resistance genes are plasmid-borne, but only a minority of S. aureus plasmids encode proteins required for conjugative transfer or Mob relaxase proteins required for mobilisation. The pWBG749 family of S. aureus conjugative plasmids can facilitate the horizontal transfer of diverse antimicrobial-resistance plasmids that lack Mob genes. Here we reveal that these mobilisable plasmids carry copies of the pWBG749 origin-of-transfer (oriT) sequence and that these oriT sequences facilitate mobilisation by pWBG749. Sequences resembling the pWBG749 oriT were identified on half of all sequenced S. aureus plasmids, including the most prevalent large antimicrobial-resistance/virulence-gene plasmids, pIB485, pMW2 and pUSA300HOUMR. oriT sequences formed five subfamilies with distinct inverted-repeat-2 (IR2) sequences. pWBG749-family plasmids encoding each IR2 were identified and pWBG749 mobilisation was found to be specific for plasmids carrying matching IR2 sequences. Specificity of mobilisation was conferred by a putative ribbon-helix-helix-protein gene smpO. Several plasmids carried 2–3 oriT variants and pWBG749-mediated recombination occurred between distinct oriT sites during mobilisation. These observations suggest this relaxase-in trans mechanism of mobilisation by pWBG749-family plasmids is a common mechanism of plasmid dissemination in S. aureus. PMID:26243776
The role of RT carry-over for congruence sequence effects in masked priming.
Huber-Huber, Christoph; Ansorge, Ulrich
2017-05-01
The present study disentangles 2 sources of the congruence sequence effect with masked primes: congruence and response time of the previous trial (reaction time [RT] carry-over). Using arrows as primes and targets and a metacontrast masking procedure we found congruence as well as congruence sequence effects. In addition, congruence sequence effects decreased when RT carry-over was accounted for in a mixed model analysis, suggesting that RT carry-over contributes to congruence sequence effects in masked priming. Crucially, effects of previous trial congruence were not cancelled out completely indicating that RT carry-over and previous trial congruence are 2 sources feeding into the congruence sequence effect. A secondary task requiring response speed judgments demonstrated general awareness of response speed (Experiments 1), but removing this secondary task (Experiment 2) showed that RT carry-over effects were also present in single-task conditions. During (dual-task) prime-awareness test parts of both experiments, however, RT carry-over failed to modulate congruence effects, suggesting that some task sets of the participants can prevent the effect. The basic RT carry-over effects are consistent with the conflict adaptation account, with the adaptation to the statistics of the environment (ASE) model, and possibly with the temporal learning explanation. Additionally considering the task-dependence of RT carry-over, the results are most compatible with the conflict adaptation account. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Casey, Stephen P; Hall, Heather J; Stanley, Helen F; Vincent, Amanda C J
2004-02-01
Phylogenetic relationships among 93 specimens of 22 species of seahorses (genus Hippocampus) from the Atlantic and Indo-Pacific Oceans were analysed using cytochrome b gene sequence data. A maximum sequence divergence of 23.2% (Kimura 2-parameter model) suggests a pre-Tethyan origin for the genus. Despite a greater number of seahorse species in the Indo-Pacific than in the Atlantic Ocean, there was no compelling genetic evidence to support an Indo-Pacific origin for the genus Hippocampus. The phylogenetic data suggest that high diversity in the Indo-Pacific results from speciation events dating from the Pleistocene to the Miocene, or earlier. Both vicariance and dispersal events in structuring the current global distribution of seahorses. The results suggested that several species designations need re-evaluating, and further phylogeographic studies are required to determine patterns and processes of seahorse dispersal.
Diversity in Requirement of Genetic and Epigenetic Factors for Centromere Function in Fungi ▿
Roy, Babhrubahan; Sanyal, Kaustuv
2011-01-01
A centromere is a chromosomal region on which several proteins assemble to form the kinetochore. The centromere-kinetochore complex helps in the attachment of chromosomes to spindle microtubules to mediate segregation of chromosomes to daughter cells during mitosis and meiosis. In several budding yeast species, the centromere forms in a DNA sequence-dependent manner, whereas in most other fungi, factors other than the DNA sequence also determine the centromere location, as centromeres were able to form on nonnative sequences (neocentromeres) when native centromeres were deleted in engineered strains. Thus, in the absence of a common DNA sequence, the cues that have facilitated centromere formation on a specific DNA sequence for millions of years remain a mystery. Kinetochore formation is facilitated by binding of a centromere-specific histone protein member of the centromeric protein A (CENP-A) family that replaces a canonical histone H3 to form a specialized centromeric chromatin structure. However, the process of kinetochore formation on the rapidly evolving and seemingly diverse centromere DNAs in different fungal species is largely unknown. More interestingly, studies in various yeasts suggest that the factors required for de novo centromere formation (establishment) may be different from those required for maintenance (propagation) of an already established centromere. Apart from the DNA sequence and CENP-A, many other factors, such as posttranslational modification (PTM) of histones at centric and pericentric chromatin, RNA interference, and DNA methylation, are also involved in centromere formation, albeit in a species-specific manner. In this review, we discuss how several genetic and epigenetic factors influence the evolution of structure and function of centromeres in fungal species. PMID:21908596
Harris, Golda G.; Lombardi, Patrick M.; Pemberton, Travis A.; Matsui, Tsutomu; Weiss, Thomas M.; Cole, Kathryn E.; Köksal, Mustafa; Murphy, Frank V.; Vedula, L. Sangeetha; Chou, Wayne K.W.; Cane, David E.; Christianson, David W.
2015-01-01
Geosmin synthase from Streptomyces coelicolor (ScGS) catalyzes an unusual, metal-dependent terpenoid cyclization and fragmentation reaction sequence. Two distinct active sites are required for catalysis: the N-terminal domain catalyzes the ionization and cyclization of farnesyl diphosphate to form germacradienol and inorganic pyrophosphate (PPi), and the C-terminal domain catalyzes the protonation, cyclization, and fragmentation of germacradienol to form geosmin and acetone through a retro-Prins reaction. A unique αα domain architecture is predicted for ScGS based on amino acid sequence: each domain contains the metal-binding motifs typical of a class I terpenoid cyclase, and each domain requires Mg2+ for catalysis. Here, we report the X-ray crystal structure of the unliganded N-terminal domain of ScGS and the structure of its complex with 3 Mg2+ ions and alendronate. These structures highlight conformational changes required for active site closure and catalysis. Although neither full-length ScGS nor constructs of the C-terminal domain could be crystallized, homology models of the C-terminal domain were constructed based on ~36% sequence identity with the N-terminal domain. Small-angle X-ray scattering experiments yield low resolution molecular envelopes into which the N-terminal domain crystal structure and the C-terminal domain homology model were fit, suggesting possible αα domain architectures as frameworks for bifunctional catalysis. PMID:26598179
Modality dependence and intermodal transfer in the Corsi Spatial Sequence Task: Screen vs. Floor.
Röser, Andrea; Hardiess, Gregor; Mallot, Hanspeter A
2016-07-01
Four versions of the Corsi Spatial Sequence Task (CSST) were tested in a complete within-subject design, investigating whether participants' performance depends on the modality of task presentation and reproduction that put different demands on spatial processing. Presentation of the sequence (encoding phase) and the reproduction (recall phase) were each carried out either on a computer screen or on the floor of a room, involving actual walking in the recall phase. Combinations of the two different encoding and recall procedures result in the modality conditions Screen-Screen, Screen-Floor, Floor-Screen, and Floor-Floor. Results show the expected decrease in performance with increasing sequence length, which is likely due to processing limitations of working memory. We also found differences in performance between the modality conditions indicating different involvements of spatial working memory processes. Participants performed best in the Screen-Screen modality condition. Floor-Screen and Floor-Floor modality conditions require additional working memory resources for reference frame transformation and spatial updating, respectively; the resulting impairment of the performance was about the same in these two conditions. Finally, the Screen-Floor modality condition requires both types of additional spatial demands and led to the poorest performance. Therefore, we suggest that besides the well-known spatial requirements of CSST, additional working memory resources are demanded in walking CSST supporting processes such as spatial updating, mental rotation, reference frame transformation, and the control of walking itself.
Fingerprints of Modified RNA Bases from Deep Sequencing Profiles.
Kietrys, Anna M; Velema, Willem A; Kool, Eric T
2017-11-29
Posttranscriptional modifications of RNA bases are not only found in many noncoding RNAs but have also recently been identified in coding (messenger) RNAs as well. They require complex and laborious methods to locate, and many still lack methods for localized detection. Here we test the ability of next-generation sequencing (NGS) to detect and distinguish between ten modified bases in synthetic RNAs. We compare ultradeep sequencing patterns of modified bases, including miscoding, insertions and deletions (indels), and truncations, to unmodified bases in the same contexts. The data show widely varied responses to modification, ranging from no response, to high levels of mutations, insertions, deletions, and truncations. The patterns are distinct for several of the modifications, and suggest the future use of ultradeep sequencing as a fingerprinting strategy for locating and identifying modifications in cellular RNAs.
A fitness cost associated with the antibiotic resistance enzyme SME-1 beta-lactamase.
Marciano, David C; Karkouti, Omid Y; Palzkill, Timothy
2007-08-01
The bla(TEM-1) beta-lactamase gene has become widespread due to the selective pressure of beta-lactam use and its stable maintenance on transferable DNA elements. In contrast, bla(SME-1) is rarely isolated and is confined to the chromosome of carbapenem-resistant Serratia marcescens strains. Dissemination of bla(SME-1) via transfer to a mobile DNA element could hinder the use of carbapenems. In this study, bla(SME-1) was determined to impart a fitness cost upon Escherichia coli in multiple genetic contexts and assays. Genetic screens and designed SME-1 mutants were utilized to identify the source of this fitness cost. These experiments established that the SME-1 protein was required for the fitness cost but also that the enzyme activity of SME-1 was not associated with the fitness cost. The genetic screens suggested that the SME-1 signal sequence was involved in the fitness cost. Consistent with these findings, exchange of the SME-1 signal sequence for the TEM-1 signal sequence alleviated the fitness cost while replacing the TEM-1 signal sequence with the SME-1 signal sequence imparted a fitness cost to TEM-1 beta-lactamase. Taken together, these results suggest that fitness costs associated with some beta-lactamases may limit their dissemination.
A Fitness Cost Associated With the Antibiotic Resistance Enzyme SME-1 β-Lactamase
Marciano, David C.; Karkouti, Omid Y.; Palzkill, Timothy
2007-01-01
The blaTEM-1 β-lactamase gene has become widespread due to the selective pressure of β-lactam use and its stable maintenance on transferable DNA elements. In contrast, blaSME-1 is rarely isolated and is confined to the chromosome of carbapenem-resistant Serratia marcescens strains. Dissemination of blaSME-1 via transfer to a mobile DNA element could hinder the use of carbapenems. In this study, blaSME-1 was determined to impart a fitness cost upon Escherichia coli in multiple genetic contexts and assays. Genetic screens and designed SME-1 mutants were utilized to identify the source of this fitness cost. These experiments established that the SME-1 protein was required for the fitness cost but also that the enzyme activity of SME-1 was not associated with the fitness cost. The genetic screens suggested that the SME-1 signal sequence was involved in the fitness cost. Consistent with these findings, exchange of the SME-1 signal sequence for the TEM-1 signal sequence alleviated the fitness cost while replacing the TEM-1 signal sequence with the SME-1 signal sequence imparted a fitness cost to TEM-1 β-lactamase. Taken together, these results suggest that fitness costs associated with some β-lactamases may limit their dissemination. PMID:17565956
Neupauerová, Jana; Grečmalová, Dagmar; Seeman, Pavel; Laššuthová, Petra
2016-05-01
We describe a patient with early onset severe axonal Charcot-Marie-Tooth disease (CMT2) with dominant inheritance, in whom Sanger sequencing failed to detect a mutation in the mitofusin 2 (MFN2) gene because of a single nucleotide polymorphism (rs2236057) under the PCR primer sequence. The severe early onset phenotype and the family history with severely affected mother (died after delivery) was very suggestive of CMT2A and this suspicion was finally confirmed by a MFN2 mutation. The mutation p.His361Tyr was later detected in the patient by massively parallel sequencing with a gene panel for hereditary neuropathies. According to this information, new primers for amplification and sequencing were designed which bind away from the polymorphic sites of the patient's DNA. Sanger sequencing with these new primers then confirmed the heterozygous mutation in the MFN2 gene in this patient. This case report shows that massively parallel sequencing may in some rare cases be more sensitive than Sanger sequencing and highlights the importance of accurate primer design which requires special attention. © 2016 John Wiley & Sons Ltd/University College London.
Multiplex De Novo Sequencing of Peptide Antibiotics
NASA Astrophysics Data System (ADS)
Mohimani, Hosein; Liu, Wei-Ting; Yang, Yu-Liang; Gaudêncio, Susana P.; Fenical, William; Dorrestein, Pieter C.; Pevzner, Pavel A.
Proliferation of drug-resistant diseases raises the challenge of searching for new, more efficient antibiotics. Currently, some of the most effective antibiotics (i.e., Vancomycin and Daptomycin) are cyclic peptides produced by non-ribosomal biosynthetic pathways. The isolation and sequencing of cyclic peptide antibiotics, unlike the same activity with linear peptides, is time-consuming and error-prone. The dominant technique for sequencing cyclic peptides is NMR-based and requires large amounts (milligrams) of purified materials that, for most compounds, are not possible to obtain. Given these facts, there is a need for new tools to sequence cyclic NRPs using picograms of material. Since nearly all cyclic NRPs are produced along with related analogs, we develop a mass spectrometry approach for sequencing all related peptides at once (in contrast to the existing approach that analyzes individual peptides). Our results suggest that instead of attempting to isolate and NMR-sequence the most abundant compound, one should acquire spectra of many related compounds and sequence all of them simultaneously using tandem mass spectrometry. We illustrate applications of this approach by sequencing new variants of cyclic peptide antibiotics from Bacillus brevis, as well as sequencing a previously unknown familiy of cyclic NRPs produced by marine bacteria.
Fisher, R P; Topper, J N; Clayton, D A
1987-07-17
Selective transcription of human mitochondrial DNA requires a transcription factor (mtTF) in addition to an essentially nonselective RNA polymerase. Partially purified mtTF is able to sequester promoter-containing DNA in preinitiation complexes in the absence of mitochondrial RNA polymerase, suggesting a DNA-binding mechanism for factor activity. Functional domains, required for positive transcriptional regulation by mtTF, are identified within both major promoters of human mtDNA through transcription of mutant promoter templates in a reconstituted in vitro system. These domains are essentially coextensive with DNA sequences protected from nuclease digestion by mtTF-binding. Comparison of the sequences of the two mtTF-responsive elements reveals significant homology only when one sequence is inverted; the binding sites are in opposite orientations with respect to the predominant direction of transcription. Thus mtTF may function bidirectionally, requiring additional protein-DNA interactions to dictate transcriptional polarity. The mtTF-responsive elements are arrayed as direct repeats, separated by approximately 80 bp within the displacement-loop region of human mitochondrial DNA; this arrangement may reflect duplication of an ancestral bidirectional promoter, giving rise to separate, unidirectional promoters for each strand.
Nagano, Yukio; Furuhashi, Hirofumi; Inaba, Takehito; Sasaki, Yukiko
2001-01-01
Complementary DNA encoding a DNA-binding protein, designated PLATZ1 (plant AT-rich sequence- and zinc-binding protein 1), was isolated from peas. The amino acid sequence of the protein is similar to those of other uncharacterized proteins predicted from the genome sequences of higher plants. However, no paralogous sequences have been found outside the plant kingdom. Multiple alignments among these paralogous proteins show that several cysteine and histidine residues are invariant, suggesting that these proteins are a novel class of zinc-dependent DNA-binding proteins with two distantly located regions, C-x2-H-x11-C-x2-C-x(4–5)-C-x2-C-x(3–7)-H-x2-H and C-x2-C-x(10–11)-C-x3-C. In an electrophoretic mobility shift assay, the zinc chelator 1,10-o-phenanthroline inhibited DNA binding, and two distant zinc-binding regions were required for DNA binding. A protein blot with 65ZnCl2 showed that both regions are required for zinc-binding activity. The PLATZ1 protein non-specifically binds to A/T-rich sequences, including the upstream region of the pea GTPase pra2 and plastocyanin petE genes. Expression of the PLATZ1 repressed those of the reporter constructs containing the coding sequence of luciferase gene driven by the cauliflower mosaic virus (CaMV) 35S90 promoter fused to the tandem repeat of the A/T-rich sequences. These results indicate that PLATZ1 is a novel class of plant-specific zinc-dependent DNA-binding protein responsible for A/T-rich sequence-mediated transcriptional repression. PMID:11600698
Singh, Pushpendra; Benjak, Andrej; Schuenemann, Verena J.; Herbig, Alexander; Avanzi, Charlotte; Busso, Philippe; Nieselt, Kay; Krause, Johannes; Vera-Cabrera, Lucio; Cole, Stewart T.
2015-01-01
Mycobacterium lepromatosis is an uncultured human pathogen associated with diffuse lepromatous leprosy and a reactional state known as Lucio's phenomenon. By using deep sequencing with and without DNA enrichment, we obtained the near-complete genome sequence of M. lepromatosis present in a skin biopsy from a Mexican patient, and compared it with that of Mycobacterium leprae, which has undergone extensive reductive evolution. The genomes display extensive synteny and are similar in size (∼3.27 Mb). Protein-coding genes share 93% nucleotide sequence identity, whereas pseudogenes are only 82% identical. The events that led to pseudogenization of 50% of the genome likely occurred before divergence from their most recent common ancestor (MRCA), and both M. lepromatosis and M. leprae have since accumulated new pseudogenes or acquired specific deletions. Functional comparisons suggest that M. lepromatosis has lost several enzymes required for amino acid synthesis whereas M. leprae has a defective heme pathway. M. lepromatosis has retained all functions required to infect the Schwann cells of the peripheral nervous system and therefore may also be neuropathogenic. A phylogeographic survey of 227 leprosy biopsies by differential PCR revealed that 221 contained M. leprae whereas only six, all from Mexico, harbored M. lepromatosis. Phylogenetic comparisons indicate that M. lepromatosis is closer than M. leprae to the MRCA, and a Bayesian dating analysis suggests that they diverged from their MRCA approximately 13.9 Mya. Thus, despite their ancient separation, the two leprosy bacilli are remarkably conserved and still cause similar pathologic conditions. PMID:25831531
Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes
2011-01-01
Background We investigate if pooling BAC clones and sequencing the pools can provide for more accurate assembly of genome sequences than the "whole genome shotgun" (WGS) approach. Furthermore, we quantify this accuracy increase. We compare the pooled BAC and WGS approaches using in silico simulations. Standard measures of assembly quality focus on assembly size and fragmentation, which are desirable for large whole genome assemblies. We propose additional measures enabling easy and visual comparison of assembly quality, such as rearrangements and redundant sequence content, relative to the known target sequence. Results The best assembly quality scores were obtained using 454 coverage of 15× linear and 5× paired (3kb insert size) reads (15L-5P) on Arabidopsis. This regime gave similarly good results on four additional plant genomes of very different GC and repeat contents. BAC pooling improved assembly scores over WGS assembly, coverage and redundancy scores improving the most. Conclusions BAC pooling works better than WGS, however, both require a physical map to order the scaffolds. Pool sizes up to 12Mbp work well, suggesting this pooling density to be effective in medium-scale re-sequencing applications such as targeted sequencing of QTL intervals for candidate gene discovery. Assuming the current Roche/454 Titanium sequencing limitations, a 12 Mbp region could be re-sequenced with a full plate of linear reads and a half plate of paired-end reads, yielding 15L-5P coverage after read pre-processing. Our simulation suggests that massively over-sequencing may not improve accuracy. Our scoring measures can be used generally to evaluate and compare results of simulated genome assemblies. PMID:21496274
Does order matter? Investigating the effect of sequence on glance duration during on-road driving
Roberts, Shannon C.; Reimer, Bryan; Mehler, Bruce
2017-01-01
Previous literature has shown that vehicle crash risks increases as drivers’ off-road glance duration increases. Many factors influence drivers’ glance duration such as individual differences, driving environment, or task characteristics. Theories and past studies suggest that glance duration increases as the task progresses, but the exact relationship between glance sequence and glance durations is not fully understood. The purpose of this study was to examine the effect of glance sequence on glance duration among drivers completing a visual-manual radio tuning task and an auditory-vocal based multi-modal navigation entry task. Eighty participants drove a vehicle on urban highways while completing radio tuning and navigation entry tasks. Forty participants drove under an experimental protocol that required three button presses followed by rotation of a tuning knob to complete the radio tuning task while the other forty participants completed the task with one less button press. Multiple statistical analyses were conducted to measure the effect of glance sequence on glance duration. Results showed that across both tasks and a variety of statistical tests, glance sequence had inconsistent effects on glance duration—the effects varied according to the number of glances, task type, and data set that was being evaluated. Results suggest that other aspects of the task as well as interface design effect glance duration and should be considered in the context of examining driver attention or lack thereof. All in all, interface design and task characteristics have a more influential impact on glance duration than glance sequence, suggesting that classical design considerations impacting driver attention, such as the size and location of buttons, remain fundamental in designing in-vehicle interfaces. PMID:28158301
Detecting and Characterizing Repeating Earthquake Sequences During Volcanic Eruptions
NASA Astrophysics Data System (ADS)
Tepp, G.; Haney, M. M.; Wech, A.
2017-12-01
A major challenge in volcano seismology is forecasting eruptions. Repeating earthquake sequences often precede volcanic eruptions or lava dome activity, providing an opportunity for short-term eruption forecasting. Automatic detection of these sequences can lead to timely eruption notification and aid in continuous monitoring of volcanic systems. However, repeating earthquake sequences may also occur after eruptions or along with magma intrusions that do not immediately lead to an eruption. This additional challenge requires a better understanding of the processes involved in producing these sequences to distinguish those that are precursory. Calculation of the inverse moment rate and concepts from the material failure forecast method can lead to such insights. The temporal evolution of the inverse moment rate is observed to differ for precursory and non-precursory sequences, and multiple earthquake sequences may occur concurrently. These observations suggest that sequences may occur in different locations or through different processes. We developed an automated repeating earthquake sequence detector and near real-time alarm to send alerts when an in-progress sequence is identified. Near real-time inverse moment rate measurements can further improve our ability to forecast eruptions by allowing for characterization of sequences. We apply the detector to eruptions of two Alaskan volcanoes: Bogoslof in 2016-2017 and Redoubt Volcano in 2009. The Bogoslof eruption produced almost 40 repeating earthquake sequences between its start in mid-December 2016 and early June 2017, 21 of which preceded an explosive eruption, and 2 sequences in the months before eruptive activity. Three of the sequences occurred after the implementation of the alarm in late March 2017 and successfully triggered alerts. The nearest seismometers to Bogoslof are over 45 km away, requiring a detector that can work with few stations and a relatively low signal-to-noise ratio. During the Redoubt eruption, earthquake sequences were observed in the months leading up to the eruptive activity beginning in March 2009 as well as immediately preceding 7 of the 19 explosive events. In contrast to Bogoslof, Redoubt has a local monitoring network which allows for better detection and more detailed analysis of the repeating earthquake sequences.
Grohmann, L; Brennicke, A; Schuster, W
1992-01-01
The Oenothera mitochondrial genome contains only a gene fragment for ribosomal protein S12 (rps12), while other plants encode a functional gene in the mitochondrion. The complete Oenothera rps12 gene is located in the nucleus. The transit sequence necessary to target this protein to the mitochondrion is encoded by a 5'-extension of the open reading frame. Comparison of the amino acid sequence encoded by the nuclear gene with the polypeptides encoded by edited mitochondrial cDNA and genomic sequences of other plants suggests that gene transfer between mitochondrion and nucleus started from edited mitochondrial RNA molecules. Mechanisms and requirements of gene transfer and activation are discussed. Images PMID:1454526
The Discovery of an Eccentric Millisecond Pulsar in the Galactic Plane
NASA Astrophysics Data System (ADS)
Champion, David J.; Ransom, Scott M.; Lazarus, Patrick; Camilo, Fernando; Kaspi, Victoria M.; Nice, David J.; Freire, Paulo C. C.; Cordes, James M.; Hessels, Jason W. T.; Bassa, Cees; Lorimer, Duncan R.; Stairs, Ingrid H.; van Leeuwen, Joeri; Arzoumnian, Zaven; Backer, Don C.; Bhat, N. D. Ramesh; Chatterjee, Shami; Crawford, Fronefield; Deneva, Julia S.; Faucher-Giguère, Claude-André; Gaensler, B. M.; Han, Jinlin; Jenet, Fredrick A.; Kasian, Laura; Kondratiev, Vlad I.; Kramer, Michael; Lazio, Joseph; McLaughlin, Maura A.; Stappers, Ben W.; Venkataraman, Arun; Vlemmings, Wouter
2008-02-01
The evolution of binary systems is governed by their orbital properties and the stellar density of the local environment. Studies of neutron stars in binary star systems offer unique insights into both these issues. In an Arecibo survey of the Galactic disk, we have found PSR J1903+0327, a radio emitting neutron star (a ``pulsar'') with a 2.15 ms rotation period, in a 95-day orbit around a massive companion. Observations in the infra-red suggests that the companion may be a main-sequence star. Theories requiring an origin in the Galactic disk cannot account for the extraordinarily high orbital eccentricity observed (0.44) or a main-sequence companion of a pulsar that has spin properties suggesting a prolonged accretion history. The most likely formation mechanism is an exchange interaction in a globular star cluster. This requires that the binary was either ejected from its parent globular cluster as a result of a three-body interaction, or that that cluster was disrupted by repeated passages through the disk of the Milky Way.
Sammond, Deanne W.; Payne, Christina M.; Brunecky, Roman; Himmel, Michael E.; Crowley, Michael F.; Beckham, Gregg T.
2012-01-01
Cellulase enzymes deconstruct cellulose to glucose, and are often comprised of glycosylated linkers connecting glycoside hydrolases (GHs) to carbohydrate-binding modules (CBMs). Although linker modifications can alter cellulase activity, the functional role of linkers beyond domain connectivity remains unknown. Here we investigate cellulase linkers connecting GH Family 6 or 7 catalytic domains to Family 1 or 2 CBMs, from both bacterial and eukaryotic cellulases to identify conserved characteristics potentially related to function. Sequence analysis suggests that the linker lengths between structured domains are optimized based on the GH domain and CBM type, such that linker length may be important for activity. Longer linkers are observed in eukaryotic GH Family 6 cellulases compared to GH Family 7 cellulases. Bacterial GH Family 6 cellulases are found with structured domains in either N to C terminal order, and similar linker lengths suggest there is no effect of domain order on length. O-glycosylation is uniformly distributed across linkers, suggesting that glycans are required along entire linker lengths for proteolysis protection and, as suggested by simulation, for extension. Sequence comparisons show that proline content for bacterial linkers is more than double that observed in eukaryotic linkers, but with fewer putative O-glycan sites, suggesting alternative methods for extension. Conversely, near linker termini where linkers connect to structured domains, O-glycosylation sites are observed less frequently, whereas glycines are more prevalent, suggesting the need for flexibility to achieve proper domain orientations. Putative N-glycosylation sites are quite rare in cellulase linkers, while an N-P motif, which strongly disfavors the attachment of N-glycans, is commonly observed. These results suggest that linkers exhibit features that are likely tailored for optimal function, despite possessing low sequence identity. This study suggests that cellulase linkers may exhibit function in enzyme action, and highlights the need for additional studies to elucidate cellulase linker functions. PMID:23139804
Liu, Q; Astell, C R
1996-10-01
During replication of the minute virus of mice (MVM) genome, a dimer replicative form (RF) intermediate is resolved into two monomer RF molecules in such a way as to retain a unique sequence within the left hand hairpin terminus of the viral genome. Although the proposed mechanism for resolution of the dimer RF remains uncertain, it likely involves site-specific nicking of the dimer bridge. The RF contains two double-stranded copies of the viral genome joined by the extended 3' hairpin. Minor sequence asymmetries within the 3' hairpin allow the two halves of the dimer bridge to be distinguished. The A half contains the sequence [sequence: see text], whereas the B half contains the sequence [sequence: see text]. Using an in vitro assay, we show that only the B half of the MVM dimer bridge is nicked site-specifically when incubated with crude NS-1 protein (expressed in insect cells) and mouse LA9 cellular extract. When highly purified NS-1, the major nonstructural protein of MVM, is used in this nicking reaction, there is an absolute requirement for the LA9 cellular extract, suggesting a cellular factor (or factors) is (are) required. A series of mutations were created in the putative host factor binding region (HFBR) on the B half of the MVM dimer bridge adjacent to the NS-1 binding site. Nicking assays of these B half mutants showed that two CG motifs displaced by 10 nucleotides are important for nicking. Gel mobility shift assays demonstrated that a host factor(s) can bind to the HFBR of the B half of the dimer bridge and efficient binding depends on the presence of both CG motifs. Competitor DNA containing the wild-type HFBR sequence is able to specifically inhibit nicking of the B half, indicating that the host factor(s) bound to the HFBR is(are) essential for site-specific nicking to occur.
Kamenova, Ivanka; Warfield, Linda
2014-01-01
Most RNA polymerase (Pol) II promoters lack a TATA element, yet nearly all Pol II transcription requires TATA binding protein (TBP). While the TBP-TATA interaction is critical for transcription at TATA-containing promoters, it has been unclear whether TBP sequence-specific DNA contacts are required for transcription at TATA-less genes. Transcription factor IID (TFIID), the TBP-containing coactivator that functions at most TATA-less genes, recognizes short sequence-specific promoter elements in metazoans, but analogous promoter elements have not been identified in Saccharomyces cerevisiae. We generated a set of mutations in the yeast TBP DNA binding surface and found that most support growth of yeast. Both in vivo and in vitro, many of these mutations are specifically defective for transcription of two TATA-containing genes with only minor defects in transcription of two TATA-less, TFIID-dependent genes. TBP binds several TATA-less promoters with apparent high affinity, but our results suggest that this binding is not important for transcription activity. Our results are consistent with the model that sequence-specific TBP-DNA contacts are not important at yeast TATA-less genes and suggest that other general transcription factors or coactivator subunits are responsible for recognition of TATA-less promoters. Our results also explain why yeast TBP derivatives defective for TATA binding appear defective in activated transcription. PMID:24865972
Kamenova, Ivanka; Warfield, Linda; Hahn, Steven
2014-08-01
Most RNA polymerase (Pol) II promoters lack a TATA element, yet nearly all Pol II transcription requires TATA binding protein (TBP). While the TBP-TATA interaction is critical for transcription at TATA-containing promoters, it has been unclear whether TBP sequence-specific DNA contacts are required for transcription at TATA-less genes. Transcription factor IID (TFIID), the TBP-containing coactivator that functions at most TATA-less genes, recognizes short sequence-specific promoter elements in metazoans, but analogous promoter elements have not been identified in Saccharomyces cerevisiae. We generated a set of mutations in the yeast TBP DNA binding surface and found that most support growth of yeast. Both in vivo and in vitro, many of these mutations are specifically defective for transcription of two TATA-containing genes with only minor defects in transcription of two TATA-less, TFIID-dependent genes. TBP binds several TATA-less promoters with apparent high affinity, but our results suggest that this binding is not important for transcription activity. Our results are consistent with the model that sequence-specific TBP-DNA contacts are not important at yeast TATA-less genes and suggest that other general transcription factors or coactivator subunits are responsible for recognition of TATA-less promoters. Our results also explain why yeast TBP derivatives defective for TATA binding appear defective in activated transcription. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Translational control of Nrf2 within the open reading frame
DOE Office of Scientific and Technical Information (OSTI.GOV)
Perez-Leal, Oscar, E-mail: operez@temple.edu; Barrero, Carlos A.; Merali, Salim, E-mail: smerali@temple.edu
2013-07-19
Highlights: •Identification of a novel Nrf2 translational repression mechanism. •The repressor is within the 3′ portion of the Nrf2 ORF. •The translation of Nrf2 or eGFP is reduced by the regulatory element. •The translational repression can be reversed with synonymous codon substitutions. •The molecular mechanism requires the mRNA sequence, but not the encoded amino acids. -- Abstract: Nuclear Factor Erythroid 2-Related Factor 2 (Nrf2) is a transcription factor that is essential for the regulation of an effective antioxidant and detoxifying response. The regulation of its activity can occur at transcription, translation and post-translational levels. Evidence suggests that under environmental stressmore » conditions, new synthesis of Nrf2 is required – a process that is regulated by translational control and is not fully understood. Here we described the identification of a novel molecular process that under basal conditions strongly represses the translation of Nrf2 within the open reading frame (ORF). This mechanism is dependent on the mRNA sequence within the 3′ portion of the ORF of Nrf2 but not in the encoded amino acid sequence. The Nrf2 translational repression can be reversed with the use of synonymous codon substitutions. This discovery suggests an additional layer of control to explain the reason for the low Nrf2 concentration under quiescent state.« less
Fault Tree Based Diagnosis with Optimal Test Sequencing for Field Service Engineers
NASA Technical Reports Server (NTRS)
Iverson, David L.; George, Laurence L.; Patterson-Hine, F. A.; Lum, Henry, Jr. (Technical Monitor)
1994-01-01
When field service engineers go to customer sites to service equipment, they want to diagnose and repair failures quickly and cost effectively. Symptoms exhibited by failed equipment frequently suggest several possible causes which require different approaches to diagnosis. This can lead the engineer to follow several fruitless paths in the diagnostic process before they find the actual failure. To assist in this situation, we have developed the Fault Tree Diagnosis and Optimal Test Sequence (FTDOTS) software system that performs automated diagnosis and ranks diagnostic hypotheses based on failure probability and the time or cost required to isolate and repair each failure. FTDOTS first finds a set of possible failures that explain exhibited symptoms by using a fault tree reliability model as a diagnostic knowledge to rank the hypothesized failures based on how likely they are and how long it would take or how much it would cost to isolate and repair them. This ordering suggests an optimal sequence for the field service engineer to investigate the hypothesized failures in order to minimize the time or cost required to accomplish the repair task. Previously, field service personnel would arrive at the customer site and choose which components to investigate based on past experience and service manuals. Using FTDOTS running on a portable computer, they can now enter a set of symptoms and get a list of possible failures ordered in an optimal test sequence to help them in their decisions. If facilities are available, the field engineer can connect the portable computer to the malfunctioning device for automated data gathering. FTDOTS is currently being applied to field service of medical test equipment. The techniques are flexible enough to use for many different types of devices. If a fault tree model of the equipment and information about component failure probabilities and isolation times or costs are available, a diagnostic knowledge base for that device can be developed easily.
Using structure to explore the sequence alignment space of remote homologs.
Kuziemko, Andrew; Honig, Barry; Petrey, Donald
2011-10-01
Protein structure modeling by homology requires an accurate sequence alignment between the query protein and its structural template. However, sequence alignment methods based on dynamic programming (DP) are typically unable to generate accurate alignments for remote sequence homologs, thus limiting the applicability of modeling methods. A central problem is that the alignment that is "optimal" in terms of the DP score does not necessarily correspond to the alignment that produces the most accurate structural model. That is, the correct alignment based on structural superposition will generally have a lower score than the optimal alignment obtained from sequence. Variations of the DP algorithm have been developed that generate alternative alignments that are "suboptimal" in terms of the DP score, but these still encounter difficulties in detecting the correct structural alignment. We present here a new alternative sequence alignment method that relies heavily on the structure of the template. By initially aligning the query sequence to individual fragments in secondary structure elements and combining high-scoring fragments that pass basic tests for "modelability", we can generate accurate alignments within a small ensemble. Our results suggest that the set of sequences that can currently be modeled by homology can be greatly extended.
To What Extent Can Motor Imagery Replace Motor Execution While Learning a Fine Motor Skill?
Sobierajewicz, Jagna; Szarkiewicz, Sylwia; Przekoracka-Krawczyk, Anna; Jaśkowski, Wojciech; van der Lubbe, Rob
2016-01-01
Motor imagery is generally thought to share common mechanisms with motor execution. In the present study, we examined to what extent learning a fine motor skill by motor imagery may substitute physical practice. Learning effects were assessed by manipulating the proportion of motor execution and motor imagery trials. Additionally, learning effects were compared between participants with an explicit motor imagery instruction and a control group. A Go/NoGo discrete sequence production (DSP) task was employed, wherein a five-stimulus sequence presented on each trial indicated the required sequence of finger movements after a Go signal. In the case of a NoGo signal, participants either had to imagine carrying out the response sequence (the motor imagery group), or the response sequence had to be withheld (the control group). Two practice days were followed by a final test day on which all sequences had to be executed. Learning effects were assessed by computing response times (RTs) and the percentages of correct responses (PCs). The electroencephalogram (EEG ) was additionally measured on this test day to examine whether motor preparation and the involvement of visual short term memory (VST M) depended on the amount of physical/mental practice. Accuracy data indicated strong learning effects. However, a substantial amount of physical practice was required to reach an optimal speed. EEG results suggest the involvement of VST M for sequences that had less or no physical practice in both groups. The absence of differences between the motor imagery and the control group underlines the possibility that motor preparation may actually resemble motor imagery. PMID:28154614
To What Extent Can Motor Imagery Replace Motor Execution While Learning a Fine Motor Skill?
Sobierajewicz, Jagna; Szarkiewicz, Sylwia; Przekoracka-Krawczyk, Anna; Jaśkowski, Wojciech; van der Lubbe, Rob
2016-01-01
Motor imagery is generally thought to share common mechanisms with motor execution. In the present study, we examined to what extent learning a fine motor skill by motor imagery may substitute physical practice. Learning effects were assessed by manipulating the proportion of motor execution and motor imagery trials. Additionally, learning effects were compared between participants with an explicit motor imagery instruction and a control group. A Go/NoGo discrete sequence production (DSP) task was employed, wherein a five-stimulus sequence presented on each trial indicated the required sequence of finger movements after a Go signal. In the case of a NoGo signal, participants either had to imagine carrying out the response sequence (the motor imagery group), or the response sequence had to be withheld (the control group). Two practice days were followed by a final test day on which all sequences had to be executed. Learning effects were assessed by computing response times (RTs) and the percentages of correct responses (PCs). The electroencephalogram (EEG ) was additionally measured on this test day to examine whether motor preparation and the involvement of visual short term memory (VST M) depended on the amount of physical/mental practice. Accuracy data indicated strong learning effects. However, a substantial amount of physical practice was required to reach an optimal speed. EEG results suggest the involvement of VST M for sequences that had less or no physical practice in both groups. The absence of differences between the motor imagery and the control group underlines the possibility that motor preparation may actually resemble motor imagery.
Evaluation of electrical test conditions in MIL-M-38510 slash sheets
NASA Astrophysics Data System (ADS)
Sandgren, K.
1980-08-01
Adequacy of MIL-M-38510 slash sheet requirements for electrical test conditions in an automated test environment were evaluated. Military temperature range commercial devices of 13 types from 6 manufacturers were purchased. Software for testing these devices and for varying the test conditions was written for the Tektronix S-3260 test system. The devices were tested to evaluate the effects of pin-condition settling time, measurement sequence of the same and different D-C parameters, temperature sequence, differently defined temperature ambients, variable measurement conditions, sequence of time measurements, pin-application sequence, and undesignated pin condition ambiguity. An alternative to current tri-state enable and disable time measurements is proposed; S-3260 'open' and 'ground' conditions are characterized; and suggestions for changes in MIL-M-38510 slash sheet specifications and MIL-STD-883 test methods are proposed, both to correct errors and ambiguities and to facilitate the gathering of repeatable data on automated test equipment. Data obtained showed no sensitivity to measurement or temperature sequence nor to temperature ambient, provided that test times were not excessive. V sub ICP tests and some low current measurements required allowance for a pin condition settling time because of the test system speed. Some pin condition application sequences yielded incorrect measurements. Undefined terminal conditions of output pins were found to affect I sub OS and propagation delay time measurements. Truth table test results varied with test frequency and V sub IL for low-power Schottky devices.
Chromosome rearrangements via template switching between diverged repeated sequences
Anand, Ranjith P.; Tsaponina, Olga; Greenwell, Patricia W.; Lee, Cheng-Sheng; Du, Wei; Petes, Thomas D.
2014-01-01
Recent high-resolution genome analyses of cancer and other diseases have revealed the occurrence of microhomology-mediated chromosome rearrangements and copy number changes. Although some of these rearrangements appear to involve nonhomologous end-joining, many must have involved mechanisms requiring new DNA synthesis. Models such as microhomology-mediated break-induced replication (MM-BIR) have been invoked to explain these rearrangements. We examined BIR and template switching between highly diverged sequences in Saccharomyces cerevisiae, induced during repair of a site-specific double-strand break (DSB). Our data show that such template switches are robust mechanisms that give rise to complex rearrangements. Template switches between highly divergent sequences appear to be mechanistically distinct from the initial strand invasions that establish BIR. In particular, such jumps are less constrained by sequence divergence and exhibit a different pattern of microhomology junctions. BIR traversing repeated DNA sequences frequently results in complex translocations analogous to those seen in mammalian cells. These results suggest that template switching among repeated genes is a potent driver of genome instability and evolution. PMID:25367035
Green, Stefan J.; Rishishwar, Lavanya; Prakash, Om; Katz, Lee S.; Mariño-Ramírez, Leonardo; Jordan, I. King; Munk, Christine; Ivanova, Natalia; Mikhailova, Natalia; Watson, David B.; Brown, Steven D.; Palumbo, Anthony V.; Brooks, Scott C.
2012-01-01
We report the first genome sequences for six strains of Rhodanobacter species isolated from a variety of soil and subsurface environments. Three of these strains are capable of complete denitrification and three others are not. However, all six strains contain most of the genes required for the respiration of nitrate to gaseous nitrogen. The nondenitrifying members of the genus lack only the gene for nitrate reduction, the first step in the full denitrification pathway. The data suggest that the environmental role of bacteria from the genus Rhodanobacter should be reevaluated. PMID:22843592
NASA Astrophysics Data System (ADS)
Newman, Joan T.
Any change, particularly on a large scale like a sequence change in a district with 75,000 students, is difficult. However, with the advent of the new TAKS science test and the new requirements for high school graduation in the state of Texas, educators and students alike are engaged in innovative educational approaches to meet these requirements. This study investigated a different, non-traditional science sequence to investigate relationships among secondary core-science course sequencing, student science-reasoning performance, and classroom pedagogy. The methodology adopted in the study led to a deeper understanding of the successes and challenges faced by teachers in teaching conceptual physics and chemistry to 8 th and 9th grade students. The qualitative analysis suggested a difference in pedagogy employed by middle and high school science teachers and a need for secondary science teachers to enhance their content knowledge and pedagogical skills, as well as change their underlying attitudes and beliefs about the abilities of students. The study examined scores of 495 randomly chosen students following three different matriculation patterns within one large independent school district. The study indicated that students who follow a sequence with 9th grade IPC generally increase their science-reasoning skills as demonstrated on the 10th grade TAKS science test when these scores are compared with those of students who do not have 9th grade IPC in the science sequence.
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.
Code of Federal Regulations, 2011 CFR
2011-07-01
... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.
Code of Federal Regulations, 2013 CFR
2013-07-01
... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.
Code of Federal Regulations, 2012 CFR
2012-07-01
... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.
Code of Federal Regulations, 2010 CFR
2010-07-01
... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.
Code of Federal Regulations, 2014 CFR
2014-07-01
... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
Sequences show rapid motor transfer and spatial translation in the oculomotor system.
Stainer, Matthew J; Carpenter, R H S; Brotchie, Peter; Anderson, Andrew J
2016-07-01
Every day we perform learnt sequences of actions that seem to happen almost without awareness. It has been argued that for learning such sequences parallel learning networks exist - one using spatial coordinates and one using motor coordinates - with sequence acquisition involving a progressive shift from the former to the latter as a sequence is rehearsed. When sequences are interrupted by an out-of-sequence target, there is a delay in the response to the target, and so here we transiently interrupt oculomotor sequences to probe the influence of oculomotor rehearsal and spatial coordinates in sequence acquisition. For our main experiments, we used a repeating sequences of eight targets in length that was first learnt either using saccadic eye movements (left/right), manual responses (left/right or up/down) or as a sequence of colour (blue/red) requiring no motor response. The sequence was immediately repeated for saccadic eye movements, during which the influence of on out-of-sequence target (an interruption) was assessed. When a sequence is learnt beforehand in an abstract way (for example, as a sequence of colours or of orthogonally mapped manual responses), interruptions are immediately disruptive to latency, suggesting neither motor rehearsal nor specific spatial coordinates are essential for encoding sequences of actions and that sequences - no matter how they are encoded - can be rapidly translated into oculomotor coordinates. The magnitude of a disruption does, however, correspond to how well a sequence is learnt: introducing an interruption to an extended sequence before it was reliably learnt reduces the magnitude of the latency disruption. Copyright © 2016 Elsevier Ltd. All rights reserved.
Diversity of phytases in the rumen.
Nakashima, Brenda A; McAllister, Tim A; Sharma, Ranjana; Selinger, L Brent
2007-01-01
Examples of a new class of phytase related to protein tyrosine phosphatases (PTP) were recently isolated from several anaerobic bacteria from the rumen of cattle. In this study, the diversity of PTP-like phytase gene sequences in the rumen was surveyed by using the polymerase chain reaction (PCR). Two sets of degenerate primers were used to amplify sequences from rumen fluid total community DNA and genomic DNA from nine bacterial isolates. Four novel PTP-like phytase sequences were retrieved from rumen fluid, whereas all nine of the anaerobic bacterial isolates investigated in this work contained PTP-like phytase sequences. One isolate, Selenomonas lacticifex, contained two distinct PTP-like phytase sequences, suggesting that multiple phytate hydrolyzing enzymes are present in this bacterium. The degenerate primer and PCR conditions described here, as well as novel sequences obtained in this study, will provide a valuable resource for future studies on this new class of phytase. The observed diversity of microbial phytases in the rumen may account for the ability of ruminants to derive a significant proportion of their phosphorus requirements from phytate.
Unusually long-lived pause required for regulation of a Rho-dependent transcription terminator.
Hollands, Kerry; Sevostiyanova, Anastasia; Groisman, Eduardo A
2014-05-13
Up to half of all transcription termination events in bacteria rely on the RNA-dependent helicase Rho. However, the nucleic acid sequences that promote Rho-dependent termination remain poorly characterized. Defining the molecular determinants that confer Rho-dependent termination is especially important for understanding how such terminators can be regulated in response to specific signals. Here, we identify an extraordinarily long-lived pause at the site where Rho terminates transcription in the 5'-leader region of the Mg(2+) transporter gene mgtA in Salmonella enterica. We dissect the sequence elements required for prolonged pausing in the mgtA leader and establish that the remarkable longevity of this pause is required for a riboswitch to stimulate Rho-dependent termination in the mgtA leader region in response to Mg(2+) availability. Unlike Rho-dependent terminators described previously, where termination occurs at multiple pause sites, there is a single site of transcription termination directed by Rho in the mgtA leader. Our data suggest that Rho-dependent termination events that are subject to regulation may require elements distinct from those operating at constitutive Rho-dependent terminators.
Klebsormidium flaccidum genome reveals primary factors for plant terrestrial adaptation
Hori, Koichi; Maruyama, Fumito; Fujisawa, Takatomo; Togashi, Tomoaki; Yamamoto, Nozomi; Seo, Mitsunori; Sato, Syusei; Yamada, Takuji; Mori, Hiroshi; Tajima, Naoyuki; Moriyama, Takashi; Ikeuchi, Masahiko; Watanabe, Mai; Wada, Hajime; Kobayashi, Koichi; Saito, Masakazu; Masuda, Tatsuru; Sasaki-Sekimoto, Yuko; Mashiguchi, Kiyoshi; Awai, Koichiro; Shimojima, Mie; Masuda, Shinji; Iwai, Masako; Nobusawa, Takashi; Narise, Takafumi; Kondo, Satoshi; Saito, Hikaru; Sato, Ryoichi; Murakawa, Masato; Ihara, Yuta; Oshima-Yamada, Yui; Ohtaka, Kinuka; Satoh, Masanori; Sonobe, Kohei; Ishii, Midori; Ohtani, Ryosuke; Kanamori-Sato, Miyu; Honoki, Rina; Miyazaki, Daichi; Mochizuki, Hitoshi; Umetsu, Jumpei; Higashi, Kouichi; Shibata, Daisuke; Kamiya, Yuji; Sato, Naoki; Nakamura, Yasukazu; Tabata, Satoshi; Ida, Shigeru; Kurokawa, Ken; Ohta, Hiroyuki
2014-01-01
The colonization of land by plants was a key event in the evolution of life. Here we report the draft genome sequence of the filamentous terrestrial alga Klebsormidium flaccidum (Division Charophyta, Order Klebsormidiales) to elucidate the early transition step from aquatic algae to land plants. Comparison of the genome sequence with that of other algae and land plants demonstrate that K. flaccidum acquired many genes specific to land plants. We demonstrate that K. flaccidum indeed produces several plant hormones and homologues of some of the signalling intermediates required for hormone actions in higher plants. The K. flaccidum genome also encodes a primitive system to protect against the harmful effects of high-intensity light. The presence of these plant-related systems in K. flaccidum suggests that, during evolution, this alga acquired the fundamental machinery required for adaptation to terrestrial environments. PMID:24865297
Motor cortex is required for learning but not for executing a motor skill.
Kawai, Risa; Markman, Timothy; Poddar, Rajesh; Ko, Raymond; Fantana, Antoniu L; Dhawale, Ashesh K; Kampff, Adam R; Ölveczky, Bence P
2015-05-06
Motor cortex is widely believed to underlie the acquisition and execution of motor skills, but its contributions to these processes are not fully understood. One reason is that studies on motor skills often conflate motor cortex's established role in dexterous control with roles in learning and producing task-specific motor sequences. To dissociate these aspects, we developed a motor task for rats that trains spatiotemporally precise movement patterns without requirements for dexterity. Remarkably, motor cortex lesions had no discernible effect on the acquired skills, which were expressed in their distinct pre-lesion forms on the very first day of post-lesion training. Motor cortex lesions prior to training, however, rendered rats unable to acquire the stereotyped motor sequences required for the task. These results suggest a remarkable capacity of subcortical motor circuits to execute learned skills and a previously unappreciated role for motor cortex in "tutoring" these circuits during learning. Copyright © 2015 Elsevier Inc. All rights reserved.
Chutiwitoonchai, Nopporn; Kakisaka, Michinori; Yamada, Kazunori; Aida, Yoko
2014-01-01
The assembly of influenza virus progeny virions requires machinery that exports viral genomic ribonucleoproteins from the cell nucleus. Currently, seven nuclear export signal (NES) consensus sequences have been identified in different viral proteins, including NS1, NS2, M1, and NP. The present study examined the roles of viral NES consensus sequences and their significance in terms of viral replication and nuclear export. Mutation of the NP-NES3 consensus sequence resulted in a failure to rescue viruses using a reverse genetics approach, whereas mutation of the NS2-NES1 and NS2-NES2 sequences led to a strong reduction in viral replication kinetics compared with the wild-type sequence. While the viral replication kinetics for other NES mutant viruses were also lower than those of the wild-type, the difference was not so marked. Immunofluorescence analysis after transient expression of NP-NES3, NS2-NES1, or NS2-NES2 proteins in host cells showed that they accumulated in the cell nucleus. These results suggest that the NP-NES3 consensus sequence is mostly required for viral replication. Therefore, each of the hydrophobic (Φ) residues within this NES consensus sequence (Φ1, Φ2, Φ3, or Φ4) was mutated, and its viral replication and nuclear export function were analyzed. No viruses harboring NP-NES3 Φ2 or Φ3 mutants could be rescued. Consistent with this, the NP-NES3 Φ2 and Φ3 mutants showed reduced binding affinity with CRM1 in a pull-down assay, and both accumulated in the cell nucleus. Indeed, a nuclear export assay revealed that these mutant proteins showed lower nuclear export activity than the wild-type protein. Moreover, the Φ2 and Φ3 residues (along with other Φ residues) within the NP-NES3 consensus were highly conserved among different influenza A viruses, including human, avian, and swine. Taken together, these results suggest that the Φ2 and Φ3 residues within the NP-NES3 protein are important for its nuclear export function during viral replication.
Felsenstein, K M; Goff, S P
1992-01-01
The gag-pol polyprotein of the murine and feline leukemia viruses is expressed by translational readthrough of a UAG terminator codon at the 3' end of the gag gene. To explore the cis-acting sequence requirements for the readthrough event in vivo, we generated a library of mutants of the Moloney murine leukemia virus with point mutations near the terminator codon and tested the mutant viral DNAs for the ability to direct synthesis of the gag-pol fusion protein and formation of infectious virus. The analysis showed that sequences 3' to the terminator are necessary and sufficient for the process. The results do not support a role for one proposed stem-loop structure that includes the terminator but are consistent with the involvement of another stem-loop 3' to the terminator. One mutant, containing two compensatory changes in this stem structure, was temperature sensitive for replication and for formation of the gag-pol protein. The results suggest that RNA sequence and structure are critical determinants of translational readthrough in vivo. Images PMID:1404606
78 FR 76533 - New Mailing Standards for Domestic Mailing Services Products
Federal Register 2010, 2011, 2012, 2013, 2014
2013-12-18
... sequencing system (FSS), a mailer association suggested to change the required minimum for an FSS scheme... must make FSS scheme or FSS facility pallets when the quantity reaches 250 pounds, so that minimum is... Standard Mail, FSS scheme pallets may be entered at origin, or at DNDC, DSCF, or DFSS entry points, but the...
Phenotypic Checkpoints Regulate Neuronal Development
Ben-Ari, Yehezkel; Spitzer, Nicholas C.
2010-01-01
Nervous system development proceeds by sequential gene expression mediated by cascades of transcription factors in parallel with sequences of patterned network activity driven by receptors and ion channels. These sequences are cell type- and developmental stage-dependent and modulated by paracrine actions of substances released by neurons and glia. How and to what extent these sequences interact to enable neuronal network development is not understood. Recent evidence demonstrates that CNS development requires intermediate stages of differentiation providing functional feedback that influences gene expression. We suggest that embryonic neuronal functions constitute a series of phenotypic checkpoint signatures; neurons failing to express these functions are delayed or developmentally arrested. Such checkpoints are likely to be a general feature of neuronal development and may constitute presymptomatic signatures of neurological disorders when they go awry. PMID:20864191
Erb, Laurie; Liu, Jun; Ockerhausen, Jonathan; Kong, Qiongman; Garrad, Richard C.; Griffin, Korey; Neal, Chris; Krugh, Brent; Santiago-Pérez, Laura I.; González, Fernando A.; Gresham, Hattie D.; Turner, John T.; Weisman, Gary A.
2001-01-01
The P2Y2 nucleotide receptor (P2Y2R) contains the integrin-binding domain arginine-glycine-aspartic acid (RGD) in its first extracellular loop, raising the possibility that this G protein–coupled receptor interacts directly with an integrin. Binding of a peptide corresponding to the first extracellular loop of the P2Y2R to K562 erythroleukemia cells was inhibited by antibodies against αVβ3/β5 integrins and the integrin-associated thrombospondin receptor, CD47. Immunofluorescence of cells transfected with epitope-tagged P2Y2Rs indicated that αV integrins colocalized 10-fold better with the wild-type P2Y2R than with a mutant P2Y2R in which the RGD sequence was replaced with RGE. Compared with the wild-type P2Y2R, the RGE mutant required 1,000-fold higher agonist concentrations to phosphorylate focal adhesion kinase, activate extracellular signal–regulated kinases, and initiate the PLC-dependent mobilization of intracellular Ca2+. Furthermore, an anti-αV integrin antibody partially inhibited these signaling events mediated by the wild-type P2Y2R. Pertussis toxin, an inhibitor of Gi/o proteins, partially inhibited Ca2+ mobilization mediated by the wild-type P2Y2R, but not by the RGE mutant, suggesting that the RGD sequence is required for P2Y2R-mediated activation of Go, but not Gq. Since CD47 has been shown to associate directly with Gi/o family proteins, these results suggest that interactions between P2Y2Rs, integrins, and CD47 may be important for coupling the P2Y2R to Go. PMID:11331301
A purified transcription factor (TIF-IB) binds to essential sequences of the mouse rDNA promoter.
Clos, J; Buttgereit, D; Grummt, I
1986-01-01
A transcription factor that is specific for mouse rDNA has been partially purified from Ehrlich ascites cells. This factor [designated transcription initiation factor (TIF)-IB] is required for accurate in vitro synthesis of mouse rRNA in addition to RNA polymerase I and another regulatory factor, TIF-IA. TIF-IB activity is present in extracts both from growing and nongrowing cells in comparable amounts. Prebinding competition experiments with wild-type and mutant templates suggest that TIF-IB interacts with the core control element of the rDNA promoter, which is located immediately upstream of the initiation site. The specific binding of TIF-IB to the RNA polymerase I promoter is demonstrated by exonuclease III protection experiments. The 3' border of the sequences protected by TIF-IB is shown to be on the coding strand at position -21 and on the noncoding strand at position -7. The results suggest that direct binding of TIF-IB to sequences in the core promoter element is the mechanism by which this factor imparts promoter selectivity to RNA polymerase I. Images PMID:3456157
The space of ultrametric phylogenetic trees.
Gavryushkin, Alex; Drummond, Alexei J
2016-08-21
The reliability of a phylogenetic inference method from genomic sequence data is ensured by its statistical consistency. Bayesian inference methods produce a sample of phylogenetic trees from the posterior distribution given sequence data. Hence the question of statistical consistency of such methods is equivalent to the consistency of the summary of the sample. More generally, statistical consistency is ensured by the tree space used to analyse the sample. In this paper, we consider two standard parameterisations of phylogenetic time-trees used in evolutionary models: inter-coalescent interval lengths and absolute times of divergence events. For each of these parameterisations we introduce a natural metric space on ultrametric phylogenetic trees. We compare the introduced spaces with existing models of tree space and formulate several formal requirements that a metric space on phylogenetic trees must possess in order to be a satisfactory space for statistical analysis, and justify them. We show that only a few known constructions of the space of phylogenetic trees satisfy these requirements. However, our results suggest that these basic requirements are not enough to distinguish between the two metric spaces we introduce and that the choice between metric spaces requires additional properties to be considered. Particularly, that the summary tree minimising the square distance to the trees from the sample might be different for different parameterisations. This suggests that further fundamental insight is needed into the problem of statistical consistency of phylogenetic inference methods. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Analysis of Epstein-Barr Virus Genomes and Expression Profiles in Gastric Adenocarcinoma.
Borozan, Ivan; Zapatka, Marc; Frappier, Lori; Ferretti, Vincent
2018-01-15
Epstein-Barr virus (EBV) is a causative agent of a variety of lymphomas, nasopharyngeal carcinoma (NPC), and ∼9% of gastric carcinomas (GCs). An important question is whether particular EBV variants are more oncogenic than others, but conclusions are currently hampered by the lack of sequenced EBV genomes. Here, we contribute to this question by mining whole-genome sequences of 201 GCs to identify 13 EBV-positive GCs and by assembling 13 new EBV genome sequences, almost doubling the number of available GC-derived EBV genome sequences and providing the first non-Asian EBV genome sequences from GC. Whole-genome sequence comparisons of all EBV isolates sequenced to date (85 from tumors and 57 from healthy individuals) showed that most GC and NPC EBV isolates were closely related although American Caucasian GC samples were more distant, suggesting a geographical component. However, EBV GC isolates were found to contain some consistent changes in protein sequences regardless of geographical origin. In addition, transcriptome data available for eight of the EBV-positive GCs were analyzed to determine which EBV genes are expressed in GC. In addition to the expected latency proteins (EBNA1, LMP1, and LMP2A), specific subsets of lytic genes were consistently expressed that did not reflect a typical lytic or abortive lytic infection, suggesting a novel mechanism of EBV gene regulation in the context of GC. These results are consistent with a model in which a combination of specific latent and lytic EBV proteins promotes tumorigenesis. IMPORTANCE Epstein-Barr virus (EBV) is a widespread virus that causes cancer, including gastric carcinoma (GC), in a small subset of individuals. An important question is whether particular EBV variants are more cancer associated than others, but more EBV sequences are required to address this question. Here, we have generated 13 new EBV genome sequences from GC, almost doubling the number of EBV sequences from GC isolates and providing the first EBV sequences from non-Asian GC. We further identify sequence changes in some EBV proteins common to GC isolates. In addition, gene expression analysis of eight of the EBV-positive GCs showed consistent expression of both the expected latency proteins and a subset of lytic proteins that was not consistent with typical lytic or abortive lytic expression. These results suggest that novel mechanisms activate expression of some EBV lytic proteins and that their expression may contribute to oncogenesis. Copyright © 2018 American Society for Microbiology.
Rombel, I T; McMorran, B J; Lamont, I L
1995-02-20
Many bacteria respond to a lack of iron in the environment by synthesizing siderophores, which act as iron-scavenging compounds. Fluorescent pseudomonads synthesize strain-specific but chemically related siderophores called pyoverdines or pseudobactins. We have investigated the mechanisms by which iron controls expression of genes involved in pyoverdine metabolism in Pseudomonas aeruginosa. Transcription of these genes is repressed by the presence of iron in the growth medium. Three promoters from these genes were cloned and the activities of the promoters were dependent on the amounts of iron in the growth media. Two of the promoters were sequenced and the transcriptional start site were identified by S1 nuclease analysis. Sequences similar to the consensus binding site for the Fur repressor protein, which controls expression of iron-repressible genes in several gram-negative species, were not present in the promoters, suggesting that they are unlikely to have a high affinity for Fur. However, comparison of the promoter sequences with those of iron-regulated genes from other Pseudomonas species and also the iron-regulated exotoxin gene of P. aeruginosa allowed identification of a shared sequence element, with the consensus sequence (G/C)CTAAAT-CCC, which is likely to act as a binding site for a transcriptional activator protein. Mutations in this sequence greatly reduced the activities of the promoters characterized here as well as those of other iron-regulated promoters. The requirement for this motif in the promoters of iron-regulated genes of different Pseudomonas species indicates that similar mechanisms are likely to be involved in controlling expression of a range of iron-regulated genes in pseudomonads.
Federal Register 2010, 2011, 2012, 2013, 2014
2012-10-29
... DEPARTMENT OF COMMERCE Patent and Trademark Office Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request... Patent applications that contain nucleotide and/or amino acid sequence disclosures must include a copy of...
Hayes, Sidney; Horbay, Monique A.; Hayes, Connie
2012-01-01
Several earlier studies have described an unusual exclusion phenotype exhibited by cells with plasmids carrying a portion of the replication region of phage lambda. Cells exhibiting this inhibition phenotype (IP) prevent the plating of homo-immune and hybrid hetero-immune lambdoid phages. We have attempted to define aspects of IP, and show that it is directed to repλ phages. IP was observed in cells with plasmids containing a λ DNA fragment including oop, encoding a short OOP micro RNA, and part of the lambda origin of replication, oriλ, defined by iteron sequences ITN1-4 and an adjacent high AT-rich sequence. Transcription of the intact oop sequence from its promoter, pO is required for IP, as are iterons ITN3–4, but not the high AT-rich portion of oriλ. The results suggest that IP silencing is directed to theta mode replication initiation from an infecting repλ genome, or an induced repλ prophage. Phage mutations suppressing IP, i.e., Sip, map within, or adjacent to cro or in O, or both. Our results for plasmid based IP suggest the hypothesis that there is a natural mechanism for silencing early theta-mode replication initiation, i.e. the buildup of λ genomes with oop + oriλ+ sequence. PMID:22590552
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bahl, C.; Morisseau, C; Bomberger, J
Cystic fibrosis transmembrane conductance regulator (CFTR) inhibitory factor (Cif) is a virulence factor secreted by Pseudomonas aeruginosa that reduces the quantity of CFTR in the apical membrane of human airway epithelial cells. Initial sequence analysis suggested that Cif is an epoxide hydrolase (EH), but its sequence violates two strictly conserved EH motifs and also is compatible with other {alpha}/{beta} hydrolase family members with diverse substrate specificities. To investigate the mechanistic basis of Cif activity, we have determined its structure at 1.8-{angstrom} resolution by X-ray crystallography. The catalytic triad consists of residues Asp129, His297, and Glu153, which are conserved across themore » family of EHs. At other positions, sequence deviations from canonical EH active-site motifs are stereochemically conservative. Furthermore, detailed enzymatic analysis confirms that Cif catalyzes the hydrolysis of epoxide compounds, with specific activity against both epibromohydrin and cis-stilbene oxide, but with a relatively narrow range of substrate selectivity. Although closely related to two other classes of {alpha}/{beta} hydrolase in both sequence and structure, Cif does not exhibit activity as either a haloacetate dehalogenase or a haloalkane dehalogenase. A reassessment of the structural and functional consequences of the H269A mutation suggests that Cif's effect on host-cell CFTR expression requires the hydrolysis of an extended endogenous epoxide substrate.« less
Mammoth and Elephant Phylogenetic Relationships: Mammut Americanum, the Missing Outgroup
Orlando, Ludovic; Hänni, Catherine; Douady, Christophe J.
2007-01-01
At the morphological level, the woolly mammoth has most often been considered as the sister-species of Asian elephants, but at the DNA level, different studies have found support for proximity with African elephants. Recent reports have increased the available sequence data and apparently solved the discrepancy, finding mammoths to be most closely related to Asian elephants. However, we demonstrate here that the three competing topologies have similar likelihood, bayesian and parsimony supports. The analysis further suggests the inadequacy of using Sirenia or Hyracoidea as outgroups. We therefore argue that orthologous sequences from the extinct American mastodon will be required to definitively solve this long-standing question. PMID:19430604
Real-time myocardium segmentation for the assessment of cardiac function variation
NASA Astrophysics Data System (ADS)
Zoehrer, Fabian; Huellebrand, Markus; Chitiboi, Teodora; Oechtering, Thekla; Sieren, Malte; Frahm, Jens; Hahn, Horst K.; Hennemuth, Anja
2017-03-01
Recent developments in MRI enable the acquisition of image sequences with high spatio-temporal resolution. Cardiac motion can be captured without gating and triggering. Image size and contrast relations differ from conventional cardiac MRI cine sequences requiring new adapted analysis methods. We suggest a novel segmentation approach utilizing contrast invariant polar scanning techniques. It has been tested with 20 datasets of arrhythmia patients. The results do not differ significantly more between automatic and manual segmentations than between observers. This indicates that the presented solution could enable clinical applications of real-time MRI for the examination of arrhythmic cardiac motion in the future.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shevtsov, M. B.; Streeter, S. D.; Thresh, S.-J.
2015-02-01
The structure of the new class of controller proteins (exemplified by C.Csp231I) in complex with its 21 bp DNA-recognition sequence is presented, and the molecular basis of sequence recognition in this class of proteins is discussed. An unusual extended spacer between the dimer binding sites suggests a novel interaction between the two C-protein dimers. In a wide variety of bacterial restriction–modification systems, a regulatory ‘controller’ protein (or C-protein) is required for effective transcription of its own gene and for transcription of the endonuclease gene found on the same operon. We have recently turned our attention to a new class ofmore » controller proteins (exemplified by C.Csp231I) that have quite novel features, including a much larger DNA-binding site with an 18 bp (∼60 Å) spacer between the two palindromic DNA-binding sequences and a very different recognition sequence from the canonical GACT/AGTC. Using X-ray crystallography, the structure of the protein in complex with its 21 bp DNA-recognition sequence was solved to 1.8 Å resolution, and the molecular basis of sequence recognition in this class of proteins was elucidated. An unusual aspect of the promoter sequence is the extended spacer between the dimer binding sites, suggesting a novel interaction between the two C-protein dimers when bound to both recognition sites correctly spaced on the DNA. A U-bend model is proposed for this tetrameric complex, based on the results of gel-mobility assays, hydrodynamic analysis and the observation of key contacts at the interface between dimers in the crystal.« less
Potier, M; Dutriaux, A; Orti, R; Groet, J; Gibelin, N; Karadima, G; Lutfalla, G; Lynn, A; Van Broeckhoven, C; Chakravarti, A; Petersen, M; Nizetic, D; Delabar, J; Rossier, J
1998-08-01
Physical mapping across a duplication can be a tour de force if the region is larger than the size of a bacterial clone. This was the case of the 170- to 275-kb duplication present on the long arm of chromosome 21 in normal human at 21q11.1 (proximal region) and at 21q22.1 (distal region), which we described previously. We have constructed sequence-ready contigs of the two copies of the duplication of which all the clones are genuine representatives of one copy or the other. This required the identification of four duplicon polymorphisms that are copy-specific and nonallelic variations in the sequence of the STSs. Thirteen STSs were mapped inside the duplicated region and 5 outside but close to the boundaries. Among these STSs 10 were end clones from YACs, PACs, or cosmids, and the average interval between two markers in the duplicated region was 16 kb. Eight PACs and cosmids showing minimal overlaps were selected in both copies of the duplication. Comparative sequence analysis along the duplication showed three single-basepair changes between the two copies over 659 bp sequenced (4 STSs), suggesting that the duplication is recent (less than 4 mya). Two CpG islands were located in the duplication, but no genes were identified after a 36-kb cosmid from the proximal copy of the duplication was sequenced. The homology of this chromosome 21 duplicated region with the pericentromeric regions of chromosomes 13, 2, and 18 suggests that the mechanism involved is probably similar to pericentromeric-directed mechanisms described in interchromosomal duplications. Copyright 1998 Academic Press.
Comparative Analysis and Distribution of Omega-3 lcPUFA Biosynthesis Genes in Marine Molluscs
Surm, Joachim M.; Prentis, Peter J.; Pavasovic, Ana
2015-01-01
Recent research has identified marine molluscs as an excellent source of omega-3 long-chain polyunsaturated fatty acids (lcPUFAs), based on their potential for endogenous synthesis of lcPUFAs. In this study we generated a representative list of fatty acyl desaturase (Fad) and elongation of very long-chain fatty acid (Elovl) genes from major orders of Phylum Mollusca, through the interrogation of transcriptome and genome sequences, and various publicly available databases. We have identified novel and uncharacterised Fad and Elovl sequences in the following species: Anadara trapezia, Nerita albicilla, Nerita melanotragus, Crassostrea gigas, Lottia gigantea, Aplysia californica, Loligo pealeii and Chlamys farreri. Based on alignments of translated protein sequences of Fad and Elovl genes, the haeme binding motif and histidine boxes of Fad proteins, and the histidine box and seventeen important amino acids in Elovl proteins, were highly conserved. Phylogenetic analysis of aligned reference sequences was used to reconstruct the evolutionary relationships for Fad and Elovl genes separately. Multiple, well resolved clades for both the Fad and Elovl sequences were observed, suggesting that repeated rounds of gene duplication best explain the distribution of Fad and Elovl proteins across the major orders of molluscs. For Elovl sequences, one clade contained the functionally characterised Elovl5 proteins, while another clade contained proteins hypothesised to have Elovl4 function. Additional well resolved clades consisted only of uncharacterised Elovl sequences. One clade from the Fad phylogeny contained only uncharacterised proteins, while the other clade contained functionally characterised delta-5 desaturase proteins. The discovery of an uncharacterised Fad clade is particularly interesting as these divergent proteins may have novel functions. Overall, this paper presents a number of novel Fad and Elovl genes suggesting that many mollusc groups possess most of the required enzymes for the synthesis of lcPUFAs. PMID:26308548
Laimins, L; Holmgren-König, M; Khoury, G
1986-01-01
The enhancer elements from either simian virus 40 or murine sarcoma virus activate the expression of a transfected rat insulin 1 (rI1) gene when placed within 2.0 kilobases or less of the rI1 gene cap site. Inclusion of 4.0 kilobases of upstream rI1 sequence, however, results in a substantial reduction in the enhancer-dependent insulin gene expression. These observations suggested that a negative transcriptional regulatory element was present between 2.0 and 4.0 kilobases of the rI1 sequence. To test this notion, we employed a heterologous enhancer-dependent transcription assay in which the simian virus 40 72-base-pair repeat is linked to a human beta-globin gene. Addition of the upstream rI1 element to this system decreased the level of enhancer-dependent beta-globin transcription by a factor of 5 to 15. This rI1 "silencer" element functions in a manner relatively independent of position and orientation and requires a cis-dependent relationship to the transcription unit on which it acts. Thus, the silencer sequence seems to have a number of the characteristics of enhancer elements, and we suggest that it may function by the converse of the enhancer mechanism. The rI1 silencer sequence was identified as a member of a long interspersed rat repetitive family. Thus, a potential role for certain repetitive sequences interspersed throughout the eukaryotic genome may be to regulate gene expression by retaining transcriptional activity within defined domains. Images PMID:3010279
Transmembrane insertion of twin-arginine signal peptides is driven by TatC and regulated by TatB
Fröbel, Julia; Rose, Patrick; Lausberg, Frank; Blümmel, Anne-Sophie; Freudl, Roland; Müller, Matthias
2012-01-01
The twin-arginine translocation (Tat) pathway of bacteria and plant chloroplasts mediates the transmembrane transport of folded proteins, which harbour signal sequences with a conserved twin-arginine motif. Many Tat translocases comprise the three membrane proteins TatA, TatB and TatC. TatC was previously shown to be involved in recognizing twin-arginine signal peptides. Here we show that beyond recognition, TatC mediates the transmembrane insertion of a twin-arginine signal sequence, thereby translocating the signal sequence cleavage site across the bilayer. In the absence of TatB, this can lead to the removal of the signal sequence even from a translocation-incompetent substrate. Hence interaction of twin-arginine signal peptides with TatB counteracts their premature cleavage uncoupled from translocation. This capacity of TatB is not shared by the homologous TatA protein. Collectively our results suggest that TatC is an insertase for twin-arginine signal peptides and that translocation-proficient signal sequence recognition requires the concerted action of TatC and TatB. PMID:23250441
Transmembrane insertion of twin-arginine signal peptides is driven by TatC and regulated by TatB.
Fröbel, Julia; Rose, Patrick; Lausberg, Frank; Blümmel, Anne-Sophie; Freudl, Roland; Müller, Matthias
2012-01-01
The twin-arginine translocation (Tat) pathway of bacteria and plant chloroplasts mediates the transmembrane transport of folded proteins, which harbour signal sequences with a conserved twin-arginine motif. Many Tat translocases comprise the three membrane proteins TatA, TatB and TatC. TatC was previously shown to be involved in recognizing twin-arginine signal peptides. Here we show that beyond recognition, TatC mediates the transmembrane insertion of a twin-arginine signal sequence, thereby translocating the signal sequence cleavage site across the bilayer. In the absence of TatB, this can lead to the removal of the signal sequence even from a translocation-incompetent substrate. Hence interaction of twin-arginine signal peptides with TatB counteracts their premature cleavage uncoupled from translocation. This capacity of TatB is not shared by the homologous TatA protein. Collectively our results suggest that TatC is an insertase for twin-arginine signal peptides and that translocation-proficient signal sequence recognition requires the concerted action of TatC and TatB.
Turetschek, Reinhard; Lyon, David; Desalegn, Getinet; Kaul, Hans-Peter; Wienkoop, Stefanie
2016-01-01
The proteomic study of non-model organisms, such as many crop plants, is challenging due to the lack of comprehensive genome information. Changing environmental conditions require the study and selection of adapted cultivars. Mutations, inherent to cultivars, hamper protein identification and thus considerably complicate the qualitative and quantitative comparison in large-scale systems biology approaches. With this workflow, cultivar-specific mutations are detected from high-throughput comparative MS analyses, by extracting sequence polymorphisms with de novo sequencing. Stringent criteria are suggested to filter for confidential mutations. Subsequently, these polymorphisms complement the initially used database, which is ready to use with any preferred database search algorithm. In our example, we thereby identified 26 specific mutations in two cultivars of Pisum sativum and achieved an increased number (17 %) of peptide spectrum matches.
A unique TBX5 microdeletion with microinsertion detected in patient with Holt-Oram syndrome.
Morine, Mikio; Kohmoto, Tomohiro; Masuda, Kiyoshi; Inagaki, Hidehito; Watanabe, Miki; Naruto, Takuya; Kurahashi, Hiroki; Maeda, Kazuhisa; Imoto, Issei
2015-12-01
Holt-Oram syndrome (HOS) is an autosomal dominant condition characterized by upper limb and congenital heart defects and caused by numerous germline mutations of TBX5 producing preterminal stop codons. Here, we report on a novel and unusual heterozygous TBX5 microdeletion with microinsertion (microindel) mutation (c.627delinsGTGACTCAGGAAACGCTTTCCTGA), which is predicted to synthesize a truncated TBX5 protein, detected in a sporadic patient with clinical features of HOS prenatally diagnosed by ultrasonography. This uncommon and relatively large inserted sequence contains sequences derived from nearby but not adjacent templates on both sense and antisense strands, suggesting two possible models, which require no repeat sequences, causing this complex microindel through the bypass of large DNA adducts via an error-prone DNA polymerase-mediated translesion synthesis. © 2015 Wiley Periodicals, Inc.
Pollock, Steve V; Colombo, Sergio L; Prout, Davey L; Godfrey, Ashley C; Moroney, James V
2003-12-01
This report describes a Chlamydomonas reinhardtii mutant that lacks Rubisco activase (Rca). Using the BleR (bleomycin resistance) gene as a positive selectable marker for nuclear transformation, an insertional mutagenesis screen was performed to select for cells that required a high-CO2 atmosphere for optimal growth. The DNA flanking the BleR insert of one of the high-CO2-requiring strains was cloned using thermal asymmetric interlaced-polymerase chain reaction and inverse polymerase chain reaction and sequenced. The flanking sequence matched the C. reinhardtii Rca cDNA sequence previously deposited in the National Center for Biotechnology Information database. The loss of a functional Rca in the strain was confirmed by the absence of Rca mRNA and protein. The open reading frame for Rca was cloned and expressed in pSL18, a C. reinhardtii expression vector conferring paromomycin resistance. This construct partially complemented the mutant phenotype, supporting the hypothesis that the loss of Rca was the reason the mutant grew poorly in a low-CO2 atmosphere. Sequencing of the C. reinhardtii Rca gene revealed that it contains 10 exons ranging in size from 18 to 470 bp. Low-CO2-grown rca1 cultures had a growth rate and maximum rate of photosynthesis 60% of wild-type cells. Results obtained from experiments on a cia5 rca1 double mutant also suggest that the CO2-concentrating mechanism partially compensates for the absence of an active Rca in the green alga C. reinhardtii.
NASA Astrophysics Data System (ADS)
Bridges, Jon P.
Improving the STEM readiness of students from historically underserved groups is a moral and economic imperative requiring greater attention and effort than has been shown to date. The current literature suggests a high school science sequence beginning with physics and centered on developing conceptual understanding, using inquiry labs and modeling to allow students to explore new ideas, and addressing and correcting student misconceptions can increase student interest in and preparation for STEM careers. The purpose of this study was to determine if the science college readiness of historically underserved students can be improved by implementing an inquiry-based high school science sequence comprised of coursework in physics, chemistry, and biology for every student. The study used a retrospective cohort observational design to address the primary research question: are there differences between historically underserved students completing a Physics First science sequence and their peers completing a traditional science sequence in 1) science college-readiness test scores, 2) rates of science college-and career-readiness, and 3) interest in STEM? Small positive effects were found for all three outcomes for historically underserved students in the Physics First sequence.
Miura, Naoki; Kucho, Ken-Ichi; Noguchi, Michiko; Miyoshi, Noriaki; Uchiumi, Toshiki; Kawaguchi, Hiroaki; Tanimoto, Akihide
2014-01-01
The microminipig, which weighs less than 10 kg at an early stage of maturity, has been reported as a potential experimental model animal. Its extremely small size and other distinct characteristics suggest the possibility of a number of differences between the genome of the microminipig and that of conventional pigs. In this study, we analyzed the genomes of two healthy microminipigs using a next-generation sequencer SOLiD™ system. We then compared the obtained genomic sequences with a genomic database for the domestic pig (Sus scrofa). The mapping coverage of sequenced tag from the microminipig to conventional pig genomic sequences was greater than 96% and we detected no clear, substantial genomic variance from these data. The results may indicate that the distinct characteristics of the microminipig derive from small-scale alterations in the genome, such as Single Nucleotide Polymorphisms or translational modifications, rather than large-scale deletion or insertion polymorphisms. Further investigation of the entire genomic sequence of the microminipig with methods enabling deeper coverage is required to elucidate the genetic basis of its distinct phenotypic traits. Copyright © 2014 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.
Takai, T; Nishita, Y; Iguchi-Ariga, S M; Ariga, H
1994-01-01
We have previously reported the human cDNA encoding MSSP-1, a sequence-specific double- and single-stranded DNA binding protein [Negishi, Nishita, Saëgusa, Kakizaki, Galli, Kihara, Tamai, Miyajima, Iguchi-Ariga and Ariga (1994) Oncogene, 9, 1133-1143]. MSSP-1 binds to a DNA replication origin/transcriptional enhancer of the human c-myc gene and has turned out to be identical with Scr2, a human protein which complements the defect of cdc2 kinase in S.pombe [Kataoka and Nojima (1994) Nucleic Acid Res., 22, 2687-2693]. We have cloned the cDNA for MSSP-2, another member of the MSSP family of proteins. The MSSP-2 cDNA shares highly homologous sequences with MSSP-1 cDNA, except for the insertion of 48 bp coding 16 amino acids near the C-terminus. Like MSSP-1, MSSP-2 has RNP-1 consensus sequences. The results of the experiments using bacterially expressed MSSP-2, and its deletion mutants, as histidine fusion proteins suggested that the binding specificity of MSSP-2 to double- and single-stranded DNA is the same as that of MSSP-1, and that the RNP consensus sequences are required for the DNA binding of the protein. MSSP-2 stimulated the DNA replication of an SV40-derived plasmid containing the binding sequence for MSSP-1 or -2. MSSP-2 is hence suggested to play an important role in regulation of DNA replication. Images PMID:7838710
May, Jared; Johnson, Philip; Saleem, Huma
2017-01-01
ABSTRACT To maximize the coding potential of viral genomes, internal ribosome entry sites (IRES) can be used to bypass the traditional requirement of a 5′ cap and some/all of the associated translation initiation factors. Although viral IRES typically contain higher-order RNA structure, an unstructured sequence of about 84 nucleotides (nt) immediately upstream of the Turnip crinkle virus (TCV) coat protein (CP) open reading frame (ORF) has been found to promote internal expression of the CP from the genomic RNA (gRNA) both in vitro and in vivo. An absence of extensive RNA structure was predicted using RNA folding algorithms and confirmed by selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE) RNA structure probing. Analysis of the IRES region in vitro by use of both the TCV gRNA and reporter constructs did not reveal any sequence-specific elements but rather suggested that an overall lack of structure was an important feature for IRES activity. The CP IRES is A-rich, independent of orientation, and strongly conserved among viruses in the same genus. The IRES was dependent on eIF4G, but not eIF4E, for activity. Low levels of CP accumulated in vivo in the absence of detectable TCV subgenomic RNAs, strongly suggesting that the IRES was active in the gRNA in vivo. Since the TCV CP also serves as the viral silencing suppressor, early translation of the CP from the viral gRNA is likely important for countering host defenses. Cellular mRNA IRES also lack extensive RNA structures or sequence conservation, suggesting that this viral IRES and cellular IRES may have similar strategies for internal translation initiation. IMPORTANCE Cap-independent translation is a common strategy among positive-sense, single-stranded RNA viruses for bypassing the host cell requirement of a 5′ cap structure. Viral IRES, in general, contain extensive secondary structure that is critical for activity. In contrast, we demonstrate that a region of viral RNA devoid of extensive secondary structure has IRES activity and produces low levels of viral coat protein in vitro and in vivo. Our findings may be applicable to cellular mRNA IRES that also have little or no sequences/structures in common. PMID:28179526
Cabral, Henrique O; Vinck, Martin; Fouquet, Celine; Pennartz, Cyriel M A; Rondi-Reig, Laure; Battaglia, Francesco P
2014-01-22
Place coding in the hippocampus requires flexible combination of sensory inputs (e.g., environmental and self-motion information) with memory of past events. We show that mouse CA1 hippocampal spatial representations may either be anchored to external landmarks (place memory) or reflect memorized sequences of cell assemblies depending on the behavioral strategy spontaneously selected. These computational modalities correspond to different CA1 dynamical states, as expressed by theta and low- and high-frequency gamma oscillations, when switching from place to sequence memory-based processing. These changes are consistent with a shift from entorhinal to CA3 input dominance on CA1. In mice with a deletion of forebrain NMDA receptors, the ability of place cells to maintain a map based on sequence memory is selectively impaired and oscillatory dynamics are correspondingly altered, suggesting that oscillations contribute to selecting behaviorally appropriate computations in the hippocampus and that NMDA receptors are crucial for this function. Copyright © 2014 Elsevier Inc. All rights reserved.
Freimuth, P; Anderson, C W
1993-03-01
The sequence of a 1158-base pair fragment of the human adenovirus serotype 12 (Ad12) genome was determined. This segment encodes the precursors for virion components Mu and VI. Both Ad12 precursors contain two sequences that conform to a consensus sequence motif for cleavage by the endoproteinase of adenovirus 2 (Ad2). Analysis of the amino terminus of VI and of the peptide fragments found in Ad12 virions demonstrated that these sites are cleaved during Ad12 maturation. This observation suggests that the recognition motif for adenovirus endoproteinases is highly conserved among human serotypes. The adenovirus 2 endoproteinase polypeptide requires additional co-factors for activity (C. W. Anderson, Protein Expression Purif., 1993, 4, 8-15). Synthetic Ad12 or Ad2 pVI carboxy-terminal peptides each permitted efficient cleavage of an artificial endoproteinase substrate by recombinant Ad2 endoproteinase polypeptide.
Unusually long-lived pause required for regulation of a Rho-dependent transcription terminator
Hollands, Kerry; Sevostiyanova, Anastasia; Groisman, Eduardo A.
2014-01-01
Up to half of all transcription termination events in bacteria rely on the RNA-dependent helicase Rho. However, the nucleic acid sequences that promote Rho-dependent termination remain poorly characterized. Defining the molecular determinants that confer Rho-dependent termination is especially important for understanding how such terminators can be regulated in response to specific signals. Here, we identify an extraordinarily long-lived pause at the site where Rho terminates transcription in the 5′-leader region of the Mg2+ transporter gene mgtA in Salmonella enterica. We dissect the sequence elements required for prolonged pausing in the mgtA leader and establish that the remarkable longevity of this pause is required for a riboswitch to stimulate Rho-dependent termination in the mgtA leader region in response to Mg2+ availability. Unlike Rho-dependent terminators described previously, where termination occurs at multiple pause sites, there is a single site of transcription termination directed by Rho in the mgtA leader. Our data suggest that Rho-dependent termination events that are subject to regulation may require elements distinct from those operating at constitutive Rho-dependent terminators. PMID:24778260
A space-efficient algorithm for local similarities.
Huang, X Q; Hardison, R C; Miller, W
1990-10-01
Existing dynamic-programming algorithms for identifying similar regions of two sequences require time and space proportional to the product of the sequence lengths. Often this space requirement is more limiting than the time requirement. We describe a dynamic-programming local-similarity algorithm that needs only space proportional to the sum of the sequence lengths. The method can also find repeats within a single long sequence. To illustrate the algorithm's potential, we discuss comparison of a 73,360 nucleotide sequence containing the human beta-like globin gene cluster and a corresponding 44,594 nucleotide sequence for rabbit, a problem well beyond the capabilities of other dynamic-programming software.
Ruchko, Mykhaylo V; Gorodnya, Olena M; Pastukh, Viktor M; Swiger, Brad M; Middleton, Natavia S; Wilson, Glenn L; Gillespie, Mark N
2009-02-01
Reactive oxygen species (ROS) generated in hypoxic pulmonary artery endothelial cells cause transient oxidative base modifications in the hypoxia-response element (HRE) of the VEGF gene that bear a conspicuous relationship to induction of VEGF mRNA expression (K.A. Ziel et al., FASEB J. 19, 387-394, 2005). If such base modifications are indeed linked to transcriptional regulation, then they should be detected in HRE sequences associated with transcriptionally active nucleosomes. Southern blot analysis of the VEGF HRE associated with nucleosome fractions prepared by micrococcal nuclease digestion indicated that hypoxia redistributed some HRE sequences from multinucleosomes to transcriptionally active mono- and dinucleosome fractions. A simple PCR method revealed that VEGF HRE sequences harboring oxidative base modifications were found exclusively in mononucleosomes. Inhibition of hypoxia-induced ROS generation with myxathiozol prevented formation of oxidative base modifications but not the redistribution of HRE sequences into mono- and dinucleosome fractions. The histone deacetylase inhibitor trichostatin A caused retention of HRE sequences in compacted nucleosome fractions and prevented formation of oxidative base modifications. These findings suggest that the hypoxia-induced oxidant stress directed at the VEGF HRE requires the sequence to be repositioned into mononucleosomes and support the prospect that oxidative modifications in this sequence are an important step in transcriptional activation.
Roessler, Christian G.; Hall, Branwen M.; Anderson, William J.; Ingram, Wendy M.; Roberts, Sue A.; Montfort, William R.; Cordes, Matthew H. J.
2008-01-01
Proteins that share common ancestry may differ in structure and function because of divergent evolution of their amino acid sequences. For a typical diverse protein superfamily, the properties of a few scattered members are known from experiment. A satisfying picture of functional and structural evolution in relation to sequence changes, however, may require characterization of a larger, well chosen subset. Here, we employ a “stepping-stone” method, based on transitive homology, to target sequences intermediate between two related proteins with known divergent properties. We apply the approach to the question of how new protein folds can evolve from preexisting folds and, in particular, to an evolutionary change in secondary structure and oligomeric state in the Cro family of bacteriophage transcription factors, initially identified by sequence-structure comparison of distant homologs from phages P22 and λ. We report crystal structures of two Cro proteins, Xfaso 1 and Pfl 6, with sequences intermediate between those of P22 and λ. The domains show 40% sequence identity but differ by switching of α-helix to β-sheet in a C-terminal region spanning ≈25 residues. Sedimentation analysis also suggests a correlation between helix-to-sheet conversion and strengthened dimerization. PMID:18227506
HATAKEYAMA, YOSHINORI; SHIBUYA, NORIHIRO; NISHIYAMA, TAKASHI; NAKASHIMA, NOBUHIKO
2004-01-01
The intergenic region (IGR) located upstream of the capsid protein gene in dicistroviruses contains an internal ribosome entry site (IRES). Translation initiation mediated by the IRES does not require initiator methionine tRNA. Comparison of the IGRs among dicistroviruses suggested that Taura syndrome virus (TSV) and acute bee paralysis virus have an extra side stem loop in the predicted IRES. We examined whether the side stem is responsible for translation activity mediated by the IGR using constructs with compensatory mutations. In vitro translation analysis showed that TSV has an IGR-IRES that is structurally distinct from those previously described. Because IGR-IRES elements determine the translation initiation site by virtue of their own tertiary structure formation, the discovery of this initiation mechanism suggests the possibility that eukaryotic mRNAs might have more extensive coding regions than previously predicted. To test this hypothesis, we searched full-length cDNA databases and whole genome sequences of eukaryotes using the pattern matching program, Scan For Matches, with parameters that can extract sequences containing secondary structure elements resembling those of IGR-IRES. Our search yielded several sequences, but their predicted secondary structures were suggested to be unstable in comparison to those of dicistroviruses. These results suggest that RNAs structurally similar to dicistroviruses are not common. If some eukaryotic mRNAs are translated independently of an initiator methionine tRNA, their structures are likely to be significantly distinct from those of dicistroviruses. PMID:15100433
Uchida, Akira; Murugesapillai, Divakaran; Kastner, Markus; Wang, Yao; Lodeiro, Maria F; Prabhakar, Shaan; Oliver, Guinevere V; Arnold, Jamie J; Maher, L James; Williams, Mark C; Cameron, Craig E
2017-01-01
Human mtDNA contains three promoters, suggesting a need for differential expression of the mitochondrial genome. Studies of mitochondrial transcription have used a reductionist approach, perhaps masking differential regulation. Here we evaluate transcription from light-strand (LSP) and heavy-strand (HSP1) promoters using templates that mimic their natural context. These studies reveal sequences upstream, hypervariable in the human population (HVR3), and downstream of the HSP1 transcription start site required for maximal yield. The carboxy-terminal tail of TFAM is essential for activation of HSP1 but not LSP. Images of the template obtained by atomic force microscopy show that TFAM creates loops in a discrete region, the formation of which correlates with activation of HSP1; looping is lost in tail-deleted TFAM. Identification of HVR3 as a transcriptional regulatory element may contribute to between-individual variability in mitochondrial gene expression. The unique requirement of HSP1 for the TFAM tail may enable its regulation by post-translational modifications. DOI: http://dx.doi.org/10.7554/eLife.27283.001 PMID:28745586
Majiduddin, Fahd K; Palzkill, Timothy
2003-03-01
Carbapenem antibiotics have been used to counteract resistant strains of bacteria harboring beta-lactamases and extended-spectrum beta-lactamases. Four enzymes from the class A group of beta-lactamases, NMC-A, IMI-1, SME-1, and KPC-1, efficiently hydrolyze carbapenem antibiotics. Sequence comparisons and structural information indicate that cysteines at amino acid residues 69 and 238, which are conserved in all four of these enzymes, form a disulfide bond that is unique to these beta-lactamases. To test whether this disulfide bond is required for catalytic activity, the codons for residues Cys69 and Cys238 were randomized individually and simultaneously by PCR-based mutagenesis to create random replacement libraries for these positions. Mutants that were able to confer resistance to ampicillin, imipenem, or cefotaxime were selected from these libraries. The results indicate that positions Cys69 and Cys238 are critical for hydrolysis of all of the antibiotics tested, suggesting that the disulfide bond is generally required for this enzyme to catalyze the hydrolysis of beta-lactam antibiotics.
Majiduddin, Fahd K.; Palzkill, Timothy
2003-01-01
Carbapenem antibiotics have been used to counteract resistant strains of bacteria harboring β-lactamases and extended-spectrum β-lactamases. Four enzymes from the class A group of β-lactamases, NMC-A, IMI-1, SME-1, and KPC-1, efficiently hydrolyze carbapenem antibiotics. Sequence comparisons and structural information indicate that cysteines at amino acid residues 69 and 238, which are conserved in all four of these enzymes, form a disulfide bond that is unique to these β-lactamases. To test whether this disulfide bond is required for catalytic activity, the codons for residues Cys69 and Cys238 were randomized individually and simultaneously by PCR-based mutagenesis to create random replacement libraries for these positions. Mutants that were able to confer resistance to ampicillin, imipenem, or cefotaxime were selected from these libraries. The results indicate that positions Cys69 and Cys238 are critical for hydrolysis of all of the antibiotics tested, suggesting that the disulfide bond is generally required for this enzyme to catalyze the hydrolysis of β-lactam antibiotics. PMID:12604542
SPARSE: quadratic time simultaneous alignment and folding of RNAs without sequence-based heuristics.
Will, Sebastian; Otto, Christina; Miladi, Milad; Möhl, Mathias; Backofen, Rolf
2015-08-01
RNA-Seq experiments have revealed a multitude of novel ncRNAs. The gold standard for their analysis based on simultaneous alignment and folding suffers from extreme time complexity of [Formula: see text]. Subsequently, numerous faster 'Sankoff-style' approaches have been suggested. Commonly, the performance of such methods relies on sequence-based heuristics that restrict the search space to optimal or near-optimal sequence alignments; however, the accuracy of sequence-based methods breaks down for RNAs with sequence identities below 60%. Alignment approaches like LocARNA that do not require sequence-based heuristics, have been limited to high complexity ([Formula: see text] quartic time). Breaking this barrier, we introduce the novel Sankoff-style algorithm 'sparsified prediction and alignment of RNAs based on their structure ensembles (SPARSE)', which runs in quadratic time without sequence-based heuristics. To achieve this low complexity, on par with sequence alignment algorithms, SPARSE features strong sparsification based on structural properties of the RNA ensembles. Following PMcomp, SPARSE gains further speed-up from lightweight energy computation. Although all existing lightweight Sankoff-style methods restrict Sankoff's original model by disallowing loop deletions and insertions, SPARSE transfers the Sankoff algorithm to the lightweight energy model completely for the first time. Compared with LocARNA, SPARSE achieves similar alignment and better folding quality in significantly less time (speedup: 3.7). At similar run-time, it aligns low sequence identity instances substantially more accurate than RAF, which uses sequence-based heuristics. © The Author 2015. Published by Oxford University Press.
Classification of DNA nucleotides with transverse tunneling currents
NASA Astrophysics Data System (ADS)
Nyvold Pedersen, Jonas; Boynton, Paul; Di Ventra, Massimiliano; Jauho, Antti-Pekka; Flyvbjerg, Henrik
2017-01-01
It has been theoretically suggested and experimentally demonstrated that fast and low-cost sequencing of DNA, RNA, and peptide molecules might be achieved by passing such molecules between electrodes embedded in a nanochannel. The experimental realization of this scheme faces major challenges, however. In realistic liquid environments, typical currents in tunneling devices are of the order of picoamps. This corresponds to only six electrons per microsecond, and this number affects the integration time required to do current measurements in real experiments. This limits the speed of sequencing, though current fluctuations due to Brownian motion of the molecule average out during the required integration time. Moreover, data acquisition equipment introduces noise, and electronic filters create correlations in time-series data. We discuss how these effects must be included in the analysis of, e.g., the assignment of specific nucleobases to current signals. As the signals from different molecules overlap, unambiguous classification is impossible with a single measurement. We argue that the assignment of molecules to a signal is a standard pattern classification problem and calculation of the error rates is straightforward. The ideas presented here can be extended to other sequencing approaches of current interest.
Stepankiw, Nicholas; Kaidow, Akihiro; Boye, Erik; Bates, David
2010-01-01
Summary Replication initiation is a key event in the cell cycle of all organisms and oriC, the replication origin in Escherichia coli, serves as the prototypical model for this process. The minimal sequence required for oriC function was originally determined entirely from plasmid studies using cloned origin fragments, which have previously been shown to differ dramatically in sequence requirement from the chromosome. Using an in vivo recombineering strategy to exchange wt oriCs for mutated ones regardless of whether they are functional origins or not, we have determined the minimal origin sequence that will support chromosome replication. Nearly the entire right half of oriC could be deleted without loss of origin function, demanding a reassessment of existing models for initiation. Cells carrying the new DnaA box-depleted 163 bp minimal oriC exhibited little or no loss of fitness under slow-growth conditions, but were sensitive to rich medium, suggesting that the dense packing of initiator binding sites that is a hallmark of prokaryotic origins, has likely evolved to support the increased demands of multi-forked replication. PMID:19737351
Ganji, Rakesh; Murugapiran, Senthil K.; Ong, John C.; ...
2016-10-20
The draft genome of Thermocrinis jamiesonii GBS1 T is 1,315,625 bp in 10 contigs and encodes 1,463 predicted genes. The presence of sox genes and various glycoside hydrolases and the absence of uptake NiFe hydrogenases ( hyaB) are consistent with a requirement for thiosulfate and suggest the ability to use carbohydrate polymers.
Exploring the sequence-structure protein landscape in the glycosyltransferase family
Zhang, Ziding; Kochhar, Sunil; Grigorov, Martin
2003-01-01
To understand the molecular basis of glycosyltransferases’ (GTFs) catalytic mechanism, extensive structural information is required. Here, fold recognition methods were employed to assign 3D protein shapes (folds) to the currently known GTF sequences, available in public databases such as GenBank and Swissprot. First, GTF sequences were retrieved and classified into clusters, based on sequence similarity only. Intracluster sequence similarity was chosen sufficiently high to ensure that the same fold is found within a given cluster. Then, a representative sequence from each cluster was selected to compose a subset of GTF sequences. The members of this reduced set were processed by three different fold recognition methods: 3D-PSSM, FUGUE, and GeneFold. Finally, the results from different fold recognition methods were analyzed and compared to sequence-similarity search methods (i.e., BLAST and PSI-BLAST). It was established that the folds of about 70% of all currently known GTF sequences can be confidently assigned by fold recognition methods, a value which is higher than the fold identification rate based on sequence comparison alone (48% for BLAST and 64% for PSI-BLAST). The identified folds were submitted to 3D clustering, and we found that most of the GTF sequences adopt the typical GTF A or GTF B folds. Our results indicate a lack of evidence that new GTF folds (i.e., folds other than GTF A and B) exist. Based on cases where fold identification was not possible, we suggest several sequences as the most promising targets for a structural genomics initiative focused on the GTF protein family. PMID:14500887
Omics Metadata Management Software (OMMS).
Perez-Arriaga, Martha O; Wilson, Susan; Williams, Kelly P; Schoeniger, Joseph; Waymire, Russel L; Powell, Amy Jo
2015-01-01
Next-generation sequencing projects have underappreciated information management tasks requiring detailed attention to specimen curation, nucleic acid sample preparation and sequence production methods required for downstream data processing, comparison, interpretation, sharing and reuse. The few existing metadata management tools for genome-based studies provide weak curatorial frameworks for experimentalists to store and manage idiosyncratic, project-specific information, typically offering no automation supporting unified naming and numbering conventions for sequencing production environments that routinely deal with hundreds, if not thousands of samples at a time. Moreover, existing tools are not readily interfaced with bioinformatics executables, (e.g., BLAST, Bowtie2, custom pipelines). Our application, the Omics Metadata Management Software (OMMS), answers both needs, empowering experimentalists to generate intuitive, consistent metadata, and perform analyses and information management tasks via an intuitive web-based interface. Several use cases with short-read sequence datasets are provided to validate installation and integrated function, and suggest possible methodological road maps for prospective users. Provided examples highlight possible OMMS workflows for metadata curation, multistep analyses, and results management and downloading. The OMMS can be implemented as a stand alone-package for individual laboratories, or can be configured for webbased deployment supporting geographically-dispersed projects. The OMMS was developed using an open-source software base, is flexible, extensible and easily installed and executed. The OMMS can be obtained at http://omms.sandia.gov. The OMMS can be obtained at http://omms.sandia.gov.
Omics Metadata Management Software (OMMS)
Perez-Arriaga, Martha O; Wilson, Susan; Williams, Kelly P; Schoeniger, Joseph; Waymire, Russel L; Powell, Amy Jo
2015-01-01
Next-generation sequencing projects have underappreciated information management tasks requiring detailed attention to specimen curation, nucleic acid sample preparation and sequence production methods required for downstream data processing, comparison, interpretation, sharing and reuse. The few existing metadata management tools for genome-based studies provide weak curatorial frameworks for experimentalists to store and manage idiosyncratic, project-specific information, typically offering no automation supporting unified naming and numbering conventions for sequencing production environments that routinely deal with hundreds, if not thousands of samples at a time. Moreover, existing tools are not readily interfaced with bioinformatics executables, (e.g., BLAST, Bowtie2, custom pipelines). Our application, the Omics Metadata Management Software (OMMS), answers both needs, empowering experimentalists to generate intuitive, consistent metadata, and perform analyses and information management tasks via an intuitive web-based interface. Several use cases with short-read sequence datasets are provided to validate installation and integrated function, and suggest possible methodological road maps for prospective users. Provided examples highlight possible OMMS workflows for metadata curation, multistep analyses, and results management and downloading. The OMMS can be implemented as a stand alone-package for individual laboratories, or can be configured for webbased deployment supporting geographically-dispersed projects. The OMMS was developed using an open-source software base, is flexible, extensible and easily installed and executed. The OMMS can be obtained at http://omms.sandia.gov. Availability The OMMS can be obtained at http://omms.sandia.gov PMID:26124554
Illingworth, Christopher J R; Parkes, Kevin E; Snell, Christopher R; Mullineaux, Philip M; Reynolds, Christopher A
2008-03-01
Methods to determine periodicity in protein sequences are useful for inferring function. Fourier transformation is one approach but care is required to ensure the periodicity is genuine. Here we have shown that empirically-derived statistical tables can be used as a measure of significance. Genuine protein sequences data rather than randomly generated sequences were used as the statistical backdrop. The method has been applied to G-protein coupled receptor (GPCR) sequences, by Fourier transformation of hydrophobicity values, codon frequencies and the extent of over-representation of codon pairs; the latter being related to translational step times. Genuine periodicity was observed in the hydrophobicity whereas the apparent periodicity (as inferred from previously reported measures) in the translation step times was not validated statistically. GCR2 has recently been proposed as the plant GPCR receptor for the hormone abscisic acid. It has homology to the Lanthionine synthetase C-like family of proteins, an observation confirmed by fold recognition. Application of the Fourier transform algorithm to the GCR2 family revealed strongly predicted seven fold periodicity in hydrophobicity, suggesting why GCR2 has been reported to be a GPCR, despite negative indications in most transmembrane prediction algorithms. The underlying multiple sequence alignment, also required for the Fourier transform analysis of periodicity, indicated that the hydrophobic regions around the 7 GXXG motifs commence near the C-terminal end of each of the 7 inner helices of the alpha-toroid and continue to the N-terminal region of the helix. The results clearly explain why GCR2 has been understandably but erroneously predicted to be a GPCR.
Schoch, Conrad L; Robbertse, Barbara; Robert, Vincent; Vu, Duong; Cardinali, Gianluigi; Irinyi, Laszlo; Meyer, Wieland; Nilsson, R Henrik; Hughes, Karen; Miller, Andrew N; Kirk, Paul M; Abarenkov, Kessy; Aime, M Catherine; Ariyawansa, Hiran A; Bidartondo, Martin; Boekhout, Teun; Buyck, Bart; Cai, Qing; Chen, Jie; Crespo, Ana; Crous, Pedro W; Damm, Ulrike; De Beer, Z Wilhelm; Dentinger, Bryn T M; Divakar, Pradeep K; Dueñas, Margarita; Feau, Nicolas; Fliegerova, Katerina; García, Miguel A; Ge, Zai-Wei; Griffith, Gareth W; Groenewald, Johannes Z; Groenewald, Marizeth; Grube, Martin; Gryzenhout, Marieka; Gueidan, Cécile; Guo, Liangdong; Hambleton, Sarah; Hamelin, Richard; Hansen, Karen; Hofstetter, Valérie; Hong, Seung-Beom; Houbraken, Jos; Hyde, Kevin D; Inderbitzin, Patrik; Johnston, Peter R; Karunarathna, Samantha C; Kõljalg, Urmas; Kovács, Gábor M; Kraichak, Ekaphan; Krizsan, Krisztina; Kurtzman, Cletus P; Larsson, Karl-Henrik; Leavitt, Steven; Letcher, Peter M; Liimatainen, Kare; Liu, Jian-Kui; Lodge, D Jean; Luangsa-ard, Janet Jennifer; Lumbsch, H Thorsten; Maharachchikumbura, Sajeewa S N; Manamgoda, Dimuthu; Martín, María P; Minnis, Andrew M; Moncalvo, Jean-Marc; Mulè, Giuseppina; Nakasone, Karen K; Niskanen, Tuula; Olariaga, Ibai; Papp, Tamás; Petkovits, Tamás; Pino-Bodas, Raquel; Powell, Martha J; Raja, Huzefa A; Redecker, Dirk; Sarmiento-Ramirez, J M; Seifert, Keith A; Shrestha, Bhushan; Stenroos, Soili; Stielow, Benjamin; Suh, Sung-Oui; Tanaka, Kazuaki; Tedersoo, Leho; Telleria, M Teresa; Udayanga, Dhanushka; Untereiner, Wendy A; Diéguez Uribeondo, Javier; Subbarao, Krishna V; Vágvölgyi, Csaba; Visagie, Cobus; Voigt, Kerstin; Walker, Donald M; Weir, Bevan S; Weiß, Michael; Wijayawardene, Nalin N; Wingfield, Michael J; Xu, J P; Yang, Zhu L; Zhang, Ning; Zhuang, Wen-Ying; Federhen, Scott
2014-01-01
DNA phylogenetic comparisons have shown that morphology-based species recognition often underestimates fungal diversity. Therefore, the need for accurate DNA sequence data, tied to both correct taxonomic names and clearly annotated specimen data, has never been greater. Furthermore, the growing number of molecular ecology and microbiome projects using high-throughput sequencing require fast and effective methods for en masse species assignments. In this article, we focus on selecting and re-annotating a set of marker reference sequences that represent each currently accepted order of Fungi. The particular focus is on sequences from the internal transcribed spacer region in the nuclear ribosomal cistron, derived from type specimens and/or ex-type cultures. Re-annotated and verified sequences were deposited in a curated public database at the National Center for Biotechnology Information (NCBI), namely the RefSeq Targeted Loci (RTL) database, and will be visible during routine sequence similarity searches with NR_prefixed accession numbers. A set of standards and protocols is proposed to improve the data quality of new sequences, and we suggest how type and other reference sequences can be used to improve identification of Fungi. Database URL: http://www.ncbi.nlm.nih.gov/bioproject/PRJNA177353. Published by Oxford University Press 2013. This work is written by US Government employees and is in the public domain in the US.
Short intronic repeat sequences facilitate circular RNA production
Liang, Dongming
2014-01-01
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery “backsplices” and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3′ end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. PMID:25281217
Planets around pulsars - Implications for planetary formation
NASA Technical Reports Server (NTRS)
Bodenheimer, Peter
1993-01-01
Data on planets around pulsars are summarized, and different models intended to explain the formation mechanism are described. Both theoretical and observational evidence suggest that very special circumstances are required for the formation of planetary systems around pulsars, namely, the prior presence of a millisecond pulsar with a close binary companion, probably a low mass main-sequence star. It is concluded that the discovery of two planets around PSR 1257+12 is important for better understanding the problems of dynamics and stellar evolution. The process of planetary formation should be learned through intensive studies of the properties of disks near young objects and application of techniques for detection of planets around main-sequence solar-type stars.
Lashbrook, C C; Gonzalez-Bosch, C; Bennett, A B
1994-01-01
Two structurally divergent endo-beta-1,4-glucanase (EGase) cDNAs were cloned from tomato. Although both cDNAs (Cel1 and Cel2) encode potentially glycosylated, basic proteins of 51 to 53 kD and possess multiple amino acid domains conserved in both plant and microbial EGases, Cel1 and Cel2 exhibit only 50% amino acid identity at the overall sequence level. Amino acid sequence comparisons to other plant EGases indicate that tomato Cel1 is most similar to bean abscission zone EGase (68%), whereas Cel2 exhibits greatest sequence identity to avocado fruit EGase (57%). Sequence comparisons suggest the presence of at least two structurally divergent EGase families in plants. Unlike ripening avocado fruit and bean abscission zones in which a single EGase mRNA predominates, EGase expression in tomato reflects the overlapping accumulation of both Cel1 and Cel2 transcripts in ripening fruit and in plant organs undergoing cell separation. Cel1 mRNA contributes significantly to total EGase mRNA accumulation within plant organs undergoing cell separation (abscission zones and mature anthers), whereas Cel2 mRNA is most abundant in ripening fruit. The overlapping expression of divergent EGase genes within a single species may suggest that multiple activities are required for the cooperative disassembly of cell wall components during fruit ripening, floral abscission, and anther dehiscence. PMID:7994180
Stars caught in the braking stage in young Magellanic Cloud clusters
NASA Astrophysics Data System (ADS)
D'Antona, Francesca; Milone, Antonino P.; Tailo, Marco; Ventura, Paolo; Vesperini, Enrico; di Criscienzo, Marcella
2017-08-01
The colour-magnitude diagrams of many Magellanic Cloud clusters (with ages up to 2 billion years) display extended turnoff regions where the stars leave the main sequence, suggesting the presence of multiple stellar populations with ages that may differ even by hundreds of millions of years 1,2,3 . A strongly debated question is whether such an extended turnoff is instead due to populations with different stellar rotations3,4,5,6 . The recent discovery of a 'split' main sequence in some younger clusters (~80-400 Myr) added another piece to this puzzle. The blue side of the main sequence is consistent with slowly rotating stellar models, and the red side consistent with rapidly rotating models7,8,9,10. However, a complete theoretical characterization of the observed colour-magnitude diagram also seemed to require an age spread9. We show here that, in the three clusters so far analysed, if the blue main-sequence stars are interpreted with models in which the stars have always been slowly rotating, they must be ~30% younger than the rest of the cluster. If they are instead interpreted as stars that were initially rapidly rotating but have later slowed down, the age difference disappears, and this 'braking' also helps to explain the apparent age differences of the extended turnoff. The age spreads in Magellanic Cloud clusters are thus a manifestation of rotational stellar evolution. Observational tests are suggested.
Evolutionary insight into the ionotropic glutamate receptor superfamily of photosynthetic organisms.
De Bortoli, Sara; Teardo, Enrico; Szabò, Ildikò; Morosinotto, Tomas; Alboresi, Alessandro
2016-11-01
Photosynthetic eukaryotes have a complex evolutionary history shaped by multiple endosymbiosis events that required a tight coordination between the organelles and the rest of the cell. Plant ionotropic glutamate receptors (iGLRs) form a large superfamily of proteins with a predicted or proven non-selective cation channel activity regulated by a broad range of amino acids. They are involved in different physiological processes such as C/N sensing, resistance against fungal infection, root and pollen tube growth and response to wounding and pathogens. Most of the present knowledge is limited to iGLRs located in plasma membranes. However, recent studies localized different iGLR isoforms to mitochondria and/or chloroplasts, suggesting the possibility that they play a specific role in bioenergetic processes. In this work, we performed a comparative analysis of GLR sequences from bacteria and various photosynthetic eukaryotes. In particular, novel types of selectivity filters of bacteria are reported adding new examples of the great diversity of the GLR superfamily. The highest variability in GLR sequences was found among the algal sequences (cryptophytes, diatoms, brown and green algae). GLRs of land plants are not closely related to the GLRs of green algae analyzed in this work. The GLR family underwent a great expansion in vascular plants. Among plant GLRs, Clade III includes sequences from Physcomitrella patens, Marchantia polymorpha and gymnosperms and can be considered the most ancient, while other clades likely emerged later. In silico analysis allowed the identification of sequences with a putative target to organelles. Sequences with a predicted localization to mitochondria and chloroplasts are randomly distributed among different type of GLRs, suggesting that no compartment-related specific function has been maintained across the species. Copyright © 2016 Elsevier B.V. All rights reserved.
Sánchez-Navarro, J A; Pallás, V
1997-01-01
The complete nucleotide sequence of an isolate of prunus necrotic ringspot virus (PNRSV) RNA 3 has been determined. Elucidation of the amino acid sequence of the proteins encoded by the two large open reading frames (ORFs) allowed us to carry out comparative and phylogenetic studies on the movement (MP) and coat (CP) proteins in the ilarvirus group. Amino acid sequence comparison of the MP revealed a highly conserved basic sequence motif with an amphipathic alpha-helical structure preceding the conserved motif of the '30K superfamily' proposed by Mushegian and Koonin [26] for MP's. Within this '30K' motif a strictly conserved transmembrane domain is present in all ilarviruses sequenced so far. At the amino-terminal end, prune dwarf virus (PDV) has an extension not present in other ilarviruses but which is observed in all bromo- and cucumoviruses, suggesting a common ancestor or a recombinational event in the Bromoviridae family. Examination of the N-terminus of the CP's of all ilarviruses revealed a highly basic region, part of which resembles the Arg-rich motif that has been characterized in the RNA-binding protein family. This motif has also been found in the other members of the Bromoviridae family, suggesting its involvement in a structural function. Furthermore this region is required for infectivity in ilarviruses. The similarities found in this Arg-rich motif are discussed in terms of this process known as genome activation. Finally, phylogenetic analysis of both the MP and CP proteins revealed a higher relationship of A1MV to PNRSV, apple mosaic virus (ApMV) and PDV than any other member of the ilarvirus group. In that sense, A1MV should be considered as a true ilarvirus instead of forming a distinct group of viruses.
A Statistical Framework for the Functional Analysis of Metagenomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sharon, Itai; Pati, Amrita; Markowitz, Victor
2008-10-01
Metagenomic studies consider the genetic makeup of microbial communities as a whole, rather than their individual member organisms. The functional and metabolic potential of microbial communities can be analyzed by comparing the relative abundance of gene families in their collective genomic sequences (metagenome) under different conditions. Such comparisons require accurate estimation of gene family frequencies. They present a statistical framework for assessing these frequencies based on the Lander-Waterman theory developed originally for Whole Genome Shotgun (WGS) sequencing projects. They also provide a novel method for assessing the reliability of the estimations which can be used for removing seemingly unreliable measurements.more » They tested their method on a wide range of datasets, including simulated genomes and real WGS data from sequencing projects of whole genomes. Results suggest that their framework corrects inherent biases in accepted methods and provides a good approximation to the true statistics of gene families in WGS projects.« less
40 CFR 86.1230-96 - Test sequence; general requirements.
Code of Federal Regulations, 2010 CFR
2010-07-01
... Petroleum Gas-Fueled and Methanol-Fueled Heavy-Duty Vehicles § 86.1230-96 Test sequence; general requirements. (a)(1) Gasoline- and methanol-fueled vehicles. The test sequence shown in figure M96-1 of this...
40 CFR 86.1230-96 - Test sequence; general requirements.
Code of Federal Regulations, 2011 CFR
2011-07-01
... Petroleum Gas-Fueled and Methanol-Fueled Heavy-Duty Vehicles § 86.1230-96 Test sequence; general requirements. (a)(1) Gasoline- and methanol-fueled vehicles. The test sequence shown in figure M96-1 of this...
Inferring Short-Range Linkage Information from Sequencing Chromatograms
Beggel, Bastian; Neumann-Fraune, Maria; Kaiser, Rolf; Verheyen, Jens; Lengauer, Thomas
2013-01-01
Direct Sanger sequencing of viral genome populations yields multiple ambiguous sequence positions. It is not straightforward to derive linkage information from sequencing chromatograms, which in turn hampers the correct interpretation of the sequence data. We present a method for determining the variants existing in a viral quasispecies in the case of two nearby ambiguous sequence positions by exploiting the effect of sequence context-dependent incorporation of dideoxynucleotides. The computational model was trained on data from sequencing chromatograms of clonal variants and was evaluated on two test sets of in vitro mixtures. The approach achieved high accuracies in identifying the mixture components of 97.4% on a test set in which the positions to be analyzed are only one base apart from each other, and of 84.5% on a test set in which the ambiguous positions are separated by three bases. In silico experiments suggest two major limitations of our approach in terms of accuracy. First, due to a basic limitation of Sanger sequencing, it is not possible to reliably detect minor variants with a relative frequency of no more than 10%. Second, the model cannot distinguish between mixtures of two or four clonal variants, if one of two sets of linear constraints is fulfilled. Furthermore, the approach requires repetitive sequencing of all variants that might be present in the mixture to be analyzed. Nevertheless, the effectiveness of our method on the two in vitro test sets shows that short-range linkage information of two ambiguous sequence positions can be inferred from Sanger sequencing chromatograms without any further assumptions on the mixture composition. Additionally, our model provides new insights into the established and widely used Sanger sequencing technology. The source code of our method is made available at http://bioinf.mpi-inf.mpg.de/publications/beggel/linkageinformation.zip. PMID:24376502
Kim, Bo Kyung; Lim, Seoung Ok; Park, Yun Gyu
2008-08-01
The cyclic adenosine monophosphate-response element (CRE)-transcription factor complex participates in the regulation of viral gene expression and pathologic processes caused by various viruses. The hepatitis B virus (HBV) enhancer I directs liver-specific transcription of viral genes and contains a CRE sequence (HBV-CRE); however, whether the HBV-CRE and CRE-binding protein (CREB) are required for the HBV life cycle remains to be determined. This study was designed to investigate the role of CREB in HBV replication and gene expression. Sequence-comparison analysis of 984 HBVs reported worldwide showed that the HBV-CRE sequence is highly conserved, indicating the possibility that it plays an important role in the HBV life cycle. The binding of CREB to the HBV-CRE site was markedly inhibited by oligonucleotides containing HBV-CRE and consensus CRE sequences in vitro and in vivo. The HBV promoter activity was demonstrated to be dependent upon the transactivation activity of CREB. Treatment with CRE decoy oligonucleotides reduced HBV promoter activity, and this was reversed by CREB overexpression. The levels of viral transcripts, DNA, and antigens were remarkably decreased in response to the overexpression of CREB mutants or treatment with the CRE decoy oligonucleotides, whereas enhancing CREB activity increased the levels of viral transcripts. In addition, introduction of a three-base mutation into the HBV-CRE led to a marked reduction in HBV messenger RNA synthesis. Taken together, our results demonstrate that both replication and gene expression of HBV require a functional CREB and HBV-CRE. We have also demonstrated that CRE decoy oligonucleotides and the overexpression of CREB mutants can effectively block the HBV life cycle, suggesting that interventions against CREB activity could provide a new avenue to treat HBV infection.
Izawa, Kazuki; Kuwahara, Hirokazu; Kihara, Kumiko; Yuki, Masahiro; Lo, Nathan; Itoh, Takehiko; Ohkuma, Moriya; Hongoh, Yuichi
2016-10-13
"Candidatus Endomicrobium trichonymphae" (Bacteria; Elusimicrobia) is an obligate intracellular symbiont of the cellulolytic protist genus Trichonympha in the termite gut. A previous genome analysis of "Ca Endomicrobium trichonymphae" phylotype Rs-D17 (genomovar Ri2008), obtained from a Trichonympha agilis cell in the gut of the termite Reticulitermes speratus, revealed that its genome is small (1.1 Mb) and contains many pseudogenes; it is in the course of reductive genome evolution. Here we report the complete genome sequence of another Rs-D17 genomovar, Ti2015, obtained from a different T. agilis cell present in an R. speratus gut. These two genomovars share most intact protein-coding genes and pseudogenes, showing 98.6% chromosome sequence similarity. However, characteristic differences were found in their defense systems, which comprised restriction-modification and CRISPR/Cas systems. The repertoire of intact restriction-modification systems differed between the genomovars, and two of the three CRISPR/Cas loci in genomovar Ri2008 are pseudogenized or missing in genomovar Ti2015. These results suggest relaxed selection pressure for maintaining these defense systems. Nevertheless, the remaining CRISPR/Cas system in each genomovar appears to be active; none of the "spacer" sequences (112 in Ri2008 and 128 in Ti2015) were shared whereas the "repeat" sequences were identical. Furthermore, we obtained draft genomes of three additional endosymbiotic Endomicrobium phylotypes from different host protist species, and discovered multiple, intact CRISPR/Cas systems in each genome. Collectively, unlike bacteriome endosymbionts in insects, the Endomicrobium endosymbionts of termite-gut protists appear to require defense against foreign DNA, although the required level of defense has likely been reduced during their intracellular lives. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Ferreira, Diogo C; van der Linden, Marx G; de Oliveira, Leandro C; Onuchic, José N; de Araújo, Antônio F Pereira
2016-04-01
Recent ab initio folding simulations for a limited number of small proteins have corroborated a previous suggestion that atomic burial information obtainable from sequence could be sufficient for tertiary structure determination when combined to sequence-independent geometrical constraints. Here, we use simulations parameterized by native burials to investigate the required amount of information in a diverse set of globular proteins comprising different structural classes and a wide size range. Burial information is provided by a potential term pushing each atom towards one among a small number L of equiprobable concentric layers. An upper bound for the required information is provided by the minimal number of layers L(min) still compatible with correct folding behavior. We obtain L(min) between 3 and 5 for seven small to medium proteins with 50 ≤ Nr ≤ 110 residues while for a larger protein with Nr = 141 we find that L ≥ 6 is required to maintain native stability. We additionally estimate the usable redundancy for a given L ≥ L(min) from the burial entropy associated to the largest folding-compatible fraction of "superfluous" atoms, for which the burial term can be turned off or target layers can be chosen randomly. The estimated redundancy for small proteins with L = 4 is close to 0.8. Our results are consistent with the above-average quality of burial predictions used in previous simulations and indicate that the fraction of approachable proteins could increase significantly with even a mild, plausible, improvement on sequence-dependent burial prediction or on sequence-independent constraints that augment the detectable redundancy during simulations. © 2016 Wiley Periodicals, Inc.
Regulation of expression of transgenes in developing fish.
Moav, B; Liu, Z; Caldovic, L D; Gross, M L; Faras, A J; Hackett, P B
1993-05-01
The transcriptional regulatory elements of the beta-actin gene of carp (Cyprinus carpio) have been examined in zebrafish and goldfish harbouring transgenes. The high sequence conservation of the putative regulatory elements in the beta-actin genes of animals suggested that their function would be conserved, so that transgenic constructs with the same transcriptional control elements would promote similar levels of transgene expression in different species of transgenic animals. To test this assumption, we analysed the temporal expression of a reporter gene under the control of transcriptional control sequences from the carp beta-actin gene in zebrafish (Brachydanio rerio) and goldfish (Carrasius auratus). Our results indicated that, contrary to expectations, combinations of different transcriptional control elements affected the level, duration, and onset of gene expression differently in developing zebrafish and goldfish. The major differences in expression of beta-actin/CAT (chloramphenicol acetyltransferase) constructs in zebrafish and goldfish were: (1) overall expression was almost 100-fold higher in goldfish than in zebrafish embryos, (2) the first intron had an enhancing effect on gene expression in zebrafish but not in goldfish, and (3) the serum-responsive/CArG-containing regulatory element in the proximal promoter was not always required for maximal CAT activity in goldfish, but was required in zebrafish. These results suggest that in the zebrafish, but not in the goldfish, there may be interactions between motifs in the proximal promoter and the first intron which appear to be required for maximal enhancement of transcription.
A Study of the Comparative Effectiveness of Zoology Prerequisites at Slippery Rock State College.
ERIC Educational Resources Information Center
Morrison, William Sechler
This study compared the effectiveness of three sequences of prerequisite courses required before taking zoology. Sequence 1 prerequisite courses consisted of general biology and human biology; Sequence 2 consisted of general biology; and Sequence 3 required cell biology. Zoology students in the spring of 1972 were pretest and a posttest. The mean…
Grawunder, U; Lieber, M R
1997-01-01
The recombination activating gene (RAG) 1 and 2 proteins are required for initiation of V(D)J recombination in vivo and have been shown to be sufficient to introduce DNA double-strand breaks at recombination signal sequences (RSSs) in a cell-free assay in vitro. RSSs consist of a highly conserved palindromic heptamer that is separated from a slightly less conserved A/T-rich nonamer by either a 12 or 23 bp spacer of random sequence. Despite the high sequence specificity of RAG-mediated cleavage at RSSs, direct binding of the RAG proteins to these sequences has been difficult to demonstrate by standard methods. Even when this can be demonstrated, questions about the order of events for an individual RAG-RSS complex will require methods that monitor aspects of the complex during transitions from one step of the reaction to the next. Here we have used template-independent DNA polymerase terminal deoxynucleotidyl transferase (TdT) in order to assess occupancy of the reaction intermediates by the RAG complex during the reaction. In addition, this approach allows analysis of the accessibility of end products of a RAG-catalyzed cleavage reaction for N nucleotide addition. The results indicate that RAG proteins form a long-lived complex with the RSS once the initial nick is generated, because the 3'-OH group at the nick remains obstructed for TdT-catalyzed N nucleotide addition. In contrast, the 3'-OH group generated at the signal end after completion of the cleavage reaction can be efficiently tailed by TdT, suggesting that the RAG proteins disassemble from the signal end after DNA double-strand cleavage has been completed. Therefore, a single RAG complex maintains occupancy from the first step (nick formation) to the second step (cleavage). In addition, the results suggest that N region diversity at V(D)J junctions within rearranged immunoglobulin and T cell receptor gene loci can only be introduced after the generation of RAG-catalyzed DNA double-strand breaks, i.e. during the DNA end joining phase of the V(D)J recombination reaction. PMID:9060432
Hierarchical Traces for Reduced NSM Memory Requirements
NASA Astrophysics Data System (ADS)
Dahl, Torbjørn S.
This paper presents work on using hierarchical long term memory to reduce the memory requirements of nearest sequence memory (NSM) learning, a previously published, instance-based reinforcement learning algorithm. A hierarchical memory representation reduces the memory requirements by allowing traces to share common sub-sequences. We present moderated mechanisms for estimating discounted future rewards and for dealing with hidden state using hierarchical memory. We also present an experimental analysis of how the sub-sequence length affects the memory compression achieved and show that the reduced memory requirements do not effect the speed of learning. Finally, we analyse and discuss the persistence of the sub-sequences independent of specific trace instances.
Rabah, Samar O; Lee, Chaehee; Hajrah, Nahid H; Makki, Rania M; Alharby, Hesham F; Alhebshi, Alawiah M; Sabir, Jamal S M; Jansen, Robert K; Ruhlman, Tracey A
2017-11-01
In plant evolution, intracellular gene transfer (IGT) is a prevalent, ongoing process. While nuclear and mitochondrial genomes are known to integrate foreign DNA via IGT and horizontal gene transfer (HGT), plastid genomes (plastomes) have resisted foreign DNA incorporation and only recently has IGT been uncovered in the plastomes of a few land plants. In this study, we completed plastome sequences for l0 crop species and describe a number of structural features including variation in gene and intron content, inversions, and expansion and contraction of the inverted repeat (IR). We identified a putative in cinnamon ( J. Presl) and other sequenced Lauraceae and an apparent functional transfer of to the nucleus of quinoa ( Willd.). In the orchard tree cashew ( L.), we report the insertion of an ∼6.7-kb fragment of mitochondrial DNA into the plastome IR. BLASTn analyses returned high identity hits to mitogenome sequences including an intact open reading frame. Using three plastome markers for five species of , we generated a phylogeny to investigate the distribution and timing of the insertion. Four species share the insertion, suggesting that this event occurred <20 million yr ago in a single clade in the genus. Our study extends the observation of mitochondrial to plastome IGT to include long-lived tree species. While previous studies have suggested possible mechanisms facilitating IGT to the plastome, more examples of this phenomenon, along with more complete mitogenome sequences, will be required before a common, or variable, mechanism can be elucidated. Copyright © 2017 Crop Science Society of America.
Zhang, Xiujun; Parry, Ronald J.
2007-01-01
The pyrrolomycins are a family of polyketide antibiotics, some of which contain a nitro group. To gain insight into the nitration mechanism associated with the formation of these antibiotics, the pyrrolomycin biosynthetic gene cluster from Actinosporangium vitaminophilum was cloned. Sequencing of ca. 56 kb of A. vitaminophilum DNA revealed 35 open reading frames (ORFs). Sequence analysis revealed a clear relationship between some of these ORFs and the biosynthetic gene cluster for pyoluteorin, a structurally related antibiotic. Since a gene transfer system could not be devised for A. vitaminophilum, additional proof for the identity of the cloned gene cluster was sought by cloning the pyrrolomycin gene cluster from Streptomyces sp. strain UC 11065, a transformable pyrrolomycin producer. Sequencing of ca. 26 kb of UC 11065 DNA revealed the presence of 17 ORFs, 15 of which exhibit strong similarity to ORFs in the A. vitaminophilum cluster as well as a nearly identical organization. Single-crossover disruption of two genes in the UC 11065 cluster abolished pyrrolomycin production in both cases. These results confirm that the genetic locus cloned from UC 11065 is essential for pyrrolomycin production, and they also confirm that the highly similar locus in A. vitaminophilum encodes pyrrolomycin biosynthetic genes. Sequence analysis revealed that both clusters contain genes encoding the two components of an assimilatory nitrate reductase. This finding suggests that nitrite is required for the formation of the nitrated pyrrolomycins. However, sequence analysis did not provide additional insights into the nitration process, suggesting the operation of a novel nitration mechanism. PMID:17158935
New Insight Into the Diversity of SemiSWEET Sugar Transporters and the Homologs in Prokaryotes
Jia, Baolei; Hao, Lujiang; Xuan, Yuan Hu; Jeon, Che Ok
2018-01-01
Sugars will eventually be exported transporters (SWEETs) and SemiSWEETs represent a family of sugar transporters in eukaryotes and prokaryotes, respectively. SWEETs contain seven transmembrane helices (TMHs), while SemiSWEETs contain three. The functions of SemiSWEETs are less studied. In this perspective article, we analyzed the diversity and conservation of SemiSWEETs and further proposed the possible functions. 1,922 SemiSWEET homologs were retrieved from the UniProt database, which is not proportional to the sequenced prokaryotic genomes. However, these proteins are very diverse in sequences and can be classified into 19 clusters when >50% sequence identity is required. Moreover, a gene context analysis indicated that several SemiSWEETs are located in the operons that are related to diverse carbohydrate metabolism. Several proteins with seven TMHs can be found in bacteria, and sequence alignment suggested that these proteins in bacteria may be formed by the duplication and fusion. Multiple sequence alignments showed that the amino acids for sugar translocation are still conserved and coevolved, although the sequences show diversity. Among them, the functions of a few amino acids are still not clear. These findings highlight the challenges that exist in SemiSWEETs and provide future researchers the foundation to explore these uncharted areas. PMID:29872447
Knudsen, Gitte M; Nielsen, Jesper Boye; Marvig, Rasmus L; Ng, Yin; Worning, Peder; Westh, Henrik; Gram, Lone
2017-08-01
Whole genome sequencing is increasing used in epidemiology, e.g. for tracing outbreaks of food-borne diseases. This requires in-depth understanding of pathogen emergence, persistence and genomic diversity along the food production chain including in food processing plants. We sequenced the genomes of 80 isolates of Listeria monocytogenes sampled from Danish food processing plants over a time-period of 20 years, and analysed the sequences together with 10 public available reference genomes to advance our understanding of interplant and intraplant genomic diversity of L. monocytogenes. Except for three persisting sequence types (ST) based on Multi Locus Sequence Typing being ST7, ST8 and ST121, long-term persistence of clonal groups was limited, and new clones were introduced continuously, potentially from raw materials. No particular gene could be linked to the persistence phenotype. Using time-based phylogenetic analyses of the persistent STs, we estimate the L. monocytogenes evolutionary rate to be 0.18-0.35 single nucleotide polymorphisms/year, suggesting that the persistent STs emerged approximately 100 years ago, which correlates with the onset of industrialization and globalization of the food market. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.
Genetic exchange between endogenous and exogenous LINE-1 repetitive elements in mouse cells.
Belmaaza, A; Wallenburg, J C; Brouillette, S; Gusew, N; Chartrand, P
1990-01-01
The repetitive LINE (L1) elements of the mouse, which are present at about 10(5) copies per genome and share over 80% of sequence homology, were examined for their ability to undergo genetic exchange with exogenous L1 sequences. The exogenous L1 sequences, carried by a shuttle vector, consisted of an internal fragment from L1Md-A2, a previously described member of the L1 family of the mouse. Using an assay that does not require the reconstitution of a selectable marker we found that this vector, in either circular or linear form, acquired DNA sequences from endogenous L1 elements at a frequency of 10(-3) to 10(-4) per rescued vector. Physical analysis of the acquired L1 sequences revealed that distinct endogenous L1 elements acted as donors and that different subfamilies participated. These results demonstrate that L1 elements are readily capable of genetic exchange. Apart from gene conversion events, the acquisition of L1 sequences outside the region of homology suggested that a second mechanism was also involved in the genetic exchange. A model which accounts for this mechanism is presented and its potential implication on the rearrangement of L1 elements is discussed. Images PMID:1978749
New Insight Into the Diversity of SemiSWEET Sugar Transporters and the Homologs in Prokaryotes.
Jia, Baolei; Hao, Lujiang; Xuan, Yuan Hu; Jeon, Che Ok
2018-01-01
Sugars will eventually be exported transporters (SWEETs) and SemiSWEETs represent a family of sugar transporters in eukaryotes and prokaryotes, respectively. SWEETs contain seven transmembrane helices (TMHs), while SemiSWEETs contain three. The functions of SemiSWEETs are less studied. In this perspective article, we analyzed the diversity and conservation of SemiSWEETs and further proposed the possible functions. 1,922 SemiSWEET homologs were retrieved from the UniProt database, which is not proportional to the sequenced prokaryotic genomes. However, these proteins are very diverse in sequences and can be classified into 19 clusters when >50% sequence identity is required. Moreover, a gene context analysis indicated that several SemiSWEETs are located in the operons that are related to diverse carbohydrate metabolism. Several proteins with seven TMHs can be found in bacteria, and sequence alignment suggested that these proteins in bacteria may be formed by the duplication and fusion. Multiple sequence alignments showed that the amino acids for sugar translocation are still conserved and coevolved, although the sequences show diversity. Among them, the functions of a few amino acids are still not clear. These findings highlight the challenges that exist in SemiSWEETs and provide future researchers the foundation to explore these uncharted areas.
Kravatsky, Yuri; Chechetkin, Vladimir; Fedoseeva, Daria; Gorbacheva, Maria; Kravatskaya, Galina; Kretova, Olga; Tchurikov, Nickolai
2017-11-23
The efficient development of antiviral drugs, including efficient antiviral small interfering RNAs (siRNAs), requires continuous monitoring of the strict correspondence between a drug and the related highly variable viral DNA/RNA target(s). Deep sequencing is able to provide an assessment of both the general target conservation and the frequency of particular mutations in the different target sites. The aim of this study was to develop a reliable bioinformatic pipeline for the analysis of millions of short, deep sequencing reads corresponding to selected highly variable viral sequences that are drug target(s). The suggested bioinformatic pipeline combines the available programs and the ad hoc scripts based on an original algorithm of the search for the conserved targets in the deep sequencing data. We also present the statistical criteria for the threshold of reliable mutation detection and for the assessment of variations between corresponding data sets. These criteria are robust against the possible sequencing errors in the reads. As an example, the bioinformatic pipeline is applied to the study of the conservation of RNA interference (RNAi) targets in human immunodeficiency virus 1 (HIV-1) subtype A. The developed pipeline is freely available to download at the website http://virmut.eimb.ru/. Brief comments and comparisons between VirMut and other pipelines are also presented.
Leblanc, B; Read, C; Moss, T
1993-02-01
The interaction of the ribosomal transcription factor xUBF with the RNA polymerase I core promoter of Xenopus laevis has been studied both at the DNA and protein levels. It is shown that a single xUBF-DNA complex forms over the 40S initiation site (+1) and involves at least the DNA sequences between -20 and +60 bp. DNA sequences upstream of +10 and downstream of +18 are each sufficient to direct complex formation independently. HMG box 1 of xUBF independently recognizes the sequences -20 to -1 and +1 to +22 and the addition of the N-terminal dimerization domain to HMG box 1 stabilizes its interaction with these sequences approximately 10-fold. HMG boxes 2/3 interact with the DNA downstream of +22 and can independently position xUBF across the initiation site. The C-terminal segment of xUBF, HMG boxes 4, 5 or the acidic domain, directly or indirectly interact with HMG box 1, making the core promoter sequences between -11 and -15 hypersensitive to DNase. This interaction also requires the DNA sequences between +17 and +32, i.e. the HMG box 2/3 binding site. The data suggest extensive folding of the core promoter within the xUBF complex.
Angiuoli, Samuel V; White, James R; Matalka, Malcolm; White, Owen; Fricke, W Florian
2011-01-01
The widespread popularity of genomic applications is threatened by the "bioinformatics bottleneck" resulting from uncertainty about the cost and infrastructure needed to meet increasing demands for next-generation sequence analysis. Cloud computing services have been discussed as potential new bioinformatics support systems but have not been evaluated thoroughly. We present benchmark costs and runtimes for common microbial genomics applications, including 16S rRNA analysis, microbial whole-genome shotgun (WGS) sequence assembly and annotation, WGS metagenomics and large-scale BLAST. Sequence dataset types and sizes were selected to correspond to outputs typically generated by small- to midsize facilities equipped with 454 and Illumina platforms, except for WGS metagenomics where sampling of Illumina data was used. Automated analysis pipelines, as implemented in the CloVR virtual machine, were used in order to guarantee transparency, reproducibility and portability across different operating systems, including the commercial Amazon Elastic Compute Cloud (EC2), which was used to attach real dollar costs to each analysis type. We found considerable differences in computational requirements, runtimes and costs associated with different microbial genomics applications. While all 16S analyses completed on a single-CPU desktop in under three hours, microbial genome and metagenome analyses utilized multi-CPU support of up to 120 CPUs on Amazon EC2, where each analysis completed in under 24 hours for less than $60. Representative datasets were used to estimate maximum data throughput on different cluster sizes and to compare costs between EC2 and comparable local grid servers. Although bioinformatics requirements for microbial genomics depend on dataset characteristics and the analysis protocols applied, our results suggests that smaller sequencing facilities (up to three Roche/454 or one Illumina GAIIx sequencer) invested in 16S rRNA amplicon sequencing, microbial single-genome and metagenomics WGS projects can achieve cost-efficient bioinformatics support using CloVR in combination with Amazon EC2 as an alternative to local computing centers.
Angiuoli, Samuel V.; White, James R.; Matalka, Malcolm; White, Owen; Fricke, W. Florian
2011-01-01
Background The widespread popularity of genomic applications is threatened by the “bioinformatics bottleneck” resulting from uncertainty about the cost and infrastructure needed to meet increasing demands for next-generation sequence analysis. Cloud computing services have been discussed as potential new bioinformatics support systems but have not been evaluated thoroughly. Results We present benchmark costs and runtimes for common microbial genomics applications, including 16S rRNA analysis, microbial whole-genome shotgun (WGS) sequence assembly and annotation, WGS metagenomics and large-scale BLAST. Sequence dataset types and sizes were selected to correspond to outputs typically generated by small- to midsize facilities equipped with 454 and Illumina platforms, except for WGS metagenomics where sampling of Illumina data was used. Automated analysis pipelines, as implemented in the CloVR virtual machine, were used in order to guarantee transparency, reproducibility and portability across different operating systems, including the commercial Amazon Elastic Compute Cloud (EC2), which was used to attach real dollar costs to each analysis type. We found considerable differences in computational requirements, runtimes and costs associated with different microbial genomics applications. While all 16S analyses completed on a single-CPU desktop in under three hours, microbial genome and metagenome analyses utilized multi-CPU support of up to 120 CPUs on Amazon EC2, where each analysis completed in under 24 hours for less than $60. Representative datasets were used to estimate maximum data throughput on different cluster sizes and to compare costs between EC2 and comparable local grid servers. Conclusions Although bioinformatics requirements for microbial genomics depend on dataset characteristics and the analysis protocols applied, our results suggests that smaller sequencing facilities (up to three Roche/454 or one Illumina GAIIx sequencer) invested in 16S rRNA amplicon sequencing, microbial single-genome and metagenomics WGS projects can achieve cost-efficient bioinformatics support using CloVR in combination with Amazon EC2 as an alternative to local computing centers. PMID:22028928
Yasuno, Rie; Wada, Hajime
1998-01-01
Lipoic acid is a coenzyme that is essential for the activity of enzyme complexes such as those of pyruvate dehydrogenase and glycine decarboxylase. We report here the isolation and characterization of LIP1 cDNA for lipoic acid synthase of Arabidopsis. The Arabidopsis LIP1 cDNA was isolated using an expressed sequence tag homologous to the lipoic acid synthase of Escherichia coli. This cDNA was shown to code for Arabidopsis lipoic acid synthase by its ability to complement a lipA mutant of E. coli defective in lipoic acid synthase. DNA-sequence analysis of the LIP1 cDNA revealed an open reading frame predicting a protein of 374 amino acids. Comparisons of the deduced amino acid sequence with those of E. coli and yeast lipoic acid synthase homologs showed a high degree of sequence similarity and the presence of a leader sequence presumably required for import into the mitochondria. Southern-hybridization analysis suggested that LIP1 is a single-copy gene in Arabidopsis. Western analysis with an antibody against lipoic acid synthase demonstrated that this enzyme is located in the mitochondrial compartment in Arabidopsis cells as a 43-kD polypeptide. PMID:9808738
Petti, C. A.; Polage, C. R.; Schreckenberger, P.
2005-01-01
Traditional methods for microbial identification require the recognition of differences in morphology, growth, enzymatic activity, and metabolism to define genera and species. Full and partial 16S rRNA gene sequencing methods have emerged as useful tools for identifying phenotypically aberrant microorganisms. We report on three bacterial blood isolates from three different College of American Pathologists-certified laboratories that were referred to ARUP Laboratories for definitive identification. Because phenotypic identification suggested unusual organisms not typically associated with the submitted clinical diagnosis, consultation with the Medical Director was sought and further testing was performed including partial 16S rRNA gene sequencing. All three patients had endocarditis, and conventional methods identified isolates from patients A, B, and C as a Facklamia sp., Eubacterium tenue, and a Bifidobacterium sp. 16S rRNA gene sequencing identified the isolates as Enterococcus faecalis, Cardiobacterium valvarum, and Streptococcus mutans, respectively. We conclude that the initial identifications of these three isolates were erroneous, may have misled clinicians, and potentially impacted patient care. 16S rRNA gene sequencing is a more objective identification tool, unaffected by phenotypic variation or technologist bias, and has the potential to reduce laboratory errors. PMID:16333109
Lee, Seung-Hun; Kim, Kyoo-Tae; Kwon, Oh-Deog; Ock, Younsung; Kim, Taeil; Choi, Donghag; Kwak, Dongmi
2016-01-01
We describe for the first time the detection of Coxiella-like bacteria (CLB), Theileria luwenshuni, and T. ovis endosymbionts in blood-sucking deer keds. Eight deer keds attached to a Korean water deer were identified as Lipoptena fortisetosa (Diptera: Hippoboscidae) by morphological and genetic analyses. Among the endosymbionts assessed, CLB, Theileria luwenshuni, and T. ovis were identified in L. fortisetosa by PCR and nucleotide sequencing. Based on phylogeny, CLB 16S rRNA sequences were classified into clade B, sharing 99.4% identity with CLB from Haemaphysalis longicornis in South Korea. Although the virulence of CLB to vertebrates is still controversial, several studies have reported clinical symptoms in birds due to CLB infections. The 18S rRNA sequences of T. luwenshuni and T. ovis in this study were 98.8-100% identical to those in GenBank, and all of the obtained sequences of T. ovis and T. luwenshuni in this study were 100% identical to each other, respectively. Although further studies are required to positively confirm L. fortisetosa as a biological vector of these pathogens, strong genetic relationships among sequences from this and previous studies suggest potential transmission among mammalian hosts by ticks and keds.
Assessing Diversity of DNA Structure-Related Sequence Features in Prokaryotic Genomes
Huang, Yongjie; Mrázek, Jan
2014-01-01
Prokaryotic genomes are diverse in terms of their nucleotide and oligonucleotide composition as well as presence of various sequence features that can affect physical properties of the DNA molecule. We present a survey of local sequence patterns which have a potential to promote non-canonical DNA conformations (i.e. different from standard B-DNA double helix) and interpret the results in terms of relationships with organisms' habitats, phylogenetic classifications, and other characteristics. Our present work differs from earlier similar surveys not only by investigating a wider range of sequence patterns in a large number of genomes but also by using a more realistic null model to assess significant deviations. Our results show that simple sequence repeats and Z-DNA-promoting patterns are generally suppressed in prokaryotic genomes, whereas palindromes and inverted repeats are over-represented. Representation of patterns that promote Z-DNA and intrinsic DNA curvature increases with increasing optimal growth temperature (OGT), and decreases with increasing oxygen requirement. Additionally, representations of close direct repeats, palindromes and inverted repeats exhibit clear negative trends with increasing OGT. The observed relationships with environmental characteristics, particularly OGT, suggest possible evolutionary scenarios of structural adaptation of DNA to particular environmental niches. PMID:24408877
Gámez-Díaz, Laura; Sigmund, Elena C; Reiser, Veronika; Vach, Werner; Jung, Sophie; Grimbacher, Bodo
2018-01-01
The diagnosis of lipopolysaccharide-responsive beige-like-anchor-protein (LRBA) deficiency currently relies on gene sequencing approaches that do not support a timely diagnosis and clinical management. We developed a rapid and sensitive test for clinical implementation based on the detection of LRBA protein by flow cytometry in peripheral blood cells after stimulation. LRBA protein was assessed in a prospective cohort of 54 healthy donors and 57 patients suspected of LRBA deficiency. Receiver operating characteristics analysis suggested an LRBA:MFI ratio cutoff point of 2.6 to identify LRBA-deficient patients by FACS with 94% sensitivity and 80% specificity and to discriminate them from patients with a similar clinical picture but other disease-causing mutations. This easy flow cytometry-based assay allows a fast screening of patients with suspicion of LRBA deficiency reducing therefore the number of patients requiring LRBA sequencing and accelerating the treatment implementation. Detection of biallelic mutations in LRBA is however required for a definitive diagnosis.
Prigoda, Nadia L; Nassuth, Annette; Mable, Barbara K
2005-07-01
The highly divergent alleles of the SRK gene in outcrossing Arabidopsis lyrata have provided important insights into the evolutionary history of self-incompatibility (SI) alleles and serve as an ideal model for studies of the evolutionary and molecular interactions between alleles in cell-cell recognition systems in general. One tantalizing question is how new specificities arise in systems that require coordination between male and female components. Allelic recruitment via gene conversion has been proposed as one possibility, based on the division of DNA sequences at the SRK locus into two distinctive groups: (1) sequences whose relationships are not well resolved and display the long branch lengths expected for a gene under balancing selection (Class A); and (2) sequences falling into a well-supported group with shorter branch lengths (Class B) that are closely related to an unlinked paralogous locus. The purpose of this study was to determine if differences in phenotype (site of expression assayed using allele-specific reverse transcription-polymerase chain reaction) or function (dominance relationships assayed through controlled pollinations) accompany the sequence-based classification. Expression of Class A alleles was restricted to floral tissues, as predicted for genes involved in the SI response. In contrast, Class B alleles, despite being tightly linked to the SI phenotype, were unexpectedly expressed in both leaves and floral tissues; the same pattern found for a related unlinked paralogous sequence. Whereas Class A included haplotypes in three different dominance classes, all Class B haplotypes were found to be recessive to all except one Class A haplotype. In addition, mapping of expression and dominance patterns onto an S-domain-based genealogy suggested that allelic dominance may be determined more by evolutionary history than by frequency-dependent selection for lowered dominance as some theories suggest. The possibility that interlocus gene conversion might have contributed to allelic diversity is discussed.
Suwannasai, Nuttika; Martín, María P; Phosri, Cherdchai; Sihanonth, Prakitsin; Whalley, Anthony J S; Spouge, John L
2013-01-01
Thailand, a part of the Indo-Burma biodiversity hotspot, has many endemic animals and plants. Some of its fungal species are difficult to recognize and separate, complicating assessments of biodiversity. We assessed species diversity within the fungal genera Annulohypoxylon and Hypoxylon, which produce biologically active and potentially therapeutic compounds, by applying classical taxonomic methods to 552 teleomorphs collected from across Thailand. Using probability of correct identification (PCI), we also assessed the efficacy of automated species identification with a fungal barcode marker, ITS, in the model system of Annulohypoxylon and Hypoxylon. The 552 teleomorphs yielded 137 ITS sequences; in addition, we examined 128 GenBank ITS sequences, to assess biases in evaluating a DNA barcode with GenBank data. The use of multiple sequence alignment in a barcode database like BOLD raises some concerns about non-protein barcode markers like ITS, so we also compared species identification using different alignment methods. Our results suggest the following. (1) Multiple sequence alignment of ITS sequences is competitive with pairwise alignment when identifying species, so BOLD should be able to preserve its present bioinformatics workflow for species identification for ITS, and possibly therefore with at least some other non-protein barcode markers. (2) Automated species identification is insensitive to a specific choice of evolutionary distance, contributing to resolution of a current debate in DNA barcoding. (3) Statistical methods are available to address, at least partially, the possibility of expert misidentification of species. Phylogenetic trees discovered a cryptic species and strongly supported monophyletic clades for many Annulohypoxylon and Hypoxylon species, suggesting that ITS can contribute usefully to a barcode for these fungi. The PCIs here, derived solely from ITS, suggest that a fungal barcode will require secondary markers in Annulohypoxylon and Hypoxylon, however. The URL http://tinyurl.com/spouge-barcode contains computer programs and other supplementary material relevant to this article.
Fenstermacher, Katherine J; Achuthan, Vasudevan; Schneider, Thomas D; DeStefano, Jeffrey J
2018-01-16
DNA polymerases (DNAPs) recognize 3' recessed termini on duplex DNA and carry out nucleotide catalysis. Unlike promoter-specific RNA polymerases (RNAPs), no sequence specificity is required for binding or initiation of catalysis. Despite this, previous results indicate that viral reverse transcriptases bind much more tightly to DNA primers that mimic the polypurine tract. In the current report, primer sequences that bind with high affinity to Taq and Klenow polymerases were identified using a modified Selective Evolution of Ligands by Exponential Enrichment (SELEX) approach. Two Taq -specific primers that bound ∼10 (Taq1) and over 100 (Taq2) times more stably than controls to Taq were identified. Taq1 contained 8 nucleotides (5' -CACTAAAG-3') that matched the phage T3 RNAP "core" promoter. Both primers dramatically outcompeted primers with similar binding thermodynamics in PCR reactions. Similarly, exonuclease minus Klenow polymerase also selected a high affinity primer that contained a related core promoter sequence from phage T7 RNAP (5' -ACTATAG-3'). For both Taq and Klenow, even small modifications to the sequence resulted in large losses in binding affinity suggesting that binding was highly sequence-specific. The results are discussed in the context of possible effects on multi-primer (multiplex) PCR assays, molecular information theory, and the evolution of RNAPs and DNAPs. Importance This work further demonstrates that primer-dependent DNA polymerases can have strong sequence biases leading to dramatically tighter binding to specific sequences. These may be related to biological function, or be a consequences of the structural architecture of the enzyme. New sequence specificity for Taq and Klenow polymerases were uncovered and among them were sequences that contained the core promoter elements from T3 and T7 phage RNA polymerase promoters. This suggests the intriguing possibility that phage RNA polymerases exploited intrinsic binding affinities of ancestral DNA polymerases to develop their promotors. Conversely, DNA polymerases could have evolved from related RNA polymerases and retained the intrinsic binding preference despite there being no clear function for such a preference in DNA biology. Copyright © 2018 American Society for Microbiology.
NASA Technical Reports Server (NTRS)
Wallace, G. R.; Weathers, G. D.; Graf, E. R.
1973-01-01
The statistics of filtered pseudorandom digital sequences called hybrid-sum sequences, formed from the modulo-two sum of several maximum-length sequences, are analyzed. The results indicate that a relation exists between the statistics of the filtered sequence and the characteristic polynomials of the component maximum length sequences. An analysis procedure is developed for identifying a large group of sequences with good statistical properties for applications requiring the generation of analog pseudorandom noise. By use of the analysis approach, the filtering process is approximated by the convolution of the sequence with a sum of unit step functions. A parameter reflecting the overall statistical properties of filtered pseudorandom sequences is derived. This parameter is called the statistical quality factor. A computer algorithm to calculate the statistical quality factor for the filtered sequences is presented, and the results for two examples of sequence combinations are included. The analysis reveals that the statistics of the signals generated with the hybrid-sum generator are potentially superior to the statistics of signals generated with maximum-length generators. Furthermore, fewer calculations are required to evaluate the statistics of a large group of hybrid-sum generators than are required to evaluate the statistics of the same size group of approximately equivalent maximum-length sequences.
Bartonella dromedarii sp. nov. isolated from domesticated camels (Camelus dromedarius) in Israel.
Rasis, Michal; Rudoler, Nir; Schwartz, David; Giladi, Michael
2014-11-01
Bartonella spp. are fastidious, Gram-negative bacilli that cause a wide spectrum of diseases in humans. Most Bartonella spp. have adapted to a specific host, generally a domestic or wild mammal. Dromedary camels (Camelus dromedarius) have become a focus of growing public-health interest because they have been identified as a reservoir host for the Middle East respiratory syndrome coronavirus. Nevertheless, data on camel zoonoses are limited. We aimed to study the occurrence of Bartonella bacteremia among dromedaries in Israel. Nine of 51 (17.6%) camels were found to be bacteremic with Bartonella spp.; bacteremia levels ranged from five to >1000 colony-forming units/mL. Phylogenetic reconstruction based on the concatenated sequences of gltA and rpoB genes demonstrated that the dromedary Bartonella isolates are closely related to other ruminant-derived Bartonella spp., with B. bovis being the nearest relative. Using electron microscopy, the novel isolates were shown to be flagellated, whereas B. bovis is nonflagellated. Sequence comparisons analysis of the housekeeping genes ftsZ, ribC, and groEL showed the highest homology to B. chomelii, B. capreoli, and B. birtlesii, respectively. Sequence analysis of the gltA and rpoB revealed ∼96% identity to B. bovis, a previously suggested cutoff value for sequence-based differentiation of Bartonella spp., suggesting that this approach does not have sufficient discriminatory power for differentiating ruminant-related Bartonella spp. A comprehensive multilocus sequence typing (MLST) analysis based on nine genetic loci (gltA, rpoB, ftsZ, internal transcribed spacer (ITS), 16S rRNA, ribC, groEL, nuoG, and SsrA) identified seven sequence types of the new dromedary isolates. This is the first description of a Bartonella sp. from camelids. On the basis of a distinct reservoir and ecological niche, sequence analyses, and expression of flagella, we designate these isolates as a novel Bartonella sp. named Bartonella dromedarii sp. nov. Further studies are required to explore its zoonotic potential.
Sala, Claudia; Forti, Francesca; Magnoni, Francesca; Ghisotti, Daniela
2008-01-01
Background In Mycobacterium tuberculosis and in Mycobacterium smegmatis the furA-katG loci, encoding the FurA regulatory protein and the KatG catalase-peroxidase, are highly conserved. In M. tuberculosis furA-katG constitute a single operon, whereas in M. smegmatis a single mRNA covering both genes could not be found. In both species, specific 5' ends have been identified: the first one, located upstream of the furA gene, corresponds to transcription initiation from the furA promoter; the second one is the katG mRNA 5' end, located in the terminal part of furA. Results In this work we demonstrate by in vitro transcription and by RNA polymerase Chromatin immunoprecipitation that no promoter is present in the M. smegmatis region covering the latter 5' end, suggesting that it is produced by specific processing of longer transcripts. Several DNA fragments of M. tuberculosis and M. smegmatis were inserted in a plasmid between the sigA promoter and the lacZ reporter gene, and expression of the reporter gene was measured. A polypurine sequence, located four bp upstream of the katG translation start codon, increased beta-galactosidase activity and stabilized the lacZ transcript. Mutagenesis of this sequence led to destabilization of the mRNA. Analysis of constructs, in which the polypurine sequence of M. smegmatis was followed by an increasing number of katG codons, demonstrated that mRNA stability requires translation of at least 20 amino acids. In order to define the requirements for the 5' processing of the katG transcript, we created several mutations in this region and analyzed the 5' ends of the transcripts: the distance from the polypurine sequence does not seem to influence the processing, neither the sequence around the cutting point. Only mutations which create a double stranded region around the processing site prevented RNA processing. Conclusion This is the first reported case in mycobacteria, in which both a polypurine sequence and translation initiation are shown to contribute to mRNA stability. The furA-katG mRNA is transcribed from the furA promoter and immediately processed; this processing is prevented by a double stranded RNA at the cutting site, suggesting that the endoribonuclease responsible for the cleavage cuts single stranded RNA. PMID:18394163
Seth, Meetu; Shirayama, Masaki; Gu, Weifeng; Ishidate, Takao; Conte, Darryl; Mello, Craig C
2013-12-23
Organisms can develop adaptive sequence-specific immunity by reexpressing pathogen-specific small RNAs that guide gene silencing. For example, the C. elegans PIWI-Argonaute/piwi-interacting RNA (piRNA) pathway recruits RNA-dependent RNA polymerase (RdRP) to foreign sequences to amplify a transgenerational small-RNA-induced epigenetic silencing signal (termed RNAe). Here, we provide evidence that, in addition to an adaptive memory of silenced sequences, C. elegans can also develop an opposing adaptive memory of expressed/self-mRNAs. We refer to this mechanism, which can prevent or reverse RNAe, as RNA-induced epigenetic gene activation (RNAa). We show that CSR-1, which engages RdRP-amplified small RNAs complementary to germline-expressed mRNAs, is required for RNAa. We show that a transgene with RNAa activity also exhibits accumulation of cognate CSR-1 small RNAs. Our findings suggest that C. elegans adaptively acquires and maintains a transgenerational CSR-1 memory that recognizes and protects self-mRNAs, allowing piRNAs to recognize foreign sequences innately, without the need for prior exposure
Mahony, Amelia; Fraser, Sarah; Groman, David B; Jones, Simon R M
2015-06-29
A PCR for the specific detection of the salmon brain parasite Myxobolus arcticus (Pugachev and Khokhlov, 1979) was developed using primers designed to amplify a 1363 base pair fragment of the small subunit rDNA. The assay did not amplify DNA from 5 other Myxobolus species or from 7 other myxozoan species belonging to 5 other genera. For juvenile sockeye salmon Oncorhynchus nerka (Walbaum) collected from Chilko Lake, British Columbia (BC), Canada, in 2011, the prevalence by PCR was 96%, in contrast to 71% by histological examination of brain tissue. In 2010, the histological prevalence was 52.5%. Sequence identity between M. arcticus from Chilko Lake and other sites in BC ranged from 99.7 to 99.8% and was 99.6% for a Japanese sequence. In contrast, an M. arcticus sequence from Norway shared 95.3% identity with the Chilko Lake sequence, suggesting misidentification of the parasite. Chilko Lake sockeye salmon were previously reported free of infection with M. arcticus, and more research is required to understand the processes involved in the local and global dispersion of this parasite.
Derakhshandeh, A; Zahraei Salehi, T; Tadjbakhsh, H; Karimi, V
2009-09-01
To identify, clone and sequence the iss (increased serum survival) gene from E. coli strain chi1378 isolated from Iranian poultry and to predict its protein product, Iss. The iss gene from E. coli strain chi1378 was amplified and cloned into the pTZ57R/T vector and sequenced. From the DNA sequence, the Iss predictive protein was evaluated using bioinformatics. Iss from strain chi1378 had 100% identity with other E. coli serotypes and isolates from different origins and also 98% identity with E. coli O157:H7 Iss protein. Phylogenetic analysis showed no significant different phylogenic groups among E. coli strains. The strong association of predicted Iss protein among different E. coli strains suggests that it could be a good antigen to control and detect avian pathogenic E. coli (APEC). Because the exact pathogenesis and the role of virulence factors are unknown, the Iss protein could be used as a target for vaccination in the future, but further research is required.
Seth, Meetu; Shirayama, Masaki; Gu, Weifeng; Ishidate, Takao; Conte, Darryl; Mello, Craig C.
2014-01-01
SUMMARY Organisms can develop adaptive sequence-specific immunity by re-expressing pathogen-specific small RNAs that guide gene silencing. For example, the C. elegans PIWI-Argonaute/piRNA pathway recruits RNA-dependent RNA polymerase RdRP to foreign sequences to amplify a trans-generational small RNA-induced epigenetic silencing signal (termed RNAe). Here we provide evidence that in addition to an adaptive memory of silenced sequences, C. elegans can also develop an opposing adaptive memory of expressed/self mRNAs. We refer to this mechanism, which can prevent or reverse RNAe as RNA-induced epigenetic gene activation (RNAa). We show that CSR-1, which engages RdRP-amplified small RNAs complementary to germline-expressed mRNAs, is required for RNAa. We show that a transgene with RNAa activity also exhibits accumulation of cognate CSR-1 small RNAs. Our findings suggest that C. elegans adaptively acquires and maintains a trans-generational CSR-1 memory that recognizes and protects self mRNAs, allowing piRNAs to recognize foreign sequences innately, without need for prior exposure. PMID:24360782
Hauler, Aron; Jonietz, Christian; Stoll, Birgit; Stoll, Katrin; Braun, Hans-Peter; Binder, Stefan
2013-05-01
The 5' ends of many mitochondrial transcripts are generated post-transcriptionally. Recently, we identified three RNA PROCESSING FACTORs required for 5' end maturation of different mitochondrial mRNAs in Arabidopsis thaliana. All of these factors are pentatricopeptide repeat proteins (PPRPs), highly similar to RESTORERs OF FERTILTY (RF), that rescue male fertility in cytoplasmic male-sterile lines from different species. Therefore, we suggested a general role of these RF-like PPRPs in mitochondrial 5' processing. We now identified RNA PROCESSING FACTOR 5, a PPRP not classified as an RF-like protein, required for the efficient 5' maturation of the nad6 and atp9 mRNAs as well as 26S rRNA. The precursor molecules of these RNAs share conserved sequence elements, approximately ranging from positions -50 to +9 relative to mature 5' mRNA termini, suggesting these sequences to be at least part of the cis elements required for processing. The knockout of RPF5 has only a moderate influence on 5' processing of atp9 mRNA, whereas the generation of the mature nad6 mRNA and 26S rRNA is almost completely abolished in the mutant. The latter leads to a 50% decrease of total 26S rRNA species, resulting in an imbalance between the large rRNA and 18S rRNA. Despite these severe changes in RNA levels and in the proportion between the 26S and 18S rRNAs, mitochondrial protein levels appear to be unaltered in the mutant, whereas seed germination capacity is markedly reduced. © 2013 The Authors The Plant Journal © 2013 John Wiley & Sons Ltd.
Whole-Genome Sequencing of a Healthy Aging Cohort.
Erikson, Galina A; Bodian, Dale L; Rueda, Manuel; Molparia, Bhuvan; Scott, Erick R; Scott-Van Zeeland, Ashley A; Topol, Sarah E; Wineinger, Nathan E; Niederhuber, John E; Topol, Eric J; Torkamani, Ali
2016-05-05
Studies of long-lived individuals have revealed few genetic mechanisms for protection against age-associated disease. Therefore, we pursued genome sequencing of a related phenotype-healthy aging-to understand the genetics of disease-free aging without medical intervention. In contrast with studies of exceptional longevity, usually focused on centenarians, healthy aging is not associated with known longevity variants, but is associated with reduced genetic susceptibility to Alzheimer and coronary artery disease. Additionally, healthy aging is not associated with a decreased rate of rare pathogenic variants, potentially indicating the presence of disease-resistance factors. In keeping with this possibility, we identify suggestive common and rare variant genetic associations implying that protection against cognitive decline is a genetic component of healthy aging. These findings, based on a relatively small cohort, require independent replication. Overall, our results suggest healthy aging is an overlapping but distinct phenotype from exceptional longevity that may be enriched with disease-protective genetic factors. VIDEO ABSTRACT. Copyright © 2016 Elsevier Inc. All rights reserved.
Hand, Brian K.; Hether, Tyler D; Kovach, Ryan P.; Muhlfeld, Clint C.; Amish, Stephen J.; Boyer, Matthew C.; O’Rourke, Sean M.; Miller, Michael R.; Lowe, Winsor H.; Hohenlohe, Paul A.; Luikart, Gordon
2015-01-01
Invasive hybridization and introgression pose a serious threat to the persistence of many native species. Understanding the effects of hybridization on native populations (e.g., fitness consequences) requires numerous species-diagnostic loci distributed genome-wide. Here we used RAD sequencing to discover thousands of single-nucleotide polymorphisms (SNPs) that are diagnostic between rainbow trout (RBT, Oncorhynchus mykiss), the world’s most widely introduced fish, and native westslope cutthroat trout (WCT, O. clarkii lewisi) in the northern Rocky Mountains, USA. We advanced previous work that identified 4,914 species-diagnostic loci by using longer sequence reads (100 bp vs. 60 bp) and a larger set of individuals (n = 84). We sequenced RAD libraries for individuals from diverse sampling sources, including native populations of WCT and hatchery broodstocks of WCT and RBT. We also took advantage of a newly released reference genome assembly for RBT to align our RAD loci. In total, we discovered 16,788 putatively diagnostic SNPs, 10,267 of which we mapped to anchored chromosome locations on the RBT genome. A small portion of previously discovered putative diagnostic loci (325 of 4,914) were no longer diagnostic (i.e., fixed between species) based on our wider survey of non-hybridized RBT and WCT individuals. Our study suggests that RAD loci mapped to a draft genome assembly could provide the marker density required to identify genes and chromosomal regions influencing selection in admixed populations of conservation concern and evolutionary interest.
Denitrifying SUP05 Require Exogenous Nitrogen
NASA Astrophysics Data System (ADS)
Shah, V.; Chang, B. X.; Morris, R. M.
2016-02-01
Members of the SUP05 clade of gamma-proteobacteria are among the most abundant chemoautotrophs in the ocean. Environmental sequencing studies suggest that they have critical roles in mediating carbon fixation, denitrification, and sulfur oxidation in oxygen minimum zones (OMZs). They have evaded cultivation and little is known about the specific growth requirements or substrate ranges that determine their abundance, distribution and impact on marine biogeochemical cycles. We evaluated the genetic potential of an isolate from the SUP05 clade "Ca. Thioglobus autotrophica strain EF1" to fix carbon, reduce nitrogen and oxidize sulfur under anaerobic growth conditions. Growth experiments support genomic predictions, indicating that strain EF1 is a facultatively anaerobic sulfur-oxidizing chemolithoautotroph that reduces nitrate to nitrite and nitric oxide to nitrous oxide. These experiments also revealed that strain EF1 is limited for growth by ammonium, which indicates that it requires an exogenous source of nitrogen for biosynthesis. Evidence that SUP05 cells produce nitrite and nitrous oxide and require exogenous nitrogen suggests that they have important roles in nitrogen cycling and that their growth is ultimately limited by the degradation of sinking organic matter.
Garcia-Martin, Juan Antonio; Bayegan, Amir H; Dotu, Ivan; Clote, Peter
2016-10-19
RNA inverse folding is the problem of finding one or more sequences that fold into a user-specified target structure s 0 , i.e. whose minimum free energy secondary structure is identical to the target s 0 . Here we consider the ensemble of all RNA sequences that have low free energy with respect to a given target s 0 . We introduce the program RNAdualPF, which computes the dual partition function Z ∗ , defined as the sum of Boltzmann factors exp(-E(a,s 0 )/RT) of all RNA nucleotide sequences a compatible with target structure s 0 . Using RNAdualPF, we efficiently sample RNA sequences that approximately fold into s 0 , where additionally the user can specify IUPAC sequence constraints at certain positions, and whether to include dangles (energy terms for stacked, single-stranded nucleotides). Moreover, since we also compute the dual partition function Z ∗ (k) over all sequences having GC-content k, the user can require that all sampled sequences have a precise, specified GC-content. Using Z ∗ , we compute the dual expected energy 〈E ∗ 〉, and use it to show that natural RNAs from the Rfam 12.0 database have higher minimum free energy than expected, thus suggesting that functional RNAs are under evolutionary pressure to be only marginally thermodynamically stable. We show that C. elegans precursor microRNA (pre-miRNA) is significantly non-robust with respect to mutations, by comparing the robustness of each wild type pre-miRNA sequence with 2000 [resp. 500] sequences of the same GC-content generated by RNAdualPF, which approximately [resp. exactly] fold into the wild type target structure. We confirm and strengthen earlier findings that precursor microRNAs and bacterial small noncoding RNAs display plasticity, a measure of structural diversity. We describe RNAdualPF, which rapidly computes the dual partition function Z ∗ and samples sequences having low energy with respect to a target structure, allowing sequence constraints and specified GC-content. Using different inverse folding software, another group had earlier shown that pre-miRNA is mutationally robust, even controlling for compositional bias. Our opposite conclusion suggests a cautionary note that computationally based insights into molecular evolution may heavily depend on the software used. C/C++-software for RNAdualPF is available at http://bioinformatics.bc.edu/clotelab/RNAdualPF .
Guide-substrate base-pairing requirement for box H/ACA RNA-guided RNA pseudouridylation.
De Zoysa, Meemanage D; Wu, Guowei; Katz, Raviv; Yu, Yi-Tao
2018-06-05
Box H/ACA RNAs are a group of small RNAs found in abundance in eukaryotes (as well as in archaea). Although their sequences differ, eukaryotic box H/ACA RNAs all share the same unique hairpin-hinge-hairpin-tail structure. Almost all of them function as guides that primarily direct pseudouridylation of rRNAs and spliceosomal snRNAs at specific sites. Although box H/ACA RNA-guided pseudouridylation has been extensively studied, the detailed rules governing this reaction, especially those concerning the guide RNA-substrate RNA base-pairing interactions that determine the specificity and efficiency of pseudouridylation, are still not exactly clear. This is particularly relevant given that the lengths of the guide sequences involved in base-pairing vary from one box H/ACA RNA to another. Here, we carry out a detailed investigation into guide-substrate base-pairing interactions, and identify the minimum number of base-pairs (8), required for RNA-guided pseudouridylation. In addition, we find that the pseudouridylation pocket, present in each hairpin of box H/ACA RNA, exhibits flexibility in fitting slightly different substrate sequences. Our results are consistent across three independent pseudouridylation pockets tested, suggesting that our findings are generally applicable to box H/ACA RNA-guided RNA pseudouridylation. Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Cheng, Huaixu; Luo, Zhidan; Lu, Mingsheng; Gao, Song; Wang, Shujun
2017-05-01
The hyperthermophilic α-amylase from Thermococcus sp. HJ21 does not require exogenous calcium ions for thermostability, and is a promising alternative to commercially available α-amylases to increase the efficiency of industrial processes like the liquefaction of starch. We analyzed the amino acid sequence of this α-amylase by sequence alignments and structural modeling, and found that this α-amylase closely resembles the α-amylase from Pyrococcus woesei. The gene of this α-amylase was cloned in Escherichia coli and the recombinant α-amylase was overexpressed and purified with a combined renaturation-purification procedure. We confirmed thermostability and exogenous calcium ion independency of the recombinant α-amylase and further investigated the mechanism of the independency using biochemical approaches. The results suggested that the α-amylase has a high calcium ion binding affinity that traps a calcium ion that would not dissociate at high temperatures, providing a direct explanation as to why the addition of calcium ions is not required for thermostability. Understanding of the mechanism offers a strong base on which to further engineer properties of this α-amylase for better potential applications in industrial processes.
Murthy, S C; Bhat, G P; Thimmappaya, B
1985-01-01
A molecular dissection of the adenovirus EIIA early (E) promoter was undertaken to study the sequence elements required for transcription and to examine the nucleotide sequences, if any, specific for its trans-activation by the viral pre-early EIA gene product. A chimeric gene in which the EIIA-E promoter region fused to the coding sequences of the bacterial chloramphenicol acetyltransferase (CAT) gene was used in transient assays to identify the transcriptional control regions. Deletion mapping studies revealed that the upstream DNA sequences up to -86 were sufficient for the optimal basal level transcription in HeLa cells and also for the EIA-induced transcription. A series of linker-scanning (LS) mutants were constructed to precisely identify the nucleotide sequences that control transcription. Analysis of these LS mutants allowed us to identify two regions of the promoter that are critical for the EIIA-E transcription. These regions are located between -29 and -21 (region I) and between -82 and -66 (region II). Mutations in region I affected initiation and appeared functionally similar to the "TATA" sequence of the commonly studied promoters. To examine whether or not the EIIA-E promoter contained DNA sequences specific for the trans-activation by the EIA, the LS mutants were analyzed in a cotransfection assay containing a plasmid carrying the EIA gene. CAT activity of all of the LS mutants was induced by the EIA gene in this assay, suggesting that the induction of transcription of the EIIA-E promoter by the EIA gene is not sequence-specific. Images PMID:3857577
Teichmann, A Lina; Nieuwenstein, Mark R; Rich, Anina N
2017-08-01
For digit-color synaesthetes, digits elicit vivid experiences of color that are highly consistent for each individual. The conscious experience of synaesthesia is typically unidirectional: Digits evoke colors but not vice versa. There is an ongoing debate about whether synaesthetes have a memory advantage over non-synaesthetes. One key question in this debate is whether synaesthetes have a general superiority or whether any benefit is specific to a certain type of material. Here, we focus on immediate serial recall and ask digit-color synaesthetes and controls to memorize digit and color sequences. We developed a sensitive staircase method manipulating presentation duration to measure participants' serial recall of both overlearned and novel sequences. Our results show that synaesthetes can activate digit information to enhance serial memory for color sequences. When color sequences corresponded to ascending or descending digit sequences, synaesthetes encoded these sequences at a faster rate than their non-synaesthetes counterparts and faster than non-structured color sequences. However, encoding color sequences is approximately 200 ms slower than encoding digit sequences directly, independent of group and condition, which shows that the translation process is time consuming. These results suggest memory advantages in synaesthesia require a modified dual-coding account, in which secondary (synaesthetically linked) information is useful only if it is more memorable than the primary information to be recalled. Our study further shows that duration thresholds are a sensitive method to measure subtle differences in serial recall performance. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Henry, Kelli F.; Kawashima, Tomokazu; Goldberg, Robert B.
2015-03-22
Little is known about the molecular mechanisms by which the embryo proper and suspensor of plant embryos activate specific gene sets shortly after fertilization. We analyzed the upstream region of the Scarlet Runner Bean ( Phaseolus coccineus) G564 gene in order to understand how genes are activated specifically in the suspensor during early embryo development. Previously, we showed that a 54-bp fragment of the G564 upstream region is sufficient for suspensor transcription and contains at least three required cis-regulatory sequences, including the 10-bp motif (5'-GAAAAGCGAA-3'), the 10 bp-like motif (5'-GAAAAACGAA-3'), and Region 2 motif (partial sequence 5'-TTGGT-3'). Here, we usemore » site-directed mutagenesis experiments in transgenic tobacco globularstage embryos to identify two additional cis-regulatory elements within the 54-bp cis-regulatory module that are required for G564 suspensor transcription: the Fifth motif (5'-GAGTTA-3') and a third 10-bp-related sequence (5'-GAAAACCACA-3'). Further deletion of the 54-bp fragment revealed that a 47-bp fragment containing the five motifs (the 10-bp, 10-bp-like, 10-bp-related, Region 2 and Fifth motifs) is sufficient for suspensor transcription, and represents a cis-regulatory module. A consensus sequence for each type of motif was determined by comparing motif sequences shown to activate suspensor transcription. Phylogenetic analyses suggest that the regulation of G564 is evolutionarily conserved. Lastly, a homologous cis-regulatory module was found upstream of the G564 ortholog in the Common Bean (Phaseolus vulgaris), indicating that the regulation of G564 is evolutionarily conserved in closely related bean species.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Henry, Kelli F.; Kawashima, Tomokazu; Goldberg, Robert B.
Little is known about the molecular mechanisms by which the embryo proper and suspensor of plant embryos activate specific gene sets shortly after fertilization. We analyzed the upstream region of the Scarlet Runner Bean ( Phaseolus coccineus) G564 gene in order to understand how genes are activated specifically in the suspensor during early embryo development. Previously, we showed that a 54-bp fragment of the G564 upstream region is sufficient for suspensor transcription and contains at least three required cis-regulatory sequences, including the 10-bp motif (5'-GAAAAGCGAA-3'), the 10 bp-like motif (5'-GAAAAACGAA-3'), and Region 2 motif (partial sequence 5'-TTGGT-3'). Here, we usemore » site-directed mutagenesis experiments in transgenic tobacco globularstage embryos to identify two additional cis-regulatory elements within the 54-bp cis-regulatory module that are required for G564 suspensor transcription: the Fifth motif (5'-GAGTTA-3') and a third 10-bp-related sequence (5'-GAAAACCACA-3'). Further deletion of the 54-bp fragment revealed that a 47-bp fragment containing the five motifs (the 10-bp, 10-bp-like, 10-bp-related, Region 2 and Fifth motifs) is sufficient for suspensor transcription, and represents a cis-regulatory module. A consensus sequence for each type of motif was determined by comparing motif sequences shown to activate suspensor transcription. Phylogenetic analyses suggest that the regulation of G564 is evolutionarily conserved. Lastly, a homologous cis-regulatory module was found upstream of the G564 ortholog in the Common Bean (Phaseolus vulgaris), indicating that the regulation of G564 is evolutionarily conserved in closely related bean species.« less
Henry, Kelli F; Kawashima, Tomokazu; Goldberg, Robert B
2015-06-01
Little is known about the molecular mechanisms by which the embryo proper and suspensor of plant embryos activate specific gene sets shortly after fertilization. We analyzed the upstream region of the Scarlet Runner Bean (Phaseolus coccineus) G564 gene in order to understand how genes are activated specifically in the suspensor during early embryo development. Previously, we showed that a 54-bp fragment of the G564 upstream region is sufficient for suspensor transcription and contains at least three required cis-regulatory sequences, including the 10-bp motif (5'-GAAAAGCGAA-3'), the 10 bp-like motif (5'-GAAAAACGAA-3'), and Region 2 motif (partial sequence 5'-TTGGT-3'). Here, we use site-directed mutagenesis experiments in transgenic tobacco globular-stage embryos to identify two additional cis-regulatory elements within the 54-bp cis-regulatory module that are required for G564 suspensor transcription: the Fifth motif (5'-GAGTTA-3') and a third 10-bp-related sequence (5'-GAAAACCACA-3'). Further deletion of the 54-bp fragment revealed that a 47-bp fragment containing the five motifs (the 10-bp, 10-bp-like, 10-bp-related, Region 2 and Fifth motifs) is sufficient for suspensor transcription, and represents a cis-regulatory module. A consensus sequence for each type of motif was determined by comparing motif sequences shown to activate suspensor transcription. Phylogenetic analyses suggest that the regulation of G564 is evolutionarily conserved. A homologous cis-regulatory module was found upstream of the G564 ortholog in the Common Bean (Phaseolus vulgaris), indicating that the regulation of G564 is evolutionarily conserved in closely related bean species.
Frequency of the first feature in action sequences influences feature binding.
Mattson, Paul S; Fournier, Lisa R; Behmer, Lawrence P
2012-10-01
We investigated whether binding among perception and action feature codes is a preliminary step toward creating a more durable memory trace of an action event. If so, increasing the frequency of a particular event (e.g., a stimulus requiring a movement with the left or right hand in an up or down direction) should increase the strength and speed of feature binding for this event. The results from two experiments, using a partial-repetition paradigm, confirmed that feature binding increased in strength and/or occurred earlier for a high-frequency (e.g., left hand moving up) than for a low-frequency (e.g., right hand moving down) event. Moreover, increasing the frequency of the first-specified feature in the action sequence alone (e.g., "left" hand) increased the strength and/or speed of action feature binding (e.g., between the "left" hand and movement in an "up" or "down" direction). The latter finding suggests an update to the theory of event coding, as not all features in the action sequence equally determine binding strength. We conclude that action planning involves serial binding of features in the order of action feature execution (i.e., associations among features are not bidirectional but are directional), which can lead to a more durable memory trace. This is consistent with physiological evidence suggesting that serial order is preserved in an action plan executed from memory and that the first feature in the action sequence may be critical in preserving this serial order.
Means, A L; Farnham, P J
1990-02-01
We have identified a sequence element that specifies the position of transcription initiation for the dihydrofolate reductase gene. Unlike the functionally analogous TATA box that directs RNA polymerase II to initiate transcription 30 nucleotides downstream, the positioning element of the dihydrofolate reductase promoter is located directly at the site of transcription initiation. By using DNase I footprint analysis, we have shown that a protein binds to this initiator element. Transcription initiated at the dihydrofolate reductase initiator element when 28 nucleotides were inserted between it and all other upstream sequences, or when it was placed on either side of the DNA helix, suggesting that there is no strict spatial requirement between the initiator and an upstream element. Although neither a single Sp1-binding site nor a single initiator element was sufficient for transcriptional activity, the combination of one Sp1-binding site and the dihydrofolate reductase initiator element cloned into a plasmid vector resulted in transcription starting at the initiator element. We have also shown that the simian virus 40 late major initiation site has striking sequence homology to the dihydrofolate reductase initiation site and that the same, or a similar, protein binds to both sites. Examination of the sequences at other RNA polymerase II initiation sites suggests that we have identified an element that is important in the transcription of other housekeeping genes. We have thus named the protein that binds to the initiator element HIP1 (Housekeeping Initiator Protein 1).
Clinical evaluation incorporating a personal genome
Ashley, Euan A.; Butte, Atul J.; Wheeler, Matthew T.; Chen, Rong; Klein, Teri E.; Dewey, Frederick E.; Dudley, Joel T.; Ormond, Kelly E.; Pavlovic, Aleksandra; Hudgins, Louanne; Gong, Li; Hodges, Laura M.; Berlin, Dorit S.; Thorn, Caroline F.; Sangkuhl, Katrin; Hebert, Joan M.; Woon, Mark; Sagreiya, Hersh; Whaley, Ryan; Morgan, Alexander A.; Pushkarev, Dmitry; Neff, Norma F; Knowles, Joshua W.; Chou, Mike; Thakuria, Joseph; Rosenbaum, Abraham; Zaranek, Alexander Wait; Church, George; Greely, Henry T.; Quake, Stephen R.; Altman, Russ B.
2010-01-01
Background The cost of genomic information has fallen steeply but the path to clinical translation of risk estimates for common variants found in genome wide association studies remains unclear. Since the speed and cost of sequencing complete genomes is rapidly declining, more comprehensive means of analyzing these data in concert with rare variants for genetic risk assessment and individualisation of therapy are required. Here, we present the first integrated analysis of a complete human genome in a clinical context. Methods An individual with a family history of vascular disease and early sudden death was evaluated. Clinical assessment included risk prediction for coronary artery disease, screening for causes of sudden cardiac death, and genetic counselling. Genetic analysis included the development of novel methods for the integration of whole genome sequence data including 2.6 million single nucleotide polymorphisms and 752 copy number variations. The algorithm focused on predicting genetic risk of genes associated with known Mendelian disease, recognised drug responses, and pathogenicity for novel variants. In addition, since integration of risk ratios derived from case control studies is challenging, we estimated posterior probabilities from age and sex appropriate prior probability and likelihood ratios derived for each genotype. In addition, we developed a visualisation approach to account for gene-environment interactions and conditionally dependent risks. Findings We found increased genetic risk for myocardial infarction, type II diabetes and certain cancers. Rare variants in LPA are consistent with the family history of coronary artery disease. Pharmacogenomic analysis suggested a positive response to lipid lowering therapy, likely clopidogrel resistance, and a low initial dosing requirement for warfarin. Many variants of uncertain significance were reported. Interpretation Although challenges remain, our results suggest that whole genome sequencing can yield useful and clinically relevant information for individual patients, especially for those with a strong family history of significant disease. PMID:20435227
Rotational evolution of slow-rotator sequence stars
NASA Astrophysics Data System (ADS)
Lanzafame, A. C.; Spada, F.
2015-12-01
Context. The observed relationship between mass, age and rotation in open clusters shows the progressive development of a slow-rotator sequence among stars possessing a radiative interior and a convective envelope during their pre-main sequence and main-sequence evolution. After 0.6 Gyr, most cluster members of this type have settled on this sequence. Aims: The observed clustering on this sequence suggests that it corresponds to some equilibrium or asymptotic condition that still lacks a complete theoretical interpretation, and which is crucial to our understanding of the stellar angular momentum evolution. Methods: We couple a rotational evolution model, which takes internal differential rotation into account, with classical and new proposals for the wind braking law, and fit models to the data using a Monte Carlo Markov chain (MCMC) method tailored to the problem at hand. We explore to what extent these models are able to reproduce the mass and time dependence of the stellar rotational evolution on the slow-rotator sequence. Results: The description of the evolution of the slow-rotator sequence requires taking the transfer of angular momentum from the radiative core to the convective envelope into account. We find that, in the mass range 0.85-1.10 M⊙, the core-envelope coupling timescale for stars in the slow-rotator sequence scales as M-7.28. Quasi-solid body rotation is achieved only after 1-2 Gyr, depending on stellar mass, which implies that observing small deviations from the Skumanich law (P ∝ √{t}) would require period data of older open clusters than is available to date. The observed evolution in the 0.1-2.5 Gyr age range and in the 0.85-1.10 M⊙ mass range is best reproduced by assuming an empirical mass dependence of the wind angular momentum loss proportional to the convective turnover timescale and to the stellar moment of inertia. Period isochrones based on our MCMC fit provide a tool for inferring stellar ages of solar-like main-sequence stars from their mass and rotation period that is largely independent of the wind braking model adopted. These effectively represent gyro-chronology relationships that take the physics of the two-zone model for the stellar angular momentum evolution into account.
Torres-Escobar, Ascención; Juárez-Rodríguez, María Dolores; Lamont, Richard J.
2013-01-01
Autoinducer-2 (AI-2) is required for biofilm formation and virulence of the oral pathogen Aggregatibacter actinomycetemcomitans, and we previously showed that lsrB codes for a receptor for AI-2. The lsrB gene is expressed as part of the lsrACDBFG operon, which is divergently transcribed from an adjacent lsrRK operon. In Escherichia coli, lsrRK encodes a repressor and AI-2 kinase that function to regulate lsrACDBFG. To determine if lsrRK controls lsrACDBFG expression and influences biofilm growth of A. actinomycetemcomitans, we first defined the promoters for each operon. Transcriptional reporter plasmids containing the 255-bp lsrACDBFG-lsrRK intergenic region (IGR) fused to lacZ showed that essential elements of lsrR promoter reside 89 to 255 bp upstream from the lsrR start codon. Two inverted repeat sequences that represent potential binding sites for LsrR and two sequences resembling the consensus cyclic AMP receptor protein (CRP) binding site were identified in this region. Using electrophoretic mobility shift assay (EMSA), purified LsrR and CRP proteins were shown to bind probes containing these sequences. Surprisingly, the 255-bp IGR did not contain the lsrA promoter. Instead, a fragment encompassing nucleotides +1 to +159 of lsrA together with the 255-bp IGR was required to promote lsrA transcription. This suggests that a region within the lsrA coding sequence influences transcription, or alternatively that the start codon of A. actinomycetemcomitans lsrA has been incorrectly annotated. Transformation of ΔlsrR, ΔlsrK, ΔlsrRK, and Δcrp deletion mutants with lacZ reporters containing the lsrA or lsrR promoter showed that LsrR negatively regulates and CRP positively regulates both lsrACDBFG and lsrRK. However, in contrast to what occurs in E. coli, deletion of lsrK had no effect on the transcriptional activity of the lsrA or lsrR promoters, suggesting that another kinase may be capable of phosphorylating AI-2 in A. actinomycetemcomitans. Finally, biofilm formation of the ΔlsrR, ΔlsrRK, and Δcrp mutants was significantly reduced relative to that of the wild type, indicating that proper regulation of the lsr locus is required for optimal biofilm growth by A. actinomycetemcomitans. PMID:23104800
Sequential Exposure of Bortezomib and Vorinostat is Synergistic in Multiple Myeloma Cells
Nanavati, Charvi; Mager, Donald E.
2018-01-01
Purpose To examine the combination of bortezomib and vorinostat in multiple myeloma cells (U266) and xenografts, and to assess the nature of their potential interactions with semi-mechanistic pharmacodynamic models and biomarkers. Methods U266 proliferation was examined for a range of bortezomib and vorinostat exposure times and concentrations (alone and in combination). A non-competitive interaction model was used with interaction parameters that reflect the nature of drug interactions after simultaneous and sequential exposures. p21 and cleaved PARP were measured using immunoblotting to assess critical biomarker dynamics. For xenografts, data were extracted from literature and modeled with a PK/PD model with an interaction parameter. Results Estimated model parameters for simultaneous in vitro and xenograft treatments suggested additive drug effects. The sequence of bortezomib preincubation for 24 hours, followed by vorinostat for 24 hours, resulted in an estimated interaction term significantly less than 1, suggesting synergistic effects. p21 and cleaved PARP were also up-regulated the most in this sequence. Conclusions Semi-mechanistic pharmacodynamic modeling suggests synergistic pharmacodynamic interactions for the sequential administration of bortezomib followed by vorinostat. Increased p21 and cleaved PARP expression can potentially explain mechanisms of their enhanced effects, which require further PK/PD systems analysis to suggest an optimal dosing regimen. PMID:28101809
Short intronic repeat sequences facilitate circular RNA production.
Liang, Dongming; Wilusz, Jeremy E
2014-10-15
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery "backsplices" and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼ 30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3' end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. © 2014 Liang and Wilusz; Published by Cold Spring Harbor Laboratory Press.
Quasi-reinforcement: control of responding by a percentage-reinforcement schedule1
Neuringer, Allen J.; Chung, Shin-Ho
1967-01-01
When a variable-interval schedule of reinforcement was segmented into small fixed-interval components, with reinforcements following some components and brief blackouts following the others, rate of responding doubled and a positively accelerated pattern within each component was obtained. Presented according to this percentage reinforcement paradigm, the blackouts approximated the functions of a food reinforcer. These effects occurred only when the behavior sequence required to produce reinforcement was identical to that required to produce blackout. The quasi-reinforcing effects of these blackout stimuli suggest that a neutral stimulus need not occasion or accompany a primary reinforcer to acquire reinforcing properties. PMID:16811304
Chowdhury, M; Taylor, J P; Chang, C F; Rappaport, J; Khalili, K
1992-01-01
A specific RNA sequence located in the leader of all human immunodeficiency virus type 1 (HIV-1) mRNAs termed the transactivation response element, or TAR, is a primary target for induction of HIV-1 long terminal repeat activity by the HIV-1-derived trans-regulatory protein, Tat. Human neurotropic virus, JC virus (JCV), a causative agent of the degenerative demyelinating disease progressive multifocal leukoencephalopathy, contains sequences in the 5' end of the late RNA species with an extensive homology to HIV-1 TAR. In this study, we examined the possible role of the JCV-derived TAR-homologous sequence in Tat-mediated activation of the JCV late promoter (Tada et al., Proc. Natl. Acad. Sci. USA 87:3479-3483, 1990). Results from site-directed mutagenesis revealed that critical G residues required for the function of HIV-1 TAR that are conserved in the JCV TAR homolog play an important role in Tat activation of the JCV promoter. In addition, in vivo competition studies suggest that shared regulatory components mediate Tat activation of the JCV late and HIV-1 long terminal repeat promoters. Furthermore, we showed that the JCV-derived TAR sequence behaves in the same way as HIV-1 TAR in response to two distinct Tat mutants, one of which that has no ability to bind to HIV-1 TAR and another that lacks transcriptional activity on a responsive promoter. These results suggest that the TAR homolog of the JCV late promoter is responsive to HIV-1 Tat induction and thus may participate in the overall activation of the JCV late promoter mediated by this transactivation. Images PMID:1331525
Nine Suggestions for Improving Sequences. Technology.
ERIC Educational Resources Information Center
Muro, Don
1995-01-01
Maintains that many educators are using sequences to create accompaniments and practice tapes geared to student abilities. Describes musical instruction using Musical Instrument Digital Interface (MIDI). Discusses eight suggestions designed to make the process of sequencing more efficient. (CFR)
SPARSE: quadratic time simultaneous alignment and folding of RNAs without sequence-based heuristics
Will, Sebastian; Otto, Christina; Miladi, Milad; Möhl, Mathias; Backofen, Rolf
2015-01-01
Motivation: RNA-Seq experiments have revealed a multitude of novel ncRNAs. The gold standard for their analysis based on simultaneous alignment and folding suffers from extreme time complexity of O(n6). Subsequently, numerous faster ‘Sankoff-style’ approaches have been suggested. Commonly, the performance of such methods relies on sequence-based heuristics that restrict the search space to optimal or near-optimal sequence alignments; however, the accuracy of sequence-based methods breaks down for RNAs with sequence identities below 60%. Alignment approaches like LocARNA that do not require sequence-based heuristics, have been limited to high complexity (≥ quartic time). Results: Breaking this barrier, we introduce the novel Sankoff-style algorithm ‘sparsified prediction and alignment of RNAs based on their structure ensembles (SPARSE)’, which runs in quadratic time without sequence-based heuristics. To achieve this low complexity, on par with sequence alignment algorithms, SPARSE features strong sparsification based on structural properties of the RNA ensembles. Following PMcomp, SPARSE gains further speed-up from lightweight energy computation. Although all existing lightweight Sankoff-style methods restrict Sankoff’s original model by disallowing loop deletions and insertions, SPARSE transfers the Sankoff algorithm to the lightweight energy model completely for the first time. Compared with LocARNA, SPARSE achieves similar alignment and better folding quality in significantly less time (speedup: 3.7). At similar run-time, it aligns low sequence identity instances substantially more accurate than RAF, which uses sequence-based heuristics. Availability and implementation: SPARSE is freely available at http://www.bioinf.uni-freiburg.de/Software/SPARSE. Contact: backofen@informatik.uni-freiburg.de Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25838465
Pms2 Suppresses Large Expansions of the (GAA·TTC)n Sequence in Neuronal Tissues
Bourn, Rebecka L.; De Biase, Irene; Pinto, Ricardo Mouro; Sandi, Chiranjeevi; Al-Mahdawi, Sahar; Pook, Mark A.; Bidichandani, Sanjay I.
2012-01-01
Expanded trinucleotide repeat sequences are the cause of several inherited neurodegenerative diseases. Disease pathogenesis is correlated with several features of somatic instability of these sequences, including further large expansions in postmitotic tissues. The presence of somatic expansions in postmitotic tissues is consistent with DNA repair being a major determinant of somatic instability. Indeed, proteins in the mismatch repair (MMR) pathway are required for instability of the expanded (CAG·CTG)n sequence, likely via recognition of intrastrand hairpins by MutSβ. It is not clear if or how MMR would affect instability of disease-causing expanded trinucleotide repeat sequences that adopt secondary structures other than hairpins, such as the triplex/R-loop forming (GAA·TTC)n sequence that causes Friedreich ataxia. We analyzed somatic instability in transgenic mice that carry an expanded (GAA·TTC)n sequence in the context of the human FXN locus and lack the individual MMR proteins Msh2, Msh6 or Pms2. The absence of Msh2 or Msh6 resulted in a dramatic reduction in somatic mutations, indicating that mammalian MMR promotes instability of the (GAA·TTC)n sequence via MutSα. The absence of Pms2 resulted in increased accumulation of large expansions in the nervous system (cerebellum, cerebrum, and dorsal root ganglia) but not in non-neuronal tissues (heart and kidney), without affecting the prevalence of contractions. Pms2 suppressed large expansions specifically in tissues showing MutSα-dependent somatic instability, suggesting that they may act on the same lesion or structure associated with the expanded (GAA·TTC)n sequence. We conclude that Pms2 specifically suppresses large expansions of a pathogenic trinucleotide repeat sequence in neuronal tissues, possibly acting independently of the canonical MMR pathway. PMID:23071719
Pms2 suppresses large expansions of the (GAA·TTC)n sequence in neuronal tissues.
Bourn, Rebecka L; De Biase, Irene; Pinto, Ricardo Mouro; Sandi, Chiranjeevi; Al-Mahdawi, Sahar; Pook, Mark A; Bidichandani, Sanjay I
2012-01-01
Expanded trinucleotide repeat sequences are the cause of several inherited neurodegenerative diseases. Disease pathogenesis is correlated with several features of somatic instability of these sequences, including further large expansions in postmitotic tissues. The presence of somatic expansions in postmitotic tissues is consistent with DNA repair being a major determinant of somatic instability. Indeed, proteins in the mismatch repair (MMR) pathway are required for instability of the expanded (CAG·CTG)(n) sequence, likely via recognition of intrastrand hairpins by MutSβ. It is not clear if or how MMR would affect instability of disease-causing expanded trinucleotide repeat sequences that adopt secondary structures other than hairpins, such as the triplex/R-loop forming (GAA·TTC)(n) sequence that causes Friedreich ataxia. We analyzed somatic instability in transgenic mice that carry an expanded (GAA·TTC)(n) sequence in the context of the human FXN locus and lack the individual MMR proteins Msh2, Msh6 or Pms2. The absence of Msh2 or Msh6 resulted in a dramatic reduction in somatic mutations, indicating that mammalian MMR promotes instability of the (GAA·TTC)(n) sequence via MutSα. The absence of Pms2 resulted in increased accumulation of large expansions in the nervous system (cerebellum, cerebrum, and dorsal root ganglia) but not in non-neuronal tissues (heart and kidney), without affecting the prevalence of contractions. Pms2 suppressed large expansions specifically in tissues showing MutSα-dependent somatic instability, suggesting that they may act on the same lesion or structure associated with the expanded (GAA·TTC)(n) sequence. We conclude that Pms2 specifically suppresses large expansions of a pathogenic trinucleotide repeat sequence in neuronal tissues, possibly acting independently of the canonical MMR pathway.
Invasion and persistence of a selfish gene in the Cnidaria.
Goddard, Matthew R; Leigh, Jessica; Roger, Andrew J; Pemberton, Andrew J
2006-12-20
Homing endonuclease genes (HEGs) are superfluous, but are capable of invading populations that mix alleles by biasing their inheritance patterns through gene conversion. One model suggests that their long-term persistence is achieved through recurrent invasion. This circumvents evolutionary degeneration, but requires reasonable rates of transfer between species to maintain purifying selection. Although HEGs are found in a variety of microbes, we found the previous discovery of this type of selfish genetic element in the mitochondria of a sea anemone surprising. We surveyed 29 species of Cnidaria for the presence of the COXI HEG. Statistical analyses provided evidence for HEG invasion. We also found that 96 individuals of Metridium senile, from five different locations in the UK, had identical HEG sequences. This lack of sequence divergence illustrates the stable nature of Anthozoan mitochondria. Our data suggests this HEG conforms to the recurrent invasion model of evolution. Ordinarily such low rates of HEG transfer would likely be insufficient to enable major invasion. However, the slow rate of Anthozoan mitochondrial change lengthens greatly the time to HEG degeneration: this significantly extends the periodicity of the HEG life-cycle. We suggest that a combination of very low substitution rates and rare transfers facilitated metazoan HEG invasion.
Shuttle OFT Level C navigation requirements
NASA Technical Reports Server (NTRS)
1980-01-01
Detailed requirements for the orbital operations computer loads, OPS 2, and OPS 8 are given. These requirements represent the total on-orbit/rendezvous navigation baseline requirements for the following principal functions: on-orbital/rendezvous navigation sequencer; on-orbit/rendezvous UPP sequencer; on-orbit rendezvous navigation; on-orbit prediction; on-orbit user parameter processing; and landing Site update.
Quantifying the Impact of Additional Laboratory Tests on the Quality of a Geomechanical Model
NASA Astrophysics Data System (ADS)
Fillion, Marie-Hélène; Hadjigeorgiou, John
2017-05-01
In an open-pit mine operation, the design of safe and economically viable slopes can be significantly influenced by the quality and quantity of collected geomechanical data. In several mining jurisdictions, codes and standards are available for reporting exploration data, but similar codes or guidelines are not formally available or enforced for geotechnical design. Current recommendations suggest a target level of confidence in the rock mass properties used for slope design. As these guidelines are qualitative and somewhat subjective, questions arise regarding the minimum number of tests to perform in order to reach the proposed level of confidence. This paper investigates the impact of defining a priori the required number of laboratory tests to conduct on rock core samples based on the geomechanical database of an operating open-pit mine in South Africa. In this review, to illustrate the process, the focus is on uniaxial compressive strength properties. Available strength data for 2 project stages were analysed using the small-sampling theory and the confidence interval approach. The results showed that the number of specimens was too low to obtain a reliable strength value for some geotechnical domains even if more specimens than the minimum proposed by the ISRM suggested methods were tested. Furthermore, the testing sequence used has an impact on the minimum number of specimens required. Current best practice cannot capture all possibilities regarding the geomechanical property distributions, and there is a demonstrated need for a method to determine the minimum number of specimens required while minimising the influence of the testing sequence.
Xie, Bingkun; Yang, Wei; Ouyang, Yongchang; Chen, Lichan; Jiang, Hesheng; Liao, Yuying; Liao, D. Joshua
2016-01-01
Tens of thousands of chimeric RNAs have been reported. Most of them contain a short homologous sequence (SHS) at the joining site of the two partner genes but are not associated with a fusion gene. We hypothesize that many of these chimeras may be technical artifacts derived from SHS-caused mis-priming in reverse transcription (RT) or polymerase chain reactions (PCR). We cloned six chimeric complementary DNAs (cDNAs) formed by human mitochondrial (mt) 16S rRNA sequences at an SHS, which were similar to several expression sequence tags (ESTs).These chimeras, which could not be detected with cDNA protection assay, were likely formed because some regions of the 16S rRNA are reversely complementary to another region to form an SHS, which allows the downstream sequence to loop back and anneal at the SHS to prime the synthesis of its complementary strand, yielding a palindromic sequence that can form a hairpin-like structure.We identified a 16S rRNA that ended at the 4th nucleotide(nt) of the mt-tRNA-leu was dominant and thus should be the wild type. We also cloned a mouse Bcl2-Nek9 chimeric cDNA that contained a 5-nt unmatchable sequence between the two partners, contained two copies of the reverse primer in the same direction but did not contain the forward primer, making it unclear how this Bcl2-Nek9 was formed and amplified. Moreover, a cDNA was amplified because one primer has 4 nts matched to the template, suggesting that there may be many more artificial cDNAs than we have realized, because the nuclear and mt genomes have many more 4-nt than 5-nt or longer homologues. Altogether, the chimeric cDNAs we cloned are good examples suggesting that many cDNAs may be artifacts due to SHS-caused mis-priming and thus greater caution should be taken when new sequence is obtained from a technique involving DNA polymerization. PMID:27148738
NASA Technical Reports Server (NTRS)
1974-01-01
The instruments to be flown on the Earth Observatory Satellite (EOS) system are defined. The instruments will be used to support the Land Resources Management (LRM) mission of the EOS. Program planning information and suggested acquisition activities for obtaining the instruments are presented. The subjects considered are as follows: (1) the performance and interface of the Thematic Mapper (TM) and the High Resolution Pointing Imager (HRPI), (2) procedure for interfacing the TM and HRPI with the EOS satellite, (3) a space vehicle integration plan suggesting the steps and sequence of events required to carry out the interface activities, and (4) suggested agreements between the contractors for providing timely and equitable solution of problems at minimum cost.
Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species
2013-01-01
Background The process of generating raw genome sequence data continues to become cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished genome sequences remains challenging. Many genome assembly tools are available, but they differ greatly in terms of their performance (speed, scalability, hardware requirements, acceptance of newer read technologies) and in their final output (composition of assembled sequence). More importantly, it remains largely unclear how to best assess the quality of assembled genome sequences. The Assemblathon competitions are intended to assess current state-of-the-art methods in genome assembly. Results In Assemblathon 2, we provided a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and snake). This resulted in a total of 43 submitted assemblies from 21 participating teams. We evaluated these assemblies using a combination of optical map data, Fosmid sequences, and several statistical methods. From over 100 different metrics, we chose ten key measures by which to assess the overall quality of the assemblies. Conclusions Many current genome assemblers produced useful assemblies, containing a significant representation of their genes and overall genome structure. However, the high degree of variability between the entries suggests that there is still much room for improvement in the field of genome assembly and that approaches which work well in assembling the genome of one species may not necessarily work well for another. PMID:23870653
Yates, Kathleen B.; Bi, Kevin; Darko, Samuel; Godec, Jernej; Gerdemann, Ulrike; Swadling, Leo; Douek, Daniel C.; Klenerman, Paul; Barnes, Eleanor J.; Sharpe, Arlene H.
2017-01-01
Abstract The T cell compartment must contain diversity in both T cell receptor (TCR) repertoire and cell state to provide effective immunity against pathogens. However, it remains unclear how differences in the TCR contribute to heterogeneity in T cell state. Single cell RNA-sequencing (scRNA-seq) can allow simultaneous measurement of TCR sequence and global transcriptional profile from single cells. However, current methods for TCR inference from scRNA-seq are limited in their sensitivity and require long sequencing reads, thus increasing the cost and decreasing the number of cells that can be feasibly analyzed. Here we present TRAPeS, a publicly available tool that can efficiently extract TCR sequence information from short-read scRNA-seq libraries. We apply it to investigate heterogeneity in the CD8+ T cell response in humans and mice, and show that it is accurate and more sensitive than existing approaches. Coupling TRAPeS with transcriptome analysis of CD8+ T cells specific for a single epitope from Yellow Fever Virus (YFV), we show that the recently described ‘naive-like’ memory population have significantly longer CDR3 regions and greater divergence from germline sequence than do effector-memory phenotype cells. This suggests that TCR usage is associated with the differentiation state of the CD8+ T cell response to YFV. PMID:28934479
Correia, J J; Chacko, B M; Lam, S S; Lin, K
2001-02-06
SMAD proteins are known to oligomerize and hetero-associate during their activation and translocation to the nucleus for transcriptional control. Analytical ultracentrifuge studies on Smad3 and Smad4 protein constructs are presented to clarify the model of homo- and hetero-oligomerization and the role of phosphorylation in the activation process. These constructs all exhibit a tendency to form disulfide cross-linked aggregates, primarily dimers, and a strong reducing agent, TCEP, was found to be required to determine the best estimates for reversible association models and equilibrium constants. A Smad4 construct, S4AF, consisting of the middle linker (L) domain and the C-terminal (C) domain, is shown to be a monomer, while a Smad3 construct, S3LC, consisting of the LC domains, is shown to form a trimer with an affinity K(3) = (1.2-3.1) x 10(9) M(-2). A Smad3 construct that mimics phosphorylation at the C-terminal target sequence, S3LC(3E), has 17--35-fold enhanced ability to form trimer over that of the wild-type construct, S3LC. S4AF associates with either S3LC or S3LC(3E) to form a hetero-trimer. In each case, the hetero-trimer is favored over the formation of the homo-trimer. Despite high sequence homology between Smad3 and Smad4, a chimeric Smad4 construct with an engineered Smad3 C-terminal pseudo-phosphorylation sequence, S4AF(3E), shows no tendency to form trimer. This suggests a Smad4-specific sequence insert inhibits homo-trimer formation, or other domains or sequences in S3LC are required in addition to the target sequence to mediate the formation of trimer. These results represent a direct molecular measure of the importance of hetero-trimerization and phosphorylation in the TGF-beta-activated Smad protein signal transduction process.
Arnaiz, Olivier; Mathy, Nathalie; Baudry, Céline; Malinsky, Sophie; Aury, Jean-Marc; Denby Wilkes, Cyril; Garnier, Olivier; Labadie, Karine; Lauderdale, Benjamin E; Le Mouël, Anne; Marmignon, Antoine; Nowacki, Mariusz; Poulain, Julie; Prajer, Malgorzata; Wincker, Patrick; Meyer, Eric; Duharcourt, Sandra; Duret, Laurent; Bétermier, Mireille; Sperling, Linda
2012-01-01
Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES) from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of -45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a -10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a fraction of the genome in which parasitic DNA is not usually tolerated.
Arnaiz, Olivier; Mathy, Nathalie; Baudry, Céline; Malinsky, Sophie; Aury, Jean-Marc; Denby Wilkes, Cyril; Garnier, Olivier; Labadie, Karine; Lauderdale, Benjamin E.; Le Mouël, Anne; Marmignon, Antoine; Nowacki, Mariusz; Poulain, Julie; Prajer, Malgorzata; Wincker, Patrick; Meyer, Eric; Duharcourt, Sandra; Duret, Laurent; Bétermier, Mireille; Sperling, Linda
2012-01-01
Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES) from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of ∼45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a ∼10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a fraction of the genome in which parasitic DNA is not usually tolerated. PMID:23071448
Buermeyer, A B; Thompson, N E; Strasheim, L A; Burgess, R R; Farnham, P J
1992-05-01
We examined the ability of purified RNA polymerase (RNAP) II lacking the carboxy-terminal heptapeptide repeat domain (CTD), called RNAP IIB, to transcribe a variety of promoters in HeLa extracts in which endogenous RNAP II activity was inhibited with anti-CTD monoclonal antibodies. Not all promoters were efficiently transcribed by RNAP IIB, and transcription did not correlate with the in vitro strength of the promoter or with the presence of a consensus TATA box. This was best illustrated by the GC-rich, non-TATA box promoters of the bidirectional dihydrofolate reductase (DHFR)-REP-encoding locus. Whereas the REP promoter was transcribed by RNAP IIB, the DHFR promoter remained inactive after addition of RNAP IIB to the antibody-inhibited reactions. However, both promoters were efficiently transcribed when purified RNAP with an intact CTD was added. We analyzed a series of promoter deletions to identify which cis elements determine the requirement for the CTD of RNAP II. All of the promoter deletions of both DHFR and REP retained the characteristics of their respective full-length promoters, suggesting that the information necessary to specify the requirement for the CTD is contained within approximately 65 bp near the initiation site. Furthermore, a synthetic minimal promoter of DHFR, consisting of a single binding site for Sp1 and a binding site for the HIP1 initiator cloned into a bacterial vector sequence, required RNAP II with an intact CTD for activity in vitro. Since the synthetic minimal promoter of DHFR and the smallest REP promoter deletion are both activated by Sp1, the differential response in this assay does not result from upstream activators. However, the sequences around the start sites of DHFR and REP are not similar and our data suggest that they bind different proteins. Therefore, we propose that specific initiator elements are important for determination of the requirement of some promoters for the CTD.
Bonatelli, Isabel A S; Carstens, Bryan C; Moraes, Evandro M
2015-01-01
Microsatellite markers (also known as SSRs, Simple Sequence Repeats) are widely used in plant science and are among the most informative molecular markers for population genetic investigations, but the development of such markers presents substantial challenges. In this report, we discuss how next generation sequencing can replace the cloning, Sanger sequencing, identification of polymorphic loci, and testing cross-amplification that were previously required to develop microsatellites. We report the development of a large set of microsatellite markers for five species of the Neotropical cactus genus Pilosocereus using a restriction-site-associated DNA sequencing (RAD-seq) on a Roche 454 platform. We identified an average of 165 microsatellites per individual, with the absolute numbers across individuals proportional to the sequence reads obtained per individual. Frequency distribution of the repeat units was similar in the five species, with shorter motifs such as di- and trinucleotide being the most abundant repeats. In addition, we provide 72 microsatellites that could be potentially amplified in the sampled species and 22 polymorphic microsatellites validated in two populations of the species Pilosocereus machrisii. Although low coverage sequencing among individuals was observed for most of the loci, which we suggest to be more related to the nature of the microsatellite markers and the possible bias inserted by the restriction enzymes than to the genome size, our work demonstrates that an NGS approach is an efficient method to isolate multispecies microsatellites even in non-model organisms.
Bonatelli, Isabel A. S.; Carstens, Bryan C.; Moraes, Evandro M.
2015-01-01
Microsatellite markers (also known as SSRs, Simple Sequence Repeats) are widely used in plant science and are among the most informative molecular markers for population genetic investigations, but the development of such markers presents substantial challenges. In this report, we discuss how next generation sequencing can replace the cloning, Sanger sequencing, identification of polymorphic loci, and testing cross-amplification that were previously required to develop microsatellites. We report the development of a large set of microsatellite markers for five species of the Neotropical cactus genus Pilosocereus using a restriction-site-associated DNA sequencing (RAD-seq) on a Roche 454 platform. We identified an average of 165 microsatellites per individual, with the absolute numbers across individuals proportional to the sequence reads obtained per individual. Frequency distribution of the repeat units was similar in the five species, with shorter motifs such as di- and trinucleotide being the most abundant repeats. In addition, we provide 72 microsatellites that could be potentially amplified in the sampled species and 22 polymorphic microsatellites validated in two populations of the species Pilosocereus machrisii. Although low coverage sequencing among individuals was observed for most of the loci, which we suggest to be more related to the nature of the microsatellite markers and the possible bias inserted by the restriction enzymes than to the genome size, our work demonstrates that an NGS approach is an efficient method to isolate multispecies microsatellites even in non-model organisms. PMID:26561396
Representation of item position in immediate serial recall: Evidence from intrusion errors.
Fischer-Baum, Simon; McCloskey, Michael
2015-09-01
In immediate serial recall, participants are asked to recall novel sequences of items in the correct order. Theories of the representations and processes required for this task differ in how order information is maintained; some have argued that order is represented through item-to-item associations, while others have argued that each item is coded for its position in a sequence, with position being defined either by distance from the start of the sequence, or by distance from both the start and the end of the sequence. Previous researchers have used error analyses to adjudicate between these different proposals. However, these previous attempts have not allowed researchers to examine the full set of alternative proposals. In the current study, we analyzed errors produced in 2 immediate serial recall experiments that differ in the modality of input (visual vs. aural presentation of words) and the modality of output (typed vs. spoken responses), using new analysis methods that allow for a greater number of alternative hypotheses to be considered. We find evidence that sequence positions are represented relative to both the start and the end of the sequence, and show a contribution of the end-based representation beyond the final item in the sequence. We also find limited evidence for item-to-item associations, suggesting that both a start-end positional scheme and item-to-item associations play a role in representing item order in immediate serial recall. (c) 2015 APA, all rights reserved).
A programmable CCD driver circuit for multiphase CCD operation
NASA Technical Reports Server (NTRS)
Ewin, Audrey J.; Reed, Kenneth V.
1989-01-01
A programmable CCD (charge-coupled device) driver circuit was designed to drive CCDs in multiphased modes. The purpose of the drive electronics is to operate developmental CCD imaging arrays for NASA's tiltable moderate resolution imaging spectrometer (MODIS-T). Five objectives for the driver were considered during its design: (1) the circuit drives CCD electrode voltages between 0 V and +30 V to produce reasonable potential wells, (2) the driving sequence is started with one input signal, (3) the driving sequence is started with one input signal, (4) the circuit allows programming of frame sequences required by arrays of any size, (5) it produces interfacing signals for the CCD and the DTF (detector test facility). Simulation of the driver verified its function with the master clock running up to 10 MHz. This suggests a maximum rate of 400,000 pixels/s. Timing and packaging parameters were verified. The design uses 54 TTL (transistor-transistor logic) chips. Two versions of hardware were fabricated: wirewrap and printed circuit board. Both were verified functionally with a logic analyzer.
Bormann, Tobias; Seyboth, Margret; Umarova, Roza; Weiller, Cornelius
2015-06-01
Studies on verbal learning in patients with impaired verbal short-term memory (vSTM) have revealed dissociations among types of verbal information. Patients with impaired vSTM are able to learn lists of known words but fail to acquire new word forms. This suggests that vSTM is involved in new word learning. The present study assessed both new word learning and the learning of digit sequences in two patients with impaired vSTM. In two experiments, participants were required to learn people's names, ages and professions, or their four digit 'phone numbers'. The STM patients were impaired on learning unknown family names and phone numbers, but managed to acquire other verbal information. In contrast, a patient with a severe verbal episodic memory impairment was impaired across information types. These results indicate verbal STM involvement in the learning of digit sequences. Copyright © 2015 Elsevier Ltd. All rights reserved.
Bartels, Ronald H M A; Donk, Roland
2002-10-01
Postinjury cervical spine instability typically requires surgical treatment in the acute or semiacute stage. The authors, however, report on three patients with older (> 8 weeks) untreated bilateral cervical facet dislocation. In two patients they attempted a classic anterior-posterior-anterior approach but failed. The misalignment in the second stage of the procedure could not be corrected, and they had to add a fourth, posterior, stage. To avoid the fourth stage, thereby reducing operating time and risk of neurological damage while turning the patient, they propose the following sequence: 1) a posterior approach to perform a complete facetectomy bilaterally with no attempt to reduce the dislocation; 2) an anterior microscopic discectomy with reduction of the dislocation and anterior fixation; and 3) posterior fixation. This sequence of procedures was successfully performed in the third patient. Based on this experience, they suggest that in cases of nonacute bilateral cervical facet dislocations the operating sequence should be posterior-anterior-posterior.
The language faculty that wasn't: a usage-based account of natural language recursion
Christiansen, Morten H.; Chater, Nick
2015-01-01
In the generative tradition, the language faculty has been shrinking—perhaps to include only the mechanism of recursion. This paper argues that even this view of the language faculty is too expansive. We first argue that a language faculty is difficult to reconcile with evolutionary considerations. We then focus on recursion as a detailed case study, arguing that our ability to process recursive structure does not rely on recursion as a property of the grammar, but instead emerges gradually by piggybacking on domain-general sequence learning abilities. Evidence from genetics, comparative work on non-human primates, and cognitive neuroscience suggests that humans have evolved complex sequence learning skills, which were subsequently pressed into service to accommodate language. Constraints on sequence learning therefore have played an important role in shaping the cultural evolution of linguistic structure, including our limited abilities for processing recursive structure. Finally, we re-evaluate some of the key considerations that have often been taken to require the postulation of a language faculty. PMID:26379567
Physical Model of the Genotype-to-Phenotype Map of Proteins
NASA Astrophysics Data System (ADS)
Tlusty, Tsvi; Libchaber, Albert; Eckmann, Jean-Pierre
2017-04-01
How DNA is mapped to functional proteins is a basic question of living matter. We introduce and study a physical model of protein evolution which suggests a mechanical basis for this map. Many proteins rely on large-scale motion to function. We therefore treat protein as learning amorphous matter that evolves towards such a mechanical function: Genes are binary sequences that encode the connectivity of the amino acid network that makes a protein. The gene is evolved until the network forms a shear band across the protein, which allows for long-range, soft modes required for protein function. The evolution reduces the high-dimensional sequence space to a low-dimensional space of mechanical modes, in accord with the observed dimensional reduction between genotype and phenotype of proteins. Spectral analysis of the space of 1 06 solutions shows a strong correspondence between localization around the shear band of both mechanical modes and the sequence structure. Specifically, our model shows how mutations are correlated among amino acids whose interactions determine the functional mode.
Dynamic ASXL1 Exon Skipping and Alternative Circular Splicing in Single Human Cells
Natarajan, Sivaraman; Carter, Robert; Brown, Patrick O.
2016-01-01
Circular RNAs comprise a poorly understood new class of noncoding RNA. In this study, we used a combination of targeted deletion, high-resolution splicing detection, and single-cell sequencing to deeply probe ASXL1 circular splicing. We found that efficient circular splicing required the canonical transcriptional start site and inverted AluSx elements. Sequencing-based interrogation of isoforms after ASXL1 overexpression identified promiscuous linear splicing between all exons, with the two most abundant non-canonical linear products skipping the exons that produced the circular isoforms. Single-cell sequencing revealed a strong preference for either the linear or circular ASXL1 isoforms in each cell, and found the predominant exon skipping product is frequently co-expressed with its reciprocal circular isoform. Finally, absolute quantification of ASXL1 isoforms confirmed our findings and suggests that standard methods overestimate circRNA abundance. Taken together, these data reveal a dynamic new view of circRNA genesis, providing additional framework for studying their roles in cellular biology. PMID:27736885
1994-01-01
The apparatus that permits protein translocation across the internal thylakoid membranes of chloroplasts is completely unknown, even though these membranes have been the subject of extensive biochemical analysis. We have used a genetic approach to characterize the translocation of Chlamydomonas cytochrome f, a chloroplast-encoded protein that spans the thylakoid once. Mutations in the hydrophobic core of the cytochrome f signal sequence inhibit the accumulation of cytochrome f, lead to an accumulation of precursor, and impair the ability of Chlamydomonas cells to grow photosynthetically. One hydrophobic core mutant also reduces the accumulation of other thylakoid membrane proteins, but not those that translocate completely across the membrane. These results suggest that the signal sequence of cytochrome f is required and is involved in one of multiple insertion pathways. Suppressors of two signal peptide mutations describe at least two nuclear genes whose products likely describe the translocation apparatus, and selected second-site chloroplast suppressors further define regions of the cytochrome f signal peptide. PMID:8034740
The language faculty that wasn't: a usage-based account of natural language recursion.
Christiansen, Morten H; Chater, Nick
2015-01-01
In the generative tradition, the language faculty has been shrinking-perhaps to include only the mechanism of recursion. This paper argues that even this view of the language faculty is too expansive. We first argue that a language faculty is difficult to reconcile with evolutionary considerations. We then focus on recursion as a detailed case study, arguing that our ability to process recursive structure does not rely on recursion as a property of the grammar, but instead emerges gradually by piggybacking on domain-general sequence learning abilities. Evidence from genetics, comparative work on non-human primates, and cognitive neuroscience suggests that humans have evolved complex sequence learning skills, which were subsequently pressed into service to accommodate language. Constraints on sequence learning therefore have played an important role in shaping the cultural evolution of linguistic structure, including our limited abilities for processing recursive structure. Finally, we re-evaluate some of the key considerations that have often been taken to require the postulation of a language faculty.
Intrinsic sequence specificity of the Cas1 integrase directs new spacer acquisition
Rollie, Clare; Schneider, Stefanie; Brinkmann, Anna Sophie; Bolt, Edward L; White, Malcolm F
2015-01-01
The adaptive prokaryotic immune system CRISPR-Cas provides RNA-mediated protection from invading genetic elements. The fundamental basis of the system is the ability to capture small pieces of foreign DNA for incorporation into the genome at the CRISPR locus, a process known as Adaptation, which is dependent on the Cas1 and Cas2 proteins. We demonstrate that Cas1 catalyses an efficient trans-esterification reaction on branched DNA substrates, which represents the reverse- or disintegration reaction. Cas1 from both Escherichia coli and Sulfolobus solfataricus display sequence specific activity, with a clear preference for the nucleotides flanking the integration site at the leader-repeat 1 boundary of the CRISPR locus. Cas2 is not required for this activity and does not influence the specificity. This suggests that the inherent sequence specificity of Cas1 is a major determinant of the adaptation process. DOI: http://dx.doi.org/10.7554/eLife.08716.001 PMID:26284603
Lavery, Richard; Zakrzewska, Krystyna; Beveridge, David; Bishop, Thomas C.; Case, David A.; Cheatham, Thomas; Dixit, Surjit; Jayaram, B.; Lankas, Filip; Laughton, Charles; Maddocks, John H.; Michon, Alexis; Osman, Roman; Orozco, Modesto; Perez, Alberto; Singh, Tanya; Spackova, Nada; Sponer, Jiri
2010-01-01
It is well recognized that base sequence exerts a significant influence on the properties of DNA and plays a significant role in protein–DNA interactions vital for cellular processes. Understanding and predicting base sequence effects requires an extensive structural and dynamic dataset which is currently unavailable from experiment. A consortium of laboratories was consequently formed to obtain this information using molecular simulations. This article describes results providing information not only on all 10 unique base pair steps, but also on all possible nearest-neighbor effects on these steps. These results are derived from simulations of 50–100 ns on 39 different DNA oligomers in explicit solvent and using a physiological salt concentration. We demonstrate that the simulations are converged in terms of helical and backbone parameters. The results show that nearest-neighbor effects on base pair steps are very significant, implying that dinucleotide models are insufficient for predicting sequence-dependent behavior. Flanking base sequences can notably lead to base pair step parameters in dynamic equilibrium between two conformational sub-states. Although this study only provides limited data on next-nearest-neighbor effects, we suggest that such effects should be analyzed before attempting to predict the sequence-dependent behavior of DNA. PMID:19850719
Harper, B; McClain, S; Ganko, E W
2012-08-01
Global regulatory agencies require bioinformatic sequence analysis as part of their safety evaluation for transgenic crops. Analysis typically focuses on encoded proteins and adjacent endogenous flanking sequences. Recently, regulatory expectations have expanded to include all reading frames of the inserted DNA. The intent is to provide biologically relevant results that can be used in the overall assessment of safety. This paper evaluates the relevance of assessing the allergenic potential of all DNA reading frames found in common food genes using methods considered for the analysis of T-DNA sequences used in transgenic crops. FASTA and BLASTX algorithms were used to compare genes from maize, rice, soybean, cucumber, melon, watermelon, and tomato using international regulatory guidance. Results show that BLASTX for maize yielded 7254 alignments that exceeded allergen similarity thresholds and 210,772 alignments that matched eight or more consecutive amino acids with an allergen; other crops produced similar results. This analysis suggests that each nontransgenic crop has a much greater potential for allergenic risk than what has been observed clinically. We demonstrate that a meaningful safety assessment is unlikely to be provided by using methods with inherently high frequencies of false positive alignments when broadly applied to all reading frames of DNA sequence. Copyright © 2012 Elsevier Inc. All rights reserved.
Gummed-up memory: chewing gum impairs short-term recall.
Kozlov, Michail D; Hughes, Robert W; Jones, Dylan M
2012-01-01
Several studies have suggested that short-term memory is generally improved by chewing gum. However, we report the first studies to show that chewing gum impairs short-term memory for both item order and item identity. Experiment 1 showed that chewing gum reduces serial recall of letter lists. Experiment 2 indicated that chewing does not simply disrupt vocal-articulatory planning required for order retention: Chewing equally impairs a matched task that required retention of list item identity. Experiment 3 demonstrated that manual tapping produces a similar pattern of impairment to that of chewing gum. These results clearly qualify the assertion that chewing gum improves short-term memory. They also pose a problem for short-term memory theories asserting that forgetting is based on domain-specific interference given that chewing does not interfere with verbal memory any more than tapping. It is suggested that tapping and chewing reduce the general capacity to process sequences.
Sieira, Rodrigo; Comerci, Diego J.; Sánchez, Daniel O.; Ugalde, Rodolfo A.
2000-01-01
As part of a Brucella abortus 2308 genome project carried out in our laboratory, we identified, cloned, and sequenced a genomic DNA fragment containing a locus (virB) highly homologous to bacterial type IV secretion systems. The B. abortus virB locus is a collinear arrangement of 13 open reading frames (ORFs). Between virB1 and virB2 and downstream of ORF12, two degenerated, palindromic repeat sequences characteristic of Brucella intergenic regions were found. Gene reporter studies demonstrated that the B. abortus virB locus constitutes an operon transcribed from virB1 which is turned on during the stationary phase of growth. A B. abortus polar virB1 mutant failed to replicate in HeLa cells, indicating that the virB operon plays a critical role in intracellular multiplication. Mutants with polar and nonpolar mutations introduced in virB10 showed different behaviors in mice and in the HeLa cell infection assay, suggesting that virB10 per se is necessary for the correct function of this type IV secretion apparatus. Mouse infection assays demonstrated that the virB operon constitutes a major determinant of B. abortus virulence. It is suggested that putative effector molecules secreted by this type IV secretion system determine routing of B. abortus to an endoplasmic reticulum-related replication compartment. PMID:10940027
Sequence requirement of the ade6-4095 meiotic recombination hotspot in Schizosaccharomyces pombe.
Foulis, Steven J; Fowler, Kyle R; Steiner, Walter W
2018-02-01
Homologous recombination occurs at a greatly elevated frequency in meiosis compared to mitosis and is initiated by programmed double-strand DNA breaks (DSBs). DSBs do not occur at uniform frequency throughout the genome in most organisms, but occur preferentially at a limited number of sites referred to as hotspots. The location of hotspots have been determined at nucleotide-level resolution in both the budding and fission yeasts, and while several patterns have emerged regarding preferred locations for DSB hotspots, it remains unclear why particular sites experience DSBs at much higher frequency than other sites with seemingly similar properties. Short sequence motifs, which are often sites for binding of transcription factors, are known to be responsible for a number of hotspots. In this study we identified the minimum sequence required for activity of one of such motif identified in a screen of random sequences capable of producing recombination hotspots. The experimentally determined sequence, GGTCTRGACC, closely matches the previously inferred sequence. Full hotspot activity requires an effective sequence length of 9.5 bp, whereas moderate activity requires an effective sequence length of approximately 8.2 bp and shows significant association with DSB hotspots. In combination with our previous work, this result is consistent with a large number of different sequence motifs capable of producing recombination hotspots, and supports a model in which hotspots can be rapidly regenerated by mutation as they are lost through recombination.
Sequence specificity of the human mRNA N6-adenosine methylase in vitro.
Harper, J E; Miceli, S M; Roberts, R J; Manley, J L
1990-01-01
N6-adenosine methylation is a frequent modification of mRNAs and their precursors, but little is known about the mechanism of the reaction or the function of the modification. To explore these questions, we developed conditions to examine N6-adenosine methylase activity in HeLa cell nuclear extracts. Transfer of the methyl group from S-[3H methyl]-adenosylmethionine to unlabeled random copolymer RNA substrates of varying ribonucleotide composition revealed a substrate specificity consistent with a previously deduced consensus sequence, Pu[G greater than A]AC[A/C/U]. 32-P labeled RNA substrates of defined sequence were used to examine the minimum sequence requirements for methylation. Each RNA was 20 nucleotides long, and contained either the core consensus sequence GGACU, or some variation of this sequence. RNAs containing GGACU, either in single or multiple copies, were good substrates for methylation, whereas RNAs containing single base substitutions within the GGACU sequence gave dramatically reduced methylation. These results demonstrate that the N6-adenosine methylase has a strict sequence specificity, and that there is no requirement for extended sequences or secondary structures for methylation. Recognition of this sequence does not require an RNA component, as micrococcal nuclease pretreatment of nuclear extracts actually increased methylation efficiency. Images PMID:2216767
Plant MicroRNA Prediction by Supervised Machine Learning Using C5.0 Decision Trees.
Williams, Philip H; Eyles, Rod; Weiller, Georg
2012-01-01
MicroRNAs (miRNAs) are nonprotein coding RNAs between 20 and 22 nucleotides long that attenuate protein production. Different types of sequence data are being investigated for novel miRNAs, including genomic and transcriptomic sequences. A variety of machine learning methods have successfully predicted miRNA precursors, mature miRNAs, and other nonprotein coding sequences. MirTools, mirDeep2, and miRanalyzer require "read count" to be included with the input sequences, which restricts their use to deep-sequencing data. Our aim was to train a predictor using a cross-section of different species to accurately predict miRNAs outside the training set. We wanted a system that did not require read-count for prediction and could therefore be applied to short sequences extracted from genomic, EST, or RNA-seq sources. A miRNA-predictive decision-tree model has been developed by supervised machine learning. It only requires that the corresponding genome or transcriptome is available within a sequence window that includes the precursor candidate so that the required sequence features can be collected. Some of the most critical features for training the predictor are the miRNA:miRNA(∗) duplex energy and the number of mismatches in the duplex. We present a cross-species plant miRNA predictor with 84.08% sensitivity and 98.53% specificity based on rigorous testing by leave-one-out validation.
2012-01-01
Background Streptomyces species are widely distributed in natural habitats, such as soils, lakes, plants and some extreme environments. Replication loci of several Streptomyces theta-type plasmids have been reported, but are not characterized in details. Conjugation loci of some Streptomyces rolling-circle-type plasmids are identified and mechanism of conjugal transferring are described. Results We report the detection of a widely distributed Streptomyces strain Y27 and its indigenous plasmid pWTY27 from fourteen plants and four soil samples cross China by both culturing and nonculturing methods. The complete nucleotide sequence of pWTY27 consisted of 14,288 bp. A basic locus for plasmid replication comprised repAB genes and an adjacent iteron sequence, to a long inverted-repeat (ca. 105 bp) of which the RepA protein bound specifically in vitro, suggesting that RepA may recognize a second structure (e.g. a long stem-loop) of the iteron DNA. A plasmid containing the locus propagated in linear mode when the telomeres of a linear plasmid were attached, indicating a bi-directional replication mode for pWTY27. As for rolling-circle plasmids, a single traA gene and a clt sequence (covering 16 bp within traA and its adjacent 159 bp) on pWTY27 were required for plasmid transfer. TraA recognized and bound specifically to the two regions of the clt sequence, one containing all the four DC1 of 7 bp (TGACACC) and one DC2 (CCCGCCC) and most of IC1, and another covering two DC2 and part of IC1, suggesting formation of a high-ordered DNA-protein complex. Conclusions This work (i) isolates a widespread Streptomyces strain Y27 and sequences its indigenous theta-type plasmid pWTY27; (ii) identifies the replication and conjugation loci of pWTY27 and; (iii) characterizes the binding sequences of the RepA and TraA proteins. PMID:23134842
Mundy, N I; Kelly, J
2001-05-01
The evolutionary relationships of the lion tamarins (Leontopithecus) were investigated using nuclear interphotoreceptor retinol binding protein (IRBP) intron sequences. Phylogenetic reconstructions strongly support the monophyly of the genus, and a sister relationship between the golden lion tamarin, Leontopithecus rosalia, and the black lion tamarin, L. chrysopygus, to the exclusion of the golden-headed lion tamarin, L. chrysomelas. The most parsimonious evolutionary reconstruction suggests that the ancestral lion tamarin and the common ancestor of L. rosalia and L. chrysopygus had predominantly black coats. This reconstruction is not consistent with a theory of orthogenetic evolution of coat color that was based on coat color evolution in marmosets and tamarins. An alternative reconstruction that is consistent with metachromism requires that ancestral lion tamarins had agouti hairs. Copyright 2001 Wiley-Liss, Inc.
Sense transcription through the S region is essential for immunoglobulin class switch recombination
Haddad, Dania; Oruc, Zéliha; Puget, Nadine; Laviolette-Malirat, Nathalie; Philippe, Magali; Carrion, Claire; Le Bert, Marc; Khamlichi, Ahmed Amine
2011-01-01
Class switch recombination (CSR) occurs between highly repetitive sequences called switch (S) regions and is initiated by activation-induced cytidine deaminase (AID). CSR is preceded by a bidirectional transcription of S regions but the relative importance of sense and antisense transcription for CSR in vivo is unknown. We generated three mouse lines in which we attempted a premature termination of transcriptional elongation by inserting bidirectional transcription terminators upstream of Sμ, upstream of Sγ3 or downstream of Sγ3 sequences. The data show, at least for Sγ3, that sense transcriptional elongation across S region is absolutely required for CSR whereas its antisense counterpart is largely dispensable, strongly suggesting that sense transcription is sufficient for AID targeting to both DNA strands. PMID:21378751
BMPOS: a Flexible and User-Friendly Tool Sets for Microbiome Studies.
Pylro, Victor S; Morais, Daniel K; de Oliveira, Francislon S; Dos Santos, Fausto G; Lemos, Leandro N; Oliveira, Guilherme; Roesch, Luiz F W
2016-08-01
Recent advances in science and technology are leading to a revision and re-orientation of methodologies, addressing old and current issues under a new perspective. Advances in next generation sequencing (NGS) are allowing comparative analysis of the abundance and diversity of whole microbial communities, generating a large amount of data and findings at a systems level. The current limitation for biologists has been the increasing demand for computational power and training required for processing of NGS data. Here, we describe the deployment of the Brazilian Microbiome Project Operating System (BMPOS), a flexible and user-friendly Linux distribution dedicated to microbiome studies. The Brazilian Microbiome Project (BMP) has developed data analyses pipelines for metagenomic studies (phylogenetic marker genes), conducted using the two main high-throughput sequencing platforms (Ion Torrent and Illumina MiSeq). The BMPOS is freely available and possesses the entire requirement of bioinformatics packages and databases to perform all the pipelines suggested by the BMP team. The BMPOS may be used as a bootable live USB stick or installed in any computer with at least 1 GHz CPU and 512 MB RAM, independent of the operating system previously installed. The BMPOS has proved to be effective for sequences processing, sequences clustering, alignment, taxonomic annotation, statistical analysis, and plotting of metagenomic data. The BMPOS has been used during several metagenomic analyses courses, being valuable as a tool for training, and an excellent starting point to anyone interested in performing metagenomic studies. The BMPOS and its documentation are available at http://www.brmicrobiome.org .
A signal-arrest-release sequence mediates export and control of the phage P1 endolysin
Xu, Min; Struck, Douglas K.; Deaton, John; Wang, Ing-Nang; Young, Ry
2004-01-01
The Lyz endolysin of bacteriophage P1 was found to cause lysis of the host without a holin. Induction of a plasmid-cloned lyz resulted in lysis, and the lytic event could be triggered prematurely by treatments that dissipate the proton-motive force. Instead of requiring a holin, export was mediated by an N-terminal transmembrane domain (TMD) and required host sec function. Exported Lyz of identical SDS/PAGE mobility was found in both the membrane and periplasmic compartments, indicating that periplasmic Lyz was not generated by the proteolytic cleavage of the membrane-associated form. In gene fusion experiments, the Lyz TMD directed PhoA to both the membrane and periplasmic compartments, whereas the TMD of the integral membrane protein FtsI restricts Lyz to the membrane. Thus, the N-terminal domain of Lyz is both necessary and sufficient not only for export of this endolysin to the membrane but also for its release into the periplasm. The unusual N-terminal domain, rich in residues that are weakly hydrophobic, thus functions as a signal-arrest-release sequence, which first acts as a normal signal-arrest domain to direct the endolysin to the periplasm in membrane-tethered form and then allows it to be released as a soluble active enzyme in the periplasm. Examination of the protein sequences of related bacteriophage endolysins suggests that the presence of an N-terminal signal-arrest-release sequence is not unique to Lyz. These observations are discussed in relation to the role of holins in the control of host lysis by bacteriophage encoding a secretory endolysin. PMID:15090650
Study design requirements for RNA sequencing-based breast cancer diagnostics.
Mer, Arvind Singh; Klevebring, Daniel; Grönberg, Henrik; Rantalainen, Mattias
2016-02-01
Sequencing-based molecular characterization of tumors provides information required for individualized cancer treatment. There are well-defined molecular subtypes of breast cancer that provide improved prognostication compared to routine biomarkers. However, molecular subtyping is not yet implemented in routine breast cancer care. Clinical translation is dependent on subtype prediction models providing high sensitivity and specificity. In this study we evaluate sample size and RNA-sequencing read requirements for breast cancer subtyping to facilitate rational design of translational studies. We applied subsampling to ascertain the effect of training sample size and the number of RNA sequencing reads on classification accuracy of molecular subtype and routine biomarker prediction models (unsupervised and supervised). Subtype classification accuracy improved with increasing sample size up to N = 750 (accuracy = 0.93), although with a modest improvement beyond N = 350 (accuracy = 0.92). Prediction of routine biomarkers achieved accuracy of 0.94 (ER) and 0.92 (Her2) at N = 200. Subtype classification improved with RNA-sequencing library size up to 5 million reads. Development of molecular subtyping models for cancer diagnostics requires well-designed studies. Sample size and the number of RNA sequencing reads directly influence accuracy of molecular subtyping. Results in this study provide key information for rational design of translational studies aiming to bring sequencing-based diagnostics to the clinic.
Identification and application of self-binding zipper-like sequences in SARS-CoV spike protein.
Zhang, Si Min; Liao, Ying; Neo, Tuan Ling; Lu, Yanning; Liu, Ding Xiang; Vahlne, Anders; Tam, James P
2018-05-22
Self-binding peptides containing zipper-like sequences, such as the Leu/Ile zipper sequence within the coiled coil regions of proteins and the cross-β spine steric zippers within the amyloid-like fibrils, could bind to the protein-of-origin through homophilic sequence-specific zipper motifs. These self-binding sequences represent opportunities for the development of biochemical tools and/or therapeutics. Here, we report on the identification of a putative self-binding β-zipper-forming peptide within the severe acute respiratory syndrome-associated coronavirus spike (S) protein and its application in viral detection. Peptide array scanning of overlapping peptides covering the entire length of S protein identified 34 putative self-binding peptides of six clusters, five of which contained octapeptide core consensus sequences. The Cluster I consensus octapeptide sequence GINITNFR was predicted by the Eisenberg's 3D profile method to have high amyloid-like fibrillation potential through steric β-zipper formation. Peptide C6 containing the Cluster I consensus sequence was shown to oligomerize and form amyloid-like fibrils. Taking advantage of this, C6 was further applied to detect the S protein expression in vitro by fluorescence staining. Meanwhile, the coiled-coil-forming Leu/Ile heptad repeat sequences within the S protein were under-represented during peptide array scanning, in agreement with that long peptide lengths were required to attain high helix-mediated interaction avidity. The data suggest that short β-zipper-like self-binding peptides within the S protein could be identified through combining the peptide scanning and predictive methods, and could be exploited as biochemical detection reagents for viral infection. Copyright © 2018. Published by Elsevier Ltd.
Nakamura, Mikiko; Suzuki, Ayako; Akada, Junko; Tomiyoshi, Keisuke; Hoshida, Hisashi; Akada, Rinji
2015-12-01
Mammalian gene expression constructs are generally prepared in a plasmid vector, in which a promoter and terminator are located upstream and downstream of a protein-coding sequence, respectively. In this study, we found that front terminator constructs-DNA constructs containing a terminator upstream of a promoter rather than downstream of a coding region-could sufficiently express proteins as a result of end joining of the introduced DNA fragment. By taking advantage of front terminator constructs, FLAG substitutions, and deletions were generated using mutagenesis primers to identify amino acids specifically recognized by commercial FLAG antibodies. A minimal epitope sequence for polyclonal FLAG antibody recognition was also identified. In addition, we analyzed the sequence of a C-terminal Ser-Lys-Leu peroxisome localization signal, and identified the key residues necessary for peroxisome targeting. Moreover, front terminator constructs of hepatitis B surface antigen were used for deletion analysis, leading to the identification of regions required for the particle formation. Collectively, these results indicate that front terminator constructs allow for easy manipulations of C-terminal protein-coding sequences, and suggest that direct gene expression with PCR-amplified DNA is useful for high-throughput protein analysis in mammalian cells.
Sloma, A; Rufo, G A; Theriault, K A; Dwyer, M; Wilson, S W; Pero, J
1991-11-01
We have purified a minor extracellular serine protease from a strain of Bacillus subtilis bearing null mutations in five extracellular protease genes: apr, npr, epr, bpr, and mpr (A. Sloma, C. Rudolph, G. Rufo, Jr., B. Sullivan, K. Theriault, D. Ally, and J. Pero, J. Bacteriol. 172:1024-1029, 1990). During purification, this novel protease (Vpr) was found bound in a complex in the void volume after gel filtration chromatography. The amino-terminal sequence of the purified protein was determined, and an oligonucleotide probe was constructed on the basis of the amino acid sequence. This probe was used to clone the structural gene (vpr) for this protease. The gene encodes a primary product of 806 amino acids. The amino acid sequence of the mature protein was preceded by a signal sequence of approximately 28 amino acids and a prosequence of approximately 132 amino acids. The mature protein has a predicted molecular weight of 68,197; however, the isolated protein has an apparent molecular weight of 28,500, suggesting that Vpr undergoes C-terminal processing or proteolysis. The vpr gene maps in the ctrA-sacA-epr region of the chromosome and is not required for growth or sporulation.
Brain activation during anticipation of sound sequences.
Leaver, Amber M; Van Lare, Jennifer; Zielinski, Brandon; Halpern, Andrea R; Rauschecker, Josef P
2009-02-25
Music consists of sound sequences that require integration over time. As we become familiar with music, associations between notes, melodies, and entire symphonic movements become stronger and more complex. These associations can become so tight that, for example, hearing the end of one album track can elicit a robust image of the upcoming track while anticipating it in total silence. Here, we study this predictive "anticipatory imagery" at various stages throughout learning and investigate activity changes in corresponding neural structures using functional magnetic resonance imaging. Anticipatory imagery (in silence) for highly familiar naturalistic music was accompanied by pronounced activity in rostral prefrontal cortex (PFC) and premotor areas. Examining changes in the neural bases of anticipatory imagery during two stages of learning conditional associations between simple melodies, however, demonstrates the importance of fronto-striatal connections, consistent with a role of the basal ganglia in "training" frontal cortex (Pasupathy and Miller, 2005). Another striking change in neural resources during learning was a shift between caudal PFC earlier to rostral PFC later in learning. Our findings regarding musical anticipation and sound sequence learning are highly compatible with studies of motor sequence learning, suggesting common predictive mechanisms in both domains.
Brain Activation During Anticipation of Sound Sequences
Leaver, Amber M.; Van Lare, Jennifer; Zielinski, Brandon; Halpern, Andrea R.; Rauschecker, Josef P.
2010-01-01
Music consists of sound sequences that require integration over time. As we become familiar with music, associations between notes, melodies, and entire symphonic movements become stronger and more complex. These associations can become so tight that, for example, hearing the end of one album track can elicit a robust image of the upcoming track while anticipating it in total silence. Here we study this predictive “anticipatory imagery” at various stages throughout learning and investigate activity changes in corresponding neural structures using functional magnetic resonance imaging (fMRI). Anticipatory imagery (in silence) for highly familiar naturalistic music was accompanied by pronounced activity in rostral prefrontal cortex (PFC) and premotor areas. Examining changes in the neural bases of anticipatory imagery during two stages of learning conditional associations between simple melodies, however, demonstrates the importance of fronto-striatal connections, consistent with a role of the basal ganglia in “training” frontal cortex (Pasupathy and Miller, 2005). Another striking change in neural resources during learning was a shift between caudal PFC earlier to rostral PFC later in learning. Our findings regarding musical anticipation and sound sequence learning are highly compatible with studies of motor sequence learning, suggesting common predictive mechanisms in both domains. PMID:19244522
Interspecific Plastome Recombination Reflects Ancient Reticulate Evolution in Picea (Pinaceae).
Sullivan, Alexis R; Schiffthaler, Bastian; Thompson, Stacey Lee; Street, Nathaniel R; Wang, Xiao-Ru
2017-07-01
Plastid sequences are a cornerstone in plant systematic studies and key aspects of their evolution, such as uniparental inheritance and absent recombination, are often treated as axioms. While exceptions to these assumptions can profoundly influence evolutionary inference, detecting them can require extensive sampling, abundant sequence data, and detailed testing. Using advancements in high-throughput sequencing, we analyzed the whole plastomes of 65 accessions of Picea, a genus of ∼35 coniferous forest tree species, to test for deviations from canonical plastome evolution. Using complementary hypothesis and data-driven tests, we found evidence for chimeric plastomes generated by interspecific hybridization and recombination in the clade comprising Norway spruce (P. abies) and 10 other species. Support for interspecific recombination remained after controlling for sequence saturation, positive selection, and potential alignment artifacts. These results reconcile previous conflicting plastid-based phylogenies and strengthen the mounting evidence of reticulate evolution in Picea. Given the relatively high frequency of hybridization and biparental plastid inheritance in plants, we suggest interspecific plastome recombination may be more widespread than currently appreciated and could underlie reported cases of discordant plastid phylogenies. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
The tripartite leader sequence is required for ectopic expression of HAdV-B and HAdV-E E3 CR1 genes.
Bair, Camden R; Kotha Lakshmi Narayan, Poornima; Kajon, Adriana E
2017-05-01
The unique repertoire of genes that characterizes the early region 3 (E3) of the different species of human adenovirus (HAdV) likely contributes to their distinct pathogenic traits. The function of many E3 CR1 proteins remains unknown possibly due to unidentified intrinsic properties that make them difficult to express ectopically. This study shows that the species HAdV-B- and HAdV-E-specific E3 CR1 genes can be expressed from vectors carrying the HAdV tripartite leader (TPL) sequence but not from traditional mammalian expression vectors. Insertion of the TPL sequence upstream of the HAdV-B and HAdV-E E3 CR1 open reading frames was sufficient to rescue protein expression from pCI-neo constructs in transfected 293T cells. The detection of higher levels of HAdV-B and HAdV-E E3 CR1 transcripts suggests that the TPL sequence may enhance gene expression at both the transcriptional and translational levels. Our findings will facilitate the characterization of additional AdV E3 proteins. Copyright © 2017 Elsevier Inc. All rights reserved.
Quadrupedal locomotor simulation: producing more realistic gaits using dual-objective optimization
Hirasaki, Eishi
2018-01-01
In evolutionary biomechanics it is often considered that gaits should evolve to minimize the energetic cost of travelling a given distance. In gait simulation this goal often leads to convincing gait generation. However, as the musculoskeletal models used get increasingly sophisticated, it becomes apparent that such a single goal can lead to extremely unrealistic gait patterns. In this paper, we explore the effects of requiring adequate lateral stability and show how this increases both energetic cost and the realism of the generated walking gait in a high biofidelity chimpanzee musculoskeletal model. We also explore the effects of changing the footfall sequences in the simulation so it mimics both the diagonal sequence walking gaits that primates typically use and also the lateral sequence walking gaits that are much more widespread among mammals. It is apparent that adding a lateral stability criterion has an important effect on the footfall phase relationship, suggesting that lateral stability may be one of the key drivers behind the observed footfall sequences in quadrupedal gaits. The observation that single optimization goals are no longer adequate for generating gait in current models has important implications for the use of biomimetic virtual robots to predict the locomotor patterns in fossil animals. PMID:29657790
Bonin, Tanor; Smilek, Daniel
2016-04-01
We evaluated whether task-irrelevant inharmonic music produces greater interference with cognitive performance than task-irrelevant harmonic music. Participants completed either an auditory (Experiment 1) or a visual (Experiment 2) version of the cognitively demanding 2-back task in which they were required to categorize each digit in a sequence of digits as either being a target (a digit also presented two positions earlier in the sequence) or a distractor (all other items). They were concurrently exposed to either task-irrelevant harmonic music (judged to be consonant), task-irrelevant inharmonic music (judged to be dissonant), or no music at all as a distraction. The main finding across both experiments was that performance on the 2-back task was worse when participants were exposed to inharmonic music than when they were exposed to harmonic music. Interestingly, performance on the 2-back task was generally the same regardless of whether harmonic music or no music was played. We suggest that inharmonic, dissonant music interferes with cognitive performance by requiring greater cognitive processing than harmonic, consonant music, and speculate about why this might be.
Nero, Thomas M; Dalia, Triana N; Wang, Joseph Che-Yen; Kysela, David T; Bochman, Matthew L; Dalia, Ankur B
2018-05-02
Acquisition of foreign DNA by natural transformation is an important mechanism of adaptation and evolution in diverse microbial species. Here, we characterize the mechanism of ComM, a broadly conserved AAA+ protein previously implicated in homologous recombination of transforming DNA (tDNA) in naturally competent Gram-negative bacterial species. In vivo, we found that ComM was required for efficient comigration of linked genetic markers in Vibrio cholerae and Acinetobacter baylyi, which is consistent with a role in branch migration. Also, ComM was particularly important for integration of tDNA with increased sequence heterology, suggesting that its activity promotes the acquisition of novel DNA sequences. In vitro, we showed that purified ComM binds ssDNA, oligomerizes into a hexameric ring, and has bidirectional helicase and branch migration activity. Based on these data, we propose a model for tDNA integration during natural transformation. This study provides mechanistic insight into the enigmatic steps involved in tDNA integration and uncovers the function of a protein required for this conserved mechanism of horizontal gene transfer.
Wang, Zhen; Anderson, Nicholas Scott; Benning, Christoph
2013-01-01
Chloroplast membrane lipid synthesis relies on the import of glycerolipids from the ER. The TGD (TriGalactosylDiacylglycerol) proteins are required for this lipid transfer process. The TGD1, -2, and -3 proteins form a putative ABC (ATP-binding cassette) transporter transporting ER-derived lipids through the inner envelope membrane of the chloroplast, while TGD4 binds phosphatidic acid (PtdOH) and resides in the outer chloroplast envelope. We identified two sequences in TGD4, amino acids 1–80 and 110–145, which are necessary and sufficient for PtdOH binding. Deletion of both sequences abolished PtdOH binding activity. We also found that TGD4 from 18:3 plants bound specifically and with increased affinity PtdOH. TGD4 did not interact with other proteins and formed a homodimer both in vitro and in vivo. Our results suggest that TGD4 is an integral dimeric β-barrel lipid transfer protein that binds PtdOH with its N terminus and contains dimerization domains at its C terminus. PMID:23297418
Complex Instruction Set Quantum Computing
NASA Astrophysics Data System (ADS)
Sanders, G. D.; Kim, K. W.; Holton, W. C.
1998-03-01
In proposed quantum computers, electromagnetic pulses are used to implement logic gates on quantum bits (qubits). Gates are unitary transformations applied to coherent qubit wavefunctions and a universal computer can be created using a minimal set of gates. By applying many elementary gates in sequence, desired quantum computations can be performed. This reduced instruction set approach to quantum computing (RISC QC) is characterized by serial application of a few basic pulse shapes and a long coherence time. However, the unitary matrix of the overall computation is ultimately a unitary matrix of the same size as any of the elementary matrices. This suggests that we might replace a sequence of reduced instructions with a single complex instruction using an optimally taylored pulse. We refer to this approach as complex instruction set quantum computing (CISC QC). One trades the requirement for long coherence times for the ability to design and generate potentially more complex pulses. We consider a model system of coupled qubits interacting through nearest neighbor coupling and show that CISC QC can reduce the time required to perform quantum computations.
NASA Technical Reports Server (NTRS)
Williams, Jacob; Davis, Elizabeth C.; Lee, David E.; Condon, Gerald L.; Dawn, Tim
2009-01-01
The Orion spacecraft will be required to perform a three-burn trans-Earth injection (TEI) maneuver sequence to return to Earth from low lunar orbit. The origin of this approach lies in the Constellation Program requirements for access to any lunar landing site location combined with anytime lunar departure. This paper documents the development of optimized databases used to rapidly model the performance requirements of the TEI three-burn sequence for an extremely large number of mission cases. It also discusses performance results for lunar departures covering a complete 18.6 year lunar nodal cycle as well as general characteristics of the optimized three-burn TEI sequence.
NASA Technical Reports Server (NTRS)
Ingels, F. M.; Schoggen, W. O.
1982-01-01
The design to achieve the required bit transition density for the Space Shuttle high rate multiplexes (HRM) data stream of the Space Laboratory Vehicle is reviewed. It contained a recommended circuit approach, specified the pseudo random (PN) sequence to be used and detailed the properties of the sequence. Calculations showing the probability of failing to meet the required transition density were included. A computer simulation of the data stream and PN cover sequence was provided. All worst case situations were simulated and the bit transition density exceeded that required. The Preliminary Design Review and the critical Design Review are documented. The Cover Sequence Generator (CSG) Encoder/Decoder design was constructed and demonstrated. The demonstrations were successful. All HRM and HRDM units incorporate the CSG encoder or CSG decoder as appropriate.
The complete sequence of Cymbidium mosaic virus from Vanilla fragrans in Hainan, China.
He, Zhen; Jiang, Dongmei; Liu, Aiqin; Sang, Liwei; Li, Wenfeng; Li, Shifang
2011-06-01
The complete nucleotide sequence of Cymbidium mosaic virus (CymMV) isolated from vanilla in Hainan province, China was determined for the first time. It comprised 6,224 nucleotides; sequence analysis suggested that the isolate we obtained was a member of the genus Potexvirus, and its sequence shared 86.67-96.61% identities with previously reported sequences. Phylogenetic analysis suggested that CymMV from vanilla fragrans was clustered into subgroup A and the isolates in this subgroup displayed little regional difference.
Breaking Lander-Waterman’s Coverage Bound
Nashta-ali, Damoun; Motahari, Seyed Abolfazl; Hosseinkhalaj, Babak
2016-01-01
Lander-Waterman’s coverage bound establishes the total number of reads required to cover the whole genome of size G bases. In fact, their bound is a direct consequence of the well-known solution to the coupon collector’s problem which proves that for such genome, the total number of bases to be sequenced should be O(G ln G). Although the result leads to a tight bound, it is based on a tacit assumption that the set of reads are first collected through a sequencing process and then are processed through a computation process, i.e., there are two different machines: one for sequencing and one for processing. In this paper, we present a significant improvement compared to Lander-Waterman’s result and prove that by combining the sequencing and computing processes, one can re-sequence the whole genome with as low as O(G) sequenced bases in total. Our approach also dramatically reduces the required computational power for the combined process. Simulation results are performed on real genomes with different sequencing error rates. The results support our theory predicting the log G improvement on coverage bound and corresponding reduction in the total number of bases required to be sequenced. PMID:27806058
Berger, Cordula; Parson, Walther
2009-06-01
The degradation state of some biological traces recovered from the crime scene requires the amplification of very short fragments to attain a useful mitochondrial (mt)DNA sequence. We have previously introduced two mini-multiplex assays that amplify 10 overlapping control region (CR) fragments in two separate multiplex PCRs, which brought successful CR consensus sequences from even highly degraded DNA extracts. This procedure requires a total of 20 sequencing reactions per sample, which is laborious and cost intensive. For only moderately degraded samples that we encounter more frequently with typical mtDNA casework material, we developed two new multiplex assays that use a subset of the mini-amplicon primers but embrace larger fragments (midis) and require only 10 sequencing reactions to build a double-stranded CR consensus sequence. We used a preceding mtDNA quantitation step by real-time PCR with two different target fragments (143 and 283 bp) that roughly correspond to the average fragment sizes of the different multiplex approaches to estimate size-dependent mtDNA quantities and to aid the choice of the appropriate PCR multiplexes with respect to quality of the results and required costs.
Kuroda, Hiroshi; Sugiura, Masahiro
2014-12-01
The chloroplast psbB operon includes five genes encoding photosystem II and cytochrome b 6 /f complex components. The psbN gene is located on the opposite strand. PsbN is localized in the thylakoid and is present even in the dark, although its level increases upon illumination and then decreases. However, the translation mechanism of the psbN mRNA remains unclear. Using an in vitro translation system from tobacco chloroplasts and a green fluorescent protein as a reporter protein, we show that translation occurs from a tobacco primary psbN 5'-UTR of 47 nucleotides (nt). Unlike many other chloroplast 5'-UTRs, the psbN 5'-UTR has two processing sites, at -39 and -24 upstream from the initiation site. Processing at -39 enhanced the translation rate fivefold. In contrast, processing at -24 did not affect the translation rate. These observations suggest that the two distinct processing events regulate, at least in part, the level of PsbN during development. The psbN 5'-UTR has no Shine-Dalgarno (SD)-like sequence. In vitro translation assays with excess amounts of the psbN 5'-UTR or with deleted psbN 5'-UTR sequences demonstrated that protein factors are required for translation and that their binding site is an 18 nt sequence in the 5'-UTR. Mobility shift assays using 10 other chloroplast 5'-UTRs suggested that common or similar proteins are involved in translation of a set of mRNAs lacking SD-like sequences.
2013-01-01
Background The lack of genomic resources can present challenges for studies of non-model organisms. Transcriptome sequencing offers an attractive method to gather information about genes and gene expression without the need for a reference genome. However, it is unclear what sequencing depth is adequate to assemble the transcriptome de novo for these purposes. Results We assembled transcriptomes of animals from six different phyla (Annelids, Arthropods, Chordates, Cnidarians, Ctenophores, and Molluscs) at regular increments of reads using Velvet/Oases and Trinity to determine how read count affects the assembly. This included an assembly of mouse heart reads because we could compare those against the reference genome that is available. We found qualitative differences in the assemblies of whole-animals versus tissues. With increasing reads, whole-animal assemblies show rapid increase of transcripts and discovery of conserved genes, while single-tissue assemblies show a slower discovery of conserved genes though the assembled transcripts were often longer. A deeper examination of the mouse assemblies shows that with more reads, assembly errors become more frequent but such errors can be mitigated with more stringent assembly parameters. Conclusions These assembly trends suggest that representative assemblies are generated with as few as 20 million reads for tissue samples and 30 million reads for whole-animals for RNA-level coverage. These depths provide a good balance between coverage and noise. Beyond 60 million reads, the discovery of new genes is low and sequencing errors of highly-expressed genes are likely to accumulate. Finally, siphonophores (polymorphic Cnidarians) are an exception and possibly require alternate assembly strategies. PMID:23496952
Francis, Warren R; Christianson, Lynne M; Kiko, Rainer; Powers, Meghan L; Shaner, Nathan C; Haddock, Steven H D
2013-03-12
The lack of genomic resources can present challenges for studies of non-model organisms. Transcriptome sequencing offers an attractive method to gather information about genes and gene expression without the need for a reference genome. However, it is unclear what sequencing depth is adequate to assemble the transcriptome de novo for these purposes. We assembled transcriptomes of animals from six different phyla (Annelids, Arthropods, Chordates, Cnidarians, Ctenophores, and Molluscs) at regular increments of reads using Velvet/Oases and Trinity to determine how read count affects the assembly. This included an assembly of mouse heart reads because we could compare those against the reference genome that is available. We found qualitative differences in the assemblies of whole-animals versus tissues. With increasing reads, whole-animal assemblies show rapid increase of transcripts and discovery of conserved genes, while single-tissue assemblies show a slower discovery of conserved genes though the assembled transcripts were often longer. A deeper examination of the mouse assemblies shows that with more reads, assembly errors become more frequent but such errors can be mitigated with more stringent assembly parameters. These assembly trends suggest that representative assemblies are generated with as few as 20 million reads for tissue samples and 30 million reads for whole-animals for RNA-level coverage. These depths provide a good balance between coverage and noise. Beyond 60 million reads, the discovery of new genes is low and sequencing errors of highly-expressed genes are likely to accumulate. Finally, siphonophores (polymorphic Cnidarians) are an exception and possibly require alternate assembly strategies.
Somyoonsap, Peechapack; Kitpreechavanich, Vichein
2013-01-01
A sequence-specific nicking endonuclease from Streptomyces designated as DC13 was purified to near homogeneity. Starting with 30 grams of wet cells, the enzyme was purified by ammonium sulfate fractionation, DEAE cellulose, and phenyl-Sepharose chromatography. The purified protein had a specific activity 1000 units/mg and migrated on SDS-PAGE gel with an estimated molecular weight of 71 kDa. Determination of subunit composition by gel filtration chromatography indicated that the native enzyme is a monomer. When incubated with different DNA substrates including pBluescript II KS, pUC118, pET-15b, and pET-26b, the enzyme converted these supercoiled plasmids to a mixture of open circular and linear DNA products, with the open circular DNA as the major cleavage product. Analysis of the kinetic of DNA cleavage showed that the enzyme appeared to cleave super-coiled plasmid in two distinct steps: a rapid cleavage of super-coiled plasmid to an open circular DNA followed a much slower step to linear DNA. The DNA cleavage reaction of the enzyme required Mg2+ as a cofactor. Based on the monomeric nature of the enzyme, the kinetics of DNA cleavage exhibited by the enzyme, and cofactor requirement, it is suggested here that the purified enzyme is a sequence-specific nicking endonuclease that is similar to type IIS restriction endonuclease. PMID:25937959
Developing eThread pipeline using SAGA-pilot abstraction for large-scale structural bioinformatics.
Ragothaman, Anjani; Boddu, Sairam Chowdary; Kim, Nayong; Feinstein, Wei; Brylinski, Michal; Jha, Shantenu; Kim, Joohyun
2014-01-01
While most of computational annotation approaches are sequence-based, threading methods are becoming increasingly attractive because of predicted structural information that could uncover the underlying function. However, threading tools are generally compute-intensive and the number of protein sequences from even small genomes such as prokaryotes is large typically containing many thousands, prohibiting their application as a genome-wide structural systems biology tool. To leverage its utility, we have developed a pipeline for eThread--a meta-threading protein structure modeling tool, that can use computational resources efficiently and effectively. We employ a pilot-based approach that supports seamless data and task-level parallelism and manages large variation in workload and computational requirements. Our scalable pipeline is deployed on Amazon EC2 and can efficiently select resources based upon task requirements. We present runtime analysis to characterize computational complexity of eThread and EC2 infrastructure. Based on results, we suggest a pathway to an optimized solution with respect to metrics such as time-to-solution or cost-to-solution. Our eThread pipeline can scale to support a large number of sequences and is expected to be a viable solution for genome-scale structural bioinformatics and structure-based annotation, particularly, amenable for small genomes such as prokaryotes. The developed pipeline is easily extensible to other types of distributed cyberinfrastructure.
Developing eThread Pipeline Using SAGA-Pilot Abstraction for Large-Scale Structural Bioinformatics
Ragothaman, Anjani; Feinstein, Wei; Jha, Shantenu; Kim, Joohyun
2014-01-01
While most of computational annotation approaches are sequence-based, threading methods are becoming increasingly attractive because of predicted structural information that could uncover the underlying function. However, threading tools are generally compute-intensive and the number of protein sequences from even small genomes such as prokaryotes is large typically containing many thousands, prohibiting their application as a genome-wide structural systems biology tool. To leverage its utility, we have developed a pipeline for eThread—a meta-threading protein structure modeling tool, that can use computational resources efficiently and effectively. We employ a pilot-based approach that supports seamless data and task-level parallelism and manages large variation in workload and computational requirements. Our scalable pipeline is deployed on Amazon EC2 and can efficiently select resources based upon task requirements. We present runtime analysis to characterize computational complexity of eThread and EC2 infrastructure. Based on results, we suggest a pathway to an optimized solution with respect to metrics such as time-to-solution or cost-to-solution. Our eThread pipeline can scale to support a large number of sequences and is expected to be a viable solution for genome-scale structural bioinformatics and structure-based annotation, particularly, amenable for small genomes such as prokaryotes. The developed pipeline is easily extensible to other types of distributed cyberinfrastructure. PMID:24995285
Crystal Structure of the Nipah Virus Phosphoprotein Tetramerization Domain
Bruhn, Jessica F.; Barnett, Katherine C.; Bibby, Jaclyn; Thomas, Jens M. H.; Keegan, Ronan M.; Rigden, Daniel J.; Bornholdt, Zachary A.
2014-01-01
The Nipah virus phosphoprotein (P) is multimeric and tethers the viral polymerase to the nucleocapsid. We present the crystal structure of the multimerization domain of Nipah virus P: a long, parallel, tetrameric, coiled coil with a small, α-helical cap structure. Across the paramyxoviruses, these domains share little sequence identity yet are similar in length and structural organization, suggesting a common requirement for scaffolding or spatial organization of the functions of P in the virus life cycle. PMID:24155387
Evolutionary Dynamics of the Gametologous CTNNB1 Gene on the Z and W Chromosomes of Snakes.
Laopichienpong, Nararat; Muangmai, Narongrit; Chanhome, Lawan; Suntrarachun, Sunutcha; Twilprawat, Panupon; Peyachoknagul, Surin; Srikulnath, Kornsorn
2017-03-01
Snakes exhibit genotypic sex determination with female heterogamety (ZZ males and ZW females), and the state of sex chromosome differentiation also varies among lineages. To investigate the evolutionary history of homologous genes located in the nonrecombining region of differentiated sex chromosomes in snakes, partial sequences of the gametologous CTNNB1 gene were analyzed for 12 species belonging to henophid (Cylindrophiidae, Xenopeltidae, and Pythonidae) and caenophid snakes (Viperidae, Elapidae, and Colubridae). Nonsynonymous/synonymous substitution ratios (Ka/Ks) in coding sequences were low (Ka/Ks < 1) between CTNNB1Z and CTNNB1W, suggesting that these 2 genes may have similar functional properties. However, frequencies of intron sequence substitutions and insertion–deletions were higher in CTNNB1Z than CTNNB1W, suggesting that Z-linked sequences evolved faster than W-linked sequences. Molecular phylogeny based on both intron and exon sequences showed the presence of 2 major clades: 1) Z-linked sequences of Caenophidia and 2) W-linked sequences of Caenophidia clustered with Z-linked sequences of Henophidia, which suggests that the sequence divergence between CTNNB1Z and CTNNB1W in Caenophidia may have occurred by the cessation of recombination after the split from Henophidia.
Cowley, Michael; de Burca, Anna; McCole, Ruth B; Chahal, Mandeep; Saadat, Ghazal; Oakey, Rebecca J; Schulz, Reiner
2011-04-20
Genomic imprinting is a form of gene dosage regulation in which a gene is expressed from only one of the alleles, in a manner dependent on the parent of origin. The mechanisms governing imprinted gene expression have been investigated in detail and have greatly contributed to our understanding of genome regulation in general. Both DNA sequence features, such as CpG islands, and epigenetic features, such as DNA methylation and non-coding RNAs, play important roles in achieving imprinted expression. However, the relative importance of these factors varies depending on the locus in question. Defining the minimal features that are absolutely required for imprinting would help us to understand how imprinting has evolved mechanistically. Imprinted retrogenes are a subset of imprinted loci that are relatively simple in their genomic organisation, being distinct from large imprinting clusters, and have the potential to be used as tools to address this question. Here, we compare the repeat element content of imprinted retrogene loci with non-imprinted controls that have a similar locus organisation. We observe no significant differences that are conserved between mouse and human, suggesting that the paucity of SINEs and relative abundance of LINEs at imprinted loci reported by others is not a sequence feature universally required for imprinting.
Ivanov, E. L.; Sugawara, N.; Fishman-Lobell, J.; Haber, J. E.
1996-01-01
HO endonuclease-induced double-strand breaks (DSBs) within a direct duplication of Escherichia coli lacZ genes are repaired either by gene conversion or by single-strand annealing (SSA), with >80% being SSA. Previously it was demonstrated that the RAD52 gene is required for DSB-induced SSA. In the present study, the effects of other genes belonging to the RAD52 epistasis group were analyzed. We show that RAD51, RAD54, RAD55, and RAD57 genes are not required for SSA irrespective of whether recombination occurred in plasmid or chromosomal DNA. In both plasmid and chromosomal constructs with homologous sequences in direct orientation, the proportion of SSA events over gene conversion was significantly elevated in the mutant strains. However, gene conversion was not affected when the two lacZ sequences were in inverted orientation. These results suggest that there is a competition between SSA and gene conversion processes that favors SSA in the absence of RAD51, RAD54, RAD55 and RAD57. Mutations in RAD50 and XRS2 genes do not prevent the completion, but markedly retard the kinetics, of DSB repair by both mechanisms in the lacZ direct repeat plasmid, a result resembling the effects of these genes during mating-type (MAT) switching. PMID:8849880
Huguet-Tapia, Jose C.; Lefebure, Tristan; Badger, Jonathan H.; Guan, Dongli; Stanhope, Michael J.
2016-01-01
Streptomyces spp. are highly differentiated actinomycetes with large, linear chromosomes that encode an arsenal of biologically active molecules and catabolic enzymes. Members of this genus are well equipped for life in nutrient-limited environments and are common soil saprophytes. Out of the hundreds of species in the genus Streptomyces, a small group has evolved the ability to infect plants. The recent availability of Streptomyces genome sequences, including four genomes of pathogenic species, provided an opportunity to characterize the gene content specific to these pathogens and to study phylogenetic relationships among them. Genome sequencing, comparative genomics, and phylogenetic analysis enabled us to discriminate pathogenic from saprophytic Streptomyces strains; moreover, we calculated that the pathogen-specific genome contains 4,662 orthologs. Phylogenetic reconstruction suggested that Streptomyces scabies and S. ipomoeae share an ancestor but that their biosynthetic clusters encoding the required virulence factor thaxtomin have diverged. In contrast, S. turgidiscabies and S. acidiscabies, two relatively unrelated pathogens, possess highly similar thaxtomin biosynthesis clusters, which suggests that the acquisition of these genes was through lateral gene transfer. PMID:26826232
Li, Wenjun; Ortiz, Gabriel; Fournier, Pierre-Edouard; Gimenez, Gregory; Reed, David L; Pittendrigh, Barry; Raoult, Didier
2010-03-23
Genetic analyses of human lice have shown that the current taxonomic classification of head lice (Pediculus humanus capitis) and body lice (Pediculus humanus humanus) does not reflect their phylogenetic organization. Three phylotypes of head lice A, B and C exist but body lice have been observed only in phylotype A. Head and body lice have different behaviours and only the latter have been involved in outbreaks of infectious diseases including epidemic typhus, trench fever and louse borne recurrent fever. Recent studies suggest that body lice arose several times from head louse populations. By introducing a new genotyping technique, sequencing variable intergenic spacers which were selected from louse genomic sequence, we were able to evaluate the genotypic distribution of 207 human lice. Sequence variation of two intergenic spacers, S2 and S5, discriminated the 207 lice into 148 genotypes and sequence variation of another two intergenic spacers, PM1 and PM2, discriminated 174 lice into 77 genotypes. Concatenation of the four intergenic spacers discriminated a panel of 97 lice into 96 genotypes. These intergenic spacer sequence types were relatively specific geographically, and enabled us to identify two clusters in France, one cluster in Central Africa (where a large body louse outbreak has been observed) and one cluster in Russia. Interestingly, head and body lice were not genetically differentiated. We propose a hypothesis for the emergence of body lice, and suggest that humans with both low hygiene and head louse infestations provide an opportunity for head louse variants, able to ingest a larger blood meal (a required characteristic of body lice), to colonize clothing. If this hypothesis is ultimately supported, it would help to explain why poor human hygiene often coincides with outbreaks of body lice. Additionally, if head lice act as a reservoir for body lice, and that any social degradation in human populations may allow the formation of new populations of body lice, then head louse populations are potentially a greater threat to humans than previously assumed.
Frange, Pierre; Meyer, Laurence; Jung, Matthieu; Goujard, Cecile; Zucman, David; Abel, Sylvie; Hochedez, Patrick; Gousset, Marine; Gascuel, Olivier; Rouzioux, Christine; Chaix, Marie-Laure
2013-01-01
Objective Characterization of HIV-1 sequences in newly infected individuals is important for elucidating the mechanisms of viral sexual transmission. We report the identification of transmitted/founder viruses in eight pairs of HIV-1 sexually-infected patients enrolled at the time of primary infection (“recipients”) and their transmitting partners (“donors”). Methods Using a single genome-amplification approach, we compared quasispecies in donors and recipients on the basis of 316 and 376 C2V5 env sequences amplified from plasma viral RNA and PBMC-associated DNA, respectively. Results Both DNA and RNA sequences indicated very homogeneous viral populations in all recipients, suggesting transmission of a single variant, even in cases of recent sexually transmitted infections (STIs) in donors (n = 2) or recipients (n = 3). In all pairs, the transmitted/founder virus was derived from an infrequent variant population within the blood of the donor. The donor variant sequences most closely related to the recipient sequences were found in plasma samples in 3/8 cases and/or in PBMC samples in 6/8 cases. Although donors were exclusively (n = 4) or predominantly (n = 4) infected by CCR5-tropic (R5) strains, two recipients were infected with highly homogeneous CXCR4/dual-mixed-tropic (X4/DM) viral populations, identified in both DNA and RNA. The proportion of X4/DM quasispecies in donors was higher in cases of X4/DM than R5 HIV transmission (16.7–22.0% versus 0–2.6%), suggesting that X4/DM transmission may be associated with a threshold population of X4/DM circulating quasispecies in donors. Conclusions These suggest that a severe genetic bottleneck occurs during subtype B HIV-1 heterosexual and homosexual transmission. Sexually-transmitted/founder virus cannot be directly predicted by analysis of the donor’s quasispecies in plasma and/or PBMC. Additional studies are required to fully understand the traits that confer the capacity to transmit and establish infection, and determine the role of concomitant STIs in mitigating the genetic bottleneck in mucosal HIV transmission. PMID:23874894
Genotyping of Human Lice Suggests Multiple Emergences of Body Lice from Local Head Louse Populations
Li, Wenjun; Ortiz, Gabriel; Fournier, Pierre-Edouard; Gimenez, Gregory; Reed, David L.; Pittendrigh, Barry; Raoult, Didier
2010-01-01
Background Genetic analyses of human lice have shown that the current taxonomic classification of head lice (Pediculus humanus capitis) and body lice (Pediculus humanus humanus) does not reflect their phylogenetic organization. Three phylotypes of head lice A, B and C exist but body lice have been observed only in phylotype A. Head and body lice have different behaviours and only the latter have been involved in outbreaks of infectious diseases including epidemic typhus, trench fever and louse borne recurrent fever. Recent studies suggest that body lice arose several times from head louse populations. Methods and Findings By introducing a new genotyping technique, sequencing variable intergenic spacers which were selected from louse genomic sequence, we were able to evaluate the genotypic distribution of 207 human lice. Sequence variation of two intergenic spacers, S2 and S5, discriminated the 207 lice into 148 genotypes and sequence variation of another two intergenic spacers, PM1 and PM2, discriminated 174 lice into 77 genotypes. Concatenation of the four intergenic spacers discriminated a panel of 97 lice into 96 genotypes. These intergenic spacer sequence types were relatively specific geographically, and enabled us to identify two clusters in France, one cluster in Central Africa (where a large body louse outbreak has been observed) and one cluster in Russia. Interestingly, head and body lice were not genetically differentiated. Conclusions We propose a hypothesis for the emergence of body lice, and suggest that humans with both low hygiene and head louse infestations provide an opportunity for head louse variants, able to ingest a larger blood meal (a required characteristic of body lice), to colonize clothing. If this hypothesis is ultimately supported, it would help to explain why poor human hygiene often coincides with outbreaks of body lice. Additionally, if head lice act as a reservoir for body lice, and that any social degradation in human populations may allow the formation of new populations of body lice, then head louse populations are potentially a greater threat to humans than previously assumed. PMID:20351779
ERIC Educational Resources Information Center
Arafeh, Sousan
2016-01-01
Best practice in curriculum development and implementation requires that discipline-based standards or requirements embody both curricular and programme scopes and sequences. Ensuring these are present and aligned in course/programme content, activities and assessments to support student success requires formalised and systematised review and…
Cova, Marta; López-Gutiérrez, Borja; Artigas-Jerónimo, Sara; González-Díaz, Aida; Bandini, Giulia; Maere, Steven; Carretero-Paulet, Lorenzo; Izquierdo, Luis
2018-03-05
Apicomplexa form a phylum of obligate parasitic protozoa of great clinical and veterinary importance. These parasites synthesize glycoconjugates for their survival and infectivity, but the enzymatic steps required to generate the glycosylation precursors are not completely characterized. In particular, glucosamine-phosphate N-acetyltransferase (GNA1) activity, needed to produce the essential UDP-N-acetylglucosamine (UDP-GlcNAc) donor, has not been identified in any Apicomplexa. We scanned the genomes of Plasmodium falciparum and representatives from six additional main lineages of the phylum for proteins containing the Gcn5-related N-acetyltransferase (GNAT) domain. One family of GNAT-domain containing proteins, composed by a P. falciparum sequence and its six apicomplexan orthologs, rescued the growth of a yeast temperature-sensitive GNA1 mutant. Heterologous expression and in vitro assays confirmed the GNA1 enzymatic activity in all lineages. Sequence, phylogenetic and synteny analyses suggest an independent origin of the Apicomplexa-specific GNA1 family, parallel to the evolution of a different GNA1 family in other eukaryotes. The inability to disrupt an otherwise modifiable gene target suggests that the enzyme is essential for P. falciparum growth. The relevance of UDP-GlcNAc for parasite viability, together with the independent evolution and unique sequence features of Apicomplexa GNA1, highlights the potential of this enzyme as a selective therapeutic target against apicomplexans.
Sabooh, M Fazli; Iqbal, Nadeem; Khan, Mukhtaj; Khan, Muslim; Maqbool, H F
2018-05-01
This study examines accurate and efficient computational method for identification of 5-methylcytosine sites in RNA modification. The occurrence of 5-methylcytosine (m 5 C) plays a vital role in a number of biological processes. For better comprehension of the biological functions and mechanism it is necessary to recognize m 5 C sites in RNA precisely. The laboratory techniques and procedures are available to identify m 5 C sites in RNA, but these procedures require a lot of time and resources. This study develops a new computational method for extracting the features of RNA sequence. In this method, first the RNA sequence is encoded via composite feature vector, then, for the selection of discriminate features, the minimum-redundancy-maximum-relevance algorithm was used. Secondly, the classification method used has been based on a support vector machine by using jackknife cross validation test. The suggested method efficiently identifies m 5 C sites from non- m 5 C sites and the outcome of the suggested algorithm is 93.33% with sensitivity of 90.0 and specificity of 96.66 on bench mark datasets. The result exhibits that proposed algorithm shown significant identification performance compared to the existing computational techniques. This study extends the knowledge about the occurrence sites of RNA modification which paves the way for better comprehension of the biological uses and mechanism. Copyright © 2018 Elsevier Ltd. All rights reserved.
Foster, Patricia L.; Lee, Heewook; Popodi, Ellen; Townes, Jesse P.; Tang, Haixu
2015-01-01
A complete understanding of evolutionary processes requires that factors determining spontaneous mutation rates and spectra be identified and characterized. Using mutation accumulation followed by whole-genome sequencing, we found that the mutation rates of three widely diverged commensal Escherichia coli strains differ only by about 50%, suggesting that a rate of 1–2 × 10−3 mutations per generation per genome is common for this bacterium. Four major forces are postulated to contribute to spontaneous mutations: intrinsic DNA polymerase errors, endogenously induced DNA damage, DNA damage caused by exogenous agents, and the activities of error-prone polymerases. To determine the relative importance of these factors, we studied 11 strains, each defective for a major DNA repair pathway. The striking result was that only loss of the ability to prevent or repair oxidative DNA damage significantly impacted mutation rates or spectra. These results suggest that, with the exception of oxidative damage, endogenously induced DNA damage does not perturb the overall accuracy of DNA replication in normally growing cells and that repair pathways may exist primarily to defend against exogenously induced DNA damage. The thousands of mutations caused by oxidative damage recovered across the entire genome revealed strong local-sequence biases of these mutations. Specifically, we found that the identity of the 3′ base can affect the mutability of a purine by oxidative damage by as much as eightfold. PMID:26460006
Tavares, D; Tully, K; Dobner, P R
1999-10-15
The promoter region of the mouse high affinity neurotensin receptor (Ntr-1) gene was characterized, and sequences required for expression in neuroblastoma cell lines that express high affinity NT-binding sites were characterized. Me(2)SO-induced neuronal differentiation of N1E-115 neuroblastoma cells increased both the expression of the endogenous Ntr-1 gene and reporter genes driven by NTR-1 promoter sequences by 3-4-fold. Deletion analysis revealed that an 83-base pair promoter region containing the transcriptional start site is required for Me(2)SO activation. Detailed mutational analysis of this region revealed that a CACCC box and the central region of a large GC-rich palindrome are the crucial cis-regulatory elements required for Me(2)SO induction. The CACCC box is bound by at least one factor that is induced upon Me(2)SO treatment of N1E-115 cells. The Me(2)SO effect was found to be both selective and cell type-restricted. Basal expression in the neuroblastoma cell lines required a distinct set of sequences, including an Sp1-like sequence, and a sequence resembling an NGFI-A-binding site; however, a more distal 5' sequence was found to repress basal activity in N1E-115 cells. These results provide evidence that Ntr-1 gene regulation involves both positive and negative regulatory elements located in the 5'-flanking region and that Ntr-1 gene activation involves the coordinate activation or induction of several factors, including a CACCC box binding complex.
Changes in spinal reflex excitability associated with motor sequence learning.
Lungu, Ovidiu; Frigon, Alain; Piché, Mathieu; Rainville, Pierre; Rossignol, Serge; Doyon, Julien
2010-05-01
There is ample evidence that motor sequence learning is mediated by changes in brain activity. Yet the question of whether this form of learning elicits changes detectable at the spinal cord level has not been addressed. To date, studies in humans have revealed that spinal reflex activity may be altered during the acquisition of various motor skills, but a link between motor sequence learning and changes in spinal excitability has not been demonstrated. To address this issue, we studied the modulation of H-reflex amplitude evoked in the flexor carpi radialis muscle of 14 healthy individuals between blocks of movements that involved the implicit acquisition of a sequence versus other movements that did not require learning. Each participant performed the task in three conditions: "sequence"-externally triggered, repeating and sequential movements, "random"-similar movements, but performed in an arbitrary order, and "simple"- involving alternating movements in a left-right or up-down direction only. When controlling for background muscular activity, H-reflex amplitude was significantly more reduced in the sequence (43.8 +/- 1.47%. mean +/- SE) compared with the random (38.2 +/- 1.60%) and simple (31.5 +/- 1.82%) conditions, while the M-response was not different across conditions. Furthermore, H-reflex changes were observed from the beginning of the learning process up to when subjects reached asymptotic performance on the motor task. Changes also persisted for >60 s after motor activity ceased. Such findings suggest that the excitability in some spinal reflex circuits is altered during the implicit learning process of a new motor sequence.
The transition to increased automaticity during finger sequence learning in adult males who stutter.
Smits-Bandstra, Sarah; De Nil, Luc; Rochon, Elizabeth
2006-01-01
The present study compared the automaticity levels of persons who stutter (PWS) and persons who do not stutter (PNS) on a practiced finger sequencing task under dual task conditions. Automaticity was defined as the amount of attention required for task performance. Twelve PWS and 12 control subjects practiced finger tapping sequences under single and then dual task conditions. Control subjects performed the sequencing task significantly faster and less variably under single versus dual task conditions while PWS' performance was consistently slow and variable (comparable to the dual task performance of control subjects) under both conditions. Control subjects were significantly more accurate on a colour recognition distracter task than PWS under dual task conditions. These results suggested that control subjects transitioned to quick, accurate and increasingly automatic performance on the sequencing task after practice, while PWS did not. Because most stuttering treatment programs for adults include practice and automatization of new motor speech skills, findings of this finger sequencing study and future studies of speech sequence learning may have important implications for how to maximize stuttering treatment effectiveness. As a result of this activity, the participant will be able to: (1) Define automaticity and explain the importance of dual task paradigms to investigate automaticity; (2) Relate the proposed relationship between motor learning and automaticity as stated by the authors; (3) Summarize the reviewed literature concerning the performance of PWS on dual tasks; and (4) Explain why the ability to transition to automaticity during motor learning may have important clinical implications for stuttering treatment effectiveness.
Crack azimuths on Europa: The G1 lineament sequence revisited
Sarid, A.R.; Greenberg, R.; Hoppa, G.V.; Brown, D.M.; Geissler, P.
2005-01-01
The tectonic sequence in the anti-jovian area covered by regional mapping images from Galileo's orbit E15 is determined from a study of cross-cutting relationships among lineament features. The sequence is used to test earlier results from orbit G1, based on lower resolution images, which appeared to display a progressive change in azimuthal orientation over about 90?? in a clockwise sense. Such a progression is consistent with expected stress variations that would accompany plausible non-synchronous rotation. The more recent data provide a more complete record than the G1 data did. We find that to fit the sequence into a continual clockwise change of orientation would require at least 1000?? (> 5 cycles) of azimuthal rotation. If due to non-synchronous rotation of Europa, this result implies that we are seeing back further into the tectonic record than the G1 results had suggested. The three sets of orientations found by Geissler et al. now appear to have been spaced out over several cycles, not during a fraction of one cycle. While our more complete sequence of lineament formation is consistent with non-synchronous rotation, a statistical test shows that it cannot be construed as independent evidence. Other lines of evidence do support non-synchronous rotation, but azimuths of crack sequences do not show it, probably because only a couple of cracks form in a given region in any given non-synchronous rotation period. ?? 2004 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hiraiwa, Akikazu; Yamanaka, Katsuo; Kwok, W.W.
Although HLA genes have been shown to be associated with certain diseases, the basis for this association is unknown. Recent studies, however, have documented patterns of nucleotide sequence variation among some HLA genes associated with a particular disease. For rheumatoid arthritis, HLA genes in most patients have a shared nucleotide sequence encoding a key structural element of an HLA class II polypeptide; this sequence element is critical for the interaction of the HLA molecule with antigenic peptides and with responding T cells, suggestive of a direct role for this sequence element in disease susceptibility. The authors describe the serological andmore » cellular immunologic characteristics encoded by this rheumatoid arthritis-associated sequence element. Site-directed mutagenesis of the DRB1 gene was used to define amino acids critical for antibody and T-cell recognition of this structural element, focusing on residues that distinguish the rheumatoid arthritis-associated alleles Dw4 and Dw14 from a closely related allele, Dw10, not associated with disease. Both the gain and loss of rheumatoid arthritis-associated epitopes were highly dependent on three residues within a discrete domain of the HLA-DR molecule. Recognition was most strongly influenced by the following amino acids (in order): 70 > 71 > 67. Some alloreactive T-cell clones were also influenced by amino acid variation in portions of the DR molecule lying outside the shared sequence element.« less
Can you sequence ecology? Metagenomics of adaptive diversification.
Marx, Christopher J
2013-01-01
Few areas of science have benefited more from the expansion in sequencing capability than the study of microbial communities. Can sequence data, besides providing hypotheses of the functions the members possess, detect the evolutionary and ecological processes that are occurring? For example, can we determine if a species is adapting to one niche, or if it is diversifying into multiple specialists that inhabit distinct niches? Fortunately, adaptation of populations in the laboratory can serve as a model to test our ability to make such inferences about evolution and ecology from sequencing. Even adaptation to a single niche can give rise to complex temporal dynamics due to the transient presence of multiple competing lineages. If there are multiple niches, this complexity is augmented by segmentation of the population into multiple specialists that can each continue to evolve within their own niche. For a known example of parallel diversification that occurred in the laboratory, sequencing data gave surprisingly few obvious, unambiguous signs of the ecological complexity present. Whereas experimental systems are open to direct experimentation to test hypotheses of selection or ecological interaction, the difficulty in "seeing ecology" from sequencing for even such a simple system suggests translation to communities like the human microbiome will be quite challenging. This will require both improved empirical methods to enhance the depth and time resolution for the relevant polymorphisms and novel statistical approaches to rigorously examine time-series data for signs of various evolutionary and ecological phenomena within and between species.
Regulatory link between DNA methylation and active demethylation in Arabidopsis
Lei, Mingguang; Zhang, Huiming; Julian, Russell; Tang, Kai; Xie, Shaojun; Zhu, Jian-Kang
2015-01-01
De novo DNA methylation through the RNA-directed DNA methylation (RdDM) pathway and active DNA demethylation play important roles in controlling genome-wide DNA methylation patterns in plants. Little is known about how cells manage the balance between DNA methylation and active demethylation activities. Here, we report the identification of a unique RdDM target sequence, where DNA methylation is required for maintaining proper active DNA demethylation of the Arabidopsis genome. In a genetic screen for cellular antisilencing factors, we isolated several REPRESSOR OF SILENCING 1 (ros1) mutant alleles, as well as many RdDM mutants, which showed drastically reduced ROS1 gene expression and, consequently, transcriptional silencing of two reporter genes. A helitron transposon element (TE) in the ROS1 gene promoter negatively controls ROS1 expression, whereas DNA methylation of an RdDM target sequence between ROS1 5′ UTR and the promoter TE region antagonizes this helitron TE in regulating ROS1 expression. This RdDM target sequence is also targeted by ROS1, and defective DNA demethylation in loss-of-function ros1 mutant alleles causes DNA hypermethylation of this sequence and concomitantly causes increased ROS1 expression. Our results suggest that this sequence in the ROS1 promoter region serves as a DNA methylation monitoring sequence (MEMS) that senses DNA methylation and active DNA demethylation activities. Therefore, the ROS1 promoter functions like a thermostat (i.e., methylstat) to sense DNA methylation levels and regulates DNA methylation by controlling ROS1 expression. PMID:25733903
Evolution of Hsp70 Gene Expression: A Role for Changes in AT-Richness within Promoters
Ma, Ronghui; Zhang, Bo; Kang, Le
2011-01-01
In disparate organisms adaptation to thermal stress has been linked to changes in the expression of genes encoding heat-shock proteins (Hsp). The underlying genetics, however, remain elusive. We show here that two AT-rich sequence elements in the promoter region of the hsp70 gene of the fly Liriomyza sativae that are absent in the congeneric species, Liriomyza huidobrensis, have marked cis-regulatory consequences. We studied the cis-regulatory consequences of these elements (called ATRS1 and ATRS2) by measuring the constitutive and heat-shock-induced luciferase luminescence that they drive in cells transfected with constructs carrying them modified, deleted, or intact, in the hsp70 promoter fused to the luciferase gene. The elements affected expression level markedly and in different ways: Deleting ATRS1 augmented both the constitutive and the heat-shock-induced luminescence, suggesting that this element represses transcription. Interestingly, replacing the element with random sequences of the same length and A+T content delivered the wild-type luminescence pattern, proving that the element's high A+T content is crucial for its effects. Deleting ATRS2 decreased luminescence dramatically and almost abolished heat-shock inducibility and so did replacing the element with random sequences matching the element's length and A+T content, suggesting that ATRS2's effects on transcription and heat-shock inducibility involve a common mechanism requiring at least in part the element's specific primary structure. Finally, constitutive and heat-shock luminescence were reduced strongly when two putative binding sites for the Zeste transcription factor identified within ATRS2 were altered through site-directed mutagenesis, and the heat-shock-induced luminescence increased when Zeste was over-expressed, indicating that Zeste participates in the effects mapped to ATRS2 at least in part. AT-rich sequences are common in promoters and our results suggest that they should play important roles in regulatory evolution since they can affect expression markedly and constrain promoter DNA in at least two different ways. PMID:21655251
SL1 revisited: functional analysis of the structure and conformation of HIV-1 genome RNA.
Sakuragi, Sayuri; Yokoyama, Masaru; Shioda, Tatsuo; Sato, Hironori; Sakuragi, Jun-Ichi
2016-11-11
The dimer initiation site/dimer linkage sequence (DIS/DLS) region of HIV is located on the 5' end of the viral genome and suggested to form complex secondary/tertiary structures. Within this structure, stem-loop 1 (SL1) is believed to be most important and an essential key to dimerization, since the sequence and predicted secondary structure of SL1 are highly stable and conserved among various virus subtypes. In particular, a six-base palindromic sequence is always present at the hairpin loop of SL1 and the formation of kissing-loop structure at this position between the two strands of genomic RNA is suggested to trigger dimerization. Although the higher-order structure model of SL1 is well accepted and perhaps even undoubted lately, there could be stillroom for consideration to depict the functional SL1 structure while in vivo (in virion or cell). In this study, we performed several analyses to identify the nucleotides and/or basepairing within SL1 which are necessary for HIV-1 genome dimerization, encapsidation, recombination and infectivity. We unexpectedly found that some nucleotides that are believed to contribute the formation of the stem do not impact dimerization or infectivity. On the other hand, we found that one G-C basepair involved in stem formation may serve as an alternative dimer interactive site. We also report on our further investigation of the roles of the palindromic sequences on viral replication. Collectively, we aim to assemble a more-comprehensive functional map of SL1 on the HIV-1 viral life cycle. We discovered several possibilities for a novel structure of SL1 in HIV-1 DLS. The newly proposed structure model suggested that the hairpin loop of SL1 appeared larger, and genome dimerization process might consist of more complicated mechanism than previously understood. Further investigations would be still required to fully understand the genome packaging and dimerization of HIV.
Distribution of hepatitis B virus subgenotype F2a in São Paulo, Brazil.
Alvarado-Mora, Mónica V; Botelho-Lima, Livia S; Santana, Rubia A; Sitnik, Roberta; Ferreira, Paulo Abrão; do Amaral Mello, Francisco; Mangueira, Cristovão P; Carrilho, Flair J; Rebello Pinho, João R
2013-10-21
HBV genotype F is primarily found in indigenous populations from South America and is classified in four subgenotypes (F1 to F4). Subgenotype F2a is the most common in Brazil among genotype F cases. The aim of this study was to characterize HBV genotype F2a circulating in 16 patients from São Paulo, Brazil. Samples were collected between 2006 and 2012 and sent to Hospital Israelita Albert Einstein. A fragment of 1306 bp partially comprising HBsAg and DNA polymerase coding regions was amplified and sequenced. Viral sequences were genotyped by phylogenetic analysis using reference sequences from GenBank (n=198), including 80 classified as subgenotype F2a. Bayesian Markov chain Monte Carlo simulation implemented in BEAST v.1.5.4 was applied to obtain the best possible estimates using the model of nucleotide substitutions GTR+G+I. It were identified three groups of sequences of subgenotype F2a: 1) 10 sequences from São Paulo state; 2) 3 sequences from Rio de Janeiro and one from São Paulo states; 3) 8 sequences from the West Amazon Basin. These results showing for the first time the distribution of F2a subgenotype in Brazil. The spreading and the dynamic of subgenotype F2a in Brazil requires the study of a higher number of samples from different regions as it is unfold in almost all Brazilian populations studied so far. We cannot infer with certainty the origin of these different groups due to the lack of available sequences. Nevertheless, our data suggest that the common origin of these groups probably occurred a long time ago.
NASA Astrophysics Data System (ADS)
Allen, Prudence
2003-04-01
Young children typically perform more poorly on psychoacoustic tasks than do adults, with large individual differences. When performance is averaged across children within age groups, the data suggest a gradual change in performance with increasing age. However, an examination of individual data suggests that the performance matures more rapidly, although at different times for different children. The mechanisms of development responsible for these changes are likely very complex, involving both sensory and cognitive processes. This paper will discuss some previously suggested mechanisms including attention and cue weighting, as well as possibilities suggested from more recent studies in which learning effects were examined. In one task, a simple frequency discrimination was required, while in another the listener was required to extract regularities in complex sequences of sounds that varied from trial to trial. Results suggested that the ability to select and consistently employ an effective listening strategy was especially important in the performance of the more complex task, while simple stimulus exposure and motivation contributed to the simpler task. These factors are important for understanding the perceptual development and for the subsequent application of psychoacoustic findings to clinical populations. [Work supported by the NSERC and the Canadian Language and Literacy Research Network.
Oxley, P R; Oldroyd, B P
2009-04-01
Establishment of a closed population honey bee, Apis mellifera L. (Hymenoptera: Apidae), breeding program based on 'black' strains has been proposed for eastern Australia. Long-term success of such a program requires a high level of genetic variance. To determine the likely extent of genetic variation available, 50 colonies from 11 different commercial apiaries were sequenced in the mitochondrial cytochrome oxidase I and II intergenic region. Five distinct and novel mitotypes were identified. No colonies were found with the A. mellifera mellifera mitotype, which is often associated with undesirable feral strains. One group of mitotypes was consistent with a caucasica origin, two with carnica, and two with ligustica. The results suggest that there is sufficient genetic diversity to support a breeding program provided all these five sources were pooled.
Transposable element islands facilitate adaptation to novel environments in an invasive species
Schrader, Lukas; Kim, Jay W.; Ence, Daniel; Zimin, Aleksey; Klein, Antonia; Wyschetzki, Katharina; Weichselgartner, Tobias; Kemena, Carsten; Stökl, Johannes; Schultner, Eva; Wurm, Yannick; Smith, Christopher D.; Yandell, Mark; Heinze, Jürgen; Gadau, Jürgen; Oettler, Jan
2014-01-01
Adaptation requires genetic variation, but founder populations are generally genetically depleted. Here we sequence two populations of an inbred ant that diverge in phenotype to determine how variability is generated. Cardiocondyla obscurior has the smallest of the sequenced ant genomes and its structure suggests a fundamental role of transposable elements (TEs) in adaptive evolution. Accumulations of TEs (TE islands) comprising 7.18% of the genome evolve faster than other regions with regard to single-nucleotide variants, gene/exon duplications and deletions and gene homology. A non-random distribution of gene families, larvae/adult specific gene expression and signs of differential methylation in TE islands indicate intragenomic differences in regulation, evolutionary rates and coalescent effective population size. Our study reveals a tripartite interplay between TEs, life history and adaptation in an invasive species. PMID:25510865
Invasion and Persistence of a Selfish Gene in the Cnidaria
Goddard, Matthew R.; Leigh, Jessica; Roger, Andrew J; Pemberton, Andrew J
2006-01-01
Background Homing endonuclease genes (HEGs) are superfluous, but are capable of invading populations that mix alleles by biasing their inheritance patterns through gene conversion. One model suggests that their long-term persistence is achieved through recurrent invasion. This circumvents evolutionary degeneration, but requires reasonable rates of transfer between species to maintain purifying selection. Although HEGs are found in a variety of microbes, we found the previous discovery of this type of selfish genetic element in the mitochondria of a sea anemone surprising. Methods/Principal Findings We surveyed 29 species of Cnidaria for the presence of the COXI HEG. Statistical analyses provided evidence for HEG invasion. We also found that 96 individuals of Metridium senile, from five different locations in the UK, had identical HEG sequences. This lack of sequence divergence illustrates the stable nature of Anthozoan mitochondria. Our data suggests this HEG conforms to the recurrent invasion model of evolution. Conclusions Ordinarily such low rates of HEG transfer would likely be insufficient to enable major invasion. However, the slow rate of Anthozoan mitochondrial change lengthens greatly the time to HEG degeneration: this significantly extends the periodicity of the HEG life-cycle. We suggest that a combination of very low substitution rates and rare transfers facilitated metazoan HEG invasion. PMID:17183657
Improving molecular diagnosis in epilepsy by a dedicated high-throughput sequencing platform.
Della Mina, Erika; Ciccone, Roberto; Brustia, Francesca; Bayindir, Baran; Limongelli, Ivan; Vetro, Annalisa; Iascone, Maria; Pezzoli, Laura; Bellazzi, Riccardo; Perotti, Gianfranco; De Giorgis, Valentina; Lunghi, Simona; Coppola, Giangennaro; Orcesi, Simona; Merli, Pietro; Savasta, Salvatore; Veggiotti, Pierangelo; Zuffardi, Orsetta
2015-03-01
We analyzed by next-generation sequencing (NGS) 67 epilepsy genes in 19 patients with different types of either isolated or syndromic epileptic disorders and in 15 controls to investigate whether a quick and cheap molecular diagnosis could be provided. The average number of nonsynonymous and splice site mutations per subject was similar in the two cohorts indicating that, even with relatively small targeted platforms, finding the disease gene is not an univocal process. Our diagnostic yield was 47% with nine cases in which we identified a very likely causative mutation. In most of them no interpretation would have been possible in absence of detailed phenotype and familial information. Seven out of 19 patients had a phenotype suggesting the involvement of a specific gene. Disease-causing mutations were found in six of these cases. Among the remaining patients, we could find a probably causative mutation only in three. None of the genes affected in the latter cases had been suspected a priori. Our protocol requires 8-10 weeks including the investigation of the parents with a cost per patient comparable to sequencing of 1-2 medium-to-large-sized genes by conventional techniques. The platform we used, although providing much less information than whole-exome or whole-genome sequencing, has the advantage that can also be run on 'benchtop' sequencers combining rapid turnaround times with higher manageability.
Detecting Coevolution in and among Protein Domains
Yeang, Chen-Hsiang; Haussler, David
2007-01-01
Correlated changes of nucleic or amino acids have provided strong information about the structures and interactions of molecules. Despite the rich literature in coevolutionary sequence analysis, previous methods often have to trade off between generality, simplicity, phylogenetic information, and specific knowledge about interactions. Furthermore, despite the evidence of coevolution in selected protein families, a comprehensive screening of coevolution among all protein domains is still lacking. We propose an augmented continuous-time Markov process model for sequence coevolution. The model can handle different types of interactions, incorporate phylogenetic information and sequence substitution, has only one extra free parameter, and requires no knowledge about interaction rules. We employ this model to large-scale screenings on the entire protein domain database (Pfam). Strikingly, with 0.1 trillion tests executed, the majority of the inferred coevolving protein domains are functionally related, and the coevolving amino acid residues are spatially coupled. Moreover, many of the coevolving positions are located at functionally important sites of proteins/protein complexes, such as the subunit linkers of superoxide dismutase, the tRNA binding sites of ribosomes, the DNA binding region of RNA polymerase, and the active and ligand binding sites of various enzymes. The results suggest sequence coevolution manifests structural and functional constraints of proteins. The intricate relations between sequence coevolution and various selective constraints are worth pursuing at a deeper level. PMID:17983264
Henle, E S; Han, Z; Tang, N; Rai, P; Luo, Y; Linn, S
1999-01-08
Preferential cleavage sites have been determined for Fe2+/H2O2-mediated oxidations of DNA. In 50 mM H2O2, preferential cleavages occurred at the nucleoside 5' to each of the dG moieties in the sequence RGGG, a sequence found in a majority of telomere repeats. Within a plasmid containing a (TTAGGG)81 human telomere insert, 7-fold more strand breakage occurred in the restriction fragment with the insert than in a similar-sized control fragment. This result implies that telomeric DNA could protect coding DNA from oxidative damage and might also link oxidative damage and iron load to telomere shortening and aging. In micromolar H2O2, preferential cleavage occurred at the thymidine within the sequence RTGR, a sequence frequently found to be required in promoters for normal responses of many procaryotic and eucaryotic genes to iron or oxygen stress. Computer modeling of the interaction of Fe2+ with RTGR in B-DNA suggests that due to steric hindrance with the thymine methyl, Fe2+ associates in a specific manner with the thymine flipped out from the base stack so as to allow an octahedrally-oriented coordination of the Fe2+ with the three purine N7 residues. Fe2+-dependent changes in NMR spectra of duplex oligonucleotides containing ATGA versus those containing AUGA or A5mCGA were consistent with this model.
Angart, Phillip A.; Carlson, Rebecca J.; Adu-Berchie, Kwasi
2016-01-01
Efficient short interfering RNA (siRNA)-mediated gene silencing requires selection of a sequence that is complementary to the intended target and possesses sequence and structural features that encourage favorable functional interactions with the RNA interference (RNAi) pathway proteins. In this study, we investigated how terminal sequence and structural characteristics of siRNAs contribute to siRNA strand loading and silencing activity and how these characteristics ultimately result in a functionally asymmetric duplex in cultured HeLa cells. Our results reiterate that the most important characteristic in determining siRNA activity is the 5′ terminal nucleotide identity. Our findings further suggest that siRNA loading is controlled principally by the hybridization stability of the 5′ terminus (Nucleotides: 1–2) of each siRNA strand, independent of the opposing terminus. Postloading, RNA-induced silencing complex (RISC)–specific activity was found to be improved by lower hybridization stability in the 5′ terminus (Nucleotides: 3–4) of the loaded siRNA strand and greater hybridization stability toward the 3′ terminus (Nucleotides: 17–18). Concomitantly, specific recognition of the 5′ terminal nucleotide sequence by human Argonaute 2 (Ago2) improves RISC half-life. These findings indicate that careful selection of siRNA sequences can maximize both the loading and the specific activity of the intended guide strand. PMID:27399870
The histidine permease gene (HIP1) of Saccharomyces cerevisiae.
Tanaka, J; Fink, G R
1985-01-01
The histidine-specific permease gene (HIP1) of Saccharomyces cerevisiae has been mapped, cloned, and sequenced. The HIP1 gene maps to the right arm of chromosome VII, approx. 11 cM distal to the ADE3 gene. The gene was isolated as an 8.6-kb BamHI-Sau3A fragment by complementation of the histidine-specific permease deficiency in recipient yeast cells. We sequenced a 2.4-kb subfragment of this BamHI-Sau3A fragment containing the HIP1 gene and identified a 1596-bp open reading frame (ORF). We confirmed the assignment of the 1596-bp ORF as the HIP1 coding sequence by sequencing a hip1 nonsense mutation. Analysis of the amino acid (aa) sequence of the HIP1 gene reveals several hydrophobic stretches, but shows no obvious N-terminal signal peptide. We have constructed a deletion of the HIP1 gene in vitro and replaced the wild-type copy of the gene with this deletion. The hip1 deletion mutant can grow when it is supplemented with 30 mM histidine, 50 times the amount required for the growth of HIP1 cells. Revertants of this deletion mutant able to grow on a normal level of histidine arise by mutation in unlinked genes. Both these observations suggest that there are additional, low-affinity pathways for histidine uptake.
Genetic and DNA sequence analysis of the kanamycin resistance transposon Tn903.
Grindley, N D; Joyce, C M
1980-01-01
The kanamycin resistance transposon Tn903 consists of a unique region of about 1000 base pairs bounded by a pair of 1050-base-pair inverted repeat sequences. Each repeat contains two Pvu II endonuclease cleavage sites separated by 520 base pairs. We have constructed derivatives of Tn903 in which this 520-base-pair fragment is deleted from one or both repeats. Those derivatives that lack both 520-base-pair fragments cannot transpose, whereas those that lack just one remain transposition proficient. One such transposable derivative, Tn903 delta I, has been selected for further study. We have determined the sequence of the intact inverted repeat. The 18 base pairs at each end are identical and inverted relative to one another, a structure characteristic of insertion sequences. Additional experiments indicate that a single inverted repeat from Tn903 can, in fact, transpose; we propose that this element be called IS903. To correlate the DNA sequence with genetic activities, we have created mutations by inserting a 10-base-pair DNA fragment at several sites within the intact repeat of Tn903 delta 1, and we have examined the effect of such insertions on transposability. The results suggest that IS903 encodes a 307-amino-acid polypeptide (a "transposase") that is absolutely required for transposition of IS903 or Tn903. Images PMID:6261245
Brady, J; Radonovich, M; Thoren, M; Das, G; Salzman, N P
1984-01-01
We have previously identified an 11-base DNA sequence, 5'-G-G-T-A-C-C-T-A-A-C-C-3' (simian virus 40 [SV40] map position 294 to 304), which is important in the control of SV40 late RNA expression in vitro and in vivo (Brady et al., Cell 31:625-633, 1982). We report here the identification of another domain of the SV40 late promoter. A series of mutants with deletions extending from SV40 map position 0 to 300 was prepared by nuclease BAL 31 treatment. The cloned templates were then analyzed for efficiency and accuracy of late SV40 RNA expression in the Manley in vitro transcription system. Our studies showed that, in addition to the promoter domain near map position 300, there are essential DNA sequences between nucleotide positions 74 and 95 that are required for efficient expression of late SV40 RNA. Included in this SV40 DNA sequence were two of the six GGGCGG SV40 repeat sequences and an 11-nucleotide segment which showed strong homology with the upstream sequences required for the efficient in vitro and in vivo expression of the histone H2A gene. This upstream promoter sequence supported transcription with the same efficiency even when it was moved 72 nucleotides closer to the major late cap site. In vitro promoter competition analysis demonstrated that the upstream promoter sequence, independent of the 294 to 304 promoter element, is capable of binding polymerase-transcription factors required for SV40 late gene transcription. Finally, we show that DNA sequences which control the specificity of RNA initiation at nucleotide 325 lie downstream of map position 294. Images PMID:6321950
Discovery of Nuclear-Encoded Genes for the Neurotoxin Saxitoxin in Dinoflagellates
Stüken, Anke; Orr, Russell J. S.; Kellmann, Ralf; Murray, Shauna A.; Neilan, Brett A.; Jakobsen, Kjetill S.
2011-01-01
Saxitoxin is a potent neurotoxin that occurs in aquatic environments worldwide. Ingestion of vector species can lead to paralytic shellfish poisoning, a severe human illness that may lead to paralysis and death. In freshwaters, the toxin is produced by prokaryotic cyanobacteria; in marine waters, it is associated with eukaryotic dinoflagellates. However, several studies suggest that saxitoxin is not produced by dinoflagellates themselves, but by co-cultured bacteria. Here, we show that genes required for saxitoxin synthesis are encoded in the nuclear genomes of dinoflagellates. We sequenced >1.2×106 mRNA transcripts from the two saxitoxin-producing dinoflagellate strains Alexandrium fundyense CCMP1719 and A. minutum CCMP113 using high-throughput sequencing technology. In addition, we used in silico transcriptome analyses, RACE, qPCR and conventional PCR coupled with Sanger sequencing. These approaches successfully identified genes required for saxitoxin-synthesis in the two transcriptomes. We focused on sxtA, the unique starting gene of saxitoxin synthesis, and show that the dinoflagellate transcripts of sxtA have the same domain structure as the cyanobacterial sxtA genes. But, in contrast to the bacterial homologs, the dinoflagellate transcripts are monocistronic, have a higher GC content, occur in multiple copies, contain typical dinoflagellate spliced-leader sequences and eukaryotic polyA-tails. Further, we investigated 28 saxitoxin-producing and non-producing dinoflagellate strains from six different genera for the presence of genomic sxtA homologs. Our results show very good agreement between the presence of sxtA and saxitoxin-synthesis, except in three strains of A. tamarense, for which we amplified sxtA, but did not detect the toxin. Our work opens for possibilities to develop molecular tools to detect saxitoxin-producing dinoflagellates in the environment. PMID:21625593
Discovery of nuclear-encoded genes for the neurotoxin saxitoxin in dinoflagellates.
Stüken, Anke; Orr, Russell J S; Kellmann, Ralf; Murray, Shauna A; Neilan, Brett A; Jakobsen, Kjetill S
2011-01-01
Saxitoxin is a potent neurotoxin that occurs in aquatic environments worldwide. Ingestion of vector species can lead to paralytic shellfish poisoning, a severe human illness that may lead to paralysis and death. In freshwaters, the toxin is produced by prokaryotic cyanobacteria; in marine waters, it is associated with eukaryotic dinoflagellates. However, several studies suggest that saxitoxin is not produced by dinoflagellates themselves, but by co-cultured bacteria. Here, we show that genes required for saxitoxin synthesis are encoded in the nuclear genomes of dinoflagellates. We sequenced >1.2×10(6) mRNA transcripts from the two saxitoxin-producing dinoflagellate strains Alexandrium fundyense CCMP1719 and A. minutum CCMP113 using high-throughput sequencing technology. In addition, we used in silico transcriptome analyses, RACE, qPCR and conventional PCR coupled with Sanger sequencing. These approaches successfully identified genes required for saxitoxin-synthesis in the two transcriptomes. We focused on sxtA, the unique starting gene of saxitoxin synthesis, and show that the dinoflagellate transcripts of sxtA have the same domain structure as the cyanobacterial sxtA genes. But, in contrast to the bacterial homologs, the dinoflagellate transcripts are monocistronic, have a higher GC content, occur in multiple copies, contain typical dinoflagellate spliced-leader sequences and eukaryotic polyA-tails. Further, we investigated 28 saxitoxin-producing and non-producing dinoflagellate strains from six different genera for the presence of genomic sxtA homologs. Our results show very good agreement between the presence of sxtA and saxitoxin-synthesis, except in three strains of A. tamarense, for which we amplified sxtA, but did not detect the toxin. Our work opens for possibilities to develop molecular tools to detect saxitoxin-producing dinoflagellates in the environment.
Quantum sequencing: opportunities and challenges
NASA Astrophysics Data System (ADS)
di Ventra, Massimiliano
Personalized or precision medicine refers to the ability of tailoring drugs to the specific genome and transcriptome of each individual. It is however not yet feasible due the high costs and slow speed of present DNA sequencing methods. I will discuss a sequencing protocol that requires the measurement of the distributions of transverse tunneling currents during the translocation of single-stranded DNA into nanochannels. I will show that such a quantum sequencing approach can reach unprecedented speeds, without requiring any chemical preparation, amplification or labeling. I will discuss recent experiments that support these theoretical predictions, the advantages of this approach over other sequencing methods, and stress the challenges that need to be overcome to render it commercially viable.
Godkin, A; Friede, T; Davenport, M; Stevanovic, S; Willis, A; Jewell, D; Hill, A; Rammensee, H G
1997-06-01
HLA-DQ8 (A1*0301, B1*0302) and -DQ2 (A1*0501, B1*0201) are both associated with diseases such as insulin-dependent diabetes mellitus and coeliac disease. We used the technique of pool sequencing to look at the requirements of peptides binding to HLA-DQ8, and combined these data with naturally sequenced ligands and in vitro binding assays to describe a novel motif for HLA-DQ8. The motif, which has the same basic format as many HLA-DR molecules, consists of four or five anchor regions, in the positions from the N-terminus of the binding core of n, n + 3, n + 5/6 and n + 8, i.e. P1, P4, P6/7 and P9. P1 and P9 require negative or polar residues, with mainly aliphatic residues at P4 and P6/7. The features of the HLA-DQ8 motif were then compared to a pool sequence of peptides eluted from HLA-DQ2. A consensus motif for the binding of a common peptide which may be involved in disease pathogenesis is described. Neither of the disease-associated alleles HLA-DQ2 and -DQ8 have Asp at position 57 of the beta-chain. This Asp, if present, may form a salt bridge with an Arg at position 79 of the alpha-chain and so alter the binding specificity of P9. HLA-DQ2 and -DQ8 both appear to prefer negatively charged amino acids at P9. In contrast, HLA-DQ7 (A1*0301, B1*0301), which is not associated with diabetes, has Asp at beta 57, allowing positively charged amino acids at P9. This analysis of the sequence features of DQ-binding peptides suggests molecular characteristics which may be useful to predict epitopes involved in disease pathogenesis.
A remark on copy number variation detection methods.
Li, Shuo; Dou, Xialiang; Gao, Ruiqi; Ge, Xinzhou; Qian, Minping; Wan, Lin
2018-01-01
Copy number variations (CNVs) are gain and loss of DNA sequence of a genome. High throughput platforms such as microarrays and next generation sequencing technologies (NGS) have been applied for genome wide copy number losses. Although progress has been made in both approaches, the accuracy and consistency of CNV calling from the two platforms remain in dispute. In this study, we perform a deep analysis on copy number losses on 254 human DNA samples, which have both SNP microarray data and NGS data publicly available from Hapmap Project and 1000 Genomes Project respectively. We show that the copy number losses reported from Hapmap Project and 1000 Genome Project only have < 30% overlap, while these reports are required to have cross-platform (e.g. PCR, microarray and high-throughput sequencing) experimental supporting by their corresponding projects, even though state-of-art calling methods were employed. On the other hand, copy number losses are found directly from HapMap microarray data by an accurate algorithm, i.e. CNVhac, almost all of which have lower read mapping depth in NGS data; furthermore, 88% of which can be supported by the sequences with breakpoint in NGS data. Our results suggest the ability of microarray calling CNVs and the possible introduction of false negatives from the unessential requirement of the additional cross-platform supporting. The inconsistency of CNV reports from Hapmap Project and 1000 Genomes Project might result from the inadequate information containing in microarray data, the inconsistent detection criteria, or the filtration effect of cross-platform supporting. The statistical test on CNVs called from CNVhac show that the microarray data can offer reliable CNV reports, and majority of CNV candidates can be confirmed by raw sequences. Therefore, the CNV candidates given by a good caller could be highly reliable without cross-platform supporting, so additional experimental information should be applied in need instead of necessarily.
SEED 2: a user-friendly platform for amplicon high-throughput sequencing data analyses.
Vetrovský, Tomáš; Baldrian, Petr; Morais, Daniel; Berger, Bonnie
2018-02-14
Modern molecular methods have increased our ability to describe microbial communities. Along with the advances brought by new sequencing technologies, we now require intensive computational resources to make sense of the large numbers of sequences continuously produced. The software developed by the scientific community to address this demand, although very useful, require experience of the command-line environment, extensive training and have steep learning curves, limiting their use. We created SEED 2, a graphical user interface for handling high-throughput amplicon-sequencing data under Windows operating systems. SEED 2 is the only sequence visualizer that empowers users with tools to handle amplicon-sequencing data of microbial community markers. It is suitable for any marker genes sequences obtained through Illumina, IonTorrent or Sanger sequencing. SEED 2 allows the user to process raw sequencing data, identify specific taxa, produce of OTU-tables, create sequence alignments and construct phylogenetic trees. Standard dual core laptops with 8 GB of RAM can handle ca. 8 million of Illumina PE 300 bp sequences, ca. 4GB of data. SEED 2 was implemented in Object Pascal and uses internal functions and external software for amplicon data processing. SEED 2 is a freeware software, available at http://www.biomed.cas.cz/mbu/lbwrf/seed/ as a self-contained file, including all the dependencies, and does not require installation. Supplementary data contain a comprehensive list of supported functions. daniel.morais@biomed.cas.cz. Supplementary data are available at Bioinformatics online. © The Author(s) 2018. Published by Oxford University Press.
Georgiev, O; Birnstiel, M L
1985-01-01
Analysis of cDNA sequences obtained from the small nuclear RNA U7 has previously suggested specific contacts, by base pairing, between the conserved stem-loop structure and CAAGAAAGA sequence of the histone pre-mRNA and the 5'-terminal sequence of the U7 RNA during RNA processing. In order to test some aspects of the model we have created a series of linker scan, deletion and insertion mutants of the 3' terminus of a sea urchin H3 histone gene and have injected mutant DNAs or in vitro synthesized precursors into frog oocyte nuclei for interpretation. We find that, in addition to the stem-loop structure of the mRNA, the CAAGAAAGA spacer transcript within the histone pre-mRNA is required absolutely for RNA processing, as predicted from our model. Spacer sequences immediately downstream of the CAAGAAAGA motif are not complementary to U7 RNA. Nevertheless, they are necessary for obtaining a maximal rate of RNA processing, as is the ACCA sequence coding for the 3' terminus of the mature mRNA. An increase of distance between the mRNA palindrome and the CAAGAAAGA by as little as six nucleotides abolishes all processing. It may, therefore, be useful to regard both these sequence motifs as part of one and the same RNA processing signal with narrowly defined topologies. Interestingly, U7 RNA-dependent 3' processing of histone pre-mRNA can occur in RNA injection experiments only when the in vitro synthesized pre-mRNA contains sequence extensions well beyond the region of sequence complementarities to the U7 RNA. In addition to directing 3' processing the terminal mRNA sequences may have a role in histone mRNA stabilization in the cytoplasmic compartment. Images Fig. 3. Fig. 4. Fig. 5. Fig. 6. Fig. 7. PMID:2410259
Biogeochemical and Molecular Signatures of Anaerobic Methane Oxidation in a Marine Sediment
Thomsen, Trine R.; Finster, Kai; Ramsing, Niels B.
2001-01-01
Anaerobic methane oxidation was investigated in 6-m-long cores of marine sediment from Aarhus Bay, Denmark. Measured concentration profiles for methane and sulfate, as well as in situ rates determined with isotope tracers, indicated that there was a narrow zone of anaerobic methane oxidation about 150 cm below the sediment surface. Methane could account for 52% of the electron donor requirement for the peak sulfate reduction rate detected in the sulfate-methane transition zone. Molecular signatures of organisms present in the transition zone were detected by using selective PCR primers for sulfate-reducing bacteria and for Archaea. One primer pair amplified the dissimilatory sulfite reductase (DSR) gene of sulfate-reducing bacteria, whereas another primer (ANME) was designed to amplify archaeal sequences found in a recent study of sediments from the Eel River Basin, as these bacteria have been suggested to be anaerobic methane oxidizers (K. U. Hinrichs, J. M. Hayes, S. P. Sylva, P. G. Brewer, and E. F. DeLong, Nature 398:802–805, 1999). Amplification with the primer pairs produced more amplificate of both target genes with samples from the sulfate-methane transition zone than with samples from the surrounding sediment. Phylogenetic analysis of the DSR gene sequences retrieved from the transition zone revealed that they all belonged to a novel deeply branching lineage of diverse DSR gene sequences not related to any previously described DSR gene sequence. In contrast, DSR gene sequences found in the top sediment were related to environmental sequences from other estuarine sediments and to sequences of members of the genera Desulfonema, Desulfococcus, and Desulfosarcina. Phylogenetic analysis of 16S rRNA sequences obtained with the primers targeting the archaeal group of possible anaerobic methane oxidizers revealed two clusters of ANME sequences, both of which were affiliated with sequences from the Eel River Basin. PMID:11282617
Biogeochemical and molecular signatures of anaerobic methane oxidation in a marine sediment.
Thomsen, T R; Finster, K; Ramsing, N B
2001-04-01
Anaerobic methane oxidation was investigated in 6-m-long cores of marine sediment from Aarhus Bay, Denmark. Measured concentration profiles for methane and sulfate, as well as in situ rates determined with isotope tracers, indicated that there was a narrow zone of anaerobic methane oxidation about 150 cm below the sediment surface. Methane could account for 52% of the electron donor requirement for the peak sulfate reduction rate detected in the sulfate-methane transition zone. Molecular signatures of organisms present in the transition zone were detected by using selective PCR primers for sulfate-reducing bacteria and for Archaea. One primer pair amplified the dissimilatory sulfite reductase (DSR) gene of sulfate-reducing bacteria, whereas another primer (ANME) was designed to amplify archaeal sequences found in a recent study of sediments from the Eel River Basin, as these bacteria have been suggested to be anaerobic methane oxidizers (K. U. Hinrichs, J. M. Hayes, S. P. Sylva, P. G. Brewer, and E. F. DeLong, Nature 398:802-805, 1999). Amplification with the primer pairs produced more amplificate of both target genes with samples from the sulfate-methane transition zone than with samples from the surrounding sediment. Phylogenetic analysis of the DSR gene sequences retrieved from the transition zone revealed that they all belonged to a novel deeply branching lineage of diverse DSR gene sequences not related to any previously described DSR gene sequence. In contrast, DSR gene sequences found in the top sediment were related to environmental sequences from other estuarine sediments and to sequences of members of the genera Desulfonema, Desulfococcus, and Desulfosarcina. Phylogenetic analysis of 16S rRNA sequences obtained with the primers targeting the archaeal group of possible anaerobic methane oxidizers revealed two clusters of ANME sequences, both of which were affiliated with sequences from the Eel River Basin.
Targeted sequencing of plant genomes
Mark D. Huynh
2014-01-01
Next-generation sequencing (NGS) has revolutionized the field of genetics by providing a means for fast and relatively affordable sequencing. With the advancement of NGS, wholegenome sequencing (WGS) has become more commonplace. However, sequencing an entire genome is still not cost effective or even beneficial in all cases. In studies that do not require a whole-...
A Simulation of DNA Sequencing Utilizing 3M Post-It[R] Notes
ERIC Educational Resources Information Center
Christensen, Doug
2009-01-01
An inexpensive and equipment free approach to teaching the technical aspects of DNA sequencing. The activity described requires an instructor with a familiarity of DNA sequencing technology but provides a straight forward method of teaching the technical aspects of sequencing in the absence of expensive sequencing equipment. The final sequence…
Adaptive efficient compression of genomes
2012-01-01
Modern high-throughput sequencing technologies are able to generate DNA sequences at an ever increasing rate. In parallel to the decreasing experimental time and cost necessary to produce DNA sequences, computational requirements for analysis and storage of the sequences are steeply increasing. Compression is a key technology to deal with this challenge. Recently, referential compression schemes, storing only the differences between a to-be-compressed input and a known reference sequence, gained a lot of interest in this field. However, memory requirements of the current algorithms are high and run times often are slow. In this paper, we propose an adaptive, parallel and highly efficient referential sequence compression method which allows fine-tuning of the trade-off between required memory and compression speed. When using 12 MB of memory, our method is for human genomes on-par with the best previous algorithms in terms of compression ratio (400:1) and compression speed. In contrast, it compresses a complete human genome in just 11 seconds when provided with 9 GB of main memory, which is almost three times faster than the best competitor while using less main memory. PMID:23146997
Structural basis of GSK-3 inhibition by N-terminal phosphorylation and by the Wnt receptor LRP6.
Stamos, Jennifer L; Chu, Matthew Ling-Hon; Enos, Michael D; Shah, Niket; Weis, William I
2014-03-18
Glycogen synthase kinase-3 (GSK-3) is a key regulator of many cellular signaling pathways. Unlike most kinases, GSK-3 is controlled by inhibition rather than by specific activation. In the insulin and several other signaling pathways, phosphorylation of a serine present in a conserved sequence near the amino terminus of GSK-3 generates an auto-inhibitory peptide. In contrast, Wnt/β-catenin signal transduction requires phosphorylation of Ser/Pro rich sequences present in the Wnt co-receptors LRP5/6, and these motifs inhibit GSK-3 activity. We present crystal structures of GSK-3 bound to its phosphorylated N-terminus and to two of the phosphorylated LRP6 motifs. A conserved loop unique to GSK-3 undergoes a dramatic conformational change that clamps the bound pseudo-substrate peptides, and reveals the mechanism of primed substrate recognition. The structures rationalize target sequence preferences and suggest avenues for the design of inhibitors selective for a subset of pathways regulated by GSK-3. DOI: http://dx.doi.org/10.7554/eLife.01998.001.
Halbedel, Sven; Prager, Rita; Fuchs, Stephan; Trost, Eva; Werner, Guido; Flieger, Antje
2018-06-01
Listeria monocytogenes causes foodborne outbreaks with high mortality. For improvement of outbreak cluster detection, the German consiliary laboratory for listeriosis implemented whole-genome sequencing (WGS) in 2015. A total of 424 human L. monocytogenes isolates collected in 2007 to 2017 were subjected to WGS and core-genome multilocus sequence typing (cgMLST). cgMLST grouped the isolates into 38 complexes, reflecting 4 known and 34 unknown disease clusters. Most of these complexes were confirmed by single nucleotide polymorphism (SNP) calling, but some were further differentiated. Interestingly, several cgMLST cluster types were further subtyped by pulsed-field gel electrophoresis, partly due to phage insertions in the accessory genome. Our results highlight the usefulness of cgMLST for routine cluster detection but also show that cgMLST complexes require validation by methods providing higher typing resolution. Twelve cgMLST clusters included recent cases, suggesting activity of the source. Therefore, the cgMLST nomenclature data presented here may support future public health actions. Copyright © 2018 American Society for Microbiology.
Targeted gene insertion for molecular medicine.
Voigt, Katrin; Izsvák, Zsuzsanna; Ivics, Zoltán
2008-11-01
Genomic insertion of a functional gene together with suitable transcriptional regulatory elements is often required for long-term therapeutical benefit in gene therapy for several genetic diseases. A variety of integrating vectors for gene delivery exist. Some of them exhibit random genomic integration, whereas others have integration preferences based on attributes of the targeted site, such as primary DNA sequence and physical structure of the DNA, or through tethering to certain DNA sequences by host-encoded cellular factors. Uncontrolled genomic insertion bears the risk of the transgene being silenced due to chromosomal position effects, and can lead to genotoxic effects due to mutagenesis of cellular genes. None of the vector systems currently used in either preclinical experiments or clinical trials displays sufficient preferences for target DNA sequences that would ensure appropriate and reliable expression of the transgene and simultaneously prevent hazardous side effects. We review in this paper the advantages and disadvantages of both viral and non-viral gene delivery technologies, discuss mechanisms of target site selection of integrating genetic elements (viruses and transposons), and suggest distinct molecular strategies for targeted gene delivery.
Deletion of ultraconserved elements yields viable mice
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ahituv, Nadav; Zhu, Yiwen; Visel, Axel
2007-07-15
Ultraconserved elements have been suggested to retainextended perfect sequence identity between the human, mouse, and ratgenomes due to essential functional properties. To investigate thenecessities of these elements in vivo, we removed four non-codingultraconserved elements (ranging in length from 222 to 731 base pairs)from the mouse genome. To maximize the likelihood of observing aphenotype, we chose to delete elements that function as enhancers in amouse transgenic assay and that are near genes that exhibit markedphenotypes both when completely inactivated in the mouse as well as whentheir expression is altered due to other genomic modifications.Remarkably, all four resulting lines of mice lackingmore » these ultraconservedelements were viable and fertile, and failed to reveal any criticalabnormalities when assayed for a variety of phenotypes including growth,longevity, pathology and metabolism. In addition more targeted screens,informed by the abnormalities observed in mice where genes in proximityto the investigated elements had been altered, also failed to revealnotable abnormalities. These results, while not inclusive of all thepossible phenotypic impact of the deleted sequences, indicate thatextreme sequence constraint does not necessarily reflect crucialfunctions required for viability.« less
Transposases are the most abundant, most ubiquitous genes in nature.
Aziz, Ramy K; Breitbart, Mya; Edwards, Robert A
2010-07-01
Genes, like organisms, struggle for existence, and the most successful genes persist and widely disseminate in nature. The unbiased determination of the most successful genes requires access to sequence data from a wide range of phylogenetic taxa and ecosystems, which has finally become achievable thanks to the deluge of genomic and metagenomic sequences. Here, we analyzed 10 million protein-encoding genes and gene tags in sequenced bacterial, archaeal, eukaryotic and viral genomes and metagenomes, and our analysis demonstrates that genes encoding transposases are the most prevalent genes in nature. The finding that these genes, classically considered as selfish genes, outnumber essential or housekeeping genes suggests that they offer selective advantage to the genomes and ecosystems they inhabit, a hypothesis in agreement with an emerging body of literature. Their mobile nature not only promotes dissemination of transposable elements within and between genomes but also leads to mutations and rearrangements that can accelerate biological diversification and--consequently--evolution. By securing their own replication and dissemination, transposases guarantee to thrive so long as nucleic acid-based life forms exist.
Dennison, Sarah Rachel; Harris, Frederick; Brandenburg, Klaus; Phoenix, David Andrew
2007-11-01
The barley yellow dwarf virus movement protein (BYDV-MP) requires its N-terminal sequence to promote the transport of viral RNA into the nuclear compartment of host plant cells. Here, graphical analysis predicts that this sequence would form a membrane interactive amphiphilic alpha-helix. Confirming this prediction, NT1, a peptide homologue of the BYDV-MP N-terminal sequence, was found to be alpha-helical (65%) in the presence of vesicles mimics of the nuclear membrane. The peptide increased the fluidity of these nuclear membrane mimics (rise in wavenumber of circa 0.5-1.0 cm(-1)) and induced surface pressure changes of 2 mN m(-1) in lipid monolayers with corresponding compositions. Taken with isotherm analysis these results suggest that BYDV-MP forms an N-terminal amphiphilic alpha-helix, which partitions into the nuclear membrane primarily through thermodynamically stable associations with the membrane lipid headgroup region. We speculate that these associations may play a role in targeting of the nuclear membrane by BYDM-MP.
Effects of Dopamine Medication on Sequence Learning with Stochastic Feedback in Parkinson's Disease
Seo, Moonsang; Beigi, Mazda; Jahanshahi, Marjan; Averbeck, Bruno B.
2010-01-01
A growing body of evidence suggests that the midbrain dopamine system plays a key role in reinforcement learning and disruption of the midbrain dopamine system in Parkinson's disease (PD) may lead to deficits on tasks that require learning from feedback. We examined how changes in dopamine levels (“ON” and “OFF” their dopamine medication) affect sequence learning from stochastic positive and negative feedback using Bayesian reinforcement learning models. We found deficits in sequence learning in patients with PD when they were “ON” and “OFF” medication relative to healthy controls, but smaller differences between patients “OFF” and “ON”. The deficits were mainly due to decreased learning from positive feedback, although across all participant groups learning was more strongly associated with positive than negative feedback in our task. The learning in our task is likely mediated by the relatively depleted dorsal striatum and not the relatively intact ventral striatum. Therefore, the changes we see in our task may be due to a strong loss of phasic dopamine signals in the dorsal striatum in PD. PMID:20740077
APOBEC3G Interacts with ssDNA by Two Modes: AFM Studies
NASA Astrophysics Data System (ADS)
Shlyakhtenko, Luda S.; Dutta, Samrat; Banga, Jaspreet; Li, Ming; Harris, Reuben S.; Lyubchenko, Yuri L.
2015-10-01
APOBEC3G (A3G) protein has antiviral activity against HIV and other pathogenic retroviruses. A3G has two domains: a catalytic C-terminal domain (CTD) that deaminates cytidine, and a N-terminal domain (NTD) that binds to ssDNA. Although abundant information exists about the biological activities of A3G protein, the interplay between sequence specific deaminase activity and A3G binding to ssDNA remains controversial. We used the topographic imaging and force spectroscopy modalities of Atomic Force Spectroscopy (AFM) to characterize the interaction of A3G protein with deaminase specific and nonspecific ssDNA substrates. AFM imaging demonstrated that A3G has elevated affinity for deaminase specific ssDNA than for nonspecific ssDNA. AFM force spectroscopy revealed two distinct binding modes by which A3G interacts with ssDNA. One mode requires sequence specificity, as demonstrated by stronger and more stable complexes with deaminase specific ssDNA than with nonspecific ssDNA. Overall these observations enforce prior studies suggesting that both domains of A3G contribute to the sequence specific binding of ssDNA.
APOBEC3G Interacts with ssDNA by Two Modes: AFM Studies.
Shlyakhtenko, Luda S; Dutta, Samrat; Banga, Jaspreet; Li, Ming; Harris, Reuben S; Lyubchenko, Yuri L
2015-10-27
APOBEC3G (A3G) protein has antiviral activity against HIV and other pathogenic retroviruses. A3G has two domains: a catalytic C-terminal domain (CTD) that deaminates cytidine, and a N-terminal domain (NTD) that binds to ssDNA. Although abundant information exists about the biological activities of A3G protein, the interplay between sequence specific deaminase activity and A3G binding to ssDNA remains controversial. We used the topographic imaging and force spectroscopy modalities of Atomic Force Spectroscopy (AFM) to characterize the interaction of A3G protein with deaminase specific and nonspecific ssDNA substrates. AFM imaging demonstrated that A3G has elevated affinity for deaminase specific ssDNA than for nonspecific ssDNA. AFM force spectroscopy revealed two distinct binding modes by which A3G interacts with ssDNA. One mode requires sequence specificity, as demonstrated by stronger and more stable complexes with deaminase specific ssDNA than with nonspecific ssDNA. Overall these observations enforce prior studies suggesting that both domains of A3G contribute to the sequence specific binding of ssDNA.
Smith, Lindsay D.; Dickinson, Rachel L.; Lucas, Christian M.; Cousins, Alex; Malygin, Alexey A.; Weldon, Carika; Perrett, Andrew J.; Bottrill, Andrew R.; Searle, Mark S.; Burley, Glenn A.; Eperon, Ian C.
2014-01-01
Summary The use of oligonucleotides to activate the splicing of selected exons is limited by a poor understanding of the mechanisms affected. A targeted bifunctional oligonucleotide enhancer of splicing (TOES) anneals to SMN2 exon 7 and carries an exonic splicing enhancer (ESE) sequence. We show that it stimulates splicing specifically of intron 6 in the presence of repressing sequences in intron 7. Complementarity to the 5′ end of exon 7 increases U2AF65 binding, but the ESE sequence is required for efficient recruitment of U2 snRNP. The ESE forms at least three coexisting discrete states: a quadruplex, a complex containing only hnRNP F/H, and a complex enriched in the activator SRSF1. Neither hnRNP H nor quadruplex formation contributes to ESE activity. The results suggest that splicing limited by weak signals can be rescued by rapid exchange of TOES oligonucleotides in various complexes and raise the possibility that SR proteins associate transiently with ESEs. PMID:25263560
Yeh, Po-Yuan; Wu, Hung-Yi
2014-07-30
It has been demonstrated that, in addition to genomic RNA, sgmRNA is able to serve as a template for the synthesis of the negative-strand [(-)-strand] complement. However, the cis-acting elements on the positive-strand [(+)-strand] sgmRNA required for (-)-strand sgmRNA synthesis have not yet been systematically identified. In this study, we employed real-time quantitative reverse transcription polymerase chain reaction to analyze the cis-acting elements on bovine coronavirus (BCoV) sgmRNA 7 required for the synthesis of its (-)-strand counterpart by deletion mutagenesis. The major findings are as follows. (1) Deletion of the 5'-terminal leader sequence on sgmRNA 7 decreased the synthesis of the (-)-strand sgmRNA complement. (2) Deletions of the 3' untranslated region (UTR) bulged stem-loop showed no effect on (-)-strand sgmRNA synthesis; however, deletion of the 3' UTR pseudoknot decreased the yield of (-)-strand sgmRNA. (3) Nucleotides positioned from -15 to -34 of the sgmRNA 7 3'-terminal region are required for efficient (-)-strand sgmRNA synthesis. (4) Nucleotide species at the 3'-most position (-1) of sgmRNA 7 is correlated to the efficiency of (-)-strand sgmRNA synthesis. These results together suggest, in principle, that the 5'- and 3'-terminal sequences on sgmRNA 7 harbor cis-acting elements are critical for efficient (-)-strand sgmRNA synthesis in BCoV.
Automatic Debugging Support for UML Designs
NASA Technical Reports Server (NTRS)
Schumann, Johann; Swanson, Keith (Technical Monitor)
2001-01-01
Design of large software systems requires rigorous application of software engineering methods covering all phases of the software process. Debugging during the early design phases is extremely important, because late bug-fixes are expensive. In this paper, we describe an approach which facilitates debugging of UML requirements and designs. The Unified Modeling Language (UML) is a set of notations for object-orient design of a software system. We have developed an algorithm which translates requirement specifications in the form of annotated sequence diagrams into structured statecharts. This algorithm detects conflicts between sequence diagrams and inconsistencies in the domain knowledge. After synthesizing statecharts from sequence diagrams, these statecharts usually are subject to manual modification and refinement. By using the "backward" direction of our synthesis algorithm. we are able to map modifications made to the statechart back into the requirements (sequence diagrams) and check for conflicts there. Fed back to the user conflicts detected by our algorithm are the basis for deductive-based debugging of requirements and domain theory in very early development stages. Our approach allows to generate explanations oil why there is a conflict and which parts of the specifications are affected.
NASA Astrophysics Data System (ADS)
Campbell, Ian H.; Davies, D. Rhodri
2017-02-01
The changes that occur at the boundary between the Archean and Proterozoic eons are arguably the most fundamental to affect the evolution of Earth's continental crust. The principal component of Archean continental crust is Granite-Greenstone Terranes (GGTs), with granites always dominant. The greenstones consist of a lower sequence of submarine komatiites and basalts, which erupted onto a pre-existing Tonalite-Trondhjemite-Granodiorite (TTG) crust. These basaltic rocks pass upwards initially into evolved volcanic rocks, such as andesites and dacites and, subsequently, into reworked felsic pyroclastic material and immature sediments. This transition coincides with widespread emplacement of granitoids, which stabilised (cratonised) the continental crust. Proterozoic supra-crustal rocks, on the other hand, are dominated by extensive flat-lying platform sequences of mature sediments, which were deposited on stable cratonic basements, with basaltic rocks appreciably less abundant. The siliceous TTGs cannot be produced by direct melting of the mantle, with most hypotheses for their origin requiring them to be underlain by a complimentary dense amphibole-garnet-pyroxenite root, which we suggest acted as ballast to the early continents. Ubiquitous continental pillow basalts in Archean lower greenstone sequences require the early continental crust to have been sub-marine, whereas the appearance of abundant clastic sediments, at higher stratigraphic levels, shows that it had emerged above sea level by the time of sedimentation. We hypothesise that the production of komatiites and associated basalts, the rise of the continental crust, widespread melting of the continental crust, the onset of sedimentation and subsequent cratonisation form a continuum that is the direct result of removal of the continent's dense amphibole-garnet-pyroxenite roots, triggered at a regional scale by the arrival of a mantle plume at the base of the lithosphere. Our idealised calculations suggest that the removal of 40 km of the amphibole-garnet-pyroxenite root would have raised the average level of the continental crust by ∼3 km. The emergence of the continental crust was an essential precursor to the rise of oxygen, which started some 200 Myr later.
Effort in Multitasking: Local and Global Assessment of Effort.
Kiesel, Andrea; Dignath, David
2017-01-01
When performing multiple tasks in succession, self-organization of task order might be superior compared to external-controlled task schedules, because self-organization allows optimizing processing modes and thus reduces switch costs, and it increases commitment to task goals. However, self-organization is an additional executive control process that is not required if task order is externally specified and as such it is considered as time-consuming and effortful. To compare self-organized and externally controlled task scheduling, we suggest assessing global subjective and objectives measures of effort in addition to local performance measures. In our new experimental approach, we combined characteristics of dual tasking settings and task switching settings and compared local and global measures of effort in a condition with free choice of task sequence and a condition with cued task sequence. In a multi-tasking environment, participants chose the task order while the task requirement of the not-yet-performed task remained the same. This task preview allowed participants to work on the previously non-chosen items in parallel and resulted in faster responses and fewer errors in task switch trials than in task repetition trials. The free-choice group profited more from this task preview than the cued group when considering local performance measures. Nevertheless, the free-choice group invested more effort than the cued group when considering global measures. Thus, self-organization in task scheduling seems to be effortful even in conditions in which it is beneficiary for task processing. In a second experiment, we reduced the possibility of task preview for the not-yet-performed tasks in order to hinder efficient self-organization. Here neither local nor global measures revealed substantial differences between the free-choice and a cued task sequence condition. Based on the results of both experiments, we suggest that global assessment of effort in addition to local performance measures might be a useful tool for multitasking research.
Fast single-pass alignment and variant calling using sequencing data
USDA-ARS?s Scientific Manuscript database
Sequencing research requires efficient computation. Few programs use already known information about DNA variants when aligning sequence data to the reference map. New program findmap.f90 reads the previous variant list before aligning sequence, calling variant alleles, and summing the allele counts...
NASA Technical Reports Server (NTRS)
Springer, E.; Sachs, M. S.; Woese, C. R.; Boone, D. R.
1995-01-01
Representatives of the family Methanosarcinaceae were analyzed phylogenetically by comparing partial sequences of their methyl-coenzyme M reductase (mcrI) genes. A 490-bp fragment from the A subunit of the gene was selected, amplified by the PCR, cloned, and sequenced for each of 25 strains belonging to the Methanosarcinaceae. The sequences obtained were aligned with the corresponding portions of five previously published sequences, and all of the sequences were compared to determine phylogenetic distances by Fitch distance matrix methods. We prepared analogous trees based on 16S rRNA sequences; these trees corresponded closely to the mcrI trees, although the mcrI sequences of pairs of organisms had 3.01 +/- 0.541 times more changes than the respective pairs of 16S rRNA sequences, suggesting that the mcrI fragment evolved about three times more rapidly than the 16S rRNA gene. The qualitative similarity of the mcrI and 16S rRNA trees suggests that transfer of genetic information between dissimilar organisms has not significantly affected these sequences, although we found inconsistencies between some mcrI distances that we measured and and previously published DNA reassociation data. It is unlikely that multiple mcrI isogenes were present in the organisms that we examined, because we found no major discrepancies in multiple determinations of mcrI sequences from the same organism. Our primers for the PCR also match analogous sites in the previously published mcrII sequences, but all of the sequences that we obtained from members of the Methanosarcinaceae were more closely related to mcrI sequences than to mcrII sequences, suggesting that members of the Methanosarcinaceae do not have distinct mcrII genes.
Landini, P; Volkert, M R
1995-04-07
The Escherichia coli aidB gene is part of the adaptive response to DNA methylation damage. Genes belonging to the adaptive response are positively regulated by the ada gene; the Ada protein acts as a transcriptional activator when methylated in one of its cysteine residues at position 69. Through DNaseI protection assays, we show that methylated Ada (meAda) is able to bind a DNA sequence between 40 and 60 base pairs upstream of the aidB transcriptional startpoint. Binding of meAda is necessary to activate transcription of the adaptive response genes; accordingly, in vitro transcription of aidB is dependent on the presence of meAda. Unmethylated Ada protein shows no protection against DNaseI digestion in the aidB promoter region nor does it promote aidB in vitro transcription. The aidB Ada-binding site shows only weak homology to the proposed consensus sequences for Ada-binding sites in E. coli (AAANNAA and AAAGCGCA) but shares a higher degree of similarity with the Ada-binding regions from other bacterial species, such as Salmonella typhimurium and Bacillus subtilis. Based on the comparison of five different Ada-dependent promoter regions, we suggest that a possible recognition sequence for meAda might be AATnnnnnnG-CAA. Higher concentrations of Ada are required for the binding of aidB than for the ada promoter, suggesting lower affinity of the protein for the aidB Ada-binding site. Common features in the Ada-binding regions of ada and aidB are a high A/T content, the presence of an inverted repeat structure, and their position relative to the transcriptional start site. We propose that these elements, in addition to the proposed recognition sequence, are important for binding of the Ada protein.
Fundamental Bounds for Sequence Reconstruction from Nanopore Sequencers.
Magner, Abram; Duda, Jarosław; Szpankowski, Wojciech; Grama, Ananth
2016-06-01
Nanopore sequencers are emerging as promising new platforms for high-throughput sequencing. As with other technologies, sequencer errors pose a major challenge for their effective use. In this paper, we present a novel information theoretic analysis of the impact of insertion-deletion (indel) errors in nanopore sequencers. In particular, we consider the following problems: (i) for given indel error characteristics and rate, what is the probability of accurate reconstruction as a function of sequence length; (ii) using replicated extrusion (the process of passing a DNA strand through the nanopore), what is the number of replicas needed to accurately reconstruct the true sequence with high probability? Our results provide a number of important insights: (i) the probability of accurate reconstruction of a sequence from a single sample in the presence of indel errors tends quickly (i.e., exponentially) to zero as the length of the sequence increases; and (ii) replicated extrusion is an effective technique for accurate reconstruction. We show that for typical distributions of indel errors, the required number of replicas is a slow function (polylogarithmic) of sequence length - implying that through replicated extrusion, we can sequence large reads using nanopore sequencers. Moreover, we show that in certain cases, the required number of replicas can be related to information-theoretic parameters of the indel error distributions.
Künzler, Markus; Gerstberger, Thomas; Stutz, Françoise; Bischoff, F. Ralf; Hurt, Ed
2000-01-01
The RanGTP-binding protein RanBP1, which is located in the cytoplasm, has been implicated in release of nuclear export complexes from the cytoplasmic side of the nuclear pore complex. Here we show that Yrb1 (the yeast homolog of RanBP1) shuttles between the nucleus and the cytoplasm. Nuclear import of Yrb1 is a facilitated process that requires a short basic sequence within the Ran-binding domain (RBD). By contrast, nuclear export of Yrb1 requires an intact RBD, which forms a ternary complex with the Xpo1 (Crm1) NES receptor in the presence of RanGTP. Nuclear export of Yrb1, however, is insensitive towards leptomycin B, suggesting a novel type of substrate recognition between Yrb1 and Xpo1. Taken together, these data suggest that ongoing nuclear import and export is an important feature of Yrb1 function in vivo. PMID:10825193
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture.
Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen; Burge, Christopher B
2017-12-27
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning ('intron definition') or exon-spanning ('exon definition') pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila , using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60-70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.
Sloma, A; Rufo, G A; Theriault, K A; Dwyer, M; Wilson, S W; Pero, J
1991-01-01
We have purified a minor extracellular serine protease from a strain of Bacillus subtilis bearing null mutations in five extracellular protease genes: apr, npr, epr, bpr, and mpr (A. Sloma, C. Rudolph, G. Rufo, Jr., B. Sullivan, K. Theriault, D. Ally, and J. Pero, J. Bacteriol. 172:1024-1029, 1990). During purification, this novel protease (Vpr) was found bound in a complex in the void volume after gel filtration chromatography. The amino-terminal sequence of the purified protein was determined, and an oligonucleotide probe was constructed on the basis of the amino acid sequence. This probe was used to clone the structural gene (vpr) for this protease. The gene encodes a primary product of 806 amino acids. The amino acid sequence of the mature protein was preceded by a signal sequence of approximately 28 amino acids and a prosequence of approximately 132 amino acids. The mature protein has a predicted molecular weight of 68,197; however, the isolated protein has an apparent molecular weight of 28,500, suggesting that Vpr undergoes C-terminal processing or proteolysis. The vpr gene maps in the ctrA-sacA-epr region of the chromosome and is not required for growth or sporulation. Images FIG. 1 PMID:1938892
Zhang, Yanhong; Pham, Nancy Kim; Zhang, Huixian; Lin, Junda; Lin, Qiang
2014-01-01
Population genetic of seahorses is confidently influenced by their species-specific ecological requirements and life-history traits. In the present study, partial sequences of mitochondrial cytochrome b (cytb) and control region (CR) were obtained from 50 Hippocampus mohnikei and 92 H. trimaculatus from four zoogeographical zones. A total of 780 base pairs of cytb gene were sequenced to characterize mitochondrial DNA (mtDNA) diversity. The mtDNA marker revealed high haplotype diversity, low nucleotide diversity, and a lack of population structure across both populations of H. mohnikei and H. trimaculatus. A neighbour-joining (NJ) tree of cytb gene sequences showed that H. mohnikei haplotypes formed one cluster. A maximum likelihood (ML) tree of cytb gene sequences showed that H. trimaculatus belonged to one lineage. The star-like pattern median-joining network of cytb and CR markers indicated a previous demographic expansion of H. mohnikei and H. trimaculatus. The cytb and CR data sets exhibited a unimodal mismatch distribution, which may have resulted from population expansion. Mismatch analysis suggested that the expansion was initiated about 276,000 years ago for H. mohnikei and about 230,000 years ago for H. trimaculatus during the middle Pleistocene period. This study indicates a possible signature of genetic variation and population expansion in two seahorses under complex marine environments.
Lowry, Kym; Woodman, Andrew; Cook, Jonathan; Evans, David J.
2014-01-01
Recombination in enteroviruses provides an evolutionary mechanism for acquiring extensive regions of novel sequence, is suggested to have a role in genotype diversity and is known to have been key to the emergence of novel neuropathogenic variants of poliovirus. Despite the importance of this evolutionary mechanism, the recombination process remains relatively poorly understood. We investigated heterologous recombination using a novel reverse genetic approach that resulted in the isolation of intermediate chimeric intertypic polioviruses bearing genomes with extensive duplicated sequences at the recombination junction. Serial passage of viruses exhibiting such imprecise junctions yielded progeny with increased fitness which had lost the duplicated sequences. Mutations or inhibitors that changed polymerase fidelity or the coalescence of replication complexes markedly altered the yield of recombinants (but did not influence non-replicative recombination) indicating both that the process is replicative and that it may be possible to enhance or reduce recombination-mediated viral evolution if required. We propose that extant recombinants result from a biphasic process in which an initial recombination event is followed by a process of resolution, deleting extraneous sequences and optimizing viral fitness. This process has implications for our wider understanding of ‘evolution by duplication’ in the positive-strand RNA viruses. PMID:24945141
Directed evolution of the TALE N-terminal domain for recognition of all 5' bases.
Lamb, Brian M; Mercer, Andrew C; Barbas, Carlos F
2013-11-01
Transcription activator-like effector (TALE) proteins can be designed to bind virtually any DNA sequence. General guidelines for design of TALE DNA-binding domains suggest that the 5'-most base of the DNA sequence bound by the TALE (the N0 base) should be a thymine. We quantified the N0 requirement by analysis of the activities of TALE transcription factors (TALE-TF), TALE recombinases (TALE-R) and TALE nucleases (TALENs) with each DNA base at this position. In the absence of a 5' T, we observed decreases in TALE activity up to >1000-fold in TALE-TF activity, up to 100-fold in TALE-R activity and up to 10-fold reduction in TALEN activity compared with target sequences containing a 5' T. To develop TALE architectures that recognize all possible N0 bases, we used structure-guided library design coupled with TALE-R activity selections to evolve novel TALE N-terminal domains to accommodate any N0 base. A G-selective domain and broadly reactive domains were isolated and characterized. The engineered TALE domains selected in the TALE-R format demonstrated modularity and were active in TALE-TF and TALEN architectures. Evolved N-terminal domains provide effective and unconstrained TALE-based targeting of any DNA sequence as TALE binding proteins and designer enzymes.
Camerlengo, Terry; Ozer, Hatice Gulcin; Onti-Srinivasan, Raghuram; Yan, Pearlly; Huang, Tim; Parvin, Jeffrey; Huang, Kun
2012-01-01
Next Generation Sequencing is highly resource intensive. NGS Tasks related to data processing, management and analysis require high-end computing servers or even clusters. Additionally, processing NGS experiments requires suitable storage space and significant manual interaction. At The Ohio State University's Biomedical Informatics Shared Resource, we designed and implemented a scalable architecture to address the challenges associated with the resource intensive nature of NGS secondary analysis built around Illumina Genome Analyzer II sequencers and Illumina's Gerald data processing pipeline. The software infrastructure includes a distributed computing platform consisting of a LIMS called QUEST (http://bisr.osumc.edu), an Automation Server, a computer cluster for processing NGS pipelines, and a network attached storage device expandable up to 40TB. The system has been architected to scale to multiple sequencers without requiring additional computing or labor resources. This platform provides demonstrates how to manage and automate NGS experiments in an institutional or core facility setting.
Identification and genomic characterization of a novel rat bocavirus from brown rats in China.
Lau, Susanna K P; Yeung, Hazel C; Li, Kenneth S M; Lam, Carol S F; Cai, Jian-Piao; Yuen, Ming-Chi; Wang, Ming; Zheng, Bo-Jian; Woo, Patrick C Y; Yuen, Kwok-Yung
2017-01-01
Despite recent discoveries of novel animal bocaparvoviruses, current understandings on the diversity and evolution of bocaparvoviruses are still limited. We report the identification and genome characterization of a novel bocaparvovirus, rat bocaparvovirus (RBoV), in brown rats (Rattus norvegicus) in China. RBoV was detected in 11.5%, 2.4%, 16.2% and 0.3% of alimentary, respiratory, spleen and kidney samples respectively, of 636 brown rats by PCR, but not in samples of other rodent species, suggesting that brown rats are the primary reservoir of RBoV. Six RBoV genomes sequenced from three brown rats revealed the presence of three ORFs, characteristic of bocaparvoviruses. Phylogenetic analysis showed that RBoV was distantly related to other bocaparvoviruses, forming a distinct cluster within the genus, with ≤55.5% nucleotide identities to the genome of ungulate bocaparvovirus 3, supporting its classification as a novel bocaparvovirus species. RBoV possessed a putative second exon encoding the C-terminal region of NS1 and conserved RNA splicing signals, similar to human bocaparvoviruses and canine bocaparvovirus. In contrast to human, feline and canine bocaparvoviruses which demonstrates inter/intra-host viral diversity, partial VP1/VP2 sequences of 49 RBoV strains demonstrated little inter-host genetic diversity, suggesting a single genetic group. Although the pathogenicity of RBoV remains to be determined, its presence in different host tissues suggests wide tissue tropism. RBoV represents the first bocaparvovirus in rodents with genome sequenced, which extends our knowledge on the host range of bocaparvoviruses. Further studies are required to better understand the epidemiology, genetic diversity and pathogenicity of bocaparvoviruses in different rodent populations. Copyright © 2016 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Steindorff, Andrei S.; Noronha, Elilane F.; Ulhoa, Cirano J.
2015-03-17
Biological control is a complex process which requires many mechanisms and a high diversity of biochemical pathways. The species of Trichoderma harzianum are well known for their biocontrol activity against many plant pathogens. To gain new insights into the biocontrol mechanism used by T. harzianum, we sequenced the isolate TR274 genome using Illumina. The assembly was performed using AllPaths-LG with a maximum coverage of 100x. The assembly resulted in 2282 contigs with a N50 of 37033bp. The genome size generated was 40.8 Mb and the GC content was 47.7%, similar to other Trichoderma genomes. Using the JGI Annotation Pipeline wemore » predicted 13,932 genes with a high transcriptome support. CEGMA tests suggested 100% genome completeness and 97.9% of RNA-SEQ reads were mapped to the genome. The phylogenetic comparison using orthologous proteins with all Trichoderma genomes sequenced at JGI, corroborates the Trichoderma (T. asperellum and T. atroviride), Longibrachiatum (T. reesei and T. longibrachiatum) and Pachibasium (T. harzianum and T. virens) section division described previously. The comparison between two Trichoderma harzianum species suggests a high genome similarity but some strain-specific expansions. Analyses of the secondary metabolites, CAZymes, transporters, proteases, transcription factors were performed. The Pachybasium section expanded virtually all categories analyzed compared with the other sections, specially Longibrachiatum section, that shows a clear contraction. These results suggests that these proteins families have an important role in their respective phenotypes. Future analysis will improve the understanding of this complex genus and give some insights about its lifestyle and the interactions with the environment.« less
Williams, N P; Mueller, P P; Hinnebusch, A G
1988-01-01
Translational control of GCN4 expression in the yeast Saccharomyces cerevisiae is mediated by multiple AUG codons present in the leader of GCN4 mRNA, each of which initiates a short open reading frame of only two or three codons. Upstream AUG codons 3 and 4 are required to repress GCN4 expression in normal growth conditions; AUG codons 1 and 2 are needed to overcome this repression in amino acid starvation conditions. We show that the regulatory function of AUG codons 1 and 2 can be qualitatively mimicked by the AUG codons of two heterologous upstream open reading frames (URFs) containing the initiation regions of the yeast genes PGK and TRP1. These AUG codons inhibit GCN4 expression when present singly in the mRNA leader; however, they stimulate GCN4 expression in derepressing conditions when inserted upstream from AUG codons 3 and 4. This finding supports the idea that AUG codons 1 and 2 function in the control mechanism as translation initiation sites and further suggests that suppression of the inhibitory effects of AUG codons 3 and 4 is a general consequence of the translation of URF 1 and 2 sequences upstream. Several observations suggest that AUG codons 3 and 4 are efficient initiation sites; however, these sequences do not act as positive regulatory elements when placed upstream from URF 1. This result suggests that efficient translation is only one of the important properties of the 5' proximal URFs in GCN4 mRNA. We propose that a second property is the ability to permit reinitiation following termination of translation and that URF 1 is optimized for this regulatory function. Images PMID:3065626
Congenital chloride diarrhea needs to be distinguished from Bartter and Gitelman syndrome.
Matsunoshita, Natsuki; Nozu, Kandai; Yoshikane, Masahide; Kawaguchi, Azusa; Fujita, Naoya; Morisada, Naoya; Ishimori, Shingo; Yamamura, Tomohiko; Minamikawa, Shogo; Horinouchi, Tomoko; Nakanishi, Keita; Fujimura, Junya; Ninchoji, Takeshi; Morioka, Ichiro; Nagase, Hiroaki; Taniguchi-Ikeda, Mariko; Kaito, Hiroshi; Iijima, Kazumoto
2018-05-30
Pseudo-Bartter/Gitelman syndrome (p-BS/GS) encompasses a clinically heterogeneous group of inherited or acquired disorders similar to Bartter syndrome (BS) or Gitelman syndrome (GS), both renal salt-losing tubulopathies. Phenotypic overlap frequently occurs between p-BS/GS and BS/GS, which are difficult to diagnose based on their clinical presentation and require genetic tests for accurate diagnosis. In addition, p-BS/GS can occur as a result of other inherited diseases such as cystic fibrosis, autosomal dominant hypocalcemia, Dent disease, or congenital chloride diarrhea (CCD). However, the detection of the variants in genes other than known BS/GS-causing genes by conventional Sanger sequencing requires substantial time and resources. We studied 27 cases clinically diagnosed with BS/GS, but with negative genetic tests for known BS/GS genes. We conducted targeted sequencing for 22 genes including genes responsible for tubulopathies and other inherited diseases manifesting with p-BS/GS symptoms. We detected the SLC26A3 gene variants responsible for CCD in two patients. In Patient 1, we found the SLC26A3 compound heterozygous variants: c.354delC and c.1008insT. In Patient 2, we identified the compound heterozygous variants: c.877G > A, p.(Glu293Lys), and c.1008insT. Our results suggest that a comprehensive genetic screening system using targeted sequencing is useful for the diagnosis of patients with p-BS/GS with alternative genetic origins.
Comparative transcriptome analysis of microsclerotia development in Nomuraea rileyi.
Song, Zhangyong; Yin, Youping; Jiang, Shasha; Liu, Juanjuan; Chen, Huan; Wang, Zhongkang
2013-06-19
Nomuraea rileyi is used as an environmental-friendly biopesticide. However, mass production and commercialization of this organism are limited due to its fastidious growth and sporulation requirements. When cultured in amended medium, we found that N. rileyi could produce microsclerotia bodies, replacing conidiophores as the infectious agent. However, little is known about the genes involved in microsclerotia development. In the present study, the transcriptomes were analyzed using next-generation sequencing technology to find the genes involved in microsclerotia development. A total of 4.69 Gb of clean nucleotides comprising 32,061 sequences was obtained, and 20,919 sequences were annotated (about 65%). Among the annotated sequences, only 5928 were annotated with 34 gene ontology (GO) functional categories, and 12,778 sequences were mapped to 165 pathways by searching against the Kyoto Encyclopedia of Genes and Genomes pathway (KEGG) database. Furthermore, we assessed the transcriptomic differences between cultures grown in minimal and amended medium. In total, 4808 sequences were found to be differentially expressed; 719 differentially expressed unigenes were assigned to 25 GO classes and 1888 differentially expressed unigenes were assigned to 161 KEGG pathways, including 25 enrichment pathways. Subsequently, we examined the up-regulation or uniquely expressed genes following amended medium treatment, which were also expressed on the enrichment pathway, and found that most of them participated in mediating oxidative stress homeostasis. To elucidate the role of oxidative stress in microsclerotia development, we analyzed the diversification of unigenes using quantitative reverse transcription-PCR (RT-qPCR). Our findings suggest that oxidative stress occurs during microsclerotia development, along with a broad metabolic activity change. Our data provide the most comprehensive sequence resource available for the study of N. rileyi. We believe that the transcriptome datasets will serve as an important public information platform to accelerate studies on N. rileyi microsclerotia.
Molecular cloning of the pheromone biosynthesis-activating neuropeptide in Helicoverpa zea.
Davis, M T; Vakharia, V N; Henry, J; Kempe, T G; Raina, A K
1992-01-01
Pheromone biosynthesis-activating neuropeptide (PBAN) regulates sex pheromone biosynthesis in female Helicoverpa (Heliothis) zea. Two oligonucleotide probes representing two overlapping amino acid regions of PBAN were used to screen 2.5 x 10(5) recombinant plaques, and a positive recombinant clone was isolated. Sequence analysis of the isolated clone showed that the PBAN gene is interrupted after the codon encoding amino acid 14 by a 0.63-kilobase (kb) intron. Preceding the PBAN amino acid sequence is a 10-amino acid sequence containing a pentapeptide Phe-Thr-Pro-Arg-Leu, which is followed by a Gly-Arg-Arg processing site. Immediately after the PBAN amino acid sequence is a Gly-Arg processing site and a short stretch of 10 amino acids. This 10-amino acid sequence contains a repeat of the PBAN C-terminal pentapeptide Phe-Ser-Pro-Arg-Leu and is terminated by another Gly-Arg processing site. It is suggested that the PBAN gene in H. zea might carry, besides PBAN, a 7- and an 8-residue amidated peptide, which share with PBAN the core C-terminal pentapeptide Phe-(Ser or Thr)-Pro-Arg-Leu-NH2. The C-terminal pentapeptide sequence of PBAN represents the minimum sequence required for pheromonotropic activity in H. zea and also bears a high degree of homology to the pyrokinin family of insect peptides with myotropic activity. It is possible that the putative heptapeptide and octapeptide might be new members of the pyrokinin family, with pheromonotropic and/or myotropic activities. Thus, the PBAN gene products, besides affecting sexual behavior, might have broad influence on many biological processes in H. zea. Images PMID:1729680
Influence of gag and RRE Sequences on HIV-1 RNA Packaging Signal Structure and Function.
Kharytonchyk, Siarhei; Brown, Joshua D; Stilger, Krista; Yasin, Saif; Iyer, Aishwarya S; Collins, John; Summers, Michael F; Telesnitsky, Alice
2018-07-06
The packaging signal (Ψ) and Rev-responsive element (RRE) enable unspliced HIV-1 RNAs' export from the nucleus and packaging into virions. For some retroviruses, engrafting Ψ onto a heterologous RNA is sufficient to direct encapsidation. In contrast, HIV-1 RNA packaging requires 5' leader Ψ elements plus poorly defined additional features. We previously defined minimal 5' leader sequences competitive with intact Ψ for HIV-1 packaging, and here examined the potential roles of additional downstream elements. The findings confirmed that together, HIV-1 5' leader Ψ sequences plus a nuclear export element are sufficient to specify packaging. However, RNAs trafficked using a heterologous export element did not compete well with RNAs using HIV-1's RRE. Furthermore, some RNA additions to well-packaged minimal vectors rendered them packaging-defective. These defects were rescued by extending gag sequences in their native context. To understand these packaging defects' causes, in vitro dimerization properties of RNAs containing minimal packaging elements were compared to RNAs with sequence extensions that were or were not compatible with packaging. In vitro dimerization was found to correlate with packaging phenotypes, suggesting that HIV-1 evolved to prevent 5' leader residues' base pairing with downstream residues and misfolding of the packaging signal. Our findings explain why gag sequences have been implicated in packaging and show that RRE's packaging contributions appear more specific than nuclear export alone. Paired with recent work showing that sequences upstream of Ψ can dictate RNA folds, the current work explains how genetic context of minimal packaging elements contributes to HIV-1 RNA fate determination. Copyright © 2018 Elsevier Ltd. All rights reserved.
Cell type-specific termination of transcription by transposable element sequences.
Conley, Andrew B; Jordan, I King
2012-09-30
Transposable elements (TEs) encode sequences necessary for their own transposition, including signals required for the termination of transcription. TE sequences within the introns of human genes show an antisense orientation bias, which has been proposed to reflect selection against TE sequences in the sense orientation owing to their ability to terminate the transcription of host gene transcripts. While there is evidence in support of this model for some elements, the extent to which TE sequences actually terminate transcription of human gene across the genome remains an open question. Using high-throughput sequencing data, we have characterized over 9,000 distinct TE-derived sequences that provide transcription termination sites for 5,747 human genes across eight different cell types. Rarefaction curve analysis suggests that there may be twice as many TE-derived termination sites (TE-TTS) genome-wide among all human cell types. The local chromatin environment for these TE-TTS is similar to that seen for 3' UTR canonical TTS and distinct from the chromatin environment of other intragenic TE sequences. However, those TE-TTS located within the introns of human genes were found to be far more cell type-specific than the canonical TTS. TE-TTS were much more likely to be found in the sense orientation than other intragenic TE sequences of the same TE family and TE-TTS in the sense orientation terminate transcription more efficiently than those found in the antisense orientation. Alu sequences were found to provide a large number of relatively weak TTS, whereas LTR elements provided a smaller number of much stronger TTS. TE sequences provide numerous termination sites to human genes, and TE-derived TTS are particularly cell type-specific. Thus, TE sequences provide a powerful mechanism for the diversification of transcriptional profiles between cell types and among evolutionary lineages, since most TE-TTS are evolutionarily young. The extent of transcription termination by TEs seen here, along with the preference for sense-oriented TE insertions to provide TTS, is consistent with the observed antisense orientation bias of human TEs.
Thomas, Sean; Martinez, L L Isadora Trejo; Westenberger, Scott J; Sturm, Nancy R
2007-05-24
The structurally complex network of minicircles and maxicircles comprising the mitochondrial DNA of kinetoplastids mirrors the complexity of the RNA editing process that is required for faithful expression of encrypted maxicircle genes. Although a few of the guide RNAs that direct this editing process have been discovered on maxicircles, guide RNAs are mostly found on the minicircles. The nuclear and maxicircle genomes have been sequenced and assembled for Trypanosoma cruzi, the causative agent of Chagas disease, however the complement of 1.4-kb minicircles, carrying four guide RNA genes per molecule in this parasite, has been less thoroughly characterised. Fifty-four CL Brener and 53 Esmeraldo strain minicircle sequence reads were extracted from T. cruzi whole genome shotgun sequencing data. With these sequences and all published T. cruzi minicircle sequences, 108 unique guide RNAs from all known T. cruzi minicircle sequences and two guide RNAs from the CL Brener maxicircle were predicted using a local alignment algorithm and mapped onto predicted or experimentally determined sequences of edited maxicircle open reading frames. For half of the sequences no statistically significant guide RNA could be assigned. Likely positions of these unidentified gRNAs in T. cruzi minicircle sequences are estimated using a simple Hidden Markov Model. With the local alignment predictions as a standard, the HMM had an ~85% chance of correctly identifying at least 20 nucleotides of guide RNA from a given minicircle sequence. Inter-minicircle recombination was documented. Variable regions contain species-specific areas of distinct nucleotide preference. Two maxicircle guide RNA genes were found. The identification of new minicircle sequences and the further characterization of all published minicircles are presented, including the first observation of recombination between minicircles. Extrapolation suggests a level of 4% recombinants in the population, supporting a relatively high recombination rate that may serve to minimize the persistence of gRNA pseudogenes. Characteristic nucleotide preferences observed within variable regions provide potential clues regarding the transcription and maturation of T. cruzi guide RNAs. Based on these preferences, a method of predicting T. cruzi guide RNAs using only primary minicircle sequence data was created.
Driving on the surface of Mars with the rover sequencing and visualization program
NASA Technical Reports Server (NTRS)
Wright, J.; Hartman, F.; Cooper, B.; Maxwell, S.; Yen, J.; Morrison, J.
2005-01-01
Operating a rover on Mars is not possible using teleoperations due to the distance involved and the bandwith limitations. To operate these rovers requires sophisticated tools to make operators knowledgeable of the terrain, hazards, features of interest, and rover state and limitations, and to support building command sequences and rehearsing expected operations. This paper discusses how the Rover Sequencing and Visualization program and a small set of associated tools support this requirement.
Neville, B. Anne; Sheridan, Paul O.; Harris, Hugh M. B.; Coughlan, Simone; Flint, Harry J.; Duncan, Sylvia H.; Jeffery, Ian B.; Claesson, Marcus J.; Ross, R. Paul; Scott, Karen P.; O'Toole, Paul W.
2013-01-01
Some Eubacterium and Roseburia species are among the most prevalent motile bacteria present in the intestinal microbiota of healthy adults. These flagellate species contribute “cell motility” category genes to the intestinal microbiome and flagellin proteins to the intestinal proteome. We reviewed and revised the annotation of motility genes in the genomes of six Eubacterium and Roseburia species that occur in the human intestinal microbiota and examined their respective locus organization by comparative genomics. Motility gene order was generally conserved across these loci. Five of these species harbored multiple genes for predicted flagellins. Flagellin proteins were isolated from R. inulinivorans strain A2-194 and from E. rectale strains A1-86 and M104/1. The amino-termini sequences of the R. inulinivorans and E. rectale A1-86 proteins were almost identical. These protein preparations stimulated secretion of interleukin-8 (IL-8) from human intestinal epithelial cell lines, suggesting that these flagellins were pro-inflammatory. Flagellins from the other four species were predicted to be pro-inflammatory on the basis of alignment to the consensus sequence of pro-inflammatory flagellins from the β- and γ- proteobacteria. Many fliC genes were deduced to be under the control of σ28. The relative abundance of the target Eubacterium and Roseburia species varied across shotgun metagenomes from 27 elderly individuals. Genes involved in the flagellum biogenesis pathways of these species were variably abundant in these metagenomes, suggesting that the current depth of coverage used for metagenomic sequencing (3.13–4.79 Gb total sequence in our study) insufficiently captures the functional diversity of genomes present at low (≤1%) relative abundance. E. rectale and R. inulinivorans thus appear to synthesize complex flagella composed of flagellin proteins that stimulate IL-8 production. A greater depth of sequencing, improved evenness of sequencing and improved metagenome assembly from short reads will be required to facilitate in silico analyses of complete complex biochemical pathways for low-abundance target species from shotgun metagenomes. PMID:23935906
A survey and evaluations of histogram-based statistics in alignment-free sequence comparison.
Luczak, Brian B; James, Benjamin T; Girgis, Hani Z
2017-12-06
Since the dawn of the bioinformatics field, sequence alignment scores have been the main method for comparing sequences. However, alignment algorithms are quadratic, requiring long execution time. As alternatives, scientists have developed tens of alignment-free statistics for measuring the similarity between two sequences. We surveyed tens of alignment-free k-mer statistics. Additionally, we evaluated 33 statistics and multiplicative combinations between the statistics and/or their squares. These statistics are calculated on two k-mer histograms representing two sequences. Our evaluations using global alignment scores revealed that the majority of the statistics are sensitive and capable of finding similar sequences to a query sequence. Therefore, any of these statistics can filter out dissimilar sequences quickly. Further, we observed that multiplicative combinations of the statistics are highly correlated with the identity score. Furthermore, combinations involving sequence length difference or Earth Mover's distance, which takes the length difference into account, are always among the highest correlated paired statistics with identity scores. Similarly, paired statistics including length difference or Earth Mover's distance are among the best performers in finding the K-closest sequences. Interestingly, similar performance can be obtained using histograms of shorter words, resulting in reducing the memory requirement and increasing the speed remarkably. Moreover, we found that simple single statistics are sufficient for processing next-generation sequencing reads and for applications relying on local alignment. Finally, we measured the time requirement of each statistic. The survey and the evaluations will help scientists with identifying efficient alternatives to the costly alignment algorithm, saving thousands of computational hours. The source code of the benchmarking tool is available as Supplementary Materials. © The Author 2017. Published by Oxford University Press.
Bacterial cellulose synthesis mechanism of facultative anaerobe Enterobacter sp. FY-07.
Ji, Kaihua; Wang, Wei; Zeng, Bing; Chen, Sibin; Zhao, Qianqian; Chen, Yueqing; Li, Guoqiang; Ma, Ting
2016-02-25
Enterobacter sp. FY-07 can produce bacterial cellulose (BC) under aerobic and anaerobic conditions. Three potential BC synthesis gene clusters (bcsI, bcsII and bcsIII) of Enterobacter sp. FY-07 have been predicted using genome sequencing and comparative genome analysis, in which bcsIII was confirmed as the main contributor to BC synthesis by gene knockout and functional reconstitution methods. Protein homology, gene arrangement and gene constitution analysis indicated that bcsIII had high identity to the bcsI operon of Enterobacter sp. 638; however, its arrangement and composition were same as those of BC synthesizing operon of G. xylinum ATCC53582 except for the flanking sequences. According to the BC biosynthesizing process, oxygen is not directly involved in the reactions of BC synthesis, however, energy is required to activate intermediate metabolites and synthesize the activator, c-di-GMP. Comparative transcriptome and metabolite quantitative analysis demonstrated that under anaerobic conditions genes involved in the TCA cycle were downregulated, however, genes in the nitrate reduction and gluconeogenesis pathways were upregulated, especially, genes in three pyruvate metabolism pathways. These results suggested that Enterobacter sp. FY-07 could produce energy efficiently under anaerobic conditions to meet the requirement of BC biosynthesis.
Bacterial cellulose synthesis mechanism of facultative anaerobe Enterobacter sp. FY-07
Ji, Kaihua; Wang, Wei; Zeng, Bing; Chen, Sibin; Zhao, Qianqian; Chen, Yueqing; Li, Guoqiang; Ma, Ting
2016-01-01
Enterobacter sp. FY-07 can produce bacterial cellulose (BC) under aerobic and anaerobic conditions. Three potential BC synthesis gene clusters (bcsI, bcsII and bcsIII) of Enterobacter sp. FY-07 have been predicted using genome sequencing and comparative genome analysis, in which bcsIII was confirmed as the main contributor to BC synthesis by gene knockout and functional reconstitution methods. Protein homology, gene arrangement and gene constitution analysis indicated that bcsIII had high identity to the bcsI operon of Enterobacter sp. 638; however, its arrangement and composition were same as those of BC synthesizing operon of G. xylinum ATCC53582 except for the flanking sequences. According to the BC biosynthesizing process, oxygen is not directly involved in the reactions of BC synthesis, however, energy is required to activate intermediate metabolites and synthesize the activator, c-di-GMP. Comparative transcriptome and metabolite quantitative analysis demonstrated that under anaerobic conditions genes involved in the TCA cycle were downregulated, however, genes in the nitrate reduction and gluconeogenesis pathways were upregulated, especially, genes in three pyruvate metabolism pathways. These results suggested that Enterobacter sp. FY-07 could produce energy efficiently under anaerobic conditions to meet the requirement of BC biosynthesis. PMID:26911736
Iglói, Kinga; Doeller, Christian F.; Paradis, Anne-Lise; Benchenane, Karim; Berthoz, Alain; Burgess, Neil; Rondi-Reig, Laure
2015-01-01
To examine the cerebellar contribution to human spatial navigation we used functional magnetic resonance imaging and virtual reality. Our findings show that the sensory-motor requirements of navigation induce activity in cerebellar lobules and cortical areas known to be involved in the motor loop and vestibular processing. By contrast, cognitive aspects of navigation mainly induce activity in a different cerebellar lobule (VIIA Crus I). Our results demonstrate a functional link between cerebellum and hippocampus in humans and identify specific functional circuits linking lobule VIIA Crus I of the cerebellum to medial parietal, medial prefrontal, and hippocampal cortices in nonmotor aspects of navigation. They further suggest that Crus I belongs to 2 nonmotor loops, involved in different strategies: place-based navigation is supported by coherent activity between left cerebellar lobule VIIA Crus I and medial parietal cortex along with right hippocampus activity, while sequence-based navigation is supported by coherent activity between right lobule VIIA Crus I, medial prefrontal cortex, and left hippocampus. These results highlight the prominent role of the human cerebellum in both motor and cognitive aspects of navigation, and specify the cortico-cerebellar circuits by which it acts depending on the requirements of the task. PMID:24947462
Modifications of allergenicity linked to food technologies.
Moneret-Vautrin, D A
1998-01-01
The prevalence of food allergies (FA) has increased over the past fifteen years. The reasons suggested are changes in dietary behaviour and the evolution of food technologies. New cases of FA have been described with chayote, rambutan, arguta, pumpkin seeds, custard apple, and with mycoproteins from Fusarium.... Additives using food proteins are at high risk: caseinates, lysozyme, cochineal red, papaïn, alpha-amylase, lactase etc. Heating can reduce allergenicity or create neo-allergens, as well as storage, inducing the synthesis of allergenic stress or PR proteins. Aeroallergens (miles, moulds) contaminate foods and can induce allergic reactions. Involuntary contamination by peanut proteins on production lines is a problem which is not yet solved. Genetically modified plants are at risk of allergenicity, requiring methodological steps of investigations: the comparison of the amino-acid sequence of the transferred protein with the sequence of known allergens, the evaluation of thermo degradability and of the denaturation by pepsin and trypsin are required, as well as the study with sera from patients allergic to the plant producing the gene. The combination of enzymatic hydrolysis, heating, or the development of genetically modified plants may offer new alternatives towards hypoallergenic foods (57 references).
Swanson, H Lee; Xinhua Zheng; Jerman, Olga
2009-01-01
The purpose of the present study was to synthesize research that compares children with and without reading disabilities (RD) on measures of short-term memory (STM) and working memory (WM). Across a broad age, reading, and IQ range, 578 effect sizes (ESs) were computed, yielding a mean ES across studies of -.89 (SD = 1.03). A total of 257 ESs were in the moderate range for STM measures (M = -.61, 95% confidence range of -.65 to -.58), and 320 ESs were in the moderate range for WM measures (M = -.67, 95% confidence range of -.68 to -.64). The results indicated that children with RD were distinctively disadvantaged compared with average readers on (a) STM measures requiring the recall of phonemes and digit sequences and (b) WM measures requiring the simultaneous processing and storage of digits within sentence sequences and final words from unrelated sentences. No significant moderating effects emerged for age, IQ, or reading level on memory ESs. The findings indicated that domain-specific STM and WM differences between ability groups persisted across age, suggesting that a verbal deficit model that fails to efficiently draw resources from both a phonological and executive system underlies RD.
Structure and stability of the ankyrin domain of the Drosophila Notch receptor.
Zweifel, Mark E; Leahy, Daniel J; Hughson, Frederick M; Barrick, Doug
2003-11-01
The Notch receptor contains a conserved ankyrin repeat domain that is required for Notch-mediated signal transduction. The ankyrin domain of Drosophila Notch contains six ankyrin sequence repeats previously identified as closely matching the ankyrin repeat consensus sequence, and a putative seventh C-terminal sequence repeat that exhibits lower similarity to the consensus sequence. To better understand the role of the Notch ankyrin domain in Notch-mediated signaling and to examine how structure is distributed among the seven ankyrin sequence repeats, we have determined the crystal structure of this domain to 2.0 angstroms resolution. The seventh, C-terminal, ankyrin sequence repeat adopts a regular ankyrin fold, but the first, N-terminal ankyrin repeat, which contains a 15-residue insertion, appears to be largely disordered. The structure reveals a substantial interface between ankyrin polypeptides, showing a high degree of shape and charge complementarity, which may be related to homotypic interactions suggested from indirect studies. However, the Notch ankyrin domain remains largely monomeric in solution, demonstrating that this interface alone is not sufficient to promote tight association. Using the structure, we have classified reported mutations within the Notch ankyrin domain that are known to disrupt signaling into those that affect buried residues and those restricted to surface residues. We show that the buried substitutions greatly decrease protein stability, whereas the surface substitutions have only a marginal affect on stability. The surface substitutions are thus likely to interfere with Notch signaling by disrupting specific Notch-effector interactions and map the sites of these interactions.
Functions of the 3′ and 5′ genome RNA regions of members of the genus Flavivirus
Brinton, Margo A.; Basu, Mausumi
2015-01-01
The positive sense genomes of members of the genus Flavivirus in the family Flaviviridae are ~11 kb nts in length and have a 5′ type I cap but no 3′ poly A. The 5′ and 3′ terminal regions contain short conserved sequences that are proposed to be repeated remnants of an ancient sequence. However, the functions of most of these conserved sequences have not yet been determined. The terminal regions of the genome also contain multiple conserved RNA structures. Functional data for many of these structures has been obtained. Three sets of complementary 3′ and 5′ terminal region sequences, some of which are located in conserved RNA structures, interact to form a panhandle structure that is required for initiation of minus strand RNA synthesis with the 5′ terminal structure functioning as the promoter. How the switch from the terminal RNA structure base pairing to the long distance RNA-RNA interaction is triggered and regulated is not well understood but evidence suggests involvement of a cell protein binding to three sites on the 3′ terminal RNA structures and a cis-acting metastable 3′ RNA element in the 3′ terminal structure. Cell proteins may also be involved in facilitating exponential replication of nascent genomic RNA within replication vesicles at later times of infection cycle. Other conserved RNA structures and/or sequences in the 5′ and 3′ terminal regions have been proposed to regulate genome translation. Additional functions of the 5′ and 3′ terminal sequences have also been reported. PMID:25683510
GyrB sequence analysis and MALDI-TOF MS as identification tools for plant pathogenic Clavibacter.
Zaluga, Joanna; Heylen, Kim; Van Hoorde, Koenraad; Hoste, Bart; Van Vaerenbergh, Johan; Maes, Martine; De Vos, Paul
2011-09-01
The bacterial genus Clavibacter has only one species, Clavibacter michiganensis, containing five subspecies. All five are plant pathogens, among which three are recognized as quarantine pests (mentioned on the EPPO A2 list). Prevention of their introduction and epidemic outbreaks requires a reliable and accurate identification. Currently, identification of these bacteria is time consuming and often problematic, mainly because of cross-reactions with other plant-associated bacteria in immunological tests and false-negative results in PCR detection methods. Furthermore, distinguishing closely related subspecies is not straightforward. This study aimed at evaluating the use of matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) and a fragment of the gyrB sequence for the reliable and fast identification of the Clavibacter subspecies. Amplification and sequencing of gyrB using a single primer set had sufficient resolution and specificity to identify each subspecies based on both sequence similarities in cluster analyses and specific signatures within the sequences. All five subspecies also generated distinct and reproducible MALDI-TOF MS profiles, with unique and specific ion peaks for each subspecies, which could be used as biomarkers for identification. Results from both methods were in agreement and were able to distinguish the five Clavibacter subspecies from each other and from representatives of closely related Rathayibacter, Leifsonia or Curtobacterium species. Our study suggests that proteomic analysis using MALDI-TOF MS and gyrB sequence are powerful diagnostic tools for the accurate identification of Clavibacter plant pathogens. Copyright © 2011 Elsevier GmbH. All rights reserved.
DNA sequence+shape kernel enables alignment-free modeling of transcription factor binding.
Ma, Wenxiu; Yang, Lin; Rohs, Remo; Noble, William Stafford
2017-10-01
Transcription factors (TFs) bind to specific DNA sequence motifs. Several lines of evidence suggest that TF-DNA binding is mediated in part by properties of the local DNA shape: the width of the minor groove, the relative orientations of adjacent base pairs, etc. Several methods have been developed to jointly account for DNA sequence and shape properties in predicting TF binding affinity. However, a limitation of these methods is that they typically require a training set of aligned TF binding sites. We describe a sequence + shape kernel that leverages DNA sequence and shape information to better understand protein-DNA binding preference and affinity. This kernel extends an existing class of k-mer based sequence kernels, based on the recently described di-mismatch kernel. Using three in vitro benchmark datasets, derived from universal protein binding microarrays (uPBMs), genomic context PBMs (gcPBMs) and SELEX-seq data, we demonstrate that incorporating DNA shape information improves our ability to predict protein-DNA binding affinity. In particular, we observe that (i) the k-spectrum + shape model performs better than the classical k-spectrum kernel, particularly for small k values; (ii) the di-mismatch kernel performs better than the k-mer kernel, for larger k; and (iii) the di-mismatch + shape kernel performs better than the di-mismatch kernel for intermediate k values. The software is available at https://bitbucket.org/wenxiu/sequence-shape.git. rohs@usc.edu or william-noble@uw.edu. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.
Johnson, Timothy J; Kariyawasam, Subhashinie; Wannemuehler, Yvonne; Mangiamele, Paul; Johnson, Sara J; Doetkott, Curt; Skyberg, Jerod A; Lynne, Aaron M; Johnson, James R; Nolan, Lisa K
2007-04-01
Escherichia coli strains that cause disease outside the intestine are known as extraintestinal pathogenic E. coli (ExPEC) and include human uropathogenic E. coli (UPEC) and avian pathogenic E. coli (APEC). Regardless of host of origin, ExPEC strains share many traits. It has been suggested that these commonalities may enable APEC to cause disease in humans. Here, we begin to test the hypothesis that certain APEC strains possess potential to cause human urinary tract infection through virulence genotyping of 1,000 APEC and UPEC strains, generation of the first complete genomic sequence of an APEC (APEC O1:K1:H7) strain, and comparison of this genome to all available human ExPEC genomic sequences. The genomes of APEC O1 and three human UPEC strains were found to be remarkably similar, with only 4.5% of APEC O1's genome not found in other sequenced ExPEC genomes. Also, use of multilocus sequence typing showed that some of the sequenced human ExPEC strains were more like APEC O1 than other human ExPEC strains. This work provides evidence that at least some human and avian ExPEC strains are highly similar to one another, and it supports the possibility that a food-borne link between some APEC and UPEC strains exists. Future studies are necessary to assess the ability of APEC to overcome the hurdles necessary for such a food-borne transmission, and epidemiological studies are required to confirm that such a phenomenon actually occurs.
Ovaskainen, Otso; Schigel, Dmitry; Ali-Kovero, Heini; Auvinen, Petri; Paulin, Lars; Nordén, Björn; Nordén, Jenni
2013-01-01
Before the recent revolution in molecular biology, field studies on fungal communities were mostly confined to fruit bodies, whereas mycelial interactions were studied in the laboratory. Here we combine high-throughput sequencing with a fruit body inventory to study simultaneously mycelial and fruit body occurrences in a community of fungi inhabiting dead wood of Norway spruce. We studied mycelial occurrence by extracting DNA from wood samples followed by 454-sequencing of the ITS1 and ITS2 regions and an automated procedure for species identification. In total, we detected 198 species as mycelia and 137 species as fruit bodies. The correlation between mycelial and fruit body occurrences was high for the majority of the species, suggesting that high-throughput sequencing can successfully characterize the dominating fungal communities, despite possible biases related to sampling, PCR, sequencing and molecular identification. We used the fruit body and molecular data to test hypothesized links between life history and population dynamic parameters. We show that the species that have on average a high mycelial abundance also have a high fruiting rate and produce large fruit bodies, leading to a positive feedback loop in their population dynamics. Earlier studies have shown that species with specialized resource requirements are rarely seen fruiting, for which reason they are often classified as red-listed. We show with the help of high-throughput sequencing that some of these species are more abundant as mycelium in wood than what could be expected from their occurrence as fruit bodies. PMID:23575372
Learn more about the special construction scheduling/sequencing requirements and procedures necessary to assure achievement of designed Indoor Air Quality (IAQ) levels for the completed project required by the EPA IAQ Program.
Isaac, Peter; Laurie, David A.
2012-01-01
The timing of flowering during the year is an important adaptive character affecting reproductive success in plants and is critical to crop yield. Flowering time has been extensively manipulated in crops such as wheat (Triticum aestivum L.) during domestication, and this enables them to grow productively in a wide range of environments. Several major genes controlling flowering time have been identified in wheat with mutant alleles having sequence changes such as insertions, deletions or point mutations. We investigated genetic variants in commercial varieties of wheat that regulate flowering by altering photoperiod response (Ppd-B1 alleles) or vernalization requirement (Vrn-A1 alleles) and for which no candidate mutation was found within the gene sequence. Genetic and genomic approaches showed that in both cases alleles conferring altered flowering time had an increased copy number of the gene and altered gene expression. Alleles with an increased copy number of Ppd-B1 confer an early flowering day neutral phenotype and have arisen independently at least twice. Plants with an increased copy number of Vrn-A1 have an increased requirement for vernalization so that longer periods of cold are required to potentiate flowering. The results suggest that copy number variation (CNV) plays a significant role in wheat adaptation. PMID:22457747
Saetta, A; Fanourakis, G; Agapitos, E; Davaris, P S
2000-01-01
Several studies suggest that certain viral and bacterial pathogens may contribute to the process of atherogenesis. However, this relation between infectious agents and atherosclerosis has not yet been established with certainty. The aim of this study was to investigate the presence of CMV in carotid endarterectomies from 40 patients suffering from atherosclerosis using immunohistochemistry and the polymerase chain reaction (PCR). None of the specimens examined gave a positive result, indicating absence of CMV particles or CMV DNA sequences in the walls of carotid arteries. This finding suggests it is possible that CMV infection may not play a major role in the formation of atheroma. Therefore, further investigation is required in order to clarify the etiology of atherosclerosis.
Heussler, Gary E; Miller, Jon L; Price, Courtney E; Collins, Alan J; O'Toole, George A
2016-11-15
CRISPR (clustered regularly interspaced short palindromic repeat)-Cas (CRISPR-associated protein) systems are diverse and found in many archaea and bacteria. These systems have mainly been characterized as adaptive immune systems able to protect against invading mobile genetic elements, including viruses. The first step in this protection is acquisition of spacer sequences from the invader DNA and incorporation of those sequences into the CRISPR array, termed CRISPR adaptation. Progress in understanding the mechanisms and requirements of CRISPR adaptation has largely been accomplished using overexpression of cas genes or plasmid loss assays; little work has focused on endogenous CRISPR-acquired immunity from viral predation. Here, we developed a new biofilm-based assay system to enrich for Pseudomonas aeruginosa strains with new spacer acquisition. We used this assay to demonstrate that P. aeruginosa rapidly acquires spacers protective against DMS3vir, an engineered lytic variant of the Mu-like bacteriophage DMS3, through primed CRISPR adaptation from spacers present in the native CRISPR2 array. We found that for the P. aeruginosa type I-F system, the cas1 gene is required for CRISPR adaptation, recG contributes to (but is not required for) primed CRISPR adaptation, recD is dispensable for primed CRISPR adaptation, and finally, the ability of a putative priming spacer to prime can vary considerably depending on the specific sequences of the spacer. Our understanding of CRISPR adaptation has expanded largely through experiments in type I CRISPR systems using plasmid loss assays, mutants of Escherichia coli, or cas1-cas2 overexpression systems, but there has been little focus on studying the adaptation of endogenous systems protecting against a lytic bacteriophage. Here we describe a biofilm system that allows P. aeruginosa to rapidly gain spacers protective against a lytic bacteriophage. This approach has allowed us to probe the requirements for CRISPR adaptation in the endogenous type I-F system of P. aeruginosa Our data suggest that CRISPR-acquired immunity in a biofilm may be one reason that many P. aeruginosa strains maintain a CRISPR-Cas system. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Enzmann, Peter-Joachim; Castric, Jeannette; Bovo, Giuseppe; Thiery, Richard; Fichtner, Dieter; Schütze, Heike; Wahli, Thomas
2010-02-24
The fish pathogenic rhabdovirus infectious hematopoietic necrosis virus (IHNV) causes substantial losses in European aquaculture. IHNV was first detected in Europe in 1987 and has since undergone considerable spread. Phylogenetic analyses of the full G-gene sequences of 73 isolates obtained from 4 countries in Europe (France, n = 18; Italy, 9; Switzerland, 4; Germany, 42) enable determination of the evolution of the virus in Europe since the first detection, and identification of characteristic changes within the G-genes of European strains. Further, the database allows us to analyse the pathways of distribution in Europe over time. The results suggest that in most of the recent cases, spread of IHNV was related to trade of infected fish. The data further demonstrate that knowledge of the sequence is required to determine the source of infections in farms.
A rule of seven in Watson-Crick base-pairing of mismatched sequences.
Cisse, Ibrahim I; Kim, Hajin; Ha, Taekjip
2012-05-13
Sequence recognition through base-pairing is essential for DNA repair and gene regulation, but the basic rules governing this process remain elusive. In particular, the kinetics of annealing between two imperfectly matched strands is not well characterized, despite its potential importance in nucleic acid-based biotechnologies and gene silencing. Here we use single-molecule fluorescence to visualize the multiple annealing and melting reactions of two untethered strands inside a porous vesicle, allowing us to precisely quantify the annealing and melting rates. The data as a function of mismatch position suggest that seven contiguous base pairs are needed for rapid annealing of DNA and RNA. This phenomenological rule of seven may underlie the requirement for seven nucleotides of complementarity to seed gene silencing by small noncoding RNA and may help guide performance improvement in DNA- and RNA-based bio- and nanotechnologies, in which off-target effects can be detrimental.
Imbert, J; Zafarullah, M; Culotta, V C; Gedamu, L; Hamer, D
1989-01-01
Metallothionein (MT) gene promoters in higher eucaryotes contain multiple metal regulatory elements (MREs) that are responsible for the metal induction of MT gene transcription. We identified and purified to near homogeneity a 74-kilodalton mouse nuclear protein that specifically binds to certain MRE sequences. This protein, MBF-I, was purified employing as an affinity reagent a trout MRE that is shown to be functional in mouse cells but which lacks the G+C-rich and SP1-like sequences found in many mammalian MT gene promoters. Using point-mutated MREs, we showed that there is a strong correlation between DNA binding in vitro and MT gene regulation in vivo, suggesting a direct role of MBF-I in MT gene transcription. We also showed that MBF-I can induce MT gene transcription in vitro in a mouse extract and that this stimulation requires zinc. Images PMID:2586522
Cloning and characterization of the Schizosaccharomyces pombe tRNA:pseudouridine synthase Pus1p.
Hellmuth, K; Grosjean, H; Motorin, Y; Deinert, K; Hurt, E; Simos, G
2000-12-01
Saccharomyces cerevisiae cells that carry deletions in both the LOS1 (a tRNA export receptor) and the PUS1 (a tRNA:pseudouridine synthase) genes exhibit a thermosensitive growth defect. A Schizosaccharomyces pombe gene, named spPUS1, was cloned from a cDNA library by complementation of this conditional lethal phenotype. The corresponding protein, spPus1p, shows sequence similarity to S. cerevisiae and murine Pus1p as well as other known members of the pseudouridine synthase family. Accordingly, recombinant spPus1p can catalyze in vitro the formation of pseudouridines at positions 27, 28, 34, 35 and 36 of yeast tRNA transcripts. The sequence and functional conservation of the Pus1p proteins in fungi and mammalian species and their notable absence from prokaryotes suggest that this family of pseudouridine synthases is required for a eukaryote-specific step of tRNA biogenesis, such as nuclear export.
Rehearsal in immediate serial recall.
Tan, Lydia; Ward, Geoff
2008-06-01
We report for the first time overt rehearsal data in immediate serial recall (ISR) undertaken at three presentation rates (1, 2.5, and 5 sec/word). Two groups of participants saw lists of six words for ISR and were required either to engage in overt rehearsal or to remain silent after reading aloud the word list during its presentation. Typical ISR serial position effects were obtained for both groups, and recall increased with slower rates. When participants rehearsed, they tended to do so in a cumulative forward order up to Serial Position 4, after which the amount of rehearsal decreased substantially. There were similarities between rehearsal and recall data: Both broke down toward the end of longer sequences, and there were strong positive correlations between the maximum sequence of participants' rehearsals and their ISR performance. We interpret these data as suggesting that similar mechanisms underpin both rehearsal and recall in ISR.
Transcriptome Wide Annotation of Eukaryotic RNase III Reactivity and Degradation Signals
Gagnon, Jules; Lavoie, Mathieu; Catala, Mathieu; Malenfant, Francis; Elela, Sherif Abou
2015-01-01
Detection and validation of the RNA degradation signals controlling transcriptome stability are essential steps for understanding how cells regulate gene expression. Here we present complete genomic and biochemical annotations of the signals required for RNA degradation by the dsRNA specific ribonuclease III (Rnt1p) and examine its impact on transcriptome expression. Rnt1p cleavage signals are randomly distributed in the yeast genome, and encompass a wide variety of sequences, indicating that transcriptome stability is not determined by the recurrence of a fixed cleavage motif. Instead, RNA reactivity is defined by the sequence and structural context in which the cleavage sites are located. Reactive signals are often associated with transiently expressed genes, and their impact on RNA expression is linked to growth conditions. Together, the data suggest that Rnt1p reactivity is triggered by malleable RNA degradation signals that permit dynamic response to changes in growth conditions. PMID:25680180
Geisberger, Roland; Rada, Cristina; Neuberger, Michael S.
2009-01-01
The carboxyterminal region of activation-induced deaminase (AID) is required for its function in Ig class switch recombination (CSR) and also contains a nuclear-export sequence (NES). Here, based on an extensive fine-structure mutation analysis of the AID NES, as well as from AID chimeras bearing heterologous NESs, we show that while a functional NES is indeed essential for CSR, it is not sufficient. The precise nature of the NES is critical both for AID stabilization and CSR function: minor changes in the NES can perturb stabilization and CSR without jeopardizing nuclear export. The results indicate that the AID NES fulfills a function beyond simply providing a signal for nuclear export and suggest the possibility that the quality of exportin-binding may be critical to the stabilization of AID and its activity in CSR. PMID:19351893
Syed, Muhammad Ibrahim; Moorthy, Balaji T; Jenner, Andreas; Fetka, Ingrid; Jansen, Ralf-Peter
2018-05-17
Localization of mRNAs depends on specific RNA-binding proteins (RBPs) and critically contributes not only to cell polarization but also to basal cell function. The yeast RBP Khd1p binds to several hundred mRNAs, the majority of which encodes secreted or membrane proteins. We demonstrate that a subfraction of Khd1p associates with artificial liposomes and endoplasmic reticulum (ER), and that Khd1p endomembrane association is partially dependent on its binding to RNA. ER targeting of at least two mRNAs, MID2 and SLG1/WSC1, requires KHD1 but is independent of their translation. Together, our results suggest interdependence of Khd1p and mRNA for their targeting to the ER and presents additional evidence for signal sequence-independent, RBP-mediated mRNA targeting. © 2018 Federation of European Biochemical Societies.
Chain, Benjamin M; Noursadeghi, Mahdad; Gardener, Michelle; Tsang, Jhen; Wright, Edward
2008-10-23
The chemokine receptor CCR5 is required for cellular entry by many strains of HIV, and provides a potential target for molecules, including antibodies, designed to block HIV transmission. This study investigates a novel approach to stimulate antibodies to CCR5. Rabbits were immunised with chimaeric peptides which encode a short fragment of the N-terminal sequence of CCR5, as well as an unrelated T cell epitope from Tetanus toxoid. Immunisation with these chimaeric peptides generates a strong antibody response which is highly focused on the N-terminal CCR5 sequence. The antibody to the chimaeric peptide containing an N-terminal methionine also recognises the full length CCR5 receptor on the cell surface, albeit at higher concentrations. Further comparison of binding to intact CCR5 with binding to CCR5 peptide suggest that the receptor specific antibody generated represents a very small fragment of the total anti-peptide antibody. These findings are consistent with the hypothesis that the N-terminal peptide in the context of the intact receptor has a different structure to that of the synthetic peptide. Finally, the antibody was able to block HIV infection of macrophages in vitro. Thus results of this study suggest that N-terminal fragments of CCR5 may provide potential immunogens with which to generate blocking antibodies to this receptor, while avoiding the dangers of including T cell auto-epitopes.
Chain, Benjamin M.; Noursadeghi, Mahdad; Gardener, Michelle; Tsang, Jhen; Wright, Edward
2008-01-01
The chemokine receptor CCR5 is required for cellular entry by many strains of HIV, and provides a potential target for molecules, including antibodies, designed to block HIV transmission. This study investigates a novel approach to stimulate antibodies to CCR5. Rabbits were immunised with chimaeric peptides which encode a short fragment of the N-terminal sequence of CCR5, as well as an unrelated T cell epitope from Tetanus toxoid. Immunisation with these chimaeric peptides generates a strong antibody response which is highly focused on the N-terminal CCR5 sequence. The antibody to the chimaeric peptide containing an N-terminal methionine also recognises the full length CCR5 receptor on the cell surface, albeit at higher concentrations. Further comparison of binding to intact CCR5 with binding to CCR5 peptide suggest that the receptor specific antibody generated represents a very small fragment of the total anti-peptide antibody. These findings are consistent with the hypothesis that the N-terminal peptide in the context of the intact receptor has a different structure to that of the synthetic peptide. Finally, the antibody was able to block HIV infection of macrophages in vitro. Thus results of this study suggest that N-terminal fragments of CCR5 may provide potential immunogens with which to generate blocking antibodies to this receptor, while avoiding the dangers of including T cell auto-epitopes. PMID:18765264
Brok-Volchanskaya, Vera S; Kadyrov, Farid A; Sivogrivov, Dmitry E; Kolosov, Peter M; Sokolov, Andrey S; Shlyapnikov, Michael G; Kryukov, Valentine M; Granovsky, Igor E
2008-04-01
Homing endonucleases initiate nonreciprocal transfer of DNA segments containing their own genes and the flanking sequences by cleaving the recipient DNA. Bacteriophage T4 segB gene, which is located in a cluster of tRNA genes, encodes a protein of unknown function, homologous to homing endonucleases of the GIY-YIG family. We demonstrate that SegB protein is a site-specific endonuclease, which produces mostly 3' 2-nt protruding ends at its DNA cleavage site. Analysis of SegB cleavage sites suggests that SegB recognizes a 27-bp sequence. It contains 11-bp conserved sequence, which corresponds to a conserved motif of tRNA TpsiC stem-loop, whereas the remainder of the recognition site is rather degenerate. T4-related phages T2L, RB1 and RB3 contain tRNA gene regions that are homologous to that of phage T4 but lack segB gene and several tRNA genes. In co-infections of phages T4 and T2L, segB gene is inherited with nearly 100% of efficiency. The preferred inheritance depends absolutely on the segB gene integrity and is accompanied by the loss of the T2L tRNA gene region markers. We suggest that SegB is a homing endonuclease that functions to ensure spreading of its own gene and the surrounding tRNA genes among T4-related phages.
Fossé, P; Motté, N; Roumier, A; Gabus, C; Muriaux, D; Darlix, J L; Paoletti, J
1996-12-24
Retroviral genomes consist of two identical RNA molecules joined noncovalently near their 5'-ends. Recently, two models have been proposed for RNA dimer formation on the basis of results obtained in vitro with human immunodeficiency virus type 1 RNA and Moloney murine leukemia virus RNA. It was first proposed that viral RNA dimerizes by forming an interstrand quadruple helix with purine tetrads. The second model postulates that RNA dimerization is initiated by a loop-loop interaction between the two RNA molecules. In order to better characterize the dimerization process of retroviral genomic RNA, we analyzed the in vitro dimerization of avian sarcoma-leukosis virus (ASLV) RNA using different transcripts. We determined the requirements for heterodimer formation, the thermal dissociation of RNA dimers, and the influence of antisense DNA oligonucleotides on dimer formation. Our results strongly suggest that purine tetrads are not involved in dimer formation. Data show that an autocomplementary sequence located upstream from the splice donor site and within a major packaging signal plays a crucial role in ASLV RNA dimer formation in vitro. This sequence is able to form a stem-loop structure, and phylogenetic analysis reveals that it is conserved in 28 different avian sarcoma and leukosis viruses. These results suggest that dimerization of ASLV RNA is initiated by a loop-loop interaction between two RNA molecules and provide an additional argument for the ubiquity of the dimerization process via loop-loop interaction.
NASA Technical Reports Server (NTRS)
Sherry, Lance; Feary, Michael; Polson, Peter; Fennell, Karl
2003-01-01
The Flight Management Computer (FMC) and its interface, the Multi-function Control and Display Unit (MCDU) have been identified by researchers and airlines as difficult to train and use. Specifically, airline pilots have described the "drinking from the fire-hose" effect during training. Previous research has identified memorized action sequences as a major factor in a user s ability to learn and operate complex devices. This paper discusses the use of a method to examine the quantity of memorized action sequences required to perform a sample of 102 tasks, using features of the Boeing 777 Flight Management Computer Interface. The analysis identified a large number of memorized action sequences that must be learned during training and then recalled during line operations. Seventy-five percent of the tasks examined require recall of at least one memorized action sequence. Forty-five percent of the tasks require recall of a memorized action sequence and occur infrequently. The large number of memorized action sequences may provide an explanation for the difficulties in training and usage of the automation. Based on these findings, implications for training and the design of new user-interfaces are discussed.
Tabata, Ryo; Kamiya, Takehiro; Shigenobu, Shuji; Yamaguchi, Katsushi; Yamada, Masashi; Hasebe, Mitsuyasu; Fujiwara, Toru; Sawa, Shinichiro
2013-01-01
Next-generation sequencing (NGS) technologies enable the rapid production of an enormous quantity of sequence data. These powerful new technologies allow the identification of mutations by whole-genome sequencing. However, most reported NGS-based mapping methods, which are based on bulked segregant analysis, are costly and laborious. To address these limitations, we designed a versatile NGS-based mapping method that consists of a combination of low- to medium-coverage multiplex SOLiD (Sequencing by Oligonucleotide Ligation and Detection) and classical genetic rough mapping. Using only low to medium coverage reduces the SOLiD sequencing costs and, since just 10 to 20 mutant F2 plants are required for rough mapping, the operation is simple enough to handle in a laboratory with limited space and funding. As a proof of principle, we successfully applied this method to identify the CTR1, which is involved in boron-mediated root development, from among a population of high boron requiring Arabidopsis thaliana mutants. Our work demonstrates that this NGS-based mapping method is a moderately priced and versatile method that can readily be applied to other model organisms. PMID:23104114
40 CFR 92.124 - Test sequence; general requirements.
Code of Federal Regulations, 2012 CFR
2012-07-01
.... (e) Pre-test engine measurements (e.g., idle and throttle notch speeds, fuel flows, etc.), pre-test engine performance checks (e.g., verification of engine power, etc.) and pre-test system calibrations (e... 40 Protection of Environment 21 2012-07-01 2012-07-01 false Test sequence; general requirements...
40 CFR 92.124 - Test sequence; general requirements.
Code of Federal Regulations, 2014 CFR
2014-07-01
.... (e) Pre-test engine measurements (e.g., idle and throttle notch speeds, fuel flows, etc.), pre-test engine performance checks (e.g., verification of engine power, etc.) and pre-test system calibrations (e... 40 Protection of Environment 20 2014-07-01 2013-07-01 true Test sequence; general requirements. 92...
40 CFR 92.124 - Test sequence; general requirements.
Code of Federal Regulations, 2010 CFR
2010-07-01
.... (e) Pre-test engine measurements (e.g., idle and throttle notch speeds, fuel flows, etc.), pre-test engine performance checks (e.g., verification of engine power, etc.) and pre-test system calibrations (e... 40 Protection of Environment 20 2010-07-01 2010-07-01 false Test sequence; general requirements...
40 CFR 92.124 - Test sequence; general requirements.
Code of Federal Regulations, 2013 CFR
2013-07-01
.... (e) Pre-test engine measurements (e.g., idle and throttle notch speeds, fuel flows, etc.), pre-test engine performance checks (e.g., verification of engine power, etc.) and pre-test system calibrations (e... 40 Protection of Environment 21 2013-07-01 2013-07-01 false Test sequence; general requirements...
40 CFR 92.124 - Test sequence; general requirements.
Code of Federal Regulations, 2011 CFR
2011-07-01
.... (e) Pre-test engine measurements (e.g., idle and throttle notch speeds, fuel flows, etc.), pre-test engine performance checks (e.g., verification of engine power, etc.) and pre-test system calibrations (e... 40 Protection of Environment 20 2011-07-01 2011-07-01 false Test sequence; general requirements...
A Novel Fission Yeast Gene, tht1 +, Is Required for the Fusion of Nuclear Envelopes during Karyogamy
Tange, Yoshie; Horio, Tetsuya; Shimanuki, Mizuki; Ding, Da-Qiao; Hiraoka, Yasushi; Niwa, Osami
1998-01-01
We have isolated a fission yeast karyogamy mutant, tht1, in which nuclear congression and the association of two spindle pole bodies occurs but the subsequent fusion of nuclear envelopes is blocked. The tht1 mutation does not prevent meiosis, so cells execute meiosis with two unfused nuclei, leading to the production of aberrant asci. The tht1 + gene was cloned and sequenced. Predicted amino acid sequence has no significant homology to previously known proteins but strongly suggests that it is a type I membrane protein. The tht1 + gene is dispensable for vegetative growth and expressed only in conjugating cells. Tht1p is a glycoprotein susceptible to endoglycosilase H digestion. Site- directed mutagenesis showed that the N-glycosylation site, as well as the COOH-terminal region of Tht1p, is essential for its function. A protease protection assay indicated that the COOH terminus is cytoplasmic. Immunocytological analysis using a HA-tagged Tht1p suggested that the protein is localized in nuclear envelopes and in the ER during karyogamy and that its levels are reduced in cells containing fused nuclei. PMID:9442101
Guillén, Natalia; Navarro, María A; Surra, Joaquín C; Arnal, Carmen; Fernández-Juan, Marta; Cebrián-Pérez, Jose Alvaro; Osada, Jesús
2007-02-15
Pig sphingomyelin synthase 1 (SMS1) cDNA was cloned, characterized and compared to the human ortholog. Porcine protein consists of 413 amino acids and displays a 97% sequence identity with human protein. A phylogenic tree of proteins reveals that porcine SMS1 is more closely related to bovine and rodent proteins than to human. Analysis of protein mass was higher than the theoretical prediction based on amino acid sequence suggesting a kind of posttranslational modification. Quantitative representation of tissue distribution obtained by real-time RT-PCR showed that it was widely expressed although important variations in levels were obtained among organs. Thus, the cardiovascular system, especially the heart, showed the highest value of all the tissues studied. Regional differences of expression were observed in the central nervous system and intestinal tract. Analysis of the hepatic mRNA and protein expressions of SMS1 following turpentine treatment revealed a progressive decrease in the former paralleled by a decrease in the protein concentration. These findings indicate the variation in expression in the different tissues might suggest a different requirement of Golgi sphingomyelin for the specific function in each organ and a regulation of the enzyme in response to turpentine-induced hepatic injury.
Akanuma, Satoshi
2017-08-06
Understanding the evolution of ancestral life, and especially the ability of some organisms to flourish in the variable environments experienced in Earth's early biosphere, requires knowledge of the characteristics and the environment of these ancestral organisms. Information about early life and environmental conditions has been obtained from fossil records and geological surveys. Recent advances in phylogenetic analysis, and an increasing number of protein sequences available in public databases, have made it possible to infer ancestral protein sequences possessed by ancient organisms. However, the in silico studies that assess the ancestral base content of ribosomal RNAs, the frequency of each amino acid in ancestral proteins, and estimate the environmental temperatures of ancient organisms, show conflicting results. The characterization of ancestral proteins reconstructed in vitro suggests that ancient organisms had very thermally stable proteins, and therefore were thermophilic or hyperthermophilic. Experimental data supports the idea that only thermophilic ancestors survived the catastrophic increase in temperature of the biosphere that was likely associated with meteorite impacts during the early history of Earth. In addition, by expanding the timescale and including more ancestral proteins for reconstruction, it appears as though the Earth's surface temperature gradually decreased over time, from Archean to present.
Sequences in the intergenic spacer influence RNA Pol I transcription from the human rRNA promoter
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, W.M.; Sylvester, J.E.
1994-09-01
In most eucaryotic species, ribosomal genes are tandemly repeated about 100-5000 times per haploid genome. The 43 Kb human rDNA repeat consists of a 13 Kb coding region for the 18S, 5.8S, 28S ribosomal RNAs (rRNAs) and transcribed spacers separated by a 30 Kb intergenic spacer. For species such as frog, mouse and rat, sequences in the intergenic spacer other than the gene promoter have been shown to modulate transcription of the ribosomal gene. These sequences are spacer promoters, enhancers and the terminator for spacer transcription. We are addressing whether the human ribosomal gene promoter is similarly influenced. In-vitro transcriptionmore » run-off assays have revealed that the 4.5 kb region (CBE), directly upstream of the gene promoter, has cis-stimulation and trans-competition properties. This suggests that the CBE fragment contains an enhancer(s) for ribosomal gene transcription. Further experiments have shown that a fragment ({approximately}1.6 kb) within the CBE fragment also has trans-competition function. Deletion subclones of this region are being tested to delineate the exact sequences responsible for these modulating activities. Previous sequence analysis and functional studies have revealed that CBE contains regions of DNA capable of adopting alternative structures such as bent DNA, Z-DNA, and triple-stranded DNA. Whether these structures are required for modulating transcription remains to be determined as does the specific DNA-protein interaction involved.« less
Smart, Matthew; Cornman, Robert S.; Iwanowicz, Deborah; McDermott-Kubeczko, Margaret; Pettis, Jeff S; Spivak, Marla S; Otto, Clint R.
2017-01-01
Taxonomic identification of pollen has historically been accomplished via light microscopy but requires specialized knowledge and reference collections, particularly when identification to lower taxonomic levels is necessary. Recently, next-generation sequencing technology has been used as a cost-effective alternative for identifying bee-collected pollen; however, this novel approach has not been tested on a spatially or temporally robust number of pollen samples. Here, we compare pollen identification results derived from light microscopy and DNA sequencing techniques with samples collected from honey bee colonies embedded within a gradient of intensive agricultural landscapes in the Northern Great Plains throughout the 2010–2011 growing seasons. We demonstrate that at all taxonomic levels, DNA sequencing was able to discern a greater number of taxa, and was particularly useful for the identification of infrequently detected species. Importantly, substantial phenological overlap did occur for commonly detected taxa using either technique, suggesting that DNA sequencing is an appropriate, and enhancing, substitutive technique for accurately capturing the breadth of bee-collected species of pollen present across agricultural landscapes. We also show that honey bees located in high and low intensity agricultural settings forage on dissimilar plants, though with overlap of the most abundantly collected pollen taxa. We highlight practical applications of utilizing sequencing technology, including addressing ecological issues surrounding land use, climate change, importance of taxa relative to abundance, and evaluating the impact of conservation program habitat enhancement efforts.
Iwanowicz, Deborah; Olson, Deanna H.; Adams, Michael J.; Adams, Cynthia; Anderson, Chauncey; Blaustein, Andrew R; Densmore, Christine L.; Figiel, Chester; Schill, William B.; Chestnut, Tara
2017-01-01
Taxonomic identification of pollen has historically been accomplished via light microscopy but requires specialized knowledge and reference collections, particularly when identification to lower taxonomic levels is necessary. Recently, next-generation sequencing technology has been used as a cost-effective alternative for identifying bee-collected pollen; however, this novel approach has not been tested on a spatially or temporally robust number of pollen samples. Here, we compare pollen identification results derived from light microscopy and DNA sequencing techniques with samples collected from honey bee colonies embedded within a gradient of intensive agricultural landscapes in the Northern Great Plains throughout the 2010–2011 growing seasons. We demonstrate that at all taxonomic levels, DNA sequencing was able to discern a greater number of taxa, and was particularly useful for the identification of infrequently detected species. Importantly, substantial phenological overlap did occur for commonly detected taxa using either technique, suggesting that DNA sequencing is an appropriate, and enhancing, substitutive technique for accurately capturing the breadth of bee-collected species of pollen present across agricultural landscapes. We also show that honey bees located in high and low intensity agricultural settings forage on dissimilar plants, though with overlap of the most abundantly collected pollen taxa. We highlight practical applications of utilizing sequencing technology, including addressing ecological issues surrounding land use, climate change, importance of taxa relative to abundance, and evaluating the impact of conservation program habitat enhancement efforts.
Zhou, Ligang; Zhou, Meixian; Sun, Chaomin; Han, Jing; Lu, Qiuhe; Zhou, Jian; Xiang, Hua
2008-08-01
The precise nick site in the double-strand origin (DSO) of pZMX201, a 1,668-bp rolling-circle replication (RCR) plasmid from the haloarchaeon Natrinema sp. CX2021, was determined by electron microscopy and DSO mapping. In this plasmid, DSO nicking occurred between residues C404 and G405 within a heptanucleotide sequence (TCTC/GGC) located in the stem region of an imperfect hairpin structure. This nick site sequence was conserved among the haloarchaeal RCR plasmids, including pNB101, suggesting that the DSO nick site might be the same for all members of this plasmid family. Interestingly, the DSOs of pZMX201 and pNB101 were found to be cross-recognized in RCR initiation and termination in a hybrid plasmid system. Mutation analysis of the DSO from pZMX201 (DSO(Z)) in this hybrid plasmid system revealed that: (i) the nucleotides in the middle of the conserved TCTCGGC sequence play more-important roles in the initiation and termination process; (ii) the left half of the hairpin structure is required for initiation but not for termination; and (iii) a 36-bp sequence containing TCTCGGC and the downstream sequence is essential and sufficient for termination. In conclusion, these haloarchaeal plasmids, with novel features that are different from the characteristics of both single-stranded DNA phages and bacterial RCR plasmids, might serve as a good model for studying the evolution of RCR replicons.
Characterization of a novel ADAM protease expressed by Pneumocystis carinii.
Kennedy, Cassie C; Kottom, Theodore J; Limper, Andrew H
2009-08-01
Pneumocystis species are opportunistic fungal pathogens that cause severe pneumonia in immunocompromised hosts. Recent evidence has suggested that unidentified proteases are involved in Pneumocystis life cycle regulation. Proteolytically active ADAM (named for "a disintegrin and metalloprotease") family molecules have been identified in some fungal organisms, such as Aspergillus fumigatus and Schizosaccharomyces pombe, and some have been shown to participate in life cycle regulation. Accordingly, we sought to characterize ADAM-like molecules in the fungal opportunistic pathogen, Pneumocystis carinii (PcADAM). After an in silico search of the P. carinii genomic sequencing project identified a 329-bp partial sequence with homology to known ADAM proteins, the full-length PcADAM sequence was obtained by PCR extension cloning, yielding a final coding sequence of 1,650 bp. Sequence analysis detected the presence of a typical ADAM catalytic active site (HEXXHXXGXXHD). Expression of PcADAM over the Pneumocystis life cycle was analyzed by Northern blot. Southern and contour-clamped homogenous electronic field blot analysis demonstrated its presence in the P. carinii genome. Expression of PcADAM was observed to be increased in Pneumocystis cysts compared to trophic forms. The full-length gene was subsequently cloned and heterologously expressed in Saccharomyces cerevisiae. Purified PcADAMp protein was proteolytically active in casein zymography, requiring divalent zinc. Furthermore, native PcADAMp extracted directly from freshly isolated Pneumocystis organisms also exhibited protease activity. This is the first report of protease activity attributable to a specific, characterized protein in the clinically important opportunistic fungal pathogen Pneumocystis.
Customisation of the exome data analysis pipeline using a combinatorial approach.
Pattnaik, Swetansu; Vaidyanathan, Srividya; Pooja, Durgad G; Deepak, Sa; Panda, Binay
2012-01-01
The advent of next generation sequencing (NGS) technologies have revolutionised the way biologists produce, analyse and interpret data. Although NGS platforms provide a cost-effective way to discover genome-wide variants from a single experiment, variants discovered by NGS need follow up validation due to the high error rates associated with various sequencing chemistries. Recently, whole exome sequencing has been proposed as an affordable option compared to whole genome runs but it still requires follow up validation of all the novel exomic variants. Customarily, a consensus approach is used to overcome the systematic errors inherent to the sequencing technology, alignment and post alignment variant detection algorithms. However, the aforementioned approach warrants the use of multiple sequencing chemistry, multiple alignment tools, multiple variant callers which may not be viable in terms of time and money for individual investigators with limited informatics know-how. Biologists often lack the requisite training to deal with the huge amount of data produced by NGS runs and face difficulty in choosing from the list of freely available analytical tools for NGS data analysis. Hence, there is a need to customise the NGS data analysis pipeline to preferentially retain true variants by minimising the incidence of false positives and make the choice of right analytical tools easier. To this end, we have sampled different freely available tools used at the alignment and post alignment stage suggesting the use of the most suitable combination determined by a simple framework of pre-existing metrics to create significant datasets.
Firth, Andrew E; Atkins, John F
2009-01-01
Japanese encephalitis, West Nile, Usutu and Murray Valley encephalitis viruses form a tight subgroup within the larger Flavivirus genus. These viruses utilize a single-polyprotein expression strategy, resulting in ~10 mature proteins. Plotting the conservation at synonymous sites along the polyprotein coding sequence reveals strong conservation peaks at the very 5' end of the coding sequence, and also at the 5' end of the sequence encoding the NS2A protein. Such peaks are generally indicative of functionally important non-coding sequence elements. The second peak corresponds to a predicted stable pseudoknot structure whose biological importance is supported by compensatory mutations that preserve the structure. The pseudoknot is preceded by a conserved slippery heptanucleotide (Y CCU UUU), thus forming a classical stimulatory motif for -1 ribosomal frameshifting. We hypothesize, therefore, that the functional importance of the pseudoknot is to stimulate a portion of ribosomes to shift -1 nt into a short (45 codon), conserved, overlapping open reading frame, termed foo. Since cleavage at the NS1-NS2A boundary is known to require synthesis of NS2A in cis, the resulting transframe fusion protein is predicted to be NS1-NS2AN-term-FOO. We hypothesize that this may explain the origin of the previously identified NS1 'extension' protein in JEV-group flaviviruses, known as NS1'. PMID:19196463
Neural-network-designed pulse sequences for robust control of singlet-triplet qubits
NASA Astrophysics Data System (ADS)
Yang, Xu-Chen; Yung, Man-Hong; Wang, Xin
2018-04-01
Composite pulses are essential for universal manipulation of singlet-triplet spin qubits. In the absence of noise, they are required to perform arbitrary single-qubit operations due to the special control constraint of a singlet-triplet qubit, while in a noisy environment, more complicated sequences have been developed to dynamically correct the error. Tailoring these sequences typically requires numerically solving a set of nonlinear equations. Here we demonstrate that these pulse sequences can be generated by a well-trained, double-layer neural network. For sequences designed for the noise-free case, the trained neural network is capable of producing almost exactly the same pulses known in the literature. For more complicated noise-correcting sequences, the neural network produces pulses with slightly different line shapes, but the robustness against noises remains comparable. These results indicate that the neural network can be a judicious and powerful alternative to existing techniques in developing pulse sequences for universal fault-tolerant quantum computation.
Elimination sequence optimization for SPAR
NASA Technical Reports Server (NTRS)
Hogan, Harry A.
1986-01-01
SPAR is a large-scale computer program for finite element structural analysis. The program allows user specification of the order in which the joints of a structure are to be eliminated since this order can have significant influence over solution performance, in terms of both storage requirements and computer time. An efficient elimination sequence can improve performance by over 50% for some problems. Obtaining such sequences, however, requires the expertise of an experienced user and can take hours of tedious effort to affect. Thus, an automatic elimination sequence optimizer would enhance productivity by reducing the analysts' problem definition time and by lowering computer costs. Two possible methods for automating the elimination sequence specifications were examined. Several algorithms based on the graph theory representations of sparse matrices were studied with mixed results. Significant improvement in the program performance was achieved, but sequencing by an experienced user still yields substantially better results. The initial results provide encouraging evidence that the potential benefits of such an automatic sequencer would be well worth the effort.
Garcia, J A; Harrich, D; Soultanakis, E; Wu, F; Mitsuyasu, R; Gaynor, R B
1989-01-01
The human immunodeficiency virus (HIV) type 1 LTR is regulated at the transcriptional level by both cellular and viral proteins. Using HeLa cell extracts, multiple regions of the HIV LTR were found to serve as binding sites for cellular proteins. An untranslated region binding protein UBP-1 has been purified and fractions containing this protein bind to both the TAR and TATA regions. To investigate the role of cellular proteins binding to both the TATA and TAR regions and their potential interaction with other HIV DNA binding proteins, oligonucleotide-directed mutagenesis of both these regions was performed followed by DNase I footprinting and transient expression assays. In the TATA region, two direct repeats TC/AAGC/AT/AGCTGC surround the TATA sequence. Mutagenesis of both of these direct repeats or of the TATA sequence interrupted binding over the TATA region on the coding strand, but only a mutation of the TATA sequence affected in vivo assays for tat-activation. In addition to TAR serving as the site of binding of cellular proteins, RNA transcribed from TAR is capable of forming a stable stem-loop structure. To determine the relative importance of DNA binding proteins as compared to secondary structure, oligonucleotide-directed mutations in the TAR region were studied. Local mutations that disrupted either the stem or loop structure were defective in gene expression. However, compensatory mutations which restored base pairing in the stem resulted in complete tat-activation. This indicated a significant role for the stem-loop structure in HIV gene expression. To determine the role of TAR binding proteins, mutations were constructed which extensively changed the primary structure of the TAR region, yet left stem base pairing, stem energy and the loop sequence intact. These mutations resulted in decreased protein binding to TAR DNA and defects in tat-activation, and revealed factor binding specifically to the loop DNA sequence. Further mutagenesis which inverted this stem and loop mutation relative to the HIV LTR mRNA start site resulted in even larger decreases in tat-activation. This suggests that multiple determinants, including protein binding, the loop sequence, and RNA or DNA secondary structure, are important in tat-activation and suggests that tat may interact with cellular proteins binding to DNA to increase HIV gene expression. Images PMID:2721501
Nuclear localization of Schizosaccharomyces pombe Mcm2/Cdc19p requires MCM complex assembly.
Pasion, S G; Forsburg, S L
1999-12-01
The minichromosome maintenance (MCM) proteins MCM2-MCM7 are conserved eukaryotic replication factors that assemble in a heterohexameric complex. In fission yeast, these proteins are nuclear throughout the cell cycle. In studying the mechanism that regulates assembly of the MCM complex, we analyzed the cis and trans elements required for nuclear localization of a single subunit, Mcm2p. Mutation of any single mcm gene leads to redistribution of wild-type MCM subunits to the cytoplasm, and this redistribution depends on an active nuclear export system. We identified the nuclear localization signal sequences of Mcm2p and showed that these are required for nuclear targeting of other MCM subunits. In turn, Mcm2p must associate with other MCM proteins for its proper localization; nuclear localization of MCM proteins thus requires assembly of MCM proteins in a complex. We suggest that coupling complex assembly to nuclear targeting and retention ensures that only intact heterohexameric MCM complexes remain nuclear.
Nuclear Localization of Schizosaccharomyces pombe Mcm2/Cdc19p Requires MCM Complex Assembly
Pasion, Sally G.; Forsburg, Susan L.
1999-01-01
The minichromosome maintenance (MCM) proteins MCM2–MCM7 are conserved eukaryotic replication factors that assemble in a heterohexameric complex. In fission yeast, these proteins are nuclear throughout the cell cycle. In studying the mechanism that regulates assembly of the MCM complex, we analyzed the cis and trans elements required for nuclear localization of a single subunit, Mcm2p. Mutation of any single mcm gene leads to redistribution of wild-type MCM subunits to the cytoplasm, and this redistribution depends on an active nuclear export system. We identified the nuclear localization signal sequences of Mcm2p and showed that these are required for nuclear targeting of other MCM subunits. In turn, Mcm2p must associate with other MCM proteins for its proper localization; nuclear localization of MCM proteins thus requires assembly of MCM proteins in a complex. We suggest that coupling complex assembly to nuclear targeting and retention ensures that only intact heterohexameric MCM complexes remain nuclear. PMID:10588642
Shotgun Protein Sequencing with Meta-contig Assembly*
Guthals, Adrian; Clauser, Karl R.; Bandeira, Nuno
2012-01-01
Full-length de novo sequencing from tandem mass (MS/MS) spectra of unknown proteins such as antibodies or proteins from organisms with unsequenced genomes remains a challenging open problem. Conventional algorithms designed to individually sequence each MS/MS spectrum are limited by incomplete peptide fragmentation or low signal to noise ratios and tend to result in short de novo sequences at low sequencing accuracy. Our shotgun protein sequencing (SPS) approach was developed to ameliorate these limitations by first finding groups of unidentified spectra from the same peptides (contigs) and then deriving a consensus de novo sequence for each assembled set of spectra (contig sequences). But whereas SPS enables much more accurate reconstruction of de novo sequences longer than can be recovered from individual MS/MS spectra, it still requires error-tolerant matching to homologous proteins to group smaller contig sequences into full-length protein sequences, thus limiting its effectiveness on sequences from poorly annotated proteins. Using low and high resolution CID and high resolution HCD MS/MS spectra, we address this limitation with a Meta-SPS algorithm designed to overlap and further assemble SPS contigs into Meta-SPS de novo contig sequences extending as long as 100 amino acids at over 97% accuracy without requiring any knowledge of homologous protein sequences. We demonstrate Meta-SPS using distinct MS/MS data sets obtained with separate enzymatic digestions and discuss how the remaining de novo sequencing limitations relate to MS/MS acquisition settings. PMID:22798278
Shotgun protein sequencing with meta-contig assembly.
Guthals, Adrian; Clauser, Karl R; Bandeira, Nuno
2012-10-01
Full-length de novo sequencing from tandem mass (MS/MS) spectra of unknown proteins such as antibodies or proteins from organisms with unsequenced genomes remains a challenging open problem. Conventional algorithms designed to individually sequence each MS/MS spectrum are limited by incomplete peptide fragmentation or low signal to noise ratios and tend to result in short de novo sequences at low sequencing accuracy. Our shotgun protein sequencing (SPS) approach was developed to ameliorate these limitations by first finding groups of unidentified spectra from the same peptides (contigs) and then deriving a consensus de novo sequence for each assembled set of spectra (contig sequences). But whereas SPS enables much more accurate reconstruction of de novo sequences longer than can be recovered from individual MS/MS spectra, it still requires error-tolerant matching to homologous proteins to group smaller contig sequences into full-length protein sequences, thus limiting its effectiveness on sequences from poorly annotated proteins. Using low and high resolution CID and high resolution HCD MS/MS spectra, we address this limitation with a Meta-SPS algorithm designed to overlap and further assemble SPS contigs into Meta-SPS de novo contig sequences extending as long as 100 amino acids at over 97% accuracy without requiring any knowledge of homologous protein sequences. We demonstrate Meta-SPS using distinct MS/MS data sets obtained with separate enzymatic digestions and discuss how the remaining de novo sequencing limitations relate to MS/MS acquisition settings.
2010-01-01
Background Comparative genomics methods such as phylogenetic profiling can mine powerful inferences from inherently noisy biological data sets. We introduce Sites Inferred by Metabolic Background Assertion Labeling (SIMBAL), a method that applies the Partial Phylogenetic Profiling (PPP) approach locally within a protein sequence to discover short sequence signatures associated with functional sites. The approach is based on the basic scoring mechanism employed by PPP, namely the use of binomial distribution statistics to optimize sequence similarity cutoffs during searches of partitioned training sets. Results Here we illustrate and validate the ability of the SIMBAL method to find functionally relevant short sequence signatures by application to two well-characterized protein families. In the first example, we partitioned a family of ABC permeases using a metabolic background property (urea utilization). Thus, the TRUE set for this family comprised members whose genome of origin encoded a urea utilization system. By moving a sliding window across the sequence of a permease, and searching each subsequence in turn against the full set of partitioned proteins, the method found which local sequence signatures best correlated with the urea utilization trait. Mapping of SIMBAL "hot spots" onto crystal structures of homologous permeases reveals that the significant sites are gating determinants on the cytosolic face rather than, say, docking sites for the substrate-binding protein on the extracellular face. In the second example, we partitioned a protein methyltransferase family using gene proximity as a criterion. In this case, the TRUE set comprised those methyltransferases encoded near the gene for the substrate RF-1. SIMBAL identifies sequence regions that map onto the substrate-binding interface while ignoring regions involved in the methyltransferase reaction mechanism in general. Neither method for training set construction requires any prior experimental characterization. Conclusions SIMBAL shows that, in functionally divergent protein families, selected short sequences often significantly outperform their full-length parent sequence for making functional predictions by sequence similarity, suggesting avenues for improved functional classifiers. When combined with structural data, SIMBAL affords the ability to localize and model functional sites. PMID:20102603
UFO: a web server for ultra-fast functional profiling of whole genome protein sequences.
Meinicke, Peter
2009-09-02
Functional profiling is a key technique to characterize and compare the functional potential of entire genomes. The estimation of profiles according to an assignment of sequences to functional categories is a computationally expensive task because it requires the comparison of all protein sequences from a genome with a usually large database of annotated sequences or sequence families. Based on machine learning techniques for Pfam domain detection, the UFO web server for ultra-fast functional profiling allows researchers to process large protein sequence collections instantaneously. Besides the frequencies of Pfam and GO categories, the user also obtains the sequence specific assignments to Pfam domain families. In addition, a comparison with existing genomes provides dissimilarity scores with respect to 821 reference proteomes. Considering the underlying UFO domain detection, the results on 206 test genomes indicate a high sensitivity of the approach. In comparison with current state-of-the-art HMMs, the runtime measurements show a considerable speed up in the range of four orders of magnitude. For an average size prokaryotic genome, the computation of a functional profile together with its comparison typically requires about 10 seconds of processing time. For the first time the UFO web server makes it possible to get a quick overview on the functional inventory of newly sequenced organisms. The genome scale comparison with a large number of precomputed profiles allows a first guess about functionally related organisms. The service is freely available and does not require user registration or specification of a valid email address.
Science Opportunity Analyzer (SOA): Not Just Another Pretty Face
NASA Technical Reports Server (NTRS)
Polanskey, Carol A.; Streiiffert, Barbara; O'Reilly, Taifun
2004-01-01
This viewgraph presentation reviews the Science Opportunity Analyzer (SOA). For the first time at JPL, the Cassini mission to Saturn is using distributed science operations for sequence generation. This means that scientist at other institutions has more responsibility to build the spacecraft sequence. Tools are required to support the sequence development. JPL tools required a complete configuration behind a firewall, and the tools that the user community had developed did not interface with the JPL tools. Therefore the SOA was created to bridge the gap between the remote scientists and the JPL operations teams. The presentation reviews the development of the SOA, and what was required of the system. The presentation reviews the functions that the SOA performed.
Desai, Aarti; Marwah, Veer Singh; Yadav, Akshay; Jha, Vineet; Dhaygude, Kishor; Bangar, Ujwala; Kulkarni, Vivek; Jere, Abhay
2013-01-01
Next Generation Sequencing (NGS) is a disruptive technology that has found widespread acceptance in the life sciences research community. The high throughput and low cost of sequencing has encouraged researchers to undertake ambitious genomic projects, especially in de novo genome sequencing. Currently, NGS systems generate sequence data as short reads and de novo genome assembly using these short reads is computationally very intensive. Due to lower cost of sequencing and higher throughput, NGS systems now provide the ability to sequence genomes at high depth. However, currently no report is available highlighting the impact of high sequence depth on genome assembly using real data sets and multiple assembly algorithms. Recently, some studies have evaluated the impact of sequence coverage, error rate and average read length on genome assembly using multiple assembly algorithms, however, these evaluations were performed using simulated datasets. One limitation of using simulated datasets is that variables such as error rates, read length and coverage which are known to impact genome assembly are carefully controlled. Hence, this study was undertaken to identify the minimum depth of sequencing required for de novo assembly for different sized genomes using graph based assembly algorithms and real datasets. Illumina reads for E.coli (4.6 MB) S.kudriavzevii (11.18 MB) and C.elegans (100 MB) were assembled using SOAPdenovo, Velvet, ABySS, Meraculous and IDBA-UD. Our analysis shows that 50X is the optimum read depth for assembling these genomes using all assemblers except Meraculous which requires 100X read depth. Moreover, our analysis shows that de novo assembly from 50X read data requires only 6-40 GB RAM depending on the genome size and assembly algorithm used. We believe that this information can be extremely valuable for researchers in designing experiments and multiplexing which will enable optimum utilization of sequencing as well as analysis resources.
Complete genome sequence of a potyvirus infecting yam beans (Pachyrhizus spp.) in Peru.
Fuentes, Segundo; Heider, Bettina; Tasso, Ruby Carolina; Romero, Elisa; Zum Felde, Thomas; Kreuze, Jan Frederik
2012-04-01
In 2010, yam beans in a field trial in Peru showed viral disease symptoms. Graft-transmission and positive ELISA results using potyvirus-specific antibodies suggested that the symptoms could be the result of a potyviral infection. Small interfering RNA (siRNA) were extracted from one of the samples and sent for high-throughput sequencing. The full genome of a new potyvirus could be assembled from the resulting siRNA sequences, and it was sufficiently different from other sequences to be considered a member of a new species, which we have designated Yam bean mosaic virus (YBMV). Sequence similarity suggests that YBMV has also been detected in yam beans in Indonesia.
Phillips, C; Gettings, K Butler; King, J L; Ballard, D; Bodner, M; Borsuk, L; Parson, W
2018-05-01
The STR sequence template file published in 2016 as part of the considerations from the DNA Commission of the International Society for Forensic Genetics on minimal STR sequence nomenclature requirements, has been comprehensively revised and audited using the latest GRCh38 genome assembly. The list of forensic STRs characterized was expanded by including supplementary autosomal, X- and Y-chromosome microsatellites in less common use for routine DNA profiling, but some likely to be adopted in future massively parallel sequencing (MPS) STR panels. We outline several aspects of sequence alignment and annotation that required care and attention to detail when comparing sequences to GRCh37 and GRCh38 assemblies, as well as the necessary matching of MPS-based allele descriptions to previously established repeat region structures described in initial sequencing studies of the less well known forensic STRs. The revised sequence guide is now available in a dynamically updated FTP format from the STRidER website with a date-stamped change log to allow users to explore their own MPS data with the most up-to-date forensic STR sequence information compiled in a simple guide. Copyright © 2018 Elsevier B.V. All rights reserved.
Pancoska, Petr; Moravek, Zdenek; Moll, Ute M
2004-01-01
Nucleic acids are molecules of choice for both established and emerging nanoscale technologies. These technologies benefit from large functional densities of 'DNA processing elements' that can be readily manufactured. To achieve the desired functionality, polynucleotide sequences are currently designed by a process that involves tedious and laborious filtering of potential candidates against a series of requirements and parameters. Here, we present a complete novel methodology for the rapid rational design of large sets of DNA sequences. This method allows for the direct implementation of very complex and detailed requirements for the generated sequences, thus avoiding 'brute force' filtering. At the same time, these sequences have narrow distributions of melting temperatures. The molecular part of the design process can be done without computer assistance, using an efficient 'human engineering' approach by drawing a single blueprint graph that represents all generated sequences. Moreover, the method eliminates the necessity for extensive thermodynamic calculations. Melting temperature can be calculated only once (or not at all). In addition, the isostability of the sequences is independent of the selection of a particular set of thermodynamic parameters. Applications are presented for DNA sequence designs for microarrays, universal microarray zip sequences and electron transfer experiments.
Chen, Hui; Luthra, Rajyalakshmi; Goswami, Rashmi S; Singh, Rajesh R; Roy-Chowdhuri, Sinchita
2015-08-28
Application of next-generation sequencing (NGS) technology to routine clinical practice has enabled characterization of personalized cancer genomes to identify patients likely to have a response to targeted therapy. The proper selection of tumor sample for downstream NGS based mutational analysis is critical to generate accurate results and to guide therapeutic intervention. However, multiple pre-analytic factors come into play in determining the success of NGS testing. In this review, we discuss pre-analytic requirements for AmpliSeq PCR-based sequencing using Ion Torrent Personal Genome Machine (PGM) (Life Technologies), a NGS sequencing platform that is often used by clinical laboratories for sequencing solid tumors because of its low input DNA requirement from formalin fixed and paraffin embedded tissue. The success of NGS mutational analysis is affected not only by the input DNA quantity but also by several other factors, including the specimen type, the DNA quality, and the tumor cellularity. Here, we review tissue requirements for solid tumor NGS based mutational analysis, including procedure types, tissue types, tumor volume and fraction, decalcification, and treatment effects.
Wld S protein requires Nmnat activity and a short N-terminal sequence to protect axons in mice.
Conforti, Laura; Wilbrey, Anna; Morreale, Giacomo; Janeckova, Lucie; Beirowski, Bogdan; Adalbert, Robert; Mazzola, Francesca; Di Stefano, Michele; Hartley, Robert; Babetto, Elisabetta; Smith, Trevor; Gilley, Jonathan; Billington, Richard A; Genazzani, Armando A; Ribchester, Richard R; Magni, Giulio; Coleman, Michael
2009-02-23
The slow Wallerian degeneration (Wld(S)) protein protects injured axons from degeneration. This unusual chimeric protein fuses a 70-amino acid N-terminal sequence from the Ube4b multiubiquitination factor with the nicotinamide adenine dinucleotide-synthesizing enzyme nicotinamide mononucleotide adenylyl transferase 1. The requirement for these components and the mechanism of Wld(S)-mediated neuroprotection remain highly controversial. The Ube4b domain is necessary for the protective phenotype in mice, but precisely which sequence is essential and why are unclear. Binding to the AAA adenosine triphosphatase valosin-containing protein (VCP)/p97 is the only known biochemical property of the Ube4b domain. Using an in vivo approach, we show that removing the VCP-binding sequence abolishes axon protection. Replacing the Wld(S) VCP-binding domain with an alternative ataxin-3-derived VCP-binding sequence restores its protective function. Enzyme-dead Wld(S) is unable to delay Wallerian degeneration in mice. Thus, neither domain is effective without the function of the other. Wld(S) requires both of its components to protect axons from degeneration.
Morozumi, Takeya; Toki, Daisuke; Eguchi-Ogawa, Tomoko; Uenishi, Hirohide
2011-09-01
Large-scale cDNA-sequencing projects require an efficient strategy for mass sequencing. Here we describe a method for sequencing pooled cDNA clones using a combination of transposon insertion and Gateway technology. Our method reduces the number of shotgun clones that are unsuitable for reconstruction of cDNA sequences, and has the advantage of reducing the total costs of the sequencing project.
Small tandemly repeated DNA sequences of higher plants likely originate from a tRNA gene ancestor.
Benslimane, A A; Dron, M; Hartmann, C; Rode, A
1986-01-01
Several monomers (177 bp) of a tandemly arranged repetitive nuclear DNA sequence of Brassica oleracea have been cloned and sequenced. They share up to 95% homology between one another and up to 80% with other satellite DNA sequences of Cruciferae, suggesting a common ancestor. Both strands of these monomers show more than 50% homology with many tRNA genes; the best homologies have been obtained with Lys and His yeast mitochondrial tRNA genes (respectively 64% and 60%). These results suggest that small tandemly repeated DNA sequences of plants may have evolved from a tRNA gene ancestor. These tandem repeats have probably arisen via a process involving reverse transcription of polymerase III RNA intermediates, as is the case for interspersed DNA sequences of mammalians. A model is proposed to explain the formation of such small tandemly repeated DNA sequences. Images PMID:3774553
Chen, C N; Su, Y; Baybayan, P; Siruno, A; Nagaraja, R; Mazzarella, R; Schlessinger, D; Chen, E
1996-01-01
Ordered shotgun sequencing (OSS) has been successfully carried out with an Xq25 YAC substrate. yWXD703 DNA was subcloned into lambda phage and sequences of insert ends of the lambda subclones were used to generate a map to select a minimum tiling path of clones to be completely sequenced. The sequence of 135 038 nt contains the entire ANT2 cDNA as well as four other candidates suggested by computer-assisted analyses. One of the putative genes is homologous to a gene implicated in Graves' disease and it, ANT2 and two others are confirmed by EST matches. The results suggest that OSS can be applied to YACs in accord with earlier simulations and further indicate that the sequence of the YAC accurately reflects the sequence of uncloned human DNA. PMID:8918809
A novel, privacy-preserving cryptographic approach for sharing sequencing data
Cassa, Christopher A; Miller, Rachel A; Mandl, Kenneth D
2013-01-01
Objective DNA samples are often processed and sequenced in facilities external to the point of collection. These samples are routinely labeled with patient identifiers or pseudonyms, allowing for potential linkage to identity and private clinical information if intercepted during transmission. We present a cryptographic scheme to securely transmit externally generated sequence data which does not require any patient identifiers, public key infrastructure, or the transmission of passwords. Materials and methods This novel encryption scheme cryptographically protects participant sequence data using a shared secret key that is derived from a unique subset of an individual’s genetic sequence. This scheme requires access to a subset of an individual’s genetic sequence to acquire full access to the transmitted sequence data, which helps to prevent sample mismatch. Results We validate that the proposed encryption scheme is robust to sequencing errors, population uniqueness, and sibling disambiguation, and provides sufficient cryptographic key space. Discussion Access to a set of an individual’s genotypes and a mutually agreed cryptographic seed is needed to unlock the full sequence, which provides additional sample authentication and authorization security. We present modest fixed and marginal costs to implement this transmission architecture. Conclusions It is possible for genomics researchers who sequence participant samples externally to protect the transmission of sequence data using unique features of an individual’s genetic sequence. PMID:23125421
Restricted transfer of learning between unimanual and bimanual finger sequences
Bai, Wenjun
2016-01-01
When training bimanual skills, such as playing piano, people sometimes practice each hand separately and at a later stage combine the movements of the two hands. This poses the critical question of whether motor skills can be acquired by separately practicing each subcomponent or should be trained as a whole. In the present study, we addressed this question by training human subjects for 4 days in a unimanual or bimanual version of the discrete sequence production task. Both groups were then tested on trained and untrained sequences on both unimanual and bimanual versions of the task. Surprisingly, we found no evidence of transfer from trained unimanual to bimanual or from trained bimanual to unimanual sequences. In half the participants, we also investigated whether cuing the sequences on the left and right hand with unique letters would change transfer. With these cues, untrained sequences that shared some components with the trained sequences were performed more quickly than sequences that did not. However, the amount of this transfer was limited to ∼10% of the overall sequence-specific learning gains. These results suggest that unimanual and bimanual sequences are learned in separate representations. Making participants aware of the interrelationship between sequences can induce some transferrable component, although the main component of the skill remains unique to unimanual or bimanual execution. NEW & NOTEWORTHY Studies in reaching movement demonstrated that approximately half of motor learning can transfer across unimanual and bimanual contexts, suggesting that neural representations for unimanual and bimanual movements are fairly overlapping at the level of elementary movement. In this study, we show that little or no transfer occurred across unimanual and bimanual sequential finger movements. This result suggests that bimanual sequences are represented at a level of the motor hierarchy that integrates movements of both hands. PMID:27974447
Khansa, Ibrahim; Hall, Courtney; Madhoun, Lauren L; Splaingard, Mark; Baylis, Adriane; Kirschner, Richard E; Pearson, Gregory D
2017-04-01
Pierre Robin sequence is characterized by mandibular retrognathia and glossoptosis resulting in airway obstruction and feeding difficulties. When conservative management fails, mandibular distraction osteogenesis or tongue-lip adhesion may be required to avoid tracheostomy. The authors' goal was to prospectively evaluate the airway and feeding outcomes of their comprehensive approach to Pierre Robin sequence, which includes conservative management, mandibular distraction osteogenesis, and tongue-lip adhesion. A longitudinal study of newborns with Pierre Robin sequence treated at a pediatric academic medical center between 2010 and 2015 was performed. Baseline feeding and respiratory data were collected. Patients underwent conservative management if they demonstrated sustainable weight gain without tube feeds, and if their airway was stable with positioning alone. Patients who required surgery underwent tongue-lip adhesion or mandibular distraction osteogenesis based on family and surgeon preference. Postoperative airway and feeding data were collected. Twenty-eight patients with Pierre Robin sequence were followed prospectively. Thirty-two percent had a syndrome. Ten underwent mandibular distraction osteogenesis, eight underwent tongue-lip adhesion, and 10 were treated conservatively. There were no differences in days to extubation or discharge, change in weight percentile, requirement for gastrostomy tube, or residual obstructive sleep apnea between the three groups. No patients required tracheostomy. The greatest reduction in apnea-hypopnea index occurred with mandibular distraction osteogenesis, followed by tongue-lip adhesion and conservative management. Careful selection of which patients with Pierre Robin sequence need surgery, and of the most appropriate surgical procedure for each patient, can minimize the need for postprocedure tracheostomy. A comprehensive approach to Pierre Robin sequence that includes conservative management, mandibular distraction osteogenesis, and tongue-lip adhesion can result in excellent airway and feeding outcomes. Therapeutic, II.
Martin, Guillaume; Baurens, Franc-Christophe; Droc, Gaëtan; Rouard, Mathieu; Cenci, Alberto; Kilian, Andrzej; Hastie, Alex; Doležel, Jaroslav; Aury, Jean-Marc; Alberti, Adriana; Carreel, Françoise; D'Hont, Angélique
2016-03-16
Recent advances in genomics indicate functional significance of a majority of genome sequences and their long range interactions. As a detailed examination of genome organization and function requires very high quality genome sequence, the objective of this study was to improve reference genome assembly of banana (Musa acuminata). We have developed a modular bioinformatics pipeline to improve genome sequence assemblies, which can handle various types of data. The pipeline comprises several semi-automated tools. However, unlike classical automated tools that are based on global parameters, the semi-automated tools proposed an expert mode for a user who can decide on suggested improvements through local compromises. The pipeline was used to improve the draft genome sequence of Musa acuminata. Genotyping by sequencing (GBS) of a segregating population and paired-end sequencing were used to detect and correct scaffold misassemblies. Long insert size paired-end reads identified scaffold junctions and fusions missed by automated assembly methods. GBS markers were used to anchor scaffolds to pseudo-molecules with a new bioinformatics approach that avoids the tedious step of marker ordering during genetic map construction. Furthermore, a genome map was constructed and used to assemble scaffolds into super scaffolds. Finally, a consensus gene annotation was projected on the new assembly from two pre-existing annotations. This approach reduced the total Musa scaffold number from 7513 to 1532 (i.e. by 80%), with an N50 that increased from 1.3 Mb (65 scaffolds) to 3.0 Mb (26 scaffolds). 89.5% of the assembly was anchored to the 11 Musa chromosomes compared to the previous 70%. Unknown sites (N) were reduced from 17.3 to 10.0%. The release of the Musa acuminata reference genome version 2 provides a platform for detailed analysis of banana genome variation, function and evolution. Bioinformatics tools developed in this work can be used to improve genome sequence assemblies in other species.
FastRNABindR: Fast and Accurate Prediction of Protein-RNA Interface Residues.
El-Manzalawy, Yasser; Abbas, Mostafa; Malluhi, Qutaibah; Honavar, Vasant
2016-01-01
A wide range of biological processes, including regulation of gene expression, protein synthesis, and replication and assembly of many viruses are mediated by RNA-protein interactions. However, experimental determination of the structures of protein-RNA complexes is expensive and technically challenging. Hence, a number of computational tools have been developed for predicting protein-RNA interfaces. Some of the state-of-the-art protein-RNA interface predictors rely on position-specific scoring matrix (PSSM)-based encoding of the protein sequences. The computational efforts needed for generating PSSMs severely limits the practical utility of protein-RNA interface prediction servers. In this work, we experiment with two approaches, random sampling and sequence similarity reduction, for extracting a representative reference database of protein sequences from more than 50 million protein sequences in UniRef100. Our results suggest that random sampled databases produce better PSSM profiles (in terms of the number of hits used to generate the profile and the distance of the generated profile to the corresponding profile generated using the entire UniRef100 data as well as the accuracy of the machine learning classifier trained using these profiles). Based on our results, we developed FastRNABindR, an improved version of RNABindR for predicting protein-RNA interface residues using PSSM profiles generated using 1% of the UniRef100 sequences sampled uniformly at random. To the best of our knowledge, FastRNABindR is the only protein-RNA interface residue prediction online server that requires generation of PSSM profiles for query sequences and accepts hundreds of protein sequences per submission. Our approach for determining the optimal BLAST database for a protein-RNA interface residue classification task has the potential of substantially speeding up, and hence increasing the practical utility of, other amino acid sequence based predictors of protein-protein and protein-DNA interfaces.
2014-01-01
Background Protein sequence similarities to any types of non-globular segments (coiled coils, low complexity regions, transmembrane regions, long loops, etc. where either positional sequence conservation is the result of a very simple, physically induced pattern or rather integral sequence properties are critical) are pertinent sources for mistaken homologies. Regretfully, these considerations regularly escape attention in large-scale annotation studies since, often, there is no substitute to manual handling of these cases. Quantitative criteria are required to suppress events of function annotation transfer as a result of false homology assignments. Results The sequence homology concept is based on the similarity comparison between the structural elements, the basic building blocks for conferring the overall fold of a protein. We propose to dissect the total similarity score into fold-critical and other, remaining contributions and suggest that, for a valid homology statement, the fold-relevant score contribution should at least be significant on its own. As part of the article, we provide the DissectHMMER software program for dissecting HMMER2/3 scores into segment-specific contributions. We show that DissectHMMER reproduces HMMER2/3 scores with sufficient accuracy and that it is useful in automated decisions about homology for instructive sequence examples. To generalize the dissection concept for cases without 3D structural information, we find that a dissection based on alignment quality is an appropriate surrogate. The approach was applied to a large-scale study of SMART and PFAM domains in the space of seed sequences and in the space of UniProt/SwissProt. Conclusions Sequence similarity core dissection with regard to fold-critical and other contributions systematically suppresses false hits and, additionally, recovers previously obscured homology relationships such as the one between aquaporins and formate/nitrite transporters that, so far, was only supported by structure comparison. PMID:24890864
Shah, Kushani; Thomas, Shelby; Stein, Arnold
2013-01-01
In this report, we describe a 5-week laboratory exercise for undergraduate biology and biochemistry students in which students learn to sequence DNA and to genotype their DNA for selected single nucleotide polymorphisms (SNPs). Students use miniaturized DNA sequencing gels that require approximately 8 min to run. The students perform G, A, T, C Sanger sequencing reactions. They prepare and run the gels, perform Southern blots (which require only 10 min), and detect sequencing ladders using a colorimetric detection system. Students enlarge their sequencing ladders from digital images of their small nylon membranes, and read the sequence manually. They compare their reads with the actual DNA sequence using BLAST2. After mastering the DNA sequencing system, students prepare their own DNA from a cheek swab, polymerase chain reaction-amplify a region of their DNA that encompasses a SNP of interest, and perform sequencing to determine their genotype at the SNP position. A family pedigree can also be constructed. The SNP chosen by the instructor was rs17822931, which is in the ABCC11 gene and is the determinant of human earwax type. Genotypes at the rs178229931 site vary in different ethnic populations. © 2013 by The International Union of Biochemistry and Molecular Biology.
Cui, Hongguang
2016-01-01
ABSTRACT The potyviral RNA genome encodes two polyproteins that are proteolytically processed by three viral protease domains into 11 mature proteins. Extensive molecular studies have identified functions for the majority of the viral proteins. For example, 6K2, one of the two smallest potyviral proteins, is an integral membrane protein and induces the endoplasmic reticulum (ER)-originated replication vesicles that target the chloroplast for robust viral replication. However, the functional role of 6K1, the other smallest protein, remains uncharacterized. In this study, we developed a series of recombinant full-length viral cDNA clones derived from a Canadian Plum pox virus (PPV) isolate. We found that deletion of any of the short motifs of 6K1 (each of which ranged from 5 to 13 amino acids), most of the 6K1 sequence (but with the conserved sequence of the cleavage sites being retained), or all of the 6K1 sequence in the PPV infectious clone abolished viral replication. The trans expression of 6K1 or the cis expression of a dislocated 6K1 failed to rescue the loss-of-replication phenotype, suggesting the temporal and spatial requirement of 6K1 for viral replication. Disruption of the N- or C-terminal cleavage site of 6K1, which prevented the release of 6K1 from the polyprotein, either partially or completely inhibited viral replication, suggesting the functional importance of the mature 6K1. We further found that green fluorescent protein-tagged 6K1 formed punctate inclusions at the viral early infection stage and colocalized with chloroplast-bound viral replicase elements 6K2 and NIb. Taken together, our results suggest that 6K1 is required for viral replication and is an important viral element of the viral replication complex at the early infection stage. IMPORTANCE Potyviruses account for more than 30% of known plant viruses and consist of many agriculturally important viruses. The genomes of potyviruses encode two polyproteins that are proteolytically processed into 11 mature proteins, with the majority of them having been at least partially functionally characterized. However, the functional role of a small protein named 6K1 remains obscure. In this study, we showed that deletion of 6K1 or a short motif/region of 6K1 in the full-length cDNA clones of plum pox virus abolishes viral replication and that mutation of the N- or C-terminal cleavage sites of 6K1 to prevent its release from the polyprotein greatly attenuates or completely inhibits viral replication, suggesting its important role in potyviral infection. We report that 6K1 forms punctate structures and targets the replication vesicles in PPV-infected plant leaf cells at the early infection stage. Our data reveal that 6K1 is an important viral protein of the potyviral replication complex. PMID:26962227
NASA Technical Reports Server (NTRS)
Gladden, Roy
2007-01-01
Version 2.0 of the autogen software has been released. "Autogen" (automated sequence generation) signifies both a process and software used to implement the process of automated generation of sequences of commands in a standard format for uplink to spacecraft. Autogen requires fewer workers than are needed for older manual sequence-generation processes and reduces sequence-generation times from weeks to minutes.
Butler-Cole, Christine; Wagner, Mary J; Da Silva, Melissa; Brown, Gordon D; Burke, Robert D; Upton, Chris
2007-07-24
Profilins are critical to cytoskeletal dynamics in eukaryotes; however, little is known about their viral counterparts. In this study, a poxviral profilin homolog, ectromelia virus strain Moscow gene 141 (ECTV-PH), was investigated by a variety of experimental and bioinformatics techniques to characterize its interactions with cellular and viral proteins. Profilin-like proteins are encoded by all orthopoxviruses sequenced to date, and share over 90% amino acid (aa) identity. Sequence comparisons show highest similarity to mammalian type 1 profilins; however, a conserved 3 aa deletion in mammalian type 3 and poxviral profilins suggests that these homologs may be more closely related. Structural analysis shows that ECTV-PH can be successfully modelled onto both the profilin 1 crystal structure and profilin 3 homology model, though few of the surface residues thought to be required for binding actin, poly(L-proline), and PIP2 are conserved. Immunoprecipitation and mass spectrometry identified two proteins that interact with ECTV-PH within infected cells: alpha-tropomyosin, a 38 kDa cellular actin-binding protein, and the 84 kDa product of vaccinia virus strain Western Reserve (VACV-WR) 148, which is the truncated VACV counterpart of the orthopoxvirus A-type inclusion (ATI) protein. Western and far-western blots demonstrated that the interaction with alpha-tropomyosin is direct, and immunofluorescence experiments suggest that ECTV-PH and alpha-tropomyosin may colocalize to structures that resemble actin tails and cellular protrusions. Sequence comparisons of the poxviral ATI proteins show that although full-length orthologs are only present in cowpox and ectromelia viruses, an ~ 700 aa truncated ATI protein is conserved in over 90% of sequenced orthopoxviruses. Immunofluorescence studies indicate that ECTV-PH localizes to cytoplasmic inclusion bodies formed by both truncated and full-length versions of the viral ATI protein. Furthermore, colocalization of ECTV-PH and truncated ATI protein to protrusions from the cell surface was observed. These results suggest a role for ECTV-PH in intracellular transport of viral proteins or intercellular spread of the virus. Broader implications include better understanding of the virus-host relationship and mechanisms by which cells organize and control the actin cytoskeleton.
de Castro, Minique Hilda; de Klerk, Daniel; Pienaar, Ronel; Rees, D Jasper G; Mans, Ben J
2017-08-10
Ticks secrete a diverse mixture of secretory proteins into the host to evade its immune response and facilitate blood-feeding, making secretory proteins attractive targets for the production of recombinant anti-tick vaccines. The largely neglected tick species, Rhipicephalus zambeziensis, is an efficient vector of Theileria parva in southern Africa but its available sequence information is limited. Next generation sequencing has advanced sequence availability for ticks in recent years and has assisted the characterisation of secretory proteins. This study focused on the de novo assembly and annotation of the salivary gland transcriptome of R. zambeziensis and the temporal expression of secretory protein transcripts in female and male ticks, before the onset of feeding and during early and late feeding. The sialotranscriptome of R. zambeziensis yielded 23,631 transcripts from which 13,584 non-redundant proteins were predicted. Eighty-six percent of these contained a predicted start and stop codon and were estimated to be putatively full-length proteins. A fifth (2569) of the predicted proteins were annotated as putative secretory proteins and explained 52% of the expression in the transcriptome. Expression analyses revealed that 2832 transcripts were differentially expressed among feeding time points and 1209 between the tick sexes. The expression analyses further indicated that 57% of the annotated secretory protein transcripts were differentially expressed. Dynamic expression profiles of secretory protein transcripts were observed during feeding of female ticks. Whereby a number of transcripts were upregulated during early feeding, presumably for feeding site establishment and then during late feeding, 52% of these were downregulated, indicating that transcripts were required at specific feeding stages. This suggested that secretory proteins are under stringent transcriptional regulation that fine-tunes their expression in salivary glands during feeding. No open reading frames were predicted for 7947 transcripts. This class represented 17% of the differentially expressed transcripts, suggesting a potential transcriptional regulatory function of long non-coding RNA in tick blood-feeding. The assembled sialotranscriptome greatly expands the sequence availability of R. zambeziensis, assists in our understanding of the transcription of secretory proteins during blood-feeding and will be a valuable resource for future vaccine candidate selection.
Rapid amplification of 5' complementary DNA ends (5' RACE).
2005-08-01
This method is used to extend partial cDNA clones by amplifying the 5' sequences of the corresponding mRNAs 1-3. The technique requires knowledge of only a small region of sequence within the partial cDNA clone. During PCR, the thermostable DNA polymerase is directed to the appropriate target RNA by a single primer derived from the region of known sequence; the second primer required for PCR is complementary to a general feature of the target-in the case of 5' RACE, to a homopolymeric tail added (via terminal transferase) to the 3' termini of cDNAs transcribed from a preparation of mRNA. This synthetic tail provides a primer-binding site upstream of the unknown 5' sequence of the target mRNA. The products of the amplification reaction are cloned into a plasmid vector for sequencing and subsequent manipulation.
Multi-Objective Optimization of Spacecraft Trajectories for Small-Body Coverage Missions
NASA Technical Reports Server (NTRS)
Hinckley, David, Jr.; Englander, Jacob; Hitt, Darren
2017-01-01
Visual coverage of surface elements of a small-body object requires multiple images to be taken that meet many requirements on their viewing angles, illumination angles, times of day, and combinations thereof. Designing trajectories capable of maximizing total possible coverage may not be useful since the image target sequence and the feasibility of said sequence given the rotation-rate limitations of the spacecraft are not taken into account. This work presents a means of optimizing, in a multi-objective manner, surface target sequences that account for such limitations.
Bodini, Margherita; Ronchini, Chiara; Giacò, Luciano; Russo, Anna; Melloni, Giorgio E. M.; Luzi, Lucilla; Sardella, Domenico; Volorio, Sara; Hasan, Syed K.; Ottone, Tiziana; Lavorgna, Serena; Lo-Coco, Francesco; Candoni, Anna; Fanin, Renato; Toffoletti, Eleonora; Iacobucci, Ilaria; Martinelli, Giovanni; Cignetti, Alessandro; Tarella, Corrado; Bernard, Loris; Pelicci, Pier Giuseppe
2015-01-01
The analyses carried out using 2 different bioinformatics pipelines (SomaticSniper and MuTect) on the same set of genomic data from 133 acute myeloid leukemia (AML) patients, sequenced inside the Cancer Genome Atlas project, gave discrepant results. We subsequently tested these 2 variant-calling pipelines on 20 leukemia samples from our series (19 primary AMLs and 1 secondary AML). By validating many of the predicted somatic variants (variant allele frequencies ranging from 100% to 5%), we observed significantly different calling efficiencies. In particular, despite relatively high specificity, sensitivity was poor in both pipelines resulting in a high rate of false negatives. Our findings raise the possibility that landscapes of AML genomes might be more complex than previously reported and characterized by the presence of hundreds of genes mutated at low variant allele frequency, suggesting that the application of genome sequencing to the clinic requires a careful and critical evaluation. We think that improvements in technology and workflow standardization, through the generation of clear experimental and bioinformatics guidelines, are fundamental to translate the use of next-generation sequencing from research to the clinic and to transform genomic information into better diagnosis and outcomes for the patient. PMID:25499761
Yeast One-Hybrid Gγ Recruitment System for Identification of Protein Lipidation Motifs
Fukuda, Nobuo; Doi, Motomichi; Honda, Shinya
2013-01-01
Fatty acids and isoprenoids can be covalently attached to a variety of proteins. These lipid modifications regulate protein structure, localization and function. Here, we describe a yeast one-hybrid approach based on the Gγ recruitment system that is useful for identifying sequence motifs those influence lipid modification to recruit proteins to the plasma membrane. Our approach facilitates the isolation of yeast cells expressing lipid-modified proteins via a simple and easy growth selection assay utilizing G-protein signaling that induces diploid formation. In the current study, we selected the N-terminal sequence of Gα subunits as a model case to investigate dual lipid modification, i.e., myristoylation and palmitoylation, a modification that is widely conserved from yeast to higher eukaryotes. Our results suggest that both lipid modifications are required for restoration of G-protein signaling. Although we could not differentiate between myristoylation and palmitoylation, N-terminal position 7 and 8 play some critical role. Moreover, we tested the preference for specific amino-acid residues at position 7 and 8 using library-based screening. This new approach will be useful to explore protein-lipid associations and to determine the corresponding sequence motifs. PMID:23922919
Multi-species Identification of Polymorphic Peptide Variants via Propagation in Spectral Networks*
Bandeira, Nuno
2016-01-01
Peptide and protein identification remains challenging in organisms with poorly annotated or rapidly evolving genomes, as are commonly encountered in environmental or biofuels research. Such limitations render tandem mass spectrometry (MS/MS) database search algorithms ineffective as they lack corresponding sequences required for peptide-spectrum matching. We address this challenge with the spectral networks approach to (1) match spectra of orthologous peptides across multiple related species and then (2) propagate peptide annotations from identified to unidentified spectra. We here present algorithms to assess the statistical significance of spectral alignments (Align-GF), reduce the impurity in spectral networks, and accurately estimate the error rate in propagated identifications. Analyzing three related Cyanothece species, a model organism for biohydrogen production, spectral networks identified peptides from highly divergent sequences from networks with dozens of variant peptides, including thousands of peptides in species lacking a sequenced genome. Our analysis further detected the presence of many novel putative peptides even in genomically characterized species, thus suggesting the possibility of gaps in our understanding of their proteomic and genomic expression. A web-based pipeline for spectral networks analysis is available at http://proteomics.ucsd.edu/software. PMID:27609420
Optimization of cDNA-AFLP experiments using genomic sequence data.
Kivioja, Teemu; Arvas, Mikko; Saloheimo, Markku; Penttilä, Merja; Ukkonen, Esko
2005-06-01
cDNA amplified fragment length polymorphism (cDNA-AFLP) is one of the few genome-wide level expression profiling methods capable of finding genes that have not yet been cloned or even predicted from sequence but have interesting expression patterns under the studied conditions. In cDNA-AFLP, a complex cDNA mixture is divided into small subsets using restriction enzymes and selective PCR. A large cDNA-AFLP experiment can require a substantial amount of resources, such as hundreds of PCR amplifications and gel electrophoresis runs, followed by manual cutting of a large number of bands from the gels. Our aim was to test whether this workload can be reduced by rational design of the experiment. We used the available genomic sequence information to optimize cDNA-AFLP experiments beforehand so that as many transcripts as possible could be profiled with a given amount of resources. Optimization of the selection of both restriction enzymes and selective primers for cDNA-AFLP experiments has not been performed previously. The in silico tests performed suggest that substantial amounts of resources can be saved by the optimization of cDNA-AFLP experiments.
APOBEC3G Interacts with ssDNA by Two Modes: AFM Studies
Shlyakhtenko, Luda S.; Dutta, Samrat; Banga, Jaspreet; Li, Ming; Harris, Reuben S.; Lyubchenko, Yuri L.
2015-01-01
APOBEC3G (A3G) protein has antiviral activity against HIV and other pathogenic retroviruses. A3G has two domains: a catalytic C-terminal domain (CTD) that deaminates cytidine, and a N-terminal domain (NTD) that binds to ssDNA. Although abundant information exists about the biological activities of A3G protein, the interplay between sequence specific deaminase activity and A3G binding to ssDNA remains controversial. We used the topographic imaging and force spectroscopy modalities of Atomic Force Spectroscopy (AFM) to characterize the interaction of A3G protein with deaminase specific and nonspecific ssDNA substrates. AFM imaging demonstrated that A3G has elevated affinity for deaminase specific ssDNA than for nonspecific ssDNA. AFM force spectroscopy revealed two distinct binding modes by which A3G interacts with ssDNA. One mode requires sequence specificity, as demonstrated by stronger and more stable complexes with deaminase specific ssDNA than with nonspecific ssDNA. Overall these observations enforce prior studies suggesting that both domains of A3G contribute to the sequence specific binding of ssDNA. PMID:26503602
Henz, Sonja; Kutz, Dieter F.; Werner, Jana; Hürster, Walter; Kolb, Florian P.; Nida-Ruemelin, Julian
2015-01-01
The aim of the study was to determine whether a deliberative process, leading to a motor action, is detectable in high density EEG recordings. Subjects were required to press one of two buttons. In a simple motor task the subject knew which button to press, whilst in a color-word Stroop task subjects had to press the right button with the right index finger when meaning and color coincided, or the left button with the left index finger when meaning and color were disparate. EEG recordings obtained during the simple motor task showed a sequence of positive (P) and negative (N) cortical potentials (P1-N1-P2) which are assumed to be related to the processing of the movement. The sequence of cortical potentials was similar in EEG recordings of subjects having to deliberate over how to respond, but the above sequence (P1-N1-P2) was preceded by slowly increasing negativity (N0), with N0 being assumed to represent the end of the deliberation process. Our data suggest the existence of neurophysiological correlates of deliberative processes. PMID:26190987
Sarkar, Mohosin; Liu, Yun; Qi, Junpeng; Peng, Haiyong; Morimoto, Jumpei; Rader, Christoph; Chiorazzi, Nicholas; Kodadek, Thomas
2016-04-01
Chronic lymphocytic leukemia (CLL) is a disease in which a single B-cell clone proliferates relentlessly in peripheral lymphoid organs, bone marrow, and blood. DNA sequencing experiments have shown that about 30% of CLL patients have stereotyped antigen-specific B-cell receptors (BCRs) with a high level of sequence homology in the variable domains of the heavy and light chains. These include many of the most aggressive cases that haveIGHV-unmutated BCRs whose sequences have not diverged significantly from the germ line. This suggests a personalized therapy strategy in which a toxin or immune effector function is delivered selectively to the pathogenic B-cells but not to healthy B-cells. To execute this strategy, serum-stable, drug-like compounds able to target the antigen-binding sites of most or all patients in a stereotyped subset are required. We demonstrate here the feasibility of this approach with the discovery of selective, high affinity ligands for CLL BCRs of the aggressive, stereotyped subset 7P that cross-react with the BCRs of several CLL patients in subset 7p, but not with BCRs from patients outside this subset. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Wang, Chun Guo; Chen, Xiao Qiang; Li, Hui; Zhao, Qian Cheng; Sun, De Ling; Song, Wen Qin
2008-02-01
Analysis of ISSR (Inter-Simple Sequence Repeat) and DDRT-PCR (Differential Display Reverse Transcriptase Polymerase Chain Reaction) was performed between cytoplasmic male sterility cauliflower ogura-A and its corresponding maintainer line ogura-B. Totally, 306 detectable bands were obtained by ISSR using thirty oligonucleotide primers. Commonly, six to twelve bands were produced per primer. Among all these primers only the amplification of primer ISSR3 was polymorphic, an 1100 bp specific band was only detected in maintainer line, named ISSR3(1100). Analysis of this sequence indicated that ISSR3(1100) was high homologous with the corresponding sequences of mitochondrial genome in Brassica napus and Arabidopsis thaliana,which suggested that ISSR3(1100) may derive from mitochondrial genome in cauliflower. To carry out DDRT-PCR analysis, three anchor primers and fifteen random primers were selected to combine. Totally, 1122 bands from 1 000 bp to 50 bp were detected. However, only four bands, named ogura-A 205, ogura-A383, ogura-B307 and ogura-B352, were confirmed to be different display in both lines. This result was further identified by reverse Northern dot blotting analysis. Among these four bands, ogura-A205 and ogura-A383 only express in cytoplasmic male sterility line, while ogura-B307 and ogura-B352 were only detected in maintainer line. Analysis of these sequences indicated that it was the first time that these four sequences were reported in cauliflower. Interestingly, ogura-A205 and ogura-B307 did not exhibit any similarities to other reported sequences in other species, more investigations were required to obtain further information. ogura-A383 and ogura-B352 were also two new sequences, they showed high similarities to corresponding chloroplast sequences of Arabidopsis thaliana and Brassica rapa subsp. pekinensis. So we speculated that these two sequences may derive from chloroplast genome. All these results obtained in this study offer new and significant information to investigate the molecular mechanism of cytoplasmic male sterility and fertile maintenance in cauliflower.
Cingulin Contains Globular and Coiled-Coil Domains and Interacts with Zo-1, Zo-2, Zo-3, and Myosin
Cordenonsi, Michelangelo; D'Atri, Fabio; Hammar, Eva; Parry, David A.D.; Kendrick-Jones, John; Shore, David; Citi, Sandra
1999-01-01
We characterized the sequence and protein interactions of cingulin, an M r 140–160-kD phosphoprotein localized on the cytoplasmic surface of epithelial tight junctions (TJ). The derived amino acid sequence of a full-length Xenopus laevis cingulin cDNA shows globular head (residues 1–439) and tail (1,326–1,368) domains and a central α-helical rod domain (440–1,325). Sequence analysis, electron microscopy, and pull-down assays indicate that the cingulin rod is responsible for the formation of coiled-coil parallel dimers, which can further aggregate through intermolecular interactions. Pull-down assays from epithelial, insect cell, and reticulocyte lysates show that an NH2-terminal fragment of cingulin (1–378) interacts in vitro with ZO-1 (K d ∼5 nM), ZO-2, ZO-3, myosin, and AF-6, but not with symplekin, and a COOH-terminal fragment (377–1,368) interacts with myosin and ZO-3. ZO-1 and ZO-2 immunoprecipitates contain cingulin, suggesting in vivo interactions. Full-length cingulin, but not NH2-terminal and COOH-terminal fragments, colocalizes with endogenous cingulin in transfected MDCK cells, indicating that sequences within both head and rod domains are required for TJ localization. We propose that cingulin is a functionally important component of TJ, linking the submembrane plaque domain of TJ to the actomyosin cytoskeleton. PMID:10613913
Pang, Siew Wai; Lahiri, Chandrajit; Poh, Chit Laa; Tan, Kuan Onn
2018-05-01
Paraneoplastic Ma Family (PNMA) comprises a growing number of family members which share relatively conserved protein sequences encoded by the human genome and is localized to several human chromosomes, including the X-chromosome. Based on sequence analysis, PNMA family members share sequence homology to the Gag protein of LTR retrotransposon, and several family members with aberrant protein expressions have been reported to be closely associated with the human Paraneoplastic Disorder (PND). In addition, gene mutations of specific members of PNMA family are known to be associated with human mental retardation or 3-M syndrome consisting of restrictive post-natal growth or dwarfism, and development of skeletal abnormalities. Other than sequence homology, the physiological function of many members in this family remains unclear. However, several members of this family have been characterized, including cell signalling events mediated by these proteins that are associated with apoptosis, and cancer in different cell types. Furthermore, while certain PNMA family members show restricted gene expression in the human brain and testis, other PNMA family members exhibit broader gene expression or preferential and selective protein interaction profiles, suggesting functional divergence within the family. Functional analysis of some members of this family have identified protein domains that are required for subcellular localization, protein-protein interactions, and cell signalling events which are the focus of this review paper. Copyright © 2018 Elsevier Inc. All rights reserved.
A yoga program for cognitive enhancement.
Brunner, Devon; Abramovitch, Amitai; Etherton, Joseph
2017-01-01
Recent studies suggest that yoga practice may improve cognitive functioning. Although preliminary data indicate that yoga improves working memory (WM), high-resolution information about the type of WM subconstructs, namely maintenance and manipulation, is not available. Furthermore, the association between cognitive enhancement and improved mindfulness as a result of yoga practice requires empirical examination. The aim of the present study is to assess the impact of a brief yoga program on WM maintenance, WM manipulation and attentive mindfulness. Measures of WM (Digit Span Forward, Backward, and Sequencing, and Letter-Number Sequencing) were administered prior to and following 6 sessions of yoga (N = 43). Additionally, the Mindfulness Attention Awareness Scale was administered to examine the potential impact of yoga practice on mindfulness, as well as the relationships among changes in WM and mindfulness. Analyses revealed significant improvement from pre- to post- training assessment on both maintenance WM (Digit Span Forward) and manipulation WM (Digit Span Backward and Letter-Number Sequencing). No change was found on Digit Span Sequencing. Improvement was also found on mindfulness scores. However, no correlation was observed between mindfulness and WM measures. A 6-session yoga program was associated with improvement on manipulation and maintenance WM measures as well as enhanced mindfulness scores. Additional research is needed to understand the extent of yoga-related cognitive enhancement and mechanisms by which yoga may enhance cognition, ideally by utilizing randomized controlled trials and more comprehensive neuropsychological batteries.
O'Sullivan, D J; O'Gara, F
1991-08-01
An iron-regulated promoter was cloned on a 2.1 kb Bg/II fragment from Pseudomonas sp. strain M114 and fused to the lacZ reporter gene. Iron-regulated lacZ expression from the resulting construct (pSP1) in strain M114 was mediated via the Fur-like repressor which also regulates siderophore production in this strain. A 390 bp StuI-PstI internal fragment contained the necessary information for iron-regulated promoter expression. This fragment was sequenced and the initiation point for transcription was determined by primer extension analysis. The region directly upstream of the transcription start point contained no significant homology to known promoter consensus sequences. However the -16 to -25 bp region contained homology to four other iron-regulated pseudomonad promoters. Deletion of bases downstream from the transcriptional start did not affect the iron-regulated expression of the promoter. The -37 and -43 bp regions exhibited some homology to the 19 bp Escherichia coli Fur-binding consensus sequence. When expressed in E. coli (via a cloned transacting factor from strain M114) lacZ expression from pSP1 was found to be regulated by iron. A region of greater than 77 bases but less than 131 upstream from the transcriptional start was found to be necessary for promoter activity, further suggesting that a transcriptional activator may be required for expression.
Zhang, Yanhong; Pham, Nancy Kim; Zhang, Huixian; Lin, Junda; Lin, Qiang
2014-01-01
Population genetic of seahorses is confidently influenced by their species-specific ecological requirements and life-history traits. In the present study, partial sequences of mitochondrial cytochrome b (cytb) and control region (CR) were obtained from 50 Hippocampus mohnikei and 92 H. trimaculatus from four zoogeographical zones. A total of 780 base pairs of cytb gene were sequenced to characterize mitochondrial DNA (mtDNA) diversity. The mtDNA marker revealed high haplotype diversity, low nucleotide diversity, and a lack of population structure across both populations of H. mohnikei and H. trimaculatus. A neighbour-joining (NJ) tree of cytb gene sequences showed that H. mohnikei haplotypes formed one cluster. A maximum likelihood (ML) tree of cytb gene sequences showed that H. trimaculatus belonged to one lineage. The star-like pattern median-joining network of cytb and CR markers indicated a previous demographic expansion of H. mohnikei and H. trimaculatus. The cytb and CR data sets exhibited a unimodal mismatch distribution, which may have resulted from population expansion. Mismatch analysis suggested that the expansion was initiated about 276,000 years ago for H. mohnikei and about 230,000 years ago for H. trimaculatus during the middle Pleistocene period. This study indicates a possible signature of genetic variation and population expansion in two seahorses under complex marine environments. PMID:25144384
Bridgewater, Laura C.; Walker, Marlan D.; Miller, Gwen C.; Ellison, Trevor A.; Holsinger, L. Daniel; Potter, Jennifer L.; Jackson, Todd L.; Chen, Reuben K.; Winkel, Vicki L.; Zhang, Zhaoping; McKinney, Sandra; de Crombrugghe, Benoit
2003-01-01
Expression of the type XI collagen gene Col11a2 is directed to cartilage by at least three chondrocyte-specific enhancer elements, two in the 5′ region and one in the first intron of the gene. The three enhancers each contain two heptameric sites with homology to the Sox protein-binding consensus sequence. The two sites are separated by 3 or 4 bp and arranged in opposite orientation to each other. Targeted mutational analyses of these three enhancers showed that in the intronic enhancer, as in the other two enhancers, both Sox sites in a pair are essential for enhancer activity. The transcription factor Sox9 binds as a dimer at the paired sites, and the introduction of insertion mutations between the sites demonstrated that physical interactions between the adjacently bound proteins are essential for enhancer activity. Additional mutational analyses demonstrated that although Sox9 binding at the paired Sox sites is necessary for enhancer activity, it alone is not sufficient. Adjacent DNA sequences in each enhancer are also required, and mutation of those sequences can eliminate enhancer activity without preventing Sox9 binding. The data suggest a new model in which adjacently bound proteins affect the DNA bend angle produced by Sox9, which in turn determines whether an active transcriptional enhancer complex is assembled. PMID:12595563
Directed evolution of the TALE N-terminal domain for recognition of all 5′ bases
Lamb, Brian M.; Mercer, Andrew C.; Barbas, Carlos F.
2013-01-01
Transcription activator-like effector (TALE) proteins can be designed to bind virtually any DNA sequence. General guidelines for design of TALE DNA-binding domains suggest that the 5′-most base of the DNA sequence bound by the TALE (the N0 base) should be a thymine. We quantified the N0 requirement by analysis of the activities of TALE transcription factors (TALE-TF), TALE recombinases (TALE-R) and TALE nucleases (TALENs) with each DNA base at this position. In the absence of a 5′ T, we observed decreases in TALE activity up to >1000-fold in TALE-TF activity, up to 100-fold in TALE-R activity and up to 10-fold reduction in TALEN activity compared with target sequences containing a 5′ T. To develop TALE architectures that recognize all possible N0 bases, we used structure-guided library design coupled with TALE-R activity selections to evolve novel TALE N-terminal domains to accommodate any N0 base. A G-selective domain and broadly reactive domains were isolated and characterized. The engineered TALE domains selected in the TALE-R format demonstrated modularity and were active in TALE-TF and TALEN architectures. Evolved N-terminal domains provide effective and unconstrained TALE-based targeting of any DNA sequence as TALE binding proteins and designer enzymes. PMID:23980031
Dikow, Nicola; Nygren, Anders Oh; Schouten, Jan P; Hartmann, Carolin; Krämer, Nikola; Janssen, Bart; Zschocke, Johannes
2007-06-01
Standard methods used for genomic methylation analysis allow the detection of complete absence of either methylated or non-methylated alleles but are usually unable to detect changes in the proportion of methylated and unmethylated alleles. We compare two methods for quantitative methylation analysis, using the chromosome 15q11-q13 imprinted region as model. Absence of the non-methylated paternal allele in this region leads to Prader-Willi syndrome (PWS) whilst absence of the methylated maternal allele results in Angelman syndrome (AS). A proportion of AS is caused by mosaic imprinting defects which may be missed with standard methods and require quantitative analysis for their detection. Sequence-based quantitative methylation analysis (SeQMA) involves quantitative comparison of peaks generated through sequencing reactions after bisulfite treatment. It is simple, cost-effective and can be easily established for a large number of genes. However, our results support previous suggestions that methods based on bisulfite treatment may be problematic for exact quantification of methylation status. Methylation-specific multiplex ligation-dependent probe amplification (MS-MLPA) avoids bisulfite treatment. It detects changes in both CpG methylation as well as copy number of up to 40 chromosomal sequences in one simple reaction. Once established in a laboratory setting, the method is more accurate, reliable and less time consuming.
A DNA 'barcode blitz': rapid digitization and sequencing of a natural history collection.
Hebert, Paul D N; Dewaard, Jeremy R; Zakharov, Evgeny V; Prosser, Sean W J; Sones, Jayme E; McKeown, Jaclyn T A; Mantle, Beth; La Salle, John
2013-01-01
DNA barcoding protocols require the linkage of each sequence record to a voucher specimen that has, whenever possible, been authoritatively identified. Natural history collections would seem an ideal resource for barcode library construction, but they have never seen large-scale analysis because of concerns linked to DNA degradation. The present study examines the strength of this barrier, carrying out a comprehensive analysis of moth and butterfly (Lepidoptera) species in the Australian National Insect Collection. Protocols were developed that enabled tissue samples, specimen data, and images to be assembled rapidly. Using these methods, a five-person team processed 41,650 specimens representing 12,699 species in 14 weeks. Subsequent molecular analysis took about six months, reflecting the need for multiple rounds of PCR as sequence recovery was impacted by age, body size, and collection protocols. Despite these variables and the fact that specimens averaged 30.4 years old, barcode records were obtained from 86% of the species. In fact, one or more barcode compliant sequences (>487 bp) were recovered from virtually all species represented by five or more individuals, even when the youngest was 50 years old. By assembling specimen images, distributional data, and DNA barcode sequences on a web-accessible informatics platform, this study has greatly advanced accessibility to information on thousands of species. Moreover, much of the specimen data became publically accessible within days of its acquisition, while most sequence results saw release within three months. As such, this study reveals the speed with which DNA barcode workflows can mobilize biodiversity data, often providing the first web-accessible information for a species. These results further suggest that existing collections can enable the rapid development of a comprehensive DNA barcode library for the most diverse compartment of terrestrial biodiversity - insects.
McLane, K E; Weaver, W R; Lei, S; Chiappinelli, V A; Conti-Tronconi, B M
1993-07-13
kappa-Flavotoxin (kappa-FTX), a snake neurotoxin that is a selective antagonist of certain neuronal nicotinic acetylcholine receptors (AChRs), has recently been isolated and characterized [Grant, G. A., Frazier, M. W., & Chiappinelli, V. A. (1988) Biochemistry 27, 1532-1537]. Like the related snake toxin kappa-bungarotoxin (kappa-BTX), kappa-FTX binds with high affinity to alpha 3 subtypes of neuronal AChRs, even though there are distinct sequence differences between the two toxins. To further characterize the sequence regions of the neuronal AChR alpha 3 subunit involved in formation of the binding site for this family of kappa-neurotoxins, we investigated kappa-FTX binding to overlapping synthetic peptides screening the alpha 3 subunit sequence. A sequence region forming a "prototope" for kappa-FTX was identified within residues alpha 3 (51-70), confirming the suggestions of previous studies on the binding of kappa-BTX to the alpha 3 subunit [McLane, K. E., Tang, F., & Conti-Tronconi, B. M. (1990) J. Biol. Chem. 265, 1537-1544] and alpha-bungarotoxin to the Torpedo AChR alpha subunit [Conti-Tronconi, B. M., Tang, F., Diethelm, B. M., Spencer, S. R., Reinhardt-Maelicke, S., & Maelicke, A. (1990) Biochemistry 29, 6221-6230] that this sequence region is involved in formation of a cholinergic site. Single residue substituted analogues, where each residue of the sequence alpha 3 (51-70) was sequentially replaced by a glycine, were used to identify the amino acid side chains involved in the interaction of this prototope with kappa-FTX.(ABSTRACT TRUNCATED AT 250 WORDS)
Pusch, Carsten M; Bachmann, Lutz
2004-05-01
Proof of authenticity is the greatest challenge in palaeogenetic research, and many safeguards have become standard routine in laboratories specialized on ancient DNA research. Here we describe an as-yet unknown source of artifacts that will require special attention in the future. We show that ancient DNA extracts on their own can have an inhibitory and mutagenic effect under PCR. We have spiked PCR reactions including known human test DNA with 14 selected ancient DNA extracts from human and nonhuman sources. We find that the ancient DNA extracts inhibit the amplification of large fragments to different degrees, suggesting that the usual control against contaminations, i.e., the absence of long amplifiable fragments, is not sufficient. But even more important, we find that the extracts induce mutations in a nonrandom fashion. We have amplified a 148-bp stretch of the mitochondrial HVRI from contemporary human template DNA in spiked PCR reactions. Subsequent analysis of 547 sequences from cloned amplicons revealed that the vast majority (76.97%) differed from the correct sequence by single nucleotide substitutions and/or indels. In total, 34 positions of a 103-bp alignment are affected, and most mutations occur repeatedly in independent PCR amplifications. Several of the induced mutations occur at positions that have previously been detected in studies of ancient hominid sequences, including the Neandertal sequences. Our data imply that PCR-induced mutations are likely to be an intrinsic and general problem of PCR amplifications of ancient templates. Therefore, ancient DNA sequences should be considered with caution, at least as long as the molecular basis for the extract-induced mutations is not understood.
Next-generation sequencing for diagnosis of rare diseases in the neonatal intensive care unit.
Daoud, Hussein; Luco, Stephanie M; Li, Rui; Bareke, Eric; Beaulieu, Chandree; Jarinova, Olga; Carson, Nancy; Nikkel, Sarah M; Graham, Gail E; Richer, Julie; Armour, Christine; Bulman, Dennis E; Chakraborty, Pranesh; Geraghty, Michael; Lines, Matthew A; Lacaze-Masmonteil, Thierry; Majewski, Jacek; Boycott, Kym M; Dyment, David A
2016-08-09
Rare diseases often present in the first days and weeks of life and may require complex management in the setting of a neonatal intensive care unit (NICU). Exhaustive consultations and traditional genetic or metabolic investigations are costly and often fail to arrive at a final diagnosis when no recognizable syndrome is suspected. For this pilot project, we assessed the feasibility of next-generation sequencing as a tool to improve the diagnosis of rare diseases in newborns in the NICU. We retrospectively identified and prospectively recruited newborns and infants admitted to the NICU of the Children's Hospital of Eastern Ontario and the Ottawa Hospital, General Campus, who had been referred to the medical genetics or metabolics inpatient consult service and had features suggesting an underlying genetic or metabolic condition. DNA from the newborns and parents was enriched for a panel of clinically relevant genes and sequenced on a MiSeq sequencing platform (Illumina Inc.). The data were interpreted with a standard informatics pipeline and reported to care providers, who assessed the importance of genotype-phenotype correlations. Of 20 newborns studied, 8 received a diagnosis on the basis of next-generation sequencing (diagnostic rate 40%). The diagnoses were renal tubular dysgenesis, SCN1A-related encephalopathy syndrome, myotubular myopathy, FTO deficiency syndrome, cranioectodermal dysplasia, congenital myasthenic syndrome, autosomal dominant intellectual disability syndrome type 7 and Denys-Drash syndrome. This pilot study highlighted the potential of next-generation sequencing to deliver molecular diagnoses rapidly with a high success rate. With broader use, this approach has the potential to alter health care delivery in the NICU. © 2016 Canadian Medical Association or its licensors.
Gadala-Maria, Daniel; Yaari, Gur; Uduman, Mohamed; Kleinstein, Steven H
2015-02-24
Individual variation in germline and expressed B-cell immunoglobulin (Ig) repertoires has been associated with aging, disease susceptibility, and differential response to infection and vaccination. Repertoire properties can now be studied at large-scale through next-generation sequencing of rearranged Ig genes. Accurate analysis of these repertoire-sequencing (Rep-Seq) data requires identifying the germline variable (V), diversity (D), and joining (J) gene segments used by each Ig sequence. Current V(D)J assignment methods work by aligning sequences to a database of known germline V(D)J segment alleles. However, existing databases are likely to be incomplete and novel polymorphisms are hard to differentiate from the frequent occurrence of somatic hypermutations in Ig sequences. Here we develop a Tool for Ig Genotype Elucidation via Rep-Seq (TIgGER). TIgGER analyzes mutation patterns in Rep-Seq data to identify novel V segment alleles, and also constructs a personalized germline database containing the specific set of alleles carried by a subject. This information is then used to improve the initial V segment assignments from existing tools, like IMGT/HighV-QUEST. The application of TIgGER to Rep-Seq data from seven subjects identified 11 novel V segment alleles, including at least one in every subject examined. These novel alleles constituted 13% of the total number of unique alleles in these subjects, and impacted 3% of V(D)J segment assignments. These results reinforce the highly polymorphic nature of human Ig V genes, and suggest that many novel alleles remain to be discovered. The integration of TIgGER into Rep-Seq processing pipelines will increase the accuracy of V segment assignments, thus improving B-cell repertoire analyses.
Ceccarelli, A; Zhukovskaya, N; Kawata, T; Bozzaro, S; Williams, J
2000-12-01
The ecmB gene of Dictyostelium is expressed at culmination both in the prestalk cells that enter the stalk tube and in ancillary stalk cell structures such as the basal disc. Stalk tube-specific expression is regulated by sequence elements within the cap-site proximal part of the promoter, the stalk tube (ST) promoter region. Dd-STATa, a member of the STAT transcription factor family, binds to elements present in the ST promoter-region and represses transcription prior to entry into the stalk tube. We have characterised an activatory DNA sequence element, that lies distal to the repressor elements and that is both necessary and sufficient for expression within the stalk tube. We have mapped this activator to a 28 nucleotide region (the 28-mer) within which we have identified a GA-containing sequence element that is required for efficient gene transcription. The Dd-STATa protein binds to the 28-mer in an in vitro binding assay, and binding is dependent upon the GA-containing sequence. However, the ecmB gene is expressed in a Dd-STATa null mutant, therefore Dd-STATa cannot be responsible for activating the 28-mer in vivo. Instead, we identified a distinct 28-mer binding activity in nuclear extracts from the Dd-STATa null mutant, the activity of this GA binding activity being largely masked in wild type extracts by the high affinity binding of the Dd-STATa protein. We suggest, that in addition to the long range repression exerted by binding to the two known repressor sites, Dd-STATa inhibits transcription by direct competition with this putative activator for binding to the GA sequence.
Gumucio, D L; Rood, K L; Gray, T A; Riordan, M F; Sartor, C I; Collins, F S
1988-01-01
The molecular mechanisms responsible for the human fetal-to-adult hemoglobin switch have not yet been elucidated. Point mutations identified in the promoter regions of gamma-globin genes from individuals with nondeletion hereditary persistence of fetal hemoglobin (HPFH) may mark cis-acting sequences important for this switch, and the trans-acting factors which interact with these sequences may be integral parts in the puzzle of gamma-globin gene regulation. We have used gel retardation and footprinting strategies to define nuclear proteins which bind to the normal gamma-globin promoter and to determine the effect of HPFH mutations on the binding of a subset of these proteins. We have identified five proteins in human erythroleukemia cells (K562 and HEL) which bind to the proximal promoter region of the normal gamma-globin gene. One factor, gamma CAAT, binds the duplicated CCAAT box sequences; the -117 HPFH mutation increases the affinity of interaction between gamma CAAT and its cognate site. Two proteins, gamma CAC1 and gamma CAC2, bind the CACCC sequence. These proteins require divalent cations for binding. The -175 HPFH mutation interferes with the binding of a fourth protein, gamma OBP, which binds an octamer sequence (ATGCAAAT) in the normal gamma-globin promoter. The HPFH phenotype of the -175 mutation indicates that the octamer-binding protein may play a negative regulatory role in this setting. A fifth protein, EF gamma a, binds to sequences which overlap the octamer-binding site. The erythroid-specific distribution of EF gamma a and its close approximation to an apparent repressor-binding site suggest that it may be important in gamma-globin regulation. Images PMID:2468996
DOE Office of Scientific and Technical Information (OSTI.GOV)
Reiche, B.; Frank, R.; Deutscher, J.
1988-08-23
Enzyme III/sup mtl/ is part of the mannitol phosphotransferase system of Staphylococcus aureus and Staphylococcus carnosus and is phosphorylated by phosphoenolpyruvate in a reaction sequence requiring enzyme I (phosphoenolpyruvate-protein phosphotransferase) and the histidine-containing protein HPr. In this paper, the authors report the isolation of III/sup mtl/ from both S. aureus and S. carnosus and the characterization of the active center. After phosphorylation of III/sup mtl/ with (/sup 32/P)PEP, enzyme I, and HPr, the phosphorylated protein was cleaved with endoproteinase GLu(C). The amino acid sequence of the S. aureus peptide carrying the phosphoryl group was found to be Gln-Val-Val-Ser-Thr-Phe-Met-Gly-Asn-Gly-Leu-Ala-Ile-Pro-His-Gly-Thr-Asp-Asp. The correspondingmore » peptide from S. carnosus shows an equal sequence except that the first residue is Ala instead of Gln. These peptides both contain a single histidyl residue which they assume to carry the phosphoryl group. All proteins of the PTS so far investigated indeed carry the phosphoryl group attached to a histidyl residue. According to sodium dodecyl sulfate gels, the molecular weight of the III/sup mtl/ proteins was found to be 15,000. They have also determined the N-terminal sequence of both proteins. Comparison of the III/sup mtl/ peptide sequences and the C-terminal part of the enzyme II/sup mtl/ of Escherichia coli reveals considerable sequence homology, which supports the suggestion that II/sup mtl/ of E. coli is a fusion protein of a soluble III protein with a membrane-bound enzyme II.« less
Defining and predicting structurally conserved regions in protein superfamilies
Huang, Ivan K.; Grishin, Nick V.
2013-01-01
Motivation: The structures of homologous proteins are generally better conserved than their sequences. This phenomenon is demonstrated by the prevalence of structurally conserved regions (SCRs) even in highly divergent protein families. Defining SCRs requires the comparison of two or more homologous structures and is affected by their availability and divergence, and our ability to deduce structurally equivalent positions among them. In the absence of multiple homologous structures, it is necessary to predict SCRs of a protein using information from only a set of homologous sequences and (if available) a single structure. Accurate SCR predictions can benefit homology modelling and sequence alignment. Results: Using pairwise DaliLite alignments among a set of homologous structures, we devised a simple measure of structural conservation, termed structural conservation index (SCI). SCI was used to distinguish SCRs from non-SCRs. A database of SCRs was compiled from 386 SCOP superfamilies containing 6489 protein domains. Artificial neural networks were then trained to predict SCRs with various features deduced from a single structure and homologous sequences. Assessment of the predictions via a 5-fold cross-validation method revealed that predictions based on features derived from a single structure perform similarly to ones based on homologous sequences, while combining sequence and structural features was optimal in terms of accuracy (0.755) and Matthews correlation coefficient (0.476). These results suggest that even without information from multiple structures, it is still possible to effectively predict SCRs for a protein. Finally, inspection of the structures with the worst predictions pinpoints difficulties in SCR definitions. Availability: The SCR database and the prediction server can be found at http://prodata.swmed.edu/SCR. Contact: 91huangi@gmail.com or grishin@chop.swmed.edu Supplementary information: Supplementary data are available at Bioinformatics Online PMID:23193223
Next-generation sequencing for diagnosis of rare diseases in the neonatal intensive care unit
Daoud, Hussein; Luco, Stephanie M.; Li, Rui; Bareke, Eric; Beaulieu, Chandree; Jarinova, Olga; Carson, Nancy; Nikkel, Sarah M.; Graham, Gail E.; Richer, Julie; Armour, Christine; Bulman, Dennis E.; Chakraborty, Pranesh; Geraghty, Michael; Lines, Matthew A.; Lacaze-Masmonteil, Thierry; Majewski, Jacek; Boycott, Kym M.; Dyment, David A.
2016-01-01
Background: Rare diseases often present in the first days and weeks of life and may require complex management in the setting of a neonatal intensive care unit (NICU). Exhaustive consultations and traditional genetic or metabolic investigations are costly and often fail to arrive at a final diagnosis when no recognizable syndrome is suspected. For this pilot project, we assessed the feasibility of next-generation sequencing as a tool to improve the diagnosis of rare diseases in newborns in the NICU. Methods: We retrospectively identified and prospectively recruited newborns and infants admitted to the NICU of the Children’s Hospital of Eastern Ontario and the Ottawa Hospital, General Campus, who had been referred to the medical genetics or metabolics inpatient consult service and had features suggesting an underlying genetic or metabolic condition. DNA from the newborns and parents was enriched for a panel of clinically relevant genes and sequenced on a MiSeq sequencing platform (Illumina Inc.). The data were interpreted with a standard informatics pipeline and reported to care providers, who assessed the importance of genotype–phenotype correlations. Results: Of 20 newborns studied, 8 received a diagnosis on the basis of next-generation sequencing (diagnostic rate 40%). The diagnoses were renal tubular dysgenesis, SCN1A-related encephalopathy syndrome, myotubular myopathy, FTO deficiency syndrome, cranioectodermal dysplasia, congenital myasthenic syndrome, autosomal dominant intellectual disability syndrome type 7 and Denys–Drash syndrome. Interpretation: This pilot study highlighted the potential of next-generation sequencing to deliver molecular diagnoses rapidly with a high success rate. With broader use, this approach has the potential to alter health care delivery in the NICU. PMID:27241786
Schönberger, Anna R; Hagelweide, Klara; Pelzer, Esther A; Fink, Gereon R; Schubotz, Ricarda I
2015-10-01
Cognitive impairment in Parkinson's disease (PD) is often attributed to dopamine deficiency in the prefrontal-basal ganglia-thalamo-cortical loops. Although recent studies point to a close interplay between motor and cognitive abilities in PD, the so-called "motor loop" connecting supplementary motor area (SMA) and putamen has been considered solely with regard to the patients' motor impairment. Our study challenges this view by testing patients with the serial prediction task (SPT), a cognitive task that requires participants to predict stimulus sequences and particularly engages premotor sites of the motor loop. We hypothesised that affection of the motor loop causes impaired SPT performance, especially when the internal sequence representation is challenged by suspension of external stimuli. As shown for motor tasks, we further expected this impairment to be compensated by hyperactivity of the lateral premotor cortex (PM). We tested 16 male PD patients ON and OFF dopaminergic medication and 16 male age-matched healthy controls in an functional Magnetic Resonance Imaging study. All subjects performed two versions of the SPT: one with on-going sequences (SPT0), and one with sequences containing non-informative wildcards (SPT+) increasing the demands on mnemonic sequence representation. Patients ON (compared to controls) revealed an impaired performance coming along with hypoactivity of SMA and putamen. Patients OFF compared to ON medication, while showing poorer performance, exhibited a significantly increased PM activity for SPT+ vs. SPT0. Furthermore, patients' performance positively co-varied with PM activity, corroborating a compensatory account. Our data reveal a contribution of the motor loop to cognitive impairment in PD, and suggest a close interplay of SMA and PM beyond motor control. Copyright © 2015 Elsevier Ltd. All rights reserved.
Pulseq: A rapid and hardware-independent pulse sequence prototyping framework.
Layton, Kelvin J; Kroboth, Stefan; Jia, Feng; Littin, Sebastian; Yu, Huijun; Leupold, Jochen; Nielsen, Jon-Fredrik; Stöcker, Tony; Zaitsev, Maxim
2017-04-01
Implementing new magnetic resonance experiments, or sequences, often involves extensive programming on vendor-specific platforms, which can be time consuming and costly. This situation is exacerbated when research sequences need to be implemented on several platforms simultaneously, for example, at different field strengths. This work presents an alternative programming environment that is hardware-independent, open-source, and promotes rapid sequence prototyping. A novel file format is described to efficiently store the hardware events and timing information required for an MR pulse sequence. Platform-dependent interpreter modules convert the file to appropriate instructions to run the sequence on MR hardware. Sequences can be designed in high-level languages, such as MATLAB, or with a graphical interface. Spin physics simulation tools are incorporated into the framework, allowing for comparison between real and virtual experiments. Minimal effort is required to implement relatively advanced sequences using the tools provided. Sequences are executed on three different MR platforms, demonstrating the flexibility of the approach. A high-level, flexible and hardware-independent approach to sequence programming is ideal for the rapid development of new sequences. The framework is currently not suitable for large patient studies or routine scanning although this would be possible with deeper integration into existing workflows. Magn Reson Med 77:1544-1552, 2017. © 2016 International Society for Magnetic Resonance in Medicine. © 2016 International Society for Magnetic Resonance in Medicine.
Single Molecule Force Measurement for Protein Synthesis on the Ribosome
NASA Astrophysics Data System (ADS)
Uemura, Sotaro
2008-04-01
The ribosome is a molecular machine that translates the genetic code described on the messenger RNA (mRNA) into an amino acid sequence through repetitive cycles of transfer RNA (tRNA) selection, peptide bond formation and translocation. Although the detailed interactions between the translation components have been revealed by extensive structural and biochemical studies, it is not known how the precise regulation of macromolecular movements required at each stage of translation is achieved. Here we demonstrate an optical tweezer assay to measure the rupture force between a single ribosome complex and mRNA. The rupture force was compared between ribosome complexes assembled on an mRNA with and without a strong Shine-Dalgarno (SD) sequence. The removal of the SD sequence significantly reduced the rupture force, indicating that the SD interactions contribute significantly to the stability of the ribosomal complex on the mRNA in a pre-peptidyl transfer state. In contrast, the post-peptidyl transfer state weakened the rupture force as compared to the complex in a pre-peptidyl transfer state and it was the same for both the SD-containing and SD-deficient mRNAs. The results suggest that formation of the first peptide bond destabilizes the SD interaction, resulting in the weakening of the force with which the ribosome grips an mRNA. This might be an important requirement to facilitate movement of the ribosome along mRNA during the first translocation step. In this article, we discuss about the above new results including the introduction of the ribosome translation mechanism and the optical tweezer method.
Kim, Seongman; Dai, Gan; O’Callaghan, Dennis J.; Kim, Seong Kee
2012-01-01
The immediate-early protein (IEP), the major regulatory protein encoded by the IE gene of equine herpesvirus 1 (EHV-1), plays a crucial role as both transcription activator and repressor during a productive lytic infection. To investigate the mechanism by which the EHV-1 IEP inhibits its own promoter, IE promoter-luciferase reporter plasmids containing wild-type and mutant IEP-binding site (IEBS) were constructed and used for luciferase reporter assays. The IEP inhibited transcription from its own promoter in the presence of a consensus IEBS (5’-ATCGT-3’) located near the transcription initiation site but did not inhibit when the consensus sequence was deleted. To determine whether the distance between the TATA box and the IEBS affects transcriptional repression, the IEBS was displaced from the original site by the insertion of synthetic DNA sequences. Luciferase reporter assays revealed that the IEP is able to repress its own promoter when the IEBS is located within 26-bp from the TATA box. We also found that the proper orientation and position of the IEBS were required for the repression by the IEP. Interestingly, the level of repression was significantly reduced when a consensus TATA sequence was deleted from the promoter region, indicating that the IEP efficiently inhibits its own promoter in a TATA box-dependent manner. Taken together, these results suggest that the EHV-1 IEP delicately modulates autoregulation of its gene through the consensus IEBS that is near the transcription initiation site and the TATA box. PMID:22265772
Kim, Seongman; Dai, Gan; O'Callaghan, Dennis J; Kim, Seong Kee
2012-04-01
The immediate-early protein (IEP), the major regulatory protein encoded by the IE gene of equine herpesvirus 1 (EHV-1), plays a crucial role as both transcription activator and repressor during a productive lytic infection. To investigate the mechanism by which the EHV-1 IEP inhibits its own promoter, IE promoter-luciferase reporter plasmids containing wild-type and mutant IEP-binding site (IEBS) were constructed and used for luciferase reporter assays. The IEP inhibited transcription from its own promoter in the presence of a consensus IEBS (5'-ATCGT-3') located near the transcription initiation site but did not inhibit when the consensus sequence was deleted. To determine whether the distance between the TATA box and the IEBS affects transcriptional repression, the IEBS was displaced from the original site by the insertion of synthetic DNA sequences. Luciferase reporter assays revealed that the IEP is able to repress its own promoter when the IEBS is located within 26-bp from the TATA box. We also found that the proper orientation and position of the IEBS were required for the repression by the IEP. Interestingly, the level of repression was significantly reduced when a consensus TATA sequence was deleted from the promoter region, indicating that the IEP efficiently inhibits its own promoter in a TATA box-dependent manner. Taken together, these results suggest that the EHV-1 IEP delicately modulates autoregulation of its gene through the consensus IEBS that is near the transcription initiation site and the TATA box. Copyright © 2012. Published by Elsevier B.V.
Rabinovitz, Shiri; Kaufman, Yotam; Ludwig, Guy; Razin, Aharon; Shemer, Ruth
2012-01-01
The Prader-Willi syndrome/Angelman syndrome (PWS/AS) imprinted domain is regulated by a bipartite imprinting control center (IC) composed of a sequence around the SNRPN promoter (PWS-IC) and a 880-bp sequence located 35 kb upstream (AS-IC). The AS-IC imprint is established during gametogenesis and confers repression upon PWS-IC on the maternal allele. Mutation at PWS-IC on the paternal allele leads to gene silencing across the entire PWS/AS domain. This silencing implies that PWS-IC functions on the paternal allele as a bidirectional activator. Here we examine the mechanism by which PWS-IC activates the paternally expressed genes (PEGs) using transgenes that include the PWS-IC sequence in the presence or absence of AS-IC and NDN, an upstream PEG, as an experimental model. We demonstrate that PWS-IC is in fact an activator of NDN. This activation requires an unmethylated PWS-IC in the gametes and during early embryogenesis. PWS-IC is dispensable later in development. Interestingly, a similar activation of a nonimprinted gene (APOA1) was observed, implying that PWS-IC is a universal activator. To decipher the mechanism by which PWS-IC confers activation of remote genes, we performed methylated DNA immunoprecipitation (MeDIP) array analysis on lymphoblast cell lines that revealed dispersed, rather than continued differential methylation. However, chromatin conformation capture (3c) experiments revealed a physical interaction between PWS-IC and the PEGs, suggesting that activation of PEGs may require their proximity to PWS-IC. PMID:22529396
Thavabalasingam, Sathesan; O'Neil, Edward B; Lee, Andy C H
2018-05-22
Recent rodent work suggests the hippocampus may provide a temporal representation of event sequences, in which the order of events and the interval durations between them are encoded. There is, however, limited human evidence for the latter, in particular whether the hippocampus processes duration information pertaining to the passage of time rather than qualitative or quantitative changes in event content. We scanned participants while they made match-mismatch judgements on each trial between a study sequence of events and a subsequent test sequence. Participants explicitly remembered event order or interval duration information (Experiment 1), or monitored order only, with duration being manipulated implicitly (Experiment 2). Hippocampal study-test pattern similarity was significantly reduced by changes to order or duration in mismatch trials, even when duration was processed implicitly. Our findings suggest the human hippocampus processes short intervals within sequences and support the idea that duration information is integrated into hippocampal mnemonic representations. Copyright © 2018 Elsevier Inc. All rights reserved.
AfterQC: automatic filtering, trimming, error removing and quality control for fastq data.
Chen, Shifu; Huang, Tanxiao; Zhou, Yanqing; Han, Yue; Xu, Mingyan; Gu, Jia
2017-03-14
Some applications, especially those clinical applications requiring high accuracy of sequencing data, usually have to face the troubles caused by unavoidable sequencing errors. Several tools have been proposed to profile the sequencing quality, but few of them can quantify or correct the sequencing errors. This unmet requirement motivated us to develop AfterQC, a tool with functions to profile sequencing errors and correct most of them, plus highly automated quality control and data filtering features. Different from most tools, AfterQC analyses the overlapping of paired sequences for pair-end sequencing data. Based on overlapping analysis, AfterQC can detect and cut adapters, and furthermore it gives a novel function to correct wrong bases in the overlapping regions. Another new feature is to detect and visualise sequencing bubbles, which can be commonly found on the flowcell lanes and may raise sequencing errors. Besides normal per cycle quality and base content plotting, AfterQC also provides features like polyX (a long sub-sequence of a same base X) filtering, automatic trimming and K-MER based strand bias profiling. For each single or pair of FastQ files, AfterQC filters out bad reads, detects and eliminates sequencer's bubble effects, trims reads at front and tail, detects the sequencing errors and corrects part of them, and finally outputs clean data and generates HTML reports with interactive figures. AfterQC can run in batch mode with multiprocess support, it can run with a single FastQ file, a single pair of FastQ files (for pair-end sequencing), or a folder for all included FastQ files to be processed automatically. Based on overlapping analysis, AfterQC can estimate the sequencing error rate and profile the error transform distribution. The results of our error profiling tests show that the error distribution is highly platform dependent. Much more than just another new quality control (QC) tool, AfterQC is able to perform quality control, data filtering, error profiling and base correction automatically. Experimental results show that AfterQC can help to eliminate the sequencing errors for pair-end sequencing data to provide much cleaner outputs, and consequently help to reduce the false-positive variants, especially for the low-frequency somatic mutations. While providing rich configurable options, AfterQC can detect and set all the options automatically and require no argument in most cases.
Bellissimo, Daniel B; Christopherson, Pamela A; Flood, Veronica H; Gill, Joan Cox; Friedman, Kenneth D; Haberichter, Sandra L; Shapiro, Amy D; Abshire, Thomas C; Leissinger, Cindy; Hoots, W Keith; Lusher, Jeanne M; Ragni, Margaret V; Montgomery, Robert R
2012-03-01
Diagnosis and classification of VWD is aided by molecular analysis of the VWF gene. Because VWF polymorphisms have not been fully characterized, we performed VWF laboratory testing and gene sequencing of 184 healthy controls with a negative bleeding history. The controls included 66 (35.9%) African Americans (AAs). We identified 21 new sequence variations, 13 (62%) of which occurred exclusively in AAs and 2 (G967D, T2666M) that were found in 10%-15% of the AA samples, suggesting they are polymorphisms. We identified 14 sequence variations reported previously as VWF mutations, the majority of which were type 1 mutations. These controls had VWF Ag levels within the normal range, suggesting that these sequence variations might not always reduce plasma VWF levels. Eleven mutations were found in AAs, and the frequency of M740I, H817Q, and R2185Q was 15%-18%. Ten AA controls had the 2N mutation H817Q; 1 was homozygous. The average factor VIII level in this group was 99 IU/dL, suggesting that this variation may confer little or no clinical symptoms. This study emphasizes the importance of sequencing healthy controls to understand ethnic-specific sequence variations so that asymptomatic sequence variations are not misidentified as mutations in other ethnic or racial groups.
Matsubara, Kazumi; Uno, Yoshinobu; Srikulnath, Kornsorn; Seki, Risako; Nishida, Chizuko; Matsuda, Yoichi
2015-12-01
Highly repetitive DNA sequences of the centromeric heterochromatin provide valuable molecular cytogenetic markers for the investigation of genomic compartmentalization in the macrochromosomes and microchromosomes of sauropsids. Here, the relationship between centromeric heterochromatin and karyotype evolution was examined using cloned repetitive DNA sequences from two snake species, the habu snake (Protobothrops flavoviridis, Crotalinae, Viperidae) and Burmese python (Python bivittatus, Pythonidae). Three satellite DNA (stDNA) families were isolated from the heterochromatin of these snakes: 168-bp PFL-MspI from P. flavoviridis and 196-bp PBI-DdeI and 174-bp PBI-MspI from P. bivittatus. The PFL-MspI and PBI-DdeI sequences were localized to the centromeric regions of most chromosomes in the respective species, suggesting that the two sequences were the major components of the centromeric heterochromatin in these organisms. The PBI-MspI sequence was localized to the pericentromeric region of four chromosome pairs. The PFL-MspI and the PBI-DdeI sequences were conserved only in the genome of closely related species, Gloydius blomhoffii (Crotalinae) and Python molurus, respectively, although their locations on the chromosomes were slightly different. In contrast, the PBI-MspI sequence was also in the genomes of P. molurus and Boa constrictor (Boidae), and additionally localized to the centromeric regions of eight chromosome pairs in B. constrictor, suggesting that this sequence originated in the genome of a common ancestor of Pythonidae and Boidae, approximately 86 million years ago. The three stDNA sequences showed no genomic compartmentalization between the macrochromosomes and microchromosomes, suggesting that homogenization of the centromeric and/or pericentromeric stDNA sequences occurred in the macrochromosomes and microchromosomes of these snakes.
Comparisons of luminaires: Efficacies and system design
DOE Office of Scientific and Technical Information (OSTI.GOV)
Albright, L.D.; Both, A.J.
1994-12-31
After reviewing basic information, three design examples have been presented to demonstrate a process of supplemental lighting design. The sequences of each example suggest careful thought and analysis are required to obtain supplemental lighting designs that provide both high levels of PAR and suitable uniformity. The end results should suggest how an analysis can evolve to achieve desired results, and the types of tools and adjustments required. It appears possible to design research greenhouses and plant growth chambers to achieve a {+-}10% PAR uniformity using HIPS luminaires. Further, HPS luminaires (and, by extension, NEHD, etc.) are required to achieve highmore » PAR levels and have the decided advantage of providing the possibility of aiming, which reduces the region of the {open_quotes}edge effect{close_quotes}. Further, for designing plant lighting systems, a modification of the standard IES luminaire data file structure is potentially useful. Luminaire installation is an important factor to obtain PAR uniformity. Spacing and mounting height are critically important. Additionally, the mounting angle of each luminaire must be carefully adjusted to conform with design assumptions. This is true for both plant growth chambers and greenhouses. Surface reflectances are particularly important when designing for small lighted regions such as plant growth chambers and research greenhouses. It is not obvious, just from looking at a surface, what its reflectance is. It is suggested that an effort be mounted to develop valid surface reflectance data to be used by designers. The importance of the surfaces (particularly the walls) in achieving PAR uniformity suggests the importance of periodic cleaning/maintenance to retain initial reflectance values.« less
DHS-STEM Internship at Lawrence Livermore National Laboratory
DOE Office of Scientific and Technical Information (OSTI.GOV)
Feldman, B
2008-08-18
This summer I had the fortunate opportunity through the DHS-STEM program to attend Lawrence Livermore National Laboratories (LLNL) to work with Tom Slezak on the bioinformatics team. The bioinformatics team, among other things, helps to develop TaqMan and microarray probes for the identification of pathogens. My main project at the laboratory was to test such probe identification capabilities against metagenomic (unsequenced) data from around the world. Using various sequence analysis tools (Vmatch and Blastall) and several we developed ourselves, about 120 metagenomic sequencing projects were compared against a collection of all completely sequenced genomes and Lawrence Livermore National Laboratory's (LLNL)more » current probe database. For the probes, the Blastall algorithms compared each individual metagenomic project using various parameters allowing for the natural ambiguities of in vitro hybridization (mismatches, deletions, insertions, hairpinning, etc.). A low level cutoff was used to eliminate poor sequence matches, and to leave a large variety of higher quality matches for future research into the hybridization of sequences with mutations and variations. Any hits with at least 80% base pair conservation over 80% of the length of the match. Because of the size of our whole genome database, we utilized the exact match algorithm of Vmatch to quickly search and compare genomes for exact matches with varying lower level limits on sequence length. I also provided preliminary feasibility analyses to support a potential industry-funded project to develop a multiplex assay on several genera and species. Each genus and species was evaluated based on the amount of sequenced genomes, amount of near neighbor sequenced genomes, presence of identifying genes--metabolistic or antibiotic resistant genes--and the availability of research on the identification of the specific genera or species. Utilizing the bioinformatic team's software, I was able to develop and/or update several TaqMan probes for these and develop a plan of identification for the more difficult ones. One suggestion for a genus with low conservation was to separate species into several groups and look for probes within these and then use a combination of probes to identify a genus. This has the added benefit of also providing subgenus identification in larger genera. During both projects I had developed a set of computer programs to simplify or consolidate several processes. These programs were constructed with the intent of being reused to either repeat these results, further this research, or to start a similar project. A big problem in the bioinformatic/sequencing field is the variability of data storage formats which make using data from various sources extremely difficult. Excluding for the moment the many errors present in online database genome sequences, there are still many difficulties in converting one data type into another successfully every time. Dealing with hundreds of files, each hundreds of megabytes, requires automation which in turn requires good data mining software. The programs I developed will help ease this issue and make more genomic sources available for use. With these programs it is extremely easy to gather the data, cleanse it, convert it and run it through some analysis software and even analyze the output of this software. When dealing with vast amounts of data it is vital for the researcher to optimize the process--which became clear to me with only ten weeks to work with. Due to the time constraint of the internship, I was unable to finish my metagenomic project; I did finish with success, my second project, discovering TaqMan identification for genera and species. Although I did not complete my first project I made significant findings along the way that suggest the need for further research on the subject. I found several instances of false positives in the metagenomic data from our microarrays which indicates the need to sequence more metagenomic samples. My initial research shows the importance of expanding our known metagenomic world; at this point there is always the likelihood of developing probes with unknown interactions because there is not enough sequencing. On the other hand my research did point out the sensitivity and quality of LLNL's microarrays when it identified a parvoviridae infection in a mosquito metagenomic sample from southern California. It also uniquely identified the presence of several species of the adenovirus which could mean that there was some archaic strain of the adenovirus present in the metagenomic sample or there was a contamination in the sample, requiring a further investigation to clarify.« less
Li, Ming; Wang, Rui; Zhao, Dahe; Xiang, Hua
2014-01-01
The clustered regularly interspaced short palindromic repeat (CRISPR)-Cas system mediates adaptive immunity against foreign nucleic acids in prokaryotes. However, efficient adaptation of a native CRISPR to purified viruses has only been observed for the type II-A system from a Streptococcus thermophilus industry strain, and rarely reported for laboratory strains. Here, we provide a second native system showing efficient adaptation. Infected by a newly isolated virus HHPV-2, Haloarcula hispanica type I-B CRISPR system acquired spacers discriminatively from viral sequences. Unexpectedly, in addition to Cas1, Cas2 and Cas4, this process also requires Cas3 and at least partial Cascade proteins, which are involved in interference and/or CRISPR RNA maturation. Intriguingly, a preexisting spacer partially matching a viral sequence is also required, and spacer acquisition from upstream and downstream sequences of its target sequence (i.e. priming protospacer) shows different strand bias. These evidences strongly indicate that adaptation in this system strictly requires a priming process. This requirement, if validated also true for other CRISPR systems as implied by our bioinformatic analysis, may help to explain failures to observe efficient adaptation to purified viruses in many laboratory strains, and the discrimination mechanism at the adaptation level that has confused scientists for years. PMID:24265226
Targeted RNA-Sequencing with Competitive Multiplex-PCR Amplicon Libraries
Blomquist, Thomas M.; Crawford, Erin L.; Lovett, Jennie L.; Yeo, Jiyoun; Stanoszek, Lauren M.; Levin, Albert; Li, Jia; Lu, Mei; Shi, Leming; Muldrew, Kenneth; Willey, James C.
2013-01-01
Whole transcriptome RNA-sequencing is a powerful tool, but is costly and yields complex data sets that limit its utility in molecular diagnostic testing. A targeted quantitative RNA-sequencing method that is reproducible and reduces the number of sequencing reads required to measure transcripts over the full range of expression would be better suited to diagnostic testing. Toward this goal, we developed a competitive multiplex PCR-based amplicon sequencing library preparation method that a) targets only the sequences of interest and b) controls for inter-target variation in PCR amplification during library preparation by measuring each transcript native template relative to a known number of synthetic competitive template internal standard copies. To determine the utility of this method, we intentionally selected PCR conditions that would cause transcript amplification products (amplicons) to converge toward equimolar concentrations (normalization) during library preparation. We then tested whether this approach would enable accurate and reproducible quantification of each transcript across multiple library preparations, and at the same time reduce (through normalization) total sequencing reads required for quantification of transcript targets across a large range of expression. We demonstrate excellent reproducibility (R2 = 0.997) with 97% accuracy to detect 2-fold change using External RNA Controls Consortium (ERCC) reference materials; high inter-day, inter-site and inter-library concordance (R2 = 0.97–0.99) using FDA Sequencing Quality Control (SEQC) reference materials; and cross-platform concordance with both TaqMan qPCR (R2 = 0.96) and whole transcriptome RNA-sequencing following “traditional” library preparation using Illumina NGS kits (R2 = 0.94). Using this method, sequencing reads required to accurately quantify more than 100 targeted transcripts expressed over a 107-fold range was reduced more than 10,000-fold, from 2.3×109 to 1.4×105 sequencing reads. These studies demonstrate that the competitive multiplex-PCR amplicon library preparation method presented here provides the quality control, reproducibility, and reduced sequencing reads necessary for development and implementation of targeted quantitative RNA-sequencing biomarkers in molecular diagnostic testing. PMID:24236095
Simulations Using Random-Generated DNA and RNA Sequences
ERIC Educational Resources Information Center
Bryce, C. F. A.
1977-01-01
Using a very simple computer program written in BASIC, a very large number of random-generated DNA or RNA sequences are obtained. Students use these sequences to predict complementary sequences and translational products, evaluate base compositions, determine frequencies of particular triplet codons, and suggest possible secondary structures.…
Genetic recombination of tick-borne flaviviruses among wild-type strains.
Norberg, Peter; Roth, Anette; Bergström, Tomas
2013-06-05
Genetic recombination has been suggested to occur in mosquito-borne flaviviruses. In contrast, tick-borne flaviviruses have been thought to evolve in a clonal manner, although recent studies suggest that recombination occurs also for these viruses. We re-analyzed the data and found that previous conclusions on wild type recombination were probably falsely drawn due to misalignments of nucleotide sequences, ambiguities in GenBank sequences, or different laboratory culture histories suggestive of recombination events in laboratory. To evaluate if reliable predictions of wild type recombination of tick-borne flaviviruses can be made, we analyzed viral strains sequenced exclusively for this study, and other flavivirus sequences retrieved from GenBank. We detected genetic signals supporting recombination between viruses within the three clades of TBEV-Eu, TBEV-Sib and TBEV-Fe, respectively. Our results suggest that the tick-borne encephalitis viruses may undergo recombination under natural conditions, but that geographic barriers restrict most recombination events to involve only closely genetically related viruses. Copyright © 2013 Elsevier Inc. All rights reserved.
Evaluating imputation algorithms for low-depth genotyping-by-sequencing (GBS) data
USDA-ARS?s Scientific Manuscript database
Well-powered genomic studies require genome-wide marker coverage across many individuals. For non-model species with few genomic resources, high-throughput sequencing (HTS) methods, such as Genotyping-By-Sequencing (GBS), offer an inexpensive alternative to array-based genotyping. Although affordabl...
Karas, Vlad O; Sinnott-Armstrong, Nicholas A; Varghese, Vici; Shafer, Robert W; Greenleaf, William J; Sherlock, Gavin
2018-01-01
Abstract Much of the within species genetic variation is in the form of single nucleotide polymorphisms (SNPs), typically detected by whole genome sequencing (WGS) or microarray-based technologies. However, WGS produces mostly uninformative reads that perfectly match the reference, while microarrays require genome-specific reagents. We have developed Diff-seq, a sequencing-based mismatch detection assay for SNP discovery without the requirement for specialized nucleic-acid reagents. Diff-seq leverages the Surveyor endonuclease to cleave mismatched DNA molecules that are generated after cross-annealing of a complex pool of DNA fragments. Sequencing libraries enriched for Surveyor-cleaved molecules result in increased coverage at the variant sites. Diff-seq detected all mismatches present in an initial test substrate, with specific enrichment dependent on the identity and context of the variation. Application to viral sequences resulted in increased observation of variant alleles in a biologically relevant context. Diff-Seq has the potential to increase the sensitivity and efficiency of high-throughput sequencing in the detection of variation. PMID:29361139
Archer, Stuart K; Shirokikh, Nikolay E; Preiss, Thomas
2015-04-01
Most applications for RNA-seq require the depletion of abundant transcripts to gain greater coverage of the underlying transcriptome. The sequences to be targeted for depletion depend on application and species and in many cases may not be supported by commercial depletion kits. This unit describes a method for generating RNA-seq libraries that incorporates probe-directed degradation (PDD), which can deplete any unwanted sequence set, with the low-bias split-adapter method of library generation (although many other library generation methods are in principle compatible). The overall strategy is suitable for applications requiring customized sequence depletion or where faithful representation of fragment ends and lack of sequence bias is paramount. We provide guidelines to rapidly design specific probes against the target sequence, and a detailed protocol for library generation using the split-adapter method including several strategies for streamlining the technique and reducing adapter dimer content. Copyright © 2015 John Wiley & Sons, Inc.
Discrete sequence prediction and its applications
NASA Technical Reports Server (NTRS)
Laird, Philip
1992-01-01
Learning from experience to predict sequences of discrete symbols is a fundamental problem in machine learning with many applications. We apply sequence prediction using a simple and practical sequence-prediction algorithm, called TDAG. The TDAG algorithm is first tested by comparing its performance with some common data compression algorithms. Then it is adapted to the detailed requirements of dynamic program optimization, with excellent results.
Yang, Teng-Chieh; Maluf, Nasib Karl
2012-02-21
Human adenovirus (Ad) is an icosahedral, double-stranded DNA virus. Viral DNA packaging refers to the process whereby the viral genome becomes encapsulated by the viral particle. In Ad, activation of the DNA packaging reaction requires at least three viral components: the IVa2 and L4-22K proteins and a section of DNA within the viral genome, called the packaging sequence. Previous studies have shown that the IVa2 and L4-22K proteins specifically bind to conserved elements within the packaging sequence and that these interactions are absolutely required for the observation of DNA packaging. However, the equilibrium mechanism for assembly of IVa2 and L4-22K onto the packaging sequence has not been determined. Here we characterize the assembly of the IVa2 and L4-22K proteins onto truncated packaging sequence DNA by analytical sedimentation velocity and equilibrium methods. At limiting concentrations of L4-22K, we observe a species with two IVa2 monomers and one L4-22K monomer bound to the DNA. In this species, the L4-22K monomer is promoting positive cooperative interactions between the two bound IVa2 monomers. As L4-22K levels are increased, we observe a species with one IVa2 monomer and three L4-22K monomers bound to the DNA. To explain this result, we propose a model in which L4-22K self-assembly on the DNA competes with IVa2 for positive heterocooperative interactions, destabilizing binding of the second IVa2 monomer. Thus, we propose that L4-22K levels control the extent of cooperativity observed between adjacently bound IVa2 monomers. We have also determined the hydrodynamic properties of all observed stoichiometric species; we observe that species with three L4-22K monomers bound have more extended conformations than species with a single L4-22K bound. We suggest this might reflect a molecular switch that controls insertion of the viral DNA into the capsid.
Sill, Orriana C.; Smith, David M.
2012-01-01
In recent years, many animal models of memory have focused on one or more of the various components of episodic memory. For example, the odor sequence memory task requires subjects to remember individual items and events (the odors) and the temporal aspects of the experience (the sequence of odor presentation). The well-known spatial context coding function of the hippocampus, as exemplified by place cell firing, may reflect the ‘where’ component of episodic memory. In the present study, we added a contextual component to the odor sequence memory task by training rats to choose the earlier odor in one context and the later odor in another context and we compared the effects of temporary hippocampal lesions on performance of the original single context task and the new dual context task. Temporary lesions significantly impaired the single context task, although performance remained significantly above chance levels. In contrast, performance dropped all the way to chance when temporary lesions were used in the dual context task. These results demonstrate that rats can learn a dual context version of the odor sequence learning task which requires the use of contextual information along with the requirement to remember the ‘what’ and ‘when’ components of the odor sequence. Moreover, the additional requirement of context-dependent expression of the ‘what-when’ memory made the task fully dependent on the hippocampus. Moreover, the addition of the contextual component made the task fully dependent on the hippocampus. PMID:22687149
Approximate matching of regular expressions.
Myers, E W; Miller, W
1989-01-01
Given a sequence A and regular expression R, the approximate regular expression matching problem is to find a sequence matching R whose optimal alignment with A is the highest scoring of all such sequences. This paper develops an algorithm to solve the problem in time O(MN), where M and N are the lengths of A and R. Thus, the time requirement is asymptotically no worse than for the simpler problem of aligning two fixed sequences. Our method is superior to an earlier algorithm by Wagner and Seiferas in several ways. First, it treats real-valued costs, in addition to integer costs, with no loss of asymptotic efficiency. Second, it requires only O(N) space to deliver just the score of the best alignment. Finally, its structure permits implementation techniques that make it extremely fast in practice. We extend the method to accommodate gap penalties, as required for typical applications in molecular biology, and further refine it to search for sub-strings of A that strongly align with a sequence in R, as required for typical data base searches. We also show how to deliver an optimal alignment between A and R in only O(N + log M) space using O(MN log M) time. Finally, an O(MN(M + N) + N2log N) time algorithm is presented for alignment scoring schemes where the cost of a gap is an arbitrary increasing function of its length.
Deletion mapping of the Aequorea victoria green fluorescent protein.
Dopf, J; Horiagon, T M
1996-01-01
Aequorea victoria green fluorescent protein (GFP) is a promising fluorescent marker which is active in a diverse array of prokaryotic and eukaryotic organisms. A key feature underlying the versatility of GFP is its capacity to undergo heterocyclic chromophore formation by cyclization of a tripeptide present in its primary sequence and thereby acquiring fluorescent activity in a variety of intracellular environments. In order to define further the primary structure requirements for chromophore formation and fluorescence in GFP, a series of N- and C-terminal GFP deletion variant expression vectors were created using the polymerase chain reaction. Scanning spectrofluorometric analyses of crude soluble protein extracts derived from eleven GFP expression constructs revealed that amino acid (aa) residues 2-232, of a total of 238 aa in the native protein, were required for the characteristic emission and absorption spectra of native GFP. Heterocyclic chromophore formation was assayed by comparing the absorption spectrum of GFP deletion variants over the 300-500-nm range to the absorption spectra of full-length GFP and GFP deletion variants missing the chromophore substrate domain from the primary sequence. GFP deletion variants lacking fluorescent activity showed no evidence of heterocyclic ring structure formation when the soluble extracts of their bacterial expression hosts were studied at pH 7.9. These observations suggest that the primary structure requirements for the fluorescent activity of GFP are relatively extensive and are compatible with the view that much of the primary structure serves an autocatalytic function.
van den Akker, Jeroen; Mishne, Gilad; Zimmer, Anjali D; Zhou, Alicia Y
2018-04-17
Next generation sequencing (NGS) has become a common technology for clinical genetic tests. The quality of NGS calls varies widely and is influenced by features like reference sequence characteristics, read depth, and mapping accuracy. With recent advances in NGS technology and software tools, the majority of variants called using NGS alone are in fact accurate and reliable. However, a small subset of difficult-to-call variants that still do require orthogonal confirmation exist. For this reason, many clinical laboratories confirm NGS results using orthogonal technologies such as Sanger sequencing. Here, we report the development of a deterministic machine-learning-based model to differentiate between these two types of variant calls: those that do not require confirmation using an orthogonal technology (high confidence), and those that require additional quality testing (low confidence). This approach allows reliable NGS-based calling in a clinical setting by identifying the few important variant calls that require orthogonal confirmation. We developed and tested the model using a set of 7179 variants identified by a targeted NGS panel and re-tested by Sanger sequencing. The model incorporated several signals of sequence characteristics and call quality to determine if a variant was identified at high or low confidence. The model was tuned to eliminate false positives, defined as variants that were called by NGS but not confirmed by Sanger sequencing. The model achieved very high accuracy: 99.4% (95% confidence interval: +/- 0.03%). It categorized 92.2% (6622/7179) of the variants as high confidence, and 100% of these were confirmed to be present by Sanger sequencing. Among the variants that were categorized as low confidence, defined as NGS calls of low quality that are likely to be artifacts, 92.1% (513/557) were found to be not present by Sanger sequencing. This work shows that NGS data contains sufficient characteristics for a machine-learning-based model to differentiate low from high confidence variants. Additionally, it reveals the importance of incorporating site-specific features as well as variant call features in such a model.
Qiao, Qin; Ren, Zhumei; Zhao, Jiayuan; Yonezawa, Takahiro; Hasegawa, Masami; Crabbe, M. James C; Li, Jianqiang; Zhong, Yang
2013-01-01
Background The central function of chloroplasts is to carry out photosynthesis, and its gene content and structure are highly conserved across land plants. Parasitic plants, which have reduced photosynthetic ability, suffer gene losses from the chloroplast (cp) genome accompanied by the relaxation of selective constraints. Compared with the rapid rise in the number of cp genome sequences of photosynthetic organisms, there are limited data sets from parasitic plants. Principal Findings/Significance Here we report the complete sequence of the cp genome of Cistanche deserticola, a holoparasitic desert species belonging to the family Orobanchaceae. The cp genome of C. deserticola is greatly reduced both in size (102,657 bp) and in gene content, indicating that all genes required for photosynthesis suffer from gene loss and pseudogenization, except for psbM. The striking difference from other holoparasitic plants is that it retains almost a full set of tRNA genes, and it has lower dN/dS for most genes than another close holoparasitic plant, E. virginiana, suggesting that Cistanche deserticola has undergone fewer losses, either due to a reduced level of holoparasitism, or to a recent switch to this life history. We also found that the rpoC2 gene was present in two copies within C. deserticola. Its own copy has much shortened and turned out to be a pseudogene. Another copy, which was not located in its cp genome, was a homolog of the host plant, Haloxylon ammodendron (Chenopodiaceae), suggesting that it was acquired from its host via a horizontal gene transfer. PMID:23554920
Molecular Insights into the Ctenophore Genus Beroe in Europe: New Species, Spreading Invaders.
Johansson, Mattias L; Shiganova, Tamara A; Ringvold, Halldis; Stupnikova, Alexandra N; Heath, Daniel D; MacIsaac, Hugh J
2018-06-07
The genus Beroe Browne, 1756 (Ctenophora, Beroidae) occurs worldwide, with 25 currently-described species. Because the genus is poorly studied, the definitive number of species is uncertain. Recently, a possible new Beroe species was suggested based on internal transcribed spacer 1 (ITS1) sequences from samples collected in Svalbard, Norway. Another species, Beroe ovata, was introduced to Europe from North America, initially in the Black Sea and subsequently (and possibly secondarily) into the Mediterranean and Baltic Seas. In areas where ctenophores have been introduced, they have often had significant detrimental ecological effects. The potential for other cryptic and/or undescribed Beroe species, and history of spread of some species in the genus, give reason for additional study. When alive, morphological hallmarks may be challenging to spot and photograph owing to the animals' transparency and near-constant motion. We sampled and analyzed 109 putative Beroe specimens from Europe, using morphological and molecular approaches. DNA analyses were conducted using cytochrome oxidase 1 and internal transcribed spacer sequences, and together with published sequences from GenBank, phylogenetic relationships of the genus were explored. Our study suggests the presence of at least five genetic lineages of Beroe in Europe, of which three could be assigned to known species: Beroe gracilis Künne 1939; Beroe cucumis Fabricius, 1780 and Beroe ovata sensu Mayer, 1912. The other two lineages (here provisionally named Beroe "norvegica" and Beroe "anatoliensis") did not clearly coincide with any known species, and might therefore reflect new species, but confirmation of this requires further study.
Li, Xi; Zhang, Ti-Cao; Qiao, Qin; Ren, Zhumei; Zhao, Jiayuan; Yonezawa, Takahiro; Hasegawa, Masami; Crabbe, M James C; Li, Jianqiang; Zhong, Yang
2013-01-01
The central function of chloroplasts is to carry out photosynthesis, and its gene content and structure are highly conserved across land plants. Parasitic plants, which have reduced photosynthetic ability, suffer gene losses from the chloroplast (cp) genome accompanied by the relaxation of selective constraints. Compared with the rapid rise in the number of cp genome sequences of photosynthetic organisms, there are limited data sets from parasitic plants. PRINCIPAL FINDINGS/SIGNIFICANCE: Here we report the complete sequence of the cp genome of Cistanche deserticola, a holoparasitic desert species belonging to the family Orobanchaceae. The cp genome of C. deserticola is greatly reduced both in size (102,657 bp) and in gene content, indicating that all genes required for photosynthesis suffer from gene loss and pseudogenization, except for psbM. The striking difference from other holoparasitic plants is that it retains almost a full set of tRNA genes, and it has lower dN/dS for most genes than another close holoparasitic plant, E. virginiana, suggesting that Cistanche deserticola has undergone fewer losses, either due to a reduced level of holoparasitism, or to a recent switch to this life history. We also found that the rpoC2 gene was present in two copies within C. deserticola. Its own copy has much shortened and turned out to be a pseudogene. Another copy, which was not located in its cp genome, was a homolog of the host plant, Haloxylon ammodendron (Chenopodiaceae), suggesting that it was acquired from its host via a horizontal gene transfer.
Brok-Volchanskaya, Vera S.; Kadyrov, Farid A.; Sivogrivov, Dmitry E.; Kolosov, Peter M.; Sokolov, Andrey S.; Shlyapnikov, Michael G.; Kryukov, Valentine M.; Granovsky, Igor E.
2008-01-01
Homing endonucleases initiate nonreciprocal transfer of DNA segments containing their own genes and the flanking sequences by cleaving the recipient DNA. Bacteriophage T4 segB gene, which is located in a cluster of tRNA genes, encodes a protein of unknown function, homologous to homing endonucleases of the GIY-YIG family. We demonstrate that SegB protein is a site-specific endonuclease, which produces mostly 3′ 2-nt protruding ends at its DNA cleavage site. Analysis of SegB cleavage sites suggests that SegB recognizes a 27-bp sequence. It contains 11-bp conserved sequence, which corresponds to a conserved motif of tRNA TψC stem-loop, whereas the remainder of the recognition site is rather degenerate. T4-related phages T2L, RB1 and RB3 contain tRNA gene regions that are homologous to that of phage T4 but lack segB gene and several tRNA genes. In co-infections of phages T4 and T2L, segB gene is inherited with nearly 100% of efficiency. The preferred inheritance depends absolutely on the segB gene integrity and is accompanied by the loss of the T2L tRNA gene region markers. We suggest that SegB is a homing endonuclease that functions to ensure spreading of its own gene and the surrounding tRNA genes among T4-related phages. PMID:18281701
Chang, D D; Clayton, D A
1986-01-01
Transcription of the heavy strand of mouse mitochondrial DNA starts from two closely spaced, distinct sites located in the displacement loop region of the genome. We report here an analysis of regulatory sequences required for faithful transcription from these two sites. Data obtained from in vitro assays demonstrated that a 51-base-pair region, encompassing nucleotides -40 to +11 of the downstream start site, contains sufficient information for accurate transcription from both start sites. Deletion of the 3' flanking sequences, including one or both start sites to -17, resulted in the initiation of transcription by the mitochondrial RNA polymerase from alternative sites within vector DNA sequences. This feature places the mouse heavy-strand promoter uniquely among other known mitochondrial promoters, all of which absolutely require cognate start sites for transcription. Comparison of the heavy-strand promoter with those of other vertebrate mitochondrial DNAs revealed a remarkably high rate of sequence divergence among species. Images PMID:3785226
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen
2017-01-01
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing. PMID:29280736
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
Pai, Athma A.; Henriques, Telmo; McCue, Kayla; ...
2017-12-27
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pai, Athma A.; Henriques, Telmo; McCue, Kayla
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less
MIPS bacterial genomes functional annotation benchmark dataset.
Tetko, Igor V; Brauner, Barbara; Dunger-Kaltenbach, Irmtraud; Frishman, Goar; Montrone, Corinna; Fobo, Gisela; Ruepp, Andreas; Antonov, Alexey V; Surmeli, Dimitrij; Mewes, Hans-Wernen
2005-05-15
Any development of new methods for automatic functional annotation of proteins according to their sequences requires high-quality data (as benchmark) as well as tedious preparatory work to generate sequence parameters required as input data for the machine learning methods. Different program settings and incompatible protocols make a comparison of the analyzed methods difficult. The MIPS Bacterial Functional Annotation Benchmark dataset (MIPS-BFAB) is a new, high-quality resource comprising four bacterial genomes manually annotated according to the MIPS functional catalogue (FunCat). These resources include precalculated sequence parameters, such as sequence similarity scores, InterPro domain composition and other parameters that could be used to develop and benchmark methods for functional annotation of bacterial protein sequences. These data are provided in XML format and can be used by scientists who are not necessarily experts in genome annotation. BFAB is available at http://mips.gsf.de/proj/bfab
A comprehensive quality control workflow for paired tumor-normal NGS experiments.
Schroeder, Christopher M; Hilke, Franz J; Löffler, Markus W; Bitzer, Michael; Lenz, Florian; Sturm, Marc
2017-06-01
Quality control (QC) is an important part of all NGS data analysis stages. Many available tools calculate QC metrics from different analysis steps of single sample experiments (raw reads, mapped reads and variant lists). Multi-sample experiments, as sequencing of tumor-normal pairs, require additional QC metrics to ensure validity of results. These multi-sample QC metrics still lack standardization. We therefore suggest a new workflow for QC of DNA sequencing of tumor-normal pairs. With this workflow well-known single-sample QC metrics and additional metrics specific for tumor-normal pairs can be calculated. The segmentation into different tools offers a high flexibility and allows reuse for other purposes. All tools produce qcML, a generic XML format for QC of -omics experiments. qcML uses quality metrics defined in an ontology, which was adapted for NGS. All QC tools are implemented in C ++ and run both under Linux and Windows. Plotting requires python 2.7 and matplotlib. The software is available under the 'GNU General Public License version 2' as part of the ngs-bits project: https://github.com/imgag/ngs-bits. christopher.schroeder@med.uni-tuebingen.de. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Hawkins, Charlene
2014-01-01
The Est1 (ever shorter telomeres 1) protein is an essential component of yeast telomerase, a ribonucleoprotein complex that restores the repetitive sequences at chromosome ends (telomeres) that would otherwise be lost during DNA replication. Previous work has shown that the telomerase RNA component (TLC1) transits through the cytoplasm during telomerase biogenesis, but mechanisms of protein import have not been addressed. Here we identify three nuclear localization sequences (NLSs) in Est1p. Mutation of the most N-terminal NLS in the context of full-length Est1p reduces Est1p nuclear localization and causes telomere shortening—phenotypes that are rescued by fusion with the NLS from the simian virus 40 (SV40) large-T antigen. In contrast to that of the TLC1 RNA, Est1p nuclear import is facilitated by Srp1p, the yeast homolog of importin α. The reduction in telomere length observed at the semipermissive temperature in a srp1 mutant strain is rescued by increased Est1p expression, consistent with a defect in Est1p nuclear import. These studies suggest that at least two nuclear import pathways are required to achieve normal telomere length homeostasis in yeast. PMID:24906415
Hashikawa, Naoya; Yamamoto, Noritaka; Sakurai, Hiroshi
2007-04-06
The hydrophobic repeat is a conserved structural motif of eukaryotic heat shock transcription factor (HSF) that enables HSF to form a homotrimer. Homotrimeric HSF binds to heat shock elements (HSEs) consisting of three inverted repeats of the sequence nGAAn. Sequences consisting of four or more nGAAn units are bound cooperatively by two HSF trimers. We show that in Saccharomyces cerevisiae cells oligomerization-defective Hsf1 is not able to bind HSEs with three units and is not extensively phosphorylated in response to stress; it is therefore unable to activate genes containing this type of HSE. Several lines of evidence indicate that oligomerization is a prerequisite for stress-induced hyperphosphorylation of Hsf1. In contrast, oligomerization and hyperphosphorylation are not necessary for gene activation via HSEs with four units. Intragenic suppressor screening of oligomerization-defective hsf1 showed that an interface between adjacent DNA-binding domains is important for the binding of Hsf1 to the HSE. We suggest that Saccharomyces cerevisiae HSEs with different structures are regulated differently; HSEs with three units require Hsf1 to be both oligomerized and hyperphosphorylated, whereas HSEs with four or more units do not require either.