Tan, Cheng; Takada, Shoji
2017-01-01
While nucleosome positioning on eukaryotic genome play important roles for genetic regulation, molecular mechanisms of nucleosome positioning and sliding along DNA are not well understood. Here we investigated thermally-activated spontaneous nucleosome sliding mechanisms developing and applying a coarse-grained molecular simulation method that incorporates both long-range electrostatic and short-range hydrogen-bond interactions between histone octamer and DNA. The simulations revealed two distinct sliding modes depending on the nucleosomal DNA sequence. A uniform DNA sequence showed frequent sliding with one base pair step in a rotation-coupled manner, akin to screw-like motions. On the contrary, a strong positioning sequence, the so-called 601 sequence, exhibits rare, abrupt transitions of five and ten base pair steps without rotation. Moreover, we evaluated the importance of hydrogen bond interactions on the sliding mode, finding that strong and weak bonds favor respectively the rotation-coupled and -uncoupled sliding movements. PMID:29194442
Bruce, A. Gregory; Ryan, Jonathan T.; Thomas, Mathew J.; Peng, Xinxia; Grundhoff, Adam; Tsai, Che-Chung
2013-01-01
The complete sequence of retroperitoneal fibromatosis-associated herpesvirus Macaca nemestrina (RFHVMn), the pig-tailed macaque homolog of Kaposi's sarcoma-associated herpesvirus (KSHV), was determined by next-generation sequence analysis of a Kaposi's sarcoma (KS)-like macaque tumor. Colinearity of genes was observed with the KSHV genome, and the core herpesvirus genes had strong sequence homology to the corresponding KSHV genes. RFHVMn lacked homologs of open reading frame 11 (ORF11) and KSHV ORFs K5 and K6, which appear to have been generated by duplication of ORFs K3 and K4 after the divergence of KSHV and RFHV. RFHVMn contained positional homologs of all other unique KSHV genes, although some showed limited sequence similarity. RFHVMn contained a number of candidate microRNA genes. Although there was little sequence similarity with KSHV microRNAs, one candidate contained the same seed sequence as the positional homolog, kshv-miR-K12-10a, suggesting functional overlap. RNA transcript splicing was highly conserved between RFHVMn and KSHV, and strong sequence conservation was noted in specific promoters and putative origins of replication, predicting important functional similarities. Sequence comparisons indicated that RFHVMn and KSHV developed in long-term synchrony with the evolution of their hosts, and both viruses phylogenetically group within the RV1 lineage of Old World primate rhadinoviruses. RFHVMn is the closest homolog of KSHV to be completely sequenced and the first sequenced RV1 rhadinovirus homolog of KSHV from a nonhuman Old World primate. The strong genetic and sequence similarity between RFHVMn and KSHV, coupled with similarities in biology and pathology, demonstrate that RFHVMn infection in macaques offers an important and relevant model for the study of KSHV in humans. PMID:24109218
Sequence and structural analyses of nuclear export signals in the NESdb database
Xu, Darui; Farmer, Alicia; Collett, Garen; Grishin, Nick V.; Chook, Yuh Min
2012-01-01
We compiled >200 nuclear export signal (NES)–containing CRM1 cargoes in a database named NESdb. We analyzed the sequences and three-dimensional structures of natural, experimentally identified NESs and of false-positive NESs that were generated from the database in order to identify properties that might distinguish the two groups of sequences. Analyses of amino acid frequencies, sequence logos, and agreement with existing NES consensus sequences revealed strong preferences for the Φ1-X3-Φ2-X2-Φ3-X-Φ4 pattern and for negatively charged amino acids in the nonhydrophobic positions of experimentally identified NESs but not of false positives. Strong preferences against certain hydrophobic amino acids in the hydrophobic positions were also revealed. These findings led to a new and more precise NES consensus. More important, three-dimensional structures are now available for 68 NESs within 56 different cargo proteins. Analyses of these structures showed that experimentally identified NESs are more likely than the false positives to adopt α-helical conformations that transition to loops at their C-termini and more likely to be surface accessible within their protein domains or be present in disordered or unobserved parts of the structures. Such distinguishing features for real NESs might be useful in future NES prediction efforts. Finally, we also tested CRM1-binding of 40 NESs that were found in the 56 structures. We found that 16 of the NES peptides did not bind CRM1, hence illustrating how NESs are easily misidentified. PMID:22833565
Biological function in the twilight zone of sequence conservation.
Ponting, Chris P
2017-08-16
Strong DNA conservation among divergent species is an indicator of enduring functionality. With weaker sequence conservation we enter a vast 'twilight zone' in which sequence subject to transient or lower constraint cannot be distinguished easily from neutrally evolving, non-functional sequence. Twilight zone functional sequence is illuminated instead by principles of selective constraint and positive selection using genomic data acquired from within a species' population. Application of these principles reveals that despite being biochemically active, most twilight zone sequence is not functional.
Silberhorn, Elisabeth; Schwartz, Uwe; Symelka, Anne; de Koning-Ward, Tania; Längst, Gernot
2016-01-01
The packaging and organization of genomic DNA into chromatin represents an additional regulatory layer of gene expression, with specific nucleosome positions that restrict the accessibility of regulatory DNA elements. The mechanisms that position nucleosomes in vivo are thought to depend on the biophysical properties of the histones, sequence patterns, like phased di-nucleotide repeats and the architecture of the histone octamer that folds DNA in 1.65 tight turns. Comparative studies of human and P. falciparum histones reveal that the latter have a strongly reduced ability to recognize internal sequence dependent nucleosome positioning signals. In contrast, the nucleosomes are positioned by AT-repeat sequences flanking nucleosomes in vivo and in vitro. Further, the strong sequence variations in the plasmodium histones, compared to other mammalian histones, do not present adaptations to its AT-rich genome. Human and parasite histones bind with higher affinity to GC-rich DNA and with lower affinity to AT-rich DNA. However, the plasmodium nucleosomes are overall less stable, with increased temperature induced mobility, decreased salt stability of the histones H2A and H2B and considerable reduced binding affinity to GC-rich DNA, as compared with the human nucleosomes. In addition, we show that plasmodium histone octamers form the shortest known nucleosome repeat length (155bp) in vitro and in vivo. Our data suggest that the biochemical properties of the parasite histones are distinct from the typical characteristics of other eukaryotic histones and these properties reflect the increased accessibility of the P. falciparum genome. PMID:28033404
λ (Δim) -statistical convergence of order α
NASA Astrophysics Data System (ADS)
Colak, Rifat; Et, Mikail; Altin, Yavuz
2017-09-01
In this study, using the generalized difference operator Δim and a sequence λ = (λn) which is a non-decreasing sequence of positive numbers tending to ∞ such that λn+1 ≤ λn+1, λ1 = 1, we introduce the concepts of λ (Δim) -statistical convergence of order α (α ∈ (0, 1]) and strong λ (Δim) -Cesàro summablility of order α (α > 0). We establish some connections between λ (Δim) -statistical convergence of order α and strong λ (Δim) -Cesàro summablility of order α. It is shown that if a sequence is strongly λ (Δim) -Cesàro summable of order α, then it is λ (Δim) -statistically convergent of order β in case 0 < α ≤ β ≤ 1.
Effects of Dopamine Medication on Sequence Learning with Stochastic Feedback in Parkinson's Disease
Seo, Moonsang; Beigi, Mazda; Jahanshahi, Marjan; Averbeck, Bruno B.
2010-01-01
A growing body of evidence suggests that the midbrain dopamine system plays a key role in reinforcement learning and disruption of the midbrain dopamine system in Parkinson's disease (PD) may lead to deficits on tasks that require learning from feedback. We examined how changes in dopamine levels (“ON” and “OFF” their dopamine medication) affect sequence learning from stochastic positive and negative feedback using Bayesian reinforcement learning models. We found deficits in sequence learning in patients with PD when they were “ON” and “OFF” medication relative to healthy controls, but smaller differences between patients “OFF” and “ON”. The deficits were mainly due to decreased learning from positive feedback, although across all participant groups learning was more strongly associated with positive than negative feedback in our task. The learning in our task is likely mediated by the relatively depleted dorsal striatum and not the relatively intact ventral striatum. Therefore, the changes we see in our task may be due to a strong loss of phasic dopamine signals in the dorsal striatum in PD. PMID:20740077
Measurement of student attitudes in first year engineering - A mixed methods approach
NASA Astrophysics Data System (ADS)
Malik, Qaiser Hameed
This research study focused on freshman attitudes towards engineering in a newly implemented cornerstone sequence that emphasized holistic design experiences. The students' initial attitudes and changes in these attitudes were examined with the explanatory mixed methods approach that allows a sequential examination of the target population with two methods, using two sets of data, to investigate the treatment effects. In the quantitative phase, the study compared changes in freshman attitude towards engineering, between the new 'design sequence' group (composed of freshmen in the cornerstone sequence) and the prior 'traditional sequence' group (composed of all other freshmen), over the course of one semester. The data were collected in fall 2008 at two time intervals and changes in the two groups' attitudes were examined with repeated measures analysis of covariance models. The analyses reported here include data from 389 students out of the total population of 722 freshmen. The analyses revealed that engineering freshmen joined the program with positive or strongly positive attitudes towards engineering. Those strong attitudes were durable and resistant to change. Students in the design sequence group had higher ACT scores, enjoyed math and science the most, and did not believe engineering to be an exact science. However, no appreciable time-group interaction was observed. To validate the quantitative results, an interview protocol was developed to investigate initial freshman attitudes and changes, if any, that took place as a result of the new cornerstone sequence. One-on-one interviews with a sample of ten students out of the population of 272 freshmen revealed that freshmen in the cornerstone sequence entered the program full of enthusiasm and idealism, and with strongly positive attitudes towards engineering. The strong motivational factors included parental/teacher influences, childhood motivations, and high school extra-curricular experiences. The participants appreciated the team work and problem solving aspects of engineering; however, they reported negative experiences in the cornerstone sequence. Interestingly, their overall perception about engineering was not affected by any of the negative experiences. The qualitative phase substantiated the belief that strong attitudes are harder to change; they are durable, they have impact, and they are not significantly affected by a short treatment. The results of this mixed methods study indicate that changing student attitudes may not be an easy task. One must develop a better understanding of student attitudes in order to improve understanding of the fine-grained details of curriculum and its implementation to be able to develop more effective cornerstone design courses. Clearly, tight and focused quantitative studies complemented with a qualitative component provide a much broader and deeper insight into the learning that takes place in freshman courses. This research also documents the use of a longitudinal study to track the design sequence group and observe their performance in their junior and senior years. This would provide a better understanding of the long term effects of the new sequence.
Janova, Eva; Matiasovic, Jan; Vahala, Jiri; Vodicka, Roman; Van Dyk, Enette; Horin, Petr
2009-07-01
The major histocompatibility complex genes coding for antigen binding and presenting molecules are the most polymorphic genes in the vertebrate genome. We studied the DRA and DQA gene polymorphism of the family Equidae. In addition to 11 previously reported DRA and 24 DQA alleles, six new DRA sequences and 13 new DQA alleles were identified in the genus Equus. Phylogenetic analysis of both DRA and DQA sequences provided evidence for trans-species polymorphism in the family Equidae. The phylogenetic trees differed from species relationships defined by standard taxonomy of Equidae and from trees based on mitochondrial or neutral gene sequence data. Analysis of selection showed differences between the less variable DRA and more variable DQA genes. DRA alleles were more often shared by more species. The DQA sequences analysed showed strong amongst-species positive selection; the selected amino acid positions mostly corresponded to selected positions in rodent and human DQA genes.
Dong, Xinran; Wang, Xiao; Zhang, Feng; Tian, Weidong
2016-01-01
Accelerated evolution of regulatory sequence can alter the expression pattern of target genes, and cause phenotypic changes. In this study, we used DNase I hypersensitive sites (DHSs) to annotate putative regulatory sequences in the human genome, and conducted a genome-wide analysis of the effects of accelerated evolution on regulatory sequences. Working under the assumption that local ancient repeat elements of DHSs are under neutral evolution, we discovered that ∼0.44% of DHSs are under accelerated evolution (ace-DHSs). We found that ace-DHSs tend to be more active than background DHSs, and are strongly associated with epigenetic marks of active transcription. The target genes of ace-DHSs are significantly enriched in neuron-related functions, and their expression levels are positively selected in the human brain. Thus, these lines of evidences strongly suggest that accelerated evolution on regulatory sequences plays important role in the evolution of human-specific phenotypes. PMID:27401230
Complete Genome Sequence of the Dairy Isolate Lactobacillus acidipiscis ACA-DC 1533
Kazou, Maria; Alexandraki, Voula; Pot, Bruno; Tsakalidou, Effie
2017-01-01
ABSTRACT Lactobacillus acidipiscis is a Gram-positive lactic acid bacterium belonging to the Lactobacillus salivarius clade. Here, we present the first complete genome sequence of L. acidipiscis isolated from traditional Greek Kopanisti cheese. Strain ACA-DC 1533 may play a key role in the strong organoleptic characteristics of Kopanisti cheese. PMID:28126948
Complete Genome Sequence of the Dairy Isolate Lactobacillus acidipiscis ACA-DC 1533.
Kazou, Maria; Alexandraki, Voula; Pot, Bruno; Tsakalidou, Effie; Papadimitriou, Konstantinos
2017-01-26
Lactobacillus acidipiscis is a Gram-positive lactic acid bacterium belonging to the Lactobacillus salivarius clade. Here, we present the first complete genome sequence of L. acidipiscis isolated from traditional Greek Kopanisti cheese. Strain ACA-DC 1533 may play a key role in the strong organoleptic characteristics of Kopanisti cheese. Copyright © 2017 Kazou et al.
Dubbed "Tom's T" by Dhruba Chattoraj, the unusually conserved thymine at position +7 in bacteriophage P1 plasmid RepA DNA binding sites rises above repressor and acceptor sequence logos. The T appears to represent base flipping prior to helix opening in this DNA replication initation protein.
Rhythm sensitivity in macaque monkeys
Selezneva, Elena; Deike, Susann; Knyazeva, Stanislava; Scheich, Henning; Brechmann, André; Brosch, Michael
2013-01-01
This study provides evidence that monkeys are rhythm sensitive. We composed isochronous tone sequences consisting of repeating triplets of two short tones and one long tone which humans perceive as repeating triplets of two weak and one strong beat. This regular sequence was compared to an irregular sequence with the same number of randomly arranged short and long tones with no such beat structure. To search for indication of rhythm sensitivity we employed an oddball paradigm in which occasional duration deviants were introduced in the sequences. In a pilot study on humans we showed that subjects more easily detected these deviants when they occurred in a regular sequence. In the monkeys we searched for spontaneous behaviors the animals executed concomitant with the deviants. We found that monkeys more frequently exhibited changes of gaze and facial expressions to the deviants when they occurred in the regular sequence compared to the irregular sequence. In addition we recorded neuronal firing and local field potentials from 175 sites of the primary auditory cortex during sequence presentation. We found that both types of neuronal signals differentiated regular from irregular sequences. Both signals were stronger in regular sequences and occurred after the onset of the long tones, i.e., at the position of the strong beat. Local field potential responses were also significantly larger for the durational deviants in regular sequences, yet in a later time window. We speculate that these temporal pattern-selective mechanisms with a focus on strong beats and their deviants underlie the perception of rhythm in the chosen sequences. PMID:24046732
Using DNA mechanics to predict in vitro nucleosome positions and formation energies
Morozov, Alexandre V.; Fortney, Karissa; Gaykalova, Daria A.; Studitsky, Vasily M.; Widom, Jonathan; Siggia, Eric D.
2009-01-01
In eukaryotic genomes, nucleosomes function to compact DNA and to regulate access to it both by simple physical occlusion and by providing the substrate for numerous covalent epigenetic tags. While competition with other DNA-binding factors and action of chromatin remodeling enzymes significantly affect nucleosome formation in vivo, nucleosome positions in vitro are determined by steric exclusion and sequence alone. We have developed a biophysical model, DNABEND, for the sequence dependence of DNA bending energies, and validated it against a collection of in vitro free energies of nucleosome formation and a set of in vitro nucleosome positions mapped at high resolution. We have also made a first ab initio prediction of nucleosomal DNA geometries, and checked its accuracy against the nucleosome crystal structure. We have used DNABEND to design both strong and weak histone- binding sequences, and measured the corresponding free energies of nucleosome formation. We find that DNABEND can successfully predict in vitro nucleosome positions and free energies, providing a physical explanation for the intrinsic sequence dependence of histone–DNA interactions. PMID:19509309
Dong, Xinran; Wang, Xiao; Zhang, Feng; Tian, Weidong
2016-10-01
Accelerated evolution of regulatory sequence can alter the expression pattern of target genes, and cause phenotypic changes. In this study, we used DNase I hypersensitive sites (DHSs) to annotate putative regulatory sequences in the human genome, and conducted a genome-wide analysis of the effects of accelerated evolution on regulatory sequences. Working under the assumption that local ancient repeat elements of DHSs are under neutral evolution, we discovered that ∼0.44% of DHSs are under accelerated evolution (ace-DHSs). We found that ace-DHSs tend to be more active than background DHSs, and are strongly associated with epigenetic marks of active transcription. The target genes of ace-DHSs are significantly enriched in neuron-related functions, and their expression levels are positively selected in the human brain. Thus, these lines of evidences strongly suggest that accelerated evolution on regulatory sequences plays important role in the evolution of human-specific phenotypes. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Brady, J; Radonovich, M; Thoren, M; Das, G; Salzman, N P
1984-01-01
We have previously identified an 11-base DNA sequence, 5'-G-G-T-A-C-C-T-A-A-C-C-3' (simian virus 40 [SV40] map position 294 to 304), which is important in the control of SV40 late RNA expression in vitro and in vivo (Brady et al., Cell 31:625-633, 1982). We report here the identification of another domain of the SV40 late promoter. A series of mutants with deletions extending from SV40 map position 0 to 300 was prepared by nuclease BAL 31 treatment. The cloned templates were then analyzed for efficiency and accuracy of late SV40 RNA expression in the Manley in vitro transcription system. Our studies showed that, in addition to the promoter domain near map position 300, there are essential DNA sequences between nucleotide positions 74 and 95 that are required for efficient expression of late SV40 RNA. Included in this SV40 DNA sequence were two of the six GGGCGG SV40 repeat sequences and an 11-nucleotide segment which showed strong homology with the upstream sequences required for the efficient in vitro and in vivo expression of the histone H2A gene. This upstream promoter sequence supported transcription with the same efficiency even when it was moved 72 nucleotides closer to the major late cap site. In vitro promoter competition analysis demonstrated that the upstream promoter sequence, independent of the 294 to 304 promoter element, is capable of binding polymerase-transcription factors required for SV40 late gene transcription. Finally, we show that DNA sequences which control the specificity of RNA initiation at nucleotide 325 lie downstream of map position 294. Images PMID:6321950
Matsuda, M; Tai, K; Moore, J E; Millar, B C; Murayama, O
2004-01-01
Nucleotide sequencing after TA cloning of the amplicon of the almost-full length recA gene from three strains of UPTC (A1, A2, and A3) isolated from seagulls in Northern Ireland, the phenotypical and genotypical characteristics of which have been demonstrated to be indistinguishable, clarified nucleotide differences at three nucleotide positions among the three strains. In conclusion, the nucleotide sequences of the recA gene were found to discriminate among the three strains of UPTC, A1, A2, and A3, which are indistinguishable phenotypically and genotypically. Thus, the present study strongly suggests that nucleotide sequence data of the amplicon of a suitable gene or region could aid in discriminating among isolates of the UPTC group, which are indistinguishable phenotypically and genotypically. Copyright 2004 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim
Wu, Yonghua; Hadly, Elizabeth A; Teng, Wenjia; Hao, Yuyang; Liang, Wei; Liu, Yu; Wang, Haitao
2016-09-20
Owls (Strigiformes) represent a fascinating group of birds that are the ecological night-time counterparts to diurnal raptors (Accipitriformes). The nocturnality of owls, unusual within birds, has favored an exceptional visual system that is highly tuned for hunting at night, yet the molecular basis for this adaptation is lacking. Here, using a comparative evolutionary analysis of 120 vision genes obtained by retinal transcriptome sequencing, we found strong positive selection for low-light vision genes in owls, which contributes to their remarkable nocturnal vision. Not surprisingly, we detected gene loss of the violet/ultraviolet-sensitive opsin (SWS1) in all owls we studied, but two other color vision genes, the red-sensitive LWS and the blue-sensitive SWS2, were found to be under strong positive selection, which may be linked to the spectral tunings of these genes toward maximizing photon absorption in crepuscular conditions. We also detected the only other positively selected genes associated with motion detection in falcons and positively selected genes associated with bright-light vision and eye protection in other diurnal raptors (Accipitriformes). Our results suggest the adaptive evolution of vision genes reflect differentiated activity time and distinct hunting behaviors.
Wu, Yonghua; Hadly, Elizabeth A.; Teng, Wenjia; Hao, Yuyang; Liang, Wei; Liu, Yu; Wang, Haitao
2016-01-01
Owls (Strigiformes) represent a fascinating group of birds that are the ecological night-time counterparts to diurnal raptors (Accipitriformes). The nocturnality of owls, unusual within birds, has favored an exceptional visual system that is highly tuned for hunting at night, yet the molecular basis for this adaptation is lacking. Here, using a comparative evolutionary analysis of 120 vision genes obtained by retinal transcriptome sequencing, we found strong positive selection for low-light vision genes in owls, which contributes to their remarkable nocturnal vision. Not surprisingly, we detected gene loss of the violet/ultraviolet-sensitive opsin (SWS1) in all owls we studied, but two other color vision genes, the red-sensitive LWS and the blue-sensitive SWS2, were found to be under strong positive selection, which may be linked to the spectral tunings of these genes toward maximizing photon absorption in crepuscular conditions. We also detected the only other positively selected genes associated with motion detection in falcons and positively selected genes associated with bright-light vision and eye protection in other diurnal raptors (Accipitriformes). Our results suggest the adaptive evolution of vision genes reflect differentiated activity time and distinct hunting behaviors. PMID:27645106
Extracting DNA words based on the sequence features: non-uniform distribution and integrity.
Li, Zhi; Cao, Hongyan; Cui, Yuehua; Zhang, Yanbo
2016-01-25
DNA sequence can be viewed as an unknown language with words as its functional units. Given that most sequence alignment algorithms such as the motif discovery algorithms depend on the quality of background information about sequences, it is necessary to develop an ab initio algorithm for extracting the "words" based only on the DNA sequences. We considered that non-uniform distribution and integrity were two important features of a word, based on which we developed an ab initio algorithm to extract "DNA words" that have potential functional meaning. A Kolmogorov-Smirnov test was used for consistency test of uniform distribution of DNA sequences, and the integrity was judged by the sequence and position alignment. Two random base sequences were adopted as negative control, and an English book was used as positive control to verify our algorithm. We applied our algorithm to the genomes of Saccharomyces cerevisiae and 10 strains of Escherichia coli to show the utility of the methods. The results provide strong evidences that the algorithm is a promising tool for ab initio building a DNA dictionary. Our method provides a fast way for large scale screening of important DNA elements and offers potential insights into the understanding of a genome.
Organization and transient expression of the gene for human U11 snRNA
Clemens, Suter-Crazzolara; Walter, Keller
1991-01-01
The nucleotide sequence of U11 small nuclear RNA, a minor U RNA from HeLa cells, was determined. Computer analysis of the sequence (135 residues) predicts two strong hairpin loops which are separated by seventeen nucleotides containing an Sm binding site (AAUUUUUUGG). A synthetic gene was constructed in which the coding region of U11 RNA is under the control of a T7 promoter. This vector can be used to produce U11 RNA in vitro. Southern hybridization and PCR analysis of HeLa genomic DNA suggest that U11 RNA is encoded by a single copy gene, and that at least three genomic regions could be U11 RNA pseudogenes. A HeLa genomic copy of a U11 gene was isolated by inverted PCR. This gene contains the U11 RNA coding sequence and several sequence elements unique for the U RNA genes. These include a Distal Sequence Element (DSE, ATTTGCATA) present between positions −215 and −223 relative to the start of transcription; a Proximal Sequence Element (PSE, TTCACCTTTACCAAAAATG) located between positions −43 and −63 ; and a 3′box (GTTAGGCGAAATATTA) between positions +150 and +166. Transfection of HeLa cells with this gene revealed that it is functioning in vivo and can produce U11 RNA. PMID:1820214
Location of a major antigenic site involved in Ross River virus neutralization.
Vrati, S; Fernon, C A; Dalgarno, L; Weir, R C
1988-02-01
The location of a major antigenic domain involved in the neutralization of an alphavirus, Ross River virus, has been defined in terms of its position in the amino acid sequence of the E2 glycoprotein. The domain encompasses three topographically close epitopes which were identified using three E2-specific neutralizing monoclonal antibodies in competitive binding assays. Nucleotide sequencing of the structural protein genes of monoclonal antibody-selected antigenic variants showed that for each variant there was a single nucleotide change in the E2 gene leading to a nonconservative amino acid substitution in E2. Changes were at positions 216, 234, and 246-251 in the amino acid sequence. The epitopes are in a region of E2 which, though not strongly conserved as to sequence among Ross River virus, Semliki Forest virus, and Sindbis virus, is conserved in its hydropathy profile among the three alphaviruses. The epitopes lie between two asparagine-linked glycosylation sites (residues 200 and 262) in E2. They are conserved as to position between the mouse virulent T48 strain and the mouse avirulent NB5092 strain.
Strong FGFR3 staining is a marker for FGFR3 fusions in diffuse gliomas
Annala, Matti; Lehtinen, Birgitta; Kesseli, Juha; Haapasalo, Joonas; Ruusuvuori, Pekka; Yli-Harja, Olli; Visakorpi, Tapio; Haapasalo, Hannu; Nykter, Matti; Zhang, Wei
2017-01-01
Abstract Background Inhibitors of fibroblast growth factor receptors (FGFRs) have recently arisen as a promising treatment option for patients with FGFR alterations. Gene fusions involving FGFR3 and transforming acidic coiled-coil protein 3 (TACC3) have been detected in diffuse gliomas and other malignancies, and fusion-positive cases have responded well to FGFR inhibition. As high FGFR3 expression has been detected in fusion-positive tumors, we sought to determine the clinical significance of FGFR3 protein expression level as well as its potential for indicating FGFR3 fusions. Methods We performed FGFR3 immunohistochemistry on tissue microarrays containing 676 grades II–IV astrocytomas and 116 grades II–III oligodendroglial tumor specimens. Fifty-one cases were further analyzed using targeted sequencing. Results Moderate to strong FGFR3 staining was detected in gliomas of all grades, was more common in females, and was associated with poor survival in diffuse astrocytomas. Targeted sequencing identified FGFR3-TACC3 fusions and an FGFR3-CAMK2A fusion in 10 of 15 strongly stained cases, whereas no fusions were found in 36 negatively to moderately stained cases. Fusion-positive cases were predominantly female and negative for IDH and EGFR/PDGFRA/MET alterations. These and moderately stained cases show lower MIB-1 proliferation index than negatively to weakly stained cases. Furthermore, stronger FGFR3 expression was commonly observed in malignant tissue regions of lower cellularity in fusion-negative cases. Importantly, subregional negative FGFR3 staining was also observed in a few fusion-positive cases. Conclusions Strong FGFR3 protein expression is indicative of FGFR3 fusions and may serve as a clinically applicable predictive marker for treatment regimens based on FGFR inhibitors. PMID:28379477
Schultz, Sharon J; Zhang, Miaohua; Champoux, James J
2010-03-19
The RNase H activity of reverse transcriptase is required during retroviral replication and represents a potential target in antiviral drug therapies. Sequence features flanking a cleavage site influence the three types of retroviral RNase H activity: internal, DNA 3'-end-directed, and RNA 5'-end-directed. Using the reverse transcriptases of HIV-1 (human immunodeficiency virus type 1) and Moloney murine leukemia virus (M-MuLV), we evaluated how individual base preferences at a cleavage site direct retroviral RNase H specificity. Strong test cleavage sites (designated as between nucleotide positions -1 and +1) for the HIV-1 and M-MuLV enzymes were introduced into model hybrid substrates designed to assay internal or DNA 3'-end-directed cleavage, and base substitutions were tested at specific nucleotide positions. For internal cleavage, positions +1, -2, -4, -5, -10, and -14 for HIV-1 and positions +1, -2, -6, and -7 for M-MuLV significantly affected RNase H cleavage efficiency, while positions -7 and -12 for HIV-1 and positions -4, -9, and -11 for M-MuLV had more modest effects. DNA 3'-end-directed cleavage was influenced substantially by positions +1, -2, -4, and -5 for HIV-1 and positions +1, -2, -6, and -7 for M-MuLV. Cleavage-site distance from the recessed end did not affect sequence preferences for M-MuLV reverse transcriptase. Based on the identified sequence preferences, a cleavage site recognized by both HIV-1 and M-MuLV enzymes was introduced into a sequence that was otherwise resistant to RNase H. The isolated RNase H domain of M-MuLV reverse transcriptase retained sequence preferences at positions +1 and -2 despite prolific cleavage in the absence of the polymerase domain. The sequence preferences of retroviral RNase H likely reflect structural features in the substrate that favor cleavage and represent a novel specificity determinant to consider in drug design. Copyright (c) 2010 Elsevier Ltd. All rights reserved.
Bonen, Linda; Boer, Poppo H.; Gray, Michael W.
1984-01-01
We have determined the sequence of the wheat mitochondrial gene for cytochrome oxidase subunit II (COII) and find that its derived protein sequence differs from that of maize at only three amino acid positions. Unexpectedly, all three replacements are non-conservative ones. The wheat COII gene has a highly-conserved intron at the same position as in maize, but the wheat intron is 1.5 times longer because of an insert relative to its maize counterpart. Hybridization analysis of mitochondrial DNA from rye, pea, broad bean and cucumber indicates strong sequence conservation of COII coding sequences among all these higher plants. However, only rye and maize mitochondrial DNA show homology with wheat COII intron sequences and rye alone with intron-insert sequences. We find that a sequence identical to the region of the 5' exon corresponding to the transmembrane domain of the COII protein is present at a second genomic location in wheat mitochondria. These variations in COII gene structure and size, as well as the presence of repeated COII sequences, illustrate at the DNA sequence level, factors which contribute to higher plant mitochondrial DNA diversity and complexity. ImagesFig. 3.Fig. 4.Fig. 5. PMID:16453565
Gerbod, D; Edgcomb, V P; Noël, C; Delgado-Viscogliosi, P; Viscogliosi, E
2000-09-01
Small subunit rDNA genes were amplified by polymerase chain reaction using specific primers from mixed-population DNA obtained from the whole hindgut of the termite Calotermes flavicollis. Comparative sequence analysis of the clones revealed two kinds of sequences that were both from parabasalid symbionts. In a molecular tree inferred by distance, parsimony and likelihood methods, and including 27 parabasalid sequences retrieved from the data bases, the sequences of the group II (clones Cf5 and Cf6) were closely related to the Devescovinidae/Calonymphidae species and thus were assigned to the Devescovinidae Foaina. The sequence of the group I (clone Cf1) emerged within the Trichomonadinae and strongly clustered with Tetratrichomonas gallinarum. On the basis of morphological data, the Monocercomonadidae Hexamastix termitis might be the most likely origin of this sequence.
Rahal, M; Kervaire, B; Villard, J; Tiercy, J-M
2008-03-01
Human leukocyte antigen (HLA) typing by polymerase chain reaction-sequence-specific oligonucleotide (PCR-SSO) hybridization on solid phase (microbead assay) or polymerase chain reaction-sequence-specific primers (PCR-SSP) requires interpretation softwares to detect all possible allele combinations. These programs propose allele calls by taking into account false-positive or false-negative signal(s). The laboratory has the option to validate typing results in the presence of strongly cross-reacting or apparent false-negative signals. Alternatively, these seemingly aberrant signals may disclose novel variants. We report here four new HLA-B (B*5620 and B*5716) and HLA-DRB1 alleles (DRB1*110107 and DRB1*1474) that were detected by apparent false-negative or -positive hybridization or amplification patterns, and ultimately resolved by sequencing. To avoid allele misassignments, a comprehensive evaluation of acquired data as documented in a quality assurance system is therefore required to confirm unambiguous typing interpretation.
Widespread signatures of local mRNA folding structure selection in four Dengue virus serotypes
2015-01-01
Background It is known that mRNA folding can affect and regulate various gene expression steps both in living organisms and in viruses. Previous studies have recognized functional RNA structures in the genome of the Dengue virus. However, these studies usually focused either on the viral untranslated regions or on very specific and limited regions at the beginning of the coding sequences, in a limited number of strains, and without considering evolutionary selection. Results Here we performed the first large scale comprehensive genomics analysis of selection for local mRNA folding strength in the Dengue virus coding sequences, based on a total of 1,670 genomes and 4 serotypes. Our analysis identified clusters of positions along the coding regions that may undergo a conserved evolutionary selection for strong or weak local folding maintained across different viral variants. Specifically, 53-66 clusters for strong folding and 49-73 clusters for weak folding (depending on serotype) aggregated of positions with a significant conservation of folding energy signals (related to partially overlapping local genomic regions) were recognized. In addition, up to 7% of these positions were found to be conserved in more than 90% of the viral genomes. Although some of the identified positions undergo frequent synonymous / non-synonymous substitutions, the selection for folding strength therein is preserved, and thus cannot be trivially explained based on sequence conservation alone. Conclusions The fact that many of the positions with significant folding related signals are conserved among different Dengue variants suggests that a better understanding of the mRNA structures in the corresponding regions may promote the development of prospective anti- Dengue vaccination strategies. The comparative genomics approach described here can be employed in the future for detecting functional regions in other pathogens with very high mutations rates. PMID:26449467
Microbial evolution of sulphate reduction when lateral gene transfer is geographically restricted.
Chi Fru, E
2011-07-01
Lateral gene transfer (LGT) is an important mechanism by which micro-organisms acquire new functions. This process has been suggested to be central to prokaryotic evolution in various environments. However, the influence of geographical constraints on the evolution of laterally acquired genes in microbial metabolic evolution is not yet well understood. In this study, the influence of geographical isolation on the evolution of laterally acquired dissimilatory sulphite reductase (dsr) gene sequences in the sulphate-reducing micro-organisms (SRM) was investigated. Sequences on four continental blocks related to SRM known to have received dsr by LGT were analysed using standard phylogenetic and multidimensional statistical methods. Sequences related to lineages with large genetic diversity correlated positively with habitat divergence. Those affiliated to Thermodesulfobacterium indicated strong biogeographical delineation; hydrothermal-vent sequences clustered independently from hot-spring sequences. Some of the hydrothermal-vent and hot-spring sequences suggested to have been acquired from a common ancestral source may have diverged upon isolation within distinct habitats. In contrast, analysis of some Desulfotomaculum sequences indicated they could have been transferred from different ancestral sources but converged upon isolation within the same niche. These results hint that, after lateral acquisition of dsr genes, barriers to gene flow probably play a strong role in their subsequent evolution.
Våge, D I; Nieminen, M; Anderson, D G; Røed, K H
2014-10-01
The protein-coding region of melanocortin 1 receptor (MC1R) was sequenced to identify potential variation affecting coat color in reindeer (Rangifer tarandus). A T→C sequence variation at nucleotide position 218 (c.218T>C) causing an amino acid (aa) change from methionine to threonine at aa position 73 (p.Met73Thr) was identified. In addition, a T→G sequence variation was found at nucleotide position 839 (c.839T>G), causing phenylalanine to be exchanged by cysteine at aa position 280 (p.Phe280Cys). The two sequence variants (c.218C and c.839G) were found to be closely associated with a darker belly coat compared with animals not having any of these two variants. The aa acid change p.Met73Thr affects the same position as p.Met73Lys previously reported to give constitutive activation of MC1R in black sheep (Ovis aries), whereas p.Phe280Cys is identical to one of two variants previously reported to be associated with dark coat color in Arctic fox (Alopex lagopus), supporting that the two variants found in reindeer are functional. The complete absence of Thr73 and Cys280 among the 51 wild reindeer analyzed provides some evidence that these variants are more common in the domestic herds. © 2014 Stichting International Foundation for Animal Genetics.
EEG potentials associated with artificial grammar learning in the primate brain.
Attaheri, Adam; Kikuchi, Yukiko; Milne, Alice E; Wilson, Benjamin; Alter, Kai; Petkov, Christopher I
2015-09-01
Electroencephalography (EEG) has identified human brain potentials elicited by Artificial Grammar (AG) learning paradigms, which present participants with rule-based sequences of stimuli. Nonhuman animals are sensitive to certain AGs; therefore, evaluating which EEG Event Related Potentials (ERPs) are associated with AG learning in nonhuman animals could identify evolutionarily conserved processes. We recorded EEG potentials during an auditory AG learning experiment in two Rhesus macaques. The animals were first exposed to sequences of nonsense words generated by the AG. Then surface-based ERPs were recorded in response to sequences that were 'consistent' with the AG and 'violation' sequences containing illegal transitions. The AG violations strongly modulated an early component, potentially homologous to the Mismatch Negativity (mMMN), a P200 and a late frontal positivity (P500). The macaque P500 is similar in polarity and time of occurrence to a late EEG positivity reported in human AG learning studies but might differ in functional role. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Working Memory Replay Prioritizes Weakly Attended Events.
Jafarpour, Anna; Penny, Will; Barnes, Gareth; Knight, Robert T; Duzel, Emrah
2017-01-01
One view of working memory posits that maintaining a series of events requires their sequential and equal mnemonic replay. Another view is that the content of working memory maintenance is prioritized by attention. We decoded the dynamics for retaining a sequence of items using magnetoencephalography, wherein participants encoded sequences of three stimuli depicting a face, a manufactured object, or a natural item and maintained them in working memory for 5000 ms. Memory for sequence position and stimulus details were probed at the end of the maintenance period. Decoding of brain activity revealed that one of the three stimuli dominated maintenance independent of its sequence position or category; and memory was enhanced for the selectively replayed stimulus. Analysis of event-related responses during the encoding of the sequence showed that the selectively replayed stimuli were determined by the degree of attention at encoding. The selectively replayed stimuli had the weakest initial encoding indexed by weaker visual attention signals at encoding. These findings do not rule out sequential mnemonic replay but reveal that attention influences the content of working memory maintenance by prioritizing replay of weakly encoded events. We propose that the prioritization of weakly encoded stimuli protects them from interference during the maintenance period, whereas the more strongly encoded stimuli can be retrieved from long-term memory at the end of the delay period.
Kim, Young-Kyu; Park, Chong-wook; Kim, Ki-Joong
2009-03-31
The chloroplast DNA sequences of Megaleranthis saniculifolia, an endemic and monotypic endangered plant species, were completed in this study (GenBank FJ597983). The genome is 159,924 bp in length. It harbors a pair of IR regions consisting of 26,608 bp each. The lengths of the LSC and SSC regions are 88,326 bp and 18,382 bp, respectively. The structural organizations, gene and intron contents, gene orders, AT contents, codon usages, and transcription units of the Megaleranthis chloroplast genome are similar to those of typical land plant cp DNAs. However, the detailed features of Megaleranthis chloroplast genomes are substantially different from that of Ranunculus, which belongs to the same family, the Ranunculaceae. First, the Megaleranthis cp DNA was 4,797 bp longer than that of Ranunculus due to an expanded IR region into the SSC region and duplicated sequence elements in several spacer regions of the Megaleranthis cp genome. Second, the chloroplast genomes of Megaleranthis and Ranunculus evidence 5.6% sequence divergence in the coding regions, 8.9% sequence divergence in the intron regions, and 18.7% sequence divergence in the intergenic spacer regions, respectively. In both the coding and noncoding regions, average nucleotide substitution rates differed markedly, depending on the genome position. Our data strongly implicate the positional effects of the evolutionary modes of chloroplast genes. The genes evidencing higher levels of base substitutions also have higher incidences of indel mutations and low Ka/Ks ratios. A total of 54 simple sequence repeat loci were identified from the Megaleranthis cp genome. The existence of rich cp SSR loci in the Megaleranthis cp genome provides a rare opportunity to study the population genetic structures of this endangered species. Our phylogenetic trees based on the two independent markers, the nuclear ITS and chloroplast matK sequences, strongly support the inclusion of the Megaleranthis to the Trollius. Therefore, our molecular trees support Ohwi's original treatment of Megaleranthis saniculiforia to Trollius chosenensis Ohwi.
Goncearenco, Alexander; Ma, Bin-Guang; Berezovsky, Igor N
2014-03-01
DNA, RNA and proteins are major biological macromolecules that coevolve and adapt to environments as components of one highly interconnected system. We explore here sequence/structure determinants of mechanisms of adaptation of these molecules, links between them, and results of their mutual evolution. We complemented statistical analysis of genomic and proteomic sequences with folding simulations of RNA molecules, unraveling causal relations between compositional and sequence biases reflecting molecular adaptation on DNA, RNA and protein levels. We found many compositional peculiarities related to environmental adaptation and the life style. Specifically, thermal adaptation of protein-coding sequences in Archaea is characterized by a stronger codon bias than in Bacteria. Guanine and cytosine load in the third codon position is important for supporting the aerobic life style, and it is highly pronounced in Bacteria. The third codon position also provides a tradeoff between arginine and lysine, which are favorable for thermal adaptation and aerobicity, respectively. Dinucleotide composition provides stability of nucleic acids via strong base-stacking in ApG dinucleotides. In relation to coevolution of nucleic acids and proteins, thermostability-related demands on the amino acid composition affect the nucleotide content in the second codon position in Archaea.
Goncearenco, Alexander; Ma, Bin-Guang; Berezovsky, Igor N.
2014-01-01
DNA, RNA and proteins are major biological macromolecules that coevolve and adapt to environments as components of one highly interconnected system. We explore here sequence/structure determinants of mechanisms of adaptation of these molecules, links between them, and results of their mutual evolution. We complemented statistical analysis of genomic and proteomic sequences with folding simulations of RNA molecules, unraveling causal relations between compositional and sequence biases reflecting molecular adaptation on DNA, RNA and protein levels. We found many compositional peculiarities related to environmental adaptation and the life style. Specifically, thermal adaptation of protein-coding sequences in Archaea is characterized by a stronger codon bias than in Bacteria. Guanine and cytosine load in the third codon position is important for supporting the aerobic life style, and it is highly pronounced in Bacteria. The third codon position also provides a tradeoff between arginine and lysine, which are favorable for thermal adaptation and aerobicity, respectively. Dinucleotide composition provides stability of nucleic acids via strong base-stacking in ApG dinucleotides. In relation to coevolution of nucleic acids and proteins, thermostability-related demands on the amino acid composition affect the nucleotide content in the second codon position in Archaea. PMID:24371267
Exploring the sequence-function relationship in transcriptional regulation by the lac O1 operator.
Maity, Tuhin S; Jha, Ramesh K; Strauss, Charlie E M; Dunbar, John
2012-07-01
Understanding how binding of a transcription factor to an operator is influenced by the operator sequence is an ongoing quest. It facilitates discovery of alternative binding sites as well as tuning of transcriptional regulation. We investigated the behavior of the Escherichia coli Lac repressor (LacI) protein with a large set of lac O(1) operator variants. The 114 variants examined contained a mean of 2.9 (range 0-4) mutations at positions -4, -2, +2 and +4 in the minimally required 17 bp operator. The relative affinity of LacI for the operators was examined by quantifying expression of a GFP reporter gene and Rosetta structural modeling. The combinations of mutations in the operator sequence created a wide range of regulatory behaviors. We observed variations in the GFP fluorescent signal among the operator variants of more than an order of magnitude under both uninduced and induced conditions. We found that a single nucleotide change may result in changes of up to six- and 12-fold in uninduced and induced GFP signals, respectively. Among the four positions mutated, we found that nucleotide G at position -4 is strongly correlated with strong repression. By Rosetta modeling, we found a significant correlation between the calculated binding energy and the experimentally observed transcriptional repression strength for many operators. However, exceptions were also observed, underscoring the necessity for further improvement in biophysical models of protein-DNA interactions. © 2012 The Authors Journal compilation © 2012 FEBS.
Efficient error correction for next-generation sequencing of viral amplicons
2012-01-01
Background Next-generation sequencing allows the analysis of an unprecedented number of viral sequence variants from infected patients, presenting a novel opportunity for understanding virus evolution, drug resistance and immune escape. However, sequencing in bulk is error prone. Thus, the generated data require error identification and correction. Most error-correction methods to date are not optimized for amplicon analysis and assume that the error rate is randomly distributed. Recent quality assessment of amplicon sequences obtained using 454-sequencing showed that the error rate is strongly linked to the presence and size of homopolymers, position in the sequence and length of the amplicon. All these parameters are strongly sequence specific and should be incorporated into the calibration of error-correction algorithms designed for amplicon sequencing. Results In this paper, we present two new efficient error correction algorithms optimized for viral amplicons: (i) k-mer-based error correction (KEC) and (ii) empirical frequency threshold (ET). Both were compared to a previously published clustering algorithm (SHORAH), in order to evaluate their relative performance on 24 experimental datasets obtained by 454-sequencing of amplicons with known sequences. All three algorithms show similar accuracy in finding true haplotypes. However, KEC and ET were significantly more efficient than SHORAH in removing false haplotypes and estimating the frequency of true ones. Conclusions Both algorithms, KEC and ET, are highly suitable for rapid recovery of error-free haplotypes obtained by 454-sequencing of amplicons from heterogeneous viruses. The implementations of the algorithms and data sets used for their testing are available at: http://alan.cs.gsu.edu/NGS/?q=content/pyrosequencing-error-correction-algorithm PMID:22759430
Efficient error correction for next-generation sequencing of viral amplicons.
Skums, Pavel; Dimitrova, Zoya; Campo, David S; Vaughan, Gilberto; Rossi, Livia; Forbi, Joseph C; Yokosawa, Jonny; Zelikovsky, Alex; Khudyakov, Yury
2012-06-25
Next-generation sequencing allows the analysis of an unprecedented number of viral sequence variants from infected patients, presenting a novel opportunity for understanding virus evolution, drug resistance and immune escape. However, sequencing in bulk is error prone. Thus, the generated data require error identification and correction. Most error-correction methods to date are not optimized for amplicon analysis and assume that the error rate is randomly distributed. Recent quality assessment of amplicon sequences obtained using 454-sequencing showed that the error rate is strongly linked to the presence and size of homopolymers, position in the sequence and length of the amplicon. All these parameters are strongly sequence specific and should be incorporated into the calibration of error-correction algorithms designed for amplicon sequencing. In this paper, we present two new efficient error correction algorithms optimized for viral amplicons: (i) k-mer-based error correction (KEC) and (ii) empirical frequency threshold (ET). Both were compared to a previously published clustering algorithm (SHORAH), in order to evaluate their relative performance on 24 experimental datasets obtained by 454-sequencing of amplicons with known sequences. All three algorithms show similar accuracy in finding true haplotypes. However, KEC and ET were significantly more efficient than SHORAH in removing false haplotypes and estimating the frequency of true ones. Both algorithms, KEC and ET, are highly suitable for rapid recovery of error-free haplotypes obtained by 454-sequencing of amplicons from heterogeneous viruses.The implementations of the algorithms and data sets used for their testing are available at: http://alan.cs.gsu.edu/NGS/?q=content/pyrosequencing-error-correction-algorithm.
Accelerated Evolution of the ASPM Gene Controlling Brain Size Begins Prior to Human Brain Expansion
Solomon, Gregory; Gersch, William; Yoon, Young-Ho; Collura, Randall; Ruvolo, Maryellen; Barrett, J. Carl; Woods, C. Geoffrey; Walsh, Christopher A
2004-01-01
Primary microcephaly (MCPH) is a neurodevelopmental disorder characterized by global reduction in cerebral cortical volume. The microcephalic brain has a volume comparable to that of early hominids, raising the possibility that some MCPH genes may have been evolutionary targets in the expansion of the cerebral cortex in mammals and especially primates. Mutations in ASPM, which encodes the human homologue of a fly protein essential for spindle function, are the most common known cause of MCPH. Here we have isolated large genomic clones containing the complete ASPM gene, including promoter regions and introns, from chimpanzee, gorilla, orangutan, and rhesus macaque by transformation-associated recombination cloning in yeast. We have sequenced these clones and show that whereas much of the sequence of ASPM is substantially conserved among primates, specific segments are subject to high Ka/Ks ratios (nonsynonymous/synonymous DNA changes) consistent with strong positive selection for evolutionary change. The ASPM gene sequence shows accelerated evolution in the African hominoid clade, and this precedes hominid brain expansion by several million years. Gorilla and human lineages show particularly accelerated evolution in the IQ domain of ASPM. Moreover, ASPM regions under positive selection in primates are also the most highly diverged regions between primates and nonprimate mammals. We report the first direct application of TAR cloning technology to the study of human evolution. Our data suggest that evolutionary selection of specific segments of the ASPM sequence strongly relates to differences in cerebral cortical size. PMID:15045028
Aleza, Pablo; Cuenca, José; Hernández, María; Juárez, José; Navarro, Luis; Ollitrault, Patrick
2015-03-08
Mapping centromere locations in plant species provides essential information for the analysis of genetic structures and population dynamics. The centromere's position affects the distribution of crossovers along a chromosome and the parental heterozygosity restitution by 2n gametes is a direct function of the genetic distance to the centromere. Sexual polyploidisation is relatively frequent in Citrus species and is widely used to develop new seedless triploid cultivars. The study's objectives were to (i) map the positions of the centromeres of the nine Citrus clementina chromosomes; (ii) analyse the crossover interference in unreduced gametes; and (iii) establish the pattern of genetic recombination in haploid clementine gametes along each chromosome and its relationship with the centromere location and distribution of genic sequences. Triploid progenies were derived from unreduced megagametophytes produced by second-division restitution. Centromere positions were mapped genetically for all linkage groups using half-tetrad analysis. Inference of the physical locations of centromeres revealed one acrocentric, four metacentric and four submetacentric chromosomes. Crossover interference was observed in unreduced gametes, with variation seen between chromosome arms. For haploid gametes, a strong decrease in the recombination rate occurred in centromeric and pericentromeric regions, which contained a low density of genic sequences. In chromosomes VIII and IX, these low recombination rates extended beyond the pericentromeric regions. The genomic region corresponding to a genetic distance < 5cM from a centromere represented 47% of the genome and 23% of the genic sequences. The centromere positions of the nine citrus chromosomes were genetically mapped. Their physical locations, inferred from the genetic ones, were consistent with the sequence constitution and recombination pattern along each chromosome. However, regions with low recombination rates extended beyond the pericentromeric regions of some chromosomes into areas richer in genic sequences. The persistence of strong linkage disequilibrium between large numbers of genes promotes the stability of epistatic interactions and multilocus-controlled traits over successive generations but also maintains multi-trait associations. Identification of the centromere positions will allow the development of simple methods to analyse unreduced gamete formation mechanisms in a large range of genotypes and further modelling of genetic inheritance in sexual polyploidisation breeding schemes.
Sequence specificity of single-stranded DNA-binding proteins: a novel DNA microarray approach
Morgan, Hugh P.; Estibeiro, Peter; Wear, Martin A.; Max, Klaas E.A.; Heinemann, Udo; Cubeddu, Liza; Gallagher, Maurice P.; Sadler, Peter J.; Walkinshaw, Malcolm D.
2007-01-01
We have developed a novel DNA microarray-based approach for identification of the sequence-specificity of single-stranded nucleic-acid-binding proteins (SNABPs). For verification, we have shown that the major cold shock protein (CspB) from Bacillus subtilis binds with high affinity to pyrimidine-rich sequences, with a binding preference for the consensus sequence, 5′-GTCTTTG/T-3′. The sequence was modelled onto the known structure of CspB and a cytosine-binding pocket was identified, which explains the strong preference for a cytosine base at position 3. This microarray method offers a rapid high-throughput approach for determining the specificity and strength of ss DNA–protein interactions. Further screening of this newly emerging family of transcription factors will help provide an insight into their cellular function. PMID:17488853
Working Memory Replay Prioritizes Weakly Attended Events
Penny, Will; Knight, Robert T.; Duzel, Emrah
2017-01-01
Abstract One view of working memory posits that maintaining a series of events requires their sequential and equal mnemonic replay. Another view is that the content of working memory maintenance is prioritized by attention. We decoded the dynamics for retaining a sequence of items using magnetoencephalography, wherein participants encoded sequences of three stimuli depicting a face, a manufactured object, or a natural item and maintained them in working memory for 5000 ms. Memory for sequence position and stimulus details were probed at the end of the maintenance period. Decoding of brain activity revealed that one of the three stimuli dominated maintenance independent of its sequence position or category; and memory was enhanced for the selectively replayed stimulus. Analysis of event-related responses during the encoding of the sequence showed that the selectively replayed stimuli were determined by the degree of attention at encoding. The selectively replayed stimuli had the weakest initial encoding indexed by weaker visual attention signals at encoding. These findings do not rule out sequential mnemonic replay but reveal that attention influences the content of working memory maintenance by prioritizing replay of weakly encoded events. We propose that the prioritization of weakly encoded stimuli protects them from interference during the maintenance period, whereas the more strongly encoded stimuli can be retrieved from long-term memory at the end of the delay period. PMID:28824955
Kodama, T; Mori, K; Kawahara, T; Ringler, D J; Desrosiers, R C
1993-01-01
One rhesus macaque displayed severe encephalomyelitis and another displayed severe enterocolitis following infection with molecularly cloned simian immunodeficiency virus (SIV) strain SIVmac239. Little or no free anti-SIV antibody developed in these two macaques, and they died relatively quickly (4 to 6 months) after infection. Manifestation of the tissue-specific disease in these macaques was associated with the emergence of variants with high replicative capacity for macrophages and primary infection of tissue macrophages. The nature of sequence variation in the central region (vif, vpr, and vpx), the env gene, and the nef long terminal repeat (LTR) region in brain, colon, and other tissues was examined to see whether specific genetic changes were associated with SIV replication in brain or gut. Sequence analysis revealed strong conservation of the intergenic central region, nef, and the LTR. However, analysis of env sequences in these two macaques and one other revealed significant, interesting patterns of sequence variation. (i) Changes in env that were found previously to contribute to the replicative ability of SIVmac for macrophages in culture were present in the tissues of these animals. (ii) The greatest variability was located in the regions between V1 and V2 and from "V3" through C3 in gp120, which are different in location from the variable regions observed previously in animals with strong antibody responses and long-term persistent infection. (iii) The predominant sequence change of D-->N at position 385 in C3 is most surprising, since this change in both SIV and human immunodeficiency virus type 1 has been associated with dramatically diminished affinity for CD4 and replication in vitro. (iv) The nature of sequence changes at some positions (146, 178, 345, 385, and "V3") suggests that viral replication in brain and gut may be facilitated by specific sequence changes in env in addition to those that impart a general ability to replicate well in macrophages. These results demonstrate that complex selective pressures, including immune responses and varying cell and tissue specificity, can influence the nature of sequence changes in env. Images PMID:8411355
Structure and expression of the attacin genes in Hyalophora cecropia.
Sun, S C; Lindström, I; Lee, J Y; Faye, I
1991-02-26
To study the regulation of the immune genes in insects, we have cloned and sequenced the attacin gene locus of the giant silk moth Hyalophora cecropia. The locus contains one acidic and one basic attacin gene as well as two pseudogenes, which are remnants of basic attacin genes. A small insertion element was found within the locus. The two functional attacin genes are transcribed in opposite directions and have two introns inserted at homologous positions. A common sequence, GGGGATTCCT, is found at nucleotide position -48 in the acidic gene and at nucleotide position -58 in the basic gene. Interestingly, this decanucleotide is similar to the consensus of the NF-k B-binding site. Expression studies revealed that both attacins are strongly induced by phorbol 12-myristate 13-acetate, lipopolysaccharide and bacteria. However, only the acidic attacin gene showed a clear response to injury.
Rehearsal in immediate serial recall.
Tan, Lydia; Ward, Geoff
2008-06-01
We report for the first time overt rehearsal data in immediate serial recall (ISR) undertaken at three presentation rates (1, 2.5, and 5 sec/word). Two groups of participants saw lists of six words for ISR and were required either to engage in overt rehearsal or to remain silent after reading aloud the word list during its presentation. Typical ISR serial position effects were obtained for both groups, and recall increased with slower rates. When participants rehearsed, they tended to do so in a cumulative forward order up to Serial Position 4, after which the amount of rehearsal decreased substantially. There were similarities between rehearsal and recall data: Both broke down toward the end of longer sequences, and there were strong positive correlations between the maximum sequence of participants' rehearsals and their ISR performance. We interpret these data as suggesting that similar mechanisms underpin both rehearsal and recall in ISR.
PreCisIon: PREdiction of CIS-regulatory elements improved by gene's positION.
Elati, Mohamed; Nicolle, Rémy; Junier, Ivan; Fernández, David; Fekih, Rim; Font, Julio; Képès, François
2013-02-01
Conventional approaches to predict transcriptional regulatory interactions usually rely on the definition of a shared motif sequence on the target genes of a transcription factor (TF). These efforts have been frustrated by the limited availability and accuracy of TF binding site motifs, usually represented as position-specific scoring matrices, which may match large numbers of sites and produce an unreliable list of target genes. To improve the prediction of binding sites, we propose to additionally use the unrelated knowledge of the genome layout. Indeed, it has been shown that co-regulated genes tend to be either neighbors or periodically spaced along the whole chromosome. This study demonstrates that respective gene positioning carries significant information. This novel type of information is combined with traditional sequence information by a machine learning algorithm called PreCisIon. To optimize this combination, PreCisIon builds a strong gene target classifier by adaptively combining weak classifiers based on either local binding sequence or global gene position. This strategy generically paves the way to the optimized incorporation of any future advances in gene target prediction based on local sequence, genome layout or on novel criteria. With the current state of the art, PreCisIon consistently improves methods based on sequence information only. This is shown by implementing a cross-validation analysis of the 20 major TFs from two phylogenetically remote model organisms. For Bacillus subtilis and Escherichia coli, respectively, PreCisIon achieves on average an area under the receiver operating characteristic curve of 70 and 60%, a sensitivity of 80 and 70% and a specificity of 60 and 56%. The newly predicted gene targets are demonstrated to be functionally consistent with previously known targets, as assessed by analysis of Gene Ontology enrichment or of the relevant literature and databases.
Emergence of good conduct, scaling and zipf laws in human behavioral sequences in an online world.
Thurner, Stefan; Szell, Michael; Sinatra, Roberta
2012-01-01
We study behavioral action sequences of players in a massive multiplayer online game. In their virtual life players use eight basic actions which allow them to interact with each other. These actions are communication, trade, establishing or breaking friendships and enmities, attack, and punishment. We measure the probabilities for these actions conditional on previous taken and received actions and find a dramatic increase of negative behavior immediately after receiving negative actions. Similarly, positive behavior is intensified by receiving positive actions. We observe a tendency towards antipersistence in communication sequences. Classifying actions as positive (good) and negative (bad) allows us to define binary 'world lines' of lives of individuals. Positive and negative actions are persistent and occur in clusters, indicated by large scaling exponents α ~ 0.87 of the mean square displacement of the world lines. For all eight action types we find strong signs for high levels of repetitiveness, especially for negative actions. We partition behavioral sequences into segments of length n (behavioral 'words' and 'motifs') and study their statistical properties. We find two approximate power laws in the word ranking distribution, one with an exponent of κ ~ -1 for the ranks up to 100, and another with a lower exponent for higher ranks. The Shannon n-tuple redundancy yields large values and increases in terms of word length, further underscoring the non-trivial statistical properties of behavioral sequences. On the collective, societal level the timeseries of particular actions per day can be understood by a simple mean-reverting log-normal model.
Sequences Associated with Centromere Competency in the Human Genome
Hayden, Karen E.; Strome, Erin D.; Merrett, Stephanie L.; Lee, Hye-Ran; Rudd, M. Katharine
2013-01-01
Centromeres, the sites of spindle attachment during mitosis and meiosis, are located in specific positions in the human genome, normally coincident with diverse subsets of alpha satellite DNA. While there is strong evidence supporting the association of some subfamilies of alpha satellite with centromere function, the basis for establishing whether a given alpha satellite sequence is or is not designated a functional centromere is unknown, and attempts to understand the role of particular sequence features in establishing centromere identity have been limited by the near identity and repetitive nature of satellite sequences. Utilizing a broadly applicable experimental approach to test sequence competency for centromere specification, we have carried out a genomic and epigenetic functional analysis of endogenous human centromere sequences available in the current human genome assembly. The data support a model in which functionally competent sequences confer an opportunity for centromere specification, integrating genomic and epigenetic signals and promoting the concept of context-dependent centromere inheritance. PMID:23230266
Caballero, Javier; Peralta, Cecilia; Molla, Antonella; Del Valle, Eleodoro E; Caballero, Primitivo; Berry, Colin; Felipe, Verónica; Yaryura, Pablo; Palma, Leopoldo
2018-01-01
Bacillus cereus is a gram-positive, spore-forming bacterium possessing an important and historical record as a human-pathogenic bacterium. However, several strains of this species exhibit interesting potential to be used as plant growth-promoting rhizobacteria. Here, we report the draft genome sequence of B. cereus strain CITVM-11.1, which consists of 37 contig sequences, accounting for 5,746,486 bp (with a GC content of 34.8%) and 5,752 predicted protein-coding sequences. Several of them could potentially be involved in plant-bacterium interactions and may contribute to the strong antagonistic activity shown by this strain against the charcoal root rot fungus, Macrophomina phaseolina. This genomic sequence also showed a number of genes that may confer this strain resistance against several polluting heavy metals and for the bioconversion of mycotoxins. © 2018 S. Karger AG, Basel.
2013-01-01
Background Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RNA viruses of the Western honey bee (Apis mellifera), deformed wing virus (DWV) and Israel acute paralysis virus (IAPV). All viral RNA was extracted from North American samples of honey bees or, in one case, the ectoparasitic mite Varroa destructor. Results Coverage depth was generally lower for IAPV than DWV, and marked gaps in coverage occurred in several narrow regions (< 50 bp) of IAPV. These coverage gaps occurred across sequencing runs and were virtually unchanged when reads were re-mapped with greater permissiveness (up to 8% divergence), suggesting a recurrent sequencing artifact rather than strain divergence. Consensus sequences of DWV for each sample showed little phylogenetic divergence, low nucleotide diversity, and strongly negative values of Fu and Li’s D statistic, suggesting a recent population bottleneck and/or purifying selection. The Kakugo strain of DWV fell outside of all other DWV sequences at 100% bootstrap support. IAPV consensus sequences supported the existence of multiple clades as had been previously reported, and Fu and Li’s D was closer to neutral expectation overall, although a sliding-window analysis identified a significantly positive D within the protease region, suggesting selection maintains diversity in that region. Within-sample mean diversity was comparable between the two viruses on average, although for both viruses there was substantial variation among samples in mean diversity at third codon positions and in the number of high-diversity sites. FST values were bimodal for DWV, likely reflecting neutral divergence in two low-diversity populations, whereas IAPV had several sites that were strong outliers with very low FST. Conclusions This initial survey of genetic variation within honey bee RNA viruses suggests future directions for studies examining the underlying causes of population-genetic structure in these economically important pathogens. PMID:23497218
Cornman, Robert Scott; Boncristiani, Humberto; Dainat, Benjamin; Chen, Yanping; vanEngelsdorp, Dennis; Weaver, Daniel; Evans, Jay D
2013-03-07
Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RNA viruses of the Western honey bee (Apis mellifera), deformed wing virus (DWV) and Israel acute paralysis virus (IAPV). All viral RNA was extracted from North American samples of honey bees or, in one case, the ectoparasitic mite Varroa destructor. Coverage depth was generally lower for IAPV than DWV, and marked gaps in coverage occurred in several narrow regions (< 50 bp) of IAPV. These coverage gaps occurred across sequencing runs and were virtually unchanged when reads were re-mapped with greater permissiveness (up to 8% divergence), suggesting a recurrent sequencing artifact rather than strain divergence. Consensus sequences of DWV for each sample showed little phylogenetic divergence, low nucleotide diversity, and strongly negative values of Fu and Li's D statistic, suggesting a recent population bottleneck and/or purifying selection. The Kakugo strain of DWV fell outside of all other DWV sequences at 100% bootstrap support. IAPV consensus sequences supported the existence of multiple clades as had been previously reported, and Fu and Li's D was closer to neutral expectation overall, although a sliding-window analysis identified a significantly positive D within the protease region, suggesting selection maintains diversity in that region. Within-sample mean diversity was comparable between the two viruses on average, although for both viruses there was substantial variation among samples in mean diversity at third codon positions and in the number of high-diversity sites. FST values were bimodal for DWV, likely reflecting neutral divergence in two low-diversity populations, whereas IAPV had several sites that were strong outliers with very low FST. This initial survey of genetic variation within honey bee RNA viruses suggests future directions for studies examining the underlying causes of population-genetic structure in these economically important pathogens.
Strong Purifying Selection at Synonymous Sites in D. melanogaster
Lawrie, David S.; Messer, Philipp W.; Hershberg, Ruth; Petrov, Dmitri A.
2013-01-01
Synonymous sites are generally assumed to be subject to weak selective constraint. For this reason, they are often neglected as a possible source of important functional variation. We use site frequency spectra from deep population sequencing data to show that, contrary to this expectation, 22% of four-fold synonymous (4D) sites in Drosophila melanogaster evolve under very strong selective constraint while few, if any, appear to be under weak constraint. Linking polymorphism with divergence data, we further find that the fraction of synonymous sites exposed to strong purifying selection is higher for those positions that show slower evolution on the Drosophila phylogeny. The function underlying the inferred strong constraint appears to be separate from splicing enhancers, nucleosome positioning, and the translational optimization generating canonical codon bias. The fraction of synonymous sites under strong constraint within a gene correlates well with gene expression, particularly in the mid-late embryo, pupae, and adult developmental stages. Genes enriched in strongly constrained synonymous sites tend to be particularly functionally important and are often involved in key developmental pathways. Given that the observed widespread constraint acting on synonymous sites is likely not limited to Drosophila, the role of synonymous sites in genetic disease and adaptation should be reevaluated. PMID:23737754
A multigene phylogeny of Olpidium and its implications for early fungal evolution
2011-01-01
Background From a common ancestor with animals, the earliest fungi inherited flagellated zoospores for dispersal in water. Terrestrial fungi lost all flagellated stages and reproduce instead with nonmotile spores. Olpidium virulentus (= Olpidium brassicae), a unicellular fungus parasitizing vascular plant root cells, seemed anomalous. Although Olpidium produces zoospores, in previous phylogenetic studies it appeared nested among the terrestrial fungi. Its position was based mainly on ribosomal gene sequences and was not strongly supported. Our goal in this study was to use amino acid sequences from four genes to reconstruct the branching order of the early-diverging fungi with particular emphasis on the position of Olpidium. Results We concatenated sequences from the Ef-2, RPB1, RPB2 and actin loci for maximum likelihood and Bayesian analyses. In the resulting trees, Olpidium virulentus, O. bornovanus and non-flagellated terrestrial fungi formed a strongly supported clade. Topology tests rejected monophyly of the Olpidium species with any other clades of flagellated fungi. Placing Olpidium at the base of terrestrial fungi was also rejected. Within the terrestrial fungi, Olpidium formed a monophyletic group with the taxa traditionally classified in the phylum Zygomycota. Within Zygomycota, Mucoromycotina was robustly monophyletic. Although without bootstrap support, Monoblepharidomycetes, a small class of zoosporic fungi, diverged from the basal node in Fungi. The zoosporic phylum Blastocladiomycota appeared as the sister group to the terrestrial fungi plus Olpidium. Conclusions This study provides strong support for Olpidium as the closest living flagellated relative of the terrestrial fungi. Appearing nested among hyphal fungi, Olpidium's unicellular thallus may have been derived from ancestral hyphae. Early in their evolution, terrestrial hyphal fungi may have reproduced with zoospores. PMID:22085768
Bramsen, Jesper B.; Pakula, Malgorzata M.; Hansen, Thomas B.; Bus, Claus; Langkjær, Niels; Odadzic, Dalibor; Smicius, Romualdas; Wengel, Suzy L.; Chattopadhyaya, Jyoti; Engels, Joachim W.; Herdewijn, Piet; Wengel, Jesper; Kjems, Jørgen
2010-01-01
Small interfering RNAs (siRNAs) are now established as the preferred tool to inhibit gene function in mammalian cells yet trigger unintended gene silencing due to their inherent miRNA-like behavior. Such off-target effects are primarily mediated by the sequence-specific interaction between the siRNA seed regions (position 2–8 of either siRNA strand counting from the 5′-end) and complementary sequences in the 3′UTR of (off-) targets. It was previously shown that chemical modification of siRNAs can reduce off-targeting but only very few modifications have been tested leaving more to be identified. Here we developed a luciferase reporter-based assay suitable to monitor siRNA off-targeting in a high throughput manner using stable cell lines. We investigated the impact of chemically modifying single nucleotide positions within the siRNA seed on siRNA function and off-targeting using 10 different types of chemical modifications, three different target sequences and three siRNA concentrations. We found several differently modified siRNAs to exercise reduced off-targeting yet incorporation of the strongly destabilizing unlocked nucleic acid (UNA) modification into position 7 of the siRNA most potently reduced off-targeting for all tested sequences. Notably, such position-specific destabilization of siRNA–target interactions did not significantly reduce siRNA potency and is therefore well suited for future siRNA designs especially for applications in vivo where siRNA concentrations, expectedly, will be low. PMID:20453030
2012-01-01
Background RNA sequencing (RNA-Seq) has emerged as a powerful approach for the detection of differential gene expression with both high-throughput and high resolution capabilities possible depending upon the experimental design chosen. Multiplex experimental designs are now readily available, these can be utilised to increase the numbers of samples or replicates profiled at the cost of decreased sequencing depth generated per sample. These strategies impact on the power of the approach to accurately identify differential expression. This study presents a detailed analysis of the power to detect differential expression in a range of scenarios including simulated null and differential expression distributions with varying numbers of biological or technical replicates, sequencing depths and analysis methods. Results Differential and non-differential expression datasets were simulated using a combination of negative binomial and exponential distributions derived from real RNA-Seq data. These datasets were used to evaluate the performance of three commonly used differential expression analysis algorithms and to quantify the changes in power with respect to true and false positive rates when simulating variations in sequencing depth, biological replication and multiplex experimental design choices. Conclusions This work quantitatively explores comparisons between contemporary analysis tools and experimental design choices for the detection of differential expression using RNA-Seq. We found that the DESeq algorithm performs more conservatively than edgeR and NBPSeq. With regard to testing of various experimental designs, this work strongly suggests that greater power is gained through the use of biological replicates relative to library (technical) replicates and sequencing depth. Strikingly, sequencing depth could be reduced as low as 15% without substantial impacts on false positive or true positive rates. PMID:22985019
Robles, José A; Qureshi, Sumaira E; Stephen, Stuart J; Wilson, Susan R; Burden, Conrad J; Taylor, Jennifer M
2012-09-17
RNA sequencing (RNA-Seq) has emerged as a powerful approach for the detection of differential gene expression with both high-throughput and high resolution capabilities possible depending upon the experimental design chosen. Multiplex experimental designs are now readily available, these can be utilised to increase the numbers of samples or replicates profiled at the cost of decreased sequencing depth generated per sample. These strategies impact on the power of the approach to accurately identify differential expression. This study presents a detailed analysis of the power to detect differential expression in a range of scenarios including simulated null and differential expression distributions with varying numbers of biological or technical replicates, sequencing depths and analysis methods. Differential and non-differential expression datasets were simulated using a combination of negative binomial and exponential distributions derived from real RNA-Seq data. These datasets were used to evaluate the performance of three commonly used differential expression analysis algorithms and to quantify the changes in power with respect to true and false positive rates when simulating variations in sequencing depth, biological replication and multiplex experimental design choices. This work quantitatively explores comparisons between contemporary analysis tools and experimental design choices for the detection of differential expression using RNA-Seq. We found that the DESeq algorithm performs more conservatively than edgeR and NBPSeq. With regard to testing of various experimental designs, this work strongly suggests that greater power is gained through the use of biological replicates relative to library (technical) replicates and sequencing depth. Strikingly, sequencing depth could be reduced as low as 15% without substantial impacts on false positive or true positive rates.
Parente, Daniel J; Ray, J Christian J; Swint-Kruse, Liskin
2015-12-01
As proteins evolve, amino acid positions key to protein structure or function are subject to mutational constraints. These positions can be detected by analyzing sequence families for amino acid conservation or for coevolution between pairs of positions. Coevolutionary scores are usually rank-ordered and thresholded to reveal the top pairwise scores, but they also can be treated as weighted networks. Here, we used network analyses to bypass a major complication of coevolution studies: For a given sequence alignment, alternative algorithms usually identify different, top pairwise scores. We reconciled results from five commonly-used, mathematically divergent algorithms (ELSC, McBASC, OMES, SCA, and ZNMI), using the LacI/GalR and 1,6-bisphosphate aldolase protein families as models. Calculations used unthresholded coevolution scores from which column-specific properties such as sequence entropy and random noise were subtracted; "central" positions were identified by calculating various network centrality scores. When compared among algorithms, network centrality methods, particularly eigenvector centrality, showed markedly better agreement than comparisons of the top pairwise scores. Positions with large centrality scores occurred at key structural locations and/or were functionally sensitive to mutations. Further, the top central positions often differed from those with top pairwise coevolution scores: instead of a few strong scores, central positions often had multiple, moderate scores. We conclude that eigenvector centrality calculations reveal a robust evolutionary pattern of constraints-detectable by divergent algorithms--that occur at key protein locations. Finally, we discuss the fact that multiple patterns coexist in evolutionary data that, together, give rise to emergent protein functions. © 2015 Wiley Periodicals, Inc.
The genetic basis of adaptive pigmentation variation in Drosophila melanogaster.
Pool, John E; Aquadro, Charles F
2007-07-01
In a broad survey of Drosophila melanogaster population samples, levels of abdominal pigmentation were found to be highly variable and geographically differentiated. A strong positive correlation was found between dark pigmentation and high altitude, suggesting adaptation to specific environments. DNA sequence polymorphism at the candidate gene ebony revealed a clear association with the pigmentation of homozygous third chromosome lines. The darkest lines sequenced had nearly identical haplotypes spanning 14.5 kb upstream of the protein-coding exons of ebony. Thus, natural selection may have elevated the frequency of an allele that confers dark abdominal pigmentation by influencing the regulation of ebony.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Murphy, Grant S.; Mills, Jeffrey L.; Miley, Michael J.
2015-10-15
Protein design tests our understanding of protein stability and structure. Successful design methods should allow the exploration of sequence space not found in nature. However, when redesigning naturally occurring protein structures, most fixed backbone design algorithms return amino acid sequences that share strong sequence identity with wild-type sequences, especially in the protein core. This behavior places a restriction on functional space that can be explored and is not consistent with observations from nature, where sequences of low identity have similar structures. Here, we allow backbone flexibility during design to mutate every position in the core (38 residues) of a four-helixmore » bundle protein. Only small perturbations to the backbone, 12 {angstrom}, were needed to entirely mutate the core. The redesigned protein, DRNN, is exceptionally stable (melting point >140C). An NMR and X-ray crystal structure show that the side chains and backbone were accurately modeled (all-atom RMSD = 1.3 {angstrom}).« less
Bausher, Michael G; Singh, Nameirakpam D; Lee, Seung-Bum; Jansen, Robert K; Daniell, Henry
2006-01-01
Background The production of Citrus, the largest fruit crop of international economic value, has recently been imperiled due to the introduction of the bacterial disease Citrus canker. No significant improvements have been made to combat this disease by plant breeding and nuclear transgenic approaches. Chloroplast genetic engineering has a number of advantages over nuclear transformation; it not only increases transgene expression but also facilitates transgene containment, which is one of the major impediments for development of transgenic trees. We have sequenced the Citrus chloroplast genome to facilitate genetic improvement of this crop and to assess phylogenetic relationships among major lineages of angiosperms. Results The complete chloroplast genome sequence of Citrus sinensis is 160,129 bp in length, and contains 133 genes (89 protein-coding, 4 rRNAs and 30 distinct tRNAs). Genome organization is very similar to the inferred ancestral angiosperm chloroplast genome. However, in Citrus the infA gene is absent. The inverted repeat region has expanded to duplicate rps19 and the first 84 amino acids of rpl22. The rpl22 gene in the IRb region has a nonsense mutation resulting in 9 stop codons. This was confirmed by PCR amplification and sequencing using primers that flank the IR/LSC boundaries. Repeat analysis identified 29 direct and inverted repeats 30 bp or longer with a sequence identity ≥ 90%. Comparison of protein-coding sequences with expressed sequence tags revealed six putative RNA edits, five of which resulted in non-synonymous modifications in petL, psbH, ycf2 and ndhA. Phylogenetic analyses using maximum parsimony (MP) and maximum likelihood (ML) methods of a dataset composed of 61 protein-coding genes for 30 taxa provide strong support for the monophyly of several major clades of angiosperms, including monocots, eudicots, rosids and asterids. The MP and ML trees are incongruent in three areas: the position of Amborella and Nymphaeales, relationship of the magnoliid genus Calycanthus, and the monophyly of the eurosid I clade. Both MP and ML trees provide strong support for the monophyly of eurosids II and for the placement of Citrus (Sapindales) sister to a clade including the Malvales/Brassicales. Conclusion This is the first complete chloroplast genome sequence for a member of the Rutaceae and Sapindales. Expansion of the inverted repeat region to include rps19 and part of rpl22 and presence of two truncated copies of rpl22 is unusual among sequenced chloroplast genomes. Availability of a complete Citrus chloroplast genome sequence provides valuable information on intergenic spacer regions and endogenous regulatory sequences for chloroplast genetic engineering. Phylogenetic analyses resolve relationships among several major clades of angiosperms and provide strong support for the monophyly of the eurosid II clade and the position of the Sapindales sister to the Brassicales/Malvales. PMID:17010212
DNA sequence templates adjacent nucleosome and ORC sites at gene amplification origins in Drosophila
Liu, Jun; Zimmer, Kurt; Rusch, Douglas B.; Paranjape, Neha; Podicheti, Ram; Tang, Haixu; Calvi, Brian R.
2015-01-01
Eukaryotic origins of DNA replication are bound by the origin recognition complex (ORC), which scaffolds assembly of a pre-replicative complex (pre-RC) that is then activated to initiate replication. Both pre-RC assembly and activation are strongly influenced by developmental changes to the epigenome, but molecular mechanisms remain incompletely defined. We have been examining the activation of origins responsible for developmental gene amplification in Drosophila. At a specific time in oogenesis, somatic follicle cells transition from genomic replication to a locus-specific replication from six amplicon origins. Previous evidence indicated that these amplicon origins are activated by nucleosome acetylation, but how this affects origin chromatin is unknown. Here, we examine nucleosome position in follicle cells using micrococcal nuclease digestion with Ilumina sequencing. The results indicate that ORC binding sites and other essential origin sequences are nucleosome-depleted regions (NDRs). Nucleosome position at the amplicons was highly similar among developmental stages during which ORC is or is not bound, indicating that being an NDR is not sufficient to specify ORC binding. Importantly, the data suggest that nucleosomes and ORC have opposite preferences for DNA sequence and structure. We propose that nucleosome hyperacetylation promotes pre-RC assembly onto adjacent DNA sequences that are disfavored by nucleosomes but favored by ORC. PMID:26227968
Gussakovsky, Daniel; Neustaeter, Haley; Spicer, Victor; Krokhin, Oleg V
2017-11-07
The development of a peptide retention prediction model for strong cation exchange (SCX) separation on a Polysulfoethyl A column is reported. Off-line 2D LC-MS/MS analysis (SCX-RPLC) of S. cerevisiae whole cell lysate was used to generate a retention dataset of ∼30 000 peptides, sufficient for identifying the major sequence-specific features of peptide retention mechanisms in SCX. In contrast to RPLC/hydrophilic interaction liquid chromatography (HILIC) separation modes, where retention is driven by hydrophobic/hydrophilic contributions of all individual residues, SCX interactions depend mainly on peptide charge (number of basic residues at acidic pH) and size. An additive model (incorporating the contributions of all 20 residues into the peptide retention) combined with a peptide length correction produces a 0.976 R 2 value prediction accuracy, significantly higher than the additive models for either HILIC or RPLC. Position-dependent effects on peptide retention for different residues were driven by the spatial orientation of tryptic peptides upon interaction with the negatively charged surface functional groups. The positively charged N-termini serve as a primary point of interaction. For example, basic residues (Arg, His, Lys) increase peptide retention when located closer to the N-terminus. We also found that hydrophobic interactions, which could lead to a mixed-mode separation mechanism, are largely suppressed at 20-30% of acetonitrile in the eluent. The accuracy of the final Sequence-Specific Retention Calculator (SSRCalc) SCX model (∼0.99 R 2 value) exceeds all previously reported predictors for peptide LC separations. This also provides a solid platform for method development in 2D LC-MS protocols in proteomics and peptide retention prediction filtering of false positive identifications.
Andrews, T Daniel; Gojobori, Takashi
2004-01-01
The PilE protein is the major component of the Neisseria meningitidis pilus, which is encoded by the pilE/pilS locus that includes an expressed gene and eight homologous silent fragments. The silent gene fragments have been shown to recombine through gene conversion with the expressed gene and thereby provide a means by which novel antigenic variants of the PilE protein can be generated. We have analyzed the evolutionary rate of the pilE gene using the nucleotide sequence of two complete pilE/pilS loci. The very high rate of evolution displayed by the PilE protein appears driven by both recombination and positive selection. Within the semivariable region of the pilE and pilS genes, recombination appears to occur within multiple small sequence blocks that lie between conserved sequence elements. Within the hypervariable region, positive selection was identified from comparison of the silent and expressed genes. The unusual gene conversion mechanism that operates at the pilE/pilS locus is a strategy employed by N. meningitidis to enhance mutation of certain regions of the PilE protein. The silent copies of the gene effectively allow "parallelized" evolution of pilE, thus enabling the encoded protein to rapidly explore a large area of sequence space in an effort to find novel antigenic variants.
Sun, Zhizeng; Mehta, Shrenik C; Adamski, Carolyn J; Gibbs, Richard A; Palzkill, Timothy
2016-09-12
CphA is a Zn(2+)-dependent metallo-β-lactamase that efficiently hydrolyzes only carbapenem antibiotics. To understand the sequence requirements for CphA function, single codon random mutant libraries were constructed for residues in and near the active site and mutants were selected for E. coli growth on increasing concentrations of imipenem, a carbapenem antibiotic. At high concentrations of imipenem that select for phenotypically wild-type mutants, the active-site residues exhibit stringent sequence requirements in that nearly all residues in positions that contact zinc, the substrate, or the catalytic water do not tolerate amino acid substitutions. In addition, at high imipenem concentrations a number of residues that do not directly contact zinc or substrate are also essential and do not tolerate substitutions. Biochemical analysis confirmed that amino acid substitutions at essential positions decreased the stability or catalytic activity of the CphA enzyme. Therefore, the CphA active - site is fragile to substitutions, suggesting active-site residues are optimized for imipenem hydrolysis. These results also suggest that resistance to inhibitors targeted to the CphA active site would be slow to develop because of the strong sequence constraints on function.
Asmal, Mohammed; Hellmann, Ina; Liu, Weimin; Keele, Brandon F.; Perelson, Alan S.; Bhattacharya, Tanmoy; Gnanakaran, S.; Daniels, Marcus; Haynes, Barton F.; Korber, Bette T.; Hahn, Beatrice H.; Shaw, George M.; Letvin, Norman L.
2011-01-01
Mucosal transmission of the human immunodeficiency virus (HIV) results in a bottleneck in viral genetic diversity. Gnanakaran and colleagues used a computational strategy to identify signature amino acids at particular positions in Envelope that were associated either with transmitted sequences sampled very early in infection, or sequences sampled during chronic infection. Among the strongest signatures observed was an enrichment for the stable presence of histidine at position 12 at transmission and in early infection, and a recurrent loss of histidine at position 12 in chronic infection. This amino acid lies within the leader peptide of Envelope, a region of the protein that has been shown to influence envelope glycoprotein expression and virion infectivity. We show a strong association between a positively charged amino acid like histidine at position 12 in transmitted/founder viruses with more efficient trafficking of the nascent envelope polypeptide to the endoplasmic reticulum and higher steady-state glycoprotein expression compared to viruses that have a non-basic position 12 residue, a substitution that was enriched among viruses sampled from chronically infected individuals. When expressed in the context of other viral proteins, transmitted envelopes with a basic amino acid position 12 were incorporated at higher density into the virus and exhibited higher infectious titers than did non-signature envelopes. These results support the potential utility of using a computational approach to examine large viral sequence data sets for functional signatures and indicate the importance of Envelope expression levels for efficient HIV transmission. PMID:21876761
Synthesis and hybridization of a series of biotinylated oligonucleotides.
Cook, A F; Vuocolo, E; Brakel, C L
1988-01-01
A series of oligonucleotides containing biotin-11-dUMP at various positions were synthesized and compared in quantitative, colorimetric hybridization-detection studies. A deoxyuridine phosphoramidite containing a protected allylamino sidearm was synthesized and used in standard, automated synthesis cycles to prepare oligonucleotides with allylamino residues at various positions within a standard 17-base sequence. Biotin substituents were subsequently attached to the allylamino sidearms by reaction with N-biotinyl-6-aminocaproic acid N-hydroxysuccinimide ester. These oligomers were hybridized to target DNA immobilized on microtiter wells (ELISA plates), and were detected with a streptavidin-biotinylated horseradish peroxidase complex using hydrogen peroxide as substrate and o-phenylenediamine as chromogen. We found that the sensitivity of detection of target DNA by biotin-labeled oligonucleotide probes was strongly dependent upon the position of the biotin label. Oligonucleotides containing biotin labels near or off the ends of the hybridizing sequence were more effective probes than oligonucleotides containing internal biotin labels. An additive effect of increasing numbers of biotin-dUMP residues was found for some labeling configurations. PMID:3375076
Abolbaghaei, Akram; Silke, Jordan R; Xia, Xuhua
2017-05-05
The 3' end of the small ribosomal RNAs (ssu rRNA) in bacteria is directly involved in the selection and binding of mRNA transcripts during translation initiation via well-documented interactions between a Shine-Dalgarno (SD) sequence located upstream of the initiation codon and an anti-SD (aSD) sequence at the 3' end of the ssu rRNA. Consequently, the 3' end of ssu rRNA (3'TAIL) is strongly conserved among bacterial species because a change in the region may impact the translation of many protein-coding genes. Escherichia coli and Bacillus subtilis differ in their 3' ends of ssu rRNA, being GAUC ACCUCCUUA 3' in E. coli and GAUC ACCUCCUU UCU3' or GAUC ACCUCCUU UCUA3' in B. subtilis Such differences in 3'TAIL lead to species-specific SDs (designated SD Ec for E. coli and SD Bs for B. subtilis ) that can form strong and well-positioned SD/aSD pairing in one species but not in the other. Selection mediated by the species-specific 3'TAIL is expected to favor SD Bs against SD Ec in B. subtilis , but favor SD Ec against SD Bs in E. coli Among well-positioned SDs, SD Ec is used more in E. coli than in B. subtilis , and SD Bs more in B. subtilis than in E. coli Highly expressed genes and genes of high translation efficiency tend to have longer SDs than lowly expressed genes and genes with low translation efficiency in both species, but more so in B. subtilis than in E. coli Both species overuse SDs matching the bolded part of the 3'TAIL shown above. The 3'TAIL difference contributes to the host specificity of phages. Copyright © 2017 Abolbaghaei et al.
Vucetic, Slobodan; Xie, Hongbo; Iakoucheva, Lilia M.; Oldfield, Christopher J.; Dunker, A. Keith; Obradovic, Zoran; Uversky, Vladimir N.
2008-01-01
Biologically active proteins without stable ordered structure (i.e., intrinsically disordered proteins) are attracting increased attention. Functional repertoires of ordered and disordered proteins are very different, and the ability to differentiate whether a given function is associated with intrinsic disorder or with a well-folded protein is crucial for modern protein science. However, there is a large gap between the number of proteins experimentally confirmed to be disordered and their actual number in nature. As a result, studies of functional properties of confirmed disordered proteins, while helpful in revealing the functional diversity of protein disorder, provide only a limited view. To overcome this problem, a bioinformatics approach for comprehensive study of functional roles of protein disorder was proposed in the first paper of this series (Xie H., Vucetic S., Iakoucheva L.M., Oldfield C.J., Dunker A.K., Obradovic Z., Uversky V.N. (2006) Functional anthology of intrinsic disorder. I. Biological processes and functions of proteins with long disordered regions. J. Proteome Res.). Applying this novel approach to Swiss-Prot sequences and functional keywords, we found over 238 and 302 keywords to be strongly positively or negatively correlated, respectively, with long intrinsically disordered regions. This paper describes ~90 Swiss-Prot keywords attributed to the cellular components, domains, technical terms, developmental processes and coding sequence diversities possessing strong positive and negative correlation with long disordered regions. PMID:17391015
Vucetic, Slobodan; Xie, Hongbo; Iakoucheva, Lilia M; Oldfield, Christopher J; Dunker, A Keith; Obradovic, Zoran; Uversky, Vladimir N
2007-05-01
Biologically active proteins without stable ordered structure (i.e., intrinsically disordered proteins) are attracting increased attention. Functional repertoires of ordered and disordered proteins are very different, and the ability to differentiate whether a given function is associated with intrinsic disorder or with a well-folded protein is crucial for modern protein science. However, there is a large gap between the number of proteins experimentally confirmed to be disordered and their actual number in nature. As a result, studies of functional properties of confirmed disordered proteins, while helpful in revealing the functional diversity of protein disorder, provide only a limited view. To overcome this problem, a bioinformatics approach for comprehensive study of functional roles of protein disorder was proposed in the first paper of this series (Xie, H.; Vucetic, S.; Iakoucheva, L. M.; Oldfield, C. J.; Dunker, A. K.; Obradovic, Z.; Uversky, V. N. Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. J. Proteome Res. 2007, 5, 1882-1898). Applying this novel approach to Swiss-Prot sequences and functional keywords, we found over 238 and 302 keywords to be strongly positively or negatively correlated, respectively, with long intrinsically disordered regions. This paper describes approximately 90 Swiss-Prot keywords attributed to the cellular components, domains, technical terms, developmental processes, and coding sequence diversities possessing strong positive and negative correlation with long disordered regions.
Conservation and diversification of Msx protein in metazoan evolution.
Takahashi, Hirokazu; Kamiya, Akiko; Ishiguro, Akira; Suzuki, Atsushi C; Saitou, Naruya; Toyoda, Atsushi; Aruga, Jun
2008-01-01
Msx (/msh) family genes encode homeodomain (HD) proteins that control ontogeny in many animal species. We compared the structures of Msx genes from a wide range of Metazoa (Porifera, Cnidaria, Nematoda, Arthropoda, Tardigrada, Platyhelminthes, Mollusca, Brachiopoda, Annelida, Echiura, Echinodermata, Hemichordata, and Chordata) to gain an understanding of the role of these genes in phylogeny. Exon-intron boundary analysis suggested that the position of the intron located N-terminally to the HDs was widely conserved in all the genes examined, including those of cnidarians. Amino acid (aa) sequence comparison revealed 3 new evolutionarily conserved domains, as well as very strong conservation of the HDs. Two of the three domains were associated with Groucho-like protein binding in both a vertebrate and a cnidarian Msx homolog, suggesting that the interaction between Groucho-like proteins and Msx proteins was established in eumetazoan ancestors. Pairwise comparison among the collected HDs and their C-flanking aa sequences revealed that the degree of sequence conservation varied depending on the animal taxa from which the sequences were derived. Highly conserved Msx genes were identified in the Vertebrata, Cephalochordata, Hemichordata, Echinodermata, Mollusca, Brachiopoda, and Anthozoa. The wide distribution of the conserved sequences in the animal phylogenetic tree suggested that metazoan ancestors had already acquired a set of conserved domains of the current Msx family genes. Interestingly, although strongly conserved sequences were recovered from the Vertebrata, Cephalochordata, and Anthozoa, the sequences from the Urochordata and Hydrozoa showed weak conservation. Because the Vertebrata-Cephalochordata-Urochordata and Anthozoa-Hydrozoa represent sister groups in the Chordata and Cnidaria, respectively, Msx sequence diversification may have occurred differentially in the course of evolution. We speculate that selective loss of the conserved domains in Msx family proteins contributed to the diversification of animal body organization.
Nonspatial Sequence Coding in CA1 Neurons
Allen, Timothy A.; Salz, Daniel M.; McKenzie, Sam
2016-01-01
The hippocampus is critical to the memory for sequences of events, a defining feature of episodic memory. However, the fundamental neuronal mechanisms underlying this capacity remain elusive. While considerable research indicates hippocampal neurons can represent sequences of locations, direct evidence of coding for the memory of sequential relationships among nonspatial events remains lacking. To address this important issue, we recorded neural activity in CA1 as rats performed a hippocampus-dependent sequence-memory task. Briefly, the task involves the presentation of repeated sequences of odors at a single port and requires rats to identify each item as “in sequence” or “out of sequence”. We report that, while the animals' location and behavior remained constant, hippocampal activity differed depending on the temporal context of items—in this case, whether they were presented in or out of sequence. Some neurons showed this effect across items or sequence positions (general sequence cells), while others exhibited selectivity for specific conjunctions of item and sequence position information (conjunctive sequence cells) or for specific probe types (probe-specific sequence cells). We also found that the temporal context of individual trials could be accurately decoded from the activity of neuronal ensembles, that sequence coding at the single-cell and ensemble level was linked to sequence memory performance, and that slow-gamma oscillations (20–40 Hz) were more strongly modulated by temporal context and performance than theta oscillations (4–12 Hz). These findings provide compelling evidence that sequence coding extends beyond the domain of spatial trajectories and is thus a fundamental function of the hippocampus. SIGNIFICANCE STATEMENT The ability to remember the order of life events depends on the hippocampus, but the underlying neural mechanisms remain poorly understood. Here we addressed this issue by recording neural activity in hippocampal region CA1 while rats performed a nonspatial sequence memory task. We found that hippocampal neurons code for the temporal context of items (whether odors were presented in the correct or incorrect sequential position) and that this activity is linked with memory performance. The discovery of this novel form of temporal coding in hippocampal neurons advances our fundamental understanding of the neurobiology of episodic memory and will serve as a foundation for our cross-species, multitechnique approach aimed at elucidating the neural mechanisms underlying memory impairments in aging and dementia. PMID:26843637
Turatsinze, Jean-Valery; Thomas-Chollier, Morgane; Defrance, Matthieu; van Helden, Jacques
2008-01-01
This protocol shows how to detect putative cis-regulatory elements and regions enriched in such elements with the regulatory sequence analysis tools (RSAT) web server (http://rsat.ulb.ac.be/rsat/). The approach applies to known transcription factors, whose binding specificity is represented by position-specific scoring matrices, using the program matrix-scan. The detection of individual binding sites is known to return many false predictions. However, results can be strongly improved by estimating P value, and by searching for combinations of sites (homotypic and heterotypic models). We illustrate the detection of sites and enriched regions with a study case, the upstream sequence of the Drosophila melanogaster gene even-skipped. This protocol is also tested on random control sequences to evaluate the reliability of the predictions. Each task requires a few minutes of computation time on the server. The complete protocol can be executed in about one hour.
Financsek, I; Mizumoto, K; Mishima, Y; Muramatsu, M
1982-01-01
The transcription initiation site of the human ribosomal RNA gene (rDNA) was located by using the single-strand specific nuclease protection method and by determining the first nucleotide of the in vitro capped 45S preribosomal RNA. The sequence of 1,211 nucleotides surrounding the initiation site was determined. The sequenced region was found to consist of 75% G and C and to contain a number of short direct and inverted repeats and palindromes. By comparison of the corresponding initiation regions of three mammalian species, several conserved sequences were found upstream and downstream from the transcription starting point. Two short A + T-rich sequences are present on human, mouse, and rat ribosomal RNA genes between the initiation site and 40 nucleotides upstream, and a C + T cluster is located at a position around -60. At and downstream from the initiation site, a common sequence, T-AG-C-T-G-A-C-A-C-G-C-T-G-T-C-C-T-CT-T, was found in the three genes from position -1 through +18. The strong conservation of these sequences suggests their functional significance in rDNA. The S1 nuclease protection experiments with cloned rDNA fragments indicated the presence in human 45S RNA of molecules several hundred nucleotides shorter than the supposed primary transcript. The first 19 nucleotides of these molecules appear identical--except for one mismatch--to the nucleotide sequence of the 5' end of a supposed early processing product of the mouse 45S RNA. Images PMID:6954460
Positional bias in variant calls against draft reference assemblies.
Briskine, Roman V; Shimizu, Kentaro K
2017-03-28
Whole genome resequencing projects may implement variant calling using draft reference genomes assembled de novo from short-read libraries. Despite lower quality of such assemblies, they allowed researchers to extend a wide range of population genetic and genome-wide association analyses to non-model species. As the variant calling pipelines are complex and involve many software packages, it is important to understand inherent biases and limitations at each step of the analysis. In this article, we report a positional bias present in variant calling performed against draft reference assemblies constructed from de Bruijn or string overlap graphs. We assessed how frequently variants appeared at each position counted from ends of a contig or scaffold sequence, and discovered unexpectedly high number of variants at the positions related to the length of either k-mers or reads used for the assembly. We detected the bias in both publicly available draft assemblies from Assemblathon 2 competition as well as in the assemblies we generated from our simulated short-read data. Simulations confirmed that the bias causing variants are predominantly false positives induced by reads from spatially distant repeated sequences. The bias is particularly strong in contig assemblies. Scaffolding does not eliminate the bias but tends to mitigate it because of the changes in variants' relative positions and alterations in read alignments. The bias can be effectively reduced by filtering out the variants that reside in repetitive elements. Draft genome sequences generated by several popular assemblers appear to be susceptible to the positional bias potentially affecting many resequencing projects in non-model species. The bias is inherent to the assembly algorithms and arises from their particular handling of repeated sequences. It is recommended to reduce the bias by filtering especially if higher-quality genome assembly cannot be achieved. Our findings can help other researchers to improve the quality of their variant data sets and reduce artefactual findings in downstream analyses.
Faragher, S G; Dalgarno, L
1986-07-20
The 3' untranslated (UT) sequences of the genomic RNAs of five geographic variants of the alphavirus Ross River virus (RRV) were determined and compared with the 3' UT sequence of RRV T48, the prototype strain. Part of the 3' UT region of Getah virus, a close serological relative of RRV, was also sequenced. The RRV 3' UT region varies markedly in length between variants. Large deletions or insertions, sequence rearrangements and single nucleotide substitutions are observed. A sequence tract of 49 to 58 nucleotides, which is repeated as four blocks in the RRV T48 3' UT region, occurs only once in the 3' UT region of one RRV strain (NB5092), indicating that the existence of repeat sequence blocks is not essential for RRV replication. However, the precise sequence of the 3' proximal copy of the repeat block and its position relative to the poly(A) tail were identical in all RRV isolates examined, suggesting that it has an important role in RRV replication. Nucleotide substitutions between RRV variants are distributed non-randomly along the length of the 3' UT region. The sequence of 120 to 130 nucleotides adjacent to the poly(A) tail is strongly conserved. Getah virus RNA contains three repeat sequence blocks in the 3' UT region. These are similar in sequence to those in RRV RNA but differ in their arrangement. Homology between the RRV and Getah 3' UT sequences is greatest in the 3' proximal repeat sequence block that shows three differences in 49 nucleotides. The 3' proximal repeat in Getah RNA occurs at the same position, relative to the poly(A) tail, as in all RRV variants. The RRV and Getah virus 3' UT sequences show extensive homology in the region between the 3' proximal repeat and the poly(A) tail but, apart from the repeat blocks themselves, they show no significant homology elsewhere.
Ketchum, Myles J; Weyand, Theodore G; Weed, Peter F; Winsauer, Peter J
2016-05-01
Learning is believed to be reflected in the activity of the hippocampus. However, neural correlates of learning have been difficult to characterize because hippocampal activity is integrated with ongoing behavior. To address this issue, male rats (n = 5) implanted with electrodes (n = 14) in the CA1 subfield responded during two tasks within a single test session. In one task, subjects acquired a new 3-response sequence (acquisition), whereas in the other task, subjects completed a well-rehearsed 3-response sequence (performance). Both tasks though could be completed using an identical response topography and used the same sensory stimuli and schedule of reinforcement. More important, comparing neural patterns during sequence acquisition to those during sequence performance allows for a subtractive approach whereby activity associated with learning could potentially be dissociated from the activity associated with ongoing behavior. At sites where CA1 activity was closely associated with behavior, the patterns of activity were differentially modulated by key position and the serial position of a response within the schedule of reinforcement. Temporal shifts between peak activity and responding on particular keys also occurred during sequence acquisition, but not during sequence performance. Ethanol disrupted CA1 activity while producing rate-decreasing effects in both tasks and error-increasing effects that were more selective for sequence acquisition than sequence performance. Ethanol also produced alterations in the magnitude of modulations and temporal pattern of CA1 activity, although these effects were not selective for sequence acquisition. Similar to ethanol, hippocampal micro-stimulation decreased response rate in both tasks and selectively increased the percentage of errors during sequence acquisition, and provided a more direct demonstration of hippocampal involvement during sequence acquisition. Together, these results strongly support the notion that ethanol disrupts sequence acquisition by disrupting hippocampal activity and that the hippocampus is necessary for the conditioned associations required for sequence acquisition. © 2015 Wiley Periodicals, Inc.
Ketchum, Myles J.; Weyand, Theodore G.; Weed, Peter F.; Winsauer, Peter J.
2015-01-01
Learning is believed to be reflected in the activity of the hippocampus. However, neural correlates of learning have been difficult to characterize because hippocampal activity is integrated with ongoing behavior. To address this issue, male rats (n=5) implanted with electrodes (n=14) in the CA1 subfield responded during two tasks within a single test session. In one task, subjects acquired a new 3-response sequence (acquisition), whereas in the other task, subjects completed a well-rehearsed 3-response sequence (performance). Both tasks though could be completed using an identical response topography and used the same sensory stimuli and schedule of reinforcement. More important, comparing neural patterns during sequence acquisition to those during sequence performance allows for a subtractive approach whereby activity associated with learning could potentially be dissociated from the activity associated with ongoing behavior. At sites where CA1 activity was closely associated with behavior, the patterns of activity were differentially modulated by key position and the serial position of a response within the schedule of reinforcement. Temporal shifts between peak activity and responding on particular keys also occurred during sequence acquisition, but not during sequence performance. Ethanol disrupted CA1 activity while producing rate-decreasing effects in both tasks and error-increasing effects that were more selective for sequence acquisition than sequence performance. Ethanol also produced alterations in the magnitude of modulations and temporal pattern of CA1 activity, although these effects were not selective for sequence acquisition. Similar to ethanol, hippocampal micro-stimulation decreased response rate in both tasks and selectively increased the percentage of errors during sequence acquisition, and provided a more direct demonstration of hippocampal involvement during sequence acquisition. Together, these results strongly support the notion that ethanol disrupts sequence acquisition by disrupting hippocampal activity and that the hippocampus is necessary for the conditioned associations required for sequence acquisition. PMID:26482846
Xie, Hongbo; Vucetic, Slobodan; Iakoucheva, Lilia M.; Oldfield, Christopher J.; Dunker, A. Keith; Uversky, Vladimir N.; Obradovic, Zoran
2008-01-01
Identifying relationships between function, amino acid sequence and protein structure represents a major challenge. In this study we propose a bioinformatics approach that identifies functional keywords in the Swiss-Prot database that correlate with intrinsic disorder. A statistical evaluation is employed to rank the significance of these correlations. Protein sequence data redundancy and the relationship between protein length and protein structure were taken into consideration to ensure the quality of the statistical inferences. Over 200,000 proteins from Swiss-Prot database were analyzed using this approach. The predictions of intrinsic disorder were carried out using PONDR VL3E predictor of long disordered regions that achieves an accuracy of above 86%. Overall, out of the 710 Swiss-Prot functional keywords that were each associated with at least 20 proteins, 238 were found to be strongly positively correlated with predicted long intrinsically disordered regions, whereas 302 were strongly negatively correlated with such regions. The remaining 170 keywords were ambiguous without strong positive or negative correlation with the disorder predictions. These functions cover a large variety of biological activities and imply that disordered regions are characterized by a wide functional repertoire. Our results agree well with literature findings, as we were able to find at least one illustrative example of functional disorder or order shown experimentally for the vast majority of keywords showing the strongest positive or negative correlation with intrinsic disorder. This work opens a series of three papers, which enriches the current view of protein structure-function relationships, especially with regards to functionalities of intrinsically disordered proteins and provides researchers with a novel tool that could be used to improve the understanding of the relationships between protein structure and function. The first paper of the series describes our statistical approach, outlines the major findings and provides illustrative examples of biological processes and functions positively and negatively correlated with intrinsic disorder. PMID:17391014
Xie, Hongbo; Vucetic, Slobodan; Iakoucheva, Lilia M; Oldfield, Christopher J; Dunker, A Keith; Uversky, Vladimir N; Obradovic, Zoran
2007-05-01
Identifying relationships between function, amino acid sequence, and protein structure represents a major challenge. In this study, we propose a bioinformatics approach that identifies functional keywords in the Swiss-Prot database that correlate with intrinsic disorder. A statistical evaluation is employed to rank the significance of these correlations. Protein sequence data redundancy and the relationship between protein length and protein structure were taken into consideration to ensure the quality of the statistical inferences. Over 200,000 proteins from the Swiss-Prot database were analyzed using this approach. The predictions of intrinsic disorder were carried out using PONDR VL3E predictor of long disordered regions that achieves an accuracy of above 86%. Overall, out of the 710 Swiss-Prot functional keywords that were each associated with at least 20 proteins, 238 were found to be strongly positively correlated with predicted long intrinsically disordered regions, whereas 302 were strongly negatively correlated with such regions. The remaining 170 keywords were ambiguous without strong positive or negative correlation with the disorder predictions. These functions cover a large variety of biological activities and imply that disordered regions are characterized by a wide functional repertoire. Our results agree well with literature findings, as we were able to find at least one illustrative example of functional disorder or order shown experimentally for the vast majority of keywords showing the strongest positive or negative correlation with intrinsic disorder. This work opens a series of three papers, which enriches the current view of protein structure-function relationships, especially with regards to functionalities of intrinsically disordered proteins, and provides researchers with a novel tool that could be used to improve the understanding of the relationships between protein structure and function. The first paper of the series describes our statistical approach, outlines the major findings, and provides illustrative examples of biological processes and functions positively and negatively correlated with intrinsic disorder.
Chen, Hui; Deng, Qiang; Ng, Sock Hoon; Lee, Raphael Tze Chuen; Maurer-Stroh, Sebastian; Zhai, Weiwei
2016-12-01
Influenza viruses are often propagated in a diverse set of culturing media and additional substitutions known as passage adaptation can cause extra evolution in the target strain, leading to ineffective vaccines. Using 25,482 H3N2 HA1 sequences curated from Global Initiative on Sharing All Influenza Data and National Center for Biotechnology Information databases, we found that passage adaptation is a very dynamic process that changes over time and evolves in a seesaw like pattern. After crossing the species boundary from bird to human in 1968, the influenza H3N2 virus evolves to be better adapted to the human environment and passaging them in embryonated eggs (i.e., an avian environment) leads to increasingly stronger positive selection. On the contrary, passage adaptation to the mammalian cell lines changes from positive selection to negative selection. Using two statistical tests, we identified 19 codon positions around the receptor binding domain strongly contributing to passage adaptation in the embryonated egg. These sites show strong convergent evolution and overlap extensively with positively selected sites identified in humans, suggesting that passage adaptation can confound many of the earlier studies on influenza evolution. Interestingly, passage adaptation in recent years seems to target a few codon positions in antigenic surface epitopes, which makes it difficult to produce antigenically unaltered vaccines using embryonic eggs. Our study outlines another interesting scenario whereby both convergent and adaptive evolution are working in synchrony driving viral adaptation. Future studies from sequence analysis to vaccine production need to take careful consideration of passage adaptation. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Drögemüller, Cord; Tetens, Jens; Sigurdsson, Snaevar; Gentile, Arcangelo; Testoni, Stefania; Lindblad-Toh, Kerstin; Leeb, Tosso
2010-01-01
Arachnomelia is a monogenic recessive defect of skeletal development in cattle. The causative mutation was previously mapped to a ∼7 Mb interval on chromosome 5. Here we show that array-based sequence capture and massively parallel sequencing technology, combined with the typical family structure in livestock populations, facilitates the identification of the causative mutation. We re-sequenced the entire critical interval in a healthy partially inbred cow carrying one copy of the critical chromosome segment in its ancestral state and one copy of the same segment with the arachnomelia mutation, and we detected a single heterozygous position. The genetic makeup of several partially inbred cattle provides extremely strong support for the causality of this mutation. The mutation represents a single base insertion leading to a premature stop codon in the coding sequence of the SUOX gene and is perfectly associated with the arachnomelia phenotype. Our findings suggest an important role for sulfite oxidase in bone development. PMID:20865119
NASA Astrophysics Data System (ADS)
Koh, Eunjung; Song, Ha Jeong; Kwon, Na Young; Kim, Gi Won; Lee, Kwang Ho; Jo, Soyeon; Park, Sujin; Park, Jihyun; Park, Eun Kyeong; Hwang, Seung Yong
2017-06-01
Real time PCR is a standard method for identification of species. One of limitations of the qPCR is that there would be false-positive result due to mismatched hybridization between target sequence and probe depending on the annealing temperature in the PCR condition. As an alternative, fluorescence melting curve analysis (FMCA) could be applied for species identification. FMCA is based on a dual-labeled probe. Even with subtle difference of target sequence, there are visible melting temperature (Tm) shift. One of FMCA applications is distinguishing organisms distributed and consumed globally as popular food ingredients. Their prices are set by species or country of origin. However, counterfeiting or distributing them without any verification procedure are becoming social problems and threatening food safety. Besides distinguishing them in naked eye is very difficult and almost impossible in any processed form. Therefore, it is necessary to identify species in molecular level. In this research three species of squids which have 1-2 base pair differences each are selected as samples since they have the same issue. We designed a probe which perfectly matches with one species and the others mismatches 2 and 1 base pair respectively and labeled with fluorophore and quencher. In an experiment with a single probe, we successfully distinguished them by Tm shift depending on the difference of base pair. By combining FMCA and qPCR chip, smaller-scale assay with higher sensitivity and resolution could be possible, andc furthermore, enabling results analysis with smart phone would realize point-of-care testing (POCT).
Russo, M. Jean; Chmelka, Mary B.; Herrenkohl, Roy C.; Herrenkohl, Todd I.
2016-01-01
Introduction The social developmental processes by which child maltreatment increases risk for marijuana use are understudied. This study examined hypothesized parent and peer pathways linking preschool abuse and sexual abuse with adolescent and adult marijuana use. Methods Analyses used data from the Lehigh Longitudinal Study. Measures included child abuse (physical abuse, emotional abuse, domestic violence, and neglect) in preschool, sexual abuse up to age 18, adolescent (average age = 18 years) parental attachment and peer marijuana approval/use, as well as adolescent and adult (average age = 36 years) marijuana use. Results Confirming elevated risk due to child maltreatment, path analysis showed that sexual abuse was positively related to adolescent marijuana use, whereas preschool abuse was positively related to adult marijuana use. In support of mediation, it was found that both forms of maltreatment were negatively related to parental attachment, which was negatively related, in turn, to having peers who use and approve of marijuana use. Peer marijuana approval/use was a strong positive predictor of adolescent marijuana use, which was a strong positive predictor, in turn, of adult marijuana use. Conclusions Results support social developmental theories that hypothesize a sequence of events leading from child maltreatment experiences to lower levels of parental attachment and, in turn, higher levels of involvement with pro-marijuana peers and, ultimately, to both adolescent and adult marijuana use. This sequence of events suggests developmentally-timed intervention activities designed to prevent maltreatment as well as the initiation and progression of marijuana use among vulnerable individuals. PMID:27889563
Alex Mason, W; Jean Russo, M; Chmelka, Mary B; Herrenkohl, Roy C; Herrenkohl, Todd I
2017-03-01
The social developmental processes by which child maltreatment increases risk for marijuana use are understudied. This study examined hypothesized parent and peer pathways linking preschool abuse and sexual abuse with adolescent and adult marijuana use. Analyses used data from the Lehigh Longitudinal Study. Measures included child abuse (physical abuse, emotional abuse, domestic violence, and neglect) in preschool, sexual abuse up to age 18, adolescent (average age=18years) parental attachment and peer marijuana approval/use, as well as adolescent and adult (average age=36years) marijuana use. Confirming elevated risk due to child maltreatment, path analysis showed that sexual abuse was positively related to adolescent marijuana use, whereas preschool abuse was positively related to adult marijuana use. In support of mediation, it was found that both forms of maltreatment were negatively related to parental attachment, which was negatively related, in turn, to having peers who use and approve of marijuana use. Peer marijuana approval/use was a strong positive predictor of adolescent marijuana use, which was a strong positive predictor, in turn, of adult marijuana use. Results support social developmental theories that hypothesize a sequence of events leading from child maltreatment experiences to lower levels of parental attachment and, in turn, higher levels of involvement with pro-marijuana peers and, ultimately, to both adolescent and adult marijuana use. This sequence of events suggests developmentally-timed intervention activities designed to prevent maltreatment as well as the initiation and progression of marijuana use among vulnerable individuals. Copyright © 2016 Elsevier Ltd. All rights reserved.
The genetic basis of adaptive pigmentation variation in Drosophila melanogaster
Pool, John E.; Aquadro, Charles F.
2009-01-01
In a broad survey of Drosophila melanogaster population samples, levels of abdominal pigmentation were found to be highly variable and geographically differentiated. A strong positive correlation was found between dark pigmentation and high altitude, suggesting adaptation to specific environments. DNA sequence polymorphism at the candidate gene ebony revealed a clear association with the pigmentation of homozygous third chromosome lines. The darkest lines sequenced had nearly identical haplotypes spanning 14.5 kilobases upstream of the protein-coding exons of ebony. Thus, natural selection may have elevated the frequency of an allele that confers dark abdominal pigmentation by influencing the regulation of ebony. PMID:17614900
Lara-Ramírez, Edgar E.; Salazar, Ma Isabel; López-López, María de Jesús; Salas-Benito, Juan Santiago; Sánchez-Varela, Alejandro
2014-01-01
The increasing number of dengue virus (DENV) genome sequences available allows identifying the contributing factors to DENV evolution. In the present study, the codon usage in serotypes 1–4 (DENV1–4) has been explored for 3047 sequenced genomes using different statistics methods. The correlation analysis of total GC content (GC) with GC content at the three nucleotide positions of codons (GC1, GC2, and GC3) as well as the effective number of codons (ENC, ENCp) versus GC3 plots revealed mutational bias and purifying selection pressures as the major forces influencing the codon usage, but with distinct pressure on specific nucleotide position in the codon. The correspondence analysis (CA) and clustering analysis on relative synonymous codon usage (RSCU) within each serotype showed similar clustering patterns to the phylogenetic analysis of nucleotide sequences for DENV1–4. These clustering patterns are strongly related to the virus geographic origin. The phylogenetic dependence analysis also suggests that stabilizing selection acts on the codon usage bias. Our analysis of a large scale reveals new feature on DENV genomic evolution. PMID:25136631
Linking soil biology and chemistry in biological soil crust using isolate exometabolomics
DOE Office of Scientific and Technical Information (OSTI.GOV)
Swenson, Tami L.; Karaoz, Ulas; Swenson, Joel M.
Metagenomic sequencing provides a window into microbial community structure and metabolic potential; however, linking these data to exogenous metabolites that microorganisms process and produce (the exometabolome) remains challenging. Previously, we observed strong exometabolite niche partitioning among bacterial isolates from biological soil crust (biocrust). For this study, we examine native biocrust to determine if these patterns are reproduced in the environment. Overall, most soil metabolites display the expected relationship (positive or negative correlation) with four dominant bacteria following a wetting event and across biocrust developmental stages. For metabolites that were previously found to be consumed by an isolate, 70% are negativelymore » correlated with the abundance of the isolate's closest matching environmental relative in situ, whereas for released metabolites, 67% were positively correlated. Our results demonstrate that metabolite profiling, shotgun sequencing and exometabolomics may be successfully integrated to functionally link microbial community structure with environmental chemistry in biocrust.« less
Linking soil biology and chemistry in biological soil crust using isolate exometabolomics
Swenson, Tami L.; Karaoz, Ulas; Swenson, Joel M.; ...
2018-01-02
Metagenomic sequencing provides a window into microbial community structure and metabolic potential; however, linking these data to exogenous metabolites that microorganisms process and produce (the exometabolome) remains challenging. Previously, we observed strong exometabolite niche partitioning among bacterial isolates from biological soil crust (biocrust). For this study, we examine native biocrust to determine if these patterns are reproduced in the environment. Overall, most soil metabolites display the expected relationship (positive or negative correlation) with four dominant bacteria following a wetting event and across biocrust developmental stages. For metabolites that were previously found to be consumed by an isolate, 70% are negativelymore » correlated with the abundance of the isolate's closest matching environmental relative in situ, whereas for released metabolites, 67% were positively correlated. Our results demonstrate that metabolite profiling, shotgun sequencing and exometabolomics may be successfully integrated to functionally link microbial community structure with environmental chemistry in biocrust.« less
Koda, Hironori; Brazier, John Alan; Onishi, Ippei; Sasaki, Shigeki
2015-08-01
Hoechst 33258 derivatives with additional interacting moieties attached at the ends of branched linkers were synthesized, and their DNA binding properties were investigated with regard to the A3T3 repeat by measuring fluorescence spectra. The binding property of the ligand was investigated by fluorescence titration, and the titration data were analyzed using the McGhee-von Hippel method. Ligand 6Q with the quinolin-6-yloxyacetyl group and Ligand IQ with isoquinolin-6-yloxyacetyl group at the ends of the branched linkers exhibit highly positive cooperativity for the DNA having 5 A3T3 sites with 3 base-insertions between them with sequence selectivity. The strategy developed in this study may be generally applicable for designing ligands for repetitive DNA sequences. Copyright © 2015 Elsevier Ltd. All rights reserved.
Linking soil biology and chemistry in biological soil crust using isolate exometabolomics.
Swenson, Tami L; Karaoz, Ulas; Swenson, Joel M; Bowen, Benjamin P; Northen, Trent R
2018-01-02
Metagenomic sequencing provides a window into microbial community structure and metabolic potential; however, linking these data to exogenous metabolites that microorganisms process and produce (the exometabolome) remains challenging. Previously, we observed strong exometabolite niche partitioning among bacterial isolates from biological soil crust (biocrust). Here we examine native biocrust to determine if these patterns are reproduced in the environment. Overall, most soil metabolites display the expected relationship (positive or negative correlation) with four dominant bacteria following a wetting event and across biocrust developmental stages. For metabolites that were previously found to be consumed by an isolate, 70% are negatively correlated with the abundance of the isolate's closest matching environmental relative in situ, whereas for released metabolites, 67% were positively correlated. Our results demonstrate that metabolite profiling, shotgun sequencing and exometabolomics may be successfully integrated to functionally link microbial community structure with environmental chemistry in biocrust.
Repression of enhancer II activity by a negative regulatory element in the hepatitis B virus genome.
Lo, W Y; Ting, L P
1994-01-01
Enhancer II of human hepatitis B virus has dual functions in vivo. Located at nucleotides (nt) 1646 to 1741, it can stimulate the surface and X promoters from a downstream position. Moreover, the same sequence can also function as upstream regulatory element that activates the core promoter in a position- and orientation-dependent manner. In this study, we report the identification and characterization of a negative regulatory element (NRE) upstream of enhancer II (nt 1613 to 1636) which can repress both the enhancer and upstream stimulatory function of the enhancer II sequence in differentiated liver cells. This NRE has marginal inhibitory effect by itself but a strong repressive function in the presence of a functional enhancer II. Mutational analysis reveals that sequence from nt 1616 to 1621 is required for repression of enhancer activity by the NRE. Gel shift analysis reveals that this negative regulatory region can be recognized by a specific protein factor(s) present at the 0.4 M NaCl fraction of HepG2 nuclear extracts. The discovery of the NRE indicates that HBV gene transcription is controlled by combined effects of both positive and negative regulation. It also provides a unique system with which to study the mechanism of negative regulation of gene expression. Images PMID:8107237
Event-Related Potentials Elicited by Pre-Attentive Emotional Changes in Temporal Context
Fujimura, Tomomi; Okanoya, Kazuo
2013-01-01
The ability to detect emotional change in the environment is essential for adaptive behavior. The current study investigated whether event-related potentials (ERPs) can reflect emotional change in a visual sequence. To assess pre-attentive processing, we examined visual mismatch negativity (vMMN): the negative potentials elicited by a deviant (infrequent) stimulus embedded in a sequence of standard (frequent) stimuli. Participants in two experiments pre-attentively viewed visual sequences of Japanese kanji with different emotional connotations while ERPs were recorded. The visual sequence in Experiment 1 consisted of neutral standards and two types of emotional deviants with a strong and weak intensity. Although the results indicated that strongly emotional deviants elicited more occipital negativity than neutral standards, it was unclear whether these negativities were derived from emotional deviation in the sequence or from the emotional significance of the deviants themselves. In Experiment 2, the two identical emotional deviants were presented against different emotional standards. One type of deviants was emotionally incongruent with the standard and the other type of deviants was emotionally congruent with the standard. The results indicated that occipital negativities elicited by deviants resulted from perceptual changes in a visual sequence at a latency of 100–200 ms and from emotional changes at latencies of 200–260 ms. Contrary to the results of the ERP experiment, reaction times to deviants showed no effect of emotional context; negative stimuli were consistently detected more rapidly than were positive stimuli. Taken together, the results suggest that brain signals can reflect emotional change in a temporal context. PMID:23671693
Event-related potentials elicited by pre-attentive emotional changes in temporal context.
Fujimura, Tomomi; Okanoya, Kazuo
2013-01-01
The ability to detect emotional change in the environment is essential for adaptive behavior. The current study investigated whether event-related potentials (ERPs) can reflect emotional change in a visual sequence. To assess pre-attentive processing, we examined visual mismatch negativity (vMMN): the negative potentials elicited by a deviant (infrequent) stimulus embedded in a sequence of standard (frequent) stimuli. Participants in two experiments pre-attentively viewed visual sequences of Japanese kanji with different emotional connotations while ERPs were recorded. The visual sequence in Experiment 1 consisted of neutral standards and two types of emotional deviants with a strong and weak intensity. Although the results indicated that strongly emotional deviants elicited more occipital negativity than neutral standards, it was unclear whether these negativities were derived from emotional deviation in the sequence or from the emotional significance of the deviants themselves. In Experiment 2, the two identical emotional deviants were presented against different emotional standards. One type of deviants was emotionally incongruent with the standard and the other type of deviants was emotionally congruent with the standard. The results indicated that occipital negativities elicited by deviants resulted from perceptual changes in a visual sequence at a latency of 100-200 ms and from emotional changes at latencies of 200-260 ms. Contrary to the results of the ERP experiment, reaction times to deviants showed no effect of emotional context; negative stimuli were consistently detected more rapidly than were positive stimuli. Taken together, the results suggest that brain signals can reflect emotional change in a temporal context.
Lee, Patricia; Ng, Hwee L.; Yang, Otto O.
2012-01-01
Human immunodeficiency virus type 1 (HIV-1) Nef downregulates major histocompatibility complex class I (MHC-I), impairing the clearance of infected cells by CD8+ cytotoxic T lymphocytes (CTLs). While sequence motifs mediating this function have been determined by in vitro mutagenesis studies of laboratory-adapted HIV-1 molecular clones, it is unclear whether the highly variable Nef sequences of primary isolates in vivo rely on the same sequence motifs. To address this issue, nef quasispecies from nine chronically HIV-1-infected persons were examined for sequence evolution and altered MHC-I downregulatory function under Gag-specific CTL immune pressure in vitro. This selection resulted in decreased nef diversity and strong purifying selection. Site-by-site analysis identified 13 codons undergoing purifying selection and 1 undergoing positive selection. Of the former, only 6 have been reported to have roles in Nef function, including 4 associated with MHC-I downregulation. Functional testing of naturally occurring in vivo polymorphisms at the 7 sites with no previously known functional role revealed 3 mutations (A84D, Y135F, and G140R) that ablated MHC-I downregulation and 3 (N52A, S169I, and V180E) that partially impaired MHC-I downregulation. Globally, the CTL pressure in vitro selected functional Nef from the in vivo quasispecies mixtures that predominately lacked MHC-I downregulatory function at the baseline. Overall, these data demonstrate that CTL pressure exerts a strong purifying selective pressure for MHC-I downregulation and identifies novel functional motifs present in Nef sequences in vivo. PMID:22553319
Biased Gene Conversion and GC-Content Evolution in the Coding Sequences of Reptiles and Vertebrates
Figuet, Emeric; Ballenghien, Marion; Romiguier, Jonathan; Galtier, Nicolas
2015-01-01
Mammalian and avian genomes are characterized by a substantial spatial heterogeneity of GC-content, which is often interpreted as reflecting the effect of local GC-biased gene conversion (gBGC), a meiotic repair bias that favors G and C over A and T alleles in high-recombining genomic regions. Surprisingly, the first fully sequenced nonavian sauropsid (i.e., reptile), the green anole Anolis carolinensis, revealed a highly homogeneous genomic GC-content landscape, suggesting the possibility that gBGC might not be at work in this lineage. Here, we analyze GC-content evolution at third-codon positions (GC3) in 44 vertebrates species, including eight newly sequenced transcriptomes, with a specific focus on nonavian sauropsids. We report that reptiles, including the green anole, have a genome-wide distribution of GC3 similar to that of mammals and birds, and we infer a strong GC3-heterogeneity to be already present in the tetrapod ancestor. We further show that the dynamic of coding sequence GC-content is largely governed by karyotypic features in vertebrates, notably in the green anole, in agreement with the gBGC hypothesis. The discrepancy between third-codon positions and noncoding DNA regarding GC-content dynamics in the green anole could not be explained by the activity of transposable elements or selection on codon usage. This analysis highlights the unique value of third-codon positions as an insertion/deletion-free marker of nucleotide substitution biases that ultimately affect the evolution of proteins. PMID:25527834
Cao-Lormeau, Van-Mai; Lambrechts, Louis
2017-01-01
Abstract Like other pathogens with high mutation and replication rates, within-host dengue virus (DENV) populations evolve during infection of their main mosquito vector, Aedes aegypti. Within-host DENV evolution during transmission provides opportunities for adaptation and emergence of novel virus variants. Recent studies of DENV genetic diversity failed to detect convergent evolution of adaptive mutations in mosquito tissues such as midgut and salivary glands, suggesting that convergent positive selection is not a major driver of within-host DENV evolution in the vector. However, it is unknown whether this conclusion extends to the transmitted viral subpopulation because it is technically difficult to sequence DENV genomes in mosquito saliva. Here, we achieved DENV full-genome sequencing by pooling saliva samples collected non-sacrificially from 49 to 163 individual Ae. aegypti mosquitoes previously infected with one of two DENV-1 genotypes. We compared the transmitted viral subpopulations found in the pooled saliva samples collected in time series with the input viral population present in the infectious blood meal. In all pooled saliva samples examined, the full-genome consensus sequence of the input viral population was unchanged. Although the pooling strategy prevents analysis of individual saliva samples, our results demonstrate the lack of strong convergent positive selection during a single round of DENV transmission by Ae. aegypti. This finding reinforces the idea that genetic drift and purifying selection are the dominant evolutionary forces shaping within-host DENV genetic diversity during transmission by mosquitoes. PMID:29497564
Chi, Hongshu; Taik, Patricia; Foley, Emily J; Racicot, Alycia C; Gray, Hilary M; Guzzetta, Katherine E; Lin, Hsin-Yun; Song, Yen-Ling; Tung, Che-Huang; Zenke, Kosuke; Yoshinaga, Tomoyoshi; Cheng, Chao-Yin; Chang, Wei-Jen; Gong, Hui
2017-07-01
The ciliate protozoan Cryptocaryon irritans parasitizes marine fish and causes lethal white spot disease. Sporadic infections as well as large-scale outbreaks have been reported globally and the parasite's broad host range poses particular threat to the aquaculture and ornamental fish markets. In order to better understand C. irritans' population structure, we sequenced and compared mitochondrial cox-1, SSU rRNA, and ITS-1 sequences from 8 new isolates of C. irritans collected in China, Japan, and Taiwan. We detected two SSU rRNA haplotypes, which differ at three positions, separating the isolates into two main groups (I and II). Cox-1 sequences also support the division into two groups, and the cox-1 divergence between these two groups is unexpectedly high (9.28% for 1582 nucleotide positions). The divergence is much greater than that detected in Ichthyophthirius multifiliis, the ciliate protozoan causing freshwater white spot disease in fish, where intraspecies divergence on cox-1 sequence is only 1.95%. ITS-1 sequences derived from these eight isolates and from all other C. irritans isolates (deposited in the GenBank) not only support the two groups, but further suggest the presence of a third group with even greater sequence divergence. Finally, a small Ka/Ks ratio estimated from cox-1 sequences suggests that this gene in C. irritans remains under strong purifying selection. Taken together, the C. irritans species may consists of many subspecies and/or syngens. Further work is needed to determine if there is reproductive isolation between the groups we have defined. Copyright © 2017 Elsevier Inc. All rights reserved.
Passing faces: sequence-dependent variations in the perceptual processing of emotional faces.
Karl, Christian; Hewig, Johannes; Osinsky, Roman
2016-10-01
There is broad evidence that contextual factors influence the processing of emotional facial expressions. Yet temporal-dynamic aspects, inter alia how face processing is influenced by the specific order of neutral and emotional facial expressions, have been largely neglected. To shed light on this topic, we recorded electroencephalogram from 168 healthy participants while they performed a gender-discrimination task with angry and neutral faces. Our event-related potential (ERP) analyses revealed a strong emotional modulation of the N170 component, indicating that the basic visual encoding and emotional analysis of a facial stimulus happen, at least partially, in parallel. While the N170 and the late positive potential (LPP; 400-600 ms) were only modestly affected by the sequence of preceding faces, we observed a strong influence of face sequences on the early posterior negativity (EPN; 200-300 ms). Finally, the differing response patterns of the EPN and LPP indicate that these two ERPs represent distinct processes during face analysis: while the former seems to represent the integration of contextual information in the perception of a current face, the latter appears to represent the net emotional interpretation of a current face.
[Multiplexing mapping of human cDNAs]. Final report, September 1, 1991--February 28, 1994
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
Using PCR with automated product analysis, 329 human brain cDNA sequences have been assigned to individual human chromosomes. Primers were designed from single-pass cDNA sequences expressed sequence tags (ESTs). Primers were used in PCR reactions with DNA from somatic cell hybrid mapping panels as templates, often with multiplexing. Many ESTs mapped match sequence database records. To evaluate of these matches, the position of the primers relative to the matching region (In), the BLAST scores and the Poisson probability values of the EST/sequence record match were determined. In cases where the gene product was stringently identified by the sequence match hadmore » already been mapped, the gene locus determined by EST was consistent with the previous position which strongly supports the validity of assigning unknown genes to human chromosomes based on the EST sequence matches. In the present cases mapping the ESTs to a chromosome can also be considered to have mapped the known gene product: rolipram-sensitive cAMP phosphodiesterase, chromosome 1; protein phosphatase 2A{beta}, chromosome 4; alpha-catenin, chromosome 5; the ELE1 oncogene, chromosome 10q11.2 or q2.1-q23; MXII protein, chromosome l0q24-qter; ribosomal protein L18a homologue, chromosome 14; ribosomal protein L3, chromosome 17; and moesin, Xp11-cen. There were also ESTs mapped that were closely related to non-human sequence records. These matches therefore can be considered to identify human counterparts of known gene products, or members of known gene families. Examples of these include membrane proteins, translation-associated proteins, structural proteins, and enzymes. These data then demonstrate that single pass sequence information is sufficient to design PCR primers useful for assigning cDNA sequences to human chromosomes. When the EST sequence matches previous sequence database records, the chromosome assignments of the EST can be used to make preliminary assignments of the human gene to a chromosome.« less
Transposon Variants and Their Effects on Gene Expression in Arabidopsis
Wang, Xi; Weigel, Detlef; Smith, Lisa M.
2013-01-01
Transposable elements (TEs) make up the majority of many plant genomes. Their transcription and transposition is controlled through siRNAs and epigenetic marks including DNA methylation. To dissect the interplay of siRNA–mediated regulation and TE evolution, and to examine how TE differences affect nearby gene expression, we investigated genome-wide differences in TEs, siRNAs, and gene expression among three Arabidopsis thaliana accessions. Both TE sequence polymorphisms and presence of linked TEs are positively correlated with intraspecific variation in gene expression. The expression of genes within 2 kb of conserved TEs is more stable than that of genes next to variant TEs harboring sequence polymorphisms. Polymorphism levels of TEs and closely linked adjacent genes are positively correlated as well. We also investigated the distribution of 24-nt-long siRNAs, which mediate TE repression. TEs targeted by uniquely mapping siRNAs are on average farther from coding genes, apparently because they more strongly suppress expression of adjacent genes. Furthermore, siRNAs, and especially uniquely mapping siRNAs, are enriched in TE regions missing in other accessions. Thus, targeting by uniquely mapping siRNAs appears to promote sequence deletions in TEs. Overall, our work indicates that siRNA–targeting of TEs may influence removal of sequences from the genome and hence evolution of gene expression in plants. PMID:23408902
Xie, Hongbo; Vucetic, Slobodan; Iakoucheva, Lilia M; Oldfield, Christopher J; Dunker, A Keith; Obradovic, Zoran; Uversky, Vladimir N
2007-05-01
Currently, the understanding of the relationships between function, amino acid sequence, and protein structure continues to represent one of the major challenges of the modern protein science. As many as 50% of eukaryotic proteins are likely to contain functionally important long disordered regions. Many proteins are wholly disordered but still possess numerous biologically important functions. However, the number of experimentally confirmed disordered proteins with known biological functions is substantially smaller than their actual number in nature. Therefore, there is a crucial need for novel bionformatics approaches that allow projection of the current knowledge from a few experimentally verified examples to much larger groups of known and potential proteins. The elaboration of a bioinformatics tool for the analysis of functional diversity of intrinsically disordered proteins and application of this data mining tool to >200 000 proteins from the Swiss-Prot database, each annotated with at least one of the 875 functional keywords, was described in the first paper of this series (Xie, H.; Vucetic, S.; Iakoucheva, L. M.; Oldfield, C. J.; Dunker, A. K.; Obradovic, Z.; Uversky, V.N. Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. J. Proteome Res. 2007, 5, 1882-1898). Using this tool, we have found that out of the 710 Swiss-Prot functional keywords associated with at least 20 proteins, 262 were strongly positively correlated with long intrinsically disordered regions, and 302 were strongly negatively correlated. Illustrative examples of functional disorder or order were found for the vast majority of keywords showing strongest positive or negative correlation with intrinsic disorder, respectively. Some 80 Swiss-Prot keywords associated with disorder- and order-driven biological processes and protein functions were described in the first paper (see above). The second paper of the series was devoted to the presentation of 87 Swiss-Prot keywords attributed to the cellular components, domains, technical terms, developmental processes, and coding sequence diversities possessing strong positive and negative correlation with long disordered regions (Vucetic, S.; Xie, H.; Iakoucheva, L. M.; Oldfield, C. J.; Dunker, A. K.; Obradovic, Z.; Uversky, V. N. Functional anthology of intrinsic disorder. 2. Cellular components, domains, technical terms, developmental processes, and coding sequence diversities correlated with long disordered regions. J. Proteome Res. 2007, 5, 1899-1916). Protein structure and functionality can be modulated by various post-translational modifications or/and as a result of binding of specific ligands. Numerous human diseases are associated with protein misfolding/misassembly/misfunctioning. This work concludes the series of papers dedicated to the functional anthology of intrinsic disorder and describes approximately 80 Swiss-Prot functional keywords that are related to ligands, post-translational modifications, and diseases possessing strong positive or negative correlation with the predicted long disordered regions in proteins.
Streicher, Jeffrey W; Wiens, John J
2017-09-01
Squamate reptiles (lizards and snakes) are the most diverse group of terrestrial vertebrates, with more than 10 000 species. Despite considerable effort to resolve relationships among major squamates clades, some branches have remained difficult. Among the most vexing has been the placement of snakes among lizard families, with most studies yielding only weak support for the position of snakes. Furthermore, the placement of iguanian lizards has remained controversial. Here we used targeted sequence capture to obtain data from 4178 nuclear loci from ultraconserved elements from 32 squamate taxa (and five outgroups) including representatives of all major squamate groups. Using both concatenated and species-tree methods, we recover strong support for a sister relationship between iguanian and anguimorph lizards, with snakes strongly supported as the sister group of these two clades. These analyses strongly resolve the difficult placement of snakes within squamates and show overwhelming support for the contentious position of iguanians. More generally, we provide a strongly supported hypothesis of higher-level relationships in the most species-rich tetrapod clade using coalescent-based species-tree methods and approximately 100 times more loci than previous estimates. © 2017 The Author(s).
Phylogenetic position of the pentastomida and [pan]crustacean relationships
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lavrov, Dennis V.; Brown, Wesley M.; Boore, Jeffrey L.
2004-01-31
Pentastomids are a small group of vermiform animals with unique morphology and parasitic lifestyle. They are generally recognized as being related to the Arthropoda, however the nature of this relationship is controversial. We have determined the complete sequence of the mitochondrial DNA (mtDNA) of the pentastomid Armillifer armillatus and complete, or nearly complete, mtDNA sequences from representatives of four previously unsampled groups of Crustacea: Remipedia (Speleonectes tulumensis), Cephalocarida (Hutchinsoniella macracantha), Cirripedia (Pollicipes polymerus), and Branchiura (Argulus americanus). Analyses of the mtDNA gene arrangements and sequences determined in this study indicate unambiguously that pentastomids are a group of modified crustaceans likelymore » related to branchiurans. In addition, gene arrangement comparisons strongly support an unforeseen assemblage of pentastomids with maxillopod and cephalocarid crustaceans, to the exclusion of remipedes, branchiopods, malacos tracans and insects.« less
Purification and characterization of the restriction endonuclease RsrI, an isoschizomer of EcoRI.
Greene, P J; Ballard, B T; Stephenson, F; Kohr, W J; Rodriguez, H; Rosenberg, J M; Boyer, H W
1988-08-15
Rhodobacter sphaeroides strain 630 produces restriction enzyme RsrI which is an isoschizomer of EcoRI. We have purified this enzyme and initiated a comparison with the EcoRI endonuclease. The properties of RsrI are consistent with a reaction mechanism similar to that of EcoRI: the position of cleavage within the -GAATTC-site is identical, the MgCl2 optimum for the cleavage is identical, and the pH profile is similar. Methylation of the substrate sequence by the EcoRI methylase protects the site from cleavage by the RsrI endonuclease. RsrI cross-reacts strongly with anti-EcoRI serum indicating three-dimensional structural similarities. We have determined the sequence of 34 N terminal amino acids for RsrI and this sequence possesses significant similarity to the EcoRI N terminus.
Setiawan, Laurentia C; Gijsbers, Esther F; van Nuenen, Adrianus C; Kootstra, Neeltje A
2015-08-01
The HLA-B27 allele is over-represented among human immunodeficiency virus type 1-infected long-term non-progressors. In these patients, strong CTL responses targeting HLA-B27-restricted viral epitopes have been associated with long-term asymptomatic survival. Indeed, loss of control of viraemia in HLA-B27 patients has been associated with CTL escape at position 264 in the immunodominant KK10 epitope. This CTL escape mutation in the viral Gag protein has been associated with severe viral attenuation and may require the presence of compensatory mutations before emerging. Here, we studied sequence evolution within HLA-B27-restricted CTL epitopes in the viral Gag protein during the course of infection of seven HLA-B27-positive patients. Longitudinal gag sequences obtained at different time points around the time of AIDS diagnosis were obtained and analysed for the presence of mutations in epitopes restricted by HLA-B27, and for potential compensatory mutations. Sequence variations were observed in the HLA-B27-restricted CTL epitopes IK9 and DR11, and the immunodominant KK10 epitope. However, the presence of sequence variations in the HLA-B27-restricted CTL epitopes could not be associated with an increase in viraemia in the majority of the patients studied. Furthermore, we observed low genetic diversity in the gag region of the viral variants throughout the course of infection, which is indicative of low viral replication and corresponds to the low viral load observed in the HLA-B27-positive patients. These data indicated that control of viral replication can be maintained in HLA-B27-positive patients despite the emergence of viral mutations in HLA-B27-restricted epitopes.
Wang, Yin-qiu; Qian, Ya-ping; Yang, Su; Shi, Hong; Liao, Cheng-hong; Zheng, Hong-Kun; Wang, Jun; Lin, Alice A.; Cavalli-Sforza, L. Luca; Underhill, Peter A.; Chakraborty, Ranajit; Jin, Li; Su, Bing
2005-01-01
Pituitary adenylate cyclase-activating polypeptide (PACAP) is a neuropeptide abundantly expressed in the central nervous system and involved in regulating neurogenesis and neuronal signal transduction. The amino acid sequence of PACAP is extremely conserved across vertebrate species, indicating a strong functional constraint during the course of evolution. However, through comparative sequence analysis, we demonstrated that the PACAP precursor gene underwent an accelerated evolution in the human lineage since the divergence from chimpanzees, and the amino acid substitution rate in humans is at least seven times faster than that in other mammal species resulting from strong Darwinian positive selection. Eleven human-specific amino acid changes were identified in the PACAP precursors, which are conserved from murine to African apes. Protein structural analysis suggested that a putative novel neuropeptide might have originated during human evolution and functioned in the human brain. Our data suggested that the PACAP precursor gene underwent adaptive changes during human origin and may have contributed to the formation of human cognition. PMID:15834139
Strain rates, stress markers and earthquake clustering (Invited)
NASA Astrophysics Data System (ADS)
Fry, B.; Gerstenberger, M.; Abercrombie, R. E.; Reyners, M.; Eberhart-Phillips, D. M.
2013-12-01
The 2010-present Canterbury earthquakes comprise a well-recorded sequence in a relatively low strain-rate shallow crustal region. We present new scientific results to test the hypothesis that: Earthquake sequences in low-strain rate areas experience high stress drop events, low-post seismic relaxation, and accentuated seismic clustering. This hypothesis is based on a physical description of the aftershock process in which the spatial distribution of stress accumulation and stress transfer are controlled by fault strength and orientation. Following large crustal earthquakes, time dependent forecasts are often developed by fitting parameters defined by Omori's aftershock decay law. In high-strain rate areas, simple forecast models utilizing a single p-value fit observed aftershock sequences well. In low-strain rate areas such as Canterbury, assumptions of simple Omori decay may not be sufficient to capture the clustering (sub-sequence) nature exhibited by the punctuated rise in activity following significant child events. In Canterbury, the moment release is more clustered than in more typical Omori sequences. The individual earthquakes in these clusters also exhibit somewhat higher stress drops than in the average crustal sequence in high-strain rate regions, suggesting the earthquakes occur on strong Andersonian-oriented faults, possibly juvenile or well-healed . We use the spectral ratio procedure outlined in (Viegas et al., 2010) to determine corner frequencies and Madariaga stress-drop values for over 800 events in the sequence. Furthermore, we will discuss the relevance of tomographic results of Reyners and Eberhart-Phillips (2013) documenting post-seismic stress-driven fluid processes following the three largest events in the sequence as well as anisotropic patterns in surface wave tomography (Fry et al., 2013). These tomographic studies are both compatible with the hypothesis, providing strong evidence for the presence of widespread and hydrated regional upper crustal cracking parallel to sub-parallel to the dominant transverse failure plane in the sequence. Joint interpretation of the three separate datasets provide a positive first attempt at testing our fundamental hypothesis.
Compound haplotypes at Xp11.23 and human population growth in Eurasia.
Alonso, S; Armour, J A L
2004-09-01
To investigate patterns of diversity and the evolutionary history of Eurasians, we have sequenced a 2.8 kb region at Xp11.23 in a sample of African and Eurasian chromosomes. This region is in a long intron of CLCN5 and is immediately flanked by a highly variable minisatellite, DXS255, and a human-specific Ta0 LINE. Compared to Africans, Eurasians showed a marked reduction in sequence diversity. The main Euro-Asiatic haplotype seems to be the ancestral haplotype for the whole sample. Coalescent simulations, including recombination and exponential growth, indicate a median length of strong linkage disequilibrium, up to approximately 9 kb for this area. The Ka/Ks ratio between the coding sequence of human CLCN5 and its mouse orthologue is much less than 1. This implies that the region sequenced is unlikely to be under the strong influence of positive selective processes on CLCN5, mutations in which have been associated with disorders such as Dent's disease. In contrast, a scenario based on a population bottleneck and exponential growth seems a more likely explanation for the reduced diversity observed in Eurasians. Coalescent analysis and linked minisatellite diversity (which reaches a gene diversity value greater than 98% in Eurasians) suggest an estimated age of origin of the Euro-Asiatic diversity compatible with a recent out-of-Africa model for colonization of Eurasia by modern Homo sapiens.
Meghdadi, Hossein; Khosravi, Azar D.; Ghadiri, Ata A.; Sina, Amir H.; Alami, Ameneh
2015-01-01
Present study was aimed to examine the diagnostic utility of polymerase chain reaction (PCR) and nested PCR techniques for the detection of Mycobacterium tuberculosis (MTB) DNA in samples from patients with extra pulmonary tuberculosis (EPTB). In total 80 formalin-fixed, paraffin-embedded (FFPE) samples comprising 70 samples with definite diagnosis of EPTB and 10 samples from known non- EPTB on the basis of histopathology examination, were included in the study. PCR amplification targeting IS6110, rpoB gene and nested PCR targeting the rpoB gene were performed on the extracted DNAs from 80 FFPE samples. The strong positive samples were directly sequenced. For negative samples and those with weak band in nested-rpoB PCR, TA cloning was performed by cloning the products into the plasmid vector with subsequent sequencing. The 95% confidence intervals (CI) for the estimates of sensitivity and specificity were calculated for each method. Fourteen (20%), 34 (48.6%), and 60 (85.7%) of the 70 positive samples confirmed by histopathology, were positive by rpoB-PCR, IS6110-PCR, and nested-rpoB PCR, respectively. By performing TA cloning on samples that yielded weak (n = 8) or negative results (n = 10) in the PCR methods, we were able to improve their quality for later sequencing. All samples with weak band and 7 out of 10 negative samples, showed strong positive results after cloning. So nested-rpoB PCR cloning revealed positivity in 67 out of 70 confirmed samples (95.7%). The sensitivity of these combination methods was calculated as 95.7% in comparison with histopathology examination. The CI for sensitivity of the PCR methods were calculated as 11.39–31.27% for rpoB-PCR, 36.44–60.83% for IS6110- PCR, 75.29–92.93% for nested-rpoB PCR, and 87.98–99.11% for nested-rpoB PCR cloning. The 10 true EPTB negative samples by histopathology, were negative by all tested methods including cloning and were used to calculate the specificity of the applied methods. The CI for 100% specificity of each PCR method were calculated as 69.15–100%. Our results indicated that nested-rpoB PCR combined with TA cloning and sequencing is a preferred method for the detection of MTB DNA in EPTB samples with high sensitivity and specificity which confirm the histopathology results. PMID:26191059
Meghdadi, Hossein; Khosravi, Azar D; Ghadiri, Ata A; Sina, Amir H; Alami, Ameneh
2015-01-01
Present study was aimed to examine the diagnostic utility of polymerase chain reaction (PCR) and nested PCR techniques for the detection of Mycobacterium tuberculosis (MTB) DNA in samples from patients with extra pulmonary tuberculosis (EPTB). In total 80 formalin-fixed, paraffin-embedded (FFPE) samples comprising 70 samples with definite diagnosis of EPTB and 10 samples from known non- EPTB on the basis of histopathology examination, were included in the study. PCR amplification targeting IS6110, rpoB gene and nested PCR targeting the rpoB gene were performed on the extracted DNAs from 80 FFPE samples. The strong positive samples were directly sequenced. For negative samples and those with weak band in nested-rpoB PCR, TA cloning was performed by cloning the products into the plasmid vector with subsequent sequencing. The 95% confidence intervals (CI) for the estimates of sensitivity and specificity were calculated for each method. Fourteen (20%), 34 (48.6%), and 60 (85.7%) of the 70 positive samples confirmed by histopathology, were positive by rpoB-PCR, IS6110-PCR, and nested-rpoB PCR, respectively. By performing TA cloning on samples that yielded weak (n = 8) or negative results (n = 10) in the PCR methods, we were able to improve their quality for later sequencing. All samples with weak band and 7 out of 10 negative samples, showed strong positive results after cloning. So nested-rpoB PCR cloning revealed positivity in 67 out of 70 confirmed samples (95.7%). The sensitivity of these combination methods was calculated as 95.7% in comparison with histopathology examination. The CI for sensitivity of the PCR methods were calculated as 11.39-31.27% for rpoB-PCR, 36.44-60.83% for IS6110- PCR, 75.29-92.93% for nested-rpoB PCR, and 87.98-99.11% for nested-rpoB PCR cloning. The 10 true EPTB negative samples by histopathology, were negative by all tested methods including cloning and were used to calculate the specificity of the applied methods. The CI for 100% specificity of each PCR method were calculated as 69.15-100%. Our results indicated that nested-rpoB PCR combined with TA cloning and sequencing is a preferred method for the detection of MTB DNA in EPTB samples with high sensitivity and specificity which confirm the histopathology results.
Musinova, Yana R; Kananykhina, Eugenia Y; Potashnikova, Daria M; Lisitsyna, Olga M; Sheval, Eugene V
2015-01-01
The majority of known nucleolar proteins are freely exchanged between the nucleolus and the surrounding nucleoplasm. One way proteins are retained in the nucleoli is by the presence of specific amino acid sequences, namely nucleolar localization signals (NoLSs). The mechanism by which NoLSs retain proteins inside the nucleoli is still unclear. Here, we present data showing that the charge-dependent (electrostatic) interactions of NoLSs with nucleolar components lead to nucleolar accumulation as follows: (i) known NoLSs are enriched in positively charged amino acids, but the NoLS structure is highly heterogeneous, and it is not possible to identify a consensus sequence for this type of signal; (ii) in two analyzed proteins (NF-κB-inducing kinase and HIV-1 Tat), the NoLS corresponds to a region that is enriched for positively charged amino acid residues; substituting charged amino acids with non-charged ones reduced the nucleolar accumulation in proportion to the charge reduction, and nucleolar accumulation efficiency was strongly correlated with the predicted charge of the tested sequences; and (iii) sequences containing only lysine or arginine residues (which were referred to as imitative NoLSs, or iNoLSs) are accumulated in the nucleoli in a charge-dependent manner. The results of experiments with iNoLSs suggested that charge-dependent accumulation inside the nucleoli was dependent on interactions with nucleolar RNAs. The results of this work are consistent with the hypothesis that nucleolar protein accumulation by NoLSs can be determined by the electrostatic interaction of positively charged regions with nucleolar RNAs rather than by any sequence-specific mechanism. Copyright © 2014 Elsevier B.V. All rights reserved.
A coevolution analysis for identifying protein-protein interactions by Fourier transform.
Yin, Changchuan; Yau, Stephen S-T
2017-01-01
Protein-protein interactions (PPIs) play key roles in life processes, such as signal transduction, transcription regulations, and immune response, etc. Identification of PPIs enables better understanding of the functional networks within a cell. Common experimental methods for identifying PPIs are time consuming and expensive. However, recent developments in computational approaches for inferring PPIs from protein sequences based on coevolution theory avoid these problems. In the coevolution theory model, interacted proteins may show coevolutionary mutations and have similar phylogenetic trees. The existing coevolution methods depend on multiple sequence alignments (MSA); however, the MSA-based coevolution methods often produce high false positive interactions. In this paper, we present a computational method using an alignment-free approach to accurately detect PPIs and reduce false positives. In the method, protein sequences are numerically represented by biochemical properties of amino acids, which reflect the structural and functional differences of proteins. Fourier transform is applied to the numerical representation of protein sequences to capture the dissimilarities of protein sequences in biophysical context. The method is assessed for predicting PPIs in Ebola virus. The results indicate strong coevolution between the protein pairs (NP-VP24, NP-VP30, NP-VP40, VP24-VP30, VP24-VP40, and VP30-VP40). The method is also validated for PPIs in influenza and E.coli genomes. Since our method can reduce false positive and increase the specificity of PPI prediction, it offers an effective tool to understand mechanisms of disease pathogens and find potential targets for drug design. The Python programs in this study are available to public at URL (https://github.com/cyinbox/PPI).
A coevolution analysis for identifying protein-protein interactions by Fourier transform
Yin, Changchuan; Yau, Stephen S. -T.
2017-01-01
Protein-protein interactions (PPIs) play key roles in life processes, such as signal transduction, transcription regulations, and immune response, etc. Identification of PPIs enables better understanding of the functional networks within a cell. Common experimental methods for identifying PPIs are time consuming and expensive. However, recent developments in computational approaches for inferring PPIs from protein sequences based on coevolution theory avoid these problems. In the coevolution theory model, interacted proteins may show coevolutionary mutations and have similar phylogenetic trees. The existing coevolution methods depend on multiple sequence alignments (MSA); however, the MSA-based coevolution methods often produce high false positive interactions. In this paper, we present a computational method using an alignment-free approach to accurately detect PPIs and reduce false positives. In the method, protein sequences are numerically represented by biochemical properties of amino acids, which reflect the structural and functional differences of proteins. Fourier transform is applied to the numerical representation of protein sequences to capture the dissimilarities of protein sequences in biophysical context. The method is assessed for predicting PPIs in Ebola virus. The results indicate strong coevolution between the protein pairs (NP-VP24, NP-VP30, NP-VP40, VP24-VP30, VP24-VP40, and VP30-VP40). The method is also validated for PPIs in influenza and E.coli genomes. Since our method can reduce false positive and increase the specificity of PPI prediction, it offers an effective tool to understand mechanisms of disease pathogens and find potential targets for drug design. The Python programs in this study are available to public at URL (https://github.com/cyinbox/PPI). PMID:28430779
Detection by PCR of pathogenic protozoa in raw and drinkable water samples in Colombia.
Triviño-Valencia, Jessica; Lora, Fabiana; Zuluaga, Juan David; Gomez-Marin, Jorge E
2016-05-01
We evaluated the presence of DNA of Giardia, Toxoplasma, and Cryptosporidium by PCR, and of Giardia and Cryptosporidium genera by immunofluorescence antibody test (IFAT), in water samples, before, during, and after plant treatment for drinkable water. We applied this method in 38 samples of 10 l of water taken from each of the water treatment steps and in 8 samples taken at home (only for Toxoplasma PCR) in Quindio region in Colombia. There were 8 positive samples for Cryptosporidium parvum (21 %), 4 for Cryptosporidium hominis (10.5 %), 27 for Toxoplasma gondii (58.6 %), 2 for Giardia duodenalis assemblage A (5.2 %), and 5 for G. duodenalis assemblage B (13.1 %). By IFAT, 23 % were positive for Giardia and 21 % for Cryptosporidium. An almost perfect agreement was found between IFAT and combined results of PCR, by Kappa composite proportion analysis. PCR positive samples were significantly more frequent in untreated raw water for C. parvum (p = 0.02). High mean of fecal coliforms, high pH values, and low mean of chlorine residuals were strongly correlated with PCR positivity for G. duodenalis assemblage B. High pH value was correlated with PCR positivity for C. parvum. Phylogenetic analysis of DNA sequences was possible, showing water and human clinical sequences for Toxoplasma within the same phylogenetic group for B1 repeated sequence. PCR assay is complementary to IFAT assay for monitoring of protozoa in raw and drinkable water, enabling species identification and to look for phylogenetic analysis in protozoa from human and environmental sources.
Coletta, Andrea; Desideri, Alessandro
2013-01-01
Camptothecin (CPT) is a topoisomerase IB (TopIB) selective inhibitor whose derivatives are currently used in cancer therapy. TopIB cleaves DNA at any sequence, but in the presence of CPT the only stabilized protein–DNA covalent complex is the one having a thymine in position −1 with respect to the cleavage site. A metadynamics simulation of two TopIB–DNA–CPT ternary complexes differing for the presence of a thymine or a cytosine in position −1 indicates the occurrence of two different drug’s unbinding pathways. The free-energy difference between the bound state and the transition state is large when a thymine is present in position −1 and is strongly reduced in presence of a cytosine, in line with the different drug stabilization properties of the two systems. Such a difference is strictly related to the changes in the hydrogen bond network between the protein, the DNA and the drug in the two systems, indicating a direct role of the protein in determining the specificity of the cleavage site sequence stabilized by the CPT. Calculations carried out in presence of one compound of the indenoisoquinoline family (NSC314622) indicate a comparable energy difference between the bound and the transition state independently of the presence of a thymine or a cytosine in position −1, in line with the experimental results. PMID:24003027
Subbotin, S A; Vierstraete, A; De Ley, P; Rowe, J; Waeyenberge, L; Moens, M; Vanfleteren, J R
2001-10-01
The ITS1, ITS2, and 5.8S gene sequences of nuclear ribosomal DNA from 40 taxa of the family Heteroderidae (including the genera Afenestrata, Cactodera, Heterodera, Globodera, Punctodera, Meloidodera, Cryphodera, and Thecavermiculatus) were sequenced and analyzed. The ITS regions displayed high levels of sequence divergence within Heteroderinae and compared to outgroup taxa. Unlike recent findings in root knot nematodes, ITS sequence polymorphism does not appear to complicate phylogenetic analysis of cyst nematodes. Phylogenetic analyses with maximum-parsimony, minimum-evolution, and maximum-likelihood methods were performed with a range of computer alignments, including elision and culled alignments. All multiple alignments and phylogenetic methods yielded similar basic structure for phylogenetic relationships of Heteroderidae. The cyst-forming nematodes are represented by six main clades corresponding to morphological characters and host specialization, with certain clades assuming different positions depending on alignment procedure and/or method of phylogenetic inference. Hypotheses of monophyly of Punctoderinae and Heteroderinae are, respectively, strongly and moderately supported by the ITS data across most alignments. Close relationships were revealed between the Avenae and the Sacchari groups and between the Humuli group and the species H. salixophila within Heteroderinae. The Goettingiana group occupies a basal position within this subfamily. The validity of the genera Afenestrata and Bidera was tested and is discussed based on molecular data. We conclude that ITS sequence data are appropriate for studies of relationships within the different species groups and less so for recovery of more ancient speciations within Heteroderidae. Copyright 2001 Academic Press.
Han, Limin; Chen, Chen; Wang, Zhezhi
2018-01-01
Epipremnum aureum is an important foliage plant in the Araceae family. In this study, we have sequenced the complete chloroplast genome of E. aureum by using Illumina Hiseq sequencing platforms. This genome is a double-stranded circular DNA sequence of 164,831 bp that contains 35.8% GC. The two inverted repeats (IRa and IRb; 26,606 bp) are spaced by a small single-copy region (22,868 bp) and a large single-copy region (88,751 bp). The chloroplast genome has 131 (113 unique) functional genes, including 86 (79 unique) protein-coding genes, 37 (30 unique) tRNA genes, and eight (four unique) rRNA genes. Tandem repeats comprise the majority of the 43 long repetitive sequences. In addition, 111 simple sequence repeats are present, with mononucleotides being the most common type and di- and tetranucleotides being infrequent events. Positive selection pressure on rps12 in the E. aureum chloroplast has been demonstrated via synonymous and nonsynonymous substitution rates and selection pressure sites analyses. Ycf15 and infA are pseudogenes in this species. We constructed a Maximum Likelihood phylogenetic tree based on the complete chloroplast genomes of 38 species from 13 families. Those results strongly indicated that E. aureum is positioned as the sister of Colocasia esculenta within the Araceae family. This work may provide information for further study of the molecular phylogenetic relationships within Araceae, as well as molecular markers and breeding novel varieties by chloroplast genetic-transformation of E. aureum in particular. PMID:29529038
Chaires, J B; Herrera, J E; Waring, M J
1990-07-03
Results from a high-resolution deoxyribonuclease I (DNase I) footprinting titration procedure are described that identify preferred daunomycin binding sites within the 160 bp tyr T DNA fragment. We have obtained single-bond resolution at 65 of the 160 potential binding sites within the tyr T fragment and have examined the effect of 0-3.0 microM total daunomycin concentration on the susceptibility of these sites toward digestion by DNase I. Four types of behavior are observed: (i) protection from DNase I cleavage; (ii) protection, but only after reaching a critical total daunomycin concentration; (iii) enhanced cleavage; (iv) no effect of added drug. Ten sites were identified as the most strongly protected on the basis of the magnitude of the reduction of their digestion product band areas in the presence of daunomycin. These were identified as the preferred daunomycin binding sites. Seven of these 10 sites are found at the end of the triplet sequences 5'ATGC and 5'ATCG, where the notation AT indicates that either A or T may occupy the position. The remaining three strongly protected sites are found at the ends of the triplet sequence 5'ATCAT. Of the preferred daunomycin binding sites we identify in this study, the sequence 5'ATCG is consistent with the specificity predicted by the theoretical studies of Chen et al. [Chen, K.-X., Gresh, N., & Pullman, B. (1985) J. Biomol. Struct. Dyn. 3, 445-466] and is the very sequence to which daunomycin is observed to be bound in two recent X-ray crystallographic studies. Solution studies, theoretical studies, and crystallographic studies have thus converged to provide a consistent and coherent picture of the sequence preference of this important anticancer antibiotic.
Harmsen, Dag; Singer, Christian; Rothgänger, Jörg; Tønjum, Tone; Sybren de Hoog, Gerrit; Shah, Haroun; Albert, Jürgen; Frosch, Matthias
2001-01-01
Fast and reliable identification of microbial isolates is a fundamental goal of clinical microbiology. However, in the case of some fastidious gram-negative bacterial species, classical phenotype identification based on either metabolic, enzymatic, or serological methods is difficult, time-consuming, and/or inadequate. 16S or 23S ribosomal DNA (rDNA) bacterial sequencing will most often result in accurate speciation of isolates. Therefore, the objective of this study was to find a hypervariable rDNA stretch, flanked by strongly conserved regions, which is suitable for molecular species identification of members of the Neisseriaceae and Moraxellaceae. The inter- and intrageneric relationships were investigated using comparative sequence analysis of PCR-amplified partial 16S and 23S rDNAs from a total of 94 strains. When compared to the type species of the genera Acinetobacter, Moraxella, and Neisseria, an average of 30 polymorphic positions was observed within the partial 16S rDNA investigated (corresponding to Escherichia coli positions 54 to 510) for each species and an average of 11 polymorphic positions was observed within the 202 nucleotides of the 23S rDNA gene (positions 1400 to 1600). Neisseria macacae and Neisseria mucosa subsp. mucosa (ATCC 19696) had identical 16S and 23S rDNA sequences. Species clusters were heterogeneous in both genes in the case of Acinetobacter lwoffii, Moraxella lacunata, and N. mucosa. Neisseria meningitidis isolates failed to cluster only in the 23S rDNA subset. Our data showed that the 16S rDNA region is more suitable than the partial 23S rDNA for the molecular diagnosis of Neisseriaceae and Moraxellaceae and that a reference database should include more than one strain of each species. All sequence chromatograms and taxonomic and disease-related information are available as part of our ribosomal differentiation of medical microorganisms (RIDOM) web-based service (http://www.ridom.hygiene.uni-wuerzburg.de/). Users can submit a sequence and conduct a similarity search against the RIDOM reference database for microbial identification purposes. PMID:11230407
Characterization of 67P/Churyumov-Gerasimenko interior from CONSERT signal amplitude variability
NASA Astrophysics Data System (ADS)
Zine, Sonia; Kofman, Wlodek; Herique, Alain; Hahnel, Ronny; Plettemeier, Dirk; Rogez, Yves; Statz, Christoph; Ciarletti, Valerie
2016-04-01
The bistatic radar CONSERT on Rosetta and Philae operated for 9 hours during Philae's First Science Sequence (FSS), on 12 and 13 November 2014. A strong signal was detected for 30 min at the beginning of the sequence, and for 80 min at the end. The signal propagated through the smaller lobe of the nucleus, with a length of propagation ranging between 200 and 800m, and a rapid decrease of its amplitude. First results have been published, based on the study of the signal propagation delay and the propagation path (Kofman et al., Science 2015; Ciarletti et al, A&A, 2015). This work focuses on the study of the signal amplitude, which shows variability throughout the acquisition sequence. The cause of this variability is twofold: (1) losses within the comet interior; (2) depolarization due to both antennas' varying relative attitudes. We simulate the depolarization by taking into account Rosetta's position and attitude on its orbit and by making assumptions on Philae's position, attitude, and close environment on the comet (dielectric properties). Then we assess the variability due to losses within the medium, and infer a characterization of the composition of the comet interior.
NASA Technical Reports Server (NTRS)
Callantine, Todd J.; Bienert, Nancy; Borade, Abhay; Gabriel, Conrad; Gujral, Vimmy; Jobe, Kim; Martin, Lynne; Omar, Faisal; Prevot, Thomas; Mercer, Joey
2016-01-01
A human-in-the-loop simulation study addressed terminal-area controller-workstation interface variations for interoperability between three new capabilities being introduced by the FAA. The capabilities are Terminal Sequencing and Spacing (TSAS), Automated Terminal Proximity Alert (ATPA), and wake-separation recategorization, or 'RECAT.' TSAS provides controllers with Controller-Managed Spacing (CMS) tools, including slot markers, speed advisories, and early/late indications, together with runway assignments and sequence numbers. ATPA provides automatic monitor, warning, and alert cones to inform controllers about spacing between aircraft on approach. ATPA cones are sized according to RECAT, an improved method of specifying wake-separation standards. The objective of the study was to identify potential issues and provide recommendations for integrating TSAS with ATPA and RECAT. Participants controlled arrival traffic under seven different display configurations, then tested an 'exploratory' configuration developed with participant input. All the display conditions were workable and acceptable, but controllers strongly preferred having the CMS tools available on Feeder positions, and both CMS tools and ATPA available on Final positions. Controllers found the integrated systems favorable and liked being able to tailor configurations to individual preferences.
Gupta, R S; Aitken, K; Falah, M; Singh, B
1994-01-01
The genes for two different 70-kDa heat shock protein (HSP70) homologs have been cloned and sequenced from the protozoan Giardia lamblia. On the basis of their sequence features, one of these genes corresponds to the cytoplasmic form of HSP70. The second gene, on the basis of its characteristic N-terminal hydrophobic signal sequence and C-terminal endoplasmic reticulum (ER) retention sequence (Lys-Asp-Glu-Leu), is the equivalent of ER-resident GRP78 or the Bip family of proteins. Phylogenetic trees based on HSP70 sequences show that G. lamblia homologs show the deepest divergence among eukaryotic species. The identification of a GRP78 or Bip homolog in G. lamblia strongly suggests the existence of ER in this ancient eukaryote. Detailed phylogenetic analyses of HSP70 sequences by boot-strap neighbor-joining and maximum-parsimony methods show that the cytoplasmic and ER homologs form distinct subfamilies that evolved from a common eukaryotic ancestor by gene duplication that occurred very early in the evolution of eukaryotic cells. It is postulated that because of the essential "molecular chaperone" function of these proteins in translocation of other proteins across membranes, duplication of their genes accompanied the evolution of ER or nucleus in the eukaryotic cell ancestor. The presence in all eukaryotic cytoplasmic HSP70 homologs (including the cognate, heat-induced, and ER forms) of a number of autapomorphic sequence signatures that are not present in any prokaryotic or organellar homologs provides strong evidence regarding the monophyletic nature of eukaryotic lineage. Further, all eukaryotic HSP70 homologs share in common with the Gram-negative group of eubacteria a number of sequence features that are not present in any archaebacterium or Gram-positive bacterium, indicating their evolution from this group of organisms. Some implications of these findings regarding the evolution of eukaryotic cells and ER are discussed. Images PMID:8159675
Hirose, Yusuke; Onuki, Mamiko; Tenjimbayashi, Yuri; Mori, Seiichiro; Ishii, Yoshiyuki; Takeuchi, Takamasa; Tasaka, Nobutaka; Satoh, Toyomi; Morisada, Tohru; Iwata, Takashi; Miyamoto, Shingo; Matsumoto, Koji; Sekizawa, Akihiko; Kukimoto, Iwao
2018-06-15
Persistent infection with oncogenic human papillomaviruses (HPVs) causes cervical cancer, accompanied by the accumulation of somatic mutations into the host genome. There are concomitant genetic changes in the HPV genome during viral infection; however, their relevance to cervical carcinogenesis is poorly understood. Here, we explored within-host genetic diversity of HPV by performing deep-sequencing analyses of viral whole-genome sequences in clinical specimens. The whole genomes of HPV types 16, 52, and 58 were amplified by type-specific PCR from total cellular DNA of cervical exfoliated cells collected from patients with cervical intraepithelial neoplasia (CIN) and invasive cervical cancer (ICC) and were deep sequenced. After constructing a reference viral genome sequence for each specimen, nucleotide positions showing changes with >0.5% frequencies compared to the reference sequence were determined for individual samples. In total, 1,052 positions of nucleotide variations were detected in HPV genomes from 151 samples (CIN1, n = 56; CIN2/3, n = 68; ICC, n = 27), with various numbers per sample. Overall, C-to-T and C-to-A substitutions were the dominant changes observed across all histological grades. While C-to-T transitions were predominantly detected in CIN1, their prevalence was decreased in CIN2/3 and fell below that of C-to-A transversions in ICC. Analysis of the trinucleotide context encompassing substituted bases revealed that TpCpN, a preferred target sequence for cellular APOBEC cytosine deaminases, was a primary site for C-to-T substitutions in the HPV genome. These results strongly imply that the APOBEC proteins are drivers of HPV genome mutation, particularly in CIN1 lesions. IMPORTANCE HPVs exhibit surprisingly high levels of genetic diversity, including a large repertoire of minor genomic variants in each viral genotype. Here, by conducting deep-sequencing analyses, we show for the first time a comprehensive snapshot of the within-host genetic diversity of high-risk HPVs during cervical carcinogenesis. Quasispecies harboring minor nucleotide variations in viral whole-genome sequences were extensively observed across different grades of CIN and cervical cancer. Among the within-host variations, C-to-T transitions, a characteristic change mediated by cellular APOBEC cytosine deaminases, were predominantly detected throughout the whole viral genome, most strikingly in low-grade CIN lesions. The results strongly suggest that within-host variations of the HPV genome are primarily generated through the interaction with host cell DNA-editing enzymes and that such within-host variability is an evolutionary source of the genetic diversity of HPVs. Copyright © 2018 American Society for Microbiology.
Weadick, Cameron J; Chang, Belinda S W
2009-05-01
Within the vertebrate eye, betagamma crystallins are extremely stable lens proteins that are uniquely adapted to increase refractory power while maintaining transparency. Unlike alpha crystallins, which are well-characterized, multifunctional proteins that have important functions both in and out of the lens, betagamma lens crystallins are a diverse group of proteins with no clear ancestral or contemporary nonlens role. We carried out phylogenetic and molecular evolutionary analyses of the betagamma-crystallin superfamily in order to study the evolutionary history of the gamma N crystallins, a recently discovered, biochemically atypical family suggested to possess a divergent or ancestral function. By including nonlens, betagamma-motif-containing sequences in our analysis as outgroups, we confirmed the phylogenetic position of the gamma N family as sister to other gamma crystallins. Using maximum likelihood codon models to estimate lineage-specific nonsynonymous-to-synonymous rate ratios revealed strong positive selection in all of the early lineages within the betagamma family, with the striking exception of the lineage leading to the gamma N crystallins which was characterized by strong purifying selection. Branch-site analysis, used to identify candidate sites involved in functional divergence between gamma N crystallins and its sister clade containing all other gamma crystallins, identified several positively selected changes at sites of known functional importance in the betagamma crystallin protein structure. Further analyses of a fish-specific gamma N crystallin gene duplication revealed a more recent episode of positive selection in only one of the two descendant lineages (gamma N2). Finally, from the guppy, Poecilia reticulata, we isolated complete gamma N1 and gamma N2 coding sequence data from cDNA and partial coding sequence data from genomic DNA in order to confirm the presence of a novel gamma N2 intron, discovered through data mining of two pufferfish genomes. We conclude that the function of the gamma N family likely resembles the ancestral vertebrate betagamma crystallin more than other betagamma families. Furthermore, owing to the presence of an additional intron in some fish gamma N2 crystallins, and the inferred action of positive selection following the fish-specific gamma N duplication, we suggest that further study of fish gamma N crystallins will be critical in further elucidating possible ancestral functions of gamma N crystallins and any nonstructural role they may have.
Schimelman, Jacob B; Dryden, Daniel M; Poudel, Lokendra; Krawiec, Katherine E; Ma, Yingfang; Podgornik, Rudolf; Parsegian, V Adrian; Denoyer, Linda K; Ching, Wai-Yim; Steinmetz, Nicole F; French, Roger H
2015-02-14
The role of base pair composition and stacking sequence in the optical properties and electronic transitions of DNA is of fundamental interest. We present and compare the optical properties of DNA oligonucleotides (AT)10, (AT)5(GC)5, and (AT-GC)5 using both ab initio methods and UV-vis molar absorbance measurements. Our data indicate a strong dependence of both the position and intensity of UV absorbance features on oligonucleotide composition and stacking sequence. The partial densities of states for each oligonucleotide indicate that the valence band edge arises from a feature associated with the PO4(3-) complex anion, and the conduction band edge arises from anti-bonding states in DNA base pairs. The results show a strong correspondence between the ab initio and experimentally determined optical properties. These results highlight the benefit of full spectral analysis of DNA, as opposed to reductive methods that consider only the 260 nm absorbance (A260) or simple purity ratios, such as A260/A230 or A260/A280, and suggest that the slope of the absorption edge onset may provide a useful metric for the degree of base pair stacking in DNA. These insights may prove useful for applications in biology, bioelectronics, and mesoscale self-assembly.
Probing energetics of Abeta fibril elongation by molecular dynamics simulations.
Takeda, Takako; Klimov, Dmitri K
2009-06-03
Using replica exchange molecular dynamics simulations and an all-atom implicit solvent model, we probed the energetics of Abeta(10-40) fibril growth. The analysis of the interactions between incoming Abeta peptides and the fibril led us to two conclusions. First, considerable variations in fibril binding propensities are observed along the Abeta sequence. The peptides in the fibril and those binding to its edge interact primarily through their N-termini. Therefore, the mutations affecting the Abeta positions 10-23 are expected to have the largest impact on fibril elongation compared with those occurring in the C-terminus and turn. Second, we performed weak perturbations of the binding free energy landscape by scanning partial deletions of side-chain interactions at various Abeta sequence positions. The results imply that strong side-chain interactions--in particular, hydrophobic contacts--impede fibril growth by favoring disordered docking of incoming peptides. Therefore, fibril elongation may be promoted by moderate reduction of Abeta hydrophobicity. The comparison with available experimental data is presented.
Density by moduli and Wijsman lacunary statistical convergence of sequences of sets.
Bhardwaj, Vinod K; Dhawan, Shweta
2017-01-01
The main object of this paper is to introduce and study a new concept of f -Wijsman lacunary statistical convergence of sequences of sets, where f is an unbounded modulus. The definition of Wijsman lacunary strong convergence of sequences of sets is extended to a definition of Wijsman lacunary strong convergence with respect to a modulus for sequences of sets and it is shown that, under certain conditions on a modulus f , the concepts of Wijsman lacunary strong convergence with respect to a modulus f and f -Wijsman lacunary statistical convergence are equivalent on bounded sequences. We further characterize those θ for which [Formula: see text], where [Formula: see text] and [Formula: see text] denote the sets of all f -Wijsman lacunary statistically convergent sequences and f -Wijsman statistically convergent sequences, respectively.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhengqiu, C.; Penaflor, C.; Kuehl, J.V.
2006-06-01
The magnoliids represent the largest basal angiosperm clade with four orders, 19 families and 8,500 species. Although several recent angiosperm molecular phylogenies have supported the monophyly of magnoliids and suggested relationships among the orders, the limited number of genes examined resulted in only weak support, and these issues remain controversial. Furthermore, considerable incongruence has resulted in phylogenies supporting three different sets of relationships among magnoliids and the two large angiosperm clades, monocots and eudicots. This is one of the most important remaining issues concerning relationships among basal angiosperms. We sequenced the chloroplast genomes of three magnoliids, Drimys (Canellales), Liriodendron (Magnoliales),more » and Piper (Piperales), and used these data in combination with 32 other completed angiosperm chloroplast genomes to assess phylogenetic relationships among magnoliids. The Drimys and Piper chloroplast genomes are nearly identical in size at 160,606 and 160,624 bp, respectively. The genomes include a pair of inverted repeats of 26,649 bp (Drimys) and 27,039 (Piper), separated by a small single copy region of 18,621 (Drimys) and 18,878 (Piper) and a large single copy region of 88,685 bp (Drimys) and 87,666 bp (Piper). The gene order of both taxa is nearly identical to many other unrearranged angiosperm chloroplast genomes, including Calycanthus, the other published magnoliid genome. Comparisons of angiosperm chloroplast genomes indicate that GC content is not uniformly distributed across the genome. Overall GC content ranges from 34-39%, and coding regions have a substantially higher GC content than non-coding regions (both intergenic spacers and introns). Among protein-coding genes, GC content varies by codon position with 1st codon > 2nd codon > 3rd codon, and it varies by functional group with photosynthetic genes having the highest percentage and NADH genes the lowest. Across the genome, GC content is highest in the inverted repeat due to the presence of rRNA genes and lowest in the small single copy region where most NADH genes are located. Phylogenetic analyses using maximum parsimony and maximum likelihood methods were performed on DNA sequences of 61 protein-coding genes. Trees from both analyses provided strong support for the monophyly of magnoliids and two strongly supported groups were identified, the Canellales/Piperales and the Laurales/Magnoliales. The phylogenies also provided moderate to strong support for the basal position of Amborella, and a sister relationship of magnoliids to a clade that includes monocots and eudicots. The complete sequences of three magnoliid chloroplast genomes provide new data from the largest basal angiosperm clade. Evolutionary comparisons of these new genome sequences, combined with other published angiosperm genome, confirm that GC content is unevenly distributed across the genome by location, codon position, and functional group. Furthermore, phylogenetic analyses provide the strongest support so far for the hypothesis that the magnoliids are sister to a large clade that includes both monocots and eudicots.« less
Rivera-Rivera, Carlos J.; Montoya-Burgos, Juan I.
2016-01-01
Phylogenetic inference artifacts can occur when sequence evolution deviates from assumptions made by the models used to analyze them. The combination of strong model assumption violations and highly heterogeneous lineage evolutionary rates can become problematic in phylogenetic inference, and lead to the well-described long-branch attraction (LBA) artifact. Here, we define an objective criterion for assessing lineage evolutionary rate heterogeneity among predefined lineages: the result of a likelihood ratio test between a model in which the lineages evolve at the same rate (homogeneous model) and a model in which different lineage rates are allowed (heterogeneous model). We implement this criterion in the algorithm Locus Specific Sequence Subsampling (LS³), aimed at reducing the effects of LBA in multi-gene datasets. For each gene, LS³ sequentially removes the fastest-evolving taxon of the ingroup and tests for lineage rate homogeneity until all lineages have uniform evolutionary rates. The sequences excluded from the homogeneously evolving taxon subset are flagged as potentially problematic. The software implementation provides the user with the possibility to remove the flagged sequences for generating a new concatenated alignment. We tested LS³ with simulations and two real datasets containing LBA artifacts: a nucleotide dataset regarding the position of Glires within mammals and an amino-acid dataset concerning the position of nematodes within bilaterians. The initially incorrect phylogenies were corrected in all cases upon removing data flagged by LS³. PMID:26912812
Testing the new animal phylogeny: a phylum level molecular analysis of the animal kingdom.
Bourlat, Sarah J; Nielsen, Claus; Economou, Andrew D; Telford, Maximilian J
2008-10-01
The new animal phylogeny inferred from ribosomal genes some years ago has prompted a number of radical rearrangements of the traditional, morphology based metazoan tree. The two main bilaterian clades, Deuterostomia and Protostomia, find strong support, but the protostomes consist of two sister groups, Ecdysozoa and Lophotrochozoa, not seen in morphology based trees. Although widely accepted, not all recent molecular phylogenetic analyses have supported the tripartite structure of the new animal phylogeny. Furthermore, even if the small ribosomal subunit (SSU) based phylogeny is correct, there is a frustrating lack of resolution of relationships between the phyla that make up the three clades of this tree. To address this issue, we have assembled a dataset including a large number of aligned sequence positions as well as a broad sampling of metazoan phyla. Our dataset consists of sequence data from ribosomal and mitochondrial genes combined with new data from protein coding genes (5139 amino acid and 3524 nucleotide positions in total) from 37 representative taxa sampled across the Metazoa. Our data show strong support for the basic structure of the new animal phylogeny as well as for the Mandibulata including Myriapoda. We also provide some resolution within the Lophotrochozoa, where we confirm support for a monophyletic clade of Echiura, Sipuncula and Annelida and surprising evidence of a close relationship between Brachiopoda and Nemertea.
A transcriptome approach to ecdysozoan phylogeny.
Borner, Janus; Rehm, Peter; Schill, Ralph O; Ebersberger, Ingo; Burmester, Thorsten
2014-11-01
The monophyly of Ecdysozoa, which comprise molting phyla, has received strong support from several lines of evidence. However, the internal relationships of Ecdysozoa are still contended. We generated expressed sequence tags from a priapulid (penis worm), a kinorhynch (mud dragon), a tardigrade (water bear) and five chelicerate taxa by 454 transcriptome sequencing. A multigene alignment was assembled from 63 taxa, which comprised after matrix optimization 24,249 amino acid positions with high data density (2.6% gaps, 19.1% missing data). Phylogenetic analyses employing various models support the monophyly of Ecdysozoa. A clade combining Priapulida and Kinorhyncha (i.e. Scalidophora) was recovered as the earliest branch among Ecdysozoa. We conclude that Cycloneuralia, a taxon erected to combine Priapulida, Kinorhyncha and Nematoda (and others), are paraphyletic. Rather Arthropoda (including Onychophora) are allied with Nematoda and Tardigrada. Within Arthropoda, we found strong support for most clades, including monophyletic Mandibulata and Pancrustacea. The phylogeny within the Euchelicerata remained largely unresolved. There is conflicting evidence on the position of tardigrades: While Bayesian and maximum likelihood analyses of only slowly evolving genes recovered Tardigrada as a sister group to Arthropoda, analyses of the full data set, and of subsets containing genes evolving at fast and intermediate rates identified a clade of Tardigrada and Nematoda. Notably, the latter topology is also supported by the analyses of indel patterns. Copyright © 2014 Elsevier Inc. All rights reserved.
Glare, E M; Paton, J C; Premier, R R; Lawrence, A J; Nisbet, I T
1990-01-01
A tandemly repeated 1,046-base-pair (bp) ClaI DNA fragment from Bordetella pertussis was cloned into Escherichia coli by using the vector pUC19. This fragment, when isolated, hybridized strongly to DNA from all 100 clinical isolates of B. pertussis tested. It was shown to have homology to single-copy sequences in Bordetella bronchiseptica but not Bordetella parapertussis and did not hybridize to lysate blots of a wide range of other bacteria, including members of the closely related genera Pasteurella, Alcaligenes, and Haemophilus. The 1,046-bp fragment was sequenced, and complementary synthetic oligonucleotides flanking a 153-bp region within the repeated element were used as primers for specific amplification of this region using the polymerase chain reaction (PCR). This procedure was then applied to the rapid (5-h) detection of B. pertussis in nasopharyngeal secretions collected from 332 children with suspected pertussis. The test yielded positive results in a total of 98 samples, compared with 66 for culture and 33 for direct immunofluorescence (IF). All of the IF-positive samples were PCR positive, as were 63 of the samples from which B. pertussis was eventually cultured. Two hundred thirty-one specimens which were negative by IF and culture were also negative in the PCR assay. However, 33 culture- and IF-negative specimens were positive by PCR assay. Several of these specimens were collected from close contacts of culture-proven pertussis patients, were follow-up specimens from such patients, or were from patients with serological evidence of pertussis and therefore may be true-rather than false-positives. Images PMID:2229381
First report of sasX-positive methicillin-resistant Staphylococcus aureus in Japan.
Nakaminami, Hidemasa; Ito, Teruyo; Han, Xiao; Ito, Ayumu; Matsuo, Miki; Uehara, Yuki; Baba, Tadashi; Hiramatsu, Keiichi; Noguchi, Norihisa
2017-09-01
SasX is a known virulence factor of Staphylococcus aureus involved in colonisation and immune evasion of the bacterium. The sasX gene, which is located on the ϕSPβ prophage, is frequently found in the sequence type (ST) 239 S. aureus lineage, which is the predominant healthcare-associated clone in Asian countries. In Japan, ST239 clones have rarely been identified, and sasX-positive strains have not been reported to date. Here, we report the first identification of 18 sasX-positive methicillin-resistant S. aureus (MRSA) strains in Japanese hospitals between 2009 and 2011. All sasX-positive isolates belonged to an ST239-staphylococcal cassette chromosome mec type III (ST239-III) lineage. However, we were unable to identify additional sasX-positive MRSA strains from 2012 to 2016, indicating that the small epidemic of sasX-positive isolates observed in this study was temporary. The sequence surrounding sasX in the strain TOHH628 lacked 51 genes that encode phage packaging and structural proteins, and no bacteriophage was induced by mitomycin C. Additionally, in the TOHH628 strain, the region (64.6 kb) containing sasX showed high identity to the ϕSPβ-like element (71.3 kb) of the Taiwanese MRSA strain Z172. The data strongly suggest that the present sasX-positive isolates found in Japanese hospitals were transmitted incidentally from other countries. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Molecular identification and transcriptional regulation of porcine IFIT2 gene.
Yang, Xiuqin; Jing, Xiaoyan; Song, Yanfang; Zhang, Caixia; Liu, Di
2018-04-06
IFN-induced protein with tetratricopeptide repeats 2 (IFIT2) plays important roles in host defense against viral infection as revealed by studies in humans and mice. However, little is known on porcine IFIT2 (pIFIT2). Here, we performed molecular cloning, expression profile, and transcriptional regulation analysis of pIFIT2. pIFIT2 gene, located on chromosome 14, is composed of two exons and have a complete coding sequence of 1407 bp. The encoded polypeptide, 468 aa in length, has three tetratricopeptide repeat motifs. pIFIT2 gene was unevenly distributed in all eleven tissues studied with the most abundance in spleen. Poly(I:C) treatment notably strongly upregulated the mRNA level and promoter activity of pIFIT2 gene. Upstream sequence of 1759 bp from the start codon which was assigned +1 here has promoter activity, and deltaEF1 acts as transcription repressor through binding to sequences at position - 1774 to - 1764. Minimal promoter region exists within nucleotide position - 162 and - 126. Two adjacent interferon-stimulated response elements (ISREs) and two nuclear factor (NF)-κB binding sites were identified within position - 310 and - 126. The ISRE elements act alone and in synergy with the one closer to start codon having more strength, so do the NF-κB binding sites. Synergistic effect was also found between the ISRE and NF-κB binding sites. Additionally, a third ISRE element was identified within position - 1661 to - 1579. These findings will contribute to clarifying the antiviral effect and underlying mechanisms of pIFIT2.
Xie, Hongbo; Vucetic, Slobodan; Iakoucheva, Lilia M.; Oldfield, Christopher J.; Dunker, A. Keith; Obradovic, Zoran; Uversky, Vladimir N.
2008-01-01
Currently, the understanding of the relationships between function, amino acid sequence and protein structure continues to represent one of the major challenges of the modern protein science. As much as 50% of eukaryotic proteins are likely to contain functionally important long disordered regions. Many proteins are wholly disordered but still possess numerous biologically important functions. However, the number of experimentally confirmed disordered proteins with known biological functions is substantially smaller than their actual number in nature. Therefore, there is a crucial need for novel bioinformatics approaches that allow projection of the current knowledge from a few experimentally verified examples to much larger groups of known and potential proteins. The elaboration of a bioinformatics tool for the analysis of functional diversity of intrinsically disordered proteins and application of this data mining tool to >200,000 proteins from Swiss-Prot database, each annotated with at least one of the 875 functional keywords was described in the first paper of this series (Xie H., Vucetic S., Iakoucheva L.M., Oldfield C.J., Dunker A.K., Obradovic Z., Uversky V.N. (2006) Functional anthology of intrinsic disorder. I. Biological processes and functions of proteins with long disordered regions. J. Proteome Res.). Using this tool, we have found that out of the 711 Swiss-Prot functional keywords associated with at least 20 proteins, 262 were strongly positively correlated with long intrinsically disordered regions, and 302 were strongly negatively correlated. Illustrative examples of functional disorder or order were found for the vast majority of keywords showing strongest positive or negative correlation with intrinsic disorder, respectively. Some 80 Swiss-Prot keywords associated with disorder- and order-driven biological processes and protein functions were described in the first paper (Xie H., Vucetic S., Iakoucheva L.M., Oldfield C.J., Dunker A.K., Obradovic Z., Uversky V.N. (2006) Functional anthology of intrinsic disorder. I. Biological processes and functions of proteins with long disordered regions. J. Proteome Res.). The second paper of the series was devoted to the presentation of 87 Swiss-Prot keywords attributed to the cellular components, domains, technical terms, developmental processes and coding sequence diversities possessing strong positive and negative correlation with long disordered regions (Vucetic S., Xie H., Iakoucheva L.M., Oldfield C.J., Dunker A.K., Obradovic Z., Uversky V.N. (2006) Functional anthology of intrinsic disorder. II. Cellular components, domains, technical terms, developmental processes and coding sequence diversities correlated with long disordered regions. J. Proteome Res.). Protein structure and functionality can be modulated by various posttranslational modifications or/and as a result of binding of specific ligands. Numerous human diseases are associated with protein misfolding/misassembly/ misfunctioning. This work concludes the series of papers dedicated to the functional anthology of intrinsic disorder and describes ~80 Swiss-Prot functional keywords that are related to ligands, posttranslational modifications and diseases possessing strong positive or negative correlation with the predicted long disordered regions in proteins. PMID:17391016
Weak Negative and Positive Selection and the Drift Load at Splice Sites
Denisov, Stepan V.; Bazykin, Georgii A.; Sutormin, Roman; Favorov, Alexander V.; Mironov, Andrey A.; Gelfand, Mikhail S.; Kondrashov, Alexey S.
2014-01-01
Splice sites (SSs) are short sequences that are crucial for proper mRNA splicing in eukaryotic cells, and therefore can be expected to be shaped by strong selection. Nevertheless, in mammals and in other intron-rich organisms, many of the SSs often involve nonconsensus (Nc), rather than consensus (Cn), nucleotides, and beyond the two critical nucleotides, the SSs are not perfectly conserved between species. Here, we compare the SS sequences between primates, and between Drosophila fruit flies, to reveal the pattern of selection acting at SSs. Cn-to-Nc substitutions are less frequent, and Nc-to-Cn substitutions are more frequent, than neutrally expected, indicating, respectively, negative and positive selection. This selection is relatively weak (1 < |4Nes| < 4), and has a similar efficiency in primates and in Drosophila. Within some nucleotide positions, the positive selection in favor of Nc-to-Cn substitutions is weaker than the negative selection maintaining already established Cn nucleotides; this difference is due to site-specific negative selection favoring current Nc nucleotides. In general, however, the strength of negative selection protecting the Cn alleles is similar in magnitude to the strength of positive selection favoring replacement of Nc alleles, as expected under the simple nearly neutral turnover. In summary, although a fraction of the Nc nucleotides within SSs is maintained by selection, the abundance of deleterious nucleotides in this class suggests a substantial genome-wide drift load. PMID:24966225
Self-organized, highly luminescent CdSe nanorod-DNA complexes.
Artemyev, Mikhail; Kisiel, Dmitry; Abmiotko, Sergey; Antipina, Maria N; Khomutov, Gennady B; Kislov, Vladimir V; Rakhnyanskaya, Anna A
2004-09-01
DNA molecules are useful building blocks and nanotemplates for controllable fabrication of various bioinorganic nanostructures due to their unique physical-chemical properties and recognition capabilities and the synthetic availability of desired nucleotide sequences and length. We have synthesized novel DNA complexes with positively charged, highly luminescent CdSe nanorods that can be self-organized into filamentary, netlike, or spheroidal nanostructures. DNA-CdSe-nanorod filaments possess strongly linearly polarized photoluminescence due to the unidirectional orientation of nanorods along the filaments. Copyright 2004 American Chemical Society
Choroid plexus papilloma in a beluga whale (Delphinapterus leucas).
Thomas, Christian; Mergl, June; Gehring, Erica; Paulus, Werner; Martineau, Daniel; Hasselblatt, Martin
2016-07-01
We report herein a choroid plexus papilloma in a beluga whale (Delphinapterus leucas). This case was positive for choroid plexus tumor marker Kir7.1 on immunohistochemistry. These results and the high conservation of Kir7.1 across species at the amino acid sequence level strongly suggest that antibodies directed against Kir7.1 not only can be employed for the diagnosis of choroid plexus tumors in cetaceans, but are also likely to be diagnostically useful in other animal species. © 2016 The Author(s).
Hickey, Anthony; Esnault, Caroline; Majumdar, Anasuya; Chatterjee, Atreyi Ghatak; Iben, James R; McQueen, Philip G; Yang, Andrew X; Mizuguchi, Takeshi; Grewal, Shiv I S; Levin, Henry L
2015-11-01
Transposable elements (TEs) constitute a substantial fraction of the eukaryotic genome and, as a result, have a complex relationship with their host that is both adversarial and dependent. To minimize damage to cellular genes, TEs possess mechanisms that target integration to sequences of low importance. However, the retrotransposon Tf1 of Schizosaccharomyces pombe integrates with a surprising bias for promoter sequences of stress-response genes. The clustering of integration in specific promoters suggests that Tf1 possesses a targeting mechanism that is important for evolutionary adaptation to changes in environment. We report here that Sap1, an essential DNA-binding protein, plays an important role in Tf1 integration. A mutation in Sap1 resulted in a 10-fold drop in Tf1 transposition, and measures of transposon intermediates support the argument that the defect occurred in the process of integration. Published ChIP-Seq data on Sap1 binding combined with high-density maps of Tf1 integration that measure independent insertions at single-nucleotide positions show that 73.4% of all integration occurs at genomic sequences bound by Sap1. This represents high selectivity because Sap1 binds just 6.8% of the genome. A genome-wide analysis of promoter sequences revealed that Sap1 binding and amounts of integration correlate strongly. More important, an alignment of the DNA-binding motif of Sap1 revealed integration clustered on both sides of the motif and showed high levels specifically at positions +19 and -9. These data indicate that Sap1 contributes to the efficiency and position of Tf1 integration. Copyright © 2015 by the Genetics Society of America.
Hickey, Anthony; Esnault, Caroline; Majumdar, Anasuya; Chatterjee, Atreyi Ghatak; Iben, James R.; McQueen, Philip G.; Yang, Andrew X.; Mizuguchi, Takeshi; Grewal, Shiv I. S.; Levin, Henry L.
2015-01-01
Transposable elements (TEs) constitute a substantial fraction of the eukaryotic genome and, as a result, have a complex relationship with their host that is both adversarial and dependent. To minimize damage to cellular genes, TEs possess mechanisms that target integration to sequences of low importance. However, the retrotransposon Tf1 of Schizosaccharomyces pombe integrates with a surprising bias for promoter sequences of stress-response genes. The clustering of integration in specific promoters suggests that Tf1 possesses a targeting mechanism that is important for evolutionary adaptation to changes in environment. We report here that Sap1, an essential DNA-binding protein, plays an important role in Tf1 integration. A mutation in Sap1 resulted in a 10-fold drop in Tf1 transposition, and measures of transposon intermediates support the argument that the defect occurred in the process of integration. Published ChIP-Seq data on Sap1 binding combined with high-density maps of Tf1 integration that measure independent insertions at single-nucleotide positions show that 73.4% of all integration occurs at genomic sequences bound by Sap1. This represents high selectivity because Sap1 binds just 6.8% of the genome. A genome-wide analysis of promoter sequences revealed that Sap1 binding and amounts of integration correlate strongly. More important, an alignment of the DNA-binding motif of Sap1 revealed integration clustered on both sides of the motif and showed high levels specifically at positions +19 and −9. These data indicate that Sap1 contributes to the efficiency and position of Tf1 integration. PMID:26358720
Schadt, Eric E.; Banerjee, Onureena; Fang, Gang; Feng, Zhixing; Wong, Wing H.; Zhang, Xuegong; Kislyuk, Andrey; Clark, Tyson A.; Luong, Khai; Keren-Paz, Alona; Chess, Andrew; Kumar, Vipin; Chen-Plotkin, Alice; Sondheimer, Neal; Korlach, Jonas; Kasarskis, Andrew
2013-01-01
Current generation DNA sequencing instruments are moving closer to seamlessly sequencing genomes of entire populations as a routine part of scientific investigation. However, while significant inroads have been made identifying small nucleotide variation and structural variations in DNA that impact phenotypes of interest, progress has not been as dramatic regarding epigenetic changes and base-level damage to DNA, largely due to technological limitations in assaying all known and unknown types of modifications at genome scale. Recently, single-molecule real time (SMRT) sequencing has been reported to identify kinetic variation (KV) events that have been demonstrated to reflect epigenetic changes of every known type, providing a path forward for detecting base modifications as a routine part of sequencing. However, to date no statistical framework has been proposed to enhance the power to detect these events while also controlling for false-positive events. By modeling enzyme kinetics in the neighborhood of an arbitrary location in a genomic region of interest as a conditional random field, we provide a statistical framework for incorporating kinetic information at a test position of interest as well as at neighboring sites that help enhance the power to detect KV events. The performance of this and related models is explored, with the best-performing model applied to plasmid DNA isolated from Escherichia coli and mitochondrial DNA isolated from human brain tissue. We highlight widespread kinetic variation events, some of which strongly associate with known modification events, while others represent putative chemically modified sites of unknown types. PMID:23093720
Schadt, Eric E; Banerjee, Onureena; Fang, Gang; Feng, Zhixing; Wong, Wing H; Zhang, Xuegong; Kislyuk, Andrey; Clark, Tyson A; Luong, Khai; Keren-Paz, Alona; Chess, Andrew; Kumar, Vipin; Chen-Plotkin, Alice; Sondheimer, Neal; Korlach, Jonas; Kasarskis, Andrew
2013-01-01
Current generation DNA sequencing instruments are moving closer to seamlessly sequencing genomes of entire populations as a routine part of scientific investigation. However, while significant inroads have been made identifying small nucleotide variation and structural variations in DNA that impact phenotypes of interest, progress has not been as dramatic regarding epigenetic changes and base-level damage to DNA, largely due to technological limitations in assaying all known and unknown types of modifications at genome scale. Recently, single-molecule real time (SMRT) sequencing has been reported to identify kinetic variation (KV) events that have been demonstrated to reflect epigenetic changes of every known type, providing a path forward for detecting base modifications as a routine part of sequencing. However, to date no statistical framework has been proposed to enhance the power to detect these events while also controlling for false-positive events. By modeling enzyme kinetics in the neighborhood of an arbitrary location in a genomic region of interest as a conditional random field, we provide a statistical framework for incorporating kinetic information at a test position of interest as well as at neighboring sites that help enhance the power to detect KV events. The performance of this and related models is explored, with the best-performing model applied to plasmid DNA isolated from Escherichia coli and mitochondrial DNA isolated from human brain tissue. We highlight widespread kinetic variation events, some of which strongly associate with known modification events, while others represent putative chemically modified sites of unknown types.
Overcoming a nucleosomal barrier to replication
Chang, Han-Wen; Pandey, Manjula; Kulaeva, Olga I.; Patel, Smita S.; Studitsky, Vasily M.
2016-01-01
Efficient overcoming and accurate maintenance of chromatin structure and associated histone marks during DNA replication are essential for normal functioning of the daughter cells. However, the molecular mechanisms of replication through chromatin are unknown. We have studied traversal of uniquely positioned mononucleosomes by T7 replisome in vitro. Nucleosomes present a strong, sequence-dependent barrier for replication, with particularly strong pausing of DNA polymerase at the +(31–40) and +(41–65) regions of the nucleosomal DNA. The exonuclease activity of T7 DNA polymerase increases the overall rate of progression of the replisome through a nucleosome, likely by resolving nonproductive complexes. The presence of nucleosome-free DNA upstream of the replication fork facilitates the progression of DNA polymerase through the nucleosome. After replication, at least 50% of the nucleosomes assume an alternative conformation, maintaining their original positions on the DNA. Our data suggest a previously unpublished mechanism for nucleosome maintenance during replication, likely involving transient formation of an intranucleosomal DNA loop. PMID:27847876
Pekar-Zlotin, Marina; Hirsch, Fred R; Soussan-Gutman, Lior; Ilouze, Maya; Dvir, Addie; Boyle, Theresa; Wynes, Murry; Miller, Vincent A; Lipson, Doron; Palmer, Gary A; Ali, Siraj M; Dekel, Shlomi; Brenner, Ronen; Bunn, Paul A; Peled, Nir
2015-03-01
The U.S. Food and Drug Administration-approved method for detecting EML4-ALK rearrangement is fluorescence in situ hybridization (FISH); however, data supporting the use of immunohistochemistry (IHC) for that purpose are accumulating. Previous studies that compared FISH and IHC considered FISH the gold standard, but none compared data with the results of next-generation sequencing (NGS) analysis. We studied FISH and IHC (D5F3 antibody) systematically for EML4-ALK rearrangement in 51 lung adenocarcinoma patients, followed by NGS in case of discordance. Of 51 patients, 4 were positive with FISH (7.8%), and 8 were positive with IHC (15.7%). Three were positive with both. NGS confirmed that four of the five patients who were positive with IHC and negative with FISH were positive for ALK. Two were treated by crizotinib, with progression-free survival of 18 and 6 months. Considering NGS as the most accurate test, the sensitivity and specificity were 42.9% and 97.7%, respectively, for FISH and 100% and 97.7%, respectively, for IHC. The FISH-based method of detecting EML4-ALK rearrangement in lung cancer may miss a significant number of patients who could benefit from targeted ALK therapy. Screening for EML4-ALK rearrangement by IHC should be strongly considered, and NGS is recommended in borderline cases. Two patients who were negative with FISH and positive with IHC were treated with crizotinib and responded to therapy. ©AlphaMed Press.
Lee, Seung-Hun; Kim, Kyoo-Tae; Kwon, Oh-Deog; Ock, Younsung; Kim, Taeil; Choi, Donghag; Kwak, Dongmi
2016-01-01
We describe for the first time the detection of Coxiella-like bacteria (CLB), Theileria luwenshuni, and T. ovis endosymbionts in blood-sucking deer keds. Eight deer keds attached to a Korean water deer were identified as Lipoptena fortisetosa (Diptera: Hippoboscidae) by morphological and genetic analyses. Among the endosymbionts assessed, CLB, Theileria luwenshuni, and T. ovis were identified in L. fortisetosa by PCR and nucleotide sequencing. Based on phylogeny, CLB 16S rRNA sequences were classified into clade B, sharing 99.4% identity with CLB from Haemaphysalis longicornis in South Korea. Although the virulence of CLB to vertebrates is still controversial, several studies have reported clinical symptoms in birds due to CLB infections. The 18S rRNA sequences of T. luwenshuni and T. ovis in this study were 98.8-100% identical to those in GenBank, and all of the obtained sequences of T. ovis and T. luwenshuni in this study were 100% identical to each other, respectively. Although further studies are required to positively confirm L. fortisetosa as a biological vector of these pathogens, strong genetic relationships among sequences from this and previous studies suggest potential transmission among mammalian hosts by ticks and keds.
Choudhary, Kumari S.; Mih, Nathan; Monk, Jonathan; Kavvas, Erol; Yurkovich, James T.; Sakoulas, George; Palsson, Bernhard O.
2018-01-01
Two-component systems (TCSs) consist of a histidine kinase and a response regulator. Here, we evaluated the conservation of the AgrAC TCS among 149 completely sequenced Staphylococcus aureus strains. It is composed of four genes: agrBDCA. We found that: (i) AgrAC system (agr) was found in all but one of the 149 strains, (ii) the agr positive strains were further classified into four agr types based on AgrD protein sequences, (iii) the four agr types not only specified the chromosomal arrangement of the agr genes but also the sequence divergence of AgrC histidine kinase protein, which confers signal specificity, (iv) the sequence divergence was reflected in distinct structural properties especially in the transmembrane region and second extracellular binding domain, and (v) there was a strong correlation between the agr type and the virulence genomic profile of the organism. Taken together, these results demonstrate that bioinformatic analysis of the agr locus leads to a classification system that correlates with the presence of virulence factors and protein structural properties. PMID:29887846
NASA Astrophysics Data System (ADS)
Fillman, Jake
2017-03-01
We study Jacobi matrices that are uniformly approximated by periodic operators. We show that if the rate of approximation is sufficiently rapid, then the associated quantum dynamics are ballistic in a rather strong sense; namely, the (normalized) Heisenberg evolution of the position operator converges strongly to a self-adjoint operator that is injective on the space of absolutely summable sequences. In particular, this means that all transport exponents corresponding to well-localized initial states are equal to one. Our result may be applied to a class of quantum many-body problems. Specifically, we establish a lower bound on the Lieb-Robinson velocity for an isotropic XY spin chain on the integers with limit-periodic couplings.
Disappearance of nucleosome positioning in mitotic chromatin in vivo.
Komura, Jun-ichiro; Ono, Tetsuya
2005-04-15
During mitosis, transcription is silenced and most transcription factors are displaced from their recognition sequences. By in vivo footprinting analysis, we have confirmed and extended previous studies showing loss of transcription factors from an RNA polymerase II promoter (c-FOS) and, for the first time, an RNA polymerase III promoter (U6) in HeLa cells. Because little was known about nucleosomal organization in mitotic chromosomes, we performed footprinting analysis for nucleosomes on these promoters in interphase and mitotic cells. During interphase, each of the promoters had a positioned nucleosome in the region intervening between proximal promoter elements and distal enhancer elements, but the strong nucleosome positioning disappeared during mitosis. Thus, the nucleosomal organization that appears to facilitate transcription in interphase cells may be lost in mitotic cells, and nucleosome positioning during mitosis does not seem to be a major component of the epigenetic mechanisms to mark genes for rapid reactivation after this phase.
Mourier, Pierre A J; Guichard, Olivier Y; Herman, Fréderic; Sizun, Philippe; Viskov, Christian
2017-03-08
Low Molecular Weight Heparins (LMWH) are complex anticoagulant drugs that mainly inhibit the blood coagulation cascade through indirect interaction with antithrombin. While inhibition of the factor Xa is well described, little is known about the polysaccharide structure inhibiting thrombin. In fact, a minimal chain length of 18 saccharides units, including an antithrombin (AT) binding pentasaccharide, is mandatory to form the active ternary complex for LMWH obtained by alkaline β-elimination (e.g., enoxaparin). However, the relationship between structure of octadecasaccharides and their thrombin inhibition has not been yet assessed on natural compounds due to technical hurdles to isolate sufficiently pure material. We report the preparation of five octadecasaccharides by using orthogonal separation methods including size exclusion, AT affinity, ion pairing and strong anion exchange chromatography. Each of these octadecasaccharides possesses two AT binding pentasaccharide sequences located at various positions. After structural elucidation using enzymatic sequencing and NMR, in vitro aFXa and aFIIa were determined. The biological activities reveal the critical role of each pentasaccharide sequence position within the octadecasaccharides and structural requirements to inhibit thrombin. Significant differences in potency, such as the twenty-fold magnitude difference observed between two regioisomers, further highlights the importance of depolymerisation process conditions on LMWH biological activity.
Molecular dynamics study of some non-hydrogen-bonding base pair DNA strands
NASA Astrophysics Data System (ADS)
Tiwari, Rakesh K.; Ojha, Rajendra P.; Tiwari, Gargi; Pandey, Vishnudatt; Mall, Vijaysree
2018-05-01
In order to elucidate the structural activity of hydrophobic modified DNA, the DMMO2-D5SICS, base pair is introduced as a constituent in different set of 12-mer and 14-mer DNA sequences for the molecular dynamics (MD) simulation in explicit water solvent. AMBER 14 force field was employed for each set of duplex during the 200ns production-dynamics simulation in orthogonal-box-water solvent by the Particle-Mesh-Ewald (PME) method in infinite periodic boundary conditions (PBC) to determine conformational parameters of the complex. The force-field parameters of modified base-pair were calculated by Gaussian-code using Hartree-Fock /ab-initio methodology. RMSD Results reveal that the conformation of the duplex is sequence dependent and the binding energy of the complex depends on the position of the modified base-pair in the nucleic acid strand. We found that non-bonding energy had a significant contribution to stabilising such type of duplex in comparison to electrostatic energy. The distortion produced within strands by such type of base-pair was local and destabilised the duplex integrity near to substitution, moreover the binding energy of duplex depends on the position of substitution of hydrophobic base-pair and the DNA sequence and strongly supports the corresponding experimental study.
Identification of immunodominant antigens for the laboratory diagnosis of toxocariasis.
Zhan, Bin; Ajmera, Ravi; Geiger, Stefan Michael; Gonçalves, Marco Túlio Porto; Liu, Zhuyun; Wei, Junfei; Wilkins, Patricia P; Fujiwara, Ricardo; Gazzinelli-Guimaraes, Pedro Henrique; Bottazzi, Maria Elena; Hotez, Peter
2015-12-01
To identify immunodominant antigens of Toxocara canis recognised by Toxocara-infected sera as recombinant reagents for immunodiagnosis of toxocariasis. Pooled sera from human cases of toxocariasis were used to identify immunodominant antigens by immunoscreening a T. canis larval expression cDNA library. The positive clones were sequenced to reveal the identity of the antigens. The recombinant proteins were expressed in E. coli and then used to confirm their immunoreaction with sera of humans with toxocariasis. Two chosen antigens were also used to differentiate Toxocara infection from other helminth infections in mice. Eleven antigens with immunodiagnostic potential were identified, including two C-type lectins (CTLs) that reacted strongly with the Toxocara-positive serum pool. The first CTL (Tc-CTL-1) is the same as TES-32, previously identified as a major immunodominant component of TES; the second CTL (Tc-CTL-2) is a novel C-type lectin sharing 83% amino acid sequence identity within the functional domain of Tc-CTL-1. The E. coli-expressed recombinant Tc-CTL-1 was strongly recognised by the Toxocara-positive serum pool or sera from animals experimentally infected with T. canis. Reactivity with recombinant Tc-CTL-1 was higher when the unreduced protein was used in an enzyme-linked immunosorbent assay (ELISA), dot-blot assay or Western blot test compared to the protein under reduced condition. Both recombinant Tc-CTL-1- and Tc-CTL-2-based ELISAs were able to differentiate T. canis infection from other helminth infections in experimentally infected mice. Both Tc-CTL-1 and Tc-CTL-2 were able to differentiate Toxocara infection from other helminth infections and could potentially be used as sensitive and specific immunodiagnostic antigens. © 2015 John Wiley & Sons Ltd.
Wu, Leeying; Oficjalska, Katarzyna; Lambert, Matthew; Fennell, Brian J; Darmanin-Sheehan, Alfredo; Ní Shúilleabháin, Deirdre; Autin, Bénédicte; Cummins, Emma; Tchistiakova, Lioudmila; Bloom, Laird; Paulsen, Janet; Gill, Davinder; Cunningham, Orla; Finlay, William J J
2012-01-01
Examination of 1269 unique naive chicken V(H) sequences showed that the majority of positions in the framework (FW) regions were maintained as germline, with high mutation rates observed in the CDRs. Many FW mutations could be clearly related to the modulation of CDR structure or the V(H)-V(L) interface. CDRs 1 and 2 of the V(H) exhibited frequent mutation in solvent-exposed positions, but conservation of common structural residues also found in human CDRs at the same positions. In comparison with humans and mice, the chicken CDR3 repertoire was skewed toward longer sequences, was dominated by small amino acids (G/S/A/C/T), and had higher cysteine (chicken, 9.4%; human, 1.6%; and mouse, 0.25%) but lower tyrosine content (chicken, 9.2%; human, 16.8%; and mouse 26.4%). A strong correlation (R(2) = 0.97) was observed between increasing CDR3 length and higher cysteine content. This suggests that noncanonical disulfides are strongly favored in chickens, potentially increasing CDR stability and complexity in the topology of the combining site. The probable formation of disulfide bonds between CDR3 and CDR1, FW2, or CDR2 was also observed, as described in camelids. All features of the naive repertoire were fully replicated in the target-selected, phage-displayed repertoire. The isolation of a chicken Fab with four noncanonical cysteines in the V(H) that exhibits 64 nM (K(D)) binding affinity for its target proved these constituents to be part of the humoral response, not artifacts. This study supports the hypothesis that disulfide bond-constrained CDR3s are a structural diversification strategy in the restricted germline v-gene repertoire of chickens.
Hiiesalu, Inga; Pärtel, Meelis; Davison, John; Gerhold, Pille; Metsis, Madis; Moora, Mari; Öpik, Maarja; Vasar, Martti; Zobel, Martin; Wilson, Scott D
2014-07-01
Although experiments show a positive association between vascular plant and arbuscular mycorrhizal fungal (AMF) species richness, evidence from natural ecosystems is scarce. Furthermore, there is little knowledge about how AMF richness varies with belowground plant richness and biomass. We examined relationships among AMF richness, above- and belowground plant richness, and plant root and shoot biomass in a native North American grassland. Root-colonizing AMF richness and belowground plant richness were detected from the same bulk root samples by 454-sequencing of the AMF SSU rRNA and plant trnL genes. In total we detected 63 AMF taxa. Plant richness was 1.5 times greater belowground than aboveground. AMF richness was significantly positively correlated with plant species richness, and more strongly with below- than aboveground plant richness. Belowground plant richness was positively correlated with belowground plant biomass and total plant biomass, whereas aboveground plant richness was positively correlated only with belowground plant biomass. By contrast, AMF richness was negatively correlated with belowground and total plant biomass. Our results indicate that AMF richness and plant belowground richness are more strongly related with each other and with plant community biomass than with the plant aboveground richness measures that have been almost exclusively considered to date. © 2014 The Authors. New Phytologist © 2014 New Phytologist Trust.
How Many Protein Sequences Fold to a Given Structure? A Coevolutionary Analysis.
Tian, Pengfei; Best, Robert B
2017-10-17
Quantifying the relationship between protein sequence and structure is key to understanding the protein universe. A fundamental measure of this relationship is the total number of amino acid sequences that can fold to a target protein structure, known as the "sequence capacity," which has been suggested as a proxy for how designable a given protein fold is. Although sequence capacity has been extensively studied using lattice models and theory, numerical estimates for real protein structures are currently lacking. In this work, we have quantitatively estimated the sequence capacity of 10 proteins with a variety of different structures using a statistical model based on residue-residue co-evolution to capture the variation of sequences from the same protein family. Remarkably, we find that even for the smallest protein folds, such as the WW domain, the number of foldable sequences is extremely large, exceeding the Avogadro constant. In agreement with earlier theoretical work, the calculated sequence capacity is positively correlated with the size of the protein, or better, the density of contacts. This allows the absolute sequence capacity of a given protein to be approximately predicted from its structure. On the other hand, the relative sequence capacity, i.e., normalized by the total number of possible sequences, is an extremely tiny number and is strongly anti-correlated with the protein length. Thus, although there may be more foldable sequences for larger proteins, it will be much harder to find them. Lastly, we have correlated the evolutionary age of proteins in the CATH database with their sequence capacity as predicted by our model. The results suggest a trade-off between the opposing requirements of high designability and the likelihood of a novel fold emerging by chance. Published by Elsevier Inc.
Maillard, J C; Martinez, D; Bensaid, A
1996-07-23
One hundred and twenty-seven Brahman cattle from several locations in Martinique (FWI), reared under different environmental conditions, were followed over three years and checked for clinical signs of dermatophilosis. To confirm that these animals had been in contact with the pathogen Dermatophilus congolensis, their sera were tested by ELISA. On the basis of this epidemiological study, 12 animals were classified as resistant (seropositive without clinical signs), belonging to herds in which the prevalence of the disease ranged from 25 to nearly 98%. Eighteen animals classified as highly susceptible displayed severe characteristic skin lesions. These 30 selected animals were typed for class I antigens of the major histocompatibility complex (MHC). MHC class II genes were analyzed using the polymerase chain reaction (PCR) and restriction fragment length polymorphism (RFLP) techniques, on the exon 2 of the bovine leucocyte antigen (BoLA) DRB3 gene. Several alleles were found, according to patterns provided by the restriction enzymes used: Fnu 4HI, Dpn II, Hae III, and Rsa I. A particular sequence "EIAY" at amino acid positions 66/67/74/78 located in the antigen recognition sites (ARS) was found in the 12 animals classified as resistant, and 10 of them displayed also class I BoLA-A8 specificity. On the other hand, only 3 out of the 18 susceptible animals showed simultaneously the BoLA-DRB3 "EIAY" sequence and BoLA-A8 specificity. Interestingly, a serine residue at position 30 of the ARS was found in 8 of the susceptible animals and was completely absent from all resistant animals. Furthermore, in a same animal, the serine at position 30 and the EIAY sequence were never found simultaneously on the same haplotype. These results show a strong correlation between the resistant character to dermatophilosis and the association of MHC haplotypes: the BoLA-A8 specificity and the BoLA-DRB3 "EIAY" sequence at ARS positions 66/67/74/78 with the lack of serine in position 30. To confirm these results, family segregation studies are in progress and some interesting observations have been obtained.
Rivera-Rivera, Carlos J; Montoya-Burgos, Juan I
2016-06-01
Phylogenetic inference artifacts can occur when sequence evolution deviates from assumptions made by the models used to analyze them. The combination of strong model assumption violations and highly heterogeneous lineage evolutionary rates can become problematic in phylogenetic inference, and lead to the well-described long-branch attraction (LBA) artifact. Here, we define an objective criterion for assessing lineage evolutionary rate heterogeneity among predefined lineages: the result of a likelihood ratio test between a model in which the lineages evolve at the same rate (homogeneous model) and a model in which different lineage rates are allowed (heterogeneous model). We implement this criterion in the algorithm Locus Specific Sequence Subsampling (LS³), aimed at reducing the effects of LBA in multi-gene datasets. For each gene, LS³ sequentially removes the fastest-evolving taxon of the ingroup and tests for lineage rate homogeneity until all lineages have uniform evolutionary rates. The sequences excluded from the homogeneously evolving taxon subset are flagged as potentially problematic. The software implementation provides the user with the possibility to remove the flagged sequences for generating a new concatenated alignment. We tested LS³ with simulations and two real datasets containing LBA artifacts: a nucleotide dataset regarding the position of Glires within mammals and an amino-acid dataset concerning the position of nematodes within bilaterians. The initially incorrect phylogenies were corrected in all cases upon removing data flagged by LS³. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Prediction suppression and surprise enhancement in monkey inferotemporal cortex.
Ramachandran, Suchitra; Meyer, Travis; Olson, Carl R
2017-07-01
Exposing monkeys, over the course of days and weeks, to pairs of images presented in fixed sequence, so that each leading image becomes a predictor for the corresponding trailing image, affects neuronal visual responsiveness in area TE. At the end of the training period, neurons respond relatively weakly to a trailing image when it appears in a trained sequence and, thus, confirms prediction, whereas they respond relatively strongly to the same image when it appears in an untrained sequence and, thus, violates prediction. This effect could arise from prediction suppression (reduced firing in response to the occurrence of a probable event) or surprise enhancement (elevated firing in response to the omission of a probable event). To identify its cause, we compared firing under the prediction-confirming and prediction-violating conditions to firing under a prediction-neutral condition. The results provide strong evidence for prediction suppression and limited evidence for surprise enhancement. NEW & NOTEWORTHY In predictive coding models of the visual system, neurons carry signed prediction error signals. We show here that monkey inferotemporal neurons exhibit prediction-modulated firing, as posited by these models, but that the signal is unsigned. The response to a prediction-confirming image is suppressed, and the response to a prediction-violating image may be enhanced. These results are better explained by a model in which the visual system emphasizes unpredicted events than by a predictive coding model. Copyright © 2017 the American Physiological Society.
Grace, Christy R.; Ferreira, Antonio M.; Waddell, M. Brett; Ridout, Granger; Naeve, Deanna; Leuze, Michael; LoCascio, Philip F.; Panetta, John C.; Wilkinson, Mark R.; Pui, Ching-Hon; Naeve, Clayton W.; Uberbacher, Edward C.; Bonten, Erik J.; Evans, William E.
2016-01-01
MicroRNAs are important regulators of gene expression, acting primarily by binding to sequence-specific locations on already transcribed messenger RNAs (mRNA) and typically down-regulating their stability or translation. Recent studies indicate that microRNAs may also play a role in up-regulating mRNA transcription levels, although a definitive mechanism has not been established. Double-helical DNA is capable of forming triple-helical structures through Hoogsteen and reverse Hoogsteen interactions in the major groove of the duplex, and we show physical evidence (i.e., NMR, FRET, SPR) that purine or pyrimidine-rich microRNAs of appropriate length and sequence form triple-helical structures with purine-rich sequences of duplex DNA, and identify microRNA sequences that favor triplex formation. We developed an algorithm (Trident) to search genome-wide for potential triplex-forming sites and show that several mammalian and non-mammalian genomes are enriched for strong microRNA triplex binding sites. We show that those genes containing sequences favoring microRNA triplex formation are markedly enriched (3.3 fold, p<2.2 × 10−16) for genes whose expression is positively correlated with expression of microRNAs targeting triplex binding sequences. This work has thus revealed a new mechanism by which microRNAs could interact with gene promoter regions to modify gene transcription. PMID:26844769
Zhao, Shanrong; Zhang, Ying; Gamini, Ramya; Zhang, Baohong; von Schack, David
2018-03-19
To allow efficient transcript/gene detection, highly abundant ribosomal RNAs (rRNA) are generally removed from total RNA either by positive polyA+ selection or by rRNA depletion (negative selection) before sequencing. Comparisons between the two methods have been carried out by various groups, but the assessments have relied largely on non-clinical samples. In this study, we evaluated these two RNA sequencing approaches using human blood and colon tissue samples. Our analyses showed that rRNA depletion captured more unique transcriptome features, whereas polyA+ selection outperformed rRNA depletion with higher exonic coverage and better accuracy of gene quantification. For blood- and colon-derived RNAs, we found that 220% and 50% more reads, respectively, would have to be sequenced to achieve the same level of exonic coverage in the rRNA depletion method compared with the polyA+ selection method. Therefore, in most cases we strongly recommend polyA+ selection over rRNA depletion for gene quantification in clinical RNA sequencing. Our evaluation revealed that a small number of lncRNAs and small RNAs made up a large fraction of the reads in the rRNA depletion RNA sequencing data. Thus, we recommend that these RNAs are specifically depleted to improve the sequencing depth of the remaining RNAs.
Patterns and rates of intron divergence between humans and chimpanzees
Gazave, Elodie; Marqués-Bonet, Tomàs; Fernando, Olga; Charlesworth, Brian; Navarro, Arcadi
2007-01-01
Background Introns, which constitute the largest fraction of eukaryotic genes and which had been considered to be neutral sequences, are increasingly acknowledged as having important functions. Several studies have investigated levels of evolutionary constraint along introns and across classes of introns of different length and location within genes. However, thus far these studies have yielded contradictory results. Results We present the first analysis of human-chimpanzee intron divergence, in which differences in the number of substitutions per intronic site (Ki) can be interpreted as the footprint of different intensities and directions of the pressures of natural selection. Our main findings are as follows: there was a strong positive correlation between intron length and divergence; there was a strong negative correlation between intron length and GC content; and divergence rates vary along introns and depending on their ordinal position within genes (for instance, first introns are more GC rich, longer and more divergent, and divergence is lower at the 3' and 5' ends of all types of introns). Conclusion We show that the higher divergence of first introns is related to their larger size. Also, the lower divergence of short introns suggests that they may harbor a relatively greater proportion of regulatory elements than long introns. Moreover, our results are consistent with the presence of functionally relevant sequences near the 5' and 3' ends of introns. Finally, our findings suggest that other parts of introns may also be under selective constraints. PMID:17309804
Hybrid DNA virus in Chinese patients with seronegative hepatitis discovered by deep sequencing.
Xu, Baoyan; Zhi, Ning; Hu, Gangqing; Wan, Zhihong; Zheng, Xiaobin; Liu, Xiaohong; Wong, Susan; Kajigaya, Sachiko; Zhao, Keji; Mao, Qing; Young, Neal S
2013-06-18
Seronegative hepatitis--non-A, non-B, non-C, non-D, non-E hepatitis--is poorly characterized but strongly associated with serious complications. We collected 92 sera specimens from patients with non-A-E hepatitis in Chongqing, China between 1999 and 2007. Ten sera pools were screened by Solexa deep sequencing. We discovered a 3,780-bp contig present in all 10 pools that yielded BLASTx E scores of 7e-05-0.008 against parvoviruses. The complete sequence of the in silico-assembled 3,780-bp contig was confirmed by gene amplification of overlapping regions over almost the entire genome, and the virus was provisionally designated NIH-CQV. Further analysis revealed that the contig was composed of two major ORFs. By protein BLAST, ORF1 and ORF2 were most homologous to the replication-associated protein of bat circovirus and the capsid protein of porcine parvovirus, respectively. Phylogenetic analysis indicated that NIH-CQV is located at the interface of Parvoviridae and Circoviridae. Prevalence of NIH-CQV in patients was determined by quantitative PCR. Sixty-three of 90 patient samples (70%) were positive, but all those from 45 healthy controls were negative. Average virus titer in the patient specimens was 1.05 e4 copies/µL. Specific antibodies against NIH-CQV were sought by immunoblotting. Eighty-four percent of patients were positive for IgG, and 31% were positive for IgM; in contrast, 78% of healthy controls were positive for IgG, but all were negative for IgM. Although more work is needed to determine the etiologic role of NIH-CQV in human disease, our data indicate that a parvovirus-like virus is highly prevalent in a cohort of patients with non-A-E hepatitis.
Evolution of major histocompatibility complex class I and class II genes in the brown bear
2012-01-01
Background Major histocompatibility complex (MHC) proteins constitute an essential component of the vertebrate immune response, and are coded by the most polymorphic of the vertebrate genes. Here, we investigated sequence variation and evolution of MHC class I and class II DRB, DQA and DQB genes in the brown bear Ursus arctos to characterise the level of polymorphism, estimate the strength of positive selection acting on them, and assess the extent of gene orthology and trans-species polymorphism in Ursidae. Results We found 37 MHC class I, 16 MHC class II DRB, four DQB and two DQA alleles. We confirmed the expression of several loci: three MHC class I, two DRB, two DQB and one DQA. MHC class I also contained two clusters of non-expressed sequences. MHC class I and DRB allele frequencies differed between northern and southern populations of the Scandinavian brown bear. The rate of nonsynonymous substitutions (dN) exceeded the rate of synonymous substitutions (dS) at putative antigen binding sites of DRB and DQB loci and, marginally significantly, at MHC class I loci. Models of codon evolution supported positive selection at DRB and MHC class I loci. Both MHC class I and MHC class II sequences showed orthology to gene clusters found in the giant panda Ailuropoda melanoleuca. Conclusions Historical positive selection has acted on MHC class I, class II DRB and DQB, but not on the DQA locus. The signal of historical positive selection on the DRB locus was particularly strong, which may be a general feature of caniforms. The presence of MHC class I pseudogenes may indicate faster gene turnover in this class through the birth-and-death process. South–north population structure at MHC loci probably reflects origin of the populations from separate glacial refugia. PMID:23031405
Evolution of major histocompatibility complex class I and class II genes in the brown bear.
Kuduk, Katarzyna; Babik, Wiesław; Bojarska, Katarzyna; Sliwińska, Ewa B; Kindberg, Jonas; Taberlet, Pierre; Swenson, Jon E; Radwan, Jacek
2012-10-02
Major histocompatibility complex (MHC) proteins constitute an essential component of the vertebrate immune response, and are coded by the most polymorphic of the vertebrate genes. Here, we investigated sequence variation and evolution of MHC class I and class II DRB, DQA and DQB genes in the brown bear Ursus arctos to characterise the level of polymorphism, estimate the strength of positive selection acting on them, and assess the extent of gene orthology and trans-species polymorphism in Ursidae. We found 37 MHC class I, 16 MHC class II DRB, four DQB and two DQA alleles. We confirmed the expression of several loci: three MHC class I, two DRB, two DQB and one DQA. MHC class I also contained two clusters of non-expressed sequences. MHC class I and DRB allele frequencies differed between northern and southern populations of the Scandinavian brown bear. The rate of nonsynonymous substitutions (dN) exceeded the rate of synonymous substitutions (dS) at putative antigen binding sites of DRB and DQB loci and, marginally significantly, at MHC class I loci. Models of codon evolution supported positive selection at DRB and MHC class I loci. Both MHC class I and MHC class II sequences showed orthology to gene clusters found in the giant panda Ailuropoda melanoleuca. Historical positive selection has acted on MHC class I, class II DRB and DQB, but not on the DQA locus. The signal of historical positive selection on the DRB locus was particularly strong, which may be a general feature of caniforms. The presence of MHC class I pseudogenes may indicate faster gene turnover in this class through the birth-and-death process. South-north population structure at MHC loci probably reflects origin of the populations from separate glacial refugia.
Pandey, Gaurav; Pandey, Om P; Rogers, Angela J; Ahsen, Mehmet E; Hoffman, Gabriel E; Raby, Benjamin A; Weiss, Scott T; Schadt, Eric E; Bunyavanich, Supinda
2018-06-11
Asthma is a common, under-diagnosed disease affecting all ages. We sought to identify a nasal brush-based classifier of mild/moderate asthma. 190 subjects with mild/moderate asthma and controls underwent nasal brushing and RNA sequencing of nasal samples. A machine learning-based pipeline identified an asthma classifier consisting of 90 genes interpreted via an L2-regularized logistic regression classification model. This classifier performed with strong predictive value and sensitivity across eight test sets, including (1) a test set of independent asthmatic and control subjects profiled by RNA sequencing (positive and negative predictive values of 1.00 and 0.96, respectively; AUC of 0.994), (2) two independent case-control cohorts of asthma profiled by microarray, and (3) five cohorts with other respiratory conditions (allergic rhinitis, upper respiratory infection, cystic fibrosis, smoking), where the classifier had a low to zero misclassification rate. Following validation in large, prospective cohorts, this classifier could be developed into a nasal biomarker of asthma.
The predicted secondary structures of class I fructose-bisphosphate aldolases.
Sawyer, L; Fothergill-Gilmore, L A; Freemont, P S
1988-01-01
The results of several secondary-structure prediction programs were combined to produce an estimate of the regions of alpha-helix, beta-sheet and reverse turns for fructose-bisphosphate aldolases from human and rat muscle and liver, from Trypanosoma brucei and from Drosophila melanogaster. All the aldolase sequences gave essentially the same pattern of secondary-structure predictions despite having sequences up to 50% different. One exception to this pattern was an additional strongly predicted helix in the rat liver and Drosophila enzymes. Regions of relatively high sequence variation generally were predicted as reverse turns, and probably occur as surface loops. Most of the positions corresponding to exon boundaries are located between regions predicted to have secondary-structural elements consistent with a compact structure. The predominantly alternating alpha/beta structure predicted is consistent with the alpha/beta-barrel structure indicated by preliminary high-resolution X-ray diffraction studies on rabbit muscle aldolase [Sygusch, Beaudry & Allaire (1986) Biophys. J. 49, 287a]. Images Fig. 1. (cont.) Fig. 1. PMID:3128269
Programmable and highly resolved in vitro detection of 5-methylcytosine by TALEs.
Kubik, Grzegorz; Schmidt, Moritz J; Penner, Johanna E; Summerer, Daniel
2014-06-02
Gene expression is extensively regulated by specific patterns of genomic 5-methylcytosine (mC), but the ability to directly detect this modification at user-defined genomic loci is limited. One reason is the lack of molecules that discriminate between mC and cytosine (C) and at the same time provide inherent, programmable sequence-selectivity. Programmable transcription-activator-like effectors (TALEs) have been observed to exhibit mC-sensitivity in vivo, but to only a limited extent in vitro. We report an mC-detection assay based on TALE control of DNA replication that displays unexpectedly strong mC-discrimination ability in vitro. The status and level of mC modification at single positions in oligonucleotides can be determined unambiguously by this assay, independently of the overall target sequence. Moreover, discrimination is reliably observed for positions bound by N-terminal and central regions of TALEs. This indicates the wide scope and robustness of the approach for highly resolved mC detection and enabled the detection of a single mC in a large, eukaryotic genome. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Diagnosis of Barmah Forest Virus Infection by a Nested Real-Time SYBR Green RT-PCR Assay
Hueston, Linda; Toi, Cheryl S.; Jeoffreys, Neisha; Sorrell, Tania; Gilbert, Gwendolyn
2013-01-01
Barmah Forest virus (BFV) is a mosquito borne (+) ssRNA alphavirus found only in Australia. It causes rash, myalgia and arthralgia in humans and is usually diagnosed serologically. We developed a real-time PCR assay to detect BFV in an effort to improve diagnosis early in the course of infection. The limit of detection was 16 genome equivalents with a specificity of 100%. Fifty five serum samples from BFV-infected patients were tested by the PCR. 52 of 53 antibody-positive samples were PCR negative. Two culture-positive (neutralizing antibody negative) samples were positive on first round PCR, while one sample (IgM and neutralizing antibody strongly positive, IgG negative) was positive on second round PCR, suggesting that viral RNA is detectable and transiently present in early infection. PCR can provide results faster than culture, is capable of high throughput and by sequencing the PCR product strain variants can be characterized. PMID:23935816
Vibration-Rotation Bands of HF and DF
1977-09-23
98 IZa. Comparison of Observed and Calculated Line Positions of HF, Av = I Sequence ........................... 99 f2b. Comparison of Observed and...Calculated Line Positions of HF, Av = 2 Sequence ........................... 102 12c. Comparison of Observed and Calculated Line Positions of HF, Av = 3...Sequence ........................... 107 i2d. Comparison of Observed and Calculated Line Positions ofHF, Av = 4 Sequence ........................... fi
Xiong, Lan; Dion, Patrick; Montplaisir, Jacques; Levchenko, Anastasia; Thibodeau, Pascale; Karemera, Liliane; Rivière, Jean-Baptiste; St-Onge, Judith; Gaspar, Claudia; Dubé, Marie-Pierre; Desautels, Alex; Turecki, Gustavo; Rouleau, Guy A
2007-10-05
Converging evidence from clinical observations, brain imaging and pathological findings strongly indicate impaired brain iron regulation in restless legs syndrome (RLS). Animal models with mutation in (DMT1) divalent metal transporter 1 gene, an important brain iron transporter, demonstrate a similar iron deficiency profile as found in RLS brain. The human DMT1 gene, mapped to chromosome 12q near the RLS1 locus, qualifies as an excellent functional and possible positional candidate for RLS. DMT1 protein levels were assessed in lymphoblastoid cell lines from RLS patients and controls. Linkage analyses were carried out with markers flanking and within the DMT1 gene. Selected patient samples from RLS families with compatible linkage to the RLS1 locus on 12q were fully sequenced in both the coding regions and the long stretches of UTR sequences. Finally, selected sequence variants were further studied in case/control and family-based association tests. A clinical association of anemia and RLS was further confirmed in this study. There was no detectable difference in DMT1 protein levels between RLS patient lymphoblastoid cell lines and normal controls. Non-parametric linkage analyses failed to identify any significant linkage signals within the DMT1 gene region. Sequencing of selected patients did not detect any sequence variant(s) compatible with DMT1 harboring RLS causative mutation(s). Further studies did not find any association between ten SNPs, spanning the whole DMT1 gene region, and RLS affection status. Finally, two DMT1 intronic SNPs showed positive association with RLS in patients with a history of anemia, when compared to RLS patients without anemia. (c) 2007 Wiley-Liss, Inc.
Wyllie, David H; Sanderson, Nicholas; Myers, Richard; Peto, Tim; Robinson, Esther; Crook, Derrick W; Smith, E Grace; Walker, A Sarah
2018-06-06
Contact tracing requires reliable identification of closely related bacterial isolates. When we noticed the reporting of artefactual variation between M. tuberculosis isolates during routine next generation sequencing of Mycobacterium spp, we investigated its basis in 2,018 consecutive M. tuberculosis isolates. In the routine process used, clinical samples were decontaminated and inoculated into broth cultures; from positive broth cultures DNA was extracted, sequenced, reads mapped, and consensus sequences determined. We investigated the process of consensus sequence determination, which selects the most common nucleotide at each position. Having determined the high-quality read depth and depth of minor variants across 8,006 M. tuberculosis genomic regions, we quantified the relationship between the minor variant depth and the amount of non-Mycobacterial bacterial DNA, which originates from commensal microbes killed during sample decontamination. In the presence of non-Mycobacterial bacterial DNA, we found significant increases in minor variant frequencies of more than 1.5 fold in 242 regions covering 5.1% of the M. tuberculosis genome. Included within these were four high variation regions strongly influenced by the amount of non-Mycobacterial bacterial DNA. Excluding these four regions from pairwise distance comparisons reduced biologically implausible variation from 5.2% to 0% in an independent validation set derived from 226 individuals. Thus, we have demonstrated an approach identifying critical genomic regions contributing to clinically relevant artefactual variation in bacterial similarity searches. The approach described monitors the outputs of the complex multi-step laboratory and bioinformatics process, allows periodic process adjustments, and will have application to quality control of routine bacterial genomics. Copyright © 2018 Wyllie et al.
Cefalù, Angelo B; Spina, Rossella; Noto, Davide; Ingrassia, Valeria; Valenti, Vincenza; Giammanco, Antonina; Fayer, Francesca; Misiano, Gabriella; Cocorullo, Gianfranco; Scrimali, Chiara; Palesano, Ornella; Altieri, Grazia I; Ganci, Antonina; Barbagallo, Carlo M; Averna, Maurizio R
Severe hypertriglyceridemia (HTG) may result from mutations in genes affecting the intravascular lipolysis of triglyceride (TG)-rich lipoproteins. The aim of this study was to develop a targeted next-generation sequencing panel for the molecular diagnosis of disorders characterized by severe HTG. We developed a targeted customized panel for next-generation sequencing Ion Torrent Personal Genome Machine to capture the coding exons and intron/exon boundaries of 18 genes affecting the main pathways of TG synthesis and metabolism. We sequenced 11 samples of patients with severe HTG (TG>885 mg/dL-10 mmol/L): 4 positive controls in whom pathogenic mutations had previously been identified by Sanger sequencing and 7 patients in whom the molecular defect was still unknown. The customized panel was accurate, and it allowed to confirm genetic variants previously identified in all positive controls with primary severe HTG. Only 1 patient of 7 with HTG was found to be carrier of a homozygous pathogenic mutation of the third novel mutation of LMF1 gene (c.1380C>G-p.Y460X). The clinical and molecular familial cascade screening allowed the identification of 2 additional affected siblings and 7 heterozygous carriers of the mutation. We showed that our targeted resequencing approach for genetic diagnosis of severe HTG appears to be accurate, less time consuming, and more economical compared with traditional Sanger resequencing. The identification of pathogenic mutations in candidate genes remains challenging and clinical resequencing should mainly intended for patients with strong clinical criteria for monogenic severe HTG. Copyright © 2017 National Lipid Association. Published by Elsevier Inc. All rights reserved.
Vandenbol, M; Jauniaux, J C; Grenson, M
1989-11-15
The complete nucleotide (nt) sequence of the PUT4 gene, whose product is required for high-affinity proline active transport in the yeast Saccharomyces cerevisiae, is presented. The sequence contains a single long open reading frame of 1881 nt, encoding a polypeptide with a calculated Mr of 68,795. The predicted protein is strongly hydrophobic and exhibits six potential glycosylation sites. Its hydropathy profile suggests the presence of twelve membrane-spanning regions flanked by hydrophilic N- and C-terminal domains. The N terminus does not resemble signal sequences found in secreted proteins. These features are characteristic of integral membrane proteins catalyzing translocation of ligands across cellular membranes. Protein sequence comparisons indicate strong resemblance to the arginine and histidine permeases of S. cerevisiae, but no marked sequence similarity to the proline permease of Escherichia coli or to other known prokaryotic or eukaryotic transport proteins. The strong similarity between the three yeast amino acid permeases suggests a common ancestor for the three proteins.
Evolution of heliobacteria: implications for photosynthetic reaction center complexes
NASA Technical Reports Server (NTRS)
Vermaas, W. F.; Blankenship, R. E. (Principal Investigator)
1994-01-01
The evolutionary position of the heliobacteria, a group of green photosynthetic bacteria with a photosynthetic apparatus functionally resembling Photosystem I of plants and cyanobacteria, has been investigated with respect to the evolutionary relationship to Gram-positive bacteria and cyanobacteria. On the basis of 16S rRNA sequence analysis, the heliobacteria appear to be most closely related to Gram-positive bacteria, but also an evolutionary link to cyanobacteria is evident. Interestingly, a 46-residue domain including the putative sixth membrane-spanning region of the heliobacterial reaction center protein show rather strong similarity (33% identity and 72% similarity) to a region including the sixth membrane-spanning region of the CP47 protein, a chlorophyll-binding core antenna polypeptide of Photosystem II. The N-terminal half of the heliobacterial reaction center polypeptide shows a moderate sequence similarity (22% identity over 232 residues) with the CP47 protein, which is significantly more than the similarity with the Photosystem I core polypeptides in this region. An evolutionary model for photosynthetic reaction center complexes is discussed, in which an ancestral homodimeric reaction center protein (possibly resembling the heliobacterial reaction center protein) with 11 membrane-spanning regions per polypeptide has diverged to give rise to the core of Photosystem I, Photosystem II, and of the photosynthetic apparatus in green, purple, and heliobacteria.
Ravi, Anuradha; Estensmo, Eva Lena F; Abée-Lund, Trine M L'; Foley, Steven L; Allgaier, Bernhard; Martin, Camilia R; Claud, Erika C; Rudi, Knut
2017-11-01
BackgroundThe preterm infant gut microbiota is vulnerable to different biotic and abiotic factors. Although the development of this microbiota has been extensively studied, the mobilome-i.e. the mobile genetic elements (MGEs) in the gut microbiota-has not been considered. Therefore, the aim of this study was to investigate the association of the mobilome with birth weight and hospital location in the preterm infant gut microbiota.MethodsThe data set consists of fecal samples from 62 preterm infants with and without necrotizing enterocolitis (NEC) from three different hospitals. We analyzed the gut microbiome by using 16S rRNA amplicon sequencing, shot-gun metagenome sequencing, and quantitative PCR. Predictive models and other data analyses were performed using MATLAB and QIIME.ResultSThe microbiota composition was significantly different between NEC-positive and NEC-negative infants and significantly different between hospitals. An operational taxanomic unit (OTU) showed strong positive and negative correlation with NEC and birth weight, respectively, whereas none showed significance for mode of delivery. Metagenome analyses revealed high levels of conjugative plasmids with MGEs and virulence genes. Results from quantitative PCR showed that the plasmid signature genes were significantly different between hospitals and in NEC-positive infants.ConclusionOur results point toward an association of the mobilome with hospital location in preterm infants.
Transcriptome profile and unique genetic evolution of positively selected genes in yak lungs.
Lan, DaoLiang; Xiong, XianRong; Ji, WenHui; Li, Jian; Mipam, Tserang-Donko; Ai, Yi; Chai, ZhiXin
2018-04-01
The yak (Bos grunniens), which is a unique bovine breed that is distributed mainly in the Qinghai-Tibetan Plateau, is considered a good model for studying plateau adaptability in mammals. The lungs are important functional organs that enable animals to adapt to their external environment. However, the genetic mechanism underlying the adaptability of yak lungs to harsh plateau environments remains unknown. To explore the unique evolutionary process and genetic mechanism of yak adaptation to plateau environments, we performed transcriptome sequencing of yak and cattle (Bos taurus) lungs using RNA-Seq technology and a subsequent comparison analysis to identify the positively selected genes in the yak. After deep sequencing, a normal transcriptome profile of yak lung that containing a total of 16,815 expressed genes was obtained, and the characteristics of yak lungs transcriptome was described by functional analysis. Furthermore, Ka/Ks comparison statistics result showed that 39 strong positively selected genes are identified from yak lungs. Further GO and KEGG analysis was conducted for the functional annotation of these genes. The results of this study provide valuable data for further explorations of the unique evolutionary process of high-altitude hypoxia adaptation in yaks in the Tibetan Plateau and the genetic mechanism at the molecular level.
Paugh, Steven W.; Coss, David R.; Bao, Ju; ...
2016-02-04
MicroRNAs are important regulators of gene expression, acting primarily by binding to sequence-specific locations on already transcribed messenger RNAs (mRNA). Recent studies indicate that microRNAs may also play a role in up-regulating mRNA transcription levels, although a definitive mechanism has not been established. Double-helical DNA is capable of forming triple-helical structures through Hoogsteen and reverse Hoogsteen interactions in the major groove of the duplex, and we show physical evidence that microRNAs form triple-helical structures with duplex DNA, and identify microRNA sequences that favor triplex formation. We developed an algorithm (Trident) to search genome-wide for potential triplex-forming sites and show thatmore » several mammalian and non-mammalian genomes are enriched for strong microRNA triplex binding sites. We show that those genes containing sequences favoring microRNA triplex formation are markedly enriched (3.3 fold, p<2.2 x 10 -16) for genes whose expression is positively correlated with expression of microRNAs targeting triplex binding sequences. As a result, this work has thus revealed a new mechanism by which microRNAs can interact with gene promoter regions to modify gene transcription.« less
Determining divergence times with a protein clock: update and reevaluation
NASA Technical Reports Server (NTRS)
Feng, D. F.; Cho, G.; Doolittle, R. F.; Bada, J. L. (Principal Investigator)
1997-01-01
A recent study of the divergence times of the major groups of organisms as gauged by amino acid sequence comparison has been expanded and the data have been reanalyzed with a distance measure that corrects for both constraints on amino acid interchange and variation in substitution rate at different sites. Beyond that, the availability of complete genome sequences for several eubacteria and an archaebacterium has had a great impact on the interpretation of certain aspects of the data. Thus, the majority of the archaebacterial sequences are not consistent with currently accepted views of the Tree of Life which cluster the archaebacteria with eukaryotes. Instead, they are either outliers or mixed in with eubacterial orthologs. The simplest resolution of the problem is to postulate that many of these sequences were carried into eukaryotes by early eubacterial endosymbionts about 2 billion years ago, only very shortly after or even coincident with the divergence of eukaryotes and archaebacteria. The strong resemblances of these same enzymes among the major eubacterial groups suggest that the cyanobacteria and Gram-positive and Gram-negative eubacteria also diverged at about this same time, whereas the much greater differences between archaebacterial and eubacterial sequences indicate these two groups may have diverged between 3 and 4 billion years ago.
Abraham, Paul E; Wang, Xiaojing; Ranjan, Priya; Nookaew, Intawat; Zhang, Bing; Tuskan, Gerald A; Hettich, Robert L
2015-12-04
Next-generation sequencing has transformed the ability to link genotypes to phenotypes and facilitates the dissection of genetic contribution to complex traits. However, it is challenging to link genetic variants with the perturbed functional effects on proteins encoded by such genes. Here we show how RNA sequencing can be exploited to construct genotype-specific protein sequence databases to assess natural variation in proteins, providing information about the molecular toolbox driving cellular processes. For this study, we used two natural genotypes selected from a recent genome-wide association study of Populus trichocarpa, an obligate outcrosser with tremendous phenotypic variation across the natural population. This strategy allowed us to comprehensively catalogue proteins containing single amino acid polymorphisms (SAAPs), as well as insertions and deletions. We profiled the frequency of 128 types of naturally occurring amino acid substitutions, including both expected (neutral) and unexpected (non-neutral) SAAPs, with a subset occurring in regions of the genome having strong polymorphism patterns consistent with recent positive and/or divergent selection. By zeroing in on the molecular signatures of these important regions that might have previously been uncharacterized, we now provide a high-resolution molecular inventory that should improve accessibility and subsequent identification of natural protein variants in future genotype-to-phenotype studies.
Conserved noncoding sequences conserve biological networks and influence genome evolution.
Xie, Jianbo; Qian, Kecheng; Si, Jingna; Xiao, Liang; Ci, Dong; Zhang, Deqiang
2018-05-01
Comparative genomics approaches have identified numerous conserved cis-regulatory sequences near genes in plant genomes. Despite the identification of these conserved noncoding sequences (CNSs), our knowledge of their functional importance and selection remains limited. Here, we used a combination of DNA methylome analysis, microarray expression analyses, and functional annotation to study these sequences in the model tree Populus trichocarpa. Methylation in CG contexts and non-CG contexts was lower in CNSs, particularly CNSs in the 5'-upstream regions of genes, compared with other sites in the genome. We observed that CNSs are enriched in genes with transcription and binding functions, and this also associated with syntenic genes and those from whole-genome duplications, suggesting that cis-regulatory sequences play a key role in genome evolution. We detected a significant positive correlation between CNS number and protein interactions, suggesting that CNSs may have roles in the evolution and maintenance of biological networks. The divergence of CNSs indicates that duplication-degeneration-complementation drives the subfunctionalization of a proportion of duplicated genes from whole-genome duplication. Furthermore, population genomics confirmed that most CNSs are under strong purifying selection and only a small subset of CNSs shows evidence of adaptive evolution. These findings provide a foundation for future studies exploring these key genomic features in the maintenance of biological networks, local adaptation, and transcription.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Paugh, Steven W.; Coss, David R.; Bao, Ju
MicroRNAs are important regulators of gene expression, acting primarily by binding to sequence-specific locations on already transcribed messenger RNAs (mRNA). Recent studies indicate that microRNAs may also play a role in up-regulating mRNA transcription levels, although a definitive mechanism has not been established. Double-helical DNA is capable of forming triple-helical structures through Hoogsteen and reverse Hoogsteen interactions in the major groove of the duplex, and we show physical evidence that microRNAs form triple-helical structures with duplex DNA, and identify microRNA sequences that favor triplex formation. We developed an algorithm (Trident) to search genome-wide for potential triplex-forming sites and show thatmore » several mammalian and non-mammalian genomes are enriched for strong microRNA triplex binding sites. We show that those genes containing sequences favoring microRNA triplex formation are markedly enriched (3.3 fold, p<2.2 x 10 -16) for genes whose expression is positively correlated with expression of microRNAs targeting triplex binding sequences. As a result, this work has thus revealed a new mechanism by which microRNAs can interact with gene promoter regions to modify gene transcription.« less
Izquierdo, Esther; Wagner, Camille; Marchioni, Eric; Aoude-Werner, Dalal; Ennahar, Saïd
2009-01-01
Enterococcus faecalis WHE 96, a strain isolated from soft cheese based on its anti-Listeria activity, produced a 5,494-Da bacteriocin that was purified to homogeneity by ultrafiltration and cation-exchange and reversed-phase chromatographies. The amino acid sequence of this bacteriocin, named enterocin 96, was determined by Edman degradation, and its structural gene was sequenced, revealing a double-glycine leader peptide. After a comparison with other bacteriocins, it was shown that enterocin 96 was a new class II bacteriocin that showed very little similarity with known structures. Enterocin 96 was indeed a new bacteriocin belonging to class II bacteriocins. The activity spectrum of enterocin 96 covered a wide range of bacteria, with strong activity against most gram-positive strains but very little or no activity against gram-negative strains. PMID:19411428
Izquierdo, Esther; Wagner, Camille; Marchioni, Eric; Aoude-Werner, Dalal; Ennahar, Saïd
2009-07-01
Enterococcus faecalis WHE 96, a strain isolated from soft cheese based on its anti-Listeria activity, produced a 5,494-Da bacteriocin that was purified to homogeneity by ultrafiltration and cation-exchange and reversed-phase chromatographies. The amino acid sequence of this bacteriocin, named enterocin 96, was determined by Edman degradation, and its structural gene was sequenced, revealing a double-glycine leader peptide. After a comparison with other bacteriocins, it was shown that enterocin 96 was a new class II bacteriocin that showed very little similarity with known structures. Enterocin 96 was indeed a new bacteriocin belonging to class II bacteriocins. The activity spectrum of enterocin 96 covered a wide range of bacteria, with strong activity against most gram-positive strains but very little or no activity against gram-negative strains.
Law, Audrey D; Fisher, Colin; Jack, Anne; Moe, Luke A
2016-07-01
Tobacco-specific nitrosamines are carcinogenic N-nitrosamine compounds present at very low levels in freshly harvested tobacco leaves that accumulate during leaf curing. Formation of N-nitrosamine compounds is associated with high nitrate levels in the leaf at harvest, and nitrate is presumed to be the source from which the N-nitrosation species originates. More specifically, nitrite is considered to be a direct precursor, and nitrite is linked with N-nitrosation in many environmental matrices where it occurs via microbial nitrate reduction. Here, we initiate work exploring the role of leaf microbial communities in formation of tobacco-specific nitrosamines. Leaves from burley tobacco line TN90H were air cured under various temperature and relative humidity levels, and 22 cured tobacco samples were analyzed for their microbial communities and leaf chemistry. Analysis of nitrate, nitrite, and total tobacco-specific nitrosamine levels revealed a strong positive correlation between the three variables, as well as a strong positive correlation with increasing relative humidity during cure conditions. 16S rRNA gene amplicon sequencing was used to assess microbial communities in each of the samples. In most samples, Proteobacteria predominated at the phylum level, accounting for >90 % of the OTUs. However, a distinct shift was noted among members of the high tobacco-specific nitrosamine group, with increases in Firmicutes and Actinobacteria. Several OTUs were identified that correlate strongly (positive and negative) with tobacco-specific nitrosamine content. Copy number of bacterial nitrate reductase genes, obtained using quantitative PCR, did not correlate strongly with tobacco-specific nitrosamine content. Incomplete denitrification is potentially implicated in tobacco-specific nitrosamine levels.
Soo Shin, Jane Hae
2017-01-01
Abstract Guanine-rich (G-rich) homopurine–homopyrimidine nucleotide sequences can block transcription with an efficiency that depends upon their orientation, composition and length, as well as the presence of negative supercoiling or breaks in the non-template DNA strand. We report that a G-rich sequence in the non-template strand reduces the yield of T7 RNA polymerase transcription by more than an order of magnitude when positioned close (9 bp) to the promoter, in comparison to that for a distal (∼250 bp) location of the same sequence. This transcription blockage is much less pronounced for a C-rich sequence, and is not significant for an A-rich sequence. Remarkably, the blockage is not pronounced if transcription is performed in the presence of RNase H, which specifically digests the RNA strands within RNA–DNA hybrids. The blockage also becomes less pronounced upon reduced RNA polymerase concentration. Based upon these observations and those from control experiments, we conclude that the blockage is primarily due to the formation of stable RNA–DNA hybrids (R-loops), which inhibit successive rounds of transcription. Our results could be relevant to transcription dynamics in vivo (e.g. transcription ‘bursting’) and may also have practical implications for the design of expression vectors. PMID:28498974
Belotserkovskii, Boris P; Soo Shin, Jane Hae; Hanawalt, Philip C
2017-06-20
Guanine-rich (G-rich) homopurine-homopyrimidine nucleotide sequences can block transcription with an efficiency that depends upon their orientation, composition and length, as well as the presence of negative supercoiling or breaks in the non-template DNA strand. We report that a G-rich sequence in the non-template strand reduces the yield of T7 RNA polymerase transcription by more than an order of magnitude when positioned close (9 bp) to the promoter, in comparison to that for a distal (∼250 bp) location of the same sequence. This transcription blockage is much less pronounced for a C-rich sequence, and is not significant for an A-rich sequence. Remarkably, the blockage is not pronounced if transcription is performed in the presence of RNase H, which specifically digests the RNA strands within RNA-DNA hybrids. The blockage also becomes less pronounced upon reduced RNA polymerase concentration. Based upon these observations and those from control experiments, we conclude that the blockage is primarily due to the formation of stable RNA-DNA hybrids (R-loops), which inhibit successive rounds of transcription. Our results could be relevant to transcription dynamics in vivo (e.g. transcription 'bursting') and may also have practical implications for the design of expression vectors. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Brunak, S; Engelbrecht, J
1996-06-01
A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed. We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting protein. The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain. A complete search for GenBank nucleotide sequences coding for structural entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment. By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets. These signals do not originate from the clustering of rare codons, but from the similarity of codons coding for very abundant amino acid residues at the N- and C-termini of helices and sheets. No correlation between the positioning of rare codons and the location of structural units was found. The mRNA signals were also compared with conserved nucleotide features of 16S-like ribosomal RNA sequences and related to mechanisms for maintaining the correct reading frame by the ribosome.
Phylodynamics of classical swine fever virus with emphasis on Ecuadorian strains.
Garrido Haro, A D; Barrera Valle, M; Acosta, A; J Flores, F
2018-06-01
Classic swine fever virus (CSFV) is a Pestivirus from the Flaviviridae family that affects pigs worldwide and is endemic in several Latin American countries. However, there are still some countries in the region, including Ecuador, for which CSFV molecular information is lacking. To better understand the epidemiology of CSFV in the Americas, sequences from CSFVs from Ecuador were generated and a phylodynamic analysis of the virus was performed. Sequences for the full-length glycoprotein E2 gene of twenty field isolates were obtained and, along with sequences from strains previously described in the Americas and from the most representative strains worldwide, were used to analyse the phylodynamics of the virus. Bayesian methods were used to test several molecular clock and demographic models. A calibrated ultrametric tree and a Bayesian skyline were constructed, and codons associated with positive selection involving immune scape were detected. The best model according to Bayes factors was the strict molecular clock and Bayesian skyline model, which shows that CSFV has an evolution rate of 3.2 × 10 -4 substitutions per site per year. The model estimates the origin of CSFV in the mid-1500s. There is a strong spatial structure for CSFV in the Americas, indicating that the virus is moving mainly through neighbouring countries. The genetic diversity of CSFV has increased constantly since its appearance, with a slight decrease in mid-twentieth century, which coincides, with eradication campaigns in North America. Even though there is no evidence of strong directional evolution of the E2 gene in CSFV, codons 713, 761, 762 and 975 appear to be selected positively and could be related to virulence or pathogenesis. These results reveal how CSFV has spread and evolved since it first appeared in the Americas and provide important information for attaining the goal of eradication of this virus in Latin America. © 2018 Blackwell Verlag GmbH.
Analysis of MSH3 in endometrial cancers with defective DNA mismatch repair.
Swisher, E M; Mutch, D G; Herzog, T J; Rader, J S; Kowalski, L D; Elbendary, A; Goodfellow, P J
1998-01-01
To clarify the origin of defective mismatch repair (MMR) in sporadic endometrial cancers with microsatellite instability (MSI), a thorough mutation analysis was performed on the human mismatch repair gene MSH3. Twenty-eight MSI-positive endometrial cancers were investigated for mutations in the human mismatch repair gene MSH3 using single-strand conformation variant (SSCV) analysis of all 24 exons. All variants were sequenced. Loss of heterozygosity was investigated at all MSH3 polymorphisms discovered. A subset of tumors were investigated for methylation of the 5' promoter region of MSH3 using Southern blot hybridization. An identical single-base deletion (delta A) predicted to result in a truncated proteins was discovered in six tumors (21.4%). This deletion occurs in a string of eight consecutive adenosine residues (A8). Because simple repeat sequences are unstable in cells with defective MMR, the observed mutation may be an effect, rather than a cause, of MSI. Evidence of inactivation of the second MSH3 allele in tumors with the delta A mutation would strongly support a causal role for these MSH3 mutations. However, there was no evidence of a second mutation, loss of sequences, or methylation of the promoter region in any of the tumors with the delta A mutation. Although the delta A mutation is a frequent event in sporadic MSI-positive endometrial cancers, it may not be causally associated with defective DNA MMR.
Effect of amino acid substitution on biological activity of cyanophlyctin-β and brevinin-2R
NASA Astrophysics Data System (ADS)
Ghorani-Azam, Adel; Balali-Mood, Mahdi; Aryan, Ehsan; Karimi, Gholamreza; Riahi-Zanjani, Bamdad
2018-04-01
Antimicrobial peptides (AMPs), as ancient immune components, are found in almost all types of living organisms. They are bioactive components with strong antibacterial, antiviral, and anti-tumor properties. In this study, we designed three sequences of antimicrobial peptides to study the effects of structural changes in biological activity compared with original peptides, cyanophlyctin β, and brevinin-2R. For antibacterial activity, two Gram-positive (Staphylococcus aureus and S. epidermidis) and two Gram-negative bacteria (Escherichia coli and Pseudomonas aeroginosa) were assayed. Unlike cyanophlyctin β and brevinin-2R, the synthesized peptide (brevinin-M1, brevinin-M2 and brevinin-M3) showed no considerable antibacterial properties. Hemolytic activity of these peptides was also ignorable even at very high concentrations of 2 mg/ml. However, after proteolytic digestion by trypsin, the peptides showed antibacterial activity comparable to their original template sequences. Structural prediction suggested that the motif sequence responsible for antibacterial activity may be re-exposed to bacterial cell membrane after proteolytic digestion. Also, findings showed that only a small change in primary sequence and therefore structure of peptides may result in a significant alteration in biological activity.
Weidow, Christopher A; Bae, Hee-Sung; Chauhan, Ashvini; Ogram, Andrew
2015-04-01
The diversity of a gene family encoding Actinobacterial aromatic ring oxygenases (AAROs) was detected by the PCR-cloning approach using a newly designed PCR primer set. The distribution of AAROs was investigated in 11 soils representing different land management and vegetation zones and was correlated with several geochemical parameters including pH, organic matter (OM), total Kjeldahl nitrogen (TKN), and nitrogen oxides (NO(x)-N: mostly NO3(-)-N). The distribution of individual clades encoding enzymes with potentially different substrates were correlated with different environmental factors, suggesting differential environmental controls on the distribution of specific enzymes as well as sequence diversity. For example, individual clades associated with phthalate dioxygenases were either strongly negatively correlated with pH, or not correlated with pH but showed strong positive correlation with organic carbon content. A large number of clones clustering in a clade related to PAH oxygenases were positively correlated with pH and nitrogen, but not with organic matter. This analysis may yield insight into the ecological forces driving the distribution of these catabolic genes.
Gerstmann-Straüssler-Scheinker disease: novel PRNP mutation and VGKC-complex antibodies.
Jones, Matthew; Odunsi, Sola; du Plessis, Daniel; Vincent, Angela; Bishop, Matthew; Head, Mark W; Ironside, James W; Gow, David
2014-06-10
To describe a unique case of Gerstmann-Straüssler-Scheinker (GSS) disease caused by a novel prion protein (PRNP) gene mutation and associated with strongly positive voltage-gated potassium channel (VGKC)-complex antibodies (Abs). Clinical data were gathered from retrospective review of the case notes. Postmortem neuropathologic examination was performed, and DNA was extracted from frozen brain tissue for full sequence analysis of the PRNP gene. The patient was diagnosed in life with VGKC-complex Ab-associated encephalitis based on strongly positive VGKC-complex Ab titers but no detectable LGI1 or CASPR2 Abs. He died despite 1 year of aggressive immunosuppressive treatment. The neuropathologic diagnosis was GSS disease, and a novel mutation, P84S, in the PRNP gene was found. VGKC-complex Abs are described in an increasingly broad range of clinical syndromes, including progressive encephalopathies, and may be amenable to treatment with immunosuppression. However, the failure to respond to aggressive immunotherapy warns against VGKC-complex Abs being pathogenic, and their presence does not preclude the possibility of prion disease. © 2014 American Academy of Neurology.
Gerstmann-Straüssler-Scheinker disease
Jones, Matthew; Odunsi, Sola; du Plessis, Daniel; Vincent, Angela; Bishop, Matthew; Head, Mark W.; Ironside, James W.
2014-01-01
Objective: To describe a unique case of Gerstmann-Straüssler-Scheinker (GSS) disease caused by a novel prion protein (PRNP) gene mutation and associated with strongly positive voltage-gated potassium channel (VGKC)-complex antibodies (Abs). Methods: Clinical data were gathered from retrospective review of the case notes. Postmortem neuropathologic examination was performed, and DNA was extracted from frozen brain tissue for full sequence analysis of the PRNP gene. Results: The patient was diagnosed in life with VGKC-complex Ab–associated encephalitis based on strongly positive VGKC-complex Ab titers but no detectable LGI1 or CASPR2 Abs. He died despite 1 year of aggressive immunosuppressive treatment. The neuropathologic diagnosis was GSS disease, and a novel mutation, P84S, in the PRNP gene was found. Conclusion: VGKC-complex Abs are described in an increasingly broad range of clinical syndromes, including progressive encephalopathies, and may be amenable to treatment with immunosuppression. However, the failure to respond to aggressive immunotherapy warns against VGKC-complex Abs being pathogenic, and their presence does not preclude the possibility of prion disease. PMID:24814844
Comte, Alexia; Gräfenhan, Tom; Links, Matthew G; Hemmingsen, Sean M; Dumonceaux, Tim J
2017-01-01
We examined the epiphytic microbiome of cereal grain using the universal barcode chaperonin-60 (cpn60). Microbial community profiling of seed washes containing DNA extracts prepared from field-grown cereal grain detected sequences from a fungus identified only to Class Sordariomycetes. To identify the fungal sequence and to improve the reference database, we determined cpn60 sequences from field-collected and reference strains of the ergot fungus, Claviceps purpurea. These data allowed us to identify this fungal sequence as deriving from C. purpurea, and suggested that C. purpurea DNA is readily detectable on agricultural commodities, including those for which ergot was not identified as a grading factor. To get a sense of the prevalence and level of C. purpurea DNA in cereal grains, we developed a quantitative PCR assay based on the fungal internal transcribed spacer (ITS) and applied it to 137 samples from the 2014 crop year. The amount of Claviceps DNA quantified correlated strongly with the proportion of ergot sclerotia identified in each grain lot, although there was evidence that non-target organisms were responsible for some false positives with the ITS-based assay. We therefore developed a cpn60-targeted loop-mediated isothermal amplification assay and applied it to the same grain wash samples. The time to positive displayed a significant, inverse correlation to ergot levels determined by visual ratings. These results indicate that both laboratory-based and field-adaptable molecular diagnostic assays can be used to detect and quantify pathogen load in bulk commodities using cereal grain washes.
Comte, Alexia; Gräfenhan, Tom; Links, Matthew G.; Hemmingsen, Sean M.
2017-01-01
We examined the epiphytic microbiome of cereal grain using the universal barcode chaperonin-60 (cpn60). Microbial community profiling of seed washes containing DNA extracts prepared from field-grown cereal grain detected sequences from a fungus identified only to Class Sordariomycetes. To identify the fungal sequence and to improve the reference database, we determined cpn60 sequences from field-collected and reference strains of the ergot fungus, Claviceps purpurea. These data allowed us to identify this fungal sequence as deriving from C. purpurea, and suggested that C. purpurea DNA is readily detectable on agricultural commodities, including those for which ergot was not identified as a grading factor. To get a sense of the prevalence and level of C. purpurea DNA in cereal grains, we developed a quantitative PCR assay based on the fungal internal transcribed spacer (ITS) and applied it to 137 samples from the 2014 crop year. The amount of Claviceps DNA quantified correlated strongly with the proportion of ergot sclerotia identified in each grain lot, although there was evidence that non-target organisms were responsible for some false positives with the ITS-based assay. We therefore developed a cpn60-targeted loop-mediated isothermal amplification assay and applied it to the same grain wash samples. The time to positive displayed a significant, inverse correlation to ergot levels determined by visual ratings. These results indicate that both laboratory-based and field-adaptable molecular diagnostic assays can be used to detect and quantify pathogen load in bulk commodities using cereal grain washes. PMID:28257512
Dual optical mechanical position tracker
NASA Astrophysics Data System (ADS)
Everett, S. L., Jr.
1985-06-01
This patent application describes an apparatus for retaining control of moving carriage impact dot matrix print heads when subjected to strong external forces such as shock and/or vibration. Position and direction of carriage movement is provided by a photo emitter-sensor assembly and a slotted timing wheel or disc having a plurality of equally spaced slots whose slot width is equal to the slot separation. The slot width is sufficient to frame a pair of side-by-side emitters which operate in conjunction with a pair of side-by-side sensors on the other side of the timing wheel. The order or sequence in which the sensors receive photo energy from their respective emitters indicates the direction of rotation of the timing wheel while simultaneous reception of photo energy by the side-by-side sensors provides an indication of valid rest position of the carriage drive motor.
Gerresheim, Gesche K; Dünnes, Nadia; Nieder-Röhrmann, Anika; Shalamova, Lyudmila A; Fricke, Markus; Hofacker, Ivo; Höner Zu Siederdissen, Christian; Marz, Manja; Niepmann, Michael
2017-02-01
We have analyzed the binding of the liver-specific microRNA-122 (miR-122) to three conserved target sites of hepatitis C virus (HCV) RNA, two in the non-structural protein 5B (NS5B) coding region and one in the 3' untranslated region (3'UTR). miR-122 binding efficiency strongly depends on target site accessibility under conditions when the range of flanking sequences available for the formation of local RNA secondary structures changes. Our results indicate that the particular sequence feature that contributes most to the correlation between target site accessibility and binding strength varies between different target sites. This suggests that the dynamics of miRNA/Ago2 binding not only depends on the target site itself but also on flanking sequence context to a considerable extent, in particular in a small viral genome in which strong selection constraints act on coding sequence and overlapping cis-signals and model the accessibility of cis-signals. In full-length genomes, single and combination mutations in the miR-122 target sites reveal that site 5B.2 is positively involved in regulating overall genome replication efficiency, whereas mutation of site 5B.3 showed a weaker effect. Mutation of the 3'UTR site and double or triple mutants showed no significant overall effect on genome replication, whereas in a translation reporter RNA, the 3'UTR target site inhibits translation directed by the HCV 5'UTR. Thus, the miR-122 target sites in the 3'-region of the HCV genome are involved in a complex interplay in regulating different steps of the HCV replication cycle.
Smith, B L; Flores, A; Dechaine, J; Krepela, J; Bergdall, A; Ferrieri, P
2004-05-01
R proteins were first identified by Lancefield in group B Streptococcus (GBS) as resistant to trypsin at pH8 and sensitive to pepsin at pH2. The R4 protein found predominantly in type III and some type II and V invasive isolates conforms to these criteria. The Rib protein, although structurally and epidemiologically similar to R4, was reported as resistant to both proteases. We report here the gene encoding the R4 protein from a type III group B streptococcal isolate (76-043) well characterized in our laboratory. Trypsin extracted GBS proteins were assayed for protease sensitivities by double-diffusion Ouchterlony using varying conditions for the enzyme pepsin. Standard haemoglobin assay was used to examine pepsin enzymatic activity. Thirty clinical isolates of varying protein profiles identified by double-diffusion from our reference strain laboratory were screened by PCR and Southern technique. SDS-PAGE gel purified R4 amino acid sequences were determined and used to design oligonucleotide primers for screening a 76-043 genomic library. R4 was sensitive to pepsin at pH2 but appeared resistant at pH4, the reported pH used for Rib. By standard haemoglobin assay and trypsin extract studies of R4 protein, pepsin was shown to be active at pH2, yet easily inactivated; assays of GBS surface proteins are critical at pH2. Of the amino acids initially sequenced from R4, 88 per cent (61/69) showed identity to Rib; the r4 nucleotide sequence was identical to that of rib. All isolates with strong positive protein reactions for R4 were positive in both PCR and Southern technique, whereas isolates expressing alpha, beta, R1/R4, and R5 (BPS) protein profiles were not. Sequenced PCR products aligned with identity to the R4 and Rib nucleotide sequences and confirmed the identity of these proteins and their molecular sequences.
Bobrova, E V; Liakhovetskiĭ, V A; Borshchevskaia, E R
2011-01-01
The dependence of errors during reproduction of a sequence of hand movements without visual feedback on the previous right- and left-hand performance ("prehistory") and on positions in space of sequence elements (random or ordered by the explicit rule) was analyzed. It was shown that the preceding information about the ordered positions of the sequence elements was used during right-hand movements, whereas left-hand movements were performed with involvement of the information about the random sequence. The data testify to a central mechanism of the analysis of spatial structure of sequence elements. This mechanism activates movement coding specific for the left hemisphere (vector coding) in case of an ordered sequence structure and positional coding specific for the right hemisphere in case of a random sequence structure.
Theodoropoulos, D; Poulos, C; Gatos, D; Cordopatis, P; Escher, E; Mizrahi, J; Regoli, D; Dalietos, D; Furst, A; Lee, T D
1985-10-01
Four cyclic analogues of the C-terminal hepta- or hexapeptide of substance P were prepared by the solution method. The cyclizations were obtained by substituting with cysteine the residues normally present in positions 5 or 6 or 11 of substance P and by subsequent disulfide bond formation. The final products were identified by ordinary analytical procedures and advanced mass spectroscopy. The biological activities were determined on three bioassays: the guinea pig ileum, the guinea pig trachea and the rabbit mesenteric vein. Results obtained with these assays indicate that all peptides with a disulfide bridgehead in position 11 are inactive and that a cycle between positions 5 and 6 already strongly reduces the biological activity. The acyclic precursors containing thiol protection groups display weak biological activities. These results further underline the importance of the side chain in position 11 of substance P and suggest that optimal biological activities may require a linear peptide sequence.
Trotier, Aurélien J; Lefrançois, William; Van Renterghem, Kris; Franconi, Jean-Michel; Thiaudière, Eric; Miraux, Sylvain
2015-07-07
To show that 3D sequences with ultra-short echo times (UTEs) can generate a positive contrast whatever the magnetic field (4.7, 7 or 9.4 T) and whatever Ultra Small Particles of Iron Oxide (USPIO) concentration injected and to use it for 3D time-resolved imaging of the murine cardiovascular system with high spatial and temporal resolutions. Three different concentrations (50, 200 and 500 μmol Fe/kg) of USPIO were injected in mice and static images of the middle part of the animals were acquired at 4.7, 7 and 9.4 T pre and post-contrast with UTE (TE/TR = 0.05/4.5 ms) sequences. Signal-to-Noise Ratio (SNR) and Contrast-to-Noise Ratio (CNR) of blood and static tissus were evaluated before and after contrast agent injection. 3D-cine images (TE/TR = 0.05/3.5 ms, scan time < 12 min) at 156 μm isotropic resolution of the mouse cardiopulmonary system were acquired prospectively with the UTE sequence for the three magnetic fields and with an USPIO dose of 200 μmol Fe/kg. SNR, CNR and signal homogeneity of blood were measured. High spatial (104 μm) or temporal (3.5 ms) resolution 3D-cine imaging (scan time < 35 min) isotropic resolution were also performed at 7 T with a new sequence encoding scheme. UTE imaging generated positive contrast and higher SNR and CNR whatever the magnetic field and the USPIO concentration used compared to pre-contrast images. Time-resolved 3D acquisition enables high blood SNR (66.6 ± 4.5 at 7 T) and CNR (33.2 ± 4.2 at 7 T) without flow or motion artefact. Coronary arteries and aortic valve were visible on images acquired at 104 μm resolution. We have demonstrated that by combining the injection of iron nanoparticles with 3D-cine UTE sequences, it was possible to generate a strong positive contrast between blood and surrounding tissues. These properties were exploited to produce images of the cardiovascular system in small animals at high magnetic fields with a high spatial and temporal resolution. This approach might be useful to measure the functional cardiac parameters or to assess anatomical modifications to the blood vessels in cardio-vascular disease models.
Efficient identification of Y chromosome sequences in the human and Drosophila genomes.
Carvalho, Antonio Bernardo; Clark, Andrew G
2013-11-01
Notwithstanding their biological importance, Y chromosomes remain poorly known in most species. A major obstacle to their study is the identification of Y chromosome sequences; due to its high content of repetitive DNA, in most genome projects, the Y chromosome sequence is fragmented into a large number of small, unmapped scaffolds. Identification of Y-linked genes among these fragments has yielded important insights about the origin and evolution of Y chromosomes, but the process is labor intensive, restricting studies to a small number of species. Apart from these fragmentary assemblies, in a few mammalian species, the euchromatic sequence of the Y is essentially complete, owing to painstaking BAC mapping and sequencing. Here we use female short-read sequencing and k-mer comparison to identify Y-linked sequences in two very different genomes, Drosophila virilis and human. Using this method, essentially all D. virilis scaffolds were unambiguously classified as Y-linked or not Y-linked. We found 800 new scaffolds (totaling 8.5 Mbp), and four new genes in the Y chromosome of D. virilis, including JYalpha, a gene involved in hybrid male sterility. Our results also strongly support the preponderance of gene gains over gene losses in the evolution of the Drosophila Y. In the intensively studied human genome, used here as a positive control, we recovered all previously known genes or gene families, plus a small amount (283 kb) of new, unfinished sequence. Hence, this method works in large and complex genomes and can be applied to any species with sex chromosomes.
Inoue, Jun G; Miya, Masaki; Lam, Kevin; Tay, Boon-Hui; Danks, Janine A; Bell, Justin; Walker, Terrence I; Venkatesh, Byrappa
2010-11-01
With our increasing ability for generating whole-genome sequences, comparative analysis of whole genomes has become a powerful tool for understanding the structure, function, and evolutionary history of human and other vertebrate genomes. By virtue of their position basal to bony vertebrates, cartilaginous fishes (class Chondrichthyes) are a valuable outgroup in comparative studies of vertebrates. Recently, a holocephalan cartilaginous fish, the elephant shark, Callorhinchus milii (Subclass Holocephali: Order Chimaeriformes), has been proposed as a model genome, and low-coverage sequence of its genome has been generated. Despite such an increasing interest, the evolutionary history of the modern holocephalans-a previously successful and diverse group but represented by only 39 extant species-and their relationship with elasmobranchs and other jawed vertebrates has been poorly documented largely owing to a lack of well-preserved fossil materials after the end-Permian about 250 Ma. In this study, we assembled the whole mitogenome sequences for eight representatives from all the three families of the modern holocephalans and investigated their phylogenetic relationships and evolutionary history. Unambiguously aligned sequences from these holocephalans together with 17 other vertebrates (9,409 nt positions excluding entire third codon positions) were subjected to partitioned maximum likelihood analysis. The resulting tree strongly supported a single origin of the modern holocephalans and their sister-group relationship with elasmobranchs. The mitogenomic tree recovered the most basal callorhinchids within the chimaeriforms, which is sister to a clade comprising the remaining two families (rhinochimaerids and chimaerids). The timetree derived from a relaxed molecular clock Bayesian method suggests that the holocephalans originated in the Silurian about 420 Ma, having survived from the end-Permian (250 Ma) mass extinction and undergoing familial diversifications during the late Jurassic to early Cretaceous (170-120 Ma). This postulated evolutionary scenario agrees well with that based on the paleontological observations.
Information Entropy of Influenza A Segment 7
NASA Astrophysics Data System (ADS)
Thompson, William A.; Fan, Shaohua; Weltman, Joel K.
2008-12-01
Information entropy (H) is a measure of uncertainty at each position within in a sequence of nucleotides.H was used to characterize a set of influenza A segment 7 nucleotide sequences. Nucleotide locations of high entropy were identified near the 5’ start of all of the sequences and the sequences were assigned to subsets according to synonymous nucleotide variants at those positions: either uracil at position six (U6), cytosine at position six (C6), adenine (A12) at position 12, guanine at position 12 (G12), adenine at position 15 (A15) or cytosine (C15) at position 15. H values were found to be correlated/corresponding (Kendall tau) along the lengths of the nucleotide segments of the subset pairs at each position. However, the H values of each subset of sequences were statistically distinguishable from those of the other member of the pair (Kolmogorov-Smirnov test). The joint probability of uncorrelated distributions of U6 and C6 sequences to viral subtypes and to viral host species was 34 times greater than for the A12:G12 subset pair and 214 times greater than for the A15:C15 pair. This result indicates that the high entropy position six of segment 7 is either a reporter or a sentinel location. The fact that not one of the H5N1 sequences in the dataset was a member of the C6 subset, but all 125 H5N1 sequences are members of the U6 subset suggests a non-random sentinel function.
Hellberg, M E; Moy, G W; Vacquier, V D
2000-03-01
Male-specific proteins have increasingly been reported as targets of positive selection and are of special interest because of the role they may play in the evolution of reproductive isolation. We report the rapid interspecific divergence of cDNA encoding a major acrosomal protein of unknown function (TMAP) of sperm from five species of teguline gastropods. A mitochondrial DNA clock (calibrated by congeneric species divided by the Isthmus of Panama) estimates that these five species diverged 2-10 MYA. Inferred amino acid sequences reveal a propeptide that has diverged rapidly between species. The mature protein has diverged faster still due to high nonsynonymous substitution rates (> 25 nonsynonymous substitutions per site per 10(9) years). cDNA encoding the mature protein (89-100 residues) shows evidence of positive selection (Dn/Ds > 1) for 4 of 10 pairwise species comparisons. cDNA and predicted secondary-structure comparisons suggest that TMAP is neither orthologous nor paralogous to abalone lysin, and thus marks a second, phylogenetically independent, protein subject to strong positive selection in free-spawning marine gastropods. In addition, an internal repeat in one species (Tegula aureotincta) produces a duplicated cleavage site which results in two alternatively processed mature proteins differing by nine amino acid residues. Such alternative processing may provide a mechanism for introducing novel amino acid sequence variation at the amino-termini of proteins. Highly divergent TMAP N-termini from two other tegulines (Tegula regina and Norrisia norrisii) may have originated by such a mechanism.
Emotional valence and arousal interact in attentional control.
Jefferies, Lisa N; Smilek, Daniel; Eich, Eric; Enns, James T
2008-03-01
A recent study demonstrated that observers' ability to identify targets in a rapid visual sequence was enhanced when they simultaneously listened to happy music. In the study reported here, we examined how the emotion-attention relationship is influenced by changes in both mood valence (negative vs. positive) and arousal (low vs. high). We used a standard induction procedure to generate calm, happy, sad, and anxious moods in participants. Results for an attentional blink task showed no differences in first-target accuracy, but second-target accuracy was highest for participants with low arousal and negative affect (sad), lowest for those with strong arousal and negative affect (anxious), and intermediate for those with positive affect regardless of their arousal (calm, happy). We discuss implications of this valence-arousal interaction for the control of visual attention.
Cell penetrating peptides: a comparative transport analysis for 474 sequence motifs.
Ramaker, Katrin; Henkel, Maik; Krause, Thorsten; Röckendorf, Niels; Frey, Andreas
2018-11-01
Delivering reagents into cells is a key demand in molecular medicine. The vehicle of choice is often cell penetrating peptides (CPPs), which can ferry conjugated cargo across membranes. Although numerous peptides have been shown to promote such uptake events, there has been no comprehensive comparison of individual performance under standardized conditions. We have devised a method to rapidly analyze the ability of a multitude of CPP conjugates to carry a model cargo into HeLa cells. Sequence information for 474 CPPs was collected from literature sources, and the respective peptides were synthesized and modified with carboxyfluorescein (FAM) as model cargo. All candidates were evaluated in an identical uptake test, and transport was quantified using cellular fluorescence intensities. Substantial differences in the ability to carry the fluorophore into the cells were observed, with transport performance differing by a factor of 70 between the best CPP investigated and cargo alone. Strong correlations were observed between uptake efficiency and both sequence length and the presence of positive net charge. A compilation of the 20 top performers with regard to cargo delivery performance and cell compatibility is provided.
Berg, Thomas; Hopwood, John J
2002-03-16
alpha-Mannosidosis is a lysosomal storage disorder caused by deficient activity of the lysosomal alpha-mannosidase. We report here the sequencing and expression of the lysosomal alpha-mannosidase cDNA from normal and alpha-mannosidosis guinea pigs. The amino acid sequence of the guinea pig enzyme displayed 82-85% identity to the lysosomal alpha-mannosidase in other mammals. The cDNA of the alpha-mannosidosis guinea pig contained a missense mutation, 679C>T, leading to substitution of arginine by tryptophan at amino acid position 227 (R227W). The R227W allele segregated with the alpha-mannosidosis genotype in the guinea pig colony and introduction of R227W into the wild-type sequence eliminated the production of recombinant alpha-mannosidase activity in heterologous expression studies. Furthermore, the guinea pig mutation has been found in human patients. Our results strongly indicate that the 679C>T mutation causes alpha-mannosidosis and suggest that the guinea pig will be an excellent model for investigation of pathogenesis and evaluation of therapeutic strategies for human alpha-mannosidosis.
DNA Sequence-Mediated, Evolutionarily Rapid Redistribution of Meiotic Recombination Hotspots
Wahls, Wayne P.; Davidson, Mari K.
2011-01-01
Hotspots regulate the position and frequency of Spo11 (Rec12)-initiated meiotic recombination, but paradoxically they are suicidal and are somehow resurrected elsewhere in the genome. After the DNA sequence-dependent activation of hotspots was discovered in fission yeast, nearly two decades elapsed before the key realizations that (A) DNA site-dependent regulation is broadly conserved and (B) individual eukaryotes have multiple different DNA sequence motifs that activate hotspots. From our perspective, such findings provide a conceptually straightforward solution to the hotspot paradox and can explain other, seemingly complex features of meiotic recombination. We describe how a small number of single-base-pair substitutions can generate hotspots de novo and dramatically alter their distribution in the genome. This model also shows how equilibrium rate kinetics could maintain the presence of hotspots over evolutionary timescales, without strong selective pressures invoked previously, and explains why hotspots localize preferentially to intergenic regions and introns. The model is robust enough to account for all hotspots of humans and chimpanzees repositioned since their divergence from the latest common ancestor. PMID:22084420
NASA Astrophysics Data System (ADS)
Yu, Jianzhong; Ma, Xiaolei; Pan, Kehou; Yang, Guanpin; Yu, Wengong
2010-07-01
We constructed and characterized a normalized cDNA library of Nannochloropsis oculata CS-179, and obtained 905 nonredundant sequences (NRSs) ranging from 431-1 756 bp in length. Among them, 496 were very similar to nonredundant ones in the GenBank ( E ≤1.0e-05), and 349 ESTs had significant hits with the clusters of eukaryotic orthologous groups (KOG). Bases G and/or C at the third position of codons of 14 amino acid residues suggested a strong bias in the conserved domain of 362 NRSs (>60%). We also identified the unigenes encoding phosphorus and nitrogen transporters, suggesting that N. oculata could efficiently transport and metabolize phosphorus and nitrogen, and recognized the unigenes that involved in biosynthesis and storage of both fatty acids and polyunsaturated fatty acids (PUFAs), which will facilitate the demonstration of eicosapentaenoic acid (EPA) biosynthesis pathway of N. oculata. In comparison with the original cDNA library, the normalized library significantly increased the efficiencies of random sequencing and rarely expressed genes discovering, and decreased the frequency of abundant gene sequences.
Introns: The Functional Benefits of Introns in Genomes.
Jo, Bong-Seok; Choi, Sun Shim
2015-12-01
The intron has been a big biological mystery since it was first discovered in several aspects. First, all of the completely sequenced eukaryotes harbor introns in the genomic structure, whereas no prokaryotes identified so far carry introns. Second, the amount of total introns varies in different species. Third, the length and number of introns vary in different genes, even within the same species genome. Fourth, all introns are copied into RNAs by transcription and DNAs by replication processes, but intron sequences do not participate in protein-coding sequences. The existence of introns in the genome should be a burden to some cells, because cells have to consume a great deal of energy to copy and excise them exactly at the correct positions with the help of complicated spliceosomal machineries. The existence throughout the long evolutionary history is explained, only if selective advantages of carrying introns are assumed to be given to cells to overcome the negative effect of introns. In that regard, we summarize previous research about the functional roles or benefits of introns. Additionally, several other studies strongly suggesting that introns should not be junk will be introduced.
Krylov, V; Tlapáková, T; Mácha, J; Curlej, J; Ryban, L; Chrenek, P
2008-01-01
For chromosomal localization of the hFVIII human transgene in F2 and F3 generation of transgenic rabbits, FISH-TSA was applied. A short cDNA probe (1250 bp) targeted chromosomes 3, 7, 8, 9 and 18 of an F2 male (animal 1-3-8). Two transgenic offspring (F3) revealed signal positions in chromosome 3 and chromosomes 3 and 7, respectively. Sequencing and structure analysis of the rabbit orthologous gene revealed high similarity to its human counterpart. Part of the sequenced cDNA (1310 bp) served as a probe for FISH-TSA analysis. The rabbit gene was localized in the q arm terminus of the X chromosome. This result is in agreement with reciprocal chromosome painting between the rabbit and the human. The presented FISH-TSA method provides strong signals without any interspecies reactivity.
Hippocampal awake replay in fear memory retrieval
Wu, Chun-Ting; Haggerty, Daniel; Kemere, Caleb; Ji, Daoyun
2017-01-01
Hippocampal place cells are key to episodic memories. How these cells participate in memory retrieval remains unclear. Here, after rats acquired a fear memory by receiving mild foot-shocks at a shock zone of a track, we analyzed place cells when the animals were placed back to the track and displayed an apparent memory retrieval behavior: avoidance of the shock zone. We found that place cells representing the shock zone were reactivated, despite the fact that the animals did not enter the shock zone. This reactivation occurred in ripple-associated awake replay of place cell sequences encoding the paths from the animal’s current positions to the shock zone, but not in place cell sequences within individual cycles of theta oscillation. The result reveals a specific place cell pattern underlying the inhibitory avoidance behavior and provides strong evidence for the involvement of awake replay in fear memory retrieval. PMID:28218916
Whole-genome resequencing reveals signatures of selection and timing of duck domestication.
Zhang, Zebin; Jia, Yaxiong; Almeida, Pedro; Mank, Judith E; van Tuinen, Marcel; Wang, Qiong; Jiang, Zhihua; Chen, Yu; Zhan, Kai; Hou, Shuisheng; Zhou, Zhengkui; Li, Huifang; Yang, Fangxi; He, Yong; Ning, Zhonghua; Yang, Ning; Qu, Lujiang
2018-04-01
The genetic basis of animal domestication remains poorly understood, and systems with substantial phenotypic differences between wild and domestic populations are useful for elucidating the genetic basis of adaptation to new environments as well as the genetic basis of rapid phenotypic change. Here, we sequenced the whole genome of 78 individual ducks, from two wild and seven domesticated populations, with an average sequencing depth of 6.42X per individual. Our population and demographic analyses indicate a complex history of domestication, with early selection for separate meat and egg lineages. Genomic comparison of wild to domesticated populations suggests that genes that affect brain and neuronal development have undergone strong positive selection during domestication. Our FST analysis also indicates that the duck white plumage is the result of selection at the melanogenesis-associated transcription factor locus. Our results advance the understanding of animal domestication and selection for complex phenotypic traits.
Cesarman, E; Chang, Y; Moore, P S; Said, J W; Knowles, D M
1995-05-04
DNA fragments that appeared to belong to an unidentified human herpesvirus were recently found in more than 90 percent of Kaposi's sarcoma lesions associated with the acquired immunodeficiency syndrome (AIDS). These fragments were also found in 6 of 39 tissue samples without Kaposi's sarcoma, including 3 malignant lymphomas, from patients with AIDS, but not in samples from patients without AIDS. We examined the DNA of 193 lymphomas from 42 patients with AIDS and 151 patients who did not have AIDS. We searched the DNA for sequences of Kaposi's sarcoma-associated herpesvirus (KSHV) by Southern blot hybridization, the polymerase chain reaction (PCR), or both. The PCR products in the positive samples were sequences and compared with the KSHV sequences in Kaposi's sarcoma tissues from patients with AIDS. KSHV sequences were identified in eight lymphomas in patients infected with the human immunodeficiency virus. All eight, and only these eight, were body-cavity-based lymphomas--that is, they were characterized by pleural, pericardial, or peritoneal lymphomatous effusions. All eight lymphomas also contained the Epstein-Barr viral genome. KSHV sequences were not found in the other 185 lymphomas. KSHV sequences were 40 to 80 times more abundant in the body-cavity-based lymphomas than in the Kaposi's sarcoma lesions. A high degree of conservation of KSHV sequences in Kaposi's sarcoma and in the eight lymphomas suggests the presence of the same agent in both lesions. The recently discovered KSHV DNA sequences occur in an unusual subgroup of AIDS-related B-cell lymphomas, but not in any other lymphoid neoplasm studied thus far. Our finding strongly suggests that a novel herpesvirus has a pathogenic role in AIDS-related body-cavity-based lymphomas.
Day, Ryan; Qu, Xiaotao; Swanson, Rosemarie; Bohannan, Zach; Bliss, Robert
2011-01-01
Abstract Most current template-based structure prediction methods concentrate on finding the correct backbone conformation and then packing sidechains within that backbone. Our packing-based method derives distance constraints from conserved relative packing groups (RPGs). In our refinement approach, the RPGs provide a level of resolution that restrains global topology while allowing conformational sampling. In this study, we test our template-based structure prediction method using 51 prediction units from CASP7 experiments. RPG-based constraints are able to substantially improve approximately two-thirds of starting templates. Upon deeper investigation, we find that true positive spatial constraints, especially those non-local in sequence, derived from the RPGs were important to building nearer native models. Surprisingly, the fraction of incorrect or false positive constraints does not strongly influence the quality of the final candidate. This result indicates that our RPG-based true positive constraints sample the self-consistent, cooperative interactions of the native structure. The lack of such reinforcing cooperativity explains the weaker effect of false positive constraints. Generally, these findings are encouraging indications that RPGs will improve template-based structure prediction. PMID:21210729
New Potent Membrane-Targeting Antibacterial Peptides from Viral Capsid Proteins
Dias, Susana A.; Freire, João M.; Pérez-Peinado, Clara; Domingues, Marco M.; Gaspar, Diana; Vale, Nuno; Gomes, Paula; Andreu, David; Henriques, Sónia T.; Castanho, Miguel A. R. B.; Veiga, Ana S.
2017-01-01
The increasing prevalence of multidrug-resistant bacteria urges the development of new antibacterial agents. With a broad spectrum activity, antimicrobial peptides have been considered potential antibacterial drug leads. Using bioinformatic tools we have previously shown that viral structural proteins are a rich source for new bioactive peptide sequences, namely antimicrobial and cell-penetrating peptides. Here, we test the efficacy and mechanism of action of the most promising peptides among those previously identified against both Gram-positive and Gram-negative bacteria. Two cell-penetrating peptides, vCPP 0769 and vCPP 2319, have high antibacterial activity against Staphylococcus aureus, MRSA, Escherichia coli, and Pseudomonas aeruginosa, being thus multifunctional. The antibacterial mechanism of action of the two most active viral protein-derived peptides, vAMP 059 and vCPP 2319, was studied in detail. Both peptides act on both Gram-positive S. aureus and Gram-negative P. aeruginosa, with bacterial cell death occurring within minutes. Also, these peptides cause bacterial membrane permeabilization and damage of the bacterial envelope of P. aeruginosa cells. Overall, the results show that structural viral proteins are an abundant source for membrane-active peptides sequences with strong antibacterial properties. PMID:28522994
Human HMG box transcription factor HBP1: a role in hCD2 LCR function.
Zhuma, T; Tyrrell, R; Sekkali, B; Skavdis, G; Saveliev, A; Tolaini, M; Roderick, K; Norton, T; Smerdon, S; Sedgwick, S; Festenstein, R; Kioussis, D
1999-01-01
The locus control region (LCR) of the human CD2 gene (hCD2) confers T cell-specific, copy-dependent and position-independent gene expression in transgenic mice. This LCR consists of a strong T cell-specific enhancer and an element without enhancer activity (designated HSS3), which is required for prevention of position effect variegation (PEV) in transgenic mice. Here, we identified the HMG box containing protein-1 (HBP1) as a factor binding to HSS3 of the hCD2 LCR. Within the LCR, HBP1 binds to a novel TTCATTCATTCA sequence that is higher in affinity than other recently reported HBP1-binding sites. Mice transgenic for a hCD2 LCR construct carrying a deletion of the HBP1-binding sequences show a propensity for PEV if the transgene integrates in a heterochromatic region of the chromosome such as the centromere or telomere. We propose that HBP1 plays an important role in chromatin opening and remodelling activities by binding to and bending the DNA, thus allowing DNA-protein and/or protein-protein interactions, which increase the probability of establishing an active locus. PMID:10562551
Detecting Recombination Hotspots from Patterns of Linkage Disequilibrium.
Wall, Jeffrey D; Stevison, Laurie S
2016-08-09
With recent advances in DNA sequencing technologies, it has become increasingly easy to use whole-genome sequencing of unrelated individuals to assay patterns of linkage disequilibrium (LD) across the genome. One type of analysis that is commonly performed is to estimate local recombination rates and identify recombination hotspots from patterns of LD. One method for detecting recombination hotspots, LDhot, has been used in a handful of species to further our understanding of the basic biology of recombination. For the most part, the effectiveness of this method (e.g., power and false positive rate) is unknown. In this study, we run extensive simulations to compare the effectiveness of three different implementations of LDhot. We find large differences in the power and false positive rates of these different approaches, as well as a strong sensitivity to the window size used (with smaller window sizes leading to more accurate estimation of hotspot locations). We also compared our LDhot simulation results with comparable simulation results obtained from a Bayesian maximum-likelihood approach for identifying hotspots. Surprisingly, we found that the latter computationally intensive approach had substantially lower power over the parameter values considered in our simulations. Copyright © 2016 Wall and Stevison.
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies.
Utturkar, Sagar M; Klingeman, Dawn M; Hurt, Richard A; Brown, Steven D
2017-01-01
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.
Ishihara, Yuko; Tanaka, Yukie; Kobayashi, Seiichiro; Kawamura, Koji; Nakasone, Hideki; Gomyo, Ayumi; Hayakawa, Jin; Tamaki, Masaharu; Akahoshi, Yu; Harada, Naonori; Kusuda, Machiko; Kameda, Kazuaki; Ugai, Tomotaka; Wada, Hidenori; Sakamoto, Kana; Sato, Miki; Terasako-Saito, Kiriko; Kikuchi, Misato; Kimura, Shun-Ichi; Tanihara, Aki; Kako, Shinichi; Uchimaru, Kaoru; Kanda, Yoshinobu
2017-10-01
We previously reported that the T-cell receptor (TCR) repertoire of human T-cell lymphotropic virus type 1 (HTLV-1) Tax 301-309 -specific CD8 + cytotoxic T cells (Tax 301-309 -CTLs) was highly restricted and a particular amino acid sequence motif, the PDR motif, was conserved among HLA-A*24:02-positive (HLA-A*24:02 + ) adult T-cell leukemia/lymphoma (ATL) patients who had undergone allogeneic hematopoietic cell transplantation (allo-HSCT). Furthermore, we found that donor-derived PDR + CTLs selectively expanded in ATL long-term HSCT survivors with strong CTL activity against HTLV-1. On the other hand, the TCR repertoires in Tax 301-309 -CTLs of asymptomatic HTLV-1 carriers (ACs) remain unclear. In this study, we directly identified the DNA sequence of complementarity-determining region 3 (CDR3) of the TCR-β chain of Tax 301-309 -CTLs at the single-cell level and compared not only the TCR repertoires but also the frequencies and phenotypes of Tax 301-309 -CTLs between ACs and ATL patients. We did not observe any essential difference in the frequencies of Tax 301-309 -CTLs between ACs and ATL patients. In the single-cell TCR repertoire analysis of Tax 301-309 -CTLs, 1,458 Tax 301-309 -CTLs and 140 clones were identified in this cohort. Tax 301-309 -CTLs showed highly restricted TCR repertoires with a strongly biased usage of BV7, and PDR, the unique motif in TCR-β CDR3, was exclusively observed in all ACs and ATL patients. However, there was no correlation between PDR + CTL frequencies and HTLV-1 proviral load (PVL). In conclusion, we have identified, for the first time, a unique amino acid sequence, PDR, as a public TCR-CDR3 motif against Tax in HLA-A*24:02 + HTLV-1-infected individuals. Further investigations are warranted to elucidate the role of the PDR + CTL response in the progression from carrier state to ATL. IMPORTANCE ATL is an aggressive T-cell malignancy caused by HTLV-1 infection. The HTLV-1 regulatory protein Tax aggressively promotes the proliferation of HTLV-1-infected lymphocytes and is also a major target antigen for CD8 + CTLs. In our previous evaluation of Tax 301-309 -CTLs, we found that a unique amino acid sequence motif, PDR, in CDR3 of the TCR-β chain of Tax 301-309 -CTLs was conserved among ATL patients after allo-HSCT. Furthermore, the PDR + Tax 301-309 -CTL clones selectively expanded and showed strong cytotoxic activities against HTLV-1. On the other hand, it remains unclear how Tax 301-309 -CTL repertoire exists in ACs. In this study, we comprehensively compared Tax-specific TCR repertoires at the single-cell level between ACs and ATL patients. Tax 301-309 -CTLs showed highly restricted TCR repertoires with a strongly biased usage of BV7, and PDR, the unique motif in TCR-β CDR3, was conserved in all ACs and ATL patients, regardless of clinical subtype in HTLV-1 infection. Copyright © 2017 American Society for Microbiology.
Ishihara, Yuko; Tanaka, Yukie; Kobayashi, Seiichiro; Kawamura, Koji; Nakasone, Hideki; Gomyo, Ayumi; Hayakawa, Jin; Tamaki, Masaharu; Akahoshi, Yu; Harada, Naonori; Kusuda, Machiko; Kameda, Kazuaki; Ugai, Tomotaka; Wada, Hidenori; Sakamoto, Kana; Sato, Miki; Terasako-Saito, Kiriko; Kikuchi, Misato; Kimura, Shun-ichi; Tanihara, Aki; Kako, Shinichi; Uchimaru, Kaoru
2017-01-01
ABSTRACT We previously reported that the T-cell receptor (TCR) repertoire of human T-cell lymphotropic virus type 1 (HTLV-1) Tax301-309-specific CD8+ cytotoxic T cells (Tax301-309-CTLs) was highly restricted and a particular amino acid sequence motif, the PDR motif, was conserved among HLA-A*24:02-positive (HLA-A*24:02+) adult T-cell leukemia/lymphoma (ATL) patients who had undergone allogeneic hematopoietic cell transplantation (allo-HSCT). Furthermore, we found that donor-derived PDR+ CTLs selectively expanded in ATL long-term HSCT survivors with strong CTL activity against HTLV-1. On the other hand, the TCR repertoires in Tax301-309-CTLs of asymptomatic HTLV-1 carriers (ACs) remain unclear. In this study, we directly identified the DNA sequence of complementarity-determining region 3 (CDR3) of the TCR-β chain of Tax301-309-CTLs at the single-cell level and compared not only the TCR repertoires but also the frequencies and phenotypes of Tax301-309-CTLs between ACs and ATL patients. We did not observe any essential difference in the frequencies of Tax301-309-CTLs between ACs and ATL patients. In the single-cell TCR repertoire analysis of Tax301-309-CTLs, 1,458 Tax301-309-CTLs and 140 clones were identified in this cohort. Tax301-309-CTLs showed highly restricted TCR repertoires with a strongly biased usage of BV7, and PDR, the unique motif in TCR-β CDR3, was exclusively observed in all ACs and ATL patients. However, there was no correlation between PDR+ CTL frequencies and HTLV-1 proviral load (PVL). In conclusion, we have identified, for the first time, a unique amino acid sequence, PDR, as a public TCR-CDR3 motif against Tax in HLA-A*24:02+ HTLV-1-infected individuals. Further investigations are warranted to elucidate the role of the PDR+ CTL response in the progression from carrier state to ATL. IMPORTANCE ATL is an aggressive T-cell malignancy caused by HTLV-1 infection. The HTLV-1 regulatory protein Tax aggressively promotes the proliferation of HTLV-1-infected lymphocytes and is also a major target antigen for CD8+ CTLs. In our previous evaluation of Tax301-309-CTLs, we found that a unique amino acid sequence motif, PDR, in CDR3 of the TCR-β chain of Tax301-309-CTLs was conserved among ATL patients after allo-HSCT. Furthermore, the PDR+ Tax301-309-CTL clones selectively expanded and showed strong cytotoxic activities against HTLV-1. On the other hand, it remains unclear how Tax301-309-CTL repertoire exists in ACs. In this study, we comprehensively compared Tax-specific TCR repertoires at the single-cell level between ACs and ATL patients. Tax301-309-CTLs showed highly restricted TCR repertoires with a strongly biased usage of BV7, and PDR, the unique motif in TCR-β CDR3, was conserved in all ACs and ATL patients, regardless of clinical subtype in HTLV-1 infection. PMID:28724766
A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets.
Savitski, Mikhail M; Wilhelm, Mathias; Hahne, Hannes; Kuster, Bernhard; Bantscheff, Marcus
2015-09-01
Calculating the number of confidently identified proteins and estimating false discovery rate (FDR) is a challenge when analyzing very large proteomic data sets such as entire human proteomes. Biological and technical heterogeneity in proteomic experiments further add to the challenge and there are strong differences in opinion regarding the conceptual validity of a protein FDR and no consensus regarding the methodology for protein FDR determination. There are also limitations inherent to the widely used classic target-decoy strategy that particularly show when analyzing very large data sets and that lead to a strong over-representation of decoy identifications. In this study, we investigated the merits of the classic, as well as a novel target-decoy-based protein FDR estimation approach, taking advantage of a heterogeneous data collection comprised of ∼19,000 LC-MS/MS runs deposited in ProteomicsDB (https://www.proteomicsdb.org). The "picked" protein FDR approach treats target and decoy sequences of the same protein as a pair rather than as individual entities and chooses either the target or the decoy sequence depending on which receives the highest score. We investigated the performance of this approach in combination with q-value based peptide scoring to normalize sample-, instrument-, and search engine-specific differences. The "picked" target-decoy strategy performed best when protein scoring was based on the best peptide q-value for each protein yielding a stable number of true positive protein identifications over a wide range of q-value thresholds. We show that this simple and unbiased strategy eliminates a conceptual issue in the commonly used "classic" protein FDR approach that causes overprediction of false-positive protein identification in large data sets. The approach scales from small to very large data sets without losing performance, consistently increases the number of true-positive protein identifications and is readily implemented in proteomics analysis software. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets
Savitski, Mikhail M.; Wilhelm, Mathias; Hahne, Hannes; Kuster, Bernhard; Bantscheff, Marcus
2015-01-01
Calculating the number of confidently identified proteins and estimating false discovery rate (FDR) is a challenge when analyzing very large proteomic data sets such as entire human proteomes. Biological and technical heterogeneity in proteomic experiments further add to the challenge and there are strong differences in opinion regarding the conceptual validity of a protein FDR and no consensus regarding the methodology for protein FDR determination. There are also limitations inherent to the widely used classic target–decoy strategy that particularly show when analyzing very large data sets and that lead to a strong over-representation of decoy identifications. In this study, we investigated the merits of the classic, as well as a novel target–decoy-based protein FDR estimation approach, taking advantage of a heterogeneous data collection comprised of ∼19,000 LC-MS/MS runs deposited in ProteomicsDB (https://www.proteomicsdb.org). The “picked” protein FDR approach treats target and decoy sequences of the same protein as a pair rather than as individual entities and chooses either the target or the decoy sequence depending on which receives the highest score. We investigated the performance of this approach in combination with q-value based peptide scoring to normalize sample-, instrument-, and search engine-specific differences. The “picked” target–decoy strategy performed best when protein scoring was based on the best peptide q-value for each protein yielding a stable number of true positive protein identifications over a wide range of q-value thresholds. We show that this simple and unbiased strategy eliminates a conceptual issue in the commonly used “classic” protein FDR approach that causes overprediction of false-positive protein identification in large data sets. The approach scales from small to very large data sets without losing performance, consistently increases the number of true-positive protein identifications and is readily implemented in proteomics analysis software. PMID:25987413
Tamiru, Amanuel; Bruce, Toby J A; Richter, Annett; Woodcock, Christine M; Midega, Charles A O; Degenhardt, Jörg; Kelemu, Segenet; Pickett, John A; Khan, Zeyaur R
2017-04-01
Maize ( Zea mays ) emits volatile terpenes in response to insect feeding and egg deposition to defend itself against harmful pests. However, maize cultivars differ strongly in their ability to produce the defense signal. To further understand the agroecological role and underlying genetic mechanisms for variation in terpene emission among maize cultivars, we studied the production of an important signaling component ( E )-caryophyllene in a South American maize landrace Braz1006 possessing stemborer Chilo partellus egg inducible defense trait, in comparison with the European maize line Delprim and North American inbred line B73. The ( E) - caryophyllene production level and transcript abundance of TPS23, terpene synthase responsible for ( E) - caryophyllene formation, were compared between Braz1006, Delprim, and B73 after mimicked herbivory. Braz1006-TPS23 was heterologously expressed in E. coli , and amino acid sequences were determined. Furthermore, electrophysiological and behavioral responses of a key parasitic wasp Cotesia sesamiae to C . partellus egg-induced Braz1006 volatiles were determined using coupled gas chromatography electroantennography and olfactometer bioassay studies. After elicitor treatment, Braz1006 released eightfold higher ( E) -caryophyllene than Delprim, whereas no ( E) -caryophyllene was detected in B73. The superior (E)- caryophyllene production by Braz1006 was positively correlated with high transcript levels of TPS23 in the landrace compared to Delprim. TPS23 alleles from Braz1006 showed dissimilarities at different sequence positions with Delprim and B73 and encodes an active enzyme. Cotesia sesamiae was attracted to egg-induced volatiles from Braz1006 and synthetic (E)- caryophyllene. The variation in ( E) -caryophyllene emission between Braz1006 and Delprim is positively correlated with induced levels of TPS23 transcripts. The enhanced TPS23 activity and corresponding ( E) -caryophyllene production by the maize landrace could be attributed to the differences in amino acid sequence with the other maize lines. This study suggested that the same analogous genes could have contrasting expression patterns in different maize genetic backgrounds. The current findings provide valuable insight not only into genetic mechanisms underlying variation in defense signal production but also the prospect of introgressing the novel defense traits into elite maize varieties for effective and ecologically sound protection of crops against damaging insect pests.
Öhrmalm, Christina; Jobs, Magnus; Eriksson, Ronnie; Golbob, Sultan; Elfaitouri, Amal; Benachenhou, Farid; Strømme, Maria; Blomberg, Jonas
2010-01-01
One of the main problems in nucleic acid-based techniques for detection of infectious agents, such as influenza viruses, is that of nucleic acid sequence variation. DNA probes, 70-nt long, some including the nucleotide analog deoxyribose-Inosine (dInosine), were analyzed for hybridization tolerance to different amounts and distributions of mismatching bases, e.g. synonymous mutations, in target DNA. Microsphere-linked 70-mer probes were hybridized in 3M TMAC buffer to biotinylated single-stranded (ss) DNA for subsequent analysis in a Luminex® system. When mismatches interrupted contiguous matching stretches of 6 nt or longer, it had a strong impact on hybridization. Contiguous matching stretches are more important than the same number of matching nucleotides separated by mismatches into several regions. dInosine, but not 5-nitroindole, substitutions at mismatching positions stabilized hybridization remarkably well, comparable to N (4-fold) wobbles in the same positions. In contrast to shorter probes, 70-nt probes with judiciously placed dInosine substitutions and/or wobble positions were remarkably mismatch tolerant, with preserved specificity. An algorithm, NucZip, was constructed to model the nucleation and zipping phases of hybridization, integrating both local and distant binding contributions. It predicted hybridization more exactly than previous algorithms, and has the potential to guide the design of variation-tolerant yet specific probes. PMID:20864443
Studying the genetic basis of speciation in high gene flow marine invertebrates
2016-01-01
A growing number of genes responsible for reproductive incompatibilities between species (barrier loci) exhibit the signals of positive selection. However, the possibility that genes experiencing positive selection diverge early in speciation and commonly cause reproductive incompatibilities has not been systematically investigated on a genome-wide scale. Here, I outline a research program for studying the genetic basis of speciation in broadcast spawning marine invertebrates that uses a priori genome-wide information on a large, unbiased sample of genes tested for positive selection. A targeted sequence capture approach is proposed that scores single-nucleotide polymorphisms (SNPs) in widely separated species populations at an early stage of allopatric divergence. The targeted capture of both coding and non-coding sequences enables SNPs to be characterized at known locations across the genome and at genes with known selective or neutral histories. The neutral coding and non-coding SNPs provide robust background distributions for identifying FST-outliers within genes that can, in principle, identify specific mutations experiencing diversifying selection. If natural hybridization occurs between species, the neutral coding and non-coding SNPs can provide a neutral admixture model for genomic clines analyses aimed at finding genes exhibiting strong blocks to introgression. Strongylocentrotid sea urchins are used as a model system to outline the approach but it can be used for any group that has a complete reference genome available. PMID:29491951
van Verk, Marcel C; Pappaioannou, Dimitri; Neeleman, Lyda; Bol, John F; Linthorst, Huub J M
2008-04-01
PR-1a is a salicylic acid-inducible defense gene of tobacco (Nicotiana tabacum). One-hybrid screens identified a novel tobacco WRKY transcription factor (NtWRKY12) with specific binding sites in the PR-1a promoter at positions -564 (box WK(1)) and -859 (box WK(2)). NtWRKY12 belongs to the class of transcription factors in which the WRKY sequence is followed by a GKK rather than a GQK sequence. The binding sequence of NtWRKY12 (WK box TTTTCCAC) deviated significantly from the consensus sequence (W box TTGAC[C/T]) shown to be recognized by WRKY factors with the GQK sequence. Mutation of the GKK sequence in NtWRKY12 into GQK or GEK abolished binding to the WK box. The WK(1) box is in close proximity to binding sites in the PR-1a promoter for transcription factors TGA1a (as-1 box) and Myb1 (MBSII box). Expression studies with PR-1a promoterbeta-glucuronidase (GUS) genes in stably and transiently transformed tobacco indicated that NtWRKY12 and TGA1a act synergistically in PR-1a expression induced by salicylic acid and bacterial elicitors. Cotransfection of Arabidopsis thaliana protoplasts with 35SNtWRKY12 and PR-1aGUS promoter fusions showed that overexpression of NtWRKY12 resulted in a strong increase in GUS expression, which required functional WK boxes in the PR-1a promoter.
Schlesinger, D H; Hay, D I
1977-03-10
The complete amino acid sequence of human salivary statherin, a peptide which strongly inhibits precipitation from supersaturated calcium phosphate solutions, and therefore stabilizes supersaturated saliva, has been determined. The NH2-terminal half of this Mr=5380 (43 amino acids) polypeptide was determined by automated Edman degradations (liquid phase) on native statherin. The peptide was digested separately with trypsin, chymotrypsin, and Staphylococcus aureus protease, and the resulting peptides were purified by gel filtration. Manual Edman degradations on purified peptide fragments yielded peptides that completed the amino acid sequence through the penultimate COOH-terminal residue. These analyses, together with carboxypeptidase digestion of native statherin and of peptide fragments of statherin, established the complete sequence of the molecule. The 2 serine residues (positions 2 and 3) in statherin were identified as phosphoserine. The amino acid sequence of human salivary statherin is striking in a number of ways. The NH2-terminal one-third is highly polar and includes three polar dipeptides: H2PO3-Ser-Ser-H2PO3-Arg-Arg-, and Glu-Glu-. The COOH-terminal two-thirds of the molecule is hydrophobic, containing several repeating dipeptides: four of -Gn-Pro-, three of -Tyr-Gln-, two of -Gly-Tyr-, two of-Gln-Tyr-, and two of the tetrapeptide sequence -Pro-Tyr-Gln-Pro-. Unusual cleavage sites in the statherin sequence obtained with chymotrypsin and S. aureus protease were also noted.
2013-01-01
Background Next-generation-sequencing (NGS) technologies combined with a classic DNA barcoding approach have enabled fast and credible measurement for biodiversity of mixed environmental samples. However, the PCR amplification involved in nearly all existing NGS protocols inevitably introduces taxonomic biases. In the present study, we developed new Illumina pipelines without PCR amplifications to analyze terrestrial arthropod communities. Results Mitochondrial enrichment directly followed by Illumina shotgun sequencing, at an ultra-high sequence volume, enabled the recovery of Cytochrome c Oxidase subunit 1 (COI) barcode sequences, which allowed for the estimation of species composition at high fidelity for a terrestrial insect community. With 15.5 Gbp Illumina data, approximately 97% and 92% were detected out of the 37 input Operational Taxonomic Units (OTUs), whether the reference barcode library was used or not, respectively, while only 1 novel OTU was found for the latter. Additionally, relatively strong correlation between the sequencing volume and the total biomass was observed for species from the bulk sample, suggesting a potential solution to reveal relative abundance. Conclusions The ability of the new Illumina PCR-free pipeline for DNA metabarcoding to detect small arthropod specimens and its tendency to avoid most, if not all, false positives suggests its great potential in biodiversity-related surveillance, such as in biomonitoring programs. However, further improvement for mitochondrial enrichment is likely needed for the application of the new pipeline in analyzing arthropod communities at higher diversity. PMID:23587339
Detecting Coevolution in and among Protein Domains
Yeang, Chen-Hsiang; Haussler, David
2007-01-01
Correlated changes of nucleic or amino acids have provided strong information about the structures and interactions of molecules. Despite the rich literature in coevolutionary sequence analysis, previous methods often have to trade off between generality, simplicity, phylogenetic information, and specific knowledge about interactions. Furthermore, despite the evidence of coevolution in selected protein families, a comprehensive screening of coevolution among all protein domains is still lacking. We propose an augmented continuous-time Markov process model for sequence coevolution. The model can handle different types of interactions, incorporate phylogenetic information and sequence substitution, has only one extra free parameter, and requires no knowledge about interaction rules. We employ this model to large-scale screenings on the entire protein domain database (Pfam). Strikingly, with 0.1 trillion tests executed, the majority of the inferred coevolving protein domains are functionally related, and the coevolving amino acid residues are spatially coupled. Moreover, many of the coevolving positions are located at functionally important sites of proteins/protein complexes, such as the subunit linkers of superoxide dismutase, the tRNA binding sites of ribosomes, the DNA binding region of RNA polymerase, and the active and ligand binding sites of various enzymes. The results suggest sequence coevolution manifests structural and functional constraints of proteins. The intricate relations between sequence coevolution and various selective constraints are worth pursuing at a deeper level. PMID:17983264
Osipiuk, J; Joachimiak, A
1997-09-12
We propose that the dnaK operon of Thermus thermophilus HB8 is composed of three functionally linked genes: dnaK, grpE, and dnaJ. The dnaK and dnaJ gene products are most closely related to their cyanobacterial homologs. The DnaK protein sequence places T. thermophilus in the plastid Hsp70 subfamily. In contrast, the grpE translated sequence is most similar to GrpE from Clostridium acetobutylicum, a Gram-positive anaerobic bacterium. A single promoter region, with homology to the Escherichia coli consensus promoter sequences recognized by the sigma70 and sigma32 transcription factors, precedes the postulated operon. This promoter is heat-shock inducible. The dnaK mRNA level increased more than 30 times upon 10 min of heat shock (from 70 degrees C to 85 degrees C). A strong transcription terminating sequence was found between the dnaK and grpE genes. The individual genes were cloned into pET expression vectors and the thermophilic proteins were overproduced at high levels in E. coli and purified to homogeneity. The recombinant T. thermophilus DnaK protein was shown to have a weak ATP-hydrolytic activity, with an optimum at 90 degrees C. The ATPase was stimulated by the presence of GrpE and DnaJ. Another open reading frame, coding for ClpB heat-shock protein, was found downstream of the dnaK operon.
Centromere location in Arabidopsis is unaltered by extreme divergence in CENH3 protein sequence
2017-01-01
During cell division, spindle fibers attach to chromosomes at centromeres. The DNA sequence at regional centromeres is fast evolving with no conserved genetic signature for centromere identity. Instead CENH3, a centromere-specific histone H3 variant, is the epigenetic signature that specifies centromere location across both plant and animal kingdoms. Paradoxically, CENH3 is also adaptively evolving. An ongoing question is whether CENH3 evolution is driven by a functional relationship with the underlying DNA sequence. Here, we demonstrate that despite extensive protein sequence divergence, CENH3 histones from distant species assemble centromeres on the same underlying DNA sequence. We first characterized the organization and diversity of centromere repeats in wild-type Arabidopsis thaliana. We show that A. thaliana CENH3-containing nucleosomes exhibit a strong preference for a unique subset of centromeric repeats. These sequences are largely missing from the genome assemblies and represent the youngest and most homogeneous class of repeats. Next, we tested the evolutionary specificity of this interaction in a background in which the native A. thaliana CENH3 is replaced with CENH3s from distant species. Strikingly, we find that CENH3 from Lepidium oleraceum and Zea mays, although specifying epigenetically weaker centromeres that result in genome elimination upon outcrossing, show a binding pattern on A. thaliana centromere repeats that is indistinguishable from the native CENH3. Our results demonstrate positional stability of a highly diverged CENH3 on independently evolved repeats, suggesting that the sequence specificity of centromeres is determined by a mechanism independent of CENH3. PMID:28223399
Olson, Nathan D.; Lund, Steven P.; Zook, Justin M.; Rojas-Cornejo, Fabiola; Beck, Brian; Foy, Carole; Huggett, Jim; Whale, Alexandra S.; Sui, Zhiwei; Baoutina, Anna; Dobeson, Michael; Partis, Lina; Morrow, Jayne B.
2015-01-01
This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing®, or Ion Torrent PGM®. The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies. PMID:27077030
Is Mutation Random or Targeted?: No Evidence for Hypermutability in Snail Toxin Genes.
Roy, Scott W
2016-10-01
Ever since Luria and Delbruck, the notion that mutation is random with respect to fitness has been foundational to modern biology. However, various studies have claimed striking exceptions to this rule. One influential case involves toxin-encoding genes in snails of the genus Conus, termed conotoxins, a large gene family that undergoes rapid diversification of their protein-coding sequences by positive selection. Previous reconstructions of the sequence evolution of conotoxin genes claimed striking patterns: (1) elevated synonymous change, interpreted as being due to targeted "hypermutation" in this region; (2) elevated transversion-to-transition ratios, interpreted as reflective of the particular mechanism of hypermutation; and (3) much lower rates of synonymous change in the codons encoding several highly conserved cysteine residues, interpreted as strong position-specific codon bias. This work has spawned a variety of studies on the potential mechanisms of hypermutation and on causes for cysteine codon bias, and has inspired hypermutation hypotheses for various other fast-evolving genes. Here, I show that all three findings are likely to be artifacts of statistical reconstruction. First, by simulating nonsynonymous change I show that high rates of dN can lead to overestimation of dS. Second, I show that there is no evidence for any of these three patterns in comparisons of closely related conotoxin sequences, suggesting that the reported findings are due to breakdown of statistical methods at high levels of sequence divergence. The current findings suggest that mutation and codon bias in conotoxin genes may not be atypical, and that random mutation and selection can explain the evolution of even these exceptional loci. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Muhairi, Salama Al; Hosani, Farida Al; Eltahir, Yassir M; Mulla, Mariam Al; Yusof, Mohammed F; Serhan, Wissam S; Hashem, Farouq M; Elsayed, Elsaeid A; Marzoug, Bahaaeldin A; Abdelazim, Assem S
2016-12-01
The objective of this research was to investigate the prevalence of Middle East respiratory syndrome coronavirus (MERS-CoV) infection primarily in dromedary camel farms and the relationship of those infections with infections in humans in the Emirate of Abu Dhabi. Nasal swabs from 1113 dromedary camels (39 farms) and 34 sheep (1 farm) and sputum samples from 2 MERS-CoV-infected camel farm owners and 1 MERS-CoV-infected sheep farm owner were collected. Samples from camels and humans underwent real-time reverse-transcription quantitative PCR screening to detect MERS-CoV. In addition, sequencing and phylogenetic analysis of partially characterized MERS-CoV genome fragments obtained from camels were performed. Among the 40 farms, 6 camel farms were positive for MERS-CoV; the virus was not detected in the single sheep farm. The maximum duration of viral shedding from infected camels was 2 weeks after the first positive test result as detected in nasal swabs and in rectal swabs obtained from infected calves. Three partial camel sequences characterized in this study (open reading frames 1a and 1ab, Spike1, Spike2, and ORF4b) together with the corresponding regions of previously reported MERS-CoV sequence obtained from one farm owner were clustering together within the larger MERS-CoV sequences cluster containing human and camel isolates reported for the Arabian Peninsula. Data provided further evidence of the zoonotic potential of MERS-CoV infection and strongly suggested that camels may have a role in the transmission of the virus to humans.
A new sensitive PCR assay for one-step detection of 12 IDH1/2 mutations in glioma.
Catteau, Aurélie; Girardi, Hélène; Monville, Florence; Poggionovo, Cécile; Carpentier, Sabrina; Frayssinet, Véronique; Voss, Jesse; Jenkins, Robert; Boisselier, Blandine; Mokhtari, Karima; Sanson, Marc; Peyro-Saint-Paul, Hélène; Giannini, Caterina
2014-06-02
Mutations in isocitrate dehydrogenase genes IDH1 or IDH2 are frequent in glioma, and IDH mutation status is a strong diagnostic and prognostic marker. Current IDH mutation screening is performed with an immunohistochemistry (IHC) assay specific for IDH1 R132H, the most common mutation. Sequencing is recommended as a second-step test for IHC-negative or -equivocal cases. We developed and validated a new real-time quantitative polymerase chain reaction (PCR) assay for single-step detection of IDH1 R132H and 11 rare IDH1/2 mutations in formalin-fixed paraffin-embedded (FFPE) glioma samples. Performance of the IDH1/2 PCR assay was compared to IHC and Sanger sequencing. The IDH1/2 PCR assay combines PCR clamping for detection of 7 IDH1 and 5 IDH2 mutations, and Amplification Refractory Mutation System technology for specific identification of the 3 most common mutations (IDH1 R132H, IDH1 R132C, IDH2 R172K). Analytical sensitivity of the PCR assay for mutation detection was <5% for 11/12 mutations (mean: 3.3%), and sensitivity for mutation identification was very high (0.8% for IDH1 R132H; 1.2% for IDH1 R132C; 0.6% for IDH2 R172K). Assay performance was further validated on 171 clinical glioma FFPE samples; of these, 147 samples met the selection criteria and 146 DNA samples were successfully extracted. IDH1/2 status was successfully obtained in 91% of cases. All but one positive IDH1 R132H-IHC cases were concordantly detected by PCR and 3 were not detected by sequencing. Among the IHC-negative cases (n = 72), PCR detected 12 additional rare mutations (10 IDH1, 2 IDH2). All mutations detected by sequencing (n = 67) were concordantly detected by PCR and 5/66 sequencing-negative cases were PCR-positive (overall concordance: 96%). Analysis of synthetic samples representative of the 11 rare IDH1/2 mutations detected by the assay produced 100% correct results. The new IDH1/2 PCR assay has a high technical success rate and is more sensitive than Sanger sequencing. Positive concordance was 98% with IHC for IDH1 R132H detection and 100% with sequencing. The PCR assay can reliably be performed on FFPE samples and has a faster turnaround time than current IDH mutation detection algorithms. The assay should facilitate implementation of a comprehensive IDH1/2 testing protocol in routine clinical practice.
WebLogo: A Sequence Logo Generator
Crooks, Gavin E.; Hon, Gary; Chandonia, John-Marc; Brenner, Steven E.
2004-01-01
WebLogo generates sequence logos, graphical representations of the patterns within a multiple sequence alignment. Sequence logos provide a richer and more precise description of sequence similarity than consensus sequences and can rapidly reveal significant features of the alignment otherwise difficult to perceive. Each logo consists of stacks of letters, one stack for each position in the sequence. The overall height of each stack indicates the sequence conservation at that position (measured in bits), whereas the height of symbols within the stack reflects the relative frequency of the corresponding amino or nucleic acid at that position. WebLogo has been enhanced recently with additional features and options, to provide a convenient and highly configurable sequence logo generator. A command line interface and the complete, open WebLogo source code are available for local installation and customization. PMID:15173120
Payá-Milans, Miriam; Venegas-Calerón, Mónica; Salas, Joaquín J; Garcés, Rafael; Martínez-Force, Enrique
2015-03-01
The acyl-[acyl carrier protein]:sn-1-glycerol-3-phosphate acyltransferase (GPAT; E.C. 2.3.1.15) catalyzes the first step of glycerolipid assembly within the stroma of the chloroplast. In the present study, the sunflower (Helianthus annuus, L.) stromal GPAT was cloned, sequenced and characterized. We identified a single ORF of 1344base pairs that encoded a GPAT sharing strong sequence homology with the plastidial GPAT from Arabidopsis thaliana (ATS1, At1g32200). Gene expression studies showed that the highest transcript levels occurred in green tissues in which chloroplasts are abundant. The corresponding mature protein was heterologously overexpressed in Escherichia coli for purification and biochemical characterization. In vitro assays using radiolabelled acyl-ACPs and glycerol-3-phosphate as substrates revealed a strong preference for oleic versus palmitic acid, and weak activity towards stearic acid. The positional fatty acid composition of relevant chloroplast phospholipids from sunflower leaves did not reflect the in vitro GPAT specificity, suggesting a more complex scenario with mixed substrates at different concentrations, competition with other acyl-ACP consuming enzymatic reactions, etc. In summary, this study has confirmed the affinity of this enzyme which would partly explain the resistance to cold temperatures observed in sunflower plants. Copyright © 2015 Elsevier Ltd. All rights reserved.
Torstensson, Anders; Dinasquet, Julie; Chierici, Melissa; Fransson, Agneta; Riemann, Lasse; Wulff, Angela
2015-10-01
Due to climate change, sea ice experiences changes in terms of extent and physical properties. In order to understand how sea ice microbial communities are affected by changes in physicochemical properties of the ice, we used 454-sequencing of 16S and 18S rRNA genes to examine environmental control of microbial diversity and composition in Antarctic sea ice. We observed a high diversity and richness of bacteria, which were strongly negatively correlated with temperature and positively with brine salinity. We suggest that bacterial diversity in sea ice is mainly controlled by physicochemical properties of the ice, such as temperature and salinity, and that sea ice bacterial communities are sensitive to seasonal and environmental changes. For the first time in Antarctic interior sea ice, we observed a strong eukaryotic dominance of the dinoflagellate phylotype SL163A10, comprising 63% of the total sequences. This phylotype is known to be kleptoplastic and could be a significant primary producer in sea ice. We conclude that mixotrophic flagellates may play a greater role in the sea ice microbial ecosystem than previously believed, and not only during the polar night but also during summer when potential food sources are abundant. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.
Ikota, Hayato; Nobusawa, Sumihito; Arai, Hideo; Kato, Yukinari; Ishizawa, Keisuke; Hirose, Takanori; Yokoo, Hideaki
2015-10-01
Glioma cells with the isocitrate dehydrogenase (IDH) 1 G395A mutation are strongly immunopositive for mIDH1(R132H), an antibody against mutant IDH1(R132H) (clone H09). However, we encountered some gliomas which were ambiguously positive for mIDH1(R132H) despite having the IDH1 G395A mutation. The aim of this study was to establish an evaluation procedure of IDH1 status by immunohistochemistry. Forty-three diffusely infiltrating gliomas were studied, and four of eight anaplastic oligoastrocytomas with the IDH1 G395A mutation were modestly or weakly positive for both the mIDH1(R132H) and an antibody against wild type IDH1, RcMab-1. Based on our staining results, the IDH1 expression of both wild and mutated types seemed to be codominant and also to be evenly suppressed under a certain condition. We propose a procedure for determining IDH1 status. If a glioma is weakly positive for mIDH1(R132H), immunohistochemistry for RcMab-1 should be performed. If the tumor cells are strongly positive for RcMab-1, the IDH1 G395A mutation is judged to be absent on the grounds that IDH1 expression is not suppressed. If the tumor cells are weakly positive for both mIDH1(R132H) and RcMab-1, then a conclusion should be made after DNA sequencing. This procedure is useful for practical evaluation of IDH1 status.
CNV-TV: a robust method to discover copy number variation from short sequencing reads.
Duan, Junbo; Zhang, Ji-Gang; Deng, Hong-Wen; Wang, Yu-Ping
2013-05-02
Copy number variation (CNV) is an important structural variation (SV) in human genome. Various studies have shown that CNVs are associated with complex diseases. Traditional CNV detection methods such as fluorescence in situ hybridization (FISH) and array comparative genomic hybridization (aCGH) suffer from low resolution. The next generation sequencing (NGS) technique promises a higher resolution detection of CNVs and several methods were recently proposed for realizing such a promise. However, the performances of these methods are not robust under some conditions, e.g., some of them may fail to detect CNVs of short sizes. There has been a strong demand for reliable detection of CNVs from high resolution NGS data. A novel and robust method to detect CNV from short sequencing reads is proposed in this study. The detection of CNV is modeled as a change-point detection from the read depth (RD) signal derived from the NGS, which is fitted with a total variation (TV) penalized least squares model. The performance (e.g., sensitivity and specificity) of the proposed approach are evaluated by comparison with several recently published methods on both simulated and real data from the 1000 Genomes Project. The experimental results showed that both the true positive rate and false positive rate of the proposed detection method do not change significantly for CNVs with different copy numbers and lengthes, when compared with several existing methods. Therefore, our proposed approach results in a more reliable detection of CNVs than the existing methods.
[Learning and Repetive Reproduction of Memorized Sequences by the Right and the Left Hand].
Bobrova, E V; Lyakhovetskii, V A; Bogacheva, I N
2015-01-01
An important stage of learning a new skill is repetitive reproduction of one and the same sequence of movements, which plays a significant role in forming of the movement stereotypes. Two groups of right-handers repeatedly memorized (6-10 repetitions) the sequences of their hand transitions by experimenter in 6 positions, firstly by the right hand (RH), and then--by the left hand (LH) or vice versa. Random sequences previously unknown to the volunteers were reproduced in the 11 series. Modified sequences were tested in the 2nd and 3rd series, where the same elements' positions were presented in different order. The processes of repetitive sequence reproduction were similar for RH and LH. However, the learning of the modified sequences differed: Information about elements' position disregarding the reproduction order was used only when LH initiated task performing. This information was not used when LH followed RH and when RH performed the task. Consequently, the type of information coding activated by LH helped learn the positions of sequence elements, while the type of information coding activated by RH prevented learning. It is supposedly connected with the predominant role of right hemisphere in the processes of positional coding and motor learning.
Augmented brain function by coordinated reset stimulation with slowly varying sequences.
Zeitler, Magteld; Tass, Peter A
2015-01-01
Several brain disorders are characterized by abnormally strong neuronal synchrony. Coordinated Reset (CR) stimulation was developed to selectively counteract abnormal neuronal synchrony by desynchronization. For this, phase resetting stimuli are delivered to different subpopulations in a timely coordinated way. In neural networks with spike timing-dependent plasticity CR stimulation may eventually lead to an anti-kindling, i.e., an unlearning of abnormal synaptic connectivity and abnormal synchrony. The spatiotemporal sequence by which all stimulation sites are stimulated exactly once is called the stimulation site sequence, or briefly sequence. So far, in simulations, pre-clinical and clinical applications CR was applied either with fixed sequences or rapidly varying sequences (RVS). In this computational study we show that appropriate repetition of the sequence with occasional random switching to the next sequence may significantly improve the anti-kindling effect of CR. To this end, a sequence is applied many times before randomly switching to the next sequence. This new method is called SVS CR stimulation, i.e., CR with slowly varying sequences. In a neuronal network with strong short-range excitatory and weak long-range inhibitory dynamic couplings SVS CR stimulation turns out to be superior to CR stimulation with fixed sequences or RVS.
Augmented brain function by coordinated reset stimulation with slowly varying sequences
Zeitler, Magteld; Tass, Peter A.
2015-01-01
Several brain disorders are characterized by abnormally strong neuronal synchrony. Coordinated Reset (CR) stimulation was developed to selectively counteract abnormal neuronal synchrony by desynchronization. For this, phase resetting stimuli are delivered to different subpopulations in a timely coordinated way. In neural networks with spike timing-dependent plasticity CR stimulation may eventually lead to an anti-kindling, i.e., an unlearning of abnormal synaptic connectivity and abnormal synchrony. The spatiotemporal sequence by which all stimulation sites are stimulated exactly once is called the stimulation site sequence, or briefly sequence. So far, in simulations, pre-clinical and clinical applications CR was applied either with fixed sequences or rapidly varying sequences (RVS). In this computational study we show that appropriate repetition of the sequence with occasional random switching to the next sequence may significantly improve the anti-kindling effect of CR. To this end, a sequence is applied many times before randomly switching to the next sequence. This new method is called SVS CR stimulation, i.e., CR with slowly varying sequences. In a neuronal network with strong short-range excitatory and weak long-range inhibitory dynamic couplings SVS CR stimulation turns out to be superior to CR stimulation with fixed sequences or RVS. PMID:25873867
Evaluation of exome variants using the Ion Proton Platform to sequence error-prone regions.
Seo, Heewon; Park, Yoomi; Min, Byung Joo; Seo, Myung Eui; Kim, Ju Han
2017-01-01
The Ion Proton sequencer from Thermo Fisher accurately determines sequence variants from target regions with a rapid turnaround time at a low cost. However, misleading variant-calling errors can occur. We performed a systematic evaluation and manual curation of read-level alignments for the 675 ultrarare variants reported by the Ion Proton sequencer from 27 whole-exome sequencing data but that are not present in either the 1000 Genomes Project and the Exome Aggregation Consortium. We classified positive variant calls into 393 highly likely false positives, 126 likely false positives, and 156 likely true positives, which comprised 58.2%, 18.7%, and 23.1% of the variants, respectively. We identified four distinct error patterns of variant calling that may be bioinformatically corrected when using different strategies: simplicity region, SNV cluster, peripheral sequence read, and base inversion. Local de novo assembly successfully corrected 201 (38.7%) of the 519 highly likely or likely false positives. We also demonstrate that the two sequencing kits from Thermo Fisher (the Ion PI Sequencing 200 kit V3 and the Ion PI Hi-Q kit) exhibit different error profiles across different error types. A refined calling algorithm with better polymerase may improve the performance of the Ion Proton sequencing platform.
Song, Da-Feng; Zhu, Mu-Yuan; Gu, Qing
2014-01-01
The aim of this study is to investigate the antimicrobial potential of Lactobacillus plantarum ZJ5, a strain isolated from fermented mustard with a broad range of inhibitory activity against both Gram-positive and Gram-negative bacteria. Here we present the peptide plantaricin ZJ5 (PZJ5), which is an extreme pH and heat-stable. However, it can be digested by pepsin and proteinase K. This peptide has strong activity against Staphylococcus aureus. PZJ5 has been purified using a multi-step process, including ammonium sulfate precipitation, cation-exchange chromatography, hydrophobic interactions and reverse-phase chromatography. The molecular mass of the peptide was found to be 2572.9 Da using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS). The primary structure of this peptide was determined using amino acid sequencing and DNA sequencing, and these analyses revealed that the DNA sequence translated as a 44-residue precursor containing a 22-amino-acid N-terminal extension that was of the double-glycine type. The bacteriocin sequence exhibited no homology with known bacteriocins when compared with those available in the database, indicating that it was a new class IId bacteriocin. PZJ5 from a food-borne strain may be useful as a promising probiotic candidate. PMID:25147943
2015-01-01
DNA oxidation by reactive oxygen species is nonrandom, potentially leading to accumulation of nucleobase damage and mutations at specific sites within the genome. We now present the first quantitative data for sequence-dependent formation of structurally defined oxidative nucleobase adducts along p53 gene-derived DNA duplexes using a novel isotope labeling-based approach. Our results reveal that local nucleobase sequence context differentially alters the yields of 2,2,4-triamino-2H-oxal-5-one (Z) and 8-oxo-7,8-dihydro-2′-deoxyguanosine (OG) in double stranded DNA. While both lesions are overproduced within endogenously methylated MeCG dinucleotides and at 5′ Gs in runs of several guanines, the formation of Z (but not OG) is strongly preferred at solvent-exposed guanine nucleobases at duplex ends. Targeted oxidation of MeCG sequences may be caused by a lowered ionization potential of guanine bases paired with MeC and the preferential intercalation of riboflavin photosensitizer adjacent to MeC:G base pairs. Importantly, some of the most frequently oxidized positions coincide with the known p53 lung cancer mutational “hotspots” at codons 245 (GGC), 248 (CGG), and 158 (CGC) respectively, supporting a possible role of oxidative degradation of DNA in the initiation of lung cancer. PMID:24571128
Song, Da-Feng; Zhu, Mu-Yuan; Gu, Qing
2014-01-01
The aim of this study is to investigate the antimicrobial potential of Lactobacillus plantarum ZJ5, a strain isolated from fermented mustard with a broad range of inhibitory activity against both Gram-positive and Gram-negative bacteria. Here we present the peptide plantaricin ZJ5 (PZJ5), which is an extreme pH and heat-stable. However, it can be digested by pepsin and proteinase K. This peptide has strong activity against Staphylococcus aureus. PZJ5 has been purified using a multi-step process, including ammonium sulfate precipitation, cation-exchange chromatography, hydrophobic interactions and reverse-phase chromatography. The molecular mass of the peptide was found to be 2572.9 Da using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS). The primary structure of this peptide was determined using amino acid sequencing and DNA sequencing, and these analyses revealed that the DNA sequence translated as a 44-residue precursor containing a 22-amino-acid N-terminal extension that was of the double-glycine type. The bacteriocin sequence exhibited no homology with known bacteriocins when compared with those available in the database, indicating that it was a new class IId bacteriocin. PZJ5 from a food-borne strain may be useful as a promising probiotic candidate.
Ventura, Marco; Jankovic, Ivana; Walker, D. Carey; Pridmore, R. David; Zink, Ralf
2002-01-01
We have identified and sequenced the genes encoding the aggregation-promoting factor (APF) protein from six different strains of Lactobacillus johnsonii and Lactobacillus gasseri. Both species harbor two apf genes, apf1 and apf2, which are in the same orientation and encode proteins of 257 to 326 amino acids. Multiple alignments of the deduced amino acid sequences of these apf genes demonstrate a very strong sequence conservation of all of the genes with the exception of their central regions. Northern blot analysis showed that both genes are transcribed, reaching their maximum expression during the exponential phase. Primer extension analysis revealed that apf1 and apf2 harbor a putative promoter sequence that is conserved in all of the genes. Western blot analysis of the LiCl cell extracts showed that APF proteins are located on the cell surface. Intact cells of L. johnsonii revealed the typical cell wall architecture of S-layer-carrying gram-positive eubacteria, which could be selectively removed with LiCl treatment. In addition, the amino acid composition, physical properties, and genetic organization were found to be quite similar to those of S-layer proteins. These results suggest that APF is a novel surface protein of the Lactobacillus acidophilus B-homology group which might belong to an S-layer-like family. PMID:12450842
Valette, Julien; Guillermier, Martine; Boumezbeur, Fawzi; Poupon, Cyril; Amadon, Alexis; Hantraye, Philippe; Lebon, Vincent
2006-03-01
To map B(0) distortions throughout the monkey brain in the two positions commonly used for NMR studies (the prone sphinx position and the supine position) in order to test the hypothesis that B(0) homogeneity in the sphinx position is significantly improved as compared to the supine position. Three macaque monkeys were installed in the two positions in a 3T whole-body MR system without shim correction. B(0) maps were acquired using a 3D gradient double-echo sequence, and field dispersion throughout the brain was quantified. In addition, field maps and localized (1)H spectra were acquired after first-order shimming was performed. The field maps collected in the three animals were highly reproducible. B(0) dispersion throughout the brain was typically two to three times greater in the supine position than in the sphinx position. Although first-order shimming proved relatively more efficient in the supine position, B(0) dispersion still remained greater in the supine than in the sphinx position. These findings can be explained by the thickness of outer brain tissues. This work demonstrates that the sphinx position is highly favorable in terms of B(0) homogeneity. It should prove useful for NMR exploration of the monkey brain, particularly at high fields where B(0) inhomogeneity associated with susceptibility artifacts is increased. (c) 2006 Wiley-Liss, Inc.
DNA unzipping phase diagram calculated via replica theory.
Roland, C Brian; Hatch, Kristi Adamson; Prentiss, Mara; Shakhnovich, Eugene I
2009-05-01
We show how single-molecule unzipping experiments can provide strong evidence that the zero-force melting transition of long molecules of natural dsDNA should be classified as a phase transition of the higher-order type (continuous). Toward this end, we study a statistical-mechanics model for the fluctuating structure of a long molecule of dsDNA, and compute the equilibrium phase diagram for the experiment in which the molecule is unzipped under applied force. We consider a perfect-matching dsDNA model, in which the loops are volume-excluding chains with arbitrary loop exponent c . We include stacking interactions, hydrogen bonds, and main-chain entropy. We include sequence heterogeneity at the level of random sequences; in particular, there is no correlation in the base-pairing (bp) energy from one sequence position to the next. We present heuristic arguments to demonstrate that the low-temperature macrostate does not exhibit degenerate ergodicity breaking. We use this claim to understand the results of our replica-theoretic calculation of the equilibrium properties of the system. As a function of temperature, we obtain the minimal force at which the molecule separates completely. This critical-force curve is a line in the temperature-force phase diagram that marks the regions where the molecule exists primarily as a double helix versus the region where the molecule exists as two separate strands. We compare our random-sequence model to magnetic tweezer experiments performed on the 48 502 bp genome of bacteriophage lambda . We find good agreement with the experimental data, which is restricted to temperatures between 24 and 50 degrees C . At higher temperatures, the critical-force curve of our random-sequence model is very different for that of the homogeneous-sequence version of our model. For both sequence models, the critical force falls to zero at the melting temperature T_{c} like |T-T_{c}|;{alpha} . For the homogeneous-sequence model, alpha=1/2 almost exactly, while for the random-sequence model, alpha approximately 0.9 . Importantly, the shape of the critical-force curve is connected, via our theory, to the manner in which the helix fraction falls to zero at T_{c} . The helix fraction is the property that is used to classify the melting transition as a type of phase transition. In our calculation, the shape of the critical-force curve holds strong evidence that the zero-force melting transition of long natural dsDNA should be classified as a higher-order (continuous) phase transition. Specifically, the order is 3rd or greater.
[Isolation and identification of rumen bacteria for cellulolytic enzyme production].
Aihemaiti, Maierhaba; Zhen, Fan; Li, Yuezhong; Aibaidoula, Gulisimayi; Yimit, Wusiman
2013-05-04
We screened aerobic bacteria with cellulolytic activity from ruminal fluid of sheep, cattle and camel in Xinjiang. Fresh ruminal fluid was inoculated on sterilized sodium carboxymethylcellulose agar plates. Highly cellulolytic aerobic bacteria were screened out by using Congo red staining and liquid secondary screening culture media. The combination of morphological and biochemical test with 16SrDNA sequence analysis were used to classify the strains. Enzymatic activities of four strains with strong cellulose-decomposing abilities were studied under different culture conditions. Out 84 isolated cellulolytic strains, 40 exhibited strong abilities in decomposing cellulose. They are including 37 Gram-negative isolates and 3 Gram-positive strains. Identification of these 40 strains shows that they belong to 11 species of 6 genera, 16 strains in Stenotrophomonas maltophilia, 10 Ochrobactrum, 5 Sphingobacterium, 3 Microbacterium, 3 Paracoccus and 2 Pseudomonas. The results of the enzymatic studies of four strains with strong cellulolytic abilities indicates that the strains have the best enzyme producing property when straw powder was chosen as the carbon source; the pH at 5.5 -6.0 and temperature at 37 degrees C. The strains with highly cellulolytic abilities isolated from ruminal fluid show strong abilities in cellulose decomposition.
Characterization and Identification of Productivity-Associated Rhizobacteria in Wheat
Habiger, Joshua
2012-01-01
The rhizosphere is populated by a numerous and diverse array of rhizobacteria, and many impact productivity in largely unknown ways. Here we characterize the rhizobacterial community in a wheat variety categorized according to shoot biomass using 16S rRNA pyrosequencing abundance data. Plants were grown in homogenized field soil under greenhouse conditions, and DNA was extracted and pyrosequenced, resulting in 29,007 quality sequences. Operational taxonomic units (OTUs) that were significantly associated with biomass productivity were identified using an exact test adjusted for the false-discovery rate. The productivity deviation expressed as a percentage of the total mean square for regression (PMSR) was determined for each OTU. Out of 719 OTUs, 42 showed significant positive associations and 39 showed significant negative associations (q value, ≤0.05). OTUs with the greatest net positive associations, by genus, were as follows: Duganella, OTU 43 and OTU 3; Janthinobacterium, OTU 278; Pseudomonas, OTU 588; and Cellvibrio, OTU 1847. Those with negative associations were as follows: Bacteria, OTU 273; Chryseobacterium, OTU 508; Proteobacteria, OTU 249; and Enterobacter, OTU 357. Shoot biomass productivity was strongly correlated with the balance between the overall abundances of positive- and negative-productivity-associated OTUs. High-productivity rhizospheres contained 9.2 significant positives for every negatively associated rhizobacterium, while low-productivity rhizospheres showed 2.3 significant negatives for every positively associated rhizobacterium. Overall rhizobacterial community diversity as measured by the Chao1, Shannon, and Simpson indexes was nonlinearly related to productivity, closely fitting a wavelike cubic equation. We conclude that shoot biomass productivity is strongly related to the ratio of positive- to negative-productivity-associated rhizobacteria in the rhizosphere. This study identifies significant OTUs composing the productive and unproductive rhizobacterial communities. PMID:22504815
The origin of chow chows in the light of the East Asian breeds.
Yang, Hechuan; Wang, Guodong; Wang, Meng; Ma, Yaping; Yin, Tingting; Fan, Ruoxi; Wu, Hong; Zhong, Li; Irwin, David M; Zhai, Weiwei; Zhang, Yaping
2017-02-16
East Asian dog breeds are one of the most ancient groups of dogs that radiated after the domestication of the dog and represent the most basal lineages of dog evolution. Among these, the Chow Chow is an ancient breed that embodies very distinct morphological and physiological features, such as sturdy build, dense coat, and blue/purple tongue. Using a Restricted site Associated DNA (RAD) sequencing approach, we sequenced the genomes of nine Chow Chows from China. Combined with a dataset of 37 canid whole genome sequencing (WGS) from several published works, we found that the Chow Chow is one of the most basal lineages, which originated together with other East Asian breeds, such as the Shar-Pei and Akita. Demographic analysis found that Chow Chows originated from the Chinese indigenous dog about 8300 years ago. The bottleneck leading to Chow Chows was not strong and genetic migration between Chow Chows and other populations is low. Two classes of genes show strong evidence of positive selection along the Chow Chow lineage, namely genes related to metabolism and digestion as well as muscle/heart development and differentiation. Dog breeds from East Asia, including the Chow Chow, originated from Chinese indigenous dogs very early in time. The genetic bottleneck leading to Chow Chows and migrations with other populations are found to be quite mild. Our current study represents an early endeavor to characterize the origin of East Asian dog breeds and establishes an important reference point for understanding the origin of ancient breeds in Asia.
Ishibashi, J; Saido-Sakanaka, H; Yang, J; Sagisaka, A; Yamakawa, M
1999-12-01
A novel member of the insect defensins, a family of antibacterial peptides, was purified from larvae of the coconut rhinoceros beetle, Oryctes rhinoceros, immunized with Escherichia coli. A full-size cDNA was cloned by combining reverse-transcription PCR (RT-PCR), and 5'- and 3'-rapid amplification of cDNA ends (RACE). Analysis of the O. rhinoceros defensin gene expression showed it to be expressed in the fat body and hemocyte, midgut and Malpighian tubules. O. rhinoceros defensin showed strong antibacterial activity against Staphylococcus aureus. A 9-mer peptide amidated at its C-terminus, AHCLAICRK-NH2 (Ala22-Lys30-NH2), was synthesized based on the deduced amino-acid sequence, assumed to be an active site sequence by analogy with the sequence of a defensin isolated from larvae of the beetle Allomyrina dichotoma. This peptide showed antibacterial activity against S. aureus, methicillin-resistant S. aureus, E. coli and Pseudomonas aeruginosa. We further modified this oligopeptide and synthesized five 9-mer peptides, ALRLAIRKR-NH2, ALLLAIRKR-NH2, AWLLAIRKR-NH2, ALYLAIRKR-NH2 and ALWLAIRKR-NH2. These oligopeptides showed strong antibacterial activity against Gram-negative and Gram-positive bacteria. The antibacterial effect of Ala22-Lys30-NH2 analogues was due to its interaction with bacterial membranes, judging from the leakage of liposome-entrapped glucose. These Ala22-Lys30-NH2 analogues did not show haemolytic activity and did not inhibit the growth of murine fibroblast cells or macrophages, except for AWLLAIRKR-NH2.
Poon, Art F. Y; Kosakovsky Pond, Sergei L.; Bennett, Phil; Richman, Douglas D; Leigh Brown, Andrew J.; Frost, Simon D. W
2007-01-01
CD8+ cytotoxic T-lymphocytes (CTLs) perform a critical role in the immune control of viral infections, including those caused by human immunodeficiency virus type 1 (HIV-1) and hepatitis C virus (HCV). As a result, genetic variation at CTL epitopes is strongly influenced by host-specific selection for either escape from the immune response, or reversion due to the replicative costs of escape mutations in the absence of CTL recognition. Under strong CTL-mediated selection, codon positions within epitopes may immediately “toggle” in response to each host, such that genetic variation in the circulating virus population is shaped by rapid adaptation to immune variation in the host population. However, this hypothesis neglects the substantial genetic variation that accumulates in virus populations within hosts. Here, we evaluate this quantity for a large number of HIV-1– (n ≥ 3,000) and HCV-infected patients (n ≥ 2,600) by screening bulk RT-PCR sequences for sequencing “mixtures” (i.e., ambiguous nucleotides), which act as site-specific markers of genetic variation within each host. We find that nonsynonymous mixtures are abundant and significantly associated with codon positions under host-specific CTL selection, which should deplete within-host variation by driving the fixation of the favored variant. Using a simple model, we demonstrate that this apparently contradictory outcome can be explained by the transmission of unfavorable variants to new hosts before they are removed by selection, which occurs more frequently when selection and transmission occur on similar time scales. Consequently, the circulating virus population is shaped by the transmission rate and the disparity in selection intensities for escape or reversion as much as it is shaped by the immune diversity of the host population, with potentially serious implications for vaccine design. PMID:17397261
Improved analyses of human mtDNA sequences support a recent African origin for Homo sapiens.
Penny, D; Steel, M; Waddell, P J; Hendy, M D
1995-09-01
New quantitative methods are applied to the 135 human mitochondrial sequences from the Vigilant et al. data set. General problems in analyzing large numbers of short sequences are discussed, and an improved strategy is suggested. A key feature is to focus not on individual trees but on the general "landscape" of trees. Over 1,000 searches were made from random starting trees with only one tree (a local optimum) being retained each time, thereby ensuring optima were found independently. A new tree comparison metric was developed that is unaffected by rearrangements of trees around many very short internal edges. Use of this metric showed that downweighting hypervariable sites revealed more evolutionary structure than studies that weighted all sites equally. Our results are consistent with convergence toward a global optimum. Crucial features are that the best optima show very strong regional differentiation, a common group of 49 African sequences is found in all the best optima, and the best optima contain the 16 !Kung sequences in a separate group of San people. The other 86 sequences form a heterogeneous mixture of Africans, Europeans, Australopapuans, and Asians. Thus all major human lineages occur in Africa, but only a subset occurs in the rest of the world. The existence of these African-only groups strongly contradicts multiregional theories for the origin of Homo sapiens that require widespread migration and interbreeding over the entire range of H. erectus. Only when the multiregional model is rejected is it appropriate to consider the root, based on a single locus, to be the center of origin of a population (otherwise different loci could give alternative geographic positions for the root). For this data, several methods locate the root within the group of 49 African sequences and are thus consistent with the recent African origin of H. sapiens. We demonstrate that the time of the last common ancestor cannot be the time of major expansion in human numbers, and our results are thus also consistent with recent models that differentiate between the last common ancestor, expansion out of Africa, and the major expansion in human populations. Such a two-phase model is consistent with a wide range of molecular and archeological evidence.
Huang, Yili; Feng, Hao; Lu, Hang; Zeng, Yanhua
2017-07-01
It is believed that sphingomonads are ubiquitously distributed in environments. However detailed information about their community structure and their co-relationship with environmental parameters remain unclear. In this study, novel sphingomonads-specific primers based on the 16S rRNA gene were designed to investigate the distribution of sphingomonads in 10 different niches. Both in silico and in-practice tests on pure cultures and environmental samples showed that Sph384f/Sph701r was an efficient primer set. Illumina MiSeq sequencing revealed that community structures of sphingomonads were significantly different among the 10 samples, although 12 sphingomonad genera were present in all samples. Based on RDA analysis and Monte Carlo permutation test, sphingomonad community structure was significantly correlated with limnetic and marine habitat types. Among these niches, the genus Sphingomicrobium showed strong positive correlation with marine habitats, whereas genera Sphingobium, Novosphingobium, Sphingopyxis, and Sphingorhabdus showed strong positive correlation with limnetic habitats. Our study provided direct evidence that sphingomonads are ubiquitously distributed in environments, and revealed for the first time that their community structure can be correlated with habitats.
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
DOE Office of Scientific and Technical Information (OSTI.GOV)
Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.; ...
2017-07-18
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Richard A.; Brown, Steven D.
2017-01-01
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences. PMID:28769883
Cladaras, C; Hadzopoulou-Cladaras, M; Nolte, R T; Atkinson, D; Zannis, V I
1986-01-01
We have isolated and sequenced overlapping cDNA clones covering the entire sequence of human apolipoprotein B-100 (apoB-100). DNA sequence analysis and determination of the mRNA transcription initiation site by S1 nuclease mapping showed that the apoB mRNA consists of 14,112 nucleotides including the 5' and 3' untranslated regions which are 128 and 301 nucleotides respectively. The DNA-derived protein sequence shows that apoB-100 is 513,000 daltons and contains 4560 amino acids including a 24-amino-acid-long signal peptide. The mol. wt of apoB-100 implies that there is one apoB molecule per LDL particle. Computer analysis of the predicted secondary structure of the protein showed that some of the potential alpha helical and beta sheet structures are amphipathic, whereas others have non-amphipathic neutral to apolar character. These latter regions may contribute to the formation of the lipid-binding domains of apoB-100. The protein contains 25 cysteines and 20 potential N-glycosylation sites. The majority of cysteines are distributed in the amino terminal portion of the protein. Four of the potential glycosylation sites are in predicted beta turn structures and may represent true glycosylation positions. ApoB lacks the tandem repeats which are characteristic of other apolipoproteins. The mean hydrophobicity the mean value of H1 and helical hydrophobic moment the mean value of microH profiles of apoB showed the presence of several potential helical regions with strong polar character and high hydrophobic moment. The region with the highest hydrophobic moment, between amino acid residues 3352 and 3369, contains five closely spaced, positively charged residues, and has sequence homology to the LDL receptor binding site of apoE. This region is flanked by three neighbouring regions with positively charged amino acids and high hydrophobic moment that are located between residues 3174 and 3681. One or more of these closely spaced apoB sequences may be involved in the formation of the LDL receptor-binding domain of apoB-100. Blotting analysis of intestinal RNA and hybridization of the blots with carboxy apoB cDNA probes produced a single 15-kb hybridization band whereas hybridization with amino terminal probes produced two hybridization bands of 15 and 8 kb. Our data indicate that both forms of apoB mRNA contain common sequences which extend from the amino terminal of apoB-100 to the vicinity of nucleotide residue 6300. These two messages may have resulted from differential splicing of the same primary apoB mRNA transcript. Images Fig. 4. Fig. 6. PMID:3030729
NASA Astrophysics Data System (ADS)
Frei, R.; Gaucher, C.
2007-12-01
Positive δ13C carbonate values, combined with the occurrence of Fe-rich cherts (oxide-facies BIF) and organic-rich black shales within the late Ediacaran (ca. 580-560 Ma) Yerbal Fm. of the Arroyo del Soldato Group (Uruguay) are compatible with paleoclimatic models which postulate that enhanced bioproductivity due to higher availability of nutrient (P, N, Fe) was essential for controlling Neoproterozoic glaciations. Tracing of associated redox processes (f.e. linked to oxygenation of bottom waters in restricted basins) that might have been responsible for the deposition of Fe-rich cherts (BIFs) is therefore an important tool to better understand the seawater changes during cold-warm periods. Besides the traditionally used Fe and Mo isotopic systems, the redox-sensitive element Cr (Cr(III); Cr(IV)) and its stable isotopes offer another complementary system to trace paleo-redox processes. We have applied Cr stable isotope systematics to a sequence of samples from a late Ediacaran sedimentary sequence in Uruguay, using a 52Cr-54Cr double spike (Schoenberg et al., Chem..Geol., subm.). The middle Yerbal Fm. is dominated by organic-rich, black shales and black dolostones (δ53Cr = -0.05‰), followed by organic-rich cherts (δ53Cr = +1.83 - +4.49 ‰) and BIF (δ53Cr = -0.31 +0.90 ‰) gradually changing into Fe-bearing, organic-rich cherts and shales (δ53Cr = -0.28 - -0.01 ‰), and another sequence with BIF and organic-rich cherts topped by carbonates of the lower Polanco Fm. (δ53Cr = -0.17 to -0.27 ‰). The strongly positively fractionated Cr isotopic signatures in organic-rich and Fe-rich cherts in the Yerbal Fm. may point to significant oxidation processes either directly in the seawater column and/or during early diagenetic processes at the sediment-water interface. While these strongly positive δ53Cr values are the first to be reported from Neoproterozoic sedimentary sequence, the exact nature of the chemical process that produced these anomalies is not yet understood. However, the occurrence of these anomalies in organic-rich and Fe-rich chemical sediments that were deposited in a period following a glacial (Gaskiers?) event is compatible with "Snowball Earth" scenarios whereby impulsive oxidation of the upper seawater was in response to ice cover retraction which allowed booming of the biosphere and concomitant oxidation of accumulated Fe2+ and subsequent precipitation of the Fe-oxyhydroxides to form the "BIF" during such epochs. Schoenberg et al. (subm.) The stable Cr isotope inventory of solid earth reservoirs determined by double-spike MC-ICP-MS. Chemical Geology
Inferring Short-Range Linkage Information from Sequencing Chromatograms
Beggel, Bastian; Neumann-Fraune, Maria; Kaiser, Rolf; Verheyen, Jens; Lengauer, Thomas
2013-01-01
Direct Sanger sequencing of viral genome populations yields multiple ambiguous sequence positions. It is not straightforward to derive linkage information from sequencing chromatograms, which in turn hampers the correct interpretation of the sequence data. We present a method for determining the variants existing in a viral quasispecies in the case of two nearby ambiguous sequence positions by exploiting the effect of sequence context-dependent incorporation of dideoxynucleotides. The computational model was trained on data from sequencing chromatograms of clonal variants and was evaluated on two test sets of in vitro mixtures. The approach achieved high accuracies in identifying the mixture components of 97.4% on a test set in which the positions to be analyzed are only one base apart from each other, and of 84.5% on a test set in which the ambiguous positions are separated by three bases. In silico experiments suggest two major limitations of our approach in terms of accuracy. First, due to a basic limitation of Sanger sequencing, it is not possible to reliably detect minor variants with a relative frequency of no more than 10%. Second, the model cannot distinguish between mixtures of two or four clonal variants, if one of two sets of linear constraints is fulfilled. Furthermore, the approach requires repetitive sequencing of all variants that might be present in the mixture to be analyzed. Nevertheless, the effectiveness of our method on the two in vitro test sets shows that short-range linkage information of two ambiguous sequence positions can be inferred from Sanger sequencing chromatograms without any further assumptions on the mixture composition. Additionally, our model provides new insights into the established and widely used Sanger sequencing technology. The source code of our method is made available at http://bioinf.mpi-inf.mpg.de/publications/beggel/linkageinformation.zip. PMID:24376502
DOE Office of Scientific and Technical Information (OSTI.GOV)
Crooks, Gavin E.
WebLogo is a web based application designed to make the generation of sequence logos as easy and painless as possible. Sequesnce logos are a graphical representation of an amino acid or nucleic acid multiple sequence alignment developed by Tom Schneider and Mike Stephens. Each logo consists of stacks of symbols, one stack for each position in the sequence. The overall height of the stack indicates the sequence conservation at that position, while the height of symbols within the stack indicates the relative frequency of each amino or nucleic acid at that position. In general, a sequence logo provides a richermore » and more precise description of, for example, a binding site, than would a consensus sequence.« less
Xie, XiaoTing; Bil, Joanna; Shantz, Emily; Hammermueller, Jutta; Nagy, Eva; Turner, Patricia V
2017-09-01
Lapine rotavirus and astrovirus have been associated with disease in rabbits, and there is strong evidence of zoonotic transmission of lapine hepatitis E virus (HEV). Outbreaks of enteritis are common on commercial meat farms, resulting in poor welfare, high rabbit mortality, and significant financial losses for rabbit producers. Currently, none of these viruses are routinely tested by diagnostic laboratories. In this study, we assessed the prevalence of rotavirus, astrovirus, and HEV RNA in 205 pooled and individual fecal samples from healthy Canadian laboratory, companion, shelter and commercial meat rabbit populations. Viral RNA were extracted and amplified via RT-PCR using virus-specific primers. Positive samples from the first cohort of samples tested were sequenced and aligned to previously identified viruses to confirm the products. Almost 45% (13/29) of the surveyed commercial rabbit farms were astrovirus-positive. Three commercial meat rabbit samples were positive for rotavirus, and either astrovirus or HEV RNA was also detected. Three companion rabbit samples also tested positive for lapine HEV. Samples from specific pathogen-free laboratory animals were negative for all viruses. Sequencing results showed highest identity to rotavirus A strain 30-96, lapine astrovirus strain 2208 and lapine HEV strain CMC-1. These results permit a better understanding of the prevalence of rotavirus, astrovirus, and hepatitis E virus in Canadian domestic rabbit populations, and continued screening for viruses may help to reduce risk of zoonotic agent transmission as well as providing a better understanding of potential causative agents of rabbit enteritis. Copyright © 2017 Elsevier B.V. All rights reserved.
Liu, Qingpo
2009-03-01
In C. elegans, four C2H2 zinc-finger proteins (ZIM-1, ZIM-2, ZIM-3, and HIM-8), which are arranged in tandem, mediate chromosome-specific pairing and synapsis during meiosis. The zim/him-8 genes from three Caenorhabditis species were contrasted in an effort to investigate the mechanisms driving their evolution. Here it is shown that the preservation of higher degree of sequence similarity in the N-terminal portion, particularly in several regions within the second exon between paralogous zim genes (especially between zim-1 and zim-3), is due to independent interparalogue gene conversions. However, the evolutionary force is not uniformly strong across species. The present data reveal that more frequent gene conversion events have occurred in C. elegans, whereas only gene conversions between zim-1 and zim-3 are detected in C. remanei. Although gene conversions are predicted to be present among zim-1, zim-2, and zim-3 in C. briggsae, the conversion tracts between zim-1/zim-2 and zim-2/zim-3 are very short. Moreover, positive selection analysis was performed on the basis of the significantly discordant phylogenies reconstructed using the N- and C-terminal sequences, respectively. Several codon sites located in the regions that are supposed not to have experienced gene conversions are predicted to be under the influence of positive selection. In comparison, stronger positive selection has acted on the C-terminal region relative to the N-terminal region. Thus, the zim/him-8 genes that evolve concertedly have also been shown to undergo adaptive diversifying selection.
Accuracy of maxillary positioning after standard and inverted orthognathic sequencing.
Ritto, Fabio G; Ritto, Thiago G; Ribeiro, Danilo Passeado; Medeiros, Paulo José; de Moraes, Márcio
2014-05-01
This study aimed to compare the accuracy of maxillary positioning after bimaxillary orthognathic surgery, using 2 sequences. A total of 80 cephalograms (40 preoperative and 40 postoperative) from 40 patients were analyzed. Group 1 included radiographs of patients submitted to conventional sequence, whereas group 2 patients were submitted to inverted sequence. The final position of the maxillary central incisor was obtained after vertical and horizontal measurements of the tracings, and it was compared with what had been planned. The null hypothesis, which stated that there would be no difference between the groups, was tested. After applying the Welch t test for comparison of mean differences between maxillary desired and achieved position, considering a statistical significance of 5% and a 2-tailed test, the null hypothesis was not rejected (P > .05). Thus, there was no difference in the accuracy of maxillary positioning between groups. Conventional and inverted sequencing proved to be reliable in positioning the maxilla after LeFort I osteotomy in bimaxillary orthognathic surgeries. Copyright © 2014 Elsevier Inc. All rights reserved.
Janecek, Elisabeth; Streichan, Sabine; Strube, Christina
2012-10-18
Rickettsioses are caused by pathogenic species of the genus Rickettsia and play an important role as emerging diseases. The bacteria are transmitted to mammal hosts including humans by arthropod vectors. Since detection, especially in tick vectors, is usually based on PCR with genus-specific primers to include different occurring Rickettsia species, subsequent species identification is mainly achieved by Sanger sequencing. In the present study a real-time pyrosequencing approach was established with the objective to differentiate between species occurring in German Ixodes ticks, which are R. helvetica, R. monacensis, R. massiliae, and R. felis. Tick material from a quantitative real-time PCR (qPCR) based study on Rickettsia-infections in I. ricinus allowed direct comparison of both sequencing techniques, Sanger and real-time pyrosequencing. A sequence stretch of rickettsial citrate synthase (gltA) gene was identified to contain divergent single nucleotide polymorphism (SNP) sites suitable for Rickettsia species differentiation. Positive control plasmids inserting the respective target sequence of each Rickettsia species of interest were constructed for initial establishment of the real-time pyrosequencing approach using Qiagen's PSQ 96MA Pyrosequencing System operating in a 96-well format. The approach included an initial amplification reaction followed by the actual pyrosequencing, which is traceable by pyrograms in real-time. Afterwards, real-time pyrosequencing was applied to 263 Ixodes tick samples already detected Rickettsia-positive in previous qPCR experiments. Establishment of real-time pyrosequencing using positive control plasmids resulted in accurate detection of all SNPs in all included Rickettsia species. The method was then applied to 263 Rickettsia-positive Ixodes ricinus samples, of which 153 (58.2%) could be identified for their species (151 R. helvetica and 2 R. monacensis) by previous custom Sanger sequencing. Real-time pyrosequencing identified all Sanger-determined ticks as well as 35 previously undifferentiated ticks resulting in a total number of 188 (71.5%) identified samples. Pyrosequencing sensitivity was found to be strongly dependent on gltA copy numbers in the reaction setup. Whereas less than 101 copies in the initial amplification reaction resulted in identification of 15.1% of the samples only, the percentage increased to 54.2% at 101-102 copies, to 95.6% at >102-103 copies and reached 100% samples identified for their Rickettsia species if more than 103 copies were present in the template. The established real-time pyrosequencing approach represents a reliable method for detection and differentiation of Rickettsia spp. present in I. ricinus diagnostic material and prevalence studies. Furthermore, the method proved to be faster, more cost-effective as well as more sensitive than custom Sanger sequencing with simultaneous high specificity.
Xiong, Jie; Gao, Shan; Dui, Wen; Yang, Wentao; Chen, Xiao; Taverna, Sean D; Pearlman, Ronald E; Ashlock, Wendy; Miao, Wei; Liu, Yifan
2016-12-01
The ciliate protozoan Tetrahymena thermophila contains two types of structurally and functionally differentiated nuclei: the transcriptionally active somatic macronucleus (MAC) and the transcriptionally silent germ-line micronucleus (MIC). Here, we demonstrate that MAC features well-positioned nucleosomes downstream of transcription start sites and flanking splice sites. Transcription-associated trans-determinants promote nucleosome positioning in MAC. By contrast, nucleosomes in MIC are dramatically delocalized. Nucleosome occupancy in MAC and MIC are nonetheless highly correlated with each other, as well as with in vitro reconstitution and predictions based upon DNA sequence features, revealing unexpectedly strong contributions from cis-determinants. In particular, well-positioned nucleosomes are often matched with GC content oscillations. As many nucleosomes are coordinately accommodated by both cis- and trans-determinants, we propose that their distribution is shaped by the impact of these nucleosomes on the mutational and transcriptional landscape, and driven by evolutionary selection. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Counting Patterns in Degenerated Sequences
NASA Astrophysics Data System (ADS)
Nuel, Grégory
Biological sequences like DNA or proteins, are always obtained through a sequencing process which might produce some uncertainty. As a result, such sequences are usually written in a degenerated alphabet where some symbols may correspond to several possible letters (ex: IUPAC DNA alphabet). When counting patterns in such degenerated sequences, the question that naturally arises is: how to deal with degenerated positions ? Since most (usually 99%) of the positions are not degenerated, it is considered harmless to discard the degenerated positions in order to get an observation, but the exact consequences of such a practice are unclear. In this paper, we introduce a rigorous method to take into account the uncertainty of sequencing for biological sequences (DNA, Proteins). We first introduce a Forward-Backward approach to compute the marginal distribution of the constrained sequence and use it both to perform a Expectation-Maximization estimation of parameters, as well as deriving a heterogeneous Markov distribution for the constrained sequence. This distribution is hence used along with known DFA-based pattern approaches to obtain the exact distribution of the pattern count under the constraints. As an illustration, we consider a EST dataset from the EMBL database. Despite the fact that only 1% of the positions in this dataset are degenerated, we show that not taking into account these positions might lead to erroneous observations, further proving the interest of our approach.
Sjö, Nicolai Christian; von Buchwald, Christian; Cassonnet, Patricia; Norrild, Bodil; Prause, Jan Ulrik; Vinding, Troels; Heegaard, Steffen
2007-08-01
To examine conjunctival papilloma and normal conjunctival tissue for the presence of human papillomavirus (HPV). Archival paraffin wax-embedded tissue from 165 conjunctival papillomas and from 20 histological normal conjunctival biopsy specimens was analysed for the presence of HPV by PCR. Specimens considered HPV positive using consensus primers, but with a negative or uncertain PCR result using type-specific HPV probes, were analysed with DNA sequencing. HPV was present in 86 of 106 (81%) beta-globin-positive papillomas. HPV type 6 was positive in 80 cases, HPV type 11 was identified in 5 cases and HPV type 45 was present in a single papilloma. All the 20 normal conjunctival biopsy specimens were beta-globin positive and HPV negative. There is a strong association between HPV and conjunctival papilloma. The study presents the largest material of conjunctival papilloma investigated for HPV and the first investigation of HPV in normal conjunctival tissue. HPV types 6 and 11 are the most common HPV types in conjunctival papilloma. This also is the first report of HPV type 45 in conjunctival papilloma.
Faham, Malek; Carlton, Victoria; Moorhead, Martin; Zheng, Jianbiao; Klinger, Mark; Pepin, Francois; Asbury, Thomas; Vignali, Marissa; Emerson, Ryan O; Robins, Harlan S; Ireland, James; Baechler-Gillespie, Emily; Inman, Robert D
2017-04-01
Ankylosing spondylitis (AS), a chronic inflammatory disorder, has a notable association with HLA-B27. One hypothesis suggests that a common antigen that binds to HLA-B27 is important for AS disease pathogenesis. This study was undertaken to determine sequences and motifs that are shared among HLA-B27-positive AS patients, using T cell repertoire next-generation sequencing. To identify motifs enriched among B27-positive AS patients, we performed T cell receptor β (TCRβ) repertoire sequencing on samples from 191 B27-positive AS patients, 43 B27-negative AS patients, and 227 controls, and we obtained >77 million TCRβ clonotype sequences. First, we assessed whether any of 50 previously published sequences were enriched in B27-positive AS patients. We then used training and test cohorts to identify discovered motifs that were enriched in B27-positive AS patients versus controls. Six previously published and 11 discovered motifs were enriched in the B27-positive AS samples as compared to controls. After combining motifs related by sequence, we identified a total of 15 independent motifs. Both the full set of 15 motifs and a set of 6 published motifs were enriched in the B27-positive AS patients as compared to B27-positive healthy individuals (P = 0.049 and P = 0.001, respectively). Using an independent cohort, we validated that at least some of these motifs were associated with AS, and not simply with B27-positive status. We identified TCRβ motifs that are enriched in B27-positive AS patients as compared to B27-positive healthy controls. This suggests that a common antigen, presented by HLA-B27 and detected by CD8+ T cells, may be associated with AS disease pathogenesis. © 2016, American College of Rheumatology.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gregory, S. G.; Hillenbrand, L. A.; Donati, J.-F.
2012-08-20
Zeeman-Doppler imaging studies have shown that the magnetic fields of T Tauri stars can be significantly more complex than a simple dipole and can vary markedly between sources. We collect and summarize the magnetic field topology information obtained to date and present Hertzsprung-Russell (H-R) diagrams for the stars in the sample. Intriguingly, the large-scale field topology of a given pre-main-sequence (PMS) star is strongly dependent upon the stellar internal structure, with the strength of the dipole component of its multipolar magnetic field decaying rapidly with the development of a radiative core. Using the observational data as a basis, we arguemore » that the general characteristics of the global magnetic field of a PMS star can be determined from its position in the H-R diagram. Moving from hotter and more luminous to cooler and less luminous stars across the PMS of the H-R diagram, we present evidence for four distinct magnetic topology regimes. Stars with large radiative cores, empirically estimated to be those with a core mass in excess of {approx}40% of the stellar mass, host highly complex and dominantly non-axisymmetric magnetic fields, while those with smaller radiative cores host axisymmetric fields with field modes of higher order than the dipole dominant (typically, but not always, the octupole). Fully convective stars above {approx}> 0.5 M{sub Sun} appear to host dominantly axisymmetric fields with strong (kilo-Gauss) dipole components. Based on similarities between the magnetic properties of PMS stars and main-sequence M-dwarfs with similar internal structures, we speculate that a bistable dynamo process operates for lower mass stars ({approx}< 0.5 M{sub Sun} at an age of a few Myr) and that they will be found to host a variety of magnetic field topologies. If the magnetic topology trends across the H-R diagram are confirmed, they may provide a new method of constraining PMS stellar evolution models.« less
Physics behind the mechanical nucleosome positioning code
NASA Astrophysics Data System (ADS)
Zuiddam, Martijn; Everaers, Ralf; Schiessel, Helmut
2017-11-01
The positions along DNA molecules of nucleosomes, the most abundant DNA-protein complexes in cells, are influenced by the sequence-dependent DNA mechanics and geometry. This leads to the "nucleosome positioning code", a preference of nucleosomes for certain sequence motives. Here we introduce a simplified model of the nucleosome where a coarse-grained DNA molecule is frozen into an idealized superhelical shape. We calculate the exact sequence preferences of our nucleosome model and find it to reproduce qualitatively all the main features known to influence nucleosome positions. Moreover, using well-controlled approximations to this model allows us to come to a detailed understanding of the physics behind the sequence preferences of nucleosomes.
An intuitive graphical webserver for multiple-choice protein sequence search.
Banky, Daniel; Szalkai, Balazs; Grolmusz, Vince
2014-04-10
Every day tens of thousands of sequence searches and sequence alignment queries are submitted to webservers. The capitalized word "BLAST" becomes a verb, describing the act of performing sequence search and alignment. However, if one needs to search for sequences that contain, for example, two hydrophobic and three polar residues at five given positions, the query formation on the most frequently used webservers will be difficult. Some servers support the formation of queries with regular expressions, but most of the users are unfamiliar with their syntax. Here we present an intuitive, easily applicable webserver, the Protein Sequence Analysis server, that allows the formation of multiple choice queries by simply drawing the residues to their positions; if more than one residue are drawn to the same position, then they will be nicely stacked on the user interface, indicating the multiple choice at the given position. This computer-game-like interface is natural and intuitive, and the coloring of the residues makes possible to form queries requiring not just certain amino acids in the given positions, but also small nonpolar, negatively charged, hydrophobic, positively charged, or polar ones. The webserver is available at http://psa.pitgroup.org. Copyright © 2014 Elsevier B.V. All rights reserved.
Identification of sequence motifs significantly associated with antisense activity.
McQuisten, Kyle A; Peek, Andrew S
2007-06-07
Predicting the suppression activity of antisense oligonucleotide sequences is the main goal of the rational design of nucleic acids. To create an effective predictive model, it is important to know what properties of an oligonucleotide sequence associate significantly with antisense activity. Also, for the model to be efficient we must know what properties do not associate significantly and can be omitted from the model. This paper will discuss the results of a randomization procedure to find motifs that associate significantly with either high or low antisense suppression activity, analysis of their properties, as well as the results of support vector machine modelling using these significant motifs as features. We discovered 155 motifs that associate significantly with high antisense suppression activity and 202 motifs that associate significantly with low suppression activity. The motifs range in length from 2 to 5 bases, contain several motifs that have been previously discovered as associating highly with antisense activity, and have thermodynamic properties consistent with previous work associating thermodynamic properties of sequences with their antisense activity. Statistical analysis revealed no correlation between a motif's position within an antisense sequence and that sequences antisense activity. Also, many significant motifs existed as subwords of other significant motifs. Support vector regression experiments indicated that the feature set of significant motifs increased correlation compared to all possible motifs as well as several subsets of the significant motifs. The thermodynamic properties of the significantly associated motifs support existing data correlating the thermodynamic properties of the antisense oligonucleotide with antisense efficiency, reinforcing our hypothesis that antisense suppression is strongly associated with probe/target thermodynamics, as there are no enzymatic mediators to speed the process along like the RNA Induced Silencing Complex (RISC) in RNAi. The independence of motif position and antisense activity also allows us to bypass consideration of this feature in the modelling process, promoting model efficiency and reducing the chance of overfitting when predicting antisense activity. The increase in SVR correlation with significant features compared to nearest-neighbour features indicates that thermodynamics alone is likely not the only factor in determining antisense efficiency.
Discovering frequently recurring movement sequences in team-sport athlete spatiotemporal data.
Sweeting, Alice J; Aughey, Robert J; Cormack, Stuart J; Morgan, Stuart
2017-12-01
Athlete external load is typically analysed from predetermined movement thresholds. The combination of movement sequences and differences in these movements between playing positions is also currently unknown. This study developed a method to discover the frequently recurring movement sequences across playing position during matches. The external load of 12 international female netball athletes was collected by a local positioning system during four national-level matches. Velocity, acceleration and angular velocity were calculated from positional (X, Y) data, clustered via one-dimensional k-means and assigned a unique alphabetic label. Combinations of velocity, acceleration and angular velocity movement were compared using the Levenshtein distance and similarities computed by the longest common substring problem. The contribution of each movement sequence, according to playing position and relative to the wider data set, was then calculated via the Minkowski distance. A total of 10 frequently recurring combinations of movement were discovered, regardless of playing position. Only the wing attack, goal attack and goal defence playing positions are closely related. We developed a technique to discover the movement sequences, according to playing position, performed by elite netballers. This methodology can be extended to discover the frequently recurring movements within other team sports and across levels of competition.
Nullomers and High Order Nullomers in Genomic Sequences
Vergni, Davide; Santoni, Daniele
2016-01-01
A nullomer is an oligomer that does not occur as a subsequence in a given DNA sequence, i.e. it is an absent word of that sequence. The importance of nullomers in several applications, from drug discovery to forensic practice, is now debated in the literature. Here, we investigated the nature of nullomers, whether their absence in genomes has just a statistical explanation or it is a peculiar feature of genomic sequences. We introduced an extension of the notion of nullomer, namely high order nullomers, which are nullomers whose mutated sequences are still nullomers. We studied different aspects of them: comparison with nullomers of random sequences, CpG distribution and mean helical rise. In agreement with previous results we found that the number of nullomers in the human genome is much larger than expected by chance. Nevertheless antithetical results were found when considering a random DNA sequence preserving dinucleotide frequencies. The analysis of CpG frequencies in nullomers and high order nullomers revealed, as expected, a high CpG content but it also highlighted a strong dependence of CpG frequencies on the dinucleotide position, suggesting that nullomers have their own peculiar structure and are not simply sequences whose CpG frequency is biased. Furthermore, phylogenetic trees were built on eleven species based on both the similarities between the dinucleotide frequencies and the number of nullomers two species share, showing that nullomers are fairly conserved among close species. Finally the study of mean helical rise of nullomers sequences revealed significantly high mean rise values, reinforcing the hypothesis that those sequences have some peculiar structural features. The obtained results show that nullomers are the consequence of the peculiar structure of DNA (also including biased CpG frequency and CpGs islands), so that the hypermutability model, also taking into account CpG islands, seems to be not sufficient to explain nullomer phenomenon. Finally, high order nullomers could emphasize those features that already make simple nullomers useful in several applications. PMID:27906971
Peng, Tao; Xue, Chenghai; Bi, Jianning; Li, Tingting; Wang, Xiaowo; Zhang, Xuegong; Li, Yanda
2008-04-26
Alternative splicing expands transcriptome diversity and plays an important role in regulation of gene expression. Previous studies focus on the regulation of a single cassette exon, but recent experiments indicate that multiple cassette exons within a gene may interact with each other. This interaction can increase the potential to generate various transcripts and adds an extra layer of complexity to gene regulation. Several cases of exon interaction have been discovered. However, the extent to which the cassette exons coordinate with each other remains unknown. Based on EST data, we employed a metric of correlation coefficients to describe the interaction between two adjacent cassette exons and then categorized these exon pairs into three different groups by their interaction (correlation) patterns. Sequence analysis demonstrates that strongly-correlated groups are more conserved and contain a higher proportion of pairs with reading frame preservation in a combinatorial manner. Multiple genome comparison further indicates that different groups of correlated pairs have different evolutionary courses: (1) The vast majority of positively-correlated pairs are old, (2) most of the weakly-correlated pairs are relatively young, and (3) negatively-correlated pairs are a mixture of old and young events. We performed a large-scale analysis of interactions between adjacent cassette exons. Compared with weakly-correlated pairs, the strongly-correlated pairs, including both the positively and negatively correlated ones, show more evidence that they are under delicate splicing control and tend to be functionally important. Additionally, the positively-correlated pairs bear strong resemblance to constitutive exons, which suggests that they may evolve from ancient constitutive exons, while negatively and weakly correlated pairs are more likely to contain newly emerging exons.
Koneva, Lada A; Zhang, Yanxiao; Virani, Shama; Hall, Pelle B; McHugh, Jonathan B; Chepeha, Douglas B; Wolf, Gregory T; Carey, Thomas E; Rozek, Laura S; Sartor, Maureen A
2018-01-01
The incidence of human papillomavirus (HPV)-related oropharynx cancer has steadily increased over the past two decades and now represents a majority of oropharyngeal cancer cases. Integration of the HPV genome into the host genome is a common event during carcinogenesis that has clinically relevant effects if the viral early genes are transcribed. Understanding the impact of HPV integration on clinical outcomes of head and neck squamous cell carcinoma (HNSCC) is critical for implementing deescalated treatment approaches for HPV + HNSCC patients. RNA sequencing (RNA-seq) data from HNSCC tumors ( n = 84) were used to identify and characterize expressed integration events, which were overrepresented near known head and neck, lung, and urogenital cancer genes. Five genes were recurrent, including CD274 (PD-L1) A significant number of genes detected to have integration events were found to interact with Tp63, ETS, and/or FOX1A. Patients with no detected integration had better survival than integration-positive and HPV - patients. Furthermore, integration-negative tumors were characterized by strongly heightened signatures for immune cells, including CD4 + , CD3 + , regulatory, CD8 + T cells, NK cells, and B cells, compared with integration-positive tumors. Finally, genes with elevated expression in integration-negative specimens were strongly enriched with immune-related gene ontology terms, while upregulated genes in integration-positive tumors were enriched for keratinization, RNA metabolism, and translation. Implications: These findings demonstrate the clinical relevancy of expressed HPV integration, which is characterized by a change in immune response and/or aberrant expression of the integration-harboring cancer-related genes, and suggest strong natural selection for tumor cells with expressed integration events in key carcinogenic genes. Mol Cancer Res; 16(1); 90-102. ©2017 AACR . ©2017 American Association for Cancer Research.
On causal roles and selected effects: our genome is mostly junk.
Doolittle, W Ford; Brunet, Tyler D P
2017-12-05
The idea that much of our genome is irrelevant to fitness-is not the product of positive natural selection at the organismal level-remains viable. Claims to the contrary, and specifically that the notion of "junk DNA" should be abandoned, are based on conflating meanings of the word "function". Recent estimates suggest that perhaps 90% of our DNA, though biochemically active, does not contribute to fitness in any sequence-dependent way, and possibly in no way at all. Comparisons to vertebrates with much larger and smaller genomes (the lungfish and the pufferfish) strongly align with such a conclusion, as they have done for the last half-century.
Jabbar, Abdul; Papini, Roberto; Ferrini, Nadia; Gasser, Robin B
2012-10-01
A 9 year-old male, neutered cat with a history of a sudden onset of lethargy, anorexia and respiratory distress was presented in a veterinary practice in Lucca, Italy. A clinical examination revealed that the cat was severely dehydrated, and had pale mucous membranes and tachypnoea. No pain or discomfort was detected at the time of physical examination. The cat was administered fluids, antibiotics and supportive therapy, but died overnight. The owner of the cat requested for a post mortem examination to be conducted. At necropsy, acephalic structures, consistent with proliferative tapeworm (cestode) larvae, were detected in the thoracic cavity on pleural surfaces. As these larvae could not be identified to genus or species by microscopy, a PCR-based sequencing-phylogenetic approach was used. Part of the cytochrome c oxidase subunit 1 gene was PCR-amplified from genomic DNAs from five individual larvae and sequenced; all five sequences obtained were identical. This consensus sequence was aligned (over 355 nucleotide positions) with homologous sequences representing a range of cestodes (including Echinococcus granulosus, Echinococcus multilocularis, Hymenolepis microstoma, Mesocestoides spp. and Taenia saginata) from previously published studies and then subjected to phylogenetic analysis. The sequence representing the larval cestode from the affected cat grouped, with strong statistical support, with those representing Mesocestoides corti and Mesocestoides lineatus. Therefore, a definitive diagnosis of pleural proliferative larval mesocestoidiasis could be made. This study illustrates the value of using molecular tools to directly assist clinical and pathological investigations of cestodiases of animals. Copyright © 2012 Elsevier B.V. All rights reserved.
ERIC Educational Resources Information Center
Van Strien, Jan W.
2004-01-01
To investigate whether concurrent nonverbal sound sequences would affect visual-hemifield lexical processing, lexical-decision performance of 24 strongly right-handed students (12 men, 12 women) was measured in three conditions: baseline, concurrent neutral sound sequence, and concurrent emotional sound sequence. With the neutral sequence,…
Centromere location in Arabidopsis is unaltered by extreme divergence in CENH3 protein sequence.
Maheshwari, Shamoni; Ishii, Takayoshi; Brown, C Titus; Houben, Andreas; Comai, Luca
2017-03-01
During cell division, spindle fibers attach to chromosomes at centromeres. The DNA sequence at regional centromeres is fast evolving with no conserved genetic signature for centromere identity. Instead CENH3, a centromere-specific histone H3 variant, is the epigenetic signature that specifies centromere location across both plant and animal kingdoms. Paradoxically, CENH3 is also adaptively evolving. An ongoing question is whether CENH3 evolution is driven by a functional relationship with the underlying DNA sequence. Here, we demonstrate that despite extensive protein sequence divergence, CENH3 histones from distant species assemble centromeres on the same underlying DNA sequence. We first characterized the organization and diversity of centromere repeats in wild-type Arabidopsis thaliana We show that A. thaliana CENH3-containing nucleosomes exhibit a strong preference for a unique subset of centromeric repeats. These sequences are largely missing from the genome assemblies and represent the youngest and most homogeneous class of repeats. Next, we tested the evolutionary specificity of this interaction in a background in which the native A. thaliana CENH3 is replaced with CENH3s from distant species. Strikingly, we find that CENH3 from Lepidium oleraceum and Zea mays , although specifying epigenetically weaker centromeres that result in genome elimination upon outcrossing, show a binding pattern on A. thaliana centromere repeats that is indistinguishable from the native CENH3. Our results demonstrate positional stability of a highly diverged CENH3 on independently evolved repeats, suggesting that the sequence specificity of centromeres is determined by a mechanism independent of CENH3. © 2017 Maheshwari et al.; Published by Cold Spring Harbor Laboratory Press.
Isoforms of the major peanut allergen Ara h 2: IgE binding in children with peanut allergy.
Hales, Belinda J; Bosco, Anthony; Mills, Kristina L; Hazell, Lee A; Loh, Richard; Holt, Patrick G; Thomas, Wayne R
2004-10-01
The major peanut allergen Ara h 2 consists of two isoforms, namely Ara h 2.0101 and Ara h 2.0201. The recently identified Ara h 2.0201 isoform contains an extra 12 amino acids including an extra copy of the reported immunodominant epitope DPYSPS. This study aimed to evaluate the IgE binding of the two Ara h 2 isoforms. Ten clones of Ara h 2 were sequenced to assess the relative frequency of the Ara h 2 isoforms and to identify whether there was further variation in the Ara h 2 sequence. IgE binding to Ara h 2.0101 and Ara h 2.0201 was measured for 70 peanut-allergic children using an IgE DELFIA assay to quantitate specific IgE binding. A competition assay was used to measure whether Ara h 2.0201 contained IgE epitopes other than those found for Ara h 2.0101. The original Ara h 2.0101 sequence was found for 6/10 clones and Ara h 2.0201 was found for 2/10 clones. Ara h 2.0201 had the expected insertion of 12 amino acids as well as substitutions at positions 40 (40G) and 142 (142E). Two new isoforms were identified as different polymorphisms of position 142. One Ara h 2.01 clone (Ara h 2.0102) contained 142E and one Ara h 2.02 clone (Ara h 2.0202) contained 142D. A polymorphism that was previously identified by other investigators at position 77 (77Q or 77R) was not found for any of the 10 sequences. Although the level of IgE binding to Ara h 2.0201 of individual patients was frequently higher than the binding to Ara h 2.0101 (p < 0.01), there was a strong correlation in binding to both isoforms (r = 0.987, p < 0.0001) and when analyzed as a group the means were similar. Ara h 2.0101 was not as efficient at blocking reactivity to Ara h 2.0201 indicating there is an additional IgE specificity for the Ara h 2.0201 isoform. Ara h 2.0201 has similar but higher IgE binding than the originally sequenced Ara h 2.0101 isoform and contains other IgE specificities.
A highly divergent Puumala virus lineage in southern Poland.
Rosenfeld, Ulrike M; Drewes, Stephan; Ali, Hanan Sheikh; Sadowska, Edyta T; Mikowska, Magdalena; Heckel, Gerald; Koteja, Paweł; Ulrich, Rainer G
2017-05-01
Puumala virus (PUUV) represents one of the most important hantaviruses in Central Europe. Phylogenetic analyses of PUUV strains indicate a strong genetic structuring of this hantavirus. Recently, PUUV sequences were identified in the natural reservoir, the bank vole (Myodes glareolus), collected in the northern part of Poland. The objective of this study was to evaluate the presence of PUUV in bank voles from southern Poland. A total of 72 bank voles were trapped in 2009 at six sites in this part of Poland. RT-PCR and IgG-ELISA analyses detected three PUUV positive voles at one trapping site. The PUUV-infected animals were identified by cytochrome b gene analysis to belong to the Carpathian and Eastern evolutionary lineages of bank vole. The novel PUUV S, M and L segment nucleotide sequences showed the closest similarity to sequences of the Russian PUUV lineage from Latvia, but were highly divergent to those previously found in northern Poland, Slovakia and Austria. In conclusion, the detection of a highly divergent PUUV lineage in southern Poland indicates the necessity of further bank vole monitoring in this region allowing rational public health measures to prevent human infections.
A report on the outbreak of Zika virus on Easter Island, South Pacific, 2014.
Tognarelli, J; Ulloa, S; Villagra, E; Lagos, J; Aguayo, C; Fasce, R; Parra, B; Mora, J; Becerra, N; Lagos, N; Vera, L; Olivares, B; Vilches, M; Fernández, J
2016-03-01
Zika virus (ZIKV) is an emerging mosquito-borne flavivirus circulating in Asia and Africa. In 2013, a large outbreak was reported on the archipelago of French Polynesia. In this study, we report the detection and molecular characterization of Zika virus for the first time in Chile from an outbreak among the inhabitants of Easter Island. A total of 89 samples from patients suspected of having ZIKV infection were collected between the period from January to May, 2014. Molecular diagnosis of the virus was performed by RT-PCR followed by the sequencing of the region containing the NS5 gene. A comparison of the viral nucleic acid sequence with those of other strains of ZIKA virus was performed using the MEGA software. Fifty-one samples were found positive for ZIKV by RT-PCR analysis. Further analysis of the NS5 gene revealed that the ZIKV strains identified in Easter Island were most closely related to those found in French Polynesia (99.8 to 99.9% nt and 100% aa sequence identity). These results strongly suggest that the transmission pathway leading to the introduction of Zika virus on Easter Island has its origin in French Polynesia.
Lopes-Santos, Lucilene; Castro, Daniel Bedo Assumpção; Ferreira-Tonin, Mariana; Corrêa, Daniele Bussioli Alves; Weir, Bevan Simon; Park, Duckchul; Ottoboni, Laura Maria Mariscal; Neto, Júlio Rodrigues; Destéfano, Suzete Aparecida Lanza
2017-06-01
The phylogenetic classification of the species Burkholderia andropogonis within the Burkholderia genus was reassessed using 16S rRNA gene phylogenetic analysis and multilocus sequence analysis (MLSA). Both phylogenetic trees revealed two main groups, named A and B, strongly supported by high bootstrap values (100%). Group A encompassed all of the Burkholderia species complex, whi.le Group B only comprised B. andropogonis species, with low percentage similarities with other species of the genus, from 92 to 95% for 16S rRNA gene sequences and 83% for conserved gene sequences. Average nucleotide identity (ANI), tetranucleotide signature frequency, and percentage of conserved proteins POCP analyses were also carried out, and in the three analyses B. andropogonis showed lower values when compared to the other Burkholderia species complex, near 71% for ANI, from 0.484 to 0.724 for tetranucleotide signature frequency, and around 50% for POCP, reinforcing the distance observed in the phylogenetic analyses. Our findings provide an important insight into the taxonomy of B. andropogonis. It is clear from the results that this bacterial species exhibits genotypic differences and represents a new genus described herein as Robbsia andropogonis gen. nov., comb. nov.
Martin, Donald S; Wright, André-Denis G; Barta, John R; Desser, Sherwin S
2002-06-01
Phylogenetic relationships within the kinetoplastid flagellates were inferred from comparisons of small-subunit ribosomal RNA gene sequences. These included 5 new gene sequences, Trypanosoma fallisi (2,239 bp), Trypanosoma chattoni (2,180 bp), Trypanosoma mega (2,211 bp), Trypanosoma neveulemairei (2,197 bp), and Trypanosoma ranarum (2,203 bp). Trees produced using maximum-parsimony and distance-matrix methods (least-squares, neighbor-joining, and maximum-likelihood), supported by strong bootstrap and quartet-puzzle analyses, indicated that the trypanosomes are a monophyletic group that divides into 2 major lineages, the salivarian trypanosomes and the nonsalivarian trypanosomes. The nonsalivarian trypanosomes further divide into 2 lineages, 1 containing trypanosomes of birds, mammals, and reptiles and the other containing trypanosomes of fish, reptiles, and anurans. Among the giant trypanosomes, T. chattoni is clearly shown to be distantly related to all the other anuran trypanosome species. Trypanosoma mega is closely associated with T. fallisi and T. ranarum, whereas T. neveulemairei and Trypanosoma rotatorium are sister taxa. The branching order of the anuran trypanosomes suggests that some toad trypanosomes may have evolved by host switching from frogs to toads.
Inhibition of trypanosomal cysteine proteinases by their propeptides.
Lalmanach, G; Lecaille, F; Chagas, J R; Authié, E; Scharfstein, J; Juliano, M A; Gauthier, F
1998-09-25
The ability of the prodomains of trypanosomal cysteine proteinases to inhibit their active form was studied using a set of 23 overlapping 15-mer peptides covering the whole prosequence of congopain, the major cysteine proteinase of Trypanosoma congolense. Three consecutive peptides with a common 5-mer sequence YHNGA were competitive inhibitors of congopain. A shorter synthetic peptide consisting of this 5-mer sequence flanked by two Ala residues (AYHNGAA) also inhibited purified congopain. No residue critical for inhibition was identified in this sequence, but a significant improvement in Ki value was obtained upon N-terminal elongation. Procongopain-derived peptides did not inhibit lysosomal cathepsins B and L but did inhibit native cruzipain (from Dm28c clone epimastigotes), the major cysteine proteinase of Trypanosoma cruzi, the proregion of which also contains the sequence YHNGA. The positioning of the YHNGA inhibitory sequence within the prosegment of trypanosomal proteinases is similar to that covering the active site in the prosegment of cysteine proteinases, the three-dimensional structure of which has been resolved. This strongly suggests that trypanosomal proteinases, despite their long C-terminal extension, have a prosegment that folds similarly to that in related mammal and plant cysteine proteinases, resulting in reverse binding within the active site. Such reverse binding could also occur for short procongopain-derived inhibitory peptides, based on their resistance to proteolysis and their ability to retain inhibitory activity after prolonged incubation. In contrast, homologous peptides in related cysteine proteinases did not inhibit trypanosomal proteinases and were rapidly cleaved by these enzymes.
Sekizuka, Tsuyoshi; Yokoi, Taeko; Murayama, Ohoshi; Millar, B Cherie; Moore, Johne; Matsuda, Motoo
2005-08-01
A newly constructed primer pair (lari-Af/lari-Ar) designed to generate a product of the flagellin (flaA) gene for urease-negative Campylobacter lari produced a PCR amplicon of about 1700 bp for 16 isolates from 7 seagulls, 5 humans, 3 food animals and one mussel in Japan and Northern Ireland. Nucleotide sequencing and alignments of the flaA amplicons from these isolates demonstrated that the deduced amino acid sequences of the possible open reading frame were 564-572 amino acid residues in length with calculated molecular weights of 58,804 to 59,463. The deduced amino acid sequence similarity analysis strongly suggested that the ORF of the flaA from the 16 isolates showed 70-75% sequence similarities to those of Campylobacter jejuni isolates. The approximate Mr of the flagellin purified from some of the isolates of urease-negative C. lari was estimated to range from 59.6 to 61.8 kDa. Thus, flagellin from the isolates of urease-negative C. lari was shown for the first time to have a molecular size similar to those of C. jejuni and Campylobacter coli isolates, but to be different from the shorter flaA and smaller flagellin of urease-positive thermophilic Campylobacter (UPTC) isolates. Flagellins from C. lari spp., consisting of the two representative taxa of urease-negative C. lari and UPTC, thus show genotypic and phenotypic diversity.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Abraham, Paul E.; Wang, Xiaojing; Ranjan, Priya
The availability of next-generation sequencing technologies has rapidly transformed our ability to link genotypes to phenotypes, and as such, promises to facilitate the dissection of genetic contribution to complex traits. Although discoveries of genetic associations will further our understanding of biology, once candidate variants have been identified, investigators are faced with the challenge of characterizing the functional effects on proteins encoded by such genes. Here we show how next-generation RNA sequencing data can be exploited to construct genotype-specific protein sequence databases, which provide a clearer picture of the molecular toolbox underlying cellular and organismal processes and their variation in amore » natural population. For this study, we used two individual genotypes (DENA-17-3 and VNDL-27-4) from a recent genome wide association (GWA) study of Populus trichocarpa, an obligate outcrosser that exhibits tremendous phenotypic variation across the natural population. This strategy allowed us to comprehensively catalogue proteins containing single amino acid polymorphisms (SAAPs) and insertions and deletions (INDELS). Based on large-scale identification of SAAPs, we profiled the frequency of 128 types of naturally occurring amino acid substitutions, with a subset of SAAPs occurring in regions of the genome having strong polymorphism patterns consistent with recent positive and/or divergent selection. In addition, we were able to explore the diploid landscape of Populus at the proteome-level, allowing the characterization of heterozygous variants.« less
Abraham, Paul E.; Wang, Xiaojing; Ranjan, Priya; ...
2015-10-20
The availability of next-generation sequencing technologies has rapidly transformed our ability to link genotypes to phenotypes, and as such, promises to facilitate the dissection of genetic contribution to complex traits. Although discoveries of genetic associations will further our understanding of biology, once candidate variants have been identified, investigators are faced with the challenge of characterizing the functional effects on proteins encoded by such genes. Here we show how next-generation RNA sequencing data can be exploited to construct genotype-specific protein sequence databases, which provide a clearer picture of the molecular toolbox underlying cellular and organismal processes and their variation in amore » natural population. For this study, we used two individual genotypes (DENA-17-3 and VNDL-27-4) from a recent genome wide association (GWA) study of Populus trichocarpa, an obligate outcrosser that exhibits tremendous phenotypic variation across the natural population. This strategy allowed us to comprehensively catalogue proteins containing single amino acid polymorphisms (SAAPs) and insertions and deletions (INDELS). Based on large-scale identification of SAAPs, we profiled the frequency of 128 types of naturally occurring amino acid substitutions, with a subset of SAAPs occurring in regions of the genome having strong polymorphism patterns consistent with recent positive and/or divergent selection. In addition, we were able to explore the diploid landscape of Populus at the proteome-level, allowing the characterization of heterozygous variants.« less
Ni, Lianghong; Zhao, Zhili; Xu, Hongxi; Chen, Shilin; Dorje, Gaawe
2016-02-15
Endemic to the Sino-Himalayan subregion, the medicinal alpine plant Gentiana straminea is a threatened species. The genetic and molecular data about it is deficient. Here we report the complete chloroplast (cp) genome sequence of G. straminea, as the first sequenced member of the family Gentianaceae. The cp genome is 148,991bp in length, including a large single copy (LSC) region of 81,240bp, a small single copy (SSC) region of 17,085bp and a pair of inverted repeats (IRs) of 25,333bp. It contains 112 unique genes, including 78 protein-coding genes, 30 tRNAs and 4 rRNAs. The rps16 gene lacks exon2 between trnK-UUU and trnQ-UUG, which is the first rps16 pseudogene found in the nonparasitic plants of Asterids clade. Sequence analysis revealed the presence of 13 forward repeats, 13 palindrome repeats and 39 simple sequence repeats (SSRs). An entire cp genome comparison study of G. straminea and four other species in Gentianales was carried out. Phylogenetic analyses using maximum likelihood (ML) and maximum parsimony (MP) were performed based on 69 protein-coding genes from 36 species of Asterids. The results strongly supported the position of Gentianaceae as one member of the order Gentianales. The complete chloroplast genome sequence will provide intragenic information for its conservation and contribute to research on the genetic and phylogenetic analyses of Gentianales and Asterids. Copyright © 2015 Elsevier B.V. All rights reserved.
Weak beacon detection for air-to-ground optical wireless link establishment.
Han, Yaoqiang; Dang, Anhong; Tang, Junxiong; Guo, Hong
2010-02-01
In an air-to-ground free-space optical communication system, strong background interference seriously affects the beacon detection, which makes it difficult to establish the optical link. In this paper, we propose a correlation beacon detection scheme under strong background interference conditions. As opposed to traditional beacon detection schemes, the beacon is modulated by an m-sequence at the transmitting terminal with a digital differential matched filter (DDMF) array introduced at the receiving end to detect the modulated beacon. This scheme is capable of suppressing both strong interference and noise by correlation reception of the received image sequence. In addition, the DDMF array enables each pixel of the image sensor to have its own DDMF of the same structure to process its received image sequence in parallel, thus it makes fast beacon detection possible. Theoretical analysis and an outdoor experiment have been demonstrated and show that the proposed scheme can realize fast and effective beacon detection under strong background interference conditions. Consequently, the required beacon transmission power can also be reduced dramatically.
Wang, Li; Deng, Xiangying; Liu, Haican; Zhao, Lanhua; You, Xiaolong; Dai, Pei; Wan, Kanglin; Zeng, Yanhua
2016-11-01
Mycobacterium tuberculosis is an obligate pathogenic bacterial species in the family of Mycobacteriaceae and attracts excessive immune responses which cause pathology of the lungs in active tuberculosis. The lack of more sensitive and effective diagnosis reagents advocates a further recognition for the fast diagnostic and immunological measures for tuberculosis. Here, two 12-mer peptides with core sequences of SVSVGMKPSPRP (CS1) and TMGFTAPRFPHY (CS2) were screened from a phage display random peptide library using the purified mixed tuberculosis-positive serum as a target. Enzyme-linked immunosorbent assay (ELISA) and dot immunobinding assay verified that positive phages exhibited strong binding affinity to mixed tuberculosis-positive serum. BLAST analysis showed that the two sequences may be mimotopes of the Mycobacterium tuberculosis The diagnostic potential for two synthetic mimotope peptides CS1 and CS2 was evaluated using different panels of serum samples (n = 181) by ELISA, and the diagnostic parameters were calculated. CS1 and CS2 achieved sensitivity of 89.41% and 85.88%, and specificities were 90.63% and 87.50%, respectively. We hypothesized that the diagnostic based on CS1 and CS2 may become a promising strategy to enhance the detection of Mycobacterium tuberculosis infection due to higher specificity and sensitivity. Therefore, CS1 and CS2 may possess potentials to provide an experimental basis for the diagnosis of tuberculosis. © FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Object tracking with adaptive HOG detector and adaptive Rao-Blackwellised particle filter
NASA Astrophysics Data System (ADS)
Rosa, Stefano; Paleari, Marco; Ariano, Paolo; Bona, Basilio
2012-01-01
Scenarios for a manned mission to the Moon or Mars call for astronaut teams to be accompanied by semiautonomous robots. A prerequisite for human-robot interaction is the capability of successfully tracking humans and objects in the environment. In this paper we present a system for real-time visual object tracking in 2D images for mobile robotic systems. The proposed algorithm is able to specialize to individual objects and to adapt to substantial changes in illumination and object appearance during tracking. The algorithm is composed by two main blocks: a detector based on Histogram of Oriented Gradient (HOG) descriptors and linear Support Vector Machines (SVM), and a tracker which is implemented by an adaptive Rao-Blackwellised particle filter (RBPF). The SVM is re-trained online on new samples taken from previous predicted positions. We use the effective sample size to decide when the classifier needs to be re-trained. Position hypotheses for the tracked object are the result of a clustering procedure applied on the set of particles. The algorithm has been tested on challenging video sequences presenting strong changes in object appearance, illumination, and occlusion. Experimental tests show that the presented method is able to achieve near real-time performances with a precision of about 7 pixels on standard video sequences of dimensions 320 × 240.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kewley, Lisa J.; Dopita, Michael A.; Sutherland, Ralph
We use the chemical evolution predictions of cosmological hydrodynamic simulations with our latest theoretical stellar population synthesis, photoionization, and shock models to predict the strong line evolution of ensembles of galaxies from z = 3 to the present day. In this paper, we focus on the brightest optical emission-line ratios, [N II]/H{alpha} and [O III]/H{beta}. We use the optical diagnostic Baldwin-Phillips-Terlevich (BPT) diagram as a tool for investigating the spectral properties of ensembles of active galaxies. We use four redshift windows chosen to exploit new near-infrared multi-object spectrographs. We predict how the BPT diagram will appear in these four redshiftmore » windows given different sets of assumptions. We show that the position of star-forming galaxies on the BPT diagram traces the interstellar medium conditions and radiation field in galaxies at a given redshift. Galaxies containing active galactic nucleus (AGN) form a mixing sequence with purely star-forming galaxies. This mixing sequence may change dramatically with cosmic time, due to the metallicity sensitivity of the optical emission-lines. Furthermore, the position of the mixing sequence may probe metallicity gradients in galaxies as a function of redshift, depending on the size of the AGN narrow-line region. We apply our latest slow shock models for gas shocked by galactic-scale winds. We show that at high redshift, galactic wind shocks are clearly separated from AGN in line ratio space. Instead, shocks from galactic winds mimic high metallicity starburst galaxies. We discuss our models in the context of future large near-infrared spectroscopic surveys.« less
Metz, Edward C.; Robles-Sikisaka, Refugio; Vacquier, Victor D.
1998-01-01
Strong positive Darwinian selection acts on two sperm fertilization proteins, lysin and 18-kDa protein, from abalone (Haliotis). To understand the phylogenetic context for this dramatic molecular evolution, we obtained sequences of mitochondrial cytochrome c oxidase subunit I (mtCOI), and genomic sequences of lysin, 18-kDa, and a G protein subunit. Based on mtDNA differentiation, four north Pacific abalone species diverged within the past 2 million years (Myr), and remaining north Pacific species diverged over a period of 4–20 Myr. Between-species nonsynonymous differences in lysin and 18-kDa exons exceed nucleotide differences in introns by 3.5- to 24-fold. Remarkably, in some comparisons nonsynonymous substitutions in lysin and 18-kDa genes exceed synonymous substitutions in mtCOI. Lysin and 18-kDa intron/exon segments were sequenced from multiple red abalone individuals collected over a 1,200-km range. Only two nucleotide changes and two sites of slippage variation were detected in a total of >29,000 nucleotides surveyed. However, polymorphism in mtCOI and a G protein intron was found in this species. This finding suggests that positive selection swept one lysin allele and one 18-kDa allele to fixation. Similarities between mtCOI and lysin gene trees indicate that rapid adaptive evolution of lysin has occurred consistently through the history of the group. Comparisons with mtCOI molecular clock calibrations suggest that nonsynonymous substitutions accumulate 2–50 times faster in lysin and 18-kDa genes than in rapidly evolving mammalian genes. PMID:9724763
Population genetics of Enterocytozoon bieneusi in captive giant pandas of China.
Li, Wei; Song, Yuan; Zhong, Zhijun; Huang, Xiangming; Wang, Chengdong; Li, Caiwu; Yang, Haidi; Liu, Haifeng; Ren, Zhihua; Lan, Jingchao; Wu, Kongju; Peng, Guangneng
2017-10-18
Most studies on Enterocytozoon bieneusi are conducted based on the internal transcribed spacer (ITS) region of the rRNA gene, whereas some have examined E. bieneusi population structures. Currently, the population genetics of this pathogen in giant panda remains unknown. The objective of this study was to determine the E. bieneusi population in captive giant pandas in China. We examined 69 E. bieneusi-positive specimens from captive giant pandas in China using five loci (ITS, MS1, MS3, MS4 and MS7) to infer E. bieneusi population genetics. For multilocus genotype (MLG) analysis of E. bieneusi-positive isolates, the MS1, MS3, MS4, and MS7 microsatellite and minisatellite loci were amplified and sequenced in 48, 45, 50 and 47 specimens, respectively, generating ten, eight, nine and five types. We successfully amplified 36 specimens and sequenced all five loci, forming 24 MLGs. Multilocus sequence analysis revealed a strong and significant linkage disequilibrium (LD), indicating a clonal population. This result was further supported by measurements of pairwise intergenic LD and a standardized index of association (I S A ) from allelic profile data. The analysis in STRUCTURE suggested three subpopulations in E. bieneusi, further confirmed using right's fixation index (F ST ). Subpopulations 1 and 2 exhibited an epidemic structure, whereas subpopulation 3 had a clonal structure. Our results describe E. bieneusi population genetics in giant pandas for the first time, improving the current understanding E. bieneusi epidemiology in the studied region. These data also benefit future studies exploring potential transmission risks from pandas to other animals, including humans.
Valentini, Françoise A; Robain, Gilberte; Marti, Brigitte G
2012-01-01
Our purpose was, applying a strictly defined protocol for urethral profilometry, 1) to test the repeatability of same session rest maximum urethral closure pressure (MUCP) and 2) to search for correlation between women complaint and the changes in MUCP value (rest and dynamic tests). A population of 140 consecutive women referred for evaluation of lower urinary tract dysfunction was stratified in 4 groups according with the urinary symptoms: stress, urge, mixed incontinence and continent and in each group in 3 age groups (young, middle age and old). The sequence of tests recorded in supine position was: urethral pressure profile at rest bladder empty, after bladder filling at 250 mL (reference test), stress profile, fatigability (before (rest) and after 10 successive strong coughs), then in standing position. In all groups, there was no significant difference between the two MUCP values at rest bladder filled. In the three incontinent groups, MUCP was higher bladder empty than bladder filled (p < 0.05) except in the young sub-group. Stress incontinence led to significant decrease of MUCP during dynamic tests in the young group. MUCP was not modified after fatigability test in women with urge complaint whatever age. When recorded following a strictly defined protocol, MUCP at rest bladder filled has a good repeatability in individual. However a complex sequence of tests during urethral pressure profilometry remains discussed in middle-age and old age-groups, it allows specifying the stress component of incontinence in young women and the urgency component in all age-groups.
Nishtala, P S; Chyou, T-Y
2017-04-01
Prescription sequence symmetry analyses (PSSA) is a ubiquitous tool employed in pharmacoepidemiological research to predict adverse drug reactions (ADRs). Several studies have reported the advantage of PSSA as a method that can be applied to a large prescription database with computational ease. The objective of this study was to validate New Zealand (NZ) prescription database as a potential source for identifying ADRs using the PSSA method. We analysed de-identified individual-level prescription data for people aged 65 years and above for the period 2005 to 2014 from the pharmaceutical collections supplied by the NZ Ministry of Health. We selected six positive controls that have been previously investigated and reported for causing ADRs. The six positive controls identified were amiodarone (repeated twice), frusemide, simvastatin, lithium and fluticasone. Amiodarone and lithium have been reported to induce thyroid dysfunction. Simvastatin reported to cause muscle cramps while fluticasone is well documented to cause oral candidiasis. Thyroxine was identified as a marker drug to treat hypothyroidism associated with amiodarone and lithium. Carbimazole was identified as a marker drug to treat hyperthyroidism associated with amiodarone use. Quinine sulphate was identified as a marker drug to treat muscle cramps associated with statins. In addition, we also analysed six negative controls that are unlikely to be associated with ADRs. The main outcome measure is to determine associations with ADRs using adjusted sequence ratios (ASR), and 95% confidence intervals RESULTS AND DISCUSSION: Our analyses confirmed a significant signal for all six positive controls. Significant positive associations were noted for amiodarone [ASR = 3·57, 95% CI (3·17-4·02)], and lithium chloride induced hypothyroidism [ASR = 3·43, 95% CI (2·55-4·70)]. Amiodarone was also strongly associated with hyperthyroidism [ASR = 8·81 95% CI (5·86-13·77)]. Simvastatin was associated with muscle cramps [ASR = 1·69, 95% CI (1·61-1·77)]. Fluticasone was positively associated with oral candidiasis [ASR = 2·34, 95% CI (2·19-2·50)]. Frusemide was associated with hypokalaemia [ASR = 2·94, 95% CI (2·83-3·05]). No strong associations were noted for the negative pairs. It is important to highlight that PSSA automatically controls for all confounding factors including unknown and unmeasured confounding variables, plus the effect of temporal trend in prescriptions, and hence allows a more robust ADR detection especially when confounding factors are difficult to determine or measure. New Zealand prescription database can be a potential source to identify ADRs engaging the PSSA method, and this could complement pharmacovigilance surveillance in NZ. The PSSA can be an important method for post-marketing surveillance and monitoring of ADRs which have relatively short latency. However, the predictive validity of PSSA will be compromised in certain scenarios, particularly when sample size is small, when new drugs are in the market and data are sparse. © 2016 John Wiley & Sons Ltd.
Church, Sheri A; Livingstone, Kevin; Lai, Zhao; Kozik, Alexander; Knapp, Steven J; Michelmore, Richard W; Rieseberg, Loren H
2007-02-01
Using likelihood-based variable selection models, we determined if positive selection was acting on 523 EST sequence pairs from two lineages of sunflower and lettuce. Variable rate models are generally not used for comparisons of sequence pairs due to the limited information and the inaccuracy of estimates of specific substitution rates. However, previous studies have shown that the likelihood ratio test (LRT) is reliable for detecting positive selection, even with low numbers of sequences. These analyses identified 56 genes that show a signature of selection, of which 75% were not identified by simpler models that average selection across codons. Subsequent mapping studies in sunflower show four of five of the positively selected genes identified by these methods mapped to domestication QTLs. We discuss the validity and limitations of using variable rate models for comparisons of sequence pairs, as well as the limitations of using ESTs for identification of positively selected genes.
BANNAI, Hiroshi; NEMOTO, Manabu; TSUJIMURA, Koji; YAMANAKA, Takashi; MAEDA, Ken; KONDO, Takashi
2015-01-01
To increase the sensitivity of an enzyme-linked immunosorbent assay (ELISA) for equine herpesvirus type 4 (EHV-4) that uses a 12-mer peptide of glycoprotein G (gG4-12-mer: MKNNPIYSEGSL) [4], we used a longer peptide consisting of a 24-mer repeat sequence (gG4-24-mer: MKNNPIYSEGSLMLNVQHDDSIHT) as an antigen. Sera of horses experimentally infected with EHV-4 reacted much more strongly to the gG4-24-mer peptide than to the gG4-12-mer peptide. We used peptide ELISAs to test paired sera from horses naturally infected with EHV-4 (n=40). gG4-24-mer ELISA detected 37 positive samples (92.5%), whereas gG4-12-mer ELISA detected only 28 (70.0%). gG4-24-mer ELISA was much more sensitive than gG4-12-mer ELISA. PMID:26424485
Bannai, Hiroshi; Nemoto, Manabu; Tsujimura, Koji; Yamanaka, Takashi; Maeda, Ken; Kondo, Takashi
2016-02-01
To increase the sensitivity of an enzyme-linked immunosorbent assay (ELISA) for equine herpesvirus type 4 (EHV-4) that uses a 12-mer peptide of glycoprotein G (gG4-12-mer: MKNNPIYSEGSL) [4], we used a longer peptide consisting of a 24-mer repeat sequence (gG4-24-mer: MKNNPIYSEGSLMLNVQHDDSIHT) as an antigen. Sera of horses experimentally infected with EHV-4 reacted much more strongly to the gG4-24-mer peptide than to the gG4-12-mer peptide. We used peptide ELISAs to test paired sera from horses naturally infected with EHV-4 (n=40). gG4-24-mer ELISA detected 37 positive samples (92.5%), whereas gG4-12-mer ELISA detected only 28 (70.0%). gG4-24-mer ELISA was much more sensitive than gG4-12-mer ELISA.
Shankla, Manish; Aksimentiev, Aleksei
2014-01-01
Control over interactions with biomolecules holds the key to applications of graphene in biotechnology. One such application is nanopore sequencing, where a DNA molecule is electrophoretically driven through a graphene nanopore. Here, we investigate how interactions of single-stranded DNA and a graphene membrane can be controlled by electrically biasing the membrane. The results of our molecular dynamics simulations suggest that electric charge on graphene can force a DNA homopolymer to adopt a range of strikingly different conformations. The conformational response is sensitive to even very subtle nucleotide modifications, such as DNA methylation. The speed of DNA motion through a graphene nanopore is strongly affected by the graphene charge: a positive charge accelerates the motion whereas a negative charge arrests it. As a possible application of the effect, we demonstrate stop-and-go transport of DNA controlled by the charge of graphene. Such on-demand transport of DNA is essential for realizing nanopore sequencing. PMID:25296960
Evolutionary dynamics of Newcastle disease virus
Miller, P.J.; Kim, L.M.; Ip, Hon S.; Afonso, C.L.
2009-01-01
A comprehensive dataset of NDV genome sequences was evaluated using bioinformatics to characterize the evolutionary forces affecting NDV genomes. Despite evidence of recombination in most genes, only one event in the fusion gene of genotype V viruses produced evolutionarily viable progenies. The codon-associated rate of change for the six NDV proteins revealed that the highest rate of change occurred at the fusion protein. All proteins were under strong purifying (negative) selection; the fusion protein displayed the highest number of amino acids under positive selection. Regardless of the phylogenetic grouping or the level of virulence, the cleavage site motif was highly conserved implying that mutations at this site that result in changes of virulence may not be favored. The coding sequence of the fusion gene and the genomes of viruses from wild birds displayed higher yearly rates of change in virulent viruses than in viruses of low virulence, suggesting that an increase in virulence may accelerate the rate of NDV evolution. ?? 2009 Elsevier Inc.
Two complete chloroplast genome sequences of Cannabis sativa varieties.
Oh, Hyehyun; Seo, Boyoung; Lee, Seunghwan; Ahn, Dong-Ha; Jo, Euna; Park, Jin-Kyoung; Min, Gi-Sik
2016-07-01
In this study, we determined the complete chloroplast (cp) genomes from two varieties of Cannabis sativa. The genome sizes were 153,848 bp (the Korean non-drug variety, Cheungsam) and 153,854 bp (the African variety, Yoruba Nigeria). The genome structures were identical with 131 individual genes [86 protein-coding genes (PCGs), eight rRNA, and 37 tRNA genes]. Further, except for the presence of an intron in the rps3 genes of two C. sativa varieties, the cp genomes of C. sativa had conservative features similar to that of all known species in the order Rosales. To verify the position of C. sativa within the order Rosales, we conducted phylogenetic analysis by using concatenated sequences of all PCGs from 17 complete cp genomes. The resulting tree strongly supported monophyly of Rosales. Further, the family Cannabaceae, represented by C. sativa, showed close relationship with the family Moraceae. The phylogenetic relationship outlined in our study is well congruent with those previously shown for the order Rosales.
NASA Astrophysics Data System (ADS)
Shankla, Manish; Aksimentiev, Aleksei
2014-10-01
Control over interactions with biomolecules holds the key to applications of graphene in biotechnology. One such application is nanopore sequencing, where a DNA molecule is electrophoretically driven through a graphene nanopore. Here we investigate how interactions of single-stranded DNA and a graphene membrane can be controlled by electrically biasing the membrane. The results of our molecular dynamics simulations suggest that electric charge on graphene can force a DNA homopolymer to adopt a range of strikingly different conformations. The conformational response is sensitive to even very subtle nucleotide modifications, such as DNA methylation. The speed of DNA motion through a graphene nanopore is strongly affected by the graphene charge: a positive charge accelerates the motion, whereas a negative charge arrests it. As a possible application of the effect, we demonstrate stop-and-go transport of DNA controlled by the charge of graphene. Such on-demand transport of DNA is essential for realizing nanopore sequencing.
Du, Q S; Ma, Y; Xie, N Z; Huang, R B
2014-01-01
In the design of peptide inhibitors the huge possible variety of the peptide sequences is of high concern. In collaboration with the fast accumulation of the peptide experimental data and database, a statistical method is suggested for peptide inhibitor design. In the two-level peptide prediction network (2L-QSAR) one level is the physicochemical properties of amino acids and the other level is the peptide sequence position. The activity contributions of amino acids are the functions of physicochemical properties and the sequence positions. In the prediction equation two weight coefficient sets {ak} and {bl} are assigned to the physicochemical properties and to the sequence positions, respectively. After the two coefficient sets are optimized based on the experimental data of known peptide inhibitors using the iterative double least square (IDLS) procedure, the coefficients are used to evaluate the bioactivities of new designed peptide inhibitors. The two-level prediction network can be applied to the peptide inhibitor design that may aim for different target proteins, or different positions of a protein. A notable advantage of the two-level statistical algorithm is that there is no need for host protein structural information. It may also provide useful insight into the amino acid properties and the roles of sequence positions.
Evidence for the Concerted Evolution between Short Linear Protein Motifs and Their Flanking Regions
Chica, Claudia; Diella, Francesca; Gibson, Toby J.
2009-01-01
Background Linear motifs are short modules of protein sequences that play a crucial role in mediating and regulating many protein–protein interactions. The function of linear motifs strongly depends on the context, e.g. functional instances mainly occur inside flexible regions that are accessible for interaction. Sometimes linear motifs appear as isolated islands of conservation in multiple sequence alignments. However, they also occur in larger blocks of sequence conservation, suggesting an active role for the neighbouring amino acids. Results The evolution of regions flanking 116 functional linear motif instances was studied. The conservation of the amino acid sequence and order/disorder tendency of those regions was related to presence/absence of the instance. For the majority of the analysed instances, the pairs of sequences conserving the linear motif were also observed to maintain a similar local structural tendency and/or to have higher local sequence conservation when compared to pairs of sequences where one is missing the linear motif. Furthermore, those instances have a higher chance to co–evolve with the neighbouring residues in comparison to the distant ones. Those findings are supported by examples where the regulation of the linear motif–mediated interaction has been shown to depend on the modifications (e.g. phosphorylation) at neighbouring positions or is thought to benefit from the binding versatility of disordered regions. Conclusion The results suggest that flanking regions are relevant for linear motif–mediated interactions, both at the structural and sequence level. More interestingly, they indicate that the prediction of linear motif instances can be enriched with contextual information by performing a sequence analysis similar to the one presented here. This can facilitate the understanding of the role of these predicted instances in determining the protein function inside the broader context of the cellular network where they arise. PMID:19584925
Accetto, Tomaž; Avguštin, Gorazd
2011-01-01
The Shine-Dalgarno (SD) sequence is a key element directing the translation to initiate at the authentic start codons and also enabling translation initiation to proceed in 5′ untranslated mRNA regions (5′-UTRs) containing moderately strong secondary structures. Bioinformatic analysis of almost forty genomes from the major bacterial phylum Bacteroidetes revealed, however, a general absence of SD sequence, drop in GC content and consequently reduced tendency to form secondary structures in 5′-UTRs. The experiments using the Prevotella bryantii TC1-1 expression system were in agreement with these findings: neither addition nor omission of SD sequence in the unstructured 5′-UTR affected the level of the reporter protein, non-specific nuclease NucB. Further, NucB level in P. bryantii TC1-1, contrary to hMGFP level in Escherichia coli, was five times lower when SD sequence formed part of the secondary structure with a folding energy -5,2 kcal/mol. Also, the extended SD sequences did not affect protein levels as in E. coli. It seems therefore that a functional SD interaction does not take place during the translation initiation in P. bryanttii TC1-1 and possibly other members of phylum Bacteroidetes although the anti SD sequence is present in 16S rRNA genes of their genomes. We thus propose that in the absence of the SD sequence interaction, the selection of genuine start codons in Bacteroidetes is accomplished by binding of ribosomal protein S1 to unstructured 5′-UTR as opposed to coding region which is inaccessible due to mRNA secondary structure. Additionally, we found that sequence logos of region preceding the start codons may be used as taxonomical markers. Depending on whether complete sequence logo or only part of it, such as information content and base proportion at specific positions, is used, bacterial genera or families and in some cases even bacterial phyla can be distinguished. PMID:21857964
Mohammed, Manal A F; Galbraith, Sareen E; Radford, Alan D; Dove, Winifred; Takasaki, Tomohiko; Kurane, Ichiro; Solomon, Tom
2011-07-01
Japanese encephalitis virus (JEV) is the most important cause of epidemic encephalitis worldwide but its origin is unknown. Epidemics of encephalitis suggestive of Japanese encephalitis (JE) were described in Japan from the 1870s onwards. Four genotypes of JEV have been characterised and representatives of each genotype have been fully sequenced. Based on limited information, a single isolate from Malaysia is thought to represent a putative fifth genotype. We have determined the complete nucleotide and amino acid sequence of Muar strain and compared it with other fully sequenced JEV genomes. Muar was the least similar, with nucleotide divergence ranging from 20.2 to 21.2% and amino acid divergence ranging from 8.5 to 9.9%. Phylogenetic analysis of Muar strain revealed that it does represent a distinct fifth genotype of JEV. We elucidated Muar signature amino acids in the envelope (E) protein, including E327 Glu on the exposed lateral surface of the putative receptor binding domain which distinguishes Muar strain from the other four genotypes. Evolutionary analysis of full-length JEV genomes revealed that the mean evolutionary rate is 4.35 × 10(-4) (3.4906 × 10(-4) to 5.303 × 10(-4)) nucleotides substitutions per site per year and suggests JEV originated from its ancestral virus in the mid 1500s in the Indonesia-Malaysia region and evolved there into different genotypes, which then spread across Asia. No strong evidence for positive selection was found between JEV strains of the five genotypes and the E gene has generally been subjected to strong purifying selection. Copyright © 2011 Elsevier B.V. All rights reserved.
Chen, Tsung-Chi; Huang, Ching-Wen; Kuo, Yan-Wen; Liu, Fang-Lin; Yuan, Chao-Hsiu Hsuan; Hsu, Hei-Ti; Yeh, Shyi-Dong
2006-12-01
ABSTRACT The NSs protein of Watermelon silver mottle virus (WSMoV) was expressed by a Zucchini yellow mosaic virus (ZYMV) vector in squash. The expressed NSs protein with a histidine tag and an additional NIa protease cleavage sequence was isolated by Ni(2+)-NTA resins as a free-form protein and further eluted after sodium dodecyl sulfate-polyacrylamide gel electrophoresis for production of rabbit antiserum and mouse monoclonal antibodies (MAbs). The rabbit antiserum strongly reacted with the NSs crude antigen of WSMoV and weakly reacted with that of a high-temperature-recovered gloxinia isolate (HT-1) of Capsicum chlorosis virus (CaCV), but not with that of Calla lily chlorotic spot virus (CCSV). In contrast, the MAbs reacted strongly with all crude NSs antigens of WSMoV, CaCV, and CCSV. Various deletions of the NSs open reading frame were constructed and expressed by ZYMV vector. Results indicate that all three MAbs target the 89- to 125-amino-acid (aa) region of WSMoV NSs protein. Two indispensable residues of cysteine and lysine were essential for MAbs recognition. Sequence comparison of the deduced MAbs-recognized region with the reported tospoviral NSs proteins revealed the presence of a consensus sequence VRKPGVKNTGCKFTMHNQIFNPN (denoted WNSscon), at the 98- to 120-aa position of NSs proteins, sharing 86 to 100% identities among those of WSMoV, CaCV, CCSV, and Peanut bud necrosis virus. A synthetic WNSscon peptide reacted with the MAbs and verified that the epitopes are present in the 98- to 120-aa region of WSMoV NSs protein. The WSMoV sero-group-specific NSs MAbs provide a means for reliable identification of tospoviruses in this large serogroup.
A model of directional selection applied to the evolution of drug resistance in HIV-1.
Seoighe, Cathal; Ketwaroo, Farahnaz; Pillay, Visva; Scheffler, Konrad; Wood, Natasha; Duffet, Rodger; Zvelebil, Marketa; Martinson, Neil; McIntyre, James; Morris, Lynn; Hide, Winston
2007-04-01
Understanding how pathogens acquire resistance to drugs is important for the design of treatment strategies, particularly for rapidly evolving viruses such as HIV-1. Drug treatment can exert strong selective pressures and sites within targeted genes that confer resistance frequently evolve far more rapidly than the neutral rate. Rapid evolution at sites that confer resistance to drugs can be used to help elucidate the mechanisms of evolution of drug resistance and to discover or corroborate novel resistance mutations. We have implemented standard maximum likelihood methods that are used to detect diversifying selection and adapted them for use with serially sampled reverse transcriptase (RT) coding sequences isolated from a group of 300 HIV-1 subtype C-infected women before and after single-dose nevirapine (sdNVP) to prevent mother-to-child transmission. We have also extended the standard models of codon evolution for application to the detection of directional selection. Through simulation, we show that the directional selection model can provide a substantial improvement in sensitivity over models of diversifying selection. Five of the sites within the RT gene that are known to harbor mutations that confer resistance to nevirapine (NVP) strongly supported the directional selection model. There was no evidence that other mutations that are known to confer NVP resistance were selected in this cohort. The directional selection model, applied to serially sampled sequences, also had more power than the diversifying selection model to detect selection resulting from factors other than drug resistance. Because inference of selection from serial samples is unlikely to be adversely affected by recombination, the methods we describe may have general applicability to the analysis of positive selection affecting recombining coding sequences when serially sampled data are available.
de Bruin, Donny; Bossert, Nelli; Aartsma-Rus, Annemieke; Bouwmeester, Dirk
2018-04-06
Short nucleic acid oligomers have found a wide range of applications in experimental physics, biology and medicine, and show potential for the treatment of acquired and genetic diseases. These applications rely heavily on the predictability of hybridization through Watson-Crick base pairing to allow positioning on a nanometer scale, as well as binding to the target transcripts, but also off-target binding to transcripts with partial homology. These effects are of particular importance in the development of therapeutic oligonucleotides, where off-target effects caused by the binding of mismatched sequences need to be avoided. We employ a novel method of probing DNA hybridization using optically active DNA-stabilized silver clusters (Ag-DNA) to measure binding efficiencies through a change in fluorescence intensity. In this way we can determine their location-specific sensitivity to individual mismatches in the sequence. The results reveal a strong dependence of the hybridization on the location of the mismatch, whereby mismatches close to the edges and center show a relatively minor impact. In parallel, we propose a simple model for calculating the annealing ratios of mismatched DNA sequences, which supports our experimental results. The primary result shown in this work is a demonstration of a novel technique to measure DNA hybridization using fluorescent Ag-DNA. With this technique, we investigated the effect of mismatches on the hybridization efficiency, and found a significant dependence on the location of individual mismatches. These effects are strongly influenced by the length of the used oligonucleotides. The novel probe method based on fluorescent Ag-DNA functions as a reliable tool in measuring this behavior. As a secondary result, we formulated a simple model that is consistent with the experimental data.
Guo, Yabin; Levin, Henry L
2010-02-01
The biological impact of transposons on the physiology of the host depends greatly on the frequency and position of integration. Previous studies of Tf1, a long terminal repeat retrotransposon in Schizosaccharomyces pombe, showed that integration occurs at the promoters of RNA polymerase II (Pol II) transcribed genes. To determine whether specific promoters are preferred targets of integration, we sequenced large numbers of insertions using high-throughput pyrosequencing. In four independent experiments we identified a total of 73,125 independent integration events. These data provided strong support for the conclusion that Pol II promoters are the targets of Tf1 integration. The size and number of the integration experiments resulted in reproducible measures of integration for each intergenic region and ORF in the S. pombe genome. The reproducibility of the integration activity from experiment to experiment demonstrates that we have saturated the full set of insertion sites that are actively targeted by Tf1. We found Tf1 integration was highly biased in favor of a specific set of Pol II promoters. The overwhelming majority (76%) of the insertions were distributed in intergenic sequences that contained 31% of the promoters of S. pombe. Interestingly, there was no correlation between the amount of integration at these promoters and their level of transcription. Instead, we found Tf1 had a strong preference for promoters that are induced by conditions of stress. This targeting of stress response genes coupled with the ability of Tf1 to regulate the expression of adjacent genes suggests Tf1 may improve the survival of S. pombe when cells are exposed to environmental stress.
Guo, Yabin; Levin, Henry L.
2010-01-01
The biological impact of transposons on the physiology of the host depends greatly on the frequency and position of integration. Previous studies of Tf1, a long terminal repeat retrotransposon in Schizosaccharomyces pombe, showed that integration occurs at the promoters of RNA polymerase II (Pol II) transcribed genes. To determine whether specific promoters are preferred targets of integration, we sequenced large numbers of insertions using high-throughput pyrosequencing. In four independent experiments we identified a total of 73,125 independent integration events. These data provided strong support for the conclusion that Pol II promoters are the targets of Tf1 integration. The size and number of the integration experiments resulted in reproducible measures of integration for each intergenic region and ORF in the S. pombe genome. The reproducibility of the integration activity from experiment to experiment demonstrates that we have saturated the full set of insertion sites that are actively targeted by Tf1. We found Tf1 integration was highly biased in favor of a specific set of Pol II promoters. The overwhelming majority (76%) of the insertions were distributed in intergenic sequences that contained 31% of the promoters of S. pombe. Interestingly, there was no correlation between the amount of integration at these promoters and their level of transcription. Instead, we found Tf1 had a strong preference for promoters that are induced by conditions of stress. This targeting of stress response genes coupled with the ability of Tf1 to regulate the expression of adjacent genes suggests Tf1 may improve the survival of S. pombe when cells are exposed to environmental stress. PMID:20040583
Planes, S; Doherty, P J; Bernardi, G
2001-11-11
Acanthochromis polyacanthus is an unusual tropical marine damselfish that uniquely lacks pelagic larvae and has lost the capacity for broad-scale dispersal among coral reefs. On the modern Great Barrier Reef (GBR), three color morphs meet and hydridize at two zones of secondary contact. Allozyme electrophoreses revealed strong differences between morphs from the southern zone but few differences between morphs from the northern counterpart, thus suggesting different contact histories. We explore the phylogeography of Acanthochromis polyacanthus with mitochondrial cytochrome b region sequences (alignment of 565 positions) obtained from 126 individuals representing seven to 12 fish from 13 sites distributed over 12 reefs of the GBR and the Coral Sea. The samples revealed three major clades: (1) black fish collected from the southern GBR; (2) bicolored fish collected from the GBR and one reef (Osprey) from the northern Coral Sea; (3) black and white monomorphs collected from six reefs in the Coral Sea. All three clades were well supported (72-100%) by bootstrap analyses. Sequence divergences were very high between the major clades (mean = 7.6%) as well as within them (2.0-3.6%). Within clades, most reefs segregated as monophyletic assemblages. This was revealed both by phylogenetic analyses and AMOVAs that showed that 72-90% of the variance originated from differences among groups, whereas only 5-13% originated within populations. These patterns are discussed in relation to the known geological history of coral reefs of the GBR and the Coral Sea. Finally, we ask whether the monospecific status of Acanthochromis should be revisited because the sequence divergences found among our samples is substantially greater than those recorded among well-recognized species in other reef fishes.
Clarke, Brian; Mahapatra, Mana; Friedgut, Orly; Bumbarov, Velizar
2017-01-01
Peste-des-petits ruminants (PPR) is one of the most important infectious diseases of domesticated small ruminants. From the initial identification in 1942 in West Africa, PPR virus (PPRV) has spread throughout much of the developing world. PPRV is now considered endemic throughout Africa, with the notable exception of South Africa, the Middle-East and Israel, as well as South-, East-, and Central Asia. Despite this widespread dispersal, the evolution and transmission of PPRV in endemic populations is not well understood. This understanding will be critical in the planning of rational measures to eradicate PPRV by the planned time as defined by the FAO and OIE. To further advance the understanding of the evolution of PPRV the full genome sequence of 18 viruses isolated from Israel from consecutive years between 1997–2014 were generated. This data set is unique and crucial for the understanding of the evolution of PPRV, as it represents the first set of full-length sequence data available from consecutive years from a single geographic location. Analysis of these full genome sequences shows 96.2–99.9% nucleotide conservation across the Israel isolates and further demonstrates the strong purifying selection pressures on PPRV within Israel and globally. Four amino acid substitutions indicative of putative positive selection were additionally identified within the Israel isolates. The mean substitution rate per site per year was estimated to be 9.22 x 10−4 (95% HPD 6.206 x 10−4–1.26 x 10−3). Using Bayesian and phylogenetic analyses we further demonstrate that the PPRV isolates from Israel belongs to linage IV and form a single strong regional cluster within all other lineage IV viruses circulating worldwide implying a single incursion into Israel. PMID:28545149
Clarke, Brian; Mahapatra, Mana; Friedgut, Orly; Bumbarov, Velizar; Parida, Satya
2017-01-01
Peste-des-petits ruminants (PPR) is one of the most important infectious diseases of domesticated small ruminants. From the initial identification in 1942 in West Africa, PPR virus (PPRV) has spread throughout much of the developing world. PPRV is now considered endemic throughout Africa, with the notable exception of South Africa, the Middle-East and Israel, as well as South-, East-, and Central Asia. Despite this widespread dispersal, the evolution and transmission of PPRV in endemic populations is not well understood. This understanding will be critical in the planning of rational measures to eradicate PPRV by the planned time as defined by the FAO and OIE. To further advance the understanding of the evolution of PPRV the full genome sequence of 18 viruses isolated from Israel from consecutive years between 1997-2014 were generated. This data set is unique and crucial for the understanding of the evolution of PPRV, as it represents the first set of full-length sequence data available from consecutive years from a single geographic location. Analysis of these full genome sequences shows 96.2-99.9% nucleotide conservation across the Israel isolates and further demonstrates the strong purifying selection pressures on PPRV within Israel and globally. Four amino acid substitutions indicative of putative positive selection were additionally identified within the Israel isolates. The mean substitution rate per site per year was estimated to be 9.22 x 10-4 (95% HPD 6.206 x 10-4-1.26 x 10-3). Using Bayesian and phylogenetic analyses we further demonstrate that the PPRV isolates from Israel belongs to linage IV and form a single strong regional cluster within all other lineage IV viruses circulating worldwide implying a single incursion into Israel.
Genetic variability and evolutionary dynamics of viruses of the family Closteroviridae
Rubio, Luis; Guerri, José; Moreno, Pedro
2013-01-01
RNA viruses have a great potential for genetic variation, rapid evolution and adaptation. Characterization of the genetic variation of viral populations provides relevant information on the processes involved in virus evolution and epidemiology and it is crucial for designing reliable diagnostic tools and developing efficient and durable disease control strategies. Here we performed an updated analysis of sequences available in Genbank and reviewed present knowledge on the genetic variability and evolutionary processes of viruses of the family Closteroviridae. Several factors have shaped the genetic structure and diversity of closteroviruses. (I) A strong negative selection seems to be responsible for the high genetic stability in space and time for some viruses. (2) Long distance migration, probably by human transport of infected propagative plant material, have caused that genetically similar virus isolates are found in distant geographical regions. (3) Recombination between divergent sequence variants have generated new genotypes and plays an important role for the evolution of some viruses of the family Closteroviridae. (4) Interaction between virus strains or between different viruses in mixed infections may alter accumulation of certain strains. (5) Host change or virus transmission by insect vectors induced changes in the viral population structure due to positive selection of sequence variants with higher fitness for host-virus or vector-virus interaction (adaptation) or by genetic drift due to random selection of sequence variants during the population bottleneck associated to the transmission process. PMID:23805130
Lederer, Christoph; Heider, Dominik; van den Boom, Johannes; Hoffmann, Daniel; Mueller, Jonathan W.; Bayer, Peter
2011-01-01
Peptidyl-prolyl cis/trans isomerases (PPIases) are enzymes assisting protein folding and protein quality control in organisms of all kingdoms of life. In contrast to the other sub-classes of PPIases, the cyclophilins and the FK-506 binding proteins, little was formerly known about the parvulin type of PPIase in Archaea. Recently, the first solution structure of an archaeal parvulin, the PinA protein from Cenarchaeum symbiosum, was reported. Investigation of occurrence and frequency of PPIase sequences in numerous archaeal genomes now revealed a strong tendency for thermophilic microorganisms to reduce the number of PPIases. Single-domain parvulins were mostly found in the genomes of recently proposed deep-branching archaeal subgroups, the Thaumarchaeota and the ARMANs (archaeal Richmond Mine acidophilic nanoorganisms). Hence, we used the parvulin sequence to reclassify available archaeal metagenomic contigs, thereby, adding new members to these subgroups. A combination of genomic background analysis and phylogenetic approaches of parvulin sequences suggested that the assigned sequences belong to at least two distinct groups of Thaumarchaeota. Finally, machine learning approaches were applied to identify amino acid residues that separate archaeal and bacterial parvulin proteins from each other. When mapped onto the recent PinA solution structure, most of these positions form a cluster at one site of the protein possibly indicating a different functionality of the two groups of parvulin proteins. PMID:22065628
Carissimo, Guillaume; Eiglmeier, Karin; Reveillaud, Julie; Holm, Inge; Diallo, Mawlouth; Diallo, Diawo; Vantaux, Amélie; Kim, Saorin; Ménard, Didier; Siv, Sovannaroth; Belda, Eugeni; Bischoff, Emmanuel; Antoniewski, Christophe; Vernick, Kenneth D.
2016-01-01
Mosquitoes of the Anopheles gambiae complex display strong preference for human bloodmeals and are major malaria vectors in Africa. However, their interaction with viruses or role in arbovirus transmission during epidemics has been little examined, with the exception of O’nyong-nyong virus, closely related to Chikungunya virus. Deep-sequencing has revealed different RNA viruses in natural insect viromes, but none have been previously described in the Anopheles gambiae species complex. Here, we describe two novel insect RNA viruses, a Dicistrovirus and a Cypovirus, found in laboratory colonies of An. gambiae taxa using small-RNA deep sequencing. Sequence analysis was done with Metavisitor, an open-source bioinformatic pipeline for virus discovery and de novo genome assembly. Wild-collected Anopheles from Senegal and Cambodia were positive for the Dicistrovirus and Cypovirus, displaying high sequence identity to the laboratory-derived virus. Thus, the Dicistrovirus (Anopheles C virus, AnCV) and Cypovirus (Anopheles Cypovirus, AnCPV) are components of the natural virome of at least some anopheline species. Their possible influence on mosquito immunity or transmission of other pathogens is unknown. These natural viruses could be developed as models for the study of Anopheles-RNA virus interactions in low security laboratory settings, in an analogous manner to the use of rodent malaria parasites for studies of mosquito anti-parasite immunity. PMID:27138938
DOE Office of Scientific and Technical Information (OSTI.GOV)
Birge, J. R.; Qi, L.; Wei, Z.
In this paper we give a variant of the Topkis-Veinott method for solving inequality constrained optimization problems. This method uses a linearly constrained positive semidefinite quadratic problem to generate a feasible descent direction at each iteration. Under mild assumptions, the algorithm is shown to be globally convergent in the sense that every accumulation point of the sequence generated by the algorithm is a Fritz-John point of the problem. We introduce a Fritz-John (FJ) function, an FJ1 strong second-order sufficiency condition (FJ1-SSOSC), and an FJ2 strong second-order sufficiency condition (FJ2-SSOSC), and then show, without any constraint qualification (CQ), that (i) ifmore » an FJ point z satisfies the FJ1-SSOSC, then there exists a neighborhood N(z) of z such that, for any FJ point y element of N(z) {l_brace}z {r_brace} , f{sub 0}(y) {ne} f{sub 0}(z) , where f{sub 0} is the objective function of the problem; (ii) if an FJ point z satisfies the FJ2-SSOSC, then z is a strict local minimum of the problem. The result (i) implies that the entire iteration point sequence generated by the method converges to an FJ point. We also show that if the parameters are chosen large enough, a unit step length can be accepted by the proposed algorithm.« less
[Whole Genome Sequencing of Human mtDNA Based on Ion Torrent PGM™ Platform].
Cao, Y; Zou, K N; Huang, J P; Ma, K; Ping, Y
2017-08-01
To analyze and detect the whole genome sequence of human mitochondrial DNA (mtDNA) by Ion Torrent PGM™ platform and to study the differences of mtDNA sequence in different tissues. Samples were collected from 6 unrelated individuals by forensic postmortem examination, including chest blood, hair, costicartilage, nail, skeletal muscle and oral epithelium. Amplification of whole genome sequence of mtDNA was performed by 4 pairs of primer. Libraries were constructed with Ion Shear™ Plus Reagents kit and Ion Plus Fragment Library kit. Whole genome sequencing of mtDNA was performed using Ion Torrent PGM™ platform. Sanger sequencing was used to determine the heteroplasmy positions and the mutation positions on HVⅠ region. The whole genome sequence of mtDNA from all samples were amplified successfully. Six unrelated individuals belonged to 6 different haplotypes. Different tissues in one individual had heteroplasmy difference. The heteroplasmy positions and the mutation positions on HVⅠ region were verified by Sanger sequencing. After a consistency check by the Kappa method, it was found that the results of mtDNA sequence had a high consistency in different tissues. The testing method used in present study for sequencing the whole genome sequence of human mtDNA can detect the heteroplasmy difference in different tissues, which have good consistency. The results provide guidance for the further applications of mtDNA in forensic science. Copyright© by the Editorial Department of Journal of Forensic Medicine
Lee, Seung-Bum; Kaittanis, Charalambos; Jansen, Robert K; Hostetler, Jessica B; Tallon, Luke J; Town, Christopher D; Daniell, Henry
2006-01-01
Background Cotton (Gossypium hirsutum) is the most important fiber crop grown in 90 countries. In 2004–2005, US farmers planted 79% of the 5.7-million hectares of nuclear transgenic cotton. Unfortunately, genetically modified cotton has the potential to hybridize with other cultivated and wild relatives, resulting in geographical restrictions to cultivation. However, chloroplast genetic engineering offers the possibility of containment because of maternal inheritance of transgenes. The complete chloroplast genome of cotton provides essential information required for genetic engineering. In addition, the sequence data were used to assess phylogenetic relationships among the major clades of rosids using cotton and 25 other completely sequenced angiosperm chloroplast genomes. Results The complete cotton chloroplast genome is 160,301 bp in length, with 112 unique genes and 19 duplicated genes within the IR, containing a total of 131 genes. There are four ribosomal RNAs, 30 distinct tRNA genes and 17 intron-containing genes. The gene order in cotton is identical to that of tobacco but lacks rpl22 and infA. There are 30 direct and 24 inverted repeats 30 bp or longer with a sequence identity ≥ 90%. Most of the direct repeats are within intergenic spacer regions, introns and a 72 bp-long direct repeat is within the psaA and psaB genes. Comparison of protein coding sequences with expressed sequence tags (ESTs) revealed nucleotide substitutions resulting in amino acid changes in ndhC, rpl23, rpl20, rps3 and clpP. Phylogenetic analysis of a data set including 61 protein-coding genes using both maximum likelihood and maximum parsimony were performed for 28 taxa, including cotton and five other angiosperm chloroplast genomes that were not included in any previous phylogenies. Conclusion Cotton chloroplast genome lacks rpl22 and infA and contains a number of dispersed direct and inverted repeats. RNA editing resulted in amino acid changes with significant impact on their hydropathy. Phylogenetic analysis provides strong support for the position of cotton in the Malvales in the eurosids II clade sister to Arabidopsis in the Brassicales. Furthermore, there is strong support for the placement of the Myrtales sister to the eurosid I clade, although expanded taxon sampling is needed to further test this relationship. PMID:16553962
Da Lage, Jean-Luc; Maczkowiak, Frédérique; Cariou, Marie-Louise
2011-01-01
Most eukaryotes have at least some genes interrupted by introns. While it is well accepted that introns were already present at moderate density in the last eukaryote common ancestor, the conspicuous diversity of intron density among genomes suggests a complex evolutionary history, with marked differences between phyla. The question of the rates of intron gains and loss in the course of evolution and factors influencing them remains controversial. We have investigated a single gene family, alpha-amylase, in 55 species covering a variety of animal phyla. Comparison of intron positions across phyla suggests a complex history, with a likely ancestral intronless gene undergoing frequent intron loss and gain, leading to extant intron/exon structures that are highly variable, even among species from the same phylum. Because introns are known to play no regulatory role in this gene and there is no alternative splicing, the structural differences may be interpreted more easily: intron positions, sizes, losses or gains may be more likely related to factors linked to splicing mechanisms and requirements, and to recognition of introns and exons, or to more extrinsic factors, such as life cycle and population size. We have shown that intron losses outnumbered gains in recent periods, but that “resets” of intron positions occurred at the origin of several phyla, including vertebrates. Rates of gain and loss appear to be positively correlated. No phase preference was found. We also found evidence for parallel gains and for intron sliding. Presence of introns at given positions was correlated to a strong protosplice consensus sequence AG/G, which was much weaker in the absence of intron. In contrast, recent intron insertions were not associated with a specific sequence. In animal Amy genes, population size and generation time seem to have played only minor roles in shaping gene structures. PMID:21611157
Liakhovetskiĭ, V A; Bobrova, E V; Skopin, G N
2012-01-01
Transposition errors during the reproduction of a hand movement sequence make it possible to receive important information on the internal representation of this sequence in the motor working memory. Analysis of such errors showed that learning to reproduce sequences of the left-hand movements improves the system of positional coding (coding ofpositions), while learning of the right-hand movements improves the system of vector coding (coding of movements). Learning of the right-hand movements after the left-hand performance involved the system of positional coding "imposed" by the left hand. Learning of the left-hand movements after the right-hand performance activated the system of vector coding. Transposition errors during learning to reproduce movement sequences can be explained by neural network using either vector coding or both vector and positional coding.
Real time simulation of computer-assisted sequencing of terminal area operations
NASA Technical Reports Server (NTRS)
Dear, R. G.
1981-01-01
A simulation was developed to investigate the utilization of computer assisted decision making for the task of sequencing and scheduling aircraft in a high density terminal area. The simulation incorporates a decision methodology termed Constrained Position Shifting. This methodology accounts for aircraft velocity profiles, routes, and weight classes in dynamically sequencing and scheduling arriving aircraft. A sample demonstration of Constrained Position Shifting is presented where six aircraft types (including both light and heavy aircraft) are sequenced to land at Denver's Stapleton International Airport. A graphical display is utilized and Constrained Position Shifting with a maximum shift of four positions (rearward or forward) is compared to first come, first serve with respect to arrival at the runway. The implementation of computer assisted sequencing and scheduling methodologies is investigated. A time based control concept will be required and design considerations for such a system are discussed.
Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul S.; Richmond, Zina; Purcell, Maureen K.; Johns, Robert; Johnson, Stewart C.; Sakasida, Sonja M.
2015-01-01
Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period.
Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul; Richmond, Zina; Johns, Robert; Purcell, Maureen K.; Johnson, Stewart C.; Saksida, Sonja M.
2015-01-01
Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period. PMID:26536673
Lingner, Thomas; Kataya, Amr R. A.; Reumann, Sigrun
2012-01-01
We recently developed the first algorithms specifically for plants to predict proteins carrying peroxisome targeting signals type 1 (PTS1) from genome sequences.1 As validated experimentally, the prediction methods are able to correctly predict unknown peroxisomal Arabidopsis proteins and to infer novel PTS1 tripeptides. The high prediction performance is primarily determined by the large number and sequence diversity of the underlying positive example sequences, which mainly derived from EST databases. However, a few constructs remained cytosolic in experimental validation studies, indicating sequencing errors in some ESTs. To identify erroneous sequences, we validated subcellular targeting of additional positive example sequences in the present study. Moreover, we analyzed the distribution of prediction scores separately for each orthologous group of PTS1 proteins, which generally resembled normal distributions with group-specific mean values. The cytosolic sequences commonly represented outliers of low prediction scores and were located at the very tail of a fitted normal distribution. Three statistical methods for identifying outliers were compared in terms of sensitivity and specificity.” Their combined application allows elimination of erroneous ESTs from positive example data sets. This new post-validation method will further improve the prediction accuracy of both PTS1 and PTS2 protein prediction models for plants, fungi, and mammals. PMID:22415050
Lingner, Thomas; Kataya, Amr R A; Reumann, Sigrun
2012-02-01
We recently developed the first algorithms specifically for plants to predict proteins carrying peroxisome targeting signals type 1 (PTS1) from genome sequences. As validated experimentally, the prediction methods are able to correctly predict unknown peroxisomal Arabidopsis proteins and to infer novel PTS1 tripeptides. The high prediction performance is primarily determined by the large number and sequence diversity of the underlying positive example sequences, which mainly derived from EST databases. However, a few constructs remained cytosolic in experimental validation studies, indicating sequencing errors in some ESTs. To identify erroneous sequences, we validated subcellular targeting of additional positive example sequences in the present study. Moreover, we analyzed the distribution of prediction scores separately for each orthologous group of PTS1 proteins, which generally resembled normal distributions with group-specific mean values. The cytosolic sequences commonly represented outliers of low prediction scores and were located at the very tail of a fitted normal distribution. Three statistical methods for identifying outliers were compared in terms of sensitivity and specificity." Their combined application allows elimination of erroneous ESTs from positive example data sets. This new post-validation method will further improve the prediction accuracy of both PTS1 and PTS2 protein prediction models for plants, fungi, and mammals.
Transcriptome-Mining for Single-Copy Nuclear Markers in Ferns
Rothfels, Carl J.; Larsson, Anders; Li, Fay-Wei; Sigel, Erin M.; Huiet, Layne; Burge, Dylan O.; Ruhsam, Markus; Graham, Sean W.; Stevenson, Dennis W.; Wong, Gane Ka-Shu; Korall, Petra; Pryer, Kathleen M.
2013-01-01
Background Molecular phylogenetic investigations have revolutionized our understanding of the evolutionary history of ferns—the second-most species-rich major group of vascular plants, and the sister clade to seed plants. The general absence of genomic resources available for this important group of plants, however, has resulted in the strong dependence of these studies on plastid data; nuclear or mitochondrial data have been rarely used. In this study, we utilize transcriptome data to design primers for nuclear markers for use in studies of fern evolutionary biology, and demonstrate the utility of these markers across the largest order of ferns, the Polypodiales. Principal Findings We present 20 novel single-copy nuclear regions, across 10 distinct protein-coding genes: ApPEFP_C, cryptochrome 2, cryptochrome 4, DET1, gapCpSh, IBR3, pgiC, SQD1, TPLATE, and transducin. These loci, individually and in combination, show strong resolving power across the Polypodiales phylogeny, and are readily amplified and sequenced from our genomic DNA test set (from 15 diploid Polypodiales species). For each region, we also present transcriptome alignments of the focal locus and related paralogs—curated broadly across ferns—that will allow researchers to develop their own primer sets for fern taxa outside of the Polypodiales. Analyses of sequence data generated from our genomic DNA test set reveal strong effects of partitioning schemes on support levels and, to a much lesser extent, on topology. A model partitioned by codon position is strongly favored, and analyses of the combined data yield a Polypodiales phylogeny that is well-supported and consistent with earlier studies of this group. Conclusions The 20 single-copy regions presented here more than triple the single-copy nuclear regions available for use in ferns. They provide a much-needed opportunity to assess plastid-derived hypotheses of relationships within the ferns, and increase our capacity to explore aspects of fern evolution previously unavailable to scientific investigation. PMID:24116189
Molecular evolution of psbA gene in ferns: unraveling selective pressure and co-evolutionary pattern
2012-01-01
Background The photosynthetic oxygen-evolving photo system II (PS II) produces almost the entire oxygen in the atmosphere. This unique biochemical system comprises a functional core complex that is encoded by psbA and other genes. Unraveling the evolutionary dynamics of this gene is of particular interest owing to its direct role in oxygen production. psbA underwent gene duplication in leptosporangiates, in which both copies have been preserved since. Because gene duplication is often followed by the non-fictionalization of one of the copies and its subsequent erosion, preservation of both psbA copies pinpoint functional or regulatory specialization events. The aim of this study was to investigate the molecular evolution of psbA among fern lineages. Results We sequenced psbA , which encodes D1 protein in the core complex of PSII, in 20 species representing 8 orders of extant ferns; then we searched for selection and convolution signatures in psbA across the 11 fern orders. Collectively, our results indicate that: (1) selective constraints among D1 protein relaxed after the duplication in 4 leptosporangiate orders; (2) a handful positively selected codons were detected within species of single copy psbA, but none in duplicated ones; (3) a few sites among D1 protein were involved in co-evolution process which may intimate significant functional/structural communications between them. Conclusions The strong competition between ferns and angiosperms for light may have been the main cause for a continuous fixation of adaptive amino acid changes in psbA , in particular after its duplication. Alternatively, a single psbA copy may have undergone bursts of adaptive changes at the molecular level to overcome angiosperms competition. The strong signature of positive Darwinian selection in a major part of D1 protein is testament to this. At the same time, species own two psbA copies hardly have positive selection signals among the D1 protein coding sequences. In this study, eleven co-evolving sites have been detected via different molecules, which may be more important than others. PMID:22899792
Pedersen, Niels C; Liu, Hongwei; Durden, Monica; Lyons, Leslie A
2016-03-01
A previous study demonstrated the existence of a natural resistance to feline infectious peritonitis virus (FIPV) among 36% of randomly bred laboratory cats. A genome wide association study (GWAS) on this population suggested that resistance was polygenic but failed to identify any strong specific associations. In order to enhance the power of GWAS or whole genome sequencing to identify strong genetic associations, a decision was made to positively select for resistance over three generations. The inbreeding experiment began with a genetically related parental (P) population consisting of three toms and four queens identified from among the survivors of the earlier study and belonging to a closely related subgroup (B). The subsequent effects of inbreeding were measured using 42 genome-wide STR markers. P generation cats produced 57 first filial (F1) kittens, only five of which (9.0%) demonstrated a natural resistance to FIPV infection. One of these five F1 survivors was then used to produce six F1/P-backcrosses kittens, only one of which proved resistant to FIP. Six of eight of the F1 and F1/P survivors succumbed to a secondary exposure 4-12 months later. Therefore, survival after both primary and secondary infection was decreased rather than increased by positive selection for resistance. The common genetic factor associated with this diminished resistance was a loss of heterozygosity. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Phytophthora-ID.org: A sequence-based Phytophthora identification tool
N.J. Grünwald; F.N. Martin; M.M. Larsen; C.M. Sullivan; C.M. Press; M.D. Coffey; E.M. Hansen; J.L. Parke
2010-01-01
Contemporary species identification relies strongly on sequence-based identification, yet resources for identification of many fungal and oomycete pathogens are rare. We developed two web-based, searchable databases for rapid identification of Phytophthora spp. based on sequencing of the internal transcribed spacer (ITS) or the cytochrome oxidase...
Chang, Yun-Juan; Peacock, Aaron D.; Long, Philip E.; Stephen, John R.; McKinley, James P.; Macnaughton, Sarah J.; Hussain, A. K. M. Anwar; Saxton, Arnold M.; White, David C.
2001-01-01
Microbially mediated reduction and immobilization of U(VI) to U(IV) plays a role in both natural attenuation and accelerated bioremediation of uranium-contaminated sites. To realize bioremediation potential and accurately predict natural attenuation, it is important to first understand the microbial diversity of such sites. In this paper, the distribution of sulfate-reducing bacteria (SRB) in contaminated groundwater associated with a uranium mill tailings disposal site at Shiprock, N.Mex., was investigated. Two culture-independent analyses were employed: sequencing of clone libraries of PCR-amplified dissimilatory sulfite reductase (DSR) gene fragments and phospholipid fatty acid (PLFA) biomarker analysis. A remarkable diversity among the DSR sequences was revealed, including sequences from δ-Proteobacteria, gram-positive organisms, and the Nitrospira division. PLFA analysis detected at least 52 different mid-chain-branched saturate PLFA and included a high proportion of 10me16:0. Desulfotomaculum and Desulfotomaculum-like sequences were the most dominant DSR genes detected. Those belonging to SRB within δ-Proteobacteria were mainly recovered from low-uranium (≤302 ppb) samples. One Desulfotomaculum-like sequence cluster overwhelmingly dominated high-U (>1,500 ppb) sites. Logistic regression showed a significant influence of uranium concentration over the dominance of this cluster of sequences (P = 0.0001). This strong association indicates that Desulfotomaculum has remarkable tolerance and adaptation to high levels of uranium and suggests the organism's possible involvement in natural attenuation of uranium. The in situ activity level of Desulfotomaculum in uranium-contaminated environments and its comparison to the activities of other SRB and other functional groups should be an important area for future research. PMID:11425735
Scop3D: three-dimensional visualization of sequence conservation.
Vermeire, Tessa; Vermaere, Stijn; Schepens, Bert; Saelens, Xavier; Van Gucht, Steven; Martens, Lennart; Vandermarliere, Elien
2015-04-01
The integration of a protein's structure with its known sequence variation provides insight on how that protein evolves, for instance in terms of (changing) function or immunogenicity. Yet, collating the corresponding sequence variants into a multiple sequence alignment, calculating each position's conservation, and mapping this information back onto a relevant structure is not straightforward. We therefore built the Sequence Conservation on Protein 3D structure (scop3D) tool to perform these tasks automatically. The output consists of two modified PDB files in which the B-values for each position are replaced by the percentage sequence conservation, or the information entropy for each position, respectively. Furthermore, text files with absolute and relative amino acid occurrences for each position are also provided, along with snapshots of the protein from six distinct directions in space. The visualization provided by scop3D can for instance be used as an aid in vaccine development or to identify antigenic hotspots, which we here demonstrate based on an analysis of the fusion proteins of human respiratory syncytial virus and mumps virus. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Logacheva, Maria D; Samigullin, Tahir H; Dhingra, Amit; Penin, Aleksey A
2008-01-01
Background Chloroplast genome sequences are extremely informative about species-interrelationships owing to its non-meiotic and often uniparental inheritance over generations. The subject of our study, Fagopyrum esculentum, is a member of the family Polygonaceae belonging to the order Caryophyllales. An uncertainty remains regarding the affinity of Caryophyllales and the asterids that could be due to undersampling of the taxa. With that background, having access to the complete chloroplast genome sequence for Fagopyrum becomes quite pertinent. Results We report the complete chloroplast genome sequence of a wild ancestor of cultivated buckwheat, Fagopyrum esculentum ssp. ancestrale. The sequence was rapidly determined using a previously described approach that utilized a PCR-based method and employed universal primers, designed on the scaffold of multiple sequence alignment of chloroplast genomes. The gene content and order in buckwheat chloroplast genome is similar to Spinacia oleracea. However, some unique structural differences exist: the presence of an intron in the rpl2 gene, a frameshift mutation in the rpl23 gene and extension of the inverted repeat region to include the ycf1 gene. Phylogenetic analysis of 61 protein-coding gene sequences from 44 complete plastid genomes provided strong support for the sister relationships of Caryophyllales (including Polygonaceae) to asterids. Further, our analysis also provided support for Amborella as sister to all other angiosperms, but interestingly, in the bayesian phylogeny inference based on first two codon positions Amborella united with Nymphaeales. Conclusion Comparative genomics analyses revealed that the Fagopyrum chloroplast genome harbors the characteristic gene content and organization as has been described for several other chloroplast genomes. However, it has some unique structural features distinct from previously reported complete chloroplast genome sequences. Phylogenetic analysis of the dataset, including this new sequence from non-core Caryophyllales supports the sister relationship between Caryophyllales and asterids. PMID:18492277
kpLogo: positional k-mer analysis reveals hidden specificity in biological sequences
2017-01-01
Abstract Motifs of only 1–4 letters can play important roles when present at key locations within macromolecules. Because existing motif-discovery tools typically miss these position-specific short motifs, we developed kpLogo, a probability-based logo tool for integrated detection and visualization of position-specific ultra-short motifs from a set of aligned sequences. kpLogo also overcomes the limitations of conventional motif-visualization tools in handling positional interdependencies and utilizing ranked or weighted sequences increasingly available from high-throughput assays. kpLogo can be found at http://kplogo.wi.mit.edu/. PMID:28460012
Mehdizadeh Gohari, Iman; Kropinski, Andrew M; Weese, Scott J; Parreira, Valeria R; Whitehead, Ashley E; Boerlin, Patrick; Prescott, John F
2016-01-01
The recent discovery of a novel beta-pore-forming toxin, NetF, which is strongly associated with canine and foal necrotizing enteritis should improve our understanding of the role of type A Clostridium perfringens associated disease in these animals. The current study presents the complete genome sequence of two netF-positive strains, JFP55 and JFP838, which were recovered from cases of foal necrotizing enteritis and canine hemorrhagic gastroenteritis, respectively. Genome sequencing was done using Single Molecule, Real-Time (SMRT) technology-PacBio and Illumina Hiseq2000. The JFP55 and JFP838 genomes include a single 3.34 Mb and 3.53 Mb chromosome, respectively, and both genomes include five circular plasmids. Plasmid annotation revealed that three plasmids were shared by the two newly sequenced genomes, including a NetF/NetE toxins-encoding tcp-conjugative plasmid, a CPE/CPB2 toxins-encoding tcp-conjugative plasmid and a putative bacteriocin-encoding plasmid. The putative beta-pore-forming toxin genes, netF, netE and netG, were located in unique pathogenicity loci on tcp-conjugative plasmids. The C. perfringens JFP55 chromosome carries 2,825 protein-coding genes whereas the chromosome of JFP838 contains 3,014 protein-encoding genes. Comparison of these two chromosomes with three available reference C. perfringens chromosome sequences identified 48 (~247 kb) and 81 (~430 kb) regions unique to JFP55 and JFP838, respectively. Some of these divergent genomic regions in both chromosomes are phage- and plasmid-related segments. Sixteen of these unique chromosomal regions (~69 kb) were shared between the two isolates. Five of these shared regions formed a mosaic of plasmid-integrated segments, suggesting that these elements were acquired early in a clonal lineage of netF-positive C. perfringens strains. These results provide significant insight into the basis of canine and foal necrotizing enteritis and are the first to demonstrate that netF resides on a large and unique plasmid-encoded locus.
enoLOGOS: a versatile web tool for energy normalized sequence logos
Workman, Christopher T.; Yin, Yutong; Corcoran, David L.; Ideker, Trey; Stormo, Gary D.; Benos, Panayiotis V.
2005-01-01
enoLOGOS is a web-based tool that generates sequence logos from various input sources. Sequence logos have become a popular way to graphically represent DNA and amino acid sequence patterns from a set of aligned sequences. Each position of the alignment is represented by a column of stacked symbols with its total height reflecting the information content in this position. Currently, the available web servers are able to create logo images from a set of aligned sequences, but none of them generates weighted sequence logos directly from energy measurements or other sources. With the advent of high-throughput technologies for estimating the contact energy of different DNA sequences, tools that can create logos directly from binding affinity data are useful to researchers. enoLOGOS generates sequence logos from a variety of input data, including energy measurements, probability matrices, alignment matrices, count matrices and aligned sequences. Furthermore, enoLOGOS can represent the mutual information of different positions of the consensus sequence, a unique feature of this tool. Another web interface for our software, C2H2-enoLOGOS, generates logos for the DNA-binding preferences of the C2H2 zinc-finger transcription factor family members. enoLOGOS and C2H2-enoLOGOS are accessible over the web at . PMID:15980495
Does CTCF mediate between nuclear organization and gene expression?
Ohlsson, Rolf; Lobanenkov, Victor; Klenova, Elena
2010-01-01
The multifunctional zinc-finger protein CCCTC-binding factor (CTCF) is a very strong candidate for the role of coordinating the expression level of coding sequences with their three-dimensional position in the nucleus, apparently responding to a "code" in the DNA itself. Dynamic interactions between chromatin fibers in the context of nuclear architecture have been implicated in various aspects of genome functions. However, the molecular basis of these interactions still remains elusive and is a subject of intense debate. Here we discuss the nature of CTCF-DNA interactions, the CTCF-binding specificity to its binding sites and the relationship between CTCF and chromatin, and we examine data linking CTCF with gene regulation in the three-dimensional nuclear space. We discuss why these features render CTCF a very strong candidate for the role and propose a unifying model, the "CTCF code," explaining the mechanistic basis of how the information encrypted in DNA may be interpreted by CTCF into diverse nuclear functions.
Obwaller, A; Duchêne, M; Bruhn, H; Steipe, B; Tripp, C; Kraft, D; Wiedermann, G; Auer, H; Aspöck, H
2001-05-01
Myosins from nematode parasites elicit strong humoral and cellular immune responses and have been investigated as vaccine candidates. In this study we cloned and sequenced a cDNA coding for myosin heavy chain from Toxocara canis, a nematode parasite of canids which may also infect humans and cause various unspecific symptoms. To determine the major antigenic regions the myosin heavy chain was systematically dissected into ten overlapping recombinant fusion polypeptides which were purified by metal chelate chromatography. Single fragments were then tested for their IgG reactivity in sera from toxocarosis patients and healthy probands. Two regions, one region at the mid to carboxy-terminal end of the head domain and one region in the rod domain, were identified as major antigens, which in combination were positive with 86% of the sera. The other domains were less reactive. This shows that the patients' IgG reactivity was not directed evenly against all parts of the molecule, but was rather clustered in few regions.
The female urinary microbiome in urgency urinary incontinence.
Pearce, Meghan M; Zilliox, Michael J; Rosenfeld, Amy B; Thomas-White, Krystal J; Richter, Holly E; Nager, Charles W; Visco, Anthony G; Nygaard, Ingrid E; Barber, Matthew D; Schaffer, Joseph; Moalli, Pamela; Sung, Vivian W; Smith, Ariana L; Rogers, Rebecca; Nolen, Tracy L; Wallace, Dennis; Meikle, Susan F; Gai, Xiaowu; Wolfe, Alan J; Brubaker, Linda
2015-09-01
The purpose of this study was to characterize the urinary microbiota in women who are planning treatment for urgency urinary incontinence and to describe clinical associations with urinary symptoms, urinary tract infection, and treatment outcomes. Catheterized urine samples were collected from multisite randomized trial participants who had no clinical evidence of urinary tract infection; 16S ribosomal RNA gene sequencing was used to dichotomize participants as either DNA sequence-positive or sequence-negative. Associations with demographics, urinary symptoms, urinary tract infection risk, and treatment outcomes were determined. In sequence-positive samples, microbiotas were characterized on the basis of their dominant microorganisms. More than one-half (51.1%; 93/182) of the participants' urine samples were sequence-positive. Sequence-positive participants were younger (55.8 vs 61.3 years old; P = .0007), had a higher body mass index (33.7 vs 30.1 kg/m(2); P = .0009), had a higher mean baseline daily urgency urinary incontinence episodes (5.7 vs 4.2 episodes; P < .0001), responded better to treatment (decrease in urgency urinary incontinence episodes, -4.4 vs -3.3; P = .0013), and were less likely to experience urinary tract infection (9% vs 27%; P = .0011). In sequence-positive samples, 8 major bacterial clusters were identified; 7 clusters were dominated not only by a single genus, most commonly Lactobacillus (45%) or Gardnerella (17%), but also by other taxa (25%). The remaining cluster had no dominant genus (13%). DNA sequencing confirmed urinary bacterial DNA in many women with urgency urinary incontinence who had no signs of infection. Sequence status was associated with baseline urgency urinary incontinence episodes, treatment response, and posttreatment urinary tract infection risk. Copyright © 2015 Elsevier Inc. All rights reserved.
Ferreira, L M; Hazlewood, G P; Barker, P J; Gilbert, H J
1991-01-01
A genomic library of Pseudomonas fluorescens subsp. cellulosa DNA was constructed in pUC18 and Escherichia coli recombinants expressing 4-methylumbelliferyl beta-D-cellobioside-hydrolysing activity (MUCase) were isolated. Enzyme produced by MUCase-positive clones did not hydrolyse either cellobiose or cellotriose but converted cellotetraose into cellobiose and cleaved cellopentaose and cellohexaose, producing a mixture of cellobiose and cellotriose. There was no activity against CM-cellulose, insoluble cellulose or xylan. On this basis, the enzyme is identified as an endo-acting cellodextrinase and is designated cellodextrinase C (CELC). Nucleotide sequencing of the gene (celC) which directs the synthesis of CELC revealed an open reading frame of 2153 bp, encoding a protein of Mr 80,189. The deduced primary sequence of CELC was confirmed by the Mr of purified CELC (77,000) and by the experimentally determined N-terminus of the enzyme which was identical with residues 38-47 of the translated sequence. The N-terminal region of CELC showed strong homology with endoglucanase, xylanases and an arabinofuranosidase of Ps. fluorescens subsp. cellulosa; homologous sequences included highly conserved serine-rich regions. Full-length CELC bound tightly to crystalline cellulose. Truncated forms of celC from which the DNA sequence encoding the conserved domain had been deleted, directed the synthesis of a functional cellodextrinase that did not bind to crystalline cellulose. This is consistent with the N-terminal region of CELC comprising a non-catalytic cellulose-binding domain which is distinct from the catalytic domain. The role of the cellulose-binding region is discussed. Images Fig. 2. Fig. 6. PMID:1953673
Improving pairwise comparison of protein sequences with domain co-occurrence
Gascuel, Olivier
2018-01-01
Comparing and aligning protein sequences is an essential task in bioinformatics. More specifically, local alignment tools like BLAST are widely used for identifying conserved protein sub-sequences, which likely correspond to protein domains or functional motifs. However, to limit the number of false positives, these tools are used with stringent sequence-similarity thresholds and hence can miss several hits, especially for species that are phylogenetically distant from reference organisms. A solution to this problem is then to integrate additional contextual information to the procedure. Here, we propose to use domain co-occurrence to increase the sensitivity of pairwise sequence comparisons. Domain co-occurrence is a strong feature of proteins, since most protein domains tend to appear with a limited number of other domains on the same protein. We propose a method to take this information into account in a typical BLAST analysis and to construct new domain families on the basis of these results. We used Plasmodium falciparum as a case study to evaluate our method. The experimental findings showed an increase of 14% of the number of significant BLAST hits and an increase of 25% of the proteome area that can be covered with a domain. Our method identified 2240 new domains for which, in most cases, no model of the Pfam database could be linked. Moreover, our study of the quality of the new domains in terms of alignment and physicochemical properties show that they are close to that of standard Pfam domains. Source code of the proposed approach and supplementary data are available at: https://gite.lirmm.fr/menichelli/pairwise-comparison-with-cooccurrence PMID:29293498
Schultz, Anna Charlotte; Fonager, Jannik; Ethelberg, Steen; Dalgaard, Camilla; Adelhardt, Marianne; Engberg, Jørgen H.; Fischer, Thea Kølsen; Lassen, Sofie Gillesberg
2014-01-01
In December 2012, an outbreak of acute gastrointestinal illness occurred in a geographical distinct area in Denmark covering 368 households. A combined microbiological, epidemiological and environmental investigation was initiated to understand the outbreak magnitude, pathogen(s) and vehicle in order to control the outbreak. Norovirus GII.4 New Orleans 2009 variant was detected in 15 of 17 individual stool samples from 14 households. Norovirus genomic material from water samples was detected and quantified and sequencing of longer parts of the viral capsid region (>1000 nt) were applied to patient and water samples. All five purposely selected water samples tested positive for norovirus GII in levels up to 1.8×104 genomic units per 200 ml. Identical norovirus sequences were found in all 5 sequenced stool samples and 1 sequenced water sample, a second sequenced water sample showed 1 nt (<0.1%) difference. In a cohort study, including 256 participants, cases were defined as residents of the area experiencing diarrhoea or vomiting onset on 12–14 December 2012. We found an attack rate of 51%. Being a case was associated with drinking tap-water on 12–13 December (relative risk = 6.0, 95%CI: 1.6–22) and a dose-response relation for the mean glasses of tap-water consumed was observed. Environmental investigations suggested contamination from a sewage pipe to the drinking water due to fall in pressure during water supply system renovations. The combined microbiological, epidemiological and environmental investigations strongly indicates the outbreak was caused by norovirus contamination of the water supply system. PMID:25222495
Crossing fitness canyons by a finite population
NASA Astrophysics Data System (ADS)
Saakian, David B.; Bratus, Alexander S.; Hu, Chin-Kun
2017-06-01
We consider the Wright-Fisher model of the finite population evolution on a fitness landscape defined in the sequence space by a path of nearly neutral mutations. We study a specific structure of the fitness landscape: One of the intermediate mutations on the mutation path results in either a large fitness value (climbing up a fitness hill) or a low fitness value (crossing a fitness canyon), the rest of the mutations besides the last one are neutral, and the last sequence has much higher fitness than any intermediate sequence. We derive analytical formulas for the first arrival time of the mutant with two point mutations. For the first arrival problem for the further mutants in the case of canyon crossing, we analytically deduce how the mean first arrival time scales with the population size and fitness difference. The location of the canyon on the path of sequences has a crucial role. If the canyon is at the beginning of the path, then it significantly prolongs the first arrival time; otherwise it just slightly changes it. Furthermore, the fitness hill at the beginning of the path strongly prolongs the arrival time period; however, the hill located near the end of the path shortens it. We optimize the first arrival time by applying a nonzero selection to the intermediate sequences. We extend our results and provide a scaling for the valley crossing time via the depth of the canyon and population size in the case of a fitness canyon at the first position. Our approach is useful for understanding some complex evolution systems, e.g., the evolution of cancer.
Sjö, Nicolai Christian; von Buchwald, Christian; Cassonnet, Patricia; Norrild, Bodil; Prause, Jan Ulrik; Vinding, Troels; Heegaard, Steffen
2007-01-01
Aim To examine conjunctival papilloma and normal conjunctival tissue for the presence of human papillomavirus (HPV). Methods Archival paraffin wax‐embedded tissue from 165 conjunctival papillomas and from 20 histological normal conjunctival biopsy specimens was analysed for the presence of HPV by PCR. Specimens considered HPV positive using consensus primers, but with a negative or uncertain PCR result using type‐specific HPV probes, were analysed with DNA sequencing. Results HPV was present in 86 of 106 (81%) β‐globin‐positive papillomas. HPV type 6 was positive in 80 cases, HPV type 11 was identified in 5 cases and HPV type 45 was present in a single papilloma. All the 20 normal conjunctival biopsy specimens were β‐globin positive and HPV negative. Conclusion There is a strong association between HPV and conjunctival papilloma. The study presents the largest material of conjunctival papilloma investigated for HPV and the first investigation of HPV in normal conjunctival tissue. HPV types 6 and 11 are the most common HPV types in conjunctival papilloma. This also is the first report of HPV type 45 in conjunctival papilloma. PMID:17166894
BlackOPs: increasing confidence in variant detection through mappability filtering.
Cabanski, Christopher R; Wilkerson, Matthew D; Soloway, Matthew; Parker, Joel S; Liu, Jinze; Prins, Jan F; Marron, J S; Perou, Charles M; Hayes, D Neil
2013-10-01
Identifying variants using high-throughput sequencing data is currently a challenge because true biological variants can be indistinguishable from technical artifacts. One source of technical artifact results from incorrectly aligning experimentally observed sequences to their true genomic origin ('mismapping') and inferring differences in mismapped sequences to be true variants. We developed BlackOPs, an open-source tool that simulates experimental RNA-seq and DNA whole exome sequences derived from the reference genome, aligns these sequences by custom parameters, detects variants and outputs a blacklist of positions and alleles caused by mismapping. Blacklists contain thousands of artifact variants that are indistinguishable from true variants and, for a given sample, are expected to be almost completely false positives. We show that these blacklist positions are specific to the alignment algorithm and read length used, and BlackOPs allows users to generate a blacklist specific to their experimental setup. We queried the dbSNP and COSMIC variant databases and found numerous variants indistinguishable from mapping errors. We demonstrate how filtering against blacklist positions reduces the number of potential false variants using an RNA-seq glioblastoma cell line data set. In summary, accounting for mapping-caused variants tuned to experimental setups reduces false positives and, therefore, improves genome characterization by high-throughput sequencing.
A thrifty variant in CREBRF strongly influences body mass index in Samoans.
Minster, Ryan L; Hawley, Nicola L; Su, Chi-Ting; Sun, Guangyun; Kershaw, Erin E; Cheng, Hong; Buhule, Olive D; Lin, Jerome; Reupena, Muagututi'a Sefuiva; Viali, Satupa'itea; Tuitele, John; Naseri, Take; Urban, Zsolt; Deka, Ranjan; Weeks, Daniel E; McGarvey, Stephen T
2016-09-01
Samoans are a unique founder population with a high prevalence of obesity, making them well suited for identifying new genetic contributors to obesity. We conducted a genome-wide association study (GWAS) in 3,072 Samoans, discovered a variant, rs12513649, strongly associated with body mass index (BMI) (P = 5.3 × 10(-14)), and replicated the association in 2,102 additional Samoans (P = 1.2 × 10(-9)). Targeted sequencing identified a strongly associated missense variant, rs373863828 (p.Arg457Gln), in CREBRF (meta P = 1.4 × 10(-20)). Although this variant is extremely rare in other populations, it is common in Samoans (frequency of 0.259), with an effect size much larger than that of any other known common BMI risk variant (1.36-1.45 kg/m(2) per copy of the risk-associated allele). In comparison to wild-type CREBRF, the Arg457Gln variant when overexpressed selectively decreased energy use and increased fat storage in an adipocyte cell model. These data, in combination with evidence of positive selection of the allele encoding p.Arg457Gln, support a 'thrifty' variant hypothesis as a factor in human obesity.
Shiau, LieJune; Schwalger, Tilo; Lindner, Benjamin
2015-06-01
We study the spike statistics of an adaptive exponential integrate-and-fire neuron stimulated by white Gaussian current noise. We derive analytical approximations for the coefficient of variation and the serial correlation coefficient of the interspike interval assuming that the neuron operates in the mean-driven tonic firing regime and that the stochastic input is weak. Our result for the serial correlation coefficient has the form of a geometric sequence and is confirmed by the comparison to numerical simulations. The theory predicts various patterns of interval correlations (positive or negative at lag one, monotonically decreasing or oscillating) depending on the strength of the spike-triggered and subthreshold components of the adaptation current. In particular, for pure subthreshold adaptation we find strong positive ISI correlations that are usually ascribed to positive correlations in the input current. Our results i) provide an alternative explanation for interspike-interval correlations observed in vivo, ii) may be useful in fitting point neuron models to experimental data, and iii) may be instrumental in exploring the role of adaptation currents for signal detection and signal transmission in single neurons.
Human leukocyte antigen (HLA) pharmacogenomic tests: potential and pitfalls.
Daly, Ann K
2014-02-01
Adverse drug reactions involving a range of prescribed drugs and affecting the skin, liver and other organs show strong associations with particular HLA alleles. For some reactions, HLA typing prior to prescription, so that those positive for the risk allele are not given the drug associated with the reaction, shows high positive and negative predictive values. The best example of clinical implementation relates to the hypersensitivity reaction induced by the anti-HIV drug abacavir. When this reaction is phenotyped accurately, 100% of those who develop it are positive for HLA-B*57:01. Drug regulators worldwide now recommend genotyping for HLA-B*57:01 before abacavir is prescribed. Serious skin rashes including Stevens-Johnson syndrome and toxic epidermal necrosis can be induced by carbamazepine and other anticonvulsant drugs. In certain East Asians, these reactions are significantly associated with HLA-B*15:02, and typing for this allele is now recommended prior to carbamazepine prescription in these populations. Other HLA associations have been described for skin rash induced by carbamazepine, allopurinol and nevirapine and for liver injury induced by flucloxacillin, amoxicillin-clavulanate, lapatanib, lumiracoxib and ticlopidine. However, the predictive values for typing HLA alleles associated with these adverse reactions are lower. Clinical implementation therefore seems unlikely. Performing HLA typing is relatively complex compared with genotyping assays for single nucleotide polymorphisms. With emphasis on HLA-B*57:01, the approaches used commonly, including use of sequence-specific oligonucleotide PCR primers and DNA sequencing are considered, together with their successful implementation. Genotyping single nucleotide polymorphisms tagging HLA alleles is a simpler alternative to HLA typing but appears insufficiently accurate for clinical use.
Xu, Dezhong; Wang, Bo; Zhang, Lei; Li, Duan; Xiao, Dan; Li, Fan; Zhang, Jingxia; Yan, Yongping
2013-01-01
Objective To investigate the prevalence of occult HBV infection (OBI) among children and to characterize virology of occult HBV, we conducted an epidemiological survey. Methods 186 HB-vaccinated infants born to HBsAg-positive mothers were included in the study. Serological tests for HBV markers were performed using commercial ELISA kits. Real-time quantitative PCR and nested PCR were used to detect HBV DNA. PCR products of the C and pre-S/S regions were sequenced and analyzed. Results 1.61% (3/186) infants were HBsAg positive, and 4.92% (9/183) infants were considered as occult infection. The viral load of mothers was associated with occult infection (P = 0.020). Incomplete three-dose injections of HB vaccine was associated with HBV infection (P = 0.022). Six OBI infants were positive for anti-HBs, but their titers were not greater than 100 mIU/mL. Seven isolated HBV pre-S/S sequences were obtained from nine OBI infants. Three of the sequences were genotype C, and four of the sequences were genotype C/D. Escape mutation S143L was found in the four sequences of genotype C/D. All seven sequences lacked G145R and other escape mutation in S region. Conclusions Occult HBV infection was detected in anti-HBs positive infants born to HBsAg-positive mothers in China. Occult infection was associated with absent anti-HBs or with low anti-HBs level, high maternal viral loads and escape mutations in the S gene. PMID:23951004
Murdock, Andrew G
2008-05-01
Closely related outgroups are optimal for rooting phylogenetic trees; however, such ideal outgroups are not always available. A phylogeny of the marattioid ferns (Marattiaceae), an ancient lineage with no close relatives, was reconstructed using nucleotide sequences of multiple chloroplast regions (rps4 + rps4-trnS spacer, trnS-trnG spacer + trnG intron, rbcL, atpB), from 88 collections, selected to cover the broadest possible range of morphologies and geographic distributions within the extant taxa. Because marattioid ferns are phylogenetically isolated from other lineages, and internal branches are relatively short, rooting was problematic. Root placement was strongly affected by long-branch attraction under maximum parsimony and by model choice under maximum likelihood. A multifaceted approach to rooting was employed to isolate the sources of bias and produce a consensus root position. In a statistical comparison of all possible root positions with three different outgroups, most root positions were not significantly less optimal than the maximum likelihood root position, including the consensus root position. This phylogeny has several important taxonomic implications for marattioid ferns: Marattia in the broad sense is paraphyletic; the Hawaiian endemic Marattia douglasii is most closely related to tropical American taxa; and Angiopteris is monophyletic only if Archangiopteris and Macroglossum are included.
Harper, Marc; Gronenberg, Luisa; Liao, James; Lee, Christopher
2014-01-01
Discovering all the genetic causes of a phenotype is an important goal in functional genomics. We combine an experimental design for detecting independent genetic causes of a phenotype with a high-throughput sequencing analysis that maximizes sensitivity for comprehensively identifying them. Testing this approach on a set of 24 mutant strains generated for a metabolic phenotype with many known genetic causes, we show that this pathway-based phenotype sequencing analysis greatly improves sensitivity of detection compared with previous methods, and reveals a wide range of pathways that can cause this phenotype. We demonstrate our approach on a metabolic re-engineering phenotype, the PEP/OAA metabolic node in E. coli, which is crucial to a substantial number of metabolic pathways and under renewed interest for biofuel research. Out of 2157 mutations in these strains, pathway-phenoseq discriminated just five gene groups (12 genes) as statistically significant causes of the phenotype. Experimentally, these five gene groups, and the next two high-scoring pathway-phenoseq groups, either have a clear connection to the PEP metabolite level or offer an alternative path of producing oxaloacetate (OAA), and thus clearly explain the phenotype. These high-scoring gene groups also show strong evidence of positive selection pressure, compared with strictly neutral selection in the rest of the genome.
A new earthworm cellulase and its possible role in the innate immunity.
Park, In Yong; Cha, Ju Roung; Ok, Suk-Mi; Shin, Chuog; Kim, Jin-Se; Kwak, Hee-Jin; Yu, Yun-Sang; Kim, Yu-Kyung; Medina, Brenda; Cho, Sung-Jin; Park, Soon Cheol
2017-02-01
A new endogenous cellulase (Ean-EG) from the earthworm, Eisenia andrei and its expression pattern are demonstrated. Based on a deduced amino acid sequence, the open reading frame (ORF) of Ean-EG consisted of 1368 bps corresponding to a polypeptide of 456 amino acid residues in which is contained the conserved region specific to GHF9 that has the essential amino acid residues for enzyme activity. In multiple alignments and phylogenetic analysis, the deduced amino acid sequence of Ean- EG showed the highest sequence similarity (about 79%) to that of an annelid (Pheretima hilgendorfi) and could be clustered together with other GHF9 cellulases, indicating that Ean-EG could be categorized as a member of the GHF9 to which most animal cellulases belong. The histological expression pattern of Ean-EG mRNA using in situ hybridization revealed that the most distinct expression was observed in epithelial cells with positive hybridization signal in epidermis, chloragogen tissue cells, coelomic cell-aggregate, and even blood vessel, which could strongly support the fact that at least in the earthworm, Eisenia andrei, cellulase function must not be limited to digestive process but be possibly extended to the innate immunity. Copyright © 2016 Elsevier Ltd. All rights reserved.
Liu, Yuan; Cui, Zhaoxia
2010-06-01
Given the commercial and ecological importance of the Asian paddle crab, Charybdis japonica, there is a clearly need for genetic and molecular research on this species. Here, we present the complete mitochondrial genome sequence of C. japonica, determined by the long-polymerase chain reaction and primer walking sequencing method. The entire genome is 15,738 bp in length, encoding a standard set of 13 protein-coding genes, two ribosomal RNA genes, and 22 transfer RNA genes, plus the putative control region, which is typical for metazoans. The total A+T content of the genome is 69.2%, lower than the other brachyuran crabs except for Callinectes sapidus. The gene order is identical to the published marine brachyurans and differs from the ancestral pancrustacean order by only the position of the tRNA ( His ) gene. Phylogenetic analyses using the concatenated nucleotide and amino acid sequences of 13 protein-coding genes strongly support the monophyly of Dendrobranchiata and Pleocyemata, which is consistent with the previous taxonomic classification. However, the systematic status of Charybdis within subfamily Thalamitinae of family Portunidae is not supported. C. japonica, as the first species of Charybdis with complete mitochondrial genome available, will provide important information on both genomics and molecular ecology of the group.
Aeromagnetic survey map of Sacramento Valley, California
Langenheim, Victoria E.
2015-01-01
Three aeromagnetic surveys were flown to improve understanding of the geology and structure in the Sacramento Valley. The resulting data serve as a basis for geophysical interpretations, and support geological mapping, water and mineral resource investigations, and other topical studies. Local spatial variations in the Earth's magnetic field (evident as anomalies on aeromagnetic maps) reflect the distribution of magnetic minerals, primarily magnetite, in the underlying rocks. In many cases the volume content of magnetic minerals can be related to rock type, and abrupt spatial changes in the amount of magnetic minerals commonly mark lithologic or structural boundaries. Bodies of serpentinite and other mafic and ultramafic rocks tend to produce the most intense positive magnetic anomalies (for example, in the northwest part of the map). These rock types are the inferred sources, concealed beneath weakly magnetic, valley-fill deposits, of the most prominent magnetic features in the map area, the magnetic highs that extend along the valley axis. Cenozoic volcanic rocks are also an important source of magnetic anomalies and coincide with short-wavelength anomalies that can be either positive (strong central positive anomaly flanked by lower-amplitude negative anomalies) or negative (strong central negative anomaly flanked by lower-amplitude positive anomalies), reflecting the contribution of remanent magnetization. Rocks with more felsic compositions or even some sedimentary units also can cause measurable magnetic anomalies. For example, the long, linear, narrow north-trending anomalies (with amplitudes of <50 nanoteslas [nT]) along the western margin of the valley coincide with exposures of the Mesozoic Great Valley sequence. Note that isolated, short-wavelength anomalies, such as those in the city of Sacramento and along some of the major roads, are caused by manmade features.
Emmett, Sarah; Boros, Samuel; Whiteman, David C; Porceddu, Sandro V; Panizza, Benedict J; Antonsson, Annika
2018-06-01
A significant proportion of mucosal squamous cell carcinomas of the head and neck (HNSCC; particularly of the oropharynx) are directly attributable to the human papillomavirus (HPV). The increase in the incidence of HPV-related tumours has been postulated to be due to changing sexual practices in the community. We analysed 136 formalin-fixed paraffin-embedded squamous cell carcinomas from the oral cavity (n=40) and oropharynx (n=96) recruited from the Princess Alexandra Hospital (Brisbane, Australia). Samples were analysed for the presence of HPV DNA using a combination of mucosal HPV general primer GP+ PCR and sequencing; p 16INK4a expression was assessed by immunohistochemistry. Each patient completed a questionnaire detailing their lifestyle factors, such as tobacco smoking and alcohol consumption, marital status, and sexual behaviour and history. The HPV DNA prevalence was 5 % in the oral cavity cancers and 72 % in the oropharyngeal cancers (P<0.0001). HPV-16 was the most commonly detected HPV type (found in 91 % of all HPV-positive tumours). There was a strong correlation between HPV DNA positivity and positive p16 INK4a staining in oropharyngeal tumours (P<0.0001). Having an HPV-related tumour was associated with being married or having been married previously (P=0.046), an increasing number of passionate kissing partners (P=0.046), ever having given oral sex (P=0.0007) and an increasing number of oral sex partners (P=0.0015). This study found a higher prevalence of HPV in oropharyngeal compared to oral cavity tumours, with a strong association being identified between oral sex behaviours and HPV-positive tumours. Further research is needed to establish that vaccines will reduce the transmission and carriage of oropharyngeal HPV infections.
Adungo, Ferdinard; Kamau, David; Inoue, Shingo; Hayasaka, Daisuke; Posadas-Herrera, Guillermo; Sang, Rosemary; Mwau, Matilu
2016-01-01
Yellow fever (YF) is an acute hemorrhagic viral infection transmitted by mosquitoes in Africa and South America. The major challenge in YF disease detection and confirmation of outbreaks in Africa is the limited availability of reference laboratories and the persistent lack of access to diagnostic tests. We used wild-type YF virus sequences to generate recombinant envelope protein in an Escherichia coli expression system. Both the recombinant protein and sucrose gradient-purified YF vaccine virus 17D (YF-17D) were used to immunize BALB/c mice to generate monoclonal antibodies (MAbs). Eight MAbs were established and systematically characterized by indirect enzyme-linked immunosorbent assay (ELISA), Western blot analysis, and immunofluorescence assay (IFA). The established MAbs showed strong reactivity with wild-type YF virus and recombinant protein with no detectable cross-reactivity to dengue virus or Japanese encephalitis virus. Epitope mapping showed strong binding of three MAbs to amino acid positions 1 to 51, while two MAbs mapped to amino acid positions 52 to 135 of the envelope protein. The remaining three MAbs did not show reactivity to envelope fragments. The established MAbs exert no neutralization against wild-type YF and 17D viruses (titer of <10 for both strains). The applicability of MAbs 8H3 and 3F4 was further evaluated using IgM capture ELISA. A total of 49 serum samples were analyzed, among which 12 positive patient and vaccinee samples were correctly identified. Using serum samples that were 2-fold serially diluted, the IgM capture ELISA was able to detect all YF-positive samples. Furthermore, MAb-based antigen detection ELISA enabled the detection of virus in culture supernatants containing titers of about 1,000 focus-forming units. PMID:27307452
DOE Office of Scientific and Technical Information (OSTI.GOV)
Adamson, J; Chang, Z; Cai, J
Purpose: To develop a robust MRI sequence to measure BOLD breath hold induced contrast in context of breast radiotherapy. Methods: Two sequences were selected from prior studies as candidates to measure BOLD contrast attributable to breath holding within the breast: (1) T2* based Gradient Echo EPI (TR/TE = 500/41ms, flip angle = 60°), and (2) T2 based Single Shot Fast Spin Echo (SSFSE) (TR/TE = 3000/60ms). We enrolled ten women post-lumpectomy for breast cancer who were undergoing treatment planning for whole breast radiotherapy. Each session utilized a 1.5T GE MRI and 4 channel breast coil with the subject immobilized pronemore » on a custom board. For each sequence, 1–3 planes of the lumpectomy breast were imaged continuously during a background measurement (1min) and intermittent breath holds (20–40s per breath hold, 3–5 holds per sequence). BOLD contrast was quantified as correlation of changes in per-pixel intensity with the breath hold schedule convolved with a hemodynamic response function. Subtle motion was corrected using a deformable registration algorithm. Correlation with breath-holding was considered significant if p<0.001. Results: The percentage of the breast ROI with positive BOLD contrast measured by the two sequences were in agreement with a correlation coefficient of R=0.72 (p=0.02). While both sequences demonstrated areas with strong BOLD response, the response was more systematic throughout the breast for the SSFSE (T2) sequence (% breast with response in the same direction: 51.2%±0.7% for T2* vs. 68.1%±16% for T2). In addition, the T2 sequence was less prone to magnetic susceptibility artifacts, especially in presence of seroma, and provided a more robust image with little distortion or artifacts. Conclusion: A T2 SSFSE sequence shows promise for measuring BOLD contrast in the context of breast radiotherapy utilizing a breath hold technique. Further study in a larger patient cohort is warranted to better refine this novel technique.« less
2013-01-01
Background The narrow-leafed lupin, Lupinus angustifolius L., is a grain legume species with a relatively compact genome. The species has 2n = 40 chromosomes and its genome size is 960 Mbp/1C. During the last decade, L. angustifolius genomic studies have achieved several milestones, such as molecular-marker development, linkage maps, and bacterial artificial chromosome (BAC) libraries. Here, these resources were integratively used to identify and sequence two gene-rich regions (GRRs) of the genome. Results The genome was screened with a probe representing the sequence of a microsatellite fragment length polymorphism (MFLP) marker linked to Phomopsis stem blight resistance. BAC clones selected by hybridization were subjected to restriction fingerprinting and contig assembly, and 232 BAC-ends were sequenced and annotated. BAC fluorescence in situ hybridization (BAC-FISH) identified eight single-locus clones. Based on physical mapping, cytogenetic localization, and BAC-end annotation, five clones were chosen for sequencing. Within the sequences of clones that hybridized in FISH to a single-locus, two large GRRs were identified. The GRRs showed strong and conserved synteny to Glycine max duplicated genome regions, illustrated by both identical gene order and parallel orientation. In contrast, in the clones with dispersed FISH signals, more than one-third of sequences were transposable elements. Sequenced, single-locus clones were used to develop 12 genetic markers, increasing the number of L. angustifolius chromosomes linked to appropriate linkage groups by five pairs. Conclusions In general, probes originating from MFLP sequences can assist genome screening and gene discovery. However, such probes are not useful for positional cloning, because they tend to hybridize to numerous loci. GRRs identified in L. angustifolius contained a low number of interspersed repeats and had a high level of synteny to the genome of the model legume G. max. Our results showed that not only was the gene nucleotide sequence conserved between soybean and lupin GRRs, but the order and orientation of particular genes in syntenic blocks was homologous, as well. These findings will be valuable to the forthcoming sequencing of the lupin genome. PMID:23379841
Knutson, Stacy T.; Westwood, Brian M.; Leuthaeuser, Janelle B.; Turner, Brandon E.; Nguyendac, Don; Shea, Gabrielle; Kumar, Kiran; Hayden, Julia D.; Harper, Angela F.; Brown, Shoshana D.; Morris, John H.; Ferrin, Thomas E.; Babbitt, Patricia C.
2017-01-01
Abstract Protein function identification remains a significant problem. Solving this problem at the molecular functional level would allow mechanistic determinant identification—amino acids that distinguish details between functional families within a superfamily. Active site profiling was developed to identify mechanistic determinants. DASP and DASP2 were developed as tools to search sequence databases using active site profiling. Here, TuLIP (Two‐Level Iterative clustering Process) is introduced as an iterative, divisive clustering process that utilizes active site profiling to separate structurally characterized superfamily members into functionally relevant clusters. Underlying TuLIP is the observation that functionally relevant families (curated by Structure‐Function Linkage Database, SFLD) self‐identify in DASP2 searches; clusters containing multiple functional families do not. Each TuLIP iteration produces candidate clusters, each evaluated to determine if it self‐identifies using DASP2. If so, it is deemed a functionally relevant group. Divisive clustering continues until each structure is either a functionally relevant group member or a singlet. TuLIP is validated on enolase and glutathione transferase structures, superfamilies well‐curated by SFLD. Correlation is strong; small numbers of structures prevent statistically significant analysis. TuLIP‐identified enolase clusters are used in DASP2 GenBank searches to identify sequences sharing functional site features. Analysis shows a true positive rate of 96%, false negative rate of 4%, and maximum false positive rate of 4%. F‐measure and performance analysis on the enolase search results and comparison to GEMMA and SCI‐PHY demonstrate that TuLIP avoids the over‐division problem of these methods. Mechanistic determinants for enolase families are evaluated and shown to correlate well with literature results. PMID:28054422
Knutson, Stacy T; Westwood, Brian M; Leuthaeuser, Janelle B; Turner, Brandon E; Nguyendac, Don; Shea, Gabrielle; Kumar, Kiran; Hayden, Julia D; Harper, Angela F; Brown, Shoshana D; Morris, John H; Ferrin, Thomas E; Babbitt, Patricia C; Fetrow, Jacquelyn S
2017-04-01
Protein function identification remains a significant problem. Solving this problem at the molecular functional level would allow mechanistic determinant identification-amino acids that distinguish details between functional families within a superfamily. Active site profiling was developed to identify mechanistic determinants. DASP and DASP2 were developed as tools to search sequence databases using active site profiling. Here, TuLIP (Two-Level Iterative clustering Process) is introduced as an iterative, divisive clustering process that utilizes active site profiling to separate structurally characterized superfamily members into functionally relevant clusters. Underlying TuLIP is the observation that functionally relevant families (curated by Structure-Function Linkage Database, SFLD) self-identify in DASP2 searches; clusters containing multiple functional families do not. Each TuLIP iteration produces candidate clusters, each evaluated to determine if it self-identifies using DASP2. If so, it is deemed a functionally relevant group. Divisive clustering continues until each structure is either a functionally relevant group member or a singlet. TuLIP is validated on enolase and glutathione transferase structures, superfamilies well-curated by SFLD. Correlation is strong; small numbers of structures prevent statistically significant analysis. TuLIP-identified enolase clusters are used in DASP2 GenBank searches to identify sequences sharing functional site features. Analysis shows a true positive rate of 96%, false negative rate of 4%, and maximum false positive rate of 4%. F-measure and performance analysis on the enolase search results and comparison to GEMMA and SCI-PHY demonstrate that TuLIP avoids the over-division problem of these methods. Mechanistic determinants for enolase families are evaluated and shown to correlate well with literature results. © 2017 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.
Ana o 2, a major cashew (Anacardium occidentale L.) nut allergen of the legumin family.
Wang, Fang; Robotham, Jason M; Teuber, Suzanne S; Sathe, Shridhar K; Roux, Kenneth H
2003-09-01
We recently cloned and described a vicilin and showed it to be a major cashew allergen. Additional IgE-reactive cashew peptides of the legumin group and 2S albumin families have also been reported. Here, we attempt to clone, express and characterize a second major cashew allergen. A cashew cDNA library was screened with human IgE and rabbit IgG anti-cashew extract antisera, and a reactive nonvicilin clone was sequenced and expressed as a fusion protein in Escherichia coli. Immunoblotting was used to screen for reactivity with patients' sera, and inhibition of immunoblotting was used to identify the corresponding native peptides in cashew nut extract. The identified allergen was subjected to linear epitope mapping using SPOTs solid-phase synthetic peptide technology. Sequence analysis showed the selected clone, designated Ana o 2, to encode for a member of the legumin family (an 11S globulin) of seed storage proteins. By IgE immunoblotting, 13 of 21 sera (62%) from cashew-allergic patients were reactive. Immunoblot inhibition data showed that the native Ana o 2 constitutes a major band at approximately 33 kD and a minor band at approximately 53 kD. Probing of overlapping synthetic peptides with pooled human cashew-allergic sera identified 22 reactive peptides, 7 of which gave strong signals. Several Ana o 2 epitopes were shown to overlap those of the peanut legumin group allergen, Ara h 3, in position but with little sequence similarity. Greater positional overlap and identity was observed between Ana o 2 and soybean glycinin epitopes. We conclude that this legumin-like protein is a major allergen in cashew nut. Copyright 2003 S. Karger AG, Basel
Zooanthroponotic transmission of rotavirus in Haryana State of Northern India.
Choudhary, P; Minakshi, P; Ranjan, K; Basanti, B
Rotaviruses are the major cause of severe gastroenteritis and mortality in young children and animals. Due to segmented nature of dsRNA genome and wide host range, vast genetic and antigenic diversity exists amongst different isolates of rotaviruses. A total of 230 fecal ovine and caprine samples collected from organized farms and villages in Haryana were screened for rotavirus detection. Samples were screened by latex agglutination test and RNA-PAGE followed by RT-PCR and nucleic acid sequencing. The latex agglutination test showed 25 newborn lamb and 4 kid fecal samples positive for rotavirus. However, RNA-PAGE showed only 9 lamb fecal samples positive for rotavirus. All the samples were subjected to RT-PCR employing vp4 and vp7 gene specific primers of group A rotavirus of ovine, bovine and human origin. Only two samples from lamb (Sheep18/Hisar/2013 and Sheep22/Hisar/2013) showed vp4 and vp7 gene specific amplification with human group A rotavirus (GAR) specific primer. However, they did not show any amplification with ovine and bovine rotavirus specific primers. The nucleotide as well as deduced amino acid sequence analysis of vp4 gene of these isolates showed >98/97% and vp7 gene >95/94% nt/aa identity with human GAR from different regions of the world. Based on nucleotide similarity search, Sheep18/Hisar/2013 and Sheep22/Hisar/2013 isolates were genotyped as G1P[8] and G1P[4]. Phylogenetic analysis also confirmed that these isolates were clustered closely with human rotaviruses from different regions of the world. Earlier, higher prevalence of human rotaviruses was reported from the sample collecting area. The amplification of ovine samples with human rotavirus gene specific primers, sequence identity and phylogenetic analysis strongly suggests the zoonotic transmission of human GAR to sheep.
Hsieh, PingHsun; Veeramah, Krishna R.; Lachance, Joseph; Tishkoff, Sarah A.; Wall, Jeffrey D.; Hammer, Michael F.; Gutenkunst, Ryan N.
2016-01-01
African Pygmies practicing a mobile hunter-gatherer lifestyle are phenotypically and genetically diverged from other anatomically modern humans, and they likely experienced strong selective pressures due to their unique lifestyle in the Central African rainforest. To identify genomic targets of adaptation, we sequenced the genomes of four Biaka Pygmies from the Central African Republic and jointly analyzed these data with the genome sequences of three Baka Pygmies from Cameroon and nine Yoruba famers. To account for the complex demographic history of these populations that includes both isolation and gene flow, we fit models using the joint allele frequency spectrum and validated them using independent approaches. Our two best-fit models both suggest ancient divergence between the ancestors of the farmers and Pygmies, 90,000 or 150,000 yr ago. We also find that bidirectional asymmetric gene flow is statistically better supported than a single pulse of unidirectional gene flow from farmers to Pygmies, as previously suggested. We then applied complementary statistics to scan the genome for evidence of selective sweeps and polygenic selection. We found that conventional statistical outlier approaches were biased toward identifying candidates in regions of high mutation or low recombination rate. To avoid this bias, we assigned P-values for candidates using whole-genome simulations incorporating demography and variation in both recombination and mutation rates. We found that genes and gene sets involved in muscle development, bone synthesis, immunity, reproduction, cell signaling and development, and energy metabolism are likely to be targets of positive natural selection in Western African Pygmies or their recent ancestors. PMID:26888263
Ultrashort echo time magnetization transfer (UTE-MT) imaging of cortical bone.
Chang, Eric Y; Bae, Won C; Shao, Hongda; Biswas, Reni; Li, Shihong; Chen, Jun; Patil, Shantanu; Healey, Robert; D'Lima, Darryl D; Chung, Christine B; Du, Jiang
2015-07-01
Magnetization transfer (MT) imaging is one way to indirectly assess pools of protons with fast transverse relaxation. However, conventional MT imaging sequences are not applicable to short T2 tissues such as cortical bone. Ultrashort echo time (UTE) sequences with TE values as low as 8 µs can detect signals from different water components in cortical bone. In this study we aim to evaluate two-dimensional UTE-MT imaging of cortical bone and its application in assessing cortical bone porosity as measured by micro-computed tomography (μCT) and biomechanical properties. In total, 38 human cadaveric distal femur and proximal tibia bones were sectioned to produce 122 rectangular pieces of cortical bone for quantitative UTE-MT MR imaging, μCT, and biomechanical testing. Off-resonance saturation ratios (OSRs) with a series of MT pulse frequency offsets (Δf) were calculated and compared with porosity assessed with μCT, as well as elastic (modulus, yield stress, and strain) and failure (ultimate stress, failure strain, and energy) properties, using Pearson correlation and linear regression. A moderately strong negative correlation was observed between OSR and μCT porosity (R(2) = 0.46-0.51), while a moderate positive correlation was observed between OSR and yield stress (R(2) = 0.25-0.30) and failure stress (R(2) = 0.31-0.35), and a weak positive correlation (R(2) = 0.09-0.12) between OSR and Young's modulus at all off-resonance saturation frequencies. OSR determined with the UTE-MT sequence provides quantitative information on cortical bone and is sensitive to μCT porosity and biomechanical function. Copyright © 2015 John Wiley & Sons, Ltd.
Hwang, Kyu-Baek; Lee, In-Hee; Park, Jin-Ho; Hambuch, Tina; Choe, Yongjoon; Kim, MinHyeok; Lee, Kyungjoon; Song, Taemin; Neu, Matthew B; Gupta, Neha; Kohane, Isaac S; Green, Robert C; Kong, Sek Won
2014-08-01
As whole genome sequencing (WGS) uncovers variants associated with rare and common diseases, an immediate challenge is to minimize false-positive findings due to sequencing and variant calling errors. False positives can be reduced by combining results from orthogonal sequencing methods, but costly. Here, we present variant filtering approaches using logistic regression (LR) and ensemble genotyping to minimize false positives without sacrificing sensitivity. We evaluated the methods using paired WGS datasets of an extended family prepared using two sequencing platforms and a validated set of variants in NA12878. Using LR or ensemble genotyping based filtering, false-negative rates were significantly reduced by 1.1- to 17.8-fold at the same levels of false discovery rates (5.4% for heterozygous and 4.5% for homozygous single nucleotide variants (SNVs); 30.0% for heterozygous and 18.7% for homozygous insertions; 25.2% for heterozygous and 16.6% for homozygous deletions) compared to the filtering based on genotype quality scores. Moreover, ensemble genotyping excluded > 98% (105,080 of 107,167) of false positives while retaining > 95% (897 of 937) of true positives in de novo mutation (DNM) discovery in NA12878, and performed better than a consensus method using two sequencing platforms. Our proposed methods were effective in prioritizing phenotype-associated variants, and an ensemble genotyping would be essential to minimize false-positive DNM candidates. © 2014 WILEY PERIODICALS, INC.
Hwang, Kyu-Baek; Lee, In-Hee; Park, Jin-Ho; Hambuch, Tina; Choi, Yongjoon; Kim, MinHyeok; Lee, Kyungjoon; Song, Taemin; Neu, Matthew B.; Gupta, Neha; Kohane, Isaac S.; Green, Robert C.; Kong, Sek Won
2014-01-01
As whole genome sequencing (WGS) uncovers variants associated with rare and common diseases, an immediate challenge is to minimize false positive findings due to sequencing and variant calling errors. False positives can be reduced by combining results from orthogonal sequencing methods, but costly. Here we present variant filtering approaches using logistic regression (LR) and ensemble genotyping to minimize false positives without sacrificing sensitivity. We evaluated the methods using paired WGS datasets of an extended family prepared using two sequencing platforms and a validated set of variants in NA12878. Using LR or ensemble genotyping based filtering, false negative rates were significantly reduced by 1.1- to 17.8-fold at the same levels of false discovery rates (5.4% for heterozygous and 4.5% for homozygous SNVs; 30.0% for heterozygous and 18.7% for homozygous insertions; 25.2% for heterozygous and 16.6% for homozygous deletions) compared to the filtering based on genotype quality scores. Moreover, ensemble genotyping excluded > 98% (105,080 of 107,167) of false positives while retaining > 95% (897 of 937) of true positives in de novo mutation (DNM) discovery, and performed better than a consensus method using two sequencing platforms. Our proposed methods were effective in prioritizing phenotype-associated variants, and ensemble genotyping would be essential to minimize false positive DNM candidates. PMID:24829188
Representation of item position in immediate serial recall: Evidence from intrusion errors.
Fischer-Baum, Simon; McCloskey, Michael
2015-09-01
In immediate serial recall, participants are asked to recall novel sequences of items in the correct order. Theories of the representations and processes required for this task differ in how order information is maintained; some have argued that order is represented through item-to-item associations, while others have argued that each item is coded for its position in a sequence, with position being defined either by distance from the start of the sequence, or by distance from both the start and the end of the sequence. Previous researchers have used error analyses to adjudicate between these different proposals. However, these previous attempts have not allowed researchers to examine the full set of alternative proposals. In the current study, we analyzed errors produced in 2 immediate serial recall experiments that differ in the modality of input (visual vs. aural presentation of words) and the modality of output (typed vs. spoken responses), using new analysis methods that allow for a greater number of alternative hypotheses to be considered. We find evidence that sequence positions are represented relative to both the start and the end of the sequence, and show a contribution of the end-based representation beyond the final item in the sequence. We also find limited evidence for item-to-item associations, suggesting that both a start-end positional scheme and item-to-item associations play a role in representing item order in immediate serial recall. (c) 2015 APA, all rights reserved).
False positives complicate ancient pathogen identifications using high-throughput shotgun sequencing
2014-01-01
Background Identification of historic pathogens is challenging since false positives and negatives are a serious risk. Environmental non-pathogenic contaminants are ubiquitous. Furthermore, public genetic databases contain limited information regarding these species. High-throughput sequencing may help reliably detect and identify historic pathogens. Results We shotgun-sequenced 8 16th-century Mixtec individuals from the site of Teposcolula Yucundaa (Oaxaca, Mexico) who are reported to have died from the huey cocoliztli (‘Great Pestilence’ in Nahautl), an unknown disease that decimated native Mexican populations during the Spanish colonial period, in order to identify the pathogen. Comparison of these sequences with those deriving from the surrounding soil and from 4 precontact individuals from the site found a wide variety of contaminant organisms that confounded analyses. Without the comparative sequence data from the precontact individuals and soil, false positives for Yersinia pestis and rickettsiosis could have been reported. Conclusions False positives and negatives remain problematic in ancient DNA analyses despite the application of high-throughput sequencing. Our results suggest that several studies claiming the discovery of ancient pathogens may need further verification. Additionally, true single molecule sequencing’s short read lengths, inability to sequence through DNA lesions, and limited ancient-DNA-specific technical development hinder its application to palaeopathology. PMID:24568097
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification.
Sinclair, Robert M; Ravantti, Janne J; Bamford, Dennis H
2017-04-15
Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. Copyright © 2017 Sinclair et al.
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification
Sinclair, Robert M.; Ravantti, Janne J.
2017-01-01
ABSTRACT Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. PMID:28122979
Query-seeded iterative sequence similarity searching improves selectivity 5–20-fold
Li, Weizhong; Lopez, Rodrigo
2017-01-01
Abstract Iterative similarity search programs, like psiblast, jackhmmer, and psisearch, are much more sensitive than pairwise similarity search methods like blast and ssearch because they build a position specific scoring model (a PSSM or HMM) that captures the pattern of sequence conservation characteristic to a protein family. But models are subject to contamination; once an unrelated sequence has been added to the model, homologs of the unrelated sequence will also produce high scores, and the model can diverge from the original protein family. Examination of alignment errors during psiblast PSSM contamination suggested a simple strategy for dramatically reducing PSSM contamination. psiblast PSSMs are built from the query-based multiple sequence alignment (MSA) implied by the pairwise alignments between the query model (PSSM, HMM) and the subject sequences in the library. When the original query sequence residues are inserted into gapped positions in the aligned subject sequence, the resulting PSSM rarely produces alignment over-extensions or alignments to unrelated sequences. This simple step, which tends to anchor the PSSM to the original query sequence and slightly increase target percent identity, can reduce the frequency of false-positive alignments more than 20-fold compared with psiblast and jackhmmer, with little loss in search sensitivity. PMID:27923999
V, Pavana Jyothi; S, Akila; Selvan, Malini K; Naidu, Hariprasad; Raghunathan, Shwethaa; Kota, Sathish; Sundaram, R C Raja; Rana, Samir Kumar; Raj, G Dhinakar; Srinivasan, V A; Mohana Subramanian, B
2016-12-01
Canine parvovirus (CPV) is a non-enveloped single stranded DNA virus with an icosahedral capsid. Mini-sequencing based CPV typing was developed earlier to detect and differentiate all the CPV types and FPV in a single reaction. This technique was further evaluated in the present study by performing the mini-sequencing directly from fecal samples which avoided tedious virus isolation steps by cell culture system. Fecal swab samples were collected from 84 dogs with enteritis symptoms, suggestive of parvoviral infection from different locations across India. Seventy six of these samples were positive by PCR; the subsequent mini-sequencing reaction typed 74 of them as type 2a virus, and 2 samples as type 2b. Additionally, 25 of the positive samples were typed by cycle sequencing of PCR products. Direct CPV typing from fecal samples using mini-sequencing showed 100% correlation with CPV typing by cycle sequencing. Moreover, CPV typing was achieved by mini-sequencing even with faintly positive PCR amplicons which was not possible by cycle sequencing. Therefore, the mini-sequencing technique is recommended for regular epidemiological follow up of CPV types, since the technique is rapid, highly sensitive and high capacity method for CPV typing. Copyright © 2016. Published by Elsevier B.V.
Zhao, Dayong; Shen, Feng; Zeng, Jin; Huang, Rui; Yu, Zhongbo; Wu, Qinglong L
2016-12-15
Association network approaches have recently been proposed as a means for exploring the associations between bacterial communities. In the present study, high-throughput sequencing was employed to investigate the seasonal variations in the composition of bacterioplankton communities in six eutrophic urban lakes of Nanjing City, China. Over 150,000 16S rRNA sequences were derived from 52 water samples, and correlation-based network analyses were conducted. Our results demonstrated that the architecture of the co-occurrence networks varied in different seasons. Cyanobacteria played various roles in the ecological networks during different seasons. Co-occurrence patterns revealed that members of Cyanobacteria shared a very similar niche and they had weak positive correlations with other phyla in summer. To explore the effect of environmental factors on species-species co-occurrence networks and to determine the most influential environmental factors, the original positive network was simplified by module partitioning and by calculating module eigengenes. Module eigengene analysis indicated that temperature only affected some Cyanobacteria; the rest were mainly affected by nitrogen associated factors throughout the year. Cyanobacteria were dominant in summer which may result from strong co-occurrence patterns and suitable living conditions. Overall, this study has improved our understanding of the roles of Cyanobacteria and other bacterioplankton in ecological networks. Copyright © 2016 Elsevier B.V. All rights reserved.
Meuleman, Wouter; Peric-Hupkes, Daan; Kind, Jop; Beaudry, Jean-Bernard; Pagie, Ludo; Kellis, Manolis; Reinders, Marcel; Wessels, Lodewyk; van Steensel, Bas
2013-02-01
In metazoans, the nuclear lamina is thought to play an important role in the spatial organization of interphase chromosomes, by providing anchoring sites for large genomic segments named lamina-associated domains (LADs). Some of these LADs are cell-type specific, while many others appear constitutively associated with the lamina. Constitutive LADs (cLADs) may contribute to a basal chromosome architecture. By comparison of mouse and human lamina interaction maps, we find that the sizes and genomic positions of cLADs are strongly conserved. Moreover, cLADs are depleted of synteny breakpoints, pointing to evolutionary selective pressure to keep cLADs intact. Paradoxically, the overall sequence conservation is low for cLADs. Instead, cLADs are universally characterized by long stretches of DNA of high A/T content. Cell-type specific LADs also tend to adhere to this "A/T rule" in embryonic stem cells, but not in differentiated cells. This suggests that the A/T rule represents a default positioning mechanism that is locally overruled during lineage commitment. Analysis of paralogs suggests that during evolution changes in A/T content have driven the relocation of genes to and from the nuclear lamina, in tight association with changes in expression level. Taken together, these results reveal that the spatial organization of mammalian genomes is highly conserved and tightly linked to local nucleotide composition.
Rodent phylogeny revised: analysis of six nuclear genes from all major rodent clades
Blanga-Kanfi, Shani; Miranda, Hector; Penn, Osnat; Pupko, Tal; DeBry, Ronald W; Huchon, Dorothée
2009-01-01
Background Rodentia is the most diverse order of placental mammals, with extant rodent species representing about half of all placental diversity. In spite of many morphological and molecular studies, the family-level relationships among rodents and the location of the rodent root are still debated. Although various datasets have already been analyzed to solve rodent phylogeny at the family level, these are difficult to combine because they involve different taxa and genes. Results We present here the largest protein-coding dataset used to study rodent relationships. It comprises six nuclear genes, 41 rodent species, and eight outgroups. Our phylogenetic reconstructions strongly support the division of Rodentia into three clades: (1) a "squirrel-related clade", (2) a "mouse-related clade", and (3) Ctenohystrica. Almost all evolutionary relationships within these clades are also highly supported. The primary remaining uncertainty is the position of the root. The application of various models and techniques aimed to remove non-phylogenetic signal was unable to solve the basal rodent trifurcation. Conclusion Sequencing and analyzing a large sequence dataset enabled us to resolve most of the evolutionary relationships among Rodentia. Our findings suggest that the uncertainty regarding the position of the rodent root reflects the rapid rodent radiation that occurred in the Paleocene rather than the presence of conflicting phylogenetic and non-phylogenetic signals in the dataset. PMID:19341461
Dabert, Miroslawa; Dastych, Hieronymus; Hohberg, Karin; Dabert, Jacek
2014-01-01
The systematics of Eutardigrada, the largest lineage among the three classes of the phylum Tardigrada, is based mainly on the morphology of the leg claws and of the buccal apparatus. However, three members of the rarely recorded and poorly known limno-terrestrial eutardigrade genus Apodibius have no claws on their strongly reduced legs, a unique character among all tardigrades. This absence of all claws makes the systematic position of Apodibius one of the most enigmatic among the whole class. Until now all known associates of the genus Apodibius have been located in the incertae sedis species group or, quite recently, included into the Necopinatidae family. In the present study, phylogenetic analyses of 18S and 28S rRNA sequence data from 31 tardigrade species representing four parachelan superfamilies (Isohypsibioidea, Hypsibioidea, Macrobiotoidea, Eohypsibioidea), the apochelan Milnesium tardigradum, and the type species of the genus Apodibius, A. confusus, indicated close relationship of the Apodibius with tardigrade species recently included in the superfamily Isohypsibioidea. This result was well-supported and consistent across all markers (separate 18S rRNA, 28S rRNA, and combined 18S rRNA+28S rRNA datasets) and methods (MP, ML) applied. Copyright © 2013 Elsevier Inc. All rights reserved.
Lin, Shu-Xiang; Wang, Wei; Guo, Wei; Yang, Hong-Jiang; Ma, Bai-Cheng; Fang, Yu-Lian; Xu, Yong-Sheng
2017-07-01
To investigate the relationship of KI polyomavirus (KIPyV) and WU polyomavirus (WUPyV) with acute respiratory infection in children in Tianjin, China. A total of 3 730 nasopharyngeal secretions were collected from hospitalized children with acute respiratory infection in Tianjin Children's Hospital from January 2011 to December 2013. Viral nucleic acid was extracted, and virus infection (KIPyV and WUPyV) was determined by PCR. Some KIPyV-positive and WUPyV-positive PCR products were subjected to sequencing. Sequencing results were aligned with the known gene sequences of KIPyV and WUPyV to construct a phylogenetic tree. Amplified VP1 fragments of KIPyV were inserted into the cloning vector (PUCm-T) transformed into E. coli competent cells. Positive clones were identified by PCR and sequencing. The nucleotide sequences were submitted to GenBank. In addition, another seven common respiratory viruses in all samples were detected by direct immunofluorescence assay. In the 3 730 specimens, the KIPyV-positive rate was 12.14% (453/3 730) and the WUPyV-positive rate was 1.69% (63/3 730). The mean infection rate of KIPyV was significantly higher in June and July, while the mean infection rate of WUPyV peaked in February and March. Most of the KIPyV-positive or WUPyV-positive children were <3 years. The co-infections with KIPyV, WUPyV, and other respiratory viruses were observed in the children. The co-infection rate was 2.31% (86/3 730) and there were nine cases of co-infections with WUPyV and KIPyV. Thirty-five KIPyV-positive and twelve WUPyV-positive PCR products were sequenced and the alignment analysis showed that they had high homology with the known sequences (94%-100% vs 95%-100%). The VP1 gene sequences obtained from two KIPyV strains in this study were recorded in GenBank with the accession numbers of KY465925 and KY465926. For some children with acute respiratory infection in Tianjin, China, the acute respiratory infection may be associated with KIPyV and WUPyV infections. KIPyV infection is common in summer, and WUPyV infection in spring. The epidemic strains in Tianjin have a high homology with those in other regions.
Cheng, Peng; Liu, Bo; Su, Yi; Hu, Yao; Hong, Yahui; Yi, Xinxin; Chen, Lei; Su, Shengying; Chu, Jeffrey S C; Chen, Nansheng; Xiong, Xingyao
2017-04-19
Efficient biomass bioconversion is a promising solution to alternative energy resources and environmental issues associated with lignocellulosic wastes. The Trichoderma species of cellulolytic fungi have strong cellulose-degrading capability, and their cellulase systems have been extensively studied. Currently, a major limitation of Trichoderma strains is their low production of β-glucosidases. We isolated two Trichoderma hamatum strains YYH13 and YYH16 with drastically different cellulose degrading efficiencies. YYH13 has higher cellobiose-hydrolyzing efficiency. To understand mechanisms underlying such differences, we sequenced the genomes of YYH13 and YYH16, which are essentially identical (38.93 and 38.92 Mb, respectively) and are similar to that of the T. hamatum strain GD12. Using GeneMark-ES, we annotated 11,316 and 11,755 protein-coding genes in YYH13 and YYH16, respectively. Comparative analysis identified 13 functionally important genes in YYH13 under positive selection. Through examining orthologous relationships, we identified 172,655, and 320 genome-specific genes in YYH13, YYH16, and GD12, respectively. We found 15 protease families that show differences between YYH13 and YYH16. Enzymatic tests showed that exoglucanase, endoglucanase, and β-glucosidase activities were higher in YYH13 than YYH16. Additionally, YYH13 contains 10 families of carbohydrate-active enzymes, including GH1, GH3, GH18, GH35, and GH55 families of chitinases, glucosidases, galactosidases, and glucanases, which are subject to stronger positive selection pressure. Furthermore, we found that the β-glucosidase gene (YYH1311079) and pGEX-KG/YYH1311079 bacterial expression vector may provide valuable insight for designing β-glucosidase with higher cellobiose-hydrolyzing efficiencies. This study suggests that the YYH13 strain of T. hamatum has the potential to serve as a model organism for producing cellulase because of its strong ability to efficiently degrade cellulosic biomass. The genome sequences of YYH13 and YYH16 represents a valuable resource for studying efficient production of biofuels.
Nucleotide sequences of Japanese isolates of citrus vein enation virus.
Nakazono-Nagaoka, Eiko; Fujikawa, Takashi; Iwanami, Toru
2017-03-01
The genomic sequences of five Japanese isolates of citrus vein enation virus (CVEV) isolates that induce vein enation were determined and compared with that of the Spanish isolate VE-1. The nucleotide sequences of all Japanese isolates were 5,983 nt in length. The genomic RNA of Japanese isolates had five potential open reading frames (ORF 0, ORF 1, ORF 2, ORF 3, and ORF 5) in the positive-sense strand. The nucleotide sequence identity among the Japanese isolates and Spanish isolate VE-1 ranged from 98.0% to 99.8%. Comparison of the partial amino acid sequences of ten Japanese isolates and three Spanish isolates suggested that four amino acid residues, at positions of 83, 104, and 113 in ORF 2 and position 41 in ORF 5, might be unique to some Japanese isolates.
Folmar, L.D.; Denslow, N.D.; Wallace, R.A.; LaFleur, G.; Gross, T.S.; Bonomelli, S.; Sullivan, C.V.
1995-01-01
N-terminal amino acid sequences for vitellogenin (Vtg) from six species of teleost fish (striped bass, mummichog, pinfish, brown bullhead, medaka, yellow perch and the sturgeon) are compared with published N-terminal Vtg sequences for the lamprey, clawed frog and domestic chicken. Striped bass and mummichog had 100% identical amino acids between positions 7 and 21, while pinfish, brown bullhead, sturgeon, lamprey, Xenopus and chicken had 87%, 93%, 60%, 47%, 47-60%) for four transcripts and had 40% identical, respectively, with striped bass for the same positions. Partial sequences obtained for medaka and yellow perch were 100% identical between positions 5 to 10. The potential utility of this conserved sequence for studies on the biochemistry, molecular biology and pathology of vitellogenesis is discussed.
Molecular epidemiology of Powassan virus in North America.
Pesko, Kendra N; Torres-Perez, Fernando; Hjelle, Brian L; Ebel, Gregory D
2010-11-01
Powassan virus (POW) is a tick-borne flavivirus distributed in Canada, the northern USA and the Primorsky region of Russia. POW is the only tick-borne flavivirus endemic to the western hemisphere, where it is transmitted mainly between Ixodes cookei and groundhogs (Marmota monax). Deer tick virus (DTV), a genotype of POW that has been frequently isolated from deer ticks (Ixodes scapularis), appears to be maintained in an enzootic cycle between these ticks and white-footed mice (Peromyscus leucopus). DTV has been isolated from ticks in several regions of North America, including the upper Midwest and the eastern seaboard. The incidence of human disease due to POW is apparently increasing. Previous analysis of tick-borne flaviviruses endemic to North America have been limited to relatively short genome fragments. We therefore assessed the evolutionary dynamics of POW using newly generated complete and partial genome sequences. Maximum-likelihood and Bayesian phylogenetic inferences showed two well-supported, reciprocally monophyletic lineages corresponding to POW and DTV. Bayesian skyline plots based on year-of-sampling data indicated no significant population size change for either virus lineage. Statistical model-based selection analyses showed evidence of purifying selection in both lineages. Positive selection was detected in NS-5 sequences for both lineages and envelope sequences for POW. Our findings confirm that POW and DTV sequences are relatively stable over time, which suggests strong evolutionary constraint, and support field observations that suggest that tick-borne flavivirus populations are extremely stable in enzootic foci.
Analysis of genomic sequences by Chaos Game Representation.
Almeida, J S; Carriço, J A; Maretzek, A; Noble, P A; Fletcher, M
2001-05-01
Chaos Game Representation (CGR) is an iterative mapping technique that processes sequences of units, such as nucleotides in a DNA sequence or amino acids in a protein, in order to find the coordinates for their position in a continuous space. This distribution of positions has two properties: it is unique, and the source sequence can be recovered from the coordinates such that distance between positions measures similarity between the corresponding sequences. The possibility of using the latter property to identify succession schemes have been entirely overlooked in previous studies which raises the possibility that CGR may be upgraded from a mere representation technique to a sequence modeling tool. The distribution of positions in the CGR plane were shown to be a generalization of Markov chain probability tables that accommodates non-integer orders. Therefore, Markov models are particular cases of CGR models rather than the reverse, as currently accepted. In addition, the CGR generalization has both practical (computational efficiency) and fundamental (scale independence) advantages. These results are illustrated by using Escherichia coli K-12 as a test data-set, in particular, the genes thrA, thrB and thrC of the threonine operon.
Wilding, C.S.; Weetman, D.; Rippon, E.J.; Steen, K.; Mawejje, H.D.; Barsukov, I.; Donnelly, M.J.
2014-01-01
Insecticide resistance is a major impediment to the control of vectors and pests of public health importance and is a strongly selected trait capable of rapid spread, sometimes even between closely-related species. Elucidating the mechanisms generating insecticide resistance in mosquito vectors of disease, and understanding the spread of resistance within and between populations and species are vital for the development of robust resistance management strategies. Here we studied the mechanisms of resistance in two sympatric members of the Anopheles gambiae species complex – the major vector of malaria in sub-Saharan Africa – in order to understand how resistance has developed and spread in eastern Uganda, a region with some of the highest levels of malaria. In eastern Uganda, where the mosquitoes Anopheles arabiensis and An. gambiae can be found sympatrically, low levels of hybrids (0.4%) occur, offering a route for introgression of adaptively important variants between species. In independent microarray studies of insecticide resistance, Gste4, an insect-specific glutathione S-transferase, was among the most significantly up-regulated genes in both species. To test the hypothesis of interspecific introgression, we sequenced 2.3kbp encompassing Gste4. Whilst this detailed sequencing ruled out introgression, we detected strong positive selection acting on Gste4. However, these sequences, followed by haplotype-specific qPCR, showed that the apparent up-regulation in An. arabiensis is a result of allelic variation across the microarray probe binding sites which artefactually elevates the gene expression signal. Thus, face-value acceptance of microarray data can be misleading and it is advisable to conduct a more detailed investigation of the causes and nature of such signal. The identification of positive selection acting on this locus led us to functionally express and characterise allelic variants of GSTE4. Although the in vitro data do not support a direct role for GSTE4 in metabolism, they do support a role for this enzyme in insecticide sequestration. Thus, the demonstration of a role for an up-regulated gene in metabolic resistance to insecticides should not be limited to simply whether it can metabolise insecticide; such a strict criterion would argue against the involvement of GSTE4 despite the weight of evidence to the contrary. PMID:25213601
NASA Astrophysics Data System (ADS)
Houdayer, Cyril; Isono, Yusuke
2016-12-01
We investigate the asymptotic structure of (possibly type III) crossed product von Neumann algebras {M = B rtimes Γ} arising from arbitrary actions {Γ \\curvearrowright B} of bi-exact discrete groups (e.g. free groups) on amenable von Neumann algebras. We prove a spectral gap rigidity result for the central sequence algebra {N' \\cap M^ω} of any nonamenable von Neumann subalgebra with normal expectation {N subset M}. We use this result to show that for any strongly ergodic essentially free nonsingular action {Γ \\curvearrowright (X, μ)} of any bi-exact countable discrete group on a standard probability space, the corresponding group measure space factor {L^∞(X) rtimes Γ} has no nontrivial central sequence. Using recent results of Boutonnet et al. (Local spectral gap in simple Lie groups and applications, 2015), we construct, for every {0 < λ ≤ 1}, a type {III_λ} strongly ergodic essentially free nonsingular action {F_∞ \\curvearrowright (X_λ, μ_λ)} of the free group {{F}_∞} on a standard probability space so that the corresponding group measure space type {III_λ} factor {L^∞(X_λ, μ_λ) rtimes F_∞} has no nontrivial central sequence by our main result. In particular, we obtain the first examples of group measure space type {III} factors with no nontrivial central sequence.
Selection and Screening of DNA Aptamers for Inorganic Nanomaterials.
Zhou, Yibo; Huang, Zhicheng; Yang, Ronghua; Liu, Juewen
2018-02-21
Searching for DNA sequences that can strongly and selectively bind to inorganic surfaces is a long-standing topic in bionanotechnology, analytical chemistry and biointerface research. This can be achieved either by aptamer selection starting with a very large library of ≈10 14 random DNA sequences, or by careful screening of a much smaller library (usually from a few to a few hundred) with rationally designed sequences. Unlike typical molecular targets, inorganic surfaces often have quite strong DNA adsorption affinities due to polyvalent binding and even chemical interactions. This leads to a very high background binding making aptamer selection difficult. Screening, on the other hand, can be designed to compare relative binding affinities of different DNA sequences and could be more appropriate for inorganic surfaces. The resulting sequences have been used for DNA-directed assembly, sorting of carbon nanotubes, and DNA-controlled growth of inorganic nanomaterials. It was recently discovered that poly-cytosine (C) DNA can strongly bind to a diverse range of nanomaterials including nanocarbons (graphene oxide and carbon nanotubes), various metal oxides and transition-metal dichalcogenides. In this Concept article, we articulate the need for screening and potential artifacts associated with traditional aptamer selection methods for inorganic surfaces. Representative examples of application are discussed, and a few future research opportunities are proposed towards the end of this article. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Moebius Sequence and Autism Spectrum Disorders--Less Frequently Associated than Formerly Thought
ERIC Educational Resources Information Center
Briegel, Wolfgang; Schimek, Martina; Kamp-Becker, Inge
2010-01-01
Moebius sequence is a rare congenital disorder usually defined as a combination of facial weakness with impairment of ocular abduction. It is questionable, whether there is a strong association of the sequence with autism spectrum disorders (ASDs) as suggested in some earlier case reports and studies. Twenty-two participants with Moebius sequence…
Jiang, Ke; Zhang, Peng
2011-01-01
TRPA1 is a calcium ion channel protein recently identified as the infrared receptor in pit organ-containing snakes. Therefore, understanding the molecular evolution of TRPA1 may help to illuminate the origin of “heat vision” in snakes and reveal the molecular mechanism of infrared sensitivity for TRPA1. To this end, we sequenced the infrared sensory gene TRPA1 in 24 snake species, representing nine snake families and multiple non-snake outgroups. We found that TRPA1 is under strong positive selection in the pit-bearing snakes studied, but not in other non-pit snakes and non-snake vertebrates. As a comparison, TRPV1, a gene closely related to TRPA1, was found to be under strong purifying selection in all the species studied, with no difference in the strength of selection between pit-bearing snakes and non-pit snakes. This finding demonstrates that the adaptive evolution of TRPA1 specifically occurred within the pit-bearing snakes and may be related to the functional modification for detecting infrared radiation. In addition, by comparing the TRPA1 protein sequences, we identified 11 amino acid sites that were diverged in pit-bearing snakes but conserved in non-pit snakes and other vertebrates, 21 sites that were diverged only within pit-vipers but conserved in the remaining snakes. These specific amino acid substitutions may be potentially functional important for infrared sensing. PMID:22163322
Genome-wide signals of positive selection in human evolution
Enard, David; Messer, Philipp W.; Petrov, Dmitri A.
2014-01-01
The role of positive selection in human evolution remains controversial. On the one hand, scans for positive selection have identified hundreds of candidate loci, and the genome-wide patterns of polymorphism show signatures consistent with frequent positive selection. On the other hand, recent studies have argued that many of the candidate loci are false positives and that most genome-wide signatures of adaptation are in fact due to reduction of neutral diversity by linked deleterious mutations, known as background selection. Here we analyze human polymorphism data from the 1000 Genomes Project and detect signatures of positive selection once we correct for the effects of background selection. We show that levels of neutral polymorphism are lower near amino acid substitutions, with the strongest reduction observed specifically near functionally consequential amino acid substitutions. Furthermore, amino acid substitutions are associated with signatures of recent adaptation that should not be generated by background selection, such as unusually long and frequent haplotypes and specific distortions in the site frequency spectrum. We use forward simulations to argue that the observed signatures require a high rate of strongly adaptive substitutions near amino acid changes. We further demonstrate that the observed signatures of positive selection correlate better with the presence of regulatory sequences, as predicted by the ENCODE Project Consortium, than with the positions of amino acid substitutions. Our results suggest that adaptation was frequent in human evolution and provide support for the hypothesis of King and Wilson that adaptive divergence is primarily driven by regulatory changes. PMID:24619126
Adaptive signals in algal Rubisco reveal a history of ancient atmospheric carbon dioxide.
Young, J N; Rickaby, R E M; Kapralov, M V; Filatov, D A
2012-02-19
Rubisco, the most abundant enzyme on the Earth and responsible for all photosynthetic carbon fixation, is often thought of as a highly conserved and sluggish enzyme. Yet, different algal Rubiscos demonstrate a range of kinetic properties hinting at a history of evolution and adaptation. Here, we show that algal Rubisco has indeed evolved adaptively during ancient and distinct geological periods. Using DNA sequences of extant marine algae of the red and Chromista lineage, we define positive selection within the large subunit of Rubisco, encoded by rbcL, to occur basal to the radiation of modern marine groups. This signal of positive selection appears to be responding to changing intracellular concentrations of carbon dioxide (CO(2)) triggered by physiological adaptations to declining atmospheric CO(2). Within the ecologically important Haptophyta (including coccolithophores) and Bacillariophyta (diatoms), positive selection occurred consistently during periods of falling Phanerozoic CO(2) and suggests emergence of carbon-concentrating mechanisms. During the Proterozoic, a strong signal of positive selection after secondary endosymbiosis occurs at the origin of the Chromista lineage (approx. 1.1 Ga), with further positive selection events until 0.41 Ga, implying a significant and continuous decrease in atmospheric CO(2) encompassing the Cryogenian Snowball Earth events. We surmise that positive selection in Rubisco has been caused by declines in atmospheric CO(2) and hence acts as a proxy for ancient atmospheric CO(2).
Adaptive signals in algal Rubisco reveal a history of ancient atmospheric carbon dioxide
Young, J. N.; Rickaby, R. E. M.; Kapralov, M. V.; Filatov, D. A.
2012-01-01
Rubisco, the most abundant enzyme on the Earth and responsible for all photosynthetic carbon fixation, is often thought of as a highly conserved and sluggish enzyme. Yet, different algal Rubiscos demonstrate a range of kinetic properties hinting at a history of evolution and adaptation. Here, we show that algal Rubisco has indeed evolved adaptively during ancient and distinct geological periods. Using DNA sequences of extant marine algae of the red and Chromista lineage, we define positive selection within the large subunit of Rubisco, encoded by rbcL, to occur basal to the radiation of modern marine groups. This signal of positive selection appears to be responding to changing intracellular concentrations of carbon dioxide (CO2) triggered by physiological adaptations to declining atmospheric CO2. Within the ecologically important Haptophyta (including coccolithophores) and Bacillariophyta (diatoms), positive selection occurred consistently during periods of falling Phanerozoic CO2 and suggests emergence of carbon-concentrating mechanisms. During the Proterozoic, a strong signal of positive selection after secondary endosymbiosis occurs at the origin of the Chromista lineage (approx. 1.1 Ga), with further positive selection events until 0.41 Ga, implying a significant and continuous decrease in atmospheric CO2 encompassing the Cryogenian Snowball Earth events. We surmise that positive selection in Rubisco has been caused by declines in atmospheric CO2 and hence acts as a proxy for ancient atmospheric CO2. PMID:22232761
Tattiyapong, Muncharee; Sivakumar, Thillaiampalam; Takemae, Hitoshi; Simking, Pacharathon; Jittapalapong, Sathaporn; Igarashi, Ikuo; Yokoyama, Naoaki
2016-07-01
Babesia bovis, an intraerythrocytic protozoan parasite, causes severe clinical disease in cattle worldwide. The genetic diversity of parasite antigens often results in different immune profiles in infected animals, hindering efforts to develop immune control methodologies against the B. bovis infection. In this study, we analyzed the genetic diversity of the merozoite surface antigen-1 (msa-1) gene using 162 B. bovis-positive blood DNA samples sourced from cattle populations reared in different geographical regions of Thailand. The identity scores shared among 93 msa-1 gene sequences isolated by PCR amplification were 43.5-100%, and the similarity values among the translated amino acid sequences were 42.8-100%. Of 23 total clades detected in our phylogenetic analysis, Thai msa-1 gene sequences occurred in 18 clades; seven among them were composed of sequences exclusively from Thailand. To investigate differential antigenicity of isolated MSA-1 proteins, we expressed and purified eight recombinant MSA-1 (rMSA-1) proteins, including an rMSA-1 from B. bovis Texas (T2Bo) strain and seven rMSA-1 proteins based on the Thai msa-1 sequences. When these antigens were analyzed in a western blot assay, anti-T2Bo cattle serum strongly reacted with the rMSA-1 from T2Bo, as well as with three other rMSA-1 proteins that shared 54.9-68.4% sequence similarity with T2Bo MSA-1. In contrast, no or weak reactivity was observed for the remaining rMSA-1 proteins, which shared low sequence similarity (35.0-39.7%) with T2Bo MSA-1. While demonstrating the high genetic diversity of the B. bovis msa-1 gene in Thailand, the present findings suggest that the genetic diversity results in antigenicity variations among the MSA-1 antigens of B. bovis in Thailand. Copyright © 2016 Elsevier B.V. All rights reserved.
Fletcher, Simon P; Ali, Iraj K; Kaminski, Ann; Digard, Paul; Jackson, Richard J
2002-01-01
Classical swine fever virus (CSFV) is a member of the pestivirus family, which shares many features in common with hepatitis C virus (HCV). It is shown here that CSFV has an exceptionally efficient cis-acting internal ribosome entry segment (IRES), which, like that of HCV, is strongly influenced by the sequences immediately downstream of the initiation codon, and is optimal with viral coding sequences in this position. Constructs that retained 17 or more codons of viral coding sequence exhibited full IRES activity, but with only 12 codons, activity was approximately 66% of maximum in vitro (though close to maximum in transfected BHK cells), whereas with just 3 codons or fewer, the activity was only approximately 15% of maximum. The minimal coding region elements required for high activity were exchanged between HCV and CSFV. Although maximum activity was observed in each case with the homologous combination of coding region and 5' UTR, the heterologous combinations were sufficiently active to rule out a highly specific functional interplay between the 5' UTR and coding sequences. On the other hand, inversion of the coding sequences resulted in low IRES activity, particularly with the HCV coding sequences. RNA structure probing showed that the efficiency of internal initiation of these chimeric constructs correlated most closely with the degree of single-strandedness of the region around and immediately downstream of the initiation codon. The low activity IRESs could not be rescued by addition of supplementary eIF4A (the initiation factor with ATP-dependent RNA helicase activity). The extreme sensitivity to secondary structure around the initiation codon is likely to be due to the fact that the eIF4F complex (which has eIF4A as one of its subunits) is not required for and does not participate in initiation on these IRESs. PMID:12515388
The last Viking King: a royal maternity case solved by ancient DNA analysis.
Dissing, Jørgen; Binladen, Jonas; Hansen, Anders; Sejrsen, Birgitte; Willerslev, Eske; Lynnerup, Niels
2007-02-14
The last of the Danish Viking Kings, Sven Estridsen, died in a.d. 1074 and is entombed in Roskilde Cathedral with other Danish kings and queens. Sven's mother, Estrid, is entombed in a pillar across the chancel. However, while there is no reasonable doubt about the identity of Sven, there have been doubts among historians whether the woman entombed was indeed Estrid. To shed light on this problem, we have extracted and analysed mitochondrial DNA (mtDNA) from pulp of teeth from each of the two royals. Four overlapping DNA-fragments covering about 400bp of hypervariable region 1 (HVR-1) of the D-loop were PCR amplified, cloned and a number of clones with each segment were sequenced. Also a segment containing the H/non-H specific nucleotide 7028 was sequenced. Consensus sequences were determined and D-loop results were replicated in an independent laboratory. This allowed the assignment of King Sven Estridsen to haplogroup H; Estrid's sequence differed from that of Sven at two positions in HVR-1, 16093T-->C and 16304T-->C, indicating that she belongs to subgroup H5a. Given the maternal inheritance of mtDNA, offspring will have the same mtDNA sequence as their mother with the exception of rare cases where the sequence has been altered by a germ line mutation. Therefore, the observation of two sequence differences makes it highly unlikely that the entombed woman was the mother of Sven. In addition, physical examination of the skeleton and the teeth strongly indicated that this woman was much younger (approximately 35 years) at the time of death than the 70 years history records tell. Although the entombed woman cannot be the Estrid, she may well be one of Sven's two daughters-in-law who were also called Estrid and who both became queens.
Lazar, Zbigniew; Rossignol, Tristan; Verbeke, Jonathan; Crutz-Le Coq, Anne-Marie; Nicaud, Jean-Marc; Robak, Małgorzata
2013-11-01
Yarrowia lipolytica requires the expression of a heterologous invertase to grow on a sucrose-based substrate. This work reports the construction of an optimized invertase expression cassette composed of Saccharomyces cerevisiae Suc2p secretion signal sequence followed by the SUC2 sequence and under the control of the strong Y. lipolytica pTEF promoter. This new construction allows a fast and optimal cleavage of sucrose into glucose and fructose and allows cells to reach the maximum growth rate. Contrary to pre-existing constructions, the expression of SUC2 is not sensitive to medium composition in this context. The strain JMY2593, expressing this new cassette with an optimized secretion signal sequence and a strong promoter, produces 4,519 U/l of extracellular invertase in bioreactor experiments compared to 597 U/l in a strain expressing the former invertase construction. The expression of this cassette strongly improved production of invertase and is suitable for simultaneously high production level of citric acid from sucrose-based media.
Molecular Methods To Improve Diagnosis and Identification of Mucormycosis▿
Hammond, Sarah P.; Bialek, Ralf; Milner, Danny A.; Petschnigg, Eva M.; Baden, Lindsey R.; Marty, Francisco M.
2011-01-01
Mucormycosis is difficult to diagnose. Samples from suspected cases often fail to grow Mucorales in microbiologic cultures. We identified all hematologic malignancy and stem cell transplant patients diagnosed with proven mucormycosis between 2001 and 2009 at Brigham and Women's Hospital/Dana-Farber Cancer Institute. Seminested PCR targeting Mucorales 18S ribosomal DNA and sequencing were performed on formalin-fixed paraffin-embedded tissue samples. Of 29 cases of mucormycosis, 27 had tissue samples available for PCR and sequencing. Mucorales PCR was positive in 22. Among 12 culture-positive cases, 10 were PCR positive and sequencing was concordant with culture results to the genus level in 9. Among 15 culture-negative cases, PCR was positive and sequencing allowed genus identification in 12. Mucorales PCR is useful for confirmation of the diagnosis of mucormycosis and for further characterization of the infection in cases where cultures are negative. PMID:21508149
Molecular methods to improve diagnosis and identification of mucormycosis.
Hammond, Sarah P; Bialek, Ralf; Milner, Danny A; Petschnigg, Eva M; Baden, Lindsey R; Marty, Francisco M
2011-06-01
Mucormycosis is difficult to diagnose. Samples from suspected cases often fail to grow Mucorales in microbiologic cultures. We identified all hematologic malignancy and stem cell transplant patients diagnosed with proven mucormycosis between 2001 and 2009 at Brigham and Women's Hospital/Dana-Farber Cancer Institute. Seminested PCR targeting Mucorales 18S ribosomal DNA and sequencing were performed on formalin-fixed paraffin-embedded tissue samples. Of 29 cases of mucormycosis, 27 had tissue samples available for PCR and sequencing. Mucorales PCR was positive in 22. Among 12 culture-positive cases, 10 were PCR positive and sequencing was concordant with culture results to the genus level in 9. Among 15 culture-negative cases, PCR was positive and sequencing allowed genus identification in 12. Mucorales PCR is useful for confirmation of the diagnosis of mucormycosis and for further characterization of the infection in cases where cultures are negative.
May, Jared; Johnson, Philip; Saleem, Huma
2017-01-01
ABSTRACT To maximize the coding potential of viral genomes, internal ribosome entry sites (IRES) can be used to bypass the traditional requirement of a 5′ cap and some/all of the associated translation initiation factors. Although viral IRES typically contain higher-order RNA structure, an unstructured sequence of about 84 nucleotides (nt) immediately upstream of the Turnip crinkle virus (TCV) coat protein (CP) open reading frame (ORF) has been found to promote internal expression of the CP from the genomic RNA (gRNA) both in vitro and in vivo. An absence of extensive RNA structure was predicted using RNA folding algorithms and confirmed by selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE) RNA structure probing. Analysis of the IRES region in vitro by use of both the TCV gRNA and reporter constructs did not reveal any sequence-specific elements but rather suggested that an overall lack of structure was an important feature for IRES activity. The CP IRES is A-rich, independent of orientation, and strongly conserved among viruses in the same genus. The IRES was dependent on eIF4G, but not eIF4E, for activity. Low levels of CP accumulated in vivo in the absence of detectable TCV subgenomic RNAs, strongly suggesting that the IRES was active in the gRNA in vivo. Since the TCV CP also serves as the viral silencing suppressor, early translation of the CP from the viral gRNA is likely important for countering host defenses. Cellular mRNA IRES also lack extensive RNA structures or sequence conservation, suggesting that this viral IRES and cellular IRES may have similar strategies for internal translation initiation. IMPORTANCE Cap-independent translation is a common strategy among positive-sense, single-stranded RNA viruses for bypassing the host cell requirement of a 5′ cap structure. Viral IRES, in general, contain extensive secondary structure that is critical for activity. In contrast, we demonstrate that a region of viral RNA devoid of extensive secondary structure has IRES activity and produces low levels of viral coat protein in vitro and in vivo. Our findings may be applicable to cellular mRNA IRES that also have little or no sequences/structures in common. PMID:28179526
Draft genome sequence of Therminicola potens strain JR
DOE Office of Scientific and Technical Information (OSTI.GOV)
Byrne-Bailey, K.G.; Wrighton, K.C.; Melnyk, R.A.
'Thermincola potens' strain JR is one of the first Gram-positive dissimilatory metal-reducing bacteria (DMRB) for which there is a complete genome sequence. Consistent with the physiology of this organism, preliminary annotation revealed an abundance of multiheme c-type cytochromes that are putatively associated with the periplasm and cell surface in a Gram-positive bacterium. Here we report the complete genome sequence of strain JR.
Fils-Lycaon, B R; Wiersma, P A; Eastwell, K C; Sautiere, P
1996-05-01
A 29-kD polypeptide is the most abundant soluble protein in ripe cherry fruit (Prunus avium L); accumulation begins at the onset of ripening as the fruit turns from yellow to red. This protein was extracted from ripe cherries and purified by size-exclusion and ion-exchange chromatography. Antibodies to the purified protein were used to screen a cDNA library from ripe cherries. Numerous recombinant plaques reacted positively with the antibodies; the DNA sequence of representative clones encoded a polypeptide of 245 amino acid residues. A signal peptide was indicated, and the predicted mature protein corresponded to the purified protein in size (23.3 kD, by mass spectrometry) and isoelectric point (4.2). A search of known protein sequences revealed a strong similarity between this polypeptide and the thaumatin family of pathogenesis-related proteins. The cherry thaumatin-like protein does not have a sweet taste, and no antifungal activity was seen in preliminary assays. Expression of the protein appears to be regulated at the gene level, with mRNA levels at their highest in the ripe fruit.
Fils-Lycaon, B R; Wiersma, P A; Eastwell, K C; Sautiere, P
1996-01-01
A 29-kD polypeptide is the most abundant soluble protein in ripe cherry fruit (Prunus avium L); accumulation begins at the onset of ripening as the fruit turns from yellow to red. This protein was extracted from ripe cherries and purified by size-exclusion and ion-exchange chromatography. Antibodies to the purified protein were used to screen a cDNA library from ripe cherries. Numerous recombinant plaques reacted positively with the antibodies; the DNA sequence of representative clones encoded a polypeptide of 245 amino acid residues. A signal peptide was indicated, and the predicted mature protein corresponded to the purified protein in size (23.3 kD, by mass spectrometry) and isoelectric point (4.2). A search of known protein sequences revealed a strong similarity between this polypeptide and the thaumatin family of pathogenesis-related proteins. The cherry thaumatin-like protein does not have a sweet taste, and no antifungal activity was seen in preliminary assays. Expression of the protein appears to be regulated at the gene level, with mRNA levels at their highest in the ripe fruit. PMID:8685266
The impact of CRISPR repeat sequence on structures of a Cas6 protein-RNA complex
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Ruiying; Zheng, Han; Preamplume, Gan
The repeat-associated mysterious proteins (RAMPs) comprise the most abundant family of proteins involved in prokaryotic immunity against invading genetic elements conferred by the clustered regularly interspaced short palindromic repeat (CRISPR) system. Cas6 is one of the first characterized RAMP proteins and is a key enzyme required for CRISPR RNA maturation. Despite a strong structural homology with other RAMP proteins that bind hairpin RNA, Cas6 distinctly recognizes single-stranded RNA. Previous structural and biochemical studies show that Cas6 captures the 5' end while cleaving the 3' end of the CRISPR RNA. Here, we describe three structures and complementary biochemical analysis of amore » noncatalytic Cas6 homolog from Pyrococcus horikoshii bound to CRISPR repeat RNA of different sequences. Our study confirms the specificity of the Cas6 protein for single-stranded RNA and further reveals the importance of the bases at Positions 5-7 in Cas6-RNA interactions. Substitutions of these bases result in structural changes in the protein-RNA complex including its oligomerization state.« less
Human papillomavirus 33 worldwide genetic variation and associated risk of cervical cancer
Chen, Alyce A.; Heideman, Daniëlle A.M.; Boon, Debby; Chen, Zigui; Burk, Robert D.; De Vuyst, Hugo; Gheit, Tarik; Snijders, Peter J.F.; Tommasino, Massimo; Franceschi, Silvia; Clifford, Gary M.
2014-01-01
Human papillomavirus (HPV) 33, a member of the HPV16-related alpha-9 species group, is found in approximately 5% of cervical cancers worldwide. The current study aimed to characterize the genetic diversity of HPV33 and to explore the association of HPV33 variants with the risk for cervical cancer. Taking advantage of the International Agency for Research on Cancer biobank, we sequenced the entire E6 and E7 open reading frames of 213 HPV33-positive cervical samples from 30 countries. We identified 28 HPV33 variants that formed 5 phylogenetic groups: the previously identified A1, A2, and B (sub) lineages and the novel A3 and C (sub)lineages. The A1 sublineage was strongly over-represented in cervical cases compared to controls in both Africa and Europe. In conclusion, we provide a classification system for HPV33 variants based on the sequence of E6 and E7 and suggest that the association of HPV33 with cervical cancer may differ by variant (sub)lineage. PMID:24314666
An enriched multimedia eBook application to facilitate learning of anatomy.
Stirling, Allan; Birt, James
2014-01-01
This pilot study compared the use of an enriched multimedia eBook with traditional methods for teaching the gross anatomy of the heart and great vessels. Seventy-one first-year students from an Australian medical school participated in the study. Students' abilities were examined by pretest, intervention, and post-test measurements. Perceptions and attitudes toward eBook technology were examined by survey questions. Results indicated a strongly positive user experience coupled with increased marks; however, there were no statistically significant results for the eBook method of delivery alone outperforming the traditional anatomy practical session. Results did show a statistically significant difference in the final marks achieved based on the sequencing of the learning modalities. With initial interaction with the multimedia content followed by active experimentation in the anatomy lab, students' performance was improved in the final test. Obtained data support the role of eBook technology in modern anatomy curriculum being a useful adjunct to traditional methods. Further study is needed to investigate the importance of sequencing of teaching interventions. © 2013 American Association of Anatomists.
Smeragliuolo, Anna H.; Long, John Davis; Bumanlag, Silverio Joseph; He, Victor; Lampe, Anna
2017-01-01
The objective of this study was to determine whether kinematic data collected by the Microsoft Kinect 2 (MK2) could be used to quantify postural stability in healthy subjects. Twelve subjects were recruited for the project, and were instructed to perform a sequence of simple postural stability tasks. The movement sequence was performed as subjects were seated on top of a force platform, and the MK2 was positioned in front of them. This sequence of tasks was performed by each subject under three different postural conditions: “both feet on the ground” (1), “One foot off the ground” (2), and “both feet off the ground” (3). We compared force platform and MK2 data to quantify the degree to which the MK2 was returning reliable data across subjects. We then applied a novel machine-learning paradigm to the MK2 data in order to determine the extent to which data from the MK2 could be used to reliably classify different postural conditions. Our initial comparison of force plate and MK2 data showed a strong agreement between the two devices, with strong Pearson correlations between the trunk centroids “Spine_Mid” (0.85 ± 0.06), “Neck” (0.86 ± 0.07) and “Head” (0.87 ± 0.07), and the center of pressure centroid inferred by the force platform. Mean accuracy for the machine learning classifier from MK2 was 97.0%, with a specific classification accuracy breakdown of 90.9%, 100%, and 100% for conditions 1 through 3, respectively. Mean accuracy for the machine learning classifier derived from the force platform data was lower at 84.4%. We conclude that data from the MK2 has sufficient information content to allow us to classify sequences of tasks being performed under different levels of postural stability. Future studies will focus on validating this protocol on large populations of individuals with actual balance impairments in order to create a toolkit that is clinically validated and available to the medical community. PMID:28196139
Dehbandi, Behdad; Barachant, Alexandre; Smeragliuolo, Anna H; Long, John Davis; Bumanlag, Silverio Joseph; He, Victor; Lampe, Anna; Putrino, David
2017-01-01
The objective of this study was to determine whether kinematic data collected by the Microsoft Kinect 2 (MK2) could be used to quantify postural stability in healthy subjects. Twelve subjects were recruited for the project, and were instructed to perform a sequence of simple postural stability tasks. The movement sequence was performed as subjects were seated on top of a force platform, and the MK2 was positioned in front of them. This sequence of tasks was performed by each subject under three different postural conditions: "both feet on the ground" (1), "One foot off the ground" (2), and "both feet off the ground" (3). We compared force platform and MK2 data to quantify the degree to which the MK2 was returning reliable data across subjects. We then applied a novel machine-learning paradigm to the MK2 data in order to determine the extent to which data from the MK2 could be used to reliably classify different postural conditions. Our initial comparison of force plate and MK2 data showed a strong agreement between the two devices, with strong Pearson correlations between the trunk centroids "Spine_Mid" (0.85 ± 0.06), "Neck" (0.86 ± 0.07) and "Head" (0.87 ± 0.07), and the center of pressure centroid inferred by the force platform. Mean accuracy for the machine learning classifier from MK2 was 97.0%, with a specific classification accuracy breakdown of 90.9%, 100%, and 100% for conditions 1 through 3, respectively. Mean accuracy for the machine learning classifier derived from the force platform data was lower at 84.4%. We conclude that data from the MK2 has sufficient information content to allow us to classify sequences of tasks being performed under different levels of postural stability. Future studies will focus on validating this protocol on large populations of individuals with actual balance impairments in order to create a toolkit that is clinically validated and available to the medical community.
Implications of the dependence of the elastic properties of DNA on nucleotide sequence.
Olson, Wilma K; Swigon, David; Coleman, Bernard D
2004-07-15
Recent advances in structural biochemistry have provided evidence that not only the geometric properties but also the elastic moduli of duplex DNA are strongly dependent on nucleotide sequence in a way that is not accounted for by classical rod models of the Kirchhoff type. A theory of sequence-dependent DNA elasticity is employed here to calculate the dependence of the equilibrium configurations of circular DNA on the binding of ligands that can induce changes in intrinsic twist at a single base-pair step. Calculations are presented of the influence on configurations of the assumed values and distribution along the DNA of intrinsic roll and twist and a modulus coupling roll to twist. Among the results obtained are the following. For minicircles formed from intrinsically straight DNA, the distribution of roll-twist coupling strongly affects the dependence of the total elastic energy Psi on the amount alpha of imposed untwisting, and that dependence can be far from quadratic. (In fact, for a periodic distribution of roll-twist coupling with a period equal to the intrinsic helical repeat length, Psi can be essentially independent of alpha for -90 degrees < alpha <90 degrees.) When the minicircle is homogeneous and without roll-twist coupling, but with uniform positive intrinsic roll, the point at which Psi attains its minimum value shifts towards negative values of alpha. It is remarked that there are cases in which one can relate graphs of Psi versus alpha to the 'effective values' of bending and twisting moduli and helical repeat length obtained from measurements of equilibrium distributions of topoisomers and probabilities of ring closure. For a minicircle formed from DNA that has an 'S' shape when stress-free, the graphs of Psi versus alpha have maxima at alpha = 0. As the binding of a twisting agent to such a minicircle results in a net decrease in Psi, the affinity of the twisting agent for binding to the minicircle is greater than its affinity for binding to unconstrained DNA with the same sequence.
USDA-ARS?s Scientific Manuscript database
Escherichia coli O157:H7 strain 86-24 does not produce curli fimbriae, but can give rise to curli-positive isolates at a variable frequency. Here, we report the whole-genome sequences of curli-negative and curli-positive isolates of strain 86-24....
Bayles, Darrell O.; Alt, David P.; Looft, Torey
2016-01-01
Escherichia coli O157:H7 strain 86-24 does not produce curli fimbriae, but gives rise to curli-positive isolates at a variable frequency. Here, we report the complete genome sequences of curli-negative and curli-positive isolates of strain 86-24. PMID:27979932
Begum, Rabeya; Alam, Sheikh Shamimul; Menzel, Gerhard; Schmidt, Thomas
2009-01-01
Background and Aims Dendrobium species show tremendous morphological diversity and have broad geographical distribution. As repetitive sequence analysis is a useful tool to investigate the evolution of chromosomes and genomes, the aim of the present study was the characterization of repetitive sequences from Dendrobium moschatum for comparative molecular and cytogenetic studies in the related species Dendrobium aphyllum, Dendrobium aggregatum and representatives from other orchid genera. Methods In order to isolate highly repetitive sequences, a c0t-1 DNA plasmid library was established. Repeats were sequenced and used as probes for Southern hybridization. Sequence divergence was analysed using bioinformatic tools. Repetitive sequences were localized along orchid chromosomes by fluorescence in situ hybridization (FISH). Key Results Characterization of the c0t-1 library resulted in the detection of repetitive sequences including the (GA)n dinucleotide DmoO11, numerous Arabidopsis-like telomeric repeats and the highly amplified dispersed repeat DmoF14. The DmoF14 repeat is conserved in six Dendrobium species but diversified in representative species of three other orchid genera. FISH analyses showed the genome-wide distribution of DmoF14 in D. moschatum, D. aphyllum and D. aggregatum. Hybridization with the telomeric repeats demonstrated Arabidopsis-like telomeres at the chromosome ends of Dendrobium species. However, FISH using the telomeric probe revealed two pairs of chromosomes with strong intercalary signals in D. aphyllum. FISH showed the terminal position of 5S and 18S–5·8S–25S rRNA genes and a characteristic number of rDNA sites in the three Dendrobium species. Conclusions The repeated sequences isolated from D. moschatum c0t-1 DNA constitute major DNA families of the D. moschatum, D. aphyllum and D. aggregatum genomes with DmoF14 representing an ancient component of orchid genomes. Large intercalary telomere-like arrays suggest chromosomal rearrangements in D. aphyllum while the number and localization of rRNA genes as well as the species-specific distribution pattern of an abundant microsatellite reflect the genomic diversity of the three Dendrobium species. PMID:19635741
Deep sequencing in library selection projects: what insight does it bring?
Glanville, J; D'Angelo, S; Khan, T A; Reddy, S T; Naranjo, L; Ferrara, F; Bradbury, A R M
2015-08-01
High throughput sequencing is poised to change all aspects of the way antibodies and other binders are discovered and engineered. Millions of available sequence reads provide an unprecedented sampling depth able to guide the design and construction of effective, high quality naïve libraries containing tens of billions of unique molecules. Furthermore, during selections, high throughput sequencing enables quantitative tracing of enriched clones and position-specific guidance to amino acid variation under positive selection during antibody engineering. Successful application of the technologies relies on specific PCR reagent design, correct sequencing platform selection, and effective use of computational tools and statistical measures to remove error, identify antibodies, estimate diversity, and extract signatures of selection from the clone down to individual structural positions. Here we review these considerations and discuss some of the remaining challenges to the widespread adoption of the technology. Copyright © 2015 Elsevier Ltd. All rights reserved.
Deep sequencing in library selection projects: what insight does it bring?
Glanville, J; D’Angelo, S; Khan, T.A.; Reddy, S. T.; Naranjo, L.; Ferrara, F.; Bradbury, A.R.M.
2015-01-01
High throughput sequencing is poised to change all aspects of the way antibodies and other binders are discovered and engineered. Millions of available sequence reads provide an unprecedented sampling depth able to guide the design and construction of effective, high quality naïve libraries containing tens of billions of unique molecules. Furthermore, during selections, high throughput sequencing enables quantitative tracing of enriched clones and position-specific guidance to amino acid variation under positive selection during antibody engineering. Successful application of the technologies relies on specific PCR reagent design, correct sequencing platform selection, and effective use of computational tools and statistical measures to remove error, identify antibodies, estimate diversity, and extract signatures of selection from the clone down to individual structural positions. Here we review these considerations and discuss some of the remaining challenges to the widespread adoption of the technology. PMID:26451649
Atibalentja, N; Noel, G R; Domier, L L
2000-03-01
A 1341 bp sequence of the 16S rDNA of an undescribed species of Pasteuria that parasitizes the soybean cyst nematode, Heterodera glycines, was determined and then compared with a homologous sequence of Pasteuria ramosa, a parasite of cladoceran water fleas of the family Daphnidae. The two Pasteuria sequences, which diverged from each other by a dissimilarity index of 7%, also were compared with the 16S rDNA sequences of 30 other bacterial species to determine the phylogenetic position of the genus Pasteuria among the Gram-positive eubacteria. Phylogenetic analyses using maximum-likelihood, maximum-parsimony and neighbour-joining methods showed that the Heterodera glycines-infecting Pasteuria and its sister species, P. ramosa, form a distinct line of descent within the Alicyclobacillus group of the Bacillaceae. These results are consistent with the view that the genus Pasteuria is a deeply rooted member of the Clostridium-Bacillus-Streptococcus branch of the Gram-positive eubacteria, neither related to the actinomycetes nor closely related to true endospore-forming bacteria.
Ferry, Alissa L; Fló, Ana; Brusini, Perrine; Cattarossi, Luigi; Macagno, Francesco; Nespor, Marina; Mehler, Jacques
2016-05-01
To understand language, humans must encode information from rapid, sequential streams of syllables - tracking their order and organizing them into words, phrases, and sentences. We used Near-Infrared Spectroscopy (NIRS) to determine whether human neonates are born with the capacity to track the positions of syllables in multisyllabic sequences. After familiarization with a six-syllable sequence, the neonate brain responded to the change (as shown by an increase in oxy-hemoglobin) when the two edge syllables switched positions but not when two middle syllables switched positions (Experiment 1), indicating that they encoded the syllables at the edges of sequences better than those in the middle. Moreover, when a 25 ms pause was inserted between the middle syllables as a segmentation cue, neonates' brains were sensitive to the change (Experiment 2), indicating that subtle cues in speech can signal a boundary, with enhanced encoding of the syllables located at the edges of that boundary. These findings suggest that neonates' brains can encode information from multisyllabic sequences and that this encoding is constrained. Moreover, subtle segmentation cues in a sequence of syllables provide a mechanism with which to accurately encode positional information from longer sequences. Tracking the order of syllables is necessary to understand language and our results suggest that the foundations for this encoding are present at birth. © 2015 John Wiley & Sons Ltd.
Lua, Rhonald C; Wilson, Stephen J; Konecki, Daniel M; Wilkins, Angela D; Venner, Eric; Morgan, Daniel H; Lichtarge, Olivier
2016-01-04
The structure and function of proteins underlie most aspects of biology and their mutational perturbations often cause disease. To identify the molecular determinants of function as well as targets for drugs, it is central to characterize the important residues and how they cluster to form functional sites. The Evolutionary Trace (ET) achieves this by ranking the functional and structural importance of the protein sequence positions. ET uses evolutionary distances to estimate functional distances and correlates genotype variations with those in the fitness phenotype. Thus, ET ranks are worse for sequence positions that vary among evolutionarily closer homologs but better for positions that vary mostly among distant homologs. This approach identifies functional determinants, predicts function, guides the mutational redesign of functional and allosteric specificity, and interprets the action of coding sequence variations in proteins, people and populations. Now, the UET database offers pre-computed ET analyses for the protein structure databank, and on-the-fly analysis of any protein sequence. A web interface retrieves ET rankings of sequence positions and maps results to a structure to identify functionally important regions. This UET database integrates several ways of viewing the results on the protein sequence or structure and can be found at http://mammoth.bcm.tmc.edu/uet/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Harrisson, K; Pavlova, A; Gan, H M; Lee, Y P; Austin, C M; Sunnucks, P
2016-06-01
Climatic differences across a taxon's range may be associated with specific bioenergetic demands and may result in genetics-based metabolic adaptation, particularly in aquatic ectothermic organisms that rely on heat exchange with the environment to regulate key physiological processes. Extending down the east coast of Australia, the Great Dividing Range (GDR) has a strong influence on climate and the evolutionary history of freshwater fish species. Despite the GDR acting as a strong contemporary barrier to fish movement, many species, and species with shared ancestries, are found on both sides of the GDR, indicative of historical dispersal events. We sequenced complete mitogenomes from the four extant species of the freshwater cod genus Maccullochella, two of which occur on the semi-arid, inland side of the GDR, and two on the mesic coastal side. We constructed a dated phylogeny and explored the relative influences of purifying and positive selection in the evolution of mitogenome divergence among species. Results supported mid- to late-Pleistocene divergence of Maccullochella across the GDR (220-710 thousand years ago), bringing forward previously reported dates. Against a background of pervasive purifying selection, we detected potentially functionally relevant fixed amino acid differences across the GDR. Although many amino acid differences between inland and coastal species may have become fixed under relaxed purifying selection in coastal environments rather than positive selection, there was evidence of episodic positive selection acting on specific codons in the Mary River coastal lineage, which has consistently experienced the warmest and least extreme climate in the genus.
Harrisson, K; Pavlova, A; Gan, H M; Lee, Y P; Austin, C M; Sunnucks, P
2016-01-01
Climatic differences across a taxon's range may be associated with specific bioenergetic demands and may result in genetics-based metabolic adaptation, particularly in aquatic ectothermic organisms that rely on heat exchange with the environment to regulate key physiological processes. Extending down the east coast of Australia, the Great Dividing Range (GDR) has a strong influence on climate and the evolutionary history of freshwater fish species. Despite the GDR acting as a strong contemporary barrier to fish movement, many species, and species with shared ancestries, are found on both sides of the GDR, indicative of historical dispersal events. We sequenced complete mitogenomes from the four extant species of the freshwater cod genus Maccullochella, two of which occur on the semi-arid, inland side of the GDR, and two on the mesic coastal side. We constructed a dated phylogeny and explored the relative influences of purifying and positive selection in the evolution of mitogenome divergence among species. Results supported mid- to late-Pleistocene divergence of Maccullochella across the GDR (220–710 thousand years ago), bringing forward previously reported dates. Against a background of pervasive purifying selection, we detected potentially functionally relevant fixed amino acid differences across the GDR. Although many amino acid differences between inland and coastal species may have become fixed under relaxed purifying selection in coastal environments rather than positive selection, there was evidence of episodic positive selection acting on specific codons in the Mary River coastal lineage, which has consistently experienced the warmest and least extreme climate in the genus. PMID:26883183
Isolation and Identification of Active Compounds from Papaya Plants and Activities as Antimicrobial
NASA Astrophysics Data System (ADS)
Prasetya, A. T.; Mursiti, S.; Maryan, S.; Jati, N. K.
2018-04-01
Extraction and isolation of papaya seeds and leaves (Carica papaya L) has been performed using n-hexane and ethanol solvents. Further isolation of the extract obtained using ethyl acetate and diethyl ether solvents. The result of the phytochemical test of papaya extract obtained by mixture of an active compound of flavonoids, alkaloids, tannins, steroids, and saponins. Ethyl acetate isolates containing only flavonoids and diethyl ether isolates contain only alkaloids. Extracts and isolates from papaya plants had gram-positive antibacterial activity greater than the gram-negative bacteria, but both did not have antifungal activity. Papaya extracts have greater antibacterial activity than flavonoid isolates and alkaloid isolates. Strong antibacterial inhibitory sequences are extracts of papaya plants, flavonoid isolates, and alkaloid isolates.
Genotyping of feline leukemia virus in Mexican housecats.
Ramírez, Hugo; Autran, Marcela; García, M Martha; Carmona, M Ángel; Rodríguez, Cecilia; Martínez, H Alejandro
2016-04-01
Feline leukemia virus (FeLV) is a retrovirus with variable rates of infection globally. DNA was obtained from cats' peripheral blood mononuclear cells, and proviral DNA of pol and env genes was detected using PCR. Seventy-six percent of cats scored positive for FeLV using env-PCR; and 54 %, by pol-PCR. Phylogenetic analysis of both regions identified sequences that correspond to a group that includes endogenous retroviruses. They form an independent branch and, therefore, a new group of endogenous viruses. Cat gender, age, outdoor access, and cohabitation with other cats were found to be significant risk factors associated with the disease. This strongly suggests that these FeLV genotypes are widely distributed in the studied feline population in Mexico.
Funke, Guido; Lawson, Paul A.; Collins, Matthew D.
1998-01-01
Four strains of an unknown coryneform bacterium were isolated in pure culture from females with urinary tract infections. Strong urease activity and the ability to slowly ferment maltose but not glucose were the most significant phenotypic features of this catalase-positive, nonmotile, nonlipophilic, rod-shaped bacterium which served to distinguish it from all other presently defined coryneform bacteria. Chemotaxonomic investigations demonstrated that the unknown bacterium belonged to the genus Corynebacterium. Comparative 16S rRNA gene sequence analysis revealed that the isolates were genealogically identical and represented a new subline within the genus Corynebacterium, for which the designation Corynebacterium riegelii sp. nov. is proposed. The type strain of Corynebacterium riegelii is CCUG 38180 (DSM 44326, CIP 105310). PMID:9508284
Endemic Transmission of Visceral Leishmaniasis in Bhutan
Yangzom, Thinley; Cruz, Israel; Bern, Caryn; Argaw, Daniel; den Boer, Margriet; Vélez, Iván Dario; Bhattacharya, Sujit K.; Molina, Ricardo; Alvar, Jorge
2012-01-01
Visceral leishmaniasis was first reported in Bhutan in 2006. We conducted studies of the parasite, possible vectors and reservoirs, and leishmanin skin test and risk factor surveys in three villages. Nineteen cases were reported from seven districts. Parasite typing yielded two novel microsatellite sequences, both related to Indian L. donovani. In one case village, 40 (18.5%) of 216 participants had positive leishmanin skin test results, compared with 3 (4.2%) of 72 in the other case village and 0 of 108 in the control village. Positive results were strongly associated with the village and increasing age. None of the tested dogs were infected. Eighteen sand flies were collected, 13 Phlebotomus species and 5 Sergentomyia species; polymerase chain reaction for leishmanial DNA was negative. This assessment suggests that endemic visceral leishmaniasis transmission has occurred in diverse locations in Bhutan. Surveillance, case investigations, and further parasite, vector, and reservoir studies are needed. The potential protective impact of bed nets should be evaluated. PMID:23091191
Detection and Genome Analysis of a Lineage III Peste Des Petits Ruminants Virus in Kenya in 2011.
Dundon, W G; Kihu, S M; Gitao, G C; Bebora, L C; John, N M; Oyugi, J O; Loitsch, A; Diallo, A
2017-04-01
In May 2011 in Turkana County, north-western Kenya, tissue samples were collected from goats suspected of having died of peste des petits ruminant (PPR) disease, an acute viral disease of small ruminants. The samples were processed and tested by reverse transcriptase PCR for the presence of PPR viral RNA. The positive samples were sequenced and identified as belonging to peste des petits ruminants virus (PPRV) lineage III. Full-genome analysis of one of the positive samples revealed that the virus causing disease in Kenya in 2011 was 95.7% identical to the full genome of a virus isolated in Uganda in 2012 and that a segment of the viral fusion gene was 100% identical to that of a virus circulating in Tanzania in 2013. These data strongly indicate transboundary movement of lineage III viruses between Eastern Africa countries and have significant implications for surveillance and control of this important disease as it moves southwards in Africa. © 2015 Blackwell Verlag GmbH.
Listening to the customer: implementing a patient satisfaction measurement system.
Cohen, L; Delaney, P; Boston, P
1994-01-01
Patient satisfaction is an important issue in positioning ambulatory medical services. An effective patient satisfaction measurement program not only helps hospital managers improve the quality of clinical and administrative activities, but also helps the hospital remain viable in increasingly competitive markets. A method for the design and measurement of patient satisfaction with outpatient Endoscopy Lab services is described in this article. The survey focuses on the sequence of events experienced by the patient. Outcome measures of primary interest include global patient satisfaction and the likelihood of using the service again if given a choice. Analysis of patient responses shows that global satisfaction with the outpatient experience is positively associated with service return intention. Additional analysis shows that facility cleanliness, privacy and nurse attention are most strongly associated with global patient satisfaction. Results underscore the importance of various service attributes on patient satisfaction and return intention and of the need to further expand the uses of patient satisfaction measurement in the outpatient Endoscopy Lab.
Bezos, Javier; Álvarez-Carrión, Beatriz; Rodríguez-Bertos, Antonio; Fernández-Manzano, Álvaro; de Juan, Lucía; Huguet, Cristina; Briones, Víctor; Romero, Beatriz
2016-12-01
The infection caused by the zoonotic opportunistic pathogen Mycobacterium avium subsp. hominissuis (Mah) was reported for the first time in a pet ferret. Both owners were HIV-positive. Euthanasia of the pet was recommended due to medical reasons and as a preventive action. Disseminated and open tuberculosis lesions were observed in the gastrointestinal and respiratory systems of the ferret. Ecographic and radiographic surveys showed a severe generalized lymphadenopathy, strong thickening of the gastric wall and peritoneum layer. The histopathological findings revealed a disseminated, granulomatous, chronic inflammation affecting the gastrointestinal tract, lungs, lymphoid tissues (spleen, tonsils and lymph nodes) and liver. Ziehl-Neelsen staining displayed the presence of positive acid-fast bacilli within these granulomas. Bacteriology and sequencing of the isolates yielded Mah sequevar code 3. Ferrets can act as reservoirs of mycobacteria exposing their owners to the infection, which is of major concern in immunodeficient individuals, as those HIV-infected. Copyright © 2016 Elsevier Ltd. All rights reserved.
Phylogenetic affinity of tree shrews to Glires is attributed to fast evolution rate.
Lin, Jiannan; Chen, Guangfeng; Gu, Liang; Shen, Yuefeng; Zheng, Meizhu; Zheng, Weisheng; Hu, Xinjie; Zhang, Xiaobai; Qiu, Yu; Liu, Xiaoqing; Jiang, Cizhong
2014-02-01
Previous phylogenetic analyses have led to incongruent evolutionary relationships between tree shrews and other suborders of Euarchontoglires. What caused the incongruence remains elusive. In this study, we identified 6845 orthologous genes between seventeen placental mammals. Tree shrews and Primates were monophyletic in the phylogenetic trees derived from the first or/and second codon positions whereas tree shrews and Glires formed a monophyly in the trees derived from the third or all codon positions. The same topology was obtained in the phylogeny inference using the slowly and fast evolving genes, respectively. This incongruence was likely attributed to the fast substitution rate in tree shrews and Glires. Notably, sequence GC content only was not informative to resolve the controversial phylogenetic relationships between tree shrews, Glires, and Primates. Finally, estimation in the confidence of the tree selection strongly supported the phylogenetic affiliation of tree shrews to Primates as a monophyly. Copyright © 2013 Elsevier Inc. All rights reserved.
Active site remodeling switches HIV specificity of antiretroviral TRIMCyp
Price, Amanda J; Marzetta, Flavia; Lammers, Michael; Ylinen, Laura M J; Schaller, Torsten; Wilson, Sam J; Towers, Greg J; James, Leo C
2011-01-01
TRIMCyps are primate antiretroviral proteins that potently inhibit HIV replication. Here we describe how rhesus macaque TRIMCyp (RhTC) has evolved to target and restrict HIV-2. We show that the ancestral cyclophilin A (CypA) domain of RhTC targets HIV-2 capsid with weak affinity, which is strongly increased in RhTC by two mutations (D66N and R69H) at the expense of HIV-1 binding. These mutations disrupt a constraining intramolecular interaction in CypA, triggering the complete restructuring (>16 Å) of an active site loop. This new configuration discriminates between divergent HIV-1 and HIV-2 loop conformations mediated by capsid residue 88. Viral sensitivity to RhTC restriction can be conferred or abolished by mutating position 88. Furthermore, position 88 determines the susceptibility of naturally occurring HIV-1 sequences to restriction. Our results reveal the complex molecular, structural and thermodynamic changes that underlie the ongoing evolutionary race between virus and host. PMID:19767750
The three-dimensional architecture of a bacterial genome and its alteration by genetic perturbation.
Umbarger, Mark A; Toro, Esteban; Wright, Matthew A; Porreca, Gregory J; Baù, Davide; Hong, Sun-Hae; Fero, Michael J; Zhu, Lihua J; Marti-Renom, Marc A; McAdams, Harley H; Shapiro, Lucy; Dekker, Job; Church, George M
2011-10-21
We have determined the three-dimensional (3D) architecture of the Caulobacter crescentus genome by combining genome-wide chromatin interaction detection, live-cell imaging, and computational modeling. Using chromosome conformation capture carbon copy (5C), we derive ~13 kb resolution 3D models of the Caulobacter genome. The resulting models illustrate that the genome is ellipsoidal with periodically arranged arms. The parS sites, a pair of short contiguous sequence elements known to be involved in chromosome segregation, are positioned at one pole, where they anchor the chromosome to the cell and contribute to the formation of a compact chromatin conformation. Repositioning these elements resulted in rotations of the chromosome that changed the subcellular positions of most genes. Such rotations did not lead to large-scale changes in gene expression, indicating that genome folding does not strongly affect gene regulation. Collectively, our data suggest that genome folding is globally dictated by the parS sites and chromosome segregation. Copyright © 2011 Elsevier Inc. All rights reserved.
Matsuda, M; Tazumi, A; Kagawa, S; Sekizuka, T; Murayama, O; Moore, JE; Millar, BC
2006-01-01
Background At present, six accessible sequences of 16S rDNA from Taylorella equigenitalis (T. equigenitalis) are available, whose sequence differences occur at a few nucleotide positions. Thus it is important to determine these sequences from additional strains in other countries, if possible, in order to clarify any anomalies regarding 16S rDNA sequence heterogeneity. Here, we clone and sequence the approximate full-length 16S rDNA from additional strains of T. equigenitalis isolated in Japan, Australia and France and compare these sequences to the existing published sequences. Results Clarification of any anomalies regarding 16S rDNA sequence heterogeneity of T. equigenitalis was carried out. When cloning, sequencing and comparison of the approximate full-length 16S rDNA from 17 strains of T. equigenitalis isolated in Japan, Australia and France, nucleotide sequence differences were demonstrated at the six loci in the 1,469 nucleotide sequence. Moreover, 12 polymorphic sites occurred among 23 sequences of the 16S rDNA, including the six reference sequences. Conclusion High sequence similarity (99.5% or more) was observed throughout, except from nucleotide positions 138 to 501 where substitutions and deletions were noted. PMID:16398935
Generalized statistical convergence of order β for sequences of fuzzy numbers
NASA Astrophysics Data System (ADS)
Altınok, Hıfsı; Karakaş, Abdulkadir; Altın, Yavuz
2018-01-01
In the present paper, we introduce the concepts of Δm-statistical convergence of order β for sequences of fuzzy numbers and strongly Δm-summable of order β for sequences of fuzzy numbers by using a modulus function f and taking supremum on metric d for 0 < β ≤ 1 and give some inclusion relations between them.
Small Deletion Variants Have Stable Breakpoints Commonly Associated with Alu Elements
Coin, Lachlan J. M.; Steinfeld, Israel; Yakhini, Zohar; Sladek, Rob; Froguel, Philippe; Blakemore, Alexandra I. F.
2008-01-01
Copy number variants (CNVs) contribute significantly to human genomic variation, with over 5000 loci reported, covering more than 18% of the euchromatic human genome. Little is known, however, about the origin and stability of variants of different size and complexity. We investigated the breakpoints of 20 small, common deletions, representing a subset of those originally identified by array CGH, using Agilent microarrays, in 50 healthy French Caucasian subjects. By sequencing PCR products amplified using primers designed to span the deleted regions, we determined the exact size and genomic position of the deletions in all affected samples. For each deletion studied, all individuals carrying the deletion share identical upstream and downstream breakpoints at the sequence level, suggesting that the deletion event occurred just once and later became common in the population. This is supported by linkage disequilibrium (LD) analysis, which has revealed that most of the deletions studied are in moderate to strong LD with surrounding SNPs, and have conserved long-range haplotypes. Analysis of the sequences flanking the deletion breakpoints revealed an enrichment of microhomology at the breakpoint junctions. More significantly, we found an enrichment of Alu repeat elements, the overwhelming majority of which intersected deletion breakpoints at their poly-A tails. We found no enrichment of LINE elements or segmental duplications, in contrast to other reports. Sequence analysis revealed enrichment of a conserved motif in the sequences surrounding the deletion breakpoints, although whether this motif has any mechanistic role in the formation of some deletions has yet to be determined. Considered together with existing information on more complex inherited variant regions, and reports of de novo variants associated with autism, these data support the presence of different subgroups of CNV in the genome which may have originated through different mechanisms. PMID:18769679
Akins, R A; Grant, D M; Stohl, L L; Bottorff, D A; Nargang, F E; Lambowitz, A M
1988-11-05
The Mauriceville and Varkud mitochondrial plasmids of Neurospora are closely related, closed circular DNAs (3.6 and 3.7 kb, respectively; 1 kb = 10(3) bases or base-pairs), whose characteristics suggest relationships to mitochondrial DNA introns and retrotransposons. Here, we characterized the structure of the Varkud plasmid, determined its complete nucleotide sequence and mapped its major transcripts. The Mauriceville and Varkud plasmids have more than 97% positional identity. Both plasmids contain a 710 amino acid open reading frame that encodes a reverse transcriptase-like protein. The amino acid sequence of this open reading frame is strongly conserved between the two plasmids (701/710 amino acids) as expected for a functionally important protein. Both plasmids have a 0.4 kb region that contains five PstI palindromes and a direct repeat of approximately 160 base-pairs. Comparison of sequences in this region suggests that the Varkud plasmid has diverged less from a common ancestor than has the Mauriceville plasmid. Two major transcripts of the Varkud plasmid were detected by Northern hybridization experiments: a full-length linear RNA of 3.7 kb and an additional prominent transcript of 4.9 kb, 1.2 kb longer than monomer plasmid. Remarkably, we find that the 4.9 kb transcript is a hybrid RNA consisting of the full-length 3.7 kb Varkud plasmid transcript plus a 5' leader of 1.2 kb that is derived from the 5' end of the mitochondrial small rRNA. This and other findings suggest that the Varkud plasmid, like certain RNA viruses, has a mechanism for joining heterologous RNAs to the 5' end of its major transcript, and that, under some circumstances, nucleotide sequences in mitochondria may be recombined at the RNA level.
Miller, Melissa A; Burgess, Tristan L; Dodd, Erin M; Rhyan, Jack C; Jang, Spencer S; Byrne, Barbara A; Gulland, Frances M D; Murray, Michael J; Toy-Choutka, Sharon; Conrad, Patricia A; Field, Cara L; Sidor, Inga F; Smith, Woutrina A
2017-04-01
We characterize Brucella infection in a wild southern sea otter ( Enhydra lutris nereis) with osteolytic lesions similar to those reported in other marine mammals and humans. This otter stranded twice along the central California coast, US over a 1-yr period and was handled extensively at two wildlife rehabilitation facilities, undergoing multiple surgeries and months of postsurgical care. Ultimately the otter was euthanized due to severe, progressive neurologic disease. Necropsy and postmortem radiographs revealed chronic, severe osteoarthritis spanning the proximal interphalangeal joint of the left hind fifth digit. Numerous coccobacilli within the joint were strongly positive on Brucella immunohistochemical labelling, and Brucella sp. was isolated in pure culture from this lesion. Sparse Brucella-immunopositive bacteria were also observed in the cytoplasm of a pulmonary vascular monocyte, and multifocal granulomas were observed in the spinal cord and liver on histopathology. Findings from biochemical characterization, 16S ribosomal DNA, and bp26 gene sequencing of the bacterial isolate were identical to those from marine-origin brucellae isolated from cetaceans and phocids. Although omp2a gene sequencing revealed 100% homology with marine Brucella spp. infecting pinnipeds, whales, and humans, omp2b gene sequences were identical only to pinniped-origin isolates. Multilocus sequence typing classified the sea otter isolate as ST26, a sequence type previously associated only with cetaceans. Our data suggest that the sea otter Brucella strain represents a novel marine lineage that is distinct from both Brucella pinnipedialis and Brucella ceti. Prior reports document the zoonotic potential of the marine brucellae. Isolation of Brucella sp. from a stranded sea otter highlights the importance of wearing personal protective equipment when handling sea otters and other marine mammals as part of wildlife conservation and rehabilitation efforts.
Comparative RNA sequencing reveals substantial genetic variation in endangered primates
Perry, George H.; Melsted, Páll; Marioni, John C.; Wang, Ying; Bainer, Russell; Pickrell, Joseph K.; Michelini, Katelyn; Zehr, Sarah; Yoder, Anne D.; Stephens, Matthew; Pritchard, Jonathan K.; Gilad, Yoav
2012-01-01
Comparative genomic studies in primates have yielded important insights into the evolutionary forces that shape genetic diversity and revealed the likely genetic basis for certain species-specific adaptations. To date, however, these studies have focused on only a small number of species. For the majority of nonhuman primates, including some of the most critically endangered, genome-level data are not yet available. In this study, we have taken the first steps toward addressing this gap by sequencing RNA from the livers of multiple individuals from each of 16 mammalian species, including humans and 11 nonhuman primates. Of the nonhuman primate species, five are lemurs and two are lorisoids, for which little or no genomic data were previously available. To analyze these data, we developed a method for de novo assembly and alignment of orthologous gene sequences across species. We assembled an average of 5721 gene sequences per species and characterized diversity and divergence of both gene sequences and gene expression levels. We identified patterns of variation that are consistent with the action of positive or directional selection, including an 18-fold enrichment of peroxisomal genes among genes whose regulation likely evolved under directional selection in the ancestral primate lineage. Importantly, we found no relationship between genetic diversity and endangered status, with the two most endangered species in our study, the black and white ruffed lemur and the Coquerel's sifaka, having the highest genetic diversity among all primates. Our observations imply that many endangered lemur populations still harbor considerable genetic variation. Timely efforts to conserve these species alongside their habitats have, therefore, strong potential to achieve long-term success. PMID:22207615
Anisimov, Andrey P; Panfertsev, Evgeniy A; Svetoch, Tat'yana E; Dentovskaya, Svetlana V
2007-01-01
Sequencing of lcrV genes and comparison of the deduced amino acid sequences from ten Y. pestis strains belonging mostly to the group of atypical rhamnose-positive isolates (non-pestis subspecies or pestoides group) showed that the LcrV proteins analyzed could be classified into five sequence types. This classification was based on major amino acid polymorphisms among LcrV proteins in the four "hot points" of the protein sequences. Some additional minor polymorphisms were found throughout these sequence types. The "hot points" corresponded to amino acids 18 (Lys --> Asn), 72 (Lys --> Arg), 273 (Cys --> Ser), and 324-326 (Ser-Gly-Lys --> Arg) in the LcrV sequence of the reference Y. pestis strain CO92. One possible explanation for polymorphism in amino acid sequences of LcrV among different strains is that strain-specific variation resulted from adaptation of the plague pathogen to different rodent and lagomorph hosts.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yu, Zhilin; Erbas, Aykut; Tantakitti, Faifan
Co-assembly of binary systems driven by specific non-covalent interactions can greatly expand the structural and functional space of supramolecular nanostructures. We report here on the self-assembly of peptide amphiphiles and fatty acids driven primarily by anion-π interactions. The peptide sequences investigated were functionalized with a perfluorinated phenylalanine residue to promote anion-π interactions with carboxylate headgroups in fatty acids. These interactions were verified here by NMR and circular dichroism experiments as well as investigated using atomistic simulations. Positioning the aromatic units close to the N-terminus of the peptide backbone near the hydrophobic core of cylindrical nanofibers leads to strong anion-π interactionsmore » between both components. With a low content of dodecanoic acid in this position, the cylindrical morphology is preserved. However, as the aromatic units are moved along the peptide backbone away from the hydrophobic core, the interactions with dodecanoic acid transform the cylindrical supramolecular morphology into ribbon-like structures. Increasing the ratio of dodecanoic acid to PA leads to either the formation of large vesicles in the binary systems where the anion-π interactions are strong, or a heterogeneous mixture of assemblies when the peptide amphiphiles associate weakly with dodecanoic acid. Our findings reveal how co-assembly involving designed specific interactions can drastically change supramolecular morphology and even cross from nano to micro scales.« less
Fernandez-Pol, J Alberto
2016-01-01
The ribosomal protein metallopanstimulin-1 (MPS1/S27) serves critical survival purposes in cell division, in normal and cancerous cells; for this reason, selective pressures of evolution have conserved the DNA sequences encoding MPS1/S27 in Archaea and Eukariotic cells. The expression of MPS1/S27 protein in human adult cerebellum has not been established. The presence of MPS1/S27, was screened in paraffin-embedded human adult brain specimens processed for tissue inmunohistochemistry. Affinity-purified specific antibodies were directed against the N-terminus of MPS1. The antibodies to MPS1 detected Purkinje cells (PC) and their dendrites. In PC, MPS1 antigen-positive staining was found in: the nucleolus, which was strongly stained; ribosomes attached to the external nuclear membrane; cytoplasm of PC, with strong staining in a punctuate fashion; the soma-attached large dendrite trunks of PC, which were MPS1 antigen-positive; and the granular cell layer, where cellular staining in a few cells that appeared to resemble smaller PC was observed. Since MPS1 is involved in cell division, DNA repair, and ribosomal biogenesis, it may be a useful antigen for studying processes such as protein synthesis, oncogenesis, regeneration, aging, and perhaps diseases of the human cerebellum. Copyright© 2016, International Institute of Anticancer Research (Dr. John G. Delinasios), All rights reserved.
Garamszegi, László Zsolt
2014-09-03
The major histocompatibility complex (MHC) is the most polymorphic genetic region in vertebrates, but the origin of such genetic diversity remains unresolved. Several studies have demonstrated at the within-population level that individuals harbouring particular alleles can be less or more susceptible to malaria, but these do not allow strong generalization. Here worldwide data on the frequencies of several hundred MHC alleles of the human leucocyte antigen (HLA) system in relation to malaria risk at the between-population level were analysed in a phylogenetic framework, and results for different alleles were quantitatively summarized in a meta-analysis. There was an overall positive relationship between malaria pressure and the frequency of several HLA alleles indicating that allele frequencies increase in countries with strong malaria pressure. Nevertheless, considerable heterogeneity was observed across alleles, and some alleles showed a remarkable negative relationship with malaria risk. When heterogeneities were partitioned into different organization groups of the MHC, the strongest positive relationships were detected for alleles of the HLA-A and HLA-B loci, but there were also differences between MHC supertypes that constitute functionally distinct nucleotide sequences. Finally, the number of MHC alleles that are maintained within countries was also related to malaria risk. Therefore, malaria represents a key selection pressure for the human MHC and has left clear evolutionary footprints on both the frequencies and the number of alleles observed in different countries.
Sharma, Vijay K; Bayles, Darrell O; Alt, David P; Looft, Torey
2016-12-15
Escherichia coli O157:H7 strain 86-24 does not produce curli fimbriae, but gives rise to curli-positive isolates at a variable frequency. Here, we report the complete genome sequences of curli-negative and curli-positive isolates of strain 86-24. Copyright © 2016 Sharma et al.
Harpur, Brock A; Kent, Clement F; Molodtsova, Daria; Lebon, Jonathan M D; Alqarni, Abdulaziz S; Owayss, Ayman A; Zayed, Amro
2014-02-18
Most theories used to explain the evolution of eusociality rest upon two key assumptions: mutations affecting the phenotype of sterile workers evolve by positive selection if the resulting traits benefit fertile kin, and that worker traits provide the primary mechanism allowing social insects to adapt to their environment. Despite the common view that positive selection drives phenotypic evolution of workers, we know very little about the prevalence of positive selection acting on the genomes of eusocial insects. We mapped the footprints of positive selection in Apis mellifera through analysis of 40 individual genomes, allowing us to identify thousands of genes and regulatory sequences with signatures of adaptive evolution over multiple timescales. We found Apoidea- and Apis-specific genes to be enriched for signatures of positive selection, indicating that novel genes play a disproportionately large role in adaptive evolution of eusocial insects. Worker-biased proteins have higher signatures of adaptive evolution relative to queen-biased proteins, supporting the view that worker traits are key to adaptation. We also found genes regulating worker division of labor to be enriched for signs of positive selection. Finally, genes associated with worker behavior based on analysis of brain gene expression were highly enriched for adaptive protein and cis-regulatory evolution. Our study highlights the significant contribution of worker phenotypes to adaptive evolution in social insects, and provides a wealth of knowledge on the loci that influence fitness in honey bees.
Harpur, Brock A.; Kent, Clement F.; Molodtsova, Daria; Lebon, Jonathan M. D.; Alqarni, Abdulaziz S.; Owayss, Ayman A.; Zayed, Amro
2014-01-01
Most theories used to explain the evolution of eusociality rest upon two key assumptions: mutations affecting the phenotype of sterile workers evolve by positive selection if the resulting traits benefit fertile kin, and that worker traits provide the primary mechanism allowing social insects to adapt to their environment. Despite the common view that positive selection drives phenotypic evolution of workers, we know very little about the prevalence of positive selection acting on the genomes of eusocial insects. We mapped the footprints of positive selection in Apis mellifera through analysis of 40 individual genomes, allowing us to identify thousands of genes and regulatory sequences with signatures of adaptive evolution over multiple timescales. We found Apoidea- and Apis-specific genes to be enriched for signatures of positive selection, indicating that novel genes play a disproportionately large role in adaptive evolution of eusocial insects. Worker-biased proteins have higher signatures of adaptive evolution relative to queen-biased proteins, supporting the view that worker traits are key to adaptation. We also found genes regulating worker division of labor to be enriched for signs of positive selection. Finally, genes associated with worker behavior based on analysis of brain gene expression were highly enriched for adaptive protein and cis-regulatory evolution. Our study highlights the significant contribution of worker phenotypes to adaptive evolution in social insects, and provides a wealth of knowledge on the loci that influence fitness in honey bees. PMID:24488971
ADEPT, a dynamic next generation sequencing data error-detection program with trimming
DOE Office of Scientific and Technical Information (OSTI.GOV)
Feng, Shihai; Lo, Chien-Chi; Li, Po-E
Illumina is the most widely used next generation sequencing technology and produces millions of short reads that contain errors. These sequencing errors constitute a major problem in applications such as de novo genome assembly, metagenomics analysis and single nucleotide polymorphism discovery. In this study, we present ADEPT, a dynamic error detection method, based on the quality scores of each nucleotide and its neighboring nucleotides, together with their positions within the read and compares this to the position-specific quality score distribution of all bases within the sequencing run. This method greatly improves upon other available methods in terms of the truemore » positive rate of error discovery without affecting the false positive rate, particularly within the middle of reads. We conclude that ADEPT is the only tool to date that dynamically assesses errors within reads by comparing position-specific and neighboring base quality scores with the distribution of quality scores for the dataset being analyzed. The result is a method that is less prone to position-dependent under-prediction, which is one of the most prominent issues in error prediction. The outcome is that ADEPT improves upon prior efforts in identifying true errors, primarily within the middle of reads, while reducing the false positive rate.« less
ADEPT, a dynamic next generation sequencing data error-detection program with trimming
Feng, Shihai; Lo, Chien-Chi; Li, Po-E; ...
2016-02-29
Illumina is the most widely used next generation sequencing technology and produces millions of short reads that contain errors. These sequencing errors constitute a major problem in applications such as de novo genome assembly, metagenomics analysis and single nucleotide polymorphism discovery. In this study, we present ADEPT, a dynamic error detection method, based on the quality scores of each nucleotide and its neighboring nucleotides, together with their positions within the read and compares this to the position-specific quality score distribution of all bases within the sequencing run. This method greatly improves upon other available methods in terms of the truemore » positive rate of error discovery without affecting the false positive rate, particularly within the middle of reads. We conclude that ADEPT is the only tool to date that dynamically assesses errors within reads by comparing position-specific and neighboring base quality scores with the distribution of quality scores for the dataset being analyzed. The result is a method that is less prone to position-dependent under-prediction, which is one of the most prominent issues in error prediction. The outcome is that ADEPT improves upon prior efforts in identifying true errors, primarily within the middle of reads, while reducing the false positive rate.« less
Gu, Zhining; Guo, Wei; Li, Chaoyang; Zhu, Xinyan; Guo, Tao
2018-01-01
Pedestrian dead reckoning (PDR) positioning algorithms can be used to obtain a target’s location only for movement with step features and not for driving, for which the trilateral Bluetooth indoor positioning method can be used. In this study, to obtain the precise locations of different states (pedestrian/car) using the corresponding positioning algorithms, we propose an adaptive method for switching between the PDR and car indoor positioning algorithms based on multilayer time sequences (MTSs). MTSs, which consider the behavior context, comprise two main aspects: filtering of noisy data in small-scale time sequences and using a state chain to reduce the time delay of algorithm switching in large-scale time sequences. The proposed method can be expected to realize the recognition of stationary, walking, driving, or other states; switch to the correct indoor positioning algorithm; and improve the accuracy of localization compared to using a single positioning algorithm. Our experiments show that the recognition of static, walking, driving, and other states improves by 5.5%, 45.47%, 26.23%, and 21% on average, respectively, compared with convolutional neural network (CNN) method. The time delay decreases by approximately 0.5–8.5 s for the transition between states and by approximately 24 s for the entire process. PMID:29495503
Barta, Endre; Sebestyén, Endre; Pálfy, Tamás B.; Tóth, Gábor; Ortutay, Csaba P.; Patthy, László
2005-01-01
DoOP (http://doop.abc.hu/) is a database of eukaryotic promoter sequences (upstream regions) aiming to facilitate the recognition of regulatory sites conserved between species. The annotated first exons of human and Arabidopsis thaliana genes were used as queries in BLAST searches to collect the most closely related orthologous first exon sequences from Chordata and Viridiplantae species. Up to 3000 bp DNA segments upstream from these first exons constitute the clusters in the chordate and plant sections of the Database of Orthologous Promoters. Release 1.0 of DoOP contains 21 061 chordate clusters from 284 different species and 7548 plant clusters from 269 different species. The database can be used to find and retrieve promoter sequences of a given gene from various species and it is also suitable to see the most trivial conserved sequence blocks in the orthologous upstream regions. Users can search DoOP with either sequence or text (annotation) to find promoter clusters of various genes. In addition to the sequence data, the positions of the conserved sequence blocks derived from multiple alignments, the positions of repetitive elements and the positions of transcription start sites known from the Eukaryotic Promoter Database (EPD) can be viewed graphically. PMID:15608291
Barta, Endre; Sebestyén, Endre; Pálfy, Tamás B; Tóth, Gábor; Ortutay, Csaba P; Patthy, László
2005-01-01
DoOP (http://doop.abc.hu/) is a database of eukaryotic promoter sequences (upstream regions) aiming to facilitate the recognition of regulatory sites conserved between species. The annotated first exons of human and Arabidopsis thaliana genes were used as queries in BLAST searches to collect the most closely related orthologous first exon sequences from Chordata and Viridiplantae species. Up to 3000 bp DNA segments upstream from these first exons constitute the clusters in the chordate and plant sections of the Database of Orthologous Promoters. Release 1.0 of DoOP contains 21,061 chordate clusters from 284 different species and 7548 plant clusters from 269 different species. The database can be used to find and retrieve promoter sequences of a given gene from various species and it is also suitable to see the most trivial conserved sequence blocks in the orthologous upstream regions. Users can search DoOP with either sequence or text (annotation) to find promoter clusters of various genes. In addition to the sequence data, the positions of the conserved sequence blocks derived from multiple alignments, the positions of repetitive elements and the positions of transcription start sites known from the Eukaryotic Promoter Database (EPD) can be viewed graphically.
Strong foreshock signal preceding the L'Aquila (Italy) earthquake (M_w~6.3) of 6~April~2009
NASA Astrophysics Data System (ADS)
Papadopoulos, G. A.; Charalampakis, M.; Fokaefs, A.; Minadakis, G.
2010-01-01
We used the earthquake catalogue of INGV extending from 1 January 2006 to 30 June 2009 to detect significant changes before and after the 6 April 2009 L'Aquila mainshock (Mw=6.3) in the seismicity rate, r (events/day), and in b-value. The statistical z-test and Utsu-test were applied to identify significant changes. From the beginning of 2006 up to the end of October 2008 the activity was relatively stable and remained in the state of background seismicity (r=1.14, b=1.09). From 28 October 2008 up to 26 March 2009, r increased significantly to 2.52 indicating weak foreshock sequence; the b-value did not changed significantly. The weak foreshock sequence was spatially distributed within the entire seismogenic area. In the last 10 days before the mainshock, strong foreshock signal became evident in space (dense epicenter concentration in the hanging-wall of the Paganica fault), in time (drastic increase of r to 21.70 events/day) and in size (b-value dropped significantly to 0.68). The significantly high seismicity rate and the low b-value in the entire foreshock sequence make a substantial difference from the background seismicity. Also, the b-value of the strong foreshock stage (last 10 days before mainshock) was significantly lower than that in the aftershock sequence. Our results indicate the important value of the foreshock sequences for the prediction of the mainshock.
The evolutionary history of the DMRT3 'Gait keeper' haplotype.
Staiger, E A; Almén, M S; Promerová, M; Brooks, S; Cothran, E G; Imsland, F; Jäderkvist Fegraeus, K; Lindgren, G; Mehrabani Yeganeh, H; Mikko, S; Vega-Pla, J L; Tozaki, T; Rubin, C J; Andersson, L
2017-10-01
A previous study revealed a strong association between the DMRT3:Ser301STOP mutation in horses and alternate gaits as well as performance in harness racing. Several follow-up studies have confirmed a high frequency of the mutation in gaited horse breeds and an effect on gait quality. The aim of this study was to determine when and where the mutation arose, to identify additional potential causal mutations and to determine the coalescence time for contemporary haplotypes carrying the stop mutation. We utilized sequences from 89 horses representing 26 breeds to identify 102 SNPs encompassing the DMRT3 gene that are in strong linkage disequilibrium with the stop mutation. These 102 SNPs were genotyped in an additional 382 horses representing 72 breeds, and we identified 14 unique haplotypes. The results provided conclusive evidence that DMRT3:Ser301STOP is causal, as no other sequence polymorphisms showed an equally strong association to locomotion traits. The low sequence diversity among mutant chromosomes demonstrated that they must have diverged from a common ancestral sequence within the last 10 000 years. Thus, the mutation occurred either just before domestication or more likely some time after domestication and then spread across the world as a result of selection on locomotion traits. © 2017 Stichting International Foundation for Animal Genetics.
Isolation and whole genome analysis of endospore-forming bacteria from heroin.
Kalinowski, Jörn; Ahrens, Björn; Al-Dilaimi, Arwa; Winkler, Anika; Wibberg, Daniel; Schleenbecker, Uwe; Rückert, Christian; Wölfel, Roman; Grass, Gregor
2018-01-01
Infections caused by endospore-forming bacteria have been associated with severe illness and death among persons who inject drugs. Analysis of the bacteria residing in heroin has thus been biased towards species that affect human health. Similarly, exploration of the bacterial diversity of seized street market heroin correlated with the skin microflora of recreational heroin users insofar as different Staphylococus spp. or typical environmental endospore formers including Bacillus cereus and other Bacilli outside the B. cereus sensu lato group as well as diverse Clostridia were identified. In this work 82 samples of non-street market ("wholesale") heroin originating from the German Federal Criminal Police Office's heroin analysis program seized during the period between 2009 and 2014 were analyzed for contaminating bacteria. Without contact with the end user and with only little contaminations introduced by final processing, adulteration and cutting this heroin likely harbors original microbiota from the drug's original source or trafficking route. We found this drug to be only sparsely populated with retrievable heterotrophic, aerobic bacteria. In total, 68 isolates were retrieved from 49 out of 82 samples analyzed (60% culture positive). All isolates were endospore-forming, Gram-positive Bacilli. Completely absent were non-endospore-formers or Gram-negatives. The three most predominant species were Bacillus clausii, Bacillus (para)licheniformis, and Terribacillus saccharophilus. Whole genome sequencing of these 68 isolates was performed using Illumina technology. Sequence data sets were assembled and annotated using an automated bioinformatics pipeline. Average nucleotide identity (ANI) values were calculated for all draft genomes and all close to identical genomes (ANI>99.5%) were compared to the forensic data of the seized drug, showing positive correlations that strongly warrant further research on this subject. Copyright © 2017 Elsevier B.V. All rights reserved.
Lin, Yi G; Weadick, Cameron J; Santini, Francesco; Chang, Belinda S W
2013-12-01
Transducin is a heterotrimeric G protein that plays a critical role in phototransduction in the rod and cone photoreceptor cells of the vertebrate retina. Rods, highly sensitive cells that recover from photoactivation slowly, underlie dim-light vision, whereas cones are less sensitive, recover more quickly, and underlie bright-light vision. Transducin deactivation is a critical step in photoreceptor recovery and may underlie the functional distinction between rods and cones. Rods and cones possess distinct transducin α subunits, yet they share a common deactivation mechanism, the GTPase activating protein (GAP) complex. Here, we used codon models to examine patterns of sequence evolution in rod (GNAT1) and cone (GNAT2) α subunits. Our results indicate that purifying selection is the dominant force shaping GNAT1 and GNAT2 evolution, but that GNAT2 has additionally been subject to positive selection operating at multiple phylogenetic scales; phylogeny-wide analysis identified several sites in the GNAT2 helical domain as having substantially elevated dN/dS estimates, and branch-site analysis identified several nearby sites as targets of strong positive selection during early vertebrate history. Examination of aligned GNAT and GAP complex crystal structures revealed steric clashes between several positively selected sites and the deactivating GAP complex. This suggests that GNAT2 sequence variation could play an important role in adaptive evolution of the vertebrate visual system via effects on photoreceptor deactivation kinetics and provides an alternative perspective to previous work that focused instead on the effect of GAP complex concentration. Our findings thus further the understanding of the molecular biology, physiology, and evolution of vertebrate visual systems.
Head-body righting reflex from the supine position and preparatory eye movements.
Troiani, Diana; Ferraresi, Aldo; Manni, Ermanno
2005-05-01
Saccular and utricular maculae can provide information on the supine static position, considering that both have pronounced curved structures with hair cells having a variety of polarization vectors that enable them to sense an inverted position and thus direct the righting reflex. The vestibular system is essential for the structuring of motor behaviour, senses linear and angular acceleration and has a strong influence on posture and balance at rest, during locomotion and in head body righting reflexes. Using guinea pigs in the supine position with a symmetrical head and trunk position, the ocular position was analysed to ascertain whether any ocular movement that occurred would adopt a spatial deviation indicative of the subsequent head and body righting. The characteristics of the righting reflex (direction, latency, duration and velocity) were analysed in guinea pigs from position signals obtained from search coils implanted in the eye, head and pelvis. The animals were kept in a supine position for a few seconds or even minutes with the eyes in a stable primary position and the head and body symmetrical and immobile. The righting reflex took place either immediately or after a slow deviation of the eyes. In both cases the righting sequence (eyes, head, body) was stereotyped and consistent. The direction of head and body righting was along the longitudinal axis of the animal and was either clockwise or anticlockwise and the direction of righting was related to the direction of the eye deviation. The ocular deviation and the direction of deviation that initiated and determined the direction of the righting reflex could be explained by possible otolithic activation.
Meinicke, Peter; Tech, Maike; Morgenstern, Burkhard; Merkl, Rainer
2004-01-01
Background Kernel-based learning algorithms are among the most advanced machine learning methods and have been successfully applied to a variety of sequence classification tasks within the field of bioinformatics. Conventional kernels utilized so far do not provide an easy interpretation of the learnt representations in terms of positional and compositional variability of the underlying biological signals. Results We propose a kernel-based approach to datamining on biological sequences. With our method it is possible to model and analyze positional variability of oligomers of any length in a natural way. On one hand this is achieved by mapping the sequences to an intuitive but high-dimensional feature space, well-suited for interpretation of the learnt models. On the other hand, by means of the kernel trick we can provide a general learning algorithm for that high-dimensional representation because all required statistics can be computed without performing an explicit feature space mapping of the sequences. By introducing a kernel parameter that controls the degree of position-dependency, our feature space representation can be tailored to the characteristics of the biological problem at hand. A regularized learning scheme enables application even to biological problems for which only small sets of example sequences are available. Our approach includes a visualization method for transparent representation of characteristic sequence features. Thereby importance of features can be measured in terms of discriminative strength with respect to classification of the underlying sequences. To demonstrate and validate our concept on a biochemically well-defined case, we analyze E. coli translation initiation sites in order to show that we can find biologically relevant signals. For that case, our results clearly show that the Shine-Dalgarno sequence is the most important signal upstream a start codon. The variability in position and composition we found for that signal is in accordance with previous biological knowledge. We also find evidence for signals downstream of the start codon, previously introduced as transcriptional enhancers. These signals are mainly characterized by occurrences of adenine in a region of about 4 nucleotides next to the start codon. Conclusions We showed that the oligo kernel can provide a valuable tool for the analysis of relevant signals in biological sequences. In the case of translation initiation sites we could clearly deduce the most discriminative motifs and their positional variation from example sequences. Attractive features of our approach are its flexibility with respect to oligomer length and position conservation. By means of these two parameters oligo kernels can easily be adapted to different biological problems. PMID:15511290
Genome sequence of a urease-positive Campylobacter lari strain
USDA-ARS?s Scientific Manuscript database
Campylobacter lari is frequently isolated from shore birds and can cause illness in humans. Here we report the draft whole genome sequence of an urease-positive strain of C. lari that was isolated in estuarial water on the coast of Delaware, USA....
Optimization of Multilocus Sequence Analysis for Identification of Species in the Genus Vibrio
Gabriel, Michael W.; Matsui, George Y.; Friedman, Robert
2014-01-01
Multilocus sequence analysis (MLSA) is an important method for identification of taxa that are not well differentiated by 16S rRNA gene sequences alone. In this procedure, concatenated sequences of selected genes are constructed and then analyzed. The effects that the number and the order of genes used in MLSA have on reconstruction of phylogenetic relationships were examined. The recA, rpoA, gapA, 16S rRNA gene, gyrB, and ftsZ sequences from 56 species of the genus Vibrio were used to construct molecular phylogenies, and these were evaluated individually and using various gene combinations. Phylogenies from two-gene sequences employing recA and rpoA in both possible gene orders were different. The addition of the gapA gene sequence, producing all six possible concatenated sequences, reduced the differences in phylogenies to degrees of statistical (bootstrap) support for some nodes. The overall statistical support for the phylogenetic tree, assayed on the basis of a reliability score (calculated from the number of nodes having bootstrap values of ≥80 divided by the total number of nodes) increased with increasing numbers of genes used, up to a maximum of four. No further improvement was observed from addition of the fifth gene sequence (ftsZ), and addition of the sixth gene (gyrB) resulted in lower proportions of strongly supported nodes. Reductions in the numbers of strongly supported nodes were also observed when maximum parsimony was employed for tree construction. Use of a small number of gene sequences in MLSA resulted in accurate identification of Vibrio species. PMID:24951781
Detecting false positive sequence homology: a machine learning approach.
Fujimoto, M Stanley; Suvorov, Anton; Jensen, Nicholas O; Clement, Mark J; Bybee, Seth M
2016-02-24
Accurate detection of homologous relationships of biological sequences (DNA or amino acid) amongst organisms is an important and often difficult task that is essential to various evolutionary studies, ranging from building phylogenies to predicting functional gene annotations. There are many existing heuristic tools, most commonly based on bidirectional BLAST searches that are used to identify homologous genes and combine them into two fundamentally distinct classes: orthologs and paralogs. Due to only using heuristic filtering based on significance score cutoffs and having no cluster post-processing tools available, these methods can often produce multiple clusters constituting unrelated (non-homologous) sequences. Therefore sequencing data extracted from incomplete genome/transcriptome assemblies originated from low coverage sequencing or produced by de novo processes without a reference genome are susceptible to high false positive rates of homology detection. In this paper we develop biologically informative features that can be extracted from multiple sequence alignments of putative homologous genes (orthologs and paralogs) and further utilized in context of guided experimentation to verify false positive outcomes. We demonstrate that our machine learning method trained on both known homology clusters obtained from OrthoDB and randomly generated sequence alignments (non-homologs), successfully determines apparent false positives inferred by heuristic algorithms especially among proteomes recovered from low-coverage RNA-seq data. Almost ~42 % and ~25 % of predicted putative homologies by InParanoid and HaMStR respectively were classified as false positives on experimental data set. Our process increases the quality of output from other clustering algorithms by providing a novel post-processing method that is both fast and efficient at removing low quality clusters of putative homologous genes recovered by heuristic-based approaches.
Rossi, Mari; El-Khechen, Dima; Black, Mary Helen; Farwell Hagman, Kelly D; Tang, Sha; Powis, Zöe
2017-05-01
Exome sequencing has recently been proved to be a successful diagnostic method for complex neurodevelopmental disorders. However, the diagnostic yield of exome sequencing for autism spectrum disorders has not been extensively evaluated in large cohorts to date. We performed diagnostic exome sequencing in a cohort of 163 individuals with autism spectrum disorder (66.3%) or autistic features (33.7%). The diagnostic yield observed in patients in our cohort was 25.8% (42 of 163) for positive or likely positive findings in characterized disease genes, while a candidate genetic etiology was reported for an additional 3.3% (4 of 120) of patients. Among the positive findings in the patients with autism spectrum disorder or autistic features, 61.9% were the result of de novo mutations. Patients presenting with psychiatric conditions or ataxia or paraplegia in addition to autism spectrum disorder or autistic features were significantly more likely to receive positive results compared with patients without these clinical features (95.6% vs 27.1%, P < 0.0001; 83.3% vs 21.2%, P < 0.0001, respectively). The majority of the positive findings were in recently identified autism spectrum disorder genes, supporting the importance of diagnostic exome sequencing for patients with autism spectrum disorder or autistic features as the causative genes might evade traditional sequential or panel testing. These results suggest that diagnostic exome sequencing would be an efficient primary diagnostic method for patients with autism spectrum disorders or autistic features. Moreover, our data may aid clinicians to better determine which subset of patients with autism spectrum disorder with additional clinical features would benefit the most from diagnostic exome sequencing. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Zhang, ZhiZhuo; Chang, Cheng Wei; Hugo, Willy; Cheung, Edwin; Sung, Wing-Kin
2013-03-01
Although de novo motifs can be discovered through mining over-represented sequence patterns, this approach misses some real motifs and generates many false positives. To improve accuracy, one solution is to consider some additional binding features (i.e., position preference and sequence rank preference). This information is usually required from the user. This article presents a de novo motif discovery algorithm called SEME (sampling with expectation maximization for motif elicitation), which uses pure probabilistic mixture model to model the motif's binding features and uses expectation maximization (EM) algorithms to simultaneously learn the sequence motif, position, and sequence rank preferences without asking for any prior knowledge from the user. SEME is both efficient and accurate thanks to two important techniques: the variable motif length extension and importance sampling. Using 75 large-scale synthetic datasets, 32 metazoan compendium benchmark datasets, and 164 chromatin immunoprecipitation sequencing (ChIP-Seq) libraries, we demonstrated the superior performance of SEME over existing programs in finding transcription factor (TF) binding sites. SEME is further applied to a more difficult problem of finding the co-regulated TF (coTF) motifs in 15 ChIP-Seq libraries. It identified significantly more correct coTF motifs and, at the same time, predicted coTF motifs with better matching to the known motifs. Finally, we show that the learned position and sequence rank preferences of each coTF reveals potential interaction mechanisms between the primary TF and the coTF within these sites. Some of these findings were further validated by the ChIP-Seq experiments of the coTFs. The application is available online.
Lam, Kathy N; Charles, Trevor C
2015-01-01
Clone libraries provide researchers with a powerful resource to study nucleic acid from diverse sources. Metagenomic clone libraries in particular have aided in studies of microbial biodiversity and function, and allowed the mining of novel enzymes. Libraries are often constructed by cloning large inserts into cosmid or fosmid vectors. Recently, there have been reports of GC bias in fosmid metagenomic libraries, and it was speculated to be a result of fragmentation and loss of AT-rich sequences during cloning. However, evidence in the literature suggests that transcriptional activity or gene product toxicity may play a role. To explore possible mechanisms responsible for sequence bias in clone libraries, we constructed a cosmid library from a human microbiome sample and sequenced DNA from different steps during library construction: crude extract DNA, size-selected DNA, and cosmid library DNA. We confirmed a GC bias in the final cosmid library, and we provide evidence that the bias is not due to fragmentation and loss of AT-rich sequences but is likely occurring after DNA is introduced into Escherichia coli. To investigate the influence of strong constitutive transcription, we searched the sequence data for promoters and found that rpoD/σ(70) promoter sequences were underrepresented in the cosmid library. Furthermore, when we examined the genomes of taxa that were differentially abundant in the cosmid library relative to the original sample, we found the bias to be more correlated with the number of rpoD/σ(70) consensus sequences in the genome than with simple GC content. The GC bias of metagenomic libraries does not appear to be due to DNA fragmentation. Rather, analysis of promoter sequences provides support for the hypothesis that strong constitutive transcription from sequences recognized as rpoD/σ(70) consensus-like in E. coli may lead to instability, causing loss of the plasmid or loss of the insert DNA that gives rise to the transcription. Despite widespread use of E. coli to propagate foreign DNA in metagenomic libraries, the effects of in vivo transcriptional activity on clone stability are not well understood. Further work is required to tease apart the effects of transcription from those of gene product toxicity.
TLR-2 Recognizes Propionibacterium acnes CAMP Factor 1 from Highly Inflammatory Strains
Ollagnier, Guillaume; Désiré, Nathalie; Sayon, Sophie; Raingeaud, Jöel; Marcelin, Anne-Geneviève; Calvez, Vincent; Khammari, Amir; Batteux, Frédéric; Dréno, Brigitte; Dupin, Nicolas
2016-01-01
Background Propionibacterium acnes (P. acnes) is an anaerobic, Gram-positive bacteria encountered in inflammatory acne lesions, particularly in the pilosebaceous follicle. P. acnes triggers a strong immune response involving keratinocytes, sebocytes and monocytes, the target cells during acne development. Lipoteicoic acid and peptidoglycan induce the inflammatory reaction, but no P. acnes surface protein interacting with Toll-like receptors has been identified. P. acnes surface proteins have been extracted by lithium stripping and shown to induce CXCL8 production by keratinocytes. Methodology and principal findings Far-western blotting identified two surface proteins, of 24.5- and 27.5-kDa in size, specifically recognized by TLR2. These proteins were characterized, by LC-MS/MS, as CAMP factor 1 devoid of its signal peptide sequence, as shown by N-terminal sequencing. Purified CAMP factor 1 induces CXCL8 production by activating the CXCL8 gene promoter, triggering the synthesis of CXCL8 mRNA. Antibodies against TLR2 significantly decreased the CXCL8 response. For the 27 P. acnes strains used in this study, CAMP1-TLR2 binding intensity was modulated and appeared to be strong in type IB and II strains, which produced large amounts of CXCL8, whereas most of the type IA1 and IA2 strains presented little or no CAMP1-TLR2 binding and low levels of CXCL8 production. The nucleotide sequence of CAMP factor displays a major polymorphism, defining two distinct genetic groups corresponding to CAMP factor 1 with 14 amino-acid changes from strains phylotyped II with moderate and high levels of CAMP1-TLR2 binding activity, and CAMP factor 1 containing 0, 1 or 2 amino-acid changes from strains phylotyped IA1, IA2, or IB presenting no, weak or moderate CAMP1-TLR2 binding. Conclusions Our findings indicate that CAMP factor 1 may contribute to P. acnes virulence, by amplifying the inflammation reaction through direct interaction with TLR2. PMID:27902761
TLR-2 Recognizes Propionibacterium acnes CAMP Factor 1 from Highly Inflammatory Strains.
Lheure, Coralie; Grange, Philippe Alain; Ollagnier, Guillaume; Morand, Philippe; Désiré, Nathalie; Sayon, Sophie; Corvec, Stéphane; Raingeaud, Jöel; Marcelin, Anne-Geneviève; Calvez, Vincent; Khammari, Amir; Batteux, Frédéric; Dréno, Brigitte; Dupin, Nicolas
2016-01-01
Propionibacterium acnes (P. acnes) is an anaerobic, Gram-positive bacteria encountered in inflammatory acne lesions, particularly in the pilosebaceous follicle. P. acnes triggers a strong immune response involving keratinocytes, sebocytes and monocytes, the target cells during acne development. Lipoteicoic acid and peptidoglycan induce the inflammatory reaction, but no P. acnes surface protein interacting with Toll-like receptors has been identified. P. acnes surface proteins have been extracted by lithium stripping and shown to induce CXCL8 production by keratinocytes. Far-western blotting identified two surface proteins, of 24.5- and 27.5-kDa in size, specifically recognized by TLR2. These proteins were characterized, by LC-MS/MS, as CAMP factor 1 devoid of its signal peptide sequence, as shown by N-terminal sequencing. Purified CAMP factor 1 induces CXCL8 production by activating the CXCL8 gene promoter, triggering the synthesis of CXCL8 mRNA. Antibodies against TLR2 significantly decreased the CXCL8 response. For the 27 P. acnes strains used in this study, CAMP1-TLR2 binding intensity was modulated and appeared to be strong in type IB and II strains, which produced large amounts of CXCL8, whereas most of the type IA1 and IA2 strains presented little or no CAMP1-TLR2 binding and low levels of CXCL8 production. The nucleotide sequence of CAMP factor displays a major polymorphism, defining two distinct genetic groups corresponding to CAMP factor 1 with 14 amino-acid changes from strains phylotyped II with moderate and high levels of CAMP1-TLR2 binding activity, and CAMP factor 1 containing 0, 1 or 2 amino-acid changes from strains phylotyped IA1, IA2, or IB presenting no, weak or moderate CAMP1-TLR2 binding. Our findings indicate that CAMP factor 1 may contribute to P. acnes virulence, by amplifying the inflammation reaction through direct interaction with TLR2.
Salazar, Gerardo A.; Cabrera, Lidia I.; Madriñán, Santiago; Chase, Mark W.
2009-01-01
Background and Aims Phylogenetic relationships of subtribes Cranichidinae and Prescottiinae, two diverse groups of neotropical terrestrial orchids, are not satisfactorily understood. A previous molecular phylogenetic study supported monophyly for Cranichidinae, but Prescottiinae consisted of two clades not sister to one another. However, that analysis included only 11 species and eight genera of these subtribes. Here, plastid and nuclear DNA sequences are analysed for an enlarged sample of genera and species of Cranichidinae and Prescottiinae with the aim of clarifying their relationships, evaluating the phylogenetic position of the monospecific genera Exalaria, Ocampoa and Pseudocranichis and examining the value of various structural traits as taxonomic markers. Methods Approx. 6000 bp of nucleotide sequences from nuclear ribosomal (ITS) and plastid DNA (rbcL, matK-trnK and trnL-trnF) were analysed with cladistic parsimony and Bayesian inference for 45 species/14 genera of Cranichidinae and Prescottiinae (plus suitable outgroups). The utility of flower orientation, thickenings of velamen cell walls, hamular viscidium and pseudolabellum to mark clades recovered by the molecular analysis was assessed by tracing these characters on the molecular trees. Key Results Spiranthinae, Cranichidinae, paraphyletic Prescottia (with Pseudocranichis embedded), and a group of mainly Andean ‘prescottioid’ genera (the ‘Stenoptera clade’) were strongly supported. Relationships among these clades were unresolved by parsimony but the Bayesian tree provided moderately strong support for the resolution (Spiranthinae–(Stenoptera clade-(Prescottia/Pseudocranichis–Cranichidinae))). Three of the four structural characters mark clades on the molecular trees, but the possession of a pseudolabellum is variable in the polyphyletic Ponthieva. Conclusions No evidence was found for monophyly of Prescottiinae and the reinstatement of Cranichidinae s.l. (including the genera of ‘Prescottiinae’) is favoured. Cranichidinae s.l. are diagnosed by non-resupinate flowers. Lack of support from parsimony for relationships among the major clades of core spiranthids is suggestive of a rapid morphological radiation or a slow rate of molecular evolution. PMID:19136493
A thrifty variant in CREBRF strongly influences body mass index in Samoans
Kershaw, Erin E; Cheng, Hong; Buhule, Olive D; Lin, Jerome; Reupena, Muagututi‘a Sefuiva; Viali, Satupa‘itea; Tuitele, John; Naseri, Take; Urban, Zsolt; Deka, Ranjan; Weeks, Daniel E; McGarvey, Stephen T
2016-01-01
Samoans are a unique founder population with a high prevalence of obesity1–3, making them well suited for identifying new genetic contributors to obesity4. We conducted a genome-wide association study (GWAS) in 3,072 Samoans, discovered a variant, rs12513649, strongly associated with body mass index (BMI) (P = 5.3 × 10−14), and replicated the association in 2,102 additional Samoans (P = 1.2 × 10−9). Targeted sequencing identified a strongly associated missense variant, rs373863828 (p.Arg457Gln), in CREBRF (meta P = 1.4 × 10−20). Although this variant is extremely rare in other populations, it is common in Samoans (frequency of 0.259), with an effect size much larger than that of any other known common BMI risk variant (1.36–1.45 kg/m2 per copy of the risk-associated allele). In comparison to wild-type CREBRF, the Arg457Gln variant when overexpressed selectively decreased energy use and increased fat storage in an adipocyte cell model. These data, in combination with evidence of positive selection of the allele encoding p.Arg457Gln, support a ‘thrifty’ variant hypothesis as a factor in human obesity. PMID:27455349
Yatabe, Yoko; Kane, Nolan C.; Scotti-Saintagne, Caroline; Rieseberg, Loren H.
2007-01-01
Plant species may remain morphologically distinct despite gene exchange with congeners, yet little is known about the genomewide pattern of introgression among species. Here we analyze the effects of persistent gene flow on genomic differentiation between the sympatric sunflower species Helianthus annuus and H. petiolaris. While the species are strongly isolated in testcrosses, genetic distances at 108 microsatellite loci and 14 sequenced genes are highly variable and much lower (on average) than for more closely related but historically allopatric congeners. Our analyses failed to detect a positive association between levels of genetic differentiation and chromosomal rearrangements (as reported in a prior publication) or proximity to QTL for morphological differences or hybrid sterility. However, a significant increase in differentiation was observed for markers within 5 cM of chromosomal breakpoints. Together, these results suggest that islands of differentiation between these two species are small, except in areas of low recombination. Furthermore, only microsatellites associated with ESTs were identified as outlier loci in tests for selection, which might indicate that the ESTs themselves are the targets of selection rather than linked genes (or that coding regions are not randomly distributed). In general, these results indicate that even strong and genetically complex reproductive barriers cannot prevent widespread introgression. PMID:17277373
Complete Genome Sequence of the Electricity-Producing “Thermincola potens” Strain JR▿
Byrne-Bailey, Kathryne G.; Wrighton, Kelly C.; Melnyk, Ryan A.; Agbo, Peter; Hazen, Terry C.; Coates, John D.
2010-01-01
“Thermincola potens” strain JR is one of the first Gram-positive dissimilatory metal-reducing bacteria (DMRB) for which there is a complete genome sequence. Consistent with the physiology of this organism, preliminary annotation revealed an abundance of multiheme c-type cytochromes that are putatively associated with the periplasm and cell surface in a Gram-positive bacterium. Here we report the complete genome sequence of strain JR. PMID:20525829
Sequence verification as quality-control step for production of cDNA microarrays.
Taylor, E; Cogdell, D; Coombes, K; Hu, L; Ramdas, L; Tabor, A; Hamilton, S; Zhang, W
2001-07-01
To generate cDNA arrays in our core laboratory, we amplified about 2300 PCR products from a human, sequence-verified cDNA clone library. As a quality-control step, we sequenced the PCR products immediately before printing. The sequence information was used to search the GenBank database to confirm the identities. Although these clones were previously sequence verified by the company, we found that only 79% of the clones matched the original database after handling. Our experience strongly indicates the necessity to sequence verify the clones at the final stage before printing on microarray slides and to modify the gene list accordingly.
Hirsh, Vera
2018-01-01
Four epidermal growth factor receptor (EGFR) tyrosine kinase inhibitors (TKIs), erlotinib, gefitinib, afatinib and osimertinib, are currently available for the management of EGFR mutation-positive non-small-cell lung cancer (NSCLC), with others in development. Although tumors are exquisitely sensitive to these agents, acquired resistance is inevitable. Furthermore, emerging data indicate that first- (erlotinib and gefitinib), second- (afatinib) and third-generation (osimertinib) EGFR TKIs differ in terms of efficacy and tolerability profiles. Therefore, there is a strong imperative to optimize the sequence of TKIs in order to maximize their clinical benefit. Osimertinib has demonstrated striking efficacy as a second-line treatment option in patients with T790M-positive tumors, and also confers efficacy and tolerability advantages over first-generation TKIs in the first-line setting. However, while accrual of T790M is the most predominant mechanism of resistance to erlotinib, gefitinib and afatinib, resistance mechanisms to osimertinib have not been clearly elucidated, meaning that possible therapy options after osimertinib failure are not clear. At present, few data comparing sequential regimens in patients with EGFR mutation-positive NSCLC are available and prospective clinical trials are required. This article reviews the similarities and differences between EGFR TKIs, and discusses key considerations when assessing optimal sequential therapy with these agents for the treatment of EGFR mutation-positive NSCLC. PMID:29383041
VanTieghem, Michelle R.; Gabard-Durnam, Laurel; Goff, Bonnie; Flannery, Jessica; Humphreys, Kathryn L.; Telzer, Eva H.; Caldera, Christina; Louie, Jennifer Y.; Shapiro, Mor; Bolger, Niall; Tottenham, Nim
2018-01-01
Institutional caregiving is associated with significant deviations from species-expected caregiving, altering the normative sequence of attachment formation and placing children at risk for long-term emotional difficulties. However, little is known about factors that can promote resilience following early institutional caregiving. In the current study, we investigated how adaptations in affective processing (i.e. positive valence bias) and family-level protective factors (i.e. secure parent-child relationships) moderate risk for internalizing symptoms in Previously Institutionalized (PI) youth. Children and adolescents with and without a history of institutional care performed a laboratory-based affective processing task and self-reported measures of parent-child relationship security. PI youth were more likely than comparison youth to show positive valence biases when interpreting ambiguous facial expressions. Both positive valence bias and parent-child relationship security moderated the association between institutional care and parent-reported internalizing symptoms, such that greater positive valence bias and more secure parent-child relationships predicted fewer symptoms in PI youth. However, when both factors were tested concurrently, parent-child relationship security more strongly moderated the link between PI status and internalizing symptoms. These findings suggest that both individual-level adaptations in affective processing and family-level factors of secure parent-child relationships may ameliorate risk for internalizing psychopathology following early institutional caregiving. PMID:28401841
Vantieghem, Michelle R; Gabard-Durnam, Laurel; Goff, Bonnie; Flannery, Jessica; Humphreys, Kathryn L; Telzer, Eva H; Caldera, Christina; Louie, Jennifer Y; Shapiro, Mor; Bolger, Niall; Tottenham, Nim
2017-05-01
Institutional caregiving is associated with significant deviations from species-expected caregiving, altering the normative sequence of attachment formation and placing children at risk for long-term emotional difficulties. However, little is known about factors that can promote resilience following early institutional caregiving. In the current study, we investigated how adaptations in affective processing (i.e., positive valence bias) and family-level protective factors (i.e., secure parent-child relationships) moderate risk for internalizing symptoms in previously institutionalized (PI) youth. Children and adolescents with and without a history of institutional care performed a laboratory-based affective processing task and self-reported measures of parent-child relationship security. PI youth were more likely than comparison youth to show positive valence biases when interpreting ambiguous facial expressions. Both positive valence bias and parent-child relationship security moderated the association between institutional care and parent-reported internalizing symptoms, such that greater positive valence bias and more secure parent-child relationships predicted fewer symptoms in PI youth. However, when both factors were tested concurrently, parent-child relationship security more strongly moderated the link between PI status and internalizing symptoms. These findings suggest that both individual-level adaptations in affective processing and family-level factors of secure parent-child relationships may ameliorate risk for internalizing psychopathology following early institutional caregiving.
Positive Streptobacillus moniliformis PCR in guinea pigs likely due to Leptotrichia spp.
Boot, Ron; Van de Berg, Lia; Reubsaet, Frans A G; Vlemminx, Maurice J
2008-04-30
Streptobacillus moniliformis is a zoonotic bacterium. We obtained positive S. moniliformis PCR results in oral swab samples from guinea pigs from an experimental colony and the breeding colony of origin. Comparison of the DNA sequence of an amplicon with deposited 16S rDNA sequences revealed that Leptotrichia sp. can be the source of a false positive S. moniliformis PCR outcome.
Antonescu, Cristina R; Agaram, Narasimhan P; Sung, Yun-Shao; Zhang, Lei; Swanson, David; Dickson, Brendan C
2018-04-01
ACTB-GLI1 fusions have been reported as the pathognomonic genetic abnormality defining an unusual subset of actin-positive, perivascular myoid tumors, known as "pericytoma with the t(7;12) translocation." In addition, GLI1 oncogenic activation through a related MALAT1-GLI1 gene fusion has been recently reported in 2 unrelated gastric tumors, namely plexiform fibromyxoma and gastroblastoma. Triggered by unexpected targeted RNA-sequencing results detecting GLI1-related fusions in a group of malignant neoplasms with round to epithelioid morphology, and frequently strong S100 protein immunoreactivity, we investigated their clinicopathologic features in relation to other known pathologic entities sharing similar genetics. On the basis of a combined approach of targeted RNA sequencing and fluorescence in situ hybridization screening, we identified 6 cases with GLI1 gene fusions, including 4 fused to ACTB, 1 with MALAT1 and 1 with PTCH1 gene. Patients had a mean age of 36 years at diagnosis (range, 16 to 79 y) and slight female predilection all except 1 tumor originated in the soft tissue. Microscopically, the tumors had a monomorphic epithelioid phenotype arranged in a distinctive nested or cord-like architecture, separated by thin septae and delicate capillary network. All except 2 cases were strongly positive for S100 protein, whereas being negative for SOX10, SMA, and EMA. Only 1 tumor showed focal cytokeratin positivity in rare cells. Although the tumors showed some resemblance to pericytic/glomus tumors or myoepithelial tumors, the immunoprofile was not supportive of either lineage. Moreover, in contrast to the benign course of so-called pericytoma with t(7;12), 3 patients in this series developed metastatic disease to either lymph nodes or lung. In fact the only patient with lung metastases showed a novel PTCH1-GLI1 gene fusion. It remains to be determined whether these tumors represent a clinically and immunohistologically distinct subset of pericytoma, or an altogether novel soft tissue sarcoma. Our findings open new opportunities for targeted therapy, as tumors with GLI1 oncogenic activation, and subsequent PTCH1 overexpression, might be sensitive to sonic hedgehog pathway inhibitors.
NASA Astrophysics Data System (ADS)
Jézéquel, Tangi; Silvestre, Virginie; Dinis, Katy; Giraudeau, Patrick; Akoka, Serge
2018-04-01
Isotope ratio monitoring by 13C NMR spectrometry (irm-13C NMR) provides the complete 13C intramolecular position-specific composition at natural abundance. It represents a powerful tool to track the (bio)chemical pathway which has led to the synthesis of targeted molecules, since it allows Position-specific Isotope Analysis (PSIA). Due to the very small composition range (which represents the range of variation of the isotopic composition of a given nuclei) of 13C natural abundance values (50‰), irm-13C NMR requires a 1‰ accuracy and thus highly quantitative analysis by 13C NMR. Until now, the conventional strategy to determine the position-specific abundance xi relies on the combination of irm-MS (isotopic ratio monitoring Mass Spectrometry) and 13C quantitative NMR. However this approach presents a serious drawback since it relies on two different techniques and requires to measure separately the signal of all the carbons of the analyzed compound, which is not always possible. To circumvent this constraint, we recently proposed a new methodology to perform 13C isotopic analysis using an internal reference method and relying on NMR only. The method combines a highly quantitative 1H NMR pulse sequence (named DWET) with a 13C isotopic NMR measurement. However, the recently published DWET sequence is unsuited for samples with short T1, which forms a serious limitation for irm-13C NMR experiments where a relaxing agent is added. In this context, we suggest two variants of the DWET called Multi-WET and Profiled-WET, developed and optimized to reach the same accuracy of 1‰ with a better immunity towards T1 variations. Their performance is evaluated on the determination of the 13C isotopic profile of vanillin. Both pulse sequences show a 1‰ accuracy with an increased robustness to pulse miscalibrations compared to the initial DWET method. This constitutes a major advance in the context of irm-13C NMR since it is now possible to perform isotopic analysis with high relaxing agent concentrations, leading to a strong reduction of the overall experiment time.
Meli, Marina L.; Cattori, Valentino; Martínez, Fernando; López, Guillermo; Vargas, Astrid; Simón, Miguel A.; Zorrilla, Irene; Muñoz, Alvaro; Palomares, Francisco; López-Bao, Jose V.; Pastor, Josep; Tandon, Ravi; Willi, Barbara; Hofmann-Lehmann, Regina; Lutz, Hans
2009-01-01
Background The Iberian lynx (Lynx pardinus) is considered the most endangered felid species in the world. In order to save this species, the Spanish authorities implemented a captive breeding program recruiting lynxes from the wild. In this context, a retrospective survey on prevalence of selected feline pathogens in free-ranging lynxes was initiated. Methodology/ Principal Findings We systematically analyzed the prevalence and importance of seven viral, one protozoan (Cytauxzoon felis), and several bacterial (e.g., hemotropic mycoplasma) infections in 77 of approximately 200 remaining free-ranging Iberian lynxes of the Doñana and Sierra Morena areas, in Southern Spain, between 2003 and 2007. With the exception of feline immunodeficiency virus (FIV), evidence of infection by all tested feline pathogens was found in Iberian lynxes. Fourteen lynxes were feline leukemia virus (FeLV) provirus-positive; eleven of these were antigenemic (FeLV p27 positive). All 14 animals tested negative for other viral infections. During a six-month period in 2007, six of the provirus-positive antigenemic lynxes died. Infection with FeLV but not with other infectious agents was associated with mortality (p<0.001). Sequencing of the FeLV surface glycoprotein gene revealed a common origin for ten of the eleven samples. The ten sequences were closely related to FeLV-A/61E, originally isolated from cats in the USA. Endogenous FeLV sequences were not detected. Conclusions/Significance It was concluded that the FeLV infection most likely originated from domestic cats invading the lynx's habitats. Data available regarding the time frame, co-infections, and outcome of FeLV-infections suggest that, in contrast to the domestic cat, the FeLV strain affecting the lynxes in 2007 is highly virulent to this species. Our data argue strongly for vaccination of lynxes and domestic cats in and around lynx's habitats in order to prevent further spread of the virus as well as reduction the domestic cat population if the lynx population is to be maintained. PMID:19270739
Melton, Emily Denise; Sorokin, Dimitry Y.; Overmars, Lex; ...
2017-09-21
Dethiobacter alkaliphilus strain AHT1 T is an anaerobic, sulfidogenic, moderately salt-tolerant alkaliphilic chemolithotroph isolated from hypersaline soda lake sediments in northeastern Mongolia. It is thus a Gram-positive bacterium with low GC content, within the phylum Firmicutes. We report its draft genome sequence, which consists of 34 contigs with a total sequence length of 3.12 Mbp. D. alkaliphilus strain AHT1 T was sequenced by the Joint Genome Institute (JGI) as part of the Community Science Program due to its relevance to bioremediation and biotechnological applications.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Melton, Emily Denise; Sorokin, Dimitry Y.; Overmars, Lex
Dethiobacter alkaliphilus strain AHT1 T is an anaerobic, sulfidogenic, moderately salt-tolerant alkaliphilic chemolithotroph isolated from hypersaline soda lake sediments in northeastern Mongolia. It is thus a Gram-positive bacterium with low GC content, within the phylum Firmicutes. We report its draft genome sequence, which consists of 34 contigs with a total sequence length of 3.12 Mbp. D. alkaliphilus strain AHT1 T was sequenced by the Joint Genome Institute (JGI) as part of the Community Science Program due to its relevance to bioremediation and biotechnological applications.
Dubé, Marie-Pier; Castonguay, Yves; Cloutier, Jean; Michaud, Josée; Bertrand, Annick
2013-03-01
Dehydrin defines a complex family of intrinsically disordered proteins with potential adaptive value with regard to freeze-induced cell dehydration. Search within an expressed sequence tags library from cDNAs of cold-acclimated crowns of alfalfa (Medicago sativa spp. sativa L.) identified transcripts putatively encoding K(3)-type dehydrins. Analysis of full-length coding sequences unveiled two highly homologous sequence variants, K(3)-A and K(3)-B. An increase in the frequency of genotypes yielding positive genomic amplification of the K(3)-dehydrin variants in response to selection for superior tolerance to freezing and the induction of their expression at low temperature strongly support a link with cold adaptation. The presence of multiple allelic forms within single genotypes and independent segregation indicate that the two K(3) dehydrin variants are encoded by distinct genes located at unlinked loci. The co-inheritance of the K(3)-A dehydrin with a Y(2)K(4) dehydrin restriction fragment length polymorphism with a demonstrated impact on freezing tolerance suggests the presence of a genome domain where these functionally related genes are located. These results provide additional evidence that dehydrin play important roles with regard to tolerance to subfreezing temperatures. They also underscore the value of recurrent selection to help identify variants within a large multigene family in allopolyploid species like alfalfa.
Identification of probabilities.
Vitányi, Paul M B; Chater, Nick
2017-02-01
Within psychology, neuroscience and artificial intelligence, there has been increasing interest in the proposal that the brain builds probabilistic models of sensory and linguistic input: that is, to infer a probabilistic model from a sample. The practical problems of such inference are substantial: the brain has limited data and restricted computational resources. But there is a more fundamental question: is the problem of inferring a probabilistic model from a sample possible even in principle? We explore this question and find some surprisingly positive and general results. First, for a broad class of probability distributions characterized by computability restrictions, we specify a learning algorithm that will almost surely identify a probability distribution in the limit given a finite i.i.d. sample of sufficient but unknown length. This is similarly shown to hold for sequences generated by a broad class of Markov chains, subject to computability assumptions. The technical tool is the strong law of large numbers. Second, for a large class of dependent sequences, we specify an algorithm which identifies in the limit a computable measure for which the sequence is typical, in the sense of Martin-Löf (there may be more than one such measure). The technical tool is the theory of Kolmogorov complexity. We analyze the associated predictions in both cases. We also briefly consider special cases, including language learning, and wider theoretical implications for psychology.
Dissecting substrate specificities of the mitochondrial AFG3L2 protease.
Ding, Bojian; Martin, Dwight W; Rampello, Anthony J; Glynn, Steven E
2018-06-22
Human AFG3L2 is a compartmental AAA+ protease that performs ATP-fueled degradation at the matrix face of the inner mitochondrial membrane. Identifying how AFG3L2 selects substrates from the diverse complement of matrix-localized proteins is essential for understanding mitochondrial protein biogenesis and quality control. Here, we create solubilized forms of AFG3L2 to examine the enzyme's substrate specificity mechanisms. We show that conserved residues within the pre-sequence of the mitochondrial ribosomal protein, MrpL32, target the subunit to the protease for processing into a mature form. Moreover, these residues can act as a degron, delivering diverse model proteins to AFG3L2 for degradation. By determining the sequence of degra-dation products from multiple substrates using mass spectrometry, we construct a peptidase specificity pro-file that displays constrained product lengths and is dominated by the identity of the residue at the P1' posi-tion, with a strong preference for hydrophobic and small polar residues. This specificity profile is validated by examining the cleavage of both fluorogenic reporter peptides and full polypeptide substrates bearing different P1' residues. Together, these results demonstrate that AFG3L2 contains multiple modes of specificity, dis-criminating between potential substrates by recognizing accessible degron sequences, and performing peptide bond cleavage at preferred patterns of residues within the compartmental chamber.
Sheft, Stanley; Norris, Molly; Spanos, George; Radasevich, Katherine; Formsma, Paige; Gygi, Brian
2016-01-01
Objective Sounds in everyday environments tend to follow one another as events unfold over time. The tacit knowledge of contextual relationships among environmental sounds can influence their perception. We examined the effect of semantic context on the identification of sequences of environmental sounds by adults of varying age and hearing abilities, with an aim to develop a nonspeech test of auditory cognition. Method The familiar environmental sound test (FEST) consisted of 25 individual sounds arranged into ten five-sound sequences: five contextually coherent and five incoherent. After hearing each sequence, listeners identified each sound and arranged them in the presentation order. FEST was administered to young normal-hearing, middle-to-older normal-hearing, and middle-to-older hearing-impaired adults (Experiment 1), and to postlingual cochlear-implant users and young normal-hearing adults tested through vocoder-simulated implants (Experiment 2). Results FEST scores revealed a strong positive effect of semantic context in all listener groups, with young normal-hearing listeners outperforming other groups. FEST scores also correlated with other measures of cognitive ability, and for CI users, with the intelligibility of speech-in-noise. Conclusions Being sensitive to semantic context effects, FEST can serve as a nonspeech test of auditory cognition for diverse listener populations to assess and potentially improve everyday listening skills. PMID:27893791
Impact of elicited mood on movement expressivity during a fitness task.
Giraud, Tom; Focone, Florian; Isableu, Brice; Martin, Jean-Claude; Demulier, Virginie
2016-10-01
The purpose of the present study was to evaluate the impact of four mood conditions (control, positive, negative, aroused) on movement expressivity during a fitness task. Motion capture data from twenty individuals were recorded as they performed a predefined motion sequence. Moods were elicited using task-specific scenarii to keep a valid context. Movement qualities inspired by Effort-Shape framework (Laban & Ullmann, 1971) were computed (i.e., Impulsiveness, Energy, Directness, Jerkiness and Expansiveness). A reduced number of computed features from each movement quality was selected via Principal Component Analyses. Analyses of variance and Generalized Linear Mixed Models were used to identify movement characteristics discriminating the four mood conditions. The aroused mood condition was strongly associated with increased mean Energy compared to the three other conditions. The positive and negative mood conditions showed more subtle differences interpreted as a result of their moderate activation level. Positive mood was associated with more impulsive movements and negative mood was associated with more tense movements (i.e., reduced variability and increased Jerkiness). Findings evidence the key role of movement qualities in capturing motion signatures of moods and highlight the importance of task context in their interpretations. Copyright © 2016 Elsevier B.V. All rights reserved.
Gattuso, Hugo; Durand, Elodie; Bignon, Emmanuelle; Morell, Christophe; Georgakilas, Alexandros G; Dumont, Elise; Chipot, Christophe; Dehez, François; Monari, Antonio
2016-10-06
In the present contribution, the interaction between damaged DNA and repair enzymes is examined by means of molecular dynamics simulations. More specifically, we consider clustered abasic DNA lesions processed by the primary human apurinic/apyrimidinic (AP) endonuclease, APE1. Our results show that, in stark contrast with the corresponding bacterial endonucleases, human APE1 imposes strong geometrical constraints on the DNA duplex. As a consequence, the level of recognition and, hence, the repair rate is higher. Important features that guide the DNA/protein interactions are the presence of an extended positively charged region and of a molecular tweezers that strongly constrains DNA. Our results are on very good agreement with the experimentally determined repair rate of clustered abasic lesions. The lack of repair for one particular arrangement of the two abasic sites is also explained considering the peculiar destabilizing interaction between the recognition region and the second lesion, resulting in a partial opening of the molecular tweezers and, thus, a less stable complex. This contribution cogently establishes the molecular bases for the recognition and repair of clustered DNA lesions by means of human endonucleases.
Yang, Xiaojun; Wang, Xiaohong; Liang, Zhijuan; Zhang, Xiaoya; Wang, Yanbo; Wang, Zhenhai
2014-05-01
To study the species and amount of bacteria in sputum of patients with ventilator-associated pneumonia (VAP) by using 16S rDNA sequencing analysis, and to explore the new method for etiologic diagnosis of VAP. Bronchoalveolar lavage sputum samples were collected from 31 patients with VAP. Bacterial DNA of the samples were extracted and identified by polymerase chain reaction (PCR). At the same time, sputum specimens were processed for routine bacterial culture. The high flux sequencing experiment was conducted on PCR positive samples with 16S rDNA macro genome sequencing technology, and sequencing results were analyzed using bioinformatics, then the results between the sequencing and bacteria culture were compared. (1) 550 bp of specific DNA sequences were amplified in sputum specimens from 27 cases of the 31 patients with VAP, and they were used for sequencing analysis. 103 856 sequences were obtained from those sputum specimens using 16S rDNA sequencing, yielding approximately 39 Mb of raw data. Tag sequencing was able to inform genus level in all 27 samples. (2) Alpha-diversity analysis showed that sputum samples of patients with VAP had significantly higher variability and richness in bacterial species (Shannon index values 1.20, Simpson index values 0.48). Rarefaction curve analysis showed that there were more species that were not detected by sequencing from some VAP sputum samples. (3) Analysis of 27 sputum samples with VAP by using 16S rDNA sequences yielded four phyla: namely Acitinobacteria, Bacteroidetes, Firmicutes, Proteobacteria. With genus as a classification, it was found that the dominant species included Streptococcus 88.9% (24/27), Limnohabitans 77.8% (21/27), Acinetobacter 70.4% (19/27), Sphingomonas 63.0% (17/27), Prevotella 63.0% (17/27), Klebsiella 55.6% (15/27), Pseudomonas 55.6% (15/27), Aquabacterium 55.6% (15/27), and Corynebacterium 55.6% (15/27). (4) Pyrophosphate sequencing discovered that Prevotella, Limnohabitans, Aquabacterium, Sphingomonas might not be detected by routine bacteria culture. Among seven species which were identified by both methods, pyrophosphate sequencing yielded higher positive rate than that of ordinary bacteria culture [Streptococcus: 88.9% (24/27) vs. 18.5% (5/27), Klebsiella: 55.6% (15/27) vs. 18.5% (5/27), Acinetobacter: 70.4% (19/27) vs. 37.0% (10/27), Corynebacterium: 55.6% (15/27) vs. 7.4% (2/27), P<0.05 or P<0.01]. Sequencing positive rate was found to increase positive rate for culture of Pseudomonas [55.6% (15/27) vs. 25.9% (7/27), P=0.050]. No significant differences were observed between sequencing and ordinary bacteria culture for detection Staphylococcus [7.4% (2/27) vs. 11.1% (3/27)] and Neisseria bacteria genera [18.5% (5/27) vs. 3.7% (1/27), both P>0.05]. 16S rDNA sequencing analysis confirmed that pathogenic bacteria in sputum of VAP were complicated with multiple drug resistant strains. Compared with routine bacterial culture, pyrophosphate sequencing had higher positive rate in detecting pathogens. 16S rDNA gene sequencing technology may become a new method for etiological diagnosis of VAP.
Divergent positive selection in rhodopsin from lake and riverine cichlid fishes.
Schott, Ryan K; Refvik, Shannon P; Hauser, Frances E; López-Fernández, Hernán; Chang, Belinda S W
2014-05-01
Studies of cichlid evolution have highlighted the importance of visual pigment genes in the spectacular radiation of the African rift lake cichlids. Recent work, however, has also provided strong evidence for adaptive diversification of riverine cichlids in the Neotropics, which inhabit environments of markedly different spectral properties from the African rift lakes. These ecological and/or biogeographic differences may have imposed divergent selective pressures on the evolution of the cichlid visual system. To test these hypotheses, we investigated the molecular evolution of the dim-light visual pigment, rhodopsin. We sequenced rhodopsin from Neotropical and African riverine cichlids and combined these data with published sequences from African cichlids. We found significant evidence for positive selection using random sites codon models in all cichlid groups, with the highest levels in African lake cichlids. Tests using branch-site and clade models that partitioned the data along ecological (lake, river) and/or biogeographic (African, Neotropical) boundaries found significant evidence of divergent selective pressures among cichlid groups. However, statistical comparisons among these models suggest that ecological, rather than biogeographic, factors may be responsible for divergent selective pressures that have shaped the evolution of the visual system in cichlids. We found that branch-site models did not perform as well as clade models for our data set, in which there was evidence for positive selection in the background. One of our most intriguing results is that the amino acid sites found to be under positive selection in Neotropical and African lake cichlids were largely nonoverlapping, despite falling into the same three functional categories: spectral tuning, retinal uptake/release, and rhodopsin dimerization. Taken together, these results would imply divergent selection across cichlid clades, but targeting similar functions. This study highlights the importance of molecular investigations of ecologically important groups and the flexibility of clade models in explicitly testing ecological hypotheses.
Positive selection in the SLC11A1 gene in the family Equidae.
Bayerova, Zuzana; Janova, Eva; Matiasovic, Jan; Orlando, Ludovic; Horin, Petr
2016-05-01
Immunity-related genes are a suitable model for studying effects of selection at the genomic level. Some of them are highly conserved due to functional constraints and purifying selection, while others are variable and change quickly to cope with the variation of pathogens. The SLC11A1 gene encodes a transporter protein mediating antimicrobial activity of macrophages. Little is known about the patterns of selection shaping this gene during evolution. Although it is a typical evolutionarily conserved gene, functionally important polymorphisms associated with various diseases were identified in humans and other species. We analyzed the genomic organization, genetic variation, and evolution of the SLC11A1 gene in the family Equidae to identify patterns of selection within this important gene. Nucleotide SLC11A1 sequences were shown to be highly conserved in ten equid species, with more than 97 % sequence identity across the family. Single nucleotide polymorphisms (SNPs) were found in the coding and noncoding regions of the gene. Seven codon sites were identified to be under strong purifying selection. Codons located in three regions, including the glycosylated extracellular loop, were shown to be under diversifying selection. A 3-bp indel resulting in a deletion of the amino acid 321 in the predicted protein was observed in all horses, while it has been maintained in all other equid species. This codon comprised in an N-glycosylation site was found to be under positive selection. Interspecific variation in the presence of predicted N-glycosylation sites was observed.
Superactive amidated COOH-terminal glucagon analogues with no methionine or tryptophan.
Murphy, W A; Coy, D H; Lance, V A
1986-01-01
The functions of the Trp-25 and Met-27 residues and the free carboxy terminus of glucagon have been debated for many years. Despite some semi-synthetic data to the contrary, comparison of the glucagon sequence with the other 5 members of this family of peptides, all of them amides and particularly growth hormone-releasing factor(1-29) amide and its recently described analogues, suggests that alterations to these positions should be quite well tolerated in terms of biological activity. To test this prediction, [Phe-25,Leu-27]-glucagon amide was synthesized in high yield and was found to actually have superior glycogenolytic activity (196%) to glucagon in the rat. Replacement of Gly-4 by D-Phe, which has been shown to give much enhanced glycogenolytic activity than glucagon itself, also increased the activity of [D-Phe-4,Phe-25,Leu-27]-glucagon amide (518%). The L-Phe-4-analogue, [Phe-4,25,Leu-27]-glucagon amide, in contrast, was 20 times less active (30%), strongly suggesting the presence of a beta-bend in this N-terminal region of glucagon. This was supported by Chou-Fasman structural predictions which indicate extensive folding in the 1-15 region. Indeed, additional conformational restriction by substitution of D-Ser in position 2 of glucagon also increased activity to 226%. [D-Gln-3]-glucagon was slightly less active (74%) than glucagon. Chou-Fasman calculations on glucagon were compared to similar treatments of the VIP, secretin, PHI, and GRF(1-29) sequences.
Fanning, T; Singer, M
1987-01-01
Recent work suggests that one or more members of the highly repeated LINE-1 (L1) DNA family found in all mammals may encode one or more proteins. Here we report the sequence of a portion of an L1 cloned from the domestic cat (Felis catus). These data permit comparison of the L1 sequences in four mammalian orders (Carnivore, Lagomorph, Rodent and Primate) and the comparison supports the suggested coding potential. In two separate, noncontiguous regions in the carboxy terminal half of the proteins predicted from the DNA sequences, there are several strongly conserved segments. In one region, these share homology with known or suspected reverse transcriptases, as described by others in rodents and primates. In the second region, closer to the carboxy terminus, the strongly conserved segments are over 90% homologous among the four orders. One of the latter segments is cysteine rich and resembles the putative metal binding domains of nucleic acid binding proteins, including those of TFIIIA and retroviruses. PMID:3562227
The Arabidopsis lyrata genome sequence and the basis of rapid genome size change
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hu, Tina T.; Pattyn, Pedro; Bakker, Erica G.
2011-04-29
In our manuscript, we present a high-quality genome sequence of the Arabidopsis thaliana relative, Arabidopsis lyrata, produced by dideoxy sequencing. We have performed the usual types of genome analysis (gene annotation, dN/dS studies etc. etc.), but this is relegated to the Supporting Information. Instead, we focus on what was a major motivation for sequencing this genome, namely to understand how A. thaliana lost half its genome in a few million years and lived to tell the tale. The rather surprising conclusion is that there is not a single genomic feature that accounts for the reduced genome, but that every aspectmore » centromeres, intergenic regions, transposable elements, gene family number is affected through hundreds of thousands of cuts. This strongly suggests that overall genome size in itself is what has been under selection, a suggestion that is strongly supported by our demonstration (using population genetics data from A. thaliana) that new deletions seem to be driven to fixation.« less
Gritsun, T S; Venugopal, K; Zanotto, P M; Mikhailov, M V; Sall, A A; Holmes, E C; Polkinghorne, I; Frolova, T V; Pogodina, V V; Lashkevich, V A; Gould, E A
1997-05-01
The complete nucleotide sequence of two tick-transmitted flaviviruses, Vasilchenko (Vs) from Siberia and louping ill (LI) from the UK, have been determined. The genomes were respectively, 10928 and 10871 nucleotides (nt) in length. The coding strategy and functional protein sequence motifs of tick-borne flaviviruses are presented in both Vs and LI viruses. The phylogenies based on maximum likelihood, maximum parsimony and distance analysis of the polyproteins, identified Vs virus as a member of the tick-borne encephalitis virus subgroup within the tick-borne serocomplex, genus Flavivirus, family Flaviviridae. Comparative alignment of the 3'-untranslated regions revealed deletions of different lengths essentially at the same position downstream of the stop codon for all tick-borne viruses. Two direct 27 nucleotide repeats at the 3'-end were found only for Vs and LI virus. Immediately following the deletions a region of 332-334 nt with relatively conserved primary structure (67-94% identity) was observed at the 3'-non-coding end of the virus genome. Pairwise comparisons of the nucleotide sequence data revealed similar levels of variation between the coding region, and the 5' and 3'-termini of the genome, implying an equivalent strong selective control for translated and untranslated regions. Indeed the predicted folding of the 5' and 3'-untranslated regions revealed patterns of stem and loop structures conserved for all tick-borne flaviviruses suggesting a purifying selection for preservation of essential RNA secondary structures which could be involved in translational control and replication. The possible implications of these findings are discussed.
In Vivo Control of CpG and Non-CpG DNA Methylation by DNA Methyltransferases
Arand, Julia; Spieler, David; Karius, Tommy; Branco, Miguel R.; Meilinger, Daniela; Meissner, Alexander; Jenuwein, Thomas; Xu, Guoliang; Leonhardt, Heinrich; Wolf, Verena; Walter, Jörn
2012-01-01
The enzymatic control of the setting and maintenance of symmetric and non-symmetric DNA methylation patterns in a particular genome context is not well understood. Here, we describe a comprehensive analysis of DNA methylation patterns generated by high resolution sequencing of hairpin-bisulfite amplicons of selected single copy genes and repetitive elements (LINE1, B1, IAP-LTR-retrotransposons, and major satellites). The analysis unambiguously identifies a substantial amount of regional incomplete methylation maintenance, i.e. hemimethylated CpG positions, with variant degrees among cell types. Moreover, non-CpG cytosine methylation is confined to ESCs and exclusively catalysed by Dnmt3a and Dnmt3b. This sequence position–, cell type–, and region-dependent non-CpG methylation is strongly linked to neighboring CpG methylation and requires the presence of Dnmt3L. The generation of a comprehensive data set of 146,000 CpG dyads was used to apply and develop parameter estimated hidden Markov models (HMM) to calculate the relative contribution of DNA methyltransferases (Dnmts) for de novo and maintenance DNA methylation. The comparative modelling included wild-type ESCs and mutant ESCs deficient for Dnmt1, Dnmt3a, Dnmt3b, or Dnmt3a/3b, respectively. The HMM analysis identifies a considerable de novo methylation activity for Dnmt1 at certain repetitive elements and single copy sequences. Dnmt3a and Dnmt3b contribute de novo function. However, both enzymes are also essential to maintain symmetrical CpG methylation at distinct repetitive and single copy sequences in ESCs. PMID:22761581
Tennessen, Jacob A.; Govindarajulu, Rajanikanth; Ashman, Tia-Lynn; Liston, Aaron
2014-01-01
Whole-genome duplications are radical evolutionary events that have driven speciation and adaptation in many taxa. Higher-order polyploids have complex histories often including interspecific hybridization and dynamic genomic changes. This chromosomal reshuffling is poorly understood for most polyploid species, despite their evolutionary and agricultural importance, due to the challenge of distinguishing homologous sequences from each other. Here, we use dense linkage maps generated with targeted sequence capture to improve the diploid strawberry (Fragaria vesca) reference genome and to disentangle the subgenomes of the wild octoploid progenitors of cultivated strawberry, Fragaria virginiana and Fragaria chiloensis. Our novel approach, POLiMAPS (Phylogenetics Of Linkage-Map-Anchored Polyploid Subgenomes), leverages sequence reads to associate informative interhomeolog phylogenetic markers with linkage groups and reference genome positions. In contrast to a widely accepted model, we find that one of the four subgenomes originates with the diploid cytoplasm donor F. vesca, one with the diploid Fragaria iinumae, and two with an unknown ancestor close to F. iinumae. Extensive unidirectional introgression has converted F. iinumae-like subgenomes to be more F. vesca-like, but never the reverse, due either to homoploid hybridization in the F. iinumae-like diploid ancestors or else strong selection spreading F. vesca-like sequence among subgenomes through homeologous exchange. In addition, divergence between homeologous chromosomes has been substantially augmented by interchromosomal rearrangements. Our phylogenetic approach reveals novel aspects of the complicated web of genetic exchanges that occur during polyploid evolution and suggests a path forward for unraveling other agriculturally and ecologically important polyploid genomes. PMID:25477420
RNA degradation and models for post-transcriptional gene-silencing.
Meins, F
2000-06-01
Post-transcriptional gene silencing (PTGS) is a form of stable but potentially reversible epigenetic modification, which frequently occurs in transgenic plants. The interaction in trans of genes with similar transcribed sequences results in sequence-specific degradation of RNAs derived from the genes involved. Highly expressed single-copy loci, transcribed inverted repeats, and poorly transcribed complex loci can act as sources of signals that trigger PTGS. In some cases, mobile, sequence-specific silencing signals can move from cell to cell or even over long distances in the plant. Several current models hold that silencing signals are 'aberrant' RNAs (aRNA), which differ in some way from normal mRNAs. The most likely candidates are small antisense RNAs (asRNA) and double-stranded RNAs (dsRNA). Direct evidence that these or other aRNAs found in silent tissues can induce PTGS is still lacking. Most current models assume that silencing signals interact with target RNAs in a sequence-specific fashion. This results in degradation, usually in the cytoplasm, by exonucleolytic as well as endonucleolytic pathways, which are not necessarily PTGS-specific. Biochemical-switch models hold that the silent state is maintained by a positive auto-regulatory loop. One possibility is that concentrations of hypothetical silencing signals above a critical threshold trigger their own production by self-replication, by degradation of target RNAs, or by a combination of both mechanisms. These models can account for the stability, reversibility and multiplicity of silent states; the strong influence of transcription rate of target genes on the incidence and stability of silencing, and the amplification and systemic propagation of motile silencing signals.
Nucleosome Positioning and Epigenetics
NASA Astrophysics Data System (ADS)
Schwab, David; Bruinsma, Robijn
2008-03-01
The role of chromatin structure in gene regulation has recently taken center stage in the field of epigenetics, phenomena that change the phenotype without changing the DNA sequence. Recent work has also shown that nucleosomes, a complex of DNA wrapped around a histone octamer, experience a sequence dependent energy landscape due to the variation in DNA bend stiffness with sequence composition. In this talk, we consider the role nucleosome positioning might play in the formation of heterochromatin, a compact form of DNA generically responsible for gene silencing. In particular, we discuss how different patterns of nucleosome positions, periodic or random, could either facilitate or suppress heterochromatin stability and formation.
Position specific variation in the rate of evolution in transcription factor binding sites
Moses, Alan M; Chiang, Derek Y; Kellis, Manolis; Lander, Eric S; Eisen, Michael B
2003-01-01
Background The binding sites of sequence specific transcription factors are an important and relatively well-understood class of functional non-coding DNAs. Although a wide variety of experimental and computational methods have been developed to characterize transcription factor binding sites, they remain difficult to identify. Comparison of non-coding DNA from related species has shown considerable promise in identifying these functional non-coding sequences, even though relatively little is known about their evolution. Results Here we analyse the genome sequences of the budding yeasts Saccharomyces cerevisiae, S. bayanus, S. paradoxus and S. mikatae to study the evolution of transcription factor binding sites. As expected, we find that both experimentally characterized and computationally predicted binding sites evolve slower than surrounding sequence, consistent with the hypothesis that they are under purifying selection. We also observe position-specific variation in the rate of evolution within binding sites. We find that the position-specific rate of evolution is positively correlated with degeneracy among binding sites within S. cerevisiae. We test theoretical predictions for the rate of evolution at positions where the base frequencies deviate from background due to purifying selection and find reasonable agreement with the observed rates of evolution. Finally, we show how the evolutionary characteristics of real binding motifs can be used to distinguish them from artefacts of computational motif finding algorithms. Conclusion As has been observed for protein sequences, the rate of evolution in transcription factor binding sites varies with position, suggesting that some regions are under stronger functional constraint than others. This variation likely reflects the varying importance of different positions in the formation of the protein-DNA complex. The characterization of the pattern of evolution in known binding sites will likely contribute to the effective use of comparative sequence data in the identification of transcription factor binding sites and is an important step toward understanding the evolution of functional non-coding DNA. PMID:12946282