Predicting residue-wise contact orders in proteins by support vector regression.
Song, Jiangning; Burrage, Kevin
2006-10-03
The residue-wise contact order (RWCO) describes the sequence separations between the residues of interest and its contacting residues in a protein sequence. It is a new kind of one-dimensional protein structure that represents the extent of long-range contacts and is considered as a generalization of contact order. Together with secondary structure, accessible surface area, the B factor, and contact number, RWCO provides comprehensive and indispensable important information to reconstructing the protein three-dimensional structure from a set of one-dimensional structural properties. Accurately predicting RWCO values could have many important applications in protein three-dimensional structure prediction and protein folding rate prediction, and give deep insights into protein sequence-structure relationships. We developed a novel approach to predict residue-wise contact order values in proteins based on support vector regression (SVR), starting from primary amino acid sequences. We explored seven different sequence encoding schemes to examine their effects on the prediction performance, including local sequence in the form of PSI-BLAST profiles, local sequence plus amino acid composition, local sequence plus molecular weight, local sequence plus secondary structure predicted by PSIPRED, local sequence plus molecular weight and amino acid composition, local sequence plus molecular weight and predicted secondary structure, and local sequence plus molecular weight, amino acid composition and predicted secondary structure. When using local sequences with multiple sequence alignments in the form of PSI-BLAST profiles, we could predict the RWCO distribution with a Pearson correlation coefficient (CC) between the predicted and observed RWCO values of 0.55, and root mean square error (RMSE) of 0.82, based on a well-defined dataset with 680 protein sequences. Moreover, by incorporating global features such as molecular weight and amino acid composition we could further improve the prediction performance with the CC to 0.57 and an RMSE of 0.79. In addition, combining the predicted secondary structure by PSIPRED was found to significantly improve the prediction performance and could yield the best prediction accuracy with a CC of 0.60 and RMSE of 0.78, which provided at least comparable performance compared with the other existing methods. The SVR method shows a prediction performance competitive with or at least comparable to the previously developed linear regression-based methods for predicting RWCO values. In contrast to support vector classification (SVC), SVR is very good at estimating the raw value profiles of the samples. The successful application of the SVR approach in this study reinforces the fact that support vector regression is a powerful tool in extracting the protein sequence-structure relationship and in estimating the protein structural profiles from amino acid sequences.
Arneth, Borros
2012-10-01
As possible mechanisms to explain the emergence of autoimmune diseases, the current author has suggested in earlier papers two new pathways: the "protein localization hypothesis" and the "protein traffic hypothesis". The "protein localization hypothesis" states that an autoimmune disease develops if a protein accumulates in a previously unoccupied compartment, that did not previously contain that protein. Similarly, the "protein traffic hypothesis" states that a sudden error within the transport of a certain protein leads to the emergence of an autoimmune disease. The current article discusses the usefulness of the different commercially available transgenic murine models of diabetes mellitus type 1 to confirm the aforementioned hypotheses. This discussion shows that several transgenic murine models of diabetes mellitus type 1 are in-line and confirm the aforementioned hypotheses. Furthermore, these hypotheses are additionally inline with the occurrence of several newly discovered protein sequences, the so-called trepitope sequences. These sequences modulate the immune response to certain proteins. The current study analyzed to what extent the hypotheses are supported by the occurrence of these new sequences. Thereby the occurrence of the trepitope sequences provides additional evidence supporting the aforementioned hypotheses. Both the "protein localization hypothesis" and the "protein traffic hypothesis" have the potential to lead to new causal therapy concepts. The "protein localization hypothesis" and the "protein traffic hypothesis" provide conceptional explanations for the diabetes mouse models as well as for the newly discovered trepitope sequences. Copyright © 2012 Elsevier Ltd. All rights reserved.
Zhang, Li; Liao, Bo; Li, Dachao; Zhu, Wen
2009-07-21
Apoptosis, or programmed cell death, plays an important role in development of an organism. Obtaining information on subcellular location of apoptosis proteins is very helpful to understand the apoptosis mechanism. In this paper, based on the concept that the position distribution information of amino acids is closely related with the structure and function of proteins, we introduce the concept of distance frequency [Matsuda, S., Vert, J.P., Ueda, N., Toh, H., Akutsu, T., 2005. A novel representation of protein sequences for prediction of subcellular location using support vector machines. Protein Sci. 14, 2804-2813] and propose a novel way to calculate distance frequencies. In order to calculate the local features, each protein sequence is separated into p parts with the same length in our paper. Then we use the novel representation of protein sequences and adopt support vector machine to predict subcellular location. The overall prediction accuracy is significantly improved by jackknife test.
Eddy, Sean R.
2008-01-01
Sequence database searches require accurate estimation of the statistical significance of scores. Optimal local sequence alignment scores follow Gumbel distributions, but determining an important parameter of the distribution (λ) requires time-consuming computational simulation. Moreover, optimal alignment scores are less powerful than probabilistic scores that integrate over alignment uncertainty (“Forward” scores), but the expected distribution of Forward scores remains unknown. Here, I conjecture that both expected score distributions have simple, predictable forms when full probabilistic modeling methods are used. For a probabilistic model of local sequence alignment, optimal alignment bit scores (“Viterbi” scores) are Gumbel-distributed with constant λ = log 2, and the high scoring tail of Forward scores is exponential with the same constant λ. Simulation studies support these conjectures over a wide range of profile/sequence comparisons, using 9,318 profile-hidden Markov models from the Pfam database. This enables efficient and accurate determination of expectation values (E-values) for both Viterbi and Forward scores for probabilistic local alignments. PMID:18516236
Schmedes, Sarah E; King, Jonathan L; Budowle, Bruce
2015-01-01
Whole-genome data are invaluable for large-scale comparative genomic studies. Current sequencing technologies have made it feasible to sequence entire bacterial genomes with relative ease and time with a substantially reduced cost per nucleotide, hence cost per genome. More than 3,000 bacterial genomes have been sequenced and are available at the finished status. Publically available genomes can be readily downloaded; however, there are challenges to verify the specific supporting data contained within the download and to identify errors and inconsistencies that may be present within the organizational data content and metadata. AutoCurE, an automated tool for bacterial genome database curation in Excel, was developed to facilitate local database curation of supporting data that accompany downloaded genomes from the National Center for Biotechnology Information. AutoCurE provides an automated approach to curate local genomic databases by flagging inconsistencies or errors by comparing the downloaded supporting data to the genome reports to verify genome name, RefSeq accession numbers, the presence of archaea, BioProject/UIDs, and sequence file descriptions. Flags are generated for nine metadata fields if there are inconsistencies between the downloaded genomes and genomes reports and if erroneous or missing data are evident. AutoCurE is an easy-to-use tool for local database curation for large-scale genome data prior to downstream analyses.
Using GBrowse 2.0 to visualize and share next-generation sequence data
2013-01-01
GBrowse is a mature web-based genome browser that is suitable for deployment on both public and private web sites. It supports most of genome browser features, including qualitative and quantitative (wiggle) tracks, track uploading, track sharing, interactive track configuration, semantic zooming and limited smooth track panning. As of version 2.0, GBrowse supports next-generation sequencing (NGS) data by providing for the direct display of SAM and BAM sequence alignment files. SAM/BAM tracks provide semantic zooming and support both local and remote data sources. This article provides step-by-step instructions for configuring GBrowse to display NGS data. PMID:23376193
Song, Jiangning; Burrage, Kevin; Yuan, Zheng; Huber, Thomas
2006-03-09
The majority of peptide bonds in proteins are found to occur in the trans conformation. However, for proline residues, a considerable fraction of Prolyl peptide bonds adopt the cis form. Proline cis/trans isomerization is known to play a critical role in protein folding, splicing, cell signaling and transmembrane active transport. Accurate prediction of proline cis/trans isomerization in proteins would have many important applications towards the understanding of protein structure and function. In this paper, we propose a new approach to predict the proline cis/trans isomerization in proteins using support vector machine (SVM). The preliminary results indicated that using Radial Basis Function (RBF) kernels could lead to better prediction performance than that of polynomial and linear kernel functions. We used single sequence information of different local window sizes, amino acid compositions of different local sequences, multiple sequence alignment obtained from PSI-BLAST and the secondary structure information predicted by PSIPRED. We explored these different sequence encoding schemes in order to investigate their effects on the prediction performance. The training and testing of this approach was performed on a newly enlarged dataset of 2424 non-homologous proteins determined by X-Ray diffraction method using 5-fold cross-validation. Selecting the window size 11 provided the best performance for determining the proline cis/trans isomerization based on the single amino acid sequence. It was found that using multiple sequence alignments in the form of PSI-BLAST profiles could significantly improve the prediction performance, the prediction accuracy increased from 62.8% with single sequence to 69.8% and Matthews Correlation Coefficient (MCC) improved from 0.26 with single local sequence to 0.40. Furthermore, if coupled with the predicted secondary structure information by PSIPRED, our method yielded a prediction accuracy of 71.5% and MCC of 0.43, 9% and 0.17 higher than the accuracy achieved based on the singe sequence information, respectively. A new method has been developed to predict the proline cis/trans isomerization in proteins based on support vector machine, which used the single amino acid sequence with different local window sizes, the amino acid compositions of local sequence flanking centered proline residues, the position-specific scoring matrices (PSSMs) extracted by PSI-BLAST and the predicted secondary structures generated by PSIPRED. The successful application of SVM approach in this study reinforced that SVM is a powerful tool in predicting proline cis/trans isomerization in proteins and biological sequence analysis.
2010-01-01
Background Cryptic species complexes are common among anophelines. Previous phylogenetic analysis based on the complete mtDNA COI gene sequences detected paraphyly in the Neotropical malaria vector Anopheles marajoara. The "Folmer region" detects a single taxon using a 3% divergence threshold. Methods To test the paraphyletic hypothesis and examine the utility of the Folmer region, genealogical trees based on a concatenated (white + 3' COI sequences) dataset and pairwise differentiation of COI fragments were examined. The population structure and demographic history were based on partial COI sequences for 294 individuals from 14 localities in Amazonian Brazil. 109 individuals from 12 localities were sequenced for the nDNA white gene, and 57 individuals from 11 localities were sequenced for the ribosomal DNA (rDNA) internal transcribed spacer 2 (ITS2). Results Distinct A. marajoara lineages were detected by combined genealogical analysis and were also supported among COI haplotypes using a median joining network and AMOVA, with time since divergence during the Pleistocene (<100,000 ya). COI sequences at the 3' end were more variable, demonstrating significant pairwise differentiation (3.82%) compared to the more moderate 2.92% detected by the Folmer region. Lineage 1 was present in all localities, whereas lineage 2 was restricted mainly to the west. Mismatch distributions for both lineages were bimodal, likely due to multiple colonization events and spatial expansion (~798 - 81,045 ya). There appears to be gene flow within, not between lineages, and a partial barrier was detected near Rio Jari in Amapá state, separating western and eastern populations. In contrast, both nDNA data sets (white gene sequences with or without the retention of the 4th intron, and ITS2 sequences and length) detected a single A. marajoara lineage. Conclusions Strong support for combined data with significant differentiation detected in the COI and absent in the nDNA suggest that the divergence is recent, and detectable only by the faster evolving mtDNA. A within subgenus threshold of >2% may be more appropriate among sister taxa in cryptic anopheline complexes than the standard 3%. Differences in demographic history and climatic changes may have contributed to mtDNA lineage divergence in A. marajoara. PMID:20929572
DOE Office of Scientific and Technical Information (OSTI.GOV)
Randall, Graham L.; Zechiedrich, E. L.; Pettitt, Bernard M.
2009-09-01
To understand how underwinding and overwinding the DNA helix affects its structure, we simulated 19 independent DNA systems with fixed degrees of twist using molecular dynamics in a system that does not allow writhe. Underwinding DNA induced spontaneous, sequence-dependent base flipping and local denaturation, while overwinding DNA induced the formation of Pauling-like DNA (P-DNA). The winding resulted in a bimodal state simultaneously including local structural failure and B-form DNA for both underwinding and extreme overwinding. Our simulations suggest that base flipping and local denaturation may provide a landscape influencing protein recognition of DNA sequence to affect, for examples, replication, transcriptionmore » and recombination. Additionally, our findings help explain results from singlemolecule experiments and demonstrate that elastic rod models are strictly valid on average only for unstressed or overwound DNA up to P-DNA formation. Finally, our data support a model in which base flipping can result from torsional stress.« less
Galbany-Casals, M; Carnicero-Campmany, P; Blanco-Moreno, J M; Smissen, R D
2012-09-01
Hybridisation is considered an important evolutionary phenomenon in Gnaphalieae, but contemporary hybridisation has been little explored within the tribe. Here, hybridisation between Helichrysum orientale and Helichrysum stoechas is studied at two different localities in the islands of Crete and Rhodes (Greece). Using three different types of molecular data (AFLP, nrDNA ITS sequences and cpDNA ndhF sequences) and morphological data, the aim is to provide simultaneous and direct comparisons between molecular and morphological variation among the parental species and the studied hybrid populations. AFLP profiles, ITS sequences and morphological data support the existence of hybrids at the two localities studied, shown as morphological and genetic intermediates between the parental species. Chloroplast DNA sequences show that both parental species can act either as pollen donor or as maternal parent. Fertility of hybrids is demonstrated by the viability of seeds produced by hybrids from both localities, and the detection of a backcross specimen to H. orientale. Although there is general congruence of morphological and molecular data, the analysis of morphology and ITS sequences can fail to detect backcross hybrids. © 2012 German Botanical Society and The Royal Botanical Society of the Netherlands.
Score distributions of gapped multiple sequence alignments down to the low-probability tail
NASA Astrophysics Data System (ADS)
Fieth, Pascal; Hartmann, Alexander K.
2016-08-01
Assessing the significance of alignment scores of optimally aligned DNA or amino acid sequences can be achieved via the knowledge of the score distribution of random sequences. But this requires obtaining the distribution in the biologically relevant high-scoring region, where the probabilities are exponentially small. For gapless local alignments of infinitely long sequences this distribution is known analytically to follow a Gumbel distribution. Distributions for gapped local alignments and global alignments of finite lengths can only be obtained numerically. To obtain result for the small-probability region, specific statistical mechanics-based rare-event algorithms can be applied. In previous studies, this was achieved for pairwise alignments. They showed that, contrary to results from previous simple sampling studies, strong deviations from the Gumbel distribution occur in case of finite sequence lengths. Here we extend the studies to multiple sequence alignments with gaps, which are much more relevant for practical applications in molecular biology. We study the distributions of scores over a large range of the support, reaching probabilities as small as 10-160, for global and local (sum-of-pair scores) multiple alignments. We find that even after suitable rescaling, eliminating the sequence-length dependence, the distributions for multiple alignment differ from the pairwise alignment case. Furthermore, we also show that the previously discussed Gaussian correction to the Gumbel distribution needs to be refined, also for the case of pairwise alignments.
An, Ji-Yong; Meng, Fan-Rong; You, Zhu-Hong; Fang, Yu-Hong; Zhao, Yu-Jun; Zhang, Ming
2016-01-01
We propose a novel computational method known as RVM-LPQ that combines the Relevance Vector Machine (RVM) model and Local Phase Quantization (LPQ) to predict PPIs from protein sequences. The main improvements are the results of representing protein sequences using the LPQ feature representation on a Position Specific Scoring Matrix (PSSM), reducing the influence of noise using a Principal Component Analysis (PCA), and using a Relevance Vector Machine (RVM) based classifier. We perform 5-fold cross-validation experiments on Yeast and Human datasets, and we achieve very high accuracies of 92.65% and 97.62%, respectively, which is significantly better than previous works. To further evaluate the proposed method, we compare it with the state-of-the-art support vector machine (SVM) classifier on the Yeast dataset. The experimental results demonstrate that our RVM-LPQ method is obviously better than the SVM-based method. The promising experimental results show the efficiency and simplicity of the proposed method, which can be an automatic decision support tool for future proteomics research.
Angiuoli, Samuel V; White, James R; Matalka, Malcolm; White, Owen; Fricke, W Florian
2011-01-01
The widespread popularity of genomic applications is threatened by the "bioinformatics bottleneck" resulting from uncertainty about the cost and infrastructure needed to meet increasing demands for next-generation sequence analysis. Cloud computing services have been discussed as potential new bioinformatics support systems but have not been evaluated thoroughly. We present benchmark costs and runtimes for common microbial genomics applications, including 16S rRNA analysis, microbial whole-genome shotgun (WGS) sequence assembly and annotation, WGS metagenomics and large-scale BLAST. Sequence dataset types and sizes were selected to correspond to outputs typically generated by small- to midsize facilities equipped with 454 and Illumina platforms, except for WGS metagenomics where sampling of Illumina data was used. Automated analysis pipelines, as implemented in the CloVR virtual machine, were used in order to guarantee transparency, reproducibility and portability across different operating systems, including the commercial Amazon Elastic Compute Cloud (EC2), which was used to attach real dollar costs to each analysis type. We found considerable differences in computational requirements, runtimes and costs associated with different microbial genomics applications. While all 16S analyses completed on a single-CPU desktop in under three hours, microbial genome and metagenome analyses utilized multi-CPU support of up to 120 CPUs on Amazon EC2, where each analysis completed in under 24 hours for less than $60. Representative datasets were used to estimate maximum data throughput on different cluster sizes and to compare costs between EC2 and comparable local grid servers. Although bioinformatics requirements for microbial genomics depend on dataset characteristics and the analysis protocols applied, our results suggests that smaller sequencing facilities (up to three Roche/454 or one Illumina GAIIx sequencer) invested in 16S rRNA amplicon sequencing, microbial single-genome and metagenomics WGS projects can achieve cost-efficient bioinformatics support using CloVR in combination with Amazon EC2 as an alternative to local computing centers.
Angiuoli, Samuel V.; White, James R.; Matalka, Malcolm; White, Owen; Fricke, W. Florian
2011-01-01
Background The widespread popularity of genomic applications is threatened by the “bioinformatics bottleneck” resulting from uncertainty about the cost and infrastructure needed to meet increasing demands for next-generation sequence analysis. Cloud computing services have been discussed as potential new bioinformatics support systems but have not been evaluated thoroughly. Results We present benchmark costs and runtimes for common microbial genomics applications, including 16S rRNA analysis, microbial whole-genome shotgun (WGS) sequence assembly and annotation, WGS metagenomics and large-scale BLAST. Sequence dataset types and sizes were selected to correspond to outputs typically generated by small- to midsize facilities equipped with 454 and Illumina platforms, except for WGS metagenomics where sampling of Illumina data was used. Automated analysis pipelines, as implemented in the CloVR virtual machine, were used in order to guarantee transparency, reproducibility and portability across different operating systems, including the commercial Amazon Elastic Compute Cloud (EC2), which was used to attach real dollar costs to each analysis type. We found considerable differences in computational requirements, runtimes and costs associated with different microbial genomics applications. While all 16S analyses completed on a single-CPU desktop in under three hours, microbial genome and metagenome analyses utilized multi-CPU support of up to 120 CPUs on Amazon EC2, where each analysis completed in under 24 hours for less than $60. Representative datasets were used to estimate maximum data throughput on different cluster sizes and to compare costs between EC2 and comparable local grid servers. Conclusions Although bioinformatics requirements for microbial genomics depend on dataset characteristics and the analysis protocols applied, our results suggests that smaller sequencing facilities (up to three Roche/454 or one Illumina GAIIx sequencer) invested in 16S rRNA amplicon sequencing, microbial single-genome and metagenomics WGS projects can achieve cost-efficient bioinformatics support using CloVR in combination with Amazon EC2 as an alternative to local computing centers. PMID:22028928
Al-Qurainy, F; Khan, S; Nadeem, M; Tarroum, M; Alaklabi, A
2013-03-11
The rare and endangered plants of any country are important genetic resources that often require urgent conservation measures. Assessment of phylogenetic relationships and evaluation of genetic diversity is very important prior to implementation of conservation strategies for saving rare and endangered plant species. We used internal transcribed spacer sequences of nuclear ribosomal DNA for the evaluation of sequence identity from the available taxa in the GenBank database by using the Basic Local Alignment Search Tool (BLAST). Two rare plant species viz, Heliotropium strigosum claded with H. pilosum (98% branch support) and Pancratium tortuosum claded with P. tenuifolium (61% branch support) clearly. However, some species, viz Scadoxus multiflorus, Commiphora myrrha and Senecio hadiensis showed close relationships with more than one species. We conclude that nuclear ribosomal internal transcribed spacer sequences are useful markers for phylogenetic study of these rare plant species in Saudi Arabia.
Protein location prediction using atomic composition and global features of the amino acid sequence
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cherian, Betsy Sheena, E-mail: betsy.skb@gmail.com; Nair, Achuthsankar S.
2010-01-22
Subcellular location of protein is constructive information in determining its function, screening for drug candidates, vaccine design, annotation of gene products and in selecting relevant proteins for further studies. Computational prediction of subcellular localization deals with predicting the location of a protein from its amino acid sequence. For a computational localization prediction method to be more accurate, it should exploit all possible relevant biological features that contribute to the subcellular localization. In this work, we extracted the biological features from the full length protein sequence to incorporate more biological information. A new biological feature, distribution of atomic composition is effectivelymore » used with, multiple physiochemical properties, amino acid composition, three part amino acid composition, and sequence similarity for predicting the subcellular location of the protein. Support Vector Machines are designed for four modules and prediction is made by a weighted voting system. Our system makes prediction with an accuracy of 100, 82.47, 88.81 for self-consistency test, jackknife test and independent data test respectively. Our results provide evidence that the prediction based on the biological features derived from the full length amino acid sequence gives better accuracy than those derived from N-terminal alone. Considering the features as a distribution within the entire sequence will bring out underlying property distribution to a greater detail to enhance the prediction accuracy.« less
Bentley, Anna M.; Normand, Guillaume; Hoyt, Jonathan
2007-01-01
The mitotic cyclins promote cell division by binding and activating cyclin-dependent kinases (CDKs). Each cyclin has a unique pattern of subcellular localization that plays a vital role in regulating cell division. During mitosis, cyclin B1 is known to localize to centrosomes, microtubules, and chromatin. To determine the mechanisms of cyclin B1 localization in M phase, we imaged full-length and mutant versions of human cyclin B1-enhanced green fluorescent protein in live cells by using spinning disk confocal microscopy. In addition to centrosome, microtubule, and chromatin localization, we found that cyclin B1 also localizes to unattached kinetochores after nuclear envelope breakdown. Kinetochore recruitment of cyclin B1 required the kinetochore proteins Hec1 and Mad2, and it was stimulated by microtubule destabilization. Mutagenesis studies revealed that cyclin B1 is recruited to kinetochores through both CDK1-dependent and -independent mechanisms. In contrast, localization of cyclin B1 to chromatin and centrosomes is independent of CDK1 binding. The N-terminal domain of cyclin B1 is necessary and sufficient for chromatin association, whereas centrosome recruitment relies on sequences within the cyclin box. Our data support a role for cyclin B1 function at unattached kinetochores, and they demonstrate that separable and distinct sequence elements target cyclin B1 to kinetochores, chromatin, and centrosomes during mitosis. PMID:17881737
Piccolo, Brian D; Wankhade, Umesh D; Chintapalli, Sree V; Bhattacharyya, Sudeepa; Chunqiao, Luo; Shankar, Kartik
2018-03-15
Dynamic assessment of microbial ecology (DAME) is a Shiny-based web application for interactive analysis and visualization of microbial sequencing data. DAME provides researchers not familiar with R programming the ability to access the most current R functions utilized for ecology and gene sequencing data analyses. Currently, DAME supports group comparisons of several ecological estimates of α-diversity and β-diversity, along with differential abundance analysis of individual taxa. Using the Shiny framework, the user has complete control of all aspects of the data analysis, including sample/experimental group selection and filtering, estimate selection, statistical methods and visualization parameters. Furthermore, graphical and tabular outputs are supported by R packages using D3.js and are fully interactive. DAME was implemented in R but can be modified by Hypertext Markup Language (HTML), Cascading Style Sheets (CSS), and JavaScript. It is freely available on the web at https://acnc-shinyapps.shinyapps.io/DAME/. Local installation and source code are available through Github (https://github.com/bdpiccolo/ACNC-DAME). Any system with R can launch DAME locally provided the shiny package is installed. bdpiccolo@uams.edu.
UCSC genome browser: deep support for molecular biomedical research.
Mangan, Mary E; Williams, Jennifer M; Lathe, Scott M; Karolchik, Donna; Lathe, Warren C
2008-01-01
The volume and complexity of genomic sequence data, and the additional experimental data required for annotation of the genomic context, pose a major challenge for display and access for biomedical researchers. Genome browsers organize this data and make it available in various ways to extract useful information to advance research projects. The UCSC Genome Browser is one of these resources. The official sequence data for a given species forms the framework to display many other types of data such as expression, variation, cross-species comparisons, and more. Visual representations of the data are available for exploration. Data can be queried with sequences. Complex database queries are also easily achieved with the Table Browser interface. Associated tools permit additional query types or access to additional data sources such as images of in situ localizations. Support for solving researcher's issues is provided with active discussion mailing lists and by providing updated training materials. The UCSC Genome Browser provides a source of deep support for a wide range of biomedical molecular research (http://genome.ucsc.edu).
Angiuoli, Samuel V; Matalka, Malcolm; Gussman, Aaron; Galens, Kevin; Vangala, Mahesh; Riley, David R; Arze, Cesar; White, James R; White, Owen; Fricke, W Florian
2011-08-30
Next-generation sequencing technologies have decentralized sequence acquisition, increasing the demand for new bioinformatics tools that are easy to use, portable across multiple platforms, and scalable for high-throughput applications. Cloud computing platforms provide on-demand access to computing infrastructure over the Internet and can be used in combination with custom built virtual machines to distribute pre-packaged with pre-configured software. We describe the Cloud Virtual Resource, CloVR, a new desktop application for push-button automated sequence analysis that can utilize cloud computing resources. CloVR is implemented as a single portable virtual machine (VM) that provides several automated analysis pipelines for microbial genomics, including 16S, whole genome and metagenome sequence analysis. The CloVR VM runs on a personal computer, utilizes local computer resources and requires minimal installation, addressing key challenges in deploying bioinformatics workflows. In addition CloVR supports use of remote cloud computing resources to improve performance for large-scale sequence processing. In a case study, we demonstrate the use of CloVR to automatically process next-generation sequencing data on multiple cloud computing platforms. The CloVR VM and associated architecture lowers the barrier of entry for utilizing complex analysis protocols on both local single- and multi-core computers and cloud systems for high throughput data processing.
Saini, Harsh; Raicar, Gaurav; Dehzangi, Abdollah; Lal, Sunil; Sharma, Alok
2015-12-07
Protein subcellular localization is an important topic in proteomics since it is related to a protein׳s overall function, helps in the understanding of metabolic pathways, and in drug design and discovery. In this paper, a basic approximation technique from natural language processing called the linear interpolation smoothing model is applied for predicting protein subcellular localizations. The proposed approach extracts features from syntactical information in protein sequences to build probabilistic profiles using dependency models, which are used in linear interpolation to determine how likely is a sequence to belong to a particular subcellular location. This technique builds a statistical model based on maximum likelihood. It is able to deal effectively with high dimensionality that hinders other traditional classifiers such as Support Vector Machines or k-Nearest Neighbours without sacrificing performance. This approach has been evaluated by predicting subcellular localizations of Gram positive and Gram negative bacterial proteins. Copyright © 2015 Elsevier Ltd. All rights reserved.
Nyaga, Martin M; Tan, Yi; Seheri, Mapaseka L; Halpin, Rebecca A; Akopov, Asmik; Stucker, Karla M; Fedorova, Nadia B; Shrivastava, Susmita; Duncan Steele, A; Mwenda, Jason M; Pickett, Brett E; Das, Suman R; Jeffrey Mphahlele, M
2018-05-18
Rotavirus A (RVA) exhibits a wide genotype diversity globally. Little is known about the genetic composition of genotype P[6] from Africa. This study investigated possible evolutionary mechanisms leading to genetic diversity of genotype P[6] VP4 sequences. Phylogenetic analyses on 167 P[6] VP4 full-length sequences were conducted, which included six porcine-origin sequences. Of the 167 sequences, 57 were newly acquired through whole genome sequencing as part of this study. The other 110 sequences were all publicly-available global P[6] VP4 full-length sequences downloaded from GenBank. The strength of association between the phenotypic features and the phylogeny was also determined. A number of reassortment and mixed infections of RVA genotype P[6] strains were observed in this study. Phylogenetic analyses demostrated the extensive genetic diversity that exists among human P[6] strains, porcine-like strains, their concomitant clades/subclades and estimated that P[6] VP4 gene has a higher substitution rate with the mean of 1.05E-3 substitutions/site/year. Further, the phylogenetic analyses indicated that genotype P[6] strains were endemic in Africa, characterised by an extensive genetic diversity and long-time local evolution of the viruses. This was also supported by phylogeographic clustering and G-genotype clustering of the P[6] strains when Bayesian Tip-association Significance testing (BaTS) was applied, clearly supporting that the viruses evolved locally in Africa instead of spatial mixing among different regions. Overall, the results demonstrated that multiple mechanisms such as reassortment events, various mutations and possibly interspecies transmission account for the enormous diversity of genotype P[6] strains in Africa. These findings highlight the need for continued global surveillance of rotavirus diversity. Copyright © 2018 Elsevier B.V. All rights reserved.
Protein subcellular localization prediction using artificial intelligence technology.
Nair, Rajesh; Rost, Burkhard
2008-01-01
Proteins perform many important tasks in living organisms, such as catalysis of biochemical reactions, transport of nutrients, and recognition and transmission of signals. The plethora of aspects of the role of any particular protein is referred to as its "function." One aspect of protein function that has been the target of intensive research by computational biologists is its subcellular localization. Proteins must be localized in the same subcellular compartment to cooperate toward a common physiological function. Aberrant subcellular localization of proteins can result in several diseases, including kidney stones, cancer, and Alzheimer's disease. To date, sequence homology remains the most widely used method for inferring the function of a protein. However, the application of advanced artificial intelligence (AI)-based techniques in recent years has resulted in significant improvements in our ability to predict the subcellular localization of a protein. The prediction accuracy has risen steadily over the years, in large part due to the application of AI-based methods such as hidden Markov models (HMMs), neural networks (NNs), and support vector machines (SVMs), although the availability of larger experimental datasets has also played a role. Automatic methods that mine textual information from the biological literature and molecular biology databases have considerably sped up the process of annotation for proteins for which some information regarding function is available in the literature. State-of-the-art methods based on NNs and HMMs can predict the presence of N-terminal sorting signals extremely accurately. Ab initio methods that predict subcellular localization for any protein sequence using only the native amino acid sequence and features predicted from the native sequence have shown the most remarkable improvements. The prediction accuracy of these methods has increased by over 30% in the past decade. The accuracy of these methods is now on par with high-throughput methods for predicting localization, and they are beginning to play an important role in directing experimental research. In this chapter, we review some of the most important methods for the prediction of subcellular localization.
Conn, Jan E.; Moreno, Marta; Saavedra, Marlon; Bickersmith, Sara A.; Knoll, Elisabeth; Fernandez, Roberto; Vera, Hubert; Burrus, Roxanne G.; Lescano, Andres G.; Sanchez, Juan Francisco; Rivera, Esteban; Vinetz, Joseph M.
2013-01-01
Anopheline specimens were collected in 2011 by human landing catch, Shannon and CDC traps from the malaria endemic localities of Santa Rosa and San Pedro in Madre de Dios Department, Peru. Most specimens were either Anopheles (Nyssorhynchus) benarrochi B or An. (Nys.) rangeli, confirmed by polymerase chain reaction-restriction fragment length polymorphism-internal transcribed spacer 2 (PCR-RFLP-ITS2) and, for selected individuals, ITS2 sequences. A few specimens from Lupuna, Loreto Department, northern Amazonian Peru, were also identified as An. benarrochi B. A statistical parsimony network using ITS2 sequences confirmed that all Peruvian An. benarrochi B analyzed were identical to those in GenBank from Putumayo, southern Colombia. Sequences of the mtDNA COI BOLD region of specimens from all three Peruvian localities were connected using a statistical parsimony network, although there were multiple mutation steps between northern and southern Peruvian sequences. A Bayesian inference of concatenated Peruvian sequences of ITS2+COI detected a single clade with very high support for all An. benarrochi B except one individual from Lupuna that was excluded. No samples were positive for Plasmodium by CytB-PCR. PMID:23243107
2011-01-01
Background Next-generation sequencing technologies have decentralized sequence acquisition, increasing the demand for new bioinformatics tools that are easy to use, portable across multiple platforms, and scalable for high-throughput applications. Cloud computing platforms provide on-demand access to computing infrastructure over the Internet and can be used in combination with custom built virtual machines to distribute pre-packaged with pre-configured software. Results We describe the Cloud Virtual Resource, CloVR, a new desktop application for push-button automated sequence analysis that can utilize cloud computing resources. CloVR is implemented as a single portable virtual machine (VM) that provides several automated analysis pipelines for microbial genomics, including 16S, whole genome and metagenome sequence analysis. The CloVR VM runs on a personal computer, utilizes local computer resources and requires minimal installation, addressing key challenges in deploying bioinformatics workflows. In addition CloVR supports use of remote cloud computing resources to improve performance for large-scale sequence processing. In a case study, we demonstrate the use of CloVR to automatically process next-generation sequencing data on multiple cloud computing platforms. Conclusion The CloVR VM and associated architecture lowers the barrier of entry for utilizing complex analysis protocols on both local single- and multi-core computers and cloud systems for high throughput data processing. PMID:21878105
SPAR: small RNA-seq portal for analysis of sequencing experiments.
Kuksa, Pavel P; Amlie-Wolf, Alexandre; Katanic, Živadin; Valladares, Otto; Wang, Li-San; Leung, Yuk Yee
2018-05-04
The introduction of new high-throughput small RNA sequencing protocols that generate large-scale genomics datasets along with increasing evidence of the significant regulatory roles of small non-coding RNAs (sncRNAs) have highlighted the urgent need for tools to analyze and interpret large amounts of small RNA sequencing data. However, it remains challenging to systematically and comprehensively discover and characterize sncRNA genes and specifically-processed sncRNA products from these datasets. To fill this gap, we present Small RNA-seq Portal for Analysis of sequencing expeRiments (SPAR), a user-friendly web server for interactive processing, analysis, annotation and visualization of small RNA sequencing data. SPAR supports sequencing data generated from various experimental protocols, including smRNA-seq, short total RNA sequencing, microRNA-seq, and single-cell small RNA-seq. Additionally, SPAR includes publicly available reference sncRNA datasets from our DASHR database and from ENCODE across 185 human tissues and cell types to produce highly informative small RNA annotations across all major small RNA types and other features such as co-localization with various genomic features, precursor transcript cleavage patterns, and conservation. SPAR allows the user to compare the input experiment against reference ENCODE/DASHR datasets. SPAR currently supports analyses of human (hg19, hg38) and mouse (mm10) sequencing data. SPAR is freely available at https://www.lisanwanglab.org/SPAR.
Pereira, J O P; Freitas, B M; Jorge, D M M; Torres, D C; Soares, C E A; Grangeiro, T B
2009-01-01
Melipona quinquefasciata is a ground-nesting South American stingless bee whose geographic distribution was believed to comprise only the central and southern states of Brazil. We obtained partial sequences (about 500-570 bp) of first internal transcribed spacer (ITS1) nuclear ribosomal DNA from Melipona specimens putatively identified as M. quinquefasciata collected from different localities in northeastern Brazil. To confirm the taxonomic identity of the northeastern samples, specimens from the state of Goiás (Central region of Brazil) were included for comparison. All sequences were deposited in GenBank (accession numbers EU073751-EU073759). The mean nucleotide divergence (excluding sites with insertions/deletions) in the ITS1 sequences was only 1.4%, ranging from 0 to 4.1%. When the sites with insertions/deletions were also taken into account, sequence divergences varied from 0 to 5.3%. In all pairwise comparisons, the ITS1 sequence from the specimens collected in Goiás was most divergent compared to the ITS1 sequences of the bees from the other locations. However, neighbor-joining phylogenetic analysis showed that all ITS1 sequences from northeastern specimens along with the sample of Goiás were resolved in a single clade with a bootstrap support of 100%. The ITS1 sequencing data thus support the occurrence of M. quinquefasciata in northeast Brazil.
Wasabi: An Integrated Platform for Evolutionary Sequence Analysis and Data Visualization.
Veidenberg, Andres; Medlar, Alan; Löytynoja, Ari
2016-04-01
Wasabi is an open source, web-based environment for evolutionary sequence analysis. Wasabi visualizes sequence data together with a phylogenetic tree within a modern, user-friendly interface: The interface hides extraneous options, supports context sensitive menus, drag-and-drop editing, and displays additional information, such as ancestral sequences, associated with specific tree nodes. The Wasabi environment supports reproducibility by automatically storing intermediate analysis steps and includes built-in functions to share data between users and publish analysis results. For computational analysis, Wasabi supports PRANK and PAGAN for phylogeny-aware alignment and alignment extension, and it can be easily extended with other tools. Along with drag-and-drop import of local files, Wasabi can access remote data through URL and import sequence data, GeneTrees and EPO alignments directly from Ensembl. To demonstrate a typical workflow using Wasabi, we reproduce key findings from recent comparative genomics studies, including a reanalysis of the EGLN1 gene from the tiger genome study: These case studies can be browsed within Wasabi at http://wasabiapp.org:8000?id=usecases. Wasabi runs inside a web browser and does not require any installation. One can start using it at http://wasabiapp.org. All source code is licensed under the AGPLv3. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Kamatuka, Kenta; Hattori, Masahiro; Sugiyama, Tomoyasu
2016-12-01
RNA interference (RNAi) screening is extensively used in the field of reverse genetics. RNAi libraries constructed using random oligonucleotides have made this technology affordable. However, the new methodology requires exploration of the RNAi target gene information after screening because the RNAi library includes non-natural sequences that are not found in genes. Here, we developed a web-based tool to support RNAi screening. The system performs short hairpin RNA (shRNA) target prediction that is informed by comprehensive enquiry (SPICE). SPICE automates several tasks that are laborious but indispensable to evaluate the shRNAs obtained by RNAi screening. SPICE has four main functions: (i) sequence identification of shRNA in the input sequence (the sequence might be obtained by sequencing clones in the RNAi library), (ii) searching the target genes in the database, (iii) demonstrating biological information obtained from the database, and (iv) preparation of search result files that can be utilized in a local personal computer (PC). Using this system, we demonstrated that genes targeted by random oligonucleotide-derived shRNAs were not different from those targeted by organism-specific shRNA. The system facilitates RNAi screening, which requires sequence analysis after screening. The SPICE web application is available at http://www.spice.sugysun.org/.
New steroid 5alpha-reductase type I (SRD5A1) homologous sequences on human chromosomes 6 and 8.
Eminović, I; Liović, M; Prezelj, J; Kocijancic, A; Rozman, D; Komel, R
2001-01-01
To date, two genes encoding 5alpha-reductase isoenzymes are known (type I, type II), and one type I pseudogene. The divergent localization of these genes and the still not fully understood function of the encoded enzymes as well as the perplexing results we obtained after sequencing PCR-amplified SRD5A1 gene fragments (out of genomic DNA), made us assume that, in addition to the known SRD5A1 gene, one or more different human 5alpha-reductase type I coding genes may exist. Our research provide the first evidence for the existence of two new SRD5A1 related, previously unidentified sequences in the human genome. These sequences which were localized to chromosomes 6 and 8 are highly homologous (> 99%) to SRD5A1, and also do not contain any deletions or insertions that are otherwise a characteristic of the SRD5API pseudogene. Our results imply that these sequences may be either coding parts of yet unknown, active SRD5A1 genes, and/or of previously unidentified pseudogenes. These findings additionally support data of Chen et al. who confirmed the existence of various SRD5A1 proteins in cultured human skin cells.
Aquatic environmental DNA detects seasonal fish abundance and habitat preference in an urban estuary
Soboleva, Lyubov; Charlop-Powers, Zachary
2017-01-01
The difficulty of censusing marine animal populations hampers effective ocean management. Analyzing water for DNA traces shed by organisms may aid assessment. Here we tested aquatic environmental DNA (eDNA) as an indicator of fish presence in the lower Hudson River estuary. A checklist of local marine fish and their relative abundance was prepared by compiling 12 traditional surveys conducted between 1988–2015. To improve eDNA identification success, 31 specimens representing 18 marine fish species were sequenced for two mitochondrial gene regions, boosting coverage of the 12S eDNA target sequence to 80% of local taxa. We collected 76 one-liter shoreline surface water samples at two contrasting estuary locations over six months beginning in January 2016. eDNA was amplified with vertebrate-specific 12S primers. Bioinformatic analysis of amplified DNA, using a reference library of GenBank and our newly generated 12S sequences, detected most (81%) locally abundant or common species and relatively few (23%) uncommon taxa, and corresponded to seasonal presence and habitat preference as determined by traditional surveys. Approximately 2% of fish reads were commonly consumed species that are rare or absent in local waters, consistent with wastewater input. Freshwater species were rarely detected despite Hudson River inflow. These results support further exploration and suggest eDNA will facilitate fine-scale geographic and temporal mapping of marine fish populations at relatively low cost. PMID:28403183
Description and physical localization of the bovine survival of motor neuron gene (SMN).
Pietrowski, D; Goldammer, T; Meinert, S; Schwerin, M; Förster, M
1998-01-01
Proximal spinal muscular atrophy (SMA) is an autosomal recessive disease in humans and other mammals, characterized by degeneration of anterior horn cells of the spinal cord. In humans, the survival of motor neuron gene (SMN) has been recognized as the SMA-determining gene and has been mapped to 5q13. In cattle, SMA is a recurrent, inherited disease that plays an important economic role in breeding programs of Brown Swiss stock. Now we have identified the full- length cDNA sequence of the bovine SMN gene. Molecular analysis and characterization of the sequence documents 85% identity to its human counterpart and three evolutionarily conserved domains in different species. Physical mapping data reveals that bovine SMN is localized to chromosome region 20q12-->q13, supporting the conserved synteny of this chromosomal region between humans and cattle.
A model of individualized canonical microcircuits supporting cognitive operations
Peterson, Andre D. H.; Haueisen, Jens; Knösche, Thomas R.
2017-01-01
Major cognitive functions such as language, memory, and decision-making are thought to rely on distributed networks of a large number of basic elements, called canonical microcircuits. In this theoretical study we propose a novel canonical microcircuit model and find that it supports two basic computational operations: a gating mechanism and working memory. By means of bifurcation analysis we systematically investigate the dynamical behavior of the canonical microcircuit with respect to parameters that govern the local network balance, that is, the relationship between excitation and inhibition, and key intrinsic feedback architectures of canonical microcircuits. We relate the local behavior of the canonical microcircuit to cognitive processing and demonstrate how a network of interacting canonical microcircuits enables the establishment of spatiotemporal sequences in the context of syntax parsing during sentence comprehension. This study provides a framework for using individualized canonical microcircuits for the construction of biologically realistic networks supporting cognitive operations. PMID:29200435
Full-genome sequence and analysis of a novel human rhinovirus strain within a divergent HRV-A clade.
Rathe, Jennifer A; Liu, Xinyue; Tallon, Luke J; Gern, James E; Liggett, Stephen B
2010-01-01
Genome sequences of human rhinoviruses (HRV) have primarily been from stocks collected in the 1960s, with genomes and phylogeny of modern HRVs remaining undefined. Here, two modern isolates (hrv-A101 and hrv-A101-v1) collected approximately 8 years apart were sequenced in their entirety. Incorporation into our full-genome HRV alignment with subsequent phylogenetic network inference indicated that these represent a unique HRV-A, localized within a distinct divergent clade. They appear to have resulted from recombination of the hrv-65 and hrv-78 lineages. These results support our contention that there are unrecognized distinct HRV-A strains, and that recombination is evident in currently circulating strains.
Hawlitschek, Oliver; Porch, Nick; Hendrich, Lars; Balke, Michael
2011-02-09
DNA sequencing techniques used to estimate biodiversity, such as DNA barcoding, may reveal cryptic species. However, disagreements between barcoding and morphological data have already led to controversy. Species delimitation should therefore not be based on mtDNA alone. Here, we explore the use of nDNA and bioclimatic modelling in a new species of aquatic beetle revealed by mtDNA sequence data. The aquatic beetle fauna of Australia is characterised by high degrees of endemism, including local radiations such as the genus Antiporus. Antiporus femoralis was previously considered to exist in two disjunct, but morphologically indistinguishable populations in south-western and south-eastern Australia. We constructed a phylogeny of Antiporus and detected a deep split between these populations. Diagnostic characters from the highly variable nuclear protein encoding arginine kinase gene confirmed the presence of two isolated populations. We then used ecological niche modelling to examine the climatic niche characteristics of the two populations. All results support the status of the two populations as distinct species. We describe the south-western species as Antiporus occidentalis sp.n. In addition to nDNA sequence data and extended use of mitochondrial sequences, ecological niche modelling has great potential for delineating morphologically cryptic species.
Hawlitschek, Oliver; Porch, Nick; Hendrich, Lars; Balke, Michael
2011-01-01
Background DNA sequencing techniques used to estimate biodiversity, such as DNA barcoding, may reveal cryptic species. However, disagreements between barcoding and morphological data have already led to controversy. Species delimitation should therefore not be based on mtDNA alone. Here, we explore the use of nDNA and bioclimatic modelling in a new species of aquatic beetle revealed by mtDNA sequence data. Methodology/Principal Findings The aquatic beetle fauna of Australia is characterised by high degrees of endemism, including local radiations such as the genus Antiporus. Antiporus femoralis was previously considered to exist in two disjunct, but morphologically indistinguishable populations in south-western and south-eastern Australia. We constructed a phylogeny of Antiporus and detected a deep split between these populations. Diagnostic characters from the highly variable nuclear protein encoding arginine kinase gene confirmed the presence of two isolated populations. We then used ecological niche modelling to examine the climatic niche characteristics of the two populations. All results support the status of the two populations as distinct species. We describe the south-western species as Antiporus occidentalis sp.n. Conclusion/Significance In addition to nDNA sequence data and extended use of mitochondrial sequences, ecological niche modelling has great potential for delineating morphologically cryptic species. PMID:21347370
Providing Epistemic Support For Assessments Through Mobile-Supported Sharing Activities1
Raclaw, Joshua; Robles, Jessica S.; DiDomenico, Stephen M.
2017-01-01
This paper examines how participants in face-to-face conversation employ mobile phones as a resource for social action. We focus on what we call mobile-supported sharing activities, in which participants use a mobile phone to share text or images with others by voicing text aloud from their mobile or providing others with visual access to the device’s display screen. Drawing from naturalistic video recordings, we focus on how mobile-supported sharing activities invite assessments by providing access to an object that is not locally accessible to the participants. Such practices make relevant co-participants’ assessment of these objects and allow for different forms of co-participation across sequence types. We additionally examine how the organization of assessments during these sharing activities displays sensitivity to preference structure. The analysis illustrates the relevance of embodiment, local objects, and new communicative technologies to the production of action in co-present interaction. Data are in American English. PMID:28936031
Compartmentalization of the yeast meiotic nucleus revealed by analysis of ectopic recombination.
Schlecht, Hélène B; Lichten, Michael; Goldman, Alastair S H
2004-11-01
As yeast cells enter meiosis, chromosomes move from a centromere-clustered (Rabl) to a telomere-clustered (bouquet) configuration and then to states of progressive homolog pairing where telomeres are more dispersed. It is uncertain at which stage of this process sequences commit to recombine with each other. Previous analyses using recombination between dispersed homologous sequences (ectopic recombination) support the view that, on average, homologs are aligned end to end by the time of commitment to recombination. We have undertaken further analyses incorporating new inserts, chromosome rearrangements, an alternate mode of recombination initiation, and mutants that disrupt nuclear structure or telomere metabolism. Our findings support previous conclusions and reveal that distance from the nearest telomere is an important parameter influencing recombination between dispersed sequences. In general, the farther dispersed sequences are from their nearest telomere, the less likely they are to engage in ectopic recombination. Neither the mode of initiating recombination nor the formation of the bouquet appears to affect this relationship. We suggest that aspects of telomere localization and behavior influence the organization and mobility of chromosomes along their entire length, during a critical period of meiosis I prophase that encompasses the homology search.
This synthetic, multi-scale approach will generate a sequence of spatially explicit maps that will provide science guidance to support strategic decision-making regarding the spatially-distributed risk of, and possible adaptation to, the spread of invasive species at local to ...
Shotgun metagenomic data streams: surfing without fear
DOE Office of Scientific and Technical Information (OSTI.GOV)
Berendzen, Joel R
2010-12-06
Timely information about bio-threat prevalence, consequence, propagation, attribution, and mitigation is needed to support decision-making, both routinely and in a crisis. One DNA sequencer can stream 25 Gbp of information per day, but sampling strategies and analysis techniques are needed to turn raw sequencing power into actionable knowledge. Shotgun metagenomics can enable biosurveillance at the level of a single city, hospital, or airplane. Metagenomics characterizes viruses and bacteria from complex environments such as soil, air filters, or sewage. Unlike targeted-primer-based sequencing, shotgun methods are not blind to sequences that are truly novel, and they can measure absolute prevalence. Shotgun metagenomicmore » sampling can be non-invasive, efficient, and inexpensive while being informative. We have developed analysis techniques for shotgun metagenomic sequencing that rely upon phylogenetic signature patterns. They work by indexing local sequence patterns in a manner similar to web search engines. Our methods are laptop-fast and favorable scaling properties ensure they will be sustainable as sequencing methods grow. We show examples of application to soil metagenomic samples.« less
Sample, Paul J.; Gaston, Kirk W.; Alfonzo, Juan D.; Limbach, Patrick A.
2015-01-01
Ribosomal ribonucleic acid (RNA), transfer RNA and other biological or synthetic RNA polymers can contain nucleotides that have been modified by the addition of chemical groups. Traditional Sanger sequencing methods cannot establish the chemical nature and sequence of these modified-nucleotide containing oligomers. Mass spectrometry (MS) has become the conventional approach for determining the nucleotide composition, modification status and sequence of modified RNAs. Modified RNAs are analyzed by MS using collision-induced dissociation tandem mass spectrometry (CID MS/MS), which produces a complex dataset of oligomeric fragments that must be interpreted to identify and place modified nucleosides within the RNA sequence. Here we report the development of RoboOligo, an interactive software program for the robust analysis of data generated by CID MS/MS of RNA oligomers. There are three main functions of RoboOligo: (i) automated de novo sequencing via the local search paradigm. (ii) Manual sequencing with real-time spectrum labeling and cumulative intensity scoring. (iii) A hybrid approach, coined ‘variable sequencing’, which combines the user intuition of manual sequencing with the high-throughput sampling of automated de novo sequencing. PMID:25820423
2015-01-01
DNA oxidation by reactive oxygen species is nonrandom, potentially leading to accumulation of nucleobase damage and mutations at specific sites within the genome. We now present the first quantitative data for sequence-dependent formation of structurally defined oxidative nucleobase adducts along p53 gene-derived DNA duplexes using a novel isotope labeling-based approach. Our results reveal that local nucleobase sequence context differentially alters the yields of 2,2,4-triamino-2H-oxal-5-one (Z) and 8-oxo-7,8-dihydro-2′-deoxyguanosine (OG) in double stranded DNA. While both lesions are overproduced within endogenously methylated MeCG dinucleotides and at 5′ Gs in runs of several guanines, the formation of Z (but not OG) is strongly preferred at solvent-exposed guanine nucleobases at duplex ends. Targeted oxidation of MeCG sequences may be caused by a lowered ionization potential of guanine bases paired with MeC and the preferential intercalation of riboflavin photosensitizer adjacent to MeC:G base pairs. Importantly, some of the most frequently oxidized positions coincide with the known p53 lung cancer mutational “hotspots” at codons 245 (GGC), 248 (CGG), and 158 (CGC) respectively, supporting a possible role of oxidative degradation of DNA in the initiation of lung cancer. PMID:24571128
NASA Astrophysics Data System (ADS)
Verma, Gaurav; Chawla, Sanjeev; Nagarajan, Rajakumar; Iqbal, Zohaib; Albert Thomas, M.; Poptani, Harish
2017-04-01
Two-dimensional localized correlated spectroscopy (2D L-COSY) offers greater spectral dispersion than conventional one-dimensional (1D) MRS techniques, yet long acquisition times and limited post-processing support have slowed its clinical adoption. Improving acquisition efficiency and developing versatile post-processing techniques can bolster the clinical viability of 2D MRS. The purpose of this study was to implement a non-uniformly weighted sampling (NUWS) scheme for faster acquisition of 2D-MRS. A NUWS 2D L-COSY sequence was developed for 7T whole-body MRI. A phantom containing metabolites commonly observed in the brain at physiological concentrations was scanned ten times with both the NUWS scheme of 12:48 duration and a 17:04 constant eight-average sequence using a 32-channel head coil. 2D L-COSY spectra were also acquired from the occipital lobe of four healthy volunteers using both the proposed NUWS and the conventional uniformly-averaged L-COSY sequence. The NUWS 2D L-COSY sequence facilitated 25% shorter acquisition time while maintaining comparable SNR in humans (+0.3%) and phantom studies (+6.0%) compared to uniform averaging. NUWS schemes successfully demonstrated improved efficiency of L-COSY, by facilitating a reduction in scan time without affecting signal quality.
Molecular epidemiology of Plum pox virus in Japan.
Maejima, Kensaku; Himeno, Misako; Komatsu, Ken; Takinami, Yusuke; Hashimoto, Masayoshi; Takahashi, Shuichiro; Yamaji, Yasuyuki; Oshima, Kenro; Namba, Shigetou
2011-05-01
For a molecular epidemiological study based on complete genome sequences, 37 Plum pox virus (PPV) isolates were collected from the Kanto region in Japan. Pair-wise analyses revealed that all 37 Japanese isolates belong to the PPV-D strain, with low genetic diversity (less than 0.8%). In phylogenetic analysis of the PPV-D strain based on complete nucleotide sequences, the relationships of the PPV-D strain were reconstructed with high resolution: at the global level, the American, Canadian, and Japanese isolates formed their own distinct monophyletic clusters, suggesting that the routes of viral entry into these countries were independent; at the local level, the actual transmission histories of PPV were precisely reconstructed with high bootstrap support. This is the first description of the molecular epidemiology of PPV based on complete genome sequences.
A Concept and Implementation of Optimized Operations of Airport Surface Traffic
NASA Technical Reports Server (NTRS)
Jung, Yoon C.; Hoang, Ty; Montoya, Justin; Gupta, Gautam; Malik, Waqar; Tobias, Leonard
2010-01-01
This paper presents a new concept of optimized surface operations at busy airports to improve the efficiency of taxi operations, as well as reduce environmental impacts. The suggested system architecture consists of the integration of two decoupled optimization algorithms. The Spot Release Planner provides sequence and timing advisories to tower controllers for releasing departure aircraft into the movement area to reduce taxi delay while achieving maximum throughput. The Runway Scheduler provides take-off sequence and arrival runway crossing sequence to the controllers to maximize the runway usage. The description of a prototype implementation of this integrated decision support tool for the airport control tower controllers is also provided. The prototype decision support tool was evaluated through a human-in-the-loop experiment, where both the Spot Release Planner and Runway Scheduler provided advisories to the Ground and Local Controllers. Initial results indicate the average number of stops made by each departure aircraft in the departure runway queue was reduced by more than half when the controllers were using the advisories, which resulted in reduced taxi times in the departure queue.
Terminator Detection by Support Vector Machine Utilizing aStochastic Context-Free Grammar
DOE Office of Scientific and Technical Information (OSTI.GOV)
Francis-Lyon, Patricia; Cristianini, Nello; Holbrook, Stephen
2006-12-30
A 2-stage detector was designed to find rho-independent transcription terminators in the Escherichia coli genome. The detector includes a Stochastic Context Free Grammar (SCFG) component and a Support Vector Machine (SVM) component. To find terminators, the SCFG searches the intergenic regions of nucleotide sequence for local matches to a terminator grammar that was designed and trained utilizing examples of known terminators. The grammar selects sequences that are the best candidates for terminators and assigns them a prefix, stem-loop, suffix structure using the Cocke-Younger-Kasaami (CYK) algorithm, modified to incorporate energy affects of base pairing. The parameters from this inferred structure aremore » passed to the SVM classifier, which distinguishes terminators from non-terminators that score high according to the terminator grammar. The SVM was trained with negative examples drawn from intergenic sequences that include both featureless and RNA gene regions (which were assigned prefix, stem-loop, suffix structure by the SCFG), so that it successfully distinguishes terminators from either of these. The classifier was found to be 96.4% successful during testing.« less
Calculating electronic correlation effects from densities of transitions
NASA Astrophysics Data System (ADS)
Haydock, Roger
Adding a localized electron to a system of interacting electrons induces a density of transitions described by the time-independent Heisenberg equation. Sequences of these transitions generate interacting states whose total energy is the sum of energies of the constituent transitions. A calculation of magnetic moments for itinerant electrons with Ising interactions illustrates this method. supported by the H. V. Snyder Gift to the University of Oregon.
Fujimoto, Kayo; Coghill, Lyndon M; Weier, Christopher A; Hwang, Lu-Yu; Kim, Ju Yeong; Schneider, John A; Metzker, Michael L; Brown, Jeremy M
2017-09-01
We explore the phylogenetic relationships among HIV sequences sampled from young adult black men who have sex with men (YAB-MSM), who are connected through peer referral/social ties and who attend common venues. Using 196 viral sequences sampled from the peripheral blood mononuclear cells of 10 individuals, our preliminary phylogenetic results indicate that these socially connected YAB-MSM are infected with distantly related viruses and provide no evidence for viral transmission between network members. Our results suggest that HIV-prevention strategies that target young adult MSM should extend beyond their network members and local community.
The origin of Chinese domestic horses revealed with novel mtDNA variants.
Yang, Yunzhou; Zhu, Qiyun; Liu, Shuqin; Zhao, Chunjiang; Wu, Changxin
2017-01-01
The origin of domestic horses in China was a controversial issue and several hypotheses including autochthonous domestication, introduction from other areas, and multiple-origins from both introduction and local wild horse introgression have been proposed, but none of them have been fully supported by DNA data. In the present study, mitochondrial DNA (mtDNA) sequences of 714 Chinese indigenous horses were analyzed. The results showed that Chinese domestic horses harbor some novel mtDNA haplogroups and suggested that local domestication events may have occurred, but they are not the dominant haplogroups and the geographical distributions of the novel mtDNA haplogroups were rather restricted. Conclusively, our results support the hypothesis that the domestic horses in China originated from both the introduced horses from outside of China and the local wild horses' introgression into the domestic populations. Results of genetic diversity analysis suggested a possibility that the introduced horses entered China through northern regions from the Eurasian steppe. © 2016 Japanese Society of Animal Science.
Mumps virus F gene and HN gene sequencing as a molecular tool to study mumps virus transmission.
Gouma, Sigrid; Cremer, Jeroen; Parkkali, Saara; Veldhuijzen, Irene; van Binnendijk, Rob S; Koopmans, Marion P G
2016-11-01
Various mumps outbreaks have occurred in the Netherlands since 2004, particularly among persons who had received 2 doses of measles, mumps, and rubella (MMR) vaccination. Genomic typing of pathogens can be used to track outbreaks, but the established genotyping of mumps virus based on the small hydrophobic (SH) gene sequences did not provide sufficient resolution. Therefore, we expanded the sequencing to include fusion (F) gene and haemagglutinin-neuraminidase (HN) gene sequences in addition to the SH gene sequences from 109 mumps virus genotype G strains obtained between 2004 and mid 2015 in the Netherlands. When the molecular information from these 3 genes was combined, we were able to identify separate mumps virus clusters and track mumps virus transmission. The analyses suggested that multiple mumps virus introductions occurred in the Netherlands between 2004 and 2015 resulting in several mumps outbreaks throughout this period, whereas during some local outbreaks the molecular data pointed towards endemic circulation. Combined analysis of epidemiological data and sequence data collected in 2015 showed good support for the phylogenetic clustering. Copyright © 2016 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio
The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less
Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio; ...
2016-03-09
The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less
Local alignment of two-base encoded DNA sequence
Homer, Nils; Merriman, Barry; Nelson, Stanley F
2009-01-01
Background DNA sequence comparison is based on optimal local alignment of two sequences using a similarity score. However, some new DNA sequencing technologies do not directly measure the base sequence, but rather an encoded form, such as the two-base encoding considered here. In order to compare such data to a reference sequence, the data must be decoded into sequence. The decoding is deterministic, but the possibility of measurement errors requires searching among all possible error modes and resulting alignments to achieve an optimal balance of fewer errors versus greater sequence similarity. Results We present an extension of the standard dynamic programming method for local alignment, which simultaneously decodes the data and performs the alignment, maximizing a similarity score based on a weighted combination of errors and edits, and allowing an affine gap penalty. We also present simulations that demonstrate the performance characteristics of our two base encoded alignment method and contrast those with standard DNA sequence alignment under the same conditions. Conclusion The new local alignment algorithm for two-base encoded data has substantial power to properly detect and correct measurement errors while identifying underlying sequence variants, and facilitating genome re-sequencing efforts based on this form of sequence data. PMID:19508732
Compressed multi-block local binary pattern for object tracking
NASA Astrophysics Data System (ADS)
Li, Tianwen; Gao, Yun; Zhao, Lei; Zhou, Hao
2018-04-01
Both robustness and real-time are very important for the application of object tracking under a real environment. The focused trackers based on deep learning are difficult to satisfy with the real-time of tracking. Compressive sensing provided a technical support for real-time tracking. In this paper, an object can be tracked via a multi-block local binary pattern feature. The feature vector was extracted based on the multi-block local binary pattern feature, which was compressed via a sparse random Gaussian matrix as the measurement matrix. The experiments showed that the proposed tracker ran in real-time and outperformed the existed compressive trackers based on Haar-like feature on many challenging video sequences in terms of accuracy and robustness.
Evidence of repeated wildfires prior to human occupation on San Nicolas Island, California
Pigati, Jeffrey S.; McGeehin, John P.; Skipp, Gary L.; Muhs, Daniel R.
2014-01-01
Understanding how early humans on the California Channel Islands might have changed local fire regimes requires a baseline knowledge of the frequency of natural wildfires on the islands prior to human occupation. A sedimentary sequence that was recently discovered in a small canyon on San Nicolas Island contains evidence of at least 24 burn events that date to between ~37 and 25 ka (thousands of calibrated 14C years before present), well before humans entered North America. The evidence includes abundant macroscopic charcoal, blackened sediments, and discrete packages of oxidized, reddish-brown sediments that are similar in appearance to sedimentary features called “fire areas” on Santa Rosa Island and elsewhere. Massive fine-grained sediments that contain the burn evidence are interpreted as sheetwash deposits and are interbedded with coarse-grained, clast-supported alluvial sediments and matrix supported sands, pebbles, and cobbles that represent localized debris flows. These sedimentary sequences suggest that the catchment area above our study site underwent multiple cycles of relative quiescence that were interrupted by fire and followed by slope instability and mass wasting events. Our 14C-based chronology dates these cycles to well before the arrival of humans on the Channel Islands and shows that natural wildfires occurred here, at a minimum, every 300–500 years prior to human occupation.
Evidence for the Concerted Evolution between Short Linear Protein Motifs and Their Flanking Regions
Chica, Claudia; Diella, Francesca; Gibson, Toby J.
2009-01-01
Background Linear motifs are short modules of protein sequences that play a crucial role in mediating and regulating many protein–protein interactions. The function of linear motifs strongly depends on the context, e.g. functional instances mainly occur inside flexible regions that are accessible for interaction. Sometimes linear motifs appear as isolated islands of conservation in multiple sequence alignments. However, they also occur in larger blocks of sequence conservation, suggesting an active role for the neighbouring amino acids. Results The evolution of regions flanking 116 functional linear motif instances was studied. The conservation of the amino acid sequence and order/disorder tendency of those regions was related to presence/absence of the instance. For the majority of the analysed instances, the pairs of sequences conserving the linear motif were also observed to maintain a similar local structural tendency and/or to have higher local sequence conservation when compared to pairs of sequences where one is missing the linear motif. Furthermore, those instances have a higher chance to co–evolve with the neighbouring residues in comparison to the distant ones. Those findings are supported by examples where the regulation of the linear motif–mediated interaction has been shown to depend on the modifications (e.g. phosphorylation) at neighbouring positions or is thought to benefit from the binding versatility of disordered regions. Conclusion The results suggest that flanking regions are relevant for linear motif–mediated interactions, both at the structural and sequence level. More interestingly, they indicate that the prediction of linear motif instances can be enriched with contextual information by performing a sequence analysis similar to the one presented here. This can facilitate the understanding of the role of these predicted instances in determining the protein function inside the broader context of the cellular network where they arise. PMID:19584925
CloVR-Comparative: automated, cloud-enabled comparative microbial genome sequence analysis pipeline.
Agrawal, Sonia; Arze, Cesar; Adkins, Ricky S; Crabtree, Jonathan; Riley, David; Vangala, Mahesh; Galens, Kevin; Fraser, Claire M; Tettelin, Hervé; White, Owen; Angiuoli, Samuel V; Mahurkar, Anup; Fricke, W Florian
2017-04-27
The benefit of increasing genomic sequence data to the scientific community depends on easy-to-use, scalable bioinformatics support. CloVR-Comparative combines commonly used bioinformatics tools into an intuitive, automated, and cloud-enabled analysis pipeline for comparative microbial genomics. CloVR-Comparative runs on annotated complete or draft genome sequences that are uploaded by the user or selected via a taxonomic tree-based user interface and downloaded from NCBI. CloVR-Comparative runs reference-free multiple whole-genome alignments to determine unique, shared and core coding sequences (CDSs) and single nucleotide polymorphisms (SNPs). Output includes short summary reports and detailed text-based results files, graphical visualizations (phylogenetic trees, circular figures), and a database file linked to the Sybil comparative genome browser. Data up- and download, pipeline configuration and monitoring, and access to Sybil are managed through CloVR-Comparative web interface. CloVR-Comparative and Sybil are distributed as part of the CloVR virtual appliance, which runs on local computers or the Amazon EC2 cloud. Representative datasets (e.g. 40 draft and complete Escherichia coli genomes) are processed in <36 h on a local desktop or at a cost of <$20 on EC2. CloVR-Comparative allows anybody with Internet access to run comparative genomics projects, while eliminating the need for on-site computational resources and expertise.
Thomas, Sean; Martinez, L L Isadora Trejo; Westenberger, Scott J; Sturm, Nancy R
2007-05-24
The structurally complex network of minicircles and maxicircles comprising the mitochondrial DNA of kinetoplastids mirrors the complexity of the RNA editing process that is required for faithful expression of encrypted maxicircle genes. Although a few of the guide RNAs that direct this editing process have been discovered on maxicircles, guide RNAs are mostly found on the minicircles. The nuclear and maxicircle genomes have been sequenced and assembled for Trypanosoma cruzi, the causative agent of Chagas disease, however the complement of 1.4-kb minicircles, carrying four guide RNA genes per molecule in this parasite, has been less thoroughly characterised. Fifty-four CL Brener and 53 Esmeraldo strain minicircle sequence reads were extracted from T. cruzi whole genome shotgun sequencing data. With these sequences and all published T. cruzi minicircle sequences, 108 unique guide RNAs from all known T. cruzi minicircle sequences and two guide RNAs from the CL Brener maxicircle were predicted using a local alignment algorithm and mapped onto predicted or experimentally determined sequences of edited maxicircle open reading frames. For half of the sequences no statistically significant guide RNA could be assigned. Likely positions of these unidentified gRNAs in T. cruzi minicircle sequences are estimated using a simple Hidden Markov Model. With the local alignment predictions as a standard, the HMM had an ~85% chance of correctly identifying at least 20 nucleotides of guide RNA from a given minicircle sequence. Inter-minicircle recombination was documented. Variable regions contain species-specific areas of distinct nucleotide preference. Two maxicircle guide RNA genes were found. The identification of new minicircle sequences and the further characterization of all published minicircles are presented, including the first observation of recombination between minicircles. Extrapolation suggests a level of 4% recombinants in the population, supporting a relatively high recombination rate that may serve to minimize the persistence of gRNA pseudogenes. Characteristic nucleotide preferences observed within variable regions provide potential clues regarding the transcription and maturation of T. cruzi guide RNAs. Based on these preferences, a method of predicting T. cruzi guide RNAs using only primary minicircle sequence data was created.
Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration
Thorvaldsdóttir, Helga; Mesirov, Jill P.
2013-01-01
Data visualization is an essential component of genomic data analysis. However, the size and diversity of the data sets produced by today’s sequencing and array-based profiling methods present major challenges to visualization tools. The Integrative Genomics Viewer (IGV) is a high-performance viewer that efficiently handles large heterogeneous data sets, while providing a smooth and intuitive user experience at all levels of genome resolution. A key characteristic of IGV is its focus on the integrative nature of genomic studies, with support for both array-based and next-generation sequencing data, and the integration of clinical and phenotypic data. Although IGV is often used to view genomic data from public sources, its primary emphasis is to support researchers who wish to visualize and explore their own data sets or those from colleagues. To that end, IGV supports flexible loading of local and remote data sets, and is optimized to provide high-performance data visualization and exploration on standard desktop systems. IGV is freely available for download from http://www.broadinstitute.org/igv, under a GNU LGPL open-source license. PMID:22517427
Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration.
Thorvaldsdóttir, Helga; Robinson, James T; Mesirov, Jill P
2013-03-01
Data visualization is an essential component of genomic data analysis. However, the size and diversity of the data sets produced by today's sequencing and array-based profiling methods present major challenges to visualization tools. The Integrative Genomics Viewer (IGV) is a high-performance viewer that efficiently handles large heterogeneous data sets, while providing a smooth and intuitive user experience at all levels of genome resolution. A key characteristic of IGV is its focus on the integrative nature of genomic studies, with support for both array-based and next-generation sequencing data, and the integration of clinical and phenotypic data. Although IGV is often used to view genomic data from public sources, its primary emphasis is to support researchers who wish to visualize and explore their own data sets or those from colleagues. To that end, IGV supports flexible loading of local and remote data sets, and is optimized to provide high-performance data visualization and exploration on standard desktop systems. IGV is freely available for download from http://www.broadinstitute.org/igv, under a GNU LGPL open-source license.
Pan, Xiaoyong; Shen, Hong-Bin
2018-05-02
RNA-binding proteins (RBPs) take over 5∼10% of the eukaryotic proteome and play key roles in many biological processes, e.g. gene regulation. Experimental detection of RBP binding sites is still time-intensive and high-costly. Instead, computational prediction of the RBP binding sites using pattern learned from existing annotation knowledge is a fast approach. From the biological point of view, the local structure context derived from local sequences will be recognized by specific RBPs. However, in computational modeling using deep learning, to our best knowledge, only global representations of entire RNA sequences are employed. So far, the local sequence information is ignored in the deep model construction process. In this study, we present a computational method iDeepE to predict RNA-protein binding sites from RNA sequences by combining global and local convolutional neural networks (CNNs). For the global CNN, we pad the RNA sequences into the same length. For the local CNN, we split a RNA sequence into multiple overlapping fixed-length subsequences, where each subsequence is a signal channel of the whole sequence. Next, we train deep CNNs for multiple subsequences and the padded sequences to learn high-level features, respectively. Finally, the outputs from local and global CNNs are combined to improve the prediction. iDeepE demonstrates a better performance over state-of-the-art methods on two large-scale datasets derived from CLIP-seq. We also find that the local CNN run 1.8 times faster than the global CNN with comparable performance when using GPUs. Our results show that iDeepE has captured experimentally verified binding motifs. https://github.com/xypan1232/iDeepE. xypan172436@gmail.com or hbshen@sjtu.edu.cn. Supplementary data are available at Bioinformatics online.
Olson, J C
1993-01-01
Diphtheria toxin (DT) and Pseudomonas aeruginosa exotoxin A have the same molecular mechanism of toxicity; both toxins ADP-ribosylate a modified histidine residue in elongation factor 2. To help identify amino acids involved in this reaction, sequences in DT that share homology with P. aeruginosa exotoxin A were synthesized and examined for a role in the ADP-ribosyltransferase reaction. By using this approach, residues 32 to 54 of DT were found to define an epitope associated with antibody-mediated inhibition of DT enzyme activity. This lends further support to the notion that residues in this region of DT are involved in the enzymatic reaction. PMID:8423159
SMARTIV: combined sequence and structure de-novo motif discovery for in-vivo RNA binding data.
Polishchuk, Maya; Paz, Inbal; Yakhini, Zohar; Mandel-Gutfreund, Yael
2018-05-25
Gene expression regulation is highly dependent on binding of RNA-binding proteins (RBPs) to their RNA targets. Growing evidence supports the notion that both RNA primary sequence and its local secondary structure play a role in specific Protein-RNA recognition and binding. Despite the great advance in high-throughput experimental methods for identifying sequence targets of RBPs, predicting the specific sequence and structure binding preferences of RBPs remains a major challenge. We present a novel webserver, SMARTIV, designed for discovering and visualizing combined RNA sequence and structure motifs from high-throughput RNA-binding data, generated from in-vivo experiments. The uniqueness of SMARTIV is that it predicts motifs from enriched k-mers that combine information from ranked RNA sequences and their predicted secondary structure, obtained using various folding methods. Consequently, SMARTIV generates Position Weight Matrices (PWMs) in a combined sequence and structure alphabet with assigned P-values. SMARTIV concisely represents the sequence and structure motif content as a single graphical logo, which is informative and easy for visual perception. SMARTIV was examined extensively on a variety of high-throughput binding experiments for RBPs from different families, generated from different technologies, showing consistent and accurate results. Finally, SMARTIV is a user-friendly webserver, highly efficient in run-time and freely accessible via http://smartiv.technion.ac.il/.
SeqDepot: streamlined database of biological sequences and precomputed features.
Ulrich, Luke E; Zhulin, Igor B
2014-01-15
Assembling and/or producing integrated knowledge of sequence features continues to be an onerous and redundant task despite a large number of existing resources. We have developed SeqDepot-a novel database that focuses solely on two primary goals: (i) assimilating known primary sequences with predicted feature data and (ii) providing the most simple and straightforward means to procure and readily use this information. Access to >28.5 million sequences and 300 million features is provided through a well-documented and flexible RESTful interface that supports fetching specific data subsets, bulk queries, visualization and searching by MD5 digests or external database identifiers. We have also developed an HTML5/JavaScript web application exemplifying how to interact with SeqDepot and Perl/Python scripts for use with local processing pipelines. Freely available on the web at http://seqdepot.net/. RESTaccess via http://seqdepot.net/api/v1. Database files and scripts maybe downloaded from http://seqdepot.net/download.
Wheeler, David
2007-01-01
GenBank(R) is a comprehensive database of publicly available DNA sequences for more than 205,000 named organisms and for more than 60,000 within the embryophyta, obtained through submissions from individual laboratories and batch submissions from large-scale sequencing projects. Daily data exchange with the European Molecular Biology Laboratory (EMBL) in Europe and the DNA Data Bank of Japan ensures worldwide coverage. GenBank is accessible through the National Center for Biotechnology Information (NCBI) retrieval system, Entrez, which integrates data from the major DNA and protein sequence databases with taxonomy, genome, mapping, protein structure, and domain information and the biomedical journal literature through PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available through FTP. GenBank usage scenarios ranging from local analyses of the data available through FTP to online analyses supported by the NCBI Web-based tools are discussed. To access GenBank and its related retrieval and analysis services, go to the NCBI Homepage at http://www.ncbi.nlm.nih.gov.
David, Sophia; Afshar, Baharak; Mentasti, Massimo; Ginevra, Christophe; Podglajen, Isabelle; Harris, Simon R; Chalker, Victoria J; Jarraud, Sophie; Harrison, Timothy G; Parkhill, Julian
2017-05-01
Legionnaires' disease is an important cause of hospital-acquired pneumonia and is caused by infection with the bacterium Legionella. Because current typing methods often fail to resolve the infection source in possible nosocomial cases, we aimed to determine whether whole-genome sequencing (WGS) could be used to support or refute suspected links between cases and hospitals. We focused on cases involving a major nosocomial-associated strain, L. pneumophila sequence type (ST) 1. WGS data from 229 L. pneumophila ST1 isolates were analyzed, including 99 isolates from the water systems of 17 hospitals and 42 clinical isolates from patients with confirmed or suspected hospital-acquired infections, as well as isolates obtained from or associated with community-acquired sources of Legionnaires' disease. Phylogenetic analysis demonstrated that all hospitals from which multiple isolates were obtained have been colonized by 1 or more distinct ST1 populations. However, deep sampling of 1 hospital also revealed the existence of substantial diversity and ward-specific microevolution within the population. Across all hospitals, suspected links with cases were supported with WGS, although the degree of support was dependent on the depth of environmental sampling and available contextual information. Finally, phylogeographic analysis revealed that hospitals have been seeded with L. pneumophila via both local and international spread of ST1. WGS can be used to support or refute suspected links between hospitals and Legionnaires' disease cases. However, deep hospital sampling is frequently required due to the potential coexistence of multiple populations, existence of substantial diversity, and similarity of hospital isolates to local populations. © The Author 2017. Published by Oxford University Press for the Infectious Diseases Society of America.
Sirius PSB: a generic system for analysis of biological sequences.
Koh, Chuan Hock; Lin, Sharene; Jedd, Gregory; Wong, Limsoon
2009-12-01
Computational tools are essential components of modern biological research. For example, BLAST searches can be used to identify related proteins based on sequence homology, or when a new genome is sequenced, prediction models can be used to annotate functional sites such as transcription start sites, translation initiation sites and polyadenylation sites and to predict protein localization. Here we present Sirius Prediction Systems Builder (PSB), a new computational tool for sequence analysis, classification and searching. Sirius PSB has four main operations: (1) Building a classifier, (2) Deploying a classifier, (3) Search for proteins similar to query proteins, (4) Preliminary and post-prediction analysis. Sirius PSB supports all these operations via a simple and interactive graphical user interface. Besides being a convenient tool, Sirius PSB has also introduced two novelties in sequence analysis. Firstly, genetic algorithm is used to identify interesting features in the feature space. Secondly, instead of the conventional method of searching for similar proteins via sequence similarity, we introduced searching via features' similarity. To demonstrate the capabilities of Sirius PSB, we have built two prediction models - one for the recognition of Arabidopsis polyadenylation sites and another for the subcellular localization of proteins. Both systems are competitive against current state-of-the-art models based on evaluation of public datasets. More notably, the time and effort required to build each model is greatly reduced with the assistance of Sirius PSB. Furthermore, we show that under certain conditions when BLAST is unable to find related proteins, Sirius PSB can identify functionally related proteins based on their biophysical similarities. Sirius PSB and its related supplements are available at: http://compbio.ddns.comp.nus.edu.sg/~sirius.
The SAMI Galaxy Survey: spatially resolving the main sequence of star formation
NASA Astrophysics Data System (ADS)
Medling, Anne M.; Cortese, Luca; Croom, Scott M.; Green, Andrew W.; Groves, Brent; Hampton, Elise; Ho, I.-Ting; Davies, Luke J. M.; Kewley, Lisa J.; Moffett, Amanda J.; Schaefer, Adam L.; Taylor, Edward; Zafar, Tayyaba; Bekki, Kenji; Bland-Hawthorn, Joss; Bloom, Jessica V.; Brough, Sarah; Bryant, Julia J.; Catinella, Barbara; Cecil, Gerald; Colless, Matthew; Couch, Warrick J.; Drinkwater, Michael J.; Driver, Simon P.; Federrath, Christoph; Foster, Caroline; Goldstein, Gregory; Goodwin, Michael; Hopkins, Andrew; Lawrence, J. S.; Leslie, Sarah K.; Lewis, Geraint F.; Lorente, Nuria P. F.; Owers, Matt S.; McDermid, Richard; Richards, Samuel N.; Sharp, Robert; Scott, Nicholas; Sweet, Sarah M.; Taranu, Dan S.; Tescari, Edoardo; Tonini, Chiara; van de Sande, Jesse; Walcher, C. Jakob; Wright, Angus
2018-04-01
We present the ˜800 star formation rate maps for the Sydney-AAO Multi-object Integral field spectrograph (SAMI) Galaxy Survey based on H α emission maps, corrected for dust attenuation via the Balmer decrement, that are included in the SAMI Public Data Release 1. We mask out spaxels contaminated by non-stellar emission using the [O III]/H β, [N II]/H α, [S II]/H α, and [O I]/H α line ratios. Using these maps, we examine the global and resolved star-forming main sequences of SAMI galaxies as a function of morphology, environmental density, and stellar mass. Galaxies further below the star-forming main sequence are more likely to have flatter star formation profiles. Early-type galaxies split into two populations with similar stellar masses and central stellar mass surface densities. The main-sequence population has centrally concentrated star formation similar to late-type galaxies, while galaxies >3σ below the main sequence show significantly reduced star formation most strikingly in the nuclear regions. The split populations support a two-step quenching mechanism, wherein halo mass first cuts off the gas supply and remaining gas continues to form stars until the local stellar mass surface density can stabilize the reduced remaining fuel against further star formation. Across all morphologies, galaxies in denser environments show a decreased specific star formation rate from the outside in, supporting an environmental cause for quenching, such as ram-pressure stripping or galaxy interactions.
Suzuki, Masahiro; Segawa, Takahiro; Mori, Hiroshi; Akiyoshi, Ayumi; Ootsuki, Ryo; Kurihara, Akira; Sakayama, Hidetoshi; Kitayama, Taiju; Abe, Tsuyoshi; Kogame, Kazuhiro; Kawai, Hiroshi; Nozaki, Hisayoshi
2016-01-01
Liagora japonica is a red algal species distributed in temperate regions of Japan. This species has not been collected from its type locality on the Pacific coast of Japan since 1927 and seems to have become extinct in this area. For molecular characterization of L. japonica, we extracted DNA from the topotype material of L. japonica collected in 1927, analyzed seven genes using Illumina next-generation sequencing, and compared these data with sequences from modern samples of similar red algae collected from the Japan Sea coast of Japan. Both morphological and molecular data from modern samples and historical specimens (including the lectotype and topotype) suggest that the specimens from the Pacific and Japan Sea coasts of Japan should be treated as a single species, and that L. japonica is phylogenetically separated from the genus Liagora. Based on the phylogenetic results and examination of reproductive structures, we propose Otohimella japonica gen. et comb. nov., characterized morphologically by diffuse carposporophytes, undivided carposporangia, and involucral filaments initiated only from the cortical cell on the supporting cell. PMID:27388436
The identification and functional annotation of RNA structures conserved in vertebrates
Seemann, Stefan E.; Mirza, Aashiq H.; Hansen, Claus; Bang-Berthelsen, Claus H.; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T.; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L.; Gorodkin, Jan
2017-01-01
Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human–mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3′ ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. PMID:28487280
Gioio, Anthony E.
2017-01-01
Abstract Tyrosine hydroxylase (TH) is the enzyme that catalyzes the rate-limiting step in the biosynthesis of the catecholamine neurotransmitters. In a previous communication, evidence was provided that TH mRNA is trafficked to the axon, where it is locally translated. In addition, a 50-bp sequence element in the 3′untranslated region (3’UTR) of TH mRNA was identified that directs TH mRNA to distal axons (i.e., zip-code). In the present study, the hypothesis was tested that local translation of TH plays an important role in the biosynthesis of the catecholamine neurotransmitters in the axon and/or presynaptic nerve terminal. Toward this end, a targeted deletion of the axonal transport sequence element was developed, using the lentiviral delivery of the CRISPR/Cas9 system, and two guide RNA (gRNA) sequences flanking the 50-bp cis-acting regulatory element in rat superior cervical ganglion (SCG) neurons. Deletion of the axonal transport element reduced TH mRNA levels in the distal axons and reduced the axonal protein levels of TH and TH activity as measured by phosphorylation of SER40 in SCG neurons. Moreover, deletion of the zip-code diminished the axonal levels of dopamine (DA) and norepinephrine (NE). Conversely, the local translation of exogenous TH mRNA in the distal axon enhanced TH levels and activity, and elevated axonal NE levels. Taken together, these results provide direct evidence to support the hypothesis that TH mRNA trafficking and local synthesis of TH play an important role in the synthesis of catecholamines in the axon and presynaptic terminal. PMID:28630892
Aschrafi, Armaz; Gioio, Anthony E; Dong, Lijin; Kaplan, Barry B
2017-01-01
Tyrosine hydroxylase (TH) is the enzyme that catalyzes the rate-limiting step in the biosynthesis of the catecholamine neurotransmitters. In a previous communication, evidence was provided that TH mRNA is trafficked to the axon, where it is locally translated. In addition, a 50-bp sequence element in the 3'untranslated region (3'UTR) of TH mRNA was identified that directs TH mRNA to distal axons (i.e., zip-code). In the present study, the hypothesis was tested that local translation of TH plays an important role in the biosynthesis of the catecholamine neurotransmitters in the axon and/or presynaptic nerve terminal. Toward this end, a targeted deletion of the axonal transport sequence element was developed, using the lentiviral delivery of the CRISPR/Cas9 system, and two guide RNA (gRNA) sequences flanking the 50-bp cis- acting regulatory element in rat superior cervical ganglion (SCG) neurons. Deletion of the axonal transport element reduced TH mRNA levels in the distal axons and reduced the axonal protein levels of TH and TH activity as measured by phosphorylation of SER40 in SCG neurons. Moreover, deletion of the zip-code diminished the axonal levels of dopamine (DA) and norepinephrine (NE). Conversely, the local translation of exogenous TH mRNA in the distal axon enhanced TH levels and activity, and elevated axonal NE levels. Taken together, these results provide direct evidence to support the hypothesis that TH mRNA trafficking and local synthesis of TH play an important role in the synthesis of catecholamines in the axon and presynaptic terminal.
MotionFlow: Visual Abstraction and Aggregation of Sequential Patterns in Human Motion Tracking Data.
Jang, Sujin; Elmqvist, Niklas; Ramani, Karthik
2016-01-01
Pattern analysis of human motions, which is useful in many research areas, requires understanding and comparison of different styles of motion patterns. However, working with human motion tracking data to support such analysis poses great challenges. In this paper, we propose MotionFlow, a visual analytics system that provides an effective overview of various motion patterns based on an interactive flow visualization. This visualization formulates a motion sequence as transitions between static poses, and aggregates these sequences into a tree diagram to construct a set of motion patterns. The system also allows the users to directly reflect the context of data and their perception of pose similarities in generating representative pose states. We provide local and global controls over the partition-based clustering process. To support the users in organizing unstructured motion data into pattern groups, we designed a set of interactions that enables searching for similar motion sequences from the data, detailed exploration of data subsets, and creating and modifying the group of motion patterns. To evaluate the usability of MotionFlow, we conducted a user study with six researchers with expertise in gesture-based interaction design. They used MotionFlow to explore and organize unstructured motion tracking data. Results show that the researchers were able to easily learn how to use MotionFlow, and the system effectively supported their pattern analysis activities, including leveraging their perception and domain knowledge.
A phylogenetic study of Laeliinae (Orchidaceae) based on combined nuclear and plastid DNA sequences
van den Berg, Cássio; Higgins, Wesley E.; Dressler, Robert L.; Whitten, W. Mark; Soto-Arenas, Miguel A.; Chase, Mark W.
2009-01-01
Background and Aims Laeliinae are a neotropical orchid subtribe with approx. 1500 species in 50 genera. In this study, an attempt is made to assess generic alliances based on molecular phylogenetic analysis of DNA sequence data. Methods Six DNA datasets were gathered: plastid trnL intron, trnL-F spacer, matK gene and trnK introns upstream and dowstream from matK and nuclear ITS rDNA. Data were analysed with maximum parsimony (MP) and Bayesian analysis with mixed models (BA). Key Results Although relationships between Laeliinae and outgroups are well supported, within the subtribe sequence variation is low considering the broad taxonomic range covered. Localized incongruence between the ITS and plastid trees was found. A combined tree followed the ITS trees more closely, but the levels of support obtained with MP were low. The Bayesian analysis recovered more well-supported nodes. The trees from combined MP and BA allowed eight generic alliances to be recognized within Laeliinae, all of which show trends in morphological characters but lack unambiguous synapomorphies. Conclusions By using combined plastid and nuclear DNA data in conjunction with mixed-models Bayesian inference, it is possible to delimit smaller groups within Laeliinae and discuss general patterns of pollination and hybridization compatibility. Furthermore, these small groups can now be used for further detailed studies to explain morphological evolution and diversification patterns within the subtribe. PMID:19423551
Hänfling, B; Dümpelmann, C; Bogutskaya, N G; Brandl, R; Brändle, M
2009-12-01
Genetic variation and geographical structuring of vimba Vimba vimba were analysed across 26 sites (80 individuals) by means of mtDNA sequences (cyt b gene, mitochondrial control region) to localize hypothesized glacial refugia and to reconstruct postglacial recoloniation routes. Although genetic diversity among sequenced individuals was low, a combined analysis of the two sequenced fragments revealed a western (central and northern Europe: Danube, Elbe and lakes of Sweden) and an eastern clade (eastern Europe: Dnieper-South Bug, Don, Neman). Furthermore, a number of divergent ancestral haplotypes distributed around the Black and Caspian Seas became apparent. Mismatch analyses supported a sudden expansion model for the populations of the western clade between 50 and 10 000 bp. Overall, the study provides strong evidence for a northward and westward expansion of V. vimba from two refugial regions located in the Danubian drainage and the northern Pontic regions respectively.
Phylogeography and origin of Chinese domestic chicken.
Wu, Y P; Huo, J H; Xie, J F; Liu, L X; Wei, Q P; Xie, M G; Kang, Z F; Ji, H Y; Ma, Y H
2014-04-01
The loss of local chicken breeds as result of replacement with cosmopolitan breeds indicates the need for conservation measures to protect the future of local genetic stocks. The aim of this study is to describe the patterns of polymorphism of the hypervariable control region of mitochondrial DNA (HVR1) in domestic chicken in China's Jiangxi province to investigate genetic diversity, genetic structure and phylo-dynamics. To this end, we sequenced the mtDNA HVR1 in 231 chickens including 22 individuals which belonged to previously published sequences. A neighbor-joining tree revealed that these samples clustered into five lineages (Lineages A, B, C, E and G). The highest haplotype diversity and nucleotide diversity were both found in Anyi tile-liked gray breed. We estimated that the most recent common ancestor of the local chicken existed approximately 16 million years ago. The mismatch distribution analysis showed two major peaks at positions 4 and 9, while the neutrality test (Tajima's D = -2.19, p < 0.05) and Fu's F-statistics (-8.59, p < 0.05) revealed a significant departure from the neutrality assumption. These results support the idea that domestication of chickens facilitated population increases. Results of a global AMOVA indicated that there was no obvious geographic structure among the local chicken breeds analyzed in this study. The data obtained in this study will assist future conservation management of local breeds and also reveals intriguing implications for the history of human population movements and commerce.
Zhou, Mu; Zhang, Qiao; Xu, Kunjie; Tian, Zengshan; Wang, Yanmeng; He, Wei
2015-01-01
Due to the wide deployment of wireless local area networks (WLAN), received signal strength (RSS)-based indoor WLAN localization has attracted considerable attention in both academia and industry. In this paper, we propose a novel page rank-based indoor mapping and localization (PRIMAL) by using the gene-sequenced unlabeled WLAN RSS for simultaneous localization and mapping (SLAM). Specifically, first of all, based on the observation of the motion patterns of the people in the target environment, we use the Allen logic to construct the mobility graph to characterize the connectivity among different areas of interest. Second, the concept of gene sequencing is utilized to assemble the sporadically-collected RSS sequences into a signal graph based on the transition relations among different RSS sequences. Third, we apply the graph drawing approach to exhibit both the mobility graph and signal graph in a more readable manner. Finally, the page rank (PR) algorithm is proposed to construct the mapping from the signal graph into the mobility graph. The experimental results show that the proposed approach achieves satisfactory localization accuracy and meanwhile avoids the intensive time and labor cost involved in the conventional location fingerprinting-based indoor WLAN localization. PMID:26404274
Synchronized excitability in a network enables generation of internal neuronal sequences
Wang, Yingxue; Roth, Zachary; Pastalkova, Eva
2016-01-01
Hippocampal place field sequences are supported by sensory cues and network internal mechanisms. In contrast, sharp-wave (SPW) sequences, theta sequences, and episode field sequences are internally generated. The relationship of these sequences to memory is unclear. SPW sequences have been shown to support learning and have been assumed to also support episodic memory. Conversely, we demonstrate these SPW sequences were present in trained rats even after episodic memory was impaired and after other internal sequences – episode field and theta sequences – were eliminated. SPW sequences did not support memory despite continuing to ‘replay’ all task-related sequences – place- field and episode field sequences. Sequence replay occurred selectively during synchronous increases of population excitability -- SPWs. Similarly, theta sequences depended on the presence of repeated synchronized waves of excitability – theta oscillations. Thus, we suggest that either intermittent or rhythmic synchronized changes of excitability trigger sequential firing of neurons, which in turn supports learning and/or memory. DOI: http://dx.doi.org/10.7554/eLife.20697.001 PMID:27677848
Angus, Robert B.; Ribera, Ignacio; Jia, Fenglong
2017-01-01
Abstract Karyotypes are given for Boreonectes emmerichi (Falkenström, 1936) from its type locality at Kangding, China, and for B. alpestris (Dutton & Angus, 2007) from the St Gotthard and San Bernardino passes in the Swiss Alps. A phylogeny based on sequence data from a combination of mitochondrial and nuclear genes recovered western Palaearctic species of Boreonectes as monophyletic with strong support. Boreonectes emmerichi was placed as sister to the north American forms of B. griseostriatus (De Geer, 1774), although with low support. The diversity of Palaearctic species of the B. griseostriatus species group is discussed. PMID:28919958
Angus, Robert B; Ribera, Ignacio; Jia, Fenglong
2017-01-01
Karyotypes are given for Boreonectes emmerichi (Falkenström, 1936) from its type locality at Kangding, China, and for B. alpestris (Dutton & Angus, 2007) from the St Gotthard and San Bernardino passes in the Swiss Alps. A phylogeny based on sequence data from a combination of mitochondrial and nuclear genes recovered western Palaearctic species of Boreonectes as monophyletic with strong support. Boreonectes emmerichi was placed as sister to the north American forms of B. griseostriatus (De Geer, 1774), although with low support. The diversity of Palaearctic species of the B. griseostriatus species group is discussed.
Differentially Private Frequent Sequence Mining via Sampling-based Candidate Pruning
Xu, Shengzhi; Cheng, Xiang; Li, Zhengyi; Xiong, Li
2016-01-01
In this paper, we study the problem of mining frequent sequences under the rigorous differential privacy model. We explore the possibility of designing a differentially private frequent sequence mining (FSM) algorithm which can achieve both high data utility and a high degree of privacy. We found, in differentially private FSM, the amount of required noise is proportionate to the number of candidate sequences. If we could effectively reduce the number of unpromising candidate sequences, the utility and privacy tradeoff can be significantly improved. To this end, by leveraging a sampling-based candidate pruning technique, we propose a novel differentially private FSM algorithm, which is referred to as PFS2. The core of our algorithm is to utilize sample databases to further prune the candidate sequences generated based on the downward closure property. In particular, we use the noisy local support of candidate sequences in the sample databases to estimate which sequences are potentially frequent. To improve the accuracy of such private estimations, a sequence shrinking method is proposed to enforce the length constraint on the sample databases. Moreover, to decrease the probability of misestimating frequent sequences as infrequent, a threshold relaxation method is proposed to relax the user-specified threshold for the sample databases. Through formal privacy analysis, we show that our PFS2 algorithm is ε-differentially private. Extensive experiments on real datasets illustrate that our PFS2 algorithm can privately find frequent sequences with high accuracy. PMID:26973430
DLocalMotif: a discriminative approach for discovering local motifs in protein sequences.
Mehdi, Ahmed M; Sehgal, Muhammad Shoaib B; Kobe, Bostjan; Bailey, Timothy L; Bodén, Mikael
2013-01-01
Local motifs are patterns of DNA or protein sequences that occur within a sequence interval relative to a biologically defined anchor or landmark. Current protein motif discovery methods do not adequately consider such constraints to identify biologically significant motifs that are only weakly over-represented but spatially confined. Using negatives, i.e. sequences known to not contain a local motif, can further increase the specificity of their discovery. This article introduces the method DLocalMotif that makes use of positional information and negative data for local motif discovery in protein sequences. DLocalMotif combines three scoring functions, measuring degrees of motif over-representation, entropy and spatial confinement, specifically designed to discriminatively exploit the availability of negative data. The method is shown to outperform current methods that use only a subset of these motif characteristics. We apply the method to several biological datasets. The analysis of peroxisomal targeting signals uncovers several novel motifs that occur immediately upstream of the dominant peroxisomal targeting signal-1 signal. The analysis of proline-tyrosine nuclear localization signals uncovers multiple novel motifs that overlap with C2H2 zinc finger domains. We also evaluate the method on classical nuclear localization signals and endoplasmic reticulum retention signals and find that DLocalMotif successfully recovers biologically relevant sequence properties. http://bioinf.scmb.uq.edu.au/dlocalmotif/
Prabhu, Yogikala; Burgos, Patricia V.; Schindler, Christina; Farías, Ginny G.; Magadár, Javier G.; Bonifacino, Juan S.
2012-01-01
The β-site amyloid precursor protein (APP)–cleaving enzyme 1 (BACE1) is a transmembrane aspartyl protease that catalyzes the proteolytic processing of APP and other plasma membrane protein precursors. BACE1 cycles between the trans-Golgi network (TGN), the plasma membrane, and endosomes by virtue of signals contained within its cytosolic C-terminal domain. One of these signals is the DXXLL-motif sequence DISLL, which controls transport between the TGN and endosomes via interaction with GGA proteins. Here we show that the DISLL sequence is embedded within a longer [DE]XXXL[LI]-motif sequence, DDISLL, which mediates internalization from the plasma membrane by interaction with the clathrin-associated, heterotetrameric adaptor protein 2 (AP-2) complex. Mutation of this signal or knockdown of either AP-2 or clathrin decreases endosomal localization and increases plasma membrane localization of BACE1. Remarkably, internalization-defective BACE1 is able to cleave an APP mutant that itself cannot be delivered to endosomes. The drug brefeldin A reversibly prevents BACE1-catalyzed APP cleavage, ruling out that this reaction occurs in the endoplasmic reticulum (ER) or ER–Golgi intermediate compartment. Taken together, these observations support the notion that BACE1 is capable of cleaving APP in late compartments of the secretory pathway. PMID:22553349
Modliszewski, Jennifer L; Thomas, David T; Fan, Chuanzhu; Crawford, Daniel J; Depamphilis, Claude W; Xiang, Qiu-Yun Jenny
2006-03-01
Knowledge regarding the origin and maintenance of hybrid zones is critical for understanding the evolutionary outcomes of natural hybridization. To evaluate the contribution of historical contact vs. long-distance gene flow in the formation of a broad hybrid zone in central and northern Georgia that involves Aesculus pavia, A. sylvatica, and A. flava, three cpDNA regions (matK, trnD-trnT, and trnH-trnK) were analyzed. The maternal inheritance of cpDNA in Aesculus was confirmed via sequencing of matK from progeny of controlled crosses. Restriction site analyses identified 21 unique haplotypes among 248 individuals representing 29 populations from parental species and hybrids. Haplotypes were sequenced for all cpDNA regions. Restriction site and sequence data were subjected to phylogeographic and population genetic analyses. Considerable cpDNA variation was detected in the hybrid zone, as well as ancestral cpDNA polymorphism; furthermore, the distribution of haplotypes indicates limited interpopulation gene flow via seeds. The genealogy and structure of genetic variation further support the historical presence of A. pavia in the Piedmont, although they are at present locally extinct. In conjunction with previous allozyme studies, the cpDNA data suggest that the hybrid zone originated through historical local gene flow, yet is maintained by periodic long-distance pollen dispersal.
Protein functional features are reflected in the patterns of mRNA translation speed.
López, Daniel; Pazos, Florencio
2015-07-09
The degeneracy of the genetic code makes it possible for the same amino acid string to be coded by different messenger RNA (mRNA) sequences. These "synonymous mRNAs" may differ largely in a number of aspects related to their overall translational efficiency, such as secondary structure content and availability of the encoded transfer RNAs (tRNAs). Consequently, they may render different yields of the translated polypeptides. These mRNA features related to translation efficiency are also playing a role locally, resulting in a non-uniform translation speed along the mRNA, which has been previously related to some protein structural features and also used to explain some dramatic effects of "silent" single-nucleotide-polymorphisms (SNPs). In this work we perform the first large scale analysis of the relationship between three experimental proxies of mRNA local translation efficiency and the local features of the corresponding encoded proteins. We found that a number of protein functional and structural features are reflected in the patterns of ribosome occupancy, secondary structure and tRNA availability along the mRNA. One or more of these proxies of translation speed have distinctive patterns around the mRNA regions coding for certain protein local features. In some cases the three patterns follow a similar trend. We also show specific examples where these patterns of translation speed point to the protein's important structural and functional features. This support the idea that the genome not only codes the protein functional features as sequences of amino acids, but also as subtle patterns of mRNA properties which, probably through local effects on the translation speed, have some consequence on the final polypeptide. These results open the possibility of predicting a protein's functional regions based on a single genomic sequence, and have implications for heterologous protein expression and fine-tuning protein function.
Global mapping of DNA conformational flexibility on Saccharomyces cerevisiae.
Menconi, Giulia; Bedini, Andrea; Barale, Roberto; Sbrana, Isabella
2015-04-01
In this study we provide the first comprehensive map of DNA conformational flexibility in Saccharomyces cerevisiae complete genome. Flexibility plays a key role in DNA supercoiling and DNA/protein binding, regulating DNA transcription, replication or repair. Specific interest in flexibility analysis concerns its relationship with human genome instability. Enrichment in flexible sequences has been detected in unstable regions of human genome defined fragile sites, where genes map and carry frequent deletions and rearrangements in cancer. Flexible sequences have been suggested to be the determinants of fragile gene proneness to breakage; however, their actual role and properties remain elusive. Our in silico analysis carried out genome-wide via the StabFlex algorithm, shows the conserved presence of highly flexible regions in budding yeast genome as well as in genomes of other Saccharomyces sensu stricto species. Flexibile peaks in S. cerevisiae identify 175 ORFs mapping on their 3'UTR, a region affecting mRNA translation, localization and stability. (TA)n repeats of different extension shape the central structure of peaks and co-localize with polyadenylation efficiency element (EE) signals. ORFs with flexible peaks share common features. Transcripts are characterized by decreased half-life: this is considered peculiar of genes involved in regulatory systems with high turnover; consistently, their function affects biological processes such as cell cycle regulation or stress response. Our findings support the functional importance of flexibility peaks, suggesting that the flexible sequence may be derived by an expansion of canonical TAYRTA polyadenylation efficiency element. The flexible (TA)n repeat amplification could be the outcome of an evolutionary neofunctionalization leading to a differential 3'-end processing and expression regulation in genes with peculiar function. Our study provides a new support to the functional role of flexibility in genomes and a strategy for its characterization inside human fragile sites.
Global Mapping of DNA Conformational Flexibility on Saccharomyces cerevisiae
Menconi, Giulia; Bedini, Andrea; Barale, Roberto; Sbrana, Isabella
2015-01-01
In this study we provide the first comprehensive map of DNA conformational flexibility in Saccharomyces cerevisiae complete genome. Flexibility plays a key role in DNA supercoiling and DNA/protein binding, regulating DNA transcription, replication or repair. Specific interest in flexibility analysis concerns its relationship with human genome instability. Enrichment in flexible sequences has been detected in unstable regions of human genome defined fragile sites, where genes map and carry frequent deletions and rearrangements in cancer. Flexible sequences have been suggested to be the determinants of fragile gene proneness to breakage; however, their actual role and properties remain elusive. Our in silico analysis carried out genome-wide via the StabFlex algorithm, shows the conserved presence of highly flexible regions in budding yeast genome as well as in genomes of other Saccharomyces sensu stricto species. Flexibile peaks in S. cerevisiae identify 175 ORFs mapping on their 3’UTR, a region affecting mRNA translation, localization and stability. (TA)n repeats of different extension shape the central structure of peaks and co-localize with polyadenylation efficiency element (EE) signals. ORFs with flexible peaks share common features. Transcripts are characterized by decreased half-life: this is considered peculiar of genes involved in regulatory systems with high turnover; consistently, their function affects biological processes such as cell cycle regulation or stress response. Our findings support the functional importance of flexibility peaks, suggesting that the flexible sequence may be derived by an expansion of canonical TAYRTA polyadenylation efficiency element. The flexible (TA)n repeat amplification could be the outcome of an evolutionary neofunctionalization leading to a differential 3’-end processing and expression regulation in genes with peculiar function. Our study provides a new support to the functional role of flexibility in genomes and a strategy for its characterization inside human fragile sites. PMID:25860149
Effect of sequence-dependent rigidity on plectoneme localization in dsDNA
NASA Astrophysics Data System (ADS)
Medalion, Shlomi; Rabin, Yitzhak
2016-04-01
We use Monte-Carlo simulations to study the effect of variable rigidity on plectoneme formation and localization in supercoiled double-stranded DNA. We show that the presence of soft sequences increases the number of plectoneme branches and that the edges of the branches tend to be localized at these sequences. We propose an experimental approach to test our results in vitro, and discuss the possible role played by plectoneme localization in the search process of transcription factors for their targets (promoter regions) on the bacterial genome.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Martinsson, T.; Vujic, M.; Tomkinson, B.
1993-08-01
The authors have assigned the human tripeptidyl peptidase II (TPP2) gene to chromosome region 13q32-q33 using two different methods. First, a full-length TPP2 cDNA was used as a probe on Southern blots of DNA from a panel of human/rodent somatic cell hybrids. The TPP2 sequences were found to segregate with the human chromosome 13. Second, fluorescence in situ hybridization analysis was performed with the same probe. This analysis supported the chromosome 13 localization and further refined it to region 13q32-q33. 20 refs., 2 figs.
Origins and Widespread Distribution of Co-existing Polyploids in Arnica cordifolia (Asteraceae)
Kao, Rebecca Hufft
2008-01-01
Background and Aims Polyploidy is a central force structuring genetic diversity in angiosperms, but its ecological significance and modes of origin are not fully understood. This work investigated the patterns of coexistence and molecular relatedness of polyploids in the perennial herb, Arnica cordifolia. Methods The local- and broad-scale distributions of cytotypes were analysed using flow cytometry. Samples were collected from both roadside and understorey habitats to test the hypothesis of niche separation between triploids and tetraploids. The nuclear rDNA internal transcribed spacer (ITS) and plastid rpl16 spacer, trnL intron plus trnL-trnF spacer and trnK 3' intron regions were sequenced. Key Results Broad-scale sampling established that both triploids and tetraploids were common throughout the range of the species, pentaploids were rare, and diploids were not found. Local-scale sampling revealed coexistence of both triploids and tetraploids within the majority of sites. Triploids and tetraploids were equally represented in the understorey and roadside habitat. Triploids were more variable than tetraploids, but both cytotypes shared polymorphisms in ITS. Conclusions Coexistence of cytotypes appears to be the norm in A. cordifolia, but habitat differentiation (roadside vs. understorey) is not supported as a coexistence mechanism. Molecular analyses supported multiple events creating triploids but revealed a lack of variation in the tetraploids. Additionally, sequence polymorphisms in ITS suggested a hybridization event prior to polyploidization. PMID:17993653
Origins and widespread distribution of co-existing Polyploids in Arnica cordifolia (Asteraceae).
Kao, Rebecca Hufft
2008-01-01
Polyploidy is a central force structuring genetic diversity in angiosperms, but its ecological significance and modes of origin are not fully understood. This work investigated the patterns of coexistence and molecular relatedness of polyploids in the perennial herb, Arnica cordifolia. The local- and broad-scale distributions of cytotypes were analysed using flow cytometry. Samples were collected from both roadside and understorey habitats to test the hypothesis of niche separation between triploids and tetraploids. The nuclear rDNA internal transcribed spacer (ITS) and plastid rpl16 spacer, trnL intron plus trnL-trnF spacer and trnK 3' intron regions were sequenced. Broad-scale sampling established that both triploids and tetraploids were common throughout the range of the species, pentaploids were rare, and diploids were not found. Local-scale sampling revealed coexistence of both triploids and tetraploids within the majority of sites. Triploids and tetraploids were equally represented in the understorey and roadside habitat. Triploids were more variable than tetraploids, but both cytotypes shared polymorphisms in ITS. Coexistence of cytotypes appears to be the norm in A. cordifolia, but habitat differentiation (roadside vs. understorey) is not supported as a coexistence mechanism. Molecular analyses supported multiple events creating triploids but revealed a lack of variation in the tetraploids. Additionally, sequence polymorphisms in ITS suggested a hybridization event prior to polyploidization.
Lari, Martina; Rizzi, Ermanno; Mona, Stefano; Corti, Giorgio; Catalano, Giulio; Chen, Kefei; Vernesi, Cristiano; Larson, Greger; Boscato, Paolo; De Bellis, Gianluca; Cooper, Alan; Caramelli, David; Bertorelle, Giorgio
2011-01-31
Bos primigenius, the aurochs, is the wild ancestor of modern cattle breeds and was formerly widespread across Eurasia and northern Africa. After a progressive decline, the species became extinct in 1627. The origin of modern taurine breeds in Europe is debated. Archaeological and early genetic evidence point to a single Near Eastern origin and a subsequent spread during the diffusion of herding and farming. More recent genetic data are instead compatible with local domestication events or at least some level of local introgression from the aurochs. Here we present the analysis of the complete mitochondrial genome of a pre-Neolithic Italian aurochs. In this study, we applied a combined strategy employing both multiplex PCR amplifications and 454 pyrosequencing technology to sequence the complete mitochondrial genome of an 11,450-year-old aurochs specimen from Central Italy. Phylogenetic analysis of the aurochs mtDNA genome supports the conclusions from previous studies of short mtDNA fragments--namely that Italian aurochsen were genetically very similar to modern cattle breeds, but highly divergent from the North-Central European aurochsen. Complete mitochondrial genome sequences are now available for several modern cattle and two pre-Neolithic mtDNA genomes from very different geographic areas. These data suggest that previously identified sub-groups within the widespread modern cattle mitochondrial T clade are polyphyletic, and they support the hypothesis that modern European breeds have multiple geographic origins.
Cirulli, Elizabeth T; Noor, Mohamed A F
2007-01-01
Ectopic exchange between transposable elements or other repetitive sequences along a chromosome can produce chromosomal inversions. As a result, genome sequence studies typically find sequence similarity between corresponding inversion breakpoint regions. Here, we identify and investigate the breakpoint regions of the X chromosome inversion distinguishing Drosophila mojavensis and Drosophila arizonae. We localize one inversion breakpoint to 13.7 kb and localize the other to a 1-Mb interval. Using this localization and assuming microsynteny between Drosophila melanogaster and D. arizonae, we pinpoint likely positions of the inversion breakpoints to windows of less than 3000 bp. These breakpoints define the size of the inversion to approximately 11 Mb. However, in contrast to many other studies, we fail to find significant sequence similarity between the 2 breakpoint regions. The localization of these inversion breakpoints will facilitate future genetic and molecular evolutionary studies in this species group, an emerging model system for ecological genetics.
The eukaryotic signal sequence, YGRL, targets the chlamydial inclusion
Kabeiseman, Emily J.; Cichos, Kyle H.; Moore, Elizabeth R.
2014-01-01
Understanding how host proteins are targeted to pathogen-specified organelles, like the chlamydial inclusion, is fundamentally important to understanding the biogenesis of these unique subcellular compartments and how they maintain autonomy within the cell. Syntaxin 6, which localizes to the chlamydial inclusion, contains an YGRL signal sequence. The YGRL functions to return syntaxin 6 to the trans-Golgi from the plasma membrane, and deletion of the YGRL signal sequence from syntaxin 6 also prevents the protein from localizing to the chlamydial inclusion. YGRL is one of three YXXL (YGRL, YQRL, and YKGL) signal sequences which target proteins to the trans-Golgi. We designed various constructs of eukaryotic proteins to test the specificity and propensity of YXXL sequences to target the inclusion. The YGRL signal sequence redirects proteins (e.g., Tgn38, furin, syntaxin 4) that normally do not localize to the chlamydial inclusion. Further, the requirement of the YGRL signal sequence for syntaxin 6 localization to inclusions formed by different species of Chlamydia is conserved. These data indicate that there is an inherent property of the chlamydial inclusion, which allows it to recognize the YGRL signal sequence. To examine whether this “inherent property” was protein or lipid in nature, we asked if deletion of the YGRL signal sequence from syntaxin 6 altered the ability of the protein to interact with proteins or lipids. Deletion or alteration of the YGRL from syntaxin 6 does not appreciably impact syntaxin 6-protein interactions, but does decrease syntaxin 6-lipid interactions. Intriguingly, data also demonstrate that YKGL or YQRL can successfully substitute for YGRL in localization of syntaxin 6 to the chlamydial inclusion. Importantly and for the first time, we are establishing that a eukaryotic signal sequence targets the chlamydial inclusion. PMID:25309881
Parasail: SIMD C library for global, semi-global, and local pairwise sequence alignments.
Daily, Jeff
2016-02-10
Sequence alignment algorithms are a key component of many bioinformatics applications. Though various fast Smith-Waterman local sequence alignment implementations have been developed for x86 CPUs, most are embedded into larger database search tools. In addition, fast implementations of Needleman-Wunsch global sequence alignment and its semi-global variants are not as widespread. This article presents the first software library for local, global, and semi-global pairwise intra-sequence alignments and improves the performance of previous intra-sequence implementations. A faster intra-sequence local pairwise alignment implementation is described and benchmarked, including new global and semi-global variants. Using a 375 residue query sequence a speed of 136 billion cell updates per second (GCUPS) was achieved on a dual Intel Xeon E5-2670 24-core processor system, the highest reported for an implementation based on Farrar's 'striped' approach. Rognes's SWIPE optimal database search application is still generally the fastest available at 1.2 to at best 2.4 times faster than Parasail for sequences shorter than 500 amino acids. However, Parasail was faster for longer sequences. For global alignments, Parasail's prefix scan implementation is generally the fastest, faster even than Farrar's 'striped' approach, however the opal library is faster for single-threaded applications. The software library is designed for 64 bit Linux, OS X, or Windows on processors with SSE2, SSE41, or AVX2. Source code is available from https://github.com/jeffdaily/parasail under the Battelle BSD-style license. Applications that require optimal alignment scores could benefit from the improved performance. For the first time, SIMD global, semi-global, and local alignments are available in a stand-alone C library.
Niche specialization of terrestrial archaeal ammonia oxidizers.
Gubry-Rangin, Cécile; Hai, Brigitte; Quince, Christopher; Engel, Marion; Thomson, Bruce C; James, Phillip; Schloter, Michael; Griffiths, Robert I; Prosser, James I; Nicol, Graeme W
2011-12-27
Soil pH is a major determinant of microbial ecosystem processes and potentially a major driver of evolution, adaptation, and diversity of ammonia oxidizers, which control soil nitrification. Archaea are major components of soil microbial communities and contribute significantly to ammonia oxidation in some soils. To determine whether pH drives evolutionary adaptation and community structure of soil archaeal ammonia oxidizers, sequences of amoA, a key functional gene of ammonia oxidation, were examined in soils at global, regional, and local scales. Globally distributed database sequences clustered into 18 well-supported phylogenetic lineages that dominated specific soil pH ranges classified as acidic (pH <5), acido-neutral (5 ≤ pH <7), or alkalinophilic (pH ≥ 7). To determine whether patterns were reproduced at regional and local scales, amoA gene fragments were amplified from DNA extracted from 47 soils in the United Kingdom (pH 3.5-8.7), including a pH-gradient formed by seven soils at a single site (pH 4.5-7.5). High-throughput sequencing and analysis of amoA gene fragments identified an additional, previously undiscovered phylogenetic lineage and revealed similar pH-associated distribution patterns at global, regional, and local scales, which were most evident for the five most abundant clusters. Archaeal amoA abundance and diversity increased with soil pH, which was the only physicochemical characteristic measured that significantly influenced community structure. These results suggest evolution based on specific adaptations to soil pH and niche specialization, resulting in a global distribution of archaeal lineages that have important consequences for soil ecosystem function and nitrogen cycling.
SubCellProt: predicting protein subcellular localization using machine learning approaches.
Garg, Prabha; Sharma, Virag; Chaudhari, Pradeep; Roy, Nilanjan
2009-01-01
High-throughput genome sequencing projects continue to churn out enormous amounts of raw sequence data. However, most of this raw sequence data is unannotated and, hence, not very useful. Among the various approaches to decipher the function of a protein, one is to determine its localization. Experimental approaches for proteome annotation including determination of a protein's subcellular localizations are very costly and labor intensive. Besides the available experimental methods, in silico methods present alternative approaches to accomplish this task. Here, we present two machine learning approaches for prediction of the subcellular localization of a protein from the primary sequence information. Two machine learning algorithms, k Nearest Neighbor (k-NN) and Probabilistic Neural Network (PNN) were used to classify an unknown protein into one of the 11 subcellular localizations. The final prediction is made on the basis of a consensus of the predictions made by two algorithms and a probability is assigned to it. The results indicate that the primary sequence derived features like amino acid composition, sequence order and physicochemical properties can be used to assign subcellular localization with a fair degree of accuracy. Moreover, with the enhanced accuracy of our approach and the definition of a prediction domain, this method can be used for proteome annotation in a high throughput manner. SubCellProt is available at www.databases.niper.ac.in/SubCellProt.
Genome assembly and transcriptome resource for river buffalo, Bubalus bubalis (2n = 50)
Iamartino, Daniela; Pruitt, Kim D; Sonstegard, Tad; Smith, Timothy P L; Low, Wai Yee; Biagini, Tommaso; Bomba, Lorenzo; Capomaccio, Stefano; Castiglioni, Bianca; Coletta, Angelo; Corrado, Federica; Ferré, Fabrizio; Iannuzzi, Leopoldo; Lawley, Cynthia; Macciotta, Nicolò; McClure, Matthew; Mancini, Giordano; Matassino, Donato; Mazza, Raffaele; Milanesi, Marco; Moioli, Bianca; Morandi, Nicola; Ramunno, Luigi; Peretti, Vincenzo; Pilla, Fabio; Ramelli, Paola; Schroeder, Steven; Strozzi, Francesco; Thibaud-Nissen, Francoise; Zicarelli, Luigi; Ajmone-Marsan, Paolo; Valentini, Alessio; Chillemi, Giovanni; Zimin, Aleksey
2017-01-01
Abstract Water buffalo is a globally important species for agriculture and local economies. A de novo assembled, well-annotated reference sequence for the water buffalo is an important prerequisite for studying the biology of this species, and is necessary to manage genetic diversity and to use modern breeding and genomic selection techniques. However, no such genome assembly has been previously reported. There are 2 species of domestic water buffalo, the river (2n = 50) and the swamp (2n = 48) buffalo. Here we describe a draft quality reference sequence for the river buffalo created from Illumina GA and Roche 454 short read sequences using the MaSuRCA assembler. The assembled sequence is 2.83 Gb, consisting of 366 983 scaffolds with a scaffold N50 of 1.41 Mb and contig N50 of 21 398 bp. Annotation of the genome was supported by transcriptome data from 30 tissues and identified 21 711 predicted protein coding genes. Searches for complete mammalian BUSCO gene groups found 98.6% of curated single copy orthologs present among predicted genes, which suggests a high level of completeness of the genome. The annotated sequence is available from NCBI at accession GCA_000471725.1. PMID:29048578
Study of cnidarian-algal symbiosis in the "omics" age.
Meyer, Eli; Weis, Virginia M
2012-08-01
The symbiotic associations between cnidarians and dinoflagellate algae (Symbiodinium) support productive and diverse ecosystems in coral reefs. Many aspects of this association, including the mechanistic basis of host-symbiont recognition and metabolic interaction, remain poorly understood. The first completed genome sequence for a symbiotic anthozoan is now available (the coral Acropora digitifera), and extensive expressed sequence tag resources are available for a variety of other symbiotic corals and anemones. These resources make it possible to profile gene expression, protein abundance, and protein localization associated with the symbiotic state. Here we review the history of "omics" studies of cnidarian-algal symbiosis and the current availability of sequence resources for corals and anemones, identifying genes putatively involved in symbiosis across 10 anthozoan species. The public availability of candidate symbiosis-associated genes leaves the field of cnidarian-algal symbiosis poised for in-depth comparative studies of sequence diversity and gene expression and for targeted functional studies of genes associated with symbiosis. Reviewing the progress to date suggests directions for future investigations of cnidarian-algal symbiosis that include (i) sequencing of Symbiodinium, (ii) proteomic analysis of the symbiosome membrane complex, (iii) glycomic analysis of Symbiodinium cell surfaces, and (iv) expression profiling of the gastrodermal cells hosting Symbiodinium.
NASA Astrophysics Data System (ADS)
Poletti, Enea; Veronese, Elisa; Calabrese, Massimiliano; Bertoldo, Alessandra; Grisan, Enrico
2012-02-01
The automatic segmentation of brain tissues in magnetic resonance (MR) is usually performed on T1-weighted images, due to their high spatial resolution. T1w sequence, however, has some major downsides when brain lesions are present: the altered appearance of diseased tissues causes errors in tissues classification. In order to overcome these drawbacks, we employed two different MR sequences: fluid attenuated inversion recovery (FLAIR) and double inversion recovery (DIR). The former highlights both gray matter (GM) and white matter (WM), the latter highlights GM alone. We propose here a supervised classification scheme that does not require any anatomical a priori information to identify the 3 classes, "GM", "WM", and "background". Features are extracted by means of a local multi-scale texture analysis, computed for each pixel of the DIR and FLAIR sequences. The 9 textures considered are average, standard deviation, kurtosis, entropy, contrast, correlation, energy, homogeneity, and skewness, evaluated on a neighborhood of 3x3, 5x5, and 7x7 pixels. Hence, the total number of features associated to a pixel is 56 (9 textures x3 scales x2 sequences +2 original pixel values). The classifier employed is a Support Vector Machine with Radial Basis Function as kernel. From each of the 4 brain volumes evaluated, a DIR and a FLAIR slice have been selected and manually segmented by 2 expert neurologists, providing 1st and 2nd human reference observations which agree with an average accuracy of 99.03%. SVM performances have been assessed with a 4-fold cross-validation, yielding an average classification accuracy of 98.79%.
Reinharz, Vladimir; Ponty, Yann; Waldispühl, Jérôme
2013-07-01
The design of RNA sequences folding into predefined secondary structures is a milestone for many synthetic biology and gene therapy studies. Most of the current software uses similar local search strategies (i.e. a random seed is progressively adapted to acquire the desired folding properties) and more importantly do not allow the user to control explicitly the nucleotide distribution such as the GC-content in their sequences. However, the latter is an important criterion for large-scale applications as it could presumably be used to design sequences with better transcription rates and/or structural plasticity. In this article, we introduce IncaRNAtion, a novel algorithm to design RNA sequences folding into target secondary structures with a predefined nucleotide distribution. IncaRNAtion uses a global sampling approach and weighted sampling techniques. We show that our approach is fast (i.e. running time comparable or better than local search methods), seedless (we remove the bias of the seed in local search heuristics) and successfully generates high-quality sequences (i.e. thermodynamically stable) for any GC-content. To complete this study, we develop a hybrid method combining our global sampling approach with local search strategies. Remarkably, our glocal methodology overcomes both local and global approaches for sampling sequences with a specific GC-content and target structure. IncaRNAtion is available at csb.cs.mcgill.ca/incarnation/. Supplementary data are available at Bioinformatics online.
NASA Astrophysics Data System (ADS)
Atehortúa, Angélica; Zuluaga, Maria A.; Ourselin, Sébastien; Giraldo, Diana; Romero, Eduardo
2016-03-01
An accurate ventricular function quantification is important to support evaluation, diagnosis and prognosis of several cardiac pathologies. However, expert heart delineation, specifically for the right ventricle, is a time consuming task with high inter-and-intra observer variability. A fully automatic 3D+time heart segmentation framework is herein proposed for short-axis-cardiac MRI sequences. This approach estimates the heart using exclusively information from the sequence itself without tuning any parameters. The proposed framework uses a coarse-to-fine approach, which starts by localizing the heart via spatio-temporal analysis, followed by a segmentation of the basal heart that is then propagated to the apex by using a non-rigid-registration strategy. The obtained volume is then refined by estimating the ventricular muscle by locally searching a prior endocardium- pericardium intensity pattern. The proposed framework was applied to 48 patients datasets supplied by the organizers of the MICCAI 2012 Right Ventricle segmentation challenge. Results show the robustness, efficiency and competitiveness of the proposed method both in terms of accuracy and computational load.
Weigand, Hannah; Weiss, Martina; Cai, Huimin; Li, Yongping; Yu, Lili; Zhang, Christine; Leese, Florian
2018-08-15
Local adaptation is of fundamental importance for populations to cope with fast, human-mediated environmental changes. In the past, analyses of local adaptation were restricted to few model species. Nowadays, due to the increased affordability of high-throughput sequencing, local adaptation can be studied much easier by searching for patterns of positive selection using genomic data. In the present study, we analysed effects of wastewater treatment plant and ore mining effluents on stream invertebrate populations. The two different anthropogenic stressors have impacted on stream ecosystems over different time scales and with different potencies. As target organisms we selected two macroinvertebrate species with different life histories and dispersal capacities: the caddisfly Glossosoma conformis and the flatworm Dugesia gonocephala. We applied a genome-wide genetic marker technique, termed ddRAD (double digest restriction site associated DNA) sequencing, to identify local adaptation. Ten and 18% of all loci were identified as candidate loci for local adaptation in D. gonocephala and G. conformis, respectively. However, after stringent re-evaluation of the genomic data, strong evidence for local adaptation remained only for one population of the flatworm D. gonocephala affected by high copper concentration from ore mining. One of the corresponding candidate loci is arnt, a gene associated with the response to xenobiotics and potentially involved in metal detoxification. Our results support the hypotheses that local adaptation is more likely to play a central role in environments impacted by a stronger stressor for a longer time and that it is more likely to occur in species with lower migration rates. However, these findings have to be interpreted cautiously, as several confounding factors may have limited the possibility to detect local adaptation. Our study highlights how genomic tools can be used to study the adaptability and thus resistance of natural populations to changing environments and we discuss prospects and limitations of the methods. Copyright © 2018 Elsevier B.V. All rights reserved.
Hu, Long; Xu, Zhiyu; Hu, Boqin; Lu, Zhi John
2017-01-09
Recent genomic studies suggest that novel long non-coding RNAs (lncRNAs) are specifically expressed and far outnumber annotated lncRNA sequences. To identify and characterize novel lncRNAs in RNA sequencing data from new samples, we have developed COME, a coding potential calculation tool based on multiple features. It integrates multiple sequence-derived and experiment-based features using a decompose-compose method, which makes it more accurate and robust than other well-known tools. We also showed that COME was able to substantially improve the consistency of predication results from other coding potential calculators. Moreover, COME annotates and characterizes each predicted lncRNA transcript with multiple lines of supporting evidence, which are not provided by other tools. Remarkably, we found that one subgroup of lncRNAs classified by such supporting features (i.e. conserved local RNA secondary structure) was highly enriched in a well-validated database (lncRNAdb). We further found that the conserved structural domains on lncRNAs had better chance than other RNA regions to interact with RNA binding proteins, based on the recent eCLIP-seq data in human, indicating their potential regulatory roles. Overall, we present COME as an accurate, robust and multiple-feature supported method for the identification and characterization of novel lncRNAs. The software implementation is available at https://github.com/lulab/COME. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
fRMSDPred: Predicting Local RMSD Between Structural Fragments Using Sequence Information
2007-04-04
machine learning approaches for estimating the RMSD value of a pair of protein fragments. These estimated fragment-level RMSD values can be used to construct the alignment, assess the quality of an alignment, and identify high-quality alignment segments. We present algorithms to solve this fragment-level RMSD prediction problem using a supervised learning framework based on support vector regression and classification that incorporates protein profiles, predicted secondary structure, effective information encoding schemes, and novel second-order pairwise exponential kernel
Processing Dynamic Image Sequences from a Moving Sensor.
1984-02-01
65 Roadsign Image Sequence ..... ................ ... 70 Roadsign Sequence with Redundant Features .. ........ . 79 Roadsign Subimage...Selected Feature Error Values .. ........ 66 2c. Industrial Image Selected Feature Local Search Values. .. .... 67 3ab. Roadsign Image Error Values...72 3c. Roadsign Image Local Search Values ............. 73 4ab. Roadsign Redundant Feature Error Values. ............ 8 4c. Roadsign
A computational genomics pipeline for prokaryotic sequencing projects.
Kislyuk, Andrey O; Katz, Lee S; Agrawal, Sonia; Hagen, Matthew S; Conley, Andrew B; Jayaraman, Pushkala; Nelakuditi, Viswateja; Humphrey, Jay C; Sammons, Scott A; Govil, Dhwani; Mair, Raydel D; Tatti, Kathleen M; Tondella, Maria L; Harcourt, Brian H; Mayer, Leonard W; Jordan, I King
2010-08-01
New sequencing technologies have accelerated research on prokaryotic genomes and have made genome sequencing operations outside major genome sequencing centers routine. However, no off-the-shelf solution exists for the combined assembly, gene prediction, genome annotation and data presentation necessary to interpret sequencing data. The resulting requirement to invest significant resources into custom informatics support for genome sequencing projects remains a major impediment to the accessibility of high-throughput sequence data. We present a self-contained, automated high-throughput open source genome sequencing and computational genomics pipeline suitable for prokaryotic sequencing projects. The pipeline has been used at the Georgia Institute of Technology and the Centers for Disease Control and Prevention for the analysis of Neisseria meningitidis and Bordetella bronchiseptica genomes. The pipeline is capable of enhanced or manually assisted reference-based assembly using multiple assemblers and modes; gene predictor combining; and functional annotation of genes and gene products. Because every component of the pipeline is executed on a local machine with no need to access resources over the Internet, the pipeline is suitable for projects of a sensitive nature. Annotation of virulence-related features makes the pipeline particularly useful for projects working with pathogenic prokaryotes. The pipeline is licensed under the open-source GNU General Public License and available at the Georgia Tech Neisseria Base (http://nbase.biology.gatech.edu/). The pipeline is implemented with a combination of Perl, Bourne Shell and MySQL and is compatible with Linux and other Unix systems.
Jedlicki, Ana; Fernández, Gonzalo; Astorga, Marcela; Oyarzún, Pablo; Toro, Jorge E.; Navarro, Jorge M.; Martínez, Víctor
2012-01-01
Background and aims On the basis of morphological evidence, the species involved in South American Pacific coast harmful algal blooms (HABs) has been traditionally recognized as Alexandrium catenella (Dinophyceae). However, these observations have not been confirmed using evidence based on genomic sequence variability. Our principal objective was to accurately determine the species of Alexandrium involved in local HABs in order to implement a real-time polymerase chain reaction (PCR) assay for its rapid and easy detection on filter-feeding shellfish, such as mussels. Methodology For species-specific determination, the intergenic spacer 1 (ITS1), 5.8S subunit, ITS2 and the hypervariable genomic regions D1–D5 of the large ribosomal subunit of local strains were sequenced and compared with two data sets of other Alexandrium sequences. Species-specific primers were used to amplify signature sequences within the genomic DNA of the studied species by conventional and real-time PCR. Principal results Phylogenetic analysis determined that the Chilean strain falls into Group I of the tamarensis complex. Our results support the allocation of the Chilean Alexandrium species as a toxic Alexandrium tamarense rather than A. catenella, as currently defined. Once local species were determined to belong to Group I of the tamarensis complex, a highly sensitive and accurate real-time PCR procedure was developed to detect dinoflagellate presence in Mytilus spp. (Bivalvia) samples after being fed (challenged) in vitro with the Chilean Alexandrium strain. The results show that real-time PCR is useful to detect Alexandrium intake in filter-feeding molluscs. Conclusions It has been shown that the classification of local Alexandrium using morphological evidence is not very accurate. Molecular methods enabled the HAB dinoflagellate species of the Chilean coast to be assigned as A. tamarense rather than A. catenella. Real-time PCR analysis based on A. tamarense primers allowed the detection of dinoflagellate DNA in Mytilus spp. samples exposed to this alga. Through the specific assignment of dinoflagellate species involved in HABs, more reliable preventive policies can be implemented. PMID:23259043
A Comprehensive Analysis of Transcript-Supported De Novo Genes in Saccharomyces sensu stricto Yeasts
Lu, Tzu-Chiao; Leu, Jun-Yi; Lin, Wen-Chang
2017-01-01
Abstract Novel genes arising from random DNA sequences (de novo genes) have been suggested to be widespread in the genomes of different organisms. However, our knowledge about the origin and evolution of de novo genes is still limited. To systematically understand the general features of de novo genes, we established a robust pipeline to analyze >20,000 transcript-supported coding sequences (CDSs) from the budding yeast Saccharomyces cerevisiae. Our analysis pipeline combined phylogeny, synteny, and sequence alignment information to identify possible orthologs across 20 Saccharomycetaceae yeasts and discovered 4,340 S. cerevisiae-specific de novo genes and 8,871 S. sensu stricto-specific de novo genes. We further combine information on CDS positions and transcript structures to show that >65% of de novo genes arose from transcript isoforms of ancient genes, especially in the upstream and internal regions of ancient genes. Fourteen identified de novo genes with high transcript levels were chosen to verify their protein expressions. Ten of them, including eight transcript isoform-associated CDSs, showed translation signals and five proteins exhibited specific cytosolic localizations. Our results suggest that de novo genes frequently arise in the S. sensu stricto complex and have the potential to be quickly integrated into ancient cellular network. PMID:28981695
The identification and functional annotation of RNA structures conserved in vertebrates.
Seemann, Stefan E; Mirza, Aashiq H; Hansen, Claus; Bang-Berthelsen, Claus H; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L; Gorodkin, Jan
2017-08-01
Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human-mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3' ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. © 2017 Seemann et al.; Published by Cold Spring Harbor Laboratory Press.
Fast online and index-based algorithms for approximate search of RNA sequence-structure patterns
2013-01-01
Background It is well known that the search for homologous RNAs is more effective if both sequence and structure information is incorporated into the search. However, current tools for searching with RNA sequence-structure patterns cannot fully handle mutations occurring on both these levels or are simply not fast enough for searching large sequence databases because of the high computational costs of the underlying sequence-structure alignment problem. Results We present new fast index-based and online algorithms for approximate matching of RNA sequence-structure patterns supporting a full set of edit operations on single bases and base pairs. Our methods efficiently compute semi-global alignments of structural RNA patterns and substrings of the target sequence whose costs satisfy a user-defined sequence-structure edit distance threshold. For this purpose, we introduce a new computing scheme to optimally reuse the entries of the required dynamic programming matrices for all substrings and combine it with a technique for avoiding the alignment computation of non-matching substrings. Our new index-based methods exploit suffix arrays preprocessed from the target database and achieve running times that are sublinear in the size of the searched sequences. To support the description of RNA molecules that fold into complex secondary structures with multiple ordered sequence-structure patterns, we use fast algorithms for the local or global chaining of approximate sequence-structure pattern matches. The chaining step removes spurious matches from the set of intermediate results, in particular of patterns with little specificity. In benchmark experiments on the Rfam database, our improved online algorithm is faster than the best previous method by up to factor 45. Our best new index-based algorithm achieves a speedup of factor 560. Conclusions The presented methods achieve considerable speedups compared to the best previous method. This, together with the expected sublinear running time of the presented index-based algorithms, allows for the first time approximate matching of RNA sequence-structure patterns in large sequence databases. Beyond the algorithmic contributions, we provide with RaligNAtor a robust and well documented open-source software package implementing the algorithms presented in this manuscript. The RaligNAtor software is available at http://www.zbh.uni-hamburg.de/ralignator. PMID:23865810
Network command processing system overview
NASA Technical Reports Server (NTRS)
Nam, Yon-Woo; Murphy, Lisa D.
1993-01-01
The Network Command Processing System (NCPS) developed for the National Aeronautics and Space Administration (NASA) Ground Network (GN) stations is a spacecraft command system utilizing a MULTIBUS I/68030 microprocessor. This system was developed and implemented at ground stations worldwide to provide a Project Operations Control Center (POCC) with command capability for support of spacecraft operations such as the LANDSAT, Shuttle, Tracking and Data Relay Satellite, and Nimbus-7. The NCPS consolidates multiple modulation schemes for supporting various manned/unmanned orbital platforms. The NCPS interacts with the POCC and a local operator to process configuration requests, generate modulated uplink sequences, and inform users of the ground command link status. This paper presents the system functional description, hardware description, and the software design.
Maleki, Ehsan; Babashah, Hossein; Koohi, Somayyeh; Kavehvash, Zahra
2017-07-01
This paper presents an optical processing approach for exploring a large number of genome sequences. Specifically, we propose an optical correlator for global alignment and an extended moiré matching technique for local analysis of spatially coded DNA, whose output is fed to a novel three-dimensional artificial neural network for local DNA alignment. All-optical implementation of the proposed 3D artificial neural network is developed and its accuracy is verified in Zemax. Thanks to its parallel processing capability, the proposed structure performs local alignment of 4 million sequences of 150 base pairs in a few seconds, which is much faster than its electrical counterparts, such as the basic local alignment search tool.
Damkliang, Kasikrit; Tandayya, Pichaya; Sangket, Unitsa; Pasomsub, Ekawat
2016-11-28
At the present, coding sequence (CDS) has been discovered and larger CDS is being revealed frequently. Approaches and related tools have also been developed and upgraded concurrently, especially for phylogenetic tree analysis. This paper proposes an integrated automatic Taverna workflow for the phylogenetic tree inferring analysis using public access web services at European Bioinformatics Institute (EMBL-EBI) and Swiss Institute of Bioinformatics (SIB), and our own deployed local web services. The workflow input is a set of CDS in the Fasta format. The workflow supports 1,000 to 20,000 numbers in bootstrapping replication. The workflow performs the tree inferring such as Parsimony (PARS), Distance Matrix - Neighbor Joining (DIST-NJ), and Maximum Likelihood (ML) algorithms of EMBOSS PHYLIPNEW package based on our proposed Multiple Sequence Alignment (MSA) similarity score. The local web services are implemented and deployed into two types using the Soaplab2 and Apache Axis2 deployment. There are SOAP and Java Web Service (JWS) providing WSDL endpoints to Taverna Workbench, a workflow manager. The workflow has been validated, the performance has been measured, and its results have been verified. Our workflow's execution time is less than ten minutes for inferring a tree with 10,000 replicates of the bootstrapping numbers. This paper proposes a new integrated automatic workflow which will be beneficial to the bioinformaticians with an intermediate level of knowledge and experiences. All local services have been deployed at our portal http://bioservices.sci.psu.ac.th.
Damkliang, Kasikrit; Tandayya, Pichaya; Sangket, Unitsa; Pasomsub, Ekawat
2016-03-01
At the present, coding sequence (CDS) has been discovered and larger CDS is being revealed frequently. Approaches and related tools have also been developed and upgraded concurrently, especially for phylogenetic tree analysis. This paper proposes an integrated automatic Taverna workflow for the phylogenetic tree inferring analysis using public access web services at European Bioinformatics Institute (EMBL-EBI) and Swiss Institute of Bioinformatics (SIB), and our own deployed local web services. The workflow input is a set of CDS in the Fasta format. The workflow supports 1,000 to 20,000 numbers in bootstrapping replication. The workflow performs the tree inferring such as Parsimony (PARS), Distance Matrix - Neighbor Joining (DIST-NJ), and Maximum Likelihood (ML) algorithms of EMBOSS PHYLIPNEW package based on our proposed Multiple Sequence Alignment (MSA) similarity score. The local web services are implemented and deployed into two types using the Soaplab2 and Apache Axis2 deployment. There are SOAP and Java Web Service (JWS) providing WSDL endpoints to Taverna Workbench, a workflow manager. The workflow has been validated, the performance has been measured, and its results have been verified. Our workflow's execution time is less than ten minutes for inferring a tree with 10,000 replicates of the bootstrapping numbers. This paper proposes a new integrated automatic workflow which will be beneficial to the bioinformaticians with an intermediate level of knowledge and experiences. The all local services have been deployed at our portal http://bioservices.sci.psu.ac.th.
NASA Astrophysics Data System (ADS)
Astuti, Winny; Qomarun; Febela, Alfa; Putri, Rufia A.; Mukaromah, H.
2018-03-01
Concept of friendly city mentions about better employment options and more opportunities of the local communities. Tourism development and creative industry effectively enhance regional economic development and community welfare. Kampoeng Jayengan is located in Serengan District, Surakarta, which the local community was originally come from Banjar in 1746; with the main economic activity was Jewels Trader. Jayengan Jewels Kampoeng (JKP) will support development of Surakarta as a creative city integrated with other creative tourism Kampoengs. In fact, JKP has been launched by the Secretary of Surakarta Government in 2015 indicated the Government’s attention and support to development of the area. This research aims for Developing Attraction Simulation based on the local economic activities of the communities in order to stimulate Economic friendly city. The methods used mixed methods combining quantitative through field observation and qualitative approach through interview and FGD. The results developed two directions of Tourism Attraction destination, the West and the East Direction. The sequence process of the jewels craft making is challenging for creating the path of tourism attraction in JKP. This implies for increasing tourist visits, enhancing regional economic development and community welfare.
Simmons, Mark P; Goloboff, Pablo A
2013-10-01
Empirical and simulated examples are used to demonstrate an artifact caused by undersampling optimal trees in data matrices that consist mostly or entirely of locally sampled (as opposed to globally, for most or all terminals) characters. The artifact is that unsupported clades consisting entirely of terminals scored for the same locally sampled partition may be resolved and assigned high resampling support-despite their being properly unsupported (i.e., not resolved in the strict consensus of all optimal trees). This artifact occurs despite application of random-addition sequences for stepwise terminal addition. The artifact is not necessarily obviated with thorough conventional branch swapping methods (even tree-bisection-reconnection) when just a single tree is held, as is sometimes implemented in parsimony bootstrap pseudoreplicates, and in every GARLI, PhyML, and RAxML pseudoreplicate and search for the most likely tree for the matrix as a whole. Hence GARLI, RAxML, and PhyML-based likelihood results require extra scrutiny, particularly when they provide high resolution and support for clades that are entirely unsupported by methods that perform more thorough searches, as in most parsimony analyses. Copyright © 2013 Elsevier Inc. All rights reserved.
WegoLoc: accurate prediction of protein subcellular localization using weighted Gene Ontology terms.
Chi, Sang-Mun; Nam, Dougu
2012-04-01
We present an accurate and fast web server, WegoLoc for predicting subcellular localization of proteins based on sequence similarity and weighted Gene Ontology (GO) information. A term weighting method in the text categorization process is applied to GO terms for a support vector machine classifier. As a result, WegoLoc surpasses the state-of-the-art methods for previously used test datasets. WegoLoc supports three eukaryotic kingdoms (animals, fungi and plants) and provides human-specific analysis, and covers several sets of cellular locations. In addition, WegoLoc provides (i) multiple possible localizations of input protein(s) as well as their corresponding probability scores, (ii) weights of GO terms representing the contribution of each GO term in the prediction, and (iii) a BLAST E-value for the best hit with GO terms. If the similarity score does not meet a given threshold, an amino acid composition-based prediction is applied as a backup method. WegoLoc and User's guide are freely available at the website http://www.btool.org/WegoLoc smchiks@ks.ac.kr; dougnam@unist.ac.kr Supplementary data is available at http://www.btool.org/WegoLoc.
De Oliveira, T; Miller, R; Tarin, M; Cassol, S
2003-01-01
Sequence databases encode a wealth of information needed to develop improved vaccination and treatment strategies for the control of HIV and other important pathogens. To facilitate effective utilization of these datasets, we developed a user-friendly GDE-based LINUX interface that reduces input/output file formatting. GDE was adapted to the Linux operating system, bioinformatics tools were integrated with microbe-specific databases, and up-to-date GDE menus were developed for several clinically important viral, bacterial and parasitic genomes. Each microbial interface was designed for local access and contains Genbank, BLAST-formatted and phylogenetic databases. GDE-Linux is available for research purposes by direct application to the corresponding author. Application-specific menus and support files can be downloaded from (http://www.bioafrica.net).
Global and Local Pitch Perception in Children with Developmental Dyslexia
ERIC Educational Resources Information Center
Ziegler, Johannes C.; Pech-Georgel, Catherine; George, Florence; Foxton, Jessica M.
2012-01-01
This study investigated global versus local pitch pattern perception in children with dyslexia aged between 8 and 11 years. Children listened to two consecutive 4-tone pitch sequences while performing a same/different task. On the different trials, sequences either preserved the contour (local condition) or they violated the contour (global…
Wong, Emily S. W.; Hardy, Margaret C.; Wood, David; Bailey, Timothy; King, Glenn F.
2013-01-01
Spider neurotoxins are commonly used as pharmacological tools and are a popular source of novel compounds with therapeutic and agrochemical potential. Since venom peptides are inherently toxic, the host spider must employ strategies to avoid adverse effects prior to venom use. It is partly for this reason that most spider toxins encode a protective proregion that upon enzymatic cleavage is excised from the mature peptide. In order to identify the mature toxin sequence directly from toxin transcripts, without resorting to protein sequencing, the propeptide cleavage site in the toxin precursor must be predicted bioinformatically. We evaluated different machine learning strategies (support vector machines, hidden Markov model and decision tree) and developed an algorithm (SpiderP) for prediction of propeptide cleavage sites in spider toxins. Our strategy uses a support vector machine (SVM) framework that combines both local and global sequence information. Our method is superior or comparable to current tools for prediction of propeptide sequences in spider toxins. Evaluation of the SVM method on an independent test set of known toxin sequences yielded 96% sensitivity and 100% specificity. Furthermore, we sequenced five novel peptides (not used to train the final predictor) from the venom of the Australian tarantula Selenotypus plumipes to test the accuracy of the predictor and found 80% sensitivity and 99.6% 8-mer specificity. Finally, we used the predictor together with homology information to predict and characterize seven groups of novel toxins from the deeply sequenced venom gland transcriptome of S. plumipes, which revealed structural complexity and innovations in the evolution of the toxins. The precursor prediction tool (SpiderP) is freely available on ArachnoServer (http://www.arachnoserver.org/spiderP.html), a web portal to a comprehensive relational database of spider toxins. All training data, test data, and scripts used are available from the SpiderP website. PMID:23894279
Liang, Yunyun; Liu, Sanyang; Zhang, Shengli
2016-12-01
Apoptosis, or programed cell death, plays a central role in the development and homeostasis of an organism. Obtaining information on subcellular location of apoptosis proteins is very helpful for understanding the apoptosis mechanism. The prediction of subcellular localization of an apoptosis protein is still a challenging task, and existing methods mainly based on protein primary sequences. In this paper, we introduce a new position-specific scoring matrix (PSSM)-based method by using detrended cross-correlation (DCCA) coefficient of non-overlapping windows. Then a 190-dimensional (190D) feature vector is constructed on two widely used datasets: CL317 and ZD98, and support vector machine is adopted as classifier. To evaluate the proposed method, objective and rigorous jackknife cross-validation tests are performed on the two datasets. The results show that our approach offers a novel and reliable PSSM-based tool for prediction of apoptosis protein subcellular localization. Copyright © 2016 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bateman, Nicholas W.; The John P. Murtha Cancer Center, Walter Reed National Military Medical Center, 8901 Wisconsin Avenue, Bethesda 20889, MD; Shoji, Yutaka
2016-01-01
AT-rich interactive domain-containing protein 1A (ARID1A) is a recently identified nuclear tumor suppressor frequently altered in solid tumor malignancies. We have identified a bipartite-like nuclear localization sequence (NLS) that contributes to nuclear import of ARID1A not previously described. We functionally confirm activity using GFP constructs fused with wild-type or mutant NLS sequences. We further show that cyto-nuclear localized, bipartite NLS mutant ARID1A exhibits greater stability than nuclear-localized, wild-type ARID1A. Identification of this undescribed functional NLS within ARID1A contributes vital insights to rationalize the impact of ARID1A missense mutations observed in patient tumors. - Highlights: • We have identified a bipartitemore » nuclear localization sequence (NLS) in ARID1A. • Confirmation of the NLS was performed using GFP constructs. • NLS mutant ARID1A exhibits greater stability than wild-type ARID1A.« less
CDinFusion – Submission-Ready, On-Line Integration of Sequence and Contextual Data
Hankeln, Wolfgang; Wendel, Norma Johanna; Gerken, Jan; Waldmann, Jost; Buttigieg, Pier Luigi; Kostadinov, Ivaylo; Kottmann, Renzo; Yilmaz, Pelin; Glöckner, Frank Oliver
2011-01-01
State of the art (DNA) sequencing methods applied in “Omics” studies grant insight into the ‘blueprints’ of organisms from all domains of life. Sequencing is carried out around the globe and the data is submitted to the public repositories of the International Nucleotide Sequence Database Collaboration. However, the context in which these studies are conducted often gets lost, because experimental data, as well as information about the environment are rarely submitted along with the sequence data. If these contextual or metadata are missing, key opportunities of comparison and analysis across studies and habitats are hampered or even impossible. To address this problem, the Genomic Standards Consortium (GSC) promotes checklists and standards to better describe our sequence data collection and to promote the capturing, exchange and integration of sequence data with contextual data. In a recent community effort the GSC has developed a series of recommendations for contextual data that should be submitted along with sequence data. To support the scientific community to significantly enhance the quality and quantity of contextual data in the public sequence data repositories, specialized software tools are needed. In this work we present CDinFusion, a web-based tool to integrate contextual and sequence data in (Multi)FASTA format prior to submission. The tool is open source and available under the Lesser GNU Public License 3. A public installation is hosted and maintained at the Max Planck Institute for Marine Microbiology at http://www.megx.net/cdinfusion. The tool may also be installed locally using the open source code available at http://code.google.com/p/cdinfusion. PMID:21935468
A basic analysis toolkit for biological sequences
Giancarlo, Raffaele; Siragusa, Alessandro; Siragusa, Enrico; Utro, Filippo
2007-01-01
This paper presents a software library, nicknamed BATS, for some basic sequence analysis tasks. Namely, local alignments, via approximate string matching, and global alignments, via longest common subsequence and alignments with affine and concave gap cost functions. Moreover, it also supports filtering operations to select strings from a set and establish their statistical significance, via z-score computation. None of the algorithms is new, but although they are generally regarded as fundamental for sequence analysis, they have not been implemented in a single and consistent software package, as we do here. Therefore, our main contribution is to fill this gap between algorithmic theory and practice by providing an extensible and easy to use software library that includes algorithms for the mentioned string matching and alignment problems. The library consists of C/C++ library functions as well as Perl library functions. It can be interfaced with Bioperl and can also be used as a stand-alone system with a GUI. The software is available at under the GNU GPL. PMID:17877802
Small-target leak detection for a closed vessel via infrared image sequences
NASA Astrophysics Data System (ADS)
Zhao, Ling; Yang, Hongjiu
2017-03-01
This paper focus on a leak diagnosis and localization method based on infrared image sequences. Some problems on high probability of false warning and negative affect for marginal information are solved by leak detection. An experimental model is established for leak diagnosis and localization on infrared image sequences. The differential background prediction is presented to eliminate the negative affect of marginal information on test vessel based on a kernel regression method. A pipeline filter based on layering voting is designed to reduce probability of leak point false warning. A synthesize leak diagnosis and localization algorithm is proposed based on infrared image sequences. The effectiveness and potential are shown for developed techniques through experimental results.
A Robust Crowdsourcing-Based Indoor Localization System.
Zhou, Baoding; Li, Qingquan; Mao, Qingzhou; Tu, Wei
2017-04-14
WiFi fingerprinting-based indoor localization has been widely used due to its simplicity and can be implemented on the smartphones. The major drawback of WiFi fingerprinting is that the radio map construction is very labor-intensive and time-consuming. Another drawback of WiFi fingerprinting is the Received Signal Strength (RSS) variance problem, caused by environmental changes and device diversity. RSS variance severely degrades the localization accuracy. In this paper, we propose a robust crowdsourcing-based indoor localization system (RCILS). RCILS can automatically construct the radio map using crowdsourcing data collected by smartphones. RCILS abstracts the indoor map as the semantics graph in which the edges are the possible user paths and the vertexes are the location where users may take special activities. RCILS extracts the activity sequence contained in the trajectories by activity detection and pedestrian dead-reckoning. Based on the semantics graph and activity sequence, crowdsourcing trajectories can be located and a radio map is constructed based on the localization results. For the RSS variance problem, RCILS uses the trajectory fingerprint model for indoor localization. During online localization, RCILS obtains an RSS sequence and realizes localization by matching the RSS sequence with the radio map. To evaluate RCILS, we apply RCILS in an office building. Experiment results demonstrate the efficiency and robustness of RCILS.
A Robust Crowdsourcing-Based Indoor Localization System
Zhou, Baoding; Li, Qingquan; Mao, Qingzhou; Tu, Wei
2017-01-01
WiFi fingerprinting-based indoor localization has been widely used due to its simplicity and can be implemented on the smartphones. The major drawback of WiFi fingerprinting is that the radio map construction is very labor-intensive and time-consuming. Another drawback of WiFi fingerprinting is the Received Signal Strength (RSS) variance problem, caused by environmental changes and device diversity. RSS variance severely degrades the localization accuracy. In this paper, we propose a robust crowdsourcing-based indoor localization system (RCILS). RCILS can automatically construct the radio map using crowdsourcing data collected by smartphones. RCILS abstracts the indoor map as the semantics graph in which the edges are the possible user paths and the vertexes are the location where users may take special activities. RCILS extracts the activity sequence contained in the trajectories by activity detection and pedestrian dead-reckoning. Based on the semantics graph and activity sequence, crowdsourcing trajectories can be located and a radio map is constructed based on the localization results. For the RSS variance problem, RCILS uses the trajectory fingerprint model for indoor localization. During online localization, RCILS obtains an RSS sequence and realizes localization by matching the RSS sequence with the radio map. To evaluate RCILS, we apply RCILS in an office building. Experiment results demonstrate the efficiency and robustness of RCILS. PMID:28420108
A space-efficient algorithm for local similarities.
Huang, X Q; Hardison, R C; Miller, W
1990-10-01
Existing dynamic-programming algorithms for identifying similar regions of two sequences require time and space proportional to the product of the sequence lengths. Often this space requirement is more limiting than the time requirement. We describe a dynamic-programming local-similarity algorithm that needs only space proportional to the sum of the sequence lengths. The method can also find repeats within a single long sequence. To illustrate the algorithm's potential, we discuss comparison of a 73,360 nucleotide sequence containing the human beta-like globin gene cluster and a corresponding 44,594 nucleotide sequence for rabbit, a problem well beyond the capabilities of other dynamic-programming software.
NASA Astrophysics Data System (ADS)
Barrett, Samuel; Starnberger, Reinhard; Spötl, Christoph; Brauer, Achim; Tjallingii, Rik; Dulski, Peter; Abfalterer, Christof
2015-04-01
The sequence of pre-LGM lacustrine sediments at Baumkirchen (Austria) provides a key record in Alpine Quaternary stratigraphy. These sediments from within the boundary of the Alps potentially provide unique insights into the regional paleoclimate. Recent drilling revealed at least ~250m (the base was not reached) of almost entirely mm- to cm-scale lacustrine sediments. The laminated sediments are comprised of alternations between clayey silt and event layers of medium silt to fine sand. The sequence is interrupted only by a short section of gravel supported in an unlaminated clay-rich matrix. Optically stimulated luminescence dating identifies two distinct sequences: the upper sequence spanning mid-late Marine Isotope Stage (MIS) 3 (~33 to ~45 ka BP), agreeing with existing calibrated radiocarbon ages, and the lower section dating to MIS 4 (~59 to ~73 ka BP). Whether the hiatus is an erosional unconformity, or if the sequences represent two separate lake phases is unclear. Although the precise location of the hiatus is hard to identify, the gravel-rich section lies at the very top of the lower sequence. Pebbles in these gravels are largely angular and contain a significant proportion of non-local, regional lithologies. Such gravels are absent in the remainder of the entire 250 m-thick sequence and hence suggest a unique event rather than e.g. an interfingering local delta gravel foresets with the basin sediments. The gravels are therefore likely to be ice-rafted debris from icebergs from nearby glaciers calving into the lake. This therefore represents the first sedimentological evidence of a MIS 4 ice advance in the Eastern Alps. X-ray fluorescence analysis (ITRAX core scanning) of event layers indicates a strong change in the geochemical composition from generally K, Zr and Ti-rich layers in the upper sequence to mainly Ca and/or Si-rich layers in the lower sequence. X-ray diffraction analysis shows the Ca and Si signals to be controlled by carbonate (both calcite and dolomite) and quartz, respectively. This suggests a change in dominant sediment source and may indicate a change in catchment or paleolake configuration, re-raising the long outstanding question of how the lake or lakes were dammed.
D-loop haplotype diversity in Brazilian horse breeds
Ianella, Patrícia; Albuquerque, Maria do Socorro Maués; Paiva, Samuel Rezende; do Egito, Andréa Alves; Almeida, Leonardo Daniel; Sereno, Fabiana T. P. S.; Carvalho, Luiz Felipe Ramos; Mariante, Arthur da Silva; McManus, Concepta Margaret
2017-01-01
Abstract The first horses were brought to Brazil by the colonizers after 1534. Over the centuries, these animals evolved and adapted to local environmental conditions usually unsuitable for exotic breeds, thereby originating locally adapted Brazilian breeds. The present work represents the first description of maternal genetic diversity in these horse breeds based on D-loop sequences. A D-Loop HSV-I fragment of 252 bp, from 141 horses belonging to ten Brazilian breeds / genetic groups (locally adapted and specialized breeds) were analysed. Thirty-five different haplotypes belonging to 18 haplogroups were identified with 33 polymorphic sites. Haplotype diversity (varying from 0.20 to 0.96) and nucleotide diversity (varying from 0.0039 to 0.0239) was lower for locally adapted than for specialized breeds, with the same pattern observed for FST values. Haplogroups identified in Brazilian breeds are in agreement with previous findings in South American samples. The low variability observed mainly in locally adapted breeds, indicates that, to ensure conservation of these breeds, careful reproductive management is needed. Additional genetic characterization studies are required to support accurate decision-making. PMID:28863209
2011-01-01
Background Bos primigenius, the aurochs, is the wild ancestor of modern cattle breeds and was formerly widespread across Eurasia and northern Africa. After a progressive decline, the species became extinct in 1627. The origin of modern taurine breeds in Europe is debated. Archaeological and early genetic evidence point to a single Near Eastern origin and a subsequent spread during the diffusion of herding and farming. More recent genetic data are instead compatible with local domestication events or at least some level of local introgression from the aurochs. Here we present the analysis of the complete mitochondrial genome of a pre-Neolithic Italian aurochs. Results In this study, we applied a combined strategy employing both multiplex PCR amplifications and 454 pyrosequencing technology to sequence the complete mitochondrial genome of an 11,450-year-old aurochs specimen from Central Italy. Phylogenetic analysis of the aurochs mtDNA genome supports the conclusions from previous studies of short mtDNA fragments - namely that Italian aurochsen were genetically very similar to modern cattle breeds, but highly divergent from the North-Central European aurochsen. Conclusions Complete mitochondrial genome sequences are now available for several modern cattle and two pre-Neolithic mtDNA genomes from very different geographic areas. These data suggest that previously identified sub-groups within the widespread modern cattle mitochondrial T clade are polyphyletic, and they support the hypothesis that modern European breeds have multiple geographic origins. PMID:21281509
Mapping a nucleolar targeting sequence of an RNA binding nucleolar protein, Nop25
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fujiwara, Takashi; Suzuki, Shunji; Kanno, Motoko
2006-06-10
Nop25 is a putative RNA binding nucleolar protein associated with rRNA transcription. The present study was undertaken to determine the mechanism of Nop25 localization in the nucleolus. Deletion experiments of Nop25 amino acid sequence showed Nop25 to contain a nuclear targeting sequence in the N-terminal and a nucleolar targeting sequence in the C-terminal. By expressing derivative peptides from the C-terminal as GFP-fusion proteins in the cells, a lysine and arginine residue-enriched peptide (KRKHPRRAQDSTKKPPSATRTSKTQRRRR) allowed a GFP-fusion protein to be transported and fully retained in the nucleolus. When the peptide was fused with cMyc epitope and expressed in the cells, amore » cMyc epitope was then detected in the nucleolus. Nop25 did not localize in the nucleolus by deletion of the peptide from Nop25. Furthermore, deletion of a subdomain (KRKHPRRAQ) in the peptide or amino acid substitution of lysine and arginine residues in the subdomain resulted in the loss of Nop25 nucleolar localization. These results suggest that the lysine and arginine residue-enriched peptide is the most prominent nucleolar targeting sequence of Nop25 and that the long stretch of basic residues might play an important role in the nucleolar localization of Nop25. Although Nop25 contained putative SUMOylation, phosphorylation and glycosylation sites, the amino acid substitution in these sites had no effect on the nucleolar localization, thus suggesting that these post-translational modifications did not contribute to the localization of Nop25 in the nucleolus. The treatment of the cells, which expressed a GFP-fusion protein with a nucleolar targeting sequence of Nop25, with RNase A resulted in a complete dislocation of the protein from the nucleolus. These data suggested that the nucleolar targeting sequence might therefore play an important role in the binding of Nop25 to RNA molecules and that the RNA binding of Nop25 might be essential for the nucleolar localization of Nop25.« less
Photophysical Characterization of Enhanced 6-Methylisoxanthopterin Fluorescence in Duplex DNA.
Moreno, Andrew; Knee, J L; Mukerji, Ishita
2016-12-08
The structure and dynamic motions of bases in DNA duplexes and other constructs are important for understanding mechanisms of selectivity and recognition of DNA-binding proteins. The fluorescent guanine analogue, 6-methylisoxanthopterin 6-MI, is well suited to this purpose as it exhibits an unexpected 3- to 4-fold increase in relative quantum yield upon duplex formation when incorporated into the following sequences: ATFAA, AAFTA, or ATFTA (where F represents 6-MI). To better understand some of the factors leading to the 6-MI fluorescence increase upon duplex formation, we characterized the effect of local sequence and structural perturbations on 6-MI photophysics through temperature melts, quantum yield measurements, fluorescence quenching assays, and fluorescence lifetime measurements. By examining 21 sequences we have determined that the duplex-enhanced fluorescence (DEF) depends on the composition of bases adjacent to 6-MI and the presence of adenines at locations n ± 2 from the probe. Investigation of duplex stability and local solvent accessibility measurements support a model in which the DEF arises from a constrained geometry of 6-MI in the duplex, which remains H-bonded to cytosine, stacked with adjacent bases and inaccessible to quenchers. Perturbation of DNA structure through the introduction of an unpaired base 3' to 6-MI or a mismatched basepair increases 6-MI dynamic motion leading to fluorescence quenching and a reduction in quantum yield. Molecular dynamics simulations suggest the enhanced fluorescence results from a greater degree of twist at the X-F step relative to the quenched duplexes examined. These results point to a model where adenine residues located at n ± 2 from 6-MI induce a structural geometry with greater twist in the duplex that hinders local motion reducing dynamic quenching and producing an increase in 6-MI fluorescence.
Niche specialization of terrestrial archaeal ammonia oxidizers
Gubry-Rangin, Cécile; Hai, Brigitte; Quince, Christopher; Engel, Marion; Thomson, Bruce C.; James, Phillip; Schloter, Michael; Griffiths, Robert I.; Prosser, James I.; Nicol, Graeme W.
2011-01-01
Soil pH is a major determinant of microbial ecosystem processes and potentially a major driver of evolution, adaptation, and diversity of ammonia oxidizers, which control soil nitrification. Archaea are major components of soil microbial communities and contribute significantly to ammonia oxidation in some soils. To determine whether pH drives evolutionary adaptation and community structure of soil archaeal ammonia oxidizers, sequences of amoA, a key functional gene of ammonia oxidation, were examined in soils at global, regional, and local scales. Globally distributed database sequences clustered into 18 well-supported phylogenetic lineages that dominated specific soil pH ranges classified as acidic (pH <5), acido-neutral (5≤ pH <7), or alkalinophilic (pH ≥7). To determine whether patterns were reproduced at regional and local scales, amoA gene fragments were amplified from DNA extracted from 47 soils in the United Kingdom (pH 3.5–8.7), including a pH-gradient formed by seven soils at a single site (pH 4.5–7.5). High-throughput sequencing and analysis of amoA gene fragments identified an additional, previously undiscovered phylogenetic lineage and revealed similar pH-associated distribution patterns at global, regional, and local scales, which were most evident for the five most abundant clusters. Archaeal amoA abundance and diversity increased with soil pH, which was the only physicochemical characteristic measured that significantly influenced community structure. These results suggest evolution based on specific adaptations to soil pH and niche specialization, resulting in a global distribution of archaeal lineages that have important consequences for soil ecosystem function and nitrogen cycling. PMID:22158986
Genomic mid-range inhomogeneity correlates with an abundance of RNA secondary structures
Bechtel, Jason M; Wittenschlaeger, Thomas; Dwyer, Trisha; Song, Jun; Arunachalam, Sasi; Ramakrishnan, Sadeesh K; Shepard, Samuel; Fedorov, Alexei
2008-01-01
Background Genomes possess different levels of non-randomness, in particular, an inhomogeneity in their nucleotide composition. Inhomogeneity is manifest from the short-range where neighboring nucleotides influence the choice of base at a site, to the long-range, commonly known as isochores, where a particular base composition can span millions of nucleotides. A separate genomic issue that has yet to be thoroughly elucidated is the role that RNA secondary structure (SS) plays in gene expression. Results We present novel data and approaches that show that a mid-range inhomogeneity (~30 to 1000 nt) not only exists in mammalian genomes but is also significantly associated with strong RNA SS. A whole-genome bioinformatics investigation of local SS in a set of 11,315 non-redundant human pre-mRNA sequences has been carried out. Four distinct components of these molecules (5'-UTRs, exons, introns and 3'-UTRs) were considered separately, since they differ in overall nucleotide composition, sequence motifs and periodicities. For each pre-mRNA component, the abundance of strong local SS (< -25 kcal/mol) was a factor of two to ten greater than a random expectation model. The randomization process preserves the short-range inhomogeneity of the corresponding natural sequences, thus, eliminating short-range signals as possible contributors to any observed phenomena. Conclusion We demonstrate that the excess of strong local SS in pre-mRNAs is linked to the little explored phenomenon of genomic mid-range inhomogeneity (MRI). MRI is an interdependence between nucleotide choice and base composition over a distance of 20–1000 nt. Additionally, we have created a public computational resource to support further study of genomic MRI. PMID:18549495
Lin, H; Rao, V B; Black, L W
1999-06-04
Bacteriophage DNA packaging results from an ATP-driven translocation of concatemeric DNA into the prohead by the phage terminase complexed with the portal vertex dodecamer of the prohead. Functional domains of the bacteriophage T4 terminase and portal gene 20 product (gp20) were determined by mutant analysis and sequence localization within the structural genes. Interaction regions of the portal vertex and large terminase subunit (gp17) were determined by genetic (terminase-portal intergenic suppressor mutations), biochemical (column retention of gp17 and inhibition of in vitro DNA packaging by gp20 peptides), and immunological (co-immunoprecipitation of polymerized gp20 peptide and gp17) studies. The specificity of the interaction was tested by means of a phage T4 HOC (highly antigenicoutercapsid protein) display system in which wild-type, cs20, and scrambled portal peptide sequences were displayed on the HOC protein of phage T4. Binding affinities of these recombinant phages as determined by the retention of these phages by a His-tag immobilized gp17 column, and by co-immunoprecipitation with purified terminase supported the specific nature of the portal protein and terminase interaction sites. In further support of specificity, a gp20 peptide corresponding to a portion of the identified site inhibited packaging whereas the scrambled sequence peptide did not block DNA packaging in vitro. The portal interaction site is localized to 28 residues in the central portion of the linear sequence of gp20 (524 residues). As judged by two pairs of intergenic portal-terminase suppressor mutations, two separate regions of the terminase large subunit gp17 (central and COOH-terminal) interact through hydrophobic contacts at the portal site. Although the terminase apparently interacts with this gp20 portal peptide, polyclonal antibody against the portal peptide appears unable to access it in the native structure, suggesting intimate association of gp20 and gp17 possibly internalizes terminase regions within the portal in the packasome complex. Both similarities and differences are seen in comparison to analogous sites which have been identified in phages T3 and lambda. Copyright 1999 Academic Press.
TCGA Expedition: A Data Acquisition and Management System for TCGA Data
Chandran, Uma R.; Medvedeva, Olga P.; Barmada, M. Michael; Blood, Philip D.; Chakka, Anish; Luthra, Soumya; Ferreira, Antonio; Wong, Kim F.; Lee, Adrian V.; Zhang, Zhihui; Budden, Robert; Scott, J. Ray; Berndt, Annerose; Berg, Jeremy M.; Jacobson, Rebecca S.
2016-01-01
Background The Cancer Genome Atlas Project (TCGA) is a National Cancer Institute effort to profile at least 500 cases of 20 different tumor types using genomic platforms and to make these data, both raw and processed, available to all researchers. TCGA data are currently over 1.2 Petabyte in size and include whole genome sequence (WGS), whole exome sequence, methylation, RNA expression, proteomic, and clinical datasets. Publicly accessible TCGA data are released through public portals, but many challenges exist in navigating and using data obtained from these sites. We developed TCGA Expedition to support the research community focused on computational methods for cancer research. Data obtained, versioned, and archived using TCGA Expedition supports command line access at high-performance computing facilities as well as some functionality with third party tools. For a subset of TCGA data collected at University of Pittsburgh, we also re-associate TCGA data with de-identified data from the electronic health records. Here we describe the software as well as the architecture of our repository, methods for loading of TCGA data to multiple platforms, and security and regulatory controls that conform to federal best practices. Results TCGA Expedition software consists of a set of scripts written in Bash, Python and Java that download, extract, harmonize, version and store all TCGA data and metadata. The software generates a versioned, participant- and sample-centered, local TCGA data directory with metadata structures that directly reference the local data files as well as the original data files. The software supports flexible searches of the data via a web portal, user-centric data tracking tools, and data provenance tools. Using this software, we created a collaborative repository, the Pittsburgh Genome Resource Repository (PGRR) that enabled investigators at our institution to work with all TCGA data formats, and to interrogate these data with analysis pipelines, and associated tools. WGS data are especially challenging for individual investigators to use, due to issues with downloading, storage, and processing; having locally accessible WGS BAM files has proven invaluable. Conclusion Our open-source, freely available TCGA Expedition software can be used to create a local collaborative infrastructure for acquiring, managing, and analyzing TCGA data and other large public datasets. PMID:27788220
A computational genomics pipeline for prokaryotic sequencing projects
Kislyuk, Andrey O.; Katz, Lee S.; Agrawal, Sonia; Hagen, Matthew S.; Conley, Andrew B.; Jayaraman, Pushkala; Nelakuditi, Viswateja; Humphrey, Jay C.; Sammons, Scott A.; Govil, Dhwani; Mair, Raydel D.; Tatti, Kathleen M.; Tondella, Maria L.; Harcourt, Brian H.; Mayer, Leonard W.; Jordan, I. King
2010-01-01
Motivation: New sequencing technologies have accelerated research on prokaryotic genomes and have made genome sequencing operations outside major genome sequencing centers routine. However, no off-the-shelf solution exists for the combined assembly, gene prediction, genome annotation and data presentation necessary to interpret sequencing data. The resulting requirement to invest significant resources into custom informatics support for genome sequencing projects remains a major impediment to the accessibility of high-throughput sequence data. Results: We present a self-contained, automated high-throughput open source genome sequencing and computational genomics pipeline suitable for prokaryotic sequencing projects. The pipeline has been used at the Georgia Institute of Technology and the Centers for Disease Control and Prevention for the analysis of Neisseria meningitidis and Bordetella bronchiseptica genomes. The pipeline is capable of enhanced or manually assisted reference-based assembly using multiple assemblers and modes; gene predictor combining; and functional annotation of genes and gene products. Because every component of the pipeline is executed on a local machine with no need to access resources over the Internet, the pipeline is suitable for projects of a sensitive nature. Annotation of virulence-related features makes the pipeline particularly useful for projects working with pathogenic prokaryotes. Availability and implementation: The pipeline is licensed under the open-source GNU General Public License and available at the Georgia Tech Neisseria Base (http://nbase.biology.gatech.edu/). The pipeline is implemented with a combination of Perl, Bourne Shell and MySQL and is compatible with Linux and other Unix systems. Contact: king.jordan@biology.gatech.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:20519285
Local Estimators for Spacecraft Formation Flying
NASA Technical Reports Server (NTRS)
Fathpour, Nanaz; Hadaegh, Fred Y.; Mesbahi, Mehran; Nabi, Marzieh
2011-01-01
A formation estimation architecture for formation flying builds upon the local information exchange among multiple local estimators. Spacecraft formation flying involves the coordination of states among multiple spacecraft through relative sensing, inter-spacecraft communication, and control. Most existing formation flying estimation algorithms can only be supported via highly centralized, all-to-all, static relative sensing. New algorithms are needed that are scalable, modular, and robust to variations in the topology and link characteristics of the formation exchange network. These distributed algorithms should rely on a local information-exchange network, relaxing the assumptions on existing algorithms. In this research, it was shown that only local observability is required to design a formation estimator and control law. The approach relies on breaking up the overall information-exchange network into sequence of local subnetworks, and invoking an agreement-type filter to reach consensus among local estimators within each local network. State estimates were obtained by a set of local measurements that were passed through a set of communicating Kalman filters to reach an overall state estimation for the formation. An optimization approach was also presented by means of which diffused estimates over the network can be incorporated in the local estimates obtained by each estimator via local measurements. This approach compares favorably with that obtained by a centralized Kalman filter, which requires complete knowledge of the raw measurement available to each estimator.
Genome assembly and transcriptome resource for river buffalo, Bubalus bubalis (2n = 50).
Williams, John L; Iamartino, Daniela; Pruitt, Kim D; Sonstegard, Tad; Smith, Timothy P L; Low, Wai Yee; Biagini, Tommaso; Bomba, Lorenzo; Capomaccio, Stefano; Castiglioni, Bianca; Coletta, Angelo; Corrado, Federica; Ferré, Fabrizio; Iannuzzi, Leopoldo; Lawley, Cynthia; Macciotta, Nicolò; McClure, Matthew; Mancini, Giordano; Matassino, Donato; Mazza, Raffaele; Milanesi, Marco; Moioli, Bianca; Morandi, Nicola; Ramunno, Luigi; Peretti, Vincenzo; Pilla, Fabio; Ramelli, Paola; Schroeder, Steven; Strozzi, Francesco; Thibaud-Nissen, Francoise; Zicarelli, Luigi; Ajmone-Marsan, Paolo; Valentini, Alessio; Chillemi, Giovanni; Zimin, Aleksey
2017-10-01
Water buffalo is a globally important species for agriculture and local economies. A de novo assembled, well-annotated reference sequence for the water buffalo is an important prerequisite for studying the biology of this species, and is necessary to manage genetic diversity and to use modern breeding and genomic selection techniques. However, no such genome assembly has been previously reported. There are 2 species of domestic water buffalo, the river (2 n = 50) and the swamp (2 n = 48) buffalo. Here we describe a draft quality reference sequence for the river buffalo created from Illumina GA and Roche 454 short read sequences using the MaSuRCA assembler. The assembled sequence is 2.83 Gb, consisting of 366 983 scaffolds with a scaffold N50 of 1.41 Mb and contig N50 of 21 398 bp. Annotation of the genome was supported by transcriptome data from 30 tissues and identified 21 711 predicted protein coding genes. Searches for complete mammalian BUSCO gene groups found 98.6% of curated single copy orthologs present among predicted genes, which suggests a high level of completeness of the genome. The annotated sequence is available from NCBI at accession GCA_000471725.1. © The Author 2017. Published by Oxford University Press.
Peyretaillade, E; Broussolle, V; Peyret, P; Méténier, G; Gouy, M; Vivarès, C P
1998-06-01
An intronless gene encoding a protein of 592 amino acid residues with similarity to 70-kDa heat shock proteins (HSP70s) has been cloned and sequenced from the amitochondrial protist Encephalitozoon cuniculi (phylum Microsporidia). Southern blot analyses show the presence of a single gene copy located on chromosome XI. The encoded protein exhibits an N-terminal hydrophobic leader sequence and two motifs shared by proteobacterial and mitochondrially expressed HSP70 homologs. Phylogenetic analysis using maximum likelihood and evolutionary distances place the E. cuniculi sequence in the cluster of mitochondrially expressed HSP70s, with a higher evolutionary rate than those of homologous sequences. Similar results were obtained after cloning a fragment of the homologous gene in the closely related species E. hellem. The presence of a nuclear targeting signal-like sequence supports a role of the Encephalitozoon HSP70 as a molecular chaperone of nuclear proteins. No evidence for cytosolic or endoplasmic reticulum forms of HSP70 was obtained through PCR amplification. These data suggest that Encephalitozoon species have evolved from an ancestor bearing mitochondria, which is in disagreement with the postulated presymbiotic origin of Microsporidia. The specific role and intracellular localization of the mitochondrial HSP70-like protein remain to be elucidated.
Parasail: SIMD C library for global, semi-global, and local pairwise sequence alignments
DOE Office of Scientific and Technical Information (OSTI.GOV)
Daily, Jeffrey A.
Sequence alignment algorithms are a key component of many bioinformatics applications. Though various fast Smith-Waterman local sequence alignment implementations have been developed for x86 CPUs, most are embedded into larger database search tools. In addition, fast implementations of Needleman-Wunsch global sequence alignment and its semi-global variants are not as widespread. This article presents the first software library for local, global, and semi-global pairwise intra-sequence alignments and improves the performance of previous intra-sequence implementations. As a result, a faster intra-sequence pairwise alignment implementation is described and benchmarked. Using a 375 residue query sequence a speed of 136 billion cell updates permore » second (GCUPS) was achieved on a dual Intel Xeon E5-2670 12-core processor system, the highest reported for an implementation based on Farrar’s ’striped’ approach. When using only a single thread, parasail was 1.7 times faster than Rognes’s SWIPE. For many score matrices, parasail is faster than BLAST. The software library is designed for 64 bit Linux, OS X, or Windows on processors with SSE2, SSE41, or AVX2. Source code is available from https://github.com/jeffdaily/parasail under the Battelle BSD-style license. In conclusion, applications that require optimal alignment scores could benefit from the improved performance. For the first time, SIMD global, semi-global, and local alignments are available in a stand-alone C library.« less
Parasail: SIMD C library for global, semi-global, and local pairwise sequence alignments
Daily, Jeffrey A.
2016-02-10
Sequence alignment algorithms are a key component of many bioinformatics applications. Though various fast Smith-Waterman local sequence alignment implementations have been developed for x86 CPUs, most are embedded into larger database search tools. In addition, fast implementations of Needleman-Wunsch global sequence alignment and its semi-global variants are not as widespread. This article presents the first software library for local, global, and semi-global pairwise intra-sequence alignments and improves the performance of previous intra-sequence implementations. As a result, a faster intra-sequence pairwise alignment implementation is described and benchmarked. Using a 375 residue query sequence a speed of 136 billion cell updates permore » second (GCUPS) was achieved on a dual Intel Xeon E5-2670 12-core processor system, the highest reported for an implementation based on Farrar’s ’striped’ approach. When using only a single thread, parasail was 1.7 times faster than Rognes’s SWIPE. For many score matrices, parasail is faster than BLAST. The software library is designed for 64 bit Linux, OS X, or Windows on processors with SSE2, SSE41, or AVX2. Source code is available from https://github.com/jeffdaily/parasail under the Battelle BSD-style license. In conclusion, applications that require optimal alignment scores could benefit from the improved performance. For the first time, SIMD global, semi-global, and local alignments are available in a stand-alone C library.« less
Schiavo, Giuseppina; Hoffmann, Orsolya Ivett; Ribani, Anisa; Utzeri, Valerio Joe; Ghionda, Marco Ciro; Bertolini, Francesca; Geraci, Claudia; Bovo, Samuele; Fontanesi, Luca
2017-10-01
Nuclear DNA sequences of mitochondrial origin (numts) are derived by insertion of mitochondrial DNA (mtDNA), into the nuclear genome. In this study, we provide, for the first time, a genome picture of numts inserted in the pig nuclear genome. The Sus scrofa reference nuclear genome (Sscrofa10.2) was aligned with circularized and consensus mtDNA sequences using LAST software. A total of 430 numt sequences that may represent 246 different numt integration events (57 numt regions determined by at least two numt sequences and 189 singletons) were identified, covering about 0.0078% of the nuclear genome. Numt integration events were correlated (0.99) to the chromosome length. The longest numt sequence (about 11 kbp) was located on SSC2. Six numts were sequenced and PCR amplified in pigs of European commercial and local pig breeds, of the Chinese Meishan breed and in European wild boars. Three of them were polymorphic for the presence or absence of the insertion. Surprisingly, the estimated age of insertion of two of the three polymorphic numts was more ancient than that of the speciation time of the Sus scrofa, supporting that these polymorphic sites were originated from interspecies admixture that contributed to shape the pig genome. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
A High-Throughput Process for the Solid-Phase Purification of Synthetic DNA Sequences
Grajkowski, Andrzej; Cieślak, Jacek; Beaucage, Serge L.
2017-01-01
An efficient process for the purification of synthetic phosphorothioate and native DNA sequences is presented. The process is based on the use of an aminopropylated silica gel support functionalized with aminooxyalkyl functions to enable capture of DNA sequences through an oximation reaction with the keto function of a linker conjugated to the 5′-terminus of DNA sequences. Deoxyribonucleoside phosphoramidites carrying this linker, as a 5′-hydroxyl protecting group, have been synthesized for incorporation into DNA sequences during the last coupling step of a standard solid-phase synthesis protocol executed on a controlled pore glass (CPG) support. Solid-phase capture of the nucleobase- and phosphate-deprotected DNA sequences released from the CPG support is demonstrated to proceed near quantitatively. Shorter than full-length DNA sequences are first washed away from the capture support; the solid-phase purified DNA sequences are then released from this support upon reaction with tetra-n-butylammonium fluoride in dry dimethylsulfoxide (DMSO) and precipitated in tetrahydrofuran (THF). The purity of solid-phase-purified DNA sequences exceeds 98%. The simulated high-throughput and scalability features of the solid-phase purification process are demonstrated without sacrificing purity of the DNA sequences. PMID:28628204
Complete Sequence and Analysis of Coconut Palm (Cocos nucifera) Mitochondrial Genome.
Aljohi, Hasan Awad; Liu, Wanfei; Lin, Qiang; Zhao, Yuhui; Zeng, Jingyao; Alamer, Ali; Alanazi, Ibrahim O; Alawad, Abdullah O; Al-Sadi, Abdullah M; Hu, Songnian; Yu, Jun
2016-01-01
Coconut (Cocos nucifera L.), a member of the palm family (Arecaceae), is one of the most economically important crops in tropics, serving as an important source of food, drink, fuel, medicine, and construction material. Here we report an assembly of the coconut (C. nucifera, Oman local Tall cultivar) mitochondrial (mt) genome based on next-generation sequencing data. This genome, 678,653bp in length and 45.5% in GC content, encodes 72 proteins, 9 pseudogenes, 23 tRNAs, and 3 ribosomal RNAs. Within the assembly, we find that the chloroplast (cp) derived regions account for 5.07% of the total assembly length, including 13 proteins, 2 pseudogenes, and 11 tRNAs. The mt genome has a relatively large fraction of repeat content (17.26%), including both forward (tandem) and inverted (palindromic) repeats. Sequence variation analysis shows that the Ti/Tv ratio of the mt genome is lower as compared to that of the nuclear genome and neutral expectation. By combining public RNA-Seq data for coconut, we identify 734 RNA editing sites supported by at least two datasets. In summary, our data provides the second complete mt genome sequence in the family Arecaceae, essential for further investigations on mitochondrial biology of seed plants.
Bateman, Nicholas W; Shoji, Yutaka; Conrads, Kelly A; Stroop, Kevin D; Hamilton, Chad A; Darcy, Kathleen M; Maxwell, George L; Risinger, John I; Conrads, Thomas P
2016-01-01
AT-rich interactive domain-containing protein 1A (ARID1A) is a recently identified nuclear tumor suppressor frequently altered in solid tumor malignancies. We have identified a bipartite-like nuclear localization sequence (NLS) that contributes to nuclear import of ARID1A not previously described. We functionally confirm activity using GFP constructs fused with wild-type or mutant NLS sequences. We further show that cyto-nuclear localized, bipartite NLS mutant ARID1A exhibits greater stability than nuclear-localized, wild-type ARID1A. Identification of this undescribed functional NLS within ARID1A contributes vital insights to rationalize the impact of ARID1A missense mutations observed in patient tumors. Copyright © 2015 Elsevier Inc. All rights reserved.
Dimensions of biodiversity in the Earth mycobiome.
Peay, Kabir G; Kennedy, Peter G; Talbot, Jennifer M
2016-07-01
Fungi represent a large proportion of the genetic diversity on Earth and fungal activity influences the structure of plant and animal communities, as well as rates of ecosystem processes. Large-scale DNA-sequencing datasets are beginning to reveal the dimensions of fungal biodiversity, which seem to be fundamentally different to bacteria, plants and animals. In this Review, we describe the patterns of fungal biodiversity that have been revealed by molecular-based studies. Furthermore, we consider the evidence that supports the roles of different candidate drivers of fungal diversity at a range of spatial scales, as well as the role of dispersal limitation in maintaining regional endemism and influencing local community assembly. Finally, we discuss the ecological mechanisms that are likely to be responsible for the high heterogeneity that is observed in fungal communities at local scales.
A hypothetical learning trajectory for conceptualizing matrices as linear transformations
NASA Astrophysics Data System (ADS)
Andrews-Larson, Christine; Wawro, Megan; Zandieh, Michelle
2017-08-01
In this paper, we present a hypothetical learning trajectory (HLT) aimed at supporting students in developing flexible ways of reasoning about matrices as linear transformations in the context of introductory linear algebra. In our HLT, we highlight the integral role of the instructor in this development. Our HLT is based on the 'Italicizing N' task sequence, in which students work to generate, compose, and invert matrices that correspond to geometric transformations specified within the problem context. In particular, we describe the ways in which the students develop local transformation views of matrix multiplication (focused on individual mappings of input vectors to output vectors) and extend these local views to more global views in which matrices are conceptualized in terms of how they transform a space in a coordinated way.
Exaptation of Bornavirus-Like Nucleoprotein Elements in Afrotherians
Kobayashi, Yuki; Horie, Masayuki; Nakano, Ayumi; Murata, Koichi; Itou, Takuya; Suzuki, Yoshiyuki
2016-01-01
Endogenous bornavirus-like nucleoprotein elements (EBLNs), the nucleotide sequence elements derived from the nucleoprotein gene of ancient bornavirus-like viruses, have been identified in many animal genomes. Here we show evidence that EBLNs encode functional proteins in their host. Some afrotherian EBLNs were observed to have been maintained for more than 83.3 million years under negative selection. Splice variants were expressed from the genomic loci of EBLNs in elephant, and some were translated into proteins. The EBLN proteins appeared to be localized to the rough endoplasmic reticulum in African elephant cells, in contrast to the nuclear localization of bornavirus N. These observations suggest that afrotherian EBLNs have acquired a novel function in their host. Interestingly, genomic sequences of the first exon and its flanking regions in these EBLN loci were homologous to those of transmembrane protein 106B (TMEM106B). The upstream region of the first exon in the EBLN loci exhibited a promoter activity, suggesting that the ability of these EBLNs to be transcribed in the host cell was gained through capturing a partial duplicate of TMEM106B. In conclusion, our results strongly support for exaptation of EBLNs to encode host proteins in afrotherians. PMID:27518265
Distinguishing thrust sequences in gravity-driven fold and thrust belts
NASA Astrophysics Data System (ADS)
Alsop, G. I.; Weinberger, R.; Marco, S.
2018-04-01
Piggyback or foreland-propagating thrust sequences, where younger thrusts develop in the footwalls of existing thrusts, are generally assumed to be the typical order of thrust development in most orogenic settings. However, overstep or 'break-back' sequences, where later thrusts develop above and in the hangingwalls of earlier thrusts, may potentially form during cessation of movement in gravity-driven mass transport deposits (MTDs). In this study, we provide a detailed outcrop-based analysis of such an overstep thrust sequence developed in an MTD in the southern Dead Sea Basin. Evidence that may be used to discriminate overstep thrusting from piggyback thrust sequences within the gravity-driven fold and thrust belt includes upright folds and forethrusts that are cut by younger overlying thrusts. Backthrusts form ideal markers that are also clearly offset and cut by overlying younger forethrusts. Portions of the basal detachment to the thrust system are folded and locally imbricated in footwall synclines below forethrust ramps, and these geometries also support an overstep sequence. However, new 'short-cut' basal detachments develop below these synclines, indicating that movement continued on the basal detachment rather than it being abandoned as in classic overstep sequences. Further evidence for 'synchronous thrusting', where movement on more than one thrust occurs at the same time, is provided by displacement patterns on sequences of thrust ramp imbricates that systematically increases downslope towards the toe of the MTD. Older thrusts that initiate downslope in the broadly overstep sequence continue to move and therefore accrue greater displacements during synchronous thrusting. Our study provides a template to help distinguish different thrust sequences in both orogenic settings and gravity-driven surficial systems, with displacement patterns potentially being imaged in seismic sections across offshore MTDs.
Simulating protein folding initiation sites using an alpha-carbon-only knowledge-based force field
Buck, Patrick M.; Bystroff, Christopher
2015-01-01
Protein folding is a hierarchical process where structure forms locally first, then globally. Some short sequence segments initiate folding through strong structural preferences that are independent of their three-dimensional context in proteins. We have constructed a knowledge-based force field in which the energy functions are conditional on local sequence patterns, as expressed in the hidden Markov model for local structure (HMMSTR). Carbon-alpha force field (CALF) builds sequence specific statistical potentials based on database frequencies for α-carbon virtual bond opening and dihedral angles, pairwise contacts and hydrogen bond donor-acceptor pairs, and simulates folding via Brownian dynamics. We introduce hydrogen bond donor and acceptor potentials as α-carbon probability fields that are conditional on the predicted local sequence. Constant temperature simulations were carried out using 27 peptides selected as putative folding initiation sites, each 12 residues in length, representing several different local structure motifs. Each 0.6 μs trajectory was clustered based on structure. Simulation convergence or representativeness was assessed by subdividing trajectories and comparing clusters. For 21 of the 27 sequences, the largest cluster made up more than half of the total trajectory. Of these 21 sequences, 14 had cluster centers that were at most 2.6 Å root mean square deviation (RMSD) from their native structure in the corresponding full-length protein. To assess the adequacy of the energy function on nonlocal interactions, 11 full length native structures were relaxed using Brownian dynamics simulations. Equilibrated structures deviated from their native states but retained their overall topology and compactness. A simple potential that folds proteins locally and stabilizes proteins globally may enable a more realistic understanding of hierarchical folding pathways. PMID:19137613
Algorithms for optimizing cross-overs in DNA shuffling.
He, Lu; Friedman, Alan M; Bailey-Kellogg, Chris
2012-03-21
DNA shuffling generates combinatorial libraries of chimeric genes by stochastically recombining parent genes. The resulting libraries are subjected to large-scale genetic selection or screening to identify those chimeras with favorable properties (e.g., enhanced stability or enzymatic activity). While DNA shuffling has been applied quite successfully, it is limited by its homology-dependent, stochastic nature. Consequently, it is used only with parents of sufficient overall sequence identity, and provides no control over the resulting chimeric library. This paper presents efficient methods to extend the scope of DNA shuffling to handle significantly more diverse parents and to generate more predictable, optimized libraries. Our CODNS (cross-over optimization for DNA shuffling) approach employs polynomial-time dynamic programming algorithms to select codons for the parental amino acids, allowing for zero or a fixed number of conservative substitutions. We first present efficient algorithms to optimize the local sequence identity or the nearest-neighbor approximation of the change in free energy upon annealing, objectives that were previously optimized by computationally-expensive integer programming methods. We then present efficient algorithms for more powerful objectives that seek to localize and enhance the frequency of recombination by producing "runs" of common nucleotides either overall or according to the sequence diversity of the resulting chimeras. We demonstrate the effectiveness of CODNS in choosing codons and allocating substitutions to promote recombination between parents targeted in earlier studies: two GAR transformylases (41% amino acid sequence identity), two very distantly related DNA polymerases, Pol X and β (15%), and beta-lactamases of varying identity (26-47%). Our methods provide the protein engineer with a new approach to DNA shuffling that supports substantially more diverse parents, is more deterministic, and generates more predictable and more diverse chimeric libraries.
Random digital encryption secure communication system
NASA Technical Reports Server (NTRS)
Doland, G. D. (Inventor)
1982-01-01
The design of a secure communication system is described. A product code, formed from two pseudorandom sequences of digital bits, is used to encipher or scramble data prior to transmission. The two pseudorandom sequences are periodically changed at intervals before they have had time to repeat. One of the two sequences is transmitted continuously with the scrambled data for synchronization. In the receiver portion of the system, the incoming signal is compared with one of two locally generated pseudorandom sequences until correspondence between the sequences is obtained. At this time, the two locally generated sequences are formed into a product code which deciphers the data from the incoming signal. Provision is made to ensure synchronization of the transmitting and receiving portions of the system.
Song, Jiangning; Yuan, Zheng; Tan, Hao; Huber, Thomas; Burrage, Kevin
2007-12-01
Disulfide bonds are primary covalent crosslinks between two cysteine residues in proteins that play critical roles in stabilizing the protein structures and are commonly found in extracy-toplasmatic or secreted proteins. In protein folding prediction, the localization of disulfide bonds can greatly reduce the search in conformational space. Therefore, there is a great need to develop computational methods capable of accurately predicting disulfide connectivity patterns in proteins that could have potentially important applications. We have developed a novel method to predict disulfide connectivity patterns from protein primary sequence, using a support vector regression (SVR) approach based on multiple sequence feature vectors and predicted secondary structure by the PSIPRED program. The results indicate that our method could achieve a prediction accuracy of 74.4% and 77.9%, respectively, when averaged on proteins with two to five disulfide bridges using 4-fold cross-validation, measured on the protein and cysteine pair on a well-defined non-homologous dataset. We assessed the effects of different sequence encoding schemes on the prediction performance of disulfide connectivity. It has been shown that the sequence encoding scheme based on multiple sequence feature vectors coupled with predicted secondary structure can significantly improve the prediction accuracy, thus enabling our method to outperform most of other currently available predictors. Our work provides a complementary approach to the current algorithms that should be useful in computationally assigning disulfide connectivity patterns and helps in the annotation of protein sequences generated by large-scale whole-genome projects. The prediction web server and Supplementary Material are accessible at http://foo.maths.uq.edu.au/~huber/disulfide
Adaptive Local Realignment of Protein Sequences.
DeBlasio, Dan; Kececioglu, John
2018-06-11
While mutation rates can vary markedly over the residues of a protein, multiple sequence alignment tools typically use the same values for their scoring-function parameters across a protein's entire length. We present a new approach, called adaptive local realignment, that in contrast automatically adapts to the diversity of mutation rates along protein sequences. This builds upon a recent technique known as parameter advising, which finds global parameter settings for an aligner, to now adaptively find local settings. Our approach in essence identifies local regions with low estimated accuracy, constructs a set of candidate realignments using a carefully-chosen collection of parameter settings, and replaces the region if a realignment has higher estimated accuracy. This new method of local parameter advising, when combined with prior methods for global advising, boosts alignment accuracy as much as 26% over the best default setting on hard-to-align protein benchmarks, and by 6.4% over global advising alone. Adaptive local realignment has been implemented within the Opal aligner using the Facet accuracy estimator.
SW#db: GPU-Accelerated Exact Sequence Similarity Database Search.
Korpar, Matija; Šošić, Martin; Blažeka, Dino; Šikić, Mile
2015-01-01
In recent years we have witnessed a growth in sequencing yield, the number of samples sequenced, and as a result-the growth of publicly maintained sequence databases. The increase of data present all around has put high requirements on protein similarity search algorithms with two ever-opposite goals: how to keep the running times acceptable while maintaining a high-enough level of sensitivity. The most time consuming step of similarity search are the local alignments between query and database sequences. This step is usually performed using exact local alignment algorithms such as Smith-Waterman. Due to its quadratic time complexity, alignments of a query to the whole database are usually too slow. Therefore, the majority of the protein similarity search methods prior to doing the exact local alignment apply heuristics to reduce the number of possible candidate sequences in the database. However, there is still a need for the alignment of a query sequence to a reduced database. In this paper we present the SW#db tool and a library for fast exact similarity search. Although its running times, as a standalone tool, are comparable to the running times of BLAST, it is primarily intended to be used for exact local alignment phase in which the database of sequences has already been reduced. It uses both GPU and CPU parallelization and was 4-5 times faster than SSEARCH, 6-25 times faster than CUDASW++ and more than 20 times faster than SSW at the time of writing, using multiple queries on Swiss-prot and Uniref90 databases.
MetaBar - a tool for consistent contextual data acquisition and standards compliant submission.
Hankeln, Wolfgang; Buttigieg, Pier Luigi; Fink, Dennis; Kottmann, Renzo; Yilmaz, Pelin; Glöckner, Frank Oliver
2010-06-30
Environmental sequence datasets are increasing at an exponential rate; however, the vast majority of them lack appropriate descriptors like sampling location, time and depth/altitude: generally referred to as metadata or contextual data. The consistent capture and structured submission of these data is crucial for integrated data analysis and ecosystems modeling. The application MetaBar has been developed, to support consistent contextual data acquisition. MetaBar is a spreadsheet and web-based software tool designed to assist users in the consistent acquisition, electronic storage, and submission of contextual data associated to their samples. A preconfigured Microsoft Excel spreadsheet is used to initiate structured contextual data storage in the field or laboratory. Each sample is given a unique identifier and at any stage the sheets can be uploaded to the MetaBar database server. To label samples, identifiers can be printed as barcodes. An intuitive web interface provides quick access to the contextual data in the MetaBar database as well as user and project management capabilities. Export functions facilitate contextual and sequence data submission to the International Nucleotide Sequence Database Collaboration (INSDC), comprising of the DNA DataBase of Japan (DDBJ), the European Molecular Biology Laboratory database (EMBL) and GenBank. MetaBar requests and stores contextual data in compliance to the Genomic Standards Consortium specifications. The MetaBar open source code base for local installation is available under the GNU General Public License version 3 (GNU GPL3). The MetaBar software supports the typical workflow from data acquisition and field-sampling to contextual data enriched sequence submission to an INSDC database. The integration with the megx.net marine Ecological Genomics database and portal facilitates georeferenced data integration and metadata-based comparisons of sampling sites as well as interactive data visualization. The ample export functionalities and the INSDC submission support enable exchange of data across disciplines and safeguarding contextual data.
MetaBar - a tool for consistent contextual data acquisition and standards compliant submission
2010-01-01
Background Environmental sequence datasets are increasing at an exponential rate; however, the vast majority of them lack appropriate descriptors like sampling location, time and depth/altitude: generally referred to as metadata or contextual data. The consistent capture and structured submission of these data is crucial for integrated data analysis and ecosystems modeling. The application MetaBar has been developed, to support consistent contextual data acquisition. Results MetaBar is a spreadsheet and web-based software tool designed to assist users in the consistent acquisition, electronic storage, and submission of contextual data associated to their samples. A preconfigured Microsoft® Excel® spreadsheet is used to initiate structured contextual data storage in the field or laboratory. Each sample is given a unique identifier and at any stage the sheets can be uploaded to the MetaBar database server. To label samples, identifiers can be printed as barcodes. An intuitive web interface provides quick access to the contextual data in the MetaBar database as well as user and project management capabilities. Export functions facilitate contextual and sequence data submission to the International Nucleotide Sequence Database Collaboration (INSDC), comprising of the DNA DataBase of Japan (DDBJ), the European Molecular Biology Laboratory database (EMBL) and GenBank. MetaBar requests and stores contextual data in compliance to the Genomic Standards Consortium specifications. The MetaBar open source code base for local installation is available under the GNU General Public License version 3 (GNU GPL3). Conclusion The MetaBar software supports the typical workflow from data acquisition and field-sampling to contextual data enriched sequence submission to an INSDC database. The integration with the megx.net marine Ecological Genomics database and portal facilitates georeferenced data integration and metadata-based comparisons of sampling sites as well as interactive data visualization. The ample export functionalities and the INSDC submission support enable exchange of data across disciplines and safeguarding contextual data. PMID:20591175
HIV-1 diversity, transmission dynamics and primary drug resistance in Angola.
Bártolo, Inês; Zakovic, Suzana; Martin, Francisco; Palladino, Claudia; Carvalho, Patrícia; Camacho, Ricardo; Thamm, Sven; Clemente, Sofia; Taveira, Nuno
2014-01-01
To assess HIV-1 diversity, transmission dynamics and prevalence of transmitted drug resistance (TDR) in Angola, five years after ART scale-up. Population sequencing of the pol gene was performed on 139 plasma samples collected in 2009 from drug-naive HIV-1 infected individuals living in Luanda. HIV-1 subtypes were determined using phylogenetic analysis. Drug resistance mutations were identified using the Calibrated Population Resistance Tool (CPR). Transmission networks were determined using phylogenetic analysis of all Angolan sequences present in the databases. Evolutionary trends were determined by comparison with a similar survey performed in 2001. 47.1% of the viruses were pure subtypes (all except B), 47.1% were recombinants and 5.8% were untypable. The prevalence of subtype A decreased significantly from 2001 to 2009 (40.0% to 10.8%, P = 0.0019) while the prevalence of unique recombinant forms (URFs) increased > 2-fold (40.0% to 83.1%, P < 0.0001). The most frequent URFs comprised untypable sequences with subtypes H (U/H, n = 7, 10.8%), A (U/A, n = 6, 9.2%) and G (G/U, n = 4, 6.2%). Newly identified U/H recombinants formed a highly supported monophyletic cluster suggesting a local and common origin. TDR mutation K103N was found in one (0.7%) patient (1.6% in 2001). Out of the 364 sequences sampled for transmission network analysis, 130 (35.7%) were part of a transmission network. Forty eight transmission clusters were identified; the majority (56.3%) comprised sequences sampled in 2008-2010 in Luanda which is consistent with a locally fuelled epidemic. Very low genetic distance was found in 27 transmission pairs sampled in the same year, suggesting recent transmission events. Transmission of drug resistant strains was still negligible in Luanda in 2009, five years after the scale-up of ART. The dominance of small and recent transmission clusters and the emergence of new URFs are consistent with a rising HIV-1 epidemics mainly driven by heterosexual transmission.
2014-01-01
Background It is important to predict the quality of a protein structural model before its native structure is known. The method that can predict the absolute local quality of individual residues in a single protein model is rare, yet particularly needed for using, ranking and refining protein models. Results We developed a machine learning tool (SMOQ) that can predict the distance deviation of each residue in a single protein model. SMOQ uses support vector machines (SVM) with protein sequence and structural features (i.e. basic feature set), including amino acid sequence, secondary structures, solvent accessibilities, and residue-residue contacts to make predictions. We also trained a SVM model with two new additional features (profiles and SOV scores) on 20 CASP8 targets and found that including them can only improve the performance when real deviations between native and model are higher than 5Å. The SMOQ tool finally released uses the basic feature set trained on 85 CASP8 targets. Moreover, SMOQ implemented a way to convert predicted local quality scores into a global quality score. SMOQ was tested on the 84 CASP9 single-domain targets. The average difference between the residue-specific distance deviation predicted by our method and the actual distance deviation on the test data is 2.637Å. The global quality prediction accuracy of the tool is comparable to other good tools on the same benchmark. Conclusion SMOQ is a useful tool for protein single model quality assessment. Its source code and executable are available at: http://sysbio.rnet.missouri.edu/multicom_toolbox/. PMID:24776231
Cao, Renzhi; Wang, Zheng; Wang, Yiheng; Cheng, Jianlin
2014-04-28
It is important to predict the quality of a protein structural model before its native structure is known. The method that can predict the absolute local quality of individual residues in a single protein model is rare, yet particularly needed for using, ranking and refining protein models. We developed a machine learning tool (SMOQ) that can predict the distance deviation of each residue in a single protein model. SMOQ uses support vector machines (SVM) with protein sequence and structural features (i.e. basic feature set), including amino acid sequence, secondary structures, solvent accessibilities, and residue-residue contacts to make predictions. We also trained a SVM model with two new additional features (profiles and SOV scores) on 20 CASP8 targets and found that including them can only improve the performance when real deviations between native and model are higher than 5Å. The SMOQ tool finally released uses the basic feature set trained on 85 CASP8 targets. Moreover, SMOQ implemented a way to convert predicted local quality scores into a global quality score. SMOQ was tested on the 84 CASP9 single-domain targets. The average difference between the residue-specific distance deviation predicted by our method and the actual distance deviation on the test data is 2.637Å. The global quality prediction accuracy of the tool is comparable to other good tools on the same benchmark. SMOQ is a useful tool for protein single model quality assessment. Its source code and executable are available at: http://sysbio.rnet.missouri.edu/multicom_toolbox/.
Rotondi, Kenneth S; Gierasch, Lila M
2003-07-08
The experiments described here explore the role of local sequence in the folding of cellular retinoic acid binding protein I (CRABP I). This is a 136-residue, 10-stranded, antiparallel beta-barrel protein with seven beta-hairpins and is a member of the intracellular lipid binding protein (iLBP) family. The relative roles of local and global sequence information in governing the folding of this class of proteins are not well-understood. In question is whether the beta-turns are locally defined by short-range interactions within their sequences, and are thus able to play an active role in reducing the conformational space available to the folding chain, or whether the turns are passive, relying upon global forces to form. Short (six- and seven-residue) peptides corresponding to the seven CRABP I turns were analyzed by circular dichroism and NMR for their tendencies to take up the conformations they adopt in the context of the native protein. The results indicate that two of the peptides, encompassing turns III and IV in CRABP I, have a strong intrinsic bias to form native turns. Intriguingly, these turns are on linked hairpins in CRABP I and represent the best-conserved turns in the iLBP family. These results suggest that local sequence may play an important role in narrowing the conformational ensemble of CRABP I during folding.
Lin, Yanqin; Lin, Liangjie; Wei, Zhiliang; Zhong, Jianhui; Chen, Zhong
2016-12-01
To acquire single voxel localized one-dimensional 1 H magnetic resonance spectroscopy (MRS) without J coupling modulations, free from amplitude and phase distortions. A pulse sequence, named PRESSIR, is developed for volume localized MRS without J modulations at arbitrary echo time (TE). The J coupling evolution is suppressed by the J-refocused module that uses a 90° pulse at the midpoint of a double spin echo. The localization performance of the PRESSIR sequence was tested with a two-compartment phantom. The proposed sequence shows similar voxel localization accuracy as PRESS. Both PRESSIR and PRESS sequences were performed on MRS brain phantom and pig brain tissue. PRESS spectra suffer from amplitude and phase distortions due to J modulations, especially under moderate and long TEs, while PRESSIR spectra are almost free from distortions. The PRESSIR sequence proposed herein enables the acquisition of single voxel in-phase MRS within a single scan. It allows an enhanced signal intensity of J coupling metabolites and reducing undesired broad resonances with short T2s while suppressing J modulations. Moreover, it provides an approach for direct measurement of nonoverlapping J coupling peaks and of transverse relaxation times T2s. Magn Reson Med 76:1661-1667, 2016. © 2015 International Society for Magnetic Resonance in Medicine. © 2015 International Society for Magnetic Resonance in Medicine.
DraGnET: Software for storing, managing and analyzing annotated draft genome sequence data
2010-01-01
Background New "next generation" DNA sequencing technologies offer individual researchers the ability to rapidly generate large amounts of genome sequence data at dramatically reduced costs. As a result, a need has arisen for new software tools for storage, management and analysis of genome sequence data. Although bioinformatic tools are available for the analysis and management of genome sequences, limitations still remain. For example, restrictions on the submission of data and use of these tools may be imposed, thereby making them unsuitable for sequencing projects that need to remain in-house or proprietary during their initial stages. Furthermore, the availability and use of next generation sequencing in industrial, governmental and academic environments requires biologist to have access to computational support for the curation and analysis of the data generated; however, this type of support is not always immediately available. Results To address these limitations, we have developed DraGnET (Draft Genome Evaluation Tool). DraGnET is an open source web application which allows researchers, with no experience in programming and database management, to setup their own in-house projects for storing, retrieving, organizing and managing annotated draft and complete genome sequence data. The software provides a web interface for the use of BLAST, allowing users to perform preliminary comparative analysis among multiple genomes. We demonstrate the utility of DraGnET for performing comparative genomics on closely related bacterial strains. Furthermore, DraGnET can be further developed to incorporate additional tools for more sophisticated analyses. Conclusions DraGnET is designed for use either by individual researchers or as a collaborative tool available through Internet (or Intranet) deployment. For genome projects that require genome sequencing data to initially remain proprietary, DraGnET provides the means for researchers to keep their data in-house for analysis using local programs or until it is made publicly available, at which point it may be uploaded to additional analysis software applications. The DraGnET home page is available at http://www.dragnet.cvm.iastate.edu and includes example files for examining the functionalities, a link for downloading the DraGnET setup package and a link to the DraGnET source code hosted with full documentation on SourceForge. PMID:20175920
NASA Astrophysics Data System (ADS)
Manduca, C. A.
2017-12-01
To develop a diverse geoscience workforce, the EarthConnections collective impact alliance is developing regionally focused, Earth education pathways. These pathways support and guide students from engagement in relevant, Earth-related science at an early age through the many steps and transitions to geoscience-related careers. Rooted in existing regional activities, pathways are developed using a process that engages regional stakeholders and community members with EarthConnections partners. Together they connect, sequence, and create multiple learning opportunities that link geoscience education and community service to address one or more local geoscience issues. Three initial pilots are demonstrating different starting points and strategies for creating pathways that serve community needs while supporting geoscience education. The San Bernardino pilot is leveraging existing academic relationships and programs; the Atlanta pilot is building into existing community activities; and the Oklahoma Tribal Nations pilot is co-constructing a pathway focus and approach. The project is using pathway mapping and a collective impact framework to support and monitor progress. The goal is to develop processes and activities that can help other communities develop similar community-based geoscience pathways. By intertwining Earth education with local community service we aspire to increase the resilience of communities in the face of environmental hazards and limited Earth resources.
Identification of a nuclear localization sequence in the polyomavirus capsid protein VP2
NASA Technical Reports Server (NTRS)
Chang, D.; Haynes, J. I. 2nd; Brady, J. N.; Consigli, R. A.; Spooner, B. S. (Principal Investigator)
1992-01-01
A nuclear localization signal (NLS) has been identified in the C-terminal (Glu307-Glu-Asp-Gly-Pro-Gln-Lys-Lys-Lys-Arg-Arg-Leu318) amino acid sequence of the polyomavirus minor capsid protein VP2. The importance of this amino acid sequence for nuclear transport of newly synthesized VP2 was demonstrated by a genetic "subtractive" study using the constructs pSG5VP2 (expressing full-length VP2) and pSG5 delta 3VP2 (expressing truncated VP2, lacking amino acids Glu307-Leu318). These constructs were transfected into COS-7 cells, and the intracellular localization of the VP2 protein was determined by indirect immunofluorescence. These studies revealed that the full-length VP2 was localized in the nucleus, while the truncated VP2 protein was localized in the cytoplasm and not transported to the nucleus. A biochemical "additive" approach was also used to determine whether this sequence could target nonnuclear proteins to the nucleus. A synthetic peptide identical to VP2 amino acids Glu307-Leu318 was cross-linked to the nonnuclear proteins bovine serum albumin (BSA) or immunoglobulin G (IgG). The conjugates were then labeled with fluorescein isothiocyanate and microinjected into the cytoplasm of NIH 3T6 cells. Both conjugates localized in the nucleus of the microinjected cells, whereas unconjugated BSA and IgG remained in the cytoplasm. Taken together, these genetic subtractive and biochemical additive approaches have identified the C-terminal sequence of polyoma-virus VP2 (containing amino acids Glu307-Leu318) as the NLS of this protein.
Enhanced Methods for Local Ancestry Assignment in Sequenced Admixed Individuals
Brown, Robert; Pasaniuc, Bogdan
2014-01-01
Inferring the ancestry at each locus in the genome of recently admixed individuals (e.g., Latino Americans) plays a major role in medical and population genetic inferences, ranging from finding disease-risk loci, to inferring recombination rates, to mapping missing contigs in the human genome. Although many methods for local ancestry inference have been proposed, most are designed for use with genotyping arrays and fail to make use of the full spectrum of data available from sequencing. In addition, current haplotype-based approaches are very computationally demanding, requiring large computational time for moderately large sample sizes. Here we present new methods for local ancestry inference that leverage continent-specific variants (CSVs) to attain increased performance over existing approaches in sequenced admixed genomes. A key feature of our approach is that it incorporates the admixed genomes themselves jointly with public datasets, such as 1000 Genomes, to improve the accuracy of CSV calling. We use simulations to show that our approach attains accuracy similar to widely used computationally intensive haplotype-based approaches with large decreases in runtime. Most importantly, we show that our method recovers comparable local ancestries, as the 1000 Genomes consensus local ancestry calls in the real admixed individuals from the 1000 Genomes Project. We extend our approach to account for low-coverage sequencing and show that accurate local ancestry inference can be attained at low sequencing coverage. Finally, we generalize CSVs to sub-continental population-specific variants (sCSVs) and show that in some cases it is possible to determine the sub-continental ancestry for short chromosomal segments on the basis of sCSVs. PMID:24743331
Pérez de Rosas, Alicia R.; Restelli, María F.; Fernández, Cintia J.; Blariza, María J.; García, Beatriz A.
2017-01-01
Here we apply inter-simple sequence repeat (ISSR) markers to explore the fine-scale genetic structure and dispersal in populations of Triatoma infestans. Five selected primers from 30 primers were used to amplify ISSRs by polymerase chain reaction. A total of 90 polymorphic bands were detected across 134 individuals captured from 11 peridomestic sites from the locality of San Martín (Capayán Department, Catamarca Province, Argentina). Significant levels of genetic differentiation suggest limited gene flow among sampling sites. Spatial autocorrelation analysis confirms that dispersal occurs on the scale of ∼469 m, suggesting that insecticide spraying should be extended at least within a radius of ∼500 m around the infested area. Moreover, Bayesian clustering algorithms indicated genetic exchange among different sites analyzed, supporting the hypothesis of an important role of peridomestic structures in the process of reinfestation. PMID:28115670
Khosravani, Azar; Rastegar-Pouyani, Eskandar; Rastegar-Pouyani, Nasrullah; Oraie, Hamzeh; Papenfuss, Theodore J
2017-12-19
Mitochondrial COI sequences were used to investigate species delimitation within the genus Bunopus in Iran. A dataset with a final sequence length of 633 nucleotides including 100 specimens from 31 geographically distant localities across Iran were generated. The result demonstrated that two major clades with strong support can be identified within the genus Bunopus in Iran. Clade A includes Bunopus crassicaudus and two new entities, eastern populations (subclade A2,1) and Shahdad populations (subclade A2,2). The second clade comprises western and southwestern populations (subclade B1,1), Arabian populations (subclade B1,2) and south and southeast populations in Iran, to which Bunopus tuberculatus (subclade B2) is assigned. In addition to Bunopus crassicaudus and B. tuberculatus, three new candidate species in Iran can easily be identified based on the DNA barcoding approach.
Buttenfield, B.P.; Stanislawski, L.V.; Brewer, C.A.
2011-01-01
This paper reports on generalization and data modeling to create reduced scale versions of the National Hydrographic Dataset (NHD) for dissemination through The National Map, the primary data delivery portal for USGS. Our approach distinguishes local differences in physiographic factors, to demonstrate that knowledge about varying terrain (mountainous, hilly or flat) and varying climate (dry or humid) can support decisions about algorithms, parameters, and processing sequences to create generalized, smaller scale data versions which preserve distinct hydrographic patterns in these regions. We work with multiple subbasins of the NHD that provide a range of terrain and climate characteristics. Specifically tailored generalization sequences are used to create simplified versions of the high resolution data, which was compiled for 1:24,000 scale mapping. Results are evaluated cartographically and metrically against a medium resolution benchmark version compiled for 1:100,000, developing coefficients of linear and areal correspondence.
NASA Astrophysics Data System (ADS)
Maier, Oskar; Wilms, Matthias; von der Gablentz, Janina; Krämer, Ulrike; Handels, Heinz
2014-03-01
Automatic segmentation of ischemic stroke lesions in magnetic resonance (MR) images is important in clinical practice and for neuroscientific trials. The key problem is to detect largely inhomogeneous regions of varying sizes, shapes and locations. We present a stroke lesion segmentation method based on local features extracted from multi-spectral MR data that are selected to model a human observer's discrimination criteria. A support vector machine classifier is trained on expert-segmented examples and then used to classify formerly unseen images. Leave-one-out cross validation on eight datasets with lesions of varying appearances is performed, showing our method to compare favourably with other published approaches in terms of accuracy and robustness. Furthermore, we compare a number of feature selectors and closely examine each feature's and MR sequence's contribution.
muBLASTP: database-indexed protein sequence search on multicore CPUs.
Zhang, Jing; Misra, Sanchit; Wang, Hao; Feng, Wu-Chun
2016-11-04
The Basic Local Alignment Search Tool (BLAST) is a fundamental program in the life sciences that searches databases for sequences that are most similar to a query sequence. Currently, the BLAST algorithm utilizes a query-indexed approach. Although many approaches suggest that sequence search with a database index can achieve much higher throughput (e.g., BLAT, SSAHA, and CAFE), they cannot deliver the same level of sensitivity as the query-indexed BLAST, i.e., NCBI BLAST, or they can only support nucleotide sequence search, e.g., MegaBLAST. Due to different challenges and characteristics between query indexing and database indexing, the existing techniques for query-indexed search cannot be used into database indexed search. muBLASTP, a novel database-indexed BLAST for protein sequence search, delivers identical hits returned to NCBI BLAST. On Intel Haswell multicore CPUs, for a single query, the single-threaded muBLASTP achieves up to a 4.41-fold speedup for alignment stages, and up to a 1.75-fold end-to-end speedup over single-threaded NCBI BLAST. For a batch of queries, the multithreaded muBLASTP achieves up to a 5.7-fold speedups for alignment stages, and up to a 4.56-fold end-to-end speedup over multithreaded NCBI BLAST. With a newly designed index structure for protein database and associated optimizations in BLASTP algorithm, we re-factored BLASTP algorithm for modern multicore processors that achieves much higher throughput with acceptable memory footprint for the database index.
SOLAR ERUPTION AND LOCAL MAGNETIC PARAMETERS
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, Jeongwoo; Chae, Jongchul; Liu, Chang
It is now a common practice to use local magnetic parameters such as magnetic decay index for explaining solar eruptions from active regions, but there can be an alternative view that the global properties of the source region should be counted as a more important factor. We discuss this issue based on Solar Dynamics Observatory observations of the three successive eruptions within 1.5 hr from the NOAA active region 11444 and the magnetic parameters calculated using the nonlinear force-free field model. Two violent eruptions occurred in the regions with relatively high magnetic twist number (0.5–1.5) and high decay index (0.9–1.1)more » at the nominal height of the filament (12″) and otherwise a mild eruption occurred, which supports the local-parameter paradigm. Our main point is that the time sequence of the eruptions did not go with these parameters. It is argued that an additional factor, in the form of stabilizing force, should operate to determine the onset of the first eruption and temporal behaviors of subsequent eruptions. As supporting evidence, we report that the heating and fast plasma flow continuing for a timescale of an hour was the direct cause for the first eruption and that the unidirectional propagation of the disturbance determined the timing of subsequent eruptions. Both of these factors are associated with the overall magnetic structure rather than local magnetic properties of the active region.« less
Predicting the host of influenza viruses based on the word vector.
Xu, Beibei; Tan, Zhiying; Li, Kenli; Jiang, Taijiao; Peng, Yousong
2017-01-01
Newly emerging influenza viruses continue to threaten public health. A rapid determination of the host range of newly discovered influenza viruses would assist in early assessment of their risk. Here, we attempted to predict the host of influenza viruses using the Support Vector Machine (SVM) classifier based on the word vector, a new representation and feature extraction method for biological sequences. The results show that the length of the word within the word vector, the sequence type (DNA or protein) and the species from which the sequences were derived for generating the word vector all influence the performance of models in predicting the host of influenza viruses. In nearly all cases, the models built on the surface proteins hemagglutinin (HA) and neuraminidase (NA) (or their genes) produced better results than internal influenza proteins (or their genes). The best performance was achieved when the model was built on the HA gene based on word vectors (words of three-letters long) generated from DNA sequences of the influenza virus. This results in accuracies of 99.7% for avian, 96.9% for human and 90.6% for swine influenza viruses. Compared to the method of sequence homology best-hit searches using the Basic Local Alignment Search Tool (BLAST), the word vector-based models still need further improvements in predicting the host of influenza A viruses.
A Toolkit for ARB to Integrate Custom Databases and Externally Built Phylogenies
Essinger, Steven D.; Reichenberger, Erin; Morrison, Calvin; ...
2015-01-21
Researchers are perpetually amassing biological sequence data. The computational approaches employed by ecologists for organizing this data (e.g. alignment, phylogeny, etc.) typically scale nonlinearly in execution time with the size of the dataset. This often serves as a bottleneck for processing experimental data since many molecular studies are characterized by massive datasets. To keep up with experimental data demands, ecologists are forced to choose between continually upgrading expensive in-house computer hardware or outsourcing the most demanding computations to the cloud. Outsourcing is attractive since it is the least expensive option, but does not necessarily allow direct user interaction with themore » data for exploratory analysis. Desktop analytical tools such as ARB are indispensable for this purpose, but they do not necessarily offer a convenient solution for the coordination and integration of datasets between local and outsourced destinations. Therefore, researchers are currently left with an undesirable tradeoff between computational throughput and analytical capability. To mitigate this tradeoff we introduce a software package to leverage the utility of the interactive exploratory tools offered by ARB with the computational throughput of cloud-based resources. Our pipeline serves as middleware between the desktop and the cloud allowing researchers to form local custom databases containing sequences and metadata from multiple resources and a method for linking data outsourced for computation back to the local database. Furthermore, a tutorial implementation of the toolkit is provided in the supporting information, S1 Tutorial.« less
A Toolkit for ARB to Integrate Custom Databases and Externally Built Phylogenies
Essinger, Steven D.; Reichenberger, Erin; Morrison, Calvin; Blackwood, Christopher B.; Rosen, Gail L.
2015-01-01
Researchers are perpetually amassing biological sequence data. The computational approaches employed by ecologists for organizing this data (e.g. alignment, phylogeny, etc.) typically scale nonlinearly in execution time with the size of the dataset. This often serves as a bottleneck for processing experimental data since many molecular studies are characterized by massive datasets. To keep up with experimental data demands, ecologists are forced to choose between continually upgrading expensive in-house computer hardware or outsourcing the most demanding computations to the cloud. Outsourcing is attractive since it is the least expensive option, but does not necessarily allow direct user interaction with the data for exploratory analysis. Desktop analytical tools such as ARB are indispensable for this purpose, but they do not necessarily offer a convenient solution for the coordination and integration of datasets between local and outsourced destinations. Therefore, researchers are currently left with an undesirable tradeoff between computational throughput and analytical capability. To mitigate this tradeoff we introduce a software package to leverage the utility of the interactive exploratory tools offered by ARB with the computational throughput of cloud-based resources. Our pipeline serves as middleware between the desktop and the cloud allowing researchers to form local custom databases containing sequences and metadata from multiple resources and a method for linking data outsourced for computation back to the local database. A tutorial implementation of the toolkit is provided in the supporting information, S1 Tutorial. Availability: http://www.ece.drexel.edu/gailr/EESI/tutorial.php. PMID:25607539
Bedbrook, Claire N; Yang, Kevin K; Rice, Austin J; Gradinaru, Viviana; Arnold, Frances H
2017-10-01
There is growing interest in studying and engineering integral membrane proteins (MPs) that play key roles in sensing and regulating cellular response to diverse external signals. A MP must be expressed, correctly inserted and folded in a lipid bilayer, and trafficked to the proper cellular location in order to function. The sequence and structural determinants of these processes are complex and highly constrained. Here we describe a predictive, machine-learning approach that captures this complexity to facilitate successful MP engineering and design. Machine learning on carefully-chosen training sequences made by structure-guided SCHEMA recombination has enabled us to accurately predict the rare sequences in a diverse library of channelrhodopsins (ChRs) that express and localize to the plasma membrane of mammalian cells. These light-gated channel proteins of microbial origin are of interest for neuroscience applications, where expression and localization to the plasma membrane is a prerequisite for function. We trained Gaussian process (GP) classification and regression models with expression and localization data from 218 ChR chimeras chosen from a 118,098-variant library designed by SCHEMA recombination of three parent ChRs. We use these GP models to identify ChRs that express and localize well and show that our models can elucidate sequence and structure elements important for these processes. We also used the predictive models to convert a naturally occurring ChR incapable of mammalian localization into one that localizes well.
Rice, Austin J.; Gradinaru, Viviana; Arnold, Frances H.
2017-01-01
There is growing interest in studying and engineering integral membrane proteins (MPs) that play key roles in sensing and regulating cellular response to diverse external signals. A MP must be expressed, correctly inserted and folded in a lipid bilayer, and trafficked to the proper cellular location in order to function. The sequence and structural determinants of these processes are complex and highly constrained. Here we describe a predictive, machine-learning approach that captures this complexity to facilitate successful MP engineering and design. Machine learning on carefully-chosen training sequences made by structure-guided SCHEMA recombination has enabled us to accurately predict the rare sequences in a diverse library of channelrhodopsins (ChRs) that express and localize to the plasma membrane of mammalian cells. These light-gated channel proteins of microbial origin are of interest for neuroscience applications, where expression and localization to the plasma membrane is a prerequisite for function. We trained Gaussian process (GP) classification and regression models with expression and localization data from 218 ChR chimeras chosen from a 118,098-variant library designed by SCHEMA recombination of three parent ChRs. We use these GP models to identify ChRs that express and localize well and show that our models can elucidate sequence and structure elements important for these processes. We also used the predictive models to convert a naturally occurring ChR incapable of mammalian localization into one that localizes well. PMID:29059183
A 3D approximate maximum likelihood localization solver
DOE Office of Scientific and Technical Information (OSTI.GOV)
2016-09-23
A robust three-dimensional solver was needed to accurately and efficiently estimate the time sequence of locations of fish tagged with acoustic transmitters and vocalizing marine mammals to describe in sufficient detail the information needed to assess the function of dam-passage design alternatives and support Marine Renewable Energy. An approximate maximum likelihood solver was developed using measurements of time difference of arrival from all hydrophones in receiving arrays on which a transmission was detected. Field experiments demonstrated that the developed solver performed significantly better in tracking efficiency and accuracy than other solvers described in the literature.
Chikenji, George; Fujitsuka, Yoshimi; Takada, Shoji
2006-02-28
Predicting protein tertiary structure by folding-like simulations is one of the most stringent tests of how much we understand the principle of protein folding. Currently, the most successful method for folding-based structure prediction is the fragment assembly (FA) method. Here, we address why the FA method is so successful and its lesson for the folding problem. To do so, using the FA method, we designed a structure prediction test of "chimera proteins." In the chimera proteins, local structural preference is specific to the target sequences, whereas nonlocal interactions are only sequence-independent compaction forces. We find that these chimera proteins can find the native folds of the intact sequences with high probability indicating dominant roles of the local interactions. We further explore roles of local structural preference by exact calculation of the HP lattice model of proteins. From these results, we suggest principles of protein folding: For small proteins, compact structures that are fully compatible with local structural preference are few, one of which is the native fold. These local biases shape up the funnel-like energy landscape.
Shaping up the protein folding funnel by local interaction: Lesson from a structure prediction study
Chikenji, George; Fujitsuka, Yoshimi; Takada, Shoji
2006-01-01
Predicting protein tertiary structure by folding-like simulations is one of the most stringent tests of how much we understand the principle of protein folding. Currently, the most successful method for folding-based structure prediction is the fragment assembly (FA) method. Here, we address why the FA method is so successful and its lesson for the folding problem. To do so, using the FA method, we designed a structure prediction test of “chimera proteins.” In the chimera proteins, local structural preference is specific to the target sequences, whereas nonlocal interactions are only sequence-independent compaction forces. We find that these chimera proteins can find the native folds of the intact sequences with high probability indicating dominant roles of the local interactions. We further explore roles of local structural preference by exact calculation of the HP lattice model of proteins. From these results, we suggest principles of protein folding: For small proteins, compact structures that are fully compatible with local structural preference are few, one of which is the native fold. These local biases shape up the funnel-like energy landscape. PMID:16488978
New powerful statistics for alignment-free sequence comparison under a pattern transfer model.
Liu, Xuemei; Wan, Lin; Li, Jing; Reinert, Gesine; Waterman, Michael S; Sun, Fengzhu
2011-09-07
Alignment-free sequence comparison is widely used for comparing gene regulatory regions and for identifying horizontally transferred genes. Recent studies on the power of a widely used alignment-free comparison statistic D2 and its variants D*2 and D(s)2 showed that their power approximates a limit smaller than 1 as the sequence length tends to infinity under a pattern transfer model. We develop new alignment-free statistics based on D2, D*2 and D(s)2 by comparing local sequence pairs and then summing over all the local sequence pairs of certain length. We show that the new statistics are much more powerful than the corresponding statistics and the power tends to 1 as the sequence length tends to infinity under the pattern transfer model. Copyright © 2011 Elsevier Ltd. All rights reserved.
New Powerful Statistics for Alignment-free Sequence Comparison Under a Pattern Transfer Model
Liu, Xuemei; Wan, Lin; Li, Jing; Reinert, Gesine; Waterman, Michael S.; Sun, Fengzhu
2011-01-01
Alignment-free sequence comparison is widely used for comparing gene regulatory regions and for identifying horizontally transferred genes. Recent studies on the power of a widely used alignment-free comparison statistic D2 and its variants D2∗ and D2s showed that their power approximates a limit smaller than 1 as the sequence length tends to infinity under a pattern transfer model. We develop new alignment-free statistics based on D2, D2∗ and D2s by comparing local sequence pairs and then summing over all the local sequence pairs of certain length. We show that the new statistics are much more powerful than the corresponding statistics and the power tends to 1 as the sequence length tends to infinity under the pattern transfer model. PMID:21723298
Global and local pitch perception in children with developmental dyslexia.
Ziegler, Johannes C; Pech-Georgel, Catherine; George, Florence; Foxton, Jessica M
2012-03-01
This study investigated global versus local pitch pattern perception in children with dyslexia aged between 8 and 11 years. Children listened to two consecutive 4-tone pitch sequences while performing a same/different task. On the different trials, sequences either preserved the contour (local condition) or they violated the contour (global condition). Compared to normally developing children, dyslexics showed robust pitch perception deficits in the local but not the global condition. This finding was replicated in a simple pitch direction task, which minimizes sequencing and short term memory. Results are consistent with a left-hemisphere deficit in dyslexia because local pitch changes are supposedly processed by the left hemisphere, whereas global pitch changes are processed by the right hemisphere. The present data suggest a link between impaired pitch processing and abnormal phonological development in children with dyslexia, which makes pitch pattern processing a potent tool for early diagnosis and remediation of dyslexia. Copyright © 2011 Elsevier Inc. All rights reserved.
Accounting for Protein Subcellular Localization: A Compartmental Map of the Rat Liver Proteome*
Jadot, Michel; Boonen, Marielle; Thirion, Jaqueline; Wang, Nan; Xing, Jinchuan; Zhao, Caifeng; Tannous, Abla; Qian, Meiqian; Zheng, Haiyan; Everett, John K.; Moore, Dirk F.; Sleat, David E.; Lobel, Peter
2017-01-01
Accurate knowledge of the intracellular location of proteins is important for numerous areas of biomedical research including assessing fidelity of putative protein-protein interactions, modeling cellular processes at a system-wide level and investigating metabolic and disease pathways. Many proteins have not been localized, or have been incompletely localized, partly because most studies do not account for entire subcellular distribution. Thus, proteins are frequently assigned to one organelle whereas a significant fraction may reside elsewhere. As a step toward a comprehensive cellular map, we used subcellular fractionation with classic balance sheet analysis and isobaric labeling/quantitative mass spectrometry to assign locations to >6000 rat liver proteins. We provide quantitative data and error estimates describing the distribution of each protein among the eight major cellular compartments: nucleus, mitochondria, lysosomes, peroxisomes, endoplasmic reticulum, Golgi, plasma membrane and cytosol. Accounting for total intracellular distribution improves quality of organelle assignments and assigns proteins with multiple locations. Protein assignments and supporting data are available online through the Prolocate website (http://prolocate.cabm.rutgers.edu). As an example of the utility of this data set, we have used organelle assignments to help analyze whole exome sequencing data from an infant dying at 6 months of age from a suspected neurodegenerative lysosomal storage disorder of unknown etiology. Sequencing data was prioritized using lists of lysosomal proteins comprising well-established residents of this organelle as well as novel candidates identified in this study. The latter included copper transporter 1, encoded by SLC31A1, which we localized to both the plasma membrane and lysosome. The patient harbors two predicted loss of function mutations in SLC31A1, suggesting that this may represent a heretofore undescribed recessive lysosomal storage disease gene. PMID:27923875
Accounting for Protein Subcellular Localization: A Compartmental Map of the Rat Liver Proteome.
Jadot, Michel; Boonen, Marielle; Thirion, Jaqueline; Wang, Nan; Xing, Jinchuan; Zhao, Caifeng; Tannous, Abla; Qian, Meiqian; Zheng, Haiyan; Everett, John K; Moore, Dirk F; Sleat, David E; Lobel, Peter
2017-02-01
Accurate knowledge of the intracellular location of proteins is important for numerous areas of biomedical research including assessing fidelity of putative protein-protein interactions, modeling cellular processes at a system-wide level and investigating metabolic and disease pathways. Many proteins have not been localized, or have been incompletely localized, partly because most studies do not account for entire subcellular distribution. Thus, proteins are frequently assigned to one organelle whereas a significant fraction may reside elsewhere. As a step toward a comprehensive cellular map, we used subcellular fractionation with classic balance sheet analysis and isobaric labeling/quantitative mass spectrometry to assign locations to >6000 rat liver proteins. We provide quantitative data and error estimates describing the distribution of each protein among the eight major cellular compartments: nucleus, mitochondria, lysosomes, peroxisomes, endoplasmic reticulum, Golgi, plasma membrane and cytosol. Accounting for total intracellular distribution improves quality of organelle assignments and assigns proteins with multiple locations. Protein assignments and supporting data are available online through the Prolocate website (http://prolocate.cabm.rutgers.edu). As an example of the utility of this data set, we have used organelle assignments to help analyze whole exome sequencing data from an infant dying at 6 months of age from a suspected neurodegenerative lysosomal storage disorder of unknown etiology. Sequencing data was prioritized using lists of lysosomal proteins comprising well-established residents of this organelle as well as novel candidates identified in this study. The latter included copper transporter 1, encoded by SLC31A1, which we localized to both the plasma membrane and lysosome. The patient harbors two predicted loss of function mutations in SLC31A1, suggesting that this may represent a heretofore undescribed recessive lysosomal storage disease gene. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
2014-01-01
Background Protein sites evolve at different rates due to functional and biophysical constraints. It is usually considered that the main structural determinant of a site’s rate of evolution is its Relative Solvent Accessibility (RSA). However, a recent comparative study has shown that the main structural determinant is the site’s Local Packing Density (LPD). LPD is related with dynamical flexibility, which has also been shown to correlate with sequence variability. Our purpose is to investigate the mechanism that connects a site’s LPD with its rate of evolution. Results We consider two models: an empirical Flexibility Model and a mechanistic Stress Model. The Flexibility Model postulates a linear increase of site-specific rate of evolution with dynamical flexibility. The Stress Model, introduced here, models mutations as random perturbations of the protein’s potential energy landscape, for which we use simple Elastic Network Models (ENMs). To account for natural selection we assume a single active conformation and use basic statistical physics to derive a linear relationship between site-specific evolutionary rates and the local stress of the mutant’s active conformation. We compare both models on a large and diverse dataset of enzymes. In a protein-by-protein study we found that the Stress Model outperforms the Flexibility Model for most proteins. Pooling all proteins together we show that the Stress Model is strongly supported by the total weight of evidence. Moreover, it accounts for the observed nonlinear dependence of sequence variability on flexibility. Finally, when mutational stress is controlled for, there is very little remaining correlation between sequence variability and dynamical flexibility. Conclusions We developed a mechanistic Stress Model of evolution according to which the rate of evolution of a site is predicted to depend linearly on the local mutational stress of the active conformation. Such local stress is proportional to LPD, so that this model explains the relationship between LPD and evolutionary rate. Moreover, the model also accounts for the nonlinear dependence between evolutionary rate and dynamical flexibility. PMID:24716445
Layer, Franziska; Sanchini, Andrea; Strommenger, Birgit; Cuny, Christiane; Breier, Ann-Christin; Proquitté, Hans; Bührer, Christoph; Schenkel, Karl; Bätzing-Feigenbaum, Jörg; Greutelaers, Benedikt; Nübel, Ulrich; Gastmeier, Petra; Eckmanns, Tim; Werner, Guido
2015-10-01
Outbreaks of Staphylococcus aureus are common in neonatal intensive care units (NICUs). Usually they are documented for methicillin-resistant strains, while reports involving methicillin-susceptible S. aureus (MSSA) strains are rare. In this study we report the epidemiological and molecular investigation of an MSSA outbreak in a NICU among preterm neonates. Infection control measures and interventions were commissioned by the Local Public Health Authority and supported by the Robert Koch Institute. To support epidemiological investigations molecular typing was done by spa-typing and Multilocus sequence typing; the relatedness of collected isolates was further elucidated by DNA SmaI-macrorestriction, microarray analysis and bacterial whole genome sequencing. A total of 213 neonates, 123 healthcare workers and 205 neonate parents were analyzed in the period November 2011 to November 2012. The outbreak strain was characterized as a MSSA spa-type t021, able to produce toxic shock syndrome toxin-1 and Enterotoxin A. We identified seventeen neonates (of which two died from toxic shock syndrome), four healthcare workers and three parents putatively involved in the outbreak. Whole-genome sequencing permitted to exclude unrelated cases from the outbreak and to discuss the role of healthcare workers as a reservoir of S. aureus on the NICU. Genome comparisons also indicated the presence of the respective clone on the ward months before the first colonized/infected neonates were detected. Copyright © 2015 Elsevier GmbH. All rights reserved.
STELLAR: fast and exact local alignments
2011-01-01
Background Large-scale comparison of genomic sequences requires reliable tools for the search of local alignments. Practical local aligners are in general fast, but heuristic, and hence sometimes miss significant matches. Results We present here the local pairwise aligner STELLAR that has full sensitivity for ε-alignments, i.e. guarantees to report all local alignments of a given minimal length and maximal error rate. The aligner is composed of two steps, filtering and verification. We apply the SWIFT algorithm for lossless filtering, and have developed a new verification strategy that we prove to be exact. Our results on simulated and real genomic data confirm and quantify the conjecture that heuristic tools like BLAST or BLAT miss a large percentage of significant local alignments. Conclusions STELLAR is very practical and fast on very long sequences which makes it a suitable new tool for finding local alignments between genomic sequences under the edit distance model. Binaries are freely available for Linux, Windows, and Mac OS X at http://www.seqan.de/projects/stellar. The source code is freely distributed with the SeqAn C++ library version 1.3 and later at http://www.seqan.de. PMID:22151882
Complete Sequence and Analysis of Coconut Palm (Cocos nucifera) Mitochondrial Genome
Zhao, Yuhui; Zeng, Jingyao; Alamer, Ali; Alanazi, Ibrahim O.; Alawad, Abdullah O.; Al-Sadi, Abdullah M.; Hu, Songnian; Yu, Jun
2016-01-01
Coconut (Cocos nucifera L.), a member of the palm family (Arecaceae), is one of the most economically important crops in tropics, serving as an important source of food, drink, fuel, medicine, and construction material. Here we report an assembly of the coconut (C. nucifera, Oman local Tall cultivar) mitochondrial (mt) genome based on next-generation sequencing data. This genome, 678,653bp in length and 45.5% in GC content, encodes 72 proteins, 9 pseudogenes, 23 tRNAs, and 3 ribosomal RNAs. Within the assembly, we find that the chloroplast (cp) derived regions account for 5.07% of the total assembly length, including 13 proteins, 2 pseudogenes, and 11 tRNAs. The mt genome has a relatively large fraction of repeat content (17.26%), including both forward (tandem) and inverted (palindromic) repeats. Sequence variation analysis shows that the Ti/Tv ratio of the mt genome is lower as compared to that of the nuclear genome and neutral expectation. By combining public RNA-Seq data for coconut, we identify 734 RNA editing sites supported by at least two datasets. In summary, our data provides the second complete mt genome sequence in the family Arecaceae, essential for further investigations on mitochondrial biology of seed plants. PMID:27736909
NASA Technical Reports Server (NTRS)
Hwang, I.; Harper, J. F.; Liang, F.; Sze, H.
2000-01-01
To investigate how calmodulin regulates a unique subfamily of Ca(2+) pumps found in plants, we examined the kinetic properties of isoform ACA2 identified in Arabidopsis. A recombinant ACA2 was expressed in a yeast K616 mutant deficient in two endogenous Ca(2+) pumps. Orthovanadate-sensitive (45)Ca(2+) transport into vesicles isolated from transformants demonstrated that ACA2 is a Ca(2+) pump. Ca(2+) pumping by the full-length protein (ACA2-1) was 4- to 10-fold lower than that of the N-terminal truncated ACA2-2 (Delta2-80), indicating that the N-terminal domain normally acts to inhibit the pump. An inhibitory sequence (IC(50) = 4 microM) was localized to a region within valine-20 to leucine-44, because a peptide corresponding to this sequence lowered the V(max) and increased the K(m) for Ca(2+) of the constitutively active ACA2-2 to values comparable to the full-length pump. The peptide also blocked the activity (IC(50) = 7 microM) of a Ca(2+) pump (AtECA1) belonging to a second family of Ca(2+) pumps. This inhibitory sequence appears to overlap with a calmodulin-binding site in ACA2, previously mapped between aspartate-19 and arginine-36 (J.F. Harper, B. Hong, I. Hwang, H.Q. Guo, R. Stoddard, J.F. Huang, M.G. Palmgren, H. Sze inverted question mark1998 J Biol Chem 273: 1099-1106). These results support a model in which the pump is kept "unactivated" by an intramolecular interaction between an autoinhibitory sequence located between residues 20 and 44 and a site in the Ca(2+) pump core that is highly conserved between different Ca(2+) pump families. Results further support a model in which activation occurs as a result of Ca(2+)-induced binding of calmodulin to a site overlapping or immediately adjacent to the autoinhibitory sequence.
Burns, Mercedes; Hedin, Marshal; Tsurusaki, Nobuo
2018-01-01
Naturally occurring population variation in reproductive mode presents an opportunity for researchers to test hypotheses regarding the evolution of sex. Asexual reproduction frequently assumes a geographical pattern, in which parthenogenesis-dominated populations are more broadly dispersed than their sexual conspecifics. We evaluate the geographical distribution of genomic signatures associated with parthenogenesis using nuclear and mitochondrial DNA sequence data from two Japanese harvestman sister taxa, Leiobunum manubriatum and Leiobunum globosum . Asexual reproduction is putatively facultative in these species, and female-biased localities are common in habitat margins. Past karyotypic and current cytometric work indicates L. globosum is entirely tetraploid, while L. manubriatum may be either diploid or tetraploid. We estimated species phylogeny, genetic differentiation, diversity, and mitonuclear discordance in females collected across the species range in order to identify range expansion toward marginal habitat, potential for hybrid origin, and persistence of asexual lineages. Our results point to northward expansion of a tetraploid ancestor of L. manubriatum and L. globosum , coupled with support for greater male gene flow in southern L. manubriatum localities. Specimens from localities in the Tohoku and Hokkaido regions were indistinct, particularly those of L. globosum , potentially due to little mitochondrial differentiation or haplotypic variation. Although L. manubriatum overlaps with L. globosum across its entire range, L. globosum was reconstructed as monophyletic with strong support using mtDNA, and marginal support with nuclear loci. Ultimately, we find evidence for continued sexual reproduction in both species and describe opportunities to clarify the rate and mechanism of parthenogenesis.
Matsubara, Kazumi; Uno, Yoshinobu; Srikulnath, Kornsorn; Seki, Risako; Nishida, Chizuko; Matsuda, Yoichi
2015-12-01
Highly repetitive DNA sequences of the centromeric heterochromatin provide valuable molecular cytogenetic markers for the investigation of genomic compartmentalization in the macrochromosomes and microchromosomes of sauropsids. Here, the relationship between centromeric heterochromatin and karyotype evolution was examined using cloned repetitive DNA sequences from two snake species, the habu snake (Protobothrops flavoviridis, Crotalinae, Viperidae) and Burmese python (Python bivittatus, Pythonidae). Three satellite DNA (stDNA) families were isolated from the heterochromatin of these snakes: 168-bp PFL-MspI from P. flavoviridis and 196-bp PBI-DdeI and 174-bp PBI-MspI from P. bivittatus. The PFL-MspI and PBI-DdeI sequences were localized to the centromeric regions of most chromosomes in the respective species, suggesting that the two sequences were the major components of the centromeric heterochromatin in these organisms. The PBI-MspI sequence was localized to the pericentromeric region of four chromosome pairs. The PFL-MspI and the PBI-DdeI sequences were conserved only in the genome of closely related species, Gloydius blomhoffii (Crotalinae) and Python molurus, respectively, although their locations on the chromosomes were slightly different. In contrast, the PBI-MspI sequence was also in the genomes of P. molurus and Boa constrictor (Boidae), and additionally localized to the centromeric regions of eight chromosome pairs in B. constrictor, suggesting that this sequence originated in the genome of a common ancestor of Pythonidae and Boidae, approximately 86 million years ago. The three stDNA sequences showed no genomic compartmentalization between the macrochromosomes and microchromosomes, suggesting that homogenization of the centromeric and/or pericentromeric stDNA sequences occurred in the macrochromosomes and microchromosomes of these snakes.
García, O; Fregel, R; Larruga, J M; Álvarez, V; Yurrebaso, I; Cabrera, V M; González, A M
2011-01-01
It has been proposed that the distribution patterns and coalescence ages found in Europeans for mitochondrial DNA (mtDNA) haplogroups V, H1 and H3 are the result of a post-glacial expansion from a Franco-Cantabrian refuge that recolonized central and northern areas. In contrast, in this refined mtDNA study of the Cantabrian Cornice that contributes 413 partial and 9 complete new mtDNA sequences, including a large Basque sample and a sample of Asturians, no experimental evidence was found to support the human refuge-expansion theory. In fact, all measures of gene diversity point to the Cantabrian Cornice in general and the Basques in particular, as less polymorphic for V, H1 and H3 than other southern regions in Iberia or in Central Europe. Genetic distances show the Cantabrian Cornice is a very heterogeneous region with significant local differences. The analysis of several minor subhaplogroups, based on complete sequences, also suggests different focal expansions over a local and peninsular range that did not affect continental Europe. Furthermore, all detected clinal trends show stronger longitudinal than latitudinal profiles. In Northern Iberia, it seems that the highest diversity values for some haplogroups with Mesolithic coalescence ages are centred on the Mediterranean side, including Catalonia and South-eastern France.
TABLE D - WMO AND LOCAL (NCEP) DESCRIPTORS AS WELL AS THOSE AWAITING
sequences common to satellite observations None 3 05 Meteorological or hydrological sequences common to Vertical sounding sequences (conventional data) None 3 10 Vertical sounding sequences (satellite data) None (satellite data) None 3 13 Sequences common to image data None 3 14 Reserved None 3 15 Oceanographic report
The red-sequence of 72 WINGS local galaxy clusters
NASA Astrophysics Data System (ADS)
Valentinuzzi, T.; Poggianti, B. M.; Fasano, G.; D'Onofrio, M.; Moretti, A.; Ramella, M.; Biviano, A.; Fritz, J.; Varela, J.; Bettoni, D.; Vulcani, B.; Moles, M.; Couch, W. J.; Dressler, A.; Kjærgaard, P.; Omizzolo, A.; Cava, A.
2011-12-01
We study the color - magnitude red sequence and blue fraction of 72 X-ray selected galaxy clusters at z = 0.04-0.07 from the WINGS survey, searching for correlations between the characteristics of the red sequence (RS) and the environment. We consider the slope and scatter of the red sequence, the number ratio of red luminous-to-faint galaxies, the blue fraction, and the fractions of ellipticals, S0s, and spirals that compose the RS. None of these quantities correlate with the cluster velocity dispersion, X-ray luminosity, number of cluster substructures, BCG prevalence over next brightest galaxies, and the spatial concentration of ellipticals. The properties of the RS, instead, depend strongly on local galaxy density. Higher density regions have a smaller RS scatter, a higher luminous-to-faint ratio, a lower blue fraction, and a lower spiral fraction on the RS. Our results clearly illustrate the prominent effect of the local density in setting the epoch when galaxies become passive and join the red sequence, as opposed to the mass of the galaxy host structure.
Chirp- and random-based coded ultrasonic excitation for localized blood-brain barrier opening
Kamimura, HAS; Wang, S; Wu, S-Y; Karakatsani, ME; Acosta, C; Carneiro, AAO; Konofagou, EE
2015-01-01
Chirp- and random-based coded excitation methods have been proposed to reduce standing wave formation and improve focusing of transcranial ultrasound. However, no clear evidence has been shown to support the benefits of these ultrasonic excitation sequences in vivo. This study evaluates the chirp and periodic selection of random frequency (PSRF) coded-excitation methods for opening the blood-brain barrier (BBB) in mice. Three groups of mice (n=15) were injected with polydisperse microbubbles and sonicated in the caudate putamen using the chirp/PSRF coded (bandwidth: 1.5-1.9 MHz, peak negative pressure: 0.52 MPa, duration: 30 s) or standard ultrasound (frequency: 1.5 MHz, pressure: 0.52 MPa, burst duration: 20 ms, duration: 5 min) sequences. T1-weighted contrast-enhanced MRI scans were performed to quantitatively analyze focused ultrasound induced BBB opening. The mean opening volumes evaluated from the MRI were 9.38±5.71 mm3, 8.91±3.91 mm3 and 35.47 ± 5.10 mm3 for the chirp, random and regular sonications, respectively. The mean cavitation levels were 55.40±28.43 V.s, 63.87±29.97 V.s and 356.52±257.15 V.s for the chirp, random and regular sonications, respectively. The chirp and PSRF coded pulsing sequences improved the BBB opening localization by inducing lower cavitation levels and smaller opening volumes compared to results of the regular sonication technique. Larger bandwidths were associated with more focused targeting but were limited by the frequency response of the transducer, the skull attenuation and the microbubbles optimal frequency range. The coded methods could therefore facilitate highly localized drug delivery as well as benefit other transcranial ultrasound techniques that use higher pressure levels and higher precision to induce the necessary bioeffects in a brain region while avoiding damage to the surrounding healthy tissue. PMID:26394091
Climate and edaphic controllers influence rhizosphere community assembly for a wild annual grass
Nuccio, Erin E.; Anderson-Furgeson, James; Estera, Katerina Y.; ...
2016-05-09
The interface between roots and soil, known as the rhizosphere, is a dynamic habitat in the soil ecosystem. Unraveling the factors that control rhizosphere community assembly is a key starting point for understanding the diversity of plant-microbial interactions that occur in soil. The goals of this study were to determine how environmental factors shape rhizosphere microbial communities, such as local soil characteristics and the regional climate, and to determine the relative influence of the rhizosphere on microbial community assembly compared to the pressures imposed by the local and regional environment. We identified the bacteria present in the soil immediately adjacentmore » to the roots of wild oat (Avena spp.) in three California grasslands using deep Illumina 16S sequencing. Rhizosphere communities were more similar to each other than to the surrounding soil communities from which they were derived, despite the fact that the grasslands studied were separated by hundreds of kilometers. The rhizosphere was the dominant factor structuring bacterial community composition (38% variance explained), and was comparable in magnitude to the combined local and regional effects (22% and 21%, respectively). Rhizosphere communities were most influenced by factors related to the regional climate (soil moisture and temperature), while background soil communities were more influenced by soil characteristics (pH, CEC, exchangeable cations, clay content). The Avena core microbiome was strongly phylogenetically clustered according to the metrics NRI and NTI, which indicates that selective processes likely shaped these communities. Furthermore, 17% of these taxa were not detectable in the background soil, even with a robust sequencing depth of approximately 70,000 sequences per sample. In conclusion, these results support the hypothesis that roots select less abundant or possibly rare populations in the soil microbial community, which appear to be lineages of bacteria that have made a physiological tradeoff for rhizosphere competence at the expense of their competitiveness in non-rhizosphere soil.« less
Climate and edaphic controllers influence rhizosphere community assembly for a wild annual grass
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nuccio, Erin E.; Anderson-Furgeson, James; Estera, Katerina Y.
The interface between roots and soil, known as the rhizosphere, is a dynamic habitat in the soil ecosystem. Unraveling the factors that control rhizosphere community assembly is a key starting point for understanding the diversity of plant-microbial interactions that occur in soil. The goals of this study were to determine how environmental factors shape rhizosphere microbial communities, such as local soil characteristics and the regional climate, and to determine the relative influence of the rhizosphere on microbial community assembly compared to the pressures imposed by the local and regional environment. We identified the bacteria present in the soil immediately adjacentmore » to the roots of wild oat (Avena spp.) in three California grasslands using deep Illumina 16S sequencing. Rhizosphere communities were more similar to each other than to the surrounding soil communities from which they were derived, despite the fact that the grasslands studied were separated by hundreds of kilometers. The rhizosphere was the dominant factor structuring bacterial community composition (38% variance explained), and was comparable in magnitude to the combined local and regional effects (22% and 21%, respectively). Rhizosphere communities were most influenced by factors related to the regional climate (soil moisture and temperature), while background soil communities were more influenced by soil characteristics (pH, CEC, exchangeable cations, clay content). The Avena core microbiome was strongly phylogenetically clustered according to the metrics NRI and NTI, which indicates that selective processes likely shaped these communities. Furthermore, 17% of these taxa were not detectable in the background soil, even with a robust sequencing depth of approximately 70,000 sequences per sample. In conclusion, these results support the hypothesis that roots select less abundant or possibly rare populations in the soil microbial community, which appear to be lineages of bacteria that have made a physiological tradeoff for rhizosphere competence at the expense of their competitiveness in non-rhizosphere soil.« less
Sequencing intractable DNA to close microbial genomes.
Hurt, Richard A; Brown, Steven D; Podar, Mircea; Palumbo, Anthony V; Elias, Dwayne A
2012-01-01
Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled "intractable" resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such problematic regions in the "non-contiguous finished" Desulfovibrio desulfuricans ND132 genome (6 intractable gaps) and the Desulfovibrio africanus genome (1 intractable gap). The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. The developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.
Characterization of a nuclear localization signal in the foot-and-mouth disease virus polymerase
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sanchez-Aparicio, Maria Teresa; Rosas, Maria Flora; Sobrino, Francisco, E-mail: fsobrino@cbm.uam.es
2013-09-15
We have experimentally tested whether the MRKTKLAPT sequence in FMDV 3D protein (residues 16 to 24) can act as a nuclear localization signal (NLS). Mutants with substitutions in two basic residues within this sequence, K18E and K20E, were generated. A decreased nuclear localization was observed in transiently expressed 3D and its precursor 3CD, suggesting a role of K18 and K20 in nuclear targeting. Fusion of MRKTKLAPT to the green fluorescence protein (GFP) increased the nuclear localization of GFP, which was not observed when GFP was fused to the 3D mutated sequences. These results indicate that the sequence MRKTKLAPT can bemore » functionally considered as a NLS. When introduced in a FMDV full length RNA replacements K18E and K20E led to production of revertant viruses that replaced the acidic residues introduced (E) by K, suggesting that the presence of lysins at positions 18 and 20 of 3D is essential for virus multiplication. - Highlights: • The FMDV 3D polymerase contains a nuclear localization signal. • Replacements K18E and K20E decrease nuclear localization of 3D and its precursor 3CD. • Fusion of the MRKTKLAPT 3D motif to GFP increases the nuclear localization of GFP. • Replacements K18E and K20E abolish the ability of MRKTKLAPT to relocate GFP. • RNAs harboring replacements K18E and K20E lead to recovery of revertant FMDVs.« less
Musculoskeletal MRI findings of juvenile localized scleroderma.
Eutsler, Eric P; Horton, Daniel B; Epelman, Monica; Finkel, Terri; Averill, Lauren W
2017-04-01
Juvenile localized scleroderma comprises a group of autoimmune conditions often characterized clinically by an area of skin hardening. In addition to superficial changes in the skin and subcutaneous tissues, juvenile localized scleroderma may involve the deep soft tissues, bones and joints, possibly resulting in functional impairment and pain in addition to cosmetic changes. There is literature documenting the spectrum of findings for deep involvement of localized scleroderma (fascia, muscles, tendons, bones and joints) in adults, but there is limited literature for the condition in children. We aimed to document the spectrum of musculoskeletal magnetic resonance imaging (MRI) findings of both superficial and deep juvenile localized scleroderma involvement in children and to evaluate the utility of various MRI sequences for detecting those findings. Two radiologists retrospectively evaluated 20 MRI studies of the extremities in 14 children with juvenile localized scleroderma. Each imaging sequence was also given a subjective score of 0 (not useful), 1 (somewhat useful) or 2 (most useful for detecting the findings). Deep tissue involvement was detected in 65% of the imaged extremities. Fascial thickening and enhancement were seen in 50% of imaged extremities. Axial T1, axial T1 fat-suppressed (FS) contrast-enhanced and axial fluid-sensitive sequences were rated most useful. Fascial thickening and enhancement were the most commonly encountered deep tissue findings in extremity MRIs of children with juvenile localized scleroderma. Because abnormalities of the skin, subcutaneous tissues and fascia tend to run longitudinally in an affected limb, axial T1, axial fluid-sensitive and axial T1-FS contrast-enhanced sequences should be included in the imaging protocol.
Vences, Miguel; Rasoloariniaina, Jean R; Riemann, Jana C
2018-02-08
The genus Typhleotris contains three poorly known blind fish species, inhabiting aquifers in the limestone plateau of south-western Madagascar. Until recently these species were known from only few localities, and their pattern of genetic differentiation remains poorly studied. In this study we analyse 122 Typhleotris tissue samples collected from 12 localities, spanning the entire known range of the genus, and use DNA sequences to assign these samples to the three species known. The phylogeny based on the mitochondrial marker cox1 revealed three main clades corresponding to the three species: Typhleotris madagascariensis, T. mararybe and T. pauliani, differing by uncorrected pairwise sequence divergences of 6.3-9.8%. The distribution ranges of the three species overlapped widely: T. mararybe was collected only in a southern group of localities, T. madagascariensis was found in both the southern and the central group of localities, and T. pauliani occurred from the northernmost site to the southern group of localities; yet the three species did not share haplotypes in two nuclear genes, except for three individuals that we hypothesize are hybrids of T. pauliani with T. madagascariensis and T. mararybe. This pattern of concordant mitochondrial and nuclear divergence despite sympatry strongly supports the status of all three taxa as separate species. Phylogeographic structure was obvious in T. madagascariensis, with two separate shallow mitochondrial clades occupying (1) the central vs. (2) the southern group of populations, and in T. pauliani, with separate mitochondrial clades for (1) the northern vs. (2) the central/southern populations. The widespread occurrence of these three cave fish species suggests that the aquifers in south-western Madagascar have at least in the past allowed episodic dispersal and gene flow of subterraneous organisms, whereas the phylogeographic pattern of T. madagascariensis and T. pauliani provides evidence for isolation and loss of connectivity in the more recent past.
Cooper, David N.; Bacolla, Albino; Férec, Claude; Vasquez, Karen M.; Kehrer-Sawatzki, Hildegard; Chen, Jian-Min
2011-01-01
Different types of human gene mutation may vary in size, from structural variants (SVs) to single base-pair substitutions, but what they all have in common is that their nature, size and location are often determined either by specific characteristics of the local DNA sequence environment or by higher-order features of the genomic architecture. The human genome is now recognized to contain ‘pervasive architectural flaws’ in that certain DNA sequences are inherently mutation-prone by virtue of their base composition, sequence repetitivity and/or epigenetic modification. Here we explore how the nature, location and frequency of different types of mutation causing inherited disease are shaped in large part, and often in remarkably predictable ways, by the local DNA sequence environment. The mutability of a given gene or genomic region may also be influenced indirectly by a variety of non-canonical (non-B) secondary structures whose formation is facilitated by the underlying DNA sequence. Since these non-B DNA structures can interfere with subsequent DNA replication and repair, and may serve to increase mutation frequencies in generalized fashion (i.e. both in the context of subtle mutations and SVs), they have the potential to serve as a unifying concept in studies of mutational mechanisms underlying human inherited disease. PMID:21853507
Qin, Shaobo; Mo, Yunming; Jiang, Ke; Cai, Bo; Xie, Feng; Jiang, Jianping; Murphy, Robert W; Li, Jia-Tang; Wang, Yuezhao
2015-01-01
Due to small body sizes, superficial similarities in morphologies, and obscure activity behaviors, the phylogeny and taxonomy of species in the genus Liuixalus were very troublesome. Some species might comprise a complex of cryptic species. To investigate the species of group, we constructed the matrilineal genealogy of the genus using 16s rRNA mitochondrial DNA sequences. Analyses recovered six well supported matrilines that involved L. romeri, L. ocellatus, L. hainanus, L. calcarius, Liuixalus shiwandashan sp. nov. and Liuixalus jinxiuensis sp. nov., though the historical relationships among them remained unresolved. Currently, Liuixalus included 4 species, distributed eastwards from northern Vietnam to Hong Kong, China. Based on genealogical and morphological distinctiveness, we described Liuixalus jinxiuensis sp. nov. from the type locality Mt. Dayao, Jinxiu, China and Liuixalus shiwandashan sp. nov. from the type locality Mt. Shiwanda, China. A combination of morphological measurements, genetic, bioacoustic and osteological analysis was served to diagnose the new taxa.
NASA Astrophysics Data System (ADS)
Liang, Yunyun; Liu, Sanyang; Zhang, Shengli
2017-02-01
Apoptosis is a fundamental process controlling normal tissue homeostasis by regulating a balance between cell proliferation and death. Predicting subcellular location of apoptosis proteins is very helpful for understanding its mechanism of programmed cell death. Prediction of apoptosis protein subcellular location is still a challenging and complicated task, and existing methods mainly based on protein primary sequences. In this paper, we propose a new position-specific scoring matrix (PSSM)-based model by using Geary autocorrelation function and detrended cross-correlation coefficient (DCCA coefficient). Then a 270-dimensional (270D) feature vector is constructed on three widely used datasets: ZD98, ZW225 and CL317, and support vector machine is adopted as classifier. The overall prediction accuracies are significantly improved by rigorous jackknife test. The results show that our model offers a reliable and effective PSSM-based tool for prediction of apoptosis protein subcellular localization.
Geomagnetism-Aided Indoor Wi-Fi Radio-Map Construction via Smartphone Crowdsourcing.
Li, Wen; Wei, Dongyan; Lai, Qifeng; Li, Xianghong; Yuan, Hong
2018-05-08
Wi-Fi radio-map construction is an important phase in indoor fingerprint localization systems. Traditional methods for Wi-Fi radio-map construction have the problems of being time-consuming and labor-intensive. In this paper, an indoor Wi-Fi radio-map construction method is proposed which utilizes crowdsourcing data contributed by smartphone users. We draw indoor pathway map and construct Wi-Fi radio-map without requiring manual site survey, exact floor layout and extra infrastructure support. The key novelty is that it recognizes road segments from crowdsourcing traces by a cluster based on magnetism sequence similarity and constructs an indoor pathway map with Wi-Fi signal strengths annotated on. Through experiments in real world indoor areas, the method is proved to have good performance on magnetism similarity calculation, road segment clustering and pathway map construction. The Wi-Fi radio maps constructed by crowdsourcing data are validated to provide competitive indoor localization accuracy.
Geomagnetism-Aided Indoor Wi-Fi Radio-Map Construction via Smartphone Crowdsourcing
Li, Wen; Wei, Dongyan; Lai, Qifeng; Li, Xianghong; Yuan, Hong
2018-01-01
Wi-Fi radio-map construction is an important phase in indoor fingerprint localization systems. Traditional methods for Wi-Fi radio-map construction have the problems of being time-consuming and labor-intensive. In this paper, an indoor Wi-Fi radio-map construction method is proposed which utilizes crowdsourcing data contributed by smartphone users. We draw indoor pathway map and construct Wi-Fi radio-map without requiring manual site survey, exact floor layout and extra infrastructure support. The key novelty is that it recognizes road segments from crowdsourcing traces by a cluster based on magnetism sequence similarity and constructs an indoor pathway map with Wi-Fi signal strengths annotated on. Through experiments in real world indoor areas, the method is proved to have good performance on magnetism similarity calculation, road segment clustering and pathway map construction. The Wi-Fi radio maps constructed by crowdsourcing data are validated to provide competitive indoor localization accuracy. PMID:29738454
Whole-Genome Sequencing and Assembly with High-Throughput, Short-Read Technologies
Sundquist, Andreas; Ronaghi, Mostafa; Tang, Haixu; Pevzner, Pavel; Batzoglou, Serafim
2007-01-01
While recently developed short-read sequencing technologies may dramatically reduce the sequencing cost and eventually achieve the $1000 goal for re-sequencing, their limitations prevent the de novo sequencing of eukaryotic genomes with the standard shotgun sequencing protocol. We present SHRAP (SHort Read Assembly Protocol), a sequencing protocol and assembly methodology that utilizes high-throughput short-read technologies. We describe a variation on hierarchical sequencing with two crucial differences: (1) we select a clone library from the genome randomly rather than as a tiling path and (2) we sample clones from the genome at high coverage and reads from the clones at low coverage. We assume that 200 bp read lengths with a 1% error rate and inexpensive random fragment cloning on whole mammalian genomes is feasible. Our assembly methodology is based on first ordering the clones and subsequently performing read assembly in three stages: (1) local assemblies of regions significantly smaller than a clone size, (2) clone-sized assemblies of the results of stage 1, and (3) chromosome-sized assemblies. By aggressively localizing the assembly problem during the first stage, our method succeeds in assembling short, unpaired reads sampled from repetitive genomes. We tested our assembler using simulated reads from D. melanogaster and human chromosomes 1, 11, and 21, and produced assemblies with large sets of contiguous sequence and a misassembly rate comparable to other draft assemblies. Tested on D. melanogaster and the entire human genome, our clone-ordering method produces accurate maps, thereby localizing fragment assembly and enabling the parallelization of the subsequent steps of our pipeline. Thus, we have demonstrated that truly inexpensive de novo sequencing of mammalian genomes will soon be possible with high-throughput, short-read technologies using our methodology. PMID:17534434
A map of human genome variation from population-scale sequencing.
Abecasis, Gonçalo R; Altshuler, David; Auton, Adam; Brooks, Lisa D; Durbin, Richard M; Gibbs, Richard A; Hurles, Matt E; McVean, Gil A
2010-10-28
The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation as a foundation for investigating the relationship between genotype and phenotype. Here we present results of the pilot phase of the project, designed to develop and compare different strategies for genome-wide sequencing with high-throughput platforms. We undertook three projects: low-coverage whole-genome sequencing of 179 individuals from four populations; high-coverage sequencing of two mother-father-child trios; and exon-targeted sequencing of 697 individuals from seven populations. We describe the location, allele frequency and local haplotype structure of approximately 15 million single nucleotide polymorphisms, 1 million short insertions and deletions, and 20,000 structural variants, most of which were previously undescribed. We show that, because we have catalogued the vast majority of common variation, over 95% of the currently accessible variants found in any individual are present in this data set. On average, each person is found to carry approximately 250 to 300 loss-of-function variants in annotated genes and 50 to 100 variants previously implicated in inherited disorders. We demonstrate how these results can be used to inform association and functional studies. From the two trios, we directly estimate the rate of de novo germline base substitution mutations to be approximately 10(-8) per base pair per generation. We explore the data with regard to signatures of natural selection, and identify a marked reduction of genetic variation in the neighbourhood of genes, due to selection at linked sites. These methods and public data will support the next phase of human genetic research.
Li, Shijun; Ehrhardt, David W.; Rhee, Seung Y.
2006-01-01
Cells are organized into a complex network of subcellular compartments that are specialized for various biological functions. Subcellular location is an important attribute of protein function. To facilitate systematic elucidation of protein subcellular location, we analyzed experimentally verified protein localization data of 1,300 Arabidopsis (Arabidopsis thaliana) proteins. The 1,300 experimentally verified proteins are distributed among 40 different compartments, with most of the proteins localized to four compartments: mitochondria (36%), nucleus (28%), plastid (17%), and cytosol (13.3%). About 19% of the proteins are found in multiple compartments, in which a high proportion (36.4%) is localized to both cytosol and nucleus. Characterization of the overrepresented Gene Ontology molecular functions and biological processes suggests that the Golgi apparatus and peroxisome may play more diverse functions but are involved in more specialized processes than other compartments. To support systematic empirical determination of protein subcellular localization using a technology called fluorescent tagging of full-length proteins, we developed a database and Web application to provide preselected green fluorescent protein insertion position and primer sequences for all Arabidopsis proteins to study their subcellular localization and to store experimentally verified protein localization images, videos, and their annotations of proteins generated using the fluorescent tagging of full-length proteins technology. The database can be searched, browsed, and downloaded using a Web browser at http://aztec.stanford.edu/gfp/. The software can also be downloaded from the same Web site for local installation. PMID:16617091
Prm3p is a pheromone-induced peripheral nuclear envelope protein required for yeast nuclear fusion.
Shen, Shu; Tobery, Cynthia E; Rose, Mark D
2009-05-01
Nuclear membrane fusion is the last step in the mating pathway of the yeast Saccharomyces cerevisiae. We adapted a bioinformatics approach to identify putative pheromone-induced membrane proteins potentially required for nuclear membrane fusion. One protein, Prm3p, was found to be required for nuclear membrane fusion; disruption of PRM3 caused a strong bilateral defect, in which nuclear congression was completed but fusion did not occur. Prm3p was localized to the nuclear envelope in pheromone-responding cells, with significant colocalization with the spindle pole body in zygotes. A previous report, using a truncated protein, claimed that Prm3p is localized to the inner nuclear envelope. Based on biochemistry, immunoelectron microscopy and live cell microscopy, we find that functional Prm3p is a peripheral membrane protein exposed on the cytoplasmic face of the outer nuclear envelope. In support of this, mutations in a putative nuclear localization sequence had no effect on full-length protein function or localization. In contrast, point mutations and deletions in the highly conserved hydrophobic carboxy-terminal domain disrupted both protein function and localization. Genetic analysis, colocalization, and biochemical experiments indicate that Prm3p interacts directly with Kar5p, suggesting that nuclear membrane fusion is mediated by a protein complex.
Molzan, Manuela; Ottmann, Christian
2013-03-01
Myeloid leukemia factor 1 (MLF1) is associated with the development of leukemic diseases such as acute myeloid leukemia (AML) and myelodysplastic syndrome (MDS). However, information on the physiological function of MLF1 is limited and mostly derived from studies identifying MLF1 interaction partners like CSN3, MLF1IP, MADM, Manp and the 14-3-3 proteins. The 14-3-3-binding site surrounding S34 is one of the only known functional features of the MLF1 sequence, along with one nuclear export sequence (NES) and two nuclear localization sequences (NLS). It was recently shown that the subcellular localization of mouse MLF1 is dependent on 14-3-3 proteins. Based on these findings, we investigated whether the subcellular localization of human MLF1 was also directly 14-3-3-dependent. Live cell imaging with GFP-fused human MLF1 was used to study the effects of mutations and deletions on its subcellular localization. Surprisingly, we found that the subcellular localization of full-length human MLF1 is 14-3-3-independent, and is probably regulated by other as-yet-unknown proteins.
SU-F-T-350: Continuous Leaf Optimization (CLO) for IMRT Leaf Sequencing
DOE Office of Scientific and Technical Information (OSTI.GOV)
Long, T; Chen, M; Jiang, S
Purpose: To study a new step-and-shoot IMRT leaf sequencing model that avoids the two main pitfalls of conventional leaf sequencing: (1) target fluence being stratified into a fixed number of discrete levels and/or (2) aperture leaf positions being restricted to a discrete set of locations. These assumptions induce error into the sequence or reduce the feasible region of potential plans, respectively. Methods: We develop a one-dimensional (single leaf pair) methodology that does not make assumptions (1) or (2) that can be easily extended to a multi-row model. The proposed continuous leaf optimization (CLO) methodology takes in an existing set ofmore » apertures and associated intensities, or solution “seed,” and improves the plan without the restrictiveness of 1or (2). It then uses a first-order descent algorithm to converge onto a locally optimal solution. A seed solution can come from models that assume (1) and (2), thus allowing the CLO model to improve upon existing leaf sequencing methodologies. Results: The CLO model was applied to 208 generated target fluence maps in one dimension. In all cases for all tested sequencing strategies, the CLO model made improvements on the starting seed objective function. The CLO model also was able to keep MUs low. Conclusion: The CLO model can improve upon existing leaf sequencing methods by avoiding the restrictions of (1) and (2). By allowing for more flexible leaf positioning, error can be reduced when matching some target fluence. This study lays the foundation for future models and solution methodologies that can incorporate continuous leaf positions explicitly into the IMRT treatment planning model. Supported by Cancer Prevention & Research Institute of Texas (CPRIT) - ID RP150485.« less
Design Thinking for mHealth Application Co-Design to Support Heart Failure Self-Management.
Woods, Leanna; Cummings, Elizabeth; Duff, Jed; Walker, Kim
2017-01-01
Heart failure is a prevalent, progressive chronic disease costing in excess of $1billion per year in Australia alone. Disease self-management has positive implications for the patient and decreases healthcare usage. However, adherence to recommended guidelines is challenging and existing literature reports sub-optimal adherence. mHealth applications in chronic disease education have the potential to facilitate patient enablement for disease self-management. To the best of our knowledge no heart failure self-management application is available for safe use by our patients. In this paper, we present the process established to co-design a mHealth application in support of heart-failure self-management. For this development, an interdisciplinary team systematically proceeds through the phases of Stanford University's Design Thinking process; empathise, define, ideate, prototype and test with a user-centred philosophy. Using this clinician-led heart failure app research as a case study, we describe a sequence of procedures to engage with local patients, carers, software developers, eHealth experts and clinical colleagues to foster rigorously developed and locally relevant patient-facing mHealth solutions. Importantly, patients are engaged in each stage with ethnographic interviews, a series of workshops and multiple re-design iterations.
Barcellos, Leonardo H; Palmeiro, Marina Lobato; Naconecy, Marcos M; Geremia, Tomás; Cervieri, André; Shinkai, Rosemary S
2018-05-17
To compare the effects of different screw-tightening sequences and torque applications on stresses in implant-supported fixed complete dentures supported by five abutments. Strain gauges fixed to the abutments were used to test the sequences 2-4-3-1-5; 1-2-3-4-5; 3-2-4-1-5; and 2-5-4-1-3 with direct 10-Ncm torque or progressive torque (5 + 10 Ncm). Data were analyzed using analysis of variance and standardized effect size. No effects of tightening sequence or torque application were found except for the sequence 3-2-4-1-5 and some small to moderate effect sizes. Screw-tightening sequences and torque application modes have only a marginal effect on residual stresses.
Sanchez, Daniel J; Reber, Paul J
2012-04-01
The memory system that supports implicit perceptual-motor sequence learning relies on brain regions that operate separately from the explicit, medial temporal lobe memory system. The implicit learning system therefore likely has distinct operating characteristics and information processing constraints. To attempt to identify the limits of the implicit sequence learning mechanism, participants performed the serial interception sequence learning (SISL) task with covertly embedded repeating sequences that were much longer than most previous studies: ranging from 30 to 60 (Experiment 1) and 60 to 90 (Experiment 2) items in length. Robust sequence-specific learning was observed for sequences up to 80 items in length, extending the known capacity of implicit sequence learning. In Experiment 3, 12-item repeating sequences were embedded among increasing amounts of irrelevant nonrepeating sequences (from 20 to 80% of training trials). Despite high levels of irrelevant trials, learning occurred across conditions. A comparison of learning rates across all three experiments found a surprising degree of constancy in the rate of learning regardless of sequence length or embedded noise. Sequence learning appears to be constant with the logarithm of the number of sequence repetitions practiced during training. The consistency in learning rate across experiments and conditions implies that the mechanisms supporting implicit sequence learning are not capacity-constrained by very long sequences nor adversely affected by high rates of irrelevant sequences during training.
DOE Office of Scientific and Technical Information (OSTI.GOV)
FitzGerald, Michael
2012-06-01
Michael FitzGerald on "A rapid whole genome sequencing and analysis system supporting genomic epidemiology" at the 2012 Sequencing, Finishing, Analysis in the Future Meeting held June 5-7, 2012 in Santa Fe, New Mexico.
FitzGerald, Michael
2018-01-11
Michael FitzGerald on "A rapid whole genome sequencing and analysis system supporting genomic epidemiology" at the 2012 Sequencing, Finishing, Analysis in the Future Meeting held June 5-7, 2012 in Santa Fe, New Mexico.
Automated sequence-specific protein NMR assignment using the memetic algorithm MATCH.
Volk, Jochen; Herrmann, Torsten; Wüthrich, Kurt
2008-07-01
MATCH (Memetic Algorithm and Combinatorial Optimization Heuristics) is a new memetic algorithm for automated sequence-specific polypeptide backbone NMR assignment of proteins. MATCH employs local optimization for tracing partial sequence-specific assignments within a global, population-based search environment, where the simultaneous application of local and global optimization heuristics guarantees high efficiency and robustness. MATCH thus makes combined use of the two predominant concepts in use for automated NMR assignment of proteins. Dynamic transition and inherent mutation are new techniques that enable automatic adaptation to variable quality of the experimental input data. The concept of dynamic transition is incorporated in all major building blocks of the algorithm, where it enables switching between local and global optimization heuristics at any time during the assignment process. Inherent mutation restricts the intrinsically required randomness of the evolutionary algorithm to those regions of the conformation space that are compatible with the experimental input data. Using intact and artificially deteriorated APSY-NMR input data of proteins, MATCH performed sequence-specific resonance assignment with high efficiency and robustness.
Nishizawa, M; Nishizawa, K
2000-10-01
The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the 'between gene' GC content heterogeneity, which is linked to 'isochores', is a principal factor associated with the bias in substitution patterns in human, 'within gene' heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed.
Nishizawa, Manami; Nishizawa, Kazuhisa
2000-01-01
The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the ‘between gene’ GC content heterogeneity, which is linked to ‘isochores’, is a principal factor associated with the bias in substitution patterns in human, ‘within gene’ heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed. PMID:11000273
Barendt, Pamela A.; Shah, Najaf A.; Barendt, Gregory A.; Kothari, Parth A.; Sarkar, Casim A.
2013-01-01
While the ribosome has evolved to function in complex intracellular environments, these contexts do not easily allow for the study of its inherent capabilities. We have used a synthetic, well-defined, Escherichia coli (E. coli)-based translation system in conjunction with ribosome display, a powerful in vitro selection method, to identify ribosome binding sites (RBSs) that can promote the efficient translation of messenger RNAs (mRNAs) with a leader length representative of natural E. coli mRNAs. In previous work, we used a longer leader sequence and unexpectedly recovered highly efficient cytosine-rich sequences with complementarity to the 16S ribosomal RNA (rRNA) and similarity to eukaryotic RBSs. In the current study, Shine-Dalgarno (SD) sequences were prevalent but non-SD sequences were also heavily enriched and were dominated by novel guanine- and uracil-rich motifs which showed statistically significant complementarity to the 16S rRNA. Additionally, only SD motifs exhibited position-dependent decreases in sequence entropy, indicating that non-SD motifs likely operate by increasing the local concentration of ribosomes in the vicinity of the start codon, rather than by a position-dependent mechanism. These results further support the putative generality of mRNA-rRNA complementarity in facilitating mRNA translation, but also suggest that context (e.g., leader length and composition) dictates the specific subset of possible RBSs that are used for efficient translation of a given transcript. PMID:23427812
Biclustering as a method for RNA local multiple sequence alignment.
Wang, Shu; Gutell, Robin R; Miranker, Daniel P
2007-12-15
Biclustering is a clustering method that simultaneously clusters both the domain and range of a relation. A challenge in multiple sequence alignment (MSA) is that the alignment of sequences is often intended to reveal groups of conserved functional subsequences. Simultaneously, the grouping of the sequences can impact the alignment; precisely the kind of dual situation biclustering is intended to address. We define a representation of the MSA problem enabling the application of biclustering algorithms. We develop a computer program for local MSA, BlockMSA, that combines biclustering with divide-and-conquer. BlockMSA simultaneously finds groups of similar sequences and locally aligns subsequences within them. Further alignment is accomplished by dividing both the set of sequences and their contents. The net result is both a multiple sequence alignment and a hierarchical clustering of the sequences. BlockMSA was tested on the subsets of the BRAliBase 2.1 benchmark suite that display high variability and on an extension to that suite to larger problem sizes. Also, alignments were evaluated of two large datasets of current biological interest, T box sequences and Group IC1 Introns. The results were compared with alignments computed by ClustalW, MAFFT, MUCLE and PROBCONS alignment programs using Sum of Pairs (SPS) and Consensus Count. Results for the benchmark suite are sensitive to problem size. On problems of 15 or greater sequences, BlockMSA is consistently the best. On none of the problems in the test suite are there appreciable differences in scores among BlockMSA, MAFFT and PROBCONS. On the T box sequences, BlockMSA does the most faithful job of reproducing known annotations. MAFFT and PROBCONS do not. On the Intron sequences, BlockMSA, MAFFT and MUSCLE are comparable at identifying conserved regions. BlockMSA is implemented in Java. Source code and supplementary datasets are available at http://aug.csres.utexas.edu/msa/
Paleoenvironmental conditions in the late Paleogene, Sumatra, Indonesia
NASA Astrophysics Data System (ADS)
O'shea, Natalie; Arthur Bettis, E.; Zaim, Yahdi; Rizal, Yan; Aswan, Aswan; Gunnell, Gregg F.; Zonneveld, John-Paul; Ciochon, Russell L.
2015-11-01
A stratified paleosol sequence exposed in an open pit mine in central Sumatra provides a record of the paleoenvironmental conditions in the lower reaches of a large river system in the late Paleogene (latest Eocene or Oligocene). Morphological, geochemical, and stable isotope data suggest that the sequence represents a mosaic of local environmental conditions changing from estuarine to riverine up section. Weakly expressed soils formed on low-lying estuary surfaces, while more well expressed soils formed on higher, better drained surfaces. Peatlands (coal) with clayey subsoils were along the estuary margins. Well-expressed soils with evidence of clay translocation and chemical weathering become more common higher in the section where alluvial deposits associated with a meandering river are dominant. Stable carbon isotope ratios support a paleolandscape dominated by C3 plants with input by C4 vegetation limited to a few intervals. Finally, whole-rock geochemistry suggests moderate chemical weathering consistent with a tropical locality. This multi-proxy paleoenvironmental reconstruction suggests a highly productive lowland forest environment at this locality in the late Paleogene and provides the first direct examination of the terrestrial environment in Sumatra at this time. The limited fossil record in Island Southeast Asia during this time period is likely a result of poor bone and shell preservation in tropical forest environments combined with a general lack of systematic prospecting. However, our continuing work in this area has produced a relatively diverse assemblage of fossil vertebrates, now including fishes, amphibians, turtles, crocodiles, and mammals, as well as a growing diversity of fossil plants.
Wielstra, B; Arntzen, J W
2014-05-27
We collected nuclear DNA data (52 markers) with next-generation sequencing for nine Triturus newt specimens, including the holotype and two of the paratypes of T. arntzeni, from the type locality at Vrtovać in eastern Serbia. We compare these data to a reference set composed of the four crested newt species distributed in eastern Serbia namely T. cristatus, T. dobrogicus, T. ivanbureschi and T. macedonicus to determine to which of these species the newts from the type locality of T. arntzeni should be attributed. The majority of alleles in individuals from Vrtovać is derived from T. macedonicus, but a considerable number of T. ivanbureschi alleles is also present; alleles typical for T. cristatus and T. dobrogicus are found at low frequency. Accordingly, we interpret Vrtovać as a T. macedonicus - T. ivanbureschi hybrid population, albeit not composed of F1 hybrids but of genetically admixed individuals derived through multiple generations of backcrossing. The data support the notion that the name T. arntzeni should not be applied to a species newly distinguished in T. karelinii sensu lato (to which the name T. ivanbureschi has been given). We conclude that because of the hybrid nature of the individuals from Vrtovać, the name T. arntzeni should be placed not only in the synonymy of T. macedonicus but also in the synonymy of T. ivanbureschi. In this study we demonstrate that next-generation sequencing can provide high quality data for type material with degraded DNA and therefore can play an important role in taxonomy.
Gong, Wei; Russell, Michael; Suzuki, Keiko; Riabowol, Karl
2006-04-01
ING1 is a type II tumor suppressor that affects cell growth, stress signaling, apoptosis, and DNA repair by altering chromatin structure and regulating transcription. Decreased ING1 expression is seen in several human cancers, and mislocalization has been noted in diverse types of cancer cells. Aberrant targeting may, therefore, functionally inactivate ING1. Bioinformatics analysis identified a sequence between the nuclear localization sequence and plant homeodomain domains of ING1 that closely matched the binding motif of 14-3-3 proteins that target cargo proteins to specific subcellular locales. We find that the widely expressed p33(ING1b) splicing isoform of ING1 interacts with members of the 14-3-3 family of proteins and that this interaction is regulated by the phosphorylation status of ING1. 14-3-3 binding resulted in significant amounts of p33(ING1b) protein being tethered in the cytoplasm. As shown previously, ectopic expression of p33(ING1b) increased levels of the p21(Waf1) cyclin-dependent kinase inhibitor upon UV-induced DNA damage. Overexpression of 14-3-3 inhibited the up-regulation of p21(Waf1) by p33(ING1b), consistent with the idea that mislocalization blocks at least one of ING1's biological activities. These data support the idea that the 14-3-3 proteins play a crucial role in regulating the activity of p33(ING1b) by directing its subcellular localization.
Subcellular localization of transiently expressed fluorescent fusion proteins.
Collings, David A
2013-01-01
The recent and massive expansion in plant genomics data has generated a large number of gene sequences for which two seemingly simple questions need to be answered: where do the proteins encoded by these genes localize in cells, and what do they do? One widespread approach to answering the localization question has been to use particle bombardment to transiently express unknown proteins tagged with green fluorescent protein (GFP) or its numerous derivatives. Confocal fluorescence microscopy is then used to monitor the localization of the fluorescent protein as it hitches a ride through the cell. The subcellular localization of the fusion protein, if not immediately apparent, can then be determined by comparison to localizations generated by fluorescent protein fusions to known signalling sequences and proteins, or by direct comparison with fluorescent dyes. This review aims to be a tour guide for researchers wanting to travel this hitch-hiker's path, and for reviewers and readers who wish to understand their travel reports. It will describe some of the technology available for visualizing protein localizations, and some of the experimental approaches for optimizing and confirming localizations generated by particle bombardment in onion epidermal cells, the most commonly used experimental system. As the non-conservation of signal sequences in heterologous expression systems such as onion, and consequent mis-targeting of fusion proteins, is always a potential problem, the epidermal cells of the Argenteum mutant of pea are proposed as a model system.
Feature co-localization landscape of the human genome
Ng, Siu-Kin; Hu, Taobo; Long, Xi; Chan, Cheuk-Hin; Tsang, Shui-Ying; Xue, Hong
2016-01-01
Although feature co-localizations could serve as useful guide-posts to genome architecture, a comprehensive and quantitative feature co-localization map of the human genome has been lacking. Herein we show that, in contrast to the conventional bipartite division of genomic sequences into genic and inter-genic regions, pairwise co-localizations of forty-two genomic features in the twenty-two autosomes based on 50-kb to 2,000-kb sequence windows indicate a tripartite zonal architecture comprising Genic zones enriched with gene-related features and Alu-elements; Proximal zones enriched with MIR- and L2-elements, transcription-factor-binding-sites (TFBSs), and conserved-indels (CIDs); and Distal zones enriched with L1-elements. Co-localizations between single-nucleotide-polymorphisms (SNPs) and copy-number-variations (CNVs) reveal a fraction of sequence windows displaying steeply enhanced levels of SNPs, CNVs and recombination rates that point to active adaptive evolution in such pathways as immune response, sensory perceptions, and cognition. The strongest positive co-localization observed between TFBSs and CIDs suggests a regulatory role of CIDs in cooperation with TFBSs. The positive co-localizations of cancer somatic CNVs (CNVT) with all Proximal zone and most Genic zone features, in contrast to the distinctly more restricted co-localizations exhibited by germline CNVs (CNVG), reveal disparate distributions of CNVTs and CNVGs indicative of dissimilarity in their underlying mechanisms. PMID:26854351
Unipro UGENE: a unified bioinformatics toolkit.
Okonechnikov, Konstantin; Golosova, Olga; Fursov, Mikhail
2012-04-15
Unipro UGENE is a multiplatform open-source software with the main goal of assisting molecular biologists without much expertise in bioinformatics to manage, analyze and visualize their data. UGENE integrates widely used bioinformatics tools within a common user interface. The toolkit supports multiple biological data formats and allows the retrieval of data from remote data sources. It provides visualization modules for biological objects such as annotated genome sequences, Next Generation Sequencing (NGS) assembly data, multiple sequence alignments, phylogenetic trees and 3D structures. Most of the integrated algorithms are tuned for maximum performance by the usage of multithreading and special processor instructions. UGENE includes a visual environment for creating reusable workflows that can be launched on local resources or in a High Performance Computing (HPC) environment. UGENE is written in C++ using the Qt framework. The built-in plugin system and structured UGENE API make it possible to extend the toolkit with new functionality. UGENE binaries are freely available for MS Windows, Linux and Mac OS X at http://ugene.unipro.ru/download.html. UGENE code is licensed under the GPLv2; the information about the code licensing and copyright of integrated tools can be found in the LICENSE.3rd_party file provided with the source bundle.
2011-01-01
Background Deep-sea hydrothermal vent animals occupy patchy and ephemeral habitats supported by chemosynthetic primary production. Volcanic and tectonic activities controlling the turnover of these habitats contribute to demographic instability that erodes genetic variation within and among colonies of these animals. We examined DNA sequences from one mitochondrial and three nuclear gene loci to assess genetic diversity in the siboglinid tubeworm, Riftia pachyptila, a widely distributed constituent of vents along the East Pacific Rise and Galápagos Rift. Results Genetic differentiation (FST) among populations increased with geographical distances, as expected under a linear stepping-stone model of dispersal. Low levels of DNA sequence diversity occurred at all four loci, allowing us to exclude the hypothesis that an idiosyncratic selective sweep eliminated mitochondrial diversity alone. Total gene diversity declined with tectonic spreading rates. The southernmost populations, which are subjected to superfast spreading rates and high probabilities of extinction, are relatively homogenous genetically. Conclusions Compared to other vent species, DNA sequence diversity is extremely low in R. pachyptila. Though its dispersal abilities appear to be effective, the low diversity, particularly in southern hemisphere populations, is consistent with frequent local extinction and (re)colonization events. PMID:21489281
Coykendall, D.K.; Johnson, S.B.; Karl, S.A.; Lutz, R.A.; Vrijenhoek, R.C.
2011-01-01
Background: Deep-sea hydrothermal vent animals occupy patchy and ephemeral habitats supported by chemosynthetic primary production. Volcanic and tectonic activities controlling the turnover of these habitats contribute to demographic instability that erodes genetic variation within and among colonies of these animals. We examined DNA sequences from one mitochondrial and three nuclear gene loci to assess genetic diversity in the siboglinid tubeworm, Riftia pachyptila, a widely distributed constituent of vents along the East Pacific Rise and Galpagos Rift. Results: Genetic differentiation (FST) among populations increased with geographical distances, as expected under a linear stepping-stone model of dispersal. Low levels of DNA sequence diversity occurred at all four loci, allowing us to exclude the hypothesis that an idiosyncratic selective sweep eliminated mitochondrial diversity alone. Total gene diversity declined with tectonic spreading rates. The southernmost populations, which are subjected to superfast spreading rates and high probabilities of extinction, are relatively homogenous genetically. Conclusions: Compared to other vent species, DNA sequence diversity is extremely low in R. pachyptila. Though its dispersal abilities appear to be effective, the low diversity, particularly in southern hemisphere populations, is consistent with frequent local extinction and (re)colonization events. ?? 2011 Coykendall et al; licensee BioMed Central Ltd.
The ENCODE Project at UC Santa Cruz.
Thomas, Daryl J; Rosenbloom, Kate R; Clawson, Hiram; Hinrichs, Angie S; Trumbower, Heather; Raney, Brian J; Karolchik, Donna; Barber, Galt P; Harte, Rachel A; Hillman-Jackson, Jennifer; Kuhn, Robert M; Rhead, Brooke L; Smith, Kayla E; Thakkapallayil, Archana; Zweig, Ann S; Haussler, David; Kent, W James
2007-01-01
The goal of the Encyclopedia Of DNA Elements (ENCODE) Project is to identify all functional elements in the human genome. The pilot phase is for comparison of existing methods and for the development of new methods to rigorously analyze a defined 1% of the human genome sequence. Experimental datasets are focused on the origin of replication, DNase I hypersensitivity, chromatin immunoprecipitation, promoter function, gene structure, pseudogenes, non-protein-coding RNAs, transcribed RNAs, multiple sequence alignment and evolutionarily constrained elements. The ENCODE project at UCSC website (http://genome.ucsc.edu/ENCODE) is the primary portal for the sequence-based data produced as part of the ENCODE project. In the pilot phase of the project, over 30 labs provided experimental results for a total of 56 browser tracks supported by 385 database tables. The site provides researchers with a number of tools that allow them to visualize and analyze the data as well as download data for local analyses. This paper describes the portal to the data, highlights the data that has been made available, and presents the tools that have been developed within the ENCODE project. Access to the data and types of interactive analysis that are possible are illustrated through supplemental examples.
Nielsen, Rasmus
2017-01-01
Admixture—the mixing of genomes from divergent populations—is increasingly appreciated as a central process in evolution. To characterize and quantify patterns of admixture across the genome, a number of methods have been developed for local ancestry inference. However, existing approaches have a number of shortcomings. First, all local ancestry inference methods require some prior assumption about the expected ancestry tract lengths. Second, existing methods generally require genotypes, which is not feasible to obtain for many next-generation sequencing projects. Third, many methods assume samples are diploid, however a wide variety of sequencing applications will fail to meet this assumption. To address these issues, we introduce a novel hidden Markov model for estimating local ancestry that models the read pileup data, rather than genotypes, is generalized to arbitrary ploidy, and can estimate the time since admixture during local ancestry inference. We demonstrate that our method can simultaneously estimate the time since admixture and local ancestry with good accuracy, and that it performs well on samples of high ploidy—i.e. 100 or more chromosomes. As this method is very general, we expect it will be useful for local ancestry inference in a wider variety of populations than what previously has been possible. We then applied our method to pooled sequencing data derived from populations of Drosophila melanogaster on an ancestry cline on the east coast of North America. We find that regions of local recombination rates are negatively correlated with the proportion of African ancestry, suggesting that selection against foreign ancestry is the least efficient in low recombination regions. Finally we show that clinal outlier loci are enriched for genes associated with gene regulatory functions, consistent with a role of regulatory evolution in ecological adaptation of admixed D. melanogaster populations. Our results illustrate the potential of local ancestry inference for elucidating fundamental evolutionary processes. PMID:28045893
Local extinction and unintentional rewilding of bighorn sheep (Ovis canadensis) on a desert island
Wilder, Benjamin T.; Betancourt, Julio L.; Epps, Clinton W.; Crowhurst, Rachel S.; Mead, Jim I.; Ezcurra, Exequiel
2014-01-01
Bighorn sheep (Ovis canadensis) were not known to live on Tiburón Island, the largest island in the Gulf of California and Mexico, prior to the surprisingly successful introduction of 20 individuals as a conservation measure in 1975. Today, a stable island population of ~500 sheep supports limited big game hunting and restocking of depleted areas on the Mexican mainland. We discovered fossil dung morphologically similar to that of bighorn sheep in a dung mat deposit from Mojet Cave, in the mountains of Tiburón Island. To determine the origin of this cave deposit we compared pellet shape to fecal pellets of other large mammals, and extracted DNA to sequence mitochondrial DNA fragments at the 12S ribosomal RNA and control regions. The fossil dung was 14C-dated to 1476–1632 calendar years before present and was confirmed as bighorn sheep by morphological and ancient DNA (aDNA) analysis. 12S sequences closely or exactly matched known bighorn sheep sequences; control region sequences exactly matched a haplotype described in desert bighorn sheep populations in southwest Arizona and southern California and showed subtle differentiation from the extant Tiburón population. Native desert bighorn sheep previously colonized this land-bridge island, most likely during the Pleistocene, when lower sea levels connected Tiburón to the mainland. They were extirpated sometime in the last ~1500 years, probably due to inherent dynamics of isolated populations, prolonged drought, and (or) human overkill. The reintroduced population is vulnerable to similar extinction risks. The discovery presented here refutes conventional wisdom that bighorn sheep are not native to Tiburón Island, and establishes its recent introduction as an example of unintentional rewilding, defined here as the introduction of a species without knowledge that it was once native and has since gone locally extinct.
Sequence-Specific Targeting of Dosage Compensation in Drosophila Favors an Active Chromatin Context
Gelbart, Marnie; Tolstorukov, Michael Y.; Plachetka, Annette; Kharchenko, Peter V.; Jung, Youngsook L.; Gorchakov, Andrey A.; Larschan, Erica; Gu, Tingting; Minoda, Aki; Riddle, Nicole C.; Schwartz, Yuri B.; Elgin, Sarah C. R.; Karpen, Gary H.; Pirrotta, Vincenzo; Kuroda, Mitzi I.; Park, Peter J.
2012-01-01
The Drosophila MSL complex mediates dosage compensation by increasing transcription of the single X chromosome in males approximately two-fold. This is accomplished through recognition of the X chromosome and subsequent acetylation of histone H4K16 on X-linked genes. Initial binding to the X is thought to occur at “entry sites” that contain a consensus sequence motif (“MSL recognition element” or MRE). However, this motif is only ∼2 fold enriched on X, and only a fraction of the motifs on X are initially targeted. Here we ask whether chromatin context could distinguish between utilized and non-utilized copies of the motif, by comparing their relative enrichment for histone modifications and chromosomal proteins mapped in the modENCODE project. Through a comparative analysis of the chromatin features in male S2 cells (which contain MSL complex) and female Kc cells (which lack the complex), we find that the presence of active chromatin modifications, together with an elevated local GC content in the surrounding sequences, has strong predictive value for functional MSL entry sites, independent of MSL binding. We tested these sites for function in Kc cells by RNAi knockdown of Sxl, resulting in induction of MSL complex. We show that ectopic MSL expression in Kc cells leads to H4K16 acetylation around these sites and a relative increase in X chromosome transcription. Collectively, our results support a model in which a pre-existing active chromatin environment, coincident with H3K36me3, contributes to MSL entry site selection. The consequences of MSL targeting of the male X chromosome include increase in nucleosome lability, enrichment for H4K16 acetylation and JIL-1 kinase, and depletion of linker histone H1 on active X-linked genes. Our analysis can serve as a model for identifying chromatin and local sequence features that may contribute to selection of functional protein binding sites in the genome. PMID:22570616
Local extinction and unintentional rewilding of bighorn sheep (Ovis canadensis) on a desert island.
Wilder, Benjamin T; Betancourt, Julio L; Epps, Clinton W; Crowhurst, Rachel S; Mead, Jim I; Ezcurra, Exequiel
2014-01-01
Bighorn sheep (Ovis canadensis) were not known to live on Tiburón Island, the largest island in the Gulf of California and Mexico, prior to the surprisingly successful introduction of 20 individuals as a conservation measure in 1975. Today, a stable island population of ∼500 sheep supports limited big game hunting and restocking of depleted areas on the Mexican mainland. We discovered fossil dung morphologically similar to that of bighorn sheep in a dung mat deposit from Mojet Cave, in the mountains of Tiburón Island. To determine the origin of this cave deposit we compared pellet shape to fecal pellets of other large mammals, and extracted DNA to sequence mitochondrial DNA fragments at the 12S ribosomal RNA and control regions. The fossil dung was 14C-dated to 1476-1632 calendar years before present and was confirmed as bighorn sheep by morphological and ancient DNA (aDNA) analysis. 12S sequences closely or exactly matched known bighorn sheep sequences; control region sequences exactly matched a haplotype described in desert bighorn sheep populations in southwest Arizona and southern California and showed subtle differentiation from the extant Tiburón population. Native desert bighorn sheep previously colonized this land-bridge island, most likely during the Pleistocene, when lower sea levels connected Tiburón to the mainland. They were extirpated sometime in the last ∼1500 years, probably due to inherent dynamics of isolated populations, prolonged drought, and (or) human overkill. The reintroduced population is vulnerable to similar extinction risks. The discovery presented here refutes conventional wisdom that bighorn sheep are not native to Tiburón Island, and establishes its recent introduction as an example of unintentional rewilding, defined here as the introduction of a species without knowledge that it was once native and has since gone locally extinct.
Local Extinction and Unintentional Rewilding of Bighorn Sheep (Ovis canadensis) on a Desert Island
Wilder, Benjamin T.; Betancourt, Julio L.; Epps, Clinton W.; Crowhurst, Rachel S.; Mead, Jim I.; Ezcurra, Exequiel
2014-01-01
Bighorn sheep (Ovis canadensis) were not known to live on Tiburón Island, the largest island in the Gulf of California and Mexico, prior to the surprisingly successful introduction of 20 individuals as a conservation measure in 1975. Today, a stable island population of ∼500 sheep supports limited big game hunting and restocking of depleted areas on the Mexican mainland. We discovered fossil dung morphologically similar to that of bighorn sheep in a dung mat deposit from Mojet Cave, in the mountains of Tiburón Island. To determine the origin of this cave deposit we compared pellet shape to fecal pellets of other large mammals, and extracted DNA to sequence mitochondrial DNA fragments at the 12S ribosomal RNA and control regions. The fossil dung was 14C-dated to 1476–1632 calendar years before present and was confirmed as bighorn sheep by morphological and ancient DNA (aDNA) analysis. 12S sequences closely or exactly matched known bighorn sheep sequences; control region sequences exactly matched a haplotype described in desert bighorn sheep populations in southwest Arizona and southern California and showed subtle differentiation from the extant Tiburón population. Native desert bighorn sheep previously colonized this land-bridge island, most likely during the Pleistocene, when lower sea levels connected Tiburón to the mainland. They were extirpated sometime in the last ∼1500 years, probably due to inherent dynamics of isolated populations, prolonged drought, and (or) human overkill. The reintroduced population is vulnerable to similar extinction risks. The discovery presented here refutes conventional wisdom that bighorn sheep are not native to Tiburón Island, and establishes its recent introduction as an example of unintentional rewilding, defined here as the introduction of a species without knowledge that it was once native and has since gone locally extinct. PMID:24646515
Filatov, Gleb; Bauwens, Bruno; Kertész-Farkas, Attila
2018-05-07
Bioinformatics studies often rely on similarity measures between sequence pairs, which often pose a bottleneck in large-scale sequence analysis. Here, we present a new convolutional kernel function for protein sequences called the LZW-Kernel. It is based on code words identified with the Lempel-Ziv-Welch (LZW) universal text compressor. The LZW-Kernel is an alignment-free method, it is always symmetric, is positive, always provides 1.0 for self-similarity and it can directly be used with Support Vector Machines (SVMs) in classification problems, contrary to normalized compression distance (NCD), which often violates the distance metric properties in practice and requires further techniques to be used with SVMs. The LZW-Kernel is a one-pass algorithm, which makes it particularly plausible for big data applications. Our experimental studies on remote protein homology detection and protein classification tasks reveal that the LZW-Kernel closely approaches the performance of the Local Alignment Kernel (LAK) and the SVM-pairwise method combined with Smith-Waterman (SW) scoring at a fraction of the time. Moreover, the LZW-Kernel outperforms the SVM-pairwise method when combined with BLAST scores, which indicates that the LZW code words might be a better basis for similarity measures than local alignment approximations found with BLAST. In addition, the LZW-Kernel outperforms n-gram based mismatch kernels, hidden Markov model based SAM and Fisher kernel, and protein family based PSI-BLAST, among others. Further advantages include the LZW-Kernel's reliance on a simple idea, its ease of implementation, and its high speed, three times faster than BLAST and several magnitudes faster than SW or LAK in our tests. LZW-Kernel is implemented as a standalone C code and is a free open-source program distributed under GPLv3 license and can be downloaded from https://github.com/kfattila/LZW-Kernel. akerteszfarkas@hse.ru. Supplementary data are available at Bioinformatics Online.
Theos, Alexander C.; Watt, Brenda; Harper, Dawn C.; Janczura, Karolina J.; Theos, Sarah C.; Herman, Kathryn E.; Marks, Michael S.
2013-01-01
SUMMARY Proteolytic fragments of the pigment cell-specific glycoprotein, PMEL, form the amyloid fibrillar matrix underlying melanins in melanosomes. The fibrils form within multivesicular endosomes to which PMEL is selectively sorted and that serve as melanosome precursors. GPNMB is a tissue-restricted glycoprotein with substantial sequence homology to PMEL but no known function, and was proposed to localize to non-fibrillar domains of distinct melanosome subcompartments in melanocytes. Here we confirm that GPNMB localizes to compartments distinct from the PMEL-containing multivesicular premelanosomes or late endosomes in melanocytes and HeLa cells, respectively, and is largely absent from fibrils. Using domain swapping, the unique PMEL localization is ascribed to its PKD domain, whereas the homologous PKD domain of GPNMB lacks apparent sorting function. The difference likely reflects extensive modification of the GPNMB PKD domain by N-glycosylation, nullifying its sorting function. These results reveal the molecular basis for the distinct trafficking and morphogenetic properties of PMEL and GPNMB, and support a deterministic function of the PMEL PKD domain in both protein sorting and amyloidogenesis. PMID:23452376
Spatial confinement induces hairpins in nicked circular DNA
Japaridze, Aleksandre; Orlandini, Enzo; Smith, Kathleen Beth; Gmür, Lucas; Valle, Francesco; Micheletti, Cristian
2017-01-01
Abstract In living cells, DNA is highly confined in space with the help of condensing agents, DNA binding proteins and high levels of supercoiling. Due to challenges associated with experimentally studying DNA under confinement, little is known about the impact of spatial confinement on the local structure of the DNA. Here, we have used well characterized slits of different sizes to collect high resolution atomic force microscopy images of confined circular DNA with the aim of assessing the impact of the spatial confinement on global and local conformational properties of DNA. Our findings, supported by numerical simulations, indicate that confinement imposes a large mechanical stress on the DNA as evidenced by a pronounced anisotropy and tangent–tangent correlation function with respect to non-constrained DNA. For the strongest confinement we observed nanometer sized hairpins and interwound structures associated with the nicked sites in the DNA sequence. Based on these findings, we propose that spatial DNA confinement in vivo can promote the formation of localized defects at mechanically weak sites that could be co-opted for biological regulatory functions. PMID:28201616
A novel extracellular matrix protein from tomato associated with lignified secondary cell walls.
Domingo, C; Gómez, M D; Cañas, L; Hernández-Yago, J; Conejero, V; Vera, P
1994-01-01
A cDNA clone representing a novel cell wall protein was isolated from a tomato cDNA library. The deduced amino acid sequence shows that the encoded protein is very small (88 amino acids), contains an N-terminal hydrophobic signal peptide, and is enriched in lysine and tyrosine. We have designated this protein TLRP for tyrosine- and lysine-rich protein. RNA gel blot hybridization identified TLRP transcripts constitutively present in roots, stems, and leaves from tomato plants. The encoded protein seems to be highly insolubilized in the cell wall, and we present evidence that this protein is specifically localized in the modified secondary cell walls of the xylem and in cells of the sclerenchyma. In addition, the protein is localized in the protective periderm layer of the growing root. The highly localized deposition in cells destined to give support and protection to the plant indicates that this cell wall protein alone and/or in collaboration with other cell wall structural proteins may have a specialized structural function by mechanically strengthening the walls. PMID:7919979
Dynamical Localization for Discrete Anderson Dirac Operators
NASA Astrophysics Data System (ADS)
Prado, Roberto A.; de Oliveira, César R.; Carvalho, Silas L.
2017-04-01
We establish dynamical localization for random Dirac operators on the d-dimensional lattice, with d\\in { 1, 2, 3} , in the three usual regimes: large disorder, band edge and 1D. These operators are discrete versions of the continuous Dirac operators and consist in the sum of a discrete free Dirac operator with a random potential. The potential is a diagonal matrix formed by different scalar potentials, which are sequences of independent and identically distributed random variables according to an absolutely continuous probability measure with bounded density and of compact support. We prove the exponential decay of fractional moments of the Green function for such models in each of the above regimes, i.e., (j) throughout the spectrum at larger disorder, (jj) for energies near the band edges at arbitrary disorder and (jjj) in dimension one, for all energies in the spectrum and arbitrary disorder. Dynamical localization in theses regimes follows from the fractional moments method. The result in the one-dimensional regime contrast with one that was previously obtained for 1D Dirac model with Bernoulli potential.
NASA Technical Reports Server (NTRS)
Chang, D.; Haynes, J. I. 2nd; Brady, J. N.; Consigli, R. A.; Spooner, B. S. (Principal Investigator)
1992-01-01
A nuclear localization signal (NLS) has been identified in the N-terminal (Ala1-Pro-Lys-Arg-Lys-Ser-Gly-Val-Ser-Lys-Cys11) amino acid sequence of the polyomavirus major capsid protein VP1. The importance of this amino acid sequence for nuclear transport of VP1 protein was demonstrated by a genetic "subtractive" study using the constructs pSG5VP1 (full-length VP1) and pSG5 delta 5'VP1 (truncated VP1, lacking amino acids Ala1-Cys11). These constructs were used to transfect COS-7 cells, and expression and intracellular localization of the VP1 protein was visualized by indirect immunofluorescence. These studies revealed that the full-length VP1 was expressed and localized in the nucleus, while the truncated VP1 protein was localized in the cytoplasm and not transported to the nucleus. These findings were substantiated by an "additive" approach using FITC-labeled conjugates of synthetic peptides homologous to the NLS of VP1 cross-linked to bovine serum albumin or immunoglobulin G. Both conjugates localized in the nucleus after microinjection into the cytoplasm of 3T6 cells. The importance of individual amino acids found in the basic sequence (Lys3-Arg-Lys5) of the NLS was also investigated. This was accomplished by synthesizing three additional peptides in which lysine-3 was substituted with threonine, arginine-4 was substituted with threonine, or lysine-5 was substituted with threonine. It was found that lysine-3 was crucial for nuclear transport, since substitution of this amino acid with threonine prevented nuclear localization of the microinjected, FITC-labeled conjugate.
Yang, Zhihui; Mammel, Mark; Papafragkou, Efstathia; Hida, Kaoru; Elkins, Christopher A; Kulka, Michael
2017-11-16
Next generation sequencing (NGS) holds promise as a single application for both detection and sequence identification of foodborne viruses; however, technical challenges remain due to anticipated low quantities of virus in contaminated food. In this study, with a focus on data analysis using several bioinformatics tools, we applied NGS toward amplification-independent detection and identification of norovirus at low copy (<10 3 copies) or within multiple strains from produce. Celery samples were inoculated with human norovirus (stool suspension) either as a single norovirus strain, a mixture of strains (GII.4 and GII.6), or a mixture of different species (hepatitis A virus and norovirus). Viral RNA isolation and recovery was confirmed by RT-qPCR, and optimized for library generation and sequencing without amplification using the Illumina MiSeq platform. Extracts containing either a single virus or a two-virus mixture were analyzed using two different analytic approaches to achieve virus detection and identification. First an overall assessment of viral genome coverage for samples varying in copy numbers (1.1×10 3 to 1.7×10 7 ) and genomic content (single or multiple strains in various ratios) was completed by reference-guided mapping. Not unexpectedly, this targeted approach to identification was successful in correctly mapping reads, thus identifying each virus contained in the inoculums even at low copy (estimated at 12 copies). For the second (metagenomic) approach, samples were treated as "unknowns" for data analyses using (i) a sequence-based alignment with a local database, (ii) an "in-house" k-mer tool, (iii) a commercially available metagenomics bioinformatic analysis platform cosmosID, and (iv) an open-source program Kraken. Of the four metagenomics tools applied in this study, only the local database alignment and in-house k-mer tool were successful in detecting norovirus (as well as HAV) at low copy (down to <10 3 copies) and within a mixture of virus strains or species. The results of this investigation provide support for continued investigation into the development and integration of these analytical tools for identification and detection of foodborne viruses. Published by Elsevier B.V.
Highly multiplexed subcellular RNA sequencing in situ
Lee, Je Hyuk; Daugharthy, Evan R.; Scheiman, Jonathan; Kalhor, Reza; Ferrante, Thomas C.; Yang, Joyce L.; Terry, Richard; Jeanty, Sauveur S. F.; Li, Chao; Amamoto, Ryoji; Peters, Derek T.; Turczyk, Brian M.; Marblestone, Adam H.; Inverso, Samuel A.; Bernard, Amy; Mali, Prashant; Rios, Xavier; Aach, John; Church, George M.
2014-01-01
Understanding the spatial organization of gene expression with single nucleotide resolution requires localizing the sequences of expressed RNA transcripts within a cell in situ. Here we describe fluorescent in situ RNA sequencing (FISSEQ), in which stably cross-linked cDNA amplicons are sequenced within a biological sample. Using 30-base reads from 8,742 genes in situ, we examined RNA expression and localization in human primary fibroblasts using a simulated wound healing assay. FISSEQ is compatible with tissue sections and whole mount embryos, and reduces the limitations of optical resolution and noisy signals on single molecule detection. Our platform enables massively parallel detection of genetic elements, including gene transcripts and molecular barcodes, and can be used to investigate cellular phenotype, gene regulation, and environment in situ. PMID:24578530
Gruber, Karl; Schöning, Caspar; Otte, Marianne; Kinuthia, Wanja; Hasselmann, Martin
2013-09-01
Identifying the forces shaping intraspecific phenotypic and genotypic divergence are of key importance in evolutionary biology. Phenotypic divergence may result from local adaptation or, especially in species with strong gene flow, from pronounced phenotypic plasticity. Here, we examine morphological and genetic divergence among populations of the western honey bee Apis mellifera in the topographically heterogeneous East African region. The currently accepted "mountain refugia hypothesis" states that populations living in disjunct montane forests belong to a different lineage than those in savanna habitats surrounding these forests. We obtained microsatellite data, mitochondrial sequences, and morphometric data from worker honey bees collected from feral colonies in three montane forests and corresponding neighboring savanna regions in Kenya. Honey bee colonies from montane forests showed distinct worker morphology compared with colonies in savanna areas. Mitochondrial sequence data did not support the existence of the two currently accepted subspecies. Furthermore, analyses of the microsatellite data with a Bayesian clustering method did not support the existence of two source populations as it would be expected under the mountain refugia scenario. Our findings suggest that phenotypic plasticity rather than distinct ancestry is the leading cause behind the phenotypic divergence observed between montane forest and savanna honey bees. Our study thus corroborates the idea that high gene flow may select for increased plasticity.
Intratumor heterogeneity in localized lung adenocarcinomas delineated by multiregion sequencing.
Zhang, Jianjun; Fujimoto, Junya; Zhang, Jianhua; Wedge, David C; Song, Xingzhi; Zhang, Jiexin; Seth, Sahil; Chow, Chi-Wan; Cao, Yu; Gumbs, Curtis; Gold, Kathryn A; Kalhor, Neda; Little, Latasha; Mahadeshwar, Harshad; Moran, Cesar; Protopopov, Alexei; Sun, Huandong; Tang, Jiabin; Wu, Xifeng; Ye, Yuanqing; William, William N; Lee, J Jack; Heymach, John V; Hong, Waun Ki; Swisher, Stephen; Wistuba, Ignacio I; Futreal, P Andrew
2014-10-10
Cancers are composed of populations of cells with distinct molecular and phenotypic features, a phenomenon termed intratumor heterogeneity (ITH). ITH in lung cancers has not been well studied. We applied multiregion whole-exome sequencing (WES) on 11 localized lung adenocarcinomas. All tumors showed clear evidence of ITH. On average, 76% of all mutations and 20 out of 21 known cancer gene mutations were identified in all regions of individual tumors, which suggested that single-region sequencing may be adequate to identify the majority of known cancer gene mutations in localized lung adenocarcinomas. With a median follow-up of 21 months after surgery, three patients have relapsed, and all three patients had significantly larger fractions of subclonal mutations in their primary tumors than patients without relapse. These data indicate that a larger subclonal mutation fraction may be associated with increased likelihood of postsurgical relapse in patients with localized lung adenocarcinomas. Copyright © 2014, American Association for the Advancement of Science.
SCALCE: boosting sequence compression algorithms using locally consistent encoding.
Hach, Faraz; Numanagic, Ibrahim; Alkan, Can; Sahinalp, S Cenk
2012-12-01
The high throughput sequencing (HTS) platforms generate unprecedented amounts of data that introduce challenges for the computational infrastructure. Data management, storage and analysis have become major logistical obstacles for those adopting the new platforms. The requirement for large investment for this purpose almost signalled the end of the Sequence Read Archive hosted at the National Center for Biotechnology Information (NCBI), which holds most of the sequence data generated world wide. Currently, most HTS data are compressed through general purpose algorithms such as gzip. These algorithms are not designed for compressing data generated by the HTS platforms; for example, they do not take advantage of the specific nature of genomic sequence data, that is, limited alphabet size and high similarity among reads. Fast and efficient compression algorithms designed specifically for HTS data should be able to address some of the issues in data management, storage and communication. Such algorithms would also help with analysis provided they offer additional capabilities such as random access to any read and indexing for efficient sequence similarity search. Here we present SCALCE, a 'boosting' scheme based on Locally Consistent Parsing technique, which reorganizes the reads in a way that results in a higher compression speed and compression rate, independent of the compression algorithm in use and without using a reference genome. Our tests indicate that SCALCE can improve the compression rate achieved through gzip by a factor of 4.19-when the goal is to compress the reads alone. In fact, on SCALCE reordered reads, gzip running time can improve by a factor of 15.06 on a standard PC with a single core and 6 GB memory. Interestingly even the running time of SCALCE + gzip improves that of gzip alone by a factor of 2.09. When compared with the recently published BEETL, which aims to sort the (inverted) reads in lexicographic order for improving bzip2, SCALCE + gzip provides up to 2.01 times better compression while improving the running time by a factor of 5.17. SCALCE also provides the option to compress the quality scores as well as the read names, in addition to the reads themselves. This is achieved by compressing the quality scores through order-3 Arithmetic Coding (AC) and the read names through gzip through the reordering SCALCE provides on the reads. This way, in comparison with gzip compression of the unordered FASTQ files (including reads, read names and quality scores), SCALCE (together with gzip and arithmetic encoding) can provide up to 3.34 improvement in the compression rate and 1.26 improvement in running time. Our algorithm, SCALCE (Sequence Compression Algorithm using Locally Consistent Encoding), is implemented in C++ with both gzip and bzip2 compression options. It also supports multithreading when gzip option is selected, and the pigz binary is available. It is available at http://scalce.sourceforge.net. fhach@cs.sfu.ca or cenk@cs.sfu.ca Supplementary data are available at Bioinformatics online.
Cell type-specific termination of transcription by transposable element sequences.
Conley, Andrew B; Jordan, I King
2012-09-30
Transposable elements (TEs) encode sequences necessary for their own transposition, including signals required for the termination of transcription. TE sequences within the introns of human genes show an antisense orientation bias, which has been proposed to reflect selection against TE sequences in the sense orientation owing to their ability to terminate the transcription of host gene transcripts. While there is evidence in support of this model for some elements, the extent to which TE sequences actually terminate transcription of human gene across the genome remains an open question. Using high-throughput sequencing data, we have characterized over 9,000 distinct TE-derived sequences that provide transcription termination sites for 5,747 human genes across eight different cell types. Rarefaction curve analysis suggests that there may be twice as many TE-derived termination sites (TE-TTS) genome-wide among all human cell types. The local chromatin environment for these TE-TTS is similar to that seen for 3' UTR canonical TTS and distinct from the chromatin environment of other intragenic TE sequences. However, those TE-TTS located within the introns of human genes were found to be far more cell type-specific than the canonical TTS. TE-TTS were much more likely to be found in the sense orientation than other intragenic TE sequences of the same TE family and TE-TTS in the sense orientation terminate transcription more efficiently than those found in the antisense orientation. Alu sequences were found to provide a large number of relatively weak TTS, whereas LTR elements provided a smaller number of much stronger TTS. TE sequences provide numerous termination sites to human genes, and TE-derived TTS are particularly cell type-specific. Thus, TE sequences provide a powerful mechanism for the diversification of transcriptional profiles between cell types and among evolutionary lineages, since most TE-TTS are evolutionarily young. The extent of transcription termination by TEs seen here, along with the preference for sense-oriented TE insertions to provide TTS, is consistent with the observed antisense orientation bias of human TEs.
Memetic algorithms for de novo motif-finding in biomedical sequences.
Bi, Chengpeng
2012-09-01
The objectives of this study are to design and implement a new memetic algorithm for de novo motif discovery, which is then applied to detect important signals hidden in various biomedical molecular sequences. In this paper, memetic algorithms are developed and tested in de novo motif-finding problems. Several strategies in the algorithm design are employed that are to not only efficiently explore the multiple sequence local alignment space, but also effectively uncover the molecular signals. As a result, there are a number of key features in the implementation of the memetic motif-finding algorithm (MaMotif), including a chromosome replacement operator, a chromosome alteration-aware local search operator, a truncated local search strategy, and a stochastic operation of local search imposed on individual learning. To test the new algorithm, we compare MaMotif with a few of other similar algorithms using simulated and experimental data including genomic DNA, primary microRNA sequences (let-7 family), and transmembrane protein sequences. The new memetic motif-finding algorithm is successfully implemented in C++, and exhaustively tested with various simulated and real biological sequences. In the simulation, it shows that MaMotif is the most time-efficient algorithm compared with others, that is, it runs 2 times faster than the expectation maximization (EM) method and 16 times faster than the genetic algorithm-based EM hybrid. In both simulated and experimental testing, results show that the new algorithm is compared favorably or superior to other algorithms. Notably, MaMotif is able to successfully discover the transcription factors' binding sites in the chromatin immunoprecipitation followed by massively parallel sequencing (ChIP-Seq) data, correctly uncover the RNA splicing signals in gene expression, and precisely find the highly conserved helix motif in the transmembrane protein sequences, as well as rightly detect the palindromic segments in the primary microRNA sequences. The memetic motif-finding algorithm is effectively designed and implemented, and its applications demonstrate it is not only time-efficient, but also exhibits excellent performance while compared with other popular algorithms. Copyright © 2012 Elsevier B.V. All rights reserved.
Fink, Inge R; Pietretti, Danilo; Voogdt, Carlos G P; Westphal, Adrie H; Savelkoul, Huub F J; Forlenza, Maria; Wiegertjes, Geert F
2016-09-01
Toll-like receptors (TLRs) are fundamental components of innate immunity that play significant roles in the defence against pathogen invasion. In this study, we present the molecular characterization of the full-length coding sequence of tlr1, tlr2a and tlr2b from common carp (Cyprinus carpio). Each is encoded within a single exon and contains a conserved number of leucine-rich repeats, a transmembrane region and an intracellular TIR domain for signalling. Indeed, sequence, phylogenetic and synteny analysis of carp tlr1, tlr2a and tlr2b support that these genes are orthologues of mammalian TLR1 and TLR2. The tlr genes are expressed in various immune organs and cell types. Furthermore, the carp sequences exhibited a good three-dimensional fit with the heterodimer structure of human TLR1-TLR2, including the potential to bind to the ligand Pam3CSK4. This supports the possible formation of carp Tlr1-Tlr2 heterodimers. However, we were unable to demonstrate Tlr1/Tlr2-mediated ligand binding in transfected cell lines through NF-κB activation, despite showing the expression and co-localization of Tlr1 and Tlr2. We discuss possible limitations when studying ligand-specific activation of NF-κB after expression of Tlr1 and/or Tlr2 in human but also fish cell lines and we propose alternative future strategies for studying ligand-binding properties of fish Tlrs. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Fischer, Iris; Steige, Kim A.; Stephan, Wolfgang; Mboup, Mamadou
2013-01-01
The wild tomato species Solanum chilense and S. peruvianum are a valuable non-model system for studying plant adaptation since they grow in diverse environments facing many abiotic constraints. Here we investigate the sequence evolution of regulatory regions of drought and cold responsive genes and their expression regulation. The coding regions of these genes were previously shown to exhibit signatures of positive selection. Expression profiles and sequence evolution of regulatory regions of members of the Asr (ABA/water stress/ripening induced) gene family and the dehydrin gene pLC30-15 were analyzed in wild tomato populations from contrasting environments. For S. chilense, we found that Asr4 and pLC30-15 appear to respond much faster to drought conditions in accessions from very dry environments than accessions from more mesic locations. Sequence analysis suggests that the promoter of Asr2 and the downstream region of pLC30-15 are under positive selection in some local populations of S. chilense. By investigating gene expression differences at the population level we provide further support of our previous conclusions that Asr2, Asr4, and pLC30-15 are promising candidates for functional studies of adaptation. Our analysis also demonstrates the power of the candidate gene approach in evolutionary biology research and highlights the importance of wild Solanum species as a genetic resource for their cultivated relatives. PMID:24205149
Brabant, Magali; Baux, Ludwig; Casimir, Richard; Briand, Jean Paul; Chaloin, Olivier; Porceddu, Mathieu; Buron, Nelly; Chauvier, David; Lassalle, Myriam; Lecoeur, Hervé; Langonné, Alain; Dupont, Sylvie; Déas, Olivier; Brenner, Catherine; Rebouillat, Dominique; Muller, Sylviane; Borgne-Sanchez, Annie; Jacotot, Etienne
2009-10-01
Dengue viruses belong to the Flavivirus family and are responsible for hemorrhagic fever in Human. Dengue virus infection triggers apoptosis especially through the expression of the small membrane (M) protein. Using isolated mitochondria, we found that synthetic peptides containing the C-terminus part of the M ectodomain caused apoptosis-related mitochondrial membrane permeabilization (MMP) events. These events include matrix swelling and the dissipation of the mitochondrial transmembrane potential (DeltaPsi(m)). Protein M Flavivirus sequence alignments and helical wheel projections reveal a conserved distribution of charged residues. Moreover, when combined to the cell penetrating HIV-1 Tat peptide transduction domain (Tat-PTD), this sequence triggers a caspase-dependent cell death associated with DeltaPsi(m) loss and cytochrome c release. Mutational approaches coupled to functional screening on isolated mitochondria resulted in the selection of a protein M derived sequence containing nine residues with potent MMP-inducing properties on isolated mitochondria. A chimeric peptide composed of a Tat-PTD linked to the 9-mer entity triggers MMP and cell death. Finally, local administration of this chimeric peptide induces growth inhibition of xenograft prostate PC3 tumors in immuno-compromised mice, and significantly enhances animal survival. Together, these findings support the notion of using viral genomes as valuable sources to discover mitochondria-targeted sequences that may lead to the development of new anticancer compounds.
Molecular phylogeny of noctilucoid dinoflagellates (Noctilucales, Dinophyceae).
Gómez, Fernando; Moreira, David; López-García, Purificación
2010-07-01
The order Noctilucales or class Noctiluciphyceae encompasses three families of aberrant dinoflagellates (Noctilucaceae, Leptodiscaceae and Kofoidiniaceae) that, at least in some life stages, lack typical dinoflagellate characters such as the ribbon-like transversal flagellum or condensed chromosomes. Noctiluca scintillans, the first dinoflagellate to be described, has been intensively investigated. However, its phylogenetic position based on the small subunit ribosomal DNA (SSU rDNA) sequence is unstable and controversial. Noctiluca has been placed either as an early diverging lineage that diverged after Oxyrrhis and before the dinokaryotes -core dinoflagellates- or as a recent lineage branching from unarmoured dino fl agellates in the order Gymnodiniales. So far, the lack of other noctilucoid sequences has hampered the elucidation of their phylogenetic relationships to other dino fl agellates. Furthermore, even the monophyly of the noctilucoids remained uncertain. We have determined SSU rRNA gene sequences for Kofoidiniaceae, those of the type Spatulodinium (=Gymnodinium) pseudonoctiluca and another Spatulodinium species, as well as of two species of Kofoidinium, and the first gene sequence of Leptodiscaceae, that of Abedinium (=Leptophyllus) dasypus. These taxa were collected from their type localities, the English Channel and the NW Mediterranean Sea, respectively. Phylogenetic analyses place the Noctilucales as a monophyletic group at a basal position close to parasites of the Marine Alveolate Group I (MAGI) and the Syndiniales (MAGII), before the core of dinokaryotic dinoflagellates, although with moderate support. 2010 Elsevier GmbH. All rights reserved.
Arabidopsis intragenomic conserved noncoding sequence
Thomas, Brian C.; Rapaka, Lakshmi; Lyons, Eric; Pedersen, Brent; Freeling, Michael
2007-01-01
After the most recent tetraploidy in the Arabidopsis lineage, most gene pairs lost one, but not both, of their duplicates. We manually inspected the 3,179 retained gene pairs and their surrounding gene space still present in the genome using a custom-made viewer application. The display of these pairs allowed us to define intragenic conserved noncoding sequences (CNSs), identify exon annotation errors, and discover potentially new genes. Using a strict algorithm to sort high-scoring pair sequences from the bl2seq data, we created a database of 14,944 intragenomic Arabidopsis CNSs. The mean CNS length is 31 bp, ranging from 15 to 285 bp. There are ≈1.7 CNSs associated with a typical gene, and Arabidopsis CNSs are found in all areas around exons, most frequently in the 5′ upstream region. Gene ontology classifications related to transcription, regulation, or “response to …” external or endogenous stimuli, especially hormones, tend to be significantly overrepresented among genes containing a large number of CNSs, whereas protein localization, transport, and metabolism are common among genes with no CNSs. There is a 1.5% overlap between these CNSs and the 218,982 putative RNAs in the Arabidopsis Small RNA Project database, allowing for two mismatches. These CNSs provide a unique set of noncoding sequences enriched for function. CNS function is implied by evolutionary conservation and independently supported because CNS-richness predicts regulatory gene ontology categories. PMID:17301222
Swain, Timothy D
2018-01-01
The recent rapid proliferation of novel taxon identification in the Zoanthidea has been accompanied by a parallel propagation of gene trees as a tool of species discovery, but not a corresponding increase in our understanding of phylogeny. This disparity is caused by the trade-off between the capabilities of automated DNA sequence alignment and data content of genes applied to phylogenetic inference in this group. Conserved genes or segments are easily aligned across the order, but produce poorly resolved trees; hypervariable genes or segments contain the evolutionary signal necessary for resolution and robust support, but sequence alignment is daunting. Staggered alignments are a form of phylogeny-informed sequence alignment composed of a mosaic of local and universal regions that allow phylogenetic inference to be applied to all nucleotides from both hypervariable and conserved gene segments. Comparisons between species tree phylogenies inferred from all data (staggered alignment) and hypervariable-excluded data (standard alignment) demonstrate improved confidence and greater topological agreement with other sources of data for the complete-data tree. This novel phylogeny is the most comprehensive to date (in terms of taxa and data) and can serve as an expandable tool for evolutionary hypothesis testing in the Zoanthidea. Spanish language abstract available in Text S1. Translation by L. O. Swain, DePaul University, Chicago, Illinois, 60604, USA. Copyright © 2017 Elsevier Inc. All rights reserved.
Kowasupat, Chanon; Panijpan, Bhinyo; Laosinchai, Parames; Ruenwongsa, Pintip; Phongdara, Amornrat; Wanna, Warapond; Senapin, Saengchan; Phiwsaiya, Kornsunee
2014-01-01
In Thailand, there are currently five recognized species members of the bubble-nesting Betta genus, namely Betta splendens, B. smaragdina, B. imbellis, B. mahachaiensis and B. siamorientalis. In 2010, we indicated the possibility, based on COI barcoding evidence, that there might be two additional species, albeit cryptic, related to the type-locality B. smaragdina in some provinces in the northeast of Thailand. In the present study, after a more extensive survey of the northeast, and phylogenetic analyses based on COI and ITS1 sequences, the B. smaragdina group may be composed of at least 3 cryptic species members. The phylogenetic positions of these B. smaragdina group members in the bubble-nesting bettas' tree together with those of their congeners have been consolidated by better DNA sequence quality and phylogenetic analyses. With a better supported tree, the species statuses of B. siamorientalis and the Cambodian B. smaragdina-like fish, B. stiktos, are also confirmed. PMID:25606392
Epigenetic Instability due to Defective Replication of Structured DNA
Sarkies, Peter; Reams, Charlie; Simpson, Laura J.; Sale, Julian E.
2010-01-01
Summary The accurate propagation of histone marks during chromosomal replication is proposed to rely on the tight coupling of replication with the recycling of parental histones to the daughter strands. Here, we show in the avian cell line DT40 that REV1, a key regulator of DNA translesion synthesis at the replication fork, is required for the maintenance of repressive chromatin marks and gene silencing in the vicinity of DNA capable of forming G-quadruplex (G4) structures. We demonstrate a previously unappreciated requirement for REV1 in replication of G4 forming sequences and show that transplanting a G4 forming sequence into a silent locus leads to its derepression in REV1-deficient cells. Together, our observations support a model in which failure to maintain processive DNA replication at G4 DNA in REV1-deficient cells leads to uncoupling of DNA synthesis from histone recycling, resulting in localized loss of repressive chromatin through biased incorporation of newly synthesized histones. PMID:21145480
Discrete Circuits Support Generalized versus Context-Specific Vocal Learning in the Songbird.
Tian, Lucas Y; Brainard, Michael S
2017-12-06
Motor skills depend on the reuse of individual gestures in multiple sequential contexts (e.g., a single phoneme in different words). Yet optimal performance requires that a given gesture be modified appropriately depending on the sequence in which it occurs. To investigate the neural architecture underlying such context-dependent modifications, we studied Bengalese finch song, which, like speech, consists of variable sequences of "syllables." We found that when birds are instructed to modify a syllable in one sequential context, learning generalizes across contexts; however, if unique instruction is provided in different contexts, learning is specific for each context. Using localized inactivation of a cortical-basal ganglia circuit specialized for song, we show that this balance between generalization and specificity reflects a hierarchical organization of neural substrates. Primary motor circuitry encodes a core syllable representation that contributes to generalization, while top-down input from cortical-basal ganglia circuitry biases this representation to enable context-specific learning. Copyright © 2017 Elsevier Inc. All rights reserved.
The Replication Focus Targeting Sequence (RFTS) Domain Is a DNA-competitive Inhibitor of Dnmt1
DOE Office of Scientific and Technical Information (OSTI.GOV)
Syeda, Farisa; Fagan, Rebecca L.; Wean, Matthew
Dnmt1 (DNA methyltransferase 1) is the principal enzyme responsible for maintenance of cytosine methylation at CpG dinucleotides in the mammalian genome. The N-terminal replication focus targeting sequence (RFTS) domain of Dnmt1 has been implicated in subcellular localization, protein association, and catalytic function. However, progress in understanding its function has been limited by the lack of assays for and a structure of this domain. Here, we show that the naked DNA- and polynucleosome-binding activities of Dnmt1 are inhibited by the RFTS domain, which functions by virtue of binding the catalytic domain to the exclusion of DNA. Kinetic analysis with a fluorogenicmore » DNA substrate established the RFTS domain as a 600-fold inhibitor of Dnmt1 enzymatic activity. The crystal structure of the RFTS domain reveals a novel fold and supports a mechanism in which an RFTS-targeted Dnmt1-binding protein, such as Uhrf1, may activate Dnmt1 for DNA binding.« less
Jobke, B.; Bolbos, R.; Saadat, E.; Cheng, J.; Li, X.; Majumdar, S.
2012-01-01
The application of biomolecular magnetic resonance imaging becomes increasingly important in the context of early cartilage changes in degenerative and inflammatory joint disease before gross morphological changes become apparent. In this limited technical report, we investigate the correlation of MRI T1, T2 and T1
Cantharellus violaceovinosus, a new species from tropical Quercus forests in eastern Mexico
Herrera, Mariana; Bandala, Victor M.; Montoya, Leticia
2018-01-01
Abstract During explorations of tropical oak forests in central Veracruz (eastern Mexico), the authors discovered a Cantharellus species that produces basidiomes with strikingly violet pileus and a hymenium with yellow, raised gill-like folds. It is harvested locally and valued as a prized edible wild mushroom. Systematic multiyear sampling of basidiomes allowed the recording of the morphological variation exhibited by fresh fruit bodies in different growth stages, which supports the recognition of this Cantharellus species from others in the genus. Two molecular phylogenetic analyses based on a set of sequences of species of all major clades in Cantharellus, one including sequences of the transcription elongation factor 1-alpha (tef-1α) and a combined tef-1α and nLSU region (the large subunit of the ribosome), confirm the isolated position of the new species in a clade close to C. lewisii from USA, in the subgenus Cantharellus. Detailed macroscopic and microscopic descriptions, accompanied by illustrations and a taxonomic discussion are presented. PMID:29681739
Cantharellus violaceovinosus, a new species from tropical Quercus forests in eastern Mexico.
Herrera, Mariana; Bandala, Victor M; Montoya, Leticia
2018-01-01
During explorations of tropical oak forests in central Veracruz (eastern Mexico), the authors discovered a Cantharellus species that produces basidiomes with strikingly violet pileus and a hymenium with yellow, raised gill-like folds. It is harvested locally and valued as a prized edible wild mushroom. Systematic multiyear sampling of basidiomes allowed the recording of the morphological variation exhibited by fresh fruit bodies in different growth stages, which supports the recognition of this Cantharellus species from others in the genus. Two molecular phylogenetic analyses based on a set of sequences of species of all major clades in Cantharellus , one including sequences of the transcription elongation factor 1-alpha (tef-1α) and a combined tef-1α and nLSU region (the large subunit of the ribosome), confirm the isolated position of the new species in a clade close to C. lewisii from USA, in the subgenus Cantharellus. Detailed macroscopic and microscopic descriptions, accompanied by illustrations and a taxonomic discussion are presented.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bejerman, Nicolás, E-mail: n.bejerman@uq.edu.au; Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD 4072; Giolitti, Fabián
Summary: We have determined the full-length 14,491-nucleotide genome sequence of a new plant rhabdovirus, alfalfa dwarf virus (ADV). Seven open reading frames (ORFs) were identified in the antigenomic orientation of the negative-sense, single-stranded viral RNA, in the order 3′-N-P-P3-M-G-P6-L-5′. The ORFs are separated by conserved intergenic regions and the genome coding region is flanked by complementary 3′ leader and 5′ trailer sequences. Phylogenetic analysis of the nucleoprotein amino acid sequence indicated that this alfalfa-infecting rhabdovirus is related to viruses in the genus Cytorhabdovirus. When transiently expressed as GFP fusions in Nicotiana benthamiana leaves, most ADV proteins accumulated in the cellmore » periphery, but unexpectedly P protein was localized exclusively in the nucleus. ADV P protein was shown to have a homotypic, and heterotypic nuclear interactions with N, P3 and M proteins by bimolecular fluorescence complementation. ADV appears unique in that it combines properties of both cytoplasmic and nuclear plant rhabdoviruses. - Highlights: • The complete genome of alfalfa dwarf virus is obtained. • An integrated localization and interaction map for ADV is determined. • ADV has a genome sequence similarity and evolutionary links with cytorhabdoviruses. • ADV protein localization and interaction data show an association with the nucleus. • ADV combines properties of both cytoplasmic and nuclear plant rhabdoviruses.« less
Iwamoto, Susumu; Tokumasu, Seiji; Suyama, Yoshihisa; Kakishima, Makoto
2005-01-01
We investigated intraspecific diversity and genetic structures of a saprotrophic fungus--Thysanophora penicillioides--based on sequences of nuclear ribosomal internal transcribed spacer (ITS) in 15 discontinuous Abies mariesii forests of Japan. In such a well-defined morphological species, numerous unexpected ITS variations were revealed: 12 ITS sequence types detected in 254 isolates collected from 15 local populations were classified into five ITS sequence groups. Maximally, four ITS groups consisted of seven ITS types coexisting in one population. However, group 1 was dominant with approximately 65%; in particular, one haplotype, 1a, was most dominant with approximately 60% in respective populations. Therefore, few differences were recognized in genetic structure among local populations, implying that the gene flow of each lineage of the fungus occurs among local populations without geographic limitations. However, minor haplotypes in some ITS groups were found only in restricted areas, suggesting that they might expand steadily from their places of origin to neighboring A. mariesii forests. Aggregating sequence data of seven European strains and four North American strains from various substrates to those of Japanese strains, 18 ITS sequence types and 28 variable sites were recognized. They were clustered into nine lineages by phylogenetic analyses of the beta-tubulin and combined ITS and beta-tubulin datasets. According to phylogenetic species recognition by the concordance of genealogies, respective lineages correspond to phylogenetic species. Plural phylogenetic species coexist in a local population in an A. mariesii forest in Japan.
Canadian Support for Elementary and Secondary Education (1998-1999).
ERIC Educational Resources Information Center
Jefferson, Anne L.
Financial support of elementary and secondary education in Canada has gone from absolute local dependence to local with some government support to local with substantial government support to current total government support with restricted, if any allowed, local support. This has been challenged on constitutional grounds in two provinces, Alberta…
Nyakatura, John A; Heymann, Eckhard W
2010-03-01
The adoption of a specific gait sequence pattern during symmetrical locomotion has been proposed to have been a key advantage for the exploitation of the fine branch niche in early primates. Diverse aspects of primate locomotion have been extensively studied in technically equipped laboratory settings, but evolutionary conclusions derived from these investigations have rarely been verified in wild primates. Bridging the gap from the lab to the field, we conducted an actual performance determination of symmetrical gaits in two free-ranging tamarin species (Saguinus mystax and Saguinus fuscicollis) of Amazonian Peru by analyzing high-speed video recordings of naturally occurring locomotor bouts. Tamarins arguably represent viable models for aspects of early primate locomotion. We tested three specific hypotheses derived from laboratory studies to test for the influence of support size and orientation and to gain further insight into the functional significance of primate gait sequence patterns: (1) The tamarins utilize symmetrical gaits at a higher rate on small supports than on larger ones. (2) During symmetrical locomotion on small supports, diagonal sequences are utilized at a higher rate than on larger supports. (3) On inclines, diagonal sequences are predominantly used and on declines, lateral sequences are predominantly used. Our results corroborated hypotheses 1 and 3. We found no clear support for hypothesis 2. In conclusion, our results add to the notion that primate gait plasticity, rather than uniform adoption of diagonal sequence gaits, enabled early primates to accommodate different support types and effectively exploit the small branch niche. Copyright 2009 Elsevier Ltd. All rights reserved.
Hirsch, B; Endris, V; Lassmann, S; Weichert, W; Pfarr, N; Schirmacher, P; Kovaleva, V; Werner, M; Bonzheim, I; Fend, F; Sperveslage, J; Kaulich, K; Zacher, A; Reifenberger, G; Köhrer, K; Stepanow, S; Lerke, S; Mayr, T; Aust, D E; Baretton, G; Weidner, S; Jung, A; Kirchner, T; Hansmann, M L; Burbat, L; von der Wall, E; Dietel, M; Hummel, M
2018-04-01
The simultaneous detection of multiple somatic mutations in the context of molecular diagnostics of cancer is frequently performed by means of amplicon-based targeted next-generation sequencing (NGS). However, only few studies are available comparing multicenter testing of different NGS platforms and gene panels. Therefore, seven partner sites of the German Cancer Consortium (DKTK) performed a multicenter interlaboratory trial for targeted NGS using the same formalin-fixed, paraffin-embedded (FFPE) specimen of molecularly pre-characterized tumors (n = 15; each n = 5 cases of Breast, Lung, and Colon carcinoma) and a colorectal cancer cell line DNA dilution series. Detailed information regarding pre-characterized mutations was not disclosed to the partners. Commercially available and custom-designed cancer gene panels were used for library preparation and subsequent sequencing on several devices of two NGS different platforms. For every case, centrally extracted DNA and FFPE tissue sections for local processing were delivered to each partner site to be sequenced with the commercial gene panel and local bioinformatics. For cancer-specific panel-based sequencing, only centrally extracted DNA was analyzed at seven sequencing sites. Subsequently, local data were compiled and bioinformatics was performed centrally. We were able to demonstrate that all pre-characterized mutations were re-identified correctly, irrespective of NGS platform or gene panel used. However, locally processed FFPE tissue sections disclosed that the DNA extraction method can affect the detection of mutations with a trend in favor of magnetic bead-based DNA extraction methods. In conclusion, targeted NGS is a very robust method for simultaneous detection of various mutations in FFPE tissue specimens if certain pre-analytical conditions are carefully considered.
NASA Astrophysics Data System (ADS)
Beaulieu, Jacques-Louis de; Brugiapaglia, Elisabetta; Joannin, Sébastien; Guiter, Frédéric; Zanchetta, Giovanni; Wulf, Sabine; Peyron, Odile; Bernardo, Liliana; Didier, Julien; Stock, Agnès; Rius, Damien; Magny, Michel
2017-02-01
Retrospective science such as palaeoecology deeply depends on the preservation of archives in sensitive places. As an example, mountains of medium altitude from Mediterranean peninsulas have long been identified by biogeographers as refuges zones allowing the survival of European temperate taxa during the ice ages, but archives to validate this hypothesis are scarce, especially in Southern Italy. Here we present a new sequence from Lago Trifoglietti (1048 m a.s.l.) in the Calabrian Mountains, which covers the Late Glacial Interstadial (LGI, corresponding to the Bölling-Alleröd period in northern-central Europe) and the transition to the Holocene. The independent chronology based on seven radiocarbon dates is supported by the evidence of three tephra layers already identified in other regional sequences. During the LGI, besides the high diversity of non arboreal pollen grains, a great number of pollens of temperate forest trees are present or abundant (mostly deciduous oaks and fir). These assemblages suggest that the site was above but not far from the upper limit of diversified woodland stands. They confirm a local survival during the last glacial. The Younger Dryas is not marked by major changes, and oak percentages are even higher, suggesting a resilient expansion at lower altitude. Surprisingly the site remains above the timberline until an aridity crisis centered at 11,100 cal 14C yr PB, which is correlated with the Preboreal Oscillation (PBO). This event is immediately followed by the local settlement of a dense fir and beech forest around the lake. A comparison with other Italian key sequences aims at explaining the climate forcing factors that governed this original vegetation dynamic. Further investigations using additional proxies are needed for a more robust climate reconstruction.
Feldkaemper, Marita P; Burkhardt, Eva; Schaeffel, Frank
2004-09-01
Myopia is a condition in which the eye is too long for the focal length of cornea and lens. Analysis of the messengers that are released by the retina to control axial eye growth in the animal model of the chicken revealed that glucagon-immunoreactive amacrine cells are involved in the retinal image processing that controls the growth of the sclera. It was found that the amount of retinal glucagon mRNA increased during treatment with positive lenses and pharmacological studies supported the idea that glucagon may act as a stop signal for eye growth. Glucagon exerts its regulatory effects by binding to a single type of glucagon receptor. In this study, we have sequenced the chicken glucagon receptor and compared its DNA and amino acid sequence with the human and mouse homologues. After sequencing about 80% of the receptor, we found a homology between 79.4 and 75.6% on cDNA level. At the protein level, about 73% of the amino acids were identical. Moreover, the cellular localization and regulation of the glucagon receptor in the chick retina was studied. In situ hybridization studies showed that many cells in the ganglion cell layer and inner nuclear layer, and some cells in the outer nuclear layer, express the receptor mRNA. Injection of the glucagon agonist Lys17,18,Glu21-glucagon induced a down-regulation of glucagon receptor mRNA content. Since the mouse would be an attractive mammalian model to study the biochemical and genetic basis of myopia, and because recent studies have demonstrated that form deprivation myopia can be induced, the expression of preproglucagon and glucagon receptor genes were also studied in the mouse retina and were found to be expressed.
Characterization and Prediction of Protein Phosphorylation Hotspots in Arabidopsis thaliana.
Christian, Jan-Ole; Braginets, Rostyslav; Schulze, Waltraud X; Walther, Dirk
2012-01-01
The regulation of protein function by modulating the surface charge status via sequence-locally enriched phosphorylation sites (P-sites) in so called phosphorylation "hotspots" has gained increased attention in recent years. We set out to identify P-hotspots in the model plant Arabidopsis thaliana. We analyzed the spacing of experimentally detected P-sites within peptide-covered regions along Arabidopsis protein sequences as available from the PhosPhAt database. Confirming earlier reports (Schweiger and Linial, 2010), we found that, indeed, P-sites tend to cluster and that distributions between serine and threonine P-sites to their respected closest next P-site differ significantly from those for tyrosine P-sites. The ability to predict P-hotspots by applying available computational P-site prediction programs that focus on identifying single P-sites was observed to be severely compromised by the inevitable interference of nearby P-sites. We devised a new approach, named HotSPotter, for the prediction of phosphorylation hotspots. HotSPotter is based primarily on local amino acid compositional preferences rather than sequence position-specific motifs and uses support vector machines as the underlying classification engine. HotSPotter correctly identified experimentally determined phosphorylation hotspots in A. thaliana with high accuracy. Applied to the Arabidopsis proteome, HotSPotter-predicted 13,677 candidate P-hotspots in 9,599 proteins corresponding to 7,847 unique genes. Hotspot containing proteins are involved predominantly in signaling processes confirming the surmised modulating role of hotspots in signaling and interaction events. Our study provides new bioinformatics means to identify phosphorylation hotspots and lays the basis for further investigating novel candidate P-hotspots. All phosphorylation hotspot annotations and predictions have been made available as part of the PhosPhAt database at http://phosphat.mpimp-golm.mpg.de.
Andronesi, Ovidiu C.; Ramadan, Saadallah; Mountford, Carolyn E.; Sorensen, A. Gregory
2011-01-01
Novel low-power adiabatic sequences are demonstrated for in-vivo localized two-dimensional (2D) correlated MR spectroscopy, such as COSY (Correlated Spectroscopy) and TOCSY (Total Correlated Spectroscopy). The design is based on three new elements for in-vivo 2D MRS: the use of gradient modulated constant adiabaticity GOIA-W(16,4) pulses for i) localization (COSY and TOCSY) and ii) mixing (TOCSY), and iii) the use of longitudinal mixing (z-filter) for magnetization transfer during TOCSY. GOIA-W(16,4) provides accurate signal localization, and more importantly, lowers the SAR for both TOCSY mixing and localization. Longitudinal mixing improves considerably (five-folds) the efficiency of TOCSY transfer. These are markedly different from previous 1D editing TOCSY sequences using spatially non-selective pulses and transverse mixing. Fully adiabatic (adiabatic mixing with adiabatic localization) and semi-adiabatic (adiabatic mixing with non-adiabatic localization) methods for 2D TOCSY are compared. Results are presented for simulations, phantoms, and in-vivo 2D spectra from healthy volunteers and patients with brain tumors obtained on 3T clinical platforms equipped with standard hardware. To the best of our knowledge this is the first demonstration of in-vivo adiabatic 2D TOCSY and fully adiabatic 2D COSY. It is expected that these methodological developments will advance the in-vivo applicability of multi(spectrally)dimensional MRS to reliably identify metabolic biomarkers. PMID:20890988
33 CFR 385.30 - Master Implementation Sequencing Plan.
Code of Federal Regulations, 2014 CFR
2014-07-01
... 33 Navigation and Navigable Waters 3 2014-07-01 2014-07-01 false Master Implementation Sequencing... Incorporating New Information Into the Plan § 385.30 Master Implementation Sequencing Plan. (a) Not later than... of Environmental Protection, and other Federal, State, and local agencies, develop a Master...
33 CFR 385.30 - Master Implementation Sequencing Plan.
Code of Federal Regulations, 2013 CFR
2013-07-01
... 33 Navigation and Navigable Waters 3 2013-07-01 2013-07-01 false Master Implementation Sequencing... Incorporating New Information Into the Plan § 385.30 Master Implementation Sequencing Plan. (a) Not later than... of Environmental Protection, and other Federal, State, and local agencies, develop a Master...
33 CFR 385.30 - Master Implementation Sequencing Plan.
Code of Federal Regulations, 2012 CFR
2012-07-01
... 33 Navigation and Navigable Waters 3 2012-07-01 2012-07-01 false Master Implementation Sequencing... Incorporating New Information Into the Plan § 385.30 Master Implementation Sequencing Plan. (a) Not later than... of Environmental Protection, and other Federal, State, and local agencies, develop a Master...
Next-generation sequencing for targeted discovery of rare mutations in rice
USDA-ARS?s Scientific Manuscript database
Advances in DNA sequencing (i.e., next-generation sequencing, NGS) have greatly increased the power and efficiency of detecting rare mutations in large mutant populations. Targeting Induced Local Lesions in Genomes (TILLING) is a reverse genetics approach for identifying gene mutations resulting fro...
McCutchen-Maloney, Sandra L.
2002-01-01
DNA mutation binding proteins alone and as chimeric proteins with nucleases are used with solid supports to detect DNA sequence variations, DNA mutations and single nucleotide polymorphisms. The solid supports may be flow cytometry beads, DNA chips, glass slides or DNA dips sticks. DNA molecules are coupled to solid supports to form DNA-support complexes. Labeled DNA is used with unlabeled DNA mutation binding proteins such at TthMutS to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by binding which gives an increase in signal. Unlabeled DNA is utilized with labeled chimeras to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by nuclease activity of the chimera which gives a decrease in signal.
Bandyopadhyay, Boudhayan; Goldenzweig, Adi; Unger, Tamar; Adato, Orit; Fleishman, Sarel J; Unger, Ron; Horovitz, Amnon
2017-12-15
The GroE chaperonin system in Escherichia coli comprises GroEL and GroES and facilitates ATP-dependent protein folding in vivo and in vitro Proteins with very similar sequences and structures can differ in their dependence on GroEL for efficient folding. One potential but unverified source for GroEL dependence is frustration, wherein not all interactions in the native state are optimized energetically, thereby potentiating slow folding and misfolding. Here, we chose enhanced green fluorescent protein as a model system and subjected it to random mutagenesis, followed by screening for variants whose in vivo folding displays increased or decreased GroEL dependence. We confirmed the altered GroEL dependence of these variants with in vitro folding assays. Strikingly, mutations at positions predicted to be highly frustrated were found to correlate with decreased GroEL dependence. Conversely, mutations at positions with low frustration were found to correlate with increased GroEL dependence. Further support for this finding was obtained by showing that folding of an enhanced green fluorescent protein variant designed computationally to have reduced frustration is indeed less GroEL-dependent. Our results indicate that changes in local frustration also affect partitioning in vivo between spontaneous and chaperonin-mediated folding. Hence, the design of minimally frustrated sequences can reduce chaperonin dependence and improve protein expression levels. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Pan, Wen-Yu; Lin, Kun-Ju; Huang, Chieh-Cheng; Chiang, Wei-Lun; Lin, Yu-Jung; Lin, Wei-Chih; Chuang, Er-Yuan; Chang, Yen; Sung, Hsing-Wen
2016-09-01
Combination chemotherapy with multiple drugs commonly requires several injections on various schedules, and the probability that the drug molecules reach the diseased tissues at the proper time and effective therapeutic concentrations is very low. This work elucidates an injectable co-delivery system that is based on cationic liposomes that are adsorbed on anionic hollow microspheres (Lipos-HMs) via electrostatic interaction, from which the localized sequence-specific release of a chemopreventive agent (1,25(OH)2D3) and an anticancer drug (doxorubicin; DOX) can be thermally driven in a time-controllable manner by an externally applied high-frequency magnetic field (HFMF). Lipos-HMs can greatly promote the accumulation of reactive oxygen species (ROS) in tumor cells by reducing their cytoplasmic expression of an antioxidant enzyme (superoxide dismutase) by 1,25(OH)2D3, increasing the susceptibility of cancer cells to the cytotoxic action of DOX. In nude mice that bear xenograft tumors, treatment with Lipos-HMs under exposure to HFMF effectively inhibits tumor growth and is the most effective therapeutic intervention among all the investigated. These empirical results demonstrate that the synergistic anticancer effects of sequential release of 1,25(OH)2D3 and DOX from the Lipos-HMs may have potential for maximizing DOX cytotoxicity, supporting more effective cancer treatment. Copyright © 2016 Elsevier Ltd. All rights reserved.
Zhang, Shuai; Lun, Zhao-Rong; Wu, Zhong-Dao; Fan, Chia-Kwung; Brown, Christopher L.; Cheng, Po-Ching; Peng, Shih-Yi; Yang, Ting-Bao
2017-01-01
Angiostrongylus cantonensis is of increasing public health importance as the main zoonotic pathogen causing eosinophilic meningitis or meningoencephalitis, which has been documented all over the world. However, there are very limited studies about its phylogeography and spread pattern. In the present study, the phylogeography of A. cantonensis in southern China (including Taiwan) and partial areas of Southeast Asia were studied based on the sequences of complete mitochondrial cytochrome b (Cytb) gene. A total of 520 individuals of A. cantonensis obtained from 13 localities were sequenced for the analyses and grouped into 42 defined haplotypes. The phylogenetic tree (NJ tree and BI tree) revealed a characteristic distribution pattern of the four main lineages, with detectable geographic structure. Genetic differentiation among populations was significant, but demographic expansion could not be detected by either neutrality tests or mismatch distribution analysis, which implied a low gene flow among the local populations in different regions where the samples were collected. Two unique lineages of the A. cantonensis population in Taiwan were detected, which suggests its multiple origin in the island. Populations in Hekou (China) and Laos showed the highest genetic diversities, which were supported by both genetic diversity indices and AMOVA. These results together infer that the area around Thailand or Hekou in Yunnan province, China are the most likely origins of Angiostrongylus cantonensis. PMID:28827809
Peng, Jian; He, Zhang-Ping; Zhang, Shuai; Lun, Zhao-Rong; Wu, Zhong-Dao; Fan, Chia-Kwung; Brown, Christopher L; Cheng, Po-Ching; Peng, Shih-Yi; Yang, Ting-Bao
2017-08-01
Angiostrongylus cantonensis is of increasing public health importance as the main zoonotic pathogen causing eosinophilic meningitis or meningoencephalitis, which has been documented all over the world. However, there are very limited studies about its phylogeography and spread pattern. In the present study, the phylogeography of A. cantonensis in southern China (including Taiwan) and partial areas of Southeast Asia were studied based on the sequences of complete mitochondrial cytochrome b (Cytb) gene. A total of 520 individuals of A. cantonensis obtained from 13 localities were sequenced for the analyses and grouped into 42 defined haplotypes. The phylogenetic tree (NJ tree and BI tree) revealed a characteristic distribution pattern of the four main lineages, with detectable geographic structure. Genetic differentiation among populations was significant, but demographic expansion could not be detected by either neutrality tests or mismatch distribution analysis, which implied a low gene flow among the local populations in different regions where the samples were collected. Two unique lineages of the A. cantonensis population in Taiwan were detected, which suggests its multiple origin in the island. Populations in Hekou (China) and Laos showed the highest genetic diversities, which were supported by both genetic diversity indices and AMOVA. These results together infer that the area around Thailand or Hekou in Yunnan province, China are the most likely origins of Angiostrongylus cantonensis.
Anchor-Free Localization Method for Mobile Targets in Coal Mine Wireless Sensor Networks
Pei, Zhongmin; Deng, Zhidong; Xu, Shuo; Xu, Xiao
2009-01-01
Severe natural conditions and complex terrain make it difficult to apply precise localization in underground mines. In this paper, an anchor-free localization method for mobile targets is proposed based on non-metric multi-dimensional scaling (Multi-dimensional Scaling: MDS) and rank sequence. Firstly, a coal mine wireless sensor network is constructed in underground mines based on the ZigBee technology. Then a non-metric MDS algorithm is imported to estimate the reference nodes’ location. Finally, an improved sequence-based localization algorithm is presented to complete precise localization for mobile targets. The proposed method is tested through simulations with 100 nodes, outdoor experiments with 15 ZigBee physical nodes, and the experiments in the mine gas explosion laboratory with 12 ZigBee nodes. Experimental results show that our method has better localization accuracy and is more robust in underground mines. PMID:22574048
Anchor-free localization method for mobile targets in coal mine wireless sensor networks.
Pei, Zhongmin; Deng, Zhidong; Xu, Shuo; Xu, Xiao
2009-01-01
Severe natural conditions and complex terrain make it difficult to apply precise localization in underground mines. In this paper, an anchor-free localization method for mobile targets is proposed based on non-metric multi-dimensional scaling (Multi-dimensional Scaling: MDS) and rank sequence. Firstly, a coal mine wireless sensor network is constructed in underground mines based on the ZigBee technology. Then a non-metric MDS algorithm is imported to estimate the reference nodes' location. Finally, an improved sequence-based localization algorithm is presented to complete precise localization for mobile targets. The proposed method is tested through simulations with 100 nodes, outdoor experiments with 15 ZigBee physical nodes, and the experiments in the mine gas explosion laboratory with 12 ZigBee nodes. Experimental results show that our method has better localization accuracy and is more robust in underground mines.
Guo, Kang-kang; Tang, Qing-hai; Zhang, Yan-ming; Kang, Kai; He, Lei
2011-05-18
The membrane topology and molecular mechanisms for endoplasmic reticulum (ER) localization of classical swine fever virus (CSFV) non-structural 2 (NS2) protien is unclear. We attempted to elucidate the subcellular localization, and the molecular mechanisms responsible for the localization of this protein in our study. The NS2 gene was amplified by reverse transcription polymerase chain reaction, with the transmembrane region and hydrophilicity of the NS2 protein was predicted by bioinformatics analysis. Twelve cDNAs of the NS2 gene were amplified by the PCR deletion method and cloned into a eukaryotic expression vector, which was transfected into a swine umbilical vein endothelial cell line (SUVEC). Subcellular localization of the NS2 protein was characterized by confocal microscopy, and western blots were carried out to analyze protein expression. Our results showed that the -NH2 terminal of the CSFV NS2 protein was highly hydrophobic and the protein localized in the ER. At least four transmembrane regions and two internal signal peptide sequences (amino acids103-138 and 220-262) were identified and thought to be critical for its trans-localization to the ER. This is the first study to identify the internal signal peptide sequences of the CSFV NS2 protein and its subcellular localization, providing the foundation for further exploration of this protein's function of this protein and its role in CSFV pathogenesis.
The right inferior frontal gyrus processes nested non-local dependencies in music.
Cheung, Vincent K M; Meyer, Lars; Friederici, Angela D; Koelsch, Stefan
2018-02-28
Complex auditory sequences known as music have often been described as hierarchically structured. This permits the existence of non-local dependencies, which relate elements of a sequence beyond their temporal sequential order. Previous studies in music have reported differential activity in the inferior frontal gyrus (IFG) when comparing regular and irregular chord-transitions based on theories in Western tonal harmony. However, it is unclear if the observed activity reflects the interpretation of hierarchical structure as the effects are confounded by local irregularity. Using functional magnetic resonance imaging (fMRI), we found that violations to non-local dependencies in nested sequences of three-tone musical motifs in musicians elicited increased activity in the right IFG. This is in contrast to similar studies in language which typically report the left IFG in processing grammatical syntax. Effects of increasing auditory working demands are moreover reflected by distributed activity in frontal and parietal regions. Our study therefore demonstrates the role of the right IFG in processing non-local dependencies in music, and suggests that hierarchical processing in different cognitive domains relies on similar mechanisms that are subserved by domain-selective neuronal subpopulations.
Segers, Laurent; Tiete, Jelmer; Braeken, An; Touhafi, Abdellah
2014-01-01
Indoor localization of persons and objects poses a great engineering challenge. Previously developed localization systems demonstrate the use of wideband techniques in ultrasound ranging systems. Direct sequence and frequency hopping spread spectrum ultrasound signals have been proven to achieve a high level of accuracy. A novel ranging method using the frequency hopping spread spectrum with finite impulse response filtering will be investigated and compared against the direct sequence spread spectrum. In the first setup, distances are estimated in a single-access environment, while in the second setup, two senders and one receiver are used. During the experiments, the micro-electromechanical systems are used as ultrasonic sensors, while the senders were implemented using field programmable gate arrays. Results show that in a single-access environment, the direct sequence spread spectrum method offers slightly better accuracy and precision performance compared to the frequency hopping spread spectrum. When two senders are used, measurements point out that the frequency hopping spread spectrum is more robust to near-far effects than the direct sequence spread spectrum. PMID:24553084
Learning of goal-relevant and -irrelevant complex visual sequences in human V1.
Rosenthal, Clive R; Mallik, Indira; Caballero-Gaudes, Cesar; Sereno, Martin I; Soto, David
2018-06-12
Learning and memory are supported by a network involving the medial temporal lobe and linked neocortical regions. Emerging evidence indicates that primary visual cortex (i.e., V1) may contribute to recognition memory, but this has been tested only with a single visuospatial sequence as the target memorandum. The present study used functional magnetic resonance imaging to investigate whether human V1 can support the learning of multiple, concurrent complex visual sequences involving discontinous (second-order) associations. Two peripheral, goal-irrelevant but structured sequences of orientated gratings appeared simultaneously in fixed locations of the right and left visual fields alongside a central, goal-relevant sequence that was in the focus of spatial attention. Pseudorandom sequences were introduced at multiple intervals during the presentation of the three structured visual sequences to provide an online measure of sequence-specific knowledge at each retinotopic location. We found that a network involving the precuneus and V1 was involved in learning the structured sequence presented at central fixation, whereas right V1 was modulated by repeated exposure to the concurrent structured sequence presented in the left visual field. The same result was not found in left V1. These results indicate for the first time that human V1 can support the learning of multiple concurrent sequences involving complex discontinuous inter-item associations, even peripheral sequences that are goal-irrelevant. Copyright © 2018. Published by Elsevier Inc.
USDA-ARS?s Scientific Manuscript database
Rice seeds of the temperate japonica cultivar Kitaake were mutagenized with sodium azide alone and in combination with methyl nitrosourea. Using the reduced representation sequencing method Restriction Enzyme Sequence Comparative Analysis (RESCAN), the mutation densities, types and local sequence co...
Localization and quantitative co-localization of enamelin with amelogenin.
Gallon, Victoria; Chen, Lisha; Yang, Xiudong; Moradian-Oldak, Janet
2013-08-01
Enamelin and amelogenin are vital proteins in enamel formation. The cooperative function of these two proteins controls crystal nucleation and morphology in vitro. We quantitatively analyzed the co-localization between enamelin and amelogenin by confocal microscopy and using two antibodies, one raised against a sequence in the porcine 32 kDa enamelin region and the other raised against full-length recombinant mouse amelogenin. We further investigated the interaction of the porcine 32 kDa enamelin and recombinant amelogenin using immuno-gold labeling. This study reports the quantitative co-localization results for postnatal days 1-8 mandibular mouse molars. We show that amelogenin and enamelin are secreted into the extracellular matrix on the cuspal slopes of the molars at day 1 and that secretion continues to at least day 8. Quantitative co-localization analysis (QCA) was performed in several different configurations using large (45 μm height, 33 μm width) and small (7 μm diameter) regions of interest to elucidate any patterns. Co-localization patterns in day 8 samples revealed that enamelin and amelogenin co-localize near the secretory face of the ameloblasts and appear to be secreted approximately in a 1:1 ratio. The degree of co-localization decreases as the enamel matures, both along the secretory face of ameloblasts and throughout the entire thickness of the enamel. Immuno-reactivity against enamelin is concentrated along the secretory face of ameloblasts, supporting the theory that this protein together with amelogenin is intimately involved in mineral induction at the beginning of enamel formation. Copyright © 2013 Elsevier Inc. All rights reserved.
Local backbone structure prediction of proteins
De Brevern, Alexandre G.; Benros, Cristina; Gautier, Romain; Valadié, Hélène; Hazout, Serge; Etchebest, Catherine
2004-01-01
Summary A statistical analysis of the PDB structures has led us to define a new set of small 3D structural prototypes called Protein Blocks (PBs). This structural alphabet includes 16 PBs, each one is defined by the (φ, Ψ) dihedral angles of 5 consecutive residues. The amino acid distributions observed in sequence windows encompassing these PBs are used to predict by a Bayesian approach the local 3D structure of proteins from the sole knowledge of their sequences. LocPred is a software which allows the users to submit a protein sequence and performs a prediction in terms of PBs. The prediction results are given both textually and graphically. PMID:15724288
Serotype-specific differences in dengue virus non-structural protein 5 nuclear localization.
Hannemann, Holger; Sung, Po-Yu; Chiu, Han-Chen; Yousuf, Amjad; Bird, Jim; Lim, Siew Pheng; Davidson, Andrew D
2013-08-02
The four serotypes of dengue virus (DENV-1 to -4) cause the most important arthropod-borne viral disease of humans. DENV non-structural protein 5 (NS5) contains enzymatic activities required for capping and replication of the viral RNA genome that occurs in the host cytoplasm. However, previous studies have shown that DENV-2 NS5 accumulates in the nucleus during infection. In this study, we examined the nuclear localization of NS5 for all four DENV serotypes. We demonstrate for the first time that there are serotypic differences in NS5 nuclear localization. Whereas the DENV-2 and -3 proteins accumulate in the nucleus, DENV-1 and -4 NS5 are predominantly if not exclusively localized to the cytoplasm. Comparative studies on the DENV-2 and -4 NS5 proteins revealed that the difference in DENV-4 NS5 nuclear localization was not due to rapid nuclear export but rather the lack of a functional nuclear localization sequence. Interaction studies using DENV-2 and -4 NS5 and human importin-α isoforms failed to identify an interaction that supported the differential nuclear localization of NS5. siRNA knockdown of the human importin-α isoform KPNA2, corresponding to the murine importin-α isoform previously shown to bind to DENV-2 NS5, did not substantially affect DENV-2 NS5 nuclear localization, whereas knockdown of importin-β did. The serotypic differences in NS5 nuclear localization did not correlate with differences in IL-8 gene expression. The results show that NS5 nuclear localization is not strictly required for virus replication but is more likely to have an auxiliary function in the life cycle of specific DENV serotypes.
Serotype-specific Differences in Dengue Virus Non-structural Protein 5 Nuclear Localization*
Hannemann, Holger; Sung, Po-Yu; Chiu, Han-Chen; Yousuf, Amjad; Bird, Jim; Lim, Siew Pheng; Davidson, Andrew D.
2013-01-01
The four serotypes of dengue virus (DENV-1 to -4) cause the most important arthropod-borne viral disease of humans. DENV non-structural protein 5 (NS5) contains enzymatic activities required for capping and replication of the viral RNA genome that occurs in the host cytoplasm. However, previous studies have shown that DENV-2 NS5 accumulates in the nucleus during infection. In this study, we examined the nuclear localization of NS5 for all four DENV serotypes. We demonstrate for the first time that there are serotypic differences in NS5 nuclear localization. Whereas the DENV-2 and -3 proteins accumulate in the nucleus, DENV-1 and -4 NS5 are predominantly if not exclusively localized to the cytoplasm. Comparative studies on the DENV-2 and -4 NS5 proteins revealed that the difference in DENV-4 NS5 nuclear localization was not due to rapid nuclear export but rather the lack of a functional nuclear localization sequence. Interaction studies using DENV-2 and -4 NS5 and human importin-α isoforms failed to identify an interaction that supported the differential nuclear localization of NS5. siRNA knockdown of the human importin-α isoform KPNA2, corresponding to the murine importin-α isoform previously shown to bind to DENV-2 NS5, did not substantially affect DENV-2 NS5 nuclear localization, whereas knockdown of importin-β did. The serotypic differences in NS5 nuclear localization did not correlate with differences in IL-8 gene expression. The results show that NS5 nuclear localization is not strictly required for virus replication but is more likely to have an auxiliary function in the life cycle of specific DENV serotypes. PMID:23770669
NASA Astrophysics Data System (ADS)
Márton, Emö; Madzin, Jozef; Bučová, Jana; Grabowski, Jacek; Plašienka, Dušan; Aubrecht, Roman
2017-04-01
The Hronic (Choč) units form the highest cover nappe system of the Central Western Carpathians which was emplaced over the Fatric (Krížna) nappe system during the Late Cretaceous. The Permian (red beds and lava flows) and Triassic (sediments) rocks, the main targets of our study, were affected only by diagenetic or very low-grade, burial-related recrystallization and were tilted and transported together. The pre-late Cretaceous sequence is overlapped by Paleogene mainly flysch sequences. Three laboratories (Bratislava, Budapest and Warsaw) were involved in standard paleomagnetic processing and AMS measurements of the samples, while Curie-points were determined in Budapest. The site/locality mean paleomagnetic directions obtained were significantly different from the local direction of the present Earth magnetic field, indicating the long term stability of the paleomagnetic signal. The magnetic fabrics varied from un-oriented to dominantly schistose with well-defined lineations. The latter were normally subhorizontal, although subvertical maxima also occurred among the Triassic sediments. Shallow inclinations, after tilt corrections, suggest near-equatorial position for most of the Permian and Lower Triassic, while around 20°N for the Middle-Upper Triassic localities. The paleomagnetic declinations are interpreted in terms of CW tectonic rotations, which are normally larger for the Permian than for the Triassic samples, although there are some differences within the same age groups. This may be attributed to differential movements during nappe emplacement or subsequent tectonic disturbances. For two localities from the Paleogene cover sequence of the Hronic units, close to the main sampling area (Low Tatra Mts) of the present study documented fairly large CCW rotations, thus obtained additional evidence for the general CCW rotation of the Central Western Carpathians during the Cenozoic. Thus, we conclude that the Cenozoic CCW rotation was pre-dated by large CW rotations, probably connected to the nappe emplacement. In addition, a pre-Jurassic moderate CW rotation is inferred from the difference in declinations between Triassic and Permian palaeomagnetic declinations. Acknowledgement: This work was financially supported by the Slovak Research and Development Agency under the contract No. APVV-0212-12 and by the Hungarian Scientific Research Fund OTKA K105245.
Optimization of Multilocus Sequence Analysis for Identification of Species in the Genus Vibrio
Gabriel, Michael W.; Matsui, George Y.; Friedman, Robert
2014-01-01
Multilocus sequence analysis (MLSA) is an important method for identification of taxa that are not well differentiated by 16S rRNA gene sequences alone. In this procedure, concatenated sequences of selected genes are constructed and then analyzed. The effects that the number and the order of genes used in MLSA have on reconstruction of phylogenetic relationships were examined. The recA, rpoA, gapA, 16S rRNA gene, gyrB, and ftsZ sequences from 56 species of the genus Vibrio were used to construct molecular phylogenies, and these were evaluated individually and using various gene combinations. Phylogenies from two-gene sequences employing recA and rpoA in both possible gene orders were different. The addition of the gapA gene sequence, producing all six possible concatenated sequences, reduced the differences in phylogenies to degrees of statistical (bootstrap) support for some nodes. The overall statistical support for the phylogenetic tree, assayed on the basis of a reliability score (calculated from the number of nodes having bootstrap values of ≥80 divided by the total number of nodes) increased with increasing numbers of genes used, up to a maximum of four. No further improvement was observed from addition of the fifth gene sequence (ftsZ), and addition of the sixth gene (gyrB) resulted in lower proportions of strongly supported nodes. Reductions in the numbers of strongly supported nodes were also observed when maximum parsimony was employed for tree construction. Use of a small number of gene sequences in MLSA resulted in accurate identification of Vibrio species. PMID:24951781
Danielsson, Frida; Wiking, Mikaela; Mahdessian, Diana; Skogs, Marie; Ait Blal, Hammou; Hjelmare, Martin; Stadler, Charlotte; Uhlén, Mathias; Lundberg, Emma
2013-01-04
One of the major challenges of a chromosome-centric proteome project is to explore in a systematic manner the potential proteins identified from the chromosomal genome sequence, but not yet characterized on a protein level. Here, we describe the use of RNA deep sequencing to screen human cell lines for RNA profiles and to use this information to select cell lines suitable for characterization of the corresponding gene product. In this manner, the subcellular localization of proteins can be analyzed systematically using antibody-based confocal microscopy. We demonstrate the usefulness of selecting cell lines with high expression levels of RNA transcripts to increase the likelihood of high quality immunofluorescence staining and subsequent successful subcellular localization of the corresponding protein. The results show a path to combine transcriptomics with affinity proteomics to characterize the proteins in a gene- or chromosome-centric manner.
Streamwise-Localized Solutions with natural 1-fold symmetry
NASA Astrophysics Data System (ADS)
Altmeyer, Sebastian; Willis, Ashley; Hof, Björn
2014-11-01
It has been proposed in recent years that turbulence is organized around unstable invariant solutions, which provide the building blocks of the chaotic dynamics. In direct numerical simulations of pipe flow we show that when imposing a minimal symmetry constraint (reflection in an axial plane only) the formation of turbulence can indeed be explained by dynamical systems concepts. The hypersurface separating laminar from turbulent motion, the edge of turbulence, is spanned by the stable manifolds of an exact invariant solution, a periodic orbit of a spatially localized structure. The turbulent states themselves (turbulent puffs in this case) are shown to arise in a bifurcation sequence from a related localized solution (the upper branch orbit). The rather complex bifurcation sequence involves secondary Hopf bifurcations, frequency locking and a period doubling cascade until eventually turbulent puffs arise. In addition we report preliminary results of the transition sequence for pipe flow without symmetry constraints.
Ouyang, Ping; Zhang, He; Fan, Zhaolan; Wei, Pei; Huang, Zhigang; Wang, Sen; Li, Tao
2016-11-05
NKX2.5 plays important roles in heart development. Being a transcription factor, NKX2.5 exerts its biological functions in nucleus. However, the sequence motif that localize NKX2.5 into nucleus is still not clear. Here, we found a R/K-rich sequence motif from Q187 to R197 (QNRRYKCKRQR) was required for exclusive nuclear localization of NKX2.5. Eight truncated plasmids (E109X, Q149X, Q170X, Q187X, Q198X, Y256X, Y259X, and C264X) which were associated with congenital heart disease (CHD) were constructed. Compared with the wild type NKX2.5, the proteins E109X, Q149X, Q170X, Q187X without intact homeodomain (HD) showed no transcriptional activity while Q198X, Y256X, Y259X and C264X with intact HD showed 50 to 66% transcriptional activity. E109X, Q149X, Q170X, Q187X without intact HD localized in the cytoplasm and nucleus simultaneously and Q198X, Y256X, Y259X and C264X with intact HD localized completely in nucleus. These results inferred the indispensability of 187QNRRYKCKRQR197 in exclusive nucleus localization. Additionally, this sequence motif was very conservative among human, mouse and rat, indicating this motif was important for NKX2.5 function. Thus, we concluded that R/K-rich sequence motif 187QNRRYKCKRQR197 played a central role for NKX2.5 nuclear localization. Our findings provided a clue to understand the mechanisms between the truncated NKX2.5 mutants and CHD. Copyright © 2016 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pichon, L.; Carn, G.; Bouric, P.
1996-03-01
Positional cloning strategies for the hemochromatosis gene have previously concentrated on a target area restricted to a maximum genomic expanse of 400 kb around the HLA-A and HLA-F loci. Recently, the candidate region has been extended to 2-3 Mb on the distal side of the MHC. In this study, 10 coding sequences [hemochromatosis candidate genes (HCG) I to X] were isolated by cDNA selection using YACs covering the HLA-A/HLA-F subregion. Two of these (HCG II and HCG IV) belong to multigene families, as well as other sequences already described in this region, i.e., P5, pMC 6.7, and HLA class I.more » Fingerprinting of the four YACSs overlapping the region was performed and allowed partial localization of the different multigene family sequences on each YAC without defining their exact positions. Fingerprinting on cosmids isolated from the ICRF chromosome 6-specific cosmid library allowed more precise localization of the redundant sequences in all of the multigene families and revealed their apparent organization in clusters. Further examination of these intertwined sequences demonstrated that this structural organization resulted from a succession of complex phenomena, including duplications and contractions. This study presents a precise description of the structural organization of the HLA-A/HLA-F region and a determination of the sequences involved in the megabase size polymorphism observed among the A3, A24, and A31 haplotypes. 29 refs., 2 figs., 2 tabs.« less
Genomic Organization of the Drosophila Telomere RetrotransposableElements
DOE Office of Scientific and Technical Information (OSTI.GOV)
George, J.A.; DeBaryshe, P.G.; Traverse, K.L.
2006-10-16
The emerging sequence of the heterochromatic portion of the Drosophila melanogaster genome, with the most recent update of euchromatic sequence, gives the first genome-wide view of the chromosomal distribution of the telomeric retrotransposons, HeT-A, TART, and Tahre. As expected, these elements are entirely excluded from euchromatin, although sequence fragments of HeT-A and TART 3 untranslated regions are found in nontelomeric heterochromatin on the Y chromosome. The proximal ends of HeT-A/TART arrays appear to be a transition zone because only here do other transposable elements mix in the array. The sharp distinction between the distribution of telomeric elements and that ofmore » other transposable elements suggests that chromatin structure is important in telomere element localization. Measurements reported here show (1) D. melanogaster telomeres are very long, in the size range reported for inbred mouse strains (averaging 46 kb per chromosome end in Drosophila stock 2057). As in organisms with telomerase, their length varies depending on genotype. There is also slight under-replication in polytene nuclei. (2) Surprisingly, the relationship between the number of HeT-A and TART elements is not stochastic but is strongly correlated across stocks, supporting the idea that the two elements are interdependent. Although currently assembled portions of the HeT-A/TART arrays are from the most-proximal part of long arrays, {approx}61% of the total HeT-A sequence in these regions consists of intact, potentially active elements with little evidence of sequence decay, making it likely that the content of the telomere arrays turns over more extensively than has been thought.« less
Dupé, Aurélien; Dumas, Carole; Papadopoulou, Barbara
2015-01-01
Alba-domain proteins are RNA-binding proteins found in archaea and eukaryotes and recently studied in protozoan parasites where they play a role in the regulation of virulence factors and stage-specific proteins. This work describes in silico structural characterization, cellular localization and biochemical analyses of Alba-domain proteins in Leishmania infantum. We show that in contrast to other protozoa, Leishmania have two Alba-domain proteins, LiAlba1 and LiAlba3, representative of the Rpp20- and the Rpp25-like eukaryotic subfamilies, respectively, which share several sequence and structural similarities but also important differences with orthologs in other protozoa, especially in sequences targeted for post-translational modifications. LiAlba1 and LiAlba3 proteins form a complex interacting with other RNA-binding proteins, ribosomal subunits, and translation factors as supported by co-immunoprecipitation and sucrose gradient sedimentation analysis. A higher co-sedimentation of Alba proteins with ribosomal subunits was seen upon conditions of decreased translation, suggesting a role of these proteins in translational repression. The Leishmania Alba-domain proteins display differential cellular localization throughout the parasite development. In the insect promastigote stage, Alba proteins co-localize predominantly to the cytoplasm but they translocate to the nucleolus and the flagellum upon amastigote differentiation in the mammalian host and are found back to the cytoplasm once amastigote differentiation is completed. Heat-shock, a major signal of amastigote differentiation, triggers Alba translocation to the nucleolus and the flagellum. Purification of the Leishmania flagellum confirmed LiAlba3 enrichment in this organelle during amastigote differentiation. Moreover, partial characterization of the Leishmania flagellum proteome of promastigotes and differentiating amastigotes revealed the presence of other RNA-binding proteins, as well as differences in the flagellum composition between these two parasite lifestages. Shuttling of Alba-domain proteins between the cytoplasm and the nucleolus or the flagellum throughout the parasite life cycle suggests that these RNA-binding proteins participate in several distinct regulatory pathways controlling developmental gene expression in Leishmania.
Dupé, Aurélien; Dumas, Carole; Papadopoulou, Barbara
2015-01-01
Alba-domain proteins are RNA-binding proteins found in archaea and eukaryotes and recently studied in protozoan parasites where they play a role in the regulation of virulence factors and stage-specific proteins. This work describes in silico structural characterization, cellular localization and biochemical analyses of Alba-domain proteins in Leishmania infantum. We show that in contrast to other protozoa, Leishmania have two Alba-domain proteins, LiAlba1 and LiAlba3, representative of the Rpp20- and the Rpp25-like eukaryotic subfamilies, respectively, which share several sequence and structural similarities but also important differences with orthologs in other protozoa, especially in sequences targeted for post-translational modifications. LiAlba1 and LiAlba3 proteins form a complex interacting with other RNA-binding proteins, ribosomal subunits, and translation factors as supported by co-immunoprecipitation and sucrose gradient sedimentation analysis. A higher co-sedimentation of Alba proteins with ribosomal subunits was seen upon conditions of decreased translation, suggesting a role of these proteins in translational repression. The Leishmania Alba-domain proteins display differential cellular localization throughout the parasite development. In the insect promastigote stage, Alba proteins co-localize predominantly to the cytoplasm but they translocate to the nucleolus and the flagellum upon amastigote differentiation in the mammalian host and are found back to the cytoplasm once amastigote differentiation is completed. Heat-shock, a major signal of amastigote differentiation, triggers Alba translocation to the nucleolus and the flagellum. Purification of the Leishmania flagellum confirmed LiAlba3 enrichment in this organelle during amastigote differentiation. Moreover, partial characterization of the Leishmania flagellum proteome of promastigotes and differentiating amastigotes revealed the presence of other RNA-binding proteins, as well as differences in the flagellum composition between these two parasite lifestages. Shuttling of Alba-domain proteins between the cytoplasm and the nucleolus or the flagellum throughout the parasite life cycle suggests that these RNA-binding proteins participate in several distinct regulatory pathways controlling developmental gene expression in Leishmania. PMID:26334886
Regulation of the heat shock response under anoxia in the turtle, Trachemys scripta elegans.
Krivoruchko, Anastasia; Storey, Kenneth B
2010-03-01
The effects of 20 h of anoxic submergence in cold water and 5 h of aerobic recovery on the heat shock response were analyzed in four organs of the anoxia-tolerant turtle Trachemys scripta elegans. Immunoblotting was used to analyze levels of active and inactive forms of the heat shock transcription factor 1 (HSF1), nuclear translocation of HSF1, and the levels of six heat shock proteins (HSPs). PCR was also used to retrieve the turtle HSF1 nucleotide sequence; its deduced amino acid sequence showed 97% identity with chicken HSF1. White skeletal muscle showed a strong fivefold increase in the amount of active HSF1 under anoxic conditions as well as an 80% increase in nuclear localization. This was accompanied by upregulation of five HSPs by 1.8- to 2.9-fold: Hsp25, Hsp40, Hsp70, Hsc70, and Hsp90, the latter two remained elevated after 5 h of aerobic recovery. Kidney and liver showed little change in active HSF1 content during anoxia and recovery, but a significant increase in the nuclear localization of HSF1 during anoxia. This supported enhanced expression of three HSPs in kidney (Hsp40, Hsc70, and Hsp90) and four in liver (Hsp40, Hsp60, Hsp70, Hsc70). Heart displayed a strong increase in active HSF1 during anoxia and recovery (6.6- to 6.8-fold higher than control) and increased nuclear localization but heart HSP levels did not rise. The data demonstrate organ-specific regulation of HSPs during anoxia exposure and aerobic recovery in T. s. elegans and suggest that the heat shock response is an important aspect of cytoprotection during facultative anaerobiosis, particularly with regard to underwater hibernation of turtles in cold water.
Kemppainen, Petri; Knight, Christopher G; Sarma, Devojit K; Hlaing, Thaung; Prakash, Anil; Maung Maung, Yan Naung; Somboon, Pradya; Mahanta, Jagadish; Walton, Catherine
2015-09-01
Recent advances in sequencing allow population-genomic data to be generated for virtually any species. However, approaches to analyse such data lag behind the ability to generate it, particularly in nonmodel species. Linkage disequilibrium (LD, the nonrandom association of alleles from different loci) is a highly sensitive indicator of many evolutionary phenomena including chromosomal inversions, local adaptation and geographical structure. Here, we present linkage disequilibrium network analysis (LDna), which accesses information on LD shared between multiple loci genomewide. In LD networks, vertices represent loci, and connections between vertices represent the LD between them. We analysed such networks in two test cases: a new restriction-site-associated DNA sequence (RAD-seq) data set for Anopheles baimaii, a Southeast Asian malaria vector; and a well-characterized single nucleotide polymorphism (SNP) data set from 21 three-spined stickleback individuals. In each case, we readily identified five distinct LD network clusters (single-outlier clusters, SOCs), each comprising many loci connected by high LD. In A. baimaii, further population-genetic analyses supported the inference that each SOC corresponds to a large inversion, consistent with previous cytological studies. For sticklebacks, we inferred that each SOC was associated with a distinct evolutionary phenomenon: two chromosomal inversions, local adaptation, population-demographic history and geographic structure. LDna is thus a useful exploratory tool, able to give a global overview of LD associated with diverse evolutionary phenomena and identify loci potentially involved. LDna does not require a linkage map or reference genome, so it is applicable to any population-genomic data set, making it especially valuable for nonmodel species. © 2015 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.
SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate
Roffler, Gretchen H.; Amish, Stephen J.; Smith, Seth; Cosart, Ted F.; Kardos, Marty; Schwartz, Michael K.; Luikart, Gordon
2016-01-01
Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding and nearby 5′ and 3′ untranslated regions of chosen candidate genes. Targeted sequences were taken from bighorn sheep (Ovis canadensis) exon capture data and directly from the domestic sheep genome (Ovis aries v. 3; oviAri3). The bighorn sheep sequences used in the Dall's sheep (Ovis dalli dalli) exon capture aligned to 2350 genes on the oviAri3 genome with an average of 2 exons each. We developed a microfluidic qPCR-based SNP chip to genotype 476 Dall's sheep from locations across their range and test for patterns of selection. Using multiple corroborating approaches (lositan and bayescan), we detected 28 SNP loci potentially under selection. We additionally identified candidate loci significantly associated with latitude, longitude, precipitation and temperature, suggesting local environmental adaptation. The three methods demonstrated consistent support for natural selection on nine genes with immune and disease-regulating functions (e.g. Ovar-DRA, APC, BATF2, MAGEB18), cell regulation signalling pathways (e.g. KRIT1, PI3K, ORRC3), and respiratory health (CYSLTR1). Characterizing adaptive allele distributions from novel genetic techniques will facilitate investigation of the influence of environmental variation on local adaptation of a northern alpine ungulate throughout its range. This research demonstrated the utility of exon capture for gene-targeted SNP discovery and subsequent SNP chip genotyping using low-quality samples in a nonmodel species.
NASA Astrophysics Data System (ADS)
Meyer, Sam; Everaers, Ralf
2015-02-01
The histone-DNA interaction in the nucleosome is a fundamental mechanism of genomic compaction and regulation, which remains largely unknown despite increasing structural knowledge of the complex. In this paper, we propose a framework for the extraction of a nanoscale histone-DNA force-field from a collection of high-resolution structures, which may be adapted to a larger class of protein-DNA complexes. We applied the procedure to a large crystallographic database extended by snapshots from molecular dynamics simulations. The comparison of the structural models first shows that, at histone-DNA contact sites, the DNA base-pairs are shifted outwards locally, consistent with locally repulsive forces exerted by the histones. The second step shows that the various force profiles of the structures under analysis derive locally from a unique, sequence-independent, quadratic repulsive force-field, while the sequence preferences are entirely due to internal DNA mechanics. We have thus obtained the first knowledge-derived nanoscale interaction potential for histone-DNA in the nucleosome. The conformations obtained by relaxation of nucleosomal DNA with high-affinity sequences in this potential accurately reproduce the experimental values of binding preferences. Finally we address the more generic binding mechanisms relevant to the 80% genomic sequences incorporated in nucleosomes, by computing the conformation of nucleosomal DNA with sequence-averaged properties. This conformation differs from those found in crystals, and the analysis suggests that repulsive histone forces are related to local stretch tension in nucleosomal DNA, mostly between adjacent contact points. This tension could play a role in the stability of the complex.
He, Ji; Dai, Xinbin; Zhao, Xuechun
2007-02-09
BLAST searches are widely used for sequence alignment. The search results are commonly adopted for various functional and comparative genomics tasks such as annotating unknown sequences, investigating gene models and comparing two sequence sets. Advances in sequencing technologies pose challenges for high-throughput analysis of large-scale sequence data. A number of programs and hardware solutions exist for efficient BLAST searching, but there is a lack of generic software solutions for mining and personalized management of the results. Systematically reviewing the results and identifying information of interest remains tedious and time-consuming. Personal BLAST Navigator (PLAN) is a versatile web platform that helps users to carry out various personalized pre- and post-BLAST tasks, including: (1) query and target sequence database management, (2) automated high-throughput BLAST searching, (3) indexing and searching of results, (4) filtering results online, (5) managing results of personal interest in favorite categories, (6) automated sequence annotation (such as NCBI NR and ontology-based annotation). PLAN integrates, by default, the Decypher hardware-based BLAST solution provided by Active Motif Inc. with a greatly improved efficiency over conventional BLAST software. BLAST results are visualized by spreadsheets and graphs and are full-text searchable. BLAST results and sequence annotations can be exported, in part or in full, in various formats including Microsoft Excel and FASTA. Sequences and BLAST results are organized in projects, the data publication levels of which are controlled by the registered project owners. In addition, all analytical functions are provided to public users without registration. PLAN has proved a valuable addition to the community for automated high-throughput BLAST searches, and, more importantly, for knowledge discovery, management and sharing based on sequence alignment results. The PLAN web interface is platform-independent, easily configurable and capable of comprehensive expansion, and user-intuitive. PLAN is freely available to academic users at http://bioinfo.noble.org/plan/. The source code for local deployment is provided under free license. Full support on system utilization, installation, configuration and customization are provided to academic users.
He, Ji; Dai, Xinbin; Zhao, Xuechun
2007-01-01
Background BLAST searches are widely used for sequence alignment. The search results are commonly adopted for various functional and comparative genomics tasks such as annotating unknown sequences, investigating gene models and comparing two sequence sets. Advances in sequencing technologies pose challenges for high-throughput analysis of large-scale sequence data. A number of programs and hardware solutions exist for efficient BLAST searching, but there is a lack of generic software solutions for mining and personalized management of the results. Systematically reviewing the results and identifying information of interest remains tedious and time-consuming. Results Personal BLAST Navigator (PLAN) is a versatile web platform that helps users to carry out various personalized pre- and post-BLAST tasks, including: (1) query and target sequence database management, (2) automated high-throughput BLAST searching, (3) indexing and searching of results, (4) filtering results online, (5) managing results of personal interest in favorite categories, (6) automated sequence annotation (such as NCBI NR and ontology-based annotation). PLAN integrates, by default, the Decypher hardware-based BLAST solution provided by Active Motif Inc. with a greatly improved efficiency over conventional BLAST software. BLAST results are visualized by spreadsheets and graphs and are full-text searchable. BLAST results and sequence annotations can be exported, in part or in full, in various formats including Microsoft Excel and FASTA. Sequences and BLAST results are organized in projects, the data publication levels of which are controlled by the registered project owners. In addition, all analytical functions are provided to public users without registration. Conclusion PLAN has proved a valuable addition to the community for automated high-throughput BLAST searches, and, more importantly, for knowledge discovery, management and sharing based on sequence alignment results. The PLAN web interface is platform-independent, easily configurable and capable of comprehensive expansion, and user-intuitive. PLAN is freely available to academic users at . The source code for local deployment is provided under free license. Full support on system utilization, installation, configuration and customization are provided to academic users. PMID:17291345
Copy Number Variation across European Populations
Chen, Wanting; Hayward, Caroline; Wright, Alan F.; Hicks, Andrew A.; Vitart, Veronique; Knott, Sara; Wild, Sarah H.; Pramstaller, Peter P.; Wilson, James F.; Rudan, Igor; Porteous, David J.
2011-01-01
Genome analysis provides a powerful approach to test for evidence of genetic variation within and between geographical regions and local populations. Copy number variants which comprise insertions, deletions and duplications of genomic sequence provide one such convenient and informative source. Here, we investigate copy number variants from genome wide scans of single nucleotide polymorphisms in three European population isolates, the island of Vis in Croatia, the islands of Orkney in Scotland and the South Tyrol in Italy. We show that whereas the overall copy number variant frequencies are similar between populations, their distribution is highly specific to the population of origin, a finding which is supported by evidence for increased kinship correlation for specific copy number variants within populations. PMID:21829696
Plasmonic Nanostructures for Nano-Scale Bio-Sensing
Chung, Taerin; Lee, Seung-Yeol; Song, Eui Young; Chun, Honggu; Lee, Byoungho
2011-01-01
The optical properties of various nanostructures have been widely adopted for biological detection, from DNA sequencing to nano-scale single molecule biological function measurements. In particular, by employing localized surface plasmon resonance (LSPR), we can expect distinguished sensing performance with high sensitivity and resolution. This indicates that nano-scale detections can be realized by using the shift of resonance wavelength of LSPR in response to the refractive index change. In this paper, we overview various plasmonic nanostructures as potential sensing components. The qualitative descriptions of plasmonic nanostructures are supported by the physical phenomena such as plasmonic hybridization and Fano resonance. We present guidelines for designing specific nanostructures with regard to wavelength range and target sensing materials. PMID:22346679
Karched, Maribasappa; Furgang, David; Planet, Paul J; DeSalle, Rob; Fine, Daniel H
2012-03-01
Aggregatibacter actinomycetemcomitans is implicated in localized aggressive periodontitis. We report the first genome sequence of an A. actinomycetemcomitans strain isolated from an Old World primate.
Boutin, Arnaud; Pinsard, Basile; Boré, Arnaud; Carrier, Julie; Fogel, Stuart M; Doyon, Julien
2018-04-01
Sleep benefits motor memory consolidation. This mnemonic process is thought to be mediated by thalamo-cortical spindle activity during NREM-stage2 sleep episodes as well as changes in striatal and hippocampal activity. However, direct experimental evidence supporting the contribution of such sleep-dependent physiological mechanisms to motor memory consolidation in humans is lacking. In the present study, we combined EEG and fMRI sleep recordings following practice of a motor sequence learning (MSL) task to determine whether spindle oscillations support sleep-dependent motor memory consolidation by transiently synchronizing and coordinating specialized cortical and subcortical networks. To that end, we conducted EEG source reconstruction on spindle epochs in both cortical and subcortical regions using novel deep-source localization techniques. Coherence-based metrics were adopted to estimate functional connectivity between cortical and subcortical structures over specific frequency bands. Our findings not only confirm the critical and functional role of NREM-stage2 sleep spindles in motor skill consolidation, but provide first-time evidence that spindle oscillations [11-17 Hz] may be involved in sleep-dependent motor memory consolidation by locally reactivating and functionally binding specific task-relevant cortical and subcortical regions within networks including the hippocampus, putamen, thalamus and motor-related cortical regions. Copyright © 2018 Elsevier Inc. All rights reserved.
Brinker, C. Jeffrey; Carnes, Eric C.; Ashley, Carlee Erin; Willman, Cheryl L.
2017-02-28
The present invention is directed to protocells for specific targeting of hepatocellular and other cancer cells which comprise a nanoporous silica core with a supported lipid bilayer; at least one agent which facilitates cancer cell death (such as a traditional small molecule, a macromolecular cargo (e.g. siRNA or a protein toxin such as ricin toxin A-chain or diphtheria toxin A-chain) and/or a histone-packaged plasmid DNA disposed within the nanoporous silica core (preferably supercoiled in order to more efficiently package the DNA into protocells) which is optionally modified with a nuclear localization sequence to assist in localizing protocells within the nucleus of the cancer cell and the ability to express peptides involved in therapy (apoptosis/cell death) of the cancer cell or as a reporter, a targeting peptide which targets cancer cells in tissue to be treated such that binding of the protocell to the targeted cells is specific and enhanced and a fusogenic peptide that promotes endosomal escape of protocells and encapsulated DNA. Protocells according to the present invention may be used to treat cancer, especially including hepatocellular (liver) cancer using novel binding peptides (c-MET peptides) which selectively bind to hepatocellular tissue or to function in diagnosis of cancer, including cancer treatment and drug discovery.
NASA Astrophysics Data System (ADS)
Sitaula, R. P.; Aschoff, J.
2013-12-01
Regional-scale sequence stratigraphic correlation, well log analysis, syntectonic unconformity mapping, isopach maps, and depositional environment maps of the upper Mesaverde Group (UMG) in Uinta basin, Utah suggest higher accommodation in northeastern part (Natural Buttes area) and local development of lacustrine facies due to increased subsidence caused by uplift of San Rafael Swell (SRS) in southern and Uinta Uplift in northern parts. Recently discovered lacustrine facies in Natural Buttes area are completely different than the dominant fluvial facies in outcrops along Book Cliffs and could have implications for significant amount of tight-gas sand production from this area. Data used for sequence stratigraphic correlation, isopach maps and depositional environmental maps include > 100 well logs, 20 stratigraphic profiles, 35 sandstone thin sections and 10 outcrop-based gamma ray profiles. Seven 4th order depositional sequences (~0.5 my duration) are identified and correlated within UMG. Correlation was constructed using a combination of fluvial facies and stacking patterns in outcrops, chert-pebble conglomerates and tidally influenced strata. These surfaces were extrapolated into subsurface by matching GR profiles. GR well logs and core log of Natural Buttes area show intervals of coarsening upward patterns suggesting possible lacustrine intervals that might contain high TOC. Locally, younger sequences are completely truncated across SRS whereas older sequences are truncated and thinned toward SRS. The cycles of truncation and thinning represent phases of SRS uplift. Thinning possibly related with the Uinta Uplift is also observed in northwestern part. Paleocurrents are consistent with interpretation of periodic segmentation and deflection of sedimentation. Regional paleocurrents are generally E-NE-directed in Sequences 1-4, and N-directed in Sequences 5-7. From isopach maps and paleocurrent direction it can be interpreted that uplift of SRS changed route of sediment supply from west to southwest. Locally, paleocurrents are highly variable near SRS further suggesting UMG basin-fill was partitioned by uplift of SRS. Sandstone composition analysis also suggests the uplift of SRS causing the variation of source rocks in upper sequences than the lower sequences. In conclusion, we suggest that Uinta basin was episodically partitioned during the deposition of UMG due to uplift of Laramide structures in the basin and accommodation was localized in northeastern part. Understanding of structural controls on accommodation, sedimentation patterns and depositional environments will aid prediction of the best-producing gas reservoirs.
BLAST and FASTA similarity searching for multiple sequence alignment.
Pearson, William R
2014-01-01
BLAST, FASTA, and other similarity searching programs seek to identify homologous proteins and DNA sequences based on excess sequence similarity. If two sequences share much more similarity than expected by chance, the simplest explanation for the excess similarity is common ancestry-homology. The most effective similarity searches compare protein sequences, rather than DNA sequences, for sequences that encode proteins, and use expectation values, rather than percent identity, to infer homology. The BLAST and FASTA packages of sequence comparison programs provide programs for comparing protein and DNA sequences to protein databases (the most sensitive searches). Protein and translated-DNA comparisons to protein databases routinely allow evolutionary look back times from 1 to 2 billion years; DNA:DNA searches are 5-10-fold less sensitive. BLAST and FASTA can be run on popular web sites, but can also be downloaded and installed on local computers. With local installation, target databases can be customized for the sequence data being characterized. With today's very large protein databases, search sensitivity can also be improved by searching smaller comprehensive databases, for example, a complete protein set from an evolutionarily neighboring model organism. By default, BLAST and FASTA use scoring strategies target for distant evolutionary relationships; for comparisons involving short domains or queries, or searches that seek relatively close homologs (e.g. mouse-human), shallower scoring matrices will be more effective. Both BLAST and FASTA provide very accurate statistical estimates, which can be used to reliably identify protein sequences that diverged more than 2 billion years ago.
Rapid evolution of cis-regulatory sequences via local point mutations
NASA Technical Reports Server (NTRS)
Stone, J. R.; Wray, G. A.
2001-01-01
Although the evolution of protein-coding sequences within genomes is well understood, the same cannot be said of the cis-regulatory regions that control transcription. Yet, changes in gene expression are likely to constitute an important component of phenotypic evolution. We simulated the evolution of new transcription factor binding sites via local point mutations. The results indicate that new binding sites appear and become fixed within populations on microevolutionary timescales under an assumption of neutral evolution. Even combinations of two new binding sites evolve very quickly. We predict that local point mutations continually generate considerable genetic variation that is capable of altering gene expression.
Al-Kindi, Adila; Kizhakkedath, Praseetha; Xu, Huifang; John, Anne; Sayegh, Abeer Al; Ganesh, Anuradha; Al-Awadi, Maha; Al-Anbouri, Lamya; Al-Gazali, Lihadh; Leitinger, Birgit; Ali, Bassam R
2014-04-11
The rare autosomal genetic disorder, Spondylo-meta-epiphyseal dysplasia with short limbs and abnormal calcifications (SMED-SL), is reported to be caused by missense or splice site mutations in the human discoidin domain receptor 2 (DDR2) gene. Previously our group has established that trafficking defects and loss of ligand binding are the underlying cellular mechanisms of several SMED-SL causing mutations. Here we report the clinical characteristics of two siblings of consanguineous marriage with suspected SMED-SL and identification of a novel disease-causing mutation in the DDR2 gene. Clinical evaluation and radiography were performed to evaluate the patients. All the coding exons and splice sites of the DDR2 gene were sequenced by Sanger sequencing. Subcellular localization of the mutated DDR2 protein was determined by confocal microscopy, deglycosylation assay and Western blotting. DDR2 activity was measured by collagen activation and Western analysis. In addition to the typical features of SMED-SL, one of the patients has an eye phenotype including visual impairment due to optic atrophy. DNA sequencing revealed a novel homozygous dinucleotide deletion mutation (c.2468_2469delCT) on exon 18 of the DDR2 gene in both patients. The mutation resulted in a frameshift leading to an amino acid change at position S823 and a predicted premature termination of translation (p.S823Cfs*2). Subcellular localization of the mutant protein was analyzed in mammalian cell lines, and it was found to be largely retained in the endoplasmic reticulum (ER), which was further supported by its N-glycosylation profile. In keeping with its cellular mis-localization, the mutant protein was found to be deficient in collagen-induced receptor activation, suggesting protein trafficking defects as the major cellular mechanism underlying the loss of DDR2 function in our patients. Our results indicate that the novel mutation results in defective trafficking of the DDR2 protein leading to loss of function and disease. This confirms our previous findings that DDR2 missense mutations occurring at the kinase domain result in retention of the mutant protein in the ER.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Witek, Matthew; Blomain, Erik S.; Magee, Michael S.
Purpose: Radiation therapy (RT) is thought to produce clinical responses in cancer patients, not only through direct toxicity to cancer cells and supporting tumor stroma cells, but also through activation of immunologic effectors. More recently, RT has potentiated the local and systemic effects of cancer immunotherapy (IT). However, combination regimens that maximize immunologic and clinical efficacy remain undefined. Methods and Materials: We evaluated the impact of local RT on adenoviral-mediated vaccination against the colorectal cancer antigen GUCY2C (Ad5-GUCY2C) in a murine subcutaneous tumor model using mouse CT26 colon cancer cells (CT26-GUCY2C). Immune responses were assessed by ELISpot, and clinical responsesmore » were assessed by tumor size and incidence. Results: The specific sequence of tumor-directed RT preceding Ad5-GUCY2C IT transformed inactive therapeutic Ad5-GUCY2C vaccination into a curative vaccine. GUCY2C-specific T cell responses were amplified (P<.05), tumor eradication was maximized (P<.01), and tumor volumes were minimized (P<.001) in mice whose tumors were irradiated before, compared with after, Ad5-GUCY2C vaccination. The immunologic and antitumor efficacy of Ad5-GUCY2C was amplified comparably by unfractionated (8 Gy × 1), or biologically equivalent doses of fractionated (3.5 Gy × 3), RT. The antitumor effects of sequential RT and IT (RT-IT) depended on expression of GUCY2C by tumor cells and the adenoviral vaccine vector, and tumor volumes were inversely related to the magnitude of GUCY2C-specific T cell responses. Moreover, mice cured of CT26-GUCY2C tumors by RT-IT showed long-lasting antigen-dependent protection, resisting tumors formed by GUCY2C-expressing 4T1 breast cancer cells inoculated 50 days after CT26 cells. Conclusions: Optimal sequencing of RT and IT amplifies antigen-specific local and systemic immune responses, revealing novel acute and long-term therapeutic antitumor protection. These observations underscore the importance of modality sequence optimization before the initiation of clinical trials of RT and IT to maximize immune and antitumor responses.« less
Cross cultural differences in unconscious knowledge.
Kiyokawa, Sachiko; Dienes, Zoltán; Tanaka, Daisuke; Yamada, Ayumi; Crowe, Louise
2012-07-01
Previous studies have indicated cross cultural differences in conscious processes, such that Asians have a global preference and Westerners a more analytical one. We investigated whether these biases also apply to unconscious knowledge. In Experiment 1, Japanese and UK participants memorized strings of large (global) letters made out of small (local) letters. The strings constituted one sequence of letters at a global level and a different sequence at a local level. Implicit learning occurred at the global and not the local level for the Japanese but equally at both levels for the English. In Experiment 2, the Japanese preference for global over local processing persisted even when structure existed only at the local but not global level. In Experiment 3, Japanese and UK participants were asked to attend to just one of the levels, global or local. Now the cultural groups performed similarly, indicating that the bias largely reflects preference rather than ability (although the data left room for residual ability differences). In Experiment 4, the greater global advantage of Japanese rather English was confirmed for strings made of Japanese kana rather than Roman letters. That is, the cultural difference is not due to familiarity of the sequence elements. In sum, we show for the first time that cultural biases strongly affect the type of unconscious knowledge people acquire. Copyright © 2012 Elsevier B.V. All rights reserved.
Endo, Megumi; Hirose, Mamiko; Honda, Masanao; Koga, Hiroyuki; Morino, Yoshiaki; Kiyomoto, Masato; Wada, Hiroshi
2018-06-15
The marine environment around Japan experienced significant changes during the Cenozoic Era. In this study, we report findings suggesting that this dynamic history left behind traces in the genome of the Japanese sand dollar species Peronella japonica and P. rubra. Although mitochondrial Cytochrome C Oxidase I sequences did not indicate fragmentation of the current local populations of P. japonica around Japan, two different types of intron sequence were found in the Alx1 locus. We inferred that past fragmentation of the populations account for the presence of two types of nuclear sequences as alleles in the Alx1 intron of P. japonica. It is likely that the split populations have intermixed in recent times; hence, we did not detect polymorphisms in the sequences reflecting the current localization of the species. In addition, we found two allelic sequences of theAlx1 intron in the sister species P. rubra. The divergence times of the two types of Alx1 intron sequences were estimated at approximately 14.9 and 4.0 million years ago for P. japonica and P. rubra, respectively. Our study indicates that information from the intron sequences of nuclear genes can enhance our understanding of past genetic events in organisms. Copyright © 2018 Elsevier B.V. All rights reserved.
Two distinct DNA sequences recognized by transcription factors represent enthalpy and entropy optima
Yin, Yimeng; Das, Pratyush K; Jolma, Arttu; Zhu, Fangjie; Popov, Alexander; Xu, You; Nilsson, Lennart
2018-01-01
Most transcription factors (TFs) can bind to a population of sequences closely related to a single optimal site. However, some TFs can bind to two distinct sequences that represent two local optima in the Gibbs free energy of binding (ΔG). To determine the molecular mechanism behind this effect, we solved the structures of human HOXB13 and CDX2 bound to their two optimal DNA sequences, CAATAAA and TCGTAAA. Thermodynamic analyses by isothermal titration calorimetry revealed that both sites were bound with similar ΔG. However, the interaction with the CAA sequence was driven by change in enthalpy (ΔH), whereas the TCG site was bound with similar affinity due to smaller loss of entropy (ΔS). This thermodynamic mechanism that leads to at least two local optima likely affects many macromolecular interactions, as ΔG depends on two partially independent variables ΔH and ΔS according to the central equation of thermodynamics, ΔG = ΔH - TΔS. PMID:29638214
Infrared thermal facial image sequence registration analysis and verification
NASA Astrophysics Data System (ADS)
Chen, Chieh-Li; Jian, Bo-Lin
2015-03-01
To study the emotional responses of subjects to the International Affective Picture System (IAPS), infrared thermal facial image sequence is preprocessed for registration before further analysis such that the variance caused by minor and irregular subject movements is reduced. Without affecting the comfort level and inducing minimal harm, this study proposes an infrared thermal facial image sequence registration process that will reduce the deviations caused by the unconscious head shaking of the subjects. A fixed image for registration is produced through the localization of the centroid of the eye region as well as image translation and rotation processes. Thermal image sequencing will then be automatically registered using the two-stage genetic algorithm proposed. The deviation before and after image registration will be demonstrated by image quality indices. The results show that the infrared thermal image sequence registration process proposed in this study is effective in localizing facial images accurately, which will be beneficial to the correlation analysis of psychological information related to the facial area.
Brown, Amanda M V; Kent, Michael L; Adamson, Martin L
2010-07-26
Loma salmonae is a microsporidian parasite prevalent in wild and farmed salmon species of the genus Oncorhynchus. This study compared ribosomal RNA (rDNA) and elongation factor-1 alpha (EF-1alpha) gene sequences to look for variation that may provide a basis for distinguishing populations. Specimens were collected from laboratory, captive (sea netpen farm and freshwater hatchery) and wild populations of fish. The host range included rainbow trout O. mykiss, Pacific salmon Oncorhynchus spp. and brook trout Salvelinus fontinalis from British Columbia, Prince Edward Island, Canada, from California, Colorado, Idaho, U.S.A. and from Chile. Both loci suggested that a variant in S. fontinalis (named 'SV') was a separate species. This was supported by the absence of similar variants in the source material (isolated from laboratory-held O. tshawytscha) and high divergence (1.4 to 2.3% in the rDNA and EF-1alpha) from L. salmonae in the type host and locality (0. mykiss in California). L. salmonae from freshwater and anadromous Oncorhynchus spp. were distinguished, providing a basis on which to evaluate possible sources of infection and suggesting geographic boundaries are important. Higher genetic variation occurred among samples of freshwater origin and from a sea netpen farm in Chile, suggesting these environments may present greater population diversity. Invariance in rDNA sequence across 17 samples from anadromous salmon in rivers, lakes, ocean, farms and hatcheries supports the hypothesis that marine transmission occurs and effectively prevents population substructuring caused by freshwater transmission.
DeepLoc: prediction of protein subcellular localization using deep learning.
Almagro Armenteros, José Juan; Sønderby, Casper Kaae; Sønderby, Søren Kaae; Nielsen, Henrik; Winther, Ole
2017-11-01
The prediction of eukaryotic protein subcellular localization is a well-studied topic in bioinformatics due to its relevance in proteomics research. Many machine learning methods have been successfully applied in this task, but in most of them, predictions rely on annotation of homologues from knowledge databases. For novel proteins where no annotated homologues exist, and for predicting the effects of sequence variants, it is desirable to have methods for predicting protein properties from sequence information only. Here, we present a prediction algorithm using deep neural networks to predict protein subcellular localization relying only on sequence information. At its core, the prediction model uses a recurrent neural network that processes the entire protein sequence and an attention mechanism identifying protein regions important for the subcellular localization. The model was trained and tested on a protein dataset extracted from one of the latest UniProt releases, in which experimentally annotated proteins follow more stringent criteria than previously. We demonstrate that our model achieves a good accuracy (78% for 10 categories; 92% for membrane-bound or soluble), outperforming current state-of-the-art algorithms, including those relying on homology information. The method is available as a web server at http://www.cbs.dtu.dk/services/DeepLoc. Example code is available at https://github.com/JJAlmagro/subcellular_localization. The dataset is available at http://www.cbs.dtu.dk/services/DeepLoc/data.php. jjalma@dtu.dk. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
García-Vásquez, Adriana; Pinacho-Pinacho, Carlos Daniel; Martínez-Ramírez, Emilio; Rubio-Godoy, Miguel
2018-08-01
In the present study, two new species of Gyrodactylus are described from Profundulus oaxacae, a fish endemic to the Pacific slope of Oaxaca State, Mexico. Fishes were collected within their distribution range in 5 localities in the Atoyac-Verde River. Gyrodactylus montealbani n. sp. and G. zapoteco n. sp. were erected and characterized morphologically (sclerites of the attachment apparatus and the male copulatory organ) and molecularly (sequences of the Internal Transcribed Spacer region of rDNA). The haptoral sclerites of the new species are similar to those of Gyrodactylus iunuri and Gyrodactylus tepari, both recently described from the goodeid fish Goodea atripinnis, from the Mexican States of Jalisco and Querétaro, respectively; and to Gyrodactylus xtachuna described from the poeciliid Poeciliopsis gracilis in Veracruz State, Mexico - nonetheless, these species can all be discriminated based on their marginal hook morphology. Specimens of G. montealbani n. sp. and G. zapoteco n. sp. were sequenced, and were aligned with sequences of 25 other Gyrodactylus spp. Both Maximum likelihood and Bayesian inference analyses indicated that the two new species are members of independent, well-supported lineages - these are the first Gyrodactylus species described from Profundulus oaxacae. Copyright © 2018 Elsevier B.V. All rights reserved.
Hackenberg, Michael; Rodríguez-Ezpeleta, Naiara; Aransay, Ana M.
2011-01-01
We present a new version of miRanalyzer, a web server and stand-alone tool for the detection of known and prediction of new microRNAs in high-throughput sequencing experiments. The new version has been notably improved regarding speed, scope and available features. Alignments are now based on the ultrafast short-read aligner Bowtie (granting also colour space support, allowing mismatches and improving speed) and 31 genomes, including 6 plant genomes, can now be analysed (previous version contained only 7). Differences between plant and animal microRNAs have been taken into account for the prediction models and differential expression of both, known and predicted microRNAs, between two conditions can be calculated. Additionally, consensus sequences of predicted mature and precursor microRNAs can be obtained from multiple samples, which increases the reliability of the predicted microRNAs. Finally, a stand-alone version of the miRanalyzer that is based on a local and easily customized database is also available; this allows the user to have more control on certain parameters as well as to use specific data such as unpublished assemblies or other libraries that are not available in the web server. miRanalyzer is available at http://bioinfo2.ugr.es/miRanalyzer/miRanalyzer.php. PMID:21515631
Byrne, L; Fisher, I; Peters, T; Mather, A; Thomson, N; Rosner, B; Bernard, H; McKeown, P; Cormican, M; Cowden, J; Aiyedun, V; Lane, C
2014-08-07
In November 2011, the presence of Salmonella Newport in a ready-to-eat watermelon slice was confirmed as part of a local food survey in England. In late December 2011, cases of S. Newport were reported in England, Wales, Northern Ireland, Scotland, Ireland and Germany. During the outbreak, 63 confirmed cases of S. Newport were reported across all six countries with isolates indistinguishable by pulsed-field gel electrophoresis from the watermelon isolate.A subset of outbreak isolates were whole-genome sequenced and were identical to, or one single nucleotide polymorphism different from the watermelon isolate.In total, 46 confirmed cases were interviewed of which 27 reported watermelon consumption. Further investigations confirmed the outbreak was linked to the consumption of watermelon imported from Brazil.Although numerous Salmonella outbreaks associated with melons have been reported in the United States and elsewhere, this is the first of its kind in Europe.Expansion of the melon import market from Brazil represents a potential threat for future outbreaks. Whole genome sequencing is rapidly becoming more accessible and can provide a compelling level of evidence of linkage between human cases and sources of infection,to support public health interventions in global food markets.
Byrne, L; Fisher, I; Peters, T; Mather, A; Thomson, N; Rosner, B; Bernard, H; McKeown, P; Cormican, M; Cowden, J; Aiyedun, V; Lane, C
2015-01-01
In November 2011, the presence of Salmonella Newport in a ready-to-eat watermelon slice was confirmed as part of a local food survey in England. In late December 2011, cases of S. Newport were reported in England, Wales, Northern Ireland, Scotland, Ireland and Germany. During the outbreak, 63 confirmed cases of S. Newport were reported across all six countries with isolates indistinguishable by pulsed-field gel electrophoresis from the watermelon isolate. A subset of outbreak isolates were whole-genome sequenced and were identical to, or one single nucleotide polymorphism different from the watermelon isolate. In total, 46 confirmed cases were interviewed of which 27 reported watermelon consumption. Further investigations confirmed the outbreak was linked to the consumption of watermelon imported from Brazil. Although numerous Salmonella outbreaks associated with melons have been reported in the United States and elsewhere, this is the first of its kind in Europe. Expansion of the melon import market from Brazil represents a potential threat for future outbreaks. Whole genome sequencing is rapidly becoming more accessible and can provide a compelling level of evidence of linkage between human cases and sources of infection, to support public health interventions in global food markets. PMID:25138971
Li, Liqi; Luo, Qifa; Xiao, Weidong; Li, Jinhui; Zhou, Shiwen; Li, Yongsheng; Zheng, Xiaoqi; Yang, Hua
2017-02-01
Palmitoylation is the covalent attachment of lipids to amino acid residues in proteins. As an important form of protein posttranslational modification, it increases the hydrophobicity of proteins, which contributes to the protein transportation, organelle localization, and functions, therefore plays an important role in a variety of cell biological processes. Identification of palmitoylation sites is necessary for understanding protein-protein interaction, protein stability, and activity. Since conventional experimental techniques to determine palmitoylation sites in proteins are both labor intensive and costly, a fast and accurate computational approach to predict palmitoylation sites from protein sequences is in urgent need. In this study, a support vector machine (SVM)-based method was proposed through integrating PSI-BLAST profile, physicochemical properties, [Formula: see text]-mer amino acid compositions (AACs), and [Formula: see text]-mer pseudo AACs into the principal feature vector. A recursive feature selection scheme was subsequently implemented to single out the most discriminative features. Finally, an SVM method was implemented to predict palmitoylation sites in proteins based on the optimal features. The proposed method achieved an accuracy of 99.41% and Matthews Correlation Coefficient of 0.9773 for a benchmark dataset. The result indicates the efficiency and accuracy of our method in prediction of palmitoylation sites based on protein sequences.
Molecular dynamics study of some non-hydrogen-bonding base pair DNA strands
NASA Astrophysics Data System (ADS)
Tiwari, Rakesh K.; Ojha, Rajendra P.; Tiwari, Gargi; Pandey, Vishnudatt; Mall, Vijaysree
2018-05-01
In order to elucidate the structural activity of hydrophobic modified DNA, the DMMO2-D5SICS, base pair is introduced as a constituent in different set of 12-mer and 14-mer DNA sequences for the molecular dynamics (MD) simulation in explicit water solvent. AMBER 14 force field was employed for each set of duplex during the 200ns production-dynamics simulation in orthogonal-box-water solvent by the Particle-Mesh-Ewald (PME) method in infinite periodic boundary conditions (PBC) to determine conformational parameters of the complex. The force-field parameters of modified base-pair were calculated by Gaussian-code using Hartree-Fock /ab-initio methodology. RMSD Results reveal that the conformation of the duplex is sequence dependent and the binding energy of the complex depends on the position of the modified base-pair in the nucleic acid strand. We found that non-bonding energy had a significant contribution to stabilising such type of duplex in comparison to electrostatic energy. The distortion produced within strands by such type of base-pair was local and destabilised the duplex integrity near to substitution, moreover the binding energy of duplex depends on the position of substitution of hydrophobic base-pair and the DNA sequence and strongly supports the corresponding experimental study.
Bödeker, Inga T M; Clemmensen, Karina E; de Boer, Wietse; Martin, Francis; Olson, Åke; Lindahl, Björn D
2014-07-01
In northern forests, belowground sequestration of nitrogen (N) in complex organic pools restricts nutrient availability to plants. Oxidative extracellular enzymes produced by ectomycorrhizal fungi may aid plant N acquisition by providing access to N in macromolecular complexes. We test the hypotheses that ectomycorrhizal Cortinarius species produce Mn-dependent peroxidases, and that the activity of these enzymes declines at elevated concentrations of inorganic N. In a boreal pine forest and a sub-arctic birch forest, Cortinarius DNA was assessed by 454-sequencing of ITS amplicons and related to Mn-peroxidase activity in humus samples with- and without previous N amendment. Transcription of Cortinarius Mn-peroxidase genes was investigated in field samples. Phylogenetic analyses of Cortinarius peroxidase amplicons and genome sequences were performed. We found a significant co-localization of high peroxidase activity and DNA from Cortinarius species. Peroxidase activity was reduced by high ammonium concentrations. Amplification of mRNA sequences indicated transcription of Cortinarius Mn-peroxidase genes under field conditions. The Cortinarius glaucopus genome encodes 11 peroxidases - a number comparable to many white-rot wood decomposers. These results support the hypothesis that some ectomycorrhizal fungi--Cortinarius species in particular--may play an important role in decomposition of complex organic matter, linked to their mobilization of organically bound N. © 2014 The Authors. New Phytologist © 2014 New Phytologist Trust.
EnsMart: A Generic System for Fast and Flexible Access to Biological Data
Kasprzyk, Arek; Keefe, Damian; Smedley, Damian; London, Darin; Spooner, William; Melsopp, Craig; Hammond, Martin; Rocca-Serra, Philippe; Cox, Tony; Birney, Ewan
2004-01-01
The EnsMart system (www.ensembl.org/EnsMart) provides a generic data warehousing solution for fast and flexible querying of large biological data sets and integration with third-party data and tools. The system consists of a query-optimized database and interactive, user-friendly interfaces. EnsMart has been applied to Ensembl, where it extends its genomic browser capabilities, facilitating rapid retrieval of customized data sets. A wide variety of complex queries, on various types of annotations, for numerous species are supported. These can be applied to many research problems, ranging from SNP selection for candidate gene screening, through cross-species evolutionary comparisons, to microarray annotation. Users can group and refine biological data according to many criteria, including cross-species analyses, disease links, sequence variations, and expression patterns. Both tabulated list data and biological sequence output can be generated dynamically, in HTML, text, Microsoft Excel, and compressed formats. A wide range of sequence types, such as cDNA, peptides, coding regions, UTRs, and exons, with additional upstream and downstream regions, can be retrieved. The EnsMart database can be accessed via a public Web site, or through a Java application suite. Both implementations and the database are freely available for local installation, and can be extended or adapted to `non-Ensembl' data sets. PMID:14707178
Phylogenetic profiles reveal structural/functional determinants of TRPC3 signal-sensing antennae
Ko, Kyung Dae; Bhardwaj, Gaurav; Hong, Yoojin; Chang, Gue Su; Kiselyov, Kirill
2009-01-01
Biochemical assessment of channel structure/function is incredibly challenging. Developing computational tools that provide these data would enable translational research, accelerating mechanistic experimentation for the bench scientist studying ion channels. Starting with the premise that protein sequence encodes information about structure, function and evolution (SF&E), we developed a unified framework for inferring SF&E from sequence information using a knowledge-based approach. The Gestalt Domain Detection Algorithm-Basic Local Alignment Tool (GDDA-BLAST) provides phylogenetic profiles that can model, ab initio, SF&E relationships of biological sequences at the whole protein, single domain and single-amino acid level.1,2 In our recent paper,4 we have applied GDDA-BLAST analysis to study canonical TRP (TRPC) channels1 and empirically validated predicted lipid-binding and trafficking activities contained within the TRPC3 TRP_2 domain of unknown function. Overall, our in silico, in vitro, and in vivo experiments support a model in which TRPC3 has signal-sensing antennae which are adorned with lipid-binding, trafficking and calmodulin regulatory domains. In this Addendum, we correlate our functional domain analysis with the cryo-EM structure of TRPC3.3 In addition, we synthesize recent studies with our new findings to provide a refined model on the mechanism(s) of TRPC3 activation/deactivation. PMID:19704910
You, Min Kyoung; Kim, Jin Hwa; Lee, Yeo Jin; Jeong, Ye Sol; Ha, Sun-Hwa
2016-12-22
Plastoglobules (PGs) are thylakoid membrane microdomains within plastids that are known as specialized locations of carotenogenesis. Three rice phytoene synthase proteins (OsPSYs) involved in carotenoid biosynthesis have been identified. Here, the N-terminal 80-amino-acid portion of OsPSY2 (PTp) was demonstrated to be a chloroplast-targeting peptide by displaying cytosolic localization of OsPSY2(ΔPTp):mCherry in rice protoplast, in contrast to chloroplast localization of OsPSY2:mCherry in a punctate pattern. The peptide sequence of a PTp was predicted to harbor two transmembrane domains eligible for a putative PG-targeting signal. To assess and enhance the PG-targeting ability of PTp, the original PTp DNA sequence ( PTp ) was modified to a synthetic DNA sequence ( stPTp ), which had 84.4% similarity to the original sequence. The motivation of this modification was to reduce the GC ratio from 75% to 65% and to disentangle the hairpin loop structures of PTp . These two DNA sequences were fused to the sequence of the synthetic green fluorescent protein (sGFP) and drove GFP expression with different efficiencies. In particular, the RNA and protein levels of stPTp-sGFP were slightly improved to 1.4-fold and 1.3-fold more than those of sGFP, respectively. The green fluorescent signals of their mature proteins were all observed as speckle-like patterns with slightly blurred stromal signals in chloroplasts. These discrete green speckles of PTp - sGFP and stPTp - sGFP corresponded exactly to the red fluorescent signal displayed by OsPSY2:mCherry in both etiolated and greening protoplasts and it is presumed to correspond to distinct PGs. In conclusion, we identified PTp as a transit peptide sequence facilitating preferential translocation of foreign proteins to PGs, and developed an improved PTp sequence, a s tPTp , which is expected to be very useful for applications in plant biotechnologies requiring precise micro-compartmental localization in plastids.
2010-01-01
Background Comparative genomics methods such as phylogenetic profiling can mine powerful inferences from inherently noisy biological data sets. We introduce Sites Inferred by Metabolic Background Assertion Labeling (SIMBAL), a method that applies the Partial Phylogenetic Profiling (PPP) approach locally within a protein sequence to discover short sequence signatures associated with functional sites. The approach is based on the basic scoring mechanism employed by PPP, namely the use of binomial distribution statistics to optimize sequence similarity cutoffs during searches of partitioned training sets. Results Here we illustrate and validate the ability of the SIMBAL method to find functionally relevant short sequence signatures by application to two well-characterized protein families. In the first example, we partitioned a family of ABC permeases using a metabolic background property (urea utilization). Thus, the TRUE set for this family comprised members whose genome of origin encoded a urea utilization system. By moving a sliding window across the sequence of a permease, and searching each subsequence in turn against the full set of partitioned proteins, the method found which local sequence signatures best correlated with the urea utilization trait. Mapping of SIMBAL "hot spots" onto crystal structures of homologous permeases reveals that the significant sites are gating determinants on the cytosolic face rather than, say, docking sites for the substrate-binding protein on the extracellular face. In the second example, we partitioned a protein methyltransferase family using gene proximity as a criterion. In this case, the TRUE set comprised those methyltransferases encoded near the gene for the substrate RF-1. SIMBAL identifies sequence regions that map onto the substrate-binding interface while ignoring regions involved in the methyltransferase reaction mechanism in general. Neither method for training set construction requires any prior experimental characterization. Conclusions SIMBAL shows that, in functionally divergent protein families, selected short sequences often significantly outperform their full-length parent sequence for making functional predictions by sequence similarity, suggesting avenues for improved functional classifiers. When combined with structural data, SIMBAL affords the ability to localize and model functional sites. PMID:20102603
Chibani, Kamel; Tarrago, Lionel; Gualberto, José Manuel; Wingsle, Gunnar; Rey, Pascal; Jacquot, Jean-Pierre; Rouhier, Nicolas
2012-01-01
Plant thioredoxins (Trxs) constitute a complex family of thiol oxidoreductases generally sharing a WCGPC active site sequence. Some recently identified plant Trxs (Clot, Trx-like1 and -2, Trx-lilium1, -2, and -3) display atypical active site sequences with altered residues between the two conserved cysteines. The transcript expression patterns, subcellular localizations, and biochemical properties of some representative poplar (Populus spp.) isoforms were investigated. Measurements of transcript levels for the 10 members in poplar organs indicate that most genes are constitutively expressed. Using transient expression of green fluorescent protein fusions, Clot and Trx-like1 were found to be mainly cytosolic, whereas Trx-like2.1 was located in plastids. All soluble recombinant proteins, except Clot, exhibited insulin reductase activity, although with variable efficiencies. Whereas Trx-like2.1 and Trx-lilium2.2 were efficiently regenerated both by NADPH-Trx reductase and glutathione, none of the proteins were reduced by the ferredoxin-Trx reductase. Only Trx-like2.1 supports the activity of plastidial thiol peroxidases and methionine sulfoxide reductases employing a single cysteine residue for catalysis and using a glutathione recycling system. The second active site cysteine of Trx-like2.1 is dispensable for this reaction, indicating that the protein possesses a glutaredoxin-like activity. Interestingly, the Trx-like2.1 active site replacement, from WCRKC to WCGPC, suppresses its capacity to use glutathione as a reductant but is sufficient to allow the regeneration of target proteins employing two cysteines for catalysis, indicating that the nature of the residues composing the active site sequence is crucial for substrate selectivity/recognition. This study provides another example of the cross talk existing between the glutathione/glutaredoxin and Trx-dependent pathways. PMID:22523226
Lahrech, H; Briguet, A
1990-11-01
It is shown that the modified stimulated echo sequence, [theta](+/- x +/- y)-t1-[theta](+ x)-t2/2-[2 theta](+ x)-t2/2- [theta](+ x)-t1-Acq(+/- x +/- y), denoted as MSTE[2 theta]x according to the exciter phase of the 2 theta pulse, is able to perform proton spectral editing without difference spectra. On the other hand, this sequence appears to be suitable for spatial localization. Sensitivity and spatial selectivity of MSTE and conventional stimulated echo sequence (STE) are briefly compared. MSTE is applied to editing lactate in the rat brain using the locally restricted excitation of a surface coil.
Shi, Ruijia; Xu, Cunshuan
2011-06-01
The study of rat proteins is an indispensable task in experimental medicine and drug development. The function of a rat protein is closely related to its subcellular location. Based on the above concept, we construct the benchmark rat proteins dataset and develop a combined approach for predicting the subcellular localization of rat proteins. From protein primary sequence, the multiple sequential features are obtained by using of discrete Fourier analysis, position conservation scoring function and increment of diversity, and these sequential features are selected as input parameters of the support vector machine. By the jackknife test, the overall success rate of prediction is 95.6% on the rat proteins dataset. Our method are performed on the apoptosis proteins dataset and the Gram-negative bacterial proteins dataset with the jackknife test, the overall success rates are 89.9% and 96.4%, respectively. The above results indicate that our proposed method is quite promising and may play a complementary role to the existing predictors in this area.
Jiang, Ke; Cai, Bo; Xie, Feng; Jiang, Jianping; Murphy, Robert W.; Li, Jia-Tang; Wang, Yuezhao
2015-01-01
Due to small body sizes, superficial similarities in morphologies, and obscure activity behaviors, the phylogeny and taxonomy of species in the genus Liuixalus were very troublesome. Some species might comprise a complex of cryptic species. To investigate the species of group, we constructed the matrilineal genealogy of the genus using 16s rRNA mitochondrial DNA sequences. Analyses recovered six well supported matrilines that involved L. romeri, L. ocellatus, L. hainanus, L. calcarius, Liuixalus shiwandashan sp. nov. and Liuixalus jinxiuensis sp. nov., though the historical relationships among them remained unresolved. Currently, Liuixalus included 4 species, distributed eastwards from northern Vietnam to Hong Kong, China. Based on genealogical and morphological distinctiveness, we described Liuixalus jinxiuensis sp. nov. from the type locality Mt. Dayao, Jinxiu, China and Liuixalus shiwandashan sp. nov. from the type locality Mt. Shiwanda, China. A combination of morphological measurements, genetic, bioacoustic and osteological analysis was served to diagnose the new taxa. PMID:26305377
Brief communication: DNA from early Holocene American dog.
Tito, Raul Y; Belknap, Samuel L; Sobolik, Kristin D; Ingraham, Robert C; Cleeland, Lauren M; Lewis, Cecil M
2011-08-01
We present the oldest genetically identified dog in the Americas, directly dated to 9,260 ± 170 Cal. B.P. The DNA was extracted from an occipital condyle imbedded in a human paleofecal sample from Hinds Cave in southwest Texas. A 368 base pair fragment of the mitochondrial genome control region was sequenced. These data were analyzed with comparable data, which included other ancient dogs and extant dogs, wolves and coyotes from around the world. Compiled with published data, our results characterize ancient American dogs within clades rooted by Eurasian wolves. In the Americas, these data provide no evidence of local interbreeding with wolves. This is a departure from the genetic pattern in other areas of the world where interbreeding with local wolf populations is apparent. Our discovery of domestic dog bone in a human paleofecal sample provides the earliest direct evidence for human consumption of dogs in the New World. These data support the hypothesis that dogs were a food source for early Paleoamericans. Copyright © 2011 Wiley-Liss, Inc.
Opto-numerical procedures supporting dynamic lower limbs monitoring and their medical diagnosis
NASA Astrophysics Data System (ADS)
Witkowski, Marcin; Kujawińska, Malgorzata; Rapp, Walter; Sitnik, Robert
2006-01-01
New optical full-field shape measurement systems allow transient shape capture at rates between 15 and 30 Hz. These frequency rates are enough to monitor controlled movements used e.g. for medical examination purposes. In this paper we present a set of algorithms which may be applied for processing of data gathered by fringe projection method implemented for lower limbs shape measurement. The purpose of presented algorithms is to locate anatomical structures based on the limb shape and its deformation in time. The algorithms are based on local surface curvature calculation and analysis of curvature maps changes during the measurement sequence. One of anatomical structure of high medical interest that is possible to scan and analyze, is patella. Tracking of patella position and orientation under dynamic conditions may lead to detect pathological patella movements and help in knee joint disease diagnosis. Therefore the usefulness of the algorithms developed was proven at examples of patella localization and monitoring.
Pre-calculated protein structure alignments at the RCSB PDB website.
Prlic, Andreas; Bliven, Spencer; Rose, Peter W; Bluhm, Wolfgang F; Bizon, Chris; Godzik, Adam; Bourne, Philip E
2010-12-01
With the continuous growth of the RCSB Protein Data Bank (PDB), providing an up-to-date systematic structure comparison of all protein structures poses an ever growing challenge. Here, we present a comparison tool for calculating both 1D protein sequence and 3D protein structure alignments. This tool supports various applications at the RCSB PDB website. First, a structure alignment web service calculates pairwise alignments. Second, a stand-alone application runs alignments locally and visualizes the results. Third, pre-calculated 3D structure comparisons for the whole PDB are provided and updated on a weekly basis. These three applications allow users to discover novel relationships between proteins available either at the RCSB PDB or provided by the user. A web user interface is available at http://www.rcsb.org/pdb/workbench/workbench.do. The source code is available under the LGPL license from http://www.biojava.org. A source bundle, prepared for local execution, is available from http://source.rcsb.org andreas@sdsc.edu; pbourne@ucsd.edu.
2-D to 3-D global/local finite element analysis of cross-ply composite laminates
NASA Technical Reports Server (NTRS)
Thompson, D. Muheim; Griffin, O. Hayden, Jr.
1990-01-01
An example of two-dimensional to three-dimensional global/local finite element analysis of a laminated composite plate with a hole is presented. The 'zoom' technique of global/local analysis is used, where displacements of the global/local interface from the two-dimensional global model are applied to the edges of the three-dimensional local model. Three different hole diameters, one, three, and six inches, are considered in order to compare the effect of hole size on the three-dimensional stress state around the hole. In addition, three different stacking sequences are analyzed for the six inch hole case in order to study the effect of stacking sequence. The existence of a 'critical' hole size, where the interlaminar stresses are maximum, is indicated. Dispersion of plies at the same angle, as opposed to clustering, is found to reduce the magnitude of some interlaminar stress components and increase others.
SCALCE: boosting sequence compression algorithms using locally consistent encoding
Hach, Faraz; Numanagić, Ibrahim; Sahinalp, S Cenk
2012-01-01
Motivation: The high throughput sequencing (HTS) platforms generate unprecedented amounts of data that introduce challenges for the computational infrastructure. Data management, storage and analysis have become major logistical obstacles for those adopting the new platforms. The requirement for large investment for this purpose almost signalled the end of the Sequence Read Archive hosted at the National Center for Biotechnology Information (NCBI), which holds most of the sequence data generated world wide. Currently, most HTS data are compressed through general purpose algorithms such as gzip. These algorithms are not designed for compressing data generated by the HTS platforms; for example, they do not take advantage of the specific nature of genomic sequence data, that is, limited alphabet size and high similarity among reads. Fast and efficient compression algorithms designed specifically for HTS data should be able to address some of the issues in data management, storage and communication. Such algorithms would also help with analysis provided they offer additional capabilities such as random access to any read and indexing for efficient sequence similarity search. Here we present SCALCE, a ‘boosting’ scheme based on Locally Consistent Parsing technique, which reorganizes the reads in a way that results in a higher compression speed and compression rate, independent of the compression algorithm in use and without using a reference genome. Results: Our tests indicate that SCALCE can improve the compression rate achieved through gzip by a factor of 4.19—when the goal is to compress the reads alone. In fact, on SCALCE reordered reads, gzip running time can improve by a factor of 15.06 on a standard PC with a single core and 6 GB memory. Interestingly even the running time of SCALCE + gzip improves that of gzip alone by a factor of 2.09. When compared with the recently published BEETL, which aims to sort the (inverted) reads in lexicographic order for improving bzip2, SCALCE + gzip provides up to 2.01 times better compression while improving the running time by a factor of 5.17. SCALCE also provides the option to compress the quality scores as well as the read names, in addition to the reads themselves. This is achieved by compressing the quality scores through order-3 Arithmetic Coding (AC) and the read names through gzip through the reordering SCALCE provides on the reads. This way, in comparison with gzip compression of the unordered FASTQ files (including reads, read names and quality scores), SCALCE (together with gzip and arithmetic encoding) can provide up to 3.34 improvement in the compression rate and 1.26 improvement in running time. Availability: Our algorithm, SCALCE (Sequence Compression Algorithm using Locally Consistent Encoding), is implemented in C++ with both gzip and bzip2 compression options. It also supports multithreading when gzip option is selected, and the pigz binary is available. It is available at http://scalce.sourceforge.net. Contact: fhach@cs.sfu.ca or cenk@cs.sfu.ca Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23047557
Cell proliferation downregulated by TGF-β2-triggered G1/S checkpoint in clinical CAFs
Wu, Jinliang; Fu, Rong; Liu, Zongzhi; Li, Guochao; Huang, Xiaolei; Xue, Yang; Xu, Yan; Sun, Yingli; Zhao, Jiangmin; Mi, Jun
2017-01-01
ABSTRACT The metabolic reprogramming is indispensible for the fast growth of tumor cells. The metabolism of CAFs is reprogrammed to aerobic glycolysis too. However, it is not clear whether this metabolic reprogramming promotes the growth of CAFs themselves. In this study, we found that the proliferation rate of CAFs was slower than NAFs, which was determined by cell counting, BrdU assay and flow cytometry analysis. Moreover, we found TGF-β signaling regulated cell growth of CAF through RNA-sequencing analysis and Western blot, which was further supported by the observation that TGF-β2 was highly expressed in colon cancer tissues. In the end, we demonstrated that CAFs were critical to tumor cell proliferation, which was supported by the evidence of their close localization in clinical tumor tissue and tumor promoting effect in mice. In brief, our data have manifested that the proliferation rate is decreased in CAFs, which enable CAFs generate more intermediate metabolites to support tumor cells growth, suggesting CAFs is an ideal target for tumor therapy. PMID:27880067
Karched, Maribasappa; Furgang, David; Planet, Paul J.; DeSalle, Rob
2012-01-01
Aggregatibacter actinomycetemcomitans is implicated in localized aggressive periodontitis. We report the first genome sequence of an A. actinomycetemcomitans strain isolated from an Old World primate. PMID:22328766
Gruber, Karl; Schöning, Caspar; Otte, Marianne; Kinuthia, Wanja; Hasselmann, Martin
2013-01-01
Identifying the forces shaping intraspecific phenotypic and genotypic divergence are of key importance in evolutionary biology. Phenotypic divergence may result from local adaptation or, especially in species with strong gene flow, from pronounced phenotypic plasticity. Here, we examine morphological and genetic divergence among populations of the western honey bee Apis mellifera in the topographically heterogeneous East African region. The currently accepted “mountain refugia hypothesis” states that populations living in disjunct montane forests belong to a different lineage than those in savanna habitats surrounding these forests. We obtained microsatellite data, mitochondrial sequences, and morphometric data from worker honey bees collected from feral colonies in three montane forests and corresponding neighboring savanna regions in Kenya. Honey bee colonies from montane forests showed distinct worker morphology compared with colonies in savanna areas. Mitochondrial sequence data did not support the existence of the two currently accepted subspecies. Furthermore, analyses of the microsatellite data with a Bayesian clustering method did not support the existence of two source populations as it would be expected under the mountain refugia scenario. Our findings suggest that phenotypic plasticity rather than distinct ancestry is the leading cause behind the phenotypic divergence observed between montane forest and savanna honey bees. Our study thus corroborates the idea that high gene flow may select for increased plasticity. PMID:24223262
Rhythm sensitivity in macaque monkeys
Selezneva, Elena; Deike, Susann; Knyazeva, Stanislava; Scheich, Henning; Brechmann, André; Brosch, Michael
2013-01-01
This study provides evidence that monkeys are rhythm sensitive. We composed isochronous tone sequences consisting of repeating triplets of two short tones and one long tone which humans perceive as repeating triplets of two weak and one strong beat. This regular sequence was compared to an irregular sequence with the same number of randomly arranged short and long tones with no such beat structure. To search for indication of rhythm sensitivity we employed an oddball paradigm in which occasional duration deviants were introduced in the sequences. In a pilot study on humans we showed that subjects more easily detected these deviants when they occurred in a regular sequence. In the monkeys we searched for spontaneous behaviors the animals executed concomitant with the deviants. We found that monkeys more frequently exhibited changes of gaze and facial expressions to the deviants when they occurred in the regular sequence compared to the irregular sequence. In addition we recorded neuronal firing and local field potentials from 175 sites of the primary auditory cortex during sequence presentation. We found that both types of neuronal signals differentiated regular from irregular sequences. Both signals were stronger in regular sequences and occurred after the onset of the long tones, i.e., at the position of the strong beat. Local field potential responses were also significantly larger for the durational deviants in regular sequences, yet in a later time window. We speculate that these temporal pattern-selective mechanisms with a focus on strong beats and their deviants underlie the perception of rhythm in the chosen sequences. PMID:24046732
Yip, Cyril C Y; Lo, Janice Y C; Sridhar, Siddharth; Lung, David C; Luk, Shik; Chan, Kwok-Hung; Chan, Jasper F W; Cheng, Vincent C C; Woo, Patrick C Y; Yuen, Kwok-Yung; Lau, Susanna K P
2017-05-16
A fatal case associated with enterovirus D68 (EV-D68) infection affecting a 10-year-old boy was reported in Hong Kong in 2014. To examine if a new strain has emerged in Hong Kong, we sequenced the partial genome of the EV-D68 strain identified from the fatal case and the complete VP1, and partial 5'UTR and 2C sequences of nine additional EV-D68 strains isolated from patients in Hong Kong. Sequence analysis indicated that a cluster of strains including the previously recognized A2 strains should belong to a separate clade, clade D, which is further divided into subclades D1 and D2. Among the 10 EV-D68 strains, 7 (including the fatal case) belonged to the previously described, newly emerged subclade B3, 2 belonged to subclade B1, and 1 belonged to subclade D1. Three EV-D68 strains, each from subclades B1, B3, and D1, were selected for complete genome sequencing and recombination analysis. While no evidence of recombination was noted among local strains, interclade recombination was identified in subclade D2 strains detected in mainland China in 2008 with VP2 acquired from clade A. This study supports the reclassification of subclade A2 into clade D1, and demonstrates interclade recombination between clades A and D2 in EV-D68 strains from China.
Characterization of a Novel Polerovirus Infecting Maize in China
Chen, Sha; Jiang, Guangzhuang; Wu, Jianxiang; Liu, Yong; Qian, Yajuan; Zhou, Xueping
2016-01-01
A novel virus, tentatively named Maize Yellow Mosaic Virus (MaYMV), was identified from the field-grown maize plants showing yellow mosaic symptoms on the leaves collected from the Yunnan Province of China by the deep sequencing of small RNAs. The complete 5642 nucleotide (nt)-long genome of the MaYMV shared the highest nucleotide sequence identity (73%) to Maize Yellow Dwarf Virus-RMV. Sequence comparisons and phylogenetic analyses suggested that MaYMV represents a new member of the genus Polerovirus in the family Luteoviridae. Furthermore, the P0 protein encoded by MaYMV was demonstrated to inhibit both local and systemic RNA silencing by co-infiltration assays using transgenic Nicotiana benthamiana line 16c carrying the GFP reporter gene, which further supported the identification of a new polerovirus. The biologically-active cDNA clone of MaYMV was generated by inserting the full-length cDNA of MaYMV into the binary vector pCB301. RT-PCR and Northern blot analyses showed that this clone was systemically infectious upon agro-inoculation into N. benthamiana. Subsequently, 13 different isolates of MaYMV from field-grown maize plants in different geographical locations of Yunnan and Guizhou provinces of China were sequenced. Analyses of their molecular variation indicate that the 3′ half of P3–P5 read-through protein coding region was the most variable, whereas the coat protein- (CP-) and movement protein- (MP-)coding regions were the most conserved. PMID:27136578
Characterization of a Novel Polerovirus Infecting Maize in China.
Chen, Sha; Jiang, Guangzhuang; Wu, Jianxiang; Liu, Yong; Qian, Yajuan; Zhou, Xueping
2016-04-28
A novel virus, tentatively named Maize Yellow Mosaic Virus (MaYMV), was identified from the field-grown maize plants showing yellow mosaic symptoms on the leaves collected from the Yunnan Province of China by the deep sequencing of small RNAs. The complete 5642 nucleotide (nt)-long genome of the MaYMV shared the highest nucleotide sequence identity (73%) to Maize Yellow Dwarf Virus-RMV. Sequence comparisons and phylogenetic analyses suggested that MaYMV represents a new member of the genus Polerovirus in the family Luteoviridae. Furthermore, the P0 protein encoded by MaYMV was demonstrated to inhibit both local and systemic RNA silencing by co-infiltration assays using transgenic Nicotiana benthamiana line 16c carrying the GFP reporter gene, which further supported the identification of a new polerovirus. The biologically-active cDNA clone of MaYMV was generated by inserting the full-length cDNA of MaYMV into the binary vector pCB301. RT-PCR and Northern blot analyses showed that this clone was systemically infectious upon agro-inoculation into N. benthamiana. Subsequently, 13 different isolates of MaYMV from field-grown maize plants in different geographical locations of Yunnan and Guizhou provinces of China were sequenced. Analyses of their molecular variation indicate that the 3' half of P3-P5 read-through protein coding region was the most variable, whereas the coat protein- (CP-) and movement protein- (MP-)coding regions were the most conserved.
Biogeography of Burkholderia pseudomallei in the Torres Strait Islands of Northern Australia
Baker, Anthony; Mayo, Mark; Owens, Leigh; Burgess, Graham; Norton, Robert; McBride, William John Hannan; Currie, Bart J.
2013-01-01
It has been hypothesized that biogeographical boundaries are a feature of Burkholderia pseudomallei ecology, and they impact the epidemiology of melioidosis on a global scale. This study examined the relatedness of B. pseudomallei sourced from islands in the Torres Strait of Northern Australia to determine if the geography of isolated island communities is a determinant of the organisms' dispersal. Environmental sampling on Badu Island in the Near Western Island cluster recovered a single clone. An additional 32 clinical isolates from the region were sourced. Isolates were characterized using multilocus sequence typing and a multiplex PCR targeting the flagellum gene cluster. Gene cluster analysis determined that 69% of the isolates from the region encoded the ancestral Burkholderia thailandensis-like flagellum and chemotaxis gene cluster, a proportion significantly lower than that reported from mainland Australia and consistent with observations of isolates from southern Papua New Guinea. A goodness-of-fit test indicated that there was geographic localization of sequence types throughout the archipelago, with the exception of Thursday Island, the economic and cultural hub of the region. Sequence types common to mainland Australia and Papua New Guinea were identified. These findings demonstrate for the first time an environmental reservoir for B. pseudomallei in the Torres Strait, and multilocus sequence typing suggests that the organism is not randomly distributed throughout this region and that seawater may provide a barrier to dispersal of the organism. Moreover, these findings support an anthropogenic dispersal hypothesis for the spread of B. pseudomallei throughout this region. PMID:23698533
Roy, Catherine; Foudi, Fatah; Charton, Jeanne; Jung, Michel; Lang, Hervé; Saussine, Christian; Jacqmin, Didier
2013-04-01
The aim of this retrospective study was to determine the respective accuracies of three types of functional MRI sequences-diffusion-weighted imaging (DWI), dynamic contrast-enhanced (DCE) MRI, and 3D (1)H-MR spectroscopy (MRS)-in the depiction of local prostate cancer recurrence after two different initial therapy options. From a cohort of 83 patients with suspicion of local recurrence based on prostate-specific antigen (PSA) kinetics who were imaged on a 3-T MRI unit using an identical protocol including the three functional sequences with an endorectal coil, we selected 60 patients (group A, 28 patients who underwent radical prostatectomy; group B, 32 patients who underwent external-beam radiation) who had local recurrence ascertained on the basis of a transrectal ultrasound-guided biopsy results and a reduction in PSA level after salvage therapy. All patients presented with a local relapse. Sensitivity with T2-weighted MRI and 3D (1)H-MRS sequences was 57% and 53%, respectively, for group A and 71% and 78%, respectively, for group B. DCE-MRI alone showed a sensitivity of 100% and 96%, respectively, for groups A and B. DWI alone had a higher sensitivity for group B (96%) than for group A (71%). The combination of T2-weighted imaging plus DWI plus DCE-MRI provided a sensitivity as high as 100% in group B. The performance of functional imaging sequences for detecting recurrence is different after radical prostatectomy and external-beam radiotherapy. DCE-MRI is a valid and efficient tool to detect prostate cancer recurrence in radical prostatectomy as well as in external-beam radiotherapy. The combination of DCE-MRI and DWI is highly efficient after radiation therapy. Three-dimensional (1)H-MRS needs to be improved. Even though it is not accurate enough, T2-weighted imaging remains essential for the morphologic analysis of the area.
Insights gained from relating cumulative seismic moments to fluid injection activities
NASA Astrophysics Data System (ADS)
McGarr, A.; Barbour, A. J.
2017-12-01
The three earthquakes with magnitudes of 5 or greater that were induced in Oklahoma during 2016 motivated efforts to improve our understanding of how fluid injection operations are related to earthquake activity. In this study, we have addressed the question of whether the volume of fluid injected down wells within 10 km of the mainshock of an induced earthquake sequence can account for its total moment release. Specifically, is the total moment release equal to, or less than, twice the product of the shear modulus and the total volume injected (McGarr, JGR, 2014, equation 7)? In contrast to McGarr's (2014, equation 13) relationship for the maximum moment, M0(max), the relationship for the total moment release has the advantage of being independent of the magnitude distribution. We find that the three sequences in Oklahoma in 2016, M5.1 Fairview, M5.8 Pawnee, M5.0 Cushing, and the 2011 M5.7 Prague sequence all adhere to this relationship. We also found that eight additional sequences of earthquakes induced by various fluid injection activities, widely distributed worldwide, show the same relationship between total moment-release and injected volume. Thus, for injected volumes ranging from 103 up to 107 cubic m, the moment release of an induced earthquake sequence appears to be similarly limited. These results imply that M0(max) for a sequence induced by fluid injection could be as high as twice the product of the shear modulus and the injected volume if the mainshock in the sequence accounts for nearly all of the total moment, as was the case for the 2016 Pawnee M5.8 mainshock. This new upper bound for maximum moment is twice what was proposed by McGarr (2014, equation 13). Our new results also support the assumption in our analysis that the induced earthquake rupture is localized to the seismogenic region that is weakened owing to a pore pressure increase of the order of a seismic stress drop.
Development of a Novel Technology for Label Free DNA Sequencing
2012-05-21
of the C-H bond stretch vibrations in the planes of the corresponding DNA bases , and in the higher-frequency side, sequence-identifier region is...composed of the N-H bond stretch vibrations in the planes of the corresponding DNA bases . In addition, the sequence-identifier dividing region almost...regions are localized at the corresponding DNA bases and exhibit a definable dependence on the sequence form of the codons under study. Final
NASA Astrophysics Data System (ADS)
Khalaf, Ezz El Din Abdel Hakim
Fatira area in the Central Eastern Desert, Egypt, is a composite terrane consisting of Neoproterozoic volcanics and sediments laid down in submarine to subaerial environment, intruded by voluminous old to young granitic rocks. The various lithofacies of the study area can be grouped in three distinct lithostratigraphic sequences, which are described here in stratigraphic order, from base to top as the Fatira El Beida, Fatira El Zarqa and Gabal Fatira sequences. Each depositional sequence, is intimately related to volcanic activity separated by time intervals of volcanic inactivity, such as marked hiatuses, reworked volcaniclasts, and or turbidite sedimentation. Four submarine facies groups have been recognized within the oldest, folded eruption sequence of Fatira El Beida. The southern part of the study area is occupied by sheet lava (SL), pillow lavas (PL), pillow breccias (PB), and overlying Bouma turbiditic volcaniclastites (VC). The four facies groups of Fatira El Beida sequence occur in a predictable upward-deepening succession, essentially from base to top, an SL-PL-PB-VC stacking pattern. The coeval tholeiitic mafic and felsic volcaniclastic rocks of this sequence indicate an extensional back-arc tectonic setting. The El Beida depositional sequence appears to fit a submarine-fan and slope-apron environment in an intra-arc site. The Fatira El Zarqa sequence involves a large volume of subaerial calc-alkaline intermediate to felsic volcanics and an unconformably overlying siliciclastic succession comprising clast-supported conglomerates (Gm), massive sandstone sheet floods (Sm) and mudstones (FI), together with a lateritic argillite paleosol (P) top formed in an alluvial-fan system. The youngest rock of Gabal Fatira sequence comprises anorogenic trachydacites and rhyolites with locally emergent domes associated with autobrecciation and sill-dyke rock swarms that could be interpreted as feeders and subvolcanic intrusions. Unconformity and lithofacies assemblages define seven events and three unconformity-bounded tectonic stages that record uplift-subsidence cycles in the study area. A proximal-distal relationship has been established within the depositional products, based on the relative dominance of erosional and depositional features.
de Miranda, R L; O'Dwyer, L H; de Castro, J R; Metzger, B; Rubini, A S; Mundim, A V; Eyal, O; Talmi-Frank, D; Cury, M C; Baneth, G
2014-10-01
The objective of this survey was to investigate the prevalence of Hepatozoon infection in dogs in the rural and urban areas of Uberlândia, Brazil by PCR and molecular characterization. DNA was obtained from blood samples collected from 346 local dogs from both genders and various ages. Seventeen PCR products from positive blood samples of urban dogs and 13 from the rural dogs were sequenced. Partial sequences of the 18S rRNA gene indicated that all 30 dogs were infected with Hepatozoon canis similar in sequence to H. canis from southern Europe. Four local dog sequences were submitted to GenBank (accessions JN835188; KF692038; KF692039; KF692040). This study indicates that H. canis is the cause of canine hepatozoonosis in Uberlândia and that infection is similarly widespread in rural and urban dogs. Copyright © 2014. Published by Elsevier Ltd.
Kwasigroch, Jean Marc; Rooman, Marianne
2006-07-15
Prelude&Fugue are bioinformatics tools aiming at predicting the local 3D structure of a protein from its amino acid sequence in terms of seven backbone torsion angle domains, using database-derived potentials. Prelude(&Fugue) computes all lowest free energy conformations of a protein or protein region, ranked by increasing energy, and possibly satisfying some interresidue distance constraints specified by the user. (Prelude&)Fugue detects sequence regions whose predicted structure is significantly preferred relative to other conformations in the absence of tertiary interactions. These programs can be used for predicting secondary structure, tertiary structure of short peptides, flickering early folding sequences and peptides that adopt a preferred conformation in solution. They can also be used for detecting structural weaknesses, i.e. sequence regions that are not optimal with respect to the tertiary fold. http://babylone.ulb.ac.be/Prelude_and_Fugue.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tuskan, Gerald A; Gunter, Lee E; DiFazio, Stephen P
The 18S-28S rDNA and 5S rDNA loci in Populus trichocarpa were localized using fluorescent in situ hybridization (FISH). Two 18S-28S rDNA sites and one 5S rDNA site were identified and located at the ends of 3 different chromosomes. FISH signals from the Arabidopsis -type telomere repeat sequence were observed at the distal ends of each chromosome. Six BAC clones selected from 2 linkage groups based on genome sequence assembly (LG-I and LG-VI) were localized on 2 chromosomes, as expected. BACs from LG-I hybridized to the longest chromosome in the complement. All BAC positions were found to be concordant with sequencemore » assembly positions. BAC-FISH will be useful for delineating each of the Populus trichocarpa chromosomes and improving the sequence assembly of this model angiosperm tree species.« less
NASA Astrophysics Data System (ADS)
Negi, Sanjay S.; Paul, Ajay; Cesca, Simone; Kamal; Kriegerowski, Marius; Mahesh, P.; Gupta, Sandeep
2017-08-01
In order to understand present day earthquake kinematics at the Indian plate boundary, we analyse seismic broadband data recorded between 2007 and 2015 by the regional network in the Garhwal-Kumaun region, northwest Himalaya. We first estimate a local 1-D velocity model for the computation of reliable Green's functions, based on 2837 P-wave and 2680 S-wave arrivals from 251 well located earthquakes. The resulting 1-D crustal structure yields a 4-layer velocity model down to the depths of 20 km. A fifth homogeneous layer extends down to 46 km, constraining the Moho using travel-time distance curve method. We then employ a multistep moment tensor (MT) inversion algorithm to infer seismic moment tensors of 11 moderate earthquakes with Mw magnitude in the range 4.0-5.0. The method provides a fast MT inversion for future monitoring of local seismicity, since Green's functions database has been prepared. To further support the moment tensor solutions, we additionally model P phase beams at seismic arrays at teleseismic distances. The MT inversion result reveals the presence of dominant thrust fault kinematics persisting along the Himalayan belt. Shallow low and high angle thrust faulting is the dominating mechanism in the Garhwal-Kumaun Himalaya. The centroid depths for these moderate earthquakes are shallow between 1 and 12 km. The beam modeling result confirm hypocentral depth estimates between 1 and 7 km. The updated seismicity, constrained source mechanism and depth results indicate typical setting of duplexes above the mid crustal ramp where slip is confirmed along out-of-sequence thrusting. The involvement of Tons thrust sheet in out-of-sequence thrusting indicate Tons thrust to be the principal active thrust at shallow depth in the Himalayan region. Our results thus support the critical taper wedge theory, where we infer the microseismicity cluster as a result of intense activity within the Lesser Himalayan Duplex (LHD) system.
Corominas, Jordi; Colijn, Johanna M; Geerlings, Maartje J; Pauper, Marc; Bakker, Bjorn; Amin, Najaf; Lores Motta, Laura; Kersten, Eveline; Garanto, Alejandro; Verlouw, Joost A M; van Rooij, Jeroen G J; Kraaij, Robert; de Jong, Paulus T V M; Hofman, Albert; Vingerling, Johannes R; Schick, Tina; Fauser, Sascha; de Jong, Eiko K; van Duijn, Cornelia M; Hoyng, Carel B; Klaver, Caroline C W; den Hollander, Anneke I
2018-04-26
Genome-wide association studies and targeted sequencing studies of candidate genes have identified common and rare variants that are associated with age-related macular degeneration (AMD). Whole-exome sequencing (WES) studies allow a more comprehensive analysis of rare coding variants across all genes of the genome and will contribute to a better understanding of the underlying disease mechanisms. To date, the number of WES studies in AMD case-control cohorts remains scarce and sample sizes are limited. To scrutinize the role of rare protein-altering variants in AMD cause, we performed the largest WES study in AMD to date in a large European cohort consisting of 1125 AMD patients and 1361 control participants. Genome-wide case-control association study of WES data. One thousand one hundred twenty-five AMD patients and 1361 control participants. A single variant association test of WES data was performed to detect variants that are associated individually with AMD. The cumulative effect of multiple rare variants with 1 gene was analyzed using a gene-based CMC burden test. Immunohistochemistry was performed to determine the localization of the Col8a1 protein in mouse eyes. Genetic variants associated with AMD. We detected significantly more rare protein-altering variants in the COL8A1 gene in patients (22/2250 alleles [1.0%]) than in control participants (11/2722 alleles [0.4%]; P = 7.07×10 -5 ). The association of rare variants in the COL8A1 gene is independent of the common intergenic variant (rs140647181) near the COL8A1 gene previously associated with AMD. We demonstrated that the Col8a1 protein localizes at Bruch's membrane. This study supported a role for protein-altering variants in the COL8A1 gene in AMD pathogenesis. We demonstrated the presence of Col8a1 in Bruch's membrane, further supporting the role of COL8A1 variants in AMD pathogenesis. Protein-altering variants in COL8A1 may alter the integrity of Bruch's membrane, contributing to the accumulation of drusen and the development of AMD. Copyright © 2018 American Academy of Ophthalmology. Published by Elsevier Inc. All rights reserved.
Houseknecht, D.W.; Bird, K.J.
2004-01-01
Beaufortian strata (Jurassic-Lower Cretaceous) in the National Petroleum Reserve in Alaska (NPRA) are a focus of exploration since the 1994 discovery of the nearby Alpine oil field (>400 MMBO). These strata include the Kingak Shale, a succession of depositional sequences influenced by rift opening of the Arctic Ocean Basin. Interpretation of sequence stratigraphy and depositional facies from a regional two-dimensional seismic grid and well data allows the definition of four sequence sets that each displays unique stratal geometries and thickness trends across NPRA. A Lower to Middle Jurassic sequence set includes numerous transgressive-regressive sequences that collectively built a clastic shelf in north-central NPRA. Along the south-facing, lobate shelf margin, condensed shales in transgressive systems tracts downlap and coalesce into a basinal condensed section that is likely an important hydrocarbon source rock. An Oxfordian-Kimmeridgian sequence set, deposited during pulses of uplift on the Barrow arch, includes multiple transgressive-regressive sequences that locally contain well-winnowed, shoreface sandstones at the base of transgressive systems tracts. These shoreface sandstones and overlying shales, deposited during maximum flooding, form stratigraphic traps that are the main objective of exploration in the Alpine play in NPRA. A Valanginian sequence set includes at least two transgressive-regressive sequences that display relatively distal characteristics, suggesting high relative sea level. An important exception is the presence of a basal transgressive systems tract that locally contains shoreface sandstones of reservoir quality. A Hauterivian sequence set includes two transgressive-regressive sequences that constitute a shelf-margin wedge developed as the result of tectonic uplift along the Barrow arch during rift opening of the Arctic Ocean Basin. This sequence set displays stratal geometries suggesting incision and synsedimentary collapse of the shelf margin. ?? 2004. The American Association of Petroleum Geologists. All rights reserved.
Time fluctuation analysis of forest fire sequences
NASA Astrophysics Data System (ADS)
Vega Orozco, Carmen D.; Kanevski, Mikhaïl; Tonini, Marj; Golay, Jean; Pereira, Mário J. G.
2013-04-01
Forest fires are complex events involving both space and time fluctuations. Understanding of their dynamics and pattern distribution is of great importance in order to improve the resource allocation and support fire management actions at local and global levels. This study aims at characterizing the temporal fluctuations of forest fire sequences observed in Portugal, which is the country that holds the largest wildfire land dataset in Europe. This research applies several exploratory data analysis measures to 302,000 forest fires occurred from 1980 to 2007. The applied clustering measures are: Morisita clustering index, fractal and multifractal dimensions (box-counting), Ripley's K-function, Allan Factor, and variography. These algorithms enable a global time structural analysis describing the degree of clustering of a point pattern and defining whether the observed events occur randomly, in clusters or in a regular pattern. The considered methods are of general importance and can be used for other spatio-temporal events (i.e. crime, epidemiology, biodiversity, geomarketing, etc.). An important contribution of this research deals with the analysis and estimation of local measures of clustering that helps understanding their temporal structure. Each measure is described and executed for the raw data (forest fires geo-database) and results are compared to reference patterns generated under the null hypothesis of randomness (Poisson processes) embedded in the same time period of the raw data. This comparison enables estimating the degree of the deviation of the real data from a Poisson process. Generalizations to functional measures of these clustering methods, taking into account the phenomena, were also applied and adapted to detect time dependences in a measured variable (i.e. burned area). The time clustering of the raw data is compared several times with the Poisson processes at different thresholds of the measured function. Then, the clustering measure value depends on the threshold which helps to understand the time pattern of the studied events. Our findings detected the presence of overdensity of events in particular time periods and showed that the forest fire sequences in Portugal can be considered as a multifractal process with a degree of time-clustering of the events. Key words: time sequences, Morisita index, fractals, multifractals, box-counting, Ripley's K-function, Allan Factor, variography, forest fires, point process. Acknowledgements This work was partly supported by the SNFS Project No. 200021-140658, "Analysis and Modelling of Space-Time Patterns in Complex Regions". References - Kanevski M. (Editor). 2008. Advanced Mapping of Environmental Data: Geostatistics, Machine Learning and Bayesian Maximum Entropy. London / Hoboken: iSTE / Wiley. - Telesca L. and Pereira M.G. 2010. Time-clustering investigation of fire temporal fluctuations in Portugal, Nat. Hazards Earth Syst. Sci., vol. 10(4): 661-666. - Vega Orozco C., Tonini M., Conedera M., Kanevski M. (2012) Cluster recognition in spatial-temporal sequences: the case of forest fires, Geoinformatica, vol. 16(4): 653-673.
Position-dependent effects of polylysine on Sec protein transport.
Liang, Fu-Cheng; Bageshwar, Umesh K; Musser, Siegfried M
2012-04-13
The bacterial Sec protein translocation system catalyzes the transport of unfolded precursor proteins across the cytoplasmic membrane. Using a recently developed real time fluorescence-based transport assay, the effects of the number and distribution of positive charges on the transport time and transport efficiency of proOmpA were examined. As expected, an increase in the number of lysine residues generally increased transport time and decreased transport efficiency. However, the observed effects were highly dependent on the polylysine position in the mature domain. In addition, a string of consecutive positive charges generally had a more significant effect on transport time and efficiency than separating the charges into two or more charged segments. Thirty positive charges distributed throughout the mature domain resulted in effects similar to 10 consecutive charges near the N terminus of the mature domain. These data support a model in which the local effects of positive charge on the translocation kinetics dominate over total thermodynamic constraints. The rapid translocation kinetics of some highly charged proOmpA mutants suggest that the charge is partially shielded from the electric field gradient during transport, possibly by the co-migration of counter ions. The transport times of precursors with multiple positively charged sequences, or "pause sites," were fairly well predicted by a local effect model. However, the kinetic profile predicted by this local effect model was not observed. Instead, the transport kinetics observed for precursors with multiple polylysine segments support a model in which translocation through the SecYEG pore is not the rate-limiting step of transport.
Position-dependent Effects of Polylysine on Sec Protein Transport*
Liang, Fu-Cheng; Bageshwar, Umesh K.; Musser, Siegfried M.
2012-01-01
The bacterial Sec protein translocation system catalyzes the transport of unfolded precursor proteins across the cytoplasmic membrane. Using a recently developed real time fluorescence-based transport assay, the effects of the number and distribution of positive charges on the transport time and transport efficiency of proOmpA were examined. As expected, an increase in the number of lysine residues generally increased transport time and decreased transport efficiency. However, the observed effects were highly dependent on the polylysine position in the mature domain. In addition, a string of consecutive positive charges generally had a more significant effect on transport time and efficiency than separating the charges into two or more charged segments. Thirty positive charges distributed throughout the mature domain resulted in effects similar to 10 consecutive charges near the N terminus of the mature domain. These data support a model in which the local effects of positive charge on the translocation kinetics dominate over total thermodynamic constraints. The rapid translocation kinetics of some highly charged proOmpA mutants suggest that the charge is partially shielded from the electric field gradient during transport, possibly by the co-migration of counter ions. The transport times of precursors with multiple positively charged sequences, or “pause sites,” were fairly well predicted by a local effect model. However, the kinetic profile predicted by this local effect model was not observed. Instead, the transport kinetics observed for precursors with multiple polylysine segments support a model in which translocation through the SecYEG pore is not the rate-limiting step of transport. PMID:22367204
White, Clayton M.; Sonsthagen, Sarah A.; Sage, George K.; Anderson, Clifford; Talbot, Sandra L.
2013-01-01
The ability to successfully colonize and persist in diverse environments likely requires broad morphological and behavioral plasticity and adaptability, and this may partly explain why the Peregrine Falcon (Falco peregrinus) exhibits a large range of morphological characteristics across their global distribution. Regional and local differences within Peregrine Falcons were sufficiently variable that ∼75 subspecies have been described; many were subsumed, and currently 19 are generally recognized. We used sequence information from the control region of the mitochondrial genome to test for concordance between genetic structure and representatives of 12 current subspecies and from two areas where subspecies distributions overlap. Haplotypes were broadly shared among subspecies, and all geographic locales shared a widely distributed common haplotype (FalconCR2). Haplotypes were distributed in a star-like phylogeny, consistent with rapid expansion of a recently derived species, with observed genetic patterns congruent with incomplete lineage sorting and/or differential rates of evolution on morphology and neutral genetic characters. Hierarchical analyses of molecular variance did not uncover genetic partitioning at the continental level, despite strong population-level structure (FST = 0.228). Similar analyses found weak partitioning, albeit significant, among subspecies (FCT = 0.138). All reconstructions placed the hierofalcons' (Gyrfalcon [F. rusticolus] and Saker Falcon [F. cherrug]) haplotypes in a well-supported clade either basal or unresolved with respect to the Peregrine Falcon. In addition, haplotypes representing Taita Falcon (F. fasciinucha) were placed within the Peregrine Falcon clade.
NASA Astrophysics Data System (ADS)
Oosthuizen, Carel J.; Cowley, Paul D.; Kyle, Scotty R.; Bloomer, Paulette
2016-12-01
Physical and/or physiological constraints are assumed to isolate fish populations confined to or dependent on estuarine habitats. Strong isolation by distance is thus expected to affect connectivity. Such structuring has important implications for sustainable utilisation and replenishment of estuarine stocks that are heavily exploited. Here we present a preliminary investigation of the phylogenetic relationships of the riverbream (Acanthopagrus species) along the southern African coast and the geographic genetic structure of what appears to be a locally endemic species or lineage. Mitochondrial DNA (mtDNA) cytochrome b sequences support the notion that the species occurring along the southern African coast is A. vagus and not A. berda as previously thought. Yet, the taxonomy of this widespread Indo-West Pacific species or species-complex requires more in-depth investigation. No genetic differentiation was detected among estuarine populations of A. vagus based on the analyses of mtDNA ND2 gene sequences and 10 polymorphic nuclear microsatellite markers. The star-like genealogy and statistical analyses are consistent with a recent population expansion event. Spatial analyses of microsatellite genotypes fail to reject the null hypothesis of panmixia, indicative of a recent population expansion or ongoing gene flow between different estuaries. The northern localities were identified as containing most of the observed variation. This study not only provides insight into the phylogenetic relationship of A. vagus relative to other Acanthopagrus species but also sheds light on the demographic history and contemporary gene flow of the species.
Estimation of primate speciation dates using local molecular clocks.
Yoder, A D; Yang, Z
2000-07-01
Protein-coding genes of the mitochondrial genomes from 31 mammalian species were analyzed to estimate the speciation dates within primates and also between rats and mice. Three calibration points were used based on paleontological data: one at 20-25 MYA for the hominoid/cercopithecoid divergence, one at 53-57 MYA for the cetacean/artiodactyl divergence, and the third at 110-130 MYA for the metatherian/eutherian divergence. Both the nucleotide and the amino acid sequences were analyzed, producing conflicting results. The global molecular clock was clearly violated for both the nucleotide and the amino acid data. Models of local clocks were implemented using maximum likelihood, allowing different evolutionary rates for some lineages while assuming rate constancy in others. Surprisingly, the highly divergent third codon positions appeared to contain phylogenetic information and produced more sensible estimates of primate divergence dates than did the amino acid sequences. Estimated dates varied considerably depending on the data type, the calibration point, and the substitution model but differed little among the four tree topologies used. We conclude that the calibration derived from the primate fossil record is too recent to be reliable; we also point out a number of problems in date estimation when the molecular clock does not hold. Despite these obstacles, we derived estimates of primate divergence dates that were well supported by the data and were generally consistent with the paleontological record. Estimation of the mouse-rat divergence date, however, was problematic.
Kocot, Kevin M; Citarella, Mathew R; Moroz, Leonid L; Halanych, Kenneth M
2013-01-01
Molecular phylogenetics relies on accurate identification of orthologous sequences among the taxa of interest. Most orthology inference programs available for use in phylogenomics rely on small sets of pre-defined orthologs from model organisms or phenetic approaches such as all-versus-all sequence comparisons followed by Markov graph-based clustering. Such approaches have high sensitivity but may erroneously include paralogous sequences. We developed PhyloTreePruner, a software utility that uses a phylogenetic approach to refine orthology inferences made using phenetic methods. PhyloTreePruner checks single-gene trees for evidence of paralogy and generates a new alignment for each group containing only sequences inferred to be orthologs. Importantly, PhyloTreePruner takes into account support values on the tree and avoids unnecessarily deleting sequences in cases where a weakly supported tree topology incorrectly indicates paralogy. A test of PhyloTreePruner on a dataset generated from 11 completely sequenced arthropod genomes identified 2,027 orthologous groups sampled for all taxa. Phylogenetic analysis of the concatenated supermatrix yielded a generally well-supported topology that was consistent with the current understanding of arthropod phylogeny. PhyloTreePruner is freely available from http://sourceforge.net/projects/phylotreepruner/.
Thomsen, Rune; Pallesen, Jonatan; Daugaard, Tina F; Børglum, Anders D; Nielsen, Anders L
2013-11-01
Subcellular RNA localization plays an important role in development, cell differentiation, and cell migration. For a comprehensive description of the population of protrusion localized mRNAs in astrocytes we separated protrusions from cell bodies in a Boyden chamber and performed high-throughput direct RNA sequencing. The mRNAs with localization in astrocyte protrusions encode proteins belonging to a variety of functional groups indicating involvement of RNA localization for a palette of cellular functions. The mRNA encoding the intermediate filament protein Nestin was among the identified mRNAs. By RT-qPCR and RNA FISH analysis we confirmed Nestin mRNA localization in cell protrusions and also protrusion localization of Nestin protein. Nestin mRNA localization was dependent of Fragile X mental retardation syndrome proteins Fmrp and Fxr1, and the Nestin 3'-UTR was sufficient to mediate protrusion mRNA localization. The mRNAs for two other intermediate filament proteins in astrocytes, Gfap and Vimentin, have moderate and no protrusion localization, respectively, showing that individual intermediate filament components have different localization mechanisms. The correlated localization of Nestin mRNA with Nestin protein in cell protrusions indicates the presence of a regulatory mechanism at the mRNA localization level for the Nestin intermediate filament protein with potential importance for astrocyte functions during brain development and maintenance. Copyright © 2013 Wiley Periodicals, Inc.
BnNHL18A shows a localization change by stress-inducing chemical treatments
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, Suk-Bae; Ham, Byung-Kook; Park, Jeong Mee
2006-01-06
The two genes, named BnNHL18A and BnNHL18B, showing sequence homology with Arabidopsis NDR1/HIN1-like (NHL) genes, were isolated from cDNA library prepared with oilseed rape (Brassica napus) seedlings treated with NaCl. The transcript level of BnNHL18A was increased by sodium chloride, ethephon, hydrogen peroxide, methyl jasmonate, or salicylic acid treatment. The coding regions of BnNHL18A and BnNHL18B contain a sarcolipin (SLN)-like sequence. Analysis of the localization of smGFP fusion proteins showed that BnNHL18A is mainly localized to endoplasmic reticulum (ER). This result suggests that the SLN-like sequence plays a role in retaining proteins in ER membrane in plants. In response tomore » NaCl, hydrogen peroxide, ethephon, and salicylic acid treatments, the protein localization of BnNHL18A was changed. Our findings suggest a common function of BnNHL18A in biotic and abiotic stresses, and demonstrate the presence of the shared mechanism of protein translocalization between the responses to plant pathogen and to osmotic stress.« less
NASA Astrophysics Data System (ADS)
Lazo, Edmundo; Saavedra, Eduardo; Humire, Fernando; Castro, Cristobal; Cortés-Cortés, Francisco
2015-09-01
We study the localization properties of direct transmission lines when we distribute two values of inductances LA and LB according to a generalized Thue-Morse aperiodic sequence generated by the inflation rule: A → ABm-1, B → BAm-1, m ≥ 2 and integer. We regain the usual Thue-Morse sequence for m = 2. We numerically study the changes produced in the localization properties of the I (ω) electric current function with increasing m values. We demonstrate that the m = 2 case does not belong to the family m ≥ 3, because when m changes from m = 2 to m = 3, the number of extended states decreases significantly. However, for m ≫ 3, the localization properties become similar to the m = 2 case. Also, the
Michalovova, M; Vyskot, B; Kejnovsky, E
2013-10-01
We analysed the size, relative age and chromosomal localization of nuclear sequences of plastid and mitochondrial origin (NUPTs-nuclear plastid DNA and NUMTs-nuclear mitochondrial DNA) in six completely sequenced plant species. We found that the largest insertions showed lower divergence from organelle DNA than shorter insertions in all species, indicating their recent origin. The largest NUPT and NUMT insertions were localized in the vicinity of the centromeres in the small genomes of Arabidopsis and rice. They were also present in other chromosomal regions in the large genomes of soybean and maize. Localization of NUPTs and NUMTs correlated positively with distribution of transposable elements (TEs) in Arabidopsis and sorghum, negatively in grapevine and soybean, and did not correlate in rice or maize. We propose a model where new plastid and mitochondrial DNA sequences are inserted close to centromeres and are later fragmented by TE insertions and reshuffled away from the centromere or removed by ectopic recombination. The mode and tempo of TE dynamism determines the turnover of NUPTs and NUMTs resulting in their species-specific chromosomal distributions.
NASA Astrophysics Data System (ADS)
Nestares, Oscar; Miravet, Carlos; Santamaria, Javier; Fonolla Navarro, Rafael
1999-05-01
Automatic object segmentation in highly noisy image sequences, composed by a translating object over a background having a different motion, is achieved through joint motion-texture analysis. Local motion and/or texture is characterized by the energy of the local spatio-temporal spectrum, as different textures undergoing different translational motions display distinctive features in their 3D (x,y,t) spectra. Measurements of local spectrum energy are obtained using a bank of directional 3rd order Gaussian derivative filters in a multiresolution pyramid in space- time (10 directions, 3 resolution levels). These 30 energy measurements form a feature vector describing texture-motion for every pixel in the sequence. To improve discrimination capability and reduce computational cost, we automatically select those 4 features (channels) that best discriminate object from background, under the assumptions that the object is smaller than the background and has a different velocity or texture. In this way we reject features irrelevant or dominated by noise, that could yield wrong segmentation results. This method has been successfully applied to sequences with extremely low visibility and for objects that are even invisible for the eye in absence of motion.
Sequence independent amplification of DNA
Bohlander, S.K.
1998-03-24
The present invention is a rapid sequence-independent amplification procedure (SIA). Even minute amounts of DNA from various sources can be amplified independent of any sequence requirements of the DNA or any a priori knowledge of any sequence characteristics of the DNA to be amplified. This method allows, for example, the sequence independent amplification of microdissected chromosomal material and the reliable construction of high quality fluorescent in situ hybridization (FISH) probes from YACs or from other sources. These probes can be used to localize YACs on metaphase chromosomes but also--with high efficiency--in interphase nuclei. 25 figs.
Sequence independent amplification of DNA
Bohlander, Stefan K.
1998-01-01
The present invention is a rapid sequence-independent amplification procedure (SIA). Even minute amounts of DNA from various sources can be amplified independent of any sequence requirements of the DNA or any a priori knowledge of any sequence characteristics of the DNA to be amplified. This method allows, for example the sequence independent amplification of microdissected chromosomal material and the reliable construction of high quality fluorescent in situ hybridization (FISH) probes from YACs or from other sources. These probes can be used to localize YACs on metaphase chromosomes but also--with high efficiency--in interphase nuclei.
Hills, Ronald D.; Kathuria, Sagar V.; Wallace, Louise A.; Day, Iain J.; Brooks, Charles L.; Matthews, C. Robert
2010-01-01
The thermodynamic hypothesis of Anfinsen postulates that structures and stabilities of globular proteins are determined by their amino acid sequences. Chain topology, however, is known to influence the folding reaction, in that motifs with a preponderance of local interactions typically fold more rapidly than those with a larger fraction of non-local interactions. Together, the topology and sequence can modulate the energy landscape and influence the rate at which the protein folds to the native conformation. To explore the relationship of sequence and topology in the folding of βα–repeat proteins, which are dominated by local interactions, a combined experimental and simulation analysis was performed on two members of the flavodoxin-like, α/β/α sandwich fold. Spo0F and the N-terminal receiver domain of NtrC (NT-NtrC) have similar topologies but low sequence identity, enabling a test of the effects of sequence on folding. Experimental results demonstrated that both response-regulator proteins fold via parallel channels through highly structured sub-millisecond intermediates before accessing their cis prolyl peptide bond-containing native conformations. Global analysis of the experimental results preferentially places these intermediates off the productive folding pathway. Sequence-sensitive Gō-model simulations conclude that frustration in the folding in Spo0F, corresponding to the appearance of the off-pathway intermediate, reflects competition for intra-subdomain van der Waals contacts between its N- and C-terminal subdomains. The extent of transient, premature structure appears to correlate with the number of isoleucine, leucine and valine (ILV) side-chains that form a large sequence-local cluster involving the central β-sheet and helices α2, α3 and α4. The failure to detect the off-pathway species in the simulations of NT-NtrC may reflect the reduced number of ILV side-chains in its corresponding hydrophobic cluster. The location of the hydrophobic clusters in the structure may also be related to the differing functional properties of these response regulators. Comparison with the results of previous experimental and simulation analyses on the homologous CheY argues that prematurely-folded unproductive intermediates are a common property of the βα-repeat motif. PMID:20226790
A Generative Angular Model of Protein Structure Evolution
Golden, Michael; García-Portugués, Eduardo; Sørensen, Michael; Mardia, Kanti V.; Hamelryck, Thomas; Hein, Jotun
2017-01-01
Abstract Recently described stochastic models of protein evolution have demonstrated that the inclusion of structural information in addition to amino acid sequences leads to a more reliable estimation of evolutionary parameters. We present a generative, evolutionary model of protein structure and sequence that is valid on a local length scale. The model concerns the local dependencies between sequence and structure evolution in a pair of homologous proteins. The evolutionary trajectory between the two structures in the protein pair is treated as a random walk in dihedral angle space, which is modeled using a novel angular diffusion process on the two-dimensional torus. Coupling sequence and structure evolution in our model allows for modeling both “smooth” conformational changes and “catastrophic” conformational jumps, conditioned on the amino acid changes. The model has interpretable parameters and is comparatively more realistic than previous stochastic models, providing new insights into the relationship between sequence and structure evolution. For example, using the trained model we were able to identify an apparent sequence–structure evolutionary motif present in a large number of homologous protein pairs. The generative nature of our model enables us to evaluate its validity and its ability to simulate aspects of protein evolution conditioned on an amino acid sequence, a related amino acid sequence, a related structure or any combination thereof. PMID:28453724
Vlahovicek, K; Munteanu, M G; Pongor, S
1999-01-01
Bending is a local conformational micropolymorphism of DNA in which the original B-DNA structure is only distorted but not extensively modified. Bending can be predicted by simple static geometry models as well as by a recently developed elastic model that incorporate sequence dependent anisotropic bendability (SDAB). The SDAB model qualitatively explains phenomena including affinity of protein binding, kinking, as well as sequence-dependent vibrational properties of DNA. The vibrational properties of DNA segments can be studied by finite element analysis of a model subjected to an initial bending moment. The frequency spectrum is obtained by applying Fourier analysis to the displacement values in the time domain. This analysis shows that the spectrum of the bending vibrations quite sensitively depends on the sequence, for example the spectrum of a curved sequence is characteristically different from the spectrum of straight sequence motifs of identical basepair composition. Curvature distributions are genome-specific, and pronounced differences are found between protein-coding and regulatory regions, respectively, that is, sites of extreme curvature and/or bendability are less frequent in protein-coding regions. A WWW server is set up for the prediction of curvature and generation of 3D models from DNA sequences (http:@www.icgeb.trieste.it/dna).
Memory-efficient dynamic programming backtrace and pairwise local sequence alignment.
Newberg, Lee A
2008-08-15
A backtrace through a dynamic programming algorithm's intermediate results in search of an optimal path, or to sample paths according to an implied probability distribution, or as the second stage of a forward-backward algorithm, is a task of fundamental importance in computational biology. When there is insufficient space to store all intermediate results in high-speed memory (e.g. cache) existing approaches store selected stages of the computation, and recompute missing values from these checkpoints on an as-needed basis. Here we present an optimal checkpointing strategy, and demonstrate its utility with pairwise local sequence alignment of sequences of length 10,000. Sample C++-code for optimal backtrace is available in the Supplementary Materials. Supplementary data is available at Bioinformatics online.
Design of Conditionally Active STATs: Insights into STAT Activation and Gene Regulatory Function
Milocco, Lawrence H.; Haslam, Jennifer A.; Rosen, Jonathan; Seidel, H. Martin
1999-01-01
The STAT (signal transducer and activator of transcription) signaling pathway is activated by a large number of cytokines and growth factors. We sought to design a conditionally active STAT that could not only provide insight into basic questions about STAT function but also serve as a powerful tool to determine the precise biological role of STATs. To this end, we have developed a conditionally active STAT by fusing STATs with the ligand-binding domain of the estrogen receptor (ER). We have demonstrated that the resulting STAT-ER chimeras are estrogen-inducible transcription factors that retain the functional and biochemical characteristics of the cognate wild-type STATs. In addition, these tools have allowed us to evaluate separately the contribution of tyrosine phosphorylation and dimerization to STAT function. We have for the first time provided experimental data supporting the model that the only apparent role of STAT tyrosine phosphorylation is to drive dimerization, as dimerization alone is sufficient to unmask a latent STAT nuclear localization sequence and induce nuclear translocation, sequence-specific DNA binding, and transcriptional activity. PMID:10082558
Cao, Ya-Nan; Wang, Ian J; Chen, Lu-Yao; Ding, Yan-Qian; Liu, Lu-Xian; Qiu, Ying-Xiong
2018-04-17
The relative roles of geography, climate and ecology in driving population divergence and (incipient) speciation has so far been largely neglected in studies addressing the evolution of East Asia's island flora. Here, we employed chloroplast and ribosomal DNA sequences and restriction site-associated DNA sequencing (RADseq) loci to investigate the phylogeography and drivers of population divergence of Neolitsea sericea. These data sets support the subdivision of N. sericea populations into the Southern and Northern lineages across the 'Tokara gap'. Two distinct sublineages were further identified for the Northern lineage of N. sericea from the RADseq data. RADseq was also used along with approximate Bayesian computation to show that the current distribution and differentiation of N. sericea populations resulted from a combination of relatively ancient migration and successive vicariant events that likely occurred during the mid to late Pleistocene. Landscape genomic analyses showed that, apart from geographic barriers, barrier, potentially local adaptation to different climatic conditions appears to be one of the major drivers for lineage diversification of N. sericea. Copyright © 2018 Elsevier Inc. All rights reserved.
Structural evolution of Arsia Mons, Pavonis Mons, and Ascreus Mons Tharsis region of Mars
NASA Technical Reports Server (NTRS)
Crumpler, L. S.; Aubele, J. C.
1978-01-01
Analysis of Viking Orbiter data suggests that Arsia Mons, Pavonis Mons, and Ascreus Mons, three large shield volcanoes of the Tharsis volcanoes of Mars, have had similar evolutionary trends. Arsia Mons appears to have developed in the following sequence: (1) construction of a main shield volcano, (2) outbreak of parasitic eruption centers on the northeast and southwest flanks, (3) volcano-tectonic subsidence of the summit and formation of concentric fractures and grabens, possibly by evacuation of an underlying magma chamber during eruption of copious lavas from parasitic eruption centers on the northeast and southwest flanks, and (4) continued volcanism along a fissure or rift bisecting the main shield, resulting in flooding of the floor of the volcano-tectonic depression and inundation of the northeast and southwest flanks by voluminous lavas locally forming parasitic shields. In terms of this sequence Pavonis Mons has developed to stage (3) and Ascreus Mons has evolved to stage (2). This interpretation is supported by crater frequency-diameter distributions in the 0.1- to 3.0-km diameter range.
Wright, David L; Magnuson, Curt E; Black, Charles B
2005-09-01
Individuals practiced two unique discrete sequence production tasks that differed in their relative time profile in either a blocked or random practice schedule. Each participant was subsequently administered a "precuing" protocol to examine the cost of initially compiling or modifying the plan for an upcoming movement's relative timing. The findings indicated that, in general, random practice facilitated the programming of the required movement timing, and this was accomplished while exhibiting greater accuracy in movement production. Participants exposed to random practice exhibited the greatest motor programming benefit, when a modification to an already prepared movement timing profile was required. When movement timing was only partially constructed prior to the imperative signal, the individuals who were trained in blocked and random practice formats accrued a similar cost to complete the programming process. These data provide additional support for the recent claim of Immink & Wright (2001) that at least some of the benefit from experience in a random as opposed to blocked training context can be localized to superior development and implementation of the motor programming process before executing the movement.
Jobke, Bjoern; Bolbos, Radu; Saadat, Ehsan; Cheng, Jonathan; Li, Xiaojuan; Majumdar, Sharmila
2013-01-01
The application of biomolecular magnetic resonance imaging becomes increasingly important in the context of early cartilage changes in degenerative and inflammatory joint disease before gross morphological changes become apparent. In this limited technical report, we investigate the correlation of MRI T1, T2 and T1ρ relaxation times with quantitative biochemical measurements of proteoglycan and collagen contents of cartilage in close synopsis with histologic morphology. A recently developed MRI sequence, T1ρ, was able to detect early intracartilaginous degeneration quantitatively and also qualitatively by color mapping demonstrating a higher sensitivity than standard T2-weighted sequences. The results correlated highly with reduced proteoglycan content and disrupted collagen architecture as measured by biochemistry and histology. The findings lend support to a clinical implementation that allows rapid visual capturing of pathology on a local, millimeter level. Further information about articular cartilage quality otherwise not detectable in vivo, via normal inspection, is needed for orthopedic treatment decisions in the present and future. Copyright © 2013 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Alaniz Rodrigues, Marcos; Dumont, Luiz Felipe Cestari; dos Santos, Cléverson Rannieri Meira; D'Incao, Fernando; Weiss, Steven; Froufe, Elsa
2017-10-01
For the first time, a molecular approach was used to evaluate the phylogenetic structure of the disjunct native American distribution of the blue crab Callinectes sapidus. Population structure was investigated by sequencing 648bp of the Cytochrome oxidase subunit 1 (COI), in a total of 138 sequences stemming from individual samples from both the northern and southern hemispheres of the Western Atlantic distribution of the species. A Bayesian approach was used to construct a phylogenetic tree for all samples, and a 95% confidence parsimony network was created to depict the relationship among haplotypes. Results revealed two highly distinct lineages, one containing all samples from the United States and some from Brazil (lineage 1) and the second restricted to Brazil (lineage 2). In addition, gene flow (at least for females) was detected among estuaries at local scales and there is evidence for shared haplotypes in the south. Furthermore, the findings of this investigation support the contemporary introduction of haplotypes that have apparently spread from the south to the north Atlantic.
Solovyeva, Evgeniya N; Dunayev, Evgeniy N; Nazarov, Roman A; Mehdi Radjabizadeh; Poyarkov, Nikolay A
2018-01-01
The morphological and genetic variation of a wide-ranging Secret Toad-headed agama, Phrynocephalus mystaceus that inhabits sand deserts of south-eastern Europe, Middle East, Middle Asia, and western China is reviewed. Based on the morphological differences and high divergence in COI (mtDNA) gene sequences a new subspecies of Ph. mystaceus is described from Khorasan Razavi Province in Iran. Partial sequences of COI mtDNA gene of 31 specimens of Ph. mystaceus from 17 localities from all major parts of species range were analyzed. Genetic distances show a deep divergence between Ph. mystaceus khorasanus ssp. n. from Khorasan Razavi Province and all other populations of Ph. mystaceus . The new subspecies can be distinguished from other populations of Ph. mystaceus by a combination of several morphological features. Molecular and morphological analyses do not support the validity of other Ph. mystaceus subspecies described from Middle Asia and Caspian basin. Geographic variations in the Ph. mystaceus species complex and the status of previously described subspecies were discussed.
Instructional versus schedule control of humans' choices in situations of diminishing returns
Hackenberg, Timothy D.; Joker, Veronica R.
1994-01-01
Four adult humans chose repeatedly between a fixed-time schedule (of points later exchangeable for money) and a progressive-time schedule that began at 0 s and increased by a fixed number of seconds with each point delivered by that schedule. Each point delivered by the fixed-time schedule reset the requirements of the progressive-time schedule to its minimum value. Subjects were provided with instructions that specified a particular sequence of choices. Under the initial conditions, the instructions accurately specified the optimal choice sequence. Thus, control by instructions and optimal control by the programmed contingencies both supported the same performance. To distinguish the effects of instructions from schedule sensitivity, the correspondence between the instructed and optimal choice patterns was gradually altered across conditions by varying the step size of the progressive-time schedule while maintaining the same instructions. Step size was manipulated, typically in 1-s units, first in an ascending and then in a descending sequence of conditions. Instructions quickly established control in all 4 subjects but, by narrowing the range of choice patterns, they reduced subsequent sensitivity to schedule changes. Instructional control was maintained across the ascending sequence of progressive-time values for each subject, but eventually diminished, giving way to more schedule-appropriate patterns. The transition from instruction-appropriate to schedule-appropriate behavior was characterized by an increase in the variability of choice patterns and local increases in point density. On the descending sequence of progressive-time values, behavior appeared to be schedule sensitive, sometimes even optimally sensitive, but it did not always change systematically with the contingencies, suggesting the involvement of other factors. PMID:16812747
Zhang, Yiming; Jin, Quan; Wang, Shuting; Ren, Ren
2011-05-01
The mobile behavior of 1481 peptides in ion mobility spectrometry (IMS), which are generated by protease digestion of the Drosophila melanogaster proteome, is modeled and predicted based on two different types of characterization methods, i.e. sequence-based approach and structure-based approach. In this procedure, the sequence-based approach considers both the amino acid composition of a peptide and the local environment profile of each amino acid in the peptide; the structure-based approach is performed with the CODESSA protocol, which regards a peptide as a common organic compound and generates more than 200 statistically significant variables to characterize the whole structure profile of a peptide molecule. Subsequently, the nonlinear support vector machine (SVM) and Gaussian process (GP) as well as linear partial least squares (PLS) regression is employed to correlate the structural parameters of the characterizations with the IMS drift times of these peptides. The obtained quantitative structure-spectrum relationship (QSSR) models are evaluated rigorously and investigated systematically via both one-deep and two-deep cross-validations as well as the rigorous Monte Carlo cross-validation (MCCV). We also give a comprehensive comparison on the resulting statistics arising from the different combinations of variable types with modeling methods and find that the sequence-based approach can give the QSSR models with better fitting ability and predictive power but worse interpretability than the structure-based approach. In addition, though the QSSR modeling using sequence-based approach is not needed for the preparation of the minimization structures of peptides before the modeling, it would be considerably efficient as compared to that using structure-based approach. Copyright © 2011 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Kanamori, Keiko; Ross, Brian D.
1999-08-01
Three-dimensional image-selected in vivo spectroscopy (ISIS) was combined with phase-cycled 1H-15N heteronuclear multiple-quantum coherence (HMQC) transfer NMR for localized selective observation of protons J-coupled to 15N in phantoms and in vivo. The ISIS-HMQC sequence, supplemented by jump-return water suppression, permitted localized selective observation of 2-5 μmol of [15Nindole]tryptophan, a precursor of the neurotransmitter serotonin, through the 15N-coupled proton in 20-40 min of acquisition in vitro at 4.7 T. In vivo, the amide proton of [5-15N]glutamine was selectively observed in the brain of spontaneously breathing 15NH4+-infused rats, using a volume probe with homogeneous 1H and 15N fields. Signal recovery after three-dimensional localization was 72-82% in phantoms and 59 ± 4% in vivo. The result demonstrates that localized selective observation of 15N-coupled protons, with complete cancellation of all other protons except water, can be achieved in spontaneously breathing animals by the ISIS-HMQC sequence. This sequence performs both volume selection and heteronuclear editing through an addition/subtraction scheme and predicts the highest intrinsic sensitivity for detection of 15N-coupled protons in the selected volume. The advantages and limitations of this method for in vivo application are compared to those of other localized editing techniques currently in use for non-exchanging protons.
Sadowska, Agnieszka; Paukszto, Lukasz; Nynca, Anna; Szczerbal, Izabela; Orlowska, Karina; Swigonska, Sylwia; Ruszkowska, Monika; Molcan, Tomasz; Jastrzebski, Jan P; Panasiewicz, Grzegorz; Ciereszko, Renata E
2017-03-01
Aryl hydrocarbon receptor (AhR) is a ligand-activated transcription factor best known for mediating xenobiotic-induced toxicity. AhR requires aryl hydrocarbon receptor nuclear translocator (ARNT) to form an active transcription complex and promote the activation of genes which have dioxin responsive element in their regulatory regions. The present study was performed to determine the complete cDNA sequences of porcine AhR and ARNT genes and their chromosomal localization. Total RNA from porcine livers were used to obtain the sequence of the entire porcine transcriptome by next-generation sequencing (NGS; lllumina HiSeq2500). In addition, both, in silico analysis and fluorescence in situ hybridization (FISH) were used to determine chromosomal localization of porcine AhR and ARNT genes. In silico analysis of nucleotide sequences showed that there were two transcript variants of AhR and ARNT genes in the pig. In addition, computer analysis revealed that AhR gene in the pig is located on chromosome 9 and ARNT on chromosome 4. The results of FISH experiment confirmed the localization of porcine AhR and ARNT genes. In the present study, for the first time, the full cDNAs of AhR and ARNT were demonstrated in the pig. In future, it would be interesting to determine the tissue distribution of AhR and ARNT transcript variants in the pig and to test whether these variants are associated with different biological functions and/or different activation pathways.
(Pea)nuts and bolts of visual narrative: Structure and meaning in sequential image comprehension
Cohn, Neil; Paczynski, Martin; Jackendoff, Ray; Holcomb, Phillip J.; Kuperberg, Gina R.
2012-01-01
Just as syntax differentiates coherent sentences from scrambled word strings, the comprehension of sequential images must also use a cognitive system to distinguish coherent narrative sequences from random strings of images. We conducted experiments analogous to two classic studies of language processing to examine the contributions of narrative structure and semantic relatedness to processing sequential images. We compared four types of comic strips: 1) Normal sequences with both structure and meaning, 2) Semantic Only sequences (in which the panels were related to a common semantic theme, but had no narrative structure), 3) Structural Only sequences (narrative structure but no semantic relatedness), and 4) Scrambled sequences of randomly-ordered panels. In Experiment 1, participants monitored for target panels in sequences presented panel-by-panel. Reaction times were slowest to panels in Scrambled sequences, intermediate in both Structural Only and Semantic Only sequences, and fastest in Normal sequences. This suggests that both semantic relatedness and narrative structure offer advantages to processing. Experiment 2 measured ERPs to all panels across the whole sequence. The N300/N400 was largest to panels in both the Scrambled and Structural Only sequences, intermediate in Semantic Only sequences and smallest in the Normal sequences. This implies that a combination of narrative structure and semantic relatedness can facilitate semantic processing of upcoming panels (as reflected by the N300/N400). Also, panels in the Scrambled sequences evoked a larger left-lateralized anterior negativity than panels in the Structural Only sequences. This localized effect was distinct from the N300/N400, and appeared despite the fact that these two sequence types were matched on local semantic relatedness between individual panels. These findings suggest that sequential image comprehension uses a narrative structure that may be independent of semantic relatedness. Altogether, we argue that the comprehension of visual narrative is guided by an interaction between structure and meaning. PMID:22387723
Kemege, Kyle E.; Hickey, John M.; Barta, Michael L.; ...
2014-11-10
Cell division in Chlamydiae is poorly understood as apparent homologs to most conserved bacterial cell division proteins are lacking and presence of elongation (rod shape) associated proteins indicate non-canonical mechanisms may be employed. The rod-shape determining protein MreB has been proposed as playing a unique role in chlamydial cell division. In other organisms, MreB is part of an elongation complex that requires RodZ for proper function. A recent study reported that the protein encoded by ORF CT009 interacts with MreB despite low sequence similarity to RodZ. The studies in this paper expand on those observations through protein structure, mutagenesis andmore » cellular localization analyses. Structural analysis indicated that CT009 shares high level of structural similarity to RodZ, revealing the conserved orientation of two residues critical for MreB interaction. Substitutions eliminated MreB protein interaction and partial complementation provided by CT009 in RodZ deficient Escherichia coli. Cellular localization analysis of CT009 showed uniform membrane staining in Chlamydia. This was in contrast to the localization of MreB, which was restricted to predicted septal planes. Finally, MreB localization to septal planes provides direct experimental observation for the role of MreB in cell division and supports the hypothesis that it serves as a functional replacement for FtsZ in Chlamydia.« less
Kemege, Kyle E.; Hickey, John M.; Barta, Michael L.; Wickstrum, Jason; Balwalli, Namita; Lovell, Scott; Battaile, Kevin P.; Hefty, P. Scott
2015-01-01
Summary Cell division in Chlamydiae is poorly understood as apparent homologs to most conserved bacterial cell division proteins are lacking and presence of elongation (rod shape) associated proteins indicate non-canonical mechanisms may be employed. The rod-shape determining protein MreB has been proposed as playing a unique role in chlamydial cell division. In other organisms, MreB is part of an elongation complex that requires RodZ for proper function. A recent study reported that the protein encoded by ORF CT009 interacts with MreB despite low sequence similarity to RodZ. The studies herein expand on those observations through protein structure, mutagenesis, and cellular localization analyses. Structural analysis indicated that CT009 shares high level of structural similarity to RodZ, revealing the conserved orientation of two residues critical for MreB interaction. Substitutions eliminated MreB protein interaction and partial complementation provided by CT009 in RodZ deficient E. coli. Cellular localization analysis of CT009 showed uniform membrane staining in Chlamydia. This was in contrast to the localization of MreB, which was restricted to predicted septal planes. MreB localization to septal planes provides direct experimental observation for the role of MreB in cell division and supports the hypothesis that it serves as a functional replacement for FtsZ in Chlamydia. PMID:25382739
Kemege, Kyle E; Hickey, John M; Barta, Michael L; Wickstrum, Jason; Balwalli, Namita; Lovell, Scott; Battaile, Kevin P; Hefty, P Scott
2015-02-01
Cell division in Chlamydiae is poorly understood as apparent homologs to most conserved bacterial cell division proteins are lacking and presence of elongation (rod shape) associated proteins indicate non-canonical mechanisms may be employed. The rod-shape determining protein MreB has been proposed as playing a unique role in chlamydial cell division. In other organisms, MreB is part of an elongation complex that requires RodZ for proper function. A recent study reported that the protein encoded by ORF CT009 interacts with MreB despite low sequence similarity to RodZ. The studies herein expand on those observations through protein structure, mutagenesis and cellular localization analyses. Structural analysis indicated that CT009 shares high level of structural similarity to RodZ, revealing the conserved orientation of two residues critical for MreB interaction. Substitutions eliminated MreB protein interaction and partial complementation provided by CT009 in RodZ deficient Escherichia coli. Cellular localization analysis of CT009 showed uniform membrane staining in Chlamydia. This was in contrast to the localization of MreB, which was restricted to predicted septal planes. MreB localization to septal planes provides direct experimental observation for the role of MreB in cell division and supports the hypothesis that it serves as a functional replacement for FtsZ in Chlamydia. © 2014 John Wiley & Sons Ltd.
A Shellcode Detection Method Based on Full Native API Sequence and Support Vector Machine
NASA Astrophysics Data System (ADS)
Cheng, Yixuan; Fan, Wenqing; Huang, Wei; An, Jing
2017-09-01
Dynamic monitoring the behavior of a program is widely used to discriminate between benign program and malware. It is usually based on the dynamic characteristics of a program, such as API call sequence or API call frequency to judge. The key innovation of this paper is to consider the full Native API sequence and use the support vector machine to detect the shellcode. We also use the Markov chain to extract and digitize Native API sequence features. Our experimental results show that the method proposed in this paper has high accuracy and low detection rate.
Mahoney, J. Matthew; Titiz, Ali S.; Hernan, Amanda E.; Scott, Rod C.
2016-01-01
Hippocampal neural systems consolidate multiple complex behaviors into memory. However, the temporal structure of neural firing supporting complex memory consolidation is unknown. Replay of hippocampal place cells during sleep supports the view that a simple repetitive behavior modifies sleep firing dynamics, but does not explain how multiple episodes could be integrated into associative networks for recollection during future cognition. Here we decode sequential firing structure within spike avalanches of all pyramidal cells recorded in sleeping rats after running in a circular track. We find that short sequences that combine into multiple long sequences capture the majority of the sequential structure during sleep, including replay of hippocampal place cells. The ensemble, however, is not optimized for maximally producing the behavior-enriched episode. Thus behavioral programming of sequential correlations occurs at the level of short-range interactions, not whole behavioral sequences and these short sequences are assembled into a large and complex milieu that could support complex memory consolidation. PMID:26866597
DNA sequence chromatogram browsing using JAVA and CORBA.
Parsons, J D; Buehler, E; Hillier, L
1999-03-01
DNA sequence chromatograms (traces) are the primary data source for all large-scale genomic and expressed sequence tags (ESTs) sequencing projects. Access to the sequencing trace assists many later analyses, for example contig assembly and polymorphism detection, but obtaining and using traces is problematic. Traces are not collected and published centrally, they are much larger than the base calls derived from them, and viewing them requires the interactivity of a local graphical client with local data. To provide efficient global access to DNA traces, we developed a client/server system based on flexible Java components integrated into other applications including an applet for use in a WWW browser and a stand-alone trace viewer. Client/server interaction is facilitated by CORBA middleware which provides a well-defined interface, a naming service, and location independence. [The software is packaged as a Jar file available from the following URL: http://www.ebi.ac.uk/jparsons. Links to working examples of the trace viewers can be found at http://corba.ebi.ac.uk/EST. All the Washington University mouse EST traces are available for browsing at the same URL.
Teaching Civil Procedure with the Aid of Local Tort Litigation.
ERIC Educational Resources Information Center
Anderson, Lloyd C.; Kirkwood, Charles E.
1987-01-01
A course in civil procedure using local tort litigation and classroom simulation of the trial has been enthusiastically recevied by students and useful in teaching appropriate procedure, sequencing, questioning, and professional cooperation. (MSE)
Computer-assisted segmentation of white matter lesions in 3D MR images using support vector machine.
Lao, Zhiqiang; Shen, Dinggang; Liu, Dengfeng; Jawad, Abbas F; Melhem, Elias R; Launer, Lenore J; Bryan, R Nick; Davatzikos, Christos
2008-03-01
Brain lesions, especially white matter lesions (WMLs), are associated with cardiac and vascular disease, but also with normal aging. Quantitative analysis of WML in large clinical trials is becoming more and more important. In this article, we present a computer-assisted WML segmentation method, based on local features extracted from multiparametric magnetic resonance imaging (MRI) sequences (ie, T1-weighted, T2-weighted, proton density-weighted, and fluid attenuation inversion recovery MRI scans). A support vector machine classifier is first trained on expert-defined WMLs, and is then used to classify new scans. Postprocessing analysis further reduces false positives by using anatomic knowledge and measures of distance from the training set. Cross-validation on a population of 35 patients from three different imaging sites with WMLs of varying sizes, shapes, and locations tests the robustness and accuracy of the proposed segmentation method, compared with the manual segmentation results from two experienced neuroradiologists.
DXplain: a Web-based diagnostic decision support system for medical students.
London, S
1998-01-01
DXplain is a diagnostic decision support program, with a new World Wide Web interface, designed to help medical students and physicians formulate differential diagnoses based on clinical findings. It covers over 2000 diseases and 5000 clinical manifestations. DXplain suggests possible diagnoses, and provides brief descriptions of every disease in the database. Not all diseases are included, nor does DXplain take into account preexisting conditions or the chronological sequence of clinical manifestations. Despite these limitations, it is a useful educational tool, particularly for problem-based learning (PBL) cases and for students in clinical rotations, as it fills a niche not adequately covered by MEDLINE or medical texts. The system is relatively self-explanatory, requiring little or no end-user training. Medical libraries offering, or planning to offer, their users access to Web-based materials and resources may find this system a valuable addition to their electronic collections. Should it prove popular with the local users, provision of access may also establish or enhance the library's image as a partner in medical education.
González-Rocha, Gerardo; Muñoz-Cartes, Gabriel; Canales-Aguirre, Cristian B.; Lima, Celia A.; Domínguez-Yévenes, Mariana; Bello-Toledo, Helia
2017-01-01
It has been proposed that Antarctic environments select microorganisms with unique biochemical adaptations, based on the tenet ‘Everything is everywhere, but, the environment selects’ by Baas-Becking. However, this is a hypothesis that has not been extensively evaluated. This study evaluated the fundamental prediction contained in this hypothesis—in the sense that species are structured in the landscape according to their local habitats-, using as study model the phylogenetic diversity of the culturable bacteria of Fildes Peninsula (King George Island, Antarctica). Eighty bacterial strains isolated from 10 different locations in the area, were recovered. Based on phylogenetic analysis of 16S rRNA gene sequences, the isolates were grouped into twenty-six phylotypes distributed in three main clades, of which only six are exclusive to Antarctica. Results showed that phylotypes do not group significantly by habitat type; however, local habitat types had phylogenetic signal, which support the phylogenetic niche conservatism hypothesis and not a selective role of the environment like the Baas-Becking hypothesis suggests. We propose that, more than habitat selection resulting in new local adaptations and diversity, local historical colonization and species sorting (i.e. differences in speciation and extinction rates that arise by interaction of species level traits with the environment) play a fundamental role on the culturable bacterial diversity in Antarctica. PMID:28632790
González-Rocha, Gerardo; Muñoz-Cartes, Gabriel; Canales-Aguirre, Cristian B; Lima, Celia A; Domínguez-Yévenes, Mariana; Bello-Toledo, Helia; Hernández, Cristián E
2017-01-01
It has been proposed that Antarctic environments select microorganisms with unique biochemical adaptations, based on the tenet 'Everything is everywhere, but, the environment selects' by Baas-Becking. However, this is a hypothesis that has not been extensively evaluated. This study evaluated the fundamental prediction contained in this hypothesis-in the sense that species are structured in the landscape according to their local habitats-, using as study model the phylogenetic diversity of the culturable bacteria of Fildes Peninsula (King George Island, Antarctica). Eighty bacterial strains isolated from 10 different locations in the area, were recovered. Based on phylogenetic analysis of 16S rRNA gene sequences, the isolates were grouped into twenty-six phylotypes distributed in three main clades, of which only six are exclusive to Antarctica. Results showed that phylotypes do not group significantly by habitat type; however, local habitat types had phylogenetic signal, which support the phylogenetic niche conservatism hypothesis and not a selective role of the environment like the Baas-Becking hypothesis suggests. We propose that, more than habitat selection resulting in new local adaptations and diversity, local historical colonization and species sorting (i.e. differences in speciation and extinction rates that arise by interaction of species level traits with the environment) play a fundamental role on the culturable bacterial diversity in Antarctica.
Hicks, Martin J; Funato, Kosuke; Wang, Lan; Aronowitz, Eric; Dyke, Jonathan P; Ballon, Douglas J; Havlicek, David F; Frenk, Esther Z; De, Bishnu P; Chiuchiolo, Maria J; Sondhi, Dolan; Hackett, Neil R; Kaminsky, Stephen M; Tabar, Viviane; Crystal, Ronald G
2015-01-01
The median survival of glioblastoma multiforme (GBM) is approximately 1 year. Following surgical removal, systemic therapies are limited by the blood-brain barrier. To circumvent this, we developed a method to modify neurons with the genetic sequence for therapeutic monoclonal antibodies using adeno-associated virus (AAV) gene transfer vectors, directing persistent, local expression in the tumor milieu. The human U87MG GBM cell line or patient-derived early passage GBM cells were administered to the striatum of NOD/SCID immunodeficient mice. AAVrh.10BevMab, an AAVrh.10-based vector coding for bevacizumab (Avastin), an anti-human vascular endothelial growth factor (VEGF) monoclonal antibody, was delivered to the area of the GBM xenograft. Localized expression of bevacizumab was demonstrated by quantitative PCR, ELISA and western blotting. Immunohistochemistry showed that bevacizumab was expressed in neurons. Concurrent administration of AAVrh.10BevMab with the U87MG tumor reduced tumor blood vessel density and tumor volume, and increased survival. Administration of AAVrh.10BevMab 1 week after U87MG xenograft reduced growth and increased survival. Studies with patient-derived early passage GBM primary cells showed a reduction in primary tumor burden with an increased survival. These data support the strategy of AAV-mediated central nervous system gene therapy to treat GBM, overcoming the blood-brain barrier through local, persistent delivery of an anti-angiogenesis monoclonal antibody.
Nair, Pradeep S; John, Eugene B
2007-01-01
Aligning specific sequences against a very large number of other sequences is a central aspect of bioinformatics. With the widespread availability of personal computers in biology laboratories, sequence alignment is now often performed locally. This makes it necessary to analyse the performance of personal computers for sequence aligning bioinformatics benchmarks. In this paper, we analyse the performance of a personal computer for the popular BLAST and FASTA sequence alignment suites. Results indicate that these benchmarks have a large number of recurring operations and use memory operations extensively. It seems that the performance can be improved with a bigger L1-cache.
Yordanova, Juliana; Kolev, Vasil; Bruns, Eike; Kirov, Roumen; Verleger, Rolf
2017-11-01
The present study explored the sleep mechanisms which may support awareness of hidden regularities. Before sleep, 53 participants learned implicitly a lateralized variant of the serial response-time task in order to localize sensorimotor encoding either in the left or right hemisphere and induce implicit regularity representations. Electroencephalographic (EEG) activity was recorded at multiple electrodes during both task performance and sleep, searching for lateralized traces of the preceding activity during learning. Sleep EEG analysis focused on region-specific slow (9-12 Hz) and fast (13-16 Hz) sleep spindles during nonrapid eye movement sleep. Fast spindle activity at those motor regions that were activated during learning increased with the amount of postsleep awareness. Independently of side of learning, spindle activity at right frontal and fronto-central regions was involved: there, fast spindles increased with the transformation of sequence knowledge from implicit before sleep to explicit after sleep, and slow spindles correlated with individual abilities of gaining awareness. These local modulations of sleep spindles corresponded to regions with greater presleep activation in participants with postsleep explicit knowledge. Sleep spindle mechanisms are related to explicit awareness (1) by tracing the activation of motor cortical and right-hemisphere regions which had stronger involvement already during learning and (2) by recruitment of individually consolidated processing modules in the right hemisphere. The integration of different sleep spindle mechanisms with functional states during wake collectively supports the gain of awareness of previously experienced regularities, with a special role for the right hemisphere. © Sleep Research Society 2017. Published by Oxford University Press [on behalf of the Sleep Research Society].
What is that mysterious booming sound?
Hill, David P.
2011-01-01
The residents of coastal North Carolina are occasionally treated to sequences of booming sounds of unknown origin. The sounds are often energetic enough to rattle windows and doors. A recent sequence occurred in early January 2011 during clear weather with no evidence of local thunder storms. Queries by a local reporter (Colin Hackman of the NBC affiliate WETC in Wilmington, North Carolina, personal communication 2011) seemed to eliminate common anthropogenic sources such as sonic booms or quarry blasts. So the commonly asked question, “What's making these booming sounds?” remained (and remains) unanswered.
Nuclear localization of Merkel cell polyomavirus large T antigen in Merkel cell carcinoma
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nakamura, Tomoyuki; Sato, Yuko; Watanabe, Daisuke
2010-03-15
To clarify whether mutations in the large T gene encoded by Merkel cell polyomavirus affect the expression and function of large T antigen in Merkel cell carcinoma cases, we investigated the expression of large T antigen in vitro and in vivo. Immunohistochemistry using a rabbit polyclonal antibody revealed that large T antigen was expressed in the nuclei of Merkel cell carcinoma cells with Merkel cell polyomavirus infection. Deletion mutant analyses identified an Arg-Lys-Arg-Lys sequence (amino acids 277-280) as a nuclear localization signal in large T antigen. Sequence analyses revealed that there were no mutations in the nuclear localization signal inmore » any of the eleven Merkel cell polyomavirus strains examined. Furthermore, stop codons were not observed in the upstream of the nuclear localization signal in any of the Merkel cell carcinoma cases examined. These data suggest that the nuclear localization signal is highly conserved and functional in Merkel cell carcinoma cases.« less
Advanced surface-enhanced Raman gene probe systems and methods thereof
Vo-Dinh, Tuan
2001-01-01
The subject invention is a series of methods and systems for using the Surface-Enhanced Raman (SER)-labeled Gene Probe for hybridization, detection and identification of SER-labeled hybridized target oligonucleotide material comprising the steps of immobilizing SER-labeled hybridized target oligonucleotide material on a support means, wherein the SER-labeled hybridized target oligonucleotide material comprise a SER label attached either to a target oligonucleotide of unknown sequence or to a gene probe of known sequence complementary to the target oligonucleotide sequence, the SER label is unique for the target oligonucleotide strands of a particular sequence wherein the SER-labeled oligonucleotide is hybridized to its complementary oligonucleotide strand, then the support means having the SER-labeled hybridized target oligonucleotide material adsorbed thereon is SERS activated with a SERS activating means, then the support means is analyzed.
Driving on the surface of Mars with the rover sequencing and visualization program
NASA Technical Reports Server (NTRS)
Wright, J.; Hartman, F.; Cooper, B.; Maxwell, S.; Yen, J.; Morrison, J.
2005-01-01
Operating a rover on Mars is not possible using teleoperations due to the distance involved and the bandwith limitations. To operate these rovers requires sophisticated tools to make operators knowledgeable of the terrain, hazards, features of interest, and rover state and limitations, and to support building command sequences and rehearsing expected operations. This paper discusses how the Rover Sequencing and Visualization program and a small set of associated tools support this requirement.
Lee, Wonhoon; Park, Jongsun; Choi, Jaeyoung; Jung, Kyongyong; Park, Bongsoo; Kim, Donghan; Lee, Jaeyoung; Ahn, Kyohun; Song, Wonho; Kang, Seogchan; Lee, Yong-Hwan; Lee, Seunghwan
2009-01-01
Background Sequences and organization of the mitochondrial genome have been used as markers to investigate evolutionary history and relationships in many taxonomic groups. The rapidly increasing mitochondrial genome sequences from diverse insects provide ample opportunities to explore various global evolutionary questions in the superclass Hexapoda. To adequately support such questions, it is imperative to establish an informatics platform that facilitates the retrieval and utilization of available mitochondrial genome sequence data. Results The Insect Mitochondrial Genome Database (IMGD) is a new integrated platform that archives the mitochondrial genome sequences from 25,747 hexapod species, including 112 completely sequenced and 20 nearly completed genomes and 113,985 partially sequenced mitochondrial genomes. The Species-driven User Interface (SUI) of IMGD supports data retrieval and diverse analyses at multi-taxon levels. The Phyloviewer implemented in IMGD provides three methods for drawing phylogenetic trees and displays the resulting trees on the web. The SNP database incorporated to IMGD presents the distribution of SNPs and INDELs in the mitochondrial genomes of multiple isolates within eight species. A newly developed comparative SNU Genome Browser supports the graphical presentation and interactive interface for the identified SNPs/INDELs. Conclusion The IMGD provides a solid foundation for the comparative mitochondrial genomics and phylogenetics of insects. All data and functions described here are available at the web site . PMID:19351385
Dissection of a nuclear localization signal.
Hodel, M R; Corbett, A H; Hodel, A E
2001-01-12
The regulated process of protein import into the nucleus of a eukaryotic cell is mediated by specific nuclear localization signals (NLSs) that are recognized by protein import receptors. This study seeks to decipher the energetic details of NLS recognition by the receptor importin alpha through quantitative analysis of variant NLSs. The relative importance of each residue in two monopartite NLS sequences was determined using an alanine scanning approach. These measurements yield an energetic definition of a monopartite NLS sequence where a required lysine residue is followed by two other basic residues in the sequence K(K/R)X(K/R). In addition, the energetic contributions of the second basic cluster in a bipartite NLS ( approximately 3 kcal/mol) as well as the energy of inhibition of the importin alpha importin beta-binding domain ( approximately 3 kcal/mol) were also measured. These data allow the generation of an energetic scale of nuclear localization sequences based on a peptide's affinity for the importin alpha-importin beta complex. On this scale, a functional NLS has a binding constant of approximately 10 nm, whereas a nonfunctional NLS has a 100-fold weaker affinity of 1 microm. Further correlation between the current in vitro data and in vivo function will provide the foundation for a comprehensive quantitative model of protein import.
Single-Molecule Denaturation Mapping of Genomic DNA in Nanofluidic Channels
NASA Astrophysics Data System (ADS)
Reisner, Walter; Larsen, Niels; Kristensen, Anders; Tegenfeldt, Jonas O.; Flyvbjerg, Henrik
2009-03-01
We have developed a new DNA barcoding technique based on the partial denaturation of extended fluorescently labeled DNA molecules. We partially melt DNA extended in nanofluidic channels via a combination of local heating and added chemical denaturants. The melted molecules, imaged via a standard fluorescence videomicroscopy setup, exhibit a nonuniform fluorescence profile corresponding to a series of local dips and peaks in the intensity trace along the stretched molecule. We show that this barcode is consistent with the presence of locally melted regions and can be explained by calculations of sequence-dependent melting probability. We believe this melting mapping technology is the first optically based single molecule technique sensitive to genome wide sequence variation that does not require an additional enzymatic labeling or restriction scheme.
Bastien, Olivier; Maréchal, Eric
2008-08-07
Confidence in pairwise alignments of biological sequences, obtained by various methods such as Blast or Smith-Waterman, is critical for automatic analyses of genomic data. Two statistical models have been proposed. In the asymptotic limit of long sequences, the Karlin-Altschul model is based on the computation of a P-value, assuming that the number of high scoring matching regions above a threshold is Poisson distributed. Alternatively, the Lipman-Pearson model is based on the computation of a Z-value from a random score distribution obtained by a Monte-Carlo simulation. Z-values allow the deduction of an upper bound of the P-value (1/Z-value2) following the TULIP theorem. Simulations of Z-value distribution is known to fit with a Gumbel law. This remarkable property was not demonstrated and had no obvious biological support. We built a model of evolution of sequences based on aging, as meant in Reliability Theory, using the fact that the amount of information shared between an initial sequence and the sequences in its lineage (i.e., mutual information in Information Theory) is a decreasing function of time. This quantity is simply measured by a sequence alignment score. In systems aging, the failure rate is related to the systems longevity. The system can be a machine with structured components, or a living entity or population. "Reliability" refers to the ability to operate properly according to a standard. Here, the "reliability" of a sequence refers to the ability to conserve a sufficient functional level at the folded and maturated protein level (positive selection pressure). Homologous sequences were considered as systems 1) having a high redundancy of information reflected by the magnitude of their alignment scores, 2) which components are the amino acids that can independently be damaged by random DNA mutations. From these assumptions, we deduced that information shared at each amino acid position evolved with a constant rate, corresponding to the information hazard rate, and that pairwise sequence alignment scores should follow a Gumbel distribution, which parameters could find some theoretical rationale. In particular, one parameter corresponds to the information hazard rate. Extreme value distribution of alignment scores, assessed from high scoring segments pairs following the Karlin-Altschul model, can also be deduced from the Reliability Theory applied to molecular sequences. It reflects the redundancy of information between homologous sequences, under functional conservative pressure. This model also provides a link between concepts of biological sequence analysis and of systems biology.
NASA Astrophysics Data System (ADS)
Park, Byeolteo; Myung, Hyun
2014-12-01
With the development of unconventional gas, the technology of directional drilling has become more advanced. Underground localization is the key technique of directional drilling for real-time path following and system control. However, there are problems such as vibration, disconnection with external infrastructure, and magnetic field distortion. Conventional methods cannot solve these problems in real time or in various environments. In this paper, a novel underground localization algorithm using a re-measurement of the sequence of the magnetic field and pose graph SLAM (simultaneous localization and mapping) is introduced. The proposed algorithm exploits the property of the drilling system that the body passes through the previous pass. By comparing the recorded measurement from one magnetic sensor and the current re-measurement from another magnetic sensor, the proposed algorithm predicts the pose of the drilling system. The performance of the algorithm is validated through simulations and experiments.
Salinas-Gamboa, Rigel; Johnson, Susan D; Sánchez-León, Nidia; Koltunow, Anna M G; Vielle-Calzada, Jean-Philippe
2016-06-01
Cowpea reproductive tools. Vigna unguiculata L. Walp. (cowpea) is recognized as a major legume food crop in Africa, but seed yields remain low in most varieties adapted to local conditions. The development of hybrid cowpea seed that could be saved after each generation, enabling significant yield increases, will require manipulation of reproductive development from a sexual to an asexual mode. To develop new technologies that could support the biotechnological manipulation of reproductive development in cowpea, we examined gametogenesis and seed formation in two transformable, African-adapted, day-length-insensitive varieties. Here, we show that these two varieties exhibit distinct morphological and phenological traits but share a common developmental sequence in terms of ovule formation and gametogenesis. We present a reproductive calendar that allows prediction of male and female gametogenesis on the basis of sporophytic parameters related to floral bud size and reproductive organ development, determining that gametogenesis occurs more rapidly in the anther than in the ovule. We also show that the mode of megagametogenesis is of the Polygonum-type and not Oenothera-type, as previously reported. Finally, we developed a whole-mount immunolocalization protocol and applied it to detect meiotic proteins in the cowpea megaspore mother cell, opening opportunities for comparing the dynamics of protein localization during male and female meiosis, as well as other reproductive events in this emerging legume model system.
Analysis of levels of support and resonance demonstrated by an elite singing teacher
NASA Astrophysics Data System (ADS)
Scherer, Ronald C.; Radhakrishnan, Nandhakumar; Poulimenos, Andreas
2003-04-01
This was a study of levels of singing expertise demonstrated by an elite operatic singer and teacher. This approach may prove advantageous because the teacher demonstrates what he thinks is important, not what the nonsinging scientist thinks should be important. Two pedagogical sequences were studied: (1) the location of support-glottis (poor), chest (better), abdomen (best); (2) locations of resonance-hard palate/straight tone (poor), mouth (better), sinus/head (best). Measures were obtained for a single frequency (196 Hz), the vowel /ae/, and for mezzo-forte loudness using the /pae pae pae/ technique. Sequence differences: The support sequence was characterized by formant frequency lowering suggestive of vocal tract lengthening. The resonance sequence was characterized by flow (AC, mean flow) and abduction increases. Sequence similarities: The best locations had the widest F2 bandwidths. The better and best locations had the largest dB difference between F2 and F3. Although acoustic power increased through the sequences, the acoustic efficiency was not a discriminating factor. Open and speed quotients were not differentiating. The flow resistance was highest and aerodynamic power the lowest for the first of each sequence. Combined data: The maximum flow declination rate correlated highly with the AC flow (r=-0.92) and SPL (r=0.901).
Hong, Soon Gyu; Cramer, Robert A; Lawrence, Christopher B; Pryor, Barry M
2005-02-01
A gene for the Alternaria major allergen, Alt a 1, was amplified from 52 species of Alternaria and related genera, and sequence information was used for phylogenetic study. Alt a 1 gene sequences evolved 3.8 times faster and contained 3.5 times more parsimony-informative sites than glyceraldehyde-3-phosphate dehydrogenase (gpd) sequences. Analyses of Alt a 1 gene and gpd exon sequences strongly supported grouping of Alternaria spp. and related taxa into several species-groups described in previous studies, especially the infectoria, alternata, porri, brassicicola, and radicina species-groups and the Embellisia group. The sonchi species-group was newly suggested in this study. Monophyly of the Nimbya group was moderately supported, and monophyly of the Ulocladium group was weakly supported. Relationships among species-groups and among closely related species of the same species-group were not fully resolved. However, higher resolution could be obtained using Alt a 1 sequences or a combined dataset than using gpd sequences alone. Despite high levels of variation in amino acid sequences, results of in silico prediction of protein secondary structure for Alt a 1 demonstrated a high degree of structural similarity for most of the species suggesting a conservation of function.
NASA Technical Reports Server (NTRS)
Koskela, P. E.; Bollman, W. E.; Freeman, J. E.; Helton, M. R.; Reichert, R. J.; Travers, E. S.; Zawacki, S. J.
1973-01-01
The activities of the following members of the Navigation Team are recorded: the Science Sequence Design Group, responsible for preparing the final science sequence designs; the Advanced Sequence Planning Group, responsible for sequence planning; and the Science Recommendation Team (SRT) representatives, responsible for conducting the necessary sequence design interfaces with the teams during the mission. The interface task included science support in both advance planning and daily operations. Science sequences designed during the mission are also discussed.
ERIC Educational Resources Information Center
Sanchez, Daniel J.; Reber, Paul J.
2012-01-01
The memory system that supports implicit perceptual-motor sequence learning relies on brain regions that operate separately from the explicit, medial temporal lobe memory system. The implicit learning system therefore likely has distinct operating characteristics and information processing constraints. To attempt to identify the limits of the…
Sequence of mammalian fossils, including hominoid teeth, from the Bubing Basin caves, South China.
Wang, Wei; Potts, Richard; Baoyin, Yuan; Huang, Weiwen; Cheng, Hai; Edwards, R Lawrence; Ditchfield, Peter
2007-04-01
A Plio-Pleistocene to Holocene faunal sequence has been recovered from four carefully excavated caves in the Bubing Basin, adjacent to the larger Bose Basin of South China. The caves vary in elevation; we suggest that the higher caves were formed and filled with sediments prior to the lower caves. The highest deposits, which are from Mohui Cave, contain hominoid teeth and other fossilized remains of mammalian taxa most similar to late Pliocene and early Pleistocene faunas. Wuyun Cave ( approximately 50m lower in elevation than Mohui) contains a late middle Pleistocene fauna, which is supported by U-series age constraints from 350 to 200ka. Lower Pubu Cave ( approximately 23m below Wuyun) is assigned to the late Pleistocene, while the Cunkong Cave (the lowest, approximately 2m lower elevation than Lower Pubu) preserves a Holocene fauna. The four faunal assemblages indicate species-level changes in Ailuropoda, Stegodon, and Sus, the appearance of Elephas, the local disappearance of Stegodon, and the migration of Equus hemionus to South China. These initial results of our work call into question the continued value of the Stegodon/Ailuropoda Fauna, a category long used to characterize the Pleistocene faunas of South China. Excavation of karstic caves of varying elevation within the basins of South China holds promise for defining local sequences of mammalian fossils that can be used to investigate faunal variations related to climate change, biogeographic events, and evolutionary change over the past two million years. Stable isotopic analysis of a small sample of mammalian teeth from Bubing Basin caves is consistent with 100% C(3) vegetation in the Bubing/Bose region, with certain delta(13)C values consistent with a canopied woodland or forest. A preliminary assessment of the hominoid teeth indicates the presence of diverse molar and premolar morphologies including dental remains of Gigantopithecus blacki and a sample with similarities to the teeth reported from Longgupo.
De novo assembly and phasing of a Korean human genome.
Seo, Jeong-Sun; Rhie, Arang; Kim, Junsoo; Lee, Sangjin; Sohn, Min-Hwan; Kim, Chang-Uk; Hastie, Alex; Cao, Han; Yun, Ji-Young; Kim, Jihye; Kuk, Junho; Park, Gun Hwa; Kim, Juhyeok; Ryu, Hanna; Kim, Jongbum; Roh, Mira; Baek, Jeonghun; Hunkapiller, Michael W; Korlach, Jonas; Shin, Jong-Yeon; Kim, Changhoon
2016-10-13
Advances in genome assembly and phasing provide an opportunity to investigate the diploid architecture of the human genome and reveal the full range of structural variation across population groups. Here we report the de novo assembly and haplotype phasing of the Korean individual AK1 (ref. 1) using single-molecule real-time sequencing, next-generation mapping, microfluidics-based linked reads, and bacterial artificial chromosome (BAC) sequencing approaches. Single-molecule sequencing coupled with next-generation mapping generated a highly contiguous assembly, with a contig N50 size of 17.9 Mb and a scaffold N50 size of 44.8 Mb, resolving 8 chromosomal arms into single scaffolds. The de novo assembly, along with local assemblies and spanning long reads, closes 105 and extends into 72 out of 190 euchromatic gaps in the reference genome, adding 1.03 Mb of previously intractable sequence. High concordance between the assembly and paired-end sequences from 62,758 BAC clones provides strong support for the robustness of the assembly. We identify 18,210 structural variants by direct comparison of the assembly with the human reference, identifying thousands of breakpoints that, to our knowledge, have not been reported before. Many of the insertions are reflected in the transcriptome and are shared across the Asian population. We performed haplotype phasing of the assembly with short reads, long reads and linked reads from whole-genome sequencing and with short reads from 31,719 BAC clones, thereby achieving phased blocks with an N50 size of 11.6 Mb. Haplotigs assembled from single-molecule real-time reads assigned to haplotypes on phased blocks covered 89% of genes. The haplotigs accurately characterized the hypervariable major histocompatability complex region as well as demonstrating allele configuration in clinically relevant genes such as CYP2D6. This work presents the most contiguous diploid human genome assembly so far, with extensive investigation of unreported and Asian-specific structural variants, and high-quality haplotyping of clinically relevant alleles for precision medicine.
Fukumori, F; Saint, C P
1997-01-01
A 9,233-bp HindIII fragment of the aromatic amine catabolic plasmid pTDN1, isolated from a derivative of Pseudomonas putida mt-2 (UCC22), confers the ability to degrade aniline on P. putida KT2442. The fragment encodes six open reading frames which are arranged in the same direction. Their 5' upstream region is part of the direct-repeat sequence of pTDN1. Nucleotide sequence of 1.8 kb of the repeat sequence revealed only a single base pair change compared to the known sequence of IS1071 which is involved in the transposition of the chlorobenzoate genes (C. Nakatsu, J. Ng, R. Singh, N. Straus, and C. Wyndham, Proc. Natl. Acad. Sci. USA 88:8312-8316, 1991). Four open reading frames encode proteins with considerable homology to proteins found in other aromatic-compound degradation pathways. On the basis of sequence similarity, these genes are proposed to encode the large and small subunits of aniline oxygenase (tdnA1 and tdnA2, respectively), a reductase (tdnB), and a LysR-type regulatory gene (tdnR). The putative large subunit has a conserved [2Fe-2S]R Rieske-type ligand center. Two genes, tdnQ and tdnT, which may be involved in amino group transfer, are localized upstream of the putative oxygenase genes. The tdnQ gene product shares about 30% similarity with glutamine synthetases; however, a pUC-based plasmid carrying tdnQ did not support the growth of an Escherichia coli glnA strain in the absence of glutamine. TdnT possesses domains that are conserved among amidotransferases. The tdnQ, tdnA1, tdnA2, tdnB, and tdnR genes are essential for the conversion of aniline to catechol. PMID:8990291
Geographic mosaic of symbiont selectivity in a genus of epiphytic cyanolichens
Fedrowitz, Katja; Kaasalainen, Ulla; Rikkinen, Jouko
2012-01-01
In symbiotic systems, patterns of symbiont diversity and selectivity are crucial for the understanding of fundamental ecological processes such as dispersal and establishment. The lichen genus Nephroma (Peltigerales, Ascomycota) has a nearly cosmopolitan distribution and is thus an attractive model for the study of symbiotic interactions over a wide range of spatial scales. In this study, we analyze the genetic diversity of Nephroma mycobionts and their associated Nostoc photobionts within a global framework. The study is based on Internal Transcribed Spacer (ITS) sequences of fungal symbionts and tRNALeu (UAA) intron sequences of cyanobacterial symbionts. The full data set includes 271 Nephroma and 358 Nostoc sequences, with over 150 sequence pairs known to originate from the same lichen thalli. Our results show that all bipartite Nephroma species associate with one group of Nostoc different from Nostoc typically found in tripartite Nephroma species. This conserved association appears to have been inherited from the common ancestor of all extant species. While specific associations between some symbiont genotypes can be observed over vast distances, both symbionts tend to show genetic differentiation over wide geographic scales. Most bipartite Nephroma species share their Nostoc symbionts with one or more other fungal taxa, and no fungal species associates solely with a single Nostoc genotype, supporting the concept of functional lichen guilds. Symbiont selectivity patterns within these lichens are best described as a geographic mosaic, with higher selectivity locally than globally. This may reflect specific habitat preferences of particular symbiont combinations, but also the influence of founder effects. PMID:23139887
Bose, Tanay; Reynolds, Don R; Berbee, Mary L
2014-01-01
Sooty molds (Capnodiaceae) are saprotrophs on the surfaces of leaves, and they take their nutrients from honeydew exuded by sap-sucking insects. We describe and illustrate the sooty mold Fumiglobus pieridicola sp. nov., which, to the dismay of gardeners, forms a thick black mycelial coating on the leaves and twigs of ornamental Japanese andromeda (Pieris japonica) in western North America. As a mitosporic species with a pycnidium that lacks an elongated neck and has at most a rudimentary stalk, the species belongs in the genus Fumiglobus. Although locally common, we found no specimens identified under Fumiglobus or its synonyms in regional herbaria and no record of any similar fungus in host indices. Our species differs from others in Fumiglobus in having smaller pycnidia and conidia and in having intercalary as well as apical pycnidia. We determined partial 18S and 28S ribosomal gene sequences for F. pieridicola, the first for any Fumiglobus species. Sequence analysis provides strong bootstrap support for including Fumiglobus within Capnodiaceae. We also determined 18S and 28S sequences for the type species of the mitosporic genus Conidiocarpus, also in Capnodiaceae. We confirm that Conidiocarpus is the anamorph of Phragmocapnias. Following the rules of nomenclatural priority, we propose the new combinations Conidiocarpus asiaticus, Conidiocarpus betle, Conidiocarpus callitris, Conidiocarpus fuliginodes, Conidiocarpus heliconiae, Conidiocarpus imperspicuus and Conidiocarpus siamensis. We hope that describing the mystery fungus from our region and providing sequences for its molecular identification will lead to new studies on its biology and distribution. © 2014 by The Mycological Society of America.
Petrography of the Paleogene Volcanic Rocks of the Sierra Maestra, Southeastern Cuba
NASA Astrophysics Data System (ADS)
Bemis, V. L.
2006-12-01
This study is a petrographic analysis of over 200 specimens of the Paleogene volcanic rocks of the Sierra Maestra (Southerneastern Cuba), a key structure in the framework of the northern Caribbean plate boundary evolution. The purpose of this study is to understand the eruptive processes and the depositional environments. The volcanic sequence in the lower part of the Sierra Maestra begins with highly porphyritic pillow lavas, topped by massive tuffs and autoclastic flows. The presence of broken phenocrystals, palagonitic glass and hyaloclastites in this section of the sequence suggests that the prevalent mode of eruption was explosive. The absence of welding in the tuffs suggests that the rocks were emplaced in a deep submarine environment. Coherent flows, much less common than the massive tuffs, show evidence of autoclastic fracturing, also indicating low temperature-submarine environments. These observations support the hypothesis that the Sierra Maestra sequence may be neither part of the Great Antilles Arc of the Mesozoic nor any other fully developed volcanic arc, rather a 250 km long, submarine eruptive system of dikes, flows and sills, most likely a back-arc structure. The volcanic rocks of the upper sequence are all very fine grained, reworked volcaniclastic materials, often with the structures of distal turbidities, in mode and texture similar to those drilled on the Cayman Rise. This study suggests that the Sierra Maestra most likely records volcanism of diverse sources: a local older submarine source, and one or more distal younger sources, identifiable with the pan-Caribbean volcanic events of the Tertiary.
NASA Astrophysics Data System (ADS)
Daneshian, Jahanbakhsh; Ramezani Dana, Leila; Sadler, Peter
2017-01-01
Benthic foraminifera species commonly outnumber planktic species in the type area of the Lower Miocene Qom Formation, in north central Iran, where it records the Tethyan link between the eastern Mediterranean and Indo- Pacific provinces. Because measured sections preserve very different sequences of first and last occurrences of these species, no single section provides a completely suitable baseline for correlation. To resolve this problem, we combined bioevents from three stratigraphic sections into a single composite sequence by constrained optimization (CONOP). The composite section arranges the first and last appearance events (FAD and LAD) of 242 foraminifera in an optimal order that minimizes the implied diachronism between sections. The composite stratigraphic ranges of the planktic foraminifera support a practical biozonation which reveals substantial local changes of accumulation rate during Aquitanian to Burdigalian times. Traditional biozone boundaries emerge little changed but an order of magnitude more correlations can be interpolated. The top of the section at Dobaradar is younger than previously thought and younger than sections at Dochah and Tigheh Reza-Abad. The latter two sections probably extend older into the Aquitanian than the Dobaradar section, but likely include a hiatus near the base of the Burdigalian. The bounding contacts with the Upper Red and Lower Red Formations are shown to be diachronous.
Seismic tomography of the area of the 2010 Beni-Ilmane earthquake sequence, north-central Algeria.
Abacha, Issam; Koulakov, Ivan; Semmane, Fethi; Yelles-Chaouche, Abd Karim
2014-01-01
The region of Beni-Ilmane (District of M'sila, north-central Algeria) was the site of an earthquake sequence that started on 14 May 2010. This sequence, which lasted several months, was triggered by conjugate E-W reverse and N-S dextral faulting. To image the crustal structure of these active faults, we used a set of 1406 well located aftershocks events and applied the local tomography software (LOTOS) algorithm, which includes absolute source location, optimization of the initial 1D velocity model, and iterative tomographic inversion for 3D seismic P- and S-wave velocities (and the Vp/Vs ratio), and source parameters. The patterns of P-wave low-velocity anomalies correspond to the alignments of faults determined from geological evidence, and the P-wave high-velocity anomalies may represent rigid blocks of the upper crust that are not deformed by regional stresses. The S-wave low-velocity anomalies coincide with the aftershock area, where relatively high values of Vp/Vs ratio (1.78) are observed compared with values in the surrounding areas (1.62-1.66). These high values may indicate high fluid contents in the aftershock area. These fluids could have been released from deeper levels by fault movements during earthquakes and migrated rapidly upwards. This hypothesis is supported by vertical sections across the study area show that the major Vp/Vs anomalies are located above the seismicity clusters.
Brain processing of meter and rhythm in music. Electrophysiological evidence of a common network.
Kuck, Heleln; Grossbach, Michael; Bangert, Marc; Altenmüller, Eckart
2003-11-01
To determine cortical structures involved in "global" meter and "local" rhythm processing, slow brain potentials (DC potentials) were recorded from the scalp of 18 musically trained subjects while listening to pairs of monophonic sequences with both metric structure and rhythmic variations. The second sequence could be either identical to or different from the first one. Differences were either of a metric or a rhythmic nature. The subjects' task was to judge whether the sequences were identical or not. During processing of the auditory tasks, brain activation patterns along with the subjects' performance were assessed using 32-channel DC electroencephalography. Data were statistically analyzed using MANOVA. Processing of both meter and rhythm produced sustained cortical activation over bilateral frontal and temporal brain regions. A shift towards right hemispheric activation was pronounced during presentation of the second stimulus. Processing of rhythmic differences yielded a more centroparietal activation compared to metric processing. These results do not support Lerdhal and Jackendoff's two-component model, predicting a dissociation of left hemispheric rhythm and right hemispheric meter processing. We suggest that the uniform right temporofrontal predominance reflects auditory working memory and a pattern recognition module, which participates in both rhythm and meter processing. More pronounced parietal activation during rhythm processing may be related to switching of task-solving strategies towards mental imagination of the score.
Mehta, Sanjay R.; Murrell, Ben; Anderson, Christy M.; Kosakovsky Pond, Sergei L.; Wertheim, Joel O.; Young, Jason A.; Freitas, Lorri; Richman, Douglas D.; Mathews, W. Chris; Scheffler, Konrad; Little, Susan J.; Smith, Davey M.
2016-01-01
Background. Because recently infected individuals disproportionately contribute to the spread of human immunodeficiency virus (HIV), we evaluated the impact of a primary HIV screening program (the Early Test) implemented in San Diego. Methods. The Early Test program used combined nucleic acid and serology testing to screen for primary infection targeting local high-risk individuals. Epidemiologic, HIV sequence, and geographic data were obtained from the San Diego County Department of Public Health and the Early Test program. Poisson regression analysis was performed to determine whether the Early Test program was temporally and geographically associated with changes in incident HIV diagnoses. Transmission chains were inferred by phylogenetic analysis of sequence data. Results. Over time, a decrease in incident HIV diagnoses was observed proportional to the number primary HIV infections diagnosed in each San Diego region (P < .001). Molecular network analyses also showed that transmission chains were more likely to terminate in regions where the program was marketed (P = .002). Although, individuals in these zip codes had infection diagnosed earlier (P = .08), they were not treated earlier (P = .83). Conclusions. These findings suggests that early HIV diagnoses by this primary infection screening program probably contributed to the observed decrease in new HIV diagnoses in San Diego, and they support the expansion and evaluation of similar programs. PMID:27174704
Ferri, Gianmarco; Alù, Milena; Corradini, Beatrice; Beduschi, Giovanni
2009-09-01
Forensic botany can provide significant supporting evidence during criminal investigations. However, it is still an underutilized field of investigation with its most common application limited to identifying specific as well as suspected illegal plants. The ubiquitous presence of plant species can be useful in forensics, but the absence of an accurate identification system remains the major obstacle to the present inability to routinely and correctly identify trace botanical evidence. Many plant materials cannot be identified and differentiated to the species level by traditional morphological characteristics when botanical specimens are degraded and lack physical features. By taking advantage of a universal barcode system, DNA sequencing, and other biomolecular techniques used routinely in forensic investigations, two chloroplast DNA regions were evaluated for their use as "barcoding" markers for plant identification in the field of forensics. We therefore investigated the forensic use of two non-coding plastid regions, psbA-trnH and trnL-trnF, to create a multimarker system for species identification that could be useful throughout the plant kingdom. The sequences from 63 plants belonging to our local flora were submitted and registered on the GenBank database. Sequence comparison to set up the level of identification (species, genus, or family) through Blast algorithms allowed us to assess the suitability of this method. The results confirmed the effectiveness of our botanic universal multimarker assay in forensic investigations.
Hsu, Shu-Hui; Cao, Yue; Lawrence, Theodore S.; Tsien, Christina; Feng, Mary; Grodzki, David M.; Balter, James M.
2015-01-01
Accurate separation of air and bone is critical for creating synthetic CT from MRI to support Radiation Oncology workflow. This study compares two different ultrashort echo-time sequences in the separation of air from bone, and evaluates post-processing methods that correct intensity nonuniformity of images and account for intensity gradients at tissue boundaries to improve this discriminatory power. CT and MRI scans were acquired on 12 patients under an institution review board-approved prospective protocol. The two MRI sequences tested were ultra-short TE imaging using 3D radial acquisition (UTE), and using pointwise encoding time reduction with radial acquisition (PETRA). Gradient nonlinearity correction was applied to both MR image volumes after acquisition. MRI intensity nonuniformity was corrected by vendor-provided normalization methods, and then further corrected using the N4itk algorithm. To overcome the intensity-gradient at air-tissue boundaries, spatial dilations, from 0 to 4 mm, were applied to threshold-defined air regions from MR images. Receiver operating characteristic (ROC) analyses, by comparing predicted (defined by MR images) versus “true” regions of air and bone (defined by CT images), were performed with and without residual bias field correction and local spatial expansion. The post-processing corrections increased the areas under the ROC curves (AUC) from 0.944 ± 0.012 to 0.976 ± 0.003 for UTE images, and from 0.850 ± 0.022 to 0.887 ± 0.012 for PETRA images, compared to without corrections. When expanding the threshold-defined air volumes, as expected, sensitivity of air identification decreased with an increase in specificity of bone discrimination, but in a non-linear fashion. A 1-mm air mask expansion yielded AUC increases of 1% and 4% for UTE and PETRA images, respectively. UTE images had significantly greater discriminatory power in separating air from bone than PETRA images. Post-processing strategies improved the discriminatory power of air from bone for both UTE and PETRA images, and reduced the difference between the two imaging sequences. Both postprocessed UTE and PETRA images demonstrated sufficient power to discriminate air from bone to support synthetic CT generation from MRI data. PMID:25776205
Kaltimbacher, Valérie; Bonnet, Crystel; Lecoeuvre, Gaëlle; Forster, Valérie; Sahel, José-Alain; Corral-Debrinski, Marisol
2006-01-01
As previously established in yeast, two sequences within mRNAs are responsible for their specific localization to the mitochondrial surface—the region coding for the mitochondrial targeting sequence and the 3′UTR. This phenomenon is conserved in human cells. Therefore, we decided to use mRNA localization as a tool to address to mitochondria, a protein that is not normally imported. For this purpose, we associated a nuclear recoded ATP6 gene with the mitochondrial targeting sequence and the 3′UTR of the nuclear SOD2 gene, which mRNA exclusively localizes to the mitochondrial surface in HeLa cells. The ATP6 gene is naturally located into the organelle and encodes a highly hydrophobic protein of the respiratory chain complex V. In this study, we demonstrated that hybrid ATP6 mRNAs, as the endogenous SOD2 mRNA, localize to the mitochondrial surface in human cells. Remarkably, fusion proteins localize to mitochondria in vivo. Indeed, ATP6 precursors synthesized in the cytoplasm were imported into mitochondria in a highly efficient way, especially when both the MTS and the 3′UTR of the SOD2 gene were associated with the re-engineered ATP6 gene. Hence, these data indicate that mRNA targeting to the mitochondrial surface represents an attractive strategy for allowing the mitochondrial import of proteins originally encoded by the mitochondrial genome without any amino acid change in the protein that could interfere with its biologic activity. PMID:16751614
Guzzi, Pietro Hiram; Milenkovic, Tijana
2018-05-01
Analogous to genomic sequence alignment that allows for across-species transfer of biological knowledge between conserved sequence regions, biological network alignment can be used to guide the knowledge transfer between conserved regions of molecular networks of different species. Hence, biological network alignment can be used to redefine the traditional notion of a sequence-based homology to a new notion of network-based homology. Analogous to genomic sequence alignment, there exist local and global biological network alignments. Here, we survey prominent and recent computational approaches of each network alignment type and discuss their (dis)advantages. Then, as it was recently shown that the two approach types are complementary, in the sense that they capture different slices of cellular functioning, we discuss the need to reconcile the two network alignment types and present a recent first step in this direction. We conclude with some open research problems on this topic and comment on the usefulness of network alignment in other domains besides computational biology.
NASA Astrophysics Data System (ADS)
Yasuda, Jun; Yoshizawa, Shin; Umemura, Shin-ichiro
2016-07-01
Sonodynamic treatment is a method of treating cancer using reactive oxygen species (ROS) generated by cavitation bubbles in collaboration with a sonosensitizer at a target tissue. In this treatment method, both localized ROS generation and ROS generation with high efficiency are important. In this study, a triggered high-intensity focused ultrasound (HIFU) sequence, which consists of a short, extremely high intensity pulse immediately followed by a long, moderate-intensity burst, was employed for the efficient generation of ROS. In experiments, a solution sealed in a chamber was exposed to a triggered HIFU sequence. Then, the distribution of generated ROS was observed by the luminol reaction, and the amount of generated ROS was quantified using KI method. As a result, the localized ROS generation was demonstrated by light emission from the luminol reaction. Moreover, it was demonstrated that the triggered HIFU sequence has higher efficiency of ROS generation by both the KI method and the luminol reaction emission.
Entropic fluctuations in DNA sequences
NASA Astrophysics Data System (ADS)
Thanos, Dimitrios; Li, Wentian; Provata, Astero
2018-03-01
The Local Shannon Entropy (LSE) in blocks is used as a complexity measure to study the information fluctuations along DNA sequences. The LSE of a DNA block maps the local base arrangement information to a single numerical value. It is shown that despite this reduction of information, LSE allows to extract meaningful information related to the detection of repetitive sequences in whole chromosomes and is useful in finding evolutionary differences between organisms. More specifically, large regions of tandem repeats, such as centromeres, can be detected based on their low LSE fluctuations along the chromosome. Furthermore, an empirical investigation of the appropriate block sizes is provided and the relationship of LSE properties with the structure of the underlying repetitive units is revealed by using both computational and mathematical methods. Sequence similarity between the genomic DNA of closely related species also leads to similar LSE values at the orthologous regions. As an application, the LSE covariance function is used to measure the evolutionary distance between several primate genomes.
Mapping Simple Repeated DNA Sequences in Heterochromatin of Drosophila Melanogaster
Lohe, A. R.; Hilliker, A. J.; Roberts, P. A.
1993-01-01
Heterochromatin in Drosophila has unusual genetic, cytological and molecular properties. Highly repeated DNA sequences (satellites) are the principal component of heterochromatin. Using probes from cloned satellites, we have constructed a chromosome map of 10 highly repeated, simple DNA sequences in heterochromatin of mitotic chromosomes of Drosophila melanogaster. Despite extensive sequence homology among some satellites, chromosomal locations could be distinguished by stringent in situ hybridizations for each satellite. Only two of the localizations previously determined using gradient-purified bulk satellite probes are correct. Eight new satellite localizations are presented, providing a megabase-level chromosome map of one-quarter of the genome. Five major satellites each exhibit a multichromosome distribution, and five minor satellites hybridize to single sites on the Y chromosome. Satellites closely related in sequence are often located near one another on the same chromosome. About 80% of Y chromosome DNA is composed of nine simple repeated sequences, in particular (AAGAC)(n) (8 Mb), (AAGAG)(n) (7 Mb) and (AATAT)(n) (6 Mb). Similarly, more than 70% of the DNA in chromosome 2 heterochromatin is composed of five simple repeated sequences. We have also generated a high resolution map of satellites in chromosome 2 heterochromatin, using a series of translocation chromosomes whose breakpoints in heterochromatin were ordered by N-banding. Finally, staining and banding patterns of heterochromatic regions are correlated with the locations of specific repeated DNA sequences. The basis for the cytochemical heterogeneity in banding appears to depend exclusively on the different satellite DNAs present in heterochromatin. PMID:8375654
Heuristic reusable dynamic programming: efficient updates of local sequence alignment.
Hong, Changjin; Tewfik, Ahmed H
2009-01-01
Recomputation of the previously evaluated similarity results between biological sequences becomes inevitable when researchers realize errors in their sequenced data or when the researchers have to compare nearly similar sequences, e.g., in a family of proteins. We present an efficient scheme for updating local sequence alignments with an affine gap model. In principle, using the previous matching result between two amino acid sequences, we perform a forward-backward alignment to generate heuristic searching bands which are bounded by a set of suboptimal paths. Given a correctly updated sequence, we initially predict a new score of the alignment path for each contour to select the best candidates among them. Then, we run the Smith-Waterman algorithm in this confined space. Furthermore, our heuristic alignment for an updated sequence shows that it can be further accelerated by using reusable dynamic programming (rDP), our prior work. In this study, we successfully validate "relative node tolerance bound" (RNTB) in the pruned searching space. Furthermore, we improve the computational performance by quantifying the successful RNTB tolerance probability and switch to rDP on perturbation-resilient columns only. In our searching space derived by a threshold value of 90 percent of the optimal alignment score, we find that 98.3 percent of contours contain correctly updated paths. We also find that our method consumes only 25.36 percent of the runtime cost of sparse dynamic programming (sDP) method, and to only 2.55 percent of that of a normal dynamic programming with the Smith-Waterman algorithm.
Local contextual processing of abstract and meaningful real-life images in professional athletes.
Fogelson, Noa; Fernandez-Del-Olmo, Miguel; Acero, Rafael Martín
2012-05-01
We investigated the effect of abstract versus real-life meaningful images from sports on local contextual processing in two groups of professional athletes. Local context was defined as the occurrence of a short predictive series of stimuli occurring before delivery of a target event. EEG was recorded in 10 professional basketball players and 9 professional athletes of individual sports during three sessions. In each session, a different set of visual stimuli were presented: triangles facing left, up, right, or down; four images of a basketball player throwing a ball; four images of a baseball player pitching a baseball. Stimuli consisted of 15 % targets and 85 % of equal numbers of three types of standards. Recording blocks consisted of targets preceded by randomized sequences of standards and by sequences including a predictive sequence signaling the occurrence of a subsequent target event. Subjects pressed a button in response to targets. In all three sessions, reaction times and peak P3b latencies were shorter for predicted targets compared with random targets, the last most informative stimulus of the predictive sequence induced a robust P3b, and N2 amplitude was larger for random targets compared with predicted targets. P3b and N2 peak amplitudes were larger in the professional basketball group in comparison with professional athletes of individual sports, across the three sessions. The findings of this study suggest that local contextual information is processed similarly for abstract and for meaningful images and that professional basketball players seem to allocate more attentional resources in the processing of these visual stimuli.
Scene-based nonuniformity correction using local constant statistics.
Zhang, Chao; Zhao, Wenyi
2008-06-01
In scene-based nonuniformity correction, the statistical approach assumes all possible values of the true-scene pixel are seen at each pixel location. This global-constant-statistics assumption does not distinguish fixed pattern noise from spatial variations in the average image. This often causes the "ghosting" artifacts in the corrected images since the existing spatial variations are treated as noises. We introduce a new statistical method to reduce the ghosting artifacts. Our method proposes a local-constant statistics that assumes that the temporal signal distribution is not constant at each pixel but is locally true. This considers statistically a constant distribution in a local region around each pixel but uneven distribution in a larger scale. Under the assumption that the fixed pattern noise concentrates in a higher spatial-frequency domain than the distribution variation, we apply a wavelet method to the gain and offset image of the noise and separate out the pattern noise from the spatial variations in the temporal distribution of the scene. We compare the results to the global-constant-statistics method using a clean sequence with large artificial pattern noises. We also apply the method to a challenging CCD video sequence and a LWIR sequence to show how effective it is in reducing noise and the ghosting artifacts.
de Melo, Ivan S.; Jimenez-Nuñez, Maria D.; Iglesias, Concepción; Campos-Caro, Antonio; Moreno-Sanchez, David; Ruiz, Felix A.; Bolívar, Jorge
2013-01-01
NOA36/ZNF330 is an evolutionarily well-preserved protein present in the nucleolus and mitochondria of mammalian cells. We have previously reported that the pro-apoptotic activity of this protein is mediated by a characteristic cysteine-rich domain. We now demonstrate that the nucleolar localization of NOA36 is due to a highly-conserved nucleolar localization signal (NoLS) present in residues 1–33. This NoLS is a sequence containing three clusters of two or three basic amino acids. We fused the amino terminal of NOA36 to eGFP in order to characterize this putative NoLS. We show that a cluster of three lysine residues at positions 3 to 5 within this sequence is critical for the nucleolar localization. We also demonstrate that the sequence as found in human is capable of directing eGFP to the nucleolus in several mammal, fish and insect cells. Moreover, this NoLS is capable of specifically directing the cytosolic yeast enzyme polyphosphatase to the target of the nucleolus of HeLa cells, wherein its enzymatic activity was detected. This NoLS could therefore serve as a very useful tool as a nucleolar marker and for directing particular proteins to the nucleolus in distant animal species. PMID:23516598
Xu, Kai; Nagy, Peter D
2017-04-01
Membranous structures derived from various organelles are important for replication of plus-stranded RNA viruses. Although the important roles of co-opted host proteins in RNA virus replication have been appreciated for a decade, the equally important functions of cellular lipids in virus replication have been gaining full attention only recently. Previous work with Tomato bushy stunt tombusvirus (TBSV) in model host yeast has revealed essential roles for phosphatidylethanolamine and sterols in viral replication. To further our understanding of the role of sterols in tombusvirus replication, in this work we showed that the TBSV p33 and p92 replication proteins could bind to sterols in vitro The sterol binding by p33 is supported by cholesterol recognition/interaction amino acid consensus (CRAC) and CARC-like sequences within the two transmembrane domains of p33. Mutagenesis of the critical Y amino acids within the CRAC and CARC sequences blocked TBSV replication in yeast and plant cells. We also showed the enrichment of sterols in the detergent-resistant membrane (DRM) fractions obtained from yeast and plant cells replicating TBSV. The DRMs could support viral RNA synthesis on both the endogenous and exogenous templates. A lipidomic approach showed the lack of enhancement of sterol levels in yeast and plant cells replicating TBSV. The data support the notion that the TBSV replication proteins are associated with sterol-rich detergent-resistant membranes in yeast and plant cells. Together, the results obtained in this study and the previously published results support the local enrichment of sterols around the viral replication proteins that is critical for TBSV replication. IMPORTANCE One intriguing aspect of viral infections is their dependence on efficient subcellular assembly platforms serving replication, virion assembly, or virus egress via budding out of infected cells. These assembly platforms might involve sterol-rich membrane microdomains, which are heterogeneous and highly dynamic nanoscale structures usurped by various viruses. Here, we demonstrate that TBSV p33 and p92 replication proteins can bind to sterol in vitro Mutagenesis analysis of p33 within the CRAC and CARC sequences involved in sterol binding shows the important connection between the abilities of p33 to bind to sterol and to support TBSV replication in yeast and plant cells. Together, the results further strengthen the model that cellular sterols are essential as proviral lipids during viral replication. Copyright © 2017 American Society for Microbiology.
Coetzer, Willem G.; Downs, Colleen T.; Perrin, Mike R.; Willows-Munro, Sandi
2015-01-01
The taxonomic position of the Cape Parrot (Poicephalus robustus robustus) has been the focus of much debate. A number of authors suggest that the Cape Parrot should be viewed as a distinct species separate from the other two P. robustus subspecies (P. r. fuscicollis and P. r. suahelicus). These recommendations were based on morphological, ecological, and behavioural assessments. In this study we investigated the validity of these recommendations using multilocus DNA analyses. We genotyped 138 specimens from five Poicephalus species (P. cryptoxanthus, P. gulielmi, P. meyeri, P. robustus, and P. rueppellii) using 11 microsatellite loci. Additionally, two mitochondrial (cytochrome oxidase I gene and 16S ribosomal RNA) and one nuclear intron (intron 7 of the β-fibrinogen gene) markers were amplified and sequenced. Bayesian clustering analysis and pairwise FST analysis of microsatellite data identified P. r. robustus as genetically distinct from the other P. robustus subspecies. Phylogenetic and molecular clock analyses on sequence data also supported the microsatellite analyses, placing P. r. robustus in a distinct clade separate from the other P. robustus subspecies. Molecular clock analysis places the most recent common ancestor between P. r. robustus and P. r. fuscicollis / P. r. suahelicus at 2.13 to 2.67 million years ago. Our results all support previous recommendations to elevate the Cape Parrot to species level. This will facilitate better planning and implementation of international and local conservation management strategies for the Cape Parrot. PMID:26267261
Saranathan, Vinodkumar; Hamilton, Deborah; Powell, George V N; Kroodsma, Donald E; Prum, Richard O
2007-09-01
Vocal learning is thought to have evolved in three clades of birds (parrots, hummingbirds, and oscine passerines), and three clades of mammals (whales, bats, and primates). Behavioural data indicate that, unlike other suboscine passerines, the three-wattled bellbird Procnias tricarunculata (Cotingidae) is capable of vocal learning. Procnias tricarunculata shows conspicuous vocal ontogeny, striking geographical variation in song, and rapid temporal change in song within a population. Deprivation studies of vocal development in P. tricarunculata are impractical. Here, we report evidence from mitochondrial DNA sequences and nuclear microsatellite loci that genetic variation within and among the four allopatric breeding populations of P. tricarunculata is not congruent with variation in vocal behaviour. Sequences of the mitochondrial DNA control region document extensive haplotype sharing among localities and song types, and no phylogenetic resolution of geographical populations or behavioural groups. The vocally differentiated, allopatric breeding populations of P. tricarunculata are only weakly genetically differentiated populations, and are not distinct taxa. Mitochondrial DNA and microsatellite variation show small (2.9% and 13.5%, respectively) but significant correlation with geographical distance, but no significant residual variation by song type. Estimates of the strength of selection that would be needed to maintain the observed geographical pattern in vocal differentiation if songs were genetically based are unreasonably high, further discrediting the hypothesis of a genetic origin of vocal variation. These data support a fourth, phylogenetically independent origin of avian vocal learning in Procnias. Geographical variations in P. tricarunculata vocal behaviour are likely culturally evolved dialects.
Zhang, Yufan; Maximova, Siela N; Guiltinan, Mark J
2015-01-01
In plants, the conversion of stearoyl-ACP to oleoyol-ACP is catalyzed by a plastid-localized soluble stearoyl-acyl carrier protein (ACP) desaturase (SAD). The activity of SAD significantly impacts the ratio of saturated and unsaturated fatty acids, and is thus a major determinant of fatty acid composition. The cacao genome contains eight putative SAD isoforms with high amino acid sequence similarities and functional domain conservation with SAD genes from other species. Sequence variation in known functional domains between different SAD family members suggested that these eight SAD isoforms might have distinct functions in plant development, a hypothesis supported by their diverse expression patterns in various cacao tissues. Notably, TcSAD1 is universally expressed across all the tissues, and its expression pattern in seeds is highly correlated with the dramatic change in fatty acid composition during seed maturation. Interestingly, TcSAD3 and TcSAD4 appear to be exclusively and highly expressed in flowers, functions of which remain unknown. To test the function of TcSAD1 in vivo, transgenic complementation of the Arabidopsis ssi2 mutant was performed, demonstrating that TcSAD1 successfully rescued all AtSSI2 related phenotypes further supporting the functional orthology between these two genes. The identification of the major SAD gene responsible for cocoa butter biosynthesis provides new strategies for screening for novel genotypes with desirable fatty acid compositions, and for use in breeding programs to help pyramid genes for quality and other traits such as disease resistance.
Ezra, Elishai; Maor, Idan; Bavli, Danny; Shalom, Itai; Levy, Gahl; Prill, Sebastian; Jaeger, Magnus S; Nahmias, Yaakov
2015-08-01
Microfluidic applications range from combinatorial synthesis to high throughput screening, with platforms integrating analog perfusion components, digitally controlled micro-valves and a range of sensors that demand a variety of communication protocols. Currently, discrete control units are used to regulate and monitor each component, resulting in scattered control interfaces that limit data integration and synchronization. Here, we present a microprocessor-based control unit, utilizing the MS Gadgeteer open framework that integrates all aspects of microfluidics through a high-current electronic circuit that supports and synchronizes digital and analog signals for perfusion components, pressure elements, and arbitrary sensor communication protocols using a plug-and-play interface. The control unit supports an integrated touch screen and TCP/IP interface that provides local and remote control of flow and data acquisition. To establish the ability of our control unit to integrate and synchronize complex microfluidic circuits we developed an equi-pressure combinatorial mixer. We demonstrate the generation of complex perfusion sequences, allowing the automated sampling, washing, and calibrating of an electrochemical lactate sensor continuously monitoring hepatocyte viability following exposure to the pesticide rotenone. Importantly, integration of an optical sensor allowed us to implement automated optimization protocols that require different computational challenges including: prioritized data structures in a genetic algorithm, distributed computational efforts in multiple-hill climbing searches and real-time realization of probabilistic models in simulated annealing. Our system offers a comprehensive solution for establishing optimization protocols and perfusion sequences in complex microfluidic circuits.
Banno, Masaki; Komiyama, Yusuke; Cao, Wei; Oku, Yuya; Ueki, Kokoro; Sumikoshi, Kazuya; Nakamura, Shugo; Terada, Tohru; Shimizu, Kentaro
2017-02-01
Several methods have been proposed for protein-sugar binding site prediction using machine learning algorithms. However, they are not effective to learn various properties of binding site residues caused by various interactions between proteins and sugars. In this study, we classified sugars into acidic and nonacidic sugars and showed that their binding sites have different amino acid occurrence frequencies. By using this result, we developed sugar-binding residue predictors dedicated to the two classes of sugars: an acid sugar binding predictor and a nonacidic sugar binding predictor. We also developed a combination predictor which combines the results of the two predictors. We showed that when a sugar is known to be an acidic sugar, the acidic sugar binding predictor achieves the best performance, and showed that when a sugar is known to be a nonacidic sugar or is not known to be either of the two classes, the combination predictor achieves the best performance. Our method uses only amino acid sequences for prediction. Support vector machine was used as a machine learning algorithm and the position-specific scoring matrix created by the position-specific iterative basic local alignment search tool was used as the feature vector. We evaluated the performance of the predictors using five-fold cross-validation. We have launched our system, as an open source freeware tool on the GitHub repository (https://doi.org/10.5281/zenodo.61513). Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
WebLogo: A Sequence Logo Generator
Crooks, Gavin E.; Hon, Gary; Chandonia, John-Marc; Brenner, Steven E.
2004-01-01
WebLogo generates sequence logos, graphical representations of the patterns within a multiple sequence alignment. Sequence logos provide a richer and more precise description of sequence similarity than consensus sequences and can rapidly reveal significant features of the alignment otherwise difficult to perceive. Each logo consists of stacks of letters, one stack for each position in the sequence. The overall height of each stack indicates the sequence conservation at that position (measured in bits), whereas the height of symbols within the stack reflects the relative frequency of the corresponding amino or nucleic acid at that position. WebLogo has been enhanced recently with additional features and options, to provide a convenient and highly configurable sequence logo generator. A command line interface and the complete, open WebLogo source code are available for local installation and customization. PMID:15173120
2013-01-01
Background The arylamine N-acetyltransferases (NATs) are a unique family of enzymes widely distributed in nature that play a crucial role in the detoxification of aromatic amine xenobiotics. Considering the temporal changes in the levels and toxicity of environmentally available chemicals, the metabolic function of NATs is likely to be under adaptive evolution to broaden or change substrate specificity over time, making NATs a promising subject for evolutionary analyses. In this study, we trace the molecular evolutionary history of the NAT gene family during the last ~450 million years of vertebrate evolution and define the likely role of gene duplication, gene conversion and positive selection in the evolutionary dynamics of this family. Results A phylogenetic analysis of 77 NAT sequences from 38 vertebrate species retrieved from public genomic databases shows that NATs are phylogenetically unstable genes, characterized by frequent gene duplications and losses even among closely related species, and that concerted evolution only played a minor role in the patterns of sequence divergence. Local signals of positive selection are detected in several lineages, probably reflecting response to changes in xenobiotic exposure. We then put a special emphasis on the study of the last ~85 million years of primate NAT evolution by determining the NAT homologous sequences in 13 additional primate species. Our phylogenetic analysis supports the view that the three human NAT genes emerged from a first duplication event in the common ancestor of Simiiformes, yielding NAT1 and an ancestral NAT gene which in turn, duplicated in the common ancestor of Catarrhini, giving rise to NAT2 and the NATP pseudogene. Our analysis suggests a main role of purifying selection in NAT1 protein evolution, whereas NAT2 was predicted to mostly evolve under positive selection to change its amino acid sequence over time. These findings are consistent with a differential role of the two human isoenzymes and support the involvement of NAT1 in endogenous metabolic pathways. Conclusions This study provides unequivocal evidence that the NAT gene family has evolved under a dynamic process of birth-and-death evolution in vertebrates, consistent with previous observations made in fungi. PMID:23497148
Mutryn, Marie F; Brannick, Erin M; Fu, Weixuan; Lee, William R; Abasht, Behnam
2015-05-21
Improvements in poultry production within the past 50 years have led to increased muscle yield and growth rate, which may be contributing to an increased rate and development of new muscle disorders in chickens. Previously reported muscle disorders and conditions are generally associated with poor meat quality traits and have a significant negative economic impact on the poultry industry. Recently, a novel myopathy phenotype has emerged which is characterized by palpably "hard" or tough breast muscle. The objective of this study is to identify the underlying biological mechanisms that contribute to this emerging muscle disorder colloquially referred to as "Wooden Breast", through the use of RNA-sequencing technology. We constructed cDNA libraries from five affected and six unaffected breast muscle samples from a line of commercial broiler chickens. After paired-end sequencing of samples using the Illumina Hiseq platform, we used Tophat to align the resulting sequence reads to the chicken reference genome and then used Cufflinks to find significant changes in gene transcript expression between each group. By comparing our gene list to previously published histology findings on this disorder and using Ingenuity Pathways Analysis (IPA®), we aim to develop a characteristic gene expression profile for this novel disorder through analyzing genes, gene families, and predicted biological pathways. Over 1500 genes were differentially expressed between affected and unaffected birds. There was an average of approximately 98 million reads per sample, across all samples. Results from the IPA analysis suggested "Diseases and Disorders" such as connective tissue disorders, "Molecular and Cellular Functions" such as cellular assembly and organization, cellular function and maintenance, and cellular movement, "Physiological System Development and Function" such as tissue development, and embryonic development, and "Top Canonical Pathways" such as, coagulation system, axonal guidance signaling, and acute phase response signaling, are associated with the Wooden Breast disease. There is convincing evidence by RNA-seq analysis to support localized hypoxia, oxidative stress, increased intracellular calcium, as well as the possible presence of muscle fiber-type switching, as key features of Wooden Breast Disease, which are supported by reported microscopic lesions of the disease.
NASA Astrophysics Data System (ADS)
Cheloni, D.; D'Agostino, N.; D'Anastasio, E.; Selvaggi, G.
2012-08-01
In this study, we revisit the mechanism of the 1976 Friuli (NE Italy) earthquake sequence (main shocks Mw 6.4, 5.9 and 6.0). We present a new source model that simultaneously fits all the available geodetic measurements of the observed deformation. We integrate triangulation measurements, which have never been previously used in the source modelling of this sequence, with high-precision levelling that covers the epicentral area. We adopt a mixed linear/non-linear optimization scheme, in which we iteratively search for the best-fitting solution by performing several linear slip inversions while varying fault location using a grid search method. Our preferred solution consists of a shallow north-dipping fault plane with assumed azimuth of 282° and accommodating a reverse dextral slip of about 1 m. The estimated geodetic moment is 6.6 × 1018 Nm (Mw 6.5), in agreement with seismological estimates. Yet, our preferred model shows that the geodetic solution is consistent with the activation of a single fault system during the entire sequence, the surface expression of which could be associated with the Buia blind thrust, supporting the hypothesis that the main activity of the Eastern Alps occurs close to the relief margin, as observed in other mountain belts. The retrieved slip pattern consists of a main coseismic patch located 3-5 km depth, in good agreement with the distribution of the main shocks. Additional slip is required in the shallower portions of the fault to reproduce the local uplift observed in the region characterized by Quaternary active folding. We tentatively interpret this patch as postseismic deformation (afterslip) occurring at the edge of the main coseismic patch. Finally, our rupture plane spatially correlates with the area of the locked fault determined from interseismic measurements, supporting the hypothesis that interseismic slip on the creeping dislocation causes strain to accumulate on the shallow (above ˜10 km depth) locked section. Assuming that all the long-term accommodation between Adria and Eurasia is seismically released, a time span of 500-700 years of strain-accumulating plate motion would result in a 1976-like earthquake.
USDA-ARS?s Scientific Manuscript database
Next generation sequencing technologies have vastly changed the approach of sequencing of the 16S rRNA gene for studies in microbial ecology. Three distinct technologies are available for large-scale 16S sequencing. All three are subject to biases introduced by sequencing error rates, amplificatio...
Kuipers, A G J; Kamstra, S A; de Jeu, M J; Visser, R G F
2002-01-01
Highly repetitive DNA sequences were isolated from genomic DNA libraries of Alstroemeria psittacina and A. inodora. Among the repetitive sequences that were isolated, tandem repeats as well as dispersed repeats could be discerned. The tandem repeats belonged to a family of interlinked Sau3A subfragments with sizes varying from 68-127 bp, and constituted a larger HinfI repeat of approximately 400 bp. Southern hybridization showed a similar molecular organization of the tandem repeats in each of the Brazilian Alstroemeria species tested. None of the repeats hybridized with DNA from Chilean Alstroemeria species, which indicates that they are specific for the Brazilian species. In-situ localization studies revealed the tandem repeats to be localized in clusters on the chromosomes of A. inodora and A. psittacina: distal hybridization sites were found on chromosome arms 2PS, 6PL, 7PS, 7PL and 8PL, interstitial sites on chromosome arms 2PL, 3PL, 4PL and 5PL. The applicability of the tandem repeats for cytogenetic analysis of interspecific hybrids and their role in heterochromatin organization are discussed.
Yin, Yan-hui; Li, Bi-chun; Wei, Guang-hui; Zhu, Cai-ye; Li, Wei; Zhang, Ya-ni; Du, Li-xin; Cao, Wen-guang
2012-05-01
The aim of this study was to clone the heart-type fatty acid binding protein (H-FABP) gene of Xuhuai goat, to explore it bioinformatically, and analyze the subcellular localization using enhanced green fluorescent protein (EGFP). The results showed that the coding sequence (CDS) length of Xuhuai goat H-FABP gene was 402 bp, encoding 133 amino acids (GenBank accession number AY466498.1). The H-FABP cDNA coding sequence was compared with the corresponding region of human, chicken, brown rat, cow, wild boar, donkey, and zebrafish. The similarity were 89%, 76%, 85%, 84%, 93%, 91%, 70%, respectively. For the corresponding amino acid sequences, the similarity were 90%, 79%, 88%, 97%, 95%, 94%, 72%, respectively. This study did not find the signal peptide region in the H-FABP protein; it revealed that H-FABP protein might be a nonsecreted protein. H-FABP expression was detected in vitro by reverse transcription-polymerase chain reaction (RT-PCR), and the EGFP-H-FABP fusion protein was localized to the cytoplasm. The gene could also be transiently and permanently expressed in mice.
Anderson, Jenn; Kuehl, Rebecca A; Mehltretter Drury, Sara A; Tschetter, Lois; Schwaegerl, Mary; Yoder, Julia; Gullickson, Heidi; Lamp, Jamison; Bachman, Charlotte; Hildreth, Marilyn
2017-12-01
Empirical evidence demonstrates myriad benefits of breastfeeding for mother and child, along with benefits to businesses that support breastfeeding. Federal and state legislation requires workplace support for pumping and provides protections for public breastfeeding. Yet, many are unaware of these laws, and thus, support systems remain underdeveloped. We used a community-based approach to spread awareness about the evidence-based benefits of breastfeeding and breastfeeding support. We worked to improve breastfeeding support at the local hospital, among local employers, and throughout the broader community. Our coalition representing the hospital, the chamber of commerce, the university, and local lactation consultants used a public deliberation model for dissemination. We held focus groups, hosted a public conversation, spoke to local organizations, and promoted these efforts through local media. The hospital achieved Baby-Friendly status and opened a Baby Café. Breastfeeding support in the community improved through policies, designated pumping spaces, and signage that supports public breastfeeding at local businesses. Community awareness of the benefits of breastfeeding and breastfeeding support increased; the breastfeeding support coalition remains active. The public deliberation process for dissemination engaged the community with evidence-based promotion of breastfeeding support, increased agency, and produced sustainable results tailored to the community's unique needs.
ISRNA: an integrative online toolkit for short reads from high-throughput sequencing data.
Luo, Guan-Zheng; Yang, Wei; Ma, Ying-Ke; Wang, Xiu-Jie
2014-02-01
Integrative Short Reads NAvigator (ISRNA) is an online toolkit for analyzing high-throughput small RNA sequencing data. Besides the high-speed genome mapping function, ISRNA provides statistics for genomic location, length distribution and nucleotide composition bias analysis of sequence reads. Number of reads mapped to known microRNAs and other classes of short non-coding RNAs, coverage of short reads on genes, expression abundance of sequence reads as well as some other analysis functions are also supported. The versatile search functions enable users to select sequence reads according to their sub-sequences, expression abundance, genomic location, relationship to genes, etc. A specialized genome browser is integrated to visualize the genomic distribution of short reads. ISRNA also supports management and comparison among multiple datasets. ISRNA is implemented in Java/C++/Perl/MySQL and can be freely accessed at http://omicslab.genetics.ac.cn/ISRNA/.
Navascués, Miguel; Vaxevanidou, Zafeiro; González-Martínez, Santiago C; Climent, José; Gil, Luis; Emerson, Brent C
2006-01-01
Chloroplast microsatellites are becoming increasingly popular markers for population genetic studies in plants, but there has been little focus on their potential for demographic inference. In this work the utility of chloroplast microsatellites for the study of population expansions was explored. First, we investigated the power of mismatch distribution analysis and the FS test with coalescent simulations of different demographic scenarios. We then applied those methods to empirical data obtained for the Canary Island pine (Pinus canariensis). The results of the simulations showed that chloroplast microsatellites are sensitive to sudden population growth. The power of the FS test and accuracy of demographic parameter estimates, such as the time of expansion, were reduced proportionally to the level of homoplasy within the data. The analysis of Canary Island pine chloroplast microsatellite data indicated population expansions for almost all sample localities. Demographic expansions at the island level can be explained by the colonisation of the archipelago by the pine, while population expansions of different ages in different localities within an island appear to be the result of local extinctions and recolonisation dynamics. Comparable mitochondrial DNA sequence data from a parasite of P. canariensis, the weevil Brachyderes rugatus, supports this scenario, suggesting a key role for volcanism in the evolution of pine forest communities in the Canary Islands. PMID:16911194
Sequences Associated with Centromere Competency in the Human Genome
Hayden, Karen E.; Strome, Erin D.; Merrett, Stephanie L.; Lee, Hye-Ran; Rudd, M. Katharine
2013-01-01
Centromeres, the sites of spindle attachment during mitosis and meiosis, are located in specific positions in the human genome, normally coincident with diverse subsets of alpha satellite DNA. While there is strong evidence supporting the association of some subfamilies of alpha satellite with centromere function, the basis for establishing whether a given alpha satellite sequence is or is not designated a functional centromere is unknown, and attempts to understand the role of particular sequence features in establishing centromere identity have been limited by the near identity and repetitive nature of satellite sequences. Utilizing a broadly applicable experimental approach to test sequence competency for centromere specification, we have carried out a genomic and epigenetic functional analysis of endogenous human centromere sequences available in the current human genome assembly. The data support a model in which functionally competent sequences confer an opportunity for centromere specification, integrating genomic and epigenetic signals and promoting the concept of context-dependent centromere inheritance. PMID:23230266
Inter-Comparison of CHARM Data and WSR-88D Storm Integrated Rainfall
NASA Technical Reports Server (NTRS)
Jedlovec, Gary J.; Meyer, Paul J.; Guillory, Anthony R.; Stellman, Keith; Limaye, Ashutosh; Arnold, James E. (Technical Monitor)
2002-01-01
A localized precipitation network has been established over a 4000 sq km region of northern Alabama in support of local weather and climate research at the Global Hydrology and Climate Center (GHCC) in Huntsville. This Cooperative Huntsville-Area Rainfall Measurement (CHARM) network is comprised of over 80 volunteers who manually take daily rainfall measurements from 85 sites. The network also incorporates 20 automated gauges that report data at 1-5 minute intervals on a 24 h a day basis. The average spacing of the gauges in the network is about 6 kin, however coverage in some regions benefit from gauges every 1-2 km. The 24 h rainfall totals from the CHARM network have been used to validate Stage III rainfall estimates of daily and storm totals derived from the WSR-88D radars that cover northern Alabama. The Stage III rainfall product is produced by the Lower Mississippi River Forecast Center (LMRFC) in support of their daily forecast operations. The intercomparisons between the local rain gauge and the radar estimates have been useful to understand the accuracy and utility of the Stage III data. Recently, the Stage III and CHARM rainfall measurements have been combined to produce an hourly rainfall dataset at each CHARM observation site. The procedure matches each CHARM site with a time sequence of Stage III radar estimates of precipitation. Hourly stage III rainfall estimates were used to partition the rain gauge values to the time interval over which they occurred. The new hourly rain gauge dataset is validated at selected points where 1-5 minute rainfall measurements have been made. This procedure greatly enhances the utility of the CHARM data for local weather and hydrologic modeling studies. The conference paper will present highlights of the Stage III intercomparison and some examples of the combined radar / rain gauge product demonstrating its accuracy and utility in deriving an hourly rainfall product from the 24 h CHARM totals.
Morse, Solon F; Bush, Sarah E; Patterson, Bruce D; Dick, Carl W; Gruwell, Matthew E; Dittmar, Katharina
2013-05-01
Bat flies are a diverse clade of obligate ectoparasites on bats. Like most blood-feeding insects, they harbor endosymbiotic prokaryotes, but the origins and nature of these symbioses are still poorly understood. To expand the knowledge of bacterial associates in bat flies, the diversity and evolution of the dominant endosymbionts in six of eight nominal subfamilies of bat flies (Streblidae and Nycteribiidae) were studied. Furthermore, the localization of endosymbionts and their transmission across developmental stages within the family Streblidae were explored. The results show diverse microbial associates in bat flies, with at least four ancestral invasions of distantly related microbial lineages throughout bat fly evolution. Phylogenetic relationships support the presence of at least two novel symbiont lineages (here clades B and D), and extend the geographic and taxonomic range of a previously documented lineage ("Candidatus Aschnera chinzeii"; here clade A). Although these lineages show reciprocally monophyletic clusters with several bat fly host clades, their phylogenetic relationships generally do not reflect current bat fly taxonomy or phylogeny. However, within some endosymbiont clades, congruent patterns of symbiont-host divergence are apparent. Other sequences identified in this study fall into the widely distributed, highly invasive, insect-associated Arsenophonus lineage and may be the result of symbiont replacements and/or transient infections (here clade C). Vertical transmission of endosymbionts of clades B and D is supported by fluorescent signal (fluorescent in situ hybridization [FISH]) and microbial DNA detection across developmental stages. The fluorescent bacterial signal is consistently localized within structures resembling bacteriomes, although their anatomical position differs by host fly clade. In summary, the results suggest an obligate host-endosymbiont relationship for three of the four known symbiont clades associated with bat flies (clades A, B, and D).
Morse, Solon F.; Bush, Sarah E.; Patterson, Bruce D.; Dick, Carl W.; Gruwell, Matthew E.
2013-01-01
Bat flies are a diverse clade of obligate ectoparasites on bats. Like most blood-feeding insects, they harbor endosymbiotic prokaryotes, but the origins and nature of these symbioses are still poorly understood. To expand the knowledge of bacterial associates in bat flies, the diversity and evolution of the dominant endosymbionts in six of eight nominal subfamilies of bat flies (Streblidae and Nycteribiidae) were studied. Furthermore, the localization of endosymbionts and their transmission across developmental stages within the family Streblidae were explored. The results show diverse microbial associates in bat flies, with at least four ancestral invasions of distantly related microbial lineages throughout bat fly evolution. Phylogenetic relationships support the presence of at least two novel symbiont lineages (here clades B and D), and extend the geographic and taxonomic range of a previously documented lineage (“Candidatus Aschnera chinzeii”; here clade A). Although these lineages show reciprocally monophyletic clusters with several bat fly host clades, their phylogenetic relationships generally do not reflect current bat fly taxonomy or phylogeny. However, within some endosymbiont clades, congruent patterns of symbiont-host divergence are apparent. Other sequences identified in this study fall into the widely distributed, highly invasive, insect-associated Arsenophonus lineage and may be the result of symbiont replacements and/or transient infections (here clade C). Vertical transmission of endosymbionts of clades B and D is supported by fluorescent signal (fluorescent in situ hybridization [FISH]) and microbial DNA detection across developmental stages. The fluorescent bacterial signal is consistently localized within structures resembling bacteriomes, although their anatomical position differs by host fly clade. In summary, the results suggest an obligate host-endosymbiont relationship for three of the four known symbiont clades associated with bat flies (clades A, B, and D). PMID:23435889
Somaraju Chalasani, Madhavi Latha; Muppirala, Madhavi; G Ponnam, Surya Prakash; Kannabiran, Chitra; Swarup, Ghanshyam
2013-01-01
Mutations in the eye lens gap junction protein connexin 50 cause cataract. Earlier we identified a frameshift mutant of connexin 50 (c.670insA; p.Thr203AsnfsX47) in a family with autosomal recessive cataract. The mutant protein is smaller and contains 46 aberrant amino acids at the C-terminus after amino acid 202. Here, we have analysed this frameshift mutant and observed that it localized to the endoplasmic reticulum (ER) but not in the plasma membrane. Moreover, overexpression of the mutant resulted in disintegration of the ER-Golgi intermediate compartment (ERGIC), reduction in the level of ERGIC-53 protein and breakdown of the Golgi in many cells. Overexpression of the frameshift mutant partially inhibited the transport of wild type connexin 50 to the plasma membrane. A deletion mutant lacking the aberrant sequence showed predominant localization in the ER and inhibited anterograde protein transport suggesting, therefore, that the aberrant sequence is not responsible for improper localization of the frameshift mutant. Further deletion analysis showed that the fourth transmembrane domain and a membrane proximal region (231-294 amino acids) of the cytoplasmic domain are needed for transport from the ER and localization to the plasma membrane. Our results show that a frameshift mutant of connexin 50 mislocalizes to the ER and causes disintegration of the ERGIC and Golgi. We have also identified a sequence of connexin 50 crucial for transport from the ER and localization to the plasma membrane.
Xun, Weibing; Xu, Zhihui; Li, Wei; Ren, Yi; Huang, Ting; Ran, Wei; Wang, Boren; Shen, Qirong; Zhang, Ruifu
2016-09-01
Natural ecosystems comprise the planet's wild plant and animal resources, but large tracts of land have been converted to agroecosystems to support the demand for agricultural products. This conversion limits the number of plant species and decreases the soil biological diversity. Here we used high-throughput 16S rRNA gene sequencing to evaluate the responses of soil bacterial communities in long-term converted and fertilized red soils (a type of Ferralic Cambisol). We observed that soil bacterial diversity was strongly affected by different types of fertilization management. Oligotrophic bacterial taxa demonstrated large relative abundances in chemically fertilized soil, whereas copiotrophic bacterial taxa were found in large relative abundances in organically fertilized and fallow management soils. Only organic-inorganic fertilization exhibited the same local taxonomic and phylogenetic diversity as that of a natural ecosystem. However, the independent use of organic or inorganic fertilizer reduced local taxonomic and phylogenetic diversity and caused biotic homogenization. This study demonstrated that the homogenization of bacterial communities caused by natural-to-agricultural ecosystem conversion can be mitigated by employing rational organic-inorganic fertilization management.
2014-01-01
Background The rare autosomal genetic disorder, Spondylo-meta-epiphyseal dysplasia with short limbs and abnormal calcifications (SMED-SL), is reported to be caused by missense or splice site mutations in the human discoidin domain receptor 2 (DDR2) gene. Previously our group has established that trafficking defects and loss of ligand binding are the underlying cellular mechanisms of several SMED-SL causing mutations. Here we report the clinical characteristics of two siblings of consanguineous marriage with suspected SMED-SL and identification of a novel disease-causing mutation in the DDR2 gene. Methods Clinical evaluation and radiography were performed to evaluate the patients. All the coding exons and splice sites of the DDR2 gene were sequenced by Sanger sequencing. Subcellular localization of the mutated DDR2 protein was determined by confocal microscopy, deglycosylation assay and Western blotting. DDR2 activity was measured by collagen activation and Western analysis. Results In addition to the typical features of SMED-SL, one of the patients has an eye phenotype including visual impairment due to optic atrophy. DNA sequencing revealed a novel homozygous dinucleotide deletion mutation (c.2468_2469delCT) on exon 18 of the DDR2 gene in both patients. The mutation resulted in a frameshift leading to an amino acid change at position S823 and a predicted premature termination of translation (p.S823Cfs*2). Subcellular localization of the mutant protein was analyzed in mammalian cell lines, and it was found to be largely retained in the endoplasmic reticulum (ER), which was further supported by its N-glycosylation profile. In keeping with its cellular mis-localization, the mutant protein was found to be deficient in collagen-induced receptor activation, suggesting protein trafficking defects as the major cellular mechanism underlying the loss of DDR2 function in our patients. Conclusions Our results indicate that the novel mutation results in defective trafficking of the DDR2 protein leading to loss of function and disease. This confirms our previous findings that DDR2 missense mutations occurring at the kinase domain result in retention of the mutant protein in the ER. PMID:24725993
Kim, Taeho; Kim, Jiyeon; Nadler, Steven A; Park, Joong-Ki
2016-05-01
Testing hypotheses of monophyly for different nematode groups in the context of broad representation of nematode diversity is central to understanding the patterns and processes of nematode evolution. Herein sequence information from mitochondrial genomes is used to test the monophyly of diplogasterids, which includes an important nematode model organism. The complete mitochondrial genome sequence of Koerneria sudhausi, a representative of Diplogasteromorpha, was determined and used for phylogenetic analyses along with 60 other nematode species. The mtDNA of K. sudhausi is comprised of 16,005 bp that includes 36 genes (12 protein-coding genes, 2 ribosomal RNA genes and 22 transfer RNA genes) encoded in the same direction. Phylogenetic trees inferred from amino acid and nucleotide sequence data for the 12 protein-coding genes strongly supported the sister relationship of K. sudhausi with Pristionchus pacificus, supporting Diplogasteromorpha. The gene order of K. sudhausi is identical to that most commonly found in members of the Rhabditomorpha + Ascaridomorpha + Diplogasteromorpha clade, with an exception of some tRNA translocations. Both the gene order pattern and sequence-based phylogenetic analyses support a close relationship between the diplogasterid species and Rhabditomorpha. The nesting of the two diplogasteromorph species within Rhabditomorpha is consistent with most molecular phylogenies for the group, but inconsistent with certain morphology-based hypotheses that asserted phylogenetic affinity between diplogasteromorphs and tylenchomorphs. Phylogenetic analysis of mitochondrial genome sequences strongly supports monophyly of the diplogasteromorpha.
Dasgupta, Anindita; Battala, Madhusudana; Saggurti, Niranjan; Nair, Saritha; Naik, D D; Silverman, Jay G; Balaiah, Donta; Raj, Anita
2013-02-15
Women living in slum communities in India too often contend with depression. Local social support in other national contexts has been shown to reduce such risks. Less research in this area has been done in India and specifically with monogamous wives. This study involved a household sample of women reporting husband's partner violence or heavy alcohol use (N=220). Participants were assessed on high social support in the community, and number of depressed days in the past 30 day (dichotomized as 10+ vs.<10 day). Logistic regression analyses assessed associations between local social support and depression, adjusting for demographics, spousal violence, and husband risky alcohol use. High local social support was reported by 40% of women; 33% reported never having local social support. Women with high local social support were significantly less likely to report depression (AOR: 0.53, 95% CI: 0.30-0.94), even after adjusting for husbands' recent spousal violence perpetration and his risky alcohol use, both of which were also significantly associated with depression. The data analyzed for this study are vulnerable to self-report and recall biases, as well as issues around social desirability. These analyses are further limited due to the use of single item measures to assess depression and local social support. Local social support reduces risk for depression among women in Mumbai contending with husbands' spousal violence and risky alcohol use. These findings support the likely utility of community-based social support building to reduce risk for depression among this vulnerable population of women. Copyright © 2012 Elsevier B.V. All rights reserved.
Cumulative Weighing of Time in Intertemporal Tradeoffs
2016-01-01
We examine preferences for sequences of delayed monetary gains. In the experimental literature, two prominent models have been advanced as psychological descriptions of preferences for sequences. In one model, the instantaneous utilities of the outcomes in a sequence are discounted as a function of their delays, and assembled into a discounted utility of the sequence. In the other model, the accumulated utility of the outcomes in a sequence is considered along with utility or disutility from improvement in outcome utilities and utility or disutility from the spreading of outcome utilities. Drawing on three threads of evidence concerning preferences for sequences of monetary gains, we propose that the accumulated utility of the outcomes in a sequence is traded off against the duration of utility accumulation. In our first experiment, aggregate choice behavior provides qualitative support for the tradeoff model. In three subsequent experiments, one of which incentivized, disaggregate choice behavior provides quantitative support for the tradeoff model in Bayesian model contests. One thread of evidence motivating the tradeoff model is that, when, in the choice between two single dated outcomes, it is conveyed that receiving less sooner means receiving nothing later, preference for receiving more later increases, but when it is conveyed that receiving more later means receiving nothing sooner, preference is left unchanged. Our results show that this asymmetric hidden-zero effect is indeed driven by those supporting the tradeoff model. The tradeoff model also accommodates all remaining evidence on preferences for sequences of monetary gains. PMID:27560853
de Borba, Luana; Villordo, Sergio M; Iglesias, Nestor G; Filomatori, Claudia V; Gebhard, Leopoldo G; Gamarnik, Andrea V
2015-03-01
The dengue virus genome is a dynamic molecule that adopts different conformations in the infected cell. Here, using RNA folding predictions, chemical probing analysis, RNA binding assays, and functional studies, we identified new cis-acting elements present in the capsid coding sequence that facilitate cyclization of the viral RNA by hybridization with a sequence involved in a local dumbbell structure at the viral 3' untranslated region (UTR). The identified interaction differentially enhances viral replication in mosquito and mammalian cells. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Udaya Sekhar, Nagothu
2003-12-01
Conservationists in the recent years view local peoples' support for protected areas management as an important element of biodiversity conservation. This is often linked to the direct benefits, which local communities get from the protected areas. These benefits could be in the form of biomass resources, park funds diverted to local villages by state agencies and revenue from wildlife tourism. There are a very few studies which have attempted to study the direct relationship between benefits from wildlife tourism and local support for conservation. In India, wildlife tourism is restricted, and mostly controlled by state and private agencies. Wildlife conservation policy does not view tourism in protected areas as a source of revenue for the local communities. The present study examines the local people's attitudes towards wildlife tourism and the impact of benefits from tourism on the local support for Sariska Tiger Reserve (STR), India. STR is a flagship for tourism where protected areas are increasingly being visited and where local support for wildlife tourism has not been studied adequately. Results indicate that two-thirds of the respondents were positive towards tourism and support for conservation. The respondents were aware that more tourism benefits are possible from a well-conserved protected area. There appears to be correlation between benefits obtained by local people from wildlife tourism and other sources, and support for protected area existence, suggesting that benefits impact people's attitudes towards conservation. Some of the main problems are the unequal distribution of tourism benefits, lack of locals' involvement in tourism and development. There is a need to clearly address these issues, so that protected areas may get the support of local people, which may lead to sustainable development.
Onuț-Brännström, Ioana; Benjamin, Mitchell; Scofield, Douglas G; Heiðmarsson, Starri; Andersson, Martin G I; Lindström, Eva S; Johannesson, Hanna
2018-03-13
In this study, we explored the diversity of green algal symbionts (photobionts) in sympatric populations of the cosmopolitan lichen-forming fungi Thamnolia and Cetraria. We sequenced with both Sanger and Ion Torrent High-Throughput Sequencing technologies the photobiont ITS-region of 30 lichen thalli from two islands: Iceland and Öland. While Sanger recovered just one photobiont genotype from each thallus, the Ion Torrent data recovered 10-18 OTUs for each pool of 5 lichen thalli, suggesting that individual lichens can contain heterogeneous photobiont populations. Both methods showed evidence for photobiont sharing between Thamnolia and Cetraria on Iceland. In contrast, our data suggest that on Öland the two mycobionts associate with distinct photobiont communities, with few shared OTUs revealed by Ion Torrent sequencing. Furthermore, by comparing our sequences with public data, we identified closely related photobionts from geographically distant localities. Taken together, we suggest that the photobiont composition in Thamnolia and Cetraria results from both photobiont-mycobiont codispersal and local acquisition during mycobiont establishment and/or lichen growth. We hypothesize that this is a successful strategy for lichens to be flexible in the use of the most adapted photobiont for the environment.
Detection of PIWI and piRNAs in the mitochondria of mammalian cancer cells.
Kwon, ChangHyuk; Tak, Hyosun; Rho, Mina; Chang, Hae Ryung; Kim, Yon Hui; Kim, Kyung Tae; Balch, Curt; Lee, Eun Kyung; Nam, Seungyoon
2014-03-28
Piwi-interacting RNAs (piRNAs) are 26-31 nt small noncoding RNAs that are processed from their longer precursor transcripts by Piwi proteins. Localization of Piwi and piRNA has been reported mostly in nucleus and cytoplasm of higher eukaryotes germ-line cells, where it is believed that known piRNA sequences are located in repeat regions of nuclear genome in germ-line cells. However, localization of PIWI and piRNA in mammalian somatic cell mitochondria yet remains largely unknown. We identified 29 piRNA sequence alignments from various regions of the human mitochondrial genome. Twelve out 29 piRNA sequences matched stem-loop fragment sequences of seven distinct tRNAs. We observed their actual expression in mitochondria subcellular fractions by inspecting mitochondrial-specific small RNA-Seq datasets. Of interest, the majority of the 29 piRNAs overlapped with multiple longer transcripts (expressed sequence tags) that are unique to the human mitochondrial genome. The presence of mature piRNAs in mitochondria was detected by qRT-PCR of mitochondrial subcellular RNAs. Further validation showed detection of Piwi by colocalization using anti-Piwil1 and mitochondria organelle-specific protein antibodies. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Code of Federal Regulations, 2010 CFR
2010-10-01
... access universal service support for areas served by price cap local exchange carriers. 54.806 Section 54... Administrator of interstate access universal service support for areas served by price cap local exchange... calculate the Interstate Access Universal Service Support for areas served by price cap local exchange...
Nuclear Retention Elements of U3 Small Nucleolar RNA
Speckmann, Wayne; Narayanan, Aarthi; Terns, Rebecca; Terns, Michael P.
1999-01-01
The processing and methylation of precursor rRNA is mediated by the box C/D small nucleolar RNAs (snoRNAs). These snoRNAs differ from most cellular RNAs in that they are not exported to the cytoplasm. Instead, these RNAs are actively retained in the nucleus where they assemble with proteins into mature small nucleolar ribonucleoprotein particles and are targeted to their intranuclear site of action, the nucleolus. In this study, we have identified the cis-acting sequences responsible for the nuclear retention of U3 box C/D snoRNA by analyzing the nucleocytoplasmic distributions of an extensive panel of U3 RNA variants after injection of the RNAs into Xenopus oocyte nuclei. Our data indicate the importance of two conserved sequence motifs in retaining U3 RNA in the nucleus. The first motif is comprised of the conserved box C′ and box D sequences that characterize the box C/D family. The second motif contains conserved box sequences B and C. Either motif is sufficient for nuclear retention, but disruption of both motifs leads to mislocalization of the RNAs to the cytoplasm. Variant RNAs that are not retained also lack 5′ cap hypermethylation and fail to associate with fibrillarin. Furthermore, our results indicate that nuclear retention of U3 RNA does not simply reflect its nucleolar localization. A fragment of U3 containing the box B/C motif is not localized to nucleoli but retained in coiled bodies. Thus, nuclear retention and nucleolar localization are distinct processes with differing sequence requirements. PMID:10567566
How the Sequence of a Gene Specifies Structural Symmetry in Proteins
Shen, Xiaojuan; Huang, Tongcheng; Wang, Guanyu; Li, Guanglin
2015-01-01
Internal symmetry is commonly observed in the majority of fundamental protein folds. Meanwhile, sufficient evidence suggests that nascent polypeptide chains of proteins have the potential to start the co-translational folding process and this process allows mRNA to contain additional information on protein structure. In this paper, we study the relationship between gene sequences and protein structures from the viewpoint of symmetry to explore how gene sequences code for structural symmetry in proteins. We found that, for a set of two-fold symmetric proteins from left-handed beta-helix fold, intragenic symmetry always exists in their corresponding gene sequences. Meanwhile, codon usage bias and local mRNA structure might be involved in modulating translation speed for the formation of structural symmetry: a major decrease of local codon usage bias in the middle of the codon sequence can be identified as a common feature; and major or consecutive decreases in local mRNA folding energy near the boundaries of the symmetric substructures can also be observed. The results suggest that gene duplication and fusion may be an evolutionarily conserved process for this protein fold. In addition, the usage of rare codons and the formation of higher order of secondary structure near the boundaries of symmetric substructures might have coevolved as conserved mechanisms to slow down translation elongation and to facilitate effective folding of symmetric substructures. These findings provide valuable insights into our understanding of the mechanisms of translation and its evolution, as well as the design of proteins via symmetric modules. PMID:26641668
Thanh, Duy Pham; Bodhidatta, Ladaporn; Mason, Carl Jeffries; Srijan, Apichai; Rabaa, Maia A.; Vinh, Phat Voong; Thanh, Tuyen Ha; Thwaites, Guy E.; Baker, Stephen; Holt, Kathryn E.
2017-01-01
Vaccines against Salmonella Typhi, the causative agent of typhoid fever, are commonly used by travellers, however, there are few examples of national immunization programs in endemic areas. There is therefore a paucity of data on the impact of typhoid immunization programs on localised populations of S. Typhi. Here we have used whole genome sequencing (WGS) to characterise 44 historical bacterial isolates collected before and after a national typhoid immunization program that was implemented in Thailand in 1977 in response to a large outbreak; the program was highly effective in reducing typhoid case numbers. Thai isolates were highly diverse, including 10 distinct phylogenetic lineages or genotypes. Novel prophage and plasmids were also detected, including examples that were previously only reported in Shigella sonnei and Escherichia coli. The majority of S. Typhi genotypes observed prior to the immunization program were not observed following it. Post-vaccine era isolates were more closely related to S. Typhi isolated from neighbouring countries than to earlier Thai isolates, providing no evidence for the local persistence of endemic S. Typhi following the national immunization program. Rather, later cases of typhoid appeared to be caused by the occasional importation of common genotypes from neighbouring Vietnam, Laos, and Cambodia. These data show the value of WGS in understanding the impacts of vaccination on pathogen populations and provide support for the proposal that large-scale typhoid immunization programs in endemic areas could result in lasting local disease elimination, although larger prospective studies are needed to test this directly. PMID:28060810
Meteor localization via statistical analysis of spatially temporal fluctuations in image sequences
NASA Astrophysics Data System (ADS)
Kukal, Jaromír.; Klimt, Martin; Šihlík, Jan; Fliegel, Karel
2015-09-01
Meteor detection is one of the most important procedures in astronomical imaging. Meteor path in Earth's atmosphere is traditionally reconstructed from double station video observation system generating 2D image sequences. However, the atmospheric turbulence and other factors cause spatially-temporal fluctuations of image background, which makes the localization of meteor path more difficult. Our approach is based on nonlinear preprocessing of image intensity using Box-Cox and logarithmic transform as its particular case. The transformed image sequences are then differentiated along discrete coordinates to obtain statistical description of sky background fluctuations, which can be modeled by multivariate normal distribution. After verification and hypothesis testing, we use the statistical model for outlier detection. Meanwhile the isolated outlier points are ignored, the compact cluster of outliers indicates the presence of meteoroids after ignition.
Advective transport observations with MODPATH-OBS--documentation of the MODPATH observation process
Hanson, R.T.; Kauffman, L.K.; Hill, M.C.; Dickinson, J.E.; Mehl, S.W.
2013-01-01
The MODPATH-OBS computer program described in this report is designed to calculate simulated equivalents for observations related to advective groundwater transport that can be represented in a quantitative way by using simulated particle-tracking data. The simulated equivalents supported by MODPATH-OBS are (1) distance from a source location at a defined time, or proximity to an observed location; (2) time of travel from an initial location to defined locations, areas, or volumes of the simulated system; (3) concentrations used to simulate groundwater age; and (4) percentages of water derived from contributing source areas. Although particle tracking only simulates the advective component of conservative transport, effects of non-conservative processes such as retardation can be approximated through manipulation of the effective-porosity value used to calculate velocity based on the properties of selected conservative tracers. This program can also account for simple decay or production, but it cannot account for diffusion. Dispersion can be represented through direct simulation of subsurface heterogeneity and the use of many particles. MODPATH-OBS acts as a postprocessor to MODPATH, so that the sequence of model runs generally required is MODFLOW, MODPATH, and MODPATH-OBS. The version of MODFLOW and MODPATH that support the version of MODPATH-OBS presented in this report are MODFLOW-2005 or MODFLOW-LGR, and MODPATH-LGR. MODFLOW-LGR is derived from MODFLOW-2005, MODPATH 5, and MODPATH 6 and supports local grid refinement. MODPATH-LGR is derived from MODPATH 5. It supports the forward and backward tracking of particles through locally refined grids and provides the output needed for MODPATH_OBS. For a single grid and no observations, MODPATH-LGR results are equivalent to MODPATH 5. MODPATH-LGR and MODPATH-OBS simulations can use nearly all of the capabilities of MODFLOW-2005 and MODFLOW-LGR; for example, simulations may be steady-state, transient, or a combination. Though the program name MODPATH-OBS specifically refers to observations, the program also can be used to calculate model prediction of observations. MODPATH-OBS is primarily intended for use with separate programs that conduct sensitivity analysis, data needs assessment, parameter estimation, and uncertainty analysis, such as UCODE_2005, and PEST. In many circumstances, refined grids in selected parts of a model are important to simulated hydraulics, detailed inflows and outflows, or other system characteristics. MODFLOW-LGR and MODPATH-LGR support accurate local grid refinement in which both mass (flows) and energy (head) are conserved across the local grid boundary. MODPATH-OBS is designed to take advantage of these capabilities. For example, particles tracked between a pumping well and a nearby stream, which are simulated poorly if a river and well are located in a single large grid cell, can be simulated with improved accuracy using a locally refined grid in MODFLOW-LGR, MODPATH-LGR, and MODPATH-OBS. The locally-refined-grid approach can provide more accurate simulated equivalents to observed transport between the well and the river. The documentation presented here includes a brief discussion of previous work, description of the methods, and detailed descriptions of the required input files and how the output files are typically used.
Wireless Avionics Packet to Support Fault Tolerance for Flight Applications
NASA Technical Reports Server (NTRS)
Block, Gary L.; Whitaker, William D.; Dillon, James W.; Lux, James P.; Ahmad, Mohammad
2009-01-01
In this protocol and packet format, data traffic is monitored by all network interfaces to determine the health of transmitter and subsystems. When failures are detected, the network inter face applies its recover y policies to provide continued service despite the presence of faults. The protocol, packet format, and inter face are independent of the data link technology used. The current demonstration system supports both commercial off-the-shelf wireless connections and wired Ethernet connections. Other technologies such as 1553 or serial data links can be used for the network backbone. The Wireless Avionics packet is divided into three parts: a header, a data payload, and a checksum. The header has the following components: magic number, version, quality of service, time to live, sending transceiver, function code, payload length, source Application Data Interface (ADI) address, destination ADI address, sending node address, target node address, and a sequence number. The magic number is used to identify WAV packets, and allows the packet format to be updated in the future. The quality of service field allows routing decisions to be made based on this value and can be used to route critical management data over a dedicated channel. The time to live value is used to discard misrouted packets while the source transceiver is updated at each hop. This information is used to monitor the health of each transceiver in the network. To identify the packet type, the function code is used. Besides having a regular data packet, the system supports diagnostic packets for fault detection and isolation. The payload length specifies the number of data bytes in the payload, and this supports variable-length packets in the network. The source ADI is the address of the originating interface. This can be used by the destination application to identify the originating source of the packet where the address consists of a subnet, subsystem class within the subnet, a subsystem unit, and the local ADI number. The destination ADI is used to route the packet to its ultimate destination. At each hop, the sending interface uses the destination address to determine the next node for the data. The sending node is the node address of the interface that is broadcasting the packet. This field is used to determine the health of the subsystem that is sending the packet. In the case of a packet that traverses several intermediate nodes, it may be the node address of the intermediate node. The target node is the node address of the next hop for the packet. It may be an intermediate node, or the final destination for the packet. The sequence number is used to identify duplicate packets. Because each interface has multiple transceivers, the same packet will appear at both receivers. The sequence number allows the interface to correlate the reception and forward a single, unique packet for additional processing. The subnet field allows data traffic to be partitioned into segregated local networks to support large networks while keeping each subnet at a manageable size. This also keeps the routing table small enough so routing can be done by a simple table lookup in an FPGA device. The subsystem class identifies members of a set of redundant subsystems, and, in a hot standby configuration, all members of the subsystem class will receive the data packets. Only the active subsystem will generate data traffic. Specific units in a class of redundant units can be identified and, if the hot standby configuration is not used, packets will be directed to a specific subsystem unit.
NASA Astrophysics Data System (ADS)
Jiao, Yong; Wakakuwa, Eyuri; Ogawa, Tomohiro
2018-02-01
We consider asymptotic convertibility of an arbitrary sequence of bipartite pure states into another by local operations and classical communication (LOCC). We adopt an information-spectrum approach to address cases where each element of the sequences is not necessarily a tensor power of a bipartite pure state. We derive necessary and sufficient conditions for the LOCC convertibility of one sequence to another in terms of spectral entropy rates of entanglement of the sequences. Based on these results, we also provide simple proofs for previously known results on the optimal rates of entanglement concentration and dilution of general sequences of bipartite pure states.
Working Memory Capacity and Stroop Interference: Global versus Local Indices of Executive Control
ERIC Educational Resources Information Center
Meier, Matt E.; Kane, Michael J.
2013-01-01
Two experiments examined the relations among working memory capacity (WMC), congruency-sequence effects, proportion-congruency effects, and the color-word Stroop effect to test whether congruency-sequence effects might inform theoretical claims regarding WMC's prediction of Stroop interference. In Experiment 1, subjects completed either a…
The Metallicity Evolution of Low-mass Galaxies: New Constraints at Intermediate Redshift
NASA Astrophysics Data System (ADS)
Henry, Alaina; Martin, Crystal L.; Finlator, Kristian; Dressler, Alan
2013-06-01
We present abundance measurements from 26 emission-line-selected galaxies at z ~ 0.6-0.7. By reaching stellar masses as low as 108 M ⊙, these observations provide the first measurement of the intermediate-redshift mass-metallicity (MZ) relation below 109 M ⊙. For the portion of our sample above M > 109 M ⊙ (8/26 galaxies), we find good agreement with previous measurements of the intermediate-redshift MZ relation. Compared to the local relation, we measure an evolution that corresponds to a 0.12 dex decrease in oxygen abundances at intermediate redshifts. This result confirms the trend that metallicity evolution becomes more significant toward lower stellar masses, in keeping with a downsizing scenario where low-mass galaxies evolve onto the local MZ relation at later cosmic times. We show that these galaxies follow the local fundamental metallicity relation, where objects with higher specific (mass-normalized) star formation rates (SFRs) have lower metallicities. Furthermore, we show that the galaxies in our sample lie on an extrapolation of the SFR-M * relation (the star-forming main sequence). Leveraging the MZ relation and star-forming main sequence (and combining our data with higher-mass measurements from the literature), we test models that assume an equilibrium between mass inflow, outflow, and star formation. We find that outflows are required to describe the data. By comparing different outflow prescriptions, we show that momentum, driven winds can describe the MZ relation; however, this model underpredicts the amount of star formation in low-mass galaxies. This disagreement may indicate that preventive feedback from gas heating has been overestimated, or it may signify a more fundamental deviation from the equilibrium assumption. Some of the data presented herein were obtained at the W. M. Keck Observatory, which is operated as a scientific partnership among the California Institute of Technology, the University of California, and the National Aeronautics and Space Administration. The Observatory was made possible by the generous financial support of the W. M. Keck Foundation.
Zou, Lingyun; Wang, Zhengzhi; Huang, Jiaomin
2007-12-01
Subcellular location is one of the key biological characteristics of proteins. Position-specific profiles (PSP) have been introduced as important characteristics of proteins in this article. In this study, to obtain position-specific profiles, the Position Specific Iterative-Basic Local Alignment Search Tool (PSI-BLAST) has been used to search for protein sequences in a database. Position-specific scoring matrices are extracted from the profiles as one class of characteristics. Four-part amino acid compositions and 1st-7th order dipeptide compositions have also been calculated as the other two classes of characteristics. Therefore, twelve characteristic vectors are extracted from each of the protein sequences. Next, the characteristic vectors are weighed by a simple weighing function and inputted into a BP neural network predictor named PSP-Weighted Neural Network (PSP-WNN). The Levenberg-Marquardt algorithm is employed to adjust the weight matrices and thresholds during the network training instead of the error back propagation algorithm. With a jackknife test on the RH2427 dataset, PSP-WNN has achieved a higher overall prediction accuracy of 88.4% rather than the prediction results by the general BP neural network, Markov model, and fuzzy k-nearest neighbors algorithm on this dataset. In addition, the prediction performance of PSP-WNN has been evaluated with a five-fold cross validation test on the PK7579 dataset and the prediction results have been consistently better than those of the previous method on the basis of several support vector machines, using compositions of both amino acids and amino acid pairs. These results indicate that PSP-WNN is a powerful tool for subcellular localization prediction. At the end of the article, influences on prediction accuracy using different weighting proportions among three characteristic vector categories have been discussed. An appropriate proportion is considered by increasing the prediction accuracy.
Clynne, Michael A.; Christiansen, Robert L.; Trimble, Deborah A.; McGeehin, John P.
2008-01-01
This contribution reports radiocarbon ages obtained from charcoal, wood and other samples collected between 1979 and 2001 in Lassen Volcanic National Park and vicinity and a few samples from other nearby localities. Most of the samples are from the Chaos Crags and Cinder Cone eruptive sequences. Brief summaries are given of the Chaos Crags and Cinder Cone eruptive sequences.
Processing Translational Motion Sequences.
1982-10-01
the initial ROADSIGN image using a (del)**2g mask with a width of 5 pixels The distinctiveness values were computed using features which were 5x5 pixel...the initial step size of the local search quite large. 34 4. EX P R g NTg The following experiments were performed using the roadsign and industrial...the initial image of the sequence. The third experiment involves processing the roadsign image sequence using the features extracted at the positions
Roux-Rouquie, M; Marilley, M
2000-09-15
We have modeled local DNA sequence parameters to search for DNA architectural motifs involved in transcription regulation and promotion within the Xenopus laevis ribosomal gene promoter and the intergenic spacer (IGS) sequences. The IGS was found to be shaped into distinct topological domains. First, intrinsic bends split the IGS into domains of common but different helical features. Local parameters at inter-domain junctions exhibit a high variability with respect to intrinsic curvature, bendability and thermal stability. Secondly, the repeated sequence blocks of the IGS exhibit right-handed supercoiled structures which could be related to their enhancer properties. Thirdly, the gene promoter presents both inherent curvature and minor groove narrowing which may be viewed as motifs of a structural code for protein recognition and binding. Such pre-existing deformations could simply be remodeled during the binding of the transcription complex. Alternatively, these deformations could pre-shape the promoter in such a way that further remodeling is facilitated. Mutations shown to abolish promoter curvature as well as intrinsic minor groove narrowing, in a variant which maintained full transcriptional activity, bring circumstantial evidence for structurally-preorganized motifs in relation to transcription regulation and promotion. Using well documented X. laevis rDNA regulatory sequences we showed that computer modeling may be of invaluable assistance in assessing encrypted architectural motifs. The evidence of these DNA topological motifs with respect to the concept of structural code is discussed.
Ameur, Adam; Bunikis, Ignas; Enroth, Stefan; Gyllensten, Ulf
2014-01-01
CanvasDB is an infrastructure for management and analysis of genetic variants from massively parallel sequencing (MPS) projects. The system stores SNP and indel calls in a local database, designed to handle very large datasets, to allow for rapid analysis using simple commands in R. Functional annotations are included in the system, making it suitable for direct identification of disease-causing mutations in human exome- (WES) or whole-genome sequencing (WGS) projects. The system has a built-in filtering function implemented to simultaneously take into account variant calls from all individual samples. This enables advanced comparative analysis of variant distribution between groups of samples, including detection of candidate causative mutations within family structures and genome-wide association by sequencing. In most cases, these analyses are executed within just a matter of seconds, even when there are several hundreds of samples and millions of variants in the database. We demonstrate the scalability of canvasDB by importing the individual variant calls from all 1092 individuals present in the 1000 Genomes Project into the system, over 4.4 billion SNPs and indels in total. Our results show that canvasDB makes it possible to perform advanced analyses of large-scale WGS projects on a local server. Database URL: https://github.com/UppsalaGenomeCenter/CanvasDB PMID:25281234
Ameur, Adam; Bunikis, Ignas; Enroth, Stefan; Gyllensten, Ulf
2014-01-01
CanvasDB is an infrastructure for management and analysis of genetic variants from massively parallel sequencing (MPS) projects. The system stores SNP and indel calls in a local database, designed to handle very large datasets, to allow for rapid analysis using simple commands in R. Functional annotations are included in the system, making it suitable for direct identification of disease-causing mutations in human exome- (WES) or whole-genome sequencing (WGS) projects. The system has a built-in filtering function implemented to simultaneously take into account variant calls from all individual samples. This enables advanced comparative analysis of variant distribution between groups of samples, including detection of candidate causative mutations within family structures and genome-wide association by sequencing. In most cases, these analyses are executed within just a matter of seconds, even when there are several hundreds of samples and millions of variants in the database. We demonstrate the scalability of canvasDB by importing the individual variant calls from all 1092 individuals present in the 1000 Genomes Project into the system, over 4.4 billion SNPs and indels in total. Our results show that canvasDB makes it possible to perform advanced analyses of large-scale WGS projects on a local server. Database URL: https://github.com/UppsalaGenomeCenter/CanvasDB. © The Author(s) 2014. Published by Oxford University Press.
NASA Astrophysics Data System (ADS)
Ghosh, Subhajit; Bose, Santanu; Mandal, Nibir; Das, Animesh
2018-03-01
This study integrates field evidence with laboratory experiments to show the mechanical effects of a lithologically contrasting stratigraphic sequence on the development of frontal thrusts: Main Boundary Thrust (MBT) and Daling Thrust (DT) in the Darjeeling-Sikkim Himalaya (DSH). We carried out field investigations mainly along two river sections in the DSH: Tista-Kalijhora and Mahanadi, covering an orogen-parallel stretch of 20 km. Our field observations suggest that the coal-shale dominated Gondwana sequence (sandwiched between the Daling Group in the north and Siwaliks in the south) has acted as a mechanically weak horizon to localize the MBT and DT. We simulated a similar mechanical setting in scaled model experiments to validate our field interpretation. In experiments, such a weak horizon at a shallow depth perturbs the sequential thrust progression, and causes a thrust to localize in the vicinity of the weak zone, splaying from the basal detachment. We correlate this weak-zone-controlled thrust with the DT, which accommodates a large shortening prior to activation of the weak zone as a new detachment with ongoing horizontal shortening. The entire shortening in the model is then transferred to this shallow detachment to produce a new sequence of thrust splays. Extrapolating this model result to the natural prototype, we show that the mechanically weak Gondwana Sequence has caused localization of the DT and MBT in the mountain front of DSH.
MannDB: A microbial annotation database for protein characterization
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, C; Lam, M; Smith, J
2006-05-19
MannDB was created to meet a need for rapid, comprehensive automated protein sequence analyses to support selection of proteins suitable as targets for driving the development of reagents for pathogen or protein toxin detection. Because a large number of open-source tools were needed, it was necessary to produce a software system to scale the computations for whole-proteome analysis. Thus, we built a fully automated system for executing software tools and for storage, integration, and display of automated protein sequence analysis and annotation data. MannDB is a relational database that organizes data resulting from fully automated, high-throughput protein-sequence analyses using open-sourcemore » tools. Types of analyses provided include predictions of cleavage, chemical properties, classification, features, functional assignment, post-translational modifications, motifs, antigenicity, and secondary structure. Proteomes (lists of hypothetical and known proteins) are downloaded and parsed from Genbank and then inserted into MannDB, and annotations from SwissProt are downloaded when identifiers are found in the Genbank entry or when identical sequences are identified. Currently 36 open-source tools are run against MannDB protein sequences either on local systems or by means of batch submission to external servers. In addition, BLAST against protein entries in MvirDB, our database of microbial virulence factors, is performed. A web client browser enables viewing of computational results and downloaded annotations, and a query tool enables structured and free-text search capabilities. When available, links to external databases, including MvirDB, are provided. MannDB contains whole-proteome analyses for at least one representative organism from each category of biological threat organism listed by APHIS, CDC, HHS, NIAID, USDA, USFDA, and WHO. MannDB comprises a large number of genomes and comprehensive protein sequence analyses representing organisms listed as high-priority agents on the websites of several governmental organizations concerned with bio-terrorism. MannDB provides the user with a BLAST interface for comparison of native and non-native sequences and a query tool for conveniently selecting proteins of interest. In addition, the user has access to a web-based browser that compiles comprehensive and extensive reports.« less
Evolutional dynamics of 45S and 5S ribosomal DNA in ancient allohexaploid Atropa belladonna.
Volkov, Roman A; Panchuk, Irina I; Borisjuk, Nikolai V; Hosiawa-Baranska, Marta; Maluszynska, Jolanta; Hemleben, Vera
2017-01-23
Polyploid hybrids represent a rich natural resource to study molecular evolution of plant genes and genomes. Here, we applied a combination of karyological and molecular methods to investigate chromosomal structure, molecular organization and evolution of ribosomal DNA (rDNA) in nightshade, Atropa belladonna (fam. Solanaceae), one of the oldest known allohexaploids among flowering plants. Because of their abundance and specific molecular organization (evolutionarily conserved coding regions linked to variable intergenic spacers, IGS), 45S and 5S rDNA are widely used in plant taxonomic and evolutionary studies. Molecular cloning and nucleotide sequencing of A. belladonna 45S rDNA repeats revealed a general structure characteristic of other Solanaceae species, and a very high sequence similarity of two length variants, with the only difference in number of short IGS subrepeats. These results combined with the detection of three pairs of 45S rDNA loci on separate chromosomes, presumably inherited from both tetraploid and diploid ancestor species, example intensive sequence homogenization that led to substitution/elimination of rDNA repeats of one parent. Chromosome silver-staining revealed that only four out of six 45S rDNA sites are frequently transcriptionally active, demonstrating nucleolar dominance. For 5S rDNA, three size variants of repeats were detected, with the major class represented by repeats containing all functional IGS elements required for transcription, the intermediate size repeats containing partially deleted IGS sequences, and the short 5S repeats containing severe defects both in the IGS and coding sequences. While shorter variants demonstrate increased rate of based substitution, probably in their transition into pseudogenes, the functional 5S rDNA variants are nearly identical at the sequence level, pointing to their origin from a single parental species. Localization of the 5S rDNA genes on two chromosome pairs further supports uniparental inheritance from the tetraploid progenitor. The obtained molecular, cytogenetic and phylogenetic data demonstrate complex evolutionary dynamics of rDNA loci in allohexaploid species of Atropa belladonna. The high level of sequence unification revealed in 45S and 5S rDNA loci of this ancient hybrid species have been seemingly achieved by different molecular mechanisms.
Persinger, M A
1993-06-01
Both theoretical and empirical observations suggest that significant alterations in self-concept should occur following most closed head injuries because of diffuse synaptic modification within the temporofrontal regions; this loss of the sense of self should evoke a grief-like response sequence and should encourage paranormal/religious experiences during the subsequent months to years. The marked consistency between phenomenological experiences and the results of neuropsychological assessments of 56 adults who had sustained brain injuries supported this hypothesis. Subsequent reports by these patients indicated that clinical translation of posttraumatic experiences into rational neurobehavioral terms and interventions tailored for the individual's specific pattern of brain "dysfunction" may facilitate adaptation during the grieving period.
Bentley, Amy R.; Chen, Guanjie; Shriner, Daniel; Doumatey, Ayo P.; Zhou, Jie; Huang, Hanxia; Mullikin, James C.; Blakesley, Robert W.; Hansen, Nancy F.; Bouffard, Gerard G.; Cherukuri, Praveen F.; Maskeri, Baishali; Young, Alice C.; Adeyemo, Adebowale; Rotimi, Charles N.
2014-01-01
Although a considerable proportion of serum lipids loci identified in European ancestry individuals (EA) replicate in African Americans (AA), interethnic differences in the distribution of serum lipids suggest that some genetic determinants differ by ethnicity. We conducted a comprehensive evaluation of five lipid candidate genes to identify variants with ethnicity-specific effects. We sequenced ABCA1, LCAT, LPL, PON1, and SERPINE1 in 48 AA individuals with extreme serum lipid concentrations (high HDLC/low TG or low HDLC/high TG). Identified variants were genotyped in the full population-based sample of AA (n = 1694) and tested for an association with serum lipids. rs328 (LPL) and correlated variants were associated with higher HDLC and lower TG. Interestingly, a stronger effect was observed on a “European” vs. “African” genetic background at this locus. To investigate this effect, we evaluated the region among West Africans (WA). For TG, the effect size among WA was the same in AA with only African local ancestry (2–3% lower TG), while the larger association among AA with local European ancestry matched previous reports in EA (10%). For HDLC, there was no association with rs328 in AA with only African local ancestry or in WA, while the association among AA with European local ancestry was much greater than what has been observed for EA (15 vs. ∼5 mg/dl), suggesting an interaction with an environmental or genetic factor that differs by ethnicity. Beyond this ancestry effect, the importance of African ancestry-focused, sequence-based work was also highlighted by serum lipid associations of variants that were in higher frequency (or present only) among those of African ancestry. By beginning our study with the sequence variation present in AA individuals, investigating local ancestry effects, and seeking replication in WA, we were able to comprehensively evaluate the role of a set of candidate genes in serum lipids in AA. PMID:24603370
NASA Astrophysics Data System (ADS)
Guimarães, L. F.; Raposo, M. I. B.; Janasi, V. A.; Cañón-Tapia, E.; Polo, L. A.
2018-04-01
In the Southern portion of the Paraná-Etendeka Magmatic Province in Brazil, extensive silicic (dacite-rhyolite) deposits occur at the top of a sequence of low-Ti pahoehoe to rubbly basalts. The internal architecture of the silicic deposits and their eruptive style, as well as the location of their sources are still unsatisfactorily known. In an attempt to provide independent evidence for flow directions in deposits previously characterized as effusive, and test the hypothesis of local sources, we carried out anisotropy of magnetic susceptibility (AMS) studies on the two main silicic units (Caxias do Sul dacites and Santa Maria Rhyolites) with the best exposures in an area previously mapped in detail. Magnetic anisotropies were determined on oriented cylindrical specimens from a total of 28 sites. Rock magnetism properties indicate that "pseudo-single-domain" magnetite carries the fabrics and the remanence. Magnetic fabrics were determined by applying anisotropy of low-field magnetic susceptibility (AMS) and anisotropy of anhysteretic remanent magnetization (AARM). Both AMS and AARM tensors are coaxial, indicating that the AMS fabric is not affected by the effect of magnetite single-domain grains. Magnetic data from several dacitic coulées (Caxias do Sul unit) indicate flows from SE to NW. The location and spatial distribution of these lavas support the hypothesis of local sources, aligned along a NE-SW trend. These data are in agreement with the alignments of structures (dome-shaped hills) observed in field work and DEM images. On the other hand, magnetic data obtained in Santa Maria rhyolites indicate that flow directions in two different areas are distinct (towards NW/NE and W), suggesting that they derived from different emission centers. So, regarding the silicic volcanism in the studied region, our data do not support the model which classifies the entire silicic volcanism of the province as extensive rheomorphic pyroclastic deposits released from a central conduit. In contrast, we propose the occurrence of local volcanic events, implying in the existence of different sources, possibly characterized by local emission centers.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fields, C.A.
1994-09-01
This Report concludes the DOE Human Genome Program project, ``Identification of Genes in Anonymous DNA Sequence.`` The central goals of this project have been (1) understanding the problem of identifying genes in anonymous sequences, and (2) development of tools, primarily the automated identification system gm, for identifying genes. The activities supported under the previous award are summarized here to provide a single complete report on the activities supported as part of the project from its inception to its completion.
ERIC Educational Resources Information Center
Sartory, Katharina; Jungermann, Anja-Kristin; Järvinen, Hanna
2017-01-01
External support by a local coordinating agency facilitates the work of school-to-school networks. This study provides an innovative theoretical framework to analyse how support provided by local education offices for school-to-school networks is perceived by the participating teachers. Based on a quantitative survey and qualitative interview data…
NASA Astrophysics Data System (ADS)
Doan, Bich-Thuy; Autret, Gwennhael; Mispelter, Joël; Méric, Philippe; Même, William; Montécot-Dubourg, Céline; Corrèze, Jean-Loup; Szeremeta, Frédéric; Gillet, Brigitte; Beloeil, Jean-Claude
2009-05-01
13C spectroscopy combined with the injection of 13C-labeled substrates is a powerful method for the study of brain metabolism in vivo. Since highly localized measurements are required in a heterogeneous organ such as the brain, it is of interest to augment the sensitivity of 13C spectroscopy by proton acquisition. Furthermore, as focal cerebral lesions are often encountered in animal models of disorders in which the two brain hemispheres are compared, we wished to develop a bi-voxel localized sequence for the simultaneous bilateral investigation of rat brain metabolism, with no need for external additional references. Two sequences were developed at 9.4 T: a bi-voxel 1H-( 13C) STEAM-POCE (Proton Observed Carbon Edited) sequence and a bi-voxel 1H-( 13C) PRESS-POCE adiabatically decoupled sequence with Hadamard encoding. Hadamard encoding allows both voxels to be recorded simultaneously, with the same acquisition time as that required for a single voxel. The method was validated in a biological investigation into the neuronal damage and the effect on the Tri Carboxylic Acid cycle in localized excitotoxic lesions. Following an excitotoxic quinolinate-induced localized lesion in the rat cortex and the infusion of U- 13C glucose, two 1H-( 13C) spectra of distinct (4 × 4 × 4 mm 3) voxels, one centred on the injured hemisphere and the other on the contralateral hemisphere, were recorded simultaneously. Two 1H bi-voxel spectra were also recorded and showed a significant decrease in N-acetyl aspartate, and an accumulation of lactate in the ipsilateral hemisphere. The 1H-( 13C) spectra could be recorded dynamically as a function of time, and showed a fall in the glutamate/glutamine ratio and the presence of a stable glutamine pool, with a permanent increase of lactate in the ipsilateral hemisphere. This bi-voxel 1H-( 13C) method can be used to investigate simultaneously both brain hemispheres, and to perform dynamic studies. We report here the neuronal damage and the effect on the Tri Carboxylic Acid cycle in localized excitotoxic lesions.
ERIC Educational Resources Information Center
Stephens, A. Lynn
2012-01-01
The purpose of this study is to investigate student interactions with simulations, and teacher support of those interactions, within naturalistic high school physics classroom settings. This study focuses on data from two lesson sequences that were conducted in several physics classrooms. The lesson sequences were conducted in a whole class…
Park, Bongsoo; Park, Jongsun; Cheong, Kyeong-Chae; Choi, Jaeyoung; Jung, Kyongyong; Kim, Donghan; Lee, Yong-Hwan; Ward, Todd J; O'Donnell, Kerry; Geiser, David M; Kang, Seogchan
2011-01-01
The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate species identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on well-preserved culture collections, have established a robust foundation for Fusarium classification. Genomes of four Fusarium species have been published with more being currently sequenced. The Cyber infrastructure for Fusarium (CiF; http://www.fusariumdb.org/) was built to support archiving and utilization of rapidly increasing data and knowledge and consists of Fusarium-ID, Fusarium Comparative Genomics Platform (FCGP) and Fusarium Community Platform (FCP). The Fusarium-ID archives phylogenetic marker sequences from most known species along with information associated with characterized isolates and supports strain identification and phylogenetic analyses. The FCGP currently archives five genomes from four species. Besides supporting genome browsing and analysis, the FCGP presents computed characteristics of multiple gene families and functional groups. The Cart/Favorite function allows users to collect sequences from Fusarium-ID and the FCGP and analyze them later using multiple tools without requiring repeated copying-and-pasting of sequences. The FCP is designed to serve as an online community forum for sharing and preserving accumulated experience and knowledge to support future research and education.