sequence-based hybrid predictor: Topics by Science.gov

Sample records for sequence-based hybrid predictor

SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues.

PubMed

Yang, Xiaoxia; Wang, Jia; Sun, Jun; Liu, Rong

2015-01-01

Protein-nucleic acid interactions are central to various fundamental biological processes. Automated methods capable of reliably identifying DNA- and RNA-binding residues in protein sequence are assuming ever-increasing importance. The majority of current algorithms rely on feature-based prediction, but their accuracy remains to be further improved. Here we propose a sequence-based hybrid algorithm SNBRFinder (Sequence-based Nucleic acid-Binding Residue Finder) by merging a feature predictor SNBRFinderF and a template predictor SNBRFinderT. SNBRFinderF was established using the support vector machine whose inputs include sequence profile and other complementary sequence descriptors, while SNBRFinderT was implemented with the sequence alignment algorithm based on profile hidden Markov models to capture the weakly homologous template of query sequence. Experimental results show that SNBRFinderF was clearly superior to the commonly used sequence profile-based predictor and SNBRFinderT can achieve comparable performance to the structure-based template methods. Leveraging the complementary relationship between these two predictors, SNBRFinder reasonably improved the performance of both DNA- and RNA-binding residue predictions. More importantly, the sequence-based hybrid prediction reached competitive performance relative to our previous structure-based counterpart. Our extensive and stringent comparisons show that SNBRFinder has obvious advantages over the existing sequence-based prediction algorithms. The value of our algorithm is highlighted by establishing an easy-to-use web server that is freely accessible at http://ibi.hzau.edu.cn/SNBRFinder.
A sequence-based hybrid predictor for identifying conformationally ambivalent regions in proteins.

PubMed

Liu, Yu-Cheng; Yang, Meng-Han; Lin, Win-Li; Huang, Chien-Kang; Oyang, Yen-Jen

2009-12-03

Proteins are dynamic macromolecules which may undergo conformational transitions upon changes in environment. As it has been observed in laboratories that protein flexibility is correlated to essential biological functions, scientists have been designing various types of predictors for identifying structurally flexible regions in proteins. In this respect, there are two major categories of predictors. One category of predictors attempts to identify conformationally flexible regions through analysis of protein tertiary structures. Another category of predictors works completely based on analysis of the polypeptide sequences. As the availability of protein tertiary structures is generally limited, the design of predictors that work completely based on sequence information is crucial for advances of molecular biology research. In this article, we propose a novel approach to design a sequence-based predictor for identifying conformationally ambivalent regions in proteins. The novelty in the design stems from incorporating two classifiers based on two distinctive supervised learning algorithms that provide complementary prediction powers. Experimental results show that the overall performance delivered by the hybrid predictor proposed in this article is superior to the performance delivered by the existing predictors. Furthermore, the case study presented in this article demonstrates that the proposed hybrid predictor is capable of providing the biologists with valuable clues about the functional sites in a protein chain. The proposed hybrid predictor provides the users with two optional modes, namely, the high-sensitivity mode and the high-specificity mode. The experimental results with an independent testing data set show that the proposed hybrid predictor is capable of delivering sensitivity of 0.710 and specificity of 0.608 under the high-sensitivity mode, while delivering sensitivity of 0.451 and specificity of 0.787 under the high-specificity mode. Though experimental results show that the hybrid approach designed to exploit the complementary prediction powers of distinctive supervised learning algorithms works more effectively than conventional approaches, there exists a large room for further improvement with respect to the achieved performance. In this respect, it is of interest to investigate the effects of exploiting additional physiochemical properties that are related to conformational ambivalence. Furthermore, it is of interest to investigate the effects of incorporating lately-developed machine learning approaches, e.g. the random forest design and the multi-stage design. As conformational transition plays a key role in carrying out several essential types of biological functions, the design of more advanced predictors for identifying conformationally ambivalent regions in proteins deserves our continuous attention.
Transcription profile of boar spermatozoa as revealed by RNA-sequencing

USDA-ARS?s Scientific Manuscript database

High-throughput RNA sequencing (RNA-Seq) overcomes the limitations of the current hybridization-based techniques to detect the actual pool of RNA transcripts in spermatozoa. The application of this technology in livestock can speed the discovery of potential predictors of male fertility. As a first ...
Ensemble Linear Neighborhood Propagation for Predicting Subchloroplast Localization of Multi-Location Proteins.

PubMed

Wan, Shibiao; Mak, Man-Wai; Kung, Sun-Yuan

2016-12-02

In the postgenomic era, the number of unreviewed protein sequences is remarkably larger and grows tremendously faster than that of reviewed ones. However, existing methods for protein subchloroplast localization often ignore the information from these unlabeled proteins. This paper proposes a multi-label predictor based on ensemble linear neighborhood propagation (LNP), namely, LNP-Chlo, which leverages hybrid sequence-based feature information from both labeled and unlabeled proteins for predicting localization of both single- and multi-label chloroplast proteins. Experimental results on a stringent benchmark dataset and a novel independent dataset suggest that LNP-Chlo performs at least 6% (absolute) better than state-of-the-art predictors. This paper also demonstrates that ensemble LNP significantly outperforms LNP based on individual features. For readers' convenience, the online Web server LNP-Chlo is freely available at http://bioinfo.eie.polyu.edu.hk/LNPChloServer/ .
Beyond Genomic Prediction: Combining Different Types of omics Data Can Improve Prediction of Hybrid Performance in Maize.

PubMed

Schrag, Tobias A; Westhues, Matthias; Schipprack, Wolfgang; Seifert, Felix; Thiemann, Alexander; Scholten, Stefan; Melchinger, Albrecht E

2018-04-01

The ability to predict the agronomic performance of single-crosses with high precision is essential for selecting superior candidates for hybrid breeding. With recent technological advances, thousands of new parent lines, and, consequently, millions of new hybrid combinations are possible in each breeding cycle, yet only a few hundred can be produced and phenotyped in multi-environment yield trials. Well established prediction approaches such as best linear unbiased prediction (BLUP) using pedigree data and whole-genome prediction using genomic data are limited in capturing epistasis and interactions occurring within and among downstream biological strata such as transcriptome and metabolome. Because mRNA and small RNA (sRNA) sequences are involved in transcriptional, translational and post-translational processes, we expect them to provide information influencing several biological strata. However, using sRNA data of parent lines to predict hybrid performance has not yet been addressed. Here, we gathered genomic, transcriptomic (mRNA and sRNA) and metabolomic data of parent lines to evaluate the ability of the data to predict the performance of untested hybrids for important agronomic traits in grain maize. We found a considerable interaction for predictive ability between predictor and trait, with mRNA data being a superior predictor for grain yield and genomic data for grain dry matter content, while sRNA performed relatively poorly for both traits. Combining mRNA and genomic data as predictors resulted in high predictive abilities across both traits and combining other predictors improved prediction over that of the individual predictors alone. We conclude that downstream "omics" can complement genomics for hybrid prediction, and, thereby, contribute to more efficient selection of hybrid candidates. Copyright © 2018 by the Genetics Society of America.
Multiplex analysis of DNA

DOEpatents

Church, George M.; Kieffer-Higgins, Stephen

1992-01-01

This invention features vectors and a method for sequencing DNA. The method includes the steps of: a) ligating the DNA into a vector comprising a tag sequence, the tag sequence includes at least 15 bases, wherein the tag sequence will not hybridize to the DNA under stringent hybridization conditions and is unique in the vector, to form a hybrid vector, b) treating the hybrid vector in a plurality of vessels to produce fragments comprising the tag sequence, wherein the fragments differ in length and terminate at a fixed known base or bases, wherein the fixed known base or bases differs in each vessel, c) separating the fragments from each vessel according to their size, d) hybridizing the fragments with an oligonucleotide able to hybridize specifically with the tag sequence, and e) detecting the pattern of hybridization of the tag sequence, wherein the pattern reflects the nucleotide sequence of the DNA.
HybridGO-Loc: mining hybrid features on gene ontology for predicting subcellular localization of multi-location proteins.

PubMed

Wan, Shibiao; Mak, Man-Wai; Kung, Sun-Yuan

2014-01-01

Protein subcellular localization prediction, as an essential step to elucidate the functions in vivo of proteins and identify drugs targets, has been extensively studied in previous decades. Instead of only determining subcellular localization of single-label proteins, recent studies have focused on predicting both single- and multi-location proteins. Computational methods based on Gene Ontology (GO) have been demonstrated to be superior to methods based on other features. However, existing GO-based methods focus on the occurrences of GO terms and disregard their relationships. This paper proposes a multi-label subcellular-localization predictor, namely HybridGO-Loc, that leverages not only the GO term occurrences but also the inter-term relationships. This is achieved by hybridizing the GO frequencies of occurrences and the semantic similarity between GO terms. Given a protein, a set of GO terms are retrieved by searching against the gene ontology database, using the accession numbers of homologous proteins obtained via BLAST search as the keys. The frequency of GO occurrences and semantic similarity (SS) between GO terms are used to formulate frequency vectors and semantic similarity vectors, respectively, which are subsequently hybridized to construct fusion vectors. An adaptive-decision based multi-label support vector machine (SVM) classifier is proposed to classify the fusion vectors. Experimental results based on recent benchmark datasets and a new dataset containing novel proteins show that the proposed hybrid-feature predictor significantly outperforms predictors based on individual GO features as well as other state-of-the-art predictors. For readers' convenience, the HybridGO-Loc server, which is for predicting virus or plant proteins, is available online at http://bioinfo.eie.polyu.edu.hk/HybridGoServer/.
Predicting PDZ domain mediated protein interactions from structure

PubMed Central

2013-01-01

Background PDZ domains are structural protein domains that recognize simple linear amino acid motifs, often at protein C-termini, and mediate protein-protein interactions (PPIs) in important biological processes, such as ion channel regulation, cell polarity and neural development. PDZ domain-peptide interaction predictors have been developed based on domain and peptide sequence information. Since domain structure is known to influence binding specificity, we hypothesized that structural information could be used to predict new interactions compared to sequence-based predictors. Results We developed a novel computational predictor of PDZ domain and C-terminal peptide interactions using a support vector machine trained with PDZ domain structure and peptide sequence information. Performance was estimated using extensive cross validation testing. We used the structure-based predictor to scan the human proteome for ligands of 218 PDZ domains and show that the predictions correspond to known PDZ domain-peptide interactions and PPIs in curated databases. The structure-based predictor is complementary to the sequence-based predictor, finding unique known and novel PPIs, and is less dependent on training–testing domain sequence similarity. We used a functional enrichment analysis of our hits to create a predicted map of PDZ domain biology. This map highlights PDZ domain involvement in diverse biological processes, some only found by the structure-based predictor. Based on this analysis, we predict novel PDZ domain involvement in xenobiotic metabolism and suggest new interactions for other processes including wound healing and Wnt signalling. Conclusions We built a structure-based predictor of PDZ domain-peptide interactions, which can be used to scan C-terminal proteomes for PDZ interactions. We also show that the structure-based predictor finds many known PDZ mediated PPIs in human that were not found by our previous sequence-based predictor and is less dependent on training–testing domain sequence similarity. Using both predictors, we defined a functional map of human PDZ domain biology and predict novel PDZ domain function. Users may access our structure-based and previous sequence-based predictors at http://webservice.baderlab.org/domains/POW. PMID:23336252
Small RNA-based prediction of hybrid performance in maize.

PubMed

Seifert, Felix; Thiemann, Alexander; Schrag, Tobias A; Rybka, Dominika; Melchinger, Albrecht E; Frisch, Matthias; Scholten, Stefan

2018-05-21

Small RNA (sRNA) sequences are known to have a broad impact on gene regulation by various mechanisms. Their performance for the prediction of hybrid traits has not yet been analyzed. Our objective was to analyze the relation of parental sRNA expression with the performance of their hybrids, to develop a sRNA-based prediction approach, and to compare it to more common SNP and mRNA transcript based predictions using a factorial mating scheme of a maize hybrid breeding program. Correlation of genomic differences and messenger RNA (mRNA) or sRNA expression differences between parental lines with hybrid performance of their hybrids revealed that sRNAs showed an inverse relationship in contrast to the other two data types. We associated differences for SNPs, mRNA and sRNA expression between parental inbred lines with the performance of their hybrid combinations and developed two prediction approaches using distance measures based on associated markers. Cross-validations revealed parental differences in sRNA expression to be strong predictors for hybrid performance for grain yield in maize, comparable to genomic and mRNA data. The integration of both positively and negatively associated markers in the prediction approaches enhanced the prediction accurary. The associated sRNAs belong predominantly to the canonical size classes of 22- and 24-nt that show specific genomic mapping characteristics. Expression profiles of sRNA are a promising alternative to SNPs or mRNA expression profiles for hybrid prediction, especially for plant species without reference genome or transcriptome information. The characteristics of the sRNAs we identified suggest that association studies based on breeding populations facilitate the identification of sRNAs involved in hybrid performance.
Predicting DNA hybridization kinetics from sequence

NASA Astrophysics Data System (ADS)

Zhang, Jinny X.; Fang, John Z.; Duan, Wei; Wu, Lucia R.; Zhang, Angela W.; Dalchau, Neil; Yordanov, Boyan; Petersen, Rasmus; Phillips, Andrew; Zhang, David Yu

2018-01-01

Hybridization is a key molecular process in biology and biotechnology, but so far there is no predictive model for accurately determining hybridization rate constants based on sequence information. Here, we report a weighted neighbour voting (WNV) prediction algorithm, in which the hybridization rate constant of an unknown sequence is predicted based on similarity reactions with known rate constants. To construct this algorithm we first performed 210 fluorescence kinetics experiments to observe the hybridization kinetics of 100 different DNA target and probe pairs (36 nt sub-sequences of the CYCS and VEGF genes) at temperatures ranging from 28 to 55 °C. Automated feature selection and weighting optimization resulted in a final six-feature WNV model, which can predict hybridization rate constants of new sequences to within a factor of 3 with ∼91% accuracy, based on leave-one-out cross-validation. Accurate prediction of hybridization kinetics allows the design of efficient probe sequences for genomics research.
Predicting Functions of Proteins in Mouse Based on Weighted Protein-Protein Interaction Network and Protein Hybrid Properties

PubMed Central

Shi, Xiaohe; Lu, Wen-Cong; Cai, Yu-Dong; Chou, Kuo-Chen

2011-01-01

Background With the huge amount of uncharacterized protein sequences generated in the post-genomic age, it is highly desirable to develop effective computational methods for quickly and accurately predicting their functions. The information thus obtained would be very useful for both basic research and drug development in a timely manner. Methodology/Principal Findings Although many efforts have been made in this regard, most of them were based on either sequence similarity or protein-protein interaction (PPI) information. However, the former often fails to work if a query protein has no or very little sequence similarity to any function-known proteins, while the latter had similar problem if the relevant PPI information is not available. In view of this, a new approach is proposed by hybridizing the PPI information and the biochemical/physicochemical features of protein sequences. The overall first-order success rates by the new predictor for the functions of mouse proteins on training set and test set were 69.1% and 70.2%, respectively, and the success rate covered by the results of the top-4 order from a total of 24 orders was 65.2%. Conclusions/Significance The results indicate that the new approach is quite promising that may open a new avenue or direction for addressing the difficult and complicated problem. PMID:21283518
BiRen: predicting enhancers with a deep-learning-based model using the DNA sequence alone.

PubMed

Yang, Bite; Liu, Feng; Ren, Chao; Ouyang, Zhangyi; Xie, Ziwei; Bo, Xiaochen; Shu, Wenjie

2017-07-01

Enhancer elements are noncoding stretches of DNA that play key roles in controlling gene expression programmes. Despite major efforts to develop accurate enhancer prediction methods, identifying enhancer sequences continues to be a challenge in the annotation of mammalian genomes. One of the major issues is the lack of large, sufficiently comprehensive and experimentally validated enhancers for humans or other species. Thus, the development of computational methods based on limited experimentally validated enhancers and deciphering the transcriptional regulatory code encoded in the enhancer sequences is urgent. We present a deep-learning-based hybrid architecture, BiRen, which predicts enhancers using the DNA sequence alone. Our results demonstrate that BiRen can learn common enhancer patterns directly from the DNA sequence and exhibits superior accuracy, robustness and generalizability in enhancer prediction relative to other state-of-the-art enhancer predictors based on sequence characteristics. Our BiRen will enable researchers to acquire a deeper understanding of the regulatory code of enhancer sequences. Our BiRen method can be freely accessed at https://github.com/wenjiegroup/BiRen . shuwj@bmi.ac.cn or boxc@bmi.ac.cn. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Automatic prediction of protein domains from sequence information using a hybrid learning system.

PubMed

Nagarajan, Niranjan; Yona, Golan

2004-06-12

We describe a novel method for detecting the domain structure of a protein from sequence information alone. The method is based on analyzing multiple sequence alignments that are derived from a database search. Multiple measures are defined to quantify the domain information content of each position along the sequence and are combined into a single predictor using a neural network. The output is further smoothed and post-processed using a probabilistic model to predict the most likely transition positions between domains. The method was assessed using the domain definitions in SCOP and CATH for proteins of known structure and was compared with several other existing methods. Our method performs well both in terms of accuracy and sensitivity. It improves significantly over the best methods available, even some of the semi-manual ones, while being fully automatic. Our method can also be used to suggest and verify domain partitions based on structural data. A few examples of predicted domain definitions and alternative partitions, as suggested by our method, are also discussed. An online domain-prediction server is available at http://biozon.org/tools/domains/
Phylogenetic Analysis of Shewanella Strains by DNA Relatedness Derived from Whole Genome Microarray DNA-DNA Hybridization and Comparison with Other Methods

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wu, Liyou; Yi, T. Y.; Van Nostrand, Joy

Phylogenetic analyses were done for the Shewanella strains isolated from Baltic Sea (38 strains), US DOE Hanford Uranium bioremediation site [Hanford Reach of the Columbia River (HRCR), 11 strains], Pacific Ocean and Hawaiian sediments (8 strains), and strains from other resources (16 strains) with three out group strains, Rhodopseudomonas palustris, Clostridium cellulolyticum, and Thermoanaerobacter ethanolicus X514, using DNA relatedness derived from WCGA-based DNA-DNA hybridizations, sequence similarities of 16S rRNA gene and gyrB gene, and sequence similarities of 6 loci of Shewanella genome selected from a shared gene list of the Shewanella strains with whole genome sequenced based on the averagemore » nucleotide identity of them (ANI). The phylogenetic trees based on 16S rRNA and gyrB gene sequences, and DNA relatedness derived from WCGA hybridizations of the tested Shewanella strains share exactly the same sub-clusters with very few exceptions, in which the strains were basically grouped by species. However, the phylogenetic analysis based on DNA relatedness derived from WCGA hybridizations dramatically increased the differentiation resolution at species and strains level within Shewanella genus. When the tree based on DNA relatedness derived from WCGA hybridizations was compared to the tree based on the combined sequences of the selected functional genes (6 loci), we found that the resolutions of both methods are similar, but the clustering of the tree based on DNA relatedness derived from WMGA hybridizations was clearer. These results indicate that WCGA-based DNA-DNA hybridization is an idea alternative of conventional DNA-DNA hybridization methods and it is superior to the phylogenetics methods based on sequence similarities of single genes. Detailed analysis is being performed for the re-classification of the strains examined.« less
DNABP: Identification of DNA-Binding Proteins Based on Feature Selection Using a Random Forest and Predicting Binding Residues.

PubMed

Ma, Xin; Guo, Jing; Sun, Xiao

2016-01-01

DNA-binding proteins are fundamentally important in cellular processes. Several computational-based methods have been developed to improve the prediction of DNA-binding proteins in previous years. However, insufficient work has been done on the prediction of DNA-binding proteins from protein sequence information. In this paper, a novel predictor, DNABP (DNA-binding proteins), was designed to predict DNA-binding proteins using the random forest (RF) classifier with a hybrid feature. The hybrid feature contains two types of novel sequence features, which reflect information about the conservation of physicochemical properties of the amino acids, and the binding propensity of DNA-binding residues and non-binding propensities of non-binding residues. The comparisons with each feature demonstrated that these two novel features contributed most to the improvement in predictive ability. Furthermore, to improve the prediction performance of the DNABP model, feature selection using the minimum redundancy maximum relevance (mRMR) method combined with incremental feature selection (IFS) was carried out during the model construction. The results showed that the DNABP model could achieve 86.90% accuracy, 83.76% sensitivity, 90.03% specificity and a Matthews correlation coefficient of 0.727. High prediction accuracy and performance comparisons with previous research suggested that DNABP could be a useful approach to identify DNA-binding proteins from sequence information. The DNABP web server system is freely available at http://www.cbi.seu.edu.cn/DNABP/.
Detection of cystic fibrosis mutations in a GeneChip{trademark} assay format

DOE Office of Scientific and Technical Information (OSTI.GOV)

Miyada, C.G.; Cronin, M.T.; Kim, S.M.

1994-09-01

We are developing assays for the detection of cystic fibrosis mutations based on DNA hybridization. A DNA sample is amplified by PCR, labeled by incorporating a fluorescein-tagged dNTP, enzymatically treated to produce smaller fragments and hybridized to a series of short (13-16 bases) oligonucleotides synthesized on a glass surface via photolithography. The hybrids are detected by eqifluorescence and mutations are identified by the specific pattern of hybridization. In a GeneChip assay, the chip surface is composed of a series of subarrays, each being specific for a particular mutation. Each subarray is further subdivided into a series of probes (40 total),more » half based on the mutant sequence and the remainder based on the wild-type sequence. For each of the subarrays, there is a redundancy in the number of probes that should hybridize to either a wild-type or a mutant target. The multiple probe strategy provides sequence information for a short five base region overlapping the mutation site. In addition, homozygous wild-type and mutant as well as heterozygous samples are each identified by a specific pattern of hybridization. The small size of each probe feature (250 x 250 {mu}m{sup 2}) permits the inclusion of additional probes required to generate sequence information by hybridization.« less
HYBRIDIZATION PROPERTIES OF DNA SEQUENCES DIRECTING THE SYNTHESIS OF MESSENGER RNA AND HETEROGENEOUS NUCLEAR RNA

PubMed Central

Greenberg, Jay R.; Perry, Robert P.

1971-01-01

The relationship of the DNA sequences from which polyribosomal messenger RNA (mRNA) and heterogeneous nuclear RNA (NRNA) of mouse L cells are transcribed was investigated by means of hybridization kinetics and thermal denaturation of the hybrids. Hybridization was performed in formamide solutions at DNA excess. Under these conditions most of the hybridizing mRNA and NRNA react at values of Dot (DNA concentration multiplied by time) expected for RNA transcribed from the nonrepeated or rarely repeated fraction of the genome. However, a fraction of both mRNA and NRNA hybridize at values of Dot about 10,000 times lower, and therefore must be transcribed from highly redundant DNA sequences. The fraction of NRNA hybridizing to highly repeated sequences is about 1.7 times greater than the corresponding fraction of mRNA. The hybrids formed by the rapidly reacting fractions of both NRNA and mRNA melt over a narrow temperature range with a midpoint about 11°C below that of native L cell DNA. This indicates that these hybrids consist of partially complementary sequences with approximately 11% mismatching of bases. Hybrids formed by the slowly reacting fraction of NRNA melt within 4°–6°C of native DNA, indicating very little, if any, mismatching of bases. Hybrids of the slowly reacting components of mRNA, formed under conditions of sufficiently low RNA input, have a high thermal stability, similar to that observed for hybrids of the slowly reacting NRNA component. However, when higher inputs of mRNA are used, hybrids are formed which have a strikingly lower thermal stability. This observation can be explained by assuming that there is sufficient similarity among the relatively rare DNA sequences coding for mRNA so that under hybridization conditions, in which these DNA sequences are not truly in excess, reversible hybrids exhibiting a considerable amount of mispairing are formed. The fact that a comparable phenomenon has not been observed for NRNA may mean that there is less similarity among the relatively rare DNA sequences coding for NRNA than there is among the rare sequences coding for mRNA. PMID:4999767
Molecular characterizations of somatic hybrids developed between Pleurotus florida and Lentinus squarrosulus through inter-simple sequence repeat markers and sequencing of ribosomal RNA-ITS gene.

PubMed

Mallick, Pijush; Chattaraj, Shruti; Sikdar, Samir Ranjan

2017-10-01

The 12 pfls somatic hybrids and 2 parents of Pleurotus florida and Lentinus s quarrosulus were characterized by ISSR and sequencing of rRNA-ITS genes. Five ISSR primers were used and amplified a total of 54 reproducible fragments with 98.14% polymorphism among all the pfls hybrid populations and parental strains. UPGMA-based cluster exhibited a dendrogram with three major groups between the parents and pfls hybrids. Parent P . florida and L . squarrosulus showed different degrees of genetic distance with all the hybrid lines and they showed closeness to hybrid pfls 1m and pfls 1h , respectively. ITS1(F) and ITS4(R) amplified the rRNA-ITS gene with 611-867 bp sequence length. The nucleotide polymorphisms were found in the ITS1, ITS2 and 5.8S rRNA region with different number of bases. Based on rRNA-ITS sequence, UPGMA cluster exhibited three distinct groups between L. squarrosulus and pfls 1p , pfls 1m and pfls 1s , and pfls 1e and P. florida .
Seasonal drought predictability in Portugal using statistical-dynamical techniques

NASA Astrophysics Data System (ADS)

Ribeiro, A. F. S.; Pires, C. A. L.

2016-08-01

Atmospheric forecasting and predictability are important to promote adaption and mitigation measures in order to minimize drought impacts. This study estimates hybrid (statistical-dynamical) long-range forecasts of the regional drought index SPI (3-months) over homogeneous regions from mainland Portugal, based on forecasts from the UKMO operational forecasting system, with lead-times up to 6 months. ERA-Interim reanalysis data is used for the purpose of building a set of SPI predictors integrating recent past information prior to the forecast launching. Then, the advantage of combining predictors with both dynamical and statistical background in the prediction of drought conditions at different lags is evaluated. A two-step hybridization procedure is performed, in which both forecasted and observed 500 hPa geopotential height fields are subjected to a PCA in order to use forecasted PCs and persistent PCs as predictors. A second hybridization step consists on a statistical/hybrid downscaling to the regional SPI, based on regression techniques, after the pre-selection of the statistically significant predictors. The SPI forecasts and the added value of combining dynamical and statistical methods are evaluated in cross-validation mode, using the R2 and binary event scores. Results are obtained for the four seasons and it was found that winter is the most predictable season, and that most of the predictive power is on the large-scale fields from past observations. The hybridization improves the downscaling based on the forecasted PCs, since they provide complementary information (though modest) beyond that of persistent PCs. These findings provide clues about the predictability of the SPI, particularly in Portugal, and may contribute to the predictability of crops yields and to some guidance on users (such as farmers) decision making process.
Determination of protein folding kinetic types using sequence and predicted secondary structure and solvent accessibility.

PubMed

Zhang, Hua; Zhang, Tuo; Gao, Jianzhao; Ruan, Jishou; Shen, Shiyi; Kurgan, Lukasz

2012-01-01

Proteins fold through a two-state (TS), with no visible intermediates, or a multi-state (MS), via at least one intermediate, process. We analyze sequence-derived factors that determine folding types by introducing a novel sequence-based folding type predictor called FOKIT. This method implements a logistic regression model with six input features which hybridize information concerning amino acid composition and predicted secondary structure and solvent accessibility. FOKIT provides predictions with average Matthews correlation coefficient (MCC) between 0.58 and 0.91 measured using out-of-sample tests on four benchmark datasets. These results are shown to be competitive or better than results of four modern predictors. We also show that FOKIT outperforms these methods when predicting chains that share low similarity with the chains used to build the model, which is an important advantage given the limited number of annotated chains. We demonstrate that inclusion of solvent accessibility helps in discrimination of the folding kinetic types and that three of the features constitute statistically significant markers that differentiate TS and MS folders. We found that the increased content of exposed Trp and buried Leu are indicative of the MS folding, which implies that the exposure/burial of certain hydrophobic residues may play important role in the formation of the folding intermediates. Our conclusions are supported by two case studies.

Hybrid Smith predictor and phase lead based divergence compensation for hardware-in-the-loop contact simulation with measurement delay

NASA Astrophysics Data System (ADS)

Qi, Chenkun; Gao, Feng; Zhao, Xianchao; Wang, Qian; Ren, Anye

2018-06-01

On the ground the hardware-in-the-loop (HIL) simulation is a good approach to test the contact dynamics of spacecraft docking process in space. Unfortunately, due to the time delay in the system the HIL contact simulation becomes divergent. However, the traditional first-order phase lead compensation approach still result in a small divergence for the pure time delay. The serial Smith predictor and phase lead compensation approach proposed by the authors recently will lead to an over-compensation and an obvious convergence. In this study, a hybrid Smith predictor and phase lead compensation approach is proposed. The hybrid Smith predictor and phase lead compensation can achieve a higher simulation fidelity with a little convergence. The phase angle of the compensator is analyzed and the stability condition of the HIL simulation system is given. The effectiveness of the proposed compensation approach is tested by simulations on an undamped elastic contact process.
A novel hybrid method of beta-turn identification in protein using binary logistic regression and neural network

PubMed Central

Asghari, Mehdi Poursheikhali; Hayatshahi, Sayyed Hamed Sadat; Abdolmaleki, Parviz

2012-01-01

From both the structural and functional points of view, β-turns play important biological roles in proteins. In the present study, a novel two-stage hybrid procedure has been developed to identify β-turns in proteins. Binary logistic regression was initially used for the first time to select significant sequence parameters in identification of β-turns due to a re-substitution test procedure. Sequence parameters were consisted of 80 amino acid positional occurrences and 20 amino acid percentages in sequence. Among these parameters, the most significant ones which were selected by binary logistic regression model, were percentages of Gly, Ser and the occurrence of Asn in position i+2, respectively, in sequence. These significant parameters have the highest effect on the constitution of a β-turn sequence. A neural network model was then constructed and fed by the parameters selected by binary logistic regression to build a hybrid predictor. The networks have been trained and tested on a non-homologous dataset of 565 protein chains. With applying a nine fold cross-validation test on the dataset, the network reached an overall accuracy (Qtotal) of 74, which is comparable with results of the other β-turn prediction methods. In conclusion, this study proves that the parameter selection ability of binary logistic regression together with the prediction capability of neural networks lead to the development of more precise models for identifying β-turns in proteins. PMID:27418910
A novel hybrid method of beta-turn identification in protein using binary logistic regression and neural network.

PubMed

Asghari, Mehdi Poursheikhali; Hayatshahi, Sayyed Hamed Sadat; Abdolmaleki, Parviz

2012-01-01

From both the structural and functional points of view, β-turns play important biological roles in proteins. In the present study, a novel two-stage hybrid procedure has been developed to identify β-turns in proteins. Binary logistic regression was initially used for the first time to select significant sequence parameters in identification of β-turns due to a re-substitution test procedure. Sequence parameters were consisted of 80 amino acid positional occurrences and 20 amino acid percentages in sequence. Among these parameters, the most significant ones which were selected by binary logistic regression model, were percentages of Gly, Ser and the occurrence of Asn in position i+2, respectively, in sequence. These significant parameters have the highest effect on the constitution of a β-turn sequence. A neural network model was then constructed and fed by the parameters selected by binary logistic regression to build a hybrid predictor. The networks have been trained and tested on a non-homologous dataset of 565 protein chains. With applying a nine fold cross-validation test on the dataset, the network reached an overall accuracy (Qtotal) of 74, which is comparable with results of the other β-turn prediction methods. In conclusion, this study proves that the parameter selection ability of binary logistic regression together with the prediction capability of neural networks lead to the development of more precise models for identifying β-turns in proteins.
Branch classification: A new mechanism for improving branch predictor performance

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chang, P.Y.; Hao, E.; Patt, Y.

There is wide agreement that one of the most significant impediments to the performance of current and future pipelined superscalar processors is the presence of conditional branches in the instruction stream. Speculative execution is one solution to the branch problem, but speculative work is discarded if a branch is mispredicted. For it to be effective, speculative work is discarded if a branch is mispredicted. For it to be effective, speculative execution requires a very accurate branch predictor; 95% accuracy is not good enough. This paper proposes branch classification, a methodology for building more accurate branch predictors. Branch classification allows anmore » individual branch instruction to be associated with the branch predictor best suited to predict its direction. Using this approach, a hybrid branch predictor can be constructed such that each component branch predictor predicts those branches for which it is best suited. To demonstrate the usefulness of branch classification, an example classification scheme is given and a new hybrid predictor is built based on this scheme which achieves a higher prediction accuracy than any branch predictor previously reported in the literature.« less
Prediction of human breast and colon cancers from imbalanced data using nearest neighbor and support vector machines.

PubMed

Majid, Abdul; Ali, Safdar; Iqbal, Mubashar; Kausar, Nabeela

2014-03-01

This study proposes a novel prediction approach for human breast and colon cancers using different feature spaces. The proposed scheme consists of two stages: the preprocessor and the predictor. In the preprocessor stage, the mega-trend diffusion (MTD) technique is employed to increase the samples of the minority class, thereby balancing the dataset. In the predictor stage, machine-learning approaches of K-nearest neighbor (KNN) and support vector machines (SVM) are used to develop hybrid MTD-SVM and MTD-KNN prediction models. MTD-SVM model has provided the best values of accuracy, G-mean and Matthew's correlation coefficient of 96.71%, 96.70% and 71.98% for cancer/non-cancer dataset, breast/non-breast cancer dataset and colon/non-colon cancer dataset, respectively. We found that hybrid MTD-SVM is the best with respect to prediction performance and computational cost. MTD-KNN model has achieved moderately better prediction as compared to hybrid MTD-NB (Naïve Bayes) but at the expense of higher computing cost. MTD-KNN model is faster than MTD-RF (random forest) but its prediction is not better than MTD-RF. To the best of our knowledge, the reported results are the best results, so far, for these datasets. The proposed scheme indicates that the developed models can be used as a tool for the prediction of cancer. This scheme may be useful for study of any sequential information such as protein sequence or any nucleic acid sequence. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Differential evolution-simulated annealing for multiple sequence alignment

NASA Astrophysics Data System (ADS)

Addawe, R. C.; Addawe, J. M.; Sueño, M. R. K.; Magadia, J. C.

2017-10-01

Multiple sequence alignments (MSA) are used in the analysis of molecular evolution and sequence structure relationships. In this paper, a hybrid algorithm, Differential Evolution - Simulated Annealing (DESA) is applied in optimizing multiple sequence alignments (MSAs) based on structural information, non-gaps percentage and totally conserved columns. DESA is a robust algorithm characterized by self-organization, mutation, crossover, and SA-like selection scheme of the strategy parameters. Here, the MSA problem is treated as a multi-objective optimization problem of the hybrid evolutionary algorithm, DESA. Thus, we name the algorithm as DESA-MSA. Simulated sequences and alignments were generated to evaluate the accuracy and efficiency of DESA-MSA using different indel sizes, sequence lengths, deletion rates and insertion rates. The proposed hybrid algorithm obtained acceptable solutions particularly for the MSA problem evaluated based on the three objectives.
Hybridization and sequencing of nucleic acids using base pair mismatches

DOEpatents

Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

2001-01-01

Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Cadmium sulfide nanocluster-based electrochemical stripping detection of DNA hybridization.

PubMed

Zhu, Ningning; Zhang, Aiping; He, Pingang; Fang, Yuzhi

2003-03-01

A novel, sensitive electrochemical DNA hybridization detection assay, using cadmium sulfide (CdS) nanoclusters as the oligonucleotide labeling tag, is described. The assay relies on the hybridization of the target DNA with the CdS nanocluster oligonucleotide DNA probe, followed by the dissolution of the CdS nanoclusters anchored on the hybrids and the indirect determination of the dissolved cadmium ions by sensitive anodic stripping voltammetry (ASV) at a mercury-coated glassy carbon electrode (GCE). The results showed that only a complementary sequence could form a double-stranded dsDNA-CdS with the DNA probe and give an obvious electrochemical response. A three-base mismatch sequence and non-complementary sequence had negligible response. The combination of the large number of cadmium ions released from each dsDNA hybrid with the remarkable sensitivity of the electrochemical stripping analysis for cadmium at mercury-film GCE allows detection at levels as low as 0.2 pmol L(-1) of the complementary sequence of DNA.
Improved detection of genetic markers of antimicrobial resistance by hybridization probe-based melting curve analysis using primers to mask proximal mutations: examples include the influenza H275Y substitution.

PubMed

Whiley, David M; Jacob, Kevin; Nakos, Jennifer; Bletchly, Cheryl; Nimmo, Graeme R; Nissen, Michael D; Sloots, Theo P

2012-06-01

Numerous real-time PCR assays have been described for detection of the influenza A H275Y alteration. However, the performance of these methods can be undermined by sequence variation in the regions flanking the codon of interest. This is a problem encountered more broadly in microbial diagnostics. In this study, we developed a modification of hybridization probe-based melting curve analysis, whereby primers are used to mask proximal mutations in the sequence targets of hybridization probes, so as to limit the potential for sequence variation to interfere with typing. The approach was applied to the H275Y alteration of the influenza A (H1N1) 2009 strain, as well as a Neisseria gonorrhoeae mutation associated with antimicrobial resistance. Assay performances were assessed using influenza A and N. gonorrhoeae strains characterized by DNA sequencing. The modified hybridization probe-based approach proved successful in limiting the effects of proximal mutations, with the results of melting curve analyses being 100% consistent with the results of DNA sequencing for all influenza A and N. gonorrhoeae strains tested. Notably, these included influenza A and N. gonorrhoeae strains exhibiting additional mutations in hybridization probe targets. Of particular interest was that the H275Y assay correctly typed influenza A strains harbouring a T822C nucleotide substitution, previously shown to interfere with H275Y typing methods. Overall our modified hybridization probe-based approach provides a simple means of circumventing problems caused by sequence variation, and offers improved detection of the influenza A H275Y alteration and potentially other resistance mechanisms.
An evolution based biosensor receptor DNA sequence generation algorithm.

PubMed

Kim, Eungyeong; Lee, Malrey; Gatton, Thomas M; Lee, Jaewan; Zang, Yupeng

2010-01-01

A biosensor is composed of a bioreceptor, an associated recognition molecule, and a signal transducer that can selectively detect target substances for analysis. DNA based biosensors utilize receptor molecules that allow hybridization with the target analyte. However, most DNA biosensor research uses oligonucleotides as the target analytes and does not address the potential problems of real samples. The identification of recognition molecules suitable for real target analyte samples is an important step towards further development of DNA biosensors. This study examines the characteristics of DNA used as bioreceptors and proposes a hybrid evolution-based DNA sequence generating algorithm, based on DNA computing, to identify suitable DNA bioreceptor recognition molecules for stable hybridization with real target substances. The Traveling Salesman Problem (TSP) approach is applied in the proposed algorithm to evaluate the safety and fitness of the generated DNA sequences. This approach improves efficiency and stability for enhanced and variable-length DNA sequence generation and allows extension to generation of variable-length DNA sequences with diverse receptor recognition requirements.
RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach.

PubMed

Pan, Xiaoyong; Shen, Hong-Bin

2017-02-28

RNAs play key roles in cells through the interactions with proteins known as the RNA-binding proteins (RBP) and their binding motifs enable crucial understanding of the post-transcriptional regulation of RNAs. How the RBPs correctly recognize the target RNAs and why they bind specific positions is still far from clear. Machine learning-based algorithms are widely acknowledged to be capable of speeding up this process. Although many automatic tools have been developed to predict the RNA-protein binding sites from the rapidly growing multi-resource data, e.g. sequence, structure, their domain specific features and formats have posed significant computational challenges. One of current difficulties is that the cross-source shared common knowledge is at a higher abstraction level beyond the observed data, resulting in a low efficiency of direct integration of observed data across domains. The other difficulty is how to interpret the prediction results. Existing approaches tend to terminate after outputting the potential discrete binding sites on the sequences, but how to assemble them into the meaningful binding motifs is a topic worth of further investigation. In viewing of these challenges, we propose a deep learning-based framework (iDeep) by using a novel hybrid convolutional neural network and deep belief network to predict the RBP interaction sites and motifs on RNAs. This new protocol is featured by transforming the original observed data into a high-level abstraction feature space using multiple layers of learning blocks, where the shared representations across different domains are integrated. To validate our iDeep method, we performed experiments on 31 large-scale CLIP-seq datasets, and our results show that by integrating multiple sources of data, the average AUC can be improved by 8% compared to the best single-source-based predictor; and through cross-domain knowledge integration at an abstraction level, it outperforms the state-of-the-art predictors by 6%. Besides the overall enhanced prediction performance, the convolutional neural network module embedded in iDeep is also able to automatically capture the interpretable binding motifs for RBPs. Large-scale experiments demonstrate that these mined binding motifs agree well with the experimentally verified results, suggesting iDeep is a promising approach in the real-world applications. The iDeep framework not only can achieve promising performance than the state-of-the-art predictors, but also easily capture interpretable binding motifs. iDeep is available at http://www.csbio.sjtu.edu.cn/bioinf/iDeep.
Kotai Antibody Builder: automated high-resolution structural modeling of antibodies.

PubMed

Yamashita, Kazuo; Ikeda, Kazuyoshi; Amada, Karlou; Liang, Shide; Tsuchiya, Yuko; Nakamura, Haruki; Shirai, Hiroki; Standley, Daron M

2014-11-15

Kotai Antibody Builder is a Web service for tertiary structural modeling of antibody variable regions. It consists of three main steps: hybrid template selection by sequence alignment and canonical rules, 3D rendering of alignments and CDR-H3 loop modeling. For the last step, in addition to rule-based heuristics used to build the initial model, a refinement option is available that uses fragment assembly followed by knowledge-based scoring. Using targets from the Second Antibody Modeling Assessment, we demonstrate that Kotai Antibody Builder generates models with an overall accuracy equal to that of the best-performing semi-automated predictors using expert knowledge. Kotai Antibody Builder is available at http://kotaiab.org standley@ifrec.osaka-u.ac.jp. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Horseradish peroxidase-labeled oligonucleotides and fluorescent tyramides for rapid detection of chromosome-specific repeat sequences.

PubMed

van Gijlswijk, R P; Wiegant, J; Vervenne, R; Lasan, R; Tanke, H J; Raap, A K

1996-01-01

We present a sensitive and rapid fluorescence in situ hybridization (FISH) strategy for detecting chromosome-specific repeat sequences. It uses horseradish peroxidase (HRP)-labeled oligonucleotide sequences in combination with fluorescent tyramide-based detection. After in situ hybridization, the HRP conjugated to the oligonucleotide probe is used to deposit fluorescently labeled tyramide molecules at the site of hybridization. The method features full chemical synthesis of probes, strong FISH signals, and short processing periods, as well as multicolor capabilities.
Kit for detecting nucleic acid sequences using competitive hybridization probes

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

2001-01-01

A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the target sequence.
Hierarchical assembly of viral nanotemplates with encoded microparticles via nucleic acid hybridization.

PubMed

Tan, Wui Siew; Lewis, Christina L; Horelik, Nicholas E; Pregibon, Daniel C; Doyle, Patrick S; Yi, Hyunmin

2008-11-04

We demonstrate hierarchical assembly of tobacco mosaic virus (TMV)-based nanotemplates with hydrogel-based encoded microparticles via nucleic acid hybridization. TMV nanotemplates possess a highly defined structure and a genetically engineered high density thiol functionality. The encoded microparticles are produced in a high throughput microfluidic device via stop-flow lithography (SFL) and consist of spatially discrete regions containing encoded identity information, an internal control, and capture DNAs. For the hybridization-based assembly, partially disassembled TMVs were programmed with linker DNAs that contain sequences complementary to both the virus 5' end and a selected capture DNA. Fluorescence microscopy, atomic force microscopy (AFM), and confocal microscopy results clearly indicate facile assembly of TMV nanotemplates onto microparticles with high spatial and sequence selectivity. We anticipate that our hybridization-based assembly strategy could be employed to create multifunctional viral-synthetic hybrid materials in a rapid and high-throughput manner. Additionally, we believe that these viral-synthetic hybrid microparticles may find broad applications in high capacity, multiplexed target sensing.
GeneChip{sup {trademark}} screening assay for cystic fibrosis mutations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cronn, M.T.; Miyada, C.G.; Fucini, R.V.

1994-09-01

GeneChip{sup {trademark}} assays are based on high density, carefully designed arrays of short oligonucleotide probes (13-16 bases) built directly on derivatized silica substrates. DNA target sequence analysis is achieved by hybridizing fluorescently labeled amplification products to these arrays. Fluorescent hybridization signals located within the probe array are translated into target sequence information using the known probe sequence at each array feature. The mutation screening assay for cystic fibrosis includes sets of oligonucleotide probes designed to detect numerous different mutations that have been described in 14 exons and one intron of the CFTR gene. Each mutation site is addressed by amore » sub-array of at least 40 probe sequences, half designed to detect the wild type gene sequence and half designed to detect the reported mutant sequence. Hybridization with homozygous mutant, homozygous wild type or heterozygous targets results in distinctive hybridization patterns within a sub-array, permitting specific discrimination of each mutation. The GeneChip probe arrays are very small (approximately 1 cm{sup 2}). There miniature size coupled with their high information content make GeneChip probe arrays a useful and practical means for providing CF mutation analysis in a clinical setting.« less
Methods of DNA sequencing by hybridization based on optimizing concentration of matrix-bound oligonucleotide and device for carrying out same

DOEpatents

Khrapko, Konstantin R [Moscow, RU; Khorlin, Alexandr A [Moscow, RU; Ivanov, Igor B [Moskovskaya, RU; Ershov, Gennady M [Moscow, RU; Lysov, Jury P [Moscow, RU; Florentiev, Vladimir L [Moscow, RU; Mirzabekov, Andrei D [Moscow, RU

1996-09-03

A method for sequencing DNA by hybridization that includes the following steps: forming an array of oligonucleotides at such concentrations that either ensure the same dissociation temperature for all fully complementary duplexes or allows hybridization and washing of such duplexes to be conducted at the same temperature; hybridizing said oligonucleotide array with labeled test DNA; washing in duplex dissociation conditions; identifying single-base substitutions in the test DNA by analyzing the distribution of the dissociation temperatures and reconstructing the DNA nucleotide sequence based on the above analysis. A device for carrying out the method comprises a solid substrate and a matrix rigidly bound to the substrate. The matrix contains the oligonucleotide array and consists of a multiplicity of gel portions. Each gel portion contains one oligonucleotide of desired length. The gel portions are separated from one another by interstices and have a thickness not exceeding 30 .mu.m.
Hybrid-denovo: a de novo OTU-picking pipeline integrating single-end and paired-end 16S sequence tags.

PubMed

Chen, Xianfeng; Johnson, Stephen; Jeraldo, Patricio; Wang, Junwen; Chia, Nicholas; Kocher, Jean-Pierre A; Chen, Jun

2018-03-01

Illumina paired-end sequencing has been increasingly popular for 16S rRNA gene-based microbiota profiling. It provides higher phylogenetic resolution than single-end reads due to a longer read length. However, the reverse read (R2) often has significant low base quality, and a large proportion of R2s will be discarded after quality control, resulting in a mixture of paired-end and single-end reads. A typical 16S analysis pipeline usually processes either paired-end or single-end reads but not a mixture. Thus, the quantification accuracy and statistical power will be reduced due to the loss of a large amount of reads. As a result, rare taxa may not be detectable with the paired-end approach, or low taxonomic resolution will result in a single-end approach. To have both the higher phylogenetic resolution provided by paired-end reads and the higher sequence coverage by single-end reads, we propose a novel OTU-picking pipeline, hybrid-denovo, that can process a hybrid of single-end and paired-end reads. Using high-quality paired-end reads as a gold standard, we show that hybrid-denovo achieved the highest correlation with the gold standard and performed better than the approaches based on paired-end or single-end reads in terms of quantifying the microbial diversity and taxonomic abundances. By applying our method to a rheumatoid arthritis (RA) data set, we demonstrated that hybrid-denovo captured more microbial diversity and identified more RA-associated taxa than a paired-end or single-end approach. Hybrid-denovo utilizes both paired-end and single-end 16S sequencing reads and is recommended for 16S rRNA gene targeted paired-end sequencing data.
Predictor-corrector framework for the sequential assembly of optical systems based on wavefront sensing.

PubMed

Schindlbeck, Christopher; Pape, Christian; Reithmeier, Eduard

2018-04-16

Alignment of optical components is crucial for the assembly of optical systems to ensure their full functionality. In this paper we present a novel predictor-corrector framework for the sequential assembly of serial optical systems. Therein, we use a hybrid optical simulation model that comprises virtual and identified component positions. The hybrid model is constantly adapted throughout the assembly process with the help of nonlinear identification techniques and wavefront measurements. This enables prediction of the future wavefront at the detector plane and therefore allows for taking corrective measures accordingly during the assembly process if a user-defined tolerance on the wavefront error is violated. We present a novel notation for the so-called hybrid model and outline the work flow of the presented predictor-corrector framework. A beam expander is assembled as demonstrator for experimental verification of the framework. The optical setup consists of a laser, two bi-convex spherical lenses each mounted to a five degree-of-freedom stage to misalign and correct components, and a Shack-Hartmann sensor for wavefront measurements.
Method for performing site-specific affinity fractionation for use in DNA sequencing

DOEpatents

Mirzabekov, Andrei Darievich; Lysov, Yuri Petrovich; Dubley, Svetlana A.

1999-01-01

A method for fractionating and sequencing DNA via affinity interaction is provided comprising contacting cleaved DNA to a first array of oligonucleotide molecules to facilitate hybridization between said cleaved DNA and the molecules; extracting the hybridized DNA from the molecules; contacting said extracted hybridized DNA with a second array of oligonucleotide molecules, wherein the oligonucleotide molecules in the second array have specified base sequences that are complementary to said extracted hybridized DNA; and attaching labeled DNA to the second array of oligonucleotide molecules, wherein the labeled re-hybridized DNA have sequences that are complementary to the oligomers. The invention further provides a method for performing multi-step conversions of the chemical structure of compounds comprising supplying an array of polyacrylamide vessels separated by hydrophobic surfaces; immobilizing a plurality of reactants, such as enzymes, in the vessels so that each vessel contains one reactant; contacting the compounds to each of the vessels in a predetermined sequence and for a sufficient time to convert the compounds to a desired state; and isolating the converted compounds from said array.

Miniaturized reaction vessel system, method for performing site-specific biochemical reactions and affinity fractionation for use in DNA sequencing

DOEpatents

Mirzabekov, Andrei Darievich; Lysov, Yuri Petrovich; Dubley, Svetlana A.

2000-01-01

A method for fractionating and sequencing DNA via affinity interaction is provided comprising contacting cleaved DNA to a first array of oligonucleotide molecules to facilitate hybridization between said cleaved DNA and the molecules; extracting the hybridized DNA from the molecules; contacting said extracted hybridized DNA with a second array of oligonucleotide molecules, wherein the oligonucleotide molecules in the second array have specified base sequences that are complementary to said extracted hybridized DNA; and attaching labeled DNA to the second array of oligonucleotide molecules, wherein the labeled re-hybridized DNA have sequences that are complementary to the oligomers. The invention further provides a method for performing multi-step conversions of the chemical structure of compounds comprising supplying an array of polyacrylamide vessels separated by hydrophobic surfaces; immobilizing a plurality of reactants, such as enzymes, in the vessels so that each vessel contains one reactant; contacting the compounds to each of the vessels in a predetermined sequence and for a sufficient time to convert the compounds to a desired state; and isolating the converted compounds from said array.
Method for performing site-specific affinity fractionation for use in DNA sequencing

DOEpatents

Mirzabekov, A.D.; Lysov, Y.P.; Dubley, S.A.

1999-05-18

A method for fractionating and sequencing DNA via affinity interaction is provided comprising contacting cleaved DNA to a first array of oligonucleotide molecules to facilitate hybridization between the cleaved DNA and the molecules; extracting the hybridized DNA from the molecules; contacting the extracted hybridized DNA with a second array of oligonucleotide molecules, wherein the oligonucleotide molecules in the second array have specified base sequences that are complementary to the extracted hybridized DNA; and attaching labeled DNA to the second array of oligonucleotide molecules, wherein the labeled re-hybridized DNA have sequences that are complementary to the oligomers. The invention further provides a method for performing multi-step conversions of the chemical structure of compounds comprising supplying an array of polyacrylamide vessels separated by hydrophobic surfaces; immobilizing a plurality of reactants, such as enzymes, in the vessels so that each vessel contains one reactant; contacting the compounds to each of the vessels in a predetermined sequence and for a sufficient time to convert the compounds to a desired state; and isolating the converted compounds from the array. 14 figs.
Confirmation of hybrid origin of Cyrtanthus based on the sequence analysis of internal transcribed spacer

USDA-ARS?s Scientific Manuscript database

The objectives of this study were to create interspecific hybrids between Cyrtanthus elatus and C. sanguineus and to confirm the hybrid origin of the progeny based on morphological characters and using molecular markers. The tip of the leaves, the shape and size of cells, and stomata distribution i...
On-chip multiplexed solid-phase nucleic acid hybridization assay using spatial profiles of immobilized quantum dots and fluorescence resonance energy transfer.

PubMed

Noor, M Omair; Tavares, Anthony J; Krull, Ulrich J

2013-07-25

A microfluidic based solid-phase assay for the multiplexed detection of nucleic acid hybridization using quantum dot (QD) mediated fluorescence resonance energy transfer (FRET) is described herein. The glass surface of hybrid glass-polydimethylsiloxane (PDMS) microfluidic channels was chemically modified to assemble the biorecognition interface. Multiplexing was demonstrated using a detection system that was comprised of two colors of immobilized semi-conductor QDs and two different oligonucleotide probe sequences. Green-emitting and red-emitting QDs were paired with Cy3 and Alexa Fluor 647 (A647) labeled oligonucleotides, respectively. The QDs served as energy donors for the transduction of dye labeled oligonucleotide targets. The in-channel assembly of the biorecognition interface and the subsequent introduction of oligonucleotide targets was accomplished within minutes using a combination of electroosmotic flow and electrophoretic force. The concurrent quantification of femtomole quantities of two target sequences was possible by measuring the spatial coverage of FRET sensitized emission along the length of the channel. In previous reports, multiplexed QD-FRET hybridization assays that employed a ratiometric method for quantification had challenges associated with lower analytical sensitivity arising from both donor and acceptor dilution that resulted in reduced energy transfer pathways as compared to single-color hybridization assays. Herein, a spatial method for quantification that is based on in-channel QD-FRET profiles provided higher analytical sensitivity in the multiplexed assay format as compared to single-color hybridization assays. The selectivity of the multiplexed hybridization assays was demonstrated by discrimination between a fully-complementary sequence and a 3 base pair sequence at a contrast ratio of 8 to 1. Copyright © 2013 Elsevier B.V. All rights reserved.
MoRFPred-plus: Computational Identification of MoRFs in Protein Sequences using Physicochemical Properties and HMM profiles.

PubMed

Sharma, Ronesh; Bayarjargal, Maitsetseg; Tsunoda, Tatsuhiko; Patil, Ashwini; Sharma, Alok

2018-01-21

Intrinsically Disordered Proteins (IDPs) lack stable tertiary structure and they actively participate in performing various biological functions. These IDPs expose short binding regions called Molecular Recognition Features (MoRFs) that permit interaction with structured protein regions. Upon interaction they undergo a disorder-to-order transition as a result of which their functionality arises. Predicting these MoRFs in disordered protein sequences is a challenging task. In this study, we present MoRFpred-plus, an improved predictor over our previous proposed predictor to identify MoRFs in disordered protein sequences. Two separate independent propensity scores are computed via incorporating physicochemical properties and HMM profiles, these scores are combined to predict final MoRF propensity score for a given residue. The first score reflects the characteristics of a query residue to be part of MoRF region based on the composition and similarity of assumed MoRF and flank regions. The second score reflects the characteristics of a query residue to be part of MoRF region based on the properties of flanks associated around the given residue in the query protein sequence. The propensity scores are processed and common averaging is applied to generate the final prediction score of MoRFpred-plus. Performance of the proposed predictor is compared with available MoRF predictors, MoRFchibi, MoRFpred, and ANCHOR. Using previously collected training and test sets used to evaluate the mentioned predictors, the proposed predictor outperforms these predictors and generates lower false positive rate. In addition, MoRFpred-plus is a downloadable predictor, which makes it useful as it can be used as input to other computational tools. https://github.com/roneshsharma/MoRFpred-plus/wiki/MoRFpred-plus:-Download. Copyright © 2017 Elsevier Ltd. All rights reserved.
Automated hybridization/imaging device for fluorescent multiplex DNA sequencing

DOEpatents

Weiss, R.B.; Kimball, A.W.; Gesteland, R.F.; Ferguson, F.M.; Dunn, D.M.; Di Sera, L.J.; Cherry, J.L.

1995-11-28

A method is disclosed for automated multiplex sequencing of DNA with an integrated automated imaging hybridization chamber system. This system comprises an hybridization chamber device for mounting a membrane containing size-fractionated multiplex sequencing reaction products, apparatus for fluid delivery to the chamber device, imaging apparatus for light delivery to the membrane and image recording of fluorescence emanating from the membrane while in the chamber device, and programmable controller apparatus for controlling operation of the system. The multiplex reaction products are hybridized with a probe, the enzyme (such as alkaline phosphatase) is bound to a binding moiety on the probe, and a fluorogenic substrate (such as a benzothiazole derivative) is introduced into the chamber device by the fluid delivery apparatus. The enzyme converts the fluorogenic substrate into a fluorescent product which, when illuminated in the chamber device with a beam of light from the imaging apparatus, excites fluorescence of the fluorescent product to produce a pattern of hybridization. The pattern of hybridization is imaged by a CCD camera component of the imaging apparatus to obtain a series of digital signals. These signals are converted by the controller apparatus into a string of nucleotides corresponding to the nucleotide sequence an automated sequence reader. The method and apparatus are also applicable to other membrane-based applications such as colony and plaque hybridization and Southern, Northern, and Western blots. 9 figs.
Automated hybridization/imaging device for fluorescent multiplex DNA sequencing

DOEpatents

Weiss, Robert B.; Kimball, Alvin W.; Gesteland, Raymond F.; Ferguson, F. Mark; Dunn, Diane M.; Di Sera, Leonard J.; Cherry, Joshua L.

1995-01-01

A method is disclosed for automated multiplex sequencing of DNA with an integrated automated imaging hybridization chamber system. This system comprises an hybridization chamber device for mounting a membrane containing size-fractionated multiplex sequencing reaction products, apparatus for fluid delivery to the chamber device, imaging apparatus for light delivery to the membrane and image recording of fluorescence emanating from the membrane while in the chamber device, and programmable controller apparatus for controlling operation of the system. The multiplex reaction products are hybridized with a probe, then an enzyme (such as alkaline phosphatase) is bound to a binding moiety on the probe, and a fluorogenic substrate (such as a benzothiazole derivative) is introduced into the chamber device by the fluid delivery apparatus. The enzyme converts the fluorogenic substrate into a fluorescent product which, when illuminated in the chamber device with a beam of light from the imaging apparatus, excites fluorescence of the fluorescent product to produce a pattern of hybridization. The pattern of hybridization is imaged by a CCD camera component of the imaging apparatus to obtain a series of digital signals. These signals are converted by the controller apparatus into a string of nucleotides corresponding to the nucleotide sequence an automated sequence reader. The method and apparatus are also applicable to other membrane-based applications such as colony and plaque hybridization and Southern, Northern, and Western blots.
Evaluation of targeted exome sequencing for 28 protein-based blood group systems, including the homologous gene systems, for blood group genotyping.

PubMed

Schoeman, Elizna M; Lopez, Genghis H; McGowan, Eunike C; Millard, Glenda M; O'Brien, Helen; Roulis, Eileen V; Liew, Yew-Wah; Martin, Jacqueline R; McGrath, Kelli A; Powley, Tanya; Flower, Robert L; Hyland, Catherine A

2017-04-01

Blood group single nucleotide polymorphism genotyping probes for a limited range of polymorphisms. This study investigated whether massively parallel sequencing (also known as next-generation sequencing), with a targeted exome strategy, provides an extended blood group genotype and the extent to which massively parallel sequencing correctly genotypes in homologous gene systems, such as RH and MNS. Donor samples (n = 28) that were extensively phenotyped and genotyped using single nucleotide polymorphism typing, were analyzed using the TruSight One Sequencing Panel and MiSeq platform. Genes for 28 protein-based blood group systems, GATA1, and KLF1 were analyzed. Copy number variation analysis was used to characterize complex structural variants in the GYPC and RH systems. The average sequencing depth per target region was 66.2 ± 39.8. Each sample harbored on average 43 ± 9 variants, of which 10 ± 3 were used for genotyping. For the 28 samples, massively parallel sequencing variant sequences correctly matched expected sequences based on single nucleotide polymorphism genotyping data. Copy number variation analysis defined the Rh C/c alleles and complex RHD hybrids. Hybrid RHD*D-CE-D variants were correctly identified, but copy number variation analysis did not confidently distinguish between D and CE exon deletion versus rearrangement. The targeted exome sequencing strategy employed extended the range of blood group genotypes detected compared with single nucleotide polymorphism typing. This single-test format included detection of complex MNS hybrid cases and, with copy number variation analysis, defined RH hybrid genes along with the RHCE*C allele hitherto difficult to resolve by variant detection. The approach is economical compared with whole-genome sequencing and is suitable for a red blood cell reference laboratory setting. © 2017 AABB.
Bhageerath-H: A homology/ab initio hybrid server for predicting tertiary structures of monomeric soluble proteins

PubMed Central

2014-01-01

Background The advent of human genome sequencing project has led to a spurt in the number of protein sequences in the databanks. Success of structure based drug discovery severely hinges on the availability of structures. Despite significant progresses in the area of experimental protein structure determination, the sequence-structure gap is continually widening. Data driven homology based computational methods have proved successful in predicting tertiary structures for sequences sharing medium to high sequence similarities. With dwindling similarities of query sequences, advanced homology/ ab initio hybrid approaches are being explored to solve structure prediction problem. Here we describe Bhageerath-H, a homology/ ab initio hybrid software/server for predicting protein tertiary structures with advancing drug design attempts as one of the goals. Results Bhageerath-H web-server was validated on 75 CASP10 targets which showed TM-scores ≥0.5 in 91% of the cases and Cα RMSDs ≤5Å from the native in 58% of the targets, which is well above the CASP10 water mark. Comparison with some leading servers demonstrated the uniqueness of the hybrid methodology in effectively sampling conformational space, scoring best decoys and refining low resolution models to high and medium resolution. Conclusion Bhageerath-H methodology is web enabled for the scientific community as a freely accessible web server. The methodology is fielded in the on-going CASP11 experiment. PMID:25521245
Horizontal Transfer of Segments of the 16S rRNA Genes between Species of the Streptococcus anginosus Group

PubMed Central

Schouls, Leo M.; Schot, Corrie S.; Jacobs, Jan A.

2003-01-01

The nature in variation of the 16S rRNA gene of members of the Streptococcus anginosus group was investigated by hybridization and DNA sequencing. A collection of 708 strains was analyzed by reverse line blot hybridization. This revealed the presence of distinct reaction patterns representing 11 different hybridization groups. The 16S rRNA genes of two strains of each hybridization group were sequenced to near-completion, and the sequence data confirmed the reverse line blot hybridization results. Closer inspection of the sequences revealed mosaic-like structures, strongly suggesting horizontal transfer of segments of the 16S rRNA gene between different species belonging to the Streptococcus anginosus group. Southern blot hybridization further showed that within a single strain all copies of the 16S rRNA gene had the same composition, indicating that the apparent mosaic structures were not PCR-induced artifacts. These findings indicate that the highly conserved rRNA genes are also subject to recombination and that these events may be fixed in the population. Such recombination may lead to the construction of incorrect phylogenetic trees based on the 16S rRNA genes. PMID:14645285
Hybridization-Induced Aggregation Technology for Practical Clinical Testing: KRAS Mutation Detection in Lung and Colorectal Tumors.

PubMed

Sloane, Hillary S; Landers, James P; Kelly, Kimberly A

2016-07-01

KRAS mutations have emerged as powerful predictors of response to targeted therapies in the treatment of lung and colorectal cancers; thus, prospective KRAS genotyping is essential for appropriate treatment stratification. Conventional mutation testing technologies are not ideal for routine clinical screening, as they often involve complex, time-consuming processes and/or costly instrumentation. In response, we recently introduced a unique analytical strategy for revealing KRAS mutations, based on the allele-specific hybridization-induced aggregation (HIA) of oligonucleotide probe-conjugated microbeads. Using simple, inexpensive instrumentation, this approach allows for the detection of any common KRAS mutation in <10 minutes after PCR. Here, we evaluate the clinical utility of the HIA method for mutation detection (HIAMD). In the analysis of 20 lung and colon tumor pathology specimens, we observed a 100% correlation between the KRAS mutation statuses determined by HIAMD and sequencing. In addition, we were able to detect KRAS mutations in a background of 75% wild-type DNA-a finding consistent with that reported for sequencing. With this, we show that HIAMD allows for the rapid and cost-effective detection of KRAS mutations, without compromising analytical performance. These results indicate the validity of HIAMD as a mutation-testing technology suitable for practical clinical testing. Further expansion of this platform may involve the detection of mutations in other key oncogenic pathways. Copyright © 2016 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

DOEpatents

Gardner, Shea N; Mariella, Jr., Raymond P; Christian, Allen T; Young, Jennifer A; Clague, David S

2013-06-25

A method of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths.
Cross-species bacterial artificial chromosome (BAC) library screening via overgo-based hybridization and BAC-contig mapping of a yield enhancement quantitative trait locus (QTL) yld1.1 in the Malaysian wild rice Oryza rufipogon.

PubMed

Song, Beng-Kah; Nadarajah, Kalaivani; Romanov, Michael N; Ratnam, Wickneswari

2005-01-01

The construction of BAC-contig physical maps is an important step towards a partial or ultimate genome sequence analysis. Here, we describe our initial efforts to apply an overgo approach to screen a BAC library of the Malaysian wild rice species, Oryza rufipogon. Overgo design is based on repetitive element masking and sequence uniqueness, and uses short probes (approximately 40 bp), making this method highly efficient and specific. Pairs of 24-bp oligos that contain an 8-bp overlap were developed from the publicly available genomic sequences of the cultivated rice, O. sativa, to generate 20 overgo probes for a 1-Mb region that encompasses a yield enhancement QTL yld1.1 in O. rufipogon. The advantages of a high similarity in melting temperature, hybridization kinetics and specific activities of overgos further enabled a pooling strategy for library screening by filter hybridization. Two pools of ten overgos each were hybridized to high-density filters representing the O. rufipogon genomic BAC library. These screening tests succeeded in providing 69 PCR-verified positive hits from a total of 23,040 BAC clones of the entire O. rufipogon library. A minimal tilling path of clones was generated to contribute to a fully covered BAC-contig map of the targeted 1-Mb region. The developed protocol for overgo design based on O. sativa sequences as a comparative genomic framework, and the pooled overgo hybridization screening technique are suitable means for high-resolution physical mapping and the identification of BAC candidates for sequencing.
Diagnostics based on nucleic acid sequence variant profiling: PCR, hybridization, and NGS approaches.

PubMed

Khodakov, Dmitriy; Wang, Chunyan; Zhang, David Yu

2016-10-01

Nucleic acid sequence variations have been implicated in many diseases, and reliable detection and quantitation of DNA/RNA biomarkers can inform effective therapeutic action, enabling precision medicine. Nucleic acid analysis technologies being translated into the clinic can broadly be classified into hybridization, PCR, and sequencing, as well as their combinations. Here we review the molecular mechanisms of popular commercial assays, and their progress in translation into in vitro diagnostics. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
PERMANENT GENETIC RESOURCES: Consensus primers of cyp73 genes discriminate willow species and hybrids (Salix, Salicaceae).

PubMed

Trung, Le Quang; VAN Puyvelde, Karolien; Triest, Ludwig

2008-03-01

Consensus primers, based on exon sequences of the cyp73 gene family coding for cinnamate 4-hydroxylase (C4H) of the lignin biosynthesis pathway, were designed for the tetraploid willow species Salix alba and Salix fragilis. Diagnostic alleles at species level were observed among introns of three cyp73 genes and allowed unambiguous detection of the first generation and introgressed hybrids in populations. Progeny analysis of a female S. alba with a male introgressed hybrid confirmed the codominant inheritance of each intron. Sequences of the diagnostic alleles of both species were similar to those found in the hybrids. © 2007 The Authors.
A Deep Machine Learning Algorithm to Optimize the Forecast of Atmospherics

NASA Astrophysics Data System (ADS)

Russell, A. M.; Alliss, R. J.; Felton, B. D.

Space-based applications from imaging to optical communications are significantly impacted by the atmosphere. Specifically, the occurrence of clouds and optical turbulence can determine whether a mission is a success or a failure. In the case of space-based imaging applications, clouds produce atmospheric transmission losses that can make it impossible for an electro-optical platform to image its target. Hence, accurate predictions of negative atmospheric effects are a high priority in order to facilitate the efficient scheduling of resources. This study seeks to revolutionize our understanding of and our ability to predict such atmospheric events through the mining of data from a high-resolution Numerical Weather Prediction (NWP) model. Specifically, output from the Weather Research and Forecasting (WRF) model is mined using a Random Forest (RF) ensemble classification and regression approach in order to improve the prediction of low cloud cover over the Haleakala summit of the Hawaiian island of Maui. RF techniques have a number of advantages including the ability to capture non-linear associations between the predictors (in this case physical variables from WRF such as temperature, relative humidity, wind speed and pressure) and the predictand (clouds), which becomes critical when dealing with the complex non-linear occurrence of clouds. In addition, RF techniques are capable of representing complex spatial-temporal dynamics to some extent. Input predictors to the WRF-based RF model are strategically selected based on expert knowledge and a series of sensitivity tests. Ultimately, three types of WRF predictors are chosen: local surface predictors, regional 3D moisture predictors and regional inversion predictors. A suite of RF experiments is performed using these predictors in order to evaluate the performance of the hybrid RF-WRF technique. The RF model is trained and tuned on approximately half of the input dataset and evaluated on the other half. The RF approach is validated using in-situ observations of clouds. All of the hybrid RF-WRF experiments demonstrated here significantly outperform the base WRF local low cloud cover forecasts in terms of the probability of detection and the overall bias. In particular, RF experiments that use only regional three-dimensional moisture predictors from the WRF model produce the highest accuracy when compared to RF experiments that use local surface predictors only or regional inversion predictors only. Furthermore, adding multiple types of WRF predictors and additional WRF predictors to the RF algorithm does not necessarily add more value in the resulting forecasts, indicating that it is better to have a small set of meaningful predictors than to have a vast set of indiscriminately-chosen predictors. This work also reveals that the WRF-based RF approach is highly sensitive to the time period over which the algorithm is trained and evaluated. Future work will focus on developing a similar WRF-based RF model for high cloud prediction and expanding the algorithm to two-dimensions horizontally.
High Specific Selectivity and Membrane-Active Mechanism of Synthetic Cationic Hybrid Antimicrobial Peptides Based on the Peptide FV7

PubMed Central

Tan, Tingting; Wu, Di; Li, Weizhong; Zheng, Xin; Li, Weifen; Shan, Anshan

2017-01-01

Hybrid peptides integrating different functional domains of peptides have many advantages, such as remarkable antimicrobial activity, lower hemolysis and ideal cell selectivity, compared with natural antimicrobial peptides. FV7 (FRIRVRV-NH2), a consensus amphiphilic sequence was identified as being analogous to host defense peptides. In this study, we designed a series of hybrid peptides FV7-LL-37 (17–29) (FV-LL), FV7-magainin 2 (9–21) (FV-MA) and FV7-cecropin A (1–8) (FV-CE) by combining the FV7 sequence with the small functional sequences LL-37 (17–29) (LL), magainin 2 (9–21) (MA) and cecropin A (1–8) (CE) which all come from well-described natural peptides. The results demonstrated that the synthetic hybrid peptides, in particular FV-LL, had potent antibacterial activities over a wide range of Gram-negative and Gram-positive bacteria with lower hemolytic activity than other peptides. Furthermore, fluorescent spectroscopy indicated that the hybrid peptide FV-LL exhibited marked membrane destruction by inducing outer and inner bacterial membrane permeabilization, while scanning electron microscopy (SEM) and transmission electron microscopy (TEM) demonstrated that FV-LL damaged membrane integrity by disrupting the bacterial membrane. Inhibiting biofilm formation assays also showed that FV-LL had similar anti-biofilm activity compared with the functional peptide sequence FV7. Synthetic cationic hybrid peptides based on FV7 could provide new models for combining different functional domains and demonstrate effective avenues to screen for novel antimicrobial agents. PMID:28178190
HIA: a genome mapper using hybrid index-based sequence alignment.

PubMed

Choi, Jongpill; Park, Kiejung; Cho, Seong Beom; Chung, Myungguen

2015-01-01

A number of alignment tools have been developed to align sequencing reads to the human reference genome. The scale of information from next-generation sequencing (NGS) experiments, however, is increasing rapidly. Recent studies based on NGS technology have routinely produced exome or whole-genome sequences from several hundreds or thousands of samples. To accommodate the increasing need of analyzing very large NGS data sets, it is necessary to develop faster, more sensitive and accurate mapping tools. HIA uses two indices, a hash table index and a suffix array index. The hash table performs direct lookup of a q-gram, and the suffix array performs very fast lookup of variable-length strings by exploiting binary search. We observed that combining hash table and suffix array (hybrid index) is much faster than the suffix array method for finding a substring in the reference sequence. Here, we defined the matching region (MR) is a longest common substring between a reference and a read. And, we also defined the candidate alignment regions (CARs) as a list of MRs that is close to each other. The hybrid index is used to find candidate alignment regions (CARs) between a reference and a read. We found that aligning only the unmatched regions in the CAR is much faster than aligning the whole CAR. In benchmark analysis, HIA outperformed in mapping speed compared with the other aligners, without significant loss of mapping accuracy. Our experiments show that the hybrid of hash table and suffix array is useful in terms of speed for mapping NGS sequencing reads to the human reference genome sequence. In conclusion, our tool is appropriate for aligning massive data sets generated by NGS sequencing.
Comparison of dkgB-linked intergenic sequence ribotyping to DNA microarray hybridization for assigning serotype to Salmonella enterica

PubMed Central

Guard, Jean; Sanchez-Ingunza, Roxana; Morales, Cesar; Stewart, Tod; Liljebjelke, Karen; Kessel, JoAnn; Ingram, Kim; Jones, Deana; Jackson, Charlene; Fedorka-Cray, Paula; Frye, Jonathan; Gast, Richard; Hinton, Arthur

2012-01-01

Two DNA-based methods were compared for the ability to assign serotype to 139 isolates of Salmonella enterica ssp. I. Intergenic sequence ribotyping (ISR) evaluated single nucleotide polymorphisms occurring in a 5S ribosomal gene region and flanking sequences bordering the gene dkgB. A DNA microarray hybridization method that assessed the presence and the absence of sets of genes was the second method. Serotype was assigned for 128 (92.1%) of submissions by the two DNA methods. ISR detected mixtures of serotypes within single colonies and it cost substantially less than Kauffmann–White serotyping and DNA microarray hybridization. Decreasing the cost of serotyping S. enterica while maintaining reliability may encourage routine testing and research. PMID:22998607
Method of Identifying a Base in a Nucleic Acid

DOEpatents

Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

1999-01-01

Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.

Identifying a base in a nucleic acid

DOEpatents

Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

2005-02-08

Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
BEST: Improved Prediction of B-Cell Epitopes from Antigen Sequences

PubMed Central

Gao, Jianzhao; Faraggi, Eshel; Zhou, Yaoqi; Ruan, Jishou; Kurgan, Lukasz

2012-01-01

Accurate identification of immunogenic regions in a given antigen chain is a difficult and actively pursued problem. Although accurate predictors for T-cell epitopes are already in place, the prediction of the B-cell epitopes requires further research. We overview the available approaches for the prediction of B-cell epitopes and propose a novel and accurate sequence-based solution. Our BEST (B-cell Epitope prediction using Support vector machine Tool) method predicts epitopes from antigen sequences, in contrast to some method that predict only from short sequence fragments, using a new architecture based on averaging selected scores generated from sliding 20-mers by a Support Vector Machine (SVM). The SVM predictor utilizes a comprehensive and custom designed set of inputs generated by combining information derived from the chain, sequence conservation, similarity to known (training) epitopes, and predicted secondary structure and relative solvent accessibility. Empirical evaluation on benchmark datasets demonstrates that BEST outperforms several modern sequence-based B-cell epitope predictors including ABCPred, method by Chen et al. (2007), BCPred, COBEpro, BayesB, and CBTOPE, when considering the predictions from antigen chains and from the chain fragments. Our method obtains a cross-validated area under the receiver operating characteristic curve (AUC) for the fragment-based prediction at 0.81 and 0.85, depending on the dataset. The AUCs of BEST on the benchmark sets of full antigen chains equal 0.57 and 0.6, which is significantly and slightly better than the next best method we tested. We also present case studies to contrast the propensity profiles generated by BEST and several other methods. PMID:22761950
A whole-genome, radiation hybrid map of wheat

USDA-ARS?s Scientific Manuscript database

Generating a reference sequence of bread wheat (Triticum aestivum L.) is a challenging task because of its large, highly repetitive and allopolyploid genome. Ordering of BAC- and NGS-based contigs in ongoing wheat genome-sequencing projects primarily uses recombination and comparative genomics-base...
GeneChip Resequencing of the Smallpox Virus Genome Can Identify Novel Strains: a Biodefense Application▿

PubMed Central

Sulaiman, Irshad M.; Tang, Kevin; Osborne, John; Sammons, Scott; Wohlhueter, Robert M.

2007-01-01

We developed a set of seven resequencing GeneChips, based on the complete genome sequences of 24 strains of smallpox virus (variola virus), for rapid characterization of this human-pathogenic virus. Each GeneChip was designed to analyze a divergent segment of approximately 30,000 bases of the smallpox virus genome. This study includes the hybridization results of 14 smallpox virus strains. Of the 14 smallpox virus strains hybridized, only 7 had sequence information included in the design of the smallpox virus resequencing GeneChips; similar information for the remaining strains was not tiled as a reference in these GeneChips. By use of variola virus-specific primers and long-range PCR, 22 overlapping amplicons were amplified to cover nearly the complete genome and hybridized with the smallpox virus resequencing GeneChip set. These GeneChips were successful in generating nucleotide sequences for all 14 of the smallpox virus strains hybridized. Analysis of the data indicated that the GeneChip resequencing by hybridization was fast and reproducible and that the smallpox virus resequencing GeneChips could differentiate the 14 smallpox virus strains characterized. This study also suggests that high-density resequencing GeneChips have potential biodefense applications and may be used as an alternate tool for rapid identification of smallpox virus in the future. PMID:17182757
Genetic relationships among Hylocereus and Selenicereus vine cacti (Cactaceae): evidence from hybridization and cytological studies.

PubMed

Tel-Zur, Noemi; Abbo, Shahal; Bar-Zvi, Dudy; Mizrahi, Yosef

2004-10-01

Hylocereus and Selenicereus are native to tropical and sub-tropical America. Based on its taxonomic status and crossability relations it was postulated that H. megalanthus (syn. S. megalanthus) is an allotetraploid (2n = 4x = 44) derived from natural hybridization between two closely related diploid taxa. The present work aimed at elucidating the genetic relationships between species of the two genera. Crosses were performed and the putative hybrids were analysed by chromosome counts and morphological traits. The ploidy level of hybrids was confirmed by fluorescent in situ hybridization (FISH) of rDNA sites. Genomic in situ hybridization (GISH) was used in an attempt to identify the putative diploid genome donors of H. megalanthus and an artificial interploid hybrid. Reciprocal crosses among four diploid Hylocereus species (H. costaricensis, H. monacanthus (syn. H. polyrhizus), H. undatus and Hylocereus sp.) yielded viable diploid hybrids, with regular chromosome pairing. Reciprocal crosses between these Hylocereus spp. and H. megalanthus yielded viable triploid, pentaploid, hexaploid and aneuploid hybrids. Morphological and phenological traits confirm the hybrid origin. In situ detection of rDNA sites was in accord with the ploidy status of the species and hybrid studied. GISH results indicated that overall sequence composition of H. megalanthus is similar to that of H. ocamponis and S. grandiflorus. High sequence similarity was also found between the parental genomes of H. monacanthus and H. megalanthus in one triploid hybrid. The ease of obtaining partially fertile F1 hybrids and the relative sequence similarity (in GISH study) suggest close genetic relationships among the taxa analysed.
Detection and isolation of nucleic acid sequences using competitive hybridization probes

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1997-01-01

A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided.
Detection and isolation of nucleic acid sequences using competitive hybridization probes

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1997-04-01

A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided. 7 figs.
Lateral flow nucleic acid biosensor for sensitive detection of microRNAs based on the dual amplification strategy of duplex-specific nuclease and hybridization chain reaction.

PubMed

Ying, Na; Ju, Chuanjing; Sun, Xiuwei; Li, Letian; Chang, Hongbiao; Song, Guangping; Li, Zhongyi; Wan, Jiayu; Dai, Enyong

2017-01-01

MicroRNAs (miRNAs) constitute novel biomarkers for various diseases. Accurate and quantitative analysis of miRNA expression is critical for biomedical research and clinical theranostics. In this study, a method was developed for sensitive and specific detection of miRNAs via dual signal amplification based on duplex specific nuclease (DSN) and hybridization chain reaction (HCR). A reporter probe (RP), comprising recognition sequence (3' end modified with biotin) for a target miRNA of miR-21 and capture sequence (5' end modified with Fam) for HCR product, was designed and synthesized. HCR was initiated by partial sequence of initiator probe (IP), the other part of which can hybridize with capture sequence of RP, and was assembled by hairpin probes modified with biotin (H1-bio and H2-bio). A miR-21 triggered cyclical DSN cleavage of RP, which was immobilized to a streptavidin (SA) coated magnetic bead (MB). The released Fam labeled capture sequence then hybridized with the HCR product to generate a detectable dsDNA. This polymer was then dropped on lateral flow strip and positive result was observed. The proposed method allowed quantitative sequence-specific detection of miR-21 (with a detection limit of 2.1 fM, S/N = 3) in a dynamic range from 100 fM to 100 pM, with an excellent ability to discriminate differences in miRNAs. The method showed acceptable testing recoveries for the determination of miRNAs in serum.
Label-free technology for the amplified detection of microRNA based on the allosteric hairpin DNA switch and hybridization chain reaction.

PubMed

Cai, Sheng; Cao, Zhijuan; Lau, Choiwan; Lu, Jianzhong

2014-11-21

By using the allosteric hairpin DNA switch, a novel assay for the detection of microRNA (miRNA) let-7a via a hybridization chain reaction (HCR) was introduced. Briefly, the hairpin DNA switch probe is a single-stranded DNA consisting of a streptavidin (SA) aptamer sequence, a target binding sequence and a certain sequence that acts as a trigger of the HCR. In the presence of target let-7a, the hairpin DNA switch would open and expose the stem region sequences, where a part of this sequence acts as initiator sequence strands for the HCR and triggers a cascade of hybridization events that yields nicked double helices analogous to alternating copolymers, another part is the SA aptamer sequence which activates its binding affinity to SA on SA-coated magnetic particles. The hybridization event could be sensitively detected via an instantaneous derivatization reaction between a special chemiluminescence (CL) reagent, 3,4,5-trimethoxylphenylglyoxal (TMPG) and the guanine nucleotides within the target, the hairpin DNA switch probe, and HCR helices to form an unstable CL intermediate for the generation of light. Our results show that the coupling of the hairpin DNA switch probe and the HCR for the amplified detection of let-7a achieves a better performance (e.g. wide linear response range: 0.1-1000 fmol, low detection limit: 0.1 fmol, and high specificity). Furthermore, this approach could be easily applied to the detection of let-7a in human lung cells, and extended to detect other types of miRNA and proteins such as PDGF based on aptamers. We believe such advancements will represent a significant step towards improved diagnostics and more personalized medical treatment.
Development of a Fluorescence Resonance Energy Transfer (FRET)-Based DNA Biosensor for Detection of Synthetic Oligonucleotide of Ganoderma boninense.

PubMed

Bakhori, Noremylia Mohd; Yusof, Nor Azah; Abdullah, Abdul Halim; Hussein, Mohd Zobir

2013-12-12

An optical DNA biosensor based on fluorescence resonance energy transfer (FRET) utilizing synthesized quantum dot (QD) has been developed for the detection of specific-sequence of DNA for Ganoderma boninense, an oil palm pathogen. Modified QD that contained carboxylic groups was conjugated with a single-stranded DNA probe (ssDNA) via amide-linkage. Hybridization of the target DNA with conjugated QD-ssDNA and reporter probe labeled with Cy5 allows for the detection of related synthetic DNA sequence of Ganoderma boninense gene based on FRET signals. Detection of FRET emission before and after hybridization was confirmed through the capability of the system to produce FRET at 680 nm for hybridized sandwich with complementary target DNA. No FRET emission was observed for non-complementary system. Hybridization time, temperature and effect of different concentration of target DNA were studied in order to optimize the developed system. The developed biosensor has shown high sensitivity with detection limit of 3.55 × 10-9 M. TEM results show that the particle size of QD varies in the range between 5 to 8 nm after ligand modification and conjugation with ssDNA. This approach is capable of providing a simple, rapid and sensitive method for detection of related synthetic DNA sequence of Ganoderma boninense.
Development of a Fluorescence Resonance Energy Transfer (FRET)-Based DNA Biosensor for Detection of Synthetic Oligonucleotide of Ganoderma boninense.

PubMed

Mohd Bakhori, Noremylia; Yusof, Nor Azah; Abdullah, Abdul Halim; Hussein, Mohd Zobir

2013-12-01

An optical DNA biosensor based on fluorescence resonance energy transfer (FRET) utilizing synthesized quantum dot (QD) has been developed for the detection of specific-sequence of DNA for Ganoderma boninense, an oil palm pathogen. Modified QD that contained carboxylic groups was conjugated with a single-stranded DNA probe (ssDNA) via amide-linkage. Hybridization of the target DNA with conjugated QD-ssDNA and reporter probe labeled with Cy5 allows for the detection of related synthetic DNA sequence of Ganoderma boninense gene based on FRET signals. Detection of FRET emission before and after hybridization was confirmed through the capability of the system to produce FRET at 680 nm for hybridized sandwich with complementary target DNA. No FRET emission was observed for non-complementary system. Hybridization time, temperature and effect of different concentration of target DNA were studied in order to optimize the developed system. The developed biosensor has shown high sensitivity with detection limit of 3.55 × 10(-9) M. TEM results show that the particle size of QD varies in the range between 5 to 8 nm after ligand modification and conjugation with ssDNA. This approach is capable of providing a simple, rapid and sensitive method for detection of related synthetic DNA sequence of Ganoderma boninense.
Polymerase chain reaction-hybridization method using urease gene sequences for high-throughput Ureaplasma urealyticum and Ureaplasma parvum detection and differentiation.

PubMed

Xu, Chen; Zhang, Nan; Huo, Qianyu; Chen, Minghui; Wang, Rengfeng; Liu, Zhili; Li, Xue; Liu, Yunde; Bao, Huijing

2016-04-15

In this article, we discuss the polymerase chain reaction (PCR)-hybridization assay that we developed for high-throughput simultaneous detection and differentiation of Ureaplasma urealyticum and Ureaplasma parvum using one set of primers and two specific DNA probes based on urease gene nucleotide sequence differences. First, U. urealyticum and U. parvum DNA samples were specifically amplified using one set of biotin-labeled primers. Furthermore, amine-modified DNA probes, which can specifically react with U. urealyticum or U. parvum DNA, were covalently immobilized to a DNA-BIND plate surface. The plate was then incubated with the PCR products to facilitate sequence-specific DNA binding. Horseradish peroxidase-streptavidin conjugation and a colorimetric assay were used. Based on the results, the PCR-hybridization assay we developed can specifically differentiate U. urealyticum and U. parvum with high sensitivity (95%) compared with cultivation (72.5%). Hence, this study demonstrates a new method for high-throughput simultaneous differentiation and detection of U. urealyticum and U. parvum with high sensitivity. Based on these observations, the PCR-hybridization assay developed in this study is ideal for detecting and discriminating U. urealyticum and U. parvum in clinical applications. Copyright © 2016 Elsevier Inc. All rights reserved.
Tomato (Solanum lycopersicum) variety discrimination and hybridization analysis based on the 5S rRNA region.

PubMed

Sun, Yan-Lin; Kang, Ho-Min; Kim, Young-Sik; Baek, Jun-Pill; Zheng, Shi-Lin; Xiang, Jin-Jun; Hong, Soon-Kwan

2014-05-04

The tomato ( Solanum lycopersicum ) is a major vegetable crop worldwide. To satisfy popular demand, more than 500 tomato varieties have been bred. However, a clear variety identification has not been found. Thorough understanding of the phylogenetic relationship and hybridization information of tomato varieties is very important for further variety breeding. Thus, in this study, we collected 26 tomato varieties and attempted to distinguish them based on the 5S rRNA region, which is widely used in the determination of phylogenetic relations. Sequence analysis of the 5S rRNA region suggested that a large number of nucleotide variations exist among tomato varieties. These variable nucleotide sites were also informative regarding hybridization. Chromas sequencing of Yellow Mountain View and Seuwiteuking varieties indicated three and one variable nucleotide sites in the non-transcribed spacer (NTS) of the 5S rRNA region showing hybridization, respectively. Based on a phylogenetic tree constructed using the 5S rRNA sequences, we observed that 16 tomato varieties were divided into three groups at 95% similarity. Rubiking and Sseommeoking, Lang Selection Procedure and Seuwiteuking, and Acorn Gold and Yellow Mountain View exhibited very high identity with their partners. This work will aid variety authentication and provides a basis for further tomato variety breeding.
Probe kit for identifying a base in a nucleic acid

DOEpatents

Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

2001-01-01

Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Distinguishing between weedy Amaranthus species based on intron one sequences from the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS)gene

USDA-ARS?s Scientific Manuscript database

Hybridization between Amaranthus species and the potential for herbicide resistance to be transferred by hybridization are of growing concern in the weed science community. It is important to confirm suspect hybrid populations early to develop an effective control strategy. With this in mind, a PC...
Consensus-Degenerate Hybrid Oligonucleotide Primers for Amplification of Priming Glycosyltransferase Genes of the Exopolysaccharide Locus in Strains of the Lactobacillus casei Group

PubMed Central

Provencher, Cathy; LaPointe, Gisèle; Sirois, Stéphane; Van Calsteren, Marie-Rose; Roy, Denis

2003-01-01

A primer design strategy named CODEHOP (consensus-degenerate hybrid oligonucleotide primer) for amplification of distantly related sequences was used to detect the priming glycosyltransferase (GT) gene in strains of the Lactobacillus casei group. Each hybrid primer consisted of a short 3′ degenerate core based on four highly conserved amino acids and a longer 5′ consensus clamp region based on six sequences of the priming GT gene products from exopolysaccharide (EPS)-producing bacteria. The hybrid primers were used to detect the priming GT gene of 44 commercial isolates and reference strains of Lactobacillus rhamnosus, L. casei, Lactobacillus zeae, and Streptococcus thermophilus. The priming GT gene was detected in the genome of both non-EPS-producing (EPS−) and EPS-producing (EPS+) strains of L. rhamnosus. The sequences of the cloned PCR products were similar to those of the priming GT gene of various gram-negative and gram-positive EPS+ bacteria. Specific primers designed from the L. rhamnosus RW-9595M GT gene were used to sequence the end of the priming GT gene in selected EPS+ strains of L. rhamnosus. Phylogenetic analysis revealed that Lactobacillus spp. form a distinctive group apart from other lactic acid bacteria for which GT genes have been characterized to date. Moreover, the sequences show a divergence existing among strains of L. rhamnosus with respect to the terminal region of the priming GT gene. Thus, the PCR approach with consensus-degenerate hybrid primers designed with CODEHOP is a practical approach for the detection of similar genes containing conserved motifs in different bacterial genomes. PMID:12788729
Detection of cystic fibrosis transmembrane conductance regulator ΔF508 gene mutation using a paper-based nucleic acid hybridization assay and a smartphone camera.

PubMed

Malhotra, Karan; Noor, M Omair; Krull, Ulrich J

2018-05-29

Diagnostic technology that makes use of paper platforms in conjunction with the ubiquitous availability of digital cameras in cellular telephones and personal assistive devices offers opportunities for development of bioassays that are cost effective and widely distributed. Assays that operate effectively in aqueous solution require further development for implementation in paper substrates, overcoming issues associated with surface interactions on a matrix that offers a large surface-to-volume ratio and constraints on convective mixing. This report presents and compares two related methods for determination of oligonucleotides that serve as indicators of cystic fibrosis, differentiating between the normal wild-type sequence, and a mutant-type sequence that has a 3-base replacement. The transduction strategy operates by selective hybridization of oligonucleotide probes that are conjugated to fluorescent quantum dots, where hybridization of target sequences causes a molecular fluorophore to approach the quantum dot and become emissive through fluorescence resonance energy transfer. Detection can rely on hybridization of a target that is labelled with Cy3 fluorophore, or in the presence of an unlabelled target when a sandwich assay format is implemented with a labelled reporter oligonucleotide. Selectivity to determine the presence of mismatched sequences involves appropriate selection of nucleotide sequences to set melt temperatures, in conjunction with control of stringency conditions using formamide as a chaotrope. It was determined that both direct and sandwich assays on paper substrates are able to distinguish between wild-type and mutant-type samples.
Screening and identification of male-specific DNA fragments in common carps Cyprinus carpio using suppression subtractive hybridization.

PubMed

Chen, J J; Du, Q Y; Yue, Y Y; Dang, B J; Chang, Z J

2010-08-01

In this study, a sex subtractive genomic DNA library was constructed using suppression subtractive hybridization (SSH) between male and female Cyprinus carpio. Twenty-two clones with distinguishable hybridization signals were selected and sequenced. The specific primers were designed based on the sequence data. Those primers were then used to amplify the sex-specific fragments from the genomic DNA of male and female carp. The amplified fragments from two clones showed specificity to males but not to females, which were named as Ccmf2 [387 base pairs (bp)] and Ccmf3 (183 bp), respectively. The sex-specific pattern was analysed in a total of 40 individuals from three other different C. carpio. stocks and grass carp Ctenopharyngodon idella using Ccmf2 and Ccmf3 as dot-blotting probes. The results revealed that the molecular diversity exists on the Y chromosome of C. carpio. No hybridization signals, however, were detected from individuals of C. idella, suggesting that the two sequences are specific to C. carpio. No significant homologous sequences of Ccmf2 and Ccmf3 were found in GenBank. Therefore, it was interpreted that the results as that Ccmf2 and Ccmf3 are two novel male-specific sequences; and both fragments could be used as markers to rapidly and accurately identify the genetic sex of part of C. carpio. This may provide a very efficient selective tool for practically breeding monosex female populations in aquacultural production.
A "signal on" protection-displacement-hybridization-based electrochemical hepatitis B virus gene sequence sensor with high sensitivity and peculiar adjustable specificity.

PubMed

Li, Fengqin; Xu, Yanmei; Yu, Xiang; Yu, Zhigang; He, Xunjun; Ji, Hongrui; Dong, Jinghao; Song, Yongbin; Yan, Hong; Zhang, Guiling

2016-08-15

One "signal on" electrochemical sensing strategy was constructed for the detection of a specific hepatitis B virus (HBV) gene sequence based on the protection-displacement-hybridization-based (PDHB) signaling mechanism. This sensing system is composed of three probes, one capturing probe (CP) and one assistant probe (AP) which are co-immobilized on the Au electrode surface, and one 3-methylene blue (MB) modified signaling probe (SP) free in the detection solution. One duplex are formed between AP and SP with the target, a specific HBV gene sequence, hybridizing with CP. This structure can drive the MB labels close to the electrode surface, thereby producing a large detection current. Two electrochemical testing techniques, alternating current voltammetry (ACV) and cyclic voltammetry (CV), were used for characterizing the sensor. Under the optimized conditions, the proposed sensor exhibits a high sensitivity with the detection limit of ∼5fM for the target. When used for the discrimination of point mutation, the sensor also features an outstanding ability and its peculiar high adjustability. Copyright © 2016 Elsevier B.V. All rights reserved.
A hybrid model based on neural networks for biomedical relation extraction.

PubMed

Zhang, Yijia; Lin, Hongfei; Yang, Zhihao; Wang, Jian; Zhang, Shaowu; Sun, Yuanyuan; Yang, Liang

2018-05-01

Biomedical relation extraction can automatically extract high-quality biomedical relations from biomedical texts, which is a vital step for the mining of biomedical knowledge hidden in the literature. Recurrent neural networks (RNNs) and convolutional neural networks (CNNs) are two major neural network models for biomedical relation extraction. Neural network-based methods for biomedical relation extraction typically focus on the sentence sequence and employ RNNs or CNNs to learn the latent features from sentence sequences separately. However, RNNs and CNNs have their own advantages for biomedical relation extraction. Combining RNNs and CNNs may improve biomedical relation extraction. In this paper, we present a hybrid model for the extraction of biomedical relations that combines RNNs and CNNs. First, the shortest dependency path (SDP) is generated based on the dependency graph of the candidate sentence. To make full use of the SDP, we divide the SDP into a dependency word sequence and a relation sequence. Then, RNNs and CNNs are employed to automatically learn the features from the sentence sequence and the dependency sequences, respectively. Finally, the output features of the RNNs and CNNs are combined to detect and extract biomedical relations. We evaluate our hybrid model using five public (protein-protein interaction) PPI corpora and a (drug-drug interaction) DDI corpus. The experimental results suggest that the advantages of RNNs and CNNs in biomedical relation extraction are complementary. Combining RNNs and CNNs can effectively boost biomedical relation extraction performance. Copyright © 2018 Elsevier Inc. All rights reserved.

Functional PMS2 Hybrid Alleles Containing a Pseudogene-Specific Missense Variant Trace Back to a Single Ancient Intrachromosomal Recombination Event

PubMed Central

Ganster, Christina; Wernstedt, Annekatrin; Kehrer-Sawatzki, Hildegard; Messiaen, Ludwine; Schmidt, Konrad; Rahner, Nils; Heinimann, Karl; Fonatsch, Christa; Zschocke, Johannes; Wimmer, Katharina

2012-01-01

Sequence exchange between PMS2 and its pseudogene PMS2CL, embedded in an inverted duplication on chromosome 7p22, has been reported to be an ongoing process that leads to functional PMS2 hybrid alleles containing PMS2- and PMS2CL-specific sequence variants at the 5′-and the 3′-end, respectively. The frequency of PMS2 hybrid alleles, their biological significance, and the mechanisms underlying their formation are largely unknown. Here we show that overall hybrid alleles account for one-third of 384 PMS2 alleles analyzed in individuals of different ethnic backgrounds. Depending on the population, 14–60% of hybrid alleles carry PMS2CL-specific sequences in exons 13–15, the remainder only in exon 15. We show that exons 13–15 hybrid alleles, named H1 hybrid alleles, constitute different haplotypes but trace back to a single ancient intrachromosomal recombination event with crossover. Taking advantage of an ancestral sequence variant specific for all H1 alleles we developed a simple gDNA-based polymerase chain reaction (PCR) assay that can be used to identify H1-allele carriers with high sensitivity and specificity (100 and 99%, respectively). Because H1 hybrid alleles harbor missense variant p.N775S of so far unknown functional significance, we assessed the H1-carrier frequency in 164 colorectal cancer patients. So far, we found no indication that the variant plays a major role with regard to cancer susceptibility. PMID:20186689
Functional PMS2 hybrid alleles containing a pseudogene-specific missense variant trace back to a single ancient intrachromosomal recombination event.

PubMed

Ganster, Christina; Wernstedt, Annekatrin; Kehrer-Sawatzki, Hildegard; Messiaen, Ludwine; Schmidt, Konrad; Rahner, Nils; Heinimann, Karl; Fonatsch, Christa; Zschocke, Johannes; Wimmer, Katharina

2010-05-01

Sequence exchange between PMS2 and its pseudogene PMS2CL, embedded in an inverted duplication on chromosome 7p22, has been reported to be an ongoing process that leads to functional PMS2 hybrid alleles containing PMS2- and PMS2CL-specific sequence variants at the 5'-and the 3'-end, respectively. The frequency of PMS2 hybrid alleles, their biological significance, and the mechanisms underlying their formation are largely unknown. Here we show that overall hybrid alleles account for one-third of 384 PMS2 alleles analyzed in individuals of different ethnic backgrounds. Depending on the population, 14-60% of hybrid alleles carry PMS2CL-specific sequences in exons 13-15, the remainder only in exon 15. We show that exons 13-15 hybrid alleles, named H1 hybrid alleles, constitute different haplotypes but trace back to a single ancient intrachromosomal recombination event with crossover. Taking advantage of an ancestral sequence variant specific for all H1 alleles we developed a simple gDNA-based polymerase chain reaction (PCR) assay that can be used to identify H1-allele carriers with high sensitivity and specificity (100 and 99%, respectively). Because H1 hybrid alleles harbor missense variant p.N775S of so far unknown functional significance, we assessed the H1-carrier frequency in 164 colorectal cancer patients. So far, we found no indication that the variant plays a major role with regard to cancer susceptibility. (c) 2010 Wiley-Liss, Inc.
Method for identifying and quantifying nucleic acid sequence aberrations

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1998-01-01

A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.
Method for identifying and quantifying nucleic acid sequence aberrations

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1998-07-21

A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

DOEpatents

Gardner, Shea N [San Leandro, CA; Mariella, Jr., Raymond P.; Christian, Allen T [Tracy, CA; Young, Jennifer A [Berkeley, CA; Clague, David S [Livermore, CA

2011-01-18

A method of fabricating a DNA molecule of user-defined sequence. The method comprises the steps of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an even or odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths. In one embodiment starting sequence fragments are of different lengths, n, n+1, n+2, etc.
Fluorescence Detection of KRAS2 mRNA Hybridization in Lung Cancer Cells with PNA-Peptides Containing an Internal Thiazole Orange

PubMed Central

2015-01-01

We previously developed reporter-peptide nucleic acid (PNA)-peptides for sequence-specific radioimaging and fluorescence imaging of particular mRNAs in cells and tumors. However, a direct test for PNA-peptide hybridization with RNA in the cytoplasm would be desirable. Thiazole orange (TO) dye at the 5′ end of a hybridization agent shows a strong increase in fluorescence quantum yield when stacked upon a 5′ terminal base pair, in solution and in cells. We hypothesized that hybridization agents with an internal TO could distinguish a single base mutation in RNA. Thus, we designed KRAS2 PNA-IGF1 tetrapeptide agents with an internal TO adjacent to the middle base of the 12th codon, a frequent site of cancer-initiating mutations. Our molecular dynamics calculations predicted a disordered bulge with weaker hybridization resulting from a single RNA mismatch. We observed that single-stranded PNA-IGF1 tetrapeptide agents with an internal TO showed low fluorescence, but fluorescence escalated 5–6-fold upon hybridization with KRAS2 RNA. Circular dichroism melting curves showed ∼10 °C higher Tm for fully complementary vs single base mismatch TO-PNA-peptide agent duplexes with KRAS2 RNA. Fluorescence measurements of treated human lung cancer cells similarly showed elevated cytoplasmic fluorescence intensity with fully complementary vs single base mismatch agents. Sequence-specific elevation of internal TO fluorescence is consistent with our hypothesis of detecting cytoplasmic PNA-peptide:RNA hybridization if a mutant agent encounters the corresponding mutant mRNA. PMID:25180641
Array-based detection of genetic alterations associated with disease

DOEpatents

Pinkel, Daniel; Albertson, Donna G.; Gray, Joe W.

2017-09-05

The present invention relates to DNA sequences from regions of copy number change on chromosome 20. The sequences can be used in hybridization methods for the identification of chromosomal abnormalities associated with various diseases.
Array-based detection of genetic alterations associated with disease

DOEpatents

Pinkel, Daniel; Albertson, Donna G.; Gray, Joe W.

2007-09-11

The present invention relates to DNA sequences from regions of copy number change on chromosome 20. The sequences can be used in hybridization methods for the identification of chromosomal abnormalities associated with various diseases.
The illusion of specific capture: surface and solution studies of suboptimal oligonucleotide hybridization

PubMed Central

2013-01-01

Background Hybridization based assays and capture systems depend on the specificity of hybridization between a probe and its intended target. A common guideline in the construction of DNA microarrays, for instance, is that avoiding complementary stretches of more than 15 nucleic acids in a 50 or 60-mer probe will eliminate sequence specific cross-hybridization reactions. Here we present a study of the behavior of partially matched oligonucleotide pairs with complementary stretches starting well below this threshold complementarity length – in silico, in solution, and at the microarray surface. The modeled behavior of pairs of oligonucleotide probes and their targets suggests that even a complementary stretch of sequence 12 nt in length would give rise to specific cross-hybridization. We designed a set of binding partners to a 50-mer oligonucleotide containing complementary stretches from 6 nt to 21 nt in length. Results Solution melting experiments demonstrate that stable partial duplexes can form when only 12 bp of complementary sequence are present; surface hybridization experiments confirm that a signal close in magnitude to full-strength signal can be obtained from hybridization of a 12 bp duplex within a 50mer oligonucleotide. Conclusions Microarray and other molecular capture strategies that rely on a 15 nt lower complementarity bound for eliminating specific cross-hybridization may not be sufficiently conservative. PMID:23445545
Genetic Relationships among Hylocereus and Selenicereus Vine Cacti (Cactaceae): Evidence from Hybridization and Cytological Studies

PubMed Central

TEL-ZUR, NOEMI; ABBO, SHAHAL; BAR-ZVI, DUDY; MIZRAHI, YOSEF

2004-01-01

• Background and Aims Hylocereus and Selenicereus are native to tropical and sub-tropical America. Based on its taxonomic status and crossability relations it was postulated that H. megalanthus (syn. S. megalanthus) is an allotetraploid (2n = 4x = 44) derived from natural hybridization between two closely related diploid taxa. The present work aimed at elucidating the genetic relationships between species of the two genera. • Methods Crosses were performed and the putative hybrids were analysed by chromosome counts and morphological traits. The ploidy level of hybrids was confirmed by fluorescent in situ hybridization (FISH) of rDNA sites. Genomic in situ hybridization (GISH) was used in an attempt to identify the putative diploid genome donors of H. megalanthus and an artificial interploid hybrid. • Key Results Reciprocal crosses among four diploid Hylocereus species (H. costaricensis, H. monacanthus (syn. H. polyrhizus), H. undatus and Hylocereus sp.) yielded viable diploid hybrids, with regular chromosome pairing. Reciprocal crosses between these Hylocereus spp. and H. megalanthus yielded viable triploid, pentaploid, hexaploid and aneuploid hybrids. Morphological and phenological traits confirm the hybrid origin. In situ detection of rDNA sites was in accord with the ploidy status of the species and hybrid studied. GISH results indicated that overall sequence composition of H. megalanthus is similar to that of H. ocamponis and S. grandiflorus. High sequence similarity was also found between the parental genomes of H. monacanthus and H. megalanthus in one triploid hybrid. • Conclusions The ease of obtaining partially fertile F1 hybrids and the relative sequence similarity (in GISH study) suggest close genetic relationships among the taxa analysed. PMID:15329334
"iSS-Hyb-mRMR": Identification of splicing sites using hybrid space of pseudo trinucleotide and pseudo tetranucleotide composition.

PubMed

Iqbal, Muhammad; Hayat, Maqsood

2016-05-01

Gene splicing is a vital source of protein diversity. Perfectly eradication of introns and joining exons is the prominent task in eukaryotic gene expression, as exons are usually interrupted by introns. Identification of splicing sites through experimental techniques is complicated and time-consuming task. With the avalanche of genome sequences generated in the post genomic age, it remains a complicated and challenging task to develop an automatic, robust and reliable computational method for fast and effective identification of splicing sites. In this study, a hybrid model "iSS-Hyb-mRMR" is proposed for quickly and accurately identification of splicing sites. Two sample representation methods namely; pseudo trinucleotide composition (PseTNC) and pseudo tetranucleotide composition (PseTetraNC) were used to extract numerical descriptors from DNA sequences. Hybrid model was developed by concatenating PseTNC and PseTetraNC. In order to select high discriminative features, minimum redundancy maximum relevance algorithm was applied on the hybrid feature space. The performance of these feature representation methods was tested using various classification algorithms including K-nearest neighbor, probabilistic neural network, general regression neural network, and fitting network. Jackknife test was used for evaluation of its performance on two benchmark datasets S1 and S2, respectively. The predictor, proposed in the current study achieved an accuracy of 93.26%, sensitivity of 88.77%, and specificity of 97.78% for S1, and the accuracy of 94.12%, sensitivity of 87.14%, and specificity of 98.64% for S2, respectively. It is observed, that the performance of proposed model is higher than the existing methods in the literature so for; and will be fruitful in the mechanism of RNA splicing, and other research academia. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Finite element analysis when orthogonal cutting of hybrid composite CFRP/Ti

NASA Astrophysics Data System (ADS)

Xu, Jinyang; El Mansori, Mohamed

2015-07-01

Hybrid composite, especially CFRP/Ti stack, is usually considered as an innovative structural configuration for manufacturing the key load-bearing components in modern aerospace industry. This paper originally proposed an FE model to simulate the total chip formation process dominated the hybrid cutting operation. The hybrid composite model was established based on three physical constituents, i.e., Ti constituent, interface and CFRP constituent. Different constitutive models and damage criteria were introduced to replicate the interrelated cutting behaviour of the stack material. The CFRP/Ti interface was modelled as a third phase through the concept of cohesive zone (CZ). Particular attention was made on the comparative studies of the influence of different cutting-sequence strategies on the machining responses induced in hybrid stack cutting. The numerical results emphasized the pivotal role of cutting-sequence strategy on the various machining induced responses including cutting-force generation, machined surface quality and induced interface damage.
Quantum-dot-based quantitative identification of pathogens in complex mixture

NASA Astrophysics Data System (ADS)

Lim, Sun Hee; Bestwater, Felix; Buchy, Philippe; Mardy, Sek; Yu, Alexey Dan Chin

2010-02-01

In the present study we describe sandwich design hybridization probes consisting of magnetic particles (MP) and quantum dots (QD) with target DNA, and their application in the detection of avian influenza virus (H5N1) sequences. Hybridization of 25-, 40-, and 100-mer target DNA with both probes was analyzed and quantified by flow cytometry and fluorescence microscopy on the scale of single particles. The following steps were used in the assay: (i) target selection by MP probes and (ii) target detection by QD probes. Hybridization efficiency between MP conjugated probes and target DNA hybrids was controlled by a fluorescent dye specific for nucleic acids. Fluorescence was detected by flow cytometry to distinguish differences in oligo sequences as short as 25-mer capturing in target DNA and by gel-electrophoresis in the case of QD probes. This report shows that effective manipulation and control of micro- and nanoparticles in hybridization assays is possible.
An application of hybrid downscaling model to forecast summer precipitation at stations in China

NASA Astrophysics Data System (ADS)

Liu, Ying; Fan, Ke

2014-06-01

A pattern prediction hybrid downscaling method was applied to predict summer (June-July-August) precipitation at China 160 stations. The predicted precipitation from the downscaling scheme is available one month before. Four predictors were chosen to establish the hybrid downscaling scheme. The 500-hPa geopotential height (GH5) and 850-hPa specific humidity (q85) were from the skillful predicted output of three DEMETER (Development of a European Multi-model Ensemble System for Seasonal to Interannual Prediction) general circulation models (GCMs). The 700-hPa geopotential height (GH7) and sea level pressure (SLP) were from reanalysis datasets. The hybrid downscaling scheme (HD-4P) has better prediction skill than a conventional statistical downscaling model (SD-2P) which contains two predictors derived from the output of GCMs, although two downscaling schemes were performed to improve the seasonal prediction of summer rainfall in comparison with the original output of the DEMETER GCMs. In particular, HD-4P downscaling predictions showed lower root mean square errors than those based on the SD-2P model. Furthermore, the HD-4P downscaling model reproduced the China summer precipitation anomaly centers more accurately than the scenario of the SD-2P model in 1998. A hybrid downscaling prediction should be effective to improve the prediction skill of summer rainfall at stations in China.
Ultrafast DNA sequencing on a microchip by a hybrid separation mechanism that gives 600 bases in 6.5 minutes.

PubMed

Fredlake, Christopher P; Hert, Daniel G; Kan, Cheuk-Wai; Chiesl, Thomas N; Root, Brian E; Forster, Ryan E; Barron, Annelise E

2008-01-15

To realize the immense potential of large-scale genomic sequencing after the completion of the second human genome (Venter's), the costs for the complete sequencing of additional genomes must be dramatically reduced. Among the technologies being developed to reduce sequencing costs, microchip electrophoresis is the only new technology ready to produce the long reads most suitable for the de novo sequencing and assembly of large and complex genomes. Compared with the current paradigm of capillary electrophoresis, microchip systems promise to reduce sequencing costs dramatically by increasing throughput, reducing reagent consumption, and integrating the many steps of the sequencing pipeline onto a single platform. Although capillary-based systems require approximately 70 min to deliver approximately 650 bases of contiguous sequence, we report sequencing up to 600 bases in just 6.5 min by microchip electrophoresis with a unique polymer matrix/adsorbed polymer wall coating combination. This represents a two-thirds reduction in sequencing time over any previously published chip sequencing result, with comparable read length and sequence quality. We hypothesize that these ultrafast long reads on chips can be achieved because the combined polymer system engenders a recently discovered "hybrid" mechanism of DNA electromigration, in which DNA molecules alternate rapidly between repeating through the intact polymer network and disrupting network entanglements to drag polymers through the solution, similar to dsDNA dynamics we observe in single-molecule DNA imaging studies. Most importantly, these results reveal the surprisingly powerful ability of microchip electrophoresis to provide ultrafast Sanger sequencing, which will translate to increased system throughput and reduced costs.
Hybridization properties of long nucleic acid probes for detection of variable target sequences, and development of a hybridization prediction algorithm

PubMed Central

Öhrmalm, Christina; Jobs, Magnus; Eriksson, Ronnie; Golbob, Sultan; Elfaitouri, Amal; Benachenhou, Farid; Strømme, Maria; Blomberg, Jonas

2010-01-01

One of the main problems in nucleic acid-based techniques for detection of infectious agents, such as influenza viruses, is that of nucleic acid sequence variation. DNA probes, 70-nt long, some including the nucleotide analog deoxyribose-Inosine (dInosine), were analyzed for hybridization tolerance to different amounts and distributions of mismatching bases, e.g. synonymous mutations, in target DNA. Microsphere-linked 70-mer probes were hybridized in 3M TMAC buffer to biotinylated single-stranded (ss) DNA for subsequent analysis in a Luminex® system. When mismatches interrupted contiguous matching stretches of 6 nt or longer, it had a strong impact on hybridization. Contiguous matching stretches are more important than the same number of matching nucleotides separated by mismatches into several regions. dInosine, but not 5-nitroindole, substitutions at mismatching positions stabilized hybridization remarkably well, comparable to N (4-fold) wobbles in the same positions. In contrast to shorter probes, 70-nt probes with judiciously placed dInosine substitutions and/or wobble positions were remarkably mismatch tolerant, with preserved specificity. An algorithm, NucZip, was constructed to model the nucleation and zipping phases of hybridization, integrating both local and distant binding contributions. It predicted hybridization more exactly than previous algorithms, and has the potential to guide the design of variation-tolerant yet specific probes. PMID:20864443
Chromosome painting - principles, strategies and scope.

PubMed

Sharma, A K; Sharma, A

2001-01-01

Chromosome Painting is emerging as a powerful tool in the exact localization of different gene sequences of chromosomes at the microscopic level. It is principally based on molecular hybridization in situ with sequence specific probes on chromosomes. Different strategies have been adopted for the preparation of probes, hybridization and visualization. The impact of this method lies in identification of genes for desired characters in the chromosomes, including those of genetic disorders, in cancer research, in transgenesis and in studies on biodiversity and evolution.
Development of a Fluorescence Resonance Energy Transfer (FRET)-Based DNA Biosensor for Detection of Synthetic Oligonucleotide of Ganoderma boninense

PubMed Central

Mohd Bakhori, Noremylia; Yusof, Nor Azah; Abdullah, Abdul Halim; Hussein, Mohd Zobir

2013-01-01

An optical DNA biosensor based on fluorescence resonance energy transfer (FRET) utilizing synthesized quantum dot (QD) has been developed for the detection of specific-sequence of DNA for Ganoderma boninense, an oil palm pathogen. Modified QD that contained carboxylic groups was conjugated with a single-stranded DNA probe (ssDNA) via amide-linkage. Hybridization of the target DNA with conjugated QD-ssDNA and reporter probe labeled with Cy5 allows for the detection of related synthetic DNA sequence of Ganoderma boninense gene based on FRET signals. Detection of FRET emission before and after hybridization was confirmed through the capability of the system to produce FRET at 680 nm for hybridized sandwich with complementary target DNA. No FRET emission was observed for non-complementary system. Hybridization time, temperature and effect of different concentration of target DNA were studied in order to optimize the developed system. The developed biosensor has shown high sensitivity with detection limit of 3.55 × 10−9 M. TEM results show that the particle size of QD varies in the range between 5 to 8 nm after ligand modification and conjugation with ssDNA. This approach is capable of providing a simple, rapid and sensitive method for detection of related synthetic DNA sequence of Ganoderma boninense. PMID:25587406
Predictors of transitions from single to multiple job holding: Results of a longitudinal study among employees aged 45-64 in the Netherlands.

PubMed

Bouwhuis, Stef; Geuskens, Goedele A; Boot, Cécile R L; Bongers, Paulien M; van der Beek, Allard J

2017-08-01

To construct prediction models for transitions to combination multiple job holding (MJH) (multiple jobs as an employee) and hybrid MJH (being an employee and self-employed), among employees aged 45-64. A total of 5187 employees in the Netherlands completed online questionnaires annually between 2010 and 2013. We applied logistic regression analyses with a backward elimination strategy to construct prediction models. Transitions to combination MJH and hybrid MJH were best predicted by a combination of factors including: demographics, health and mastery, work characteristics, work history, skills and knowledge, social factors, and financial factors. Not having a permanent contract and a poor household financial situation predicted both transitions. Some predictors only predicted combination MJH, e.g., working part-time, or hybrid MJH, e.g., work-home interference. A wide variety of factors predict combination MJH and/or hybrid MJH. The prediction model approach allowed for the identification of predictors that have not been previously studied. © 2017 Wiley Periodicals, Inc.
Preparation of genosensor for detection of specific DNA sequence of the hepatitis B virus

NASA Astrophysics Data System (ADS)

Honorato Castro, Ana C.; França, Erick G.; de Paula, Lucas F.; Soares, Marcia M. C. N.; Goulart, Luiz R.; Madurro, João M.; Brito-Madurro, Ana G.

2014-09-01

An electrochemical genosensor was constructed for detection of specific DNA sequence of the hepatitis B virus, based on graphite electrodes modified with poly(4-aminophenol) and incorporating a specific oligonucleotide probe. The modified electrode containing the probe was evaluated by differential pulse voltammetry, before and after incubation with the complementary oligonucleotide target. Detection was performed by monitoring oxidizable DNA bases (direct detection) or using ethidium bromide as indicator of the hybridization process (indirect detection). The device showed a detection limit for the oligonucleotide target of 2.61 nmol L-1. Indirect detection using ethidium bromide was promising in discriminating mismatches, which is a very desirable attribute for detection of disease-related point mutations. In addition, it was possible to observe differences between hybridized and non-hybridized surfaces by atomic force microscopy.

Reconstruction of the evolutionary history of Saccharomyces cerevisiae x S. kudriavzevii hybrids based on multilocus sequence analysis.

PubMed

Peris, David; Lopes, Christian A; Arias, Armando; Barrio, Eladio

2012-01-01

In recent years, interspecific hybridization and introgression are increasingly recognized as significant events in the evolution of Saccharomyces yeasts. These mechanisms have probably been involved in the origin of novel yeast genotypes and phenotypes, which in due course were to colonize and predominate in the new fermentative environments created by human manipulation. The particular conditions in which hybrids arose are still unknown, as well as the number of possible hybridization events that generated the whole set of natural hybrids described in the literature during recent years. In this study, we could infer at least six different hybridization events that originated a set of 26 S. cerevisiae x S. kudriavzevii hybrids isolated from both fermentative and non-fermentative environments. Different wine S. cerevisiae strains and European S. kudriavzevii strains were probably involved in the hybridization events according to gene sequence information, as well as from previous data on their genome composition and ploidy. Finally, we postulate that these hybrids may have originated after the introduction of vine growing and winemaking practices by the Romans to the present Northern vine-growing limits and spread during the expansion of improved viticulture and enology practices that occurred during the Late Middle Ages.
Transcriptional mapping of the ribosomal RNA region of mouse L-cell mitochondrial DNA.

PubMed Central

Nagley, P; Clayton, D A

1980-01-01

The map positions in mouse mitochondrial DNA of the two ribosomal RNA genes and adjacent genes coding several small transcripts have been determined precisely by application of a procedure in which DNA-RNA hybrids have been subjected to digestion by S1 nuclease under conditions of varying severity. Digestion of the DNA-RNA hybrids with S1 nuclease yielded a series of species which were shown to contain ribosomal RNA molecules together with adjacent transcripts hybridized conjointly to a continuous segment of mitochondrial DNA. There is one small transcript about 60 bases long whose gene adjoins the sequences coding the 5'-end of the small ribosomal RNA (950 bases) and which lies approximately 200 nucleotides from the D-loop origin of heavy strand mitochondrial DNA synthesis. An 80-base transcript lies between the small and large ribosomal RNA genes, and genes for two further short transcript (each about 80 bases in length) abut the sequences coding the 3'-end of the large ribosomal RNA (approximately 1500 bases). The ability to isolate a discrete DNA-RNA hybrid species approximately 2700 base pairs in length containing all these transcripts suggests that there can be few nucleotides in this region of mouse mitochondrial DNA which are not represented as stable RNA species. Images PMID:6253898
Hybrid Organic-Inorganic Films Grown Using Molecular Layer Deposition

DTIC Science & Technology

2011-03-01

shown that zincone films based on DEZ and hydroquinone (HQ) have displayed some conductivity when alloyed with ZnO ALD films [35]. The schematic...11 Schematic showing the two-step reaction sequence for AB zincone MLD growth using diethylzinc (DEZ) and hydroquinone (HQ). The hybrid organic
An integrated PCR colony hybridization approach to screen cDNA libraries for full-length coding sequences.

PubMed

Pollier, Jacob; González-Guzmán, Miguel; Ardiles-Diaz, Wilson; Geelen, Danny; Goossens, Alain

2011-01-01

cDNA-Amplified Fragment Length Polymorphism (cDNA-AFLP) is a commonly used technique for genome-wide expression analysis that does not require prior sequence knowledge. Typically, quantitative expression data and sequence information are obtained for a large number of differentially expressed gene tags. However, most of the gene tags do not correspond to full-length (FL) coding sequences, which is a prerequisite for subsequent functional analysis. A medium-throughput screening strategy, based on integration of polymerase chain reaction (PCR) and colony hybridization, was developed that allows in parallel screening of a cDNA library for FL clones corresponding to incomplete cDNAs. The method was applied to screen for the FL open reading frames of a selection of 163 cDNA-AFLP tags from three different medicinal plants, leading to the identification of 109 (67%) FL clones. Furthermore, the protocol allows for the use of multiple probes in a single hybridization event, thus significantly increasing the throughput when screening for rare transcripts. The presented strategy offers an efficient method for the conversion of incomplete expressed sequence tags (ESTs), such as cDNA-AFLP tags, to FL-coding sequences.
TargetCrys: protein crystallization prediction by fusing multi-view features with two-layered SVM.

PubMed

Hu, Jun; Han, Ke; Li, Yang; Yang, Jing-Yu; Shen, Hong-Bin; Yu, Dong-Jun

2016-11-01

The accurate prediction of whether a protein will crystallize plays a crucial role in improving the success rate of protein crystallization projects. A common critical problem in the development of machine-learning-based protein crystallization predictors is how to effectively utilize protein features extracted from different views. In this study, we aimed to improve the efficiency of fusing multi-view protein features by proposing a new two-layered SVM (2L-SVM) which switches the feature-level fusion problem to a decision-level fusion problem: the SVMs in the 1st layer of the 2L-SVM are trained on each of the multi-view feature sets; then, the outputs of the 1st layer SVMs, which are the "intermediate" decisions made based on the respective feature sets, are further ensembled by a 2nd layer SVM. Based on the proposed 2L-SVM, we implemented a sequence-based protein crystallization predictor called TargetCrys. Experimental results on several benchmark datasets demonstrated the efficacy of the proposed 2L-SVM for fusing multi-view features. We also compared TargetCrys with existing sequence-based protein crystallization predictors and demonstrated that the proposed TargetCrys outperformed most of the existing predictors and is competitive with the state-of-the-art predictors. The TargetCrys webserver and datasets used in this study are freely available for academic use at: http://csbio.njust.edu.cn/bioinf/TargetCrys .
A graphical language for reliability model generation

NASA Technical Reports Server (NTRS)

Howell, Sandra V.; Bavuso, Salvatore J.; Haley, Pamela J.

1990-01-01

A graphical interface capability of the hybrid automated reliability predictor (HARP) is described. The graphics-oriented (GO) module provides the user with a graphical language for modeling system failure modes through the selection of various fault tree gates, including sequence dependency gates, or by a Markov chain. With this graphical input language, a fault tree becomes a convenient notation for describing a system. In accounting for any sequence dependencies, HARP converts the fault-tree notation to a complex stochastic process that is reduced to a Markov chain which it can then solve for system reliability. The graphics capability is available for use on an IBM-compatible PC, a Sun, and a VAX workstation. The GO module is written in the C programming language and uses the Graphical Kernel System (GKS) standard for graphics implementation. The PC, VAX, and Sun versions of the HARP GO module are currently in beta-testing.
Detection and isolation of nucleic acid sequences using a bifunctional hybridization probe

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

2000-01-01

A method for detecting and isolating a target sequence in a sample of nucleic acids is provided using a bifunctional hybridization probe capable of hybridizing to the target sequence that includes a detectable marker and a first complexing agent capable of forming a binding pair with a second complexing agent. A kit is also provided for detecting a target sequence in a sample of nucleic acids using a bifunctional hybridization probe according to this method.
HiRel: Hybrid Automated Reliability Predictor (HARP) integrated reliability tool system, (version 7.0). Volume 2: HARP tutorial

NASA Technical Reports Server (NTRS)

Rothmann, Elizabeth; Dugan, Joanne Bechta; Trivedi, Kishor S.; Mittal, Nitin; Bavuso, Salvatore J.

1994-01-01

The Hybrid Automated Reliability Predictor (HARP) integrated Reliability (HiRel) tool system for reliability/availability prediction offers a toolbox of integrated reliability/availability programs that can be used to customize the user's application in a workstation or nonworkstation environment. The Hybrid Automated Reliability Predictor (HARP) tutorial provides insight into HARP modeling techniques and the interactive textual prompting input language via a step-by-step explanation and demonstration of HARP's fault occurrence/repair model and the fault/error handling models. Example applications are worked in their entirety and the HARP tabular output data are presented for each. Simple models are presented at first with each succeeding example demonstrating greater modeling power and complexity. This document is not intended to present the theoretical and mathematical basis for HARP.
Plant MicroRNA Prediction by Supervised Machine Learning Using C5.0 Decision Trees.

PubMed

Williams, Philip H; Eyles, Rod; Weiller, Georg

2012-01-01

MicroRNAs (miRNAs) are nonprotein coding RNAs between 20 and 22 nucleotides long that attenuate protein production. Different types of sequence data are being investigated for novel miRNAs, including genomic and transcriptomic sequences. A variety of machine learning methods have successfully predicted miRNA precursors, mature miRNAs, and other nonprotein coding sequences. MirTools, mirDeep2, and miRanalyzer require "read count" to be included with the input sequences, which restricts their use to deep-sequencing data. Our aim was to train a predictor using a cross-section of different species to accurately predict miRNAs outside the training set. We wanted a system that did not require read-count for prediction and could therefore be applied to short sequences extracted from genomic, EST, or RNA-seq sources. A miRNA-predictive decision-tree model has been developed by supervised machine learning. It only requires that the corresponding genome or transcriptome is available within a sequence window that includes the precursor candidate so that the required sequence features can be collected. Some of the most critical features for training the predictor are the miRNA:miRNA(∗) duplex energy and the number of mismatches in the duplex. We present a cross-species plant miRNA predictor with 84.08% sensitivity and 98.53% specificity based on rigorous testing by leave-one-out validation.
De Novo Centromere Formation and Centromeric Sequence Expansion in Wheat and its Wide Hybrids.

PubMed

Guo, Xiang; Su, Handong; Shi, Qinghua; Fu, Shulan; Wang, Jing; Zhang, Xiangqi; Hu, Zanmin; Han, Fangpu

2016-04-01

Centromeres typically contain tandem repeat sequences, but centromere function does not necessarily depend on these sequences. We identified functional centromeres with significant quantitative changes in the centromeric retrotransposons of wheat (CRW) contents in wheat aneuploids (Triticum aestivum) and the offspring of wheat wide hybrids. The CRW signals were strongly reduced or essentially lost in some wheat ditelosomic lines and in the addition lines from the wide hybrids. The total loss of the CRW sequences but the presence of CENH3 in these lines suggests that the centromeres were formed de novo. In wheat and its wide hybrids, which carry large complex genomes or no sequenced genome, we performed CENH3-ChIP-dot-blot methods alone or in combination with CENH3-ChIP-seq and identified the ectopic genomic sequences present at the new centromeres. In adcdition, the transcription of the identified DNA sequences was remarkably increased at the new centromere, suggesting that the transcription of the corresponding sequences may be associated with de novo centromere formation. Stable alien chromosomes with two and three regions containing CRW sequences induced by centromere breakage were observed in the wheat-Th. elongatum hybrid derivatives, but only one was a functional centromere. In wheat-rye (Secale cereale) hybrids, the rye centromere-specific sequences spread along the chromosome arms and may have caused centromere expansion. Frequent and significant quantitative alterations in the centromere sequence via chromosomal rearrangement have been systematically described in wheat wide hybridizations, which may affect the retention or loss of the alien chromosomes in the hybrids. Thus, the centromere behavior in wide crosses likely has an important impact on the generation of biodiversity, which ultimately has implications for speciation.
De Novo Centromere Formation and Centromeric Sequence Expansion in Wheat and its Wide Hybrids

PubMed Central

Fu, Shulan; Wang, Jing; Zhang, Xiangqi; Hu, Zanmin; Han, Fangpu

2016-01-01

Centromeres typically contain tandem repeat sequences, but centromere function does not necessarily depend on these sequences. We identified functional centromeres with significant quantitative changes in the centromeric retrotransposons of wheat (CRW) contents in wheat aneuploids (Triticum aestivum) and the offspring of wheat wide hybrids. The CRW signals were strongly reduced or essentially lost in some wheat ditelosomic lines and in the addition lines from the wide hybrids. The total loss of the CRW sequences but the presence of CENH3 in these lines suggests that the centromeres were formed de novo. In wheat and its wide hybrids, which carry large complex genomes or no sequenced genome, we performed CENH3-ChIP-dot-blot methods alone or in combination with CENH3-ChIP-seq and identified the ectopic genomic sequences present at the new centromeres. In adcdition, the transcription of the identified DNA sequences was remarkably increased at the new centromere, suggesting that the transcription of the corresponding sequences may be associated with de novo centromere formation. Stable alien chromosomes with two and three regions containing CRW sequences induced by centromere breakage were observed in the wheat-Th. elongatum hybrid derivatives, but only one was a functional centromere. In wheat-rye (Secale cereale) hybrids, the rye centromere-specific sequences spread along the chromosome arms and may have caused centromere expansion. Frequent and significant quantitative alterations in the centromere sequence via chromosomal rearrangement have been systematically described in wheat wide hybridizations, which may affect the retention or loss of the alien chromosomes in the hybrids. Thus, the centromere behavior in wide crosses likely has an important impact on the generation of biodiversity, which ultimately has implications for speciation. PMID:27110907
Two molecular markers based on mitochondrial genomes for varieties identification of the northern snakehead (Channa argus) and blotched snakehead (Channa maculata) and their reciprocal hybrids.

PubMed

Xincheng, Zhang; Kunci, Chen; Xinping, Zhu; Jian, Zhao; Qing, Luo; Xiaoyou, Hong; Wei, Li; Fengfang, Xiao

2015-08-01

The northern snakehead (Channa argus) and blotched snakehead (Channa maculata) and their reciprocal hybrids have played important roles in the Chinese freshwater aquaculture industry, with an annual production in China exceeding 400 thousand tons. While these are popular aquaculture breeds in China, it is not easy to identify northern snakehead, blotched snakehead, and their hybrids. Thus, a method should be developed to identify these varieties. To distinguish between the reciprocal hybrids (C. argus ♀ × C. maculata ♂ and C. maculata ♀ × C. argus ♂), the mitochondrial genome sequences of northern snakehead and blotched snakehead and their reciprocal hybrids were compared. Following the alignment and analysis of mtDNA sequences of northern snakehead, blotched snakehead and their hybrids, two pairs of specific primers were designed based on identified differences ranging from 12S rRNA to 16S rRNA gene. The BY1 primers amplified the same bands in the blotched snakehead and the hybrid (C. maculata ♀ × C. argus ♂), while producing no products in northern snakehead and the hybrid (C. argus ♀ × C. maculata ♂). Amplification with WY1 yielded the opposite results. Then, 30 individuals per fish were randomized to verify the primers, and the results showed that the primers were specific for breeds, as intended. The specific primers can not only simply distinguish between two kinds of hybrids, but also rapidly identify the two parents. This study provides a method of molecular marker identification to identify reciprocal hybrids.
A Highly Sensitive Oligonucleotide Hybridization Assay for Klebsiella pneumoniae Carbapenemase with the Probes on a Gold Nanoparticles Modified Glassy Carbon Electrode.

PubMed

Pan, Hong-zhi; Yu, Hong- Wei; Wang, Na; Zhang, Ze; Wan, Guang-Cai; Liu, Hao; Guan, Xue; Chang, Dong

2015-01-01

To develop a new electrochemical DNA biosensor for determination of Klebsiella pneumoniae carbapenemase, a highly sensitive and selective electrochemical biosensor for DNA detection was constructed based on a glassy carbon electrode (GCE) modified with gold nanoparticles (Au-nano). The Au-nano/GCE was characterized by scanning electromicroscopy, cyclic voltammetry, and electrochemical impedance spectroscopy. The hybridization detection was measured by differential pulse voltammetry using methylene blue as the hybridization indicator. The dynamic range of detection of the sensor for the target DNA sequences was from 1 × 10(-11) to 1 × 10(-8) M, with an LOD of 1 × 10(-12) M. The DNA biosensor had excellent specificity for distinguishing complementary DNA sequence in the presence of non-complementary and mismatched DNA sequence. The Au-nano/GCE showed significant improvement in electrochemical characteristics, and this biosensor was successfully applied for determination of K. pneumoniae.
Fast and Non-Toxic In Situ Hybridization without Blocking of Repetitive Sequences

PubMed Central

Matthiesen, Steen H.; Hansen, Charles M.

2012-01-01

Formamide is the preferred solvent to lower the melting point and annealing temperature of nucleic acid strands in in situ hybridization (ISH). A key benefit of formamide is better preservation of morphology due to a lower incubation temperature. However, in fluorescence in situ hybridization (FISH), against unique DNA targets in tissue sections, an overnight hybridization is required to obtain sufficient signal intensity. Here, we identified alternative solvents and developed a new hybridization buffer that reduces the required hybridization time to one hour (IQFISH method). Remarkably, denaturation and blocking against repetitive DNA sequences to prevent non-specific binding is not required. Furthermore, the new hybridization buffer is less hazardous than formamide containing buffers. The results demonstrate a significant increased hybridization rate at a lowered denaturation and hybridization temperature for both DNA and PNA (peptide nucleic acid) probes. We anticipate that these formamide substituting solvents will become the foundation for changes in the understanding and performance of denaturation and hybridization of nucleic acids. For example, the process time for tissue-based ISH for gene aberration tests in cancer diagnostics can be reduced from days to a few hours. Furthermore, the understanding of the interactions and duplex formation of nucleic acid strands may benefit from the properties of these solvents. PMID:22911704
Triplex in-situ hybridization

DOEpatents

Fresco, Jacques R.; Johnson, Marion D.

2002-01-01

Disclosed are methods for detecting in situ the presence of a target sequence in a substantially double-stranded nucleic acid segment, which comprises: a) contacting in situ under conditions suitable for hybridization a substantially double-stranded nucleic acid segment with a detectable third strand, said third strand being capable of hybridizing to at least a portion of the target sequence to form a triple-stranded structure, if said target sequence is present; and b) detecting whether hybridization between the third strand and the target sequence has occured.
Sensitive detection of mercury and copper ions by fluorescent DNA/Ag nanoclusters in guanine-rich DNA hybridization

NASA Astrophysics Data System (ADS)

Peng, Jun; Ling, Jian; Zhang, Xiu-Qing; Bai, Hui-Ping; Zheng, Liyan; Cao, Qiu-E.; Ding, Zhong-Tao

2015-02-01

In this work, we designed a new fluorescent oligonucleotides-stabilized silver nanoclusters (DNA/AgNCs) probe for sensitive detection of mercury and copper ions. This probe contains two tailored DNA sequence. One is a signal probe contains a cytosine-rich sequence template for AgNCs synthesis and link sequence at both ends. The other is a guanine-rich sequence for signal enhancement and link sequence complementary to the link sequence of the signal probe. After hybridization, the fluorescence of hybridized double-strand DNA/AgNCs is 200-fold enhanced based on the fluorescence enhancement effect of DNA/AgNCs in proximity of guanine-rich DNA sequence. The double-strand DNA/AgNCs probe is brighter and stable than that of single-strand DNA/AgNCs, and more importantly, can be used as novel fluorescent probes for detecting mercury and copper ions. Mercury and copper ions in the range of 6.0-160.0 and 6-240 nM, can be linearly detected with the detection limits of 2.1 and 3.4 nM, respectively. Our results indicated that the analytical parameters of the method for mercury and copper ions detection are much better than which using a single-strand DNA/AgNCs.
Structural studies of polypeptides: Mechanism of immunoglobin catalysis and helix propagation in hybrid sequence, disulfide containing peptides

DOE Office of Scientific and Technical Information (OSTI.GOV)

Storrs, Richard Wood

1992-08-01

Catalytic immunoglobin fragments were studied Nuclear Magnetic Resonance spectroscopy to identify amino acid residues responsible for the catalytic activity. Small, hybrid sequence peptides were analyzed for helix propagation following covalent initiation and for activity related to the protein from which the helical sequence was derived. Hydrolysis of p-nitrophenyl carbonates and esters by specific immunoglobins is thought to involve charge complementarity. The pK of the transition state analog P-nitrophenyl phosphate bound to the immunoglobin fragment was determined by 31P-NMR to verify the juxtaposition of a positively charged amino acid to the binding/catalytic site. Optical studies of immunoglobin mediated photoreversal of cis,more » syn cyclobutane thymine dimers implicated tryptophan as the photosensitizing chromophore. Research shows the chemical environment of a single tryptophan residue is altered upon binding of the thymine dimer. This tryptophan residue was localized to within 20 Å of the binding site through the use of a nitroxide paramagnetic species covalently attached to the thymine dimer. A hybrid sequence peptide was synthesized based on the bee venom peptide apamin in which the helical residues of apamin were replaced with those from the recognition helix of the bacteriophage 434 repressor protein. Oxidation of the disufide bonds occured uniformly in the proper 1-11, 3-15 orientation, stabilizing the 434 sequence in an α-helix. The glycine residue stopped helix propagation. Helix propagation in 2,2,2-trifluoroethanol mixtures was investigated in a second hybrid sequence peptide using the apamin-derived disulfide scaffold and the S-peptide sequence. The helix-stop signal previously observed was not observed in the NMR NOESY spectrum. Helical connectivities were seen throughout the S-peptide sequence. The apamin/S-peptide hybrid binded to the S-protein (residues 21-166 of ribonuclease A) and reconstituted enzymatic activity.« less
Structural studies of polypeptides: Mechanism of immunoglobin catalysis and helix propagation in hybrid sequence, disulfide containing peptides

DOE Office of Scientific and Technical Information (OSTI.GOV)

Storrs, R.W.

1992-08-01

Catalytic immunoglobin fragments were studied Nuclear Magnetic Resonance spectroscopy to identify amino acid residues responsible for the catalytic activity. Small, hybrid sequence peptides were analyzed for helix propagation following covalent initiation and for activity related to the protein from which the helical sequence was derived. Hydrolysis of p-nitrophenyl carbonates and esters by specific immunoglobins is thought to involve charge complementarity. The pK of the transition state analog P-nitrophenyl phosphate bound to the immunoglobin fragment was determined by [sup 31]P-NMR to verify the juxtaposition of a positively charged amino acid to the binding/catalytic site. Optical studies of immunoglobin mediated photoreversal ofmore » cis, syn cyclobutane thymine dimers implicated tryptophan as the photosensitizing chromophore. Research shows the chemical environment of a single tryptophan residue is altered upon binding of the thymine dimer. This tryptophan residue was localized to within 20 [Angstrom] of the binding site through the use of a nitroxide paramagnetic species covalently attached to the thymine dimer. A hybrid sequence peptide was synthesized based on the bee venom peptide apamin in which the helical residues of apamin were replaced with those from the recognition helix of the bacteriophage 434 repressor protein. Oxidation of the disufide bonds occured uniformly in the proper 1-11, 3-15 orientation, stabilizing the 434 sequence in an [alpha]-helix. The glycine residue stopped helix propagation. Helix propagation in 2,2,2-trifluoroethanol mixtures was investigated in a second hybrid sequence peptide using the apamin-derived disulfide scaffold and the S-peptide sequence. The helix-stop signal previously observed was not observed in the NMR NOESY spectrum. Helical connectivities were seen throughout the S-peptide sequence. The apamin/S-peptide hybrid binded to the S-protein (residues 21-166 of ribonuclease A) and reconstituted enzymatic activity.« less
Synthesis and DNA interaction of a mixed proflavine-phenanthroline Tröger base.

PubMed

Baldeyrou, Brigitte; Tardy, Christelle; Bailly, Christian; Colson, Pierre; Houssier, Claude; Charmantray, Franck; Demeunynck, Martine

2002-04-01

We report the synthesis of an asymmetric Tröger base containing the two well characterised DNA binding chromophores, proflavine and phenanthroline. The mode of interaction of the hybrid molecule was investigated by circular and linear dichroism experiments and a biochemical assay using DNA topoisomerase I. The data are compatible with a model in which the proflavine moiety intercalates between DNA base pairs and the phenanthroline ring occupies the DNA groove. DNase I cleavage experiments were carried out to investigate the sequence preference of the hybrid ligand and a well resolved footprint was detected at a site encompassing two adjacent 5'-GTC.5-GAC triplets. The sequence preference of the asymmetric molecule is compared to that of the symmetric analogues.
MTBDRplus and MTBDRsl Assays: Absence of Wild-Type Probe Hybridization and Implications for Detection of Drug-Resistant Tuberculosis

PubMed Central

Georghiou, Sophia B.; Catanzaro, Donald; Rodrigues, Camilla; Crudu, Valeriu; Victor, Thomas C.; Garfein, Richard S.; Catanzaro, Antonino; Rodwell, Timothy C.

2016-01-01

Accurate identification of drug-resistant Mycobacterium tuberculosis is imperative for effective treatment and subsequent reduction in disease transmission. Line probe assays rapidly detect mutations associated with resistance and wild-type sequences associated with susceptibility. Examination of molecular-level performance is necessary for improved assay result interpretation and for continued diagnostic development. Using data collected from a large, multisite diagnostic study, probe hybridization results from line probe assays, MTBDRplus and MTBDRsl, were compared to those of sequencing, and the diagnostic performance of each individual mutation and wild-type probe was assessed. Line probe assay results classified as resistant due to the absence of wild-type probe hybridization were compared to those of sequencing to determine if novel mutations were inhibiting wild-type probe hybridization. The contribution of absent wild-type probe hybridization to the detection of drug resistance was assessed via comparison to a phenotypic reference standard. In our study, mutation probes demonstrated significantly higher specificities than wild-type probes and wild-type probes demonstrated marginally higher sensitivities than mutation probes, an ideal combination for detecting the presence of resistance conferring mutations while yielding the fewest number of false-positive results. The absence of wild-type probe hybridization without mutation probe hybridization was determined to be primarily the result of failure of mutation probe hybridization and not the result of novel or rare mutations. Compared to phenotypic culture-based drug susceptibility testing, the absence of wild-type probe hybridization without mutation probe hybridization significantly contributed to the detection of phenotypic rifampin and fluoroquinolone resistance with negligible increases in false-positive results. PMID:26763971

Molecular evidence of hybridization in sympatric populations of the Enantia jethys complex (Lepidoptera: Pieridae).

PubMed

Jasso-Martínez, Jovana M; Machkour-M'Rabet, Salima; Vila, Roger; Rodríguez-Arnaiz, Rosario; Castañeda-Sortibrán, América Nitxin

2018-01-01

Hybridization events are frequently demonstrated in natural butterfly populations. One interesting butterfly complex species is the Enantia jethys complex that has been studied for over a century; many debates exist regarding the species composition of this complex. Currently, three species that live sympatrically in the Gulf slope of Mexico (Enantia jethys, E. mazai, and E. albania) are recognized in this complex (based on morphological and molecular studies). Where these species live in sympatry, some cases of interspecific mating have been observed, suggesting hybridization events. Considering this, we employed a multilocus approach (analyses of mitochondrial and nuclear sequences: COI, RpS5, and Wg; and nuclear dominant markers: inter-simple sequence repeat (ISSRs) to study hybridization in sympatric populations from Veracruz, Mexico. Genetic diversity parameters were determined for all molecular markers, and species identification was assessed by different methods such as analyses of molecular variance (AMOVA), clustering, principal coordinate analysis (PCoA), gene flow, and PhiPT parameters. ISSR molecular markers were used for a more profound study of hybridization process. Although species of the Enantia jethys complex have a low dispersal capacity, we observed high genetic diversity, probably reflecting a high density of individuals locally. ISSR markers provided evidence of a contemporary hybridization process, detecting a high number of hybrids (from 17% to 53%) with significant differences in genetic diversity. Furthermore, a directional pattern of hybridization was observed from E. albania to other species. Phylogenetic study through DNA sequencing confirmed the existence of three clades corresponding to the three species previously recognized by morphological and molecular studies. This study underlines the importance of assessing hybridization in evolutionary studies, by tracing the lineage separation process that leads to the origin of new species. Our research demonstrates that hybridization processes have a high occurrence in natural populations.
Process of labeling specific chromosomes using recombinant repetitive DNA

DOEpatents

Moyzis, R.K.; Meyne, J.

1988-02-12

Chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family members and consensus sequences of the repetitive DNA families for the chromosome preferential sequences. The selected low homology regions are then hybridized with chromosomes to determine those low homology regions hybridized with a specific chromosome under normal stringency conditions.
Diversity in the 18S SSU rRNA V4 hyper-variable region of Theileria spp. in Cape buffalo (Syncerus caffer) and cattle from southern Africa.

PubMed

Mans, Ben J; Pienaar, Ronel; Latif, Abdalla A; Potgieter, Fred T

2011-05-01

Sequence variation within the 18S SSU rRNA V4 hyper-variable region can affect the accuracy of real-time hybridization probe-based diagnostics for the detection of Theileria spp. infections. This is relevant for assays that use non-specific primers, such as the real-time hybridization assay for T. parva (Sibeko et al. 2008). To assess the effect of sequence variation on this test, the Theileria 18S gene from 62 buffalo and 49 cattle samples was cloned and ∼1000 clones sequenced. Twenty-six genotypes were detected which included known and novel genotypes for the T. buffeli, T. mutans, T. taurotragi and T. velifera clades. A novel genotype related to T. sp. (sable) was also detected in 1 bovine sample. Theileria genotypic diversity was higher in buffalo compared to cattle. Polymorphism within the T. parva hyper-variable region was confirmed by aberrant real-time melting peaks and supported by sequencing of the S5 ribosomal gene. Analysis of the S5 gene suggests that this gene can be a marker for species differentiation. T. parva, T. sp. (buffalo) and T. sp. (bougasvlei) remain the only genotypes amplified by the primer set of the hybridization assay. Therefore, the 18S sequence diversity observed does not seem to affect the current real-time hybridization assay for T. parva.
DNA interactions with a Methylene Blue redox indicator depend on the DNA length and are sequence specific.

PubMed

Farjami, Elaheh; Clima, Lilia; Gothelf, Kurt V; Ferapontova, Elena E

2010-06-01

A DNA molecular beacon approach was used for the analysis of interactions between DNA and Methylene Blue (MB) as a redox indicator of a hybridization event. DNA hairpin structures of different length and guanine (G) content were immobilized onto gold electrodes in their folded states through the alkanethiol linker at the 5'-end. Binding of MB to the folded hairpin DNA was electrochemically studied and compared with binding to the duplex structure formed by hybridization of the hairpin DNA to a complementary DNA strand. Variation of the electrochemical signal from the DNA-MB complex was shown to depend primarily on the DNA length and sequence used: the G-C base pairs were the preferential sites of MB binding in the duplex. For short 20 nts long DNA sequences, the increased electrochemical response from MB bound to the duplex structure was consistent with the increased amount of bound and electrochemically readable MB molecules (i.e. MB molecules that are available for the electron transfer (ET) reaction with the electrode). With longer DNA sequences, the balance between the amounts of the electrochemically readable MB molecules bound to the hairpin DNA and to the hybrid was opposite: a part of the MB molecules bound to the long-sequence DNA duplex seem to be electrochemically mute due to long ET distance. The increasing electrochemical response from MB bound to the short-length DNA hybrid contrasts with the decreasing signal from MB bound to the long-length DNA hybrid and allows an "off"-"on" genosensor development.
Molecular beacon sequence design algorithm.

PubMed

Monroe, W Todd; Haselton, Frederick R

2003-01-01

A method based on Web-based tools is presented to design optimally functioning molecular beacons. Molecular beacons, fluorogenic hybridization probes, are a powerful tool for the rapid and specific detection of a particular nucleic acid sequence. However, their synthesis costs can be considerable. Since molecular beacon performance is based on its sequence, it is imperative to rationally design an optimal sequence before synthesis. The algorithm presented here uses simple Microsoft Excel formulas and macros to rank candidate sequences. This analysis is carried out using mfold structural predictions along with other free Web-based tools. For smaller laboratories where molecular beacons are not the focus of research, the public domain algorithm described here may be usefully employed to aid in molecular beacon design.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, M.S.

1998-08-18

A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device. 27 figs.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.; Wang, Chunwei; Jevons, Luis C.; Bernhart, Derek H.; Lipshutz, Robert J.

2004-05-11

A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.

1998-08-18

A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.

2003-08-19

A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
EST Express: PHP/MySQL based automated annotation of ESTs from expression libraries

PubMed Central

Smith, Robin P; Buchser, William J; Lemmon, Marcus B; Pardinas, Jose R; Bixby, John L; Lemmon, Vance P

2008-01-01

Background Several biological techniques result in the acquisition of functional sets of cDNAs that must be sequenced and analyzed. The emergence of redundant databases such as UniGene and centralized annotation engines such as Entrez Gene has allowed the development of software that can analyze a great number of sequences in a matter of seconds. Results We have developed "EST Express", a suite of analytical tools that identify and annotate ESTs originating from specific mRNA populations. The software consists of a user-friendly GUI powered by PHP and MySQL that allows for online collaboration between researchers and continuity with UniGene, Entrez Gene and RefSeq. Two key features of the software include a novel, simplified Entrez Gene parser and tools to manage cDNA library sequencing projects. We have tested the software on a large data set (2,016 samples) produced by subtractive hybridization. Conclusion EST Express is an open-source, cross-platform web server application that imports sequences from cDNA libraries, such as those generated through subtractive hybridization or yeast two-hybrid screens. It then provides several layers of annotation based on Entrez Gene and RefSeq to allow the user to highlight useful genes and manage cDNA library projects. PMID:18402700
EST Express: PHP/MySQL based automated annotation of ESTs from expression libraries.

PubMed

Smith, Robin P; Buchser, William J; Lemmon, Marcus B; Pardinas, Jose R; Bixby, John L; Lemmon, Vance P

2008-04-10

Several biological techniques result in the acquisition of functional sets of cDNAs that must be sequenced and analyzed. The emergence of redundant databases such as UniGene and centralized annotation engines such as Entrez Gene has allowed the development of software that can analyze a great number of sequences in a matter of seconds. We have developed "EST Express", a suite of analytical tools that identify and annotate ESTs originating from specific mRNA populations. The software consists of a user-friendly GUI powered by PHP and MySQL that allows for online collaboration between researchers and continuity with UniGene, Entrez Gene and RefSeq. Two key features of the software include a novel, simplified Entrez Gene parser and tools to manage cDNA library sequencing projects. We have tested the software on a large data set (2,016 samples) produced by subtractive hybridization. EST Express is an open-source, cross-platform web server application that imports sequences from cDNA libraries, such as those generated through subtractive hybridization or yeast two-hybrid screens. It then provides several layers of annotation based on Entrez Gene and RefSeq to allow the user to highlight useful genes and manage cDNA library projects.
Method for isolating chromosomal DNA in preparation for hybridization in suspension

DOEpatents

Lucas, Joe N.

2000-01-01

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. Chromosomal DNA in a sample containing cell debris is prepared for hybridization in suspension by treating the mixture with RNase. The treated DNA can also be fixed prior to hybridization.
DNA hybridization kinetics: zippering, internal displacement and sequence dependence.

PubMed

Ouldridge, Thomas E; Sulc, Petr; Romano, Flavio; Doye, Jonathan P K; Louis, Ard A

2013-10-01

Although the thermodynamics of DNA hybridization is generally well established, the kinetics of this classic transition is less well understood. Providing such understanding has new urgency because DNA nanotechnology often depends critically on binding rates. Here, we explore DNA oligomer hybridization kinetics using a coarse-grained model. Strand association proceeds through a complex set of intermediate states, with successful binding events initiated by a few metastable base-pairing interactions, followed by zippering of the remaining bonds. But despite reasonably strong interstrand interactions, initial contacts frequently dissociate because typical configurations in which they form differ from typical states of similar enthalpy in the double-stranded equilibrium ensemble. Initial contacts must be stabilized by two or three base pairs before full zippering is likely, resulting in negative effective activation enthalpies. Non-Arrhenius behavior arises because the number of base pairs required for nucleation increases with temperature. In addition, we observe two alternative pathways-pseudoknot and inchworm internal displacement-through which misaligned duplexes can rearrange to form duplexes. These pathways accelerate hybridization. Our results explain why experimentally observed association rates of GC-rich oligomers are higher than rates of AT- rich equivalents, and more generally demonstrate how association rates can be modulated by sequence choice.
An Engineered Kinetic Amplification Mechanism for Single Nucleotide Variant Discrimination by DNA Hybridization Probes.

PubMed

Chen, Sherry Xi; Seelig, Georg

2016-04-20

Even a single-nucleotide difference between the sequences of two otherwise identical biological nucleic acids can have dramatic functional consequences. Here, we use model-guided reaction pathway engineering to quantitatively improve the performance of selective hybridization probes in recognizing single nucleotide variants (SNVs). Specifically, we build a detection system that combines discrimination by competition with DNA strand displacement-based catalytic amplification. We show, both mathematically and experimentally, that the single nucleotide selectivity of such a system in binding to single-stranded DNA and RNA is quadratically better than discrimination due to competitive hybridization alone. As an additional benefit the integrated circuit inherits the property of amplification and provides at least 10-fold better sensitivity than standard hybridization probes. Moreover, we demonstrate how the detection mechanism can be tuned such that the detection reaction is agnostic to the position of the SNV within the target sequence. in contrast, prior strand displacement-based probes designed for kinetic discrimination are highly sensitive to position effects. We apply our system to reliably discriminate between different members of the let-7 microRNA family that differ in only a single base position. Our results demonstrate the power of systematic reaction network design to quantitatively improve biotechnology.
A new family of satellite DNA sequences as a major component of centromeric heterochromatin in owls (Strigiformes).

PubMed

Yamada, Kazuhiko; Nishida-Umehara, Chizuko; Matsuda, Yoichi

2004-03-01

We isolated a new family of satellite DNA sequences from HaeIII- and EcoRI-digested genomic DNA of the Blakiston's fish owl ( Ketupa blakistoni). The repetitive sequences were organized in tandem arrays of the 174 bp element, and localized to the centromeric regions of all macrochromosomes, including the Z and W chromosomes, and microchromosomes. This hybridization pattern was consistent with the distribution of C-band-positive centromeric heterochromatin, and the satellite DNA sequences occupied 10% of the total genome as a major component of centromeric heterochromatin. The sequences were homogenized between macro- and microchromosomes in this species, and therefore intraspecific divergence of the nucleotide sequences was low. The 174 bp element cross-hybridized to the genomic DNA of six other Strigidae species, but not to that of the Tytonidae, suggesting that the satellite DNA sequences are conserved in the same family but fairly divergent between the different families in the Strigiformes. Secondly, the centromeric satellite DNAs were cloned from eight Strigidae species, and the nucleotide sequences of 41 monomer fragments were compared within and between species. Molecular phylogenetic relationships of the nucleotide sequences were highly correlated with both the taxonomy based on morphological traits and the phylogenetic tree constructed by DNA-DNA hybridization. These results suggest that the satellite DNA sequence has evolved by concerted evolution in the Strigidae and that it is a good taxonomic and phylogenetic marker to examine genetic diversity between Strigiformes species.
Reconstruction of the Evolutionary History of Saccharomyces cerevisiae x S. kudriavzevii Hybrids Based on Multilocus Sequence Analysis

PubMed Central

Peris, David; Lopes, Christian A.; Arias, Armando; Barrio, Eladio

2012-01-01

In recent years, interspecific hybridization and introgression are increasingly recognized as significant events in the evolution of Saccharomyces yeasts. These mechanisms have probably been involved in the origin of novel yeast genotypes and phenotypes, which in due course were to colonize and predominate in the new fermentative environments created by human manipulation. The particular conditions in which hybrids arose are still unknown, as well as the number of possible hybridization events that generated the whole set of natural hybrids described in the literature during recent years. In this study, we could infer at least six different hybridization events that originated a set of 26 S. cerevisiae x S. kudriavzevii hybrids isolated from both fermentative and non-fermentative environments. Different wine S. cerevisiae strains and European S. kudriavzevii strains were probably involved in the hybridization events according to gene sequence information, as well as from previous data on their genome composition and ploidy. Finally, we postulate that these hybrids may have originated after the introduction of vine growing and winemaking practices by the Romans to the present Northern vine-growing limits and spread during the expansion of improved viticulture and enology practices that occurred during the Late Middle Ages. PMID:23049811
Selective Constraints on Coding Sequences of Nervous System Genes Are a Major Determinant of Duplicate Gene Retention in Vertebrates

PubMed Central

Roux, Julien; Liu, Jialin; Robinson-Rechavi, Marc

2017-01-01

Abstract The evolutionary history of vertebrates is marked by three ancient whole-genome duplications: two successive rounds in the ancestor of vertebrates, and a third one specific to teleost fishes. Biased loss of most duplicates enriched the genome for specific genes, such as slow evolving genes, but this selective retention process is not well understood. To understand what drives the long-term preservation of duplicate genes, we characterized duplicated genes in terms of their expression patterns. We used a new method of expression enrichment analysis, TopAnat, applied to in situ hybridization data from thousands of genes from zebrafish and mouse. We showed that the presence of expression in the nervous system is a good predictor of a higher rate of retention of duplicate genes after whole-genome duplication. Further analyses suggest that purifying selection against the toxic effects of misfolded or misinteracting proteins, which is particularly strong in nonrenewing neural tissues, likely constrains the evolution of coding sequences of nervous system genes, leading indirectly to the preservation of duplicate genes after whole-genome duplication. Whole-genome duplications thus greatly contributed to the expansion of the toolkit of genes available for the evolution of profound novelties of the nervous system at the base of the vertebrate radiation. PMID:28981708
A novel chaotic based image encryption using a hybrid model of deoxyribonucleic acid and cellular automata

NASA Astrophysics Data System (ADS)

Enayatifar, Rasul; Sadaei, Hossein Javedani; Abdullah, Abdul Hanan; Lee, Malrey; Isnin, Ismail Fauzi

2015-08-01

Currently, there are many studies have conducted on developing security of the digital image in order to protect such data while they are sending on the internet. This work aims to propose a new approach based on a hybrid model of the Tinkerbell chaotic map, deoxyribonucleic acid (DNA) and cellular automata (CA). DNA rules, DNA sequence XOR operator and CA rules are used simultaneously to encrypt the plain-image pixels. To determine rule number in DNA sequence and also CA, a 2-dimension Tinkerbell chaotic map is employed. Experimental results and computer simulations, both confirm that the proposed scheme not only demonstrates outstanding encryption, but also resists various typical attacks.
Sequence Based Prediction of Antioxidant Proteins Using a Classifier Selection Strategy

PubMed Central

Zhang, Lina; Zhang, Chengjin; Gao, Rui; Yang, Runtao; Song, Qing

2016-01-01

Antioxidant proteins perform significant functions in maintaining oxidation/antioxidation balance and have potential therapies for some diseases. Accurate identification of antioxidant proteins could contribute to revealing physiological processes of oxidation/antioxidation balance and developing novel antioxidation-based drugs. In this study, an ensemble method is presented to predict antioxidant proteins with hybrid features, incorporating SSI (Secondary Structure Information), PSSM (Position Specific Scoring Matrix), RSA (Relative Solvent Accessibility), and CTD (Composition, Transition, Distribution). The prediction results of the ensemble predictor are determined by an average of prediction results of multiple base classifiers. Based on a classifier selection strategy, we obtain an optimal ensemble classifier composed of RF (Random Forest), SMO (Sequential Minimal Optimization), NNA (Nearest Neighbor Algorithm), and J48 with an accuracy of 0.925. A Relief combined with IFS (Incremental Feature Selection) method is adopted to obtain optimal features from hybrid features. With the optimal features, the ensemble method achieves improved performance with a sensitivity of 0.95, a specificity of 0.93, an accuracy of 0.94, and an MCC (Matthew’s Correlation Coefficient) of 0.880, far better than the existing method. To evaluate the prediction performance objectively, the proposed method is compared with existing methods on the same independent testing dataset. Encouragingly, our method performs better than previous studies. In addition, our method achieves more balanced performance with a sensitivity of 0.878 and a specificity of 0.860. These results suggest that the proposed ensemble method can be a potential candidate for antioxidant protein prediction. For public access, we develop a user-friendly web server for antioxidant protein identification that is freely accessible at http://antioxidant.weka.cc. PMID:27662651
Analysis of BAC-end sequences (BESs) and development of BES-SSR markers for genetic mapping and hybrid purity assessment in pigeonpea (Cajanus spp.)

PubMed Central

2011-01-01

Background Pigeonpea [Cajanus cajan (L.) Millsp.] is an important legume crop of rainfed agriculture. Despite of concerted research efforts directed to pigeonpea improvement, stagnated productivity of pigeonpea during last several decades may be accounted to prevalence of various biotic and abiotic constraints and the situation is exacerbated by availability of inadequate genomic resources to undertake any molecular breeding programme for accelerated crop improvement. With the objective of enhancing genomic resources for pigeonpea, this study reports for the first time, large scale development of SSR markers from BAC-end sequences and their subsequent use for genetic mapping and hybridity testing in pigeonpea. Results A set of 88,860 BAC (bacterial artificial chromosome)-end sequences (BESs) were generated after constructing two BAC libraries by using HindIII (34,560 clones) and BamHI (34,560 clones) restriction enzymes. Clustering based on sequence identity of BESs yielded a set of >52K non-redundant sequences, comprising 35 Mbp or >4% of the pigeonpea genome. These sequences were analyzed to develop annotation lists and subdivide the BESs into genome fractions (e.g., genes, retroelements, transpons and non-annotated sequences). Parallel analysis of BESs for microsatellites or simple sequence repeats (SSRs) identified 18,149 SSRs, from which a set of 6,212 SSRs were selected for further analysis. A total of 3,072 novel SSR primer pairs were synthesized and tested for length polymorphism on a set of 22 parental genotypes of 13 mapping populations segregating for traits of interest. In total, we identified 842 polymorphic SSR markers that will have utility in pigeonpea improvement. Based on these markers, the first SSR-based genetic map comprising of 239 loci was developed for this previously uncharacterized genome. Utility of developed SSR markers was also demonstrated by identifying a set of 42 markers each for two hybrids (ICPH 2671 and ICPH 2438) for genetic purity assessment in commercial hybrid breeding programme. Conclusion In summary, while BAC libraries and BESs should be useful for genomics studies, BES-SSR markers, and the genetic map should be very useful for linking the genetic map with a future physical map as well as for molecular breeding in pigeonpea. PMID:21447154

Measuring DNA hybridization using fluorescent DNA-stabilized silver clusters to investigate mismatch effects on therapeutic oligonucleotides.

PubMed

de Bruin, Donny; Bossert, Nelli; Aartsma-Rus, Annemieke; Bouwmeester, Dirk

2018-04-06

Short nucleic acid oligomers have found a wide range of applications in experimental physics, biology and medicine, and show potential for the treatment of acquired and genetic diseases. These applications rely heavily on the predictability of hybridization through Watson-Crick base pairing to allow positioning on a nanometer scale, as well as binding to the target transcripts, but also off-target binding to transcripts with partial homology. These effects are of particular importance in the development of therapeutic oligonucleotides, where off-target effects caused by the binding of mismatched sequences need to be avoided. We employ a novel method of probing DNA hybridization using optically active DNA-stabilized silver clusters (Ag-DNA) to measure binding efficiencies through a change in fluorescence intensity. In this way we can determine their location-specific sensitivity to individual mismatches in the sequence. The results reveal a strong dependence of the hybridization on the location of the mismatch, whereby mismatches close to the edges and center show a relatively minor impact. In parallel, we propose a simple model for calculating the annealing ratios of mismatched DNA sequences, which supports our experimental results. The primary result shown in this work is a demonstration of a novel technique to measure DNA hybridization using fluorescent Ag-DNA. With this technique, we investigated the effect of mismatches on the hybridization efficiency, and found a significant dependence on the location of individual mismatches. These effects are strongly influenced by the length of the used oligonucleotides. The novel probe method based on fluorescent Ag-DNA functions as a reliable tool in measuring this behavior. As a secondary result, we formulated a simple model that is consistent with the experimental data.
HPV Genotyping of Modified General Primer-Amplicons Is More Analytically Sensitive and Specific by Sequencing than by Hybridization

PubMed Central

Meisal, Roger; Rounge, Trine Ballestad; Christiansen, Irene Kraus; Eieland, Alexander Kirkeby; Worren, Merete Molton; Molden, Tor Faksvaag; Kommedal, Øyvind; Hovig, Eivind; Leegaard, Truls Michael

2017-01-01

Sensitive and specific genotyping of human papillomaviruses (HPVs) is important for population-based surveillance of carcinogenic HPV types and for monitoring vaccine effectiveness. Here we compare HPV genotyping by Next Generation Sequencing (NGS) to an established DNA hybridization method. In DNA isolated from urine, the overall analytical sensitivity of NGS was found to be 22% higher than that of hybridization. NGS was also found to be the most specific method and expanded the detection repertoire beyond the 37 types of the DNA hybridization assay. Furthermore, NGS provided an increased resolution by identifying genetic variants of individual HPV types. The same Modified General Primers (MGP)-amplicon was used in both methods. The NGS method is described in detail to facilitate implementation in the clinical microbiology laboratory and includes suggestions for new standards for detection and calling of types and variants with improved resolution. PMID:28045981
HPV Genotyping of Modified General Primer-Amplicons Is More Analytically Sensitive and Specific by Sequencing than by Hybridization.

PubMed

Meisal, Roger; Rounge, Trine Ballestad; Christiansen, Irene Kraus; Eieland, Alexander Kirkeby; Worren, Merete Molton; Molden, Tor Faksvaag; Kommedal, Øyvind; Hovig, Eivind; Leegaard, Truls Michael; Ambur, Ole Herman

2017-01-01

Sensitive and specific genotyping of human papillomaviruses (HPVs) is important for population-based surveillance of carcinogenic HPV types and for monitoring vaccine effectiveness. Here we compare HPV genotyping by Next Generation Sequencing (NGS) to an established DNA hybridization method. In DNA isolated from urine, the overall analytical sensitivity of NGS was found to be 22% higher than that of hybridization. NGS was also found to be the most specific method and expanded the detection repertoire beyond the 37 types of the DNA hybridization assay. Furthermore, NGS provided an increased resolution by identifying genetic variants of individual HPV types. The same Modified General Primers (MGP)-amplicon was used in both methods. The NGS method is described in detail to facilitate implementation in the clinical microbiology laboratory and includes suggestions for new standards for detection and calling of types and variants with improved resolution.
Predicting DNA binding proteins using support vector machine with hybrid fractal features.

PubMed

Niu, Xiao-Hui; Hu, Xue-Hai; Shi, Feng; Xia, Jing-Bo

2014-02-21

DNA-binding proteins play a vitally important role in many biological processes. Prediction of DNA-binding proteins from amino acid sequence is a significant but not fairly resolved scientific problem. Chaos game representation (CGR) investigates the patterns hidden in protein sequences, and visually reveals previously unknown structure. Fractal dimensions (FD) are good tools to measure sizes of complex, highly irregular geometric objects. In order to extract the intrinsic correlation with DNA-binding property from protein sequences, CGR algorithm, fractal dimension and amino acid composition are applied to formulate the numerical features of protein samples in this paper. Seven groups of features are extracted, which can be computed directly from the primary sequence, and each group is evaluated by the 10-fold cross-validation test and Jackknife test. Comparing the results of numerical experiments, the group of amino acid composition and fractal dimension (21-dimension vector) gets the best result, the average accuracy is 81.82% and average Matthew's correlation coefficient (MCC) is 0.6017. This resulting predictor is also compared with existing method DNA-Prot and shows better performances. © 2013 The Authors. Published by Elsevier Ltd All rights reserved.
Acquisition of New DNA Sequences After Infection of Chicken Cells with Avian Myeloblastosis Virus

PubMed Central

Shoyab, M.; Baluda, M. A.; Evans, R.

1974-01-01

DNA-RNA hybridization studies between 70S RNA from avian myeloblastosis virus (AMV) and an excess of DNA from (i) AMV-induced leukemic chicken myeloblasts or (ii) a mixture of normal and of congenitally infected K-137 chicken embryos producing avian leukosis viruses revealed the presence of fast- and slow-hybridizing virus-specific DNA sequences. However, the leukemic cells contained twice the level of AMV-specific DNA sequences observed in normal chicken embryonic cells. The fast-reacting sequences were two to three times more numerous in leukemic DNA than in DNA from the mixed embryos. The slow-reacting sequences had a reiteration frequency of approximately 9 and 6, in the two respective systems. Both the fast- and the slow-reacting DNA sequences in leukemic cells exhibited a higher Tm (2 C) than the respective DNA sequences in normal cells. In normal and leukemic cells the slow hybrid sequences appeared to have a Tm which was 2 C higher than that of the fast hybrid sequences. Individual non-virus-producing chicken embryos, either group-specific antigen positive or negative, contained 40 to 100 copies of the fast sequences and 2 to 6 copies of the slowly hybridizing sequences per cell genome. Normal rat cells did not contain DNA that hybridized with AMV RNA, whereas non-virus-producing rat cells transformed by B-77 avian sarcoma virus contained only the slowly reacting sequences. The results demonstrate that leukemic cells transformed by AMV contain new AMV-specific DNA sequences which were not present before infection. PMID:16789139
Different strategies for the detection of bioagents using electrochemical and photoelectrochemical genosensors

NASA Astrophysics Data System (ADS)

Voccia, Diego; Bettazi, Francesca; Palchetti, Ilaria

2015-10-01

In recent years various kinds of biosensors for the detection of pathogens have been developed. A genosensor consists in the immobilization, onto the surface of a chosen transducer, of an oligonucleotide with a specific base sequence called capture probe. The complementary sequence (the analytical target, i.e. a specific sequence of the DNA/RNA of the pathogen) present in the sample is recognized and captured by the probe through the hybridization reaction. The evaluation of the extent of the hybridization allows one to confirm whether the sample contains the complementary sequence of the probe or not. Electrochemical transducers have received considerable attention in connection with the detection of DNA hybridization. Moreover, recently, with the emergence of novel photoelectrochemically active species and new detection schemes, photoelectrochemistry has resulted in substantial progress in its analytical performance for biosensing applications. In this paper, some examples of electrochemical genosensors for multiplexed pathogen detection are shown. Moreover, the preliminary experiments towards the development of a photoelectrochemical genosensor using a TiO2 - nanocrystal-modified ITO electrode are discussed.
Toward a solid-phase nucleic acid hybridization assay within microfluidic channels using immobilized quantum dots as donors in fluorescence resonance energy transfer.

PubMed

Chen, Lu; Algar, W Russ; Tavares, Anthony J; Krull, Ulrich J

2011-01-01

The optical properties and surface area of quantum dots (QDs) have made them an attractive platform for the development of nucleic acid biosensors based on fluorescence resonance energy transfer (FRET). Solid-phase assays based on FRET using mixtures of immobilized QD-oligonucleotide conjugates (QD biosensors) have been developed. The typical challenges associated with solid-phase detection strategies include non-specific adsorption, slow kinetics of hybridization, and sample manipulation. The new work herein has considered the immobilization of QD biosensors onto the surfaces of microfluidic channels in order to address these challenges. Microfluidic flow can be used to dynamically control stringency by adjustment of the potential in an electrokinetic-based microfluidics environment. The shearing force, Joule heating, and the competition between electroosmotic and electrophoretic mobilities allow the optimization of hybridization conditions, convective delivery of target to the channel surface to speed hybridization, amelioration of adsorption, and regeneration of the sensing surface. Microfluidic flow can also be used to deliver (for immobilization) and remove QD biosensors. QDs that were conjugated with two different oligonucleotide sequences were used to demonstrate feasibility. One oligonucleotide sequence on the QD was available as a linker for immobilization via hybridization with complementary oligonucleotides located on a glass surface within a microfluidic channel. A second oligonucleotide sequence on the QD served as a probe to transduce hybridization with target nucleic acid in a sample solution. A Cy3 label on the target was excited by FRET using green-emitting CdSe/ZnS QD donors and provided an analytical signal to explore this detection strategy. The immobilized QDs could be removed under denaturing conditions by disrupting the duplex that was used as the surface linker and thus allowed a new layer of QD biosensors to be re-coated within the channel for re-use of the microfluidic chip.
Elman RNN based classification of proteins sequences on account of their mutual information.

PubMed

Mishra, Pooja; Nath Pandey, Paras

2012-10-21

In the present work we have employed the method of estimating residue correlation within the protein sequences, by using the mutual information (MI) of adjacent residues, based on structural and solvent accessibility properties of amino acids. The long range correlation between nonadjacent residues is improved by constructing a mutual information vector (MIV) for a single protein sequence, like this each protein sequence is associated with its corresponding MIVs. These MIVs are given to Elman RNN to obtain the classification of protein sequences. The modeling power of MIV was shown to be significantly better, giving a new approach towards alignment free classification of protein sequences. We also conclude that sequence structural and solvent accessible property based MIVs are better predictor. Copyright © 2012 Elsevier Ltd. All rights reserved.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.

1999-10-26

A computer system (1) for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area (814) and sample sequences in another area (816) on a display device (3).
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.

2001-06-05

A computer system (1) for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area (814) and sample sequences in another area (816) on a display device (3).
Development of Thinopyrum ponticum-specific molecular markers and FISH probes based on SLAF-seq technology.

PubMed

Liu, Liqin; Luo, Qiaoling; Teng, Wan; Li, Bin; Li, Hongwei; Li, Yiwen; Li, Zhensheng; Zheng, Qi

2018-05-01

Based on SLAF-seq, 67 Thinopyrum ponticum-specific markers and eight Th. ponticum-specific FISH probes were developed, and these markers and probes could be used for detection of alien chromatin in a wheat background. Decaploid Thinopyrum ponticum (2n = 10x = 70) is a valuable gene reservoir for wheat improvement. Identification of Th. ponticum introgression would facilitate its transfer into diverse wheat genetic backgrounds and its practical utilization in wheat improvement. Based on specific-locus-amplified fragment sequencing (SLAF-seq) technology, 67 new Th. ponticum-specific molecular markers and eight Th. ponticum-specific fluorescence in situ hybridization (FISH) probes have been developed from a tiny wheat-Th. ponticum translocation line. These newly developed molecular markers allowed the detection of Th. ponticum DNA in a variety of materials specifically and steadily at high throughput. According to the hybridization signal pattern, the eight Th. ponticum-specific probes could be divided into two groups. The first group including five dispersed repetitive sequence probes could identify Th. ponticum chromatin more sensitively and accurately than genomic in situ hybridization (GISH). Whereas the second group having three tandem repetitive sequence probes enabled the discrimination of Th. ponticum chromosomes together with another clone pAs1 in wheat-Th. ponticum partial amphiploid Xiaoyan 68.
A simple nucleic acid hybridization/latex agglutination assay for the rapid detection of polymerase chain reaction amplicons.

PubMed

Vollenhofer-Schrumpf, Sabine; Buresch, Ronald; Schinkinger, Manfred

2007-03-01

We have developed a new method for the detection of nucleic acid hybridization, based on a simple latex agglutination test that can be evaluated by the unaided eye. Nucleic acid, e.g., a polymerase chain reaction (PCR) product, is denatured and incubated with polystyrene beads carrying covalently bound complementary oligonucleotide sequences. Hybridization of the nucleic acids leads to aggregation of the latex particles, thereby verifying the presence of target sequence. The test is performed at room temperature, and results are available within 10 min. As a proof of principle, the hybridization/latex agglutination assay was applied to the detection of purified PCR fragments either specific for Salmonella spp. or a synthetic sequence, and to the detection of Salmonella enterica in artificially contaminated chicken samples. A few nanograms of purified PCR fragments were detectable. In artificially contaminated chicken samples, 3 colony-forming units (cfu)/25 g were detected in one of three replicates, and 30 cfu/25 g were detected in both of two replicates when samples for PCR were taken directly from primary enrichment, demonstrating the practical applicability of this test system. Even multiplex detection might be achievable. This novel kind of assay could be useful for a range of applications where hybridization of nucleic acids, e.g., PCR fragments, is to be detected.
Import of desired nucleic acid sequences using addressing motif of mitochondrial ribosomal 5S-rRNA for fluorescent in vivo hybridization of mitochondrial DNA and RNA.

PubMed

Zelenka, Jaroslav; Alán, Lukáš; Jabůrek, Martin; Ježek, Petr

2014-04-01

Based on the matrix-addressing sequence of mitochondrial ribosomal 5S-rRNA (termed MAM), which is naturally imported into mitochondria, we have constructed an import system for in vivo targeting of mitochondrial DNA (mtDNA) or mt-mRNA, in order to provide fluorescence hybridization of the desired sequences. Thus DNA oligonucleotides were constructed, containing the 5'-flanked T7 RNA polymerase promoter. After in vitro transcription and fluorescent labeling with Alexa Fluor(®) 488 or 647 dye, we obtained the fluorescent "L-ND5 probe" containing MAM and exemplar cargo, i.e., annealing sequence to a short portion of ND5 mRNA and to the light-strand mtDNA complementary to the heavy strand nd5 mt gene (5'-end 21 base pair sequence). For mitochondrial in vivo fluorescent hybridization, HepG2 cells were treated with dequalinium micelles, containing the fluorescent probes, bringing the probes proximally to the mitochondrial outer membrane and to the natural import system. A verification of import into the mitochondrial matrix of cultured HepG2 cells was provided by confocal microscopy colocalizations. Transfections using lipofectamine or probes without 5S-rRNA addressing MAM sequence or with MAM only were ineffective. Alternatively, the same DNA oligonucleotides with 5'-CACC overhang (substituting T7 promoter) were transcribed from the tetracycline-inducible pENTRH1/TO vector in human embryonic kidney T-REx®-293 cells, while mitochondrial matrix localization after import of the resulting unlabeled RNA was detected by PCR. The MAM-containing probe was then enriched by three-order of magnitude over the natural ND5 mRNA in the mitochondrial matrix. In conclusion, we present a proof-of-principle for mitochondrial in vivo hybridization and mitochondrial nucleic acid import.
The minimal amount of starting DNA for Agilent’s hybrid capture-based targeted massively parallel sequencing

PubMed Central

Chung, Jongsuk; Son, Dae-Soon; Jeon, Hyo-Jeong; Kim, Kyoung-Mee; Park, Gahee; Ryu, Gyu Ha; Park, Woong-Yang; Park, Donghyun

2016-01-01

Targeted capture massively parallel sequencing is increasingly being used in clinical settings, and as costs continue to decline, use of this technology may become routine in health care. However, a limited amount of tissue has often been a challenge in meeting quality requirements. To offer a practical guideline for the minimum amount of input DNA for targeted sequencing, we optimized and evaluated the performance of targeted sequencing depending on the input DNA amount. First, using various amounts of input DNA, we compared commercially available library construction kits and selected Agilent’s SureSelect-XT and KAPA Biosystems’ Hyper Prep kits as the kits most compatible with targeted deep sequencing using Agilent’s SureSelect custom capture. Then, we optimized the adapter ligation conditions of the Hyper Prep kit to improve library construction efficiency and adapted multiplexed hybrid selection to reduce the cost of sequencing. In this study, we systematically evaluated the performance of the optimized protocol depending on the amount of input DNA, ranging from 6.25 to 200 ng, suggesting the minimal input DNA amounts based on coverage depths required for specific applications. PMID:27220682
Integration of hybridization-based markers (overgos) into physical maps for comparative and evolutionary explorations in the genus Oryza and in Sorghum

PubMed Central

Hass-Jacobus, Barbara L; Futrell-Griggs, Montona; Abernathy, Brian; Westerman, Rick; Goicoechea, Jose-Luis; Stein, Joshua; Klein, Patricia; Hurwitz, Bonnie; Zhou, Bin; Rakhshan, Fariborz; Sanyal, Abhijit; Gill, Navdeep; Lin, Jer-Young; Walling, Jason G; Luo, Mei Zhong; Ammiraju, Jetty Siva S; Kudrna, Dave; Kim, Hye Ran; Ware, Doreen; Wing, Rod A; Miguel, Phillip San; Jackson, Scott A

2006-01-01

Background With the completion of the genome sequence for rice (Oryza sativa L.), the focus of rice genomics research has shifted to the comparison of the rice genome with genomes of other species for gene cloning, breeding, and evolutionary studies. The genus Oryza includes 23 species that shared a common ancestor 8–10 million years ago making this an ideal model for investigations into the processes underlying domestication, as many of the Oryza species are still undergoing domestication. This study integrates high-throughput, hybridization-based markers with BAC end sequence and fingerprint data to construct physical maps of rice chromosome 1 orthologues in two wild Oryza species. Similar studies were undertaken in Sorghum bicolor, a species which diverged from cultivated rice 40–50 million years ago. Results Overgo markers, in conjunction with fingerprint and BAC end sequence data, were used to build sequence-ready BAC contigs for two wild Oryza species. The markers drove contig merges to construct physical maps syntenic to rice chromosome 1 in the wild species and provided evidence for at least one rearrangement on chromosome 1 of the O. sativa versus Oryza officinalis comparative map. When rice overgos were aligned to available S. bicolor sequence, 29% of the overgos aligned with three or fewer mismatches; of these, 41% gave positive hybridization signals. Overgo hybridization patterns supported colinearity of loci in regions of sorghum chromosome 3 and rice chromosome 1 and suggested that a possible genomic inversion occurred in this syntenic region in one of the two genomes after the divergence of S. bicolor and O. sativa. Conclusion The results of this study emphasize the importance of identifying conserved sequences in the reference sequence when designing overgo probes in order for those probes to hybridize successfully in distantly related species. As interspecific markers, overgos can be used successfully to construct physical maps in species which diverged less than 8 million years ago, and can be used in a more limited fashion to examine colinearity among species which diverged as much as 40 million years ago. Additionally, overgos are able to provide evidence of genomic rearrangements in comparative physical mapping studies. PMID:16895597
Identifying N6-methyladenosine sites using multi-interval nucleotide pair position specificity and support vector machine

NASA Astrophysics Data System (ADS)

Xing, Pengwei; Su, Ran; Guo, Fei; Wei, Leyi

2017-04-01

N6-methyladenosine (m6A) refers to methylation of the adenosine nucleotide acid at the nitrogen-6 position. It plays an important role in a series of biological processes, such as splicing events, mRNA exporting, nascent mRNA synthesis, nuclear translocation and translation process. Numerous experiments have been done to successfully characterize m6A sites within sequences since high-resolution mapping of m6A sites was established. However, as the explosive growth of genomic sequences, using experimental methods to identify m6A sites are time-consuming and expensive. Thus, it is highly desirable to develop fast and accurate computational identification methods. In this study, we propose a sequence-based predictor called RAM-NPPS for identifying m6A sites within RNA sequences, in which we present a novel feature representation algorithm based on multi-interval nucleotide pair position specificity, and use support vector machine classifier to construct the prediction model. Comparison results show that our proposed method outperforms the state-of-the-art predictors on three benchmark datasets across the three species, indicating the effectiveness and robustness of our method. Moreover, an online webserver implementing the proposed predictor has been established at http://server.malab.cn/RAM-NPPS/. It is anticipated to be a useful prediction tool to assist biologists to reveal the mechanisms of m6A site functions.
Multiple sequence alignment using multi-objective based bacterial foraging optimization algorithm.

PubMed

Rani, R Ranjani; Ramyachitra, D

2016-12-01

Multiple sequence alignment (MSA) is a widespread approach in computational biology and bioinformatics. MSA deals with how the sequences of nucleotides and amino acids are sequenced with possible alignment and minimum number of gaps between them, which directs to the functional, evolutionary and structural relationships among the sequences. Still the computation of MSA is a challenging task to provide an efficient accuracy and statistically significant results of alignments. In this work, the Bacterial Foraging Optimization Algorithm was employed to align the biological sequences which resulted in a non-dominated optimal solution. It employs Multi-objective, such as: Maximization of Similarity, Non-gap percentage, Conserved blocks and Minimization of gap penalty. BAliBASE 3.0 benchmark database was utilized to examine the proposed algorithm against other methods In this paper, two algorithms have been proposed: Hybrid Genetic Algorithm with Artificial Bee Colony (GA-ABC) and Bacterial Foraging Optimization Algorithm. It was found that Hybrid Genetic Algorithm with Artificial Bee Colony performed better than the existing optimization algorithms. But still the conserved blocks were not obtained using GA-ABC. Then BFO was used for the alignment and the conserved blocks were obtained. The proposed Multi-Objective Bacterial Foraging Optimization Algorithm (MO-BFO) was compared with widely used MSA methods Clustal Omega, Kalign, MUSCLE, MAFFT, Genetic Algorithm (GA), Ant Colony Optimization (ACO), Artificial Bee Colony (ABC), Particle Swarm Optimization (PSO) and Hybrid Genetic Algorithm with Artificial Bee Colony (GA-ABC). The final results show that the proposed MO-BFO algorithm yields better alignment than most widely used methods. Copyright Â© 2016 Elsevier Ireland Ltd. All rights reserved.
Lanthanum-Based Metal-Organic Frameworks for Specific Detection of Sudan Virus RNA Conservative Sequences down to Single-Base Mismatch.

PubMed

Yang, Shui-Ping; Zhao, Wei; Hu, Pei-Pei; Wu, Ke-Yang; Jiang, Zhi-Hong; Bai, Li-Ping; Li, Min-Min; Chen, Jin-Xiang

2017-12-18

Reactions of La(NO 3 ) 3 ·6H 2 O with the polar, tritopic quaternized carboxylate ligands N-carboxymethyl-3,5-dicarboxylpyridinium bromide (H 3 CmdcpBr) and N-(4-carboxybenzyl)-3,5-dicarboxylpyridinium bromide (H 3 CbdcpBr) afford two water-stable metal-organic frameworks (MOFs) of {[La 4 (Cmdcp) 6 (H 2 O) 9 ]} n (1, 3D) and {[La 2 (Cbdcp) 3 (H 2 O) 10 ]} n (2, 2D). MOFs 1 and 2 absorb the carboxyfluorescein (FAM)-tagged probe DNA (P-DNA) and quench the fluorescence of FAM via a photoinduced electron transfer (PET) process. The nonemissive P-DNA@MOF hybrids thus formed in turn function as sensing platforms to distinguish conservative linear, single-stranded RNA sequences of Sudan virus with high selectivity and low detection limits of 112 and 67 pM, respectively (at a signal-to-noise ratio of 3). These hybrids also exhibit high specificity and discriminate down to single-base mismatch RNA sequences.
Advanced surface-enhanced Raman gene probe systems and methods thereof

DOEpatents

Vo-Dinh, Tuan

2001-01-01

The subject invention is a series of methods and systems for using the Surface-Enhanced Raman (SER)-labeled Gene Probe for hybridization, detection and identification of SER-labeled hybridized target oligonucleotide material comprising the steps of immobilizing SER-labeled hybridized target oligonucleotide material on a support means, wherein the SER-labeled hybridized target oligonucleotide material comprise a SER label attached either to a target oligonucleotide of unknown sequence or to a gene probe of known sequence complementary to the target oligonucleotide sequence, the SER label is unique for the target oligonucleotide strands of a particular sequence wherein the SER-labeled oligonucleotide is hybridized to its complementary oligonucleotide strand, then the support means having the SER-labeled hybridized target oligonucleotide material adsorbed thereon is SERS activated with a SERS activating means, then the support means is analyzed.
First Guatemalan record of natural hybridisation between Neotropical species of the Lady’s Slipper orchid (Orchidaceae, Cypripedioideae)

PubMed Central

Szlachetko, Dariusz L.; Kolanowska, Marta; Muller, Fred; Vannini, Jay; Rojek, Joanna

2017-01-01

The first natural hybrid in the section Irapeana of the orchid genus Cypripedium is described and illustrated based on Guatemalan material. A molecular evaluation of the discovery is provided. Specimens with intermediate flowers between C. irapeanum and C. dickinsonianum within ITS and Xdh sequences have the signal sequence of both these species. The analysis of plastid sequences indicated that the maternal line is C. irapeanum. Information about the ecology, embryology and conservation status of the novelty is given, together with a distribution map of its parental species, C. irapeanum and C. dickinsonianum. A discussion of the hybridization between Cypripedium species is presented. The potential hybrid zones between the representatives of Cypripedium section Irapeana which were estimated based on the results of ecological niche modeling analysis are located in the Maya Highlands (C. dickinsonianum and C. irapeanum) and the eastern part of Southern Sierra Madre (C. molle and C. irapeanum). Moreover, all three Cypripedium species could inhabit Cordillera Neovolcánica according to the obtained models; however, it should be noticed that this region is well-distanced from the edges of the known geographical range of C. molle. PMID:29302391

First Guatemalan record of natural hybridisation between Neotropical species of the Lady's Slipper orchid (Orchidaceae, Cypripedioideae).

PubMed

Szlachetko, Dariusz L; Kolanowska, Marta; Muller, Fred; Vannini, Jay; Rojek, Joanna; Górniak, Marcin

2017-01-01

The first natural hybrid in the section Irapeana of the orchid genus Cypripedium is described and illustrated based on Guatemalan material. A molecular evaluation of the discovery is provided. Specimens with intermediate flowers between C. irapeanum and C. dickinsonianum within ITS and Xdh sequences have the signal sequence of both these species. The analysis of plastid sequences indicated that the maternal line is C. irapeanum . Information about the ecology, embryology and conservation status of the novelty is given, together with a distribution map of its parental species, C. irapeanum and C. dickinsonianum . A discussion of the hybridization between Cypripedium species is presented. The potential hybrid zones between the representatives of Cypripedium section Irapeana which were estimated based on the results of ecological niche modeling analysis are located in the Maya Highlands ( C. dickinsonianum and C. irapeanum ) and the eastern part of Southern Sierra Madre ( C. molle and C. irapeanum ). Moreover, all three Cypripedium species could inhabit Cordillera Neovolcánica according to the obtained models; however, it should be noticed that this region is well-distanced from the edges of the known geographical range of C. molle .
DNA/RNA hybrid substrates modulate the catalytic activity of purified AID.

PubMed

Abdouni, Hala S; King, Justin J; Ghorbani, Atefeh; Fifield, Heather; Berghuis, Lesley; Larijani, Mani

2018-01-01

Activation-induced cytidine deaminase (AID) converts cytidine to uridine at Immunoglobulin (Ig) loci, initiating somatic hypermutation and class switching of antibodies. In vitro, AID acts on single stranded DNA (ssDNA), but neither double-stranded DNA (dsDNA) oligonucleotides nor RNA, and it is believed that transcription is the in vivo generator of ssDNA targeted by AID. It is also known that the Ig loci, particularly the switch (S) regions targeted by AID are rich in transcription-generated DNA/RNA hybrids. Here, we examined the binding and catalytic behavior of purified AID on DNA/RNA hybrid substrates bearing either random sequences or GC-rich sequences simulating Ig S regions. If substrates were made up of a random sequence, AID preferred substrates composed entirely of DNA over DNA/RNA hybrids. In contrast, if substrates were composed of S region sequences, AID preferred to mutate DNA/RNA hybrids over substrates composed entirely of DNA. Accordingly, AID exhibited a significantly higher affinity for binding DNA/RNA hybrid substrates composed specifically of S region sequences, than any other substrates composed of DNA. Thus, in the absence of any other cellular processes or factors, AID itself favors binding and mutating DNA/RNA hybrids composed of S region sequences. AID:DNA/RNA complex formation and supporting mutational analyses suggest that recognition of DNA/RNA hybrids is an inherent structural property of AID. Copyright © 2017 Elsevier Ltd. All rights reserved.
A Six Nuclear Gene Phylogeny of Citrus (Rutaceae) Taking into Account Hybridization and Lineage Sorting

PubMed Central

Keremane, Manjunath L.; Lee, Richard F.; Maureira-Butler, Ivan J.; Roose, Mikeal L.

2013-01-01

Background Genus Citrus (Rutaceae) comprises many important cultivated species that generally hybridize easily. Phylogenetic study of a group showing extensive hybridization is challenging. Since the genus Citrus has diverged recently (4–12 Ma), incomplete lineage sorting of ancestral polymorphisms is also likely to cause discrepancies among genes in phylogenetic inferences. Incongruence of gene trees is observed and it is essential to unravel the processes that cause inconsistencies in order to understand the phylogenetic relationships among the species. Methodology and Principal Findings (1) We generated phylogenetic trees using haplotype sequences of six low copy nuclear genes. (2) Published simple sequence repeat data were re-analyzed to study population structure and the results were compared with the phylogenetic trees constructed using sequence data and coalescence simulations. (3) To distinguish between hybridization and incomplete lineage sorting, we developed and utilized a coalescence simulation approach. In other studies, species trees have been inferred despite the possibility of hybridization having occurred and used to generate null distributions of the effect of lineage sorting alone (by coalescent simulation). Since this is problematic, we instead generate these distributions directly from observed gene trees. Of the six trees generated, we used the most resolved three to detect hybrids. We found that 11 of 33 samples appear to be affected by historical hybridization. Analysis of the remaining three genes supported the conclusions from the hybrid detection test. Conclusions We have identified or confirmed probable hybrid origins for several Citrus cultivars using three different approaches–gene phylogenies, population structure analysis and coalescence simulation. Hybridization and incomplete lineage sorting were identified primarily based on differences among gene phylogenies with reference to null expectations via coalescence simulations. We conclude that identifying hybridization as a frequent cause of incongruence among gene trees is critical to correctly infer the phylogeny among species of Citrus. PMID:23874615
Subcellular location prediction of proteins using support vector machines with alignment of block sequences utilizing amino acid composition.

PubMed

Tamura, Takeyuki; Akutsu, Tatsuya

2007-11-30

Subcellular location prediction of proteins is an important and well-studied problem in bioinformatics. This is a problem of predicting which part in a cell a given protein is transported to, where an amino acid sequence of the protein is given as an input. This problem is becoming more important since information on subcellular location is helpful for annotation of proteins and genes and the number of complete genomes is rapidly increasing. Since existing predictors are based on various heuristics, it is important to develop a simple method with high prediction accuracies. In this paper, we propose a novel and general predicting method by combining techniques for sequence alignment and feature vectors based on amino acid composition. We implemented this method with support vector machines on plant data sets extracted from the TargetP database. Through fivefold cross validation tests, the obtained overall accuracies and average MCC were 0.9096 and 0.8655 respectively. We also applied our method to other datasets including that of WoLF PSORT. Although there is a predictor which uses the information of gene ontology and yields higher accuracy than ours, our accuracies are higher than existing predictors which use only sequence information. Since such information as gene ontology can be obtained only for known proteins, our predictor is considered to be useful for subcellular location prediction of newly-discovered proteins. Furthermore, the idea of combination of alignment and amino acid frequency is novel and general so that it may be applied to other problems in bioinformatics. Our method for plant is also implemented as a web-system and available on http://sunflower.kuicr.kyoto-u.ac.jp/~tamura/slpfa.html.
Influence of Stacking Sequence and Notch Angle on the Charpy Impact Behavior of Hybrid Composites

NASA Astrophysics Data System (ADS)

Behnia, S.; Daghigh, V.; Nikbin, K.; Fereidoon, A.; Ghorbani, J.

2016-09-01

The low-velocity impact behavior of hybrid composite laminates was investigated. The epoxy matrix was reinforced with aramid, glass, basalt, and carbon fabrics using the hand lay-up technique. Different stacking sequences and notch angles were and notch angles considered and tested using a Charpy impact testing machine to study the hybridization and notch angle effects on the impact response of the hybrid composites. The energy absorption capability of specimens with different stacking sequences and notch angles is compared and discussed. It is shown that the hybridization can enhance the mechanical performance of composite materials.
RAD sequencing yields a high success rate for westslope cutthroat and rainbow trout species-diagnostic SNP assays

USGS Publications Warehouse

Stephen J. Amish,; Paul A. Hohenlohe,; Sally Painter,; Robb F. Leary,; Muhlfeld, Clint C.; Fred W. Allendorf,; Luikart, Gordon

2012-01-01

Hybridization with introduced rainbow trout threatens most native westslope cutthroat trout populations. Understanding the genetic effects of hybridization and introgression requires a large set of high-throughput, diagnostic genetic markers to inform conservation and management. Recently, we identified several thousand candidate single-nucleotide polymorphism (SNP) markers based on RAD sequencing of 11 westslope cutthroat trout and 13 rainbow trout individuals. Here, we used flanking sequence for 56 of these candidate SNP markers to design high-throughput genotyping assays. We validated the assays on a total of 92 individuals from 22 populations and seven hatchery strains. Forty-six assays (82%) amplified consistently and allowed easy identification of westslope cutthroat and rainbow trout alleles as well as heterozygote controls. The 46 SNPs will provide high power for early detection of population admixture and improved identification of hybrid and nonhybridized individuals. This technique shows promise as a very low-cost, reliable and relatively rapid method for developing and testing SNP markers for nonmodel organisms with limited genomic resources.
Multi-level machine learning prediction of protein-protein interactions in Saccharomyces cerevisiae.

PubMed

Zubek, Julian; Tatjewski, Marcin; Boniecki, Adam; Mnich, Maciej; Basu, Subhadip; Plewczynski, Dariusz

2015-01-01

Accurate identification of protein-protein interactions (PPI) is the key step in understanding proteins' biological functions, which are typically context-dependent. Many existing PPI predictors rely on aggregated features from protein sequences, however only a few methods exploit local information about specific residue contacts. In this work we present a two-stage machine learning approach for prediction of protein-protein interactions. We start with the carefully filtered data on protein complexes available for Saccharomyces cerevisiae in the Protein Data Bank (PDB) database. First, we build linear descriptions of interacting and non-interacting sequence segment pairs based on their inter-residue distances. Secondly, we train machine learning classifiers to predict binary segment interactions for any two short sequence fragments. The final prediction of the protein-protein interaction is done using the 2D matrix representation of all-against-all possible interacting sequence segments of both analysed proteins. The level-I predictor achieves 0.88 AUC for micro-scale, i.e., residue-level prediction. The level-II predictor improves the results further by a more complex learning paradigm. We perform 30-fold macro-scale, i.e., protein-level cross-validation experiment. The level-II predictor using PSIPRED-predicted secondary structure reaches 0.70 precision, 0.68 recall, and 0.70 AUC, whereas other popular methods provide results below 0.6 threshold (recall, precision, AUC). Our results demonstrate that multi-scale sequence features aggregation procedure is able to improve the machine learning results by more than 10% as compared to other sequence representations. Prepared datasets and source code for our experimental pipeline are freely available for download from: http://zubekj.github.io/mlppi/ (open source Python implementation, OS independent).
Assessing genetic divergence in interspecific hybrids of Aechmea gomosepala and A. recurvata var. recurvata using inflorescence characteristics and sequence-related amplified polymorphism markers.

PubMed

Zhang, F; Ge, Y Y; Wang, W Y; Shen, X L; Yu, X Y

2012-12-03

Conventional hybridization and selection techniques have aided the development of new ornamental crop cultivars. However, little information is available on the genetic divergence of bromeliad hybrids. In the present study, we investigated the genetic variability in interspecific hybrids of Aechmea gomosepala and A. recurvata var. recurvata using inflorescence characteristics and sequence-related amplified polymorphism (SRAP) markers. The morphological analysis showed that the putative hybrids were intermediate between both parental species with respect to inflorescence characteristics. The 16 SRAP primer combinations yield 265 bands, among which 154 (57.72%) were polymorphic. The genetic similarity was an average of 0.59 and ranged from 0.21 to 0.87, indicating moderate genetic divergence among the hybrids. The unweighted pair group method with arithmetic average (UPGMA)-based cluster analysis distinguished the hybrids from their parents with a genetic distance coefficient of 0.54. The cophenetic correlation was 0.93, indicating a good fit between the dendrogram and the original distance matrix. The two-dimensional plot from the principal coordinate analysis showed that the hybrids were intermediately dispersed between both parents, corresponding to the results of the UPGMA cluster and the morphological analysis. These results suggest that SRAP markers could help to identify breeders, characterize F(1) hybrids of bromeliads at an early stage, and expedite genetic improvement of bromeliad cultivars.
Production and molecular characterization of somatic hybrids between Pleurotus florida and Lentinula edodes.

PubMed

Mallick, Pijush; Sikdar, Samir Ranjan

2014-08-01

Nine inter-generic somatic hybrids named as pfle were produced through PEG-mediated protoplast fusion between Pleurotus florida and Lentinula edodes using double selection method. Hybridity of the newly developed strains was established on the basis of colony morphology, mycelial growth, hyphal traits, fruit-body productivity and inter single sequence repeat (ISSR) marker profiling. Hybrid population was assessed with different phenotypic variables by one-way analysis of variance. Principal component matrices were analyzed for the six phenotypic variables in scatter plot showing maximum positive correlation between each variable for all strains examined. Six ISSR primers generated 66 reproducible fragments with 98.48 % polymorphism. The dendrogram thus created based on unweighted pair-group method with mathematic averages method of clustering and Euclidean distance which exhibited three major groups between the parents and pfle hybrids. Though P. florida parent remained in one group but it showed different degrees of genetic distance with all the hybrid lines belonging to the other two groups while L. edodes was most distantly related to all the hybrid lines. L. edodes specific sequence-rich ISSR amplicon was recorded in all the hybrid lines and in L. edodes but not in P. florida. All the fruit body generating pfle hybrid lines could produce basidiocarp on paddy straw in sub-tropical climate and showed phenotypic resemblance to the P. florida parent.
Patterns of genetic diversity and candidate genes for ecological divergence in a homoploid hybrid sunflower, Helianthus anomalus

PubMed Central

SAPIR, YUVAL; MOODY, MICHAEL L.; BROUILLETTE, LARRY C.; DONOVAN, LISA A.; RIESEBERG, LOREN H.

2008-01-01

Natural hybridization accompanied by a shift in niche preference by hybrid genotypes can lead to hybrid speciation. Natural selection may cause the fixation of advantageous alleles in the ecologically diverged hybrids, and the loci experiencing selection should exhibit a reduction in allelic diversity relative to neutral loci. Here, we analyzed patterns of genetic diversity at 59 microsatellite loci associated with expressed sequence tags (ESTs) in a homoploid hybrid sunflower species, Helianthus anomalus. We used two indices, ln RV and ln RH, to compare variation and heterozygosity (respectively) at each locus between the hybrid species and its two parental species, H. annuus and H. petiolaris. Mean values of ln RV and ln RH were significantly lower than zero, which implies that H. anomalus experienced a population bottleneck during its recent evolutionary history. After correcting for the apparent bottleneck, we found six loci with a significant reduction in variation or with heterozygosity in the hybrid species, compared to one or both of the parental species. These loci should be viewed as a ranked list of candidate loci, pending further sequencing and functional analyses. Sequence data were generated for two of the candidate loci, but population genetics tests failed to detect deviations from neutral evolution at either locus. Nonetheless, a greater than eight-fold excess of nonsynonymous substitutions was found near a putative N-myristoylation motif at the second locus (HT998), and likelihood-based models indicated that the protein has been under selection in H. anomalus in the past and, perhaps, in one or both parental species. Finally, our data suggest that selective sweeps may have united populations of H. anomalus isolated by a mountain range, indicating that even low gene-flow species may be held together by the spread of advantageous alleles. PMID:17944850
Nucleotide sequence of the Varkud mitochondrial plasmid of Neurospora and synthesis of a hybrid transcript with a 5' leader derived from mitochondrial RNA.

PubMed

Akins, R A; Grant, D M; Stohl, L L; Bottorff, D A; Nargang, F E; Lambowitz, A M

1988-11-05

The Mauriceville and Varkud mitochondrial plasmids of Neurospora are closely related, closed circular DNAs (3.6 and 3.7 kb, respectively; 1 kb = 10(3) bases or base-pairs), whose characteristics suggest relationships to mitochondrial DNA introns and retrotransposons. Here, we characterized the structure of the Varkud plasmid, determined its complete nucleotide sequence and mapped its major transcripts. The Mauriceville and Varkud plasmids have more than 97% positional identity. Both plasmids contain a 710 amino acid open reading frame that encodes a reverse transcriptase-like protein. The amino acid sequence of this open reading frame is strongly conserved between the two plasmids (701/710 amino acids) as expected for a functionally important protein. Both plasmids have a 0.4 kb region that contains five PstI palindromes and a direct repeat of approximately 160 base-pairs. Comparison of sequences in this region suggests that the Varkud plasmid has diverged less from a common ancestor than has the Mauriceville plasmid. Two major transcripts of the Varkud plasmid were detected by Northern hybridization experiments: a full-length linear RNA of 3.7 kb and an additional prominent transcript of 4.9 kb, 1.2 kb longer than monomer plasmid. Remarkably, we find that the 4.9 kb transcript is a hybrid RNA consisting of the full-length 3.7 kb Varkud plasmid transcript plus a 5' leader of 1.2 kb that is derived from the 5' end of the mitochondrial small rRNA. This and other findings suggest that the Varkud plasmid, like certain RNA viruses, has a mechanism for joining heterologous RNAs to the 5' end of its major transcript, and that, under some circumstances, nucleotide sequences in mitochondria may be recombined at the RNA level.
Structure of the circumsporozoite protein gene in 18 strains of Plasmodium falciparum.

PubMed

Weber, J L; Hockmeyer, W T

1985-06-01

Using the cloned circumsporozoite (CS) protein gene of a Brazilian strain of Plasmodium falciparum as probe, we have analyzed the structure of the CS protein gene from 17 other Asian, African, Central and South American parasite strains by nucleic acid hybridization. Each strain appears to have one CS protein gene which hybridizes readily to the Brazilian strain probe. The 5' and 3' thirds of the genes are invariant in size in all 18 strains whereas the central third containing the 12 base pair tandem repeats varies in size over a range of about 100 base pairs. Several differences were found in the locations of Sau3A sites in the genes. The Sau3A sites are significant because each of the minority Asn-Val-Asp-Pro repeats in the cloned gene has a Sau3A site. DNA melting of hybrids revealed a high degree of homology between the sequences of the cloned gene and genes from an Asian strain and an African strain. A 14 base oligodeoxynucleotide with a sequence from the central repeat region hybridized to all strains tested. We conclude that the CS protein gene is highly conserved among strains of P. falciparum and that malaria vaccine development with the CS protein is unlikely to be complicated by strain variation.
Development of a DNA-Based Method for Distinguishing the Malaria Vectors, Anopheles Gambiae from Anopheles Arabiensis.

DTIC Science & Technology

1987-11-15

analysis. However, in our preliminary studies, hybridization with the DPro.5ohil actin probe required such low stringency conditions that the signal to...rDNA genes and could therefore contain seOuencec tjhich, under normal DNA hybridization conditions , behave in a species-specific mrnner. We theref’-e...pAGr23B) behave as species-specific probes under the conditions normally used for DNA hybridization. These sequences could be used to design specific
Development of a microcapillary column for detecting targeted messenger RNA molecules.

PubMed

Ohnishi, Michihiro

2006-03-24

A capillary column in a rapid-flow system has been developed for detecting targeted messenger RNA (mRNA) molecules. The column has a structure made of two beds-one bed of porous microbeads and one bed of microbeads with a polythymidine base sequence. The targeted eukaryotic mRNA molecules are detected by two-step hybridization (sandwich hybridization) composed of polyadenosine selection of mRNA molecules and formation of a probe-target (targeted mRNA) hybrid. The sandwich hybridization, which is accomplished within 1 h, was tested using synthetic polydeoxynucleotides. Ten picomoles of the targeted polydeoxynucleotide were detected.
Inferring Phylogenetic Relationships of Indian Citron (Citrus medica L.) based on rbcL and matK Sequences of Chloroplast DNA.

PubMed

Uchoi, Ajit; Malik, Surendra Kumar; Choudhary, Ravish; Kumar, Susheel; Rohini, M R; Pal, Digvender; Ercisli, Sezai; Chaudhury, Rekha

2016-06-01

Phylogenetic relationships of Indian Citron (Citrus medica L.) with other important Citrus species have been inferred through sequence analyses of rbcL and matK gene region of chloroplast DNA. The study was based on 23 accessions of Citrus genotypes representing 15 taxa of Indian Citrus, collected from wild, semi-wild, and domesticated stocks. The phylogeny was inferred using the maximum parsimony (MP) and neighbor-joining (NJ) methods. Both MP and NJ trees separated all the 23 accessions of Citrus into five distinct clusters. The chloroplast DNA (cpDNA) analysis based on rbcL and matK sequence data carried out in Indian taxa of Citrus was useful in differentiating all the true species and species/varieties of probable hybrid origin in distinct clusters or groups. Sequence analysis based on rbcL and matK gene provided unambiguous identification and disposition of true species like C. maxima, C. medica, C. reticulata, and related hybrids/cultivars. The separation of C. maxima, C. medica, and C. reticulata in distinct clusters or sub-clusters supports their distinctiveness as the basic species of edible Citrus. However, the cpDNA sequence analysis of rbcL and matK gene could not find any clear cut differentiation between subgenera Citrus and Papeda as proposed in Swingle's system of classification.
Progressive Dictionary Learning with Hierarchical Predictive Structure for Scalable Video Coding.

PubMed

Dai, Wenrui; Shen, Yangmei; Xiong, Hongkai; Jiang, Xiaoqian; Zou, Junni; Taubman, David

2017-04-12

Dictionary learning has emerged as a promising alternative to the conventional hybrid coding framework. However, the rigid structure of sequential training and prediction degrades its performance in scalable video coding. This paper proposes a progressive dictionary learning framework with hierarchical predictive structure for scalable video coding, especially in low bitrate region. For pyramidal layers, sparse representation based on spatio-temporal dictionary is adopted to improve the coding efficiency of enhancement layers (ELs) with a guarantee of reconstruction performance. The overcomplete dictionary is trained to adaptively capture local structures along motion trajectories as well as exploit the correlations between neighboring layers of resolutions. Furthermore, progressive dictionary learning is developed to enable the scalability in temporal domain and restrict the error propagation in a close-loop predictor. Under the hierarchical predictive structure, online learning is leveraged to guarantee the training and prediction performance with an improved convergence rate. To accommodate with the stateof- the-art scalable extension of H.264/AVC and latest HEVC, standardized codec cores are utilized to encode the base and enhancement layers. Experimental results show that the proposed method outperforms the latest SHVC and HEVC simulcast over extensive test sequences with various resolutions.
Mapping neurofibromatosis 1 homologous loci by fluorescence in situ hybridization

DOE Office of Scientific and Technical Information (OSTI.GOV)

Viskochil, D.; Breidenbach, H.H.; Cawthon, R.

Neurofibromatosis 1 maps to chromosome band 17q11.2 and the NF1 gene is comprised of 59 exons that span approximately 335 kb of genomic DNA. In order to further analyze the structure of NF1 from exons 2 through 27b, we isolated a number of cosmid and bacteriophage P-1 genomic clones using NF1-exon probes under high-stringency hybridization conditions. Using tagged, intron-based primers and DNA from various clones as a template, we PCR-amplified and sequenced individual NF1 exons. The exon sequences in PCR products from several genomic clones differed from the exon sequence derived from cloned NF1 cDNAs. Clones with variant sequences weremore » mapped by fluorescence in situ hybridization under high-stringency conditions. Three clones mapped to chromosome band 15q11.2, one mapped to 14q11.2, one mapped to both 2q14.1-14.3 and 14q11.2, one mapped to 2q33-34, and one mapped to both 18q11.2 and 21q21. Even though some PCR-product sequences retained proper splice junctions and open reading frames, we have yet to identify cDNAs that correspond to the variant exon sequences. We are now sequencing clones that map to NF1-homologous loci in order to develop discriminating primer pairs for the exclusive amplification of NF1-specific sequences in our efforts to develop a comprehensive NF1 mutation screen using genomic DNA as template. The role of NF1-homologous sequences may play in neurofibromatosis 1 is not clear.« less
Mouse Vk gene classification by nucleic acid sequence similarity.

PubMed

Strohal, R; Helmberg, A; Kroemer, G; Kofler, R

1989-01-01

Analyses of immunoglobulin (Ig) variable (V) region gene usage in the immune response, estimates of V gene germline complexity, and other nucleic acid hybridization-based studies depend on the extent to which such genes are related (i.e., sequence similarity) and their organization in gene families. While mouse Igh heavy chain V region (VH) gene families are relatively well-established, a corresponding systematic classification of Igk light chain V region (Vk) genes has not been reported. The present analysis, in the course of which we reviewed the known extent of the Vk germline gene repertoire and Vk gene usage in a variety of responses to foreign and self antigens, provides a classification of mouse Vk genes in gene families composed of members with greater than 80% overall nucleic acid sequence similarity. This classification differed in several aspects from that of VH genes: only some Vk gene families were as clearly separated (by greater than 25% sequence dissimilarity) as typical VH gene families; most Vk gene families were closely related and, in several instances, members from different families were very similar (greater than 80%) over large sequence portions; frequently, classification by nucleic acid sequence similarity diverged from existing classifications based on amino-terminal protein sequence similarity. Our data have implications for Vk gene analyses by nucleic acid hybridization and describe potentially important differences in sequence organization between VH and Vk genes.
Estimating Genomic Distance from DNA Sequence Location in Cell Nuclei by a Random Walk Model

NASA Astrophysics Data System (ADS)

van den Engh, Ger; Sachs, Rainer; Trask, Barbara J.

1992-09-01

The folding of chromatin in interphase cell nuclei was studied by fluorescent in situ hybridization with pairs of unique DNA sequence probes. The sites of DNA sequences separated by 100 to 2000 kilobase pairs (kbp) are distributed in interphase chromatin according to a random walk model. This model provides the basis for calculating the spacing of sequences along the linear DNA molecule from interphase distance measurements. An interphase mapping strategy based on this model was tested with 13 probes from a 4-megabase pair (Mbp) region of chromosome 4 containing the Huntington disease locus. The results confirmed the locations of the probes and showed that the remaining gap in the published maps of this region is negligible in size. Interphase distance measurements should facilitate construction of chromosome maps with an average marker density of one per 100 kbp, approximately ten times greater than that achieved by hybridization to metaphase chromosomes.
Cytogenetic Analysis of Populus trichocarpa - Ribosomal DNA, Telomere Repeat Sequence, and Marker-selected BACs

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tuskan, Gerald A; Gunter, Lee E; DiFazio, Stephen P

The 18S-28S rDNA and 5S rDNA loci in Populus trichocarpa were localized using fluorescent in situ hybridization (FISH). Two 18S-28S rDNA sites and one 5S rDNA site were identified and located at the ends of 3 different chromosomes. FISH signals from the Arabidopsis -type telomere repeat sequence were observed at the distal ends of each chromosome. Six BAC clones selected from 2 linkage groups based on genome sequence assembly (LG-I and LG-VI) were localized on 2 chromosomes, as expected. BACs from LG-I hybridized to the longest chromosome in the complement. All BAC positions were found to be concordant with sequencemore » assembly positions. BAC-FISH will be useful for delineating each of the Populus trichocarpa chromosomes and improving the sequence assembly of this model angiosperm tree species.« less

NEBNext Direct: A Novel, Rapid, Hybridization-Based Approach for the Capture and Library Conversion of Genomic Regions of Interest.

PubMed

Emerman, Amy B; Bowman, Sarah K; Barry, Andrew; Henig, Noa; Patel, Kruti M; Gardner, Andrew F; Hendrickson, Cynthia L

2017-07-05

Next-generation sequencing (NGS) is a powerful tool for genomic studies, translational research, and clinical diagnostics that enables the detection of single nucleotide polymorphisms, insertions and deletions, copy number variations, and other genetic variations. Target enrichment technologies improve the efficiency of NGS by only sequencing regions of interest, which reduces sequencing costs while increasing coverage of the selected targets. Here we present NEBNext Direct ® , a hybridization-based, target-enrichment approach that addresses many of the shortcomings of traditional target-enrichment methods. This approach features a simple, 7-hr workflow that uses enzymatic removal of off-target sequences to achieve a high specificity for regions of interest. Additionally, unique molecular identifiers are incorporated for the identification and filtering of PCR duplicates. The same protocol can be used across a wide range of input amounts, input types, and panel sizes, enabling NEBNext Direct to be broadly applicable across a wide variety of research and diagnostic needs. © 2017 by John Wiley & Sons, Inc. Copyright © 2017 John Wiley & Sons, Inc.
A hybrid swarm population of Pinus densiflora x P. sylvestris hybrids inferred from sequence analysis of chloroplast DNA and morphological characters

USDA-ARS?s Scientific Manuscript database

To confirm a hybrid swarm population of Pinus densiflora × P. sylvestris in Jilin, China and to study whether shoot apex morphology of 4-year old seedlings can be correlated with the sequence of a chloroplast DNA simple sequence repeat marker (cpDNA SSR), needles and seeds from P. densiflora, P. syl...
Combined subtraction hybridization and polymerase chain reaction amplification procedure for isolation of strain-specific Rhizobium DNA sequences.

PubMed Central

Bjourson, A J; Stone, C E; Cooper, J E

1992-01-01

A novel subtraction hybridization procedure, incorporating a combination of four separation strategies, was developed to isolate unique DNA sequences from a strain of Rhizobium leguminosarum bv. trifolii. Sau3A-digested DNA from this strain, i.e., the probe strain, was ligated to a linker and hybridized in solution with an excess of pooled subtracter DNA from seven other strains of the same biovar which had been restricted, ligated to a different, biotinylated, subtracter-specific linker, and amplified by polymerase chain reaction to incorporate dUTP. Subtracter DNA and subtracter-probe hybrids were removed by phenol-chloroform extraction of a streptavidin-biotin-DNA complex. NENSORB chromatography of the sequences remaining in the aqueous layer captured biotinylated subtracter DNA which may have escaped removal by phenol-chloroform treatment. Any traces of contaminating subtracter DNA were removed by digestion with uracil DNA glycosylase. Finally, remaining sequences were amplified by polymerase chain reaction with a probe strain-specific primer, labelled with 32P, and tested for specificity in dot blot hybridizations against total genomic target DNA from each strain in the subtracter pool. Two rounds of subtraction-amplification were sufficient to remove cross-hybridizing sequences and to give a probe which hybridized only with homologous target DNA. The method is applicable to the isolation of DNA and RNA sequences from both procaryotic and eucaryotic cells. Images PMID:1637166
Prediction of Disease Causing Non-Synonymous SNPs by the Artificial Neural Network Predictor NetDiseaseSNP

PubMed Central

Johansen, Morten Bo; Izarzugaza, Jose M. G.; Brunak, Søren; Petersen, Thomas Nordahl; Gupta, Ramneek

2013-01-01

We have developed a sequence conservation-based artificial neural network predictor called NetDiseaseSNP which classifies nsSNPs as disease-causing or neutral. Our method uses the excellent alignment generation algorithm of SIFT to identify related sequences and a combination of 31 features assessing sequence conservation and the predicted surface accessibility to produce a single score which can be used to rank nsSNPs based on their potential to cause disease. NetDiseaseSNP classifies successfully disease-causing and neutral mutations. In addition, we show that NetDiseaseSNP discriminates cancer driver and passenger mutations satisfactorily. Our method outperforms other state-of-the-art methods on several disease/neutral datasets as well as on cancer driver/passenger mutation datasets and can thus be used to pinpoint and prioritize plausible disease candidates among nsSNPs for further investigation. NetDiseaseSNP is publicly available as an online tool as well as a web service: http://www.cbs.dtu.dk/services/NetDiseaseSNP PMID:23935863
Sequencing of real-world samples using a microfabricated hybrid device having unconstrained straight separation channels.

PubMed

Liu, Shaorong; Elkin, Christopher; Kapur, Hitesh

2003-11-01

We describe a microfabricated hybrid device that consists of a microfabricated chip containing multiple twin-T injectors attached to an array of capillaries that serve as the separation channels. A new fabrication process was employed to create two differently sized round channels in a chip. Twin-T injectors were formed by the smaller round channels that match the bore of the separation capillaries and separation capillaries were incorporated to the injectors through the larger round channels that match the outer diameter of the capillaries. This allows for a minimum dead volume and provides a robust chip/capillary interface. This hybrid design takes full advantage, such as sample stacking and purification and uniform signal intensity profile, of the unique chip injection scheme for DNA sequencing while employing long straight capillaries for the separations. In essence, the separation channel length is optimized for both speed and resolution since it is unconstrained by chip size. To demonstrate the reliability and practicality of this hybrid device, we sequenced over 1000 real-world samples from Human Chromosome 5 and Ciona intestinalis, prepared at Joint Genome Institute. We achieved average Phred20 read of 675 bases in about 70 min with a success rate of 91%. For the similar type of samples on MegaBACE 1000, the average Phred20 read is about 550-600 bases in 120 min separation time with a success rate of about 80-90%.
The application of magnetic bead hybridization for the recovery and STR amplification of degraded and inhibited forensic DNA.

PubMed

Wang, Jing; McCord, Bruce

2011-06-01

A common problem in the analysis of forensic DNA evidence is the presence of environmentally degraded and inhibited DNA. Such samples produce a variety of interpretational problems such as allele imbalance, allele dropout and sequence specific inhibition. In an attempt to develop methods to enhance the recovery of this type of evidence, magnetic bead hybridization has been applied to extract and preconcentrate DNA sequences containing short tandem repeat (STR) alleles of interest. In this work, genomic DNA was fragmented by heating, and sequences associated with STR alleles were selectively hybridized to allele-specific biotinylated probes. Each particular biotinylated probe-DNA complex was bound to streptavidin-coated magnetic beads using enabling enrichment of target DNA sequences. Experiments conducted using degraded DNA samples, as well as samples containing a large concentration of inhibitory substances, showed good specificity and recovery of missing alleles. Based on the favorable results obtained with these specific probes, this method should prove useful as a tool to improve the recovery of alleles from degraded and inhibited DNA samples. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Targeted genotyping-by-sequencing permits cost-effective identification and discrimination of pasture grass species and cultivars.

PubMed

Pembleton, Luke W; Drayton, Michelle C; Bain, Melissa; Baillie, Rebecca C; Inch, Courtney; Spangenberg, German C; Wang, Junping; Forster, John W; Cogan, Noel O I

2016-05-01

A targeted amplicon-based genotyping-by-sequencing approach has permitted cost-effective and accurate discrimination between ryegrass species (perennial, Italian and inter-species hybrid), and identification of cultivars based on bulked samples. Perennial ryegrass and Italian ryegrass are the most important temperate forage species for global agriculture, and are represented in the commercial pasture seed market by numerous cultivars each composed of multiple highly heterozygous individuals. Previous studies have identified difficulties in the use of morphophysiological criteria to discriminate between these two closely related taxa. Recently, a highly multiplexed single nucleotide polymorphism (SNP)-based genotyping assay has been developed that permits accurate differentiation between both species and cultivars of ryegrasses at the genetic level. This assay has since been further developed into an amplicon-based genotyping-by-sequencing (GBS) approach implemented on a second-generation sequencing platform, allowing accelerated throughput and ca. sixfold reduction in cost. Using the GBS approach, 63 cultivars of perennial, Italian and interspecific hybrid ryegrasses, as well as intergeneric Festulolium hybrids, were genotyped. The genetic relationships between cultivars were interpreted in terms of known breeding histories and indistinct species boundaries within the Lolium genus, as well as suitability of current cultivar registration methodologies. An example of applicability to quality assurance and control (QA/QC) of seed purity is also described. Rapid, low-cost genotypic assays provide new opportunities for breeders to more fully explore genetic diversity within breeding programs, allowing the combination of novel unique genetic backgrounds. Such tools also offer the potential to more accurately define cultivar identities, allowing protection of varieties in the commercial market and supporting processes of cultivar accreditation and quality assurance.
A microfabricated hybrid device for DNA sequencing.

PubMed

Liu, Shaorong

2003-11-01

We have created a hybrid device of a microfabricated round-channel twin-T injector incorporated with a separation capillary in order to extend the straight separation distance for high speed and long readlength DNA sequencing. Semicircular grooves on glass wafers are obtained using a photomask with a narrow line-width and a standard isotropic photolithographic etching process. Round channels are made when two etched wafers are face-to-face aligned and bonded. A two-mask fabrication process has been developed to make channels of two different diameters. The twin-T injector is formed by the smaller channels whose diameter matches the bore of the separation capillary, and the "usual" separation channel, now called the connection channel, is formed by the larger ones whose diameter matches the outer diameter of the separation capillary. The separation capillary is inserted through the connection channel all the way to the twin-T injector to allow the capillary bore flush with the twin-T injector channels. The total dead-volume of the connection is estimated to be approximately 5 pL. To demonstrate the efficiency of this hybrid device, we have performed four-color DNA sequencing on it. Using a 200 microm twin-T injector coupled with a separation capillary of 20 cm effective separation distance, we have obtained readlengths of 800 plus bases at an accuracy of 98.5% in 56 min, compared to about 650 bases in 100 min on a conventional 40 cm long capillary sequencing machine under similar conditions. At an increased separation field strength and using a diluted sieving matrix, the separation time has been reduced to 20 min with a readlength of 700 bases at 98.5% base-calling accuracy.
Pilot Evaluation of Adaptive Control in Motion-Based Flight Simulator

NASA Technical Reports Server (NTRS)

Kaneshige, John T.; Campbell, Stefan Forrest

2009-01-01

The objective of this work is to assess the strengths, weaknesses, and robustness characteristics of several MRAC (Model-Reference Adaptive Control) based adaptive control technologies garnering interest from the community as a whole. To facilitate this, a control study using piloted and unpiloted simulations to evaluate sensitivities and handling qualities was conducted. The adaptive control technologies under consideration were ALR (Adaptive Loop Recovery), BLS (Bounded Linear Stability), Hybrid Adaptive Control, L1, OCM (Optimal Control Modification), PMRAC (Predictor-based MRAC), and traditional MRAC
Artificial mismatch hybridization

DOEpatents

Guo, Zhen; Smith, Lloyd M.

1998-01-01

An improved nucleic acid hybridization process is provided which employs a modified oligonucleotide and improves the ability to discriminate a control nucleic acid target from a variant nucleic acid target containing a sequence variation. The modified probe contains at least one artificial mismatch relative to the control nucleic acid target in addition to any mismatch(es) arising from the sequence variation. The invention has direct and advantageous application to numerous existing hybridization methods, including, applications that employ, for example, the Polymerase Chain Reaction, allele-specific nucleic acid sequencing methods, and diagnostic hybridization methods.
The primary structures of two yeast enolase genes. Homology between the 5' noncoding flanking regions of yeast enolase and glyceraldehyde-3-phosphate dehydrogenase genes.

PubMed

Holland, M J; Holland, J P; Thill, G P; Jackson, K A

1981-02-10

Segments of yeast genomic DNA containing two enolase structural genes have been isolated by subculture cloning procedures using a cDNA hybridization probe synthesized from purified yeast enolase mRNA. Based on restriction endonuclease and transcriptional maps of these two segments of yeast DNA, each hybrid plasmid contains a region of extensive nucleotide sequence homology which forms hybrids with the cDNA probe. The DNA sequences which flank this homologous region in the two hybrid plasmids are nonhomologous indicating that these sequences are nontandemly repeated in the yeast genome. The complete nucleotide sequence of the coding as well as the flanking noncoding regions of these genes has been determined. The amino acid sequence predicted from one reading frame of both structural genes is extremely similar to that determined for yeast enolase (Chin, C. C. Q., Brewer, J. M., Eckard, E., and Wold, F. (1981) J. Biol. Chem. 256, 1370-1376), confirming that these isolated structural genes encode yeast enolase. The nucleotide sequences of the coding regions of the genes are approximately 95% homologous, and neither gene contains an intervening sequence. Codon utilization in the enolase genes follows the same biased pattern previously described for two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes (Holland, J. P., and Holland, M. J. (1980) J. Biol. Chem. 255, 2596-2605). DNA blotting analysis confirmed that the isolated segments of yeast DNA are colinear with yeast genomic DNA and that there are two nontandemly repeated enolase genes per haploid yeast genome. The noncoding portions of the two enolase genes adjacent to the initiation and termination codons are approximately 70% homologous and contain sequences thought to be involved in the synthesis and processing messenger RNA. Finally there are regions of extensive homology between the two enolase structural genes and two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes within the 5- noncoding portions of these glycolytic genes.
Synthesis and hybridization of a series of biotinylated oligonucleotides.

PubMed Central

Cook, A F; Vuocolo, E; Brakel, C L

1988-01-01

A series of oligonucleotides containing biotin-11-dUMP at various positions were synthesized and compared in quantitative, colorimetric hybridization-detection studies. A deoxyuridine phosphoramidite containing a protected allylamino sidearm was synthesized and used in standard, automated synthesis cycles to prepare oligonucleotides with allylamino residues at various positions within a standard 17-base sequence. Biotin substituents were subsequently attached to the allylamino sidearms by reaction with N-biotinyl-6-aminocaproic acid N-hydroxysuccinimide ester. These oligomers were hybridized to target DNA immobilized on microtiter wells (ELISA plates), and were detected with a streptavidin-biotinylated horseradish peroxidase complex using hydrogen peroxide as substrate and o-phenylenediamine as chromogen. We found that the sensitivity of detection of target DNA by biotin-labeled oligonucleotide probes was strongly dependent upon the position of the biotin label. Oligonucleotides containing biotin labels near or off the ends of the hybridizing sequence were more effective probes than oligonucleotides containing internal biotin labels. An additive effect of increasing numbers of biotin-dUMP residues was found for some labeling configurations. PMID:3375076
Development of Genetic Markers in Eucalyptus Species by Target Enrichment and Exome Sequencing

PubMed Central

Dasgupta, Modhumita Ghosh; Dharanishanthi, Veeramuthu; Agarwal, Ishangi; Krutovsky, Konstantin V.

2015-01-01

The advent of next-generation sequencing has facilitated large-scale discovery, validation and assessment of genetic markers for high density genotyping. The present study was undertaken to identify markers in genes supposedly related to wood property traits in three Eucalyptus species. Ninety four genes involved in xylogenesis were selected for hybridization probe based nuclear genomic DNA target enrichment and exome sequencing. Genomic DNA was isolated from the leaf tissues and used for on-array probe hybridization followed by Illumina sequencing. The raw sequence reads were trimmed and high-quality reads were mapped to the E. grandis reference sequence and the presence of single nucleotide variants (SNVs) and insertions/ deletions (InDels) were identified across the three species. The average read coverage was 216X and a total of 2294 SNVs and 479 InDels were discovered in E. camaldulensis, 2383 SNVs and 518 InDels in E. tereticornis, and 1228 SNVs and 409 InDels in E. grandis. Additionally, SNV calling and InDel detection were conducted in pair-wise comparisons of E. tereticornis vs. E. grandis, E. camaldulensis vs. E. tereticornis and E. camaldulensis vs. E. grandis. This study presents an efficient and high throughput method on development of genetic markers for family– based QTL and association analysis in Eucalyptus. PMID:25602379
Modular prediction of protein structural classes from sequences of twilight-zone identity with predicting sequences.

PubMed

Mizianty, Marcin J; Kurgan, Lukasz

2009-12-13

Knowledge of structural class is used by numerous methods for identification of structural/functional characteristics of proteins and could be used for the detection of remote homologues, particularly for chains that share twilight-zone similarity. In contrast to existing sequence-based structural class predictors, which target four major classes and which are designed for high identity sequences, we predict seven classes from sequences that share twilight-zone identity with the training sequences. The proposed MODular Approach to Structural class prediction (MODAS) method is unique as it allows for selection of any subset of the classes. MODAS is also the first to utilize a novel, custom-built feature-based sequence representation that combines evolutionary profiles and predicted secondary structure. The features quantify information relevant to the definition of the classes including conservation of residues and arrangement and number of helix/strand segments. Our comprehensive design considers 8 feature selection methods and 4 classifiers to develop Support Vector Machine-based classifiers that are tailored for each of the seven classes. Tests on 5 twilight-zone and 1 high-similarity benchmark datasets and comparison with over two dozens of modern competing predictors show that MODAS provides the best overall accuracy that ranges between 80% and 96.7% (83.5% for the twilight-zone datasets), depending on the dataset. This translates into 19% and 8% error rate reduction when compared against the best performing competing method on two largest datasets. The proposed predictor provides accurate predictions at 58% accuracy for membrane proteins class, which is not considered by majority of existing methods, in spite that this class accounts for only 2% of the data. Our predictive model is analyzed to demonstrate how and why the input features are associated with the corresponding classes. The improved predictions stem from the novel features that express collocation of the secondary structure segments in the protein sequence and that combine evolutionary and secondary structure information. Our work demonstrates that conservation and arrangement of the secondary structure segments predicted along the protein chain can successfully predict structural classes which are defined based on the spatial arrangement of the secondary structures. A web server is available at http://biomine.ece.ualberta.ca/MODAS/.
Modular prediction of protein structural classes from sequences of twilight-zone identity with predicting sequences

PubMed Central

2009-01-01

Background Knowledge of structural class is used by numerous methods for identification of structural/functional characteristics of proteins and could be used for the detection of remote homologues, particularly for chains that share twilight-zone similarity. In contrast to existing sequence-based structural class predictors, which target four major classes and which are designed for high identity sequences, we predict seven classes from sequences that share twilight-zone identity with the training sequences. Results The proposed MODular Approach to Structural class prediction (MODAS) method is unique as it allows for selection of any subset of the classes. MODAS is also the first to utilize a novel, custom-built feature-based sequence representation that combines evolutionary profiles and predicted secondary structure. The features quantify information relevant to the definition of the classes including conservation of residues and arrangement and number of helix/strand segments. Our comprehensive design considers 8 feature selection methods and 4 classifiers to develop Support Vector Machine-based classifiers that are tailored for each of the seven classes. Tests on 5 twilight-zone and 1 high-similarity benchmark datasets and comparison with over two dozens of modern competing predictors show that MODAS provides the best overall accuracy that ranges between 80% and 96.7% (83.5% for the twilight-zone datasets), depending on the dataset. This translates into 19% and 8% error rate reduction when compared against the best performing competing method on two largest datasets. The proposed predictor provides accurate predictions at 58% accuracy for membrane proteins class, which is not considered by majority of existing methods, in spite that this class accounts for only 2% of the data. Our predictive model is analyzed to demonstrate how and why the input features are associated with the corresponding classes. Conclusions The improved predictions stem from the novel features that express collocation of the secondary structure segments in the protein sequence and that combine evolutionary and secondary structure information. Our work demonstrates that conservation and arrangement of the secondary structure segments predicted along the protein chain can successfully predict structural classes which are defined based on the spatial arrangement of the secondary structures. A web server is available at http://biomine.ece.ualberta.ca/MODAS/. PMID:20003388
Molecular Evidence for Natural Hybridization between Cotoneaster dielsianus and C. glaucophyllus

PubMed Central

Li, Mingwan; Chen, Sufang; Zhou, Renchao; Fan, Qiang; Li, Feifei; Liao, Wenbo

2017-01-01

Hybridization accompanied by polyploidization and apomixis has been demonstrated as a driving force in the evolution and speciation of many plants. A good example to study the evolutionary process of hybridization associated with polyploidy and apomixis is the genus Cotoneaster (Rosaceae), which includes approximately 150 species, most of which are polyploid apomicts. In this study, we investigated all Cotoneaster taxa distributed in a small region of Malipo, Yunnan, China. Based on the morphological characteristics, four Cotoneaster taxa were identified and sampled: C. dielsianus, C. glaucophyllus, C. franchetii, and a putative hybrid. Flow cytometry analyses showed that C. glaucophyllus was diploid, while the other three taxa were tetraploid. A total of five low-copy nuclear genes and six chloroplast regions were sequenced to validate the status of the putative hybrid. Sequence analyses showed that C. dielsianus and C. glaucophyllus are distantly related and they could be well separated using totally 50 fixed nucleotide substitutions and four fixed indels at the 11 investigated genes. All individuals of the putative hybrid harbored identical sequences: they showed chromatogram additivity for all fixed differences between C. dielsianus and C. glaucophyllus at the five nuclear genes, and were identical with C. glaucophyllus at the six chloroplast regions. Haplotype analysis revealed that C. dielsianus possessed nine haplotypes for the 11 genes, while C. glaucophyllus had ten, and there were no shared haplotypes between the two species. The putative hybrid harbored two haplotypes for each nuclear gene: one shared with C. dielsianus and the other with C. glaucophyllus. They possessed the same chloroplast haplotype with C. glaucophyllus. Our study provided convincing evidence for natural hybridization between C. dielsianus and C. glaucophyllus, and revealed that all hybrid individuals were derivatives of one initial F1 via apomixes. C. glaucophyllus served as the maternal parent at the initial hybridization event. We proposed that anthropological disturbance provided an opportunity for hybridization between C. dielsianus and C. glaucophyllus, and a tetraploid F1 successfully bred many identical progenies via apomixis. Under this situation, species integrity could be maintained for these Cotoneaster species, but attentions should be kept for this new-born hybrid. PMID:28536587
Prediction of protein subcellular locations by GO-FunD-PseAA predictor.

PubMed

Chou, Kuo-Chen; Cai, Yu-Dong

2004-08-06

The localization of a protein in a cell is closely correlated with its biological function. With the explosion of protein sequences entering into DataBanks, it is highly desired to develop an automated method that can fast identify their subcellular location. This will expedite the annotation process, providing timely useful information for both basic research and industrial application. In view of this, a powerful predictor has been developed by hybridizing the gene ontology approach [Nat. Genet. 25 (2000) 25], functional domain composition approach [J. Biol. Chem. 277 (2002) 45765], and the pseudo-amino acid composition approach [Proteins Struct. Funct. Genet. 43 (2001) 246; Erratum: ibid. 44 (2001) 60]. As a showcase, the recently constructed dataset [Bioinformatics 19 (2003) 1656] was used for demonstration. The dataset contains 7589 proteins classified into 12 subcellular locations: chloroplast, cytoplasmic, cytoskeleton, endoplasmic reticulum, extracellular, Golgi apparatus, lysosomal, mitochondrial, nuclear, peroxisomal, plasma membrane, and vacuolar. The overall success rate of prediction obtained by the jackknife cross-validation was 92%. This is so far the highest success rate performed on this dataset by following an objective and rigorous cross-validation procedure.
Differential gene expression in the siphonophore Nanomia bijuga (Cnidaria) assessed with multiple next-generation sequencing workflows.

PubMed

Siebert, Stefan; Robinson, Mark D; Tintori, Sophia C; Goetz, Freya; Helm, Rebecca R; Smith, Stephen A; Shaner, Nathan; Haddock, Steven H D; Dunn, Casey W

2011-01-01

We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing.
Differential Gene Expression in the Siphonophore Nanomia bijuga (Cnidaria) Assessed with Multiple Next-Generation Sequencing Workflows

PubMed Central

Siebert, Stefan; Robinson, Mark D.; Tintori, Sophia C.; Goetz, Freya; Helm, Rebecca R.; Smith, Stephen A.; Shaner, Nathan; Haddock, Steven H. D.; Dunn, Casey W.

2011-01-01

We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing. PMID:21829563
Methods and compositions for efficient nucleic acid sequencing

DOEpatents

Drmanac, Radoje

2006-07-04

Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

Methods and compositions for efficient nucleic acid sequencing

DOEpatents

Drmanac, Radoje

2002-01-01

Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.
Increased adolescent HIV testing with a hybrid mobile strategy in Uganda and Kenya.

PubMed

Kadede, Kevin; Ruel, Theodore; Kabami, Jane; Ssemmondo, Emmanuel; Sang, Norton; Kwarisiima, Dalsone; Bukusi, Elizabeth; Cohen, Craig R; Liegler, Teri; Clark, Tamara D; Charlebois, Edwin D; Petersen, Maya L; Kamya, Moses R; Havlir, Diane V; Chamie, Gabriel

2016-09-10

We sought to increase adolescent HIV testing across rural communities in east Africa and identify predictors of undiagnosed HIV. Hybrid mobile testing. We enumerated 116 326 adolescents (10-24 years) in 32 communities of Uganda and Kenya ( NCT01864603): 98 694 (85%) reported stable (≥6 months of prior year) residence. In each community we performed hybrid testing: 2-week multidisease community health campaign that included HIV testing, followed by home-based testing of community health campaign nonparticipants. We measured adolescent HIV testing coverage and prevalence, and determined predictors of newly diagnosed HIV among HIV-infected adolescents using multivariable logistic regression. A total of 86 421 (88%) stable adolescents tested for HIV; coverage was 86, 90, and 88% in early (10-14), mid (15-17), and late (18-24) adolescents, respectively. Self-reported prior testing was 9, 26, and 55% in early, mid, and late adolescents tested, respectively. HIV prevalence among adolescents tested was 1.6 and 0.6% in Ugandan women and men, and 7.1 and 1.5% in Kenyan women and men, respectively. Prevalence increased in mid-adolescence for women and late adolescence for men. Among HIV-infected adolescents, 58% reported newly diagnosed HIV. In multivariate analysis of HIV-infected adolescents, predictors of newly diagnosed HIV included male sex [odds ratio (OR) = 1.97 (95% confidence interval (CI): 1.42-2.73)], Ugandan residence [OR = 2.63 (95% CI: 2.08-3.31)], and single status [OR = 1.62 (95% CI: 1.23-2.14) vs. married)]. The SEARCH hybrid strategy tested 88% of stable adolescents for HIV, a substantial increase over the 28% reporting prior testing. The majority (57%) of HIV-infected adolescents were new diagnoses. Mobile HIV testing for adults should be leveraged to reach adolescents for HIV treatment and prevention.
Population-based rare variant detection via pooled exome or custom hybridization capture with or without individual indexing.

PubMed

Ramos, Enrique; Levinson, Benjamin T; Chasnoff, Sara; Hughes, Andrew; Young, Andrew L; Thornton, Katherine; Li, Allie; Vallania, Francesco L M; Province, Michael; Druley, Todd E

2012-12-06

Rare genetic variation in the human population is a major source of pathophysiological variability and has been implicated in a host of complex phenotypes and diseases. Finding disease-related genes harboring disparate functional rare variants requires sequencing of many individuals across many genomic regions and comparing against unaffected cohorts. However, despite persistent declines in sequencing costs, population-based rare variant detection across large genomic target regions remains cost prohibitive for most investigators. In addition, DNA samples are often precious and hybridization methods typically require large amounts of input DNA. Pooled sample DNA sequencing is a cost and time-efficient strategy for surveying populations of individuals for rare variants. We set out to 1) create a scalable, multiplexing method for custom capture with or without individual DNA indexing that was amenable to low amounts of input DNA and 2) expand the functionality of the SPLINTER algorithm for calling substitutions, insertions and deletions across either candidate genes or the entire exome by integrating the variant calling algorithm with the dynamic programming aligner, Novoalign. We report methodology for pooled hybridization capture with pre-enrichment, indexed multiplexing of up to 48 individuals or non-indexed pooled sequencing of up to 92 individuals with as little as 70 ng of DNA per person. Modified solid phase reversible immobilization bead purification strategies enable no sample transfers from sonication in 96-well plates through adapter ligation, resulting in 50% less library preparation reagent consumption. Custom Y-shaped adapters containing novel 7 base pair index sequences with a Hamming distance of ≥2 were directly ligated onto fragmented source DNA eliminating the need for PCR to incorporate indexes, and was followed by a custom blocking strategy using a single oligonucleotide regardless of index sequence. These results were obtained aligning raw reads against the entire genome using Novoalign followed by variant calling of non-indexed pools using SPLINTER or SAMtools for indexed samples. With these pipelines, we find sensitivity and specificity of 99.4% and 99.7% for pooled exome sequencing. Sensitivity, and to a lesser degree specificity, proved to be a function of coverage. For rare variants (≤2% minor allele frequency), we achieved sensitivity and specificity of ≥94.9% and ≥99.99% for custom capture of 2.5 Mb in multiplexed libraries of 22-48 individuals with only ≥5-fold coverage/chromosome, but these parameters improved to ≥98.7 and 100% with 20-fold coverage/chromosome. This highly scalable methodology enables accurate rare variant detection, with or without individual DNA sample indexing, while reducing the amount of required source DNA and total costs through less hybridization reagent consumption, multi-sample sonication in a standard PCR plate, multiplexed pre-enrichment pooling with a single hybridization and lesser sequencing coverage required to obtain high sensitivity.
Comparative genomics of Lupinus angustifolius gene-rich regions: BAC library exploration, genetic mapping and cytogenetics

PubMed Central

2013-01-01

Background The narrow-leafed lupin, Lupinus angustifolius L., is a grain legume species with a relatively compact genome. The species has 2n = 40 chromosomes and its genome size is 960 Mbp/1C. During the last decade, L. angustifolius genomic studies have achieved several milestones, such as molecular-marker development, linkage maps, and bacterial artificial chromosome (BAC) libraries. Here, these resources were integratively used to identify and sequence two gene-rich regions (GRRs) of the genome. Results The genome was screened with a probe representing the sequence of a microsatellite fragment length polymorphism (MFLP) marker linked to Phomopsis stem blight resistance. BAC clones selected by hybridization were subjected to restriction fingerprinting and contig assembly, and 232 BAC-ends were sequenced and annotated. BAC fluorescence in situ hybridization (BAC-FISH) identified eight single-locus clones. Based on physical mapping, cytogenetic localization, and BAC-end annotation, five clones were chosen for sequencing. Within the sequences of clones that hybridized in FISH to a single-locus, two large GRRs were identified. The GRRs showed strong and conserved synteny to Glycine max duplicated genome regions, illustrated by both identical gene order and parallel orientation. In contrast, in the clones with dispersed FISH signals, more than one-third of sequences were transposable elements. Sequenced, single-locus clones were used to develop 12 genetic markers, increasing the number of L. angustifolius chromosomes linked to appropriate linkage groups by five pairs. Conclusions In general, probes originating from MFLP sequences can assist genome screening and gene discovery. However, such probes are not useful for positional cloning, because they tend to hybridize to numerous loci. GRRs identified in L. angustifolius contained a low number of interspersed repeats and had a high level of synteny to the genome of the model legume G. max. Our results showed that not only was the gene nucleotide sequence conserved between soybean and lupin GRRs, but the order and orientation of particular genes in syntenic blocks was homologous, as well. These findings will be valuable to the forthcoming sequencing of the lupin genome. PMID:23379841
Biochip-Based Detection of KRAS Mutation in Non-Small Cell Lung Cancer

PubMed Central

Kriegshäuser, Gernot; Fabjani, Gerhild; Ziegler, Barbara; Zöchbauer-Müller, Sabine; End, Adelheid; Zeillinger, Robert

2011-01-01

This study is aimed at evaluating the potential of a biochip assay to sensitively detect KRAS mutation in DNA from non-small cell lung cancer (NSCLC) tissue samples. The assay covers 10 mutations in codons 12 and 13 of the KRAS gene, and is based on mutant-enriched PCR followed by reverse-hybridization of biotinylated amplification products to an array of sequence-specific probes immobilized on the tip of a rectangular plastic stick (biochip). Biochip hybridization identified 17 (21%) samples to carry a KRAS mutation of which 16 (33%) were adenocarcinomas and 1 (3%) was a squamous cell carcinoma. All mutations were confirmed by DNA sequencing. Using 10 ng of starting DNA, the biochip assay demonstrated a detection limit of 1% mutant sequence in a background of wild-type DNA. Our results suggest that the biochip assay is a sensitive alternative to protocols currently in use for KRAS mutation testing on limited quantity samples. PMID:22272089
Optimizing the specificity of nucleic acid hybridization.

PubMed

Zhang, David Yu; Chen, Sherry Xi; Yin, Peng

2012-01-22

The specific hybridization of complementary sequences is an essential property of nucleic acids, enabling diverse biological and biotechnological reactions and functions. However, the specificity of nucleic acid hybridization is compromised for long strands, except near the melting temperature. Here, we analytically derived the thermodynamic properties of a hybridization probe that would enable near-optimal single-base discrimination and perform robustly across diverse temperature, salt and concentration conditions. We rationally designed 'toehold exchange' probes that approximate these properties, and comprehensively tested them against five different DNA targets and 55 spurious analogues with energetically representative single-base changes (replacements, deletions and insertions). These probes produced discrimination factors between 3 and 100+ (median, 26). Without retuning, our probes function robustly from 10 °C to 37 °C, from 1 mM Mg(2+) to 47 mM Mg(2+), and with nucleic acid concentrations from 1 nM to 5 µM. Experiments with RNA also showed effective single-base change discrimination.
Estimation of relative effectiveness of phylogenetic programs by machine learning.

PubMed

Krivozubov, Mikhail; Goebels, Florian; Spirin, Sergei

2014-04-01

Reconstruction of phylogeny of a protein family from a sequence alignment can produce results of different quality. Our goal is to predict the quality of phylogeny reconstruction basing on features that can be extracted from the input alignment. We used Fitch-Margoliash (FM) method of phylogeny reconstruction and random forest as a predictor. For training and testing the predictor, alignments of orthologous series (OS) were used, for which the result of phylogeny reconstruction can be evaluated by comparison with trees of corresponding organisms. Our results show that the quality of phylogeny reconstruction can be predicted with more than 80% precision. Also, we tried to predict which phylogeny reconstruction method, FM or UPGMA, is better for a particular alignment. With the used set of features, among alignments for which the obtained predictor predicts a better performance of UPGMA, 56% really give a better result with UPGMA. Taking into account that in our testing set only for 34% alignments UPGMA performs better, this result shows a principal possibility to predict the better phylogeny reconstruction method basing on features of a sequence alignment.
Selective Constraints on Coding Sequences of Nervous System Genes Are a Major Determinant of Duplicate Gene Retention in Vertebrates.

PubMed

Roux, Julien; Liu, Jialin; Robinson-Rechavi, Marc

2017-11-01

The evolutionary history of vertebrates is marked by three ancient whole-genome duplications: two successive rounds in the ancestor of vertebrates, and a third one specific to teleost fishes. Biased loss of most duplicates enriched the genome for specific genes, such as slow evolving genes, but this selective retention process is not well understood. To understand what drives the long-term preservation of duplicate genes, we characterized duplicated genes in terms of their expression patterns. We used a new method of expression enrichment analysis, TopAnat, applied to in situ hybridization data from thousands of genes from zebrafish and mouse. We showed that the presence of expression in the nervous system is a good predictor of a higher rate of retention of duplicate genes after whole-genome duplication. Further analyses suggest that purifying selection against the toxic effects of misfolded or misinteracting proteins, which is particularly strong in nonrenewing neural tissues, likely constrains the evolution of coding sequences of nervous system genes, leading indirectly to the preservation of duplicate genes after whole-genome duplication. Whole-genome duplications thus greatly contributed to the expansion of the toolkit of genes available for the evolution of profound novelties of the nervous system at the base of the vertebrate radiation. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes.

PubMed

Chou, Kuo-Chen

2005-01-01

With protein sequences entering into databanks at an explosive pace, the early determination of the family or subfamily class for a newly found enzyme molecule becomes important because this is directly related to the detailed information about which specific target it acts on, as well as to its catalytic process and biological function. Unfortunately, it is both time-consuming and costly to do so by experiments alone. In a previous study, the covariant-discriminant algorithm was introduced to identify the 16 subfamily classes of oxidoreductases. Although the results were quite encouraging, the entire prediction process was based on the amino acid composition alone without including any sequence-order information. Therefore, it is worthy of further investigation. To incorporate the sequence-order effects into the predictor, the 'amphiphilic pseudo amino acid composition' is introduced to represent the statistical sample of a protein. The novel representation contains 20 + 2lambda discrete numbers: the first 20 numbers are the components of the conventional amino acid composition; the next 2lambda numbers are a set of correlation factors that reflect different hydrophobicity and hydrophilicity distribution patterns along a protein chain. Based on such a concept and formulation scheme, a new predictor is developed. It is shown by the self-consistency test, jackknife test and independent dataset tests that the success rates obtained by the new predictor are all significantly higher than those by the previous predictors. The significant enhancement in success rates also implies that the distribution of hydrophobicity and hydrophilicity of the amino acid residues along a protein chain plays a very important role to its structure and function.
Universal fingerprinting chip server.

PubMed

Casique-Almazán, Janet; Larios-Serrato, Violeta; Olguín-Ruíz, Gabriela Edith; Sánchez-Vallejo, Carlos Javier; Maldonado-Rodríguez, Rogelio; Méndez-Tenorio, Alfonso

2012-01-01

The Virtual Hybridization approach predicts the most probable hybridization sites across a target nucleic acid of known sequence, including both perfect and mismatched pairings. Potential hybridization sites, having a user-defined minimum number of bases that are paired with the oligonucleotide probe, are first identified. Then free energy values are evaluated for each potential hybridization site, and if it has a calculated free energy of equal or higher negative value than a user-defined free energy cut-off value, it is considered as a site of high probability of hybridization. The Universal Fingerprinting Chip Applications Server contains the software for visualizing predicted hybridization patterns, which yields a simulated hybridization fingerprint that can be compared with experimentally derived fingerprints or with a virtual fingerprint arising from a different sample. The database is available for free at http://bioinformatica.homelinux.org/UFCVH/
Chip-based sequencing nucleic acids

DOEpatents

Beer, Neil Reginald

2014-08-26

A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.
Detection and identification of Theileria infection in sika deer ( Cervus nippon ) in China.

PubMed

He, Lan; Khan, Muhanmad Kasib; Zhang, Wen-Jie; Zhang, Qing-Li; Zhou, Yan-Qin; Hu, Min; Zhao, Junlong

2012-06-01

The sika deer ( Cervus nippon ) is a first-grade state-protected animal in China and designated a threatened species by the World Conservation Union. To detect hemoparasite infection of sika deer, blood samples were collected from 24 animals in the Hubei Province Deer Center. Genomic DNA was extracted, and the V4 hypervariable region encoding 18S rRNA was analyzed by reverse line blot hybridization assay. PCR products hybridized with Babesia / Theileria genus-specific probes but failed to hybridize with any of the Babesia or Theileria species-specific probes, suggesting the presence of a novel, or variant, species. Here 18S rRNA and internal transcribed spacer (ITS) genes were amplified, cloned, and sequenced from 7 isolates. Alignment and BlastN of the cloned sequences revealed high similarities to the homologous 18S rRNA genes and ITS genes of Theileria cervi (AY735122), Theileria sp. CNY1A (AB012194), and Theileria sp. ex Yamaguchi (AF529272). Phylogenetic analysis based on the 18S rRNA gene and ITS sequences showed that all cloned sequences were grouped within the Theileria clade. Phylogeny based on the 18S rRNA gene divided the organisms into 2 groups. Group 1 was closest to Theileria sp. ex Yamaguchi (AF529272), and group 2 was distinct from all other identified Theileria and Babesia species. These results suggest the existence of Theileria sp. infection in sika deer in China. To our knowledge, this is the first report of cervine Theileria sp. in China.
iNuc-PhysChem: A Sequence-Based Predictor for Identifying Nucleosomes via Physicochemical Properties

PubMed Central

Feng, Peng-Mian; Ding, Chen; Zuo, Yong-Chun; Chou, Kuo-Chen

2012-01-01

Nucleosome positioning has important roles in key cellular processes. Although intensive efforts have been made in this area, the rules defining nucleosome positioning is still elusive and debated. In this study, we carried out a systematic comparison among the profiles of twelve DNA physicochemical features between the nucleosomal and linker sequences in the Saccharomyces cerevisiae genome. We found that nucleosomal sequences have some position-specific physicochemical features, which can be used for in-depth studying nucleosomes. Meanwhile, a new predictor, called iNuc-PhysChem, was developed for identification of nucleosomal sequences by incorporating these physicochemical properties into a 1788-D (dimensional) feature vector, which was further reduced to a 884-D vector via the IFS (incremental feature selection) procedure to optimize the feature set. It was observed by a cross-validation test on a benchmark dataset that the overall success rate achieved by iNuc-PhysChem was over 96% in identifying nucleosomal or linker sequences. As a web-server, iNuc-PhysChem is freely accessible to the public at http://lin.uestc.edu.cn/server/iNuc-PhysChem. For the convenience of the vast majority of experimental scientists, a step-by-step guide is provided on how to use the web-server to get the desired results without the need to follow the complicated mathematics that were presented just for the integrity in developing the predictor. Meanwhile, for those who prefer to run predictions in their own computers, the predictor's code can be easily downloaded from the web-server. It is anticipated that iNuc-PhysChem may become a useful high throughput tool for both basic research and drug design. PMID:23144709
SHORT-TERM SOLAR FLARE PREDICTION USING MULTIRESOLUTION PREDICTORS

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yu Daren; Huang Xin; Hu Qinghua

2010-01-20

Multiresolution predictors of solar flares are constructed by a wavelet transform and sequential feature extraction method. Three predictors-the maximum horizontal gradient, the length of neutral line, and the number of singular points-are extracted from Solar and Heliospheric Observatory/Michelson Doppler Imager longitudinal magnetograms. A maximal overlap discrete wavelet transform is used to decompose the sequence of predictors into four frequency bands. In each band, four sequential features-the maximum, the mean, the standard deviation, and the root mean square-are extracted. The multiresolution predictors in the low-frequency band reflect trends in the evolution of newly emerging fluxes. The multiresolution predictors in the high-frequencymore » band reflect the changing rates in emerging flux regions. The variation of emerging fluxes is decoupled by wavelet transform in different frequency bands. The information amount of these multiresolution predictors is evaluated by the information gain ratio. It is found that the multiresolution predictors in the lowest and highest frequency bands contain the most information. Based on these predictors, a C4.5 decision tree algorithm is used to build the short-term solar flare prediction model. It is found that the performance of the short-term solar flare prediction model based on the multiresolution predictors is greatly improved.« less
A hybrid training approach for leaf area index estimation via Cubist and random forests machine-learning

NASA Astrophysics Data System (ADS)

Houborg, Rasmus; McCabe, Matthew F.

2018-01-01

With an increasing volume and dimensionality of Earth observation data, enhanced integration of machine-learning methodologies is needed to effectively analyze and utilize these information rich datasets. In machine-learning, a training dataset is required to establish explicit associations between a suite of explanatory 'predictor' variables and the target property. The specifics of this learning process can significantly influence model validity and portability, with a higher generalization level expected with an increasing number of observable conditions being reflected in the training dataset. Here we propose a hybrid training approach for leaf area index (LAI) estimation, which harnesses synergistic attributes of scattered in-situ measurements and systematically distributed physically based model inversion results to enhance the information content and spatial representativeness of the training data. To do this, a complimentary training dataset of independent LAI was derived from a regularized model inversion of RapidEye surface reflectances and subsequently used to guide the development of LAI regression models via Cubist and random forests (RF) decision tree methods. The application of the hybrid training approach to a broad set of Landsat 8 vegetation index (VI) predictor variables resulted in significantly improved LAI prediction accuracies and spatial consistencies, relative to results relying on in-situ measurements alone for model training. In comparing the prediction capacity and portability of the two machine-learning algorithms, a pair of relatively simple multi-variate regression models established by Cubist performed best, with an overall relative mean absolute deviation (rMAD) of ∼11%, determined based on a stringent scene-specific cross-validation approach. In comparison, the portability of RF regression models was less effective (i.e., an overall rMAD of ∼15%), which was attributed partly to model saturation at high LAI in association with inherent extrapolation and transferability limitations. Explanatory VIs formed from bands in the near-infrared (NIR) and shortwave infrared domains (e.g., NDWI) were associated with the highest predictive ability, whereas Cubist models relying entirely on VIs based on NIR and red band combinations (e.g., NDVI) were associated with comparatively high uncertainties (i.e., rMAD ∼ 21%). The most transferable and best performing models were based on combinations of several predictor variables, which included both NDWI- and NDVI-like variables. In this process, prior screening of input VIs based on an assessment of variable relevance served as an effective mechanism for optimizing prediction accuracies from both Cubist and RF. While this study demonstrated benefit in combining data mining operations with physically based constraints via a hybrid training approach, the concept of transferability and portability warrants further investigations in order to realize the full potential of emerging machine-learning techniques for regression purposes.
Control algorithms for aerobraking in the Martian atmosphere

NASA Technical Reports Server (NTRS)

Ward, Donald T.; Shipley, Buford W., Jr.

1991-01-01

The Analytic Predictor Corrector (APC) and Energy Controller (EC) atmospheric guidance concepts were adapted to control an interplanetary vehicle aerobraking in the Martian atmosphere. Changes are made to the APC to improve its robustness to density variations. These changes include adaptation of a new exit phase algorithm, an adaptive transition velocity to initiate the exit phase, refinement of the reference dynamic pressure calculation and two improved density estimation techniques. The modified controller with the hybrid density estimation technique is called the Mars Hybrid Predictor Corrector (MHPC), while the modified controller with a polynomial density estimator is called the Mars Predictor Corrector (MPC). A Lyapunov Steepest Descent Controller (LSDC) is adapted to control the vehicle. The LSDC lacked robustness, so a Lyapunov tracking exit phase algorithm is developed to guide the vehicle along a reference trajectory. This algorithm, when using the hybrid density estimation technique to define the reference path, is called the Lyapunov Hybrid Tracking Controller (LHTC). With the polynomial density estimator used to define the reference trajectory, the algorithm is called the Lyapunov Tracking Controller (LTC). These four new controllers are tested using a six degree of freedom computer simulation to evaluate their robustness. The MHPC, MPC, LHTC, and LTC show dramatic improvements in robustness over the APC and EC.
A Genome-Scale Investigation of How Sequence, Function, and Tree-Based Gene Properties Influence Phylogenetic Inference.

PubMed

Shen, Xing-Xing; Salichos, Leonidas; Rokas, Antonis

2016-09-02

Molecular phylogenetic inference is inherently dependent on choices in both methodology and data. Many insightful studies have shown how choices in methodology, such as the model of sequence evolution or optimality criterion used, can strongly influence inference. In contrast, much less is known about the impact of choices in the properties of the data, typically genes, on phylogenetic inference. We investigated the relationships between 52 gene properties (24 sequence-based, 19 function-based, and 9 tree-based) with each other and with three measures of phylogenetic signal in two assembled data sets of 2,832 yeast and 2,002 mammalian genes. We found that most gene properties, such as evolutionary rate (measured through the percent average of pairwise identity across taxa) and total tree length, were highly correlated with each other. Similarly, several gene properties, such as gene alignment length, Guanine-Cytosine content, and the proportion of tree distance on internal branches divided by relative composition variability (treeness/RCV), were strongly correlated with phylogenetic signal. Analysis of partial correlations between gene properties and phylogenetic signal in which gene evolutionary rate and alignment length were simultaneously controlled, showed similar patterns of correlations, albeit weaker in strength. Examination of the relative importance of each gene property on phylogenetic signal identified gene alignment length, alongside with number of parsimony-informative sites and variable sites, as the most important predictors. Interestingly, the subsets of gene properties that optimally predicted phylogenetic signal differed considerably across our three phylogenetic measures and two data sets; however, gene alignment length and RCV were consistently included as predictors of all three phylogenetic measures in both yeasts and mammals. These results suggest that a handful of sequence-based gene properties are reliable predictors of phylogenetic signal and could be useful in guiding the choice of phylogenetic markers. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Genetic origin and composition of a natural hybrid poplar Populus × jrtyschensis from two distantly related species.

PubMed

Jiang, Dechun; Feng, Jianju; Dong, Miao; Wu, Guili; Mao, Kangshan; Liu, Jianquan

2016-04-18

The factors that contribute to and maintain hybrid zones between distinct species are highly variable, depending on hybrid origins, frequencies and fitness. In this study, we aimed to examine genetic origins, compositions and possible maintenance of Populus × jrtyschensis, an assumed natural hybrid between two distantly related species. This hybrid poplar occurs mainly on the floodplains along the river valleys between the overlapping distributions of the two putative parents. We collected 566 individuals from 45 typical populations of P. × jrtyschensis, P. nigra and P. laurifolia. We genotyped them based on the sequence variations of one maternally inherited chloroplast DNA (cpDNA) fragment and genetic polymorphisms at 20 SSR loci. We further sequenced eight nuclear genes for 168 individuals from 31 populations. Two groups of cpDNA haplotypes characteristic of P. nigra and P. laurifolia respectively were both recovered for P. × jrtyschensis. Genetic structures and coalescent tests of two sets of nuclear population genetic data suggested that P. × jrtyschensis originated from hybridizations between the two assumed parental species. All examined populations of P. × jrtyschensis comprise mainly F1 hybrids from interspecific hybridizations between P. nigra and P. laurifolia. In the habitats of P. × jrtyschensis, there are lower concentrations of soil nitrogen than in the habitats occupied by the other two species. Our extensive examination of the genetic composition of P. × jrtyschensis suggested that it is typical of F1-dominated hybrid zones. This finding plus the low concentration of soil nitrogen in the floodplain soils support the F1-dominated bounded hybrid superiority hypothesis of hybrid zone maintenance for this particular hybrid poplar.
Prediction of protein-protein interactions based on PseAA composition and hybrid feature selection.

PubMed

Liu, Liang; Cai, Yudong; Lu, Wencong; Feng, Kaiyan; Peng, Chunrong; Niu, Bing

2009-03-06

Based on pseudo amino acid (PseAA) composition and a novel hybrid feature selection frame, this paper presents a computational system to predict the PPIs (protein-protein interactions) using 8796 protein pairs. These pairs are coded by PseAA composition, resulting in 114 features. A hybrid feature selection system, mRMR-KNNs-wrapper, is applied to obtain an optimized feature set by excluding poor-performed and/or redundant features, resulting in 103 remaining features. Using the optimized 103-feature subset, a prediction model is trained and tested in the k-nearest neighbors (KNNs) learning system. This prediction model achieves an overall accurate prediction rate of 76.18%, evaluated by 10-fold cross-validation test, which is 1.46% higher than using the initial 114 features and is 6.51% higher than the 20 features, coded by amino acid compositions. The PPIs predictor, developed for this research, is available for public use at http://chemdata.shu.edu.cn/ppi.
Examining Potential Predictors for Completion of the Gardasil Vaccine Sequence Based on Data Gathered at Clinics of Johns Hopkins Medical Institutions

ERIC Educational Resources Information Center

Barat, Christopher E.; Wright, Courtney; Chou, Betty

2011-01-01

This paper presents categorical data that were gathered at two urban clinics and two suburban clinics of Johns Hopkins in an effort to identify characteristics of young female patients who successfully complete the three-injection sequence of the Gardasil quadrivalent human papillomavirus vaccine (HPV4). Available categorical correlates included…

An electrochemical genosensor for Salmonella typhi on gold nanoparticles-mercaptosilane modified screen printed electrode.

PubMed

Das, Ritu; Sharma, Mukesh K; Rao, Vepa K; Bhattacharya, B K; Garg, Iti; Venkatesh, V; Upadhyay, Sanjay

2014-10-20

In this work, we fabricated a system of integrated self-assembled layer of organosilane 3-mercaptopropyltrimethoxy silane (MPTS) on the screen printed electrode (SPE) and electrochemically deposited gold nanoparticle for Salmonella typhi detection employing Vi gene as a molecular marker. Thiolated DNA probe was immobilized on a gold nanoparticle (AuNP) modified SPE for DNA hybridization assay using methylene blue as redox (electroactive) hybridization indicator, and signal was monitored by differential pulse voltammetry (DPV) method. The modified SPE was characterized by cyclic voltammetry (CV), electrochemical impedance spectroscopy (EIS), and atomic force microscopy (AFM) method. The DNA biosensor showed excellent performances with high sensitivity and good selectivity. The current response was linear with the target sequence concentrations ranging from 1.0 × 10(-11) to 0.5 × 10(-8)M and the detection limit was found to be 50 (± 2.1)pM. The DNA biosensor showed good discrimination ability to the one-base, two-base and three-base mismatched sequences. The fabricated genosensor could also be regenerated easily and reused for three to four times for further hybridization studies. Copyright © 2014 Elsevier B.V. All rights reserved.
Development of chemiluminescent probe hybridization, RT-PCR and nucleic acid cycle sequencing assays of Sabin type 3 isolates to identify base pair 472 Sabin type 3 mutants associated with vaccine associated paralytic poliomyelitis.

PubMed

Old, M O; Logan, L H; Maldonado, Y A

1997-11-01

Sabin type 3 polio vaccine virus is the most common cause of poliovaccine associated paralytic poliomyelitis. Vaccine associated paralytic poliomyelitis cases have been associated with Sabin type 3 revertants containing a single U to C substitution at bp 472 of Sabin type 3. A rapid method of identification of Sabin type 3 bp 472 mutants is described. An enterovirus group-specific probe for use in a chemiluminescent dot blot hybridization assay was developed to identify enterovirus positive viral lysates. A reverse transcription-polymerase chain reaction (RT-PCR) assay producing a 319 bp PCR product containing the Sabin type 3 bp 472 mutation site was then employed to identify Sabin type 3 isolates. Chemiluminescent nucleic acid cycle sequencing of the purified 319 bp PCR product was then employed to identify nucleic acid sequences at bp 472. The enterovirus group probe hybridization procedure and isolation of the Sabin type 3 PCR product were highly sensitive and specific; nucleic acid cycle sequencing corresponded to the known sequence of stock Sabin type 3 isolates. These methods will be used to identify the Sabin type 3 reversion rate from sequential stool samples of infants obtained after the first and second doses of oral poliovirus vaccine.
Method for nucleic acid hybridization using single-stranded DNA binding protein

DOEpatents

Tabor, Stanley; Richardson, Charles C.

1996-01-01

Method of nucleic acid hybridization for detecting the presence of a specific nucleic acid sequence in a population of different nucleic acid sequences using a nucleic acid probe. The nucleic acid probe hybridizes with the specific nucleic acid sequence but not with other nucleic acid sequences in the population. The method includes contacting a sample (potentially including the nucleic acid sequence) with the nucleic acid probe under hybridizing conditions in the presence of a single-stranded DNA binding protein provided in an amount which stimulates renaturation of a dilute solution (i.e., one in which the t.sub.1/2 of renaturation is longer than 3 weeks) of single-stranded DNA greater than 500 fold (i.e., to a t.sub.1/2 less than 60 min, preferably less than 5 min, and most preferably about 1 min.) in the absence of nucleotide triphosphates.
Application of hybrid clustering using parallel k-means algorithm and DIANA algorithm

NASA Astrophysics Data System (ADS)

Umam, Khoirul; Bustamam, Alhadi; Lestari, Dian

2017-03-01

DNA is one of the carrier of genetic information of living organisms. Encoding, sequencing, and clustering DNA sequences has become the key jobs and routine in the world of molecular biology, in particular on bioinformatics application. There are two type of clustering, hierarchical clustering and partitioning clustering. In this paper, we combined two type clustering i.e. K-Means (partitioning clustering) and DIANA (hierarchical clustering), therefore it called Hybrid clustering. Application of hybrid clustering using Parallel K-Means algorithm and DIANA algorithm used to clustering DNA sequences of Human Papillomavirus (HPV). The clustering process is started with Collecting DNA sequences of HPV are obtained from NCBI (National Centre for Biotechnology Information), then performing characteristics extraction of DNA sequences. The characteristics extraction result is store in a matrix form, then normalize this matrix using Min-Max normalization and calculate genetic distance using Euclidian Distance. Furthermore, the hybrid clustering is applied by using implementation of Parallel K-Means algorithm and DIANA algorithm. The aim of using Hybrid Clustering is to obtain better clusters result. For validating the resulted clusters, to get optimum number of clusters, we use Davies-Bouldin Index (DBI). In this study, the result of implementation of Parallel K-Means clustering is data clustered become 5 clusters with minimal IDB value is 0.8741, and Hybrid Clustering clustered data become 13 sub-clusters with minimal IDB values = 0.8216, 0.6845, 0.3331, 0.1994 and 0.3952. The IDB value of hybrid clustering less than IBD value of Parallel K-Means clustering only that perform at 1ts stage. Its means clustering using Hybrid Clustering have the better result to clustered DNA sequence of HPV than perform parallel K-Means Clustering only.
Nanopore DNA sensors based on dendrimer-modified nanopipettes.

PubMed

Fu, Yaqin; Tokuhisa, Hideo; Baker, Lane A

2009-08-28

A dendrimer-modified nanopipette is used to detect hybridization of a specific DNA sequence through evaluation of the extent of rectification of ion currents observed in the measured current-voltage response.
Continuously tunable nucleic acid hybridization probes.

PubMed

Wu, Lucia R; Wang, Juexiao Sherry; Fang, John Z; Evans, Emily R; Pinto, Alessandro; Pekker, Irena; Boykin, Richard; Ngouenet, Celine; Webster, Philippa J; Beechem, Joseph; Zhang, David Yu

2015-12-01

In silico-designed nucleic acid probes and primers often do not achieve favorable specificity and sensitivity tradeoffs on the first try, and iterative empirical sequence-based optimization is needed, particularly in multiplexed assays. We present a novel, on-the-fly method of tuning probe affinity and selectivity by adjusting the stoichiometry of auxiliary species, which allows for independent and decoupled adjustment of the hybridization yield for different probes in multiplexed assays. Using this method, we achieved near-continuous tuning of probe effective free energy. To demonstrate our approach, we enforced uniform capture efficiency of 31 DNA molecules (GC content, 0-100%), maximized the signal difference for 11 pairs of single-nucleotide variants and performed tunable hybrid capture of mRNA from total RNA. Using the Nanostring nCounter platform, we applied stoichiometric tuning to simultaneously adjust yields for a 24-plex assay, and we show multiplexed quantitation of RNA sequences and variants from formalin-fixed, paraffin-embedded samples.
Thermodynamics of RNA structures by Wang–Landau sampling

PubMed Central

Lou, Feng; Clote, Peter

2010-01-01

Motivation: Thermodynamics-based dynamic programming RNA secondary structure algorithms have been of immense importance in molecular biology, where applications range from the detection of novel selenoproteins using expressed sequence tag (EST) data, to the determination of microRNA genes and their targets. Dynamic programming algorithms have been developed to compute the minimum free energy secondary structure and partition function of a given RNA sequence, the minimum free-energy and partition function for the hybridization of two RNA molecules, etc. However, the applicability of dynamic programming methods depends on disallowing certain types of interactions (pseudoknots, zig-zags, etc.), as their inclusion renders structure prediction an nondeterministic polynomial time (NP)-complete problem. Nevertheless, such interactions have been observed in X-ray structures. Results: A non-Boltzmannian Monte Carlo algorithm was designed by Wang and Landau to estimate the density of states for complex systems, such as the Ising model, that exhibit a phase transition. In this article, we apply the Wang-Landau (WL) method to compute the density of states for secondary structures of a given RNA sequence, and for hybridizations of two RNA sequences. Our method is shown to be much faster than existent software, such as RNAsubopt. From density of states, we compute the partition function over all secondary structures and over all pseudoknot-free hybridizations. The advantage of the WL method is that by adding a function to evaluate the free energy of arbitary pseudoknotted structures and of arbitrary hybridizations, we can estimate thermodynamic parameters for situations known to be NP-complete. This extension to pseudoknots will be made in the sequel to this article; in contrast, the current article describes the WL algorithm applied to pseudoknot-free secondary structures and hybridizations. Availability: The WL RNA hybridization web server is under construction at http://bioinformatics.bc.edu/clotelab/. Contact: clote@bc.edu PMID:20529917
ChIP-chip.

PubMed

Kim, Tae Hoon; Dekker, Job

2018-05-01

ChIP-chip can be used to analyze protein-DNA interactions in a region-wide and genome-wide manner. DNA microarrays contain PCR products or oligonucleotide probes that are designed to represent genomic sequences. Identification of genomic sites that interact with a specific protein is based on competitive hybridization of the ChIP-enriched DNA and the input DNA to DNA microarrays. The ChIP-chip protocol can be divided into two main sections: Amplification of ChIP DNA and hybridization of ChIP DNA to arrays. A large amount of DNA is required to hybridize to DNA arrays, and hybridization to a set of multiple commercial arrays that represent the entire human genome requires two rounds of PCR amplifications. The relative hybridization intensity of ChIP DNA and that of the input DNA is used to determine whether the probe sequence is a potential site of protein-DNA interaction. Resolution of actual genomic sites bound by the protein is dependent on the size of the chromatin and on the genomic distance between the probes on the array. As with expression profiling using gene chips, ChIP-chip experiments require multiple replicates for reliable statistical measure of protein-DNA interactions. © 2018 Cold Spring Harbor Laboratory Press.
Molecular cloning of cDNAs for the nerve-cell specific phosphoprotein, synapsin I.

PubMed Central

Kilimann, M W; DeGennaro, L J

1985-01-01

To provide access to synapsin I-specific DNA sequences, we have constructed cDNA clones complementary to synapsin I mRNA isolated from rat brain. Synapsin I mRNA was specifically enriched by immunoadsorption of polysomes prepared from the brains of 10-14 day old rats. Employing this enriched mRNA, a cDNA library was constructed in pBR322 and screened by differential colony hybridization with single-stranded cDNA probes made from synapsin I mRNA and total polysomal poly(A)+ RNA. This screening procedure proved to be highly selective. Five independent recombinant plasmids which exhibited distinctly stronger hybridization with the synapsin I probe were characterized further by restriction mapping. All of the cDNA inserts gave restriction enzyme digestion patterns which could be aligned. In addition, some of the cDNA inserts were shown to contain poly(dA) sequences. Final identification of synapsin I cDNA clones relied on the ability of the cDNA inserts to hybridize specifically to synapsin I mRNA. Several plasmids were tested by positive hybridization selection. They specifically selected synapsin I mRNA which was identified by in vitro translation and immunoprecipitation of the translation products. The established cDNA clones were used for a blot-hybridization analysis of synapsin I mRNA. A fragment (1600 bases) from the longest cDNA clone hybridized with two discrete RNA species 5800 and 4500 bases long, in polyadenylated RNA from rat brain and PC12 cells. No hybridization was detected to RNA from rat liver, skeletal muscle or cardiac muscle. Images Fig. 1. Fig. 2. Fig. 4. Fig. 5. PMID:3933975
Hybrid spread spectrum radio system

DOEpatents

Smith, Stephen F [London, TN; Dress, William B [Camas, WA

2010-02-09

Systems and methods are described for hybrid spread spectrum radio systems. A method, includes receiving a hybrid spread spectrum signal including: fast frequency hopping demodulating and direct sequence demodulating a direct sequence spread spectrum signal, wherein multiple frequency hops occur within a single data-bit time and each bit is represented by chip transmissions at multiple frequencies.
Hybrid Capture-Based Tumor Sequencing and Copy Number Analysis to Confirm Origin of Metachronous Metastases in BRCA1-Mutant Cholangiocarcinoma Harboring a Novel YWHAZ-BRAF Fusion.

PubMed

Lim, Huat C; Montesion, Meagan; Botton, Thomas; Collisson, Eric A; Umetsu, Sarah E; Behr, Spencer C; Gordan, John D; Stephens, Phil J; Kelley, Robin K

2018-04-05

Biliary tract cancers such as cholangiocarcinoma represent a heterogeneous group of cancers that can be difficult to diagnose. Recent comprehensive genomic analyses in large cholangiocarcinoma cohorts have defined important molecular subgroups within cholangiocarcinoma that may relate to anatomic location and etiology [1-4] and may predict responsiveness to targeted therapies in development [5-7]. These emerging data highlight the potential for tumor genomics to inform diagnosis and treatment options in this challenging tumor type. We report the case of a patient with a germline BRCA1 mutation who presented with a cholangiocarcinoma driven by the novel YWHAZ-BRAF fusion. Hybrid capture-based DNA sequencing and copy number analysis performed as part of clinical care demonstrated that two later-occurring tumors were clonally derived from the primary cholangiocarcinoma rather than distinct new primaries, revealing an unusual pattern of late metachronous metastasis. We discuss the clinical significance of these genetic alterations and their relevance to therapeutic strategies. Hybrid capture-based next-generation DNA sequencing assays can provide diagnostic clarity in patients with unusual patterns of metastasis and recurrence in which the pathologic diagnosis is ambiguous.To our knowledge, this is the first reported case of a YWHAZ-BRAF fusion in pancreaticobiliary cancer, and a very rare case of cholangiocarcinoma in the setting of a germline BRCA1 mutation.The patient's BRCA1 mutation and YWHAZ-BRAF fusion constitute potential targets for future therapy. © AlphaMed Press 2018.
Independent assessment and improvement of wheat genome sequence assemblies using Fosill jumping libraries.

PubMed

Lu, Fu-Hao; McKenzie, Neil; Kettleborough, George; Heavens, Darren; Clark, Matthew D; Bevan, Michael W

2018-05-01

The accurate sequencing and assembly of very large, often polyploid, genomes remains a challenging task, limiting long-range sequence information and phased sequence variation for applications such as plant breeding. The 15-Gb hexaploid bread wheat (Triticum aestivum) genome has been particularly challenging to sequence, and several different approaches have recently generated long-range assemblies. Mapping and understanding the types of assembly errors are important for optimising future sequencing and assembly approaches and for comparative genomics. Here we use a Fosill 38-kb jumping library to assess medium and longer-range order of different publicly available wheat genome assemblies. Modifications to the Fosill protocol generated longer Illumina sequences and enabled comprehensive genome coverage. Analyses of two independent Bacterial Artificial Chromosome (BAC)-based chromosome-scale assemblies, two independent Illumina whole genome shotgun assemblies, and a hybrid Single Molecule Real Time (SMRT-PacBio) and short read (Illumina) assembly were carried out. We revealed a surprising scale and variety of discrepancies using Fosill mate-pair mapping and validated several of each class. In addition, Fosill mate-pairs were used to scaffold a whole genome Illumina assembly, leading to a 3-fold increase in N50 values. Our analyses, using an independent means to validate different wheat genome assemblies, show that whole genome shotgun assemblies based solely on Illumina sequences are significantly more accurate by all measures compared to BAC-based chromosome-scale assemblies and hybrid SMRT-Illumina approaches. Although current whole genome assemblies are reasonably accurate and useful, additional improvements will be needed to generate complete assemblies of wheat genomes using open-source, computationally efficient, and cost-effective methods.
Solid phase sequencing of biopolymers

DOEpatents

Cantor, Charles; Koster, Hubert

2010-09-28

This invention relates to methods for detecting and sequencing target nucleic acid sequences, to mass modified nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probes comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include DNA or RNA in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated molecular weight analysis and identification of the target sequence.
Optimal protein library design using recombination or point mutations based on sequence-based scoring functions.

PubMed

Pantazes, Robert J; Saraf, Manish C; Maranas, Costas D

2007-08-01

In this paper, we introduce and test two new sequence-based protein scoring systems (i.e. S1, S2) for assessing the likelihood that a given protein hybrid will be functional. By binning together amino acids with similar properties (i.e. volume, hydrophobicity and charge) the scoring systems S1 and S2 allow for the quantification of the severity of mismatched interactions in the hybrids. The S2 scoring system is found to be able to significantly functionally enrich a cytochrome P450 library over other scoring methods. Given this scoring base, we subsequently constructed two separate optimization formulations (i.e. OPTCOMB and OPTOLIGO) for optimally designing protein combinatorial libraries involving recombination or mutations, respectively. Notably, two separate versions of OPTCOMB are generated (i.e. model M1, M2) with the latter allowing for position-dependent parental fragment skipping. Computational benchmarking results demonstrate the efficacy of models OPTCOMB and OPTOLIGO to generate high scoring libraries of a prespecified size.
Tilted pillar array fabrication by the combination of proton beam writing and soft lithography for microfluidic cell capture Part 2: Image sequence analysis based evaluation and biological application.

PubMed

Járvás, Gábor; Varga, Tamás; Szigeti, Márton; Hajba, László; Fürjes, Péter; Rajta, István; Guttman, András

2018-02-01

As a continuation of our previously published work, this paper presents a detailed evaluation of a microfabricated cell capture device utilizing a doubly tilted micropillar array. The device was fabricated using a novel hybrid technology based on the combination of proton beam writing and conventional lithography techniques. Tilted pillars offer unique flow characteristics and support enhanced fluidic interaction for improved immunoaffinity based cell capture. The performance of the microdevice was evaluated by an image sequence analysis based in-house developed single-cell tracking system. Individual cell tracking allowed in-depth analysis of the cell-chip surface interaction mechanism from hydrodynamic point of view. Simulation results were validated by using the hybrid device and the optimized surface functionalization procedure. Finally, the cell capture capability of this new generation microdevice was demonstrated by efficiently arresting cells from a HT29 cell-line suspension. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Biosensors for DNA sequence detection

NASA Technical Reports Server (NTRS)

Vercoutere, Wenonah; Akeson, Mark

2002-01-01

DNA biosensors are being developed as alternatives to conventional DNA microarrays. These devices couple signal transduction directly to sequence recognition. Some of the most sensitive and functional technologies use fibre optics or electrochemical sensors in combination with DNA hybridization. In a shift from sequence recognition by hybridization, two emerging single-molecule techniques read sequence composition using zero-mode waveguides or electrical impedance in nanoscale pores.
Dynamic learning and context-dependence in sequential, attribute-based, stated-preference valuation questions

Treesearch

Thomas P. Holmes; Kevin J. Boyle

2005-01-01

A hybrid stated-preference model is presented that combines the referendum contingent valuation response format with an experimentally designed set of attributes. A sequence of valuation questions is asked to a random sample in a mailout mail-back format. Econometric analysis shows greater discrimination between alternatives in the final choice in the sequence, and the...
Insights into phylogeny, sex function and age of Fragaria based on whole chloroplast genome sequencing

Treesearch

Wambui Njunguna; Aaron Liston; Richard Cronn; Tia-Lynn Ashman; Nahla Bassil

2013-01-01

The cultivated strawberry is one of the youngest domesticated plants, developed in France in the 1700s from chance hybridization between two western hemisphere octoploid species. However, little is known about the evolution of the species that gave rise to this important fruit crop. Phylogenetic analysis of chloroplast genome sequences of 21 Fragaria...
Electronic hybridization detection in microarray format and DNA genotyping

NASA Astrophysics Data System (ADS)

Blin, Antoine; Cissé, Ismaïl; Bockelmann, Ulrich

2014-02-01

We describe an approach to substituting a fluorescence microarray with a surface made of an arrangement of electrolyte-gated field effect transistors. This was achieved using a dedicated blocking of non-specific interactions and comparing threshold voltage shifts of transistors exhibiting probe molecules of different base sequence. We apply the approach to detection of the 35delG mutation, which is related to non-syndromic deafness and is one of the most frequent mutations in humans. The process involves barcode sequences that are generated by Tas-PCR, a newly developed replication reaction using polymerase blocking. The barcodes are recognized by hybridization to surface attached probes and are directly detected by the semiconductor device.
Electronic hybridization detection in microarray format and DNA genotyping

PubMed Central

Blin, Antoine; Cissé, Ismaïl; Bockelmann, Ulrich

2014-01-01

We describe an approach to substituting a fluorescence microarray with a surface made of an arrangement of electrolyte-gated field effect transistors. This was achieved using a dedicated blocking of non-specific interactions and comparing threshold voltage shifts of transistors exhibiting probe molecules of different base sequence. We apply the approach to detection of the 35delG mutation, which is related to non-syndromic deafness and is one of the most frequent mutations in humans. The process involves barcode sequences that are generated by Tas-PCR, a newly developed replication reaction using polymerase blocking. The barcodes are recognized by hybridization to surface attached probes and are directly detected by the semiconductor device. PMID:24569823

Hybrid selection for sequencing pathogen genomes from clinical samples

PubMed Central

2011-01-01

We have adapted a solution hybrid selection protocol to enrich pathogen DNA in clinical samples dominated by human genetic material. Using mock mixtures of human and Plasmodium falciparum malaria parasite DNA as well as clinical samples from infected patients, we demonstrate an average of approximately 40-fold enrichment of parasite DNA after hybrid selection. This approach will enable efficient genome sequencing of pathogens from clinical samples, as well as sequencing of endosymbiotic organisms such as Wolbachia that live inside diverse metazoan phyla. PMID:21835008
Chromosome specific repetitive DNA sequences

DOEpatents

Moyzis, Robert K.; Meyne, Julianne

1991-01-01

A method is provided for determining specific nucleotide sequences useful in forming a probe which can identify specific chromosomes, preferably through in situ hybridization within the cell itself. In one embodiment, chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family me This invention is the result of a contract with the Department of Energy (Contract No. W-7405-ENG-36).
Sequence conservation on the Y chromosome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gibson, L.H.; Yang-Feng, L.; Lau, C.

The Y chromosome is present in all mammals and is considered to be essential to sex determination. Despite intense genomic research, only a few genes have been identified and mapped to this chromosome in humans. Several of them, such as SRY and ZFY, have been demonstrated to be conserved and Y-located in other mammals. In order to address the issue of sequence conservation on the Y chromosome, we performed fluorescence in situ hybridization (FISH) with DNA from a human Y cosmid library as a probe to study the Y chromosomes from other mammalian species. Total DNA from 3,000-4,500 cosmid poolsmore » were labeled with biotinylated-dUTP and hybridized to metaphase chromosomes. For human and primate preparations, human cot1 DNA was included in the hybridization mixture to suppress the hybridization from repeat sequences. FISH signals were detected on the Y chromosomes of human, gorilla, orangutan and baboon (Old World monkey) and were absent on those of squirrel monkey (New World monkey), Indian munjac, wood lemming, Chinese hamster, rat and mouse. Since sequence analysis suggested that specific genes, e.g. SRY and ZFY, are conserved between these two groups, the lack of detectable hybridization in the latter group implies either that conservation of the human Y sequences is limited to the Y chromosomes of the great apes and Old World monkeys, or that the size of the syntenic segment is too small to be detected under the resolution of FISH, or that homologeous sequences have undergone considerable divergence. Further studies with reduced hybridization stringency are currently being conducted. Our results provide some clues as to Y-sequence conservation across species and demonstrate the limitations of FISH across species with total DNA sequences from a particular chromosome.« less
Communicator Style as a Predictor of Cyberbullying in a Hybrid Learning Environment

ERIC Educational Resources Information Center

Dursun, Ozcan Ozgur; Akbulut, Yavuz

2012-01-01

This study aimed to describe the characteristics of undergraduate students in a hybrid learning environment with regard to their communicator styles and cyberbullying behaviors. Moreover, relationships between cyberbullying victimization and learners' perceived communicator styles were investigated. Cyberbullying victimization was measured through…
Arrays of probes for positional sequencing by hybridization

DOEpatents

Cantor, Charles R [Boston, MA; Prezetakiewiczr, Marek [East Boston, MA; Smith, Cassandra L [Boston, MA; Sano, Takeshi [Waltham, MA

2008-01-15

This invention is directed to methods and reagents useful for sequencing nucleic acid targets utilizing sequencing by hybridization technology comprising probes, arrays of probes and methods whereby sequence information is obtained rapidly and efficiently in discrete packages. That information can be used for the detection, identification, purification and complete or partial sequencing of a particular target nucleic acid. When coupled with a ligation step, these methods can be performed under a single set of hybridization conditions. The invention also relates to the replication of probe arrays and methods for making and replicating arrays of probes which are useful for the large scale manufacture of diagnostic aids used to screen biological samples for specific target sequences. Arrays created using PCR technology may comprise probes with 5'- and/or 3'-overhangs.
Complete chloroplast and ribosomal sequences for 30 accessions elucidate evolution of Oryza AA genome species

PubMed Central

Kim, Kyunghee; Lee, Sang-Choon; Lee, Junki; Yu, Yeisoo; Yang, Kiwoung; Choi, Beom-Soon; Koh, Hee-Jong; Waminal, Nomar Espinosa; Choi, Hong-Il; Kim, Nam-Hoon; Jang, Woojong; Park, Hyun-Seung; Lee, Jonghoon; Lee, Hyun Oh; Joh, Ho Jun; Lee, Hyeon Ju; Park, Jee Young; Perumal, Sampath; Jayakodi, Murukarthick; Lee, Yun Sun; Kim, Backki; Copetti, Dario; Kim, Soonok; Kim, Sunggil; Lim, Ki-Byung; Kim, Young-Dong; Lee, Jungho; Cho, Kwang-Su; Park, Beom-Seok; Wing, Rod A.; Yang, Tae-Jin

2015-01-01

Cytoplasmic chloroplast (cp) genomes and nuclear ribosomal DNA (nR) are the primary sequences used to understand plant diversity and evolution. We introduce a high-throughput method to simultaneously obtain complete cp and nR sequences using Illumina platform whole-genome sequence. We applied the method to 30 rice specimens belonging to nine Oryza species. Concurrent phylogenomic analysis using cp and nR of several of specimens of the same Oryza AA genome species provides insight into the evolution and domestication of cultivated rice, clarifying three ambiguous but important issues in the evolution of wild Oryza species. First, cp-based trees clearly classify each lineage but can be biased by inter-subspecies cross-hybridization events during speciation. Second, O. glumaepatula, a South American wild rice, includes two cytoplasm types, one of which is derived from a recent interspecies hybridization with O. longistminata. Third, the Australian O. rufipogan-type rice is a perennial form of O. meridionalis. PMID:26506948
Adaptable gene-specific dye bias correction for two-channel DNA microarrays.

PubMed

Margaritis, Thanasis; Lijnzaad, Philip; van Leenen, Dik; Bouwmeester, Diane; Kemmeren, Patrick; van Hooff, Sander R; Holstege, Frank C P

2009-01-01

DNA microarray technology is a powerful tool for monitoring gene expression or for finding the location of DNA-bound proteins. DNA microarrays can suffer from gene-specific dye bias (GSDB), causing some probes to be affected more by the dye than by the sample. This results in large measurement errors, which vary considerably for different probes and also across different hybridizations. GSDB is not corrected by conventional normalization and has been difficult to address systematically because of its variance. We show that GSDB is influenced by label incorporation efficiency, explaining the variation of GSDB across different hybridizations. A correction method (Gene- And Slide-Specific Correction, GASSCO) is presented, whereby sequence-specific corrections are modulated by the overall bias of individual hybridizations. GASSCO outperforms earlier methods and works well on a variety of publically available datasets covering a range of platforms, organisms and applications, including ChIP on chip. A sequence-based model is also presented, which predicts which probes will suffer most from GSDB, useful for microarray probe design and correction of individual hybridizations. Software implementing the method is publicly available.
Adaptable gene-specific dye bias correction for two-channel DNA microarrays

PubMed Central

Margaritis, Thanasis; Lijnzaad, Philip; van Leenen, Dik; Bouwmeester, Diane; Kemmeren, Patrick; van Hooff, Sander R; Holstege, Frank CP

2009-01-01

DNA microarray technology is a powerful tool for monitoring gene expression or for finding the location of DNA-bound proteins. DNA microarrays can suffer from gene-specific dye bias (GSDB), causing some probes to be affected more by the dye than by the sample. This results in large measurement errors, which vary considerably for different probes and also across different hybridizations. GSDB is not corrected by conventional normalization and has been difficult to address systematically because of its variance. We show that GSDB is influenced by label incorporation efficiency, explaining the variation of GSDB across different hybridizations. A correction method (Gene- And Slide-Specific Correction, GASSCO) is presented, whereby sequence-specific corrections are modulated by the overall bias of individual hybridizations. GASSCO outperforms earlier methods and works well on a variety of publically available datasets covering a range of platforms, organisms and applications, including ChIP on chip. A sequence-based model is also presented, which predicts which probes will suffer most from GSDB, useful for microarray probe design and correction of individual hybridizations. Software implementing the method is publicly available. PMID:19401678
A low density microarray method for the identification of human papillomavirus type 18 variants.

PubMed

Meza-Menchaca, Thuluz; Williams, John; Rodríguez-Estrada, Rocío B; García-Bravo, Aracely; Ramos-Ligonio, Ángel; López-Monteon, Aracely; Zepeda, Rossana C

2013-09-26

We describe a novel microarray based-method for the screening of oncogenic human papillomavirus 18 (HPV-18) molecular variants. Due to the fact that sequencing methodology may underestimate samples containing more than one variant we designed a specific and sensitive stacking DNA hybridization assay. This technology can be used to discriminate between three possible phylogenetic branches of HPV-18. Probes were attached covalently on glass slides and hybridized with single-stranded DNA targets. Prior to hybridization with the probes, the target strands were pre-annealed with the three auxiliary contiguous oligonucleotides flanking the target sequences. Screening HPV-18 positive cell lines and cervical samples were used to evaluate the performance of this HPV DNA microarray. Our results demonstrate that the HPV-18's variants hybridized specifically to probes, with no detection of unspecific signals. Specific probes successfully reveal detectable point mutations in these variants. The present DNA oligoarray system can be used as a reliable, sensitive and specific method for HPV-18 variant screening. Furthermore, this simple assay allows the use of inexpensive equipment, making it accessible in resource-poor settings.
A Low Density Microarray Method for the Identification of Human Papillomavirus Type 18 Variants

PubMed Central

Meza-Menchaca, Thuluz; Williams, John; Rodríguez-Estrada, Rocío B.; García-Bravo, Aracely; Ramos-Ligonio, Ángel; López-Monteon, Aracely; Zepeda, Rossana C.

2013-01-01

We describe a novel microarray based-method for the screening of oncogenic human papillomavirus 18 (HPV-18) molecular variants. Due to the fact that sequencing methodology may underestimate samples containing more than one variant we designed a specific and sensitive stacking DNA hybridization assay. This technology can be used to discriminate between three possible phylogenetic branches of HPV-18. Probes were attached covalently on glass slides and hybridized with single-stranded DNA targets. Prior to hybridization with the probes, the target strands were pre-annealed with the three auxiliary contiguous oligonucleotides flanking the target sequences. Screening HPV-18 positive cell lines and cervical samples were used to evaluate the performance of this HPV DNA microarray. Our results demonstrate that the HPV-18's variants hybridized specifically to probes, with no detection of unspecific signals. Specific probes successfully reveal detectable point mutations in these variants. The present DNA oligoarray system can be used as a reliable, sensitive and specific method for HPV-18 variant screening. Furthermore, this simple assay allows the use of inexpensive equipment, making it accessible in resource-poor settings. PMID:24077317
Organization and variation analysis of 5S rDNA in different ploidy-level hybrids of red crucian carp × topmouth culter.

PubMed

He, Weiguo; Qin, Qinbo; Liu, Shaojun; Li, Tangluo; Wang, Jing; Xiao, Jun; Xie, Lihua; Zhang, Chun; Liu, Yun

2012-01-01

Through distant crossing, diploid, triploid and tetraploid hybrids of red crucian carp (Carassius auratus red var., RCC♀, Cyprininae, 2n = 100) × topmouth culter (Erythroculter ilishaeformis Bleeker, TC♂, Cultrinae, 2n = 48) were successfully produced. Diploid hybrids possessed 74 chromosomes with one set from RCC and one set from TC; triploid hybrids harbored 124 chromosomes with two sets from RCC and one set from TC; tetraploid hybrids had 148 chromosomes with two sets from RCC and two sets from TC. The 5S rDNA of the three different ploidy-level hybrids and their parents were sequenced and analyzed. There were three monomeric 5S rDNA classes (designated class I: 203 bp; class II: 340 bp; and class III: 477 bp) in RCC and two monomeric 5S rDNA classes (designated class IV: 188 bp, and class V: 286 bp) in TC. In the hybrid offspring, diploid hybrids inherited three 5S rDNA classes from their female parent (RCC) and only class IV from their male parent (TC). Triploid hybrids inherited class II and class III from their female parent (RCC) and class IV from their male parent (TC). Tetraploid hybrids gained class II and class III from their female parent (RCC), and generated a new 5S rDNA sequence (designated class I-N). The specific paternal 5S rDNA sequence of class V was not found in the hybrid offspring. Sequence analysis of 5S rDNA revealed the influence of hybridization and polyploidization on the organization and variation of 5S rDNA in fish. This is the first report on the coexistence in vertebrates of viable diploid, triploid and tetraploid hybrids produced by crossing parents with different chromosome numbers, and these new hybrids are novel specimens for studying the genomic variation in the first generation of interspecific hybrids, which has significance for evolution and fish genetics.
CODEHOP (COnsensus-DEgenerate Hybrid Oligonucleotide Primer) PCR primer design

PubMed Central

Rose, Timothy M.; Henikoff, Jorja G.; Henikoff, Steven

2003-01-01

We have developed a new primer design strategy for PCR amplification of distantly related gene sequences based on consensus-degenerate hybrid oligonucleotide primers (CODEHOPs). An interactive program has been written to design CODEHOP PCR primers from conserved blocks of amino acids within multiply-aligned protein sequences. Each CODEHOP consists of a pool of related primers containing all possible nucleotide sequences encoding 3–4 highly conserved amino acids within a 3′ degenerate core. A longer 5′ non-degenerate clamp region contains the most probable nucleotide predicted for each flanking codon. CODEHOPs are used in PCR amplification to isolate distantly related sequences encoding the conserved amino acid sequence. The primer design software and the CODEHOP PCR strategy have been utilized for the identification and characterization of new gene orthologs and paralogs in different plant, animal and bacterial species. In addition, this approach has been successful in identifying new pathogen species. The CODEHOP designer (http://blocks.fhcrc.org/codehop.html) is linked to BlockMaker and the Multiple Alignment Processor within the Blocks Database World Wide Web (http://blocks.fhcrc.org). PMID:12824413
Optimized Probe Masking for Comparative Transcriptomics of Closely Related Species

PubMed Central

Poeschl, Yvonne; Delker, Carolin; Trenner, Jana; Ullrich, Kristian Karsten; Quint, Marcel; Grosse, Ivo

2013-01-01

Microarrays are commonly applied to study the transcriptome of specific species. However, many available microarrays are restricted to model organisms, and the design of custom microarrays for other species is often not feasible. Hence, transcriptomics approaches of non-model organisms as well as comparative transcriptomics studies among two or more species often make use of cost-intensive RNAseq studies or, alternatively, by hybridizing transcripts of a query species to a microarray of a closely related species. When analyzing these cross-species microarray expression data, differences in the transcriptome of the query species can cause problems, such as the following: (i) lower hybridization accuracy of probes due to mismatches or deletions, (ii) probes binding multiple transcripts of different genes, and (iii) probes binding transcripts of non-orthologous genes. So far, methods for (i) exist, but these neglect (ii) and (iii). Here, we propose an approach for comparative transcriptomics addressing problems (i) to (iii), which retains only transcript-specific probes binding transcripts of orthologous genes. We apply this approach to an Arabidopsis lyrata expression data set measured on a microarray designed for Arabidopsis thaliana, and compare it to two alternative approaches, a sequence-based approach and a genomic DNA hybridization-based approach. We investigate the number of retained probe sets, and we validate the resulting expression responses by qRT-PCR. We find that the proposed approach combines the benefit of sequence-based stringency and accuracy while allowing the expression analysis of much more genes than the alternative sequence-based approach. As an added benefit, the proposed approach requires probes to detect transcripts of orthologous genes only, which provides a superior base for biological interpretation of the measured expression responses. PMID:24260119
Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1998-03-24

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.
Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1998-01-01

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.
Ultrasensitive determination of DNA sequences by flow injection chemiluminescence using silver ions as labels.

PubMed

Zheng, Lichun; Liu, Xiuhui; Zhou, Min; Ma, Yongjun; Wu, Guofan; Lu, Xiaoquan

2014-10-27

We presented a new strategy for ultrasensitive detection of DNA sequences based on the novel detection probe which was labeled with Ag(+) using metallothionein (MT) as a bridge. The assay relied on a sandwich-type DNA hybridization in which the DNA targets were first hybridized to the captured oligonucleotide probes immobilized on Fe3O4@Au composite magnetic nanoparticles (MNPs), and then the Ag(+)-modified detection probes were used to monitor the presence of the specific DNA targets. After being anchored on the hybrids, Ag(+) was released down through acidic treatment and sensitively determined by a coupling flow injection-chemiluminescent reaction system (Ag(+)-Mn(2+)-K2S2O8-H3PO4-luminol) (FI-CL). The experiment results showed that the CL intensities increased linearly with the concentrations of DNA targets in the range from 10 to 500 pmol L(-1) with a detection limit of 3.3 pmol L(-1). The high sensitivity in this work may be ascribed to the high molar ratio of Ag(+)-MT, the sensitive determination of Ag(+) by the coupling FI-CL reaction system and the perfect magnetic separation based on Fe3O4@Au composite MNPs. Moreover, the proposed strategy exhibited excellent selectivity against the mismatched DNA sequences and could be applied to real samples analysis. Copyright © 2014 Elsevier B.V. All rights reserved.
Manipulation of oligonucleotides immobilized on solid supports - DNA computations on surfaces

NASA Astrophysics Data System (ADS)

Liu, Qinghua

The manipulation of DNA oligonucleotides immobilized on various solid supports has been studied intensively, especially in the area of surface hybridization. Recently, surface-based biotechnology has been applied to the area of molecular computing. These surface-based methods have advantages with regard to ease of handling, facile purification, and less interference when compared to solution methodologies. This dissertation describes the investigation of molecular approaches to DNA computing. The feasibility of encoding a bit (0 or 1) of information for DNA-based computations at the single nucleotide level was studied, particularly with regard to the efficiency and specificity of hybridization discrimination. Both gold and glass surfaces, with addressed arrays of 32 oligonucleotides, were employed with similar hybridization results. Although single-base discrimination may be achieved in the system, it is at the cost of a severe decrease in the efficiency of hybridization to perfectly matched sequences. This compromises the utility of single nucleotide encoding for DNA computing applications in the absence of some additional mechanism for increasing specificity. Several methods are suggested including a multiple-base encoding strategy. The multiple-base encoding strategy was employed to develop a prototype DNA computer. The approach was demonstrated by solving a small example of the Satisfiability (SAT) problem, an NP-complete problem in Boolean logic. 16 distinct DNA oligonucleotides, encoding all candidate solutions to the 4-variable-4-clause-3-SAT problem, were immobilized on a gold surface in the non-addressed format. Four cycles of MARK (hybridization), DESTROY (enzymatic destruction) and UNMARK (denaturation) were performed, which identified and eliminated members of the set which were not solutions to the problem. Determination of the answer was accomplished in the READOUT (sequence identification) operation by PCR amplification of the remaining molecules and hybridization to an addressed array. Four answers were determined and the S/N ratio between correct and incorrect solutions ranged from 10 to 777, making discrimination between correct and incorrect solutions to the problem straightforward. Additionally, studies of enzymatic manipulations of DNA molecules on surfaces suggested the use of E. coli Exonuclease I (Exo I) and perhaps EarI in the DESTROY operation.
Nature and distribution of feline sarcoma virus nucleotide sequences.

PubMed Central

Frankel, A E; Gilbert, J H; Porzig, K J; Scolnick, E M; Aaronson, S A

1979-01-01

The genomes of three independent isolates of feline sarcoma virus (FeSV) were compared by molecular hybridization techniques. Using complementary DNAs prepared from two strains, SM- and ST-FeSV, common complementary DNA'S were selected by sequential hybridization to FeSV and feline leukemia virus RNAs. These DNAs were shown to be highly related among the three independent sarcoma virus isolates. FeSV-specific complementary DNAs were prepared by selection for hybridization by the homologous FeSV RNA and against hybridization by fline leukemia virus RNA. Sarcoma virus-specific sequences of SM-FeSV were shown to differ from those of either ST- or GA-FeSV strains, whereas ST-FeSV-specific DNA shared extensive sequence homology with GA-FeSV. By molecular hybridization, each set of FeSV-specific sequences was demonstrated to be present in normal cat cellular DNA in approximately one copy per haploid genome and was conserved throughout Felidae. In contrast, FeSV-common sequences were present in multiple DNA copies and were found only in Mediterranean cats. The present results are consistent with the concept that each FeSV strain has arisen by a mechanism involving recombination between feline leukemia virus and cat cellular DNA sequences, the latter represented within the cat genome in a manner analogous to that of a cellular gene. PMID:225544
DNA sequence similarity recognition by hybridization to short oligomers

DOEpatents

Milosavljevic, Aleksandar

1999-01-01

Methods are disclosed for the comparison of nucleic acid sequences. Data is generated by hybridizing sets of oligomers with target nucleic acids. The data thus generated is manipulated simultaneously with respect to both (i) matching between oligomers and (ii) matching between oligomers and putative reference sequences available in databases. Using data compression methods to manipulate this mutual information, sequences for the target can be constructed.
Characterization of the complete mitochondrial genome of the hybrid Epinephelus moara♀ × Epinephelus lanceolatus♂, and phylogenetic analysis in subfamily epinephelinae

NASA Astrophysics Data System (ADS)

Gao, Fengtao; Wei, Min; Zhu, Ying; Guo, Hua; Chen, Songlin; Yang, Guanpin

2017-06-01

This study presents the complete mitochondrial genome of the hybrid Epinephelus moara♀× Epinephelus lanceolatus♂. The genome is 16886 bp in length, and contains 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes, a light-strand replication origin and a control region. Additionally, phylogenetic analysis based on the nucleotide sequences of 13 conserved protein-coding genes using the maximum likelihood method indicated that the mitochondrial genome is maternally inherited. This study presents genomic data for studying phylogenetic relationships and breeding of hybrid Epinephelinae.

Hybrids of Nucleic Acids and Carbon Nanotubes for Nanobiotechnology.

PubMed

Umemura, Kazuo

2015-03-12

Recent progress in the combination of nucleic acids and carbon nanotubes (CNTs) has been briefly reviewed here. Since discovering the hybridization phenomenon of DNA molecules and CNTs in 2003, a large amount of fundamental and applied research has been carried out. Among thousands of papers published since 2003, approximately 240 papers focused on biological applications were selected and categorized based on the types of nucleic acids used, but not the types of CNTs. This survey revealed that the hybridization phenomenon is strongly affected by various factors, such as DNA sequences, and for this reason, fundamental studies on the hybridization phenomenon are important. Additionally, many research groups have proposed numerous practical applications, such as nanobiosensors. The goal of this review is to provide perspective on biological applications using hybrids of nucleic acids and CNTs.
Marinospirillum insulare sp. nov., a novel halophilic helical bacterium isolated from kusaya gravy.

PubMed

Satomi, M; Kimura, B; Hayashi, M; Okuzumi, M; Fujii, T

2004-01-01

A novel species that belongs to the genus Marinospirillum is described on the basis of phenotypic characteristics, phylogenetic analysis of 16S rRNA and gyrB gene sequences and DNA-DNA hybridization. Four strains of helical, halophilic, Gram-negative, heterotrophic bacteria were isolated from kusaya gravy, which is fermented brine that is used for the production of traditional dried fish in the Izu Islands of Japan. All of the new isolates were motile by means of bipolar tuft flagella, of small cell size, coccoid-body-forming and aerophilic; it was concluded that they belong to the same bacterial species, based on DNA-DNA hybridization values (>70% DNA relatedness). DNA G+C contents of the new strains were 42-43 mol% and they had isoprenoid quinone Q-8 as the major component. Phylogenetic analysis of 16S rRNA gene sequences indicated that the new isolates were members of the genus Marinospirillum; sequence similarity of the new isolates to Marinospirillum minutulum, Marinospirillum megaterium and Marinospirillum alkaliphilum was 98.5, 98.2 and 95.2%, respectively. Phylogenetic analysis based on the gyrB gene indicated that the new isolates had enough phylogenetic distance from M. minutulum and M. megaterium to be regarded as different species, with 84.7 and 78.7% sequence similarity, respectively. DNA-DNA hybridization showed that the new isolates had <36% DNA relatedness to M. minutulum and M. megaterium, supporting the phylogenetic conclusion. Thus, a novel species is proposed: Marinospirillum insulare sp. nov. (type strain, KT=LMG 21802T=NBRC 100033T).
Chromosomal distribution of pTa-535, pTa-86, pTa-713, 35S rDNA repetitive sequences in interspecific hexaploid hybrids of common wheat (Triticum aestivum L.) and spelt (Triticum spelta L.)

PubMed Central

Duba, Adrian; Kwiatek, Michał; Wiśniewska, Halina; Wachowska, Urszula; Wiwart, Marian

2018-01-01

Fluorescent in situ hybridization (FISH) relies on fluorescent-labeled probes to detect specific DNA sequences in the genome, and it is widely used in cytogenetic analyses. The aim of this study was to determine the karyotype of T. aestivum and T. spelta hybrids and their parental components (three common wheat cultivars and five spelt breeding lines), to identify chromosomal aberrations in the evaluated wheat lines, and to analyze the distribution of polymorphisms of repetitive sequences in the examined hybrids. The FISH procedure was carried out with four DNA clones, pTa-86, pTa-535, pTa-713 and 35S rDNA used as probes. The observed polymorphisms between the investigated lines of common wheat, spelt and their hybrids was relatively low. However, differences were observed in the distribution of repetitive sequences on chromosomes 4A, 6A, 1B and 6B in selected hybrid genomes. The polymorphisms observed in common wheat and spelt hybrids carry valuable information for wheat breeders. The results of our study are also a valuable source of knowledge about genome organization and diversification in common wheat, spelt and their hybrids. The relevant information is essential for common wheat breeders, and it can contribute to breeding programs aimed at biodiversity preservation. PMID:29447228
Chromosomal distribution of pTa-535, pTa-86, pTa-713, 35S rDNA repetitive sequences in interspecific hexaploid hybrids of common wheat (Triticum aestivum L.) and spelt (Triticum spelta L.).

PubMed

Goriewa-Duba, Klaudia; Duba, Adrian; Kwiatek, Michał; Wiśniewska, Halina; Wachowska, Urszula; Wiwart, Marian

2018-01-01

Fluorescent in situ hybridization (FISH) relies on fluorescent-labeled probes to detect specific DNA sequences in the genome, and it is widely used in cytogenetic analyses. The aim of this study was to determine the karyotype of T. aestivum and T. spelta hybrids and their parental components (three common wheat cultivars and five spelt breeding lines), to identify chromosomal aberrations in the evaluated wheat lines, and to analyze the distribution of polymorphisms of repetitive sequences in the examined hybrids. The FISH procedure was carried out with four DNA clones, pTa-86, pTa-535, pTa-713 and 35S rDNA used as probes. The observed polymorphisms between the investigated lines of common wheat, spelt and their hybrids was relatively low. However, differences were observed in the distribution of repetitive sequences on chromosomes 4A, 6A, 1B and 6B in selected hybrid genomes. The polymorphisms observed in common wheat and spelt hybrids carry valuable information for wheat breeders. The results of our study are also a valuable source of knowledge about genome organization and diversification in common wheat, spelt and their hybrids. The relevant information is essential for common wheat breeders, and it can contribute to breeding programs aimed at biodiversity preservation.
Genetic relatedness of orbiviruses by RNA-RNA blot hybridization

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bodkin, D.K.

1985-01-01

RNA-RNA blot hybridization was developed in order to identify type-specific genes among double-stranded (ds) RNA viruses, to assess the genetic relatedness of dsRNA viruses and to classify new strains. Viral dsRNA segments were electrophoresed through 10% polyacrylamide gels, transferred to membranes, and hybridized to (5'/sup 32/P)-pCp labeled genomic RNA from a related strain. Hybridization was performed at 52/sup 0/C, 50% formamide, 5X SSC. Under these conditions heterologous RNA species must share greater than or equal to 74% sequence homology in order to form stable dsRNA hybrids. Cognate genes of nine members of the Palyam serogroup of orbiviruses were identified andmore » their sequence relatedness to the prototype. Palyam virus, was determined. Reciprocal blot hybridizations were performed using radiolabeled genomic RNA of all members of the Palyam serogroup. Unique and variant genes were identified by lack of cross-homology or by weak homology between segments. Since genes 2 and 6 exhibited the highest degree of sequence variability, response to the vertebrate immune system may be a major cause of sequence divergence among members of a single serogroup. Changuinola serogroup isolates were compared by dot-blot hybridization, while Colorado tick fever (CTF) serogroup isolates were compared by the RNA-RNA blot hybridization procedure described for reovirus and Palyam serogroup isolates. Preliminary blot hybridization data were also obtained on the relatedness of members of different Orbivirus serogroups.« less
Electrochemical detection of sequence-specific DNA based on formation of G-quadruplex-hemin through continuous hybridization chain reaction.

PubMed

Sun, Xiaofan; Chen, Haohan; Wang, Shuling; Zhang, Yiping; Tian, Yaping; Zhou, Nandi

2018-08-27

A high-sensitive detection of sequence-specific DNA was established based on the formation of G-quadruplex-hemin complex through continuous hybridization chain reaction (HCR). Taking HIV DNA sequence as an example, a capture probe complementary to part of HIV DNA was firstly self-assembled onto the surface of Au electrode. Then a specially designed assistant probe with both terminals complementary to the target DNA and a G-quadruplex-forming sequence in the center was introduced into the detection solution. In the presence of both the target DNA and the assistant probe, the target DNA can be captured on the electrode surface and then a continuous HCR can be conducted due to the mutual recognition of the target DNA and the assistant probe, leading to the formation of a large number of G-quadruplex on the electrode surface. With the help of hemin, a pronounced electrochemical signal can be observed in differential pulse voltammetry (DPV), due to the formation of G-quadruplex-hemin complex. The peak current is linearly related with the logarithm of the concentration of the target DNA in the range from 10 fM to 10 pM. The electrochemical sensor has high selectivity to clearly discriminate single-base mismatched and three-base mismatched sequences from the original HIV DNA sequence. Moreover, the established DNA sensor was challenged by detection of HIV DNA in human serum samples, which showed the low detection limit of 6.3 fM. Thus it has great application prospect in the field of clinical diagnosis and environmental monitoring. Copyright © 2018 Elsevier B.V. All rights reserved.
Structural protein descriptors in 1-dimension and their sequence-based predictions.

PubMed

Kurgan, Lukasz; Disfani, Fatemeh Miri

2011-09-01

The last few decades observed an increasing interest in development and application of 1-dimensional (1D) descriptors of protein structure. These descriptors project 3D structural features onto 1D strings of residue-wise structural assignments. They cover a wide-range of structural aspects including conformation of the backbone, burying depth/solvent exposure and flexibility of residues, and inter-chain residue-residue contacts. We perform first-of-its-kind comprehensive comparative review of the existing 1D structural descriptors. We define, review and categorize ten structural descriptors and we also describe, summarize and contrast over eighty computational models that are used to predict these descriptors from the protein sequences. We show that the majority of the recent sequence-based predictors utilize machine learning models, with the most popular being neural networks, support vector machines, hidden Markov models, and support vector and linear regressions. These methods provide high-throughput predictions and most of them are accessible to a non-expert user via web servers and/or stand-alone software packages. We empirically evaluate several recent sequence-based predictors of secondary structure, disorder, and solvent accessibility descriptors using a benchmark set based on CASP8 targets. Our analysis shows that the secondary structure can be predicted with over 80% accuracy and segment overlap (SOV), disorder with over 0.9 AUC, 0.6 Matthews Correlation Coefficient (MCC), and 75% SOV, and relative solvent accessibility with PCC of 0.7 and MCC of 0.6 (0.86 when homology is used). We demonstrate that the secondary structure predicted from sequence without the use of homology modeling is as good as the structure extracted from the 3D folds predicted by top-performing template-based methods.
Multilocus phylogeny and phylogenomics of Eriochrysis P. Beauv. (Poaceae-Andropogoneae): Taxonomic implications and evidence of interspecific hybridization.

PubMed

Welker, Cassiano A D; Souza-Chies, Tatiana T; Longhi-Wagner, Hilda M; Peichoto, Myriam Carolina; McKain, Michael R; Kellogg, Elizabeth A

2016-06-01

Species delimitation is a vital issue concerning evolutionary biology and conservation of biodiversity. However, it is a challenging task for several reasons, including the low interspecies variability of markers currently used in phylogenetic reconstructions and the occurrence of reticulate evolution and polyploidy in many lineages of flowering plants. The first phylogeny of the grass genus Eriochrysis is presented here, focusing on the New World species, in order to examine its relationships to other genera of the subtribe Saccharinae/tribe Andropogoneae and to define the circumscriptions of its taxonomically complicated species. Molecular cloning and sequencing of five regions of four low-copy nuclear genes (apo1, d8, ep2-ex7 and ep2-ex8, kn1) were performed, as well as complete plastome sequencing. Trees were reconstructed using maximum parsimony, maximum likelihood, and Bayesian inference analyses. The present phylogenetic analyses indicate that Eriochrysis is monophyletic and the Old World E. pallida is sister to the New World species. Subtribe Saccharinae is polyphyletic, as is the genus Eulalia. Based on nuclear and plastome sequences plus morphology, we define the circumscriptions of the New World species of Eriochrysis: E. laxa is distinct from E. warmingiana, and E. villosa is distinct from E. cayennensis. Natural hybrids occur between E. laxa and E. villosa. The hybrids are probably tetraploids, based on the number of paralogues in the nuclear gene trees. This is the first record of a polyploid taxon in the genus Eriochrysis. Some incongruities between nuclear genes and plastome analyses were detected and are potentially caused by incomplete lineage sorting and/or ancient hybridization. The set of low-copy nuclear genes used in this study seems to be sufficient to resolve phylogenetic relationships and define the circumscriptions of other species complexes in the grass family and relatives, even in the presence of polyploidy and reticulate evolution. Complete plastome sequencing is also a promising tool for phylogenetic inference. Copyright © 2016 Elsevier Inc. All rights reserved.
Expanding probe repertoire and improving reproducibility in human genomic hybridization

PubMed Central

Dorman, Stephanie N.; Shirley, Ben C.; Knoll, Joan H. M.; Rogan, Peter K.

2013-01-01

Diagnostic DNA hybridization relies on probes composed of single copy (sc) genomic sequences. Sc sequences in probe design ensure high specificity and avoid cross-hybridization to other regions of the genome, which could lead to ambiguous results that are difficult to interpret. We examine how the distribution and composition of repetitive sequences in the genome affects sc probe performance. A divide and conquer algorithm was implemented to design sc probes. With this approach, sc probes can include divergent repetitive elements, which hybridize to unique genomic targets under higher stringency experimental conditions. Genome-wide custom probe sets were created for fluorescent in situ hybridization (FISH) and microarray genomic hybridization. The scFISH probes were developed for detection of copy number changes within small tumour suppressor genes and oncogenes. The microarrays demonstrated increased reproducibility by eliminating cross-hybridization to repetitive sequences adjacent to probe targets. The genome-wide microarrays exhibited lower median coefficients of variation (17.8%) for two HapMap family trios. The coefficients of variations of commercial probes within 300 nt of a repetitive element were 48.3% higher than the nearest custom probe. Furthermore, the custom microarray called a chromosome 15q11.2q13 deletion more consistently. This method for sc probe design increases probe coverage for FISH and lowers variability in genomic microarrays. PMID:23376933
Validation of two ribosomal RNA removal methods for microbial metatranscriptomics

DOE Office of Scientific and Technical Information (OSTI.GOV)

He, Shaomei; Wurtzel, Omri; Singh, Kanwar

2010-10-01

The predominance of rRNAs in the transcriptome is a major technical challenge in sequence-based analysis of cDNAs from microbial isolates and communities. Several approaches have been applied to deplete rRNAs from (meta)transcriptomes, but no systematic investigation of potential biases introduced by any of these approaches has been reported. Here we validated the effectiveness and fidelity of the two most commonly used approaches, subtractive hybridization and exonuclease digestion, as well as combinations of these treatments, on two synthetic five-microorganism metatranscriptomes using massively parallel sequencing. We found that the effectiveness of rRNA removal was a function of community composition and RNA integritymore » for these treatments. Subtractive hybridization alone introduced the least bias in relative transcript abundance, whereas exonuclease and in particular combined treatments greatly compromised mRNA abundance fidelity. Illumina sequencing itself also can compromise quantitative data analysis by introducing a G+C bias between runs.« less
Identification of DNA-binding proteins by combining auto-cross covariance transformation and ensemble learning.

PubMed

Liu, Bin; Wang, Shanyi; Dong, Qiwen; Li, Shumin; Liu, Xuan

2016-04-20

DNA-binding proteins play a pivotal role in various intra- and extra-cellular activities ranging from DNA replication to gene expression control. With the rapid development of next generation of sequencing technique, the number of protein sequences is unprecedentedly increasing. Thus it is necessary to develop computational methods to identify the DNA-binding proteins only based on the protein sequence information. In this study, a novel method called iDNA-KACC is presented, which combines the Support Vector Machine (SVM) and the auto-cross covariance transformation. The protein sequences are first converted into profile-based protein representation, and then converted into a series of fixed-length vectors by the auto-cross covariance transformation with Kmer composition. The sequence order effect can be effectively captured by this scheme. These vectors are then fed into Support Vector Machine (SVM) to discriminate the DNA-binding proteins from the non DNA-binding ones. iDNA-KACC achieves an overall accuracy of 75.16% and Matthew correlation coefficient of 0.5 by a rigorous jackknife test. Its performance is further improved by employing an ensemble learning approach, and the improved predictor is called iDNA-KACC-EL. Experimental results on an independent dataset shows that iDNA-KACC-EL outperforms all the other state-of-the-art predictors, indicating that it would be a useful computational tool for DNA binding protein identification. .
Morphological and genetic evidence of contemporary intersectional hybridisation in Mediterranean Helichrysum (Asteraceae, Gnaphalieae).

PubMed

Galbany-Casals, M; Carnicero-Campmany, P; Blanco-Moreno, J M; Smissen, R D

2012-09-01

Hybridisation is considered an important evolutionary phenomenon in Gnaphalieae, but contemporary hybridisation has been little explored within the tribe. Here, hybridisation between Helichrysum orientale and Helichrysum stoechas is studied at two different localities in the islands of Crete and Rhodes (Greece). Using three different types of molecular data (AFLP, nrDNA ITS sequences and cpDNA ndhF sequences) and morphological data, the aim is to provide simultaneous and direct comparisons between molecular and morphological variation among the parental species and the studied hybrid populations. AFLP profiles, ITS sequences and morphological data support the existence of hybrids at the two localities studied, shown as morphological and genetic intermediates between the parental species. Chloroplast DNA sequences show that both parental species can act either as pollen donor or as maternal parent. Fertility of hybrids is demonstrated by the viability of seeds produced by hybrids from both localities, and the detection of a backcross specimen to H. orientale. Although there is general congruence of morphological and molecular data, the analysis of morphology and ITS sequences can fail to detect backcross hybrids. © 2012 German Botanical Society and The Royal Botanical Society of the Netherlands.
Predicting Human Protein Subcellular Locations by the Ensemble of Multiple Predictors via Protein-Protein Interaction Network with Edge Clustering Coefficients

PubMed Central

Du, Pufeng; Wang, Lusheng

2014-01-01

One of the fundamental tasks in biology is to identify the functions of all proteins to reveal the primary machinery of a cell. Knowledge of the subcellular locations of proteins will provide key hints to reveal their functions and to understand the intricate pathways that regulate biological processes at the cellular level. Protein subcellular location prediction has been extensively studied in the past two decades. A lot of methods have been developed based on protein primary sequences as well as protein-protein interaction network. In this paper, we propose to use the protein-protein interaction network as an infrastructure to integrate existing sequence based predictors. When predicting the subcellular locations of a given protein, not only the protein itself, but also all its interacting partners were considered. Unlike existing methods, our method requires neither the comprehensive knowledge of the protein-protein interaction network nor the experimentally annotated subcellular locations of most proteins in the protein-protein interaction network. Besides, our method can be used as a framework to integrate multiple predictors. Our method achieved 56% on human proteome in absolute-true rate, which is higher than the state-of-the-art methods. PMID:24466278
Structured oligonucleotides for target indexing to allow single-vessel PCR amplification and solid support microarray hybridization.

PubMed

Girard, Laurie D; Boissinot, Karel; Peytavi, Régis; Boissinot, Maurice; Bergeron, Michel G

2015-02-07

The combination of molecular diagnostic technologies is increasingly used to overcome limitations on sensitivity, specificity or multiplexing capabilities, and provide efficient lab-on-chip devices. Two such techniques, PCR amplification and microarray hybridization are used serially to take advantage of the high sensitivity and specificity of the former combined with high multiplexing capacities of the latter. These methods are usually performed in different buffers and reaction chambers. However, these elaborate methods have high complexity and cost related to reagent requirements, liquid storage and the number of reaction chambers to integrate into automated devices. Furthermore, microarray hybridizations have a sequence dependent efficiency not always predictable. In this work, we have developed the concept of a structured oligonucleotide probe which is activated by cleavage from polymerase exonuclease activity. This technology is called SCISSOHR for Structured Cleavage Induced Single-Stranded Oligonucleotide Hybridization Reaction. The SCISSOHR probes enable indexing the target sequence to a tag sequence. The SCISSOHR technology also allows the combination of nucleic acid amplification and microarray hybridization in a single vessel in presence of the PCR buffer only. The SCISSOHR technology uses an amplification probe that is irreversibly modified in presence of the target, releasing a single-stranded DNA tag for microarray hybridization. Each tag is composed of a 3-nucleotide sequence-dependent segment and a unique "target sequence-independent" 14-nucleotide segment allowing for optimal hybridization with minimal cross-hybridization. We evaluated the performance of five (5) PCR buffers to support microarray hybridization, compared to a conventional hybridization buffer. Finally, as a proof of concept, we developed a multiplexed assay for the amplification, detection, and identification of three (3) DNA targets. This new technology will facilitate the design of lab-on-chip microfluidic devices, while also reducing consumable costs. At term, it will allow the cost-effective automation of highly multiplexed assays for detection and identification of genetic targets.
Optimizing the specificity of nucleic acid hybridization

PubMed Central

Zhang, David Yu; Chen, Sherry Xi; Yin, Peng

2014-01-01

The specific hybridization of complementary sequences is an essential property of nucleic acids, enabling diverse biological and biotechnological reactions and functions. However, the specificity of nucleic acid hybridization is compromised for long strands, except near the melting temperature. Here, we analytically derived the thermodynamic properties of a hybridization probe that would enable near-optimal single-base discrimination and perform robustly across diverse temperature, salt and concentration conditions. We rationally designed ‘toehold exchange’ probes that approximate these properties, and comprehensively tested them against five different DNA targets and 55 spurious analogues with energetically representative single-base changes (replacements, deletions and insertions). These probes produced discrimination factors between 3 and 100+ (median, 26). Without retuning, our probes function robustly from 10 °C to 37 °C, from 1 mM Mg2+ to 47 mM Mg2+, and with nucleic acid concentrations from 1 nM to 5 μM. Experiments with RNA also showed effective single-base change discrimination. PMID:22354435
Electrochemical DNA biosensor based on a glassy carbon electrode modified with gold nanoparticles and graphene for sensitive determination of Klebsiella pneumoniae carbapenemase.

PubMed

Pan, Hong-zhi; Yu, Hong-wei; Wang, Na; Zhang, Ze; Wan, Guang-cai; Liu, Hao; Guan, Xue; Chang, Dong

2015-11-20

We describe the fabrication of a sensitive electrochemical DNA biosensor for determination of Klebsiella pneumoniae carbapenemase (KPC). The highly sensitive and selective electrochemical biosensor for DNA detection was constructed based on a glassy carbon electrode (GCE) modified with gold nanoparticles (Au-NPs) and graphene (Gr). Then Au-NPs/Gr/GCE was characterized by scanning electro microscope (SEM), cyclic voltammetry (CV) and electrochemical impedance spectroscopy (EIS). The hybridization detection was measured by diffierential pulse voltammetry (DPV) using methylene blue (MB) as the hybridization indicator. The dynamic range of detection of the sensor for the target DNA sequences was from 1 × 10(-12) to 1 × 10(-7)mol/L, with a detection limit of 2 × 10(-13)mol/L. The DNA biosensor had excellent specificity for distinguishing complementary DNA sequence in the presence of non-complementary and mismatched DNA sequence. The results demonstrated that the Au-NPs/Gr nanocomposite was a promising substrate for the development of high-performance electrocatalysts for determination of KPC. Copyright © 2015 Elsevier B.V. All rights reserved.
A core microbiome associated with the peritoneal tumors of pseudomyxoma peritonei

PubMed Central

2013-01-01

Background Pseudomyxoma peritonei (PMP) is a malignancy characterized by dissemination of mucus-secreting cells throughout the peritoneum. This disease is associated with significant morbidity and mortality and despite effective treatment options for early-stage disease, patients with PMP often relapse. Thus, there is a need for additional treatment options to reduce relapse rate and increase long-term survival. A previous study identified the presence of both typed and non-culturable bacteria associated with PMP tissue and determined that increased bacterial density was associated with more severe disease. These findings highlighted the possible role for bacteria in PMP disease. Methods To more clearly define the bacterial communities associated with PMP disease, we employed a sequenced-based analysis to profile the bacterial populations found in PMP tumor and mucin tissue in 11 patients. Sequencing data were confirmed by in situ hybridization at multiple taxonomic depths and by culturing. A pilot clinical study was initiated to determine whether the addition of antibiotic therapy affected PMP patient outcome. Main results We determined that the types of bacteria present are highly conserved in all PMP patients; the dominant phyla are the Proteobacteria, Actinobacteria, Firmicutes and Bacteroidetes. A core set of taxon-specific sequences were found in all 11 patients; many of these sequences were classified into taxonomic groups that also contain known human pathogens. In situ hybridization directly confirmed the presence of bacteria in PMP at multiple taxonomic depths and supported our sequence-based analysis. Furthermore, culturing of PMP tissue samples allowed us to isolate 11 different bacterial strains from eight independent patients, and in vitro analysis of subset of these isolates suggests that at least some of these strains may interact with the PMP-associated mucin MUC2. Finally, we provide evidence suggesting that targeting these bacteria with antibiotic treatment may increase the survival of PMP patients. Conclusions Using 16S amplicon-based sequencing, direct in situ hybridization analysis and culturing methods, we have identified numerous bacterial taxa that are consistently present in all PMP patients tested. Combined with data from a pilot clinical study, these data support the hypothesis that adding antimicrobials to the standard PMP treatment could improve PMP patient survival. PMID:23844722
On the phylogenetic position of the scrub-birds (Passeriformes: Menurae: Atrichornithidae) of Australia

USGS Publications Warehouse

Chesser, R.T.; ten, Have J.

2007-01-01

Evolutionary relationships of the scrub-birds Atrichornis were investigated using complete sequences of the recombination-activating gene RAG-1 and the proto-oncogene c-mos for two individuals of the noisy scrub-bird Atrichornis clamosus. Phylogenetic analysis revealed that Atrichornis was sister to the genus Menura (the lyrebirds) and that these two genera (the Menurae) were sister to the rest of the oscine passerines. A sister relationship between Atrichornis and Menura supports the traditional view, based on morphology and DNA hybridization, that these taxa are closely related. Similarly, a sister relationship with the remaining oscine passerines agrees with the morphological distinctiveness of Atrichornis and Menura, although this result contradicts conclusions based on DNA hybridization studies. Although Atrichornis is very well known morphologically, previous conclusions regarding its relationships were hampered by a lack of comparative knowledge of other passerines, making concurrence of the sequence data of particular significance. ?? Dt. Ornithologen-Gesellschaft e.V. 2007.
Morphological characteristics and genetic diversity of Burmese long-tailed Macaques (Macaca fascicularis aurea).

PubMed

Bunlungsup, Srichan; Imai, Hiroo; Hamada, Yuzuru; Gumert, Michael D; San, Aye Mi; Malaivijitnond, Suchinda

2015-12-15

Macaca fascicularis aurea (Mfa) is the only macaque which has been recorded to use stone tools to access encased foods. They live in close contact with M. fascicularis fascicularis (Mff) in southwestern Thailand and the hybrids were reported [Fooden, 1995]. Although Mff and Mfa can be seen in the same habitat types, tool-use behavior has never been reported in Mff. Thus, comparing the morphological characteristics and genetics between Mfa and Mff should help elucidate not only the morphological differences and genetic divergence between these subspecies but also potentially the relationship between genetics and their tool use behavior. We surveyed Mfa and Mff in Myanmar and Thailand, ranging from 16° 58' to 7° 12' N. Fecal or blood samples were collected from eight, five, and four populations of Mfa, Mff, and Mff × Mfa morphological hybrids along with three individuals of captive Chinese M. mulatta (Mm), respectively, for mtDNA and Y-chromosome (TSPY and SRY genes) DNA sequence analyses. In addition, eight populations were captured and measured for 38 somatometric dimensions. Comparison of the somatic measurements revealed that Mfa had a statistically significantly shorter tail than Mff (P < 0.05). Based on the mtDNA sequences, Mfa was separated from the Mm/Mff clade. Within the Mfa clade, the mainland Myanmar population was separate from the Mergui Archipelago and Thailand Andaman seacoast populations. All the morphological hybrids had the Mff mtDNA haplotype. Based on the Y-chromosome sequences, the three major clades of Mm/Indochinese Mff, Sundaic Mff, and Mfa were constructed. The hybrid populations grouped either with the Mm/Indochinese Mff or with the Mfa. Regarding the genetic analysis, one subspecies hybrid population in Thailand (KRI) elicited tool use behavior, thus the potential role of genetics in tool use behavior is raised in addition to the environmental force, morphological suitability, and cognitive capability. Am. J. Primatol. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Protein-RNA interface residue prediction using machine learning: an assessment of the state of the art.

PubMed

Walia, Rasna R; Caragea, Cornelia; Lewis, Benjamin A; Towfic, Fadi; Terribilini, Michael; El-Manzalawy, Yasser; Dobbs, Drena; Honavar, Vasant

2012-05-10

RNA molecules play diverse functional and structural roles in cells. They function as messengers for transferring genetic information from DNA to proteins, as the primary genetic material in many viruses, as catalysts (ribozymes) important for protein synthesis and RNA processing, and as essential and ubiquitous regulators of gene expression in living organisms. Many of these functions depend on precisely orchestrated interactions between RNA molecules and specific proteins in cells. Understanding the molecular mechanisms by which proteins recognize and bind RNA is essential for comprehending the functional implications of these interactions, but the recognition 'code' that mediates interactions between proteins and RNA is not yet understood. Success in deciphering this code would dramatically impact the development of new therapeutic strategies for intervening in devastating diseases such as AIDS and cancer. Because of the high cost of experimental determination of protein-RNA interfaces, there is an increasing reliance on statistical machine learning methods for training predictors of RNA-binding residues in proteins. However, because of differences in the choice of datasets, performance measures, and data representations used, it has been difficult to obtain an accurate assessment of the current state of the art in protein-RNA interface prediction. We provide a review of published approaches for predicting RNA-binding residues in proteins and a systematic comparison and critical assessment of protein-RNA interface residue predictors trained using these approaches on three carefully curated non-redundant datasets. We directly compare two widely used machine learning algorithms (Naïve Bayes (NB) and Support Vector Machine (SVM)) using three different data representations in which features are encoded using either sequence- or structure-based windows. Our results show that (i) Sequence-based classifiers that use a position-specific scoring matrix (PSSM)-based representation (PSSMSeq) outperform those that use an amino acid identity based representation (IDSeq) or a smoothed PSSM (SmoPSSMSeq); (ii) Structure-based classifiers that use smoothed PSSM representation (SmoPSSMStr) outperform those that use PSSM (PSSMStr) as well as sequence identity based representation (IDStr). PSSMSeq classifiers, when tested on an independent test set of 44 proteins, achieve performance that is comparable to that of three state-of-the-art structure-based predictors (including those that exploit geometric features) in terms of Matthews Correlation Coefficient (MCC), although the structure-based methods achieve substantially higher Specificity (albeit at the expense of Sensitivity) compared to sequence-based methods. We also find that the expected performance of the classifiers on a residue level can be markedly different from that on a protein level. Our experiments show that the classifiers trained on three different non-redundant protein-RNA interface datasets achieve comparable cross-validation performance. However, we find that the results are significantly affected by differences in the distance threshold used to define interface residues. Our results demonstrate that protein-RNA interface residue predictors that use a PSSM-based encoding of sequence windows outperform classifiers that use other encodings of sequence windows. While structure-based methods that exploit geometric features can yield significant increases in the Specificity of protein-RNA interface residue predictions, such increases are offset by decreases in Sensitivity. These results underscore the importance of comparing alternative methods using rigorous statistical procedures, multiple performance measures, and datasets that are constructed based on several alternative definitions of interface residues and redundancy cutoffs as well as including evaluations on independent test sets into the comparisons.

Nucleotide sequence composition and method for detection of neisseria gonorrhoeae

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lo, A.; Yang, H.L.

1990-02-13

This patent describes a composition of matter that is specific for {ital Neisseria gonorrhoeae}. It comprises: at least one nucleotide sequence for which the ratio of the amount of the sequence which hybridizes to chromosomal DNA of {ital Neisseria gonorrhoeae} to the amount of the sequence which hybridizes to chromosomal DNA of {ital Neisseria meningitidis} is greater than about five. The ratio being obtained by a method described.
Graphical workstation capability for reliability modeling

NASA Technical Reports Server (NTRS)

Bavuso, Salvatore J.; Koppen, Sandra V.; Haley, Pamela J.

1992-01-01

In addition to computational capabilities, software tools for estimating the reliability of fault-tolerant digital computer systems must also provide a means of interfacing with the user. Described here is the new graphical interface capability of the hybrid automated reliability predictor (HARP), a software package that implements advanced reliability modeling techniques. The graphics oriented (GO) module provides the user with a graphical language for modeling system failure modes through the selection of various fault-tree gates, including sequence-dependency gates, or by a Markov chain. By using this graphical input language, a fault tree becomes a convenient notation for describing a system. In accounting for any sequence dependencies, HARP converts the fault-tree notation to a complex stochastic process that is reduced to a Markov chain, which it can then solve for system reliability. The graphics capability is available for use on an IBM-compatible PC, a Sun, and a VAX workstation. The GO module is written in the C programming language and uses the graphical kernal system (GKS) standard for graphics implementation. The PC, VAX, and Sun versions of the HARP GO module are currently in beta-testing stages.
Synthetic oligonucleotide probes deduced from amino acid sequence data. Theoretical and practical considerations.

PubMed

Lathe, R

1985-05-05

Synthetic probes deduced from amino acid sequence data are widely used to detect cognate coding sequences in libraries of cloned DNA segments. The redundancy of the genetic code dictates that a choice must be made between (1) a mixture of probes reflecting all codon combinations, and (2) a single longer "optimal" probe. The second strategy is examined in detail. The frequency of sequences matching a given probe by chance alone can be determined and also the frequency of sequences closely resembling the probe and contributing to the hybridization background. Gene banks cannot be treated as random associations of the four nucleotides, and probe sequences deduced from amino acid sequence data occur more often than predicted by chance alone. Probe lengths must be increased to confer the necessary specificity. Examination of hybrids formed between unique homologous probes and their cognate targets reveals that short stretches of perfect homology occurring by chance make a significant contribution to the hybridization background. Statistical methods for improving homology are examined, taking human coding sequences as an example, and considerations of codon utilization and dinucleotide frequencies yield an overall homology of greater than 82%. Recommendations for probe design and hybridization are presented, and the choice between using multiple probes reflecting all codon possibilities and a unique optimal probe is discussed.
Cloning the Gravity and Shear Stress Related Genes from MG-63 Cells by Subtracting Hybridization

NASA Astrophysics Data System (ADS)

Zhang, Shu; Dai, Zhong-quan; Wang, Bing; Cao, Xin-sheng; Li, Ying-hui; Sun, Xi-qing

2008-06-01

Background The purpose of the present study was to clone the gravity and shear stress related genes from osteoblast-like human osteosarcoma MG-63 cells by subtractive hybridization. Method MG-63 cells were divided into two groups (1G group and simulated microgravity group). After cultured for 60 h in two different gravitational environments, two groups of MG-63 cells were treated with 1.5Pa fluid shear stress (FSS) for 60 min, respectively. The total RNA in cells was isolated. The gravity and shear stress related genes were cloned by subtractive hybridization. Result 200 clones were gained. 30 positive clones were selected using PCR method based on the primers of vector and sequenced. The obtained sequences were analyzed by blast. changes of 17 sequences were confirmed by RT-PCR and these genes are related to cell proliferation, cell differentiation, protein synthesis, signal transduction and apoptosis. 5 unknown genes related to gravity and shear stress were found. Conclusion In this part of our study, our result indicates that simulated microgravity may change the activities of MG-63 cells by inducing the functional alterations of specific genes.
Two-color, 30 second microwave-accelerated Metal-Enhanced Fluorescence DNA assays: a new Rapid Catch and Signal (RCS) technology.

PubMed

Dragan, Anatoliy I; Golberg, Karina; Elbaz, Amit; Marks, Robert; Zhang, Yongxia; Geddes, Chris D

2011-03-07

For analyses of DNA fragment sequences in solution we introduce a 2-color DNA assay, utilizing a combination of the Metal-Enhanced Fluorescence (MEF) effect and microwave-accelerated DNA hybridization. The assay is based on a new "Catch and Signal" technology, i.e. the simultaneous specific recognition of two target DNA sequences in one well by complementary anchor-ssDNAs, attached to silver island films (SiFs). It is shown that fluorescent labels (Alexa 488 and Alexa 594), covalently attached to ssDNA fragments, play the role of biosensor recognition probes, demonstrating strong response upon DNA hybridization, locating fluorophores in close proximity to silver NPs, which is ideal for MEF. Subsequently the emission dramatically increases, while the excited state lifetime decreases. It is also shown that 30s microwave irradiation of wells, containing DNA molecules, considerably (~1000-fold) speeds up the highly selective hybridization of DNA fragments at ambient temperature. The 2-color "Catch and Signal" DNA assay platform can radically expedite quantitative analysis of genome DNA sequences, creating a simple and fast bio-medical platform for nucleic acid analysis. Copyright © 2010 Elsevier B.V. All rights reserved.
AmpliVar: mutation detection in high-throughput sequence from amplicon-based libraries.

PubMed

Hsu, Arthur L; Kondrashova, Olga; Lunke, Sebastian; Love, Clare J; Meldrum, Cliff; Marquis-Nicholson, Renate; Corboy, Greg; Pham, Kym; Wakefield, Matthew; Waring, Paul M; Taylor, Graham R

2015-04-01

Conventional means of identifying variants in high-throughput sequencing align each read against a reference sequence, and then call variants at each position. Here, we demonstrate an orthogonal means of identifying sequence variation by grouping the reads as amplicons prior to any alignment. We used AmpliVar to make key-value hashes of sequence reads and group reads as individual amplicons using a table of flanking sequences. Low-abundance reads were removed according to a selectable threshold, and reads above this threshold were aligned as groups, rather than as individual reads, permitting the use of sensitive alignment tools. We show that this approach is more sensitive, more specific, and more computationally efficient than comparable methods for the analysis of amplicon-based high-throughput sequencing data. The method can be extended to enable alignment-free confirmation of variants seen in hybridization capture target-enrichment data. © 2015 WILEY PERIODICALS, INC.
Competitive hybridization models

NASA Astrophysics Data System (ADS)

Cherepinsky, Vera; Hashmi, Ghazala; Mishra, Bud

2010-11-01

Microarray technology, in its simplest form, allows one to gather abundance data for target DNA molecules, associated with genomes or gene-expressions, and relies on hybridizing the target to many short probe oligonucleotides arrayed on a surface. While for such multiplexed reactions conditions are optimized to make the most of each individual probe-target interaction, subsequent analysis of these experiments is based on the implicit assumption that a given experiment yields the same result regardless of whether it was conducted in isolation or in parallel with many others. It has been discussed in the literature that this assumption is frequently false, and its validity depends on the types of probes and their interactions with each other. We present a detailed physical model of hybridization as a means of understanding probe interactions in a multiplexed reaction. Ultimately, the model can be derived from a system of ordinary differential equations (ODE’s) describing kinetic mass action with conservation-of-mass equations completing the system. We examine pairwise probe interactions in detail and present a model of “competition” between the probes for the target—especially, when the target is effectively in short supply. These effects are shown to be predictable from the affinity constants for each of the four probe sequences involved, namely, the match and mismatch sequences for both probes. These affinity constants are calculated from the thermodynamic parameters such as the free energy of hybridization, which are in turn computed according to the nearest neighbor (NN) model for each probe and target sequence. Simulations based on the competitive hybridization model explain the observed variability in the signal of a given probe when measured in parallel with different groupings of other probes or individually. The results of the simulations can be used for experiment design and pooling strategies, based on which probes have been shown to have a strong effect on each other’s signal in the in silico experiment. These results are aimed at better design of multiplexed reactions on arrays used in genotyping (e.g., HLA typing, SNP, or CNV detection, etc.) and mutation analysis (e.g., cystic fibrosis, cancer, autism, etc.).
A Hybrid Parallel Strategy Based on String Graph Theory to Improve De Novo DNA Assembly on the TianHe-2 Supercomputer.

PubMed

Zhang, Feng; Liao, Xiangke; Peng, Shaoliang; Cui, Yingbo; Wang, Bingqiang; Zhu, Xiaoqian; Liu, Jie

2016-06-01

' The de novo assembly of DNA sequences is increasingly important for biological researches in the genomic era. After more than one decade since the Human Genome Project, some challenges still exist and new solutions are being explored to improve de novo assembly of genomes. String graph assembler (SGA), based on the string graph theory, is a new method/tool developed to address the challenges. In this paper, based on an in-depth analysis of SGA we prove that the SGA-based sequence de novo assembly is an NP-complete problem. According to our analysis, SGA outperforms other similar methods/tools in memory consumption, but costs much more time, of which 60-70 % is spent on the index construction. Upon this analysis, we introduce a hybrid parallel optimization algorithm and implement this algorithm in the TianHe-2's parallel framework. Simulations are performed with different datasets. For data of small size the optimized solution is 3.06 times faster than before, and for data of middle size it's 1.60 times. The results demonstrate an evident performance improvement, with the linear scalability for parallel FM-index construction. This results thus contribute significantly to improving the efficiency of de novo assembly of DNA sequences.
Solid phase sequencing of double-stranded nucleic acids

DOEpatents

Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.

2002-01-01

This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.
OPAL: prediction of MoRF regions in intrinsically disordered protein sequences.

PubMed

Sharma, Ronesh; Raicar, Gaurav; Tsunoda, Tatsuhiko; Patil, Ashwini; Sharma, Alok

2018-06-01

Intrinsically disordered proteins lack stable 3-dimensional structure and play a crucial role in performing various biological functions. Key to their biological function are the molecular recognition features (MoRFs) located within long disordered regions. Computationally identifying these MoRFs from disordered protein sequences is a challenging task. In this study, we present a new MoRF predictor, OPAL, to identify MoRFs in disordered protein sequences. OPAL utilizes two independent sources of information computed using different component predictors. The scores are processed and combined using common averaging method. The first score is computed using a component MoRF predictor which utilizes composition and sequence similarity of MoRF and non-MoRF regions to detect MoRFs. The second score is calculated using half-sphere exposure (HSE), solvent accessible surface area (ASA) and backbone angle information of the disordered protein sequence, using information from the amino acid properties of flanks surrounding the MoRFs to distinguish MoRF and non-MoRF residues. OPAL is evaluated using test sets that were previously used to evaluate MoRF predictors, MoRFpred, MoRFchibi and MoRFchibi-web. The results demonstrate that OPAL outperforms all the available MoRF predictors and is the most accurate predictor available for MoRF prediction. It is available at http://www.alok-ai-lab.com/tools/opal/. ashwini@hgc.jp or alok.sharma@griffith.edu.au. Supplementary data are available at Bioinformatics online.
Identification of the centromeric repeat in the threespine stickleback fish (Gasterosteus aculeatus).

PubMed

Cech, Jennifer N; Peichel, Catherine L

2015-12-01

Centromere sequences exist as gaps in many genome assemblies due to their repetitive nature. Here we take an unbiased approach utilizing centromere protein A (CENP-A) chomatin immunoprecipitation followed by high-throughput sequencing to identify the centromeric repeat sequence in the threespine stickleback fish (Gasterosteus aculeatus). A 186-bp, AT-rich repeat was validated as centromeric using both fluorescence in situ hybridization (FISH) and immunofluorescence combined with FISH (IF-FISH) on interphase nuclei and metaphase spreads. This repeat hybridizes strongly to the centromere on all chromosomes, with the exception of weak hybridization to the Y chromosome. Together, our work provides the first validated sequence information for the threespine stickleback centromere.
Sequenced RAPD markers to detect hybridization in the barbary partridge (Alectoris barbara, Phasianidae).

PubMed

Barbanera, Filippo; Guerrini, Monica; Bertoncini, Franco; Cappelli, Fabio; Muzzeddu, Marco; Dini, Fernando

2011-01-01

In the Alectoris partridges (Phasianidae), hybridization occurs occasionally as a result of the natural breakdown of isolating mechanisms but more frequently as a result of human activity. No genetic record of hybridization is known for the barbary partridge (A. barbara). This species is distributed mostly in North Africa and, in Europe, on the island of Sardinia (Italy) and on Gibraltar. The risk of hybridization between barbary and red-legged partridge (A. rufa: Iberian Peninsula, France, Italy) is high in Sardinia and in Spain. We developed two random amplified polymorphic DNA (RAPD) markers to detect A. barbara × A. rufa hybrid partridges. We tested them on 125 experimental hybrids, sequenced the relative species-specific bands and found that the bands and their corresponding sequences were reliably transmitted through a number of generations (F1, F2, F3, BC1, BC2). Our markers represent a highly valuable tool for the preservation of the A. barbara genome from the pressing threat of A. rufa pollution. © 2010 Blackwell Publishing Ltd.
A Data-Driven Approach to Develop Physically Sound Predictors: Application to Depth-Averaged Velocities and Drag Coefficients on Vegetated Flows

NASA Astrophysics Data System (ADS)

Tinoco, R. O.; Goldstein, E. B.; Coco, G.

2016-12-01

We use a machine learning approach to seek accurate, physically sound predictors, to estimate two relevant flow parameters for open-channel vegetated flows: mean velocities and drag coefficients. A genetic programming algorithm is used to find a robust relationship between properties of the vegetation and flow parameters. We use data published from several laboratory experiments covering a broad range of conditions to obtain: a) in the case of mean flow, an equation that matches the accuracy of other predictors from recent literature while showing a less complex structure, and b) for drag coefficients, a predictor that relies on both single element and array parameters. We investigate different criteria for dataset size and data selection to evaluate their impact on the resulting predictor, as well as simple strategies to obtain only dimensionally consistent equations, and avoid the need for dimensional coefficients. The results show that a proper methodology can deliver physically sound models representative of the processes involved, such that genetic programming and machine learning techniques can be used as powerful tools to study complicated phenomena and develop not only purely empirical, but "hybrid" models, coupling results from machine learning methodologies into physics-based models.
A Trichosporonales genome tree based on 27 haploid and three evolutionarily conserved 'natural' hybrid genomes.

PubMed

Takashima, Masako; Sriswasdi, Sira; Manabe, Ri-Ichiroh; Ohkuma, Moriya; Sugita, Takashi; Iwasaki, Wataru

2018-01-01

To construct a backbone tree consisting of basidiomycetous yeasts, draft genome sequences from 25 species of Trichosporonales (Tremellomycetes, Basidiomycota) were generated. In addition to the hybrid genomes of Trichosporon coremiiforme and Trichosporon ovoides that we described previously, we identified an interspecies hybrid genome in Cutaneotrichosporon mucoides (formerly Trichosporon mucoides). This hybrid genome had a gene retention rate of ~55%, and its closest haploid relative was Cutaneotrichosporon dermatis. After constructing the C. mucoides subgenomes, we generated a phylogenetic tree using genome data from the 27 haploid species and the subgenome data from the three hybrid genome species. It was a high-quality tree with 100% bootstrap support for all of the branches. The genome-based tree provided superior resolution compared with previous multi-gene analyses. Although our backbone tree does not include all Trichosporonales genera (e.g. Cryptotrichosporon), it will be valuable for future analyses of genome data. Interest in interspecies hybrid fungal genomes has recently increased because they may provide a basis for new technologies. The three Trichosporonales hybrid genomes described in this study are different from well-characterized hybrid genomes (e.g. those of Saccharomyces pastorianus and Saccharomyces bayanus) because these hybridization events probably occurred in the distant evolutionary past. Hence, they will be useful for studying genome stability following hybridization and speciation events. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
In situ hybridization in paracoccidioidomycosis.

PubMed

De Brito, T; Sandhu, G S; Kline, B C; Aleff, R A; Sandoval, M P; Santos, R T; Brandão, A A; Lacaz, C S

1999-06-01

In situ hybridization (ISH) was performed using oral biopsies from patients with paracoccidioidomycosis and guinea pig testes inoculated with a culture of Paracoccidioides brasiliensis isolated from soil, employing both a 14 base-pair specific oligoprobe (ACT CCC CCG TGG TC) and its complementary sequence. When combining ISH with the Gridley stain which detects fungal cell walls, about 2-3% of the fungal cells present in the tissues were labelled. When the complementary probe was used, labelling was higher, reaching the 3% level.
Definition of Eight Mulberry Species in the Genus Morus by Internal Transcribed Spacer-Based Phylogeny

PubMed Central

Zeng, Qiwei; Chen, Hongyu; Zhang, Chao; Han, Minjing; Li, Tian; Qi, Xiwu; Xiang, Zhonghuai; He, Ningjia

2015-01-01

Mulberry, belonging to the order Rosales, family Moraceae, and genus Morus, has received attention because of both its economic and medicinal value, as well as for its important ecological function. The genus Morus has a worldwide distribution, however, its taxonomy remains complex and disputed. Many studies have attempted to classify Morus species, resulting in varied numbers of designated Morus spp. To address this issue, we used information from internal transcribed spacer (ITS) genetic sequences to study the taxonomy of all the members of generally accepted genus Morus. We found that intraspecific 5.8S rRNA sequences were identical but that interspecific 5.8S sequences were diverse. M. alba and M. notabilis showed the shortest (215 bp) and the longest (233 bp) ITS1 sequence length, respectively. With the completion of the mulberry genome, we could identify single nucleotide polymorphisms within the ITS locus in the M. notabilis genome. From reconstruction of a phylogenetic tree based on the complete ITS data, we propose that the Morus genus should be classified into eight species, including M. alba, M. nigra, M. notabilis, M. serrata, M. celtidifolia, M. insignis, M. rubra, and M. mesozygia. Furthermore, the classification of the ITS sequences of known interspecific hybrid clones into both paternal and maternal clades indicated that ITS variation was sufficient to distinguish interspecific hybrids in the genus Morus. PMID:26266951
Market surveillance on non-halal additives incorporated in surimi based products using polymerase chain reaction (PCR)-southern hybridization analysis

NASA Astrophysics Data System (ADS)

Aravindran, S.; Sahilah, A. M.; Aminah, A.

2014-09-01

Halal surveillance on halal ingredients incorporated in surimi based products were studied using polymerase chain reaction (PCR)-southern hybridization on chip analysis. The primers used in this technique were targeted on mitochondria DNA (mtDNA) of cytochrome b (cyt b) gene sequence which able to differentiate 7 type (beef, chicken, duck, goat, buffalo, lamb and pork) of species on a single chip. 17 (n = 17*3) different brands of surimi-based product were purchased randomly from Selangor local market in January 2013. Of 17 brands, 3 (n = 3*3) brands were positive for chicken DNA, 1 (n = 1*3) brand was positive for goat DNA, and the remainder 13 brands (n = 13*3) have no DNA species detected. The sensitivity of PCR-southern hybridization primers to detect each meat species was 0.1 ng. In the present study, it is evidence that PCR-Southern Hybridization analysis offered a reliable result due to its highly specific and sensitive properties in detecting non-halal additive such as plasma protein incorporation in surimi-based product.
5-bp Classical Satellite DNA Loci from Chromosome-1 Instability in Cervical Neoplasia Detected by DNA Breakage Detection/Fluorescence in Situ Hybridization (DBD-FISH).

PubMed

Cortés-Gutiérrez, Elva I; Ortíz-Hernández, Brenda L; Dávila-Rodríguez, Martha I; Cerda-Flores, Ricardo M; Fernández, José Luis; López-Fernández, Carmen; Gosálvez, Jaime

2013-02-19

We aimed to evaluate the association between the progressive stages of cervical neoplasia and DNA damage in 5-bp classical satellite DNA sequences from chromosome-1 in cervical epithelium and in peripheral blood lymphocytes using DNA breakage detection/fluorescence in situ hybridization (DBD-FISH). A hospital-based unmatched case-control study was conducted in 2011 with a sample of 30 women grouped according to disease stage and selected according to histological diagnosis; 10 with low-grade squamous intraepithelial lesions (LG-SIL), 10 with high-grade SIL (HG-SIL), and 10 with no cervical lesions, from the Unidad Medica de Alta Especialidad of The Mexican Social Security Institute, IMSS, Mexico. Specific chromosome damage levels in 5-bp classical satellite DNA sequences from chromosome-1 were evaluated in cervical epithelium and peripheral blood lymphocytes using the DBD-FISH technique. Whole-genome DNA hybridization was used as a reference for the level of damage. Results of Kruskal-Wallis test showed a significant increase according to neoplastic development in both tissues. The instability of 5-bp classical satellite DNA sequences from chromosome-1 was evidenced using chromosome-orientation FISH. In conclusion, we suggest that the progression to malignant transformation involves an increase in the instability of 5-bp classical satellite DNA sequences from chromosome-1.
A new DPYD genotyping assay for improving the safety of 5-fluorouracil therapy.

PubMed

Sistonen, Johanna; Smith, Chingying; Fu, Yung-Kang; Largiadèr, Carlo R

2012-12-24

Chemotherapeutic use of 5-fluorouracil (5FU) is compromised by 10-20% of patients developing severe toxicity. Recently described genetic variation in dihydropyrimidine dehydrogenase (DPYD) has been shown to be a major predictor of 5FU toxicity. Here, we describe a new genotyping assay for routine clinical use that covers all the major DPYD risk variants. Genomic regions targeting DPYD risk variants (c.1129-5923C>G, c.1679T>G/A, c.1905+1G>A, c.2846A>T) and additional markers (c.234-123G>C, c.496A>G, c.775A>G) were amplified in a multiplex PCR reaction. The subsequent steps including allele-specific primer extension, hybridization of the primers to a microarray, scanning of the array, and data analysis were automated within the INFINITI® Analyzer (AutoGenomics). The assay was validated by analyzing 107 blood samples obtained from patients previously re-sequenced for the DPYD. The genotypes obtained with the developed assay were 100% concordant with the re-sequencing. The procedure is suitable for routine clinical use since the results are obtained within one day. For heterozygous risk variant carriers (~7% of Europeans), the treatment can be adjusted by 5FU dose reduction, whereas carriers of two risk alleles should be treated with an alternative therapy. The developed assay provides a novel tool to improve the safety of commonly used 5FU-based chemotherapies. Copyright © 2012 Elsevier B.V. All rights reserved.
Modular probes for enriching and detecting complex nucleic acid sequences

NASA Astrophysics Data System (ADS)

Wang, Juexiao Sherry; Yan, Yan Helen; Zhang, David Yu

2017-12-01

Complex DNA sequences are difficult to detect and profile, but are important contributors to human health and disease. Existing hybridization probes lack the capability to selectively bind and enrich hypervariable, long or repetitive sequences. Here, we present a generalized strategy for constructing modular hybridization probes (M-Probes) that overcomes these challenges. We demonstrate that M-Probes can tolerate sequence variations of up to 7 nt at prescribed positions while maintaining single nucleotide sensitivity at other positions. M-Probes are also shown to be capable of sequence-selectively binding a continuous DNA sequence of more than 500 nt. Furthermore, we show that M-Probes can detect genes with triplet repeats exceeding a programmed threshold. As a demonstration of this technology, we have developed a hybrid capture method to determine the exact triplet repeat expansion number in the Huntington's gene of genomic DNA using quantitative PCR.

Genetic analysis of tumorigenesis: XXXII. Localization of constitutionally amplified KRAS sequences to Chinese hamster chromosomes X and Y by in situ hybridization.

PubMed

Stenman, G; Anisowicz, A; Sager, R

1988-11-01

The KRAS gene is constitutionally amplified in the Chinese hamster. We have mapped the amplified sequences by in situ hybridization to two major sites on the X and Y chromosomes, Xq4 and Yp2. No autosomal site was detected despite a search under relaxed hybridization conditions. KRAS DNA is amplified about 50-fold compared to a human cell line known to have a diploid number of KRAS sequences, whereas mRNA expression is 5- to 10-fold lower than in normal human cells. While mRNA expression levels do not necessarily parallel gene copy number, the low expression level strongly suggests that the amplified sequences are transcriptionally silent. It is suggested that the amplified sequences arose from the original KRAS gene on chromosome 8 and that the KRAS sequences on the Y chromosome arose by X-Y recombination.
Hybrids of Nucleic Acids and Carbon Nanotubes for Nanobiotechnology

PubMed Central

Umemura, Kazuo

2015-01-01

Recent progress in the combination of nucleic acids and carbon nanotubes (CNTs) has been briefly reviewed here. Since discovering the hybridization phenomenon of DNA molecules and CNTs in 2003, a large amount of fundamental and applied research has been carried out. Among thousands of papers published since 2003, approximately 240 papers focused on biological applications were selected and categorized based on the types of nucleic acids used, but not the types of CNTs. This survey revealed that the hybridization phenomenon is strongly affected by various factors, such as DNA sequences, and for this reason, fundamental studies on the hybridization phenomenon are important. Additionally, many research groups have proposed numerous practical applications, such as nanobiosensors. The goal of this review is to provide perspective on biological applications using hybrids of nucleic acids and CNTs. PMID:28347014
Physico-chemical foundations underpinning microarray and next-generation sequencing experiments

PubMed Central

Harrison, Andrew; Binder, Hans; Buhot, Arnaud; Burden, Conrad J.; Carlon, Enrico; Gibas, Cynthia; Gamble, Lara J.; Halperin, Avraham; Hooyberghs, Jef; Kreil, David P.; Levicky, Rastislav; Noble, Peter A.; Ott, Albrecht; Pettitt, B. Montgomery; Tautz, Diethard; Pozhitkov, Alexander E.

2013-01-01

Hybridization of nucleic acids on solid surfaces is a key process involved in high-throughput technologies such as microarrays and, in some cases, next-generation sequencing (NGS). A physical understanding of the hybridization process helps to determine the accuracy of these technologies. The goal of a widespread research program is to develop reliable transformations between the raw signals reported by the technologies and individual molecular concentrations from an ensemble of nucleic acids. This research has inputs from many areas, from bioinformatics and biostatistics, to theoretical and experimental biochemistry and biophysics, to computer simulations. A group of leading researchers met in Ploen Germany in 2011 to discuss present knowledge and limitations of our physico-chemical understanding of high-throughput nucleic acid technologies. This meeting inspired us to write this summary, which provides an overview of the state-of-the-art approaches based on physico-chemical foundation to modeling of the nucleic acids hybridization process on solid surfaces. In addition, practical application of current knowledge is emphasized. PMID:23307556
Continuously Tunable Nucleic Acid Hybridization Probes

PubMed Central

Wu, Lucia R.; Wang, J. Sherry; Fang, John Z.; Reiser, Emily; Pinto, Alessandro; Pekker, Irena; Boykin, Richard; Ngouenet, Celine; Webster, Philippa J.; Beechem, Joseph; Zhang, David Yu

2015-01-01

In silico designed nucleic acid probes and primers often fail to achieve favorable specificity and sensitivity tradeoffs on the first try, and iterative empirical sequence-based optimization is needed, particularly in multiplexed assays. Here, we present a novel, on-the-fly method of tuning probe affinity and selectivity via the stoichiometry of auxiliary species, allowing independent and decoupled adjustment of hybridization yield for different probes in multiplexed assays. Using this method, we achieve near-continuous tuning of probe effective free energy (0.03 kcal·mol−1 granularity). As applications, we enforced uniform capture efficiency of 31 DNA molecules (GC content 0% – 100%), maximized signal difference for 11 pairs of single nucleotide variants, and performed tunable hybrid-capture of mRNA from total RNA. Using the Nanostring nCounter platform, we applied stoichiometric tuning to simultaneously adjust yields for a 24-plex assay, and we show multiplexed quantitation of RNA sequences and variants from formalin-fixed, paraffin-embedded samples (FFPE). PMID:26480474
Microbial identification by immunohybridization assay of artificial RNA labels

NASA Technical Reports Server (NTRS)

Kourentzi, Katerina D.; Fox, George E.; Willson, Richard C.

2002-01-01

Ribosomal RNA (rRNA) and engineered stable artificial RNAs (aRNAs) are frequently used to monitor bacteria in complex ecosystems. In this work, we describe a solid-phase immunocapture hybridization assay that can be used with low molecular weight RNA targets. A biotinylated DNA probe is efficiently hybridized in solution with the target RNA, and the DNA-RNA hybrids are captured on streptavidin-coated plates and quantified using a DNA-RNA heteroduplex-specific antibody conjugated to alkaline phosphatase. The assay was shown to be specific for both 5S rRNA and low molecular weight (LMW) artificial RNAs and highly sensitive, allowing detection of as little as 5.2 ng (0.15 pmol) in the case of 5S rRNA. Target RNAs were readily detected even in the presence of excess nontarget RNA. Detection using DNA probes as small as 17 bases targeting a repetitive artificial RNA sequence in an engineered RNA was more efficient than the detection of a unique sequence.
Sequencing small genomic targets with high efficiency and extreme accuracy

PubMed Central

Schmitt, Michael W.; Fox, Edward J.; Prindle, Marc J.; Reid-Bayliss, Kate S.; True, Lawrence D.; Radich, Jerald P.; Loeb, Lawrence A.

2015-01-01

The detection of minority variants in mixed samples demands methods for enrichment and accurate sequencing of small genomic intervals. We describe an efficient approach based on sequential rounds of hybridization with biotinylated oligonucleotides, enabling more than one-million fold enrichment of genomic regions of interest. In conjunction with error correcting double-stranded molecular tags, our approach enables the quantification of mutations in individual DNA molecules. PMID:25849638
Sequencing and Analyzing the "t" (1;7) Reciprocal Translocation Breakpoints Associated with a Case of Childhood-Onset Schizophrenia/Autistic Disorder

ERIC Educational Resources Information Center

Idol, Jacquelyn R.; Addington, Anjene M.; Long, Robert T.; Rapoport, Judith L.; Green, Eric D.

2008-01-01

We characterized a "t"(1;7)(p22;q21) reciprocal translocation in a patient with childhood-onset schizophrenia (COS) and autism using genome mapping and sequencing methods. Based on genomic maps of human chromosome 7 and fluorescence in situ hybridization (FISH) studies, we delimited the region of 7q21 harboring the translocation breakpoint to a…
Hybridization-based antibody cDNA recovery for the production of recombinant antibodies identified by repertoire sequencing.

PubMed

Valdés-Alemán, Javier; Téllez-Sosa, Juan; Ovilla-Muñoz, Marbella; Godoy-Lozano, Elizabeth; Velázquez-Ramírez, Daniel; Valdovinos-Torres, Humberto; Gómez-Barreto, Rosa E; Martinez-Barnetche, Jesús

2014-01-01

High-throughput sequencing of the antibody repertoire is enabling a thorough analysis of B cell diversity and clonal selection, which may improve the novel antibody discovery process. Theoretically, an adequate bioinformatic analysis could allow identification of candidate antigen-specific antibodies, requiring their recombinant production for experimental validation of their specificity. Gene synthesis is commonly used for the generation of recombinant antibodies identified in silico. Novel strategies that bypass gene synthesis could offer more accessible antibody identification and validation alternatives. We developed a hybridization-based recovery strategy that targets the complementarity-determining region 3 (CDRH3) for the enrichment of cDNA of candidate antigen-specific antibody sequences. Ten clonal groups of interest were identified through bioinformatic analysis of the heavy chain antibody repertoire of mice immunized with hen egg white lysozyme (HEL). cDNA from eight of the targeted clonal groups was recovered efficiently, leading to the generation of recombinant antibodies. One representative heavy chain sequence from each clonal group recovered was paired with previously reported anti-HEL light chains to generate full antibodies, later tested for HEL-binding capacity. The recovery process proposed represents a simple and scalable molecular strategy that could enhance antibody identification and specificity assessment, enabling a more cost-efficient generation of recombinant antibodies.
Long-range prediction of Indian summer monsoon rainfall using data mining and statistical approaches

NASA Astrophysics Data System (ADS)

H, Vathsala; Koolagudi, Shashidhar G.

2017-10-01

This paper presents a hybrid model to better predict Indian summer monsoon rainfall. The algorithm considers suitable techniques for processing dense datasets. The proposed three-step algorithm comprises closed itemset generation-based association rule mining for feature selection, cluster membership for dimensionality reduction, and simple logistic function for prediction. The application of predicting rainfall into flood, excess, normal, deficit, and drought based on 36 predictors consisting of land and ocean variables is presented. Results show good accuracy in the considered study period of 37years (1969-2005).
Molecular characterization of an ependymin precursor from goldfish brain.

PubMed

Königstorfer, A; Sterrer, S; Eckerskorn, C; Lottspeich, F; Schmidt, R; Hoffmann, W

1989-01-01

Ependymins are thought to be implicated in fundamental processes involved in plasticity of the goldfish CNS. Gas-phase sequencing of purified ependymins beta and gamma revealed that they share the same N-terminal sequence. Each sequence displays microheterogeneities at several positions. Based on the protein sequences obtained, we constructed synthetic oligonucleotides and used them as hybridization probes for screening cDNA libraries of goldfish brain. In this article we describe the full-length sequence of a mRNA encoding a precursor of ependymins. A cleavable signal sequence characteristic of secretory proteins is located at the N-terminal end, followed directly by the ependymin sequence. Also, two potential N-glycosylation sites were detected. A computer search revealed that ependymins form a novel family of unique proteins.
Radiation hybrid maps of D-genome of Aegilops tauschii and their application in sequence assembly of large and complex plant genomes

USDA-ARS?s Scientific Manuscript database

The large and complex genome of bread wheat (Triticum aestivum L., ~17 Gb) requires high-resolution genome maps saturated with ordered markers to assist in anchoring and orienting BAC contigs/ sequence scaffolds for whole genome sequence assembly. Radiation hybrid (RH) mapping has proven to be an e...
FastRNABindR: Fast and Accurate Prediction of Protein-RNA Interface Residues.

PubMed

El-Manzalawy, Yasser; Abbas, Mostafa; Malluhi, Qutaibah; Honavar, Vasant

2016-01-01

A wide range of biological processes, including regulation of gene expression, protein synthesis, and replication and assembly of many viruses are mediated by RNA-protein interactions. However, experimental determination of the structures of protein-RNA complexes is expensive and technically challenging. Hence, a number of computational tools have been developed for predicting protein-RNA interfaces. Some of the state-of-the-art protein-RNA interface predictors rely on position-specific scoring matrix (PSSM)-based encoding of the protein sequences. The computational efforts needed for generating PSSMs severely limits the practical utility of protein-RNA interface prediction servers. In this work, we experiment with two approaches, random sampling and sequence similarity reduction, for extracting a representative reference database of protein sequences from more than 50 million protein sequences in UniRef100. Our results suggest that random sampled databases produce better PSSM profiles (in terms of the number of hits used to generate the profile and the distance of the generated profile to the corresponding profile generated using the entire UniRef100 data as well as the accuracy of the machine learning classifier trained using these profiles). Based on our results, we developed FastRNABindR, an improved version of RNABindR for predicting protein-RNA interface residues using PSSM profiles generated using 1% of the UniRef100 sequences sampled uniformly at random. To the best of our knowledge, FastRNABindR is the only protein-RNA interface residue prediction online server that requires generation of PSSM profiles for query sequences and accepts hundreds of protein sequences per submission. Our approach for determining the optimal BLAST database for a protein-RNA interface residue classification task has the potential of substantially speeding up, and hence increasing the practical utility of, other amino acid sequence based predictors of protein-protein and protein-DNA interfaces.
Prediction of CO concentrations based on a hybrid Partial Least Square and Support Vector Machine model

NASA Astrophysics Data System (ADS)

Yeganeh, B.; Motlagh, M. Shafie Pour; Rashidi, Y.; Kamalan, H.

2012-08-01

Due to the health impacts caused by exposures to air pollutants in urban areas, monitoring and forecasting of air quality parameters have become popular as an important topic in atmospheric and environmental research today. The knowledge on the dynamics and complexity of air pollutants behavior has made artificial intelligence models as a useful tool for a more accurate pollutant concentration prediction. This paper focuses on an innovative method of daily air pollution prediction using combination of Support Vector Machine (SVM) as predictor and Partial Least Square (PLS) as a data selection tool based on the measured values of CO concentrations. The CO concentrations of Rey monitoring station in the south of Tehran, from Jan. 2007 to Feb. 2011, have been used to test the effectiveness of this method. The hourly CO concentrations have been predicted using the SVM and the hybrid PLS-SVM models. Similarly, daily CO concentrations have been predicted based on the aforementioned four years measured data. Results demonstrated that both models have good prediction ability; however the hybrid PLS-SVM has better accuracy. In the analysis presented in this paper, statistic estimators including relative mean errors, root mean squared errors and the mean absolute relative error have been employed to compare performances of the models. It has been concluded that the errors decrease after size reduction and coefficients of determination increase from 56 to 81% for SVM model to 65-85% for hybrid PLS-SVM model respectively. Also it was found that the hybrid PLS-SVM model required lower computational time than SVM model as expected, hence supporting the more accurate and faster prediction ability of hybrid PLS-SVM model.
Directional genomic hybridization for chromosomal inversion discovery and detection.

PubMed

Ray, F Andrew; Zimmerman, Erin; Robinson, Bruce; Cornforth, Michael N; Bedford, Joel S; Goodwin, Edwin H; Bailey, Susan M

2013-04-01

Chromosomal rearrangements are a source of structural variation within the genome that figure prominently in human disease, where the importance of translocations and deletions is well recognized. In principle, inversions-reversals in the orientation of DNA sequences within a chromosome-should have similar detrimental potential. However, the study of inversions has been hampered by traditional approaches used for their detection, which are not particularly robust. Even with significant advances in whole genome approaches, changes in the absolute orientation of DNA remain difficult to detect routinely. Consequently, our understanding of inversions is still surprisingly limited, as is our appreciation for their frequency and involvement in human disease. Here, we introduce the directional genomic hybridization methodology of chromatid painting-a whole new way of looking at structural features of the genome-that can be employed with high resolution on a cell-by-cell basis, and demonstrate its basic capabilities for genome-wide discovery and targeted detection of inversions. Bioinformatics enabled development of sequence- and strand-specific directional probe sets, which when coupled with single-stranded hybridization, greatly improved the resolution and ease of inversion detection. We highlight examples of the far-ranging applicability of this cytogenomics-based approach, which include confirmation of the alignment of the human genome database and evidence that individuals themselves share similar sequence directionality, as well as use in comparative and evolutionary studies for any species whose genome has been sequenced. In addition to applications related to basic mechanistic studies, the information obtainable with strand-specific hybridization strategies may ultimately enable novel gene discovery, thereby benefitting the diagnosis and treatment of a variety of human disease states and disorders including cancer, autism, and idiopathic infertility.
From Environmental Sequences to Morphology: Observation and Characterisation of a Paulinellid Testate Amoeba (Micropyxidiella edaphonis gen. nov. sp. nov. Euglyphida, Paulinellidae) from Soil using Fluorescent in situ Hybridization.

PubMed

Tarnawski, Sonia-Estelle; Lara, Enrique

2015-05-01

High microbial diversity is revealed by environmental DNA surveys. However, nothing is known about the morphology and function of these potentially new organisms. In the course of an environmental soil diversity study, we found for the first time environmental sequences that reveal the presence of Paulinellidae (a mostly marine and marginally freshwater family of euglyphid testate amoebae) in samples of forest litter from different geographic origins. The new sequences form a basal, robust clade in the family. We used fluorescent in situ hybridization (FISH) to detect the organisms from which these sequences derived. We isolated the cells and documented them with light and scanning electron microscopy. Based on these observations, we described these organisms as Micropyxidiella edaphonis gen. nov. sp. nov. The organisms were very small testate amoebae (generally less than 10μm) with an irregular proteinaceous test. This suggests an unknown diversity in testate amoebae, and calls for extending this type of investigations to other protist groups which are known only as environmental DNA sequences. Copyright © 2015 Elsevier GmbH. All rights reserved.
The use of molecular dynamics simulations to evaluate the DNA sequence-selectivity of G-A cross-linking PBD-duocarmycin dimers.

PubMed

Jackson, Paul J M; Rahman, Khondaker M; Thurston, David E

2017-01-01

The pyrrolobenzodiazepine (PBD) and duocarmycin families are DNA-interactive agents that covalently bond to guanine (G) and adenine (A) bases, respectively, and that have been joined together to create synthetic dimers capable of cross-linking G-G, A-A, and G-A bases. Three G-A alkylating dimers have been reported in publications to date, with defined DNA-binding sites proposed for two of them. In this study we have used molecular dynamics simulations to elucidate preferred DNA-binding sites for the three published molecular types. For the PBD-CPI dimer UTA-6026 (1), our simulations correctly predicted its favoured binding site (i.e., 5'-C(G)AATTA-3') as identified by DNA cleavage studies. However, for the PBD-CI molecule ('Compound 11', 3), we were unable to reconcile the results of our simulations with the reported preferred cross-linking sequence (5'-ATTTTCC(G)-3'). We found that the molecule is too short to span the five base pairs between the A and G bases as claimed, but should target instead a sequence such as 5'-ATTTC(G)-3' with two less base pairs between the reacting G and A residues. Our simulation results for this hybrid dimer are also in accord with the very low interstrand cross-linking and in vitro cytotoxicity activities reported for it. Although a preferred cross-linking sequence was not reported for the third hybrid dimer ('27eS', 2), our simulations predict that it should span two base pairs between covalently reacting G and A bases (e.g., 5'-GTAT(A)-3'). Copyright Â© 2016. Published by Elsevier Ltd.
Sequence-Based Prediction of RNA-Binding Proteins Using Random Forest with Minimum Redundancy Maximum Relevance Feature Selection.

PubMed

Ma, Xin; Guo, Jing; Sun, Xiao

2015-01-01

The prediction of RNA-binding proteins is one of the most challenging problems in computation biology. Although some studies have investigated this problem, the accuracy of prediction is still not sufficient. In this study, a highly accurate method was developed to predict RNA-binding proteins from amino acid sequences using random forests with the minimum redundancy maximum relevance (mRMR) method, followed by incremental feature selection (IFS). We incorporated features of conjoint triad features and three novel features: binding propensity (BP), nonbinding propensity (NBP), and evolutionary information combined with physicochemical properties (EIPP). The results showed that these novel features have important roles in improving the performance of the predictor. Using the mRMR-IFS method, our predictor achieved the best performance (86.62% accuracy and 0.737 Matthews correlation coefficient). High prediction accuracy and successful prediction performance suggested that our method can be a useful approach to identify RNA-binding proteins from sequence information.
Statistical properties of filtered pseudorandom digital sequences formed from the sum of maximum-length sequences

NASA Technical Reports Server (NTRS)

Wallace, G. R.; Weathers, G. D.; Graf, E. R.

1973-01-01

The statistics of filtered pseudorandom digital sequences called hybrid-sum sequences, formed from the modulo-two sum of several maximum-length sequences, are analyzed. The results indicate that a relation exists between the statistics of the filtered sequence and the characteristic polynomials of the component maximum length sequences. An analysis procedure is developed for identifying a large group of sequences with good statistical properties for applications requiring the generation of analog pseudorandom noise. By use of the analysis approach, the filtering process is approximated by the convolution of the sequence with a sum of unit step functions. A parameter reflecting the overall statistical properties of filtered pseudorandom sequences is derived. This parameter is called the statistical quality factor. A computer algorithm to calculate the statistical quality factor for the filtered sequences is presented, and the results for two examples of sequence combinations are included. The analysis reveals that the statistics of the signals generated with the hybrid-sum generator are potentially superior to the statistics of signals generated with maximum-length generators. Furthermore, fewer calculations are required to evaluate the statistics of a large group of hybrid-sum generators than are required to evaluate the statistics of the same size group of approximately equivalent maximum-length sequences.
A New Adaptive Framework for Collaborative Filtering Prediction

PubMed Central

Almosallam, Ibrahim A.; Shang, Yi

2010-01-01

Collaborative filtering is one of the most successful techniques for recommendation systems and has been used in many commercial services provided by major companies including Amazon, TiVo and Netflix. In this paper we focus on memory-based collaborative filtering (CF). Existing CF techniques work well on dense data but poorly on sparse data. To address this weakness, we propose to use z-scores instead of explicit ratings and introduce a mechanism that adaptively combines global statistics with item-based values based on data density level. We present a new adaptive framework that encapsulates various CF algorithms and the relationships among them. An adaptive CF predictor is developed that can self adapt from user-based to item-based to hybrid methods based on the amount of available ratings. Our experimental results show that the new predictor consistently obtained more accurate predictions than existing CF methods, with the most significant improvement on sparse data sets. When applied to the Netflix Challenge data set, our method performed better than existing CF and singular value decomposition (SVD) methods and achieved 4.67% improvement over Netflix’s system. PMID:21572924
A New Adaptive Framework for Collaborative Filtering Prediction.

PubMed

Almosallam, Ibrahim A; Shang, Yi

2008-06-01

Collaborative filtering is one of the most successful techniques for recommendation systems and has been used in many commercial services provided by major companies including Amazon, TiVo and Netflix. In this paper we focus on memory-based collaborative filtering (CF). Existing CF techniques work well on dense data but poorly on sparse data. To address this weakness, we propose to use z-scores instead of explicit ratings and introduce a mechanism that adaptively combines global statistics with item-based values based on data density level. We present a new adaptive framework that encapsulates various CF algorithms and the relationships among them. An adaptive CF predictor is developed that can self adapt from user-based to item-based to hybrid methods based on the amount of available ratings. Our experimental results show that the new predictor consistently obtained more accurate predictions than existing CF methods, with the most significant improvement on sparse data sets. When applied to the Netflix Challenge data set, our method performed better than existing CF and singular value decomposition (SVD) methods and achieved 4.67% improvement over Netflix's system.

A novel type 1/2 hybrid IncC plasmid carrying fifteen antimicrobial resistance genes recovered from Proteus mirabilis in China.

PubMed

Lei, Chang-Wei; Kong, Ling-Han; Ma, Su-Zhen; Liu, Bi-Hui; Chen, Yan-Peng; Zhang, An-Yun; Wang, Hong-Ning

2017-09-01

IncC plasmids are of great concern as vehicles of broad-spectrum cephalosporins and carbapenems resistance genes bla CMY and bla NDM . The aim of this study was to sequence and characterize a multidrug resistance (MDR) IncC plasmid (pPm14C18) recovered from Proteus mirabilis. pPm14C18 was identified in a CMY-2-producing P. mirabilis isolate from chicken in China in 2014, and could be transferred to Escherichia coli conferring an MDR phenotype. Whole genome sequencing confirmed pPm14C18 was a novel type 1/2 hybrid IncC plasmid 165,992bp in size, containing fifteen antimicrobial resistance genes. It harboured a novel MDR mosaic region comprised of a hybrid Tn21 tnp -pDU mer , in which bla CTX-M-65 , dfrA32 and ereA were firstly reported in IncC plasmid. Phylogenetic relationship reconstruction based on the nucleotide sequences of the 52 IncC backbones showed all type 1 IncC plasmids were clustered into one clade, and then merged with pPm14C18 and finally with the type 2 IncC plasmids and another type 1/2 hybrid IncC plasmid pYR1. The MDR IncC plasmids in P. mirabilis of animal origin might threaten public health, which should be drawn more attention. Copyright © 2017 Elsevier Inc. All rights reserved.
Exploration of the folding dynamics of human telomeric G-quadruplex with a hybrid atomistic structure-based model

NASA Astrophysics Data System (ADS)

Bian, Yunqiang; Ren, Weitong; Song, Feng; Yu, Jiafeng; Wang, Jihua

2018-05-01

Structure-based models or Gō-like models, which are built from one or multiple particular experimental structures, have been successfully applied to the folding of proteins and RNAs. Recently, a variant termed the hybrid atomistic model advances the description of backbone and side chain interactions of traditional structure-based models, by borrowing the description of local interactions from classical force fields. In this study, we assessed the validity of this model in the folding problem of human telomeric DNA G-quadruplex, where local dihedral terms play important roles. A two-state model was developed and a set of molecular dynamics simulations was conducted to study the folding dynamics of sequence Htel24, which was experimentally validated to adopt two different (3 + 1) hybrid G-quadruplex topologies in K+ solution. Consistent with the experimental observations, the hybrid-1 conformation was found to be more stable and the hybrid-2 conformation was kinetically more favored. The simulations revealed that the hybrid-2 conformation folded in a higher cooperative manner, which may be the reason why it was kinetically more accessible. Moreover, by building a Markov state model, a two-quartet G-quadruplex state and a misfolded state were identified as competing states to complicate the folding process of Htel24. Besides, the simulations also showed that the transition between hybrid-1 and hybrid-2 conformations may proceed an ensemble of hairpin structures. The hybrid atomistic structure-based model reproduced the kinetic partitioning folding dynamics of Htel24 between two different folds, and thus can be used to study the complex folding processes of other G-quadruplex structures.
Single-Molecule Electrical Random Resequencing of DNA and RNA

NASA Astrophysics Data System (ADS)

Ohshiro, Takahito; Matsubara, Kazuki; Tsutsui, Makusu; Furuhashi, Masayuki; Taniguchi, Masateru; Kawai, Tomoji

2012-07-01

Two paradigm shifts in DNA sequencing technologies--from bulk to single molecules and from optical to electrical detection--are expected to realize label-free, low-cost DNA sequencing that does not require PCR amplification. It will lead to development of high-throughput third-generation sequencing technologies for personalized medicine. Although nanopore devices have been proposed as third-generation DNA-sequencing devices, a significant milestone in these technologies has been attained by demonstrating a novel technique for resequencing DNA using electrical signals. Here we report single-molecule electrical resequencing of DNA and RNA using a hybrid method of identifying single-base molecules via tunneling currents and random sequencing. Our method reads sequences of nine types of DNA oligomers. The complete sequence of 5'-UGAGGUA-3' from the let-7 microRNA family was also identified by creating a composite of overlapping fragment sequences, which was randomly determined using tunneling current conducted by single-base molecules as they passed between a pair of nanoelectrodes.
Towards development of new ornamental plants: status and progress in wide hybridization.

PubMed

Kuligowska, Katarzyna; Lütken, Henrik; Müller, Renate

2016-07-01

The present review provides insights into the key findings of the hybridization process, crucial factors affecting the adaptation of new technologies within wide hybridization of ornamental plants and presents perspectives of further development of this strategy. Wide hybridization is one of the oldest breeding techniques that contributed enormously to the development of modern plant cultivars. Within ornamental breeding, it represents the main source of genetic variation. During the long history of wide hybridization, a number of methods were implemented allowing the evolution from a conventional breeding tool into a modern methodology. Nowadays, the research on model plants and crop species increases our understanding of reproductive isolation among distant species and partly explains the background of the traditional approaches previously used for overcoming hybridization barriers. Characterization of parental plants and hybrids is performed using molecular and cytological techniques that strongly facilitate breeding processes. Molecular markers and sequencing technologies are used for the assessment of genetic relationships among plants, as the genetic distance is typically depicted as one of the most important factors influencing cross-compatibility in hybridization processes. Furthermore, molecular marker systems are frequently applied for verification of hybrid state of the progeny. The flow cytometry and genomic in situ hybridization are used in the assessment of hybridization partners and characterization of hybrid progeny in relation to genome stabilization as well as genome recombination and introgression. In the future, new research and technologies are likely to provide more detailed information about genes and pathways responsible for interspecific reproductive isolation. Ultimately, this knowledge will enable development of strategies for obtaining compatible lines for hybrid production. Recent development in sequencing technologies and availability of sequence data will also facilitate creation of new molecular markers that will advance marker-assisted selection in hybridization process.
Classifying short genomic fragments from novel lineages using composition and homology

PubMed Central

2011-01-01

Background The assignment of taxonomic attributions to DNA fragments recovered directly from the environment is a vital step in metagenomic data analysis. Assignments can be made using rank-specific classifiers, which assign reads to taxonomic labels from a predetermined level such as named species or strain, or rank-flexible classifiers, which choose an appropriate taxonomic rank for each sequence in a data set. The choice of rank typically depends on the optimal model for a given sequence and on the breadth of taxonomic groups seen in a set of close-to-optimal models. Homology-based (e.g., LCA) and composition-based (e.g., PhyloPythia, TACOA) rank-flexible classifiers have been proposed, but there is at present no hybrid approach that utilizes both homology and composition. Results We first develop a hybrid, rank-specific classifier based on BLAST and Naïve Bayes (NB) that has comparable accuracy and a faster running time than the current best approach, PhymmBL. By substituting LCA for BLAST or allowing the inclusion of suboptimal NB models, we obtain a rank-flexible classifier. This hybrid classifier outperforms established rank-flexible approaches on simulated metagenomic fragments of length 200 bp to 1000 bp and is able to assign taxonomic attributions to a subset of sequences with few misclassifications. We then demonstrate the performance of different classifiers on an enhanced biological phosphorous removal metagenome, illustrating the advantages of rank-flexible classifiers when representative genomes are absent from the set of reference genomes. Application to a glacier ice metagenome demonstrates that similar taxonomic profiles are obtained across a set of classifiers which are increasingly conservative in their classification. Conclusions Our NB-based classification scheme is faster than the current best composition-based algorithm, Phymm, while providing equally accurate predictions. The rank-flexible variant of NB, which we term ε-NB, is complementary to LCA and can be combined with it to yield conservative prediction sets of very high confidence. The simple parameterization of LCA and ε-NB allows for tuning of the balance between more predictions and increased precision, allowing the user to account for the sensitivity of downstream analyses to misclassified or unclassified sequences. PMID:21827705
Fine-scale genetic mapping of a hybrid sterility factor between Drosophila simulans and D. mauritiana: the varied and elusive functions of "speciation genes".

PubMed

Araripe, Luciana O; Montenegro, Horácio; Lemos, Bernardo; Hartl, Daniel L

2010-12-14

Hybrid male sterility (HMS) is a usual outcome of hybridization between closely related animal species. It arises because interactions between alleles that are functional within one species may be disrupted in hybrids. The identification of genes leading to hybrid sterility is of great interest for understanding the evolutionary process of speciation. In the current work we used marked P-element insertions as dominant markers to efficiently locate one genetic factor causing a severe reduction in fertility in hybrid males of Drosophila simulans and D. mauritiana. Our mapping effort identified a region of 9 kb on chromosome 3, containing three complete and one partial coding sequences. Within this region, two annotated genes are suggested as candidates for the HMS factor, based on the comparative molecular characterization and public-source information. Gene Taf1 is partially contained in the region, but yet shows high polymorphism with four fixed non-synonymous substitutions between the two species. Its molecular functions involve sequence-specific DNA binding and transcription factor activity. Gene agt is a small, intronless gene, whose molecular function is annotated as methylated-DNA-protein-cysteine S-methyltransferase activity. High polymorphism and one fixed non-synonymous substitution suggest this is a fast evolving gene. The gene trees of both genes perfectly separate D. simulans and D. mauritiana into monophyletic groups. Analysis of gene expression using microarray revealed trends that were similar to those previously found in comparisons between whole-genome hybrids and parental species. The identification following confirmation of the HMS candidate gene will add another case study leading to understanding the evolutionary process of hybrid incompatibility.
On the complexity of the Saccharomyces bayanus taxon: hybridization and potential hybrid speciation.

PubMed

Pérez-Través, Laura; Lopes, Christian A; Querol, Amparo; Barrio, Eladio

2014-01-01

Although the genus Saccharomyces has been thoroughly studied, some species in the genus has not yet been accurately resolved; an example is S. bayanus, a taxon that includes genetically diverse lineages of pure and hybrid strains. This diversity makes the assignation and classification of strains belonging to this species unclear and controversial. They have been subdivided by some authors into two varieties (bayanus and uvarum), which have been raised to the species level by others. In this work, we evaluate the complexity of 46 different strains included in the S. bayanus taxon by means of PCR-RFLP analysis and by sequencing of 34 gene regions and one mitochondrial gene. Using the sequence data, and based on the S. bayanus var. bayanus reference strain NBRC 1948, a hypothetical pure S. bayanus was reconstructed for these genes that showed alleles with similarity values lower than 97% with the S. bayanus var. uvarum strain CBS 7001, and of 99-100% with the non S. cerevisiae portion in S. pastorianus Weihenstephan 34/70 and with the new species S. eubayanus. Among the S. bayanus strains under study, different levels of homozygosity, hybridization and introgression were found; however, no pure S. bayanus var. bayanus strain was identified. These S. bayanus hybrids can be classified into two types: homozygous (type I) and heterozygous hybrids (type II), indicating that they have been originated by different hybridization processes. Therefore, a putative evolutionary scenario involving two different hybridization events between a S. bayanus var. uvarum and unknown European S. eubayanus-like strains can be postulated to explain the genomic diversity observed in our S. bayanus var. bayanus strains.
On the Complexity of the Saccharomyces bayanus Taxon: Hybridization and Potential Hybrid Speciation

PubMed Central

Pérez-Través, Laura; Lopes, Christian A.; Querol, Amparo; Barrio, Eladio

2014-01-01

Although the genus Saccharomyces has been thoroughly studied, some species in the genus has not yet been accurately resolved; an example is S. bayanus, a taxon that includes genetically diverse lineages of pure and hybrid strains. This diversity makes the assignation and classification of strains belonging to this species unclear and controversial. They have been subdivided by some authors into two varieties (bayanus and uvarum), which have been raised to the species level by others. In this work, we evaluate the complexity of 46 different strains included in the S. bayanus taxon by means of PCR-RFLP analysis and by sequencing of 34 gene regions and one mitochondrial gene. Using the sequence data, and based on the S. bayanus var. bayanus reference strain NBRC 1948, a hypothetical pure S. bayanus was reconstructed for these genes that showed alleles with similarity values lower than 97% with the S. bayanus var. uvarum strain CBS 7001, and of 99–100% with the non S. cerevisiae portion in S. pastorianus Weihenstephan 34/70 and with the new species S. eubayanus. Among the S. bayanus strains under study, different levels of homozygosity, hybridization and introgression were found; however, no pure S. bayanus var. bayanus strain was identified. These S. bayanus hybrids can be classified into two types: homozygous (type I) and heterozygous hybrids (type II), indicating that they have been originated by different hybridization processes. Therefore, a putative evolutionary scenario involving two different hybridization events between a S. bayanus var. uvarum and unknown European S. eubayanus-like strains can be postulated to explain the genomic diversity observed in our S. bayanus var. bayanus strains. PMID:24705561
Downscaling of daily precipitation using a hybrid model of Artificial Neural Network, Wavelet, and Quantile Mapping in Gharehsoo River Basin, Iran

NASA Astrophysics Data System (ADS)

Taie Semiromi, M.; Koch, M.

2017-12-01

Although linear/regression statistical downscaling methods are very straightforward and widely used, and they can be applied to a single predictor-predictand pair or spatial fields of predictors-predictands, the greatest constraint is the requirement of a normal distribution of the predictor and the predictand values, which means that it cannot be used to predict the distribution of daily rainfall because it is typically non-normal. To tacked with such a limitation, the current study aims to introduce a new developed hybrid technique taking advantages from Artificial Neural Networks (ANNs), Wavelet and Quantile Mapping (QM) for downscaling of daily precipitation for 10 rain-gauge stations located in Gharehsoo River Basin, Iran. With the purpose of daily precipitation downscaling, the study makes use of Second Generation Canadian Earth System Model (CanESM2) developed by Canadian Centre for Climate Modeling and Analysis (CCCma). Climate projections are available for three representative concentration pathways (RCPs) namely RCP 2.6, RCP 4.5 and RCP 8.5 for up to 2100. In this regard, 26 National Centers for Environmental Prediction (NCEP) reanalysis large-scale variables which have potential physical relationships with precipitation, were selected as candidate predictors. Afterwards, predictor screening was conducted using correlation, partial correlation and explained variance between predictors and predictand (precipitation). Depending on each rain-gauge station between two and three predictors were selected which their decomposed details (D) and approximation (A) obtained from discrete wavelet analysis were fed as inputs to the neural networks. After downscaling of daily precipitation, bias correction was conducted using quantile mapping. Out of the complete time series available, i.e. 1978-2005, two third of which namely 1978-1996 was used for calibration of QM and the reminder, i.e. 1997-2005 was considered for the validation. Result showed that the proposed hybrid method supported by QM for bias-correction could quite satisfactorily simulate daily precipitation. Also, results indicated that under all RCPs, precipitation will be more or less than 12% decreased by 2100. However, precipitation will be less decreased under RCP 8.5 compared with RCP 4.5.
Optimal control of a hybrid rhythmic-discrete task: the bouncing ball revisited.

PubMed

Ronsse, Renaud; Wei, Kunlin; Sternad, Dagmar

2010-05-01

Rhythmically bouncing a ball with a racket is a hybrid task that combines continuous rhythmic actuation of the racket with the control of discrete impact events between racket and ball. This study presents experimental data and a two-layered modeling framework that explicitly addresses the hybrid nature of control: a first discrete layer calculates the state to reach at impact and the second continuous layer smoothly drives the racket to this desired state, based on optimality principles. The testbed for this hybrid model is task performance at a range of increasingly slower tempos. When slowing the rhythm of the bouncing actions, the continuous cycles become separated into a sequence of discrete movements interspersed by dwell times and directed to achieve the desired impact. Analyses of human performance show increasing variability of performance measures with slower tempi, associated with a change in racket trajectories from approximately sinusoidal to less symmetrical velocity profiles. Matching results of model simulations give support to a hybrid control model based on optimality, and therefore suggest that optimality principles are applicable to the sensorimotor control of complex movements such as ball bouncing.
Use of repetitive DNA sequences to distinguish Mus musculus and Mus caroli cells by in situ hybridization.

PubMed

Siracusa, L D; Chapman, V M; Bennett, K L; Hastie, N D; Pietras, D F; Rossant, J

1983-02-01

Mammalian chimaeras have proved useful for investigating early steps in embryonic development. However, a complete clonal analysis of cell lineages has been limited by the lack of a marker which is ubiquitous and can distinguish parental cell types in situ. We have developed a cell marker system which fulfils these criteria. Chimaeric mice were successfully produced from two mouse species which possess sufficient genetic differences to allow unequivocal identification of parental cell types. DNA-DNA in situ hybridization with cloned, species-specific sequences was performed to distinguish the parental cell types. We have identified a cloned, Mus musculus satellite DNA sequence which shows hybridization differences between Mus musculus and Mus caroli DNA. This clone was used a a probe in in situ hybridizations to bone marrow chromosomes from Mus musculus, Mus caroli, and an interspecific F1 hybrid. The clone could qualitatively distinguish Mus musculus from Mus caroli chromosomes after in situ hybridization, even when they were derived from the same F1 hybrid cell. Quantitation of this hybridization to interphase nuclei from bone marrow spreads indicates that the probe can successfully distinguish Mus musculus from Mus caroli cells and can determine the percentage contribution of Mus musculus in mixtures of bone marrow cells of these species and in chimaeric bone marrow cell preparations.
The complete mitochondrial genome of Haliotis laevigata (Gastropoda: Haliotidae) using MiSeq and HiSeq sequencing.

PubMed

Robinson, Nick A; Hall, Nathan E; Ross, Elizabeth M; Cooke, Ira R; Shiel, Brett P; Robinson, Andrew J; Strugnell, Jan M

2016-01-01

The mitochondrial genome of greenlip abalone, Haliotis laevigata, is reported. MiSeq and HiSeq sequencing of one individual was assembled to yield a single 16,545 bp contig. The sequence shares 92% identity to the H. rubra mitochondrial genome (a closely related species that hybridize with H. laevigata in the wild). The sequence will be useful for determining the maternal contribution to hybrid populations, for investigating population structure and stock-enhancement effectiveness.
New families of site-specific repetitive DNA sequences that comprise constitutive heterochromatin of the Syrian hamster (Mesocricetus auratus, Cricetinae, Rodentia).

PubMed

Yamada, Kazuhiko; Kamimura, Eikichi; Kondo, Mariko; Tsuchiya, Kimiyuki; Nishida-Umehara, Chizuko; Matsuda, Yoichi

2006-02-01

We molecularly cloned new families of site-specific repetitive DNA sequences from BglII- and EcoRI-digested genomic DNA of the Syrian hamster (Mesocricetus auratus, Cricetrinae, Rodentia) and characterized them by chromosome in situ hybridization and filter hybridization. They were classified into six different types of repetitive DNA sequence families according to chromosomal distribution and genome organization. The hybridization patterns of the sequences were consistent with the distribution of C-positive bands and/or Hoechst-stained heterochromatin. The centromeric major satellite DNA and sex chromosome-specific and telomeric region-specific repetitive sequences were conserved in the same genus (Mesocricetus) but divergent in different genera. The chromosome-2-specific sequence was conserved in two genera, Mesocricetus and Cricetulus, and a low copy number of repetitive sequences on the heterochromatic chromosome arms were conserved in the subfamily Cricetinae but not in the subfamily Calomyscinae. By contrast, the other type of repetitive sequences on the heterochromatic chromosome arms, which had sequence similarities to a LINE sequence of rodents, was conserved through the three subfamilies, Cricetinae, Calomyscinae and Murinae. The nucleotide divergence of the repetitive sequences of heterochromatin was well correlated with the phylogenetic relationships of the Cricetinae species, and each sequence has been independently amplified and diverged in the same genome.
Inhibition, Updating Working Memory, and Shifting Predict Reading Disability Symptoms in a Hybrid Model: Project KIDS.

PubMed

Daucourt, Mia C; Schatschneider, Christopher; Connor, Carol M; Al Otaiba, Stephanie; Hart, Sara A

2018-01-01

Recent achievement research suggests that executive function (EF), a set of regulatory processes that control both thought and action necessary for goal-directed behavior, is related to typical and atypical reading performance. This project examines the relation of EF, as measured by its components, Inhibition, Updating Working Memory, and Shifting, with a hybrid model of reading disability (RD). Our sample included 420 children who participated in a broader intervention project when they were in KG-third grade (age M = 6.63 years, SD = 1.04 years, range = 4.79-10.40 years). At the time their EF was assessed, using a parent-report Behavior Rating Inventory of Executive Function (BRIEF), they had a mean age of 13.21 years ( SD = 1.54 years; range = 10.47-16.63 years). The hybrid model of RD was operationalized as a composite consisting of four symptoms, and set so that any child could have any one, any two, any three, any four, or none of the symptoms included in the hybrid model. The four symptoms include low word reading achievement, unexpected low word reading achievement, poorer reading comprehension compared to listening comprehension, and dual-discrepancy response-to-intervention, requiring both low achievement and low growth in word reading. The results of our multilevel ordinal logistic regression analyses showed a significant relation between all three components of EF (Inhibition, Updating Working Memory, and Shifting) and the hybrid model of RD, and that the strength of EF's predictive power for RD classification was the highest when RD was modeled as having at least one or more symptoms. Importantly, the chances of being classified as having RD increased as EF performance worsened and decreased as EF performance improved. The question of whether any one EF component would emerge as a superior predictor was also examined and results showed that Inhibition, Updating Working Memory, and Shifting were equally valuable as predictors of the hybrid model of RD. In total, all EF components were significant and equally effective predictors of RD when RD was operationalized using the hybrid model.
Inhibition, Updating Working Memory, and Shifting Predict Reading Disability Symptoms in a Hybrid Model: Project KIDS

PubMed Central

Daucourt, Mia C.; Schatschneider, Christopher; Connor, Carol M.; Al Otaiba, Stephanie; Hart, Sara A.

2018-01-01

Recent achievement research suggests that executive function (EF), a set of regulatory processes that control both thought and action necessary for goal-directed behavior, is related to typical and atypical reading performance. This project examines the relation of EF, as measured by its components, Inhibition, Updating Working Memory, and Shifting, with a hybrid model of reading disability (RD). Our sample included 420 children who participated in a broader intervention project when they were in KG-third grade (age M = 6.63 years, SD = 1.04 years, range = 4.79–10.40 years). At the time their EF was assessed, using a parent-report Behavior Rating Inventory of Executive Function (BRIEF), they had a mean age of 13.21 years (SD = 1.54 years; range = 10.47–16.63 years). The hybrid model of RD was operationalized as a composite consisting of four symptoms, and set so that any child could have any one, any two, any three, any four, or none of the symptoms included in the hybrid model. The four symptoms include low word reading achievement, unexpected low word reading achievement, poorer reading comprehension compared to listening comprehension, and dual-discrepancy response-to-intervention, requiring both low achievement and low growth in word reading. The results of our multilevel ordinal logistic regression analyses showed a significant relation between all three components of EF (Inhibition, Updating Working Memory, and Shifting) and the hybrid model of RD, and that the strength of EF’s predictive power for RD classification was the highest when RD was modeled as having at least one or more symptoms. Importantly, the chances of being classified as having RD increased as EF performance worsened and decreased as EF performance improved. The question of whether any one EF component would emerge as a superior predictor was also examined and results showed that Inhibition, Updating Working Memory, and Shifting were equally valuable as predictors of the hybrid model of RD. In total, all EF components were significant and equally effective predictors of RD when RD was operationalized using the hybrid model. PMID:29662458
Evaluation and Selection of Best Priority Sequencing Rule in Job Shop Scheduling using Hybrid MCDM Technique

NASA Astrophysics Data System (ADS)

Kiran Kumar, Kalla; Nagaraju, Dega; Gayathri, S.; Narayanan, S.

2017-05-01

Priority Sequencing Rules provide the guidance for the order in which the jobs are to be processed at a workstation. The application of different priority rules in job shop scheduling gives different order of scheduling. More experimentation needs to be conducted before a final choice is made to know the best priority sequencing rule. Hence, a comprehensive method of selecting the right choice is essential in managerial decision making perspective. This paper considers seven different priority sequencing rules in job shop scheduling. For evaluation and selection of the best priority sequencing rule, a set of eight criteria are considered. The aim of this work is to demonstrate the methodology of evaluating and selecting the best priority sequencing rule by using hybrid multi criteria decision making technique (MCDM), i.e., analytical hierarchy process (AHP) with technique for order preference by similarity to ideal solution (TOPSIS). The criteria weights are calculated by using AHP whereas the relative closeness values of all priority sequencing rules are computed based on TOPSIS with the help of data acquired from the shop floor of a manufacturing firm. Finally, from the findings of this work, the priority sequencing rules are ranked from most important to least important. The comprehensive methodology presented in this paper is very much essential for the management of a workstation to choose the best priority sequencing rule among the available alternatives for processing the jobs with maximum benefit.
DHSpred: support-vector-machine-based human DNase I hypersensitive sites prediction using the optimal features selected by random forest.

PubMed

Manavalan, Balachandran; Shin, Tae Hwan; Lee, Gwang

2018-01-05

DNase I hypersensitive sites (DHSs) are genomic regions that provide important information regarding the presence of transcriptional regulatory elements and the state of chromatin. Therefore, identifying DHSs in uncharacterized DNA sequences is crucial for understanding their biological functions and mechanisms. Although many experimental methods have been proposed to identify DHSs, they have proven to be expensive for genome-wide application. Therefore, it is necessary to develop computational methods for DHS prediction. In this study, we proposed a support vector machine (SVM)-based method for predicting DHSs, called DHSpred (DNase I Hypersensitive Site predictor in human DNA sequences), which was trained with 174 optimal features. The optimal combination of features was identified from a large set that included nucleotide composition and di- and trinucleotide physicochemical properties, using a random forest algorithm. DHSpred achieved a Matthews correlation coefficient and accuracy of 0.660 and 0.871, respectively, which were 3% higher than those of control SVM predictors trained with non-optimized features, indicating the efficiency of the feature selection method. Furthermore, the performance of DHSpred was superior to that of state-of-the-art predictors. An online prediction server has been developed to assist the scientific community, and is freely available at: http://www.thegleelab.org/DHSpred.html.
DHSpred: support-vector-machine-based human DNase I hypersensitive sites prediction using the optimal features selected by random forest

PubMed Central

Manavalan, Balachandran; Shin, Tae Hwan; Lee, Gwang

2018-01-01

DNase I hypersensitive sites (DHSs) are genomic regions that provide important information regarding the presence of transcriptional regulatory elements and the state of chromatin. Therefore, identifying DHSs in uncharacterized DNA sequences is crucial for understanding their biological functions and mechanisms. Although many experimental methods have been proposed to identify DHSs, they have proven to be expensive for genome-wide application. Therefore, it is necessary to develop computational methods for DHS prediction. In this study, we proposed a support vector machine (SVM)-based method for predicting DHSs, called DHSpred (DNase I Hypersensitive Site predictor in human DNA sequences), which was trained with 174 optimal features. The optimal combination of features was identified from a large set that included nucleotide composition and di- and trinucleotide physicochemical properties, using a random forest algorithm. DHSpred achieved a Matthews correlation coefficient and accuracy of 0.660 and 0.871, respectively, which were 3% higher than those of control SVM predictors trained with non-optimized features, indicating the efficiency of the feature selection method. Furthermore, the performance of DHSpred was superior to that of state-of-the-art predictors. An online prediction server has been developed to assist the scientific community, and is freely available at: http://www.thegleelab.org/DHSpred.html PMID:29416743
SVM-Based Prediction of Propeptide Cleavage Sites in Spider Toxins Identifies Toxin Innovation in an Australian Tarantula

PubMed Central

Wong, Emily S. W.; Hardy, Margaret C.; Wood, David; Bailey, Timothy; King, Glenn F.

2013-01-01

Spider neurotoxins are commonly used as pharmacological tools and are a popular source of novel compounds with therapeutic and agrochemical potential. Since venom peptides are inherently toxic, the host spider must employ strategies to avoid adverse effects prior to venom use. It is partly for this reason that most spider toxins encode a protective proregion that upon enzymatic cleavage is excised from the mature peptide. In order to identify the mature toxin sequence directly from toxin transcripts, without resorting to protein sequencing, the propeptide cleavage site in the toxin precursor must be predicted bioinformatically. We evaluated different machine learning strategies (support vector machines, hidden Markov model and decision tree) and developed an algorithm (SpiderP) for prediction of propeptide cleavage sites in spider toxins. Our strategy uses a support vector machine (SVM) framework that combines both local and global sequence information. Our method is superior or comparable to current tools for prediction of propeptide sequences in spider toxins. Evaluation of the SVM method on an independent test set of known toxin sequences yielded 96% sensitivity and 100% specificity. Furthermore, we sequenced five novel peptides (not used to train the final predictor) from the venom of the Australian tarantula Selenotypus plumipes to test the accuracy of the predictor and found 80% sensitivity and 99.6% 8-mer specificity. Finally, we used the predictor together with homology information to predict and characterize seven groups of novel toxins from the deeply sequenced venom gland transcriptome of S. plumipes, which revealed structural complexity and innovations in the evolution of the toxins. The precursor prediction tool (SpiderP) is freely available on ArachnoServer (http://www.arachnoserver.org/spiderP.html), a web portal to a comprehensive relational database of spider toxins. All training data, test data, and scripts used are available from the SpiderP website. PMID:23894279
Hybridization chain reaction-based instantaneous derivatization technology for chemiluminescence detection of specific DNA sequences.

PubMed

Wang, Xin; Lau, Choiwan; Kai, Masaaki; Lu, Jianzhong

2013-05-07

We propose here a new amplifying strategy that uses hybridization chain reaction (HCR) to detect specific sequences of DNA, where stable DNA monomers assemble on the magnetic beads only upon exposure to a target DNA. Briefly, in the HCR process, two complementary stable species of hairpins coexist in solution until the introduction of initiator reporter strands triggers a cascade of hybridization events that yield nicked double helices analogous to alternating copolymers. Moreover, a "sandwich-type" detection strategy is employed in our design. Magnetic beads, which are functionalized with capture DNA, are reacted with the target, and sandwiched with the above nicked double helices. Then, chemiluminescence (CL) detection proceeds via an instantaneous derivatization reaction between a specific CL reagent, 3,4,5-trimethoxylphenylglyoxal (TMPG), and the guanine nucleotides within the target DNA, reporter strands and DNA monomers for the generation of light. Our results clearly show that the amplification detection of specific sequences of DNA achieves a better performance (e.g. wide linear response range, low detection limit, and high specificity) as compared to the traditional sandwich type (capture/target/reporter) assays. Upon modification, the approach presented could be extended to detect other types of targets. We believe that this simple technique is promising for improving medical diagnosis and treatment.

Genetic variation and origin of parthenogenesis in the Aspidoscelis cozumela complex: evidence from mitochondrial genes.

PubMed

Manríquez-Morán, Norma L; Cruz, Fausto R Méndez-de la; Murphy, Robert W

2014-01-01

Parthenogenesis is a form of clonal reproduction. Eggs develop in the absence of sperm and offspring are genetically identical to their mother. Although common in invertebrates, it occurs in only a few species of squamate reptiles. Parthenogenetic reptiles have their origin in interspecific hybridization, and their populations are exclusively female. Because of its high mutation rate and maternal inheritance, mitochondrial DNA sequence data can evaluate the origin and evolution of all-female vertebrates. Partial sequences from two mitochondrial genes, Cytb and ND4, were analyzed to investigate questions about the origin of parthenogenesis in the Aspidoscelis cozumela complex, which includes A. cozumela, A. maslini and A. rodecki. Low levels of divergence were detected among parthenogenetic species, and between them and A. angusticeps, confirming it as the maternal species of the parthenoforms. A gene tree was constructed using sequences from three populations of A. angusticeps and nine of its unisexual daughter species. The phylogeny suggests that two independent hybridization events between A. angusticeps and A. deppii formed three unisexual species. One hybridization resulted in A. rodecki and the other formed A. maslini and A. cozumela. Although A. cozumela has the haplotype characteristic of A. maslini from Puerto Morelos, it is considered to be a different species based on karyological and morphological characteristics and its geographical isolation.
Yeast One-Hybrid Gγ Recruitment System for Identification of Protein Lipidation Motifs

PubMed Central

Fukuda, Nobuo; Doi, Motomichi; Honda, Shinya

2013-01-01

Fatty acids and isoprenoids can be covalently attached to a variety of proteins. These lipid modifications regulate protein structure, localization and function. Here, we describe a yeast one-hybrid approach based on the Gγ recruitment system that is useful for identifying sequence motifs those influence lipid modification to recruit proteins to the plasma membrane. Our approach facilitates the isolation of yeast cells expressing lipid-modified proteins via a simple and easy growth selection assay utilizing G-protein signaling that induces diploid formation. In the current study, we selected the N-terminal sequence of Gα subunits as a model case to investigate dual lipid modification, i.e., myristoylation and palmitoylation, a modification that is widely conserved from yeast to higher eukaryotes. Our results suggest that both lipid modifications are required for restoration of G-protein signaling. Although we could not differentiate between myristoylation and palmitoylation, N-terminal position 7 and 8 play some critical role. Moreover, we tested the preference for specific amino-acid residues at position 7 and 8 using library-based screening. This new approach will be useful to explore protein-lipid associations and to determine the corresponding sequence motifs. PMID:23922919
Genome-Wide Spectra of Transcription Insertions and Deletions Reveal That Slippage Depends on RNA:DNA Hybrid Complementarity

PubMed Central

Traverse, Charles C.

2017-01-01

ABSTRACT Advances in sequencing technologies have enabled direct quantification of genome-wide errors that occur during RNA transcription. These errors occur at rates that are orders of magnitude higher than rates during DNA replication, but due to technical difficulties such measurements have been limited to single-base substitutions and have not yet quantified the scope of transcription insertions and deletions. Previous reporter gene assay findings suggested that transcription indels are produced exclusively by elongation complex slippage at homopolymeric runs, so we enumerated indels across the protein-coding transcriptomes of Escherichia coli and Buchnera aphidicola, which differ widely in their genomic base compositions and incidence of repeat regions. As anticipated from prior assays, transcription insertions prevailed in homopolymeric runs of A and T; however, transcription deletions arose in much more complex sequences and were rarely associated with homopolymeric runs. By reconstructing the relocated positions of the elongation complex as inferred from the sequences inserted or deleted during transcription, we show that continuation of transcription after slippage hinges on the degree of nucleotide complementarity within the RNA:DNA hybrid at the new DNA template location. PMID:28851848
Generation and characterization of tribenuron-methyl herbicide-resistant rapeseed (Brasscia napus) for hybrid seed production using chemically induced male sterility.

PubMed

Li, Haitao; Li, Juanjuan; Zhao, Bo; Wang, Jing; Yi, Licong; Liu, Chao; Wu, Jiangsheng; King, Graham J; Liu, Kede

2015-01-01

Identification and molecular analysis of four tribenuron-methyl resistant mutants in Brassica napus , which would be very useful in hybrid production using a Chemically induced male sterility system. Chemically induced male sterility (CIMS) systems dependent on chemical hybridization agents (CHAs) like tribenuron-methyl (TBM) represent an important approach for practical utilization of heterosis in rapeseed. However, when spraying the female parents with TBM to induce male sterility the male parents must be protected with a shield to avoid injury to the stamens, which would otherwise complicate the seed production protocol and increase the cost of hybrid seed production. Here we report the first proposed application of a herbicide-resistant cultivar in hybrid production, using a CIMS system based on identifying four TBM-resistant mutants in Brassica napus. Genetic analysis indicated that the TBM resistance was controlled by a single dominant nuclear gene. An in vitro enzyme activity assay for acetohydroxyacid synthase (AHAS) suggested that the herbicide resistance is caused by a gain-of-function mutation in a copy of AHAS genes. Comparative sequencing of the mutants and wild type BnaA.AHAS.a coding sequences identified a C-to-T transition at either position 535 or 536 from the translation start site, which resulted in a substitution of proline with serine or leucine at position 197 according to the Arabidopsis thaliana protein sequence. An allele-specific dCAPS marker developed from the C536T variation co-segregated with the herbicide resistance. Transgenic A. thaliana plants expressing BnaA.ahas3.a conferred herbicide resistance, which confirmed that the P197 substitution in BnaA.AHAS.a was responsible for the herbicide resistance. Moreover, the TBM-resistant lines maintain normal male fertility under TBM treatment and can be of practical value in hybrid seed production using CIMS.
A novel hybrid genetic algorithm to solve the make-to-order sequence-dependent flow-shop scheduling problem

NASA Astrophysics Data System (ADS)

Mirabi, Mohammad; Fatemi Ghomi, S. M. T.; Jolai, F.

2014-04-01

Flow-shop scheduling problem (FSP) deals with the scheduling of a set of n jobs that visit a set of m machines in the same order. As the FSP is NP-hard, there is no efficient algorithm to reach the optimal solution of the problem. To minimize the holding, delay and setup costs of large permutation flow-shop scheduling problems with sequence-dependent setup times on each machine, this paper develops a novel hybrid genetic algorithm (HGA) with three genetic operators. Proposed HGA applies a modified approach to generate a pool of initial solutions, and also uses an improved heuristic called the iterated swap procedure to improve the initial solutions. We consider the make-to-order production approach that some sequences between jobs are assumed as tabu based on maximum allowable setup cost. In addition, the results are compared to some recently developed heuristics and computational experimental results show that the proposed HGA performs very competitively with respect to accuracy and efficiency of solution.
Nucleic Acid Detection Methods

DOEpatents

Smith, Cassandra L.; Yaar, Ron; Szafranski, Przemyslaw; Cantor, Charles R.

1998-05-19

The invention relates to methods for rapidly determining the sequence and/or length a target sequence. The target sequence may be a series of known or unknown repeat sequences which are hybridized to an array of probes. The hybridized array is digested with a single-strand nuclease and free 3'-hydroxyl groups extended with a nucleic acid polymerase. Nuclease cleaved heteroduplexes can be easily distinguish from nuclease uncleaved heteroduplexes by differential labeling. Probes and target can be differentially labeled with detectable labels. Matched target can be detected by cleaving resulting loops from the hybridized target and creating free 3-hydroxyl groups. These groups are recognized and extended by polymerases added into the reaction system which also adds or releases one label into solution. Analysis of the resulting products using either solid phase or solution. These methods can be used to detect characteristic nucleic acid sequences, to determine target sequence and to screen for genetic defects and disorders. Assays can be conducted on solid surfaces allowing for multiple reactions to be conducted in parallel and, if desired, automated.
Analysis of simple sequence repeat (SSR) structure and sequence within Epichloë endophyte genomes reveals impacts on gene structure and insights into ancestral hybridization events.

PubMed

Clayton, William; Eaton, Carla Jane; Dupont, Pierre-Yves; Gillanders, Tim; Cameron, Nick; Saikia, Sanjay; Scott, Barry

2017-01-01

Epichloë grass endophytes comprise a group of filamentous fungi of both sexual and asexual species. Known for the beneficial characteristics they endow upon their grass hosts, the identification of these endophyte species has been of great interest agronomically and scientifically. The use of simple sequence repeat loci and the variation in repeat elements has been used to rapidly identify endophyte species and strains, however, little is known of how the structure of repeat elements changes between species and strains, and where these repeat elements are located in the fungal genome. We report on an in-depth analysis of the structure and genomic location of the simple sequence repeat locus B10, commonly used for Epichloë endophyte species identification. The B10 repeat was found to be located within an exon of a putative bZIP transcription factor, suggesting possible impacts on polypeptide sequence and thus protein function. Analysis of this repeat in the asexual endophyte hybrid Epichloë uncinata revealed that the structure of B10 alleles reflects the ancestral species that hybridized to give rise to this species. Understanding the structure and sequence of these simple sequence repeats provides a useful set of tools for readily distinguishing strains and for gaining insights into the ancestral species that have undergone hybridization events.
Computational tools for copy number variation (CNV) detection using next-generation sequencing data: features and perspectives.

PubMed

Zhao, Min; Wang, Qingguo; Wang, Quan; Jia, Peilin; Zhao, Zhongming

2013-01-01

Copy number variation (CNV) is a prevalent form of critical genetic variation that leads to an abnormal number of copies of large genomic regions in a cell. Microarray-based comparative genome hybridization (arrayCGH) or genotyping arrays have been standard technologies to detect large regions subject to copy number changes in genomes until most recently high-resolution sequence data can be analyzed by next-generation sequencing (NGS). During the last several years, NGS-based analysis has been widely applied to identify CNVs in both healthy and diseased individuals. Correspondingly, the strong demand for NGS-based CNV analyses has fuelled development of numerous computational methods and tools for CNV detection. In this article, we review the recent advances in computational methods pertaining to CNV detection using whole genome and whole exome sequencing data. Additionally, we discuss their strengths and weaknesses and suggest directions for future development.
Computational tools for copy number variation (CNV) detection using next-generation sequencing data: features and perspectives

PubMed Central

2013-01-01

Copy number variation (CNV) is a prevalent form of critical genetic variation that leads to an abnormal number of copies of large genomic regions in a cell. Microarray-based comparative genome hybridization (arrayCGH) or genotyping arrays have been standard technologies to detect large regions subject to copy number changes in genomes until most recently high-resolution sequence data can be analyzed by next-generation sequencing (NGS). During the last several years, NGS-based analysis has been widely applied to identify CNVs in both healthy and diseased individuals. Correspondingly, the strong demand for NGS-based CNV analyses has fuelled development of numerous computational methods and tools for CNV detection. In this article, we review the recent advances in computational methods pertaining to CNV detection using whole genome and whole exome sequencing data. Additionally, we discuss their strengths and weaknesses and suggest directions for future development. PMID:24564169
Rapid detection of Cyprinid herpesvirus-3 (CyHV-3) using a gold nanoparticle-based hybridization assay.

PubMed

Saleh, Mona; El-Matbouli, Mansour

2015-06-01

Cyprinid herpesvirus-3 (CyHV-3) is a highly infectious pathogen that causes fatal disease in common and koi carp Cyprinus carpio L. CyHV-3 detection is usually based on virus propagation or amplification of the viral DNA using the PCR or LAMP techniques. However, due to the limited susceptibility of cells used for propagation, it is not always possible to successfully isolate CyHV-3 even from tissue samples that have high virus titres. All previously described detection methods including PCR-based assays are time consuming, laborious and require specialized equipment. To overcome these limitations, gold nanoparticles (AuNPs) have been explored for direct and sensitive detection of DNA. In this study, a label-free colorimetric nanodiagnostic method for direct detection of unamplified CyHV-3 DNA using gold nanoparticles is introduced. Under appropriate conditions, DNA probes hybridize with their complementary target sequences in the sample DNA, which results in aggregation of the gold nanoparticles and a concomitant colour change from red to blue, whereas test samples with non complementary DNA sequences remain red. In this study, gold nanoparticles were used to develop and evaluate a specific and sensitive hybridization assay for direct and rapid detection of the highly infectious pathogen termed Cyprinid herpesvirus-3. Copyright © 2015 Elsevier B.V. All rights reserved.
Comparing Charge Transport in Oligonucleotides: RNA:DNA Hybrids and DNA Duplexes.

PubMed

Li, Yuanhui; Artés, Juan M; Qi, Jianqing; Morelan, Ian A; Feldstein, Paul; Anantram, M P; Hihath, Joshua

2016-05-19

Understanding the electronic properties of oligonucleotide systems is important for applications in nanotechnology, biology, and sensing systems. Here the charge-transport properties of guanine-rich RNA:DNA hybrids are compared to double-stranded DNA (dsDNA) duplexes with identical sequences. The conductance of the RNA:DNA hybrids is ∼10 times higher than the equivalent dsDNA, and conformational differences are determined to be the primary reason for this difference. The conductance of the RNA:DNA hybrids is also found to decrease more rapidly than dsDNA when the length is increased. Ab initio electronic structure and Green's function-based density of states calculations demonstrate that these differences arise because the energy levels are more spatially distributed in the RNA:DNA hybrid but that the number of accessible hopping sites is smaller. These combination results indicate that a simple hopping model that treats each individual guanine as a hopping site is insufficient to explain both a higher conductance and β value for RNA:DNA hybrids, and larger delocalization lengths must be considered.
Comparative structural analysis of Bru1 region homeologs in Saccharum spontaneum and S. officinarum

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Jisen; Sharma, Anupma; Yu, Qingyi

Here, sugarcane is a major sugar and biofuel crop, but genomic research and molecular breeding have lagged behind other major crops due to the complexity of auto-allopolyploid genomes. Sugarcane cultivars are frequently aneuploid with chromosome number ranging from 100 to 130, consisting of 70-80 % S. officinarum, 10-20 % S. spontaneum, and 10 % recombinants between these two species. Analysis of a genomic region in the progenitor autoploid genomes of sugarcane hybrid cultivars will reveal the nature and divergence of homologous chromosomes. As a result, to investigate the origin and evolution of haplotypes in the Bru1 genomic regions in sugarcanemore » cultivars, we identified two BAC clones from S. spontaneum and four from S. officinarum and compared to seven haplotype sequences from sugarcane hybrid R570. The results clarified the origin of seven homologous haplotypes in R570, four haplotypes originated from S. officinarum, two from S. spontaneum and one recombinant.. Retrotransposon insertions and sequences variations among the homologous haplotypes sequence divergence ranged from 18.2 % to 60.5 % with an average of 33. 7 %. Gene content and gene structure were relatively well conserved among the homologous haplotypes. Exon splitting occurred in haplotypes of the hybrid genome but not in its progenitor genomes. Tajima's D analysis revealed that S. spontaneum hapotypes in the Bru1 genomic regions were under strong directional selection. Numerous inversions, deletions, insertions and translocations were found between haplotypes within each genome. In conclusion, this is the first comparison among haplotypes of a modern sugarcane hybrid and its two progenitors. Tajima's D results emphasized the crucial role of this fungal disease resistance gene for enhancing the fitness of this species and indicating that the brown rust resistance gene in R570 is from S. spontaneum. Species-specific InDel, sequences similarity and phylogenetic analysis of homologous genes can be used for identifying the origin of S. spontaneum and S. officinarum haplotype in Saccharum hybrids. Comparison of exon splitting among the homologous haplotypes suggested that the genome rearrangements in Saccharum hybrids S. officinarum would be sufficient for proper genome assembly of this autopolyploid genome. Retrotransposon insertions and sequences variations among the homologous haplotypes sequence divergence may allow sequencing and assembling the autopolyploid Saccharum genomes and the auto-allopolyploid hybrid genomes using whole genome shotgun sequencing.« less
Comparative structural analysis of Bru1 region homeologs in Saccharum spontaneum and S. officinarum

DOE PAGES

Zhang, Jisen; Sharma, Anupma; Yu, Qingyi; ...

2016-06-10

Here, sugarcane is a major sugar and biofuel crop, but genomic research and molecular breeding have lagged behind other major crops due to the complexity of auto-allopolyploid genomes. Sugarcane cultivars are frequently aneuploid with chromosome number ranging from 100 to 130, consisting of 70-80 % S. officinarum, 10-20 % S. spontaneum, and 10 % recombinants between these two species. Analysis of a genomic region in the progenitor autoploid genomes of sugarcane hybrid cultivars will reveal the nature and divergence of homologous chromosomes. As a result, to investigate the origin and evolution of haplotypes in the Bru1 genomic regions in sugarcanemore » cultivars, we identified two BAC clones from S. spontaneum and four from S. officinarum and compared to seven haplotype sequences from sugarcane hybrid R570. The results clarified the origin of seven homologous haplotypes in R570, four haplotypes originated from S. officinarum, two from S. spontaneum and one recombinant.. Retrotransposon insertions and sequences variations among the homologous haplotypes sequence divergence ranged from 18.2 % to 60.5 % with an average of 33. 7 %. Gene content and gene structure were relatively well conserved among the homologous haplotypes. Exon splitting occurred in haplotypes of the hybrid genome but not in its progenitor genomes. Tajima's D analysis revealed that S. spontaneum hapotypes in the Bru1 genomic regions were under strong directional selection. Numerous inversions, deletions, insertions and translocations were found between haplotypes within each genome. In conclusion, this is the first comparison among haplotypes of a modern sugarcane hybrid and its two progenitors. Tajima's D results emphasized the crucial role of this fungal disease resistance gene for enhancing the fitness of this species and indicating that the brown rust resistance gene in R570 is from S. spontaneum. Species-specific InDel, sequences similarity and phylogenetic analysis of homologous genes can be used for identifying the origin of S. spontaneum and S. officinarum haplotype in Saccharum hybrids. Comparison of exon splitting among the homologous haplotypes suggested that the genome rearrangements in Saccharum hybrids S. officinarum would be sufficient for proper genome assembly of this autopolyploid genome. Retrotransposon insertions and sequences variations among the homologous haplotypes sequence divergence may allow sequencing and assembling the autopolyploid Saccharum genomes and the auto-allopolyploid hybrid genomes using whole genome shotgun sequencing.« less
Optimization of the hybridization-based method for purification of thermostable tRNAs in the presence of tetraalkylammonium salts

PubMed Central

Yokogawa, Takashi; Kitamura, Yusuke; Nakamura, Daigo; Ohno, Satoshi; Nishikawa, Kazuya

2010-01-01

We found that both tetramethylammonium chloride (TMA-Cl) and tetra-ethylammonium chloride (TEA-Cl), which are used as monovalent cations for northern hybridization, drastically destabilized the tertiary structures of tRNAs and enhanced the formation of tRNA•oligoDNA hybrids. These effects are of great advantage for the hybridization-based method for purification of specific tRNAs from unfractionated tRNA mixtures through the use of an immobilized oligoDNA complementary to the target tRNA. Replacement of NaCl by TMA-Cl or TEA-Cl in the hybridization buffer greatly improved the recovery of a specific tRNA, even from unfractionated tRNAs derived from a thermophile. Since TEA-Cl destabilized tRNAs more strongly than TMA-Cl, it was necessary to lower the hybridization temperature at the sacrifice of the purity of the recovered tRNA when using TEA-Cl. Therefore, we propose two alternative protocols, depending on the desired properties of the tRNA to be purified. When the total recovery of the tRNA is important, hybridization should be carried out in the presence of TEA-Cl. However, if the purity of the recovered tRNA is important, TMA-Cl should be used for the hybridization. In principle, this procedure for tRNA purification should be applicable to any small-size RNA whose gene sequence is already known. PMID:20040572
Genomics and introgression: discovery and mapping of thousands of species-diagnostic SNPs using RAD sequencing

USGS Publications Warehouse

Hand, Brian K.; Hether, Tyler D; Kovach, Ryan P.; Muhlfeld, Clint C.; Amish, Stephen J.; Boyer, Matthew C.; O’Rourke, Sean M.; Miller, Michael R.; Lowe, Winsor H.; Hohenlohe, Paul A.; Luikart, Gordon

2015-01-01

Invasive hybridization and introgression pose a serious threat to the persistence of many native species. Understanding the effects of hybridization on native populations (e.g., fitness consequences) requires numerous species-diagnostic loci distributed genome-wide. Here we used RAD sequencing to discover thousands of single-nucleotide polymorphisms (SNPs) that are diagnostic between rainbow trout (RBT, Oncorhynchus mykiss), the world’s most widely introduced fish, and native westslope cutthroat trout (WCT, O. clarkii lewisi) in the northern Rocky Mountains, USA. We advanced previous work that identified 4,914 species-diagnostic loci by using longer sequence reads (100 bp vs. 60 bp) and a larger set of individuals (n = 84). We sequenced RAD libraries for individuals from diverse sampling sources, including native populations of WCT and hatchery broodstocks of WCT and RBT. We also took advantage of a newly released reference genome assembly for RBT to align our RAD loci. In total, we discovered 16,788 putatively diagnostic SNPs, 10,267 of which we mapped to anchored chromosome locations on the RBT genome. A small portion of previously discovered putative diagnostic loci (325 of 4,914) were no longer diagnostic (i.e., fixed between species) based on our wider survey of non-hybridized RBT and WCT individuals. Our study suggests that RAD loci mapped to a draft genome assembly could provide the marker density required to identify genes and chromosomal regions influencing selection in admixed populations of conservation concern and evolutionary interest.
Improved hybrid optimization algorithm for 3D protein structure prediction.

PubMed

Zhou, Changjun; Hou, Caixia; Wei, Xiaopeng; Zhang, Qiang

2014-07-01

A new improved hybrid optimization algorithm - PGATS algorithm, which is based on toy off-lattice model, is presented for dealing with three-dimensional protein structure prediction problems. The algorithm combines the particle swarm optimization (PSO), genetic algorithm (GA), and tabu search (TS) algorithms. Otherwise, we also take some different improved strategies. The factor of stochastic disturbance is joined in the particle swarm optimization to improve the search ability; the operations of crossover and mutation that are in the genetic algorithm are changed to a kind of random liner method; at last tabu search algorithm is improved by appending a mutation operator. Through the combination of a variety of strategies and algorithms, the protein structure prediction (PSP) in a 3D off-lattice model is achieved. The PSP problem is an NP-hard problem, but the problem can be attributed to a global optimization problem of multi-extremum and multi-parameters. This is the theoretical principle of the hybrid optimization algorithm that is proposed in this paper. The algorithm combines local search and global search, which overcomes the shortcoming of a single algorithm, giving full play to the advantage of each algorithm. In the current universal standard sequences, Fibonacci sequences and real protein sequences are certified. Experiments show that the proposed new method outperforms single algorithms on the accuracy of calculating the protein sequence energy value, which is proved to be an effective way to predict the structure of proteins.
Genomic in situ hybridization in interspecific hybrids of scallops (Bivalvia, Pectinidae) and localization of the satellite DNA Cf303, and the vertebrate telomeric sequences (TTAGGG)n on chromosomes of scallop Chlamys farreri (Jones & Preston, 1904)

PubMed Central

Hu, Liping; Jiang, Liming; Bi, Ke; Liao, Huan; Yang, Zujing; Huang, Xiaoting; Bao, Zhenmin

2018-01-01

Abstract Mitotic chromosome preparations of the interspecific hybrids Chlamys farreri (Jones & Preston, 1904) × Patinopecten yessoensis (Jay, 1857), C. farreri × Argopecten irradians (Lamarck, 1819) and C. farreri × Mimachlamys nobilis (Reeve, 1852) were used to compare two different scallop genomes in a single slide. Although genomic in situ hybridization (GISH) using genomic DNA from each scallop species as probe painted mitotic chromosomes of the interspecific hybrids, the painting results were not uniform; instead it showed species-specific distribution patterns of fluorescent signals among the chromosomes. The most prominent GISH-bands were mainly located at centromeric or telomeric regions of scallop chromosomes. In order to illustrate the sequence constitution of the GISH-bands, the satellite Cf303 sequences of C. farreri and the vertebrate telomeric (TTAGGG)n sequences were used to map mitotic chromosomes of C. farreri by fluorescence in situ hybridization (FISH). The results indicated that the GISH-banding pattern presented by the chromosomes of C. farreri is mainly due to the distribution of the satellite Cf303 DNA, therefore suggesting that the GISH-banding patterns found in the other three scallops could also be the result of the chromosomal distribution of other species-specific satellite DNAs. PMID:29675138
Genomic in situ hybridization in interspecific hybrids of scallops (Bivalvia, Pectinidae) and localization of the satellite DNA Cf303, and the vertebrate telomeric sequences (TTAGGG)n on chromosomes of scallop Chlamys farreri (Jones & Preston, 1904).

PubMed

Hu, Liping; Jiang, Liming; Bi, Ke; Liao, Huan; Yang, Zujing; Huang, Xiaoting; Bao, Zhenmin

2018-01-01

Mitotic chromosome preparations of the interspecific hybrids Chlamys farreri (Jones & Preston, 1904) × Patinopecten yessoensis (Jay, 1857), C. farreri × Argopecten irradians (Lamarck, 1819) and C. farreri × Mimachlamys nobilis (Reeve, 1852) were used to compare two different scallop genomes in a single slide. Although genomic in situ hybridization (GISH) using genomic DNA from each scallop species as probe painted mitotic chromosomes of the interspecific hybrids, the painting results were not uniform; instead it showed species-specific distribution patterns of fluorescent signals among the chromosomes. The most prominent GISH-bands were mainly located at centromeric or telomeric regions of scallop chromosomes. In order to illustrate the sequence constitution of the GISH-bands, the satellite Cf303 sequences of C. farreri and the vertebrate telomeric (TTAGGG) n sequences were used to map mitotic chromosomes of C. farreri by fluorescence in situ hybridization (FISH). The results indicated that the GISH-banding pattern presented by the chromosomes of C. farreri is mainly due to the distribution of the satellite Cf303 DNA, therefore suggesting that the GISH-banding patterns found in the other three scallops could also be the result of the chromosomal distribution of other species-specific satellite DNAs.
Unveiling the Hybrid Genome Structure of Escherichia coli RR1 (HB101 RecA+)

PubMed Central

Jeong, Haeyoung; Sim, Young Mi; Kim, Hyun Ju; Lee, Sang Jun

2017-01-01

There have been extensive genome sequencing studies for Escherichia coli strains, particularly for pathogenic isolates, because fast determination of pathogenic potential and/or drug resistance and their propagation routes is crucial. For laboratory E. coli strains, however, genome sequence information is limited except for several well-known strains. We determined the complete genome sequence of laboratory E. coli strain RR1 (HB101 RecA+), which has long been used as a general cloning host. A hybrid genome sequence of K-12 MG1655 and B BL21(DE3) was constructed based on the initial mapping of Illumina HiSeq reads to each reference, and iterative rounds of read mapping, variant detection, and consensus extraction were carried out. Finally, PCR and Sanger sequencing-based finishing were applied to resolve non-single nucleotide variant regions with aberrant read depths and breakpoints, most of them resulting from prophages and insertion sequence transpositions that are not present in the reference genome sequence. We found that 96.9% of the RR1 genome is derived from K-12, and identified exact crossover junctions between K-12 and B genomic fragments. However, because RR1 has experienced a series of genetic manipulations since branching from the common ancestor, it has a set of mutations different from those found in K-12 MG1655. As well as identifying all known genotypes of RR1 on the basis of genomic context, we found novel mutations. Our results extend current knowledge of the genotype of RR1 and its relatives, and provide insights into the pedigree, genomic background, and physiology of common laboratory strains. PMID:28421066
Structured oligonucleotides for target indexing to allow single-vessel PCR amplification and solid support microarray hybridization

PubMed Central

Girard, Laurie D.; Boissinot, Karel; Peytavi, Régis; Boissinot, Maurice; Bergeron, Michel G.

2014-01-01

The combination of molecular diagnostic technologies is increasingly used to overcome limitations on sensitivity, specificity or multiplexing capabilities, and provide efficient lab-on-chip devices. Two such techniques, PCR amplification and microarray hybridization are used serially to take advantage of the high sensitivity and specificity of the former combined with high multiplexing capacities of the latter. These methods are usually performed in different buffers and reaction chambers. However, these elaborate methods have a high complexity cost related to reagent requirements, liquid storage and the number of reaction chambers to integrate into automated devices. Furthermore, microarray hybridizations have a sequence dependent efficiency not always predictable. In this work, we have developed the concept of a structured oligonucleotide probe which is activated by cleavage from polymerase exonuclease activity. This technology is called SCISSOHR for Structured Cleavage Induced Single-Stranded Oligonucleotide Hybridization Reaction. The SCISSOHR probes enable indexing the target sequence to a tag sequence. The SCISSOHR technology also allows the combination of nucleic acid amplification and microarray hybridization in a single vessel in presence of the PCR buffer only. The SCISSOHR technology uses an amplification probe that is irreversibly modified in presence of the target, releasing a single-stranded DNA tag for microarray hybridization. Each tag is composed of a 3-nucleotidesequence-dependent segment and a unique “target sequence-independent” 14-nucleotide segment allowing for optimal hybridization with minimal cross-hybridization. We evaluated the performance of five (5) PCR buffers to support microarray hybridization, compared to a conventional hybridization buffer. Finally, as a proof of concept, we developed a multiplexed assay for the amplification, detection, and identification of three (3) DNA targets. This new technology will facilitate the design of lab-on-chip microfluidic devices, while also reducing consumable costs. At term, it will allow the cost-effective automation of highly multiplexed assays for detection and identification of genetic targets. PMID:25489607

probeBase—an online resource for rRNA-targeted oligonucleotide probes and primers: new features 2016

PubMed Central

Greuter, Daniel; Loy, Alexander; Horn, Matthias; Rattei, Thomas

2016-01-01

probeBase http://www.probebase.net is a manually maintained and curated database of rRNA-targeted oligonucleotide probes and primers. Contextual information and multiple options for evaluating in silico hybridization performance against the most recent rRNA sequence databases are provided for each oligonucleotide entry, which makes probeBase an important and frequently used resource for microbiology research and diagnostics. Here we present a major update of probeBase, which was last featured in the NAR Database Issue 2007. This update describes a complete remodeling of the database architecture and environment to accommodate computationally efficient access. Improved search functions, sequence match tools and data output now extend the opportunities for finding suitable hierarchical probe sets that target an organism or taxon at different taxonomic levels. To facilitate the identification of complementary probe sets for organisms represented by short rRNA sequence reads generated by amplicon sequencing or metagenomic analysis with next generation sequencing technologies such as Illumina and IonTorrent, we introduce a novel tool that recovers surrogate near full-length rRNA sequences for short query sequences and finds matching oligonucleotides in probeBase. PMID:26586809
Improvements on a privacy-protection algorithm for DNA sequences with generalization lattices.

PubMed

Li, Guang; Wang, Yadong; Su, Xiaohong

2012-10-01

When developing personal DNA databases, there must be an appropriate guarantee of anonymity, which means that the data cannot be related back to individuals. DNA lattice anonymization (DNALA) is a successful method for making personal DNA sequences anonymous. However, it uses time-consuming multiple sequence alignment and a low-accuracy greedy clustering algorithm. Furthermore, DNALA is not an online algorithm, and so it cannot quickly return results when the database is updated. This study improves the DNALA method. Specifically, we replaced the multiple sequence alignment in DNALA with global pairwise sequence alignment to save time, and we designed a hybrid clustering algorithm comprised of a maximum weight matching (MWM)-based algorithm and an online algorithm. The MWM-based algorithm is more accurate than the greedy algorithm in DNALA and has the same time complexity. The online algorithm can process data quickly when the database is updated. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Identification of antisense nucleic acid hybridization sites in mRNA molecules with self-quenching fluorescent reporter molecules

PubMed Central

Gifford, Lida K.; Opalinska, Joanna B.; Jordan, David; Pattanayak, Vikram; Greenham, Paul; Kalota, Anna; Robbins, Michelle; Vernovsky, Kathy; Rodriguez, Lesbeth C.; Do, Bao T.; Lu, Ponzy; Gewirtz, Alan M.

2005-01-01

We describe a physical mRNA mapping strategy employing fluorescent self-quenching reporter molecules (SQRMs) that facilitates the identification of mRNA sequence accessible for hybridization with antisense nucleic acids in vitro and in vivo, real time. SQRMs are 20–30 base oligodeoxynucleotides with 5–6 bp complementary ends to which a 5′ fluorophore and 3′ quenching group are attached. Alone, the SQRM complementary ends form a stem that holds the fluorophore and quencher in contact. When the SQRM forms base pairs with its target, the structure separates the fluorophore from the quencher. This event can be reported by fluorescence emission when the fluorophore is excited. The stem–loop of the SQRM suggests that SQRM be made to target natural stem–loop structures formed during mRNA synthesis. The general utility of this method is demonstrated by SQRM identification of targetable sequence within c-myb and bcl-6 mRNA. Corresponding antisense oligonucleotides reduce these gene products in cells. PMID:15718294
Design and implementation of a hybrid MPI-CUDA model for the Smith-Waterman algorithm.

PubMed

Khaled, Heba; Faheem, Hossam El Deen Mostafa; El Gohary, Rania

2015-01-01

This paper provides a novel hybrid model for solving the multiple pair-wise sequence alignment problem combining message passing interface and CUDA, the parallel computing platform and programming model invented by NVIDIA. The proposed model targets homogeneous cluster nodes equipped with similar Graphical Processing Unit (GPU) cards. The model consists of the Master Node Dispatcher (MND) and the Worker GPU Nodes (WGN). The MND distributes the workload among the cluster working nodes and then aggregates the results. The WGN performs the multiple pair-wise sequence alignments using the Smith-Waterman algorithm. We also propose a modified implementation to the Smith-Waterman algorithm based on computing the alignment matrices row-wise. The experimental results demonstrate a considerable reduction in the running time by increasing the number of the working GPU nodes. The proposed model achieved a performance of about 12 Giga cell updates per second when we tested against the SWISS-PROT protein knowledge base running on four nodes.
An effective hybrid immune algorithm for solving the distributed permutation flow-shop scheduling problem

NASA Astrophysics Data System (ADS)

Xu, Ye; Wang, Ling; Wang, Shengyao; Liu, Min

2014-09-01

In this article, an effective hybrid immune algorithm (HIA) is presented to solve the distributed permutation flow-shop scheduling problem (DPFSP). First, a decoding method is proposed to transfer a job permutation sequence to a feasible schedule considering both factory dispatching and job sequencing. Secondly, a local search with four search operators is presented based on the characteristics of the problem. Thirdly, a special crossover operator is designed for the DPFSP, and mutation and vaccination operators are also applied within the framework of the HIA to perform an immune search. The influence of parameter setting on the HIA is investigated based on the Taguchi method of design of experiment. Extensive numerical testing results based on 420 small-sized instances and 720 large-sized instances are provided. The effectiveness of the HIA is demonstrated by comparison with some existing heuristic algorithms and the variable neighbourhood descent methods. New best known solutions are obtained by the HIA for 17 out of 420 small-sized instances and 585 out of 720 large-sized instances.
Development and application of a recombination-based library versus library high- throughput yeast two-hybrid (RLL-Y2H) screening system.

PubMed

Yang, Fang; Lei, Yingying; Zhou, Meiling; Yao, Qili; Han, Yichao; Wu, Xiang; Zhong, Wanshun; Zhu, Chenghang; Xu, Weize; Tao, Ran; Chen, Xi; Lin, Da; Rahman, Khaista; Tyagi, Rohit; Habib, Zeshan; Xiao, Shaobo; Wang, Dang; Yu, Yang; Chen, Huanchun; Fu, Zhenfang; Cao, Gang

2018-02-16

Protein-protein interaction (PPI) network maintains proper function of all organisms. Simple high-throughput technologies are desperately needed to delineate the landscape of PPI networks. While recent state-of-the-art yeast two-hybrid (Y2H) systems improved screening efficiency, either individual colony isolation, library preparation arrays, gene barcoding or massive sequencing are still required. Here, we developed a recombination-based 'library vs library' Y2H system (RLL-Y2H), by which multi-library screening can be accomplished in a single pool without any individual treatment. This system is based on the phiC31 integrase-mediated integration between bait and prey plasmids. The integrated fragments were digested by MmeI and subjected to deep sequencing to decode the interaction matrix. We applied this system to decipher the trans-kingdom interactome between Mycobacterium tuberculosis and host cells and further identified Rv2427c interfering with the phagosome-lysosome fusion. This concept can also be applied to other systems to screen protein-RNA and protein-DNA interactions and delineate signaling landscape in cells.
Cloud-based adaptive exon prediction for DNA analysis.

PubMed

Putluri, Srinivasareddy; Zia Ur Rahman, Md; Fathima, Shaik Yasmeen

2018-02-01

Cloud computing offers significant research and economic benefits to healthcare organisations. Cloud services provide a safe place for storing and managing large amounts of such sensitive data. Under conventional flow of gene information, gene sequence laboratories send out raw and inferred information via Internet to several sequence libraries. DNA sequencing storage costs will be minimised by use of cloud service. In this study, the authors put forward a novel genomic informatics system using Amazon Cloud Services, where genomic sequence information is stored and accessed for processing. True identification of exon regions in a DNA sequence is a key task in bioinformatics, which helps in disease identification and design drugs. Three base periodicity property of exons forms the basis of all exon identification techniques. Adaptive signal processing techniques found to be promising in comparison with several other methods. Several adaptive exon predictors (AEPs) are developed using variable normalised least mean square and its maximum normalised variants to reduce computational complexity. Finally, performance evaluation of various AEPs is done based on measures such as sensitivity, specificity and precision using various standard genomic datasets taken from National Center for Biotechnology Information genomic sequence database.
Input selection and performance optimization of ANN-based streamflow forecasts in the drought-prone Murray Darling Basin region using IIS and MODWT algorithm

NASA Astrophysics Data System (ADS)

Prasad, Ramendra; Deo, Ravinesh C.; Li, Yan; Maraseni, Tek

2017-11-01

Forecasting streamflow is vital for strategically planning, utilizing and redistributing water resources. In this paper, a wavelet-hybrid artificial neural network (ANN) model integrated with iterative input selection (IIS) algorithm (IIS-W-ANN) is evaluated for its statistical preciseness in forecasting monthly streamflow, and it is then benchmarked against M5 Tree model. To develop hybrid IIS-W-ANN model, a global predictor matrix is constructed for three local hydrological sites (Richmond, Gwydir, and Darling River) in Australia's agricultural (Murray-Darling) Basin. Model inputs comprised of statistically significant lagged combination of streamflow water level, are supplemented by meteorological data (i.e., precipitation, maximum and minimum temperature, mean solar radiation, vapor pressure and evaporation) as the potential model inputs. To establish robust forecasting models, iterative input selection (IIS) algorithm is applied to screen the best data from the predictor matrix and is integrated with the non-decimated maximum overlap discrete wavelet transform (MODWT) applied on the IIS-selected variables. This resolved the frequencies contained in predictor data while constructing a wavelet-hybrid (i.e., IIS-W-ANN and IIS-W-M5 Tree) model. Forecasting ability of IIS-W-ANN is evaluated via correlation coefficient (r), Willmott's Index (WI), Nash-Sutcliffe Efficiency (ENS), root-mean-square-error (RMSE), and mean absolute error (MAE), including the percentage RMSE and MAE. While ANN models are seen to outperform M5 Tree executed for all hydrological sites, the IIS variable selector was efficient in determining the appropriate predictors, as stipulated by the better performance of the IIS coupled (ANN and M5 Tree) models relative to the models without IIS. When IIS-coupled models are integrated with MODWT, the wavelet-hybrid IIS-W-ANN and IIS-W-M5 Tree are seen to attain significantly accurate performance relative to their standalone counterparts. Importantly, IIS-W-ANN model accuracy outweighs IIS-ANN, as evidenced by a larger r and WI (by 7.5% and 3.8%, respectively) and a lower RMSE (by 21.3%). In comparison to the IIS-W-M5 Tree model, IIS-W-ANN model yielded larger values of WI = 0.936-0.979 and ENS = 0.770-0.920. Correspondingly, the errors (RMSE and MAE) ranged from 0.162-0.487 m and 0.139-0.390 m, respectively, with relative errors, RRMSE = (15.65-21.00) % and MAPE = (14.79-20.78) %. Distinct geographic signature is evident where the most and least accurately forecasted streamflow data is attained for the Gwydir and Darling River, respectively. Conclusively, this study advocates the efficacy of iterative input selection, allowing the proper screening of model predictors, and subsequently, its integration with MODWT resulting in enhanced performance of the models applied in streamflow forecasting.
Experimental evidence for the ancestry of allotetraploid Trifolium repens and creation of synthetic forms with value for plant breeding.

PubMed

Williams, Warren M; Ellison, Nicholas W; Ansari, Helal A; Verry, Isabelle M; Hussain, S Wajid

2012-04-24

White clover (Trifolium repens) is a ubiquitous weed of the temperate world that through use of improved cultivars has also become the most important legume of grazed pastures world-wide. It has long been suspected to be allotetraploid, but the diploid ancestral species have remained elusive. Putative diploid ancestors were indicated by DNA sequence phylogeny to be T. pallescens and T. occidentale. Here, we use further DNA evidence as well as a combination of molecular cytogenetics (FISH and GISH) and experimental hybridization to test the hypothesis that white clover originated as a hybrid between T. pallescens and T. occidentale. T. pallescens plants were identified with chloroplast trnL intron DNA sequences identical to those of white clover. Similarly, T. occidentale plants with nuclear ITS sequences identical to white clover were also identified. Reciprocal GISH experiments, alternately using labeled genomic DNA probes from each of the putative ancestral species on the same white clover cells, showed that half of the chromosomes hybridized with each probe. F1 hybrids were generated by embryo rescue and these showed strong interspecific chromosome pairing and produced a significant frequency of unreduced gametes, indicating the likely mode of polyploidization. The F1 hybrids are inter-fertile with white clover and function as synthetic white clovers, a valuable new resource for the re-incorporation of ancestral genomes into modern white clover for future plant breeding. Evidence from DNA sequence analyses, molecular cytogenetics, interspecific hybridization and breeding experiments supports the hypothesis that a diploid alpine species (T. pallescens) hybridized with a diploid coastal species (T. occidentale) to generate tetraploid T. repens. The coming together of these two narrowly adapted species (one alpine and the other maritime), along with allotetraploidy, has led to a transgressive hybrid with a broad adaptive range.
Genome evolution in alpine oat-like grasses through homoploid hybridization and polyploidy

PubMed Central

Winterfeld, Grit; Wölk, Alexandra; Röser, Martin

2016-01-01

Hybridization and polyploidization can radically impact genome organization from sequence level to chromosome structure. As a result, often in response to environmental change and species isolation, the development of novel traits can arise and will tend to result in the formation of homoploid or polyploid hybrid species. In this study we focus on evidence of hybridization and polyploidization by ascertaining the species parentage of the endemic alpine Helictotrichon parlatorei group. This group comprises five taxa; the diploids H. parlatorei, Helictotrichon setaceum subsp. setaceum and subsp. petzense, their putative hybrid Helictotrichon ×krischae and the hexaploid Helictotrichon sempervirens. For molecular analyses, cloned nuclear Topoisomerase VI genes of H. sempervirens and H. ×krischae were sequenced and compared with sequences of the diploids to estimate the evolutionary history in this group. In addition, detailed chromosome studies were carried out including fluorescence in situ hybridization (FISH) with 5S and 45S ribosomal and satellite DNA probes, and fluorochrome staining with chromomycin and DAPI. Two distinct types of Topoisomerase VI sequences were identified. One of them (SET) occurs in both subspecies of H. setaceum, the other (PAR) in H. parlatorei. Both types were found in H. ×krischae and H. sempervirens. Karyotypes of H. parlatorei and H. setaceum could be distinguished by chromosomes with a clearly differentiated banding pattern of ribosomal DNAs. Both patterns occurred in the hybrid H. ×krischae. Hexaploid H. sempervirens shares karyotype features with diploid H. parlatorei, but lacks the expected chromosome characteristics of H. setaceum, possibly an example of beginning diploidization after polyploidization. The geographic origin of the putative parental species and their hybrids and the possible biogeographical spread through the Alps are discussed. PMID:27255513
Toward a Predictive Model of Community College Student Success in Blended Classes

ERIC Educational Resources Information Center

Volchok, Edward

2018-01-01

This retrospective study evaluates early semester predictors of whether or not community college students will successfully complete blended or hybrid courses. These predictors are available to faculty by the fourth week of the semester. Success is defined as receiving a grade of C- or higher. Failure is defined as a grade below a C- or a…
DNA Sequencing by Capillary Electrophoresis

PubMed Central

Karger, Barry L.; Guttman, Andras

2009-01-01

Sequencing of human and other genomes has been at the center of interest in the biomedical field over the past several decades and is now leading toward an era of personalized medicine. During this time, DNA sequencing methods have evolved from the labor intensive slab gel electrophoresis, through automated multicapillary electrophoresis systems using fluorophore labeling with multispectral imaging, to the “next generation” technologies of cyclic array, hybridization based, nanopore and single molecule sequencing. Deciphering the genetic blueprint and follow-up confirmatory sequencing of Homo sapiens and other genomes was only possible by the advent of modern sequencing technologies that was a result of step by step advances with a contribution of academics, medical personnel and instrument companies. While next generation sequencing is moving ahead at break-neck speed, the multicapillary electrophoretic systems played an essential role in the sequencing of the Human Genome, the foundation of the field of genomics. In this prospective, we wish to overview the role of capillary electrophoresis in DNA sequencing based in part of several of our articles in this journal. PMID:19517496
Methylobacterium phyllosphaerae sp. nov., a pink-pigmented, facultative methylotroph from the phyllosphere of rice.

PubMed

Madhaiyan, Munusamy; Poonguzhali, Selvaraj; Kwon, Soon-Wo; Sa, Tong-Min

2009-01-01

A pink-pigmented, aerobic, facultatively methylotrophic bacterial strain, CBMB27T, isolated from leaf tissues of rice (Oryza sativa L. 'Dong-Jin'), was analysed using a polyphasic taxonomic approach. Comparative 16S rRNA gene sequence-based phylogenetic analysis placed the strain in a clade with the species Methylobacterium oryzae, Methylobacterium fujisawaense and Methylobacterium mesophilicum; strain CBMB27T showed sequence similarities of 98.3, 98.5 and 97.3 %, respectively, to the type strains of these three species. DNA-DNA hybridization experiments revealed low levels (<38 %) of DNA-DNA relatedness between strain CBMB27T and its closest relatives. The sequence of the 1-aminocyclopropane-1-carboxylate deaminase gene (acdS) in strain CBMB27T differed from those of close relatives. The major fatty acid of the isolate was C(18 : 1)omega7c and the G+C content of the genomic DNA was 66.8 mol%. Based on the results of 16S rRNA gene sequence analysis, DNA-DNA hybridization, and physiological and biochemical characterization, which enabled the isolate to be differentiated from all recognized species of the genus Methylobacterium, it was concluded that strain CBMB27T represents a novel species in the genus Methylobacterium for which the name Methylobacterium phyllosphaerae sp. nov. is proposed (type strain CBMB27T =LMG 24361T =KACC 11716T =DSM 19779T).
Sequence signatures of allosteric proteins towards rational design.

PubMed

Namboodiri, Saritha; Verma, Chandra; Dhar, Pawan K; Giuliani, Alessandro; Nair, Achuthsankar S

2010-12-01

Allostery is the phenomenon of changes in the structure and activity of proteins that appear as a consequence of ligand binding at sites other than the active site. Studying mechanistic basis of allostery leading to protein design with predetermined functional endpoints is an important unmet need of synthetic biology. Here, we screened the amino acid sequence landscape in search of sequence-signatures of allostery using Recurrence Quantitative Analysis (RQA) method. A characteristic vector, comprised of 10 features extracted from RQA was defined for amino acid sequences. Using Principal Component Analysis, four factors were found to be important determinants of allosteric behavior. Our sequence-based predictor method shows 82.6% accuracy, 85.7% sensitivity and 77.9% specificity with the current dataset. Further, we show that Laminarity-Mean-hydrophobicity representing repeated hydrophobic patches is the most crucial indicator of allostery. To our best knowledge this is the first report that describes sequence determinants of allostery based on hydrophobicity. As an outcome of these findings, we plan to explore possibility of inducing allostery in proteins.
Hybridization experiments indicate incomplete reproductive isolating mechanism between Fasciola hepatica and Fasciola gigantica.

PubMed

Itagaki, T; Ichinomiya, M; Fukuda, K; Fusyuku, S; Carmona, C

2011-09-01

Experiments on hybridization between Fasciola hepatica and Fasciola gigantica were carried out to clarify whether a reproductive isolating mechanism appears between the two Fasciola species. Molecular evidence for hybridization was based on the DNA sequence of the internal transcribed spacer 1 (ITS1) region in nuclear ribosomal DNA, which differs between the species. The results suggested that there were not pre-mating but post-mating isolating mechanisms between the two species. However, viable adults of the hybrids F1 and F2 were produced from both parental F. hepatica and F. gigantica. The hybrids inherited phenotypic characteristics such as ratio of body length and width and infectivity to rats from parental Fasciola hepatica and F. gigantica. These findings suggest that reproductive isolation is incomplete between Fasciola hepatica and F. gigantica. Adults of the hybrids F1 and F2 were completely different in mode of reproduction from aspermic Fasciola forms that occur in Asia and seem to be offspring originated from hybridization between F. hepatica and F. gigantica and to reproduce parthenogenetically.
Effect of Backbone Design on Hybridization Thermodynamics of Oligo-nucleic Acids: A Coarse-Grained Molecular Dynamics Simulation Study

NASA Astrophysics Data System (ADS)

Ghobadi, Ahmadreza F.; Jayaraman, Arthi

DNA hybridization is the basis of various bio-nano technologies, such as DNA origami and assembly of DNA-functionalized nanoparticles. A hybridized double stranded (ds) DNA is formed when complementary nucleobases on hybridizing strands exhibit specific and directional hydrogen bonds through canonical Watson-Crick base-pairing interactions. In recent years, the need for cheaper alternatives and significant synthetic advances have driven design of DNA mimics with new backbone chemistries. However, a fundamental understanding of how these backbone modifications in the oligo-nucleic acids impact the hybridization and melting behavior of the duplex is still lacking. In this talk, we present our recent findings on impact of varying backbone chemistry on hybridization of oligo-nucleic acid duplexes. We use coarse-grained molecular dynamics simulations to isolate the effect of strand flexibility, electrostatic interactions and nucleobase spacing on the melting curves for duplexes with various strand sequences and concentrations. Since conjugation of oligo-nucleic acids with polymers serve as building blocks for thermo-responsive polymer networks and gels, we also present the effect of such conjugation on hybridization thermodynamics and polymer conformation.
Reassociation and hybridization properties of DNAs from several species of fish

USGS Publications Warehouse

Gharrett, A.J.; Simon, R.C.; McIntyre, J.D.

1977-01-01

Reassociation and hybridization properties from spectrophotometric studies of DNAs from 10 species of fish indicate:1. Great diversity in the amounts of repeated sequences in the genomes of different species - more specialized fish had less redundancy.2. Large differences in the complexities of the DNAs - more specialized fish had less information.3. Little homology between sequences of remotely related species but substantial homology between sequences of closely related species.
Ancestral chloroplast polymorphism and historical secondary contact in a broad hybrid zone of Aesculus (Sapindaceae).

PubMed

Modliszewski, Jennifer L; Thomas, David T; Fan, Chuanzhu; Crawford, Daniel J; Depamphilis, Claude W; Xiang, Qiu-Yun Jenny

2006-03-01

Knowledge regarding the origin and maintenance of hybrid zones is critical for understanding the evolutionary outcomes of natural hybridization. To evaluate the contribution of historical contact vs. long-distance gene flow in the formation of a broad hybrid zone in central and northern Georgia that involves Aesculus pavia, A. sylvatica, and A. flava, three cpDNA regions (matK, trnD-trnT, and trnH-trnK) were analyzed. The maternal inheritance of cpDNA in Aesculus was confirmed via sequencing of matK from progeny of controlled crosses. Restriction site analyses identified 21 unique haplotypes among 248 individuals representing 29 populations from parental species and hybrids. Haplotypes were sequenced for all cpDNA regions. Restriction site and sequence data were subjected to phylogeographic and population genetic analyses. Considerable cpDNA variation was detected in the hybrid zone, as well as ancestral cpDNA polymorphism; furthermore, the distribution of haplotypes indicates limited interpopulation gene flow via seeds. The genealogy and structure of genetic variation further support the historical presence of A. pavia in the Piedmont, although they are at present locally extinct. In conjunction with previous allozyme studies, the cpDNA data suggest that the hybrid zone originated through historical local gene flow, yet is maintained by periodic long-distance pollen dispersal.
Molecular Evidence for a Natural Primary Triple Hybrid in Plants Revealed from Direct Sequencing

PubMed Central

Kaplan, Zdenek; Fehrer, Judith

2007-01-01

Background and Aims Molecular evidence for natural primary hybrids composed of three different plant species is very rarely reported. An investigation was therefore carried out into the origin and a possible scenario for the rise of a sterile plant clone showing a combination of diagnostic morphological features of three separate, well-defined Potamogeton species. Methods The combination of sequences from maternally inherited cytoplasmic (rpl20-rps12) and biparentally inherited nuclear ribosomal DNA (ITS) was used to identify the exact identity of the putative triple hybrid. Key Results Direct sequencing showed ITS variants of three parental taxa, P. gramineus, P. lucens and P. perfoliatus, whereas chloroplast DNA identified P. perfoliatus as the female parent. A scenario for the rise of the triple hybrid through a fertile binary hybrid P. gramineus × P. lucens crossed with P. perfoliatus is described. Conclusions Even though the triple hybrid is sterile, it possesses an efficient strategy for its existence and became locally successful even in the parental environment, perhaps as a result of heterosis. The population investigated is the only one known of this hybrid, P. × torssanderi, worldwide. Isozyme analysis indicated the colony to be genetically uniform. The plants studied represented a single clone that seems to have persisted at this site for a long time. PMID:17478544
Dissecting whole-genome sequencing-based online tools for predicting resistance in Mycobacterium tuberculosis: can we use them for clinical decision guidance?

PubMed

Macedo, Rita; Nunes, Alexandra; Portugal, Isabel; Duarte, Sílvia; Vieira, Luís; Gomes, João Paulo

2018-05-01

Whole-genome sequencing (WGS)-based bioinformatics platforms for the rapid prediction of resistance will soon be implemented in the Tuberculosis (TB) laboratory, but their accuracy assessment still needs to be strengthened. Here, we fully-sequenced a total of 54 multidrug-resistant (MDR) and five susceptible TB strains and performed, for the first time, a simultaneous evaluation of the major four free online platforms (TB Profiler, PhyResSE, Mykrobe Predictor and TGS-TB). Overall, the sensitivity of resistance prediction ranged from 84.3% using Mykrobe predictor to 95.2% using TB profiler, while specificity was higher and homogeneous among platforms. TB profiler revealed the best performance robustness (sensitivity, specificity, PPV and NPV above 95%), followed by TGS-TB (all parameters above 90%). We also observed a few discrepancies between phenotype and genotype, where, in some cases, it was possible to pin-point some "candidate" mutations (e.g., in the rpsL promoter region) highlighting the need for their confirmation through mutagenesis assays and potential review of the anti-TB genetic databases. The rampant development of the bioinformatics algorithms and the tremendously reduced time-frame until the clinician may decide for a definitive and most effective treatment will certainly trigger the technological transition where WGS-based bioinformatics platforms could replace phenotypic drug susceptibility testing for TB. Copyright © 2018 Elsevier Ltd. All rights reserved.

Becoming pure: identifying generational classes of admixed individuals within lesser and greater scaup populations.

PubMed

Lavretsky, Philip; Peters, Jeffrey L; Winker, Kevin; Bahn, Volker; Kulikova, Irina; Zhuravlev, Yuri N; Wilson, Robert E; Barger, Chris; Gurney, Kirsty; McCracken, Kevin G

2016-02-01

Estimating the frequency of hybridization is important to understand its evolutionary consequences and its effects on conservation efforts. In this study, we examined the extent of hybridization in two sister species of ducks that hybridize. We used mitochondrial control region sequences and 3589 double-digest restriction-associated DNA sequences (ddRADseq) to identify admixture between wild lesser scaup (Aythya affinis) and greater scaup (A. marila). Among 111 individuals, we found one introgressed mitochondrial DNA haplotype in lesser scaup and four in greater scaup. Likewise, based on the site-frequency spectrum from autosomal DNA, gene flow was asymmetrical, with higher rates from lesser into greater scaup. However, using ddRADseq nuclear DNA, all individuals were assigned to their respective species with >0.95 posterior assignment probability. To examine the power for detecting admixture, we simulated a breeding experiment in which empirical data were used to create F1 hybrids and nine generations (F2-F10) of backcrossing. F1 hybrids and F2, F3 and most F4 backcrosses were clearly distinguishable from pure individuals, but evidence of admixed histories was effectively lost after the fourth generation. Thus, we conclude that low interspecific assignment probabilities (0.011-0.043) for two lesser and nineteen greater scaup were consistent with admixed histories beyond the F3 generation. These results indicate that the propensity of these species to hybridize in the wild is low and largely asymmetric. When applied to species-specific cases, our approach offers powerful utility for examining concerns of hybridization in conservation efforts, especially for determining the generational time until admixed histories are effectively lost through backcrossing. © 2015 John Wiley & Sons Ltd.
Genomic characterization and taxonomic position of a rhabdovirus from a hybrid snakehead.

PubMed

Zeng, Weiwei; Wang, Qing; Wang, Yingying; Liu, Cun; Liang, Hongru; Fang, Xiang; Wu, Shuqin

2014-09-01

A new rhabdovirus, tentatively designated as hybrid snakehead rhabdovirus C1207 (HSHRV-C1207), was first isolated from a moribund hybrid snakehead (Channa maculata×Channa argus) in China. We present the complete genome sequence of HSHRV-C1207 and a comprehensive sequence comparison between HSHRV-C1207 and other rhabdoviruses. Sequence alignment and phylogenetic analysis revealed that HSHRV-C1207 shared the highest degree of homology with Monopterus albus rhabdovirus and Siniperca chuatsi rhabdovirus. All three viruses clustered into a single group that was distinct from the recognized genera in the family Rhabdoviridae. Our analysis suggests that HSHRV-C1207, as well as MARV and SCRV, should be assigned to a new rhabdovirus genus.
Identification of the genomic locus for the human Rieske Fe-S Protein gene on Chromosome 19q12

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pennacchio, L.A.

1994-05-06

We have identified the chromosomal location of the human Rieske Iron-Sulfur Protein (UQCRFS1) gene. Mapping by hybridization to a panel of monochromosomal hybrid cell lines indicated that the gene was either on chromosome 19 or 22. By screening a human chromosome 19 specific genomic cosmid library with an oligonucleotide probe made from the published Rieske cDNA sequence, we identified a corresponding cosmid. Portions of this cosmid were sequenced directly. The exon, exon:intron junction, and flanking sequences verified that this cosmid contains the genomic locus. Fluorescent in situ hybridization (FISH) was performed to localize this cosmid to chromosome band 19q12.
SSHscreen and SSHdb, generic software for microarray based gene discovery: application to the stress response in cowpea

PubMed Central

2010-01-01

Background Suppression subtractive hybridization is a popular technique for gene discovery from non-model organisms without an annotated genome sequence, such as cowpea (Vigna unguiculata (L.) Walp). We aimed to use this method to enrich for genes expressed during drought stress in a drought tolerant cowpea line. However, current methods were inefficient in screening libraries and management of the sequence data, and thus there was a need to develop software tools to facilitate the process. Results Forward and reverse cDNA libraries enriched for cowpea drought response genes were screened on microarrays, and the R software package SSHscreen 2.0.1 was developed (i) to normalize the data effectively using spike-in control spot normalization, and (ii) to select clones for sequencing based on the calculation of enrichment ratios with associated statistics. Enrichment ratio 3 values for each clone showed that 62% of the forward library and 34% of the reverse library clones were significantly differentially expressed by drought stress (adjusted p value < 0.05). Enrichment ratio 2 calculations showed that > 88% of the clones in both libraries were derived from rare transcripts in the original tester samples, thus supporting the notion that suppression subtractive hybridization enriches for rare transcripts. A set of 118 clones were chosen for sequencing, and drought-induced cowpea genes were identified, the most interesting encoding a late embryogenesis abundant Lea5 protein, a glutathione S-transferase, a thaumatin, a universal stress protein, and a wound induced protein. A lipid transfer protein and several components of photosynthesis were down-regulated by the drought stress. Reverse transcriptase quantitative PCR confirmed the enrichment ratio values for the selected cowpea genes. SSHdb, a web-accessible database, was developed to manage the clone sequences and combine the SSHscreen data with sequence annotations derived from BLAST and Blast2GO. The self-BLAST function within SSHdb grouped redundant clones together and illustrated that the SSHscreen plots are a useful tool for choosing anonymous clones for sequencing, since redundant clones cluster together on the enrichment ratio plots. Conclusions We developed the SSHscreen-SSHdb software pipeline, which greatly facilitates gene discovery using suppression subtractive hybridization by improving the selection of clones for sequencing after screening the library on a small number of microarrays. Annotation of the sequence information and collaboration was further enhanced through a web-based SSHdb database, and we illustrated this through identification of drought responsive genes from cowpea, which can now be investigated in gene function studies. SSH is a popular and powerful gene discovery tool, and therefore this pipeline will have application for gene discovery in any biological system, particularly non-model organisms. SSHscreen 2.0.1 and a link to SSHdb are available from http://microarray.up.ac.za/SSHscreen. PMID:20359330
A Novel Botulinum Neurotoxin, Previously Reported as Serotype H, Has a Hybrid-Like Structure With Regions of Similarity to the Structures of Serotypes A and F and Is Neutralized With Serotype A Antitoxin

PubMed Central

Maslanka, Susan E.; Lúquez, Carolina; Dykes, Janet K.; Tepp, William H.; Pier, Christina L.; Pellett, Sabine; Raphael, Brian H.; Kalb, Suzanne R.; Barr, John R.; Rao, Agam; Johnson, Eric A.

2016-01-01

Botulism is a potentially fatal paralytic disease caused by the action of botulinum neurotoxin (BoNT) on nerve cells. There are 7 known serotypes (A–G) of BoNT and up to 40 genetic variants. Clostridium botulinum strain IBCA10-7060 was recently reported to produce BoNT serotype B (BoNT/B) and a novel BoNT, designated as BoNT/H. The BoNT gene (bont) sequence of BoNT/H was compared to known bont sequences. Genetic analysis suggested that BoNT/H has a hybrid-like structure containing regions of similarity to the structures of BoNT/A1 and BoNT/F5. This novel BoNT was serologically characterized by the mouse neutralization assay and a neuronal cell–based assay. The toxic effects of this hybrid-like BoNT were completely eliminated by existing serotype A antitoxins, including those contained in multivalent therapeutic antitoxin products that are the mainstay of human botulism treatment. PMID:26068781
Molecular systematics of European Hyalodaphnia: the role of contemporary hybridization in ancient species.

PubMed Central

Schwenk, K; Posada, D; Hebert, P D

2000-01-01

We examined phylogenetic relationships among Daphnia using mitochondrial DNA (mtDNA) sequences from the small subunit ribosomal RNA (12S), cytochrome c oxidase subunit I and nuclear DNA sequences from the first and second internal transcribed spacer representing 1612 base positions. Phylogenetic analyses using several species of the three main Daphnia subgenera, Ctenodaphnia, Hyalodaphnia and Daphnia, revealed that the Hyalodaphnia are a monophyletic sister group of the Daphnia. Most Hyalodaphnia species occur on one continent, whereas only three are found in North America and Europe. Endemicity of species is associated with variation in thermal tolerance and habitat differentiation. Although many species of the Hyalodaphnia are known to hybridize in nature, mtDNA divergence is relatively high ca. 9%) compared to other hybridizing arthropods (ca. 3%). Reproductive isolation in Daphnia seems to evolve significantly slower than genetic isolation. We related these findings to what is known about the ecology and genetics of Daphnia in order to better understand the evolutionary diversification of lineages. The relationship of these data to phylogenetic patterns is discussed in the context of speciation processes in Daphnia. PMID:11052533
A CRISPR/molecular beacon hybrid system for live-cell genomic imaging.

PubMed

Wu, Xiaotian; Mao, Shiqi; Yang, Yantao; Rushdi, Muaz N; Krueger, Christopher J; Chen, Antony K

2018-04-30

The clustered regularly interspersed short palindromic repeat (CRISPR) gene-editing system has been repurposed for live-cell genomic imaging, but existing approaches rely on fluorescent protein reporters, making sensitive and continuous imaging difficult. Here, we present a fluorophore-based live-cell genomic imaging system that consists of a nuclease-deactivated mutant of the Cas9 protein (dCas9), a molecular beacon (MB), and an engineered single-guide RNA (sgRNA) harboring a unique MB target sequence (sgRNA-MTS), termed CRISPR/MB. Specifically, dCas9 and sgRNA-MTS are first co-expressed to target a specific locus in cells, followed by delivery of MBs that can then hybridize to MTS to illuminate the target locus. We demonstrated the feasibility of this approach for quantifying genomic loci, for monitoring chromatin dynamics, and for dual-color imaging when using two orthogonal MB/MTS pairs. With flexibility in selecting different combinations of fluorophore/quencher pairs and MB/MTS sequences, our CRISPR/MB hybrid system could be a promising platform for investigating chromatin activities.
Single-Step Conversion of Cells to Retrovirus Vector Producers with Herpes Simplex Virus–Epstein-Barr Virus Hybrid Amplicons

PubMed Central

Sena-Esteves, Miguel; Saeki, Yoshinaga; Camp, Sara M.; Chiocca, E. Antonio; Breakefield, Xandra O.

1999-01-01

We report here on the development and characterization of a novel herpes simplex virus type 1 (HSV-1) amplicon-based vector system which takes advantage of the host range and retention properties of HSV–Epstein-Barr virus (EBV) hybrid amplicons to efficiently convert cells to retrovirus vector producer cells after single-step transduction. The retrovirus genes gag-pol and env (GPE) and retroviral vector sequences were modified to minimize sequence overlap and cloned into an HSV-EBV hybrid amplicon. Retrovirus expression cassettes were used to generate the HSV-EBV-retrovirus hybrid vectors, HERE and HERA, which code for the ecotropic and the amphotropic envelopes, respectively. Retrovirus vector sequences encoding lacZ were cloned downstream from the GPE expression unit. Transfection of 293T/17 cells with amplicon plasmids yielded retrovirus titers between 106 and 107 transducing units/ml, while infection of the same cells with amplicon vectors generated maximum titers 1 order of magnitude lower. Retrovirus titers were dependent on the extent of transduction by amplicon vectors for the same cell line, but different cell lines displayed varying capacities to produce retrovirus vectors even at the same transduction efficiencies. Infection of human and dog primary gliomas with this system resulted in the production of retrovirus vectors for more than 1 week and the long-term retention and increase in transgene activity over time in these cell populations. Although the efficiency of this system still has to be determined in vivo, many applications are foreseeable for this approach to gene delivery. PMID:10559361
Independently segregating simple sequence repeats (SSR) alleles in polyploid sugarcane

USDA-ARS?s Scientific Manuscript database

The complex nuclear genomic and flower structures of sugarcane cultivars (Saccharum hybrids spp., 2n = 10x = 100 – 130) render sugarcane a difficult subject for genetics research. Using a capillary electrophoresis- and fluorescence-labeling-based SSR genotyping platform, the segregation of a multi-a...
Efficient self-assembly of DNA-functionalized fluorophores and gold nanoparticles with DNA functionalized silicon surfaces: the effect of oligomer spacers

PubMed Central

Milton, James A.; Patole, Samson; Yin, Huabing; Xiao, Qiang; Brown, Tom; Melvin, Tracy

2013-01-01

Although strategies for the immobilization of DNA oligonucleotides onto surfaces for bioanalytical and top-down bio-inspired nanobiofabrication approaches are well developed, the effect of introducing spacer molecules between the surface and the DNA oligonucleotide for the hybridization of nanoparticle–DNA conjugates has not been previously assessed in a quantitative manner. The hybridization efficiency of DNA oligonucleotides end-labelled with gold nanoparticles (1.4 or 10 nm diameter) with DNA sequences conjugated to silicon surfaces via hexaethylene glycol phosphate diester oligomer spacers (0, 1, 2, 6 oligomers) was found to be independent of spacer length. To quantify both the density of DNA strands attached to the surfaces and hybridization with the surface-attached DNA, new methodologies have been developed. Firstly, a simple approach based on fluorescence has been developed for determination of the immobilization density of DNA oligonucleotides. Secondly, an approach using mass spectrometry has been created to establish (i) the mean number of DNA oligonucleotides attached to the gold nanoparticles and (ii) the hybridization density of nanoparticle–oligonucleotide conjugates with the silicon surface–attached complementary sequence. These methods and results will be useful for application with nanosensors, the self-assembly of nanoelectronic devices and the attachment of nanoparticles to biomolecules for single-molecule biophysical studies. PMID:23361467
Preparation and characterization of zinc oxide nanoparticles and their sensor applications for electrochemical monitoring of nucleic acid hybridization.

PubMed

Yumak, Tugrul; Kuralay, Filiz; Muti, Mihrican; Sinag, Ali; Erdem, Arzum; Abaci, Serdar

2011-09-01

In this study, ZnO nanoparticles (ZNP) of approximately 30 nm in size were synthesized by the hydrothermal method and characterized by X-ray diffraction (XRD), Braun-Emmet-Teller (BET) N2 adsorption analysis and transmission electron microscopy (TEM). ZnO nanoparticles enriched with poly(vinylferrocenium) (PVF+) modified single-use graphite electrodes were then developed for the electrochemical monitoring of nucleic acid hybridization related to the Hepatitis B Virus (HBV). Firstly, the surfaces of polymer modified and polymer-ZnO nanoparticle modified single-use pencil graphite electrodes (PGEs) were characterized using scanning electron microscopy (SEM). The electrochemical behavior of these electrodes was also investigated using differential pulse voltammetry (DPV) and electrochemical impedance spectroscopy (EIS). Subsequently, the polymer-ZnO nanoparticle modified PGEs were evaluated for the electrochemical detection of DNA based on the changes at the guanine oxidation signals. Various modifications in DNA oligonucleotides and probe concentrations were examined in order to optimize the electrochemical signals that were generated by means of nucleic acid hybridization. After the optimization studies, the sequence-selective DNA hybridization was investigated in the case of a complementary amino linked probe (target), or noncomplementary (NC) sequences, or target and mismatch (MM) mixture in the ratio of (1:1). Copyright © 2011 Elsevier B.V. All rights reserved.
Development of SSR Markers Linked to Low Hydrocyanic Acid Content in Sorghum-Sudan Grass Hybrid Based on BSA Method.

PubMed

Xiao-Xia, Yu; Zhi-Hua, Liu; Zhuo, Yu; Yue, Shi; Xiao-Yu, Li

2016-01-01

Sorghum-Sudan grass hybrid containing high hydrocyanic acid content can cause hydrocyanic acid poisoning to the livestock and limit the popularization of this forage crop. Molecular markers associated with low hydrocyanic acid content can speed up the process of identification of genotypes with low hydrocyanic acid content. In the present study, 11 polymorphic SSR primers were screened and used for bulked segregant analysis and single marker analysis. Three SSR markers Xtxp7230, Xtxp7375 and Bnlg667960 associated with low hydrocyanic acid content were rapidly identified by BSA. In single marker analysis, six markers Xtxp7230, Xtxp7375, Bnlg667960, Xtxp67-11, Xtxp295-7 and Xtxp12-9 were linked to low hydrocyanic acid content, which explained the proportion of phenotypic variation from 7.6 % to 41.2 %. The markers identified by BSA were also verified by single marker analysis. The three SSR marker bands were then cloned and sequenced for sequence homology analysis in NCBI. It is the first report on the development of molecular markers associated with low hydrocyanic acid content in sorghum- Sudan grass hybrid. These markers will be useful for genetic improvement of low hydrocyanic acid sorghum-Sudan grass hybrid by marker-assisted breeding.
Design of nucleic acid sequences for DNA computing based on a thermodynamic approach

PubMed Central

Tanaka, Fumiaki; Kameda, Atsushi; Yamamoto, Masahito; Ohuchi, Azuma

2005-01-01

We have developed an algorithm for designing multiple sequences of nucleic acids that have a uniform melting temperature between the sequence and its complement and that do not hybridize non-specifically with each other based on the minimum free energy (ΔGmin). Sequences that satisfy these constraints can be utilized in computations, various engineering applications such as microarrays, and nano-fabrications. Our algorithm is a random generate-and-test algorithm: it generates a candidate sequence randomly and tests whether the sequence satisfies the constraints. The novelty of our algorithm is that the filtering method uses a greedy search to calculate ΔGmin. This effectively excludes inappropriate sequences before ΔGmin is calculated, thereby reducing computation time drastically when compared with an algorithm without the filtering. Experimental results in silico showed the superiority of the greedy search over the traditional approach based on the hamming distance. In addition, experimental results in vitro demonstrated that the experimental free energy (ΔGexp) of 126 sequences correlated well with ΔGmin (|R| = 0.90) than with the hamming distance (|R| = 0.80). These results validate the rationality of a thermodynamic approach. We implemented our algorithm in a graphic user interface-based program written in Java. PMID:15701762
Interspecific somatic hybrids between Cyclamen persicum and C. coum, two sexually incompatible species.

PubMed

Prange, Anika Nadja Sabine; Bartsch, Melanie; Meiners, Julia; Serek, Margrethe; Winkelmann, Traud

2012-04-01

By applying polyethylene glycol (PEG)-mediated protoplast fusion, the first somatic hybrids were obtained between Cyclamen persicum (2n = 2x = 48) and C. coum (2n = 2x = 30)-two species that cannot be combined by cross breeding. Heterofusion was detected by double fluorescent staining with fluorescein diacetate and scopoletin. The highest heterofusion frequencies (of about 5%) resulted from a protocol using a protoplast density of 1 × 10(6)/mL and 40% PEG. The DNA content of C. coum was estimated for the first time by propidium iodide staining to be 14.7 pg/2C and was 4.6 times higher than that of C. persicum. Among 200 in vitro plantlets regenerated from fusion experiments, most resembled the C. coum parent, whereas only 5 plants showed typical C. persicum phenotypes and 46 had a deviating morphology. By flow cytometry, six putative somatic hybrids were identified. A species-specific DNA marker was developed based on the sequence of the 5.8S gene in the ribosomal nuclear DNA and its flanking internal transcribed spacers ITS1 and ITS2. The hybrid status of only one plant could be verified by the species-specific DNA marker as well as sequencing of the amplification product. RAPD markers turned out to be less informative and applicable for hybrid identification, as no clear additivity of the parental marker bands was observed. Chromosome counting in root tips of four hybrids revealed the presence of the 30 C. coum chromosomes and 2-41 additional ones indicating elimination of C. persicum chromosomes. © Springer-Verlag 2011
Applications of Multiple Nuclear Genes to the Molecular Phylogeny, Population Genetics and Hybrid Identification in the Mangrove Genus Rhizophora.

PubMed

Chen, Yongmei; Hou, Yansong; Guo, Zixiao; Wang, Wenqing; Zhong, Cairong; Zhou, Renchao; Shi, Suhua

2015-01-01

The genus Rhizophora is one of the most important components of mangrove forests. It is an ideal system for studying biogeography, molecular evolution, population genetics, hybridization and conservation genetics of mangroves. However, there are no sufficient molecular markers to address these topics. Here, we developed 77 pairs of nuclear gene primers, which showed successful PCR amplifications across all five Rhizophora species and sequencing in R. apiculata. Here, we present three tentative applications using a subset of the developed nuclear genes to (I) reconstruct the phylogeny, (II) examine the genetic structure and (III) identify natural hybridization in Rhizophora. Phylogenetic analyses support the hypothesis that Rhizophora had disappeared in the Atlantic-East Pacific (AEP) region and was re-colonized from the IWP region approximately 12.7 Mya. Population genetics analyses in four natural populations of R. apiculata in Hainan, China, revealed extremely low genetic diversity, strong population differentiation and extensive admixture, suggesting that the Pleistocene glaciations, particularly the last glacial maximum, greatly influenced the population dynamics of R. apiculata in Hainan. We also verified the hybrid status of a morphologically intermediate individual between R. apiculata and R. stylosa in Hainan. Based on the sequences of five nuclear genes and one chloroplast intergenic spacer, this individual is likely to be an F1 hybrid, with R. stylosa as its maternal parent. The nuclear gene markers developed in this study should be of great value for characterizing the hybridization and introgression patterns in other cases of this genus and testing the role of natural selection using population genomics approaches.
Diploid hybrid origin of Ostryopsis intermedia (Betulaceae) in the Qinghai-Tibet Plateau triggered by Quaternary climate change.

PubMed

Liu, Bingbing; Abbott, Richard J; Lu, Zhiqiang; Tian, Bin; Liu, Jianquan

2014-06-01

Despite the well-known effects that Quaternary climate oscillations had on shaping intraspecific diversity, their role in driving homoploid hybrid speciation is less clear. Here, we examine their importance in the putative homoploid hybrid origin and evolution of Ostryopsis intermedia, a diploid species occurring in the Qinghai-Tibet Plateau (QTP), a biodiversity hotspot. We investigated interspecific relationships between this species and its only other congeners, O. davidiana and O. nobilis, based on four sets of nuclear and chloroplast population genetic data and tested alternative speciation hypotheses. All nuclear data distinguished the three species clearly and supported a close relationship between O. intermedia and the disjunctly distributed O. davidiana. Chloroplast DNA sequence variation identified two tentative lineages, which distinguished O. intermedia from O. davidiana; however, both were present in O. nobilis. Admixture analyses of genetic polymorphisms at 20 SSR loci and sequence variation at 11 nuclear loci and approximate Bayesian computation (ABC) tests supported the hypothesis that O. intermedia originated by homoploid hybrid speciation from O. davidiana and O. nobilis. We further estimated that O. davidiana and O. nobilis diverged 6-11 Ma, while O. intermedia originated 0.5-1.2 Ma when O. davidiana is believed to have migrated southward, contacted and hybridized with O. nobilis possibly during the largest Quaternary glaciation that occurred in this region. Our findings highlight the importance of Quaternary climate change in the QTP in causing hybrid speciation in this important biodiversity hotspot. © 2014 John Wiley & Sons Ltd.
Phytophthora ×stagnum nothosp. nov., a New Hybrid from Irrigation Reservoirs at Ornamental Plant Nurseries in Virginia

PubMed Central

Yang, Xiao; Richardson, Patricia A.; Hong, Chuanxue

2014-01-01

A novel Phytophthora species was frequently recovered from irrigation reservoirs at several ornamental plant production facilities in eastern Virginia. Initial sequencing of the internal transcribed spacer (ITS) region of this species generated unreadable sequences due to continual polymorphic positions. Cloning and sequencing the ITS region as well as sequencing the mitochondrially encoded cytochrome c oxidase 1 and beta-tubulin genes revealed that it is a hybrid between P. taxon PgChlamydo as its paternal parent and an unknown species genetically close to P. mississippiae as its maternal parent. This hybrid has some diagnostic morphological features of P. taxon PgChlamydo and P. mississippiae. It produces catenulate hyphal swellings, characteristic of P. mississippiae, and chlamydospores, typical of P. taxon PgChlamydo. It also produces both ornamented and relatively smooth-walled oogonia. Ornamented oogonia are another important diagnostic character of P. mississippiae. The relatively smooth-walled oogonia may be indicative of oogonial character of P. taxon PgChlamydo. The new hybrid is described here as Phytophthora ×stagnum. PMID:25072374
Fluorescence In situ Hybridization: Cell-Based Genetic Diagnostic and Research Applications.

PubMed

Cui, Chenghua; Shu, Wei; Li, Peining

2016-01-01

Fluorescence in situ hybridization (FISH) is a macromolecule recognition technology based on the complementary nature of DNA or DNA/RNA double strands. Selected DNA strands incorporated with fluorophore-coupled nucleotides can be used as probes to hybridize onto the complementary sequences in tested cells and tissues and then visualized through a fluorescence microscope or an imaging system. This technology was initially developed as a physical mapping tool to delineate genes within chromosomes. Its high analytical resolution to a single gene level and high sensitivity and specificity enabled an immediate application for genetic diagnosis of constitutional common aneuploidies, microdeletion/microduplication syndromes, and subtelomeric rearrangements. FISH tests using panels of gene-specific probes for somatic recurrent losses, gains, and translocations have been routinely applied for hematologic and solid tumors and are one of the fastest-growing areas in cancer diagnosis. FISH has also been used to detect infectious microbias and parasites like malaria in human blood cells. Recent advances in FISH technology involve various methods for improving probe labeling efficiency and the use of super resolution imaging systems for direct visualization of intra-nuclear chromosomal organization and profiling of RNA transcription in single cells. Cas9-mediated FISH (CASFISH) allowed in situ labeling of repetitive sequences and single-copy sequences without the disruption of nuclear genomic organization in fixed or living cells. Using oligopaint-FISH and super-resolution imaging enabled in situ visualization of chromosome haplotypes from differentially specified single-nucleotide polymorphism loci. Single molecule RNA FISH (smRNA-FISH) using combinatorial labeling or sequential barcoding by multiple round of hybridization were applied to measure mRNA expression of multiple genes within single cells. Research applications of these single molecule single cells DNA and RNA FISH techniques have visualized intra-nuclear genomic structure and sub-cellular transcriptional dynamics of many genes and revealed their functions in various biological processes.
Isolation and characterization of 5S rDNA sequences in catfishes genome (Heptapteridae and Pseudopimelodidae): perspectives for rDNA studies in fish by C0t method.

PubMed

Gouveia, Juceli Gonzalez; Wolf, Ivan Rodrigo; de Moraes-Manécolo, Vivian Patrícia Oliveira; Bardella, Vanessa Belline; Ferracin, Lara Munique; Giuliano-Caetano, Lucia; da Rosa, Renata; Dias, Ana Lúcia

2016-12-01

Sequences of 5S ribosomal RNA (rRNA) are extensively used in fish cytogenomic studies, once they have a flexible organization at the chromosomal level, showing inter- and intra-specific variation in number and position in karyotypes. Sequences from the genome of Imparfinis schubarti (Heptapteridae) were isolated, aiming to understand the organization of 5S rDNA families in the fish genome. The isolation of 5S rDNA from the genome of I. schubarti was carried out by reassociation kinetics (C 0 t) and PCR amplification. The obtained sequences were cloned for the construction of a micro-library. The obtained clones were sequenced and hybridized in I. schubarti and Microglanis cottoides (Pseudopimelodidae) for chromosome mapping. An analysis of the sequence alignments with other fish groups was accomplished. Both methods were effective when using 5S rDNA for hybridization in I. schubarti genome. However, the C 0 t method enabled the use of a complete 5S rRNA gene, which was also successful in the hybridization of M. cottoides. Nevertheless, this gene was obtained only partially by PCR. The hybridization results and sequence analyses showed that intact 5S regions are more appropriate for the probe operation, due to conserved structure and motifs. This study contributes to a better understanding of the organization of multigene families in catfish's genomes.
Construction of a small Mus musculus repetitive DNA library: identification of a new satellite sequence in Mus musculus.

PubMed Central

Pietras, D F; Bennett, K L; Siracusa, L D; Woodworth-Gutai, M; Chapman, V M; Gross, K W; Kane-Haas, C; Hastie, N D

1983-01-01

We report the construction of a small library of recombinant plasmids containing Mus musculus repetitive DNA inserts. The repetitive cloned fraction was derived from denatured genomic DNA by reassociation to a Cot value at which repetitive, but not unique, sequences have reannealed followed by exhaustive S1 nuclease treatment to degrade single stranded DNA. Initial characterizations of this library by colony filter hybridizations have led to the identification of a previously undetected M. musculus minor satellite as well as to clones containing M. musculus major satellite sequences. This new satellite is repeated 10-20 times less than the major satellite in the M. musculus genome. It has a repeat length of 130 nucleotides compared with the M. musculus major satellite with a repeat length of 234 nucleotides. Sequence analysis of the minor satellite has shown that it has a 29 base pair region with extensive homology to one of the major satellite repeating subunits. We also show by in situ hybridization that this minor satellite sequence is located at the centromeres and possibly the arms of at least half the M musculus chromosomes. Sequences related to the minor satellite have been found in the DNA of a related Mus species, Mus spretus, and may represent the major satellite of that species. Images PMID:6314268

Strong transcription blockage mediated by R-loop formation within a G-rich homopurine–homopyrimidine sequence localized in the vicinity of the promoter

PubMed Central

Soo Shin, Jane Hae

2017-01-01

Abstract Guanine-rich (G-rich) homopurine–homopyrimidine nucleotide sequences can block transcription with an efficiency that depends upon their orientation, composition and length, as well as the presence of negative supercoiling or breaks in the non-template DNA strand. We report that a G-rich sequence in the non-template strand reduces the yield of T7 RNA polymerase transcription by more than an order of magnitude when positioned close (9 bp) to the promoter, in comparison to that for a distal (∼250 bp) location of the same sequence. This transcription blockage is much less pronounced for a C-rich sequence, and is not significant for an A-rich sequence. Remarkably, the blockage is not pronounced if transcription is performed in the presence of RNase H, which specifically digests the RNA strands within RNA–DNA hybrids. The blockage also becomes less pronounced upon reduced RNA polymerase concentration. Based upon these observations and those from control experiments, we conclude that the blockage is primarily due to the formation of stable RNA–DNA hybrids (R-loops), which inhibit successive rounds of transcription. Our results could be relevant to transcription dynamics in vivo (e.g. transcription ‘bursting’) and may also have practical implications for the design of expression vectors. PMID:28498974
Strong transcription blockage mediated by R-loop formation within a G-rich homopurine-homopyrimidine sequence localized in the vicinity of the promoter.

PubMed

Belotserkovskii, Boris P; Soo Shin, Jane Hae; Hanawalt, Philip C

2017-06-20

Guanine-rich (G-rich) homopurine-homopyrimidine nucleotide sequences can block transcription with an efficiency that depends upon their orientation, composition and length, as well as the presence of negative supercoiling or breaks in the non-template DNA strand. We report that a G-rich sequence in the non-template strand reduces the yield of T7 RNA polymerase transcription by more than an order of magnitude when positioned close (9 bp) to the promoter, in comparison to that for a distal (∼250 bp) location of the same sequence. This transcription blockage is much less pronounced for a C-rich sequence, and is not significant for an A-rich sequence. Remarkably, the blockage is not pronounced if transcription is performed in the presence of RNase H, which specifically digests the RNA strands within RNA-DNA hybrids. The blockage also becomes less pronounced upon reduced RNA polymerase concentration. Based upon these observations and those from control experiments, we conclude that the blockage is primarily due to the formation of stable RNA-DNA hybrids (R-loops), which inhibit successive rounds of transcription. Our results could be relevant to transcription dynamics in vivo (e.g. transcription 'bursting') and may also have practical implications for the design of expression vectors. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Comparison of the Predictive Accuracy of DNA Array-Based Multigene Classifiers across cDNA Arrays and Affymetrix GeneChips

PubMed Central

Stec, James; Wang, Jing; Coombes, Kevin; Ayers, Mark; Hoersch, Sebastian; Gold, David L.; Ross, Jeffrey S; Hess, Kenneth R.; Tirrell, Stephen; Linette, Gerald; Hortobagyi, Gabriel N.; Symmans, W. Fraser; Pusztai, Lajos

2005-01-01

We examined how well differentially expressed genes and multigene outcome classifiers retain their class-discriminating values when tested on data generated by different transcriptional profiling platforms. RNA from 33 stage I-III breast cancers was hybridized to both Affymetrix GeneChip and Millennium Pharmaceuticals cDNA arrays. Only 30% of all corresponding gene expression measurements on the two platforms had Pearson correlation coefficient r ≥ 0.7 when UniGene was used to match probes. There was substantial variation in correlation between different Affymetrix probe sets matched to the same cDNA probe. When cDNA and Affymetrix probes were matched by basic local alignment tool (BLAST) sequence identity, the correlation increased substantially. We identified 182 genes in the Affymetrix and 45 in the cDNA data (including 17 common genes) that accurately separated 91% of cases in supervised hierarchical clustering in each data set. Cross-platform testing of these informative genes resulted in lower clustering accuracy of 45 and 79%, respectively. Several sets of accurate five-gene classifiers were developed on each platform using linear discriminant analysis. The best 100 classifiers showed average misclassification error rate of 2% on the original data that rose to 19.5% when tested on data from the other platform. Random five-gene classifiers showed misclassification error rate of 33%. We conclude that multigene predictors optimized for one platform lose accuracy when applied to data from another platform due to missing genes and sequence differences in probes that result in differing measurements for the same gene. PMID:16049308
Interspecific Plastome Recombination Reflects Ancient Reticulate Evolution in Picea (Pinaceae).

PubMed

Sullivan, Alexis R; Schiffthaler, Bastian; Thompson, Stacey Lee; Street, Nathaniel R; Wang, Xiao-Ru

2017-07-01

Plastid sequences are a cornerstone in plant systematic studies and key aspects of their evolution, such as uniparental inheritance and absent recombination, are often treated as axioms. While exceptions to these assumptions can profoundly influence evolutionary inference, detecting them can require extensive sampling, abundant sequence data, and detailed testing. Using advancements in high-throughput sequencing, we analyzed the whole plastomes of 65 accessions of Picea, a genus of ∼35 coniferous forest tree species, to test for deviations from canonical plastome evolution. Using complementary hypothesis and data-driven tests, we found evidence for chimeric plastomes generated by interspecific hybridization and recombination in the clade comprising Norway spruce (P. abies) and 10 other species. Support for interspecific recombination remained after controlling for sequence saturation, positive selection, and potential alignment artifacts. These results reconcile previous conflicting plastid-based phylogenies and strengthen the mounting evidence of reticulate evolution in Picea. Given the relatively high frequency of hybridization and biparental plastid inheritance in plants, we suggest interspecific plastome recombination may be more widespread than currently appreciated and could underlie reported cases of discordant plastid phylogenies. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Brain cDNA clone for human cholinesterase

DOE Office of Scientific and Technical Information (OSTI.GOV)

McTiernan, C.; Adkins, S.; Chatonnet, A.

1987-10-01

A cDNA library from human basal ganglia was screened with oligonucleotide probes corresponding to portions of the amino acid sequence of human serum cholinesterase. Five overlapping clones, representing 2.4 kilobases, were isolated. The sequenced cDNA contained 207 base pairs of coding sequence 5' to the amino terminus of the mature protein in which there were four ATG translation start sites in the same reading frame as the protein. Only the ATG coding for Met-(-28) lay within a favorable consensus sequence for functional initiators. There were 1722 base pairs of coding sequence corresponding to the protein found circulating in human serum.more » The amino acid sequence deduced from the cDNA exactly matched the 574 amino acid sequence of human serum cholinesterase, as previously determined by Edman degradation. Therefore, our clones represented cholinesterase rather than acetylcholinesterase. It was concluded that the amino acid sequences of cholinesterase from two different tissues, human brain and human serum, were identical. Hybridization of genomic DNA blots suggested that a single gene, or very few genes coded for cholinesterase.« less
Application of Islanding Detection and Classification of Power Quality Disturbance in Hybrid Energy System

NASA Astrophysics Data System (ADS)

Sun, L. B.; Wu, Z. S.; Yang, K. K.

2018-04-01

Islanding and power quality (PQ) disturbances in hybrid energy system become more serious with the application of renewable energy sources. In this paper, a novel method based on wavelet transform (WT) and modified feed forward neural network (FNN) is proposed to detect islanding and classify PQ problems. First, the performance indices, i.e., the energy content and SD of the transformed signal are extracted from the negative sequence component of the voltage signal at PCC using WT. Afterward, WT indices are fed to train FNNs midfield by Particle Swarm Optimization (PSO) which is a novel heuristic optimization method. Then, the results of simulation based on WT-PSOFNN are discussed in MATLAB/SIMULINK. Simulations on the hybrid power system show that the accuracy can be significantly improved by the proposed method in detecting and classifying of different disturbances connected to multiple distributed generations.
Flow cytometric detection method for DNA samples

DOEpatents

Nasarabadi, Shanavaz [Livermore, CA; Langlois, Richard G [Livermore, CA; Venkateswaran, Kodumudi S [Round Rock, TX

2011-07-05

Disclosed herein are two methods for rapid multiplex analysis to determine the presence and identity of target DNA sequences within a DNA sample. Both methods use reporting DNA sequences, e.g., modified conventional Taqman.RTM. probes, to combine multiplex PCR amplification with microsphere-based hybridization using flow cytometry means of detection. Real-time PCR detection can also be incorporated. The first method uses a cyanine dye, such as, Cy3.TM., as the reporter linked to the 5' end of a reporting DNA sequence. The second method positions a reporter dye, e.g., FAM.TM. on the 3' end of the reporting DNA sequence and a quencher dye, e.g., TAMRA.TM., on the 5' end.
Flow cytometric detection method for DNA samples

DOEpatents

Nasarabadi, Shanavaz [Livermore, CA; Langlois, Richard G [Livermore, CA; Venkateswaran, Kodumudi S [Livermore, CA

2006-08-01

Disclosed herein are two methods for rapid multiplex analysis to determine the presence and identity of target DNA sequences within a DNA sample. Both methods use reporting DNA sequences, e.g., modified conventional Taqman.RTM. probes, to combine multiplex PCR amplification with microsphere-based hybridization using flow cytometry means of detection. Real-time PCR detection can also be incorporated. The first method uses a cyanine dye, such as, Cy3.TM., as the reporter linked to the 5' end of a reporting DNA sequence. The second method positions a reporter dye, e.g., FAM, on the 3' end of the reporting DNA sequence and a quencher dye, e.g., TAMRA, on the 5' end.
454 next generation-sequencing outperforms allele-specific PCR, Sanger sequencing, and pyrosequencing for routine KRAS mutation analysis of formalin-fixed, paraffin-embedded samples

PubMed Central

Altimari, Annalisa; de Biase, Dario; De Maglio, Giovanna; Gruppioni, Elisa; Capizzi, Elisa; Degiovanni, Alessio; D’Errico, Antonia; Pession, Annalisa; Pizzolitto, Stefano; Fiorentino, Michelangelo; Tallini, Giovanni

2013-01-01

Detection of KRAS mutations in archival pathology samples is critical for therapeutic appropriateness of anti-EGFR monoclonal antibodies in colorectal cancer. We compared the sensitivity, specificity, and accuracy of Sanger sequencing, ARMS-Scorpion (TheraScreen®) real-time polymerase chain reaction (PCR), pyrosequencing, chip array hybridization, and 454 next-generation sequencing to assess KRAS codon 12 and 13 mutations in 60 nonconsecutive selected cases of colorectal cancer. Twenty of the 60 cases were detected as wild-type KRAS by all methods with 100% specificity. Among the 40 mutated cases, 13 were discrepant with at least one method. The sensitivity was 85%, 90%, 93%, and 92%, and the accuracy was 90%, 93%, 95%, and 95% for Sanger sequencing, TheraScreen real-time PCR, pyrosequencing, and chip array hybridization, respectively. The main limitation of Sanger sequencing was its low analytical sensitivity, whereas TheraScreen real-time PCR, pyrosequencing, and chip array hybridization showed higher sensitivity but suffered from the limitations of predesigned assays. Concordance between the methods was k = 0.79 for Sanger sequencing and k > 0.85 for the other techniques. Tumor cell enrichment correlated significantly with the abundance of KRAS-mutated deoxyribonucleic acid (DNA), evaluated as ΔCt for TheraScreen real-time PCR (P = 0.03), percentage of mutation for pyrosequencing (P = 0.001), ratio for chip array hybridization (P = 0.003), and percentage of mutation for 454 next-generation sequencing (P = 0.004). Also, 454 next-generation sequencing showed the best cross correlation for quantification of mutation abundance compared with all the other methods (P < 0.001). Our comparison showed the superiority of next-generation sequencing over the other techniques in terms of sensitivity and specificity. Next-generation sequencing will replace Sanger sequencing as the reference technique for diagnostic detection of KRAS mutation in archival tumor tissues. PMID:23950653
A multilevel ant colony optimization algorithm for classical and isothermic DNA sequencing by hybridization with multiplicity information available.

PubMed

Kwarciak, Kamil; Radom, Marcin; Formanowicz, Piotr

2016-04-01

The classical sequencing by hybridization takes into account a binary information about sequence composition. A given element from an oligonucleotide library is or is not a part of the target sequence. However, the DNA chip technology has been developed and it enables to receive a partial information about multiplicity of each oligonucleotide the analyzed sequence consist of. Currently, it is not possible to assess the exact data of such type but even partial information should be very useful. Two realistic multiplicity information models are taken into consideration in this paper. The first one, called "one and many" assumes that it is possible to obtain information if a given oligonucleotide occurs in a reconstructed sequence once or more than once. According to the second model, called "one, two and many", one is able to receive from biochemical experiment information if a given oligonucleotide is present in an analyzed sequence once, twice or at least three times. An ant colony optimization algorithm has been implemented to verify the above models and to compare with existing algorithms for sequencing by hybridization which utilize the additional information. The proposed algorithm solves the problem with any kind of hybridization errors. Computational experiment results confirm that using even the partial information about multiplicity leads to increased quality of reconstructed sequences. Moreover, they also show that the more precise model enables to obtain better solutions and the ant colony optimization algorithm outperforms the existing ones. Test data sets and the proposed ant colony optimization algorithm are available on: http://bioserver.cs.put.poznan.pl/download/ACO4mSBH.zip. Copyright © 2016 Elsevier Ltd. All rights reserved.
Nucleic acid detection methods

DOEpatents

Smith, C.L.; Yaar, R.; Szafranski, P.; Cantor, C.R.

1998-05-19

The invention relates to methods for rapidly determining the sequence and/or length a target sequence. The target sequence may be a series of known or unknown repeat sequences which are hybridized to an array of probes. The hybridized array is digested with a single-strand nuclease and free 3{prime}-hydroxyl groups extended with a nucleic acid polymerase. Nuclease cleaved heteroduplexes can be easily distinguish from nuclease uncleaved heteroduplexes by differential labeling. Probes and target can be differentially labeled with detectable labels. Matched target can be detected by cleaving resulting loops from the hybridized target and creating free 3-hydroxyl groups. These groups are recognized and extended by polymerases added into the reaction system which also adds or releases one label into solution. Analysis of the resulting products using either solid phase or solution. These methods can be used to detect characteristic nucleic acid sequences, to determine target sequence and to screen for genetic defects and disorders. Assays can be conducted on solid surfaces allowing for multiple reactions to be conducted in parallel and, if desired, automated. 18 figs.
SPHINX--an algorithm for taxonomic binning of metagenomic sequences.

PubMed

Mohammed, Monzoorul Haque; Ghosh, Tarini Shankar; Singh, Nitin Kumar; Mande, Sharmila S

2011-01-01

Compared with composition-based binning algorithms, the binning accuracy and specificity of alignment-based binning algorithms is significantly higher. However, being alignment-based, the latter class of algorithms require enormous amount of time and computing resources for binning huge metagenomic datasets. The motivation was to develop a binning approach that can analyze metagenomic datasets as rapidly as composition-based approaches, but nevertheless has the accuracy and specificity of alignment-based algorithms. This article describes a hybrid binning approach (SPHINX) that achieves high binning efficiency by utilizing the principles of both 'composition'- and 'alignment'-based binning algorithms. Validation results with simulated sequence datasets indicate that SPHINX is able to analyze metagenomic sequences as rapidly as composition-based algorithms. Furthermore, the binning efficiency (in terms of accuracy and specificity of assignments) of SPHINX is observed to be comparable with results obtained using alignment-based algorithms. A web server for the SPHINX algorithm is available at http://metagenomics.atc.tcs.com/SPHINX/.
From protein sequence to dynamics and disorder with DynaMine.

PubMed

Cilia, Elisa; Pancsa, Rita; Tompa, Peter; Lenaerts, Tom; Vranken, Wim F

2013-01-01

Protein function and dynamics are closely related; however, accurate dynamics information is difficult to obtain. Here based on a carefully assembled data set derived from experimental data for proteins in solution, we quantify backbone dynamics properties on the amino-acid level and develop DynaMine--a fast, high-quality predictor of protein backbone dynamics. DynaMine uses only protein sequence information as input and shows great potential in distinguishing regions of different structural organization, such as folded domains, disordered linkers, molten globules and pre-structured binding motifs of different sizes. It also identifies disordered regions within proteins with an accuracy comparable to the most sophisticated existing predictors, without depending on prior disorder knowledge or three-dimensional structural information. DynaMine provides molecular biologists with an important new method that grasps the dynamical characteristics of any protein of interest, as we show here for human p53 and E1A from human adenovirus 5.
Model predictive control of an air suspension system with damping multi-mode switching damper based on hybrid model

NASA Astrophysics Data System (ADS)

Sun, Xiaoqiang; Yuan, Chaochun; Cai, Yingfeng; Wang, Shaohua; Chen, Long

2017-09-01

This paper presents the hybrid modeling and the model predictive control of an air suspension system with damping multi-mode switching damper. Unlike traditional damper with continuously adjustable damping, in this study, a new damper with four discrete damping modes is applied to vehicle semi-active air suspension. The new damper can achieve different damping modes by just controlling the on-off statuses of two solenoid valves, which makes its damping adjustment more efficient and more reliable. However, since the damping mode switching induces different modes of operation, the air suspension system with the new damper poses challenging hybrid control problem. To model both the continuous/discrete dynamics and the switching between different damping modes, the framework of mixed logical dynamical (MLD) systems is used to establish the system hybrid model. Based on the resulting hybrid dynamical model, the system control problem is recast as a model predictive control (MPC) problem, which allows us to optimize the switching sequences of the damping modes by taking into account the suspension performance requirements. Numerical simulations results demonstrate the efficacy of the proposed control method finally.
Genomic characterization reconfirms the taxonomic status of Lactobacillus parakefiri

PubMed Central

TANIZAWA, Yasuhiro; KOBAYASHI, Hisami; KAMINUMA, Eli; SAKAMOTO, Mitsuo; OHKUMA, Moriya; NAKAMURA, Yasukazu; ARITA, Masanori; TOHNO, Masanori

2017-01-01

Whole-genome sequencing was performed for Lactobacillus parakefiri JCM 8573T to confirm its hitherto controversial taxonomic position. Here, we report its first reliable reference genome. Genome-wide metrics, such as average nucleotide identity and digital DNA-DNA hybridization, and phylogenomic analysis based on multiple genes supported its taxonomic status as a distinct species in the genus Lactobacillus. The availability of a reliable genome sequence will aid future investigations on the industrial applications of L. parakefiri in functional foods such as kefir grains. PMID:28748134
Graphene/MoS(2) heterostructures for ultrasensitive detection of DNA hybridisation.

PubMed

Loan, Phan Thi Kim; Zhang, Wenjing; Lin, Cheng-Te; Wei, Kung-Hwa; Li, Lain-Jong; Chen, Chang-Hsiao

2014-07-23

The photoluminescence signals of a graphene/MoS2 heterostructural stacking film are sensitive to environmental charges, which allows the single-base sequence-selective detection of DNA hybridization with sensitivity to the level of aM. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Reference karyotype and cytomolecular map for loblolly pine (Pinus taeda L.)

Treesearch

M. Nurul Islam-faridi; C. Dana Nelson; Thomas L. Kubisiak

2007-01-01

A reference karyotype is presented for loblolly pine (Pinus taeda L., subgenus Pinus , section Pinus, subsection Australes), based on fluorescent in situ hybridization (FISH), using 18s-28s rDNA, 5s rDNA, and Arabidopsis-type telomere repeat sequence (A-type TRS). Well...
Asteroseismology of hybrid δ Scuti-γ Doradus pulsating stars

NASA Astrophysics Data System (ADS)

Sánchez Arias, J. P.; Córsico, A. H.; Althaus, L. G.

2017-01-01

Context. Hybrid δ Scuti-γ Doradus pulsating stars show acoustic (p) oscillation modes typical of δ Scuti variable stars, and gravity (g) pulsation modes characteristic of γ Doradus variable stars simultaneously excited. Observations from space missions such as MOST, CoRoT, and Kepler have revealed a large number of hybrid δ Scuti-γ Doradus pulsators, thus paving the way for an exciting new channel of asteroseismic studies. Aims: We perform detailed asteroseismological modelling of five hybrid δ Scuti-γ Doradus stars. Methods: A grid-based modeling approach was employed to sound the internal structure of the target stars using stellar models ranging from the zero-age main sequence to the terminal-age main sequence, varying parameters such as stellar mass, effective temperature, metallicity and core overshooting. Their adiabatic radial (ℓ = 0) and non-radial (ℓ = 1,2,3) p and g mode periods were computed. Two model-fitting procedures were used to search for asteroseismological models that best reproduce the observed pulsation spectra of each target star. Results: We derive the fundamental parameters and the evolutionary status of five hybrid δ Scuti-γ Doradus variable stars recently observed by the CoRoT and Kepler space missions: CoRoT 105733033, CoRoT 100866999, KIC 11145123, KIC 9244992, and HD 49434. The asteroseismological model for each star results from different criteria of model selection, in which we take full advantage of the richness of periods that characterises the pulsation spectra for this kind of star.
FMLRC: Hybrid long read error correction using an FM-index.

PubMed

Wang, Jeremy R; Holt, James; McMillan, Leonard; Jones, Corbin D

2018-02-09

Long read sequencing is changing the landscape of genomic research, especially de novo assembly. Despite the high error rate inherent to long read technologies, increased read lengths dramatically improve the continuity and accuracy of genome assemblies. However, the cost and throughput of these technologies limits their application to complex genomes. One solution is to decrease the cost and time to assemble novel genomes by leveraging "hybrid" assemblies that use long reads for scaffolding and short reads for accuracy. We describe a novel method leveraging a multi-string Burrows-Wheeler Transform with auxiliary FM-index to correct errors in long read sequences using a set of complementary short reads. We demonstrate that our method efficiently produces significantly more high quality corrected sequence than existing hybrid error-correction methods. We also show that our method produces more contiguous assemblies, in many cases, than existing state-of-the-art hybrid and long-read only de novo assembly methods. Our method accurately corrects long read sequence data using complementary short reads. We demonstrate higher total throughput of corrected long reads and a corresponding increase in contiguity of the resulting de novo assemblies. Improved throughput and computational efficiency than existing methods will help better economically utilize emerging long read sequencing technologies.
Dissecting the hybridization of oligonucleotides to structured complementary sequences.

PubMed

Peracchi, Alessio

2016-06-01

When oligonucleotides hybridize to long target molecules, the process is slowed by the secondary structure in the targets. The phenomenon has been analyzed in several previous studies, but many details remain poorly understood. I used a spectrofluorometric strategy, focusing on the formation/breaking of individual base pairs, to study the kinetics of association between a DNA hairpin and >20 complementary oligonucleotides ('antisenses'). Hybridization rates differed by over three orders of magnitude. Association was toehold-mediated, both for antisenses binding to the target's ends and for those designed to interact with the loop. Binding of these latter, besides being consistently slower, was affected to variable, non-uniform extents by the asymmetric loop structure. Divalent metal ions accelerated hybridization, more pronouncedly when nucleation occurred at the loop. Incorporation of locked nucleic acid (LNA) residues in the antisenses substantially improved the kinetics only when LNAs participated to the earliest hybridization steps. The effects of individual LNAs placed along the antisense indicated that the reaction transition state occurred after invading at least the first base pair of the stem. The experimental approach helps dissect hybridization reactions involving structured nucleic acids. Toehold-dependent, nucleation-invasion models appear fully appropriate for describing such reactions. Estimating the stability of nucleation complexes formed at internal toeholds is the major hurdle for the quantitative prediction of hybridization rates. While analyzing the mechanisms of a fundamental biochemical process (hybridization), this work also provides suggestions for the improvement of technologies that rely on such process. Copyright © 2016 Elsevier B.V. All rights reserved.

System for controlling a hybrid energy system

DOEpatents

Hoff, Brian D.; Akasam, Sivaprasad

2013-01-29

A method includes identifying a first operating sequence of a repeated operation of at least one non-traction load. The method also includes determining first and second parameters respectively indicative of a requested energy and output energy of the at least one non-traction load and comparing the determined first and second parameters at a plurality of time increments of the first operating sequence. The method also includes determining a third parameter of the hybrid energy system indicative of energy regenerated from the at least one non-traction load and monitoring the third parameter at the plurality of time increments of the first operating sequence. The method also includes determining at least one of an energy deficiency or an energy surplus associated with the non-traction load of the hybrid energy system and selectively adjusting energy stored within the storage device during at least a portion of a second operating sequence.
A high-resolution whole genome radiation hybrid map of human chromosome 17q22-q25.3 across the genes for GH and TK

DOE Office of Scientific and Technical Information (OSTI.GOV)

Foster, J.W.; Schafer, A.J.; Critcher, R.

1996-04-15

We have constructed a whole genome radiation hybrid (WG-RH) map across a region of human chromosome 17q, from growth hormone (GH) to thymidine kinase (TK). A panel of 128 WG-RH hybrid cell lines generated by X-irradiation and fusion has been tested for the retention of 39 sequence-tagged site (STS) markers by the polymerase chain reaction. This genome mapping technique has allowed the integration of existing VNTR and microsatellite markers with additional new markers and existing STS markers previously mapped to this region by other means. The WG-RH map includes eight expressed sequence tag (EST) and three anonymous markers developed formore » this study, together with 23 anonymous microsatellites and five existing ESTs. Analysis of these data resulted in a high-density comprehensive map across this region of the genome. A subset of these markers has been used to produce a framework map consisting of 20 loci ordered with odds greater than 1000:1. The markers are of sufficient density to build a YAC contig across this region based on marker content. We have developed sequence tags for both ends of a 2.1-Mb YAC and mapped these using the WG-RH panel, allowing a direct comparison of cRay{sub 6000} to physical distance. 31 refs., 3 figs., 2 tabs.« less
Methods for determining the genetic affinity of microorganisms and viruses

NASA Technical Reports Server (NTRS)

Fox, George E. (Inventor); Willson, III, Richard C. (Inventor); Zhang, Zhengdong (Inventor)

2012-01-01

Selecting which sub-sequences in a database of nucleic acid such as 16S rRNA are highly characteristic of particular groupings of bacteria, microorganisms, fungi, etc. on a substantially phylogenetic tree. Also applicable to viruses comprising viral genomic RNA or DNA. A catalogue of highly characteristic sequences identified by this method is assembled to establish the genetic identity of an unknown organism. The characteristic sequences are used to design nucleic acid hybridization probes that include the characteristic sequence or its complement, or are derived from one or more characteristic sequences. A plurality of these characteristic sequences is used in hybridization to determine the phylogenetic tree position of the organism(s) in a sample. Those target organisms represented in the original sequence database and sufficient characteristic sequences can identify to the species or subspecies level. Oligonucleotide arrays of many probes are especially preferred. A hybridization signal can comprise fluorescence, chemiluminescence, or isotopic labeling, etc.; or sequences in a sample can be detected by direct means, e.g. mass spectrometry. The method's characteristic sequences can also be used to design specific PCR primers. The method uniquely identifies the phylogenetic affinity of an unknown organism without requiring prior knowledge of what is present in the sample. Even if the organism has not been previously encountered, the method still provides useful information about which phylogenetic tree bifurcation nodes encompass the organism.
Fine Output Voltage Control Method considering Time-Delay of Digital Inverter System for X-ray Computed Tomography

NASA Astrophysics Data System (ADS)

Shibata, Junji; Kaneko, Kazuhide; Ohishi, Kiyoshi; Ando, Itaru; Ogawa, Mina; Takano, Hiroshi

This paper proposes a new output voltage control for an inverter system, which has time-delay and nonlinear load. In the next generation X-ray computed tomography of a medical device (X-ray CT) that uses the contactless power transfer method, the feedback signal often contains time-delay due to AD/DA conversion and error detection/correction time. When the PID controller of the inverter system is received the adverse effects of the time-delay, the controller often has an overshoot and a oscillated response. In order to overcome this problem, this paper proposes a compensation method based on the Smith predictor for an inverter system having a time-delay and the nonlinear loads which are the diode bridge rectifier and X-ray tube. The proposed compensation method consists of the hybrid Smith predictor system based on an equivalent analog circuit and DSP. The experimental results confirm the validity of the proposed system.
Sensitive DNA detection and SNP discrimination using ultrabright SERS nanorattles and magnetic beads for malaria diagnostics.

PubMed

Ngo, Hoan T; Gandra, Naveen; Fales, Andrew M; Taylor, Steve M; Vo-Dinh, Tuan

2016-07-15

One of the major obstacles to implement nucleic acid-based molecular diagnostics at the point-of-care (POC) and in resource-limited settings is the lack of sensitive and practical DNA detection methods that can be seamlessly integrated into portable platforms. Herein we present a sensitive yet simple DNA detection method using a surface-enhanced Raman scattering (SERS) nanoplatform: the ultrabright SERS nanorattle. The method, referred to as the nanorattle-based method, involves sandwich hybridization of magnetic beads that are loaded with capture probes, target sequences, and ultrabright SERS nanorattles that are loaded with reporter probes. Upon hybridization, a magnet was applied to concentrate the hybridization sandwiches at a detection spot for SERS measurements. The ultrabright SERS nanorattles, composed of a core and a shell with resonance Raman reporters loaded in the gap space between the core and the shell, serve as SERS tags for signal detection. Using this method, a specific DNA sequence of the malaria parasite Plasmodium falciparum could be detected with a detection limit of approximately 100 attomoles. Single nucleotide polymorphism (SNP) discrimination of wild type malaria DNA and mutant malaria DNA, which confers resistance to artemisinin drugs, was also demonstrated. These test models demonstrate the molecular diagnostic potential of the nanorattle-based method to both detect and genotype infectious pathogens. Furthermore, the method's simplicity makes it a suitable candidate for integration into portable platforms for POC and in resource-limited settings applications. Copyright © 2016. Published by Elsevier B.V.
Characterization of Prdm9 in equids and sterility in mules.

PubMed

Steiner, Cynthia C; Ryder, Oliver A

2013-01-01

Prdm9 (Meisetz) is the first speciation gene discovered in vertebrates conferring reproductive isolation. This locus encodes a meiosis-specific histone H3 methyltransferase that specifies meiotic recombination hotspots during gametogenesis. Allelic differences in Prdm9, characterized for a variable number of zinc finger (ZF) domains, have been associated with hybrid sterility in male house mice via spermatogenic failure at the pachytene stage. The mule, a classic example of hybrid sterility in mammals also exhibits a similar spermatogenesis breakdown, making Prdm9 an interesting candidate to evaluate in equine hybrids. In this study, we characterized the Prdm9 gene in all species of equids by analyzing sequence variation of the ZF domains and estimating positive selection. We also evaluated the role of Prdm9 in hybrid sterility by assessing allelic differences of ZF domains in equine hybrids. We found remarkable variation in the sequence and number of ZF domains among equid species, ranging from five domains in the Tibetan kiang and Asiatic wild ass, to 14 in the Grevy's zebra. Positive selection was detected in all species at amino acid sites known to be associated with DNA-binding specificity of ZF domains in mice and humans. Equine hybrids, in particular a quartet pedigree composed of a fertile mule showed a mosaic of sequences and number of ZF domains suggesting that Prdm9 variation does not seem by itself to contribute to equine hybrid sterility.
Characterization of Prdm9 in Equids and Sterility in Mules

PubMed Central

Steiner, Cynthia C.; Ryder, Oliver A.

2013-01-01

Prdm9 (Meisetz) is the first speciation gene discovered in vertebrates conferring reproductive isolation. This locus encodes a meiosis-specific histone H3 methyltransferase that specifies meiotic recombination hotspots during gametogenesis. Allelic differences in Prdm9, characterized for a variable number of zinc finger (ZF) domains, have been associated with hybrid sterility in male house mice via spermatogenic failure at the pachytene stage. The mule, a classic example of hybrid sterility in mammals also exhibits a similar spermatogenesis breakdown, making Prdm9 an interesting candidate to evaluate in equine hybrids. In this study, we characterized the Prdm9 gene in all species of equids by analyzing sequence variation of the ZF domains and estimating positive selection. We also evaluated the role of Prdm9 in hybrid sterility by assessing allelic differences of ZF domains in equine hybrids. We found remarkable variation in the sequence and number of ZF domains among equid species, ranging from five domains in the Tibetan kiang and Asiatic wild ass, to 14 in the Grevy’s zebra. Positive selection was detected in all species at amino acid sites known to be associated with DNA-binding specificity of ZF domains in mice and humans. Equine hybrids, in particular a quartet pedigree composed of a fertile mule showed a mosaic of sequences and number of ZF domains suggesting that Prdm9 variation does not seem by itself to contribute to equine hybrid sterility. PMID:23613924
Single-Molecule Counting of Point Mutations by Transient DNA Binding

NASA Astrophysics Data System (ADS)

Su, Xin; Li, Lidan; Wang, Shanshan; Hao, Dandan; Wang, Lei; Yu, Changyuan

2017-03-01

High-confidence detection of point mutations is important for disease diagnosis and clinical practice. Hybridization probes are extensively used, but are hindered by their poor single-nucleotide selectivity. Shortening the length of DNA hybridization probes weakens the stability of the probe-target duplex, leading to transient binding between complementary sequences. The kinetics of probe-target binding events are highly dependent on the number of complementary base pairs. Here, we present a single-molecule assay for point mutation detection based on transient DNA binding and use of total internal reflection fluorescence microscopy. Statistical analysis of single-molecule kinetics enabled us to effectively discriminate between wild type DNA sequences and single-nucleotide variants at the single-molecule level. A higher single-nucleotide discrimination is achieved than in our previous work by optimizing the assay conditions, which is guided by statistical modeling of kinetics with a gamma distribution. The KRAS c.34 A mutation can be clearly differentiated from the wild type sequence (KRAS c.34 G) at a relative abundance as low as 0.01% mutant to WT. To demonstrate the feasibility of this method for analysis of clinically relevant biological samples, we used this technology to detect mutations in single-stranded DNA generated from asymmetric RT-PCR of mRNA from two cancer cell lines.
Isolation of genes from female sterile flowers in Medicago sativa.

PubMed

Capomaccio, Stefano; Barone, Pierluigi; Reale, Lara; Veronesi, Fabio; Rosellini, Daniele

2009-06-01

A better knowledge of female sporogenesis and gametogenesis could have several practical applications, from commercial hybrid seed production to gene containment in GM crops. With the purpose of isolating genes involved in the megasporogenesis process, the cDNA-AFLP technique was employed to isolate transcript-derived fragments (TDF) differentially expressed between female-fertile and female-sterile full-sib alfalfa plants. This female sterility trait involves female-specific arrest of sporogenesis at early prophase associated with ectopic, massive callose deposition within the nucellus. Ninety-six TDFs were generated and BLAST analyses revealed similarities with genes involved in different Gene Ontology categories. Three TDFs were selected based on their putative functions: showing high similarity to a soybean flower-expressed beta 1,3-glucanase, to an Arabidopsis thaliana MAPKKK, and to an A. thaliana eukaryotic initiation translation factor eIF4G III, respectively. The full length mRNA sequences were obtained. RT-PCR and in situ hybridizations were performed to confirm differential expression during flower development. The genomic organization of the three genes was assessed through sequencing and Southern experiments. Sequence polymorphisms were found between sterile and fertile plants. Our approach based on differential display and bulked segregant analysis was successful in isolating genes that were differentially expressed between fertile and sterile alfalfa plants.
DNA microdevice for electrochemical detection of Escherichia coli 0157:H7 molecular markers.

PubMed

Berganza, J; Olabarria, G; García, R; Verdoy, D; Rebollo, A; Arana, S

2007-04-15

An electrochemical DNA sensor based on the hybridization recognition of a single-stranded DNA (ssDNA) probe immobilized onto a gold electrode to its complementary ssDNA is presented. The DNA probe is bound on gold surface electrode by using self-assembled monolayer (SAM) technology. An optimized mixed SAM with a blocking molecule preventing the nonspecific adsorption on the electrode surface has been prepared. In this paper, a DNA biosensor is designed by means of the immobilization of a single stranded DNA probe on an electrochemical transducer surface to recognize specifically Escherichia coli (E. coli) 0157:H7 complementary target DNA sequence via cyclic voltammetry experiments. The 21 mer DNA probe including a C6 alkanethiol group at the 5' phosphate end has been synthesized to form the SAM onto the gold surface through the gold sulfur bond. The goal of this paper has been to design, characterise and optimise an electrochemical DNA sensor. In order to investigate the oligonucleotide probe immobilization and the hybridization detection, experiments with different concentration of DNA and mismatch sequences have been performed. This microdevice has demonstrated the suitability of oligonucleotide Self-assembled monolayers (SAMs) on gold as immobilization method. The DNA probes deposited on gold surface have been functional and able to detect changes in bases sequence in a 21-mer oligonucleotide.
Sunflower Hybrid Breeding: From Markers to Genomic Selection

PubMed Central

Dimitrijevic, Aleksandra; Horn, Renate

2018-01-01

In sunflower, molecular markers for simple traits as, e.g., fertility restoration, high oleic acid content, herbicide tolerance or resistances to Plasmopara halstedii, Puccinia helianthi, or Orobanche cumana have been successfully used in marker-assisted breeding programs for years. However, agronomically important complex quantitative traits like yield, heterosis, drought tolerance, oil content or selection for disease resistance, e.g., against Sclerotinia sclerotiorum have been challenging and will require genome-wide approaches. Plant genetic resources for sunflower are being collected and conserved worldwide that represent valuable resources to study complex traits. Sunflower association panels provide the basis for genome-wide association studies, overcoming disadvantages of biparental populations. Advances in technologies and the availability of the sunflower genome sequence made novel approaches on the whole genome level possible. Genotype-by-sequencing, and whole genome sequencing based on next generation sequencing technologies facilitated the production of large amounts of SNP markers for high density maps as well as SNP arrays and allowed genome-wide association studies and genomic selection in sunflower. Genome wide or candidate gene based association studies have been performed for traits like branching, flowering time, resistance to Sclerotinia head and stalk rot. First steps in genomic selection with regard to hybrid performance and hybrid oil content have shown that genomic selection can successfully address complex quantitative traits in sunflower and will help to speed up sunflower breeding programs in the future. To make sunflower more competitive toward other oil crops higher levels of resistance against pathogens and better yield performance are required. In addition, optimizing plant architecture toward a more complex growth type for higher plant densities has the potential to considerably increase yields per hectare. Integrative approaches combining omic technologies (genomics, transcriptomics, proteomics, metabolomics and phenomics) using bioinformatic tools will facilitate the identification of target genes and markers for complex traits and will give a better insight into the mechanisms behind the traits. PMID:29387071
Hybrid genome assembly and annotation of Paenibacillus pasadenensis strain R16 reveals insights on endophytic life style and antifungal activity

PubMed Central

Passera, Alessandro; Marcolungo, Luca; Brasca, Milena; Quaglino, Fabio; Cantaloni, Chiara; Delledonne, Massimo

2018-01-01

Bacteria of the Paenibacillus genus are becoming important in many fields of science, including agriculture, for their positive effects on the health of plants. However, there are little information available on this genus compared to other bacteria (such as Bacillus or Pseudomonas), especially when considering genomic information. Sequencing the genomes of plant-beneficial bacteria is a crucial step to identify the genetic elements underlying the adaptation to life inside a plant host and, in particular, which of these features determine the differences between a helpful microorganism and a pathogenic one. In this study, we have characterized the genome of Paenibacillus pasadenensis, strain R16, recently investigated for its antifungal activities and plant-associated features. An hybrid assembly approach was used integrating the very precise reads obtained by Illumina technology and long fragments acquired with Oxford Nanopore Technology (ONT) sequencing. De novo genome assembly based solely on Illumina reads generated a relatively fragmented assembly of 5.72 Mbp in 99 ungapped sequences with an N50 length of 544 Kbp; hybrid assembly, integrating Illumina and ONT reads, improved the assembly quality, generating a genome of 5.75 Mbp, organized in 6 contigs with an N50 length of 3.4 Mbp. Annotation of the latter genome identified 4987 coding sequences, of which 1610 are hypothetical proteins. Enrichment analysis identified pathways of particular interest for the endophyte biology, including the chitin-utilization pathway and the incomplete siderophore pathway which hints at siderophore parasitism. In addition the analysis led to the identification of genes for the production of terpenes, as for example farnesol, that was hypothesized as the main antifungal molecule produced by the strain. The functional analysis on the genome confirmed several plant-associated, plant-growth promotion, and biocontrol traits of strain R16, thus adding insights in the genetic bases of these complex features, and of the Paenibacillus genus in general. PMID:29351296
Identification of a novel interspecific hybrid yeast from a metagenomic spontaneously inoculated beer sample using Hi-C.

PubMed

Smukowski Heil, Caiti; Burton, Joshua N; Liachko, Ivan; Friedrich, Anne; Hanson, Noah A; Morris, Cody L; Schacherer, Joseph; Shendure, Jay; Thomas, James H; Dunham, Maitreya J

2018-01-01

Interspecific hybridization is a common mechanism enabling genetic diversification and adaptation; however, the detection of hybrid species has been quite difficult. The identification of microbial hybrids is made even more complicated, as most environmental microbes are resistant to culturing and must be studied in their native mixed communities. We have previously adapted the chromosome conformation capture method Hi-C to the assembly of genomes from mixed populations. Here, we show the method's application in assembling genomes directly from an uncultured, mixed population from a spontaneously inoculated beer sample. Our assembly method has enabled us to de-convolute four bacterial and four yeast genomes from this sample, including a putative yeast hybrid. Downstream isolation and analysis of this hybrid confirmed its genome to consist of Pichia membranifaciens and that of another related, but undescribed, yeast. Our work shows that Hi-C-based metagenomic methods can overcome the limitation of traditional sequencing methods in studying complex mixtures of genomes. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
Evaluation of chronic lymphocytic leukemia by oligonucleotide-based microarray analysis uncovers novel aberrations not detected by FISH or cytogenetic analysis

PubMed Central

2011-01-01

Background Cytogenetic evaluation is a key component of the diagnosis and prognosis of chronic lymphocytic leukemia (CLL). We performed oligonucleotide-based comparative genomic hybridization microarray analysis on 34 samples with CLL and known abnormal karyotypes previously determined by cytogenetics and/or fluorescence in situ hybridization (FISH). Results Using a custom designed microarray that targets >1800 genes involved in hematologic disease and other malignancies, we identified additional cryptic aberrations and novel findings in 59% of cases. These included gains and losses of genes associated with cell cycle regulation, apoptosis and susceptibility loci on 3p21.31, 5q35.2q35.3, 10q23.31q23.33, 11q22.3, and 22q11.23. Conclusions Our results show that microarray analysis will detect known aberrations, including microscopic and cryptic alterations. In addition, novel genomic changes will be uncovered that may become important prognostic predictors or treatment targets for CLL in the future. PMID:22087757
Laser Desorption Mass Spectrometry for DNA Sequencing and Analysis

NASA Astrophysics Data System (ADS)

Chen, C. H. Winston; Taranenko, N. I.; Golovlev, V. V.; Isola, N. R.; Allman, S. L.

1998-03-01

Rapid DNA sequencing and/or analysis is critically important for biomedical research. In the past, gel electrophoresis has been the primary tool to achieve DNA analysis and sequencing. However, gel electrophoresis is a time-consuming and labor-extensive process. Recently, we have developed and used laser desorption mass spectrometry (LDMS) to achieve sequencing of ss-DNA longer than 100 nucleotides. With LDMS, we succeeded in sequencing DNA in seconds instead of hours or days required by gel electrophoresis. In addition to sequencing, we also applied LDMS for the detection of DNA probes for hybridization LDMS was also used to detect short tandem repeats for forensic applications. Clinical applications for disease diagnosis such as cystic fibrosis caused by base deletion and point mutation have also been demonstrated. Experimental details will be presented in the meeting. abstract.
Signal amplification of padlock probes by rolling circle replication.

PubMed Central

Banér, J; Nilsson, M; Mendel-Hartvig, M; Landegren, U

1998-01-01

Circularizing oligonucleotide probes (padlock probes) have the potential to detect sets of gene sequences with high specificity and excellent selectivity for sequence variants, but sensitivity of detection has been limiting. By using a rolling circle replication (RCR) mechanism, circularized but not unreacted probes can yield a powerful signal amplification. We demonstrate here that in order for the reaction to proceed efficiently, the probes must be released from the topological link that forms with target molecules upon hybridization and ligation. If the target strand has a nearby free 3' end, then the probe-target hybrids can be displaced by the polymerase used for replication. The displaced probe can then slip off the targetstrand and a rolling circle amplification is initiated. Alternatively, the target sequence itself can prime an RCR after its non-base paired 3' end has been removed by exonucleolytic activity. We found the Phi29 DNA polymerase to be superior to the Klenow fragment in displacing the target DNA strand, and it maintained the polymerization reaction for at least 12 h, yielding an extension product that represents several thousand-fold the length of the padlock probe. PMID:9801302
The construction and partial characterization of plasmids containing complementary DNA sequences to human calcitonin precursor polyprotein.

PubMed Central

Allison, J; Hall, L; MacIntyre, I; Craig, R K

1981-01-01

(1) Total poly(A)-containing RNA isolated from human thyroid medullary carcinoma tissue was shown to direct the synthesis in the wheat germ cell-free system of a major (Mr 21000) and several minor forms of human calcitonin precursor polyproteins. Evidence for processing of these precursor(s) by the wheat germ cell-free system is also presented. (2) A small complementary DNA (cDNA) plasmid library has been constructed in the PstI site of the plasmid pAT153, using total human thyroid medullary carcinoma poly(A)-containing RNA as the starting material. (3) Plasmids containing abundant cDNA sequences were selected by hybridization in situ, and two of these (ph T-B3 and phT-B6) were characterized by hybridization--translation and restriction analysis. Each was shown to contain human calcitonin precursor polyprotein cDNA sequences. (4) RNA blotting techniques demonstrate that the human calcitonin precursor polyprotein is encoded within a mRNA containing 1000 bases. (5) The results demonstrate that human calcitonin is synthesized as a precursor polyprotein. Images Fig. 1. Fig. 2. Fig. 3. PMID:6896146
A DNA microarray for identification of selected Korean birds based on mitochondrial cytochrome c oxidase I gene sequences.

PubMed

Chung, In-Hyuk; Yoo, Hye Sook; Eah, Jae-Yong; Yoon, Hyun-Kyu; Jung, Jin-Wook; Hwang, Seung Yong; Kim, Chang-Bae

2010-10-01

DNA barcoding with the gene encoding cytochrome c oxidase I (COI) in the mitochondrial genome has been proposed as a standard marker to identify and discover animal species. Some migratory wild birds are suspected of transmitting avian influenza and pose a threat to aircraft safety because of bird strikes. We have previously reported the COI gene sequences of 92 Korean bird species. In the present study, we developed a DNA microarray to identify 17 selected bird species on the basis of nucleotide diversity. We designed and synthesized 19 specific oligonucleotide probes; these probes were arrayed on a silylated glass slide. The length of the probes was 19-24 bps. The COI sequences amplified from the tissues of the selected birds were labeled with a fluorescent probe for microarray hybridization, and unique hybridization patterns were detected for each selected species. These patterns may be considered diagnostic patterns for species identification. This microarray system will provide a sensitive and a high-throughput method for identification of Korean birds.
[Comparative results of preimplantation genetic screening by array comparative genomic hybridization and new-generation sequencing].

PubMed

Aleksandrova, N V; Shubina, E S; Ekimov, A N; Kodyleva, T A; Mukosey, I S; Makarova, N P; Kulakova, E V; Levkov, L A; Barkov, I Yu; Trofimov, D Yu; Sukhikh, G T

2017-01-01

Aneuploidies as quantitative chromosome abnormalities are a main cause of failed development of morphologically normal embryos, implantation failures, and early reproductive losses. Preimplantation genetic screening (PGS) allows a preselection of embryos with a normal karyotype, thus increasing the implantation rate and reducing the frequency of early pregnancy loss after IVF. Modern PGS technologies are based on a genome-wide analysis of the embryo. The first pilot study in Russia was performed to assess the possibility of using semiconductor new-generation sequencing (NGS) as a PGS method. NGS data were collected for 38 biopsied embryos and compared with the data from array comparative genomic hybridization (array-CGH). The concordance between the NGS and array-CGH data was 94.8%. Two samples showed the karyotype 47,XXY by array-CGH and a normal karyotype by NGS. The discrepancies may be explained by loss of efficiency of array-CGH amplicon labeling.
Integrating DNA strand displacement circuitry to the nonlinear hybridization chain reaction.

PubMed

Zhang, Zhuo; Fan, Tsz Wing; Hsing, I-Ming

2017-02-23

Programmable and modular attributes of DNA molecules allow one to develop versatile sensing platforms that can be operated isothermally and enzyme-free. In this work, we present an approach to integrate upstream DNA strand displacement circuits that can be turned on by a sequence-specific microRNA analyte with a downstream nonlinear hybridization chain reaction for a cascading hyperbranched nucleic acid assembly. This system provides a two-step amplification strategy for highly sensitive detection of the miRNA analyte, conducive for multiplexed detection. Multiple miRNA analytes were tested with our integrated circuitry using the same downstream signal amplification setting, showing the decoupling of nonlinear self-assembly with the analyte sequence. Compared with the reported methods, our signal amplification approach provides an additional control module for higher-order DNA self-assembly and could be developed into a promising platform for the detection of critical nucleic-acid based biomarkers.

Estimating drought risk across Europe from reported drought impacts, hazard indicators and vulnerability factors

NASA Astrophysics Data System (ADS)

Blauhut, V.; Stahl, K.; Stagge, J. H.; Tallaksen, L. M.; De Stefano, L.; Vogt, J.

2015-12-01

Drought is one of the most costly natural hazards in Europe. Due to its complexity, drought risk, the combination of the natural hazard and societal vulnerability, is difficult to define and challenging to detect and predict, as the impacts of drought are very diverse, covering the breadth of socioeconomic and environmental systems. Pan-European maps of drought risk could inform the elaboration of guidelines and policies to address its documented severity and impact across borders. This work (1) tests the capability of commonly applied hazard indicators and vulnerability factors to predict annual drought impact occurrence for different sectors and macro regions in Europe and (2) combines information on past drought impacts, drought hazard indicators, and vulnerability factors into estimates of drought risk at the pan-European scale. This "hybrid approach" bridges the gap between traditional vulnerability assessment and probabilistic impact forecast in a statistical modelling framework. Multivariable logistic regression was applied to predict the likelihood of impact occurrence on an annual basis for particular impact categories and European macro regions. The results indicate sector- and macro region specific sensitivities of hazard indicators, with the Standardised Precipitation Evapotranspiration Index for a twelve month aggregation period (SPEI-12) as the overall best hazard predictor. Vulnerability factors have only limited ability to predict drought impacts as single predictor, with information about landuse and water resources as best vulnerability-based predictors. (3) The application of the "hybrid approach" revealed strong regional (NUTS combo level) and sector specific differences in drought risk across Europe. The majority of best predictor combinations rely on a combination of SPEI for shorter and longer aggregation periods, and a combination of information on landuse and water resources. The added value of integrating regional vulnerability information with drought risk prediction could be proven. Thus, the study contributes to the overall understanding of drivers of drought impacts, current practice of drought indicators selection for specific application, and drought risk assessment.
Estimating drought risk across Europe from reported drought impacts, drought indices, and vulnerability factors

NASA Astrophysics Data System (ADS)

Blauhut, Veit; Stahl, Kerstin; Stagge, James Howard; Tallaksen, Lena M.; De Stefano, Lucia; Vogt, Jürgen

2016-07-01

Drought is one of the most costly natural hazards in Europe. Due to its complexity, drought risk, meant as the combination of the natural hazard and societal vulnerability, is difficult to define and challenging to detect and predict, as the impacts of drought are very diverse, covering the breadth of socioeconomic and environmental systems. Pan-European maps of drought risk could inform the elaboration of guidelines and policies to address its documented severity and impact across borders. This work tests the capability of commonly applied drought indices and vulnerability factors to predict annual drought impact occurrence for different sectors and macro regions in Europe and combines information on past drought impacts, drought indices, and vulnerability factors into estimates of drought risk at the pan-European scale. This hybrid approach bridges the gap between traditional vulnerability assessment and probabilistic impact prediction in a statistical modelling framework. Multivariable logistic regression was applied to predict the likelihood of impact occurrence on an annual basis for particular impact categories and European macro regions. The results indicate sector- and macro-region-specific sensitivities of drought indices, with the Standardized Precipitation Evapotranspiration Index (SPEI) for a 12-month accumulation period as the overall best hazard predictor. Vulnerability factors have only limited ability to predict drought impacts as single predictors, with information about land use and water resources being the best vulnerability-based predictors. The application of the hybrid approach revealed strong regional and sector-specific differences in drought risk across Europe. The majority of the best predictor combinations rely on a combination of SPEI for shorter and longer accumulation periods, and a combination of information on land use and water resources. The added value of integrating regional vulnerability information with drought risk prediction could be proven. Thus, the study contributes to the overall understanding of drivers of drought impacts, appropriateness of drought indices selection for specific applications, and drought risk assessment.
Oligonucleotide Array for Identification and Detection of Pythium Species†

PubMed Central

Tambong, J. T.; de Cock, A. W. A. M.; Tinker, N. A.; Lévesque, C. A.

2006-01-01

A DNA array containing 172 oligonucleotides complementary to specific diagnostic regions of internal transcribed spacers (ITS) of more than 100 species was developed for identification and detection of Pythium species. All of the species studied, with the exception of Pythium ostracodes, exhibited a positive hybridization reaction with at least one corresponding species-specific oligonucleotide. Hybridization patterns were distinct for each species. The array hybridization patterns included cluster-specific oligonucleotides that facilitated the recognition of species, including new ones, belonging to groups such as those producing filamentous or globose sporangia. BLAST analyses against 500 publicly available Pythium sequences in GenBank confirmed that species-specific oligonucleotides were unique to all of the available strains of each species, of which there were numerous economically important ones. GenBank entries of newly described species that are not putative synonyms showed no homology to sequences of the spotted species-specific oligonucleotides, but most new species did match some of the cluster-specific oligonucleotides. Further verification of the specificity of the DNA array was done with 50 additional Pythium isolates obtained by soil dilution plating. The hybridization patterns obtained were consistent with the identification of these isolates based on morphology and ITS sequence analyses. In another blind test, total DNA of the same soil samples was amplified and hybridized on the array, and the results were compared to those of 130 Pythium isolates obtained by soil dilution plating and root baiting. The 13 species detected by the DNA array corresponded to the isolates obtained by a combination of soil dilution plating and baiting, except for one new species that was not represented on the array. We conclude that the reported DNA array is a reliable tool for identification and detection of the majority of Pythium species in environmental samples. Simultaneous detection and identification of multiple species of soilborne pathogens such as Pythium species could be a major step forward for epidemiological and ecological studies. PMID:16597974
"Hook"-calibration of GeneChip-microarrays: theory and algorithm.

PubMed

Binder, Hans; Preibisch, Stephan

2008-08-29

: The improvement of microarray calibration methods is an essential prerequisite for quantitative expression analysis. This issue requires the formulation of an appropriate model describing the basic relationship between the probe intensity and the specific transcript concentration in a complex environment of competing interactions, the estimation of the magnitude these effects and their correction using the intensity information of a given chip and, finally the development of practicable algorithms which judge the quality of a particular hybridization and estimate the expression degree from the intensity values. : We present the so-called hook-calibration method which co-processes the log-difference (delta) and -sum (sigma) of the perfect match (PM) and mismatch (MM) probe-intensities. The MM probes are utilized as an internal reference which is subjected to the same hybridization law as the PM, however with modified characteristics. After sequence-specific affinity correction the method fits the Langmuir-adsorption model to the smoothed delta-versus-sigma plot. The geometrical dimensions of this so-called hook-curve characterize the particular hybridization in terms of simple geometric parameters which provide information about the mean non-specific background intensity, the saturation value, the mean PM/MM-sensitivity gain and the fraction of absent probes. This graphical summary spans a metrics system for expression estimates in natural units such as the mean binding constants and the occupancy of the probe spots. The method is single-chip based, i.e. it separately uses the intensities for each selected chip. : The hook-method corrects the raw intensities for the non-specific background hybridization in a sequence-specific manner, for the potential saturation of the probe-spots with bound transcripts and for the sequence-specific binding of specific transcripts. The obtained chip characteristics in combination with the sensitivity corrected probe-intensity values provide expression estimates scaled in natural units which are given by the binding constants of the particular hybridization.
Improving transmembrane protein consensus topology prediction using inter-helical interaction.

PubMed

Wang, Han; Zhang, Chao; Shi, Xiaohu; Zhang, Li; Zhou, You

2012-11-01

Alpha helix transmembrane proteins (αTMPs) represent roughly 30% of all open reading frames (ORFs) in a typical genome and are involved in many critical biological processes. Due to the special physicochemical properties, it is hard to crystallize and obtain high resolution structures experimentally, thus, sequence-based topology prediction is highly desirable for the study of transmembrane proteins (TMPs), both in structure prediction and function prediction. Various model-based topology prediction methods have been developed, but the accuracy of those individual predictors remain poor due to the limitation of the methods or the features they used. Thus, the consensus topology prediction method becomes practical for high accuracy applications by combining the advances of the individual predictors. Here, based on the observation that inter-helical interactions are commonly found within the transmembrane helixes (TMHs) and strongly indicate the existence of them, we present a novel consensus topology prediction method for αTMPs, CNTOP, which incorporates four top leading individual topology predictors, and further improves the prediction accuracy by using the predicted inter-helical interactions. The method achieved 87% prediction accuracy based on a benchmark dataset and 78% accuracy based on a non-redundant dataset which is composed of polytopic αTMPs. Our method derives the highest topology accuracy than any other individual predictors and consensus predictors, at the same time, the TMHs are more accurately predicted in their length and locations, where both the false positives (FPs) and the false negatives (FNs) decreased dramatically. The CNTOP is available at: http://ccst.jlu.edu.cn/JCSB/cntop/CNTOP.html. Copyright © 2012 Elsevier B.V. All rights reserved.
Cloud-based adaptive exon prediction for DNA analysis

PubMed Central

Putluri, Srinivasareddy; Fathima, Shaik Yasmeen

2018-01-01

Cloud computing offers significant research and economic benefits to healthcare organisations. Cloud services provide a safe place for storing and managing large amounts of such sensitive data. Under conventional flow of gene information, gene sequence laboratories send out raw and inferred information via Internet to several sequence libraries. DNA sequencing storage costs will be minimised by use of cloud service. In this study, the authors put forward a novel genomic informatics system using Amazon Cloud Services, where genomic sequence information is stored and accessed for processing. True identification of exon regions in a DNA sequence is a key task in bioinformatics, which helps in disease identification and design drugs. Three base periodicity property of exons forms the basis of all exon identification techniques. Adaptive signal processing techniques found to be promising in comparison with several other methods. Several adaptive exon predictors (AEPs) are developed using variable normalised least mean square and its maximum normalised variants to reduce computational complexity. Finally, performance evaluation of various AEPs is done based on measures such as sensitivity, specificity and precision using various standard genomic datasets taken from National Center for Biotechnology Information genomic sequence database. PMID:29515813
Zepto-molar electrochemical detection of Brucella genome based on gold nanoribbons covered by gold nanoblooms

NASA Astrophysics Data System (ADS)

Rahi, Amid; Sattarahmady, Naghmeh; Heli, Hossein

2015-12-01

Gold nanoribbons covered by gold nanoblooms were sonoelectrodeposited on a polycrystalline gold surface at -1800 mV (vs. AgCl) with the assistance of ultrasound and co-occurrence of the hydrogen evolution reaction. The nanostructure, as a transducer, was utilized to immobilize a Brucella-specific probe and fabrication of a genosensor, and the process of immobilization and hybridization was detected by electrochemical methods, using methylene blue as a redox marker. The proposed method for detection of the complementary sequence, sequences with base-mismatched (one-, two- and three-base mismatches), and the sequence of non-complementary sequence was assayed. The fabricated genosensor was evaluated for the assay of the bacteria in the cultured and human samples without polymerase chain reactions (PCR). The genosensor could detect the complementary sequence with a calibration sensitivity of 0.40 μA dm3 mol-1, a linear concentration range of 10 zmol dm-3 to 10 pmol dm-3, and a detection limit of 1.71 zmol dm-3.
Hybridization of Tamarix ramosissima and T. chinensis (saltcedars) with T. aphylla (athel) (tamaricaceae) in the southwestern USA dertermined from DNA sequence data

USGS Publications Warehouse

Gaskin, John F.; Shafroth, Patrick B.

2005-01-01

Morphological intermediates between Tamarix ramosissima or T. chinensis (saltcedars) and T. aphylla (athel) were found recently in three locations in the southwestern USA, and were assumed to be hybrids or a previously unreported species. We sequenced chloroplast and nuclear DNA from putative parental and hybrid morphotypes and hybrid status of morphological intermediates was supported. Chloroplast data suggest that the seed source for these hybrids is T. aphylla. Invasive T. aphylla genotypes found in Australia match those found in the USA. Seed was collected from one of the hybrids, and a low percentage of it was viable. This hybrid combination has not been previously reported in the USA or the native ranges of the species. Although populations of this novel Tamarix hybrid appear to be uncommon at present, both parental species are considered invasive (saltcedars in North America; athel in Australia), and it is possible that more aggressive hybrid genotypes could be produced. Therefore, natural resource managers concerned with the potential spread of non-native species should be aware of the existence of these plants and monitor their future spread.
Isolation and Characterization of Burkholderia rinojensis sp. nov., a Non-Burkholderia cepacia Complex Soil Bacterium with Insecticidal and Miticidal Activities

PubMed Central

Fernandez, Lorena E.; Koivunen, Marja; Yang, April; Flor-Weiler, Lina; Marrone, Pamela G.

2013-01-01

Isolate A396, a bacterium isolated from a Japanese soil sample demonstrated strong insecticidal and miticidal activities in laboratory bioassays. The isolate was characterized through biochemical methods, fatty acid methyl ester (FAME) analysis, sequencing of 16S rRNA, multilocus sequence typing and analysis, and DNA-DNA hybridization. FAME analysis matched A396 to Burkholderia cenocepacia, but this result was not confirmed by 16S rRNA or DNA-DNA hybridization. 16S rRNA sequencing indicated closest matches with B. glumae and B. plantarii. DNA-DNA hybridization experiments with B. plantarii, B. glumae, B. multivorans, and B. cenocepacia confirmed the low genetic similarity (11.5 to 37.4%) with known members of the genus. PCR-based screening showed that A396 lacks markers associated with members of the B. cepacia complex. Bioassay results indicated two mechanisms of action: through ingestion and contact. The isolate effectively controlled beet armyworms (Spodoptera exigua; BAW) and two-spotted spider mites (Tetranychus urticae; TSSM). In diet overlay bioassays with BAW, 1% to 4% (vol/vol) dilution of the whole-cell broth caused 97% to 100% mortality 4 days postexposure, and leaf disc treatment bioassays attained 75% ± 22% mortality 3 days postexposure. Contact bioassays led to 50% larval mortality, as well as discoloration, stunting, and failure to molt. TSSM mortality reached 93% in treated leaf discs. Activity was maintained in cell-free supernatants and after heat treatment (60°C for 2 h), indicating that a secondary metabolite or excreted thermostable enzyme might be responsible for the activity. Based on these results, we describe the novel species Burkholderia rinojensis, a good candidate for the development of a biocontrol product against insect and mite pests. PMID:24096416
A bimetallic nanocomposite modified genosensor for recognition and determination of thalassemia gene.

PubMed

Hamidi-Asl, Ezat; Raoof, Jahan Bakhsh; Naghizadeh, Nahid; Akhavan-Niaki, Haleh; Ojani, Reza; Banihashemi, Ali

2016-10-01

The main roles of DNA in the cells are to maintain and properly express genetic information. It is important to have analytical methods capable of fast and sensitive detection of DNA damage. DNA hybridization sensors are well suited for diagnostics and other purposes, including determination of bacteria and viruses. Beta thalassemias (βth) are due to mutations in the β-globin gene. In this study, an electrochemical biosensor which detects the sequences related to the β-globin gene issued from real samples amplified by polymerase chain reaction (PCR) is described for the first time. The biosensor relies on the immobilization of 20-mer single stranded oligonucleotide (probe) related to βth sequence on the carbon paste electrode (CPE) modified by 15% silver (Ag) and platinum (Pt) nanoparticles to prepare the bimetallic nanocomposite electrode and hybridization of this oligonucleotide with its complementary sequence (target). The extent of hybridization between the probe and target sequences was shown by using linear sweep voltammetry (LSV) with methylene blue (MB) as hybridization indicator. The selectivity of sensor was investigated using PCR samples containing non-complementary oligonucleotides. The detection limit of biosensor was calculated about 470.0pg/μL. Copyright © 2016 Elsevier B.V. All rights reserved.
Precise and selective sensing of DNA-DNA hybridization by graphene/Si-nanowires diode-type biosensors.

PubMed

Kim, Jungkil; Park, Shin-Young; Kim, Sung; Lee, Dae Hun; Kim, Ju Hwan; Kim, Jong Min; Kang, Hee; Han, Joong-Soo; Park, Jun Woo; Lee, Hosun; Choi, Suk-Ho

2016-08-18

Single-Si-nanowire (NW)-based DNA sensors have been recently developed, but their sensitivity is very limited because of high noise signals, originating from small source-drain current of the single Si NW. Here, we demonstrate that chemical-vapor-deposition-grown large-scale graphene/surface-modified vertical-Si-NW-arrays junctions can be utilized as diode-type biosensors for highly-sensitive and -selective detection of specific oligonucleotides. For this, a twenty-seven-base-long synthetic oligonucleotide, which is a fragment of human DENND2D promoter sequence, is first decorated as a probe on the surface of vertical Si-NW arrays, and then the complementary oligonucleotide is hybridized to the probe. This hybridization gives rise to a doping effect on the surface of Si NWs, resulting in the increase of the current in the biosensor. The current of the biosensor increases from 19 to 120% as the concentration of the target DNA varies from 0.1 to 500 nM. In contrast, such biosensing does not come into play by the use of the oligonucleotide with incompatible or mismatched sequences. Similar results are observed from photoluminescence microscopic images and spectra. The biosensors show very-uniform current changes with standard deviations ranging ~1 to ~10% by ten-times endurance tests. These results are very promising for their applications in accurate, selective, and stable biosensing.
RNABindRPlus: a predictor that combines machine learning and sequence homology-based methods to improve the reliability of predicted RNA-binding residues in proteins.

PubMed

Walia, Rasna R; Xue, Li C; Wilkins, Katherine; El-Manzalawy, Yasser; Dobbs, Drena; Honavar, Vasant

2014-01-01

Protein-RNA interactions are central to essential cellular processes such as protein synthesis and regulation of gene expression and play roles in human infectious and genetic diseases. Reliable identification of protein-RNA interfaces is critical for understanding the structural bases and functional implications of such interactions and for developing effective approaches to rational drug design. Sequence-based computational methods offer a viable, cost-effective way to identify putative RNA-binding residues in RNA-binding proteins. Here we report two novel approaches: (i) HomPRIP, a sequence homology-based method for predicting RNA-binding sites in proteins; (ii) RNABindRPlus, a new method that combines predictions from HomPRIP with those from an optimized Support Vector Machine (SVM) classifier trained on a benchmark dataset of 198 RNA-binding proteins. Although highly reliable, HomPRIP cannot make predictions for the unaligned parts of query proteins and its coverage is limited by the availability of close sequence homologs of the query protein with experimentally determined RNA-binding sites. RNABindRPlus overcomes these limitations. We compared the performance of HomPRIP and RNABindRPlus with that of several state-of-the-art predictors on two test sets, RB44 and RB111. On a subset of proteins for which homologs with experimentally determined interfaces could be reliably identified, HomPRIP outperformed all other methods achieving an MCC of 0.63 on RB44 and 0.83 on RB111. RNABindRPlus was able to predict RNA-binding residues of all proteins in both test sets, achieving an MCC of 0.55 and 0.37, respectively, and outperforming all other methods, including those that make use of structure-derived features of proteins. More importantly, RNABindRPlus outperforms all other methods for any choice of tradeoff between precision and recall. An important advantage of both HomPRIP and RNABindRPlus is that they rely on readily available sequence and sequence-derived features of RNA-binding proteins. A webserver implementation of both methods is freely available at http://einstein.cs.iastate.edu/RNABindRPlus/.
Genotypic and phenotypic evaluation of off-type grasses in hybrid Bermudagrass [Cynodon dactylon (L.) Pers. x C. transvaalensis Burtt-Davy] putting greens using genotyping-by-sequencing and morphological characterization.

PubMed

Reasor, Eric H; Brosnan, James T; Staton, Margaret E; Lane, Thomas; Trigiano, Robert N; Wadl, Phillip A; Conner, Joann A; Schwartz, Brian M

2018-01-01

Interspecific hybrid bermudagrass [ Cynodon dactylon (L.) Pers. x C. transvaalensis Burtt-Davy] is one of the most widely used grasses on golf courses, with cultivars derived from 'Tifgreen' or 'Tifdwarf' particularly used for putting greens. Many bermudagrass cultivars established for putting greens can be genetically unstable and lead to the occurrence of undesirable off-type grasses that vary in phenotype. The objective of this research was to genetically and phenotypically differentiate off-type grasses and hybrid cultivars. Beginning in 2013, off-type and desirable hybrid bermudagrass samples were collected from golf course putting greens in the southeastern United States and genetically and phenotypically characterized using genotyping-by-sequencing and morphology. Genotyping-by-sequencing determined that 11% (5) of off-type and desirable samples from putting greens were genetically divergent from standard cultivars such as Champion, MiniVerde, Tifdwarf, TifEagle, and Tifgreen. In addition, genotyping-by-sequencing was unable to genetically distinguish all standard cultivars from one another due to their similar origin and clonal propagation; however, over 90,000 potentially informative nucleotide variants were identified among the triploid hybrid cultivars. Although few genetic differences were found in this research, samples harvested from golf course putting greens had variable morphology and were clustered into three distinct phenotypic groups. The majority of off-type grasses in hybrid bermudagrass putting greens were genetically similar with variable morphological traits. Off-type grasses within golf course putting greens have the potential to compromise putting surface functionality and aesthetics.
Performance and life cycle environmental benefits of recycling spent ion exchange brines by catalytic treatment of nitrate.

PubMed

Choe, Jong Kwon; Bergquist, Allison M; Jeong, Sangjo; Guest, Jeremy S; Werth, Charles J; Strathmann, Timothy J

2015-09-01

Salt used to make brines for regeneration of ion exchange (IX) resins is the dominant economic and environmental liability of IX treatment systems for nitrate-contaminated drinking water sources. To reduce salt usage, the applicability and environmental benefits of using a catalytic reduction technology to treat nitrate in spent IX brines and enable their reuse for IX resin regeneration were evaluated. Hybrid IX/catalyst systems were designed and life cycle assessment of process consumables are used to set performance targets for the catalyst reactor. Nitrate reduction was measured in a typical spent brine (i.e., 5000 mg/L NO3(-) and 70,000 mg/L NaCl) using bimetallic Pd-In hydrogenation catalysts with variable Pd (0.2-2.5 wt%) and In (0.0125-0.25 wt%) loadings on pelletized activated carbon support (Pd-In/C). The highest activity of 50 mgNO3(-)/(min - g(Pd)) was obtained with a 0.5 wt%Pd-0.1 wt%In/C catalyst. Catalyst longevity was demonstrated by observing no decrease in catalyst activity over more than 60 days in a packed-bed reactor. Based on catalyst activity measured in batch and packed-bed reactors, environmental impacts of hybrid IX/catalyst systems were evaluated for both sequencing-batch and continuous-flow packed-bed reactor designs and environmental impacts of the sequencing-batch hybrid system were found to be 38-81% of those of conventional IX. Major environmental impact contributors other than salt consumption include Pd metal, hydrogen (electron donor), and carbon dioxide (pH buffer). Sensitivity of environmental impacts of the sequencing-batch hybrid reactor system to sulfate and bicarbonate anions indicate the hybrid system is more sustainable than conventional IX when influent water contains <80 mg/L sulfate (at any bicarbonate level up to 100 mg/L) or <20 mg/L bicarbonate (at any sulfate level up to 100 mg/L) assuming 15 brine reuse cycles. The study showed that hybrid IX/catalyst reactor systems have potential to reduce resource consumption and improve environmental impacts associated with treating nitrate-contaminated water sources. Copyright © 2015 Elsevier Ltd. All rights reserved.
Molecular genetic analysis of the V kappa Ser group associated with two mouse light chain genetic markers. Complementary DNA cloning and southern hybridization analysis

PubMed Central

1985-01-01

Previous studies (21) have shown that two mouse kappa light (L) chain variable (V) region polymorphisms, the IB-peptide and Efla markers, reflect expression of a characteristic group of V kappa regions, called V kappa Ser, by some inbred strains and not others. Expression of V kappa Ser is controlled by a locus on chromosome 6, the chromosome that contains the kappa locus. To further characterize this V kappa group and begin to analyze the basis for its strain-specific expression, full- length complementary DNA (cDNA) copies were produced of L chain mRNA from the M75 myeloma that had been induced in the C.C58 strain of mice, and which produces a V kappa Ser L chain. The C.C58 strain is congenic with BALB/cAn, differing in the region of chromosome 6 that controls expression of the V kappa polymorphisms and the Lyt-2 and Lyt-3 T cell alloantigens. The complete nucleotide sequence of this cloned cDNA was determined and compared with the nucleotide sequences the most closely related BALB/c myeloma L chains known. Results indicated significant differences throughout the variable region, but particularly toward the 5' portion of the sequence. A probe corresponding to 200 bp of the 5' end of the cloned V kappa Ser cDNA was used in Southern hybridizations of restriction digests of liver DNA from a number of inbred, recombinant, and recombinant inbred strains. Under stringent hybridization conditions, one strongly-hybridizing fragment was observed in Bam HI, Hind III, and Eco RI digests, and based on the size of the fragments, strains could be organized into two groups. The presence of strongly hybridizing Bam HI, Hind III, and Eco RI fragments of 3.2, 2.8, and 2.1 kb, respectively, was found to correlate completely with expression by the strain of the IB-peptide and Efla markers. All nonexpressor strains yielded hybridizing fragments of 7.8, 8.4, and 2.8 kb, respectively. Possible explanations for strain- specific expression of V kappa Ser-associated phenotypic markers are discussed. PMID:3926938
Ultrafast DNA sequencing on a microchip by a hybrid separation mechanism that gives 600 bases in 6.5 minutes

PubMed Central

Fredlake, Christopher P.; Hert, Daniel G.; Kan, Cheuk-Wai; Chiesl, Thomas N.; Root, Brian E.; Forster, Ryan E.; Barron, Annelise E.

2008-01-01

To realize the immense potential of large-scale genomic sequencing after the completion of the second human genome (Venter's), the costs for the complete sequencing of additional genomes must be dramatically reduced. Among the technologies being developed to reduce sequencing costs, microchip electrophoresis is the only new technology ready to produce the long reads most suitable for the de novo sequencing and assembly of large and complex genomes. Compared with the current paradigm of capillary electrophoresis, microchip systems promise to reduce sequencing costs dramatically by increasing throughput, reducing reagent consumption, and integrating the many steps of the sequencing pipeline onto a single platform. Although capillary-based systems require ≈70 min to deliver ≈650 bases of contiguous sequence, we report sequencing up to 600 bases in just 6.5 min by microchip electrophoresis with a unique polymer matrix/adsorbed polymer wall coating combination. This represents a two-thirds reduction in sequencing time over any previously published chip sequencing result, with comparable read length and sequence quality. We hypothesize that these ultrafast long reads on chips can be achieved because the combined polymer system engenders a recently discovered “hybrid” mechanism of DNA electromigration, in which DNA molecules alternate rapidly between reptating through the intact polymer network and disrupting network entanglements to drag polymers through the solution, similar to dsDNA dynamics we observe in single-molecule DNA imaging studies. Most importantly, these results reveal the surprisingly powerful ability of microchip electrophoresis to provide ultrafast Sanger sequencing, which will translate to increased system throughput and reduced costs. PMID:18184818
pLoc_bal-mGpos: Predict subcellular localization of Gram-positive bacterial proteins by quasi-balancing training dataset and PseAAC.

PubMed

Xiao, Xuan; Cheng, Xiang; Chen, Genqiang; Mao, Qi; Chou, Kuo-Chen

2018-05-26

Knowledge of protein subcellular localization is vitally important for both basic research and drug development. With the avalanche of protein sequences emerging in the post-genomic age, it is highly desired to develop computational tools for timely and effectively identifying their subcellular localization purely based on the sequence information alone. Recently, a predictor called "pLoc-mGpos" was developed for identifying the subcellular localization of Gram-positive bacterial proteins. Its performance is overwhelmingly better than that of the other predictors for the same purpose, particularly in dealing with multi-label systems in which some proteins, called "multiplex proteins", may simultaneously occur in two or more subcellular locations. Although it is indeed a very powerful predictor, more efforts are definitely needed to further improve it. This is because pLoc-mGpos was trained by an extremely skewed dataset in which some subset (subcellular location) was over 11 times the size of the other subsets. Accordingly, it cannot avoid the bias consequence caused by such an uneven training dataset. To alleviate such bias consequence, we have developed a new and bias-reducing predictor called pLoc_bal-mGpos by quasi-balancing the training dataset. Rigorous target jackknife tests on exactly the same experiment-confirmed dataset have indicated that the proposed new predictor is remarkably superior to pLoc-mGpos, the existing state-of-the-art predictor in identifying the subcellular localization of Gram-positive bacterial proteins. To maximize the convenience for most experimental scientists, a user-friendly web-server for the new predictor has been established at http://www.jci-bioinfo.cn/pLoc_bal-mGpos/, by which users can easily get their desired results without the need to go through the detailed mathematics. Copyright © 2018 Elsevier Inc. All rights reserved.
Development of Prevotella intermedia-specific PCR primers based on the nucleotide sequences of a DNA probe Pig27.

PubMed

Kim, Min Jung; Hwang, Kyung Hwan; Lee, Young-Seok; Park, Jae-Yoon; Kook, Joong-Ki

2011-03-01

The aim of this study was to develop Prevotella intermedia-specific PCR primers based on the P. intermedia-specific DNA probe. The P. intermedia-specific DNA probe was screened by inverted dot blot hybridization and confirmed by Southern blot hybridization. The nucleotide sequences of the species-specific DNA probes were determined using a chain termination method. Southern blot analysis showed that the DNA probe, Pig27, detected only the genomic DNA of P. intermedia strains. PCR showed that the PCR primers, Pin-F1/Pin-R1, had species-specificity for P. intermedia. The detection limits of the PCR primer sets were 0.4pg of the purified genomic DNA of P. intermedia ATCC 49046. These results suggest that the PCR primers, Pin-F1/Pin-R1, could be useful in the detection of P. intermedia as well as in the development of a PCR kit in epidemiological studies related to periodontal diseases. Crown Copyright © 2010. Published by Elsevier B.V. All rights reserved.
iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition

PubMed Central

Lin, Hao; Deng, En-Ze; Ding, Hui; Chen, Wei; Chou, Kuo-Chen

2014-01-01

The σ54 promoters are unique in prokaryotic genome and responsible for transcripting carbon and nitrogen-related genes. With the avalanche of genome sequences generated in the postgenomic age, it is highly desired to develop automated methods for rapidly and effectively identifying the σ54 promoters. Here, a predictor called ‘iPro54-PseKNC’ was developed. In the predictor, the samples of DNA sequences were formulated by a novel feature vector called ‘pseudo k-tuple nucleotide composition’, which was further optimized by the incremental feature selection procedure. The performance of iPro54-PseKNC was examined by the rigorous jackknife cross-validation tests on a stringent benchmark data set. As a user-friendly web-server, iPro54-PseKNC is freely accessible at http://lin.uestc.edu.cn/server/iPro54-PseKNC. For the convenience of the vast majority of experimental scientists, a step-by-step protocol guide was provided on how to use the web-server to get the desired results without the need to follow the complicated mathematics that were presented in this paper just for its integrity. Meanwhile, we also discovered through an in-depth statistical analysis that the distribution of distances between the transcription start sites and the translation initiation sites were governed by the gamma distribution, which may provide a fundamental physical principle for studying the σ54 promoters. PMID:25361964
Evaluation and validation of de novo and hybrid assembly techniques to derive high quality genome sequences

DOE PAGES

Utturkar, Sagar M.; Klingeman, Dawn Marie; Land, Miriam L.; ...

2014-06-14

Our motivation with this work was to assess the potential of different types of sequence data combined with de novo and hybrid assembly approaches to improve existing draft genome sequences. Our results show Illumina, 454 and PacBio sequencing technologies were used to generate de novo and hybrid genome assemblies for four different bacteria, which were assessed for quality using summary statistics (e.g. number of contigs, N50) and in silico evaluation tools. Differences in predictions of multiple copies of rDNA operons for each respective bacterium were evaluated by PCR and Sanger sequencing, and then the validated results were applied as anmore » additional criterion to rank assemblies. In general, assemblies using longer PacBio reads were better able to resolve repetitive regions. In this study, the combination of Illumina and PacBio sequence data assembled through the ALLPATHS-LG algorithm gave the best summary statistics and most accurate rDNA operon number predictions. This study will aid others looking to improve existing draft genome assemblies. As to availability and implementation–all assembly tools except CLC Genomics Workbench are freely available under GNU General Public License.« less

Enrichment of individual KIR2DL4 sequences from genomic DNA using long-template PCR and allele-specific hybridization to magnetic bead-bound oligonucleotide probes.

PubMed

Roberts, C H; Turino, C; Madrigal, J A; Marsh, S G E

2007-06-01

DNA enrichment by allele-specific hybridization (DEASH) was used as a means to isolate individual alleles of the killer cell immunoglobulin-like receptor (KIR2DL4) gene from heterozygous genomic DNA. Using long-template polymerase chain reaction (LT-PCR), the complete KIR2DL4 gene was amplified from a cell line that had previously been characterized for its KIR gene content by PCR using sequence-specific primers (PCR-SSP). The whole gene amplicons were sequenced and we identified two heterozygous positions in accordance with the predictions of the PCR-SSP. The amplicons were then hybridized to allele-specific, biotinylated oligonucleotide probes and through binding to streptavidin-coated beads, the targeted alleles were enriched. A second PCR amplified only the exonic regions of the enriched allele, and these were then sequenced in full. We show DEASH to be capable of enriching single alleles from a heterozygous PCR product, and through sequencing the enriched DNA, we are able to produce complete coding sequences of the KIR2DL4 alleles in accordance with the typing predicted by PCR-SSP.
Nucleic Acid Sandwich Hybridization Assay with Quantum Dot-Induced Fluorescence Resonance Energy Transfer for Pathogen Detection

PubMed Central

Chou, Cheng-Chung; Huang, Yi-Han

2012-01-01

This paper reports a nucleic acid sandwich hybridization assay with a quantum dot (QD)-induced fluorescence resonance energy transfer (FRET) reporter system. Two label-free hemagglutinin H5 sequences (60-mer DNA and 630-nt cDNA fragment) of avian influenza viruses were used as the targets in this work. Two oligonucleotides (16 mers and 18 mers) that specifically recognize two separate but neighboring regions of the H5 sequences were served as the capturing and reporter probes, respectively. The capturing probe was conjugated to QD655 (donor) in a molar ratio of 10:1 (probe-to-QD), and the reporter probe was labeled with Alexa Fluor 660 dye (acceptor) during synthesis. The sandwich hybridization assay was done in a 20 μL transparent, adhesive frame-confined microchamber on a disposable, temperature-adjustable indium tin oxide (ITO) glass slide. The FRET signal in response to the sandwich hybridization was monitored by a homemade optical sensor comprising a single 400 nm UV light-emitting diode (LED), optical fibers, and a miniature 16-bit spectrophotometer. The target with a concentration ranging from 0.5 nM to 1 μM was successfully correlated with both QD emission decrease at 653 nm and dye emission increase at 690 nm. To sum up, this work is beneficial for developing a portable QD-based nucleic acid sensor for on-site pathogen detection. PMID:23211753
Spatial segregation of spawning habitat limits hybridization between sympatric native Steelhead and Coastal Cutthroat Trout

USGS Publications Warehouse

Buehrens, T.W.; Glasgow, J.; Ostberg, Carl O.; Quinn, T.P.

2013-01-01

Native Coastal Cutthroat Trout Oncorhynchus clarkii clarkii and Coastal Steelhead O. mykiss irideus hybridize naturally in watersheds of the Pacific Northwest yet maintain species integrity. Partial reproductive isolation due to differences in spawning habitat may limit hybridization between these species, but this process is poorly understood. We used a riverscape approach to determine the spatial distribution of spawning habitats used by native Coastal Cutthroat Trout and Steelhead as evidenced by the distribution of recently emerged fry. Molecular genetic markers were used to classify individuals as pure species or hybrids, and individuals were assigned to age-classes based on length. Fish and physical habitat data were collected in a spatially continuous framework to assess the relationship between habitat and watershed features and the spatial distribution of parental species and hybrids. Sampling occurred in 35 reaches from tidewaters to headwaters in a small (20 km2) coastal watershed in Washington State. Cutthroat, Steelhead, and hybrid trout accounted for 35%, 42%, and 23% of the fish collected, respectively. Strong segregation of spawning areas between Coastal Cutthroat Trout and Steelhead was evidenced by the distribution of age-0 trout. Cutthroat Trout were located farther upstream and in smaller tributaries than Steelhead were. The best predictor of species occurrence at a site was the drainage area of the watershed that contributed to the site. This area was positively correlated with the occurrence of age-0 Steelhead and negatively with the presence of Cutthroat Trout, whereas hybrids were found in areas occupied by both parental species. A similar pattern was observed in older juveniles of both species but overlap was greater, suggesting substantial dispersal of trout after emergence. Our results offer support for spatial reproductive segregation as a factor limiting hybridization between Steelhead and Coastal Cutthroat Trout.
Characterisation of IS153, an IS3-family insertion sequence isolated from Lactobacillus sanfranciscensis and its use for strain differentiation.

PubMed

Ehrmann, M A; Vogel, R E

2001-11-01

An insertion sequence has been identified in the genome of Lactobacillus sanfranciscensis DSM 20451T as segment of 1351 nucleotides containing 37-bp imperfect terminal inverted repeats. The sequence of this element encodes two out of phase, overlapping open reading frames, orfA and orfB, from which three putative proteins are produced. OrfAB is a transframe protein produced by -1 translational frame shifting between orf A and orf B that is presumed to be the transposase. The large orfAB of this element encodes a 342 amino acid protein that displays similarities with transposases encoded by bacterial insertion sequences belonging to the IS3 family. In L. sanfranciscensis type strain DSM 20451T multiple truncated IS elements were identified. Inverse PCR was used to analyze target sites of four of these elements, but except of their highly AT rich character not any sequence specificity was identified so far. Moreover, no flanking direct repeats were identified. Multiple copies of IS153 were detected by hybridization in other strains of L. sanfranciscensis. Resulting hybridization patterns were shown to differentiate between organisms at strain level rather than a probe targeted against the 16S rDNA. With a PCR based approach IS153 or highly similar sequences were detected in L. acidophilus, L. casei, L. malefermentans, L. plantarum, L. hilgardii, L. collinoides L. farciminis L. sakei and L. salivarius, L. reuteri as well as in Enterococcus faecium, Pediococcus acidilactici and P. pentosaceus.
Optimization process planning using hybrid genetic algorithm and intelligent search for job shop machining.

PubMed

Salehi, Mojtaba; Bahreininejad, Ardeshir

2011-08-01

Optimization of process planning is considered as the key technology for computer-aided process planning which is a rather complex and difficult procedure. A good process plan of a part is built up based on two elements: (1) the optimized sequence of the operations of the part; and (2) the optimized selection of the machine, cutting tool and Tool Access Direction (TAD) for each operation. In the present work, the process planning is divided into preliminary planning, and secondary/detailed planning. In the preliminary stage, based on the analysis of order and clustering constraints as a compulsive constraint aggregation in operation sequencing and using an intelligent searching strategy, the feasible sequences are generated. Then, in the detailed planning stage, using the genetic algorithm which prunes the initial feasible sequences, the optimized operation sequence and the optimized selection of the machine, cutting tool and TAD for each operation based on optimization constraints as an additive constraint aggregation are obtained. The main contribution of this work is the optimization of sequence of the operations of the part, and optimization of machine selection, cutting tool and TAD for each operation using the intelligent search and genetic algorithm simultaneously.
Optimization process planning using hybrid genetic algorithm and intelligent search for job shop machining

PubMed Central

Salehi, Mojtaba

2010-01-01

Optimization of process planning is considered as the key technology for computer-aided process planning which is a rather complex and difficult procedure. A good process plan of a part is built up based on two elements: (1) the optimized sequence of the operations of the part; and (2) the optimized selection of the machine, cutting tool and Tool Access Direction (TAD) for each operation. In the present work, the process planning is divided into preliminary planning, and secondary/detailed planning. In the preliminary stage, based on the analysis of order and clustering constraints as a compulsive constraint aggregation in operation sequencing and using an intelligent searching strategy, the feasible sequences are generated. Then, in the detailed planning stage, using the genetic algorithm which prunes the initial feasible sequences, the optimized operation sequence and the optimized selection of the machine, cutting tool and TAD for each operation based on optimization constraints as an additive constraint aggregation are obtained. The main contribution of this work is the optimization of sequence of the operations of the part, and optimization of machine selection, cutting tool and TAD for each operation using the intelligent search and genetic algorithm simultaneously. PMID:21845020
Nonlinear Prediction Model for Hydrologic Time Series Based on Wavelet Decomposition

NASA Astrophysics Data System (ADS)

Kwon, H.; Khalil, A.; Brown, C.; Lall, U.; Ahn, H.; Moon, Y.

2005-12-01

Traditionally forecasting and characterizations of hydrologic systems is performed utilizing many techniques. Stochastic linear methods such as AR and ARIMA and nonlinear ones such as statistical learning theory based tools have been extensively used. The common difficulty to all methods is the determination of sufficient and necessary information and predictors for a successful prediction. Relationships between hydrologic variables are often highly nonlinear and interrelated across the temporal scale. A new hybrid approach is proposed for the simulation of hydrologic time series combining both the wavelet transform and the nonlinear model. The present model employs some merits of wavelet transform and nonlinear time series model. The Wavelet Transform is adopted to decompose a hydrologic nonlinear process into a set of mono-component signals, which are simulated by nonlinear model. The hybrid methodology is formulated in a manner to improve the accuracy of a long term forecasting. The proposed hybrid model yields much better results in terms of capturing and reproducing the time-frequency properties of the system at hand. Prediction results are promising when compared to traditional univariate time series models. An application of the plausibility of the proposed methodology is provided and the results conclude that wavelet based time series model can be utilized for simulating and forecasting of hydrologic variable reasonably well. This will ultimately serve the purpose of integrated water resources planning and management.
Power prediction in mobile communication systems using an optimal neural-network structure.

PubMed

Gao, X M; Gao, X Z; Tanskanen, J A; Ovaska, S J

1997-01-01

Presents a novel neural-network-based predictor for received power level prediction in direct sequence code division multiple access (DS/CDMA) systems. The predictor consists of an adaptive linear element (Adaline) followed by a multilayer perceptron (MLP). An important but difficult problem in designing such a cascade predictor is to determine the complexity of the networks. We solve this problem by using the predictive minimum description length (PMDL) principle to select the optimal numbers of input and hidden nodes. This approach results in a predictor with both good noise attenuation and excellent generalization capability. The optimized neural networks are used for predictive filtering of very noisy Rayleigh fading signals with 1.8 GHz carrier frequency. Our results show that the optimal neural predictor can provide smoothed in-phase and quadrature signals with signal-to-noise ratio (SNR) gains of about 12 and 7 dB at the urban mobile speeds of 5 and 50 km/h, respectively. The corresponding power signal SNR gains are about 11 and 5 dB. Therefore, the neural predictor is well suitable for power control applications where ldquodelaylessrdquo noise attenuation and efficient reduction of fast fading are required.
Microwell hybridization assay for detection of PCR products from Mycobacterium tuberculosis complex and the recombinant Mycobacterium smegmatis strain 1008 used as an internal control.

PubMed Central

Kox, L F; Noordhoek, G T; Kunakorn, M; Mulder, S; Sterrenburg, M; Kolk, A H

1996-01-01

A microwell hybridization assay was developed for the detection of the PCR products from both Mycobacterium tuberculosis complex bacteria and the recombinant Mycobacterium smegmatis strain 1008 that is used as an internal control to monitor inhibition in the PCR based on the M. tuberculosis complex-specific insertion sequence IS6110. The test is based on specific detection with digoxigenin-labeled oligonucleotide probes of biotinylated PCR products which are captured in a microtiter plate coated with streptavidin. The captured PCR products are hybridized separately with two probes, one specific for the PCR product from IS6110 from M. tuberculosis complex and the other specific for the PCR fragment from the modified IS6110 fragment from the recombinant M. smegmatis 1008. The microwell hybridization assay discriminates perfectly between the two types of amplicon. The amount of PCR product that can be detected by this assay is 10 times less than that which can be detected by agarose gel electrophoresis. The test can be performed in 2 h. It is much faster and less laborious than Southern blot hybridization. Furthermore, the interpretation of results is objective. The assay was used with 172 clinical samples in a routine microbiology laboratory, and the results were in complete agreement with those of agarose gel electrophoresis and Southern blot hybridization. PMID:8862568
Prediction of the optimum hybridization conditions of dot-blot-SNP analysis using estimated melting temperature of oligonucleotide probes.

PubMed

Shiokai, Sachiko; Kitashiba, Hiroyasu; Nishio, Takeshi

2010-08-01

Although the dot-blot-SNP technique is a simple cost-saving technique suitable for genotyping of many plant individuals, optimization of hybridization and washing conditions for each SNP marker requires much time and labor. For prediction of the optimum hybridization conditions for each probe, we compared T (m) values estimated from nucleotide sequences using the DINAMelt web server, measured T (m) values, and hybridization conditions yielding allele-specific signals. The estimated T (m) values were comparable to the measured T (m) values with small differences of less than 3 degrees C for most of the probes. There were differences of approximately 14 degrees C between the specific signal detection conditions and estimated T (m) values. Change of one level of SSC concentrations of 0.1, 0.2, 0.5, and 1.0x SSC corresponded to a difference of approximately 5 degrees C in optimum signal detection temperature. Increasing the sensitivity of signal detection by shortening the exposure time to X-ray film changed the optimum hybridization condition for specific signal detection. Addition of competitive oligonucleotides to the hybridization mixture increased the suitable hybridization conditions by 1.8. Based on these results, optimum hybridization conditions for newly produced dot-blot-SNP markers will become predictable.
Generation of Leishmania Hybrids by Whole Genomic DNA Transformation

PubMed Central

Coelho, Adriano C.; Leprohon, Philippe; Ouellette, Marc

2012-01-01

Genetic exchange is a powerful tool to study gene function in microorganisms. Here, we tested the feasibility of generating Leishmania hybrids by electroporating genomic DNA of donor cells into recipient Leishmania parasites. The donor DNA was marked with a drug resistance marker facilitating the selection of DNA transfer into the recipient cells. The transferred DNA was integrated exclusively at homologous locus and was as large as 45 kb. The independent generation of L. infantum hybrids with L. major sequences was possible for several chromosomal regions. Interfering with the mismatch repair machinery by inactivating the MSH2 gene enabled an increased efficiency of recombination between divergent sequences, hence favouring the selection of hybrids between species. Hybrids were shown to acquire the phenotype derived from the donor cells, as demonstrated for the transfer of drug resistance genes from L. major into L. infantum. The described method is a first step allowing the generation of in vitro hybrids for testing gene functions in a natural genomic context in the parasite Leishmania. PMID:23029579
Chromosomal Distribution of Endogenous Jaagsiekte Sheep Retrovirus Proviral Sequences in the Sheep Genome

PubMed Central

Carlson, Jonathan; Lyon, Monique; Bishop, Jeanette; Vaiman, Anne; Cribiu, Edmond; Mornex, Jean-François; Brown, Susan; Knudson, Dennis; DeMartini, James; Leroux, Caroline

2003-01-01

A family of endogenous retroviruses (enJSRV) closely related to Jaagsiekte sheep retrovirus (JSRV) is ubiquitous in domestic and wild sheep and goats. Southern blot hybridization studies indicate that there is little active replication or movement of the enJSRV proviruses in these species. Two approaches were used to investigate the distribution of proviral loci in the sheep genome. Fluorescence in situ hybridization (FISH) to metaphase chromosome spreads using viral DNA probes was used to detect loci on chromosomes. Hybridization signals were reproducibly detected on seven sheep chromosomes and eight goat chromosomes in seven cell lines. In addition, a panel of 30 sheep-hamster hybrid cell lines, each of which carries one or more sheep chromosomes and which collectively contain the whole sheep genome, was examined for enJSRV sequences. DNA from each of the lines was used as a template for PCR with JSRV gag-specific primers. A PCR product was amplified from 27 of the hybrid lines, indicating that JSRV gag sequences are found on at least 15 of the 28 sheep chromosomes, including those identified by FISH. Thus, enJSRV proviruses are essentially randomly distributed among the chromosomes of sheep and goats. FISH and/or Southern blot hybridization on DNA from several of the sheep-hamster hybrid cell lines suggests that loci containing multiple copies of enJSRV are present on chromosomes 6 and 9. The origin and functional significance of these arrays is not known. PMID:12915578
Genetic Evidence of Hybridization between the Endangered Native Species Iguana delicatissima and the Invasive Iguana iguana (Reptilia, Iguanidae) in the Lesser Antilles: Management Implications.

PubMed

Vuillaume, Barbara; Valette, Victorien; Lepais, Olivier; Grandjean, Frédéric; Breuil, Michel

2015-01-01

The worldwide increase of hybridization in different groups is thought to have become more important with the loss of isolating barriers and the introduction of invasive species. This phenomenon could result in the extinction of endemic species. This study aims at investigating the hybridization dynamics between the endemic and threatened Lesser Antillean iguana (Iguana delicatissima) and the invasive common green iguana (Iguana iguana) in the Lesser Antilles, as well as assessing the impact of interspecific hybridization on the decline of I. delicatissima. 59 I. delicatissima (5 localities), 47 I. iguana (12 localities) and 27 hybrids (5 localities), who were all identified based on morphological characters, have been genotyped at 15 microsatellites markers. We also sequenced hybrids using ND4 mitochondrial loci to further investigate mitochondrial introgression. The genetic clustering of species and hybrid genetic assignment were performed using a comparative approach, through the implementation of a Discriminant Analysis of Principal Component (DAPC) based on statistics, as well as genetic clustering approaches based on the genetic models of several populations (Structure, NewHybrids and HIest), in order to get full characterization of hybridization patterns and introgression dynamics across the islands. The iguanas identified as hybrids in the wild, thanks to morphological analysis, were all genetically F1, F2, or backcrosses. A high proportion of individuals were also the result of a longer-term admixture. The absence of reproductive barriers between species leads to hybridization when species are in contact. Yet morphological and behavioral differences between species could explain why males I. iguana may dominate I. delicatissima, thus resulting in short-term species displacement and extinction by hybridization and recurrent introgression from I. iguana toward I. delicatissima. As a consequence, I. delicatissima gets eliminated through introgression, as observed in recent population history over several islands. These results have profound implications for species management of the endangered I. delicatissima and practical conservation recommendations are being discussed in the light of these findings.
Genetic Evidence of Hybridization between the Endangered Native Species Iguana delicatissima and the Invasive Iguana iguana (Reptilia, Iguanidae) in the Lesser Antilles: Management Implications

PubMed Central

Vuillaume, Barbara; Valette, Victorien; Lepais, Olivier; Grandjean, Frédéric; Breuil, Michel

2015-01-01

The worldwide increase of hybridization in different groups is thought to have become more important with the loss of isolating barriers and the introduction of invasive species. This phenomenon could result in the extinction of endemic species. This study aims at investigating the hybridization dynamics between the endemic and threatened Lesser Antillean iguana (Iguana delicatissima) and the invasive common green iguana (Iguana iguana) in the Lesser Antilles, as well as assessing the impact of interspecific hybridization on the decline of I. delicatissima. 59 I. delicatissima (5 localities), 47 I. iguana (12 localities) and 27 hybrids (5 localities), who were all identified based on morphological characters, have been genotyped at 15 microsatellites markers. We also sequenced hybrids using ND4 mitochondrial loci to further investigate mitochondrial introgression. The genetic clustering of species and hybrid genetic assignment were performed using a comparative approach, through the implementation of a Discriminant Analysis of Principal Component (DAPC) based on statistics, as well as genetic clustering approaches based on the genetic models of several populations (Structure, NewHybrids and HIest), in order to get full characterization of hybridization patterns and introgression dynamics across the islands. The iguanas identified as hybrids in the wild, thanks to morphological analysis, were all genetically F1, F2, or backcrosses. A high proportion of individuals were also the result of a longer-term admixture. The absence of reproductive barriers between species leads to hybridization when species are in contact. Yet morphological and behavioral differences between species could explain why males I. iguana may dominate I. delicatissima, thus resulting in short-term species displacement and extinction by hybridization and recurrent introgression from I. iguana toward I. delicatissima. As a consequence, I. delicatissima gets eliminated through introgression, as observed in recent population history over several islands. These results have profound implications for species management of the endangered I. delicatissima and practical conservation recommendations are being discussed in the light of these findings. PMID:26046351
The Evolution of Polymorphic Hybrid Incompatibilities in House Mice.

PubMed

Larson, Erica L; Vanderpool, Dan; Sarver, Brice A J; Callahan, Colin; Keeble, Sara; Provencio, Lorraine P; Kessler, Michael D; Stewart, Vanessa; Nordquist, Erin; Dean, Matthew D; Good, Jeffrey M

2018-04-24

Resolving the mechanistic and genetic bases of reproductive barriers between species is essential to understanding the evolutionary forces that shape speciation. Intrinsic hybrid incompatibilities are often treated as fixed between species, yet there can be considerable variation in the strength of reproductive isolation between populations. The extent and causes of this variation remain poorly understood in most systems. We investigated the genetic basis of variable hybrid male sterility (HMS) between two recently diverged subspecies of house mice, Mus musculus domesticus and M. m. musculus We found that polymorphic HMS has a surprisingly complex genetic basis, with contributions from at least five autosomal loci segregating between two closely related wild-derived strains of M. m. musculus One of the HMS-linked regions on Chromosome 4 also showed extensive introgression among inbred laboratory strains and transmission ratio distortion (TRD) in hybrid crosses. Using additional crosses and whole genome sequencing of sperm pools, we showed that TRD was limited to hybrid crosses and was not due to differences in sperm motility between M. m. musculus strains. Based on these results, we argue that TRD likely reflects additional incompatibilities that reduce hybrid embryonic viability. In some common inbred strains of mice, selection against deleterious interactions appears to have unexpectedly driven introgression at loci involved in epistatic hybrid incompatibilities. The highly variable genetic basis to F1 hybrid incompatibilities between closely related mouse lineages argues that a thorough dissection of reproductive isolation will require much more extensive sampling of natural variation than has been commonly utilized in mice and other model systems. Copyright © 2018, Genetics.
High-throughput analysis of the protein sequence-stability landscape using a quantitative "yeast surface two-hybrid" system and fragment reconstitution

PubMed Central

Dutta, Sanjib; Koide, Akiko; Koide, Shohei

2008-01-01

Stability evaluation of many mutants can lead to a better understanding of the sequence determinants of a structural motif and of factors governing protein stability and protein evolution. The traditional biophysical analysis of protein stability is low throughput, limiting our ability to widely explore the sequence space in a quantitative manner. In this study, we have developed a high-throughput library screening method for quantifying stability changes, which is based on protein fragment reconstitution and yeast surface display. Our method exploits the thermodynamic linkage between protein stability and fragment reconstitution and the ability of the yeast surface display technique to quantitatively evaluate protein-protein interactions. The method was applied to a fibronectin type III (FN3) domain. Characterization of fragment reconstitution was facilitated by the co-expression of two FN3 fragments, thus establishing a "yeast surface two-hybrid" method. Importantly, our method does not rely on competition between clones and thus eliminates a common limitation of high-throughput selection methods in which the most stable variants are predominantly recovered. Thus, it allows for the isolation of sequences that exhibits a desired level of stability. We identified over one hundred unique sequences for a β-bulge motif, which was significantly more informative than natural sequences of the FN3 family in revealing the sequence determinants for the β-bulge. Our method provides a powerful means to rapidly assess stability of many variants, to systematically assess contribution of different factors to protein stability and to enhance protein stability. PMID:18674545
Impact of point-mutations on the hybridization affinity of surface-bound DNA/DNA and RNA/DNA oligonucleotide-duplexes: Comparison of single base mismatches and base bulges

PubMed Central

Naiser, Thomas; Ehler, Oliver; Kayser, Jona; Mai, Timo; Michel, Wolfgang; Ott, Albrecht

2008-01-01

Background The high binding specificity of short 10 to 30 mer oligonucleotide probes enables single base mismatch (MM) discrimination and thus provides the basis for genotyping and resequencing microarray applications. Recent experiments indicate that the underlying principles governing DNA microarray hybridization – and in particular MM discrimination – are not completely understood. Microarrays usually address complex mixtures of DNA targets. In order to reduce the level of complexity and to study the problem of surface-based hybridization with point defects in more detail, we performed array based hybridization experiments in well controlled and simple situations. Results We performed microarray hybridization experiments with short 16 to 40 mer target and probe lengths (in situations without competitive hybridization) in order to systematically investigate the impact of point-mutations – varying defect type and position – on the oligonucleotide duplex binding affinity. The influence of single base bulges and single base MMs depends predominantly on position – it is largest in the middle of the strand. The position-dependent influence of base bulges is very similar to that of single base MMs, however certain bulges give rise to an unexpectedly high binding affinity. Besides the defect (MM or bulge) type, which is the second contribution in importance to hybridization affinity, there is also a sequence dependence, which extends beyond the defect next-neighbor and which is difficult to quantify. Direct comparison between binding affinities of DNA/DNA and RNA/DNA duplexes shows, that RNA/DNA purine-purine MMs are more discriminating than corresponding DNA/DNA MMs. In DNA/DNA MM discrimination the affected base pair (C·G vs. A·T) is the pertinent parameter. We attribute these differences to the different structures of the duplexes (A vs. B form). Conclusion We have shown that DNA microarrays can resolve even subtle changes in hybridization affinity for simple target mixtures. We have further shown that the impact of point defects on oligonucleotide stability can be broken down to a hierarchy of effects. In order to explain our observations we propose DNA molecular dynamics – in form of zipping of the oligonucleotide duplex – to play an important role. PMID:18477387
IN SITU DEMONSTRATION OF DNA HYBRIDIZING WITH CHROMOSOMAL AND NUCLEAR SAP RNA IN CHIRONOMUS TENTANS

PubMed Central

Lambert, B.; Wieslander, L.; Daneholt, B.; Egyházi, E.; Ringborg, U.

1972-01-01

Cytological hybridization combined with microdissection of Chironomus tentans salivary gland cells was used to locate DNA complementary to newly synthesized RNA from chromosomes and nuclear sap and from a single chromosomal puff, the Balbiani ring 2 (BR 2). Salivary glands were incubated with tritiated nucleosides. The labeled RNA was extracted from microdissected nuclei and hybridized to denatured squash preparations of salivary gland cells under conditions which primarily allow repeated sequences to interact. The bound RNA, resistant to ribonuclease treatment, was detected radioautographically. It was found that BR 2 RNA hybridizes specifically with the BR 2 region of chromosome IV. Nuclear sap RNA was fractionated into high and low molecular-weight RNA; the former hybridizes with the BR 2 region of chromosome IV, the latter in a diffuse distribution over the whole chromosome set. RNA from chromosome I hybridizes diffusely with all chromosomes. Nucleolar RNA hybridizes specifically with the nucleolar organizers, contained in chromosomes II and III. It is concluded that the BR 2 region of chromosome IV contains repeated DNA sequences and that nuclear sap contains BR 2 RNA. PMID:5025107
Cytogenetic and molecular markers for detecting Aegilops uniaristata chromosomes in a wheat background.

PubMed

Gong, Wenping; Li, Guangrong; Zhou, Jianping; Li, Genying; Liu, Cheng; Huang, Chengyan; Zhao, Zhendong; Yang, Zujun

2014-09-01

Aegilops uniaristata has many agronomically useful traits that can be used for wheat breeding. So far, a Triticum turgidum - Ae. uniaristata amphiploid and one set of Chinese Spring (CS) - Ae. uniaristata addition lines have been produced. To guide Ae. uniaristata chromatin transformation from these lines into cultivated wheat through chromosome engineering, reliable cytogenetic and molecular markers specific for Ae. uniaristata chromosomes need to be developed. Standard C-banding shows that C-bands mainly exist in the centromeric regions of Ae. uniaristata but rarely at the distal ends. Fluorescence in situ hybridization (FISH) using (GAA)8 as a probe showed that the hybridization signal of chromosomes 1N-7N are different, thus (GAA)8 can be used to identify all Ae. uniaristata chromosomes in wheat background simultaneously. Moreover, a total of 42 molecular markers specific for Ae. uniaristata chromosomes were developed by screening expressed sequence tag - sequence tagged site (EST-STS), expressed sequence tag - simple sequence repeat (EST-SSR), and PCR-based landmark unique gene (PLUG) primers. The markers were subsequently localized using the CS - Ae. uniaristata addition lines and different wheat cultivars as controls. The cytogenetic and molecular markers developed herein will be helpful for screening and identifying wheat - Ae. uniaristata progeny.
DNA sequencing using fluorescence background electroblotting membrane

DOEpatents

Caldwell, Karin D.; Chu, Tun-Jen; Pitt, William G.

1992-01-01

A method for the multiplex sequencing on DNA is disclosed which comprises the electroblotting or specific base terminated DNA fragments, which have been resolved by gel electrophoresis, onto the surface of a neutral non-aromatic polymeric microporous membrane exhibiting low background fluorescence which has been surface modified to contain amino groups. Polypropylene membranes are preferably and the introduction of amino groups is accomplished by subjecting the membrane to radio or microwave frequency plasma discharge in the presence of an aminating agent, preferably ammonia. The membrane, containing physically adsorbed DNA fragments on its surface after the electroblotting, is then treated with crosslinking means such as UV radiation or a glutaraldehyde spray to chemically bind the DNA fragments to the membrane through said smino groups contained on the surface thereof. The DNA fragments chemically bound to the membrane are subjected to hybridization probing with a tagged probe specific to the sequence of the DNA fragments. The tagging may be by either fluorophores or radioisotopes. The tagged probes hybridized to said target DNA fragments are detected and read by laser induced fluorescence detection or autoradiograms. The use of aminated low fluorescent background membranes allows the use of fluorescent detection and reading even when the available amount of DNA to be sequenced is small. The DNA bound to the membrances may be reprobed numerous times.

DNA sequencing using fluorescence background electroblotting membrane

DOEpatents

Caldwell, K.D.; Chu, T.J.; Pitt, W.G.

1992-05-12

A method for the multiplex sequencing on DNA is disclosed which comprises the electroblotting or specific base terminated DNA fragments, which have been resolved by gel electrophoresis, onto the surface of a neutral non-aromatic polymeric microporous membrane exhibiting low background fluorescence which has been surface modified to contain amino groups. Polypropylene membranes are preferably and the introduction of amino groups is accomplished by subjecting the membrane to radio or microwave frequency plasma discharge in the presence of an aminating agent, preferably ammonia. The membrane, containing physically adsorbed DNA fragments on its surface after the electroblotting, is then treated with crosslinking means such as UV radiation or a glutaraldehyde spray to chemically bind the DNA fragments to the membrane through amino groups contained on the surface. The DNA fragments chemically bound to the membrane are subjected to hybridization probing with a tagged probe specific to the sequence of the DNA fragments. The tagging may be by either fluorophores or radioisotopes. The tagged probes hybridized to the target DNA fragments are detected and read by laser induced fluorescence detection or autoradiograms. The use of aminated low fluorescent background membranes allows the use of fluorescent detection and reading even when the available amount of DNA to be sequenced is small. The DNA bound to the membranes may be reprobed numerous times. No Drawings
Clustered regularly interspaced short palindromic repeats (CRISPRs) analysis of members of the Mycobacterium tuberculosis complex.

PubMed

Botelho, Ana; Canto, Ana; Leão, Célia; Cunha, Mónica V

2015-01-01

Typical CRISPR (clustered, regularly interspaced, short palindromic repeat) regions are constituted by short direct repeats (DRs), interspersed with similarly sized non-repetitive spacers, derived from transmissible genetic elements, acquired when the cell is challenged with foreign DNA. The analysis of the structure, in number and nature, of CRISPR spacers is a valuable tool for molecular typing since these loci are polymorphic among strains, originating characteristic signatures. The existence of CRISPR structures in the genome of the members of Mycobacterium tuberculosis complex (MTBC) enabled the development of a genotyping method, based on the analysis of the presence or absence of 43 oligonucleotide spacers separated by conserved DRs. This method, called spoligotyping, consists on PCR amplification of the DR chromosomal region and recognition after hybridization of the spacers that are present. The workflow beneath this methodology implies that the PCR products are brought onto a membrane containing synthetic oligonucleotides that have complementary sequences to the spacer sequences. Lack of hybridization of the PCR products to a specific oligonucleotide sequence indicates absence of the correspondent spacer sequence in the examined strain. Spoligotyping gained great notoriety as a robust identification and typing tool for members of MTBC, enabling multiple epidemiological studies on human and animal tuberculosis.
A novel self-powered and sensitive label-free DNA biosensor in microbial fuel cell.

PubMed

Asghary, Maryam; Raoof, Jahan Bakhsh; Rahimnejad, Mostafa; Ojani, Reza

2016-08-15

In this work, a novel self-powered, sensitive, low-cost, and label-free DNA biosensor is reported by applying a two-chambered microbial fuel cell (MFC) as a power supply. A graphite electrode and an Au nanoparticles modified graphite electrode (AuNP/graphite electrode) were used as anode and cathode in the MFC system, respectively. The active biocatalyst in the anodic chamber was a mixed culture of microorganisms. The sensing element of the biosensor was fabricated by the well-known Au-thiol binding the ssDNA probe on the surface of an AuNP/graphite cathode. Electrons produced by microorganisms were transported from the anode to the cathode through an external circuit, which could be detected by the terminal multi-meter detector. The difference between power densities of the ssDNA probe modified cathode in the absence and presence of complementary sequence served as the detection signal of the DNA hybridization with detection limit of 3.1nM. Thereafter, this biosensor was employed for diagnosis and determination of complementary sequence in a human serum sample. The hybridization specificity studies further revealed that the developed DNA biosensor could distinguish fully complementary sequences from one-base mismatched and non-complementary sequences. Copyright © 2016 Elsevier B.V. All rights reserved.
A Support Vector Machine based method to distinguish proteobacterial proteins from eukaryotic plant proteins

PubMed Central

2012-01-01

Background Members of the phylum Proteobacteria are most prominent among bacteria causing plant diseases that result in a diminution of the quantity and quality of food produced by agriculture. To ameliorate these losses, there is a need to identify infections in early stages. Recent developments in next generation nucleic acid sequencing and mass spectrometry open the door to screening plants by the sequences of their macromolecules. Such an approach requires the ability to recognize the organismal origin of unknown DNA or peptide fragments. There are many ways to approach this problem but none have emerged as the best protocol. Here we attempt a systematic way to determine organismal origins of peptides by using a machine learning algorithm. The algorithm that we implement is a Support Vector Machine (SVM). Result The amino acid compositions of proteobacterial proteins were found to be different from those of plant proteins. We developed an SVM model based on amino acid and dipeptide compositions to distinguish between a proteobacterial protein and a plant protein. The amino acid composition (AAC) based SVM model had an accuracy of 92.44% with 0.85 Matthews correlation coefficient (MCC) while the dipeptide composition (DC) based SVM model had a maximum accuracy of 94.67% and 0.89 MCC. We also developed SVM models based on a hybrid approach (AAC and DC), which gave a maximum accuracy 94.86% and a 0.90 MCC. The models were tested on unseen or untrained datasets to assess their validity. Conclusion The results indicate that the SVM based on the AAC and DC hybrid approach can be used to distinguish proteobacterial from plant protein sequences. PMID:23046503
HiRel: Hybrid Automated Reliability Predictor (HARP) integrated reliability tool system, (version 7.0). Volume 1: HARP introduction and user's guide

NASA Technical Reports Server (NTRS)

Bavuso, Salvatore J.; Rothmann, Elizabeth; Dugan, Joanne Bechta; Trivedi, Kishor S.; Mittal, Nitin; Boyd, Mark A.; Geist, Robert M.; Smotherman, Mark D.

1994-01-01

The Hybrid Automated Reliability Predictor (HARP) integrated Reliability (HiRel) tool system for reliability/availability prediction offers a toolbox of integrated reliability/availability programs that can be used to customize the user's application in a workstation or nonworkstation environment. HiRel consists of interactive graphical input/output programs and four reliability/availability modeling engines that provide analytical and simulative solutions to a wide host of reliable fault-tolerant system architectures and is also applicable to electronic systems in general. The tool system was designed to be compatible with most computing platforms and operating systems, and some programs have been beta tested, within the aerospace community for over 8 years. Volume 1 provides an introduction to the HARP program. Comprehensive information on HARP mathematical models can be found in the references.
Genome-wide ancestry and divergence patterns from low-coverage sequencing data reveal a complex history of admixture in wild baboons

PubMed Central

Wall, Jeffrey D; Schlebusch, Stephen A; Alberts, Susan C; Cox, Laura A; Snyder-Mackler, Noah; Nevonen, Kimberly; Carbone, Lucia; Tung, Jenny

2017-01-01

Naturally occurring admixture has now been documented in every major primate lineage, suggesting its key role in primate evolutionary history. Active primate hybrid zones can provide valuable insight into this process. Here, we investigate the history of admixture in one of the best-studied natural primate hybrid zones, between yellow baboons (Papio cynocephalus) and anubis baboons (Papio anubis) in the Amboseli ecosystem of Kenya. We generated a new genome assembly for yellow baboon and low coverage genome-wide resequencing data from yellow baboons, anubis baboons, and known hybrids (n=44). Using a novel composite likelihood method for estimating local ancestry from low coverage data, we found high levels of genetic diversity and genetic differentiation between the parent taxa, and excellent agreement between genome-scale ancestry estimates and a priori pedigree, life history, and morphology-based estimates (r2=0.899). However, even putatively unadmixed Amboseli yellow individuals carried a substantial proportion of anubis ancestry, presumably due to historical admixture. Further, the distribution of shared versus fixed differences between a putatively unadmixed Amboseli yellow baboon and an unadmixed anubis baboon, both sequenced at high coverage, are inconsistent with simple isolation-migration or equilibrium migration models. Our findings suggest a complex process of intermittent contact that has occurred multiple times in baboon evolutionary history, despite no obvious fitness costs to hybrids or major geographic or behavioral barriers. In combination with the extensive phenotypic data available for baboon hybrids, our results provide valuable context for understanding the history of admixture in primates, including in our own lineage. PMID:27145036
Modeling Hybridization Kinetics of Gene Probes in a DNA Biochip Using FEMLAB

PubMed Central

Munir, Ahsan; Waseem, Hassan; Williams, Maggie R.; Stedtfeld, Robert D.; Gulari, Erdogan; Tiedje, James M.; Hashsham, Syed A.

2017-01-01

Microfluidic DNA biochips capable of detecting specific DNA sequences are useful in medical diagnostics, drug discovery, food safety monitoring and agriculture. They are used as miniaturized platforms for analysis of nucleic acids-based biomarkers. Binding kinetics between immobilized single stranded DNA on the surface and its complementary strand present in the sample are of interest. To achieve optimal sensitivity with minimum sample size and rapid hybridization, ability to predict the kinetics of hybridization based on the thermodynamic characteristics of the probe is crucial. In this study, a computer aided numerical model for the design and optimization of a flow-through biochip was developed using a finite element technique packaged software tool (FEMLAB; package included in COMSOL Multiphysics) to simulate the transport of DNA through a microfluidic chamber to the reaction surface. The model accounts for fluid flow, convection and diffusion in the channel and on the reaction surface. Concentration, association rate constant, dissociation rate constant, recirculation flow rate, and temperature were key parameters affecting the rate of hybridization. The model predicted the kinetic profile and signal intensities of eighteen 20-mer probes targeting vancomycin resistance genes (VRGs). Predicted signal intensities and hybridization kinetics strongly correlated with experimental data in the biochip (R2 = 0.8131). PMID:28555058
Modeling Hybridization Kinetics of Gene Probes in a DNA Biochip Using FEMLAB.

PubMed

Munir, Ahsan; Waseem, Hassan; Williams, Maggie R; Stedtfeld, Robert D; Gulari, Erdogan; Tiedje, James M; Hashsham, Syed A

2017-05-29

Microfluidic DNA biochips capable of detecting specific DNA sequences are useful in medical diagnostics, drug discovery, food safety monitoring and agriculture. They are used as miniaturized platforms for analysis of nucleic acids-based biomarkers. Binding kinetics between immobilized single stranded DNA on the surface and its complementary strand present in the sample are of interest. To achieve optimal sensitivity with minimum sample size and rapid hybridization, ability to predict the kinetics of hybridization based on the thermodynamic characteristics of the probe is crucial. In this study, a computer aided numerical model for the design and optimization of a flow-through biochip was developed using a finite element technique packaged software tool (FEMLAB; package included in COMSOL Multiphysics) to simulate the transport of DNA through a microfluidic chamber to the reaction surface. The model accounts for fluid flow, convection and diffusion in the channel and on the reaction surface. Concentration, association rate constant, dissociation rate constant, recirculation flow rate, and temperature were key parameters affecting the rate of hybridization. The model predicted the kinetic profile and signal intensities of eighteen 20-mer probes targeting vancomycin resistance genes (VRGs). Predicted signal intensities and hybridization kinetics strongly correlated with experimental data in the biochip (R² = 0.8131).
Lager Yeast Comes of Age

PubMed Central

2014-01-01

Alcoholic fermentations have accompanied human civilizations throughout our history. Lager yeasts have a several-century-long tradition of providing fresh beer with clean taste. The yeast strains used for lager beer fermentation have long been recognized as hybrids between two Saccharomyces species. We summarize the initial findings on this hybrid nature, the genomics/transcriptomics of lager yeasts, and established targets of strain improvements. Next-generation sequencing has provided fast access to yeast genomes. Its use in population genomics has uncovered many more hybridization events within Saccharomyces species, so that lager yeast hybrids are no longer the exception from the rule. These findings have led us to propose network evolution within Saccharomyces species. This “web of life” recognizes the ability of closely related species to exchange DNA and thus drain from a combined gene pool rather than be limited to a gene pool restricted by speciation. Within the domesticated lager yeasts, two groups, the Saaz and Frohberg groups, can be distinguished based on fermentation characteristics. Recent evidence suggests that these groups share an evolutionary history. We thus propose to refer to the Saaz group as Saccharomyces carlsbergensis and to the Frohberg group as Saccharomyces pastorianus based on their distinct genomes. New insight into the hybrid nature of lager yeast will provide novel directions for future strain improvement. PMID:25084862
HDOCK: a web server for protein–protein and protein–DNA/RNA docking based on a hybrid strategy

PubMed Central

Yan, Yumeng; Zhang, Di; Zhou, Pei; Li, Botong

2017-01-01

Abstract Protein–protein and protein–DNA/RNA interactions play a fundamental role in a variety of biological processes. Determining the complex structures of these interactions is valuable, in which molecular docking has played an important role. To automatically make use of the binding information from the PDB in docking, here we have presented HDOCK, a novel web server of our hybrid docking algorithm of template-based modeling and free docking, in which cases with misleading templates can be rescued by the free docking protocol. The server supports protein–protein and protein–DNA/RNA docking and accepts both sequence and structure inputs for proteins. The docking process is fast and consumes about 10–20 min for a docking run. Tested on the cases with weakly homologous complexes of <30% sequence identity from five docking benchmarks, the HDOCK pipeline tied with template-based modeling on the protein–protein and protein–DNA benchmarks and performed better than template-based modeling on the three protein–RNA benchmarks when the top 10 predictions were considered. The performance of HDOCK became better when more predictions were considered. Combining the results of HDOCK and template-based modeling by ranking first of the template-based model further improved the predictive power of the server. The HDOCK web server is available at http://hdock.phys.hust.edu.cn/. PMID:28521030
Rapid hybrid de novo assembly of a microbial genome using only short reads: Corynebacterium pseudotuberculosis I19 as a case study.

PubMed

Cerdeira, Louise Teixeira; Carneiro, Adriana Ribeiro; Ramos, Rommel Thiago Jucá; de Almeida, Sintia Silva; D'Afonseca, Vivian; Schneider, Maria Paula Cruz; Baumbach, Jan; Tauch, Andreas; McCulloch, John Anthony; Azevedo, Vasco Ariston Carvalho; Silva, Artur

2011-08-01

Due to the advent of the so-called Next-Generation Sequencing (NGS) technologies the amount of monetary and temporal resources for whole-genome sequencing has been reduced by several orders of magnitude. Sequence reads can be assembled either by anchoring them directly onto an available reference genome (classical reference assembly), or can be concatenated by overlap (de novo assembly). The latter strategy is preferable because it tends to maintain the architecture of the genome sequence the however, depending on the NGS platform used, the shortness of read lengths cause tremendous problems the in the subsequent genome assembly phase, impeding closing of the entire genome sequence. To address the problem, we developed a multi-pronged hybrid de novo strategy combining De Bruijn graph and Overlap-Layout-Consensus methods, which was used to assemble from short reads the entire genome of Corynebacterium pseudotuberculosis strain I19, a bacterium with immense importance in veterinary medicine that causes Caseous Lymphadenitis in ruminants, principally ovines and caprines. Briefly, contigs were assembled de novo from the short reads and were only oriented using a reference genome by anchoring. Remaining gaps were closed using iterative anchoring of short reads by craning to gap flanks. Finally, we compare the genome sequence assembled using our hybrid strategy to a classical reference assembly using the same data as input and show that with the availability of a reference genome, it pays off to use the hybrid de novo strategy, rather than a classical reference assembly, because more genome sequences are preserved using the former. Copyright © 2011 Elsevier B.V. All rights reserved.
Prediction of BP reactivity to talking using hybrid soft computing approaches.

PubMed

Kaur, Gurmanik; Arora, Ajat Shatru; Jain, Vijender Kumar

2014-01-01

High blood pressure (BP) is associated with an increased risk of cardiovascular diseases. Therefore, optimal precision in measurement of BP is appropriate in clinical and research studies. In this work, anthropometric characteristics including age, height, weight, body mass index (BMI), and arm circumference (AC) were used as independent predictor variables for the prediction of BP reactivity to talking. Principal component analysis (PCA) was fused with artificial neural network (ANN), adaptive neurofuzzy inference system (ANFIS), and least square-support vector machine (LS-SVM) model to remove the multicollinearity effect among anthropometric predictor variables. The statistical tests in terms of coefficient of determination (R (2)), root mean square error (RMSE), and mean absolute percentage error (MAPE) revealed that PCA based LS-SVM (PCA-LS-SVM) model produced a more efficient prediction of BP reactivity as compared to other models. This assessment presents the importance and advantages posed by PCA fused prediction models for prediction of biological variables.
Interfacial transduction of nucleic acid hybridization using immobilized quantum dots as donors in fluorescence resonance energy transfer.

PubMed

Algar, W Russ; Krull, Ulrich J

2009-01-06

Fluorescence resonance energy transfer (FRET) using immobilized quantum dots (QDs) as energy donors was explored as a transduction method for the detection of nucleic acid hybridization at an interface. This research was motivated by the success of the QD-FRET-based transduction of nucleic acid hybridization in solution-phase assays. This new work represents a fundamental step toward the assembly of a biosensor, where immobilization of the selective chemistry on a surface is desired. After immobilizing QD-probe oligonucleotide conjugates on optical fibers, a demonstration of the retention of selectivity was achieved by the introduction of acceptor (Cy3)-labeled single-stranded target oligonucleotides. Hybridization generated the proximity required for FRET, and the resulting fluorescence spectra provided an analytical signal proportional to the amount of target. This research provides an important framework for the future development of nucleic acid biosensors based on QDs and FRET. The most important findings of this work are that (1) a QD-FRET solid-phase hybridization assay is viable and (2) a passivating layer of denatured bovine serum albumin alleviates nonspecific adsorption, ultimately resulting in (3) the potential for a reusable assay format and mismatch discrimination. In this, the first incarnation of a solid-phase QD-FRET hybridization assay, the limit of detection was found to be 5 nM, and the dynamic range was almost 2 orders of magnitude. Selective discrimination of the target was shown using a three-base-pairs mismatch from a fully complementary sequence. Despite a gradual loss of signal, reuse of the optical fibers over multiple cycles of hybridization and dehybridization was possible. Directions for further improvement of the analytical performance by optimizing the design of the QD-probe oligonucleotide interface are identified.
Direct Spectroscopic Study of Reconstituted Transcription Complexes Reveals That Intrinsic Termination Is Driven Primarily by Thermodynamic Destabilization of the Nucleic Acid Framework*S

PubMed Central

Datta, Kausiki; von Hippel, Peter H.

2008-01-01

Changes in near UV circular dichroism (CD) and fluorescence spectra of site-specifically placed pairs of 2-aminopurine residues have been used to probe the roles of the RNA hairpin and the RNA-DNA hybrid in controlling intrinsic termination of transcription. Functional transcription complexes were assembled directly by mixing preformed nucleic acid scaffolds of defined sequence with T7 RNA polymerase (RNAP). Scaffolds containing RNA hairpins immediately upstream of a GC-rich hybrid formed complexes of reduced stability, whereas the same hairpins adjacent to a hybrid of rU-dA base pairs triggered complex dissociation and transcript release. 2-Aminopurine probes at the upstream ends of the hairpin stems show that the hairpins open on RNAP binding and that stem re-formation begins after one or two RNA bases on the downstream side of the stem have emerged from the RNAP exit tunnel. Hairpins directly adjacent to the RNA-DNA hybrid weaken RNAP binding, decrease elongation efficiency, and disrupt the upstream end of the hybrid as well as interfere with the movement of the template base at the RNAP active site. Probing the edges of the DNA transcription bubble demonstrates that termination hairpins prevent translocation of the RNAP, suggesting that they transiently “lock” the polymerase to the nucleic acid scaffold and, thus, hold the RNA-DNA hybrid “in frame.” At intrinsic terminators the weak rU-dA hybrid and the adjacent termination hairpin combine to destabilize the elongation complex sufficiently to permit significant transcript release, whereas hairpin-dependent pausing provides time for the process to go to completion. PMID:18070878
Phylogenetic analysis of Mycobacterium massiliense strains having recombinant rpoB gene laterally transferred from Mycobacterium abscessus.

PubMed

Kim, Byoung-Jun; Kim, Ga-Na; Kim, Bo-Ram; Shim, Tae-Sun; Kook, Yoon-Hoh; Kim, Bum-Joon

2017-01-01

Recent multi locus sequence typing (MLST) and genome based studies indicate that lateral gene transfer (LGT) events in the rpoB gene are prevalent between Mycobacterium abscessus complex strains. To check the prevalence of the M. massiliense strains subject to rpoB LGT (Rec-mas), we applied rpoB typing (711 bp) to 106 Korean strains of M. massiliense infection that had already been identified by hsp65 sequence analysis (603 bp). The analysis indicated 6 smooth strains in M. massiliense Type I (10.0%, 6/60) genotypes but no strains in M. massiliense Type II genotypes (0%, 0/46), showing a discrepancy between the 2 typing methods. Further MLST analysis based on the partial sequencing of seven housekeeping genes, argH, cya, glpK, gnd, murC, pta and purH, as well as erm(41) PCR proved that these 6 Rec-mas strains consisted of two distinct genotypes belonging to M. massiliense and not M. abscessus. The complete rpoB sequencing analysis showed that these 6 Rec-mas strains have an identical hybrid rpoB gene, of which a 478 bp partial rpoB fragment may be laterally transferred from M. abscessus. Notably, five of the 6 Rec-mas strains showed complete identical sequences in a total of nine genes, including the seven MLST genes, hsp65, and rpoB, suggesting their clonal propagation in South Korea. In conclusion, we identified 6 M. massiliense smooth strains of 2 phylogenetically distinct genotypes with a specific hybrid rpoB gene laterally transferred from M. abscessus from Korean patients. Their clinical relevance and bacteriological traits remain to be elucidated.
Phylogenetic analysis of Mycobacterium massiliense strains having recombinant rpoB gene laterally transferred from Mycobacterium abscessus

PubMed Central

Kim, Byoung-Jun; Kim, Ga-Na; Kim, Bo-Ram; Shim, Tae-Sun; Kook, Yoon-Hoh

2017-01-01

Recent multi locus sequence typing (MLST) and genome based studies indicate that lateral gene transfer (LGT) events in the rpoB gene are prevalent between Mycobacterium abscessus complex strains. To check the prevalence of the M. massiliense strains subject to rpoB LGT (Rec-mas), we applied rpoB typing (711 bp) to 106 Korean strains of M. massiliense infection that had already been identified by hsp65 sequence analysis (603 bp). The analysis indicated 6 smooth strains in M. massiliense Type I (10.0%, 6/60) genotypes but no strains in M. massiliense Type II genotypes (0%, 0/46), showing a discrepancy between the 2 typing methods. Further MLST analysis based on the partial sequencing of seven housekeeping genes, argH, cya, glpK, gnd, murC, pta and purH, as well as erm(41) PCR proved that these 6 Rec-mas strains consisted of two distinct genotypes belonging to M. massiliense and not M. abscessus. The complete rpoB sequencing analysis showed that these 6 Rec-mas strains have an identical hybrid rpoB gene, of which a 478 bp partial rpoB fragment may be laterally transferred from M. abscessus. Notably, five of the 6 Rec-mas strains showed complete identical sequences in a total of nine genes, including the seven MLST genes, hsp65, and rpoB, suggesting their clonal propagation in South Korea. In conclusion, we identified 6 M. massiliense smooth strains of 2 phylogenetically distinct genotypes with a specific hybrid rpoB gene laterally transferred from M. abscessus from Korean patients. Their clinical relevance and bacteriological traits remain to be elucidated. PMID:28604829
mLASSO-Hum: A LASSO-based interpretable human-protein subcellular localization predictor.

PubMed

Wan, Shibiao; Mak, Man-Wai; Kung, Sun-Yuan

2015-10-07

Knowing the subcellular compartments of human proteins is essential to shed light on the mechanisms of a broad range of human diseases. In computational methods for protein subcellular localization, knowledge-based methods (especially gene ontology (GO) based methods) are known to perform better than sequence-based methods. However, existing GO-based predictors often lack interpretability and suffer from overfitting due to the high dimensionality of feature vectors. To address these problems, this paper proposes an interpretable multi-label predictor, namely mLASSO-Hum, which can yield sparse and interpretable solutions for large-scale prediction of human protein subcellular localization. By using the one-vs-rest LASSO-based classifiers, 87 out of more than 8000 GO terms are found to play more significant roles in determining the subcellular localization. Based on these 87 essential GO terms, we can decide not only where a protein resides within a cell, but also why it is located there. To further exploit information from the remaining GO terms, a method based on the GO hierarchical information derived from the depth distance of GO terms is proposed. Experimental results show that mLASSO-Hum performs significantly better than state-of-the-art predictors. We also found that in addition to the GO terms from the cellular component category, GO terms from the other two categories also play important roles in the final classification decisions. For readers' convenience, the mLASSO-Hum server is available online at http://bioinfo.eie.polyu.edu.hk/mLASSOHumServer/. Copyright © 2015 Elsevier Ltd. All rights reserved.
Design and Manufacturing of a Novel Shear Thickening Fluid Composite (STFC) with Enhanced out-of-Plane Properties and Damage Suppression

NASA Astrophysics Data System (ADS)

Pinto, F.; Meo, M.

2017-06-01

The ability to absorb a large amount of energy during an impact event without generating critical damages represents a key feature of new generation composite systems. Indeed, the intrinsic layered nature of composite materials allows the embodiment of specific hybrid plies within the stacking sequence that can be exploited to increase impact resistance and damping of the entire structure without dramatic weight increase. This work is based on the development of an impact-resistant hybrid composite obtained by including a thin layer of Non-Newtonian silica based fluid in a carbon fibres reinforced polymer (CFRP) laminate. This hybrid phase is able to respond to an external solicitation by activating an order-disorder transition that thickens the fluid increasing its viscosity, hence dissipating the energy impact without any critical failure. Several Shear Thickening Fluids (STFs) were manufactured by changing the dimensions of the particles that constitute the disperse phase and their concentrations into the continuous phase. The dynamic viscosity of the different STFs was evaluated via rheometric tests, observing both shear thinning and shear thickening effects depending on the concentration of silica particles. The solutions were then embedded as an active layer within the stacking sequence to manufacture the hybrid CFRP laminates with different embedded STFs. Free vibration tests were carried out in order to assess the damping properties of the different laminates, while low velocity impact tests were used to evaluate their impact properties. Results indicate that the presence of the non-Newtonian fluid is able to absorb up to 45 % of the energy during an impact event for impacts at 2.5 m/s depending on the different concentrations and particles dimensions. These results were confirmed via C-Scan analyses to assess the extent of the internal delamination.
GFam: a platform for automatic annotation of gene families.

PubMed

Sasidharan, Rajkumar; Nepusz, Tamás; Swarbreck, David; Huala, Eva; Paccanaro, Alberto

2012-10-01

We have developed GFam, a platform for automatic annotation of gene/protein families. GFam provides a framework for genome initiatives and model organism resources to build domain-based families, derive meaningful functional labels and offers a seamless approach to propagate functional annotation across periodic genome updates. GFam is a hybrid approach that uses a greedy algorithm to chain component domains from InterPro annotation provided by its 12 member resources followed by a sequence-based connected component analysis of un-annotated sequence regions to derive consensus domain architecture for each sequence and subsequently generate families based on common architectures. Our integrated approach increases sequence coverage by 7.2 percentage points and residue coverage by 14.6 percentage points higher than the coverage relative to the best single-constituent database within InterPro for the proteome of Arabidopsis. The true power of GFam lies in maximizing annotation provided by the different InterPro data sources that offer resource-specific coverage for different regions of a sequence. GFam's capability to capture higher sequence and residue coverage can be useful for genome annotation, comparative genomics and functional studies. GFam is a general-purpose software and can be used for any collection of protein sequences. The software is open source and can be obtained from http://www.paccanarolab.org/software/gfam/.
Consequences of Normalizing Transcriptomic and Genomic Libraries of Plant Genomes Using a Duplex-Specific Nuclease and Tetramethylammonium Chloride

PubMed Central

Froenicke, Lutz; Lavelle, Dean; Martineau, Belinda; Perroud, Bertrand; Michelmore, Richard

2013-01-01

Several applications of high throughput genome and transcriptome sequencing would benefit from a reduction of the high-copy-number sequences in the libraries being sequenced and analyzed, particularly when applied to species with large genomes. We adapted and analyzed the consequences of a method that utilizes a thermostable duplex-specific nuclease for reducing the high-copy components in transcriptomic and genomic libraries prior to sequencing. This reduces the time, cost, and computational effort of obtaining informative transcriptomic and genomic sequence data for both fully sequenced and non-sequenced genomes. It also reduces contamination from organellar DNA in preparations of nuclear DNA. Hybridization in the presence of 3 M tetramethylammonium chloride (TMAC), which equalizes the rates of hybridization of GC and AT nucleotide pairs, reduced the bias against sequences with high GC content. Consequences of this method on the reduction of high-copy and enrichment of low-copy sequences are reported for Arabidopsis and lettuce. PMID:23409088

Some links on this page may take you to non-federal websites. Their policies may differ from this site.