assist large sequencing: Topics by Science.gov

Sample records for assist large sequencing

Sequencing consolidates molecular markers with plant breeding practice.

PubMed

Yang, Huaan; Li, Chengdao; Lam, Hon-Ming; Clements, Jonathan; Yan, Guijun; Zhao, Shancen

2015-05-01

Plenty of molecular markers have been developed by contemporary sequencing technologies, whereas few of them are successfully applied in breeding, thus we present a review on how sequencing can facilitate marker-assisted selection in plant breeding. The growing global population and shrinking arable land area require efficient plant breeding. Novel strategies assisted by certain markers have proven effective for genetic gains. Fortunately, cutting-edge sequencing technologies bring us a deluge of genomes and genetic variations, enlightening the potential of marker development. However, a large gap still exists between the potential of molecular markers and actual plant breeding practices. In this review, we discuss marker-assisted breeding from a historical perspective, describe the road from crop sequencing to breeding, and highlight how sequencing facilitates the application of markers in breeding practice.
Radiation hybrid maps of D-genome of Aegilops tauschii and their application in sequence assembly of large and complex plant genomes

USDA-ARS?s Scientific Manuscript database

The large and complex genome of bread wheat (Triticum aestivum L., ~17 Gb) requires high-resolution genome maps saturated with ordered markers to assist in anchoring and orienting BAC contigs/ sequence scaffolds for whole genome sequence assembly. Radiation hybrid (RH) mapping has proven to be an e...
PLAN-IT: Knowledge-Based Mission Sequencing

NASA Astrophysics Data System (ADS)

Biefeld, Eric W.

1987-02-01

Mission sequencing consumes a large amount of time and manpower during a space exploration effort. Plan-It is a knowledge-based approach to assist in mission sequencing. Plan-It uses a combined frame and blackboard architecture. This paper reports on the approach implemented by Plan-It and the current applications of Plan-It for sequencing at NASA.
Fluorescence in situ hybridization and optical mapping to correct scaffold arrangement in the tomato genome

USDA-ARS?s Scientific Manuscript database

Modern biological analyses are often assisted by recent technologies making the sequencing of complex genomes both technically possible and feasible. We recently sequenced the tomato genome that, like many eukaryotic genomes, is large and complex. Current sequencing technologies allow the developmen...
FOUNTAIN: A JAVA open-source package to assist large sequencing projects

PubMed Central

Buerstedde, Jean-Marie; Prill, Florian

2001-01-01

Background Better automation, lower cost per reaction and a heightened interest in comparative genomics has led to a dramatic increase in DNA sequencing activities. Although the large sequencing projects of specialized centers are supported by in-house bioinformatics groups, many smaller laboratories face difficulties managing the appropriate processing and storage of their sequencing output. The challenges include documentation of clones, templates and sequencing reactions, and the storage, annotation and analysis of the large number of generated sequences. Results We describe here a new program, named FOUNTAIN, for the management of large sequencing projects . FOUNTAIN uses the JAVA computer language and data storage in a relational database. Starting with a collection of sequencing objects (clones), the program generates and stores information related to the different stages of the sequencing project using a web browser interface for user input. The generated sequences are subsequently imported and annotated based on BLAST searches against the public databases. In addition, simple algorithms to cluster sequences and determine putative polymorphic positions are implemented. Conclusions A simple, but flexible and scalable software package is presented to facilitate data generation and storage for large sequencing projects. Open source and largely platform and database independent, we wish FOUNTAIN to be improved and extended in a community effort. PMID:11591214
Genotyping by sequencing for SNP-based linkage analysis and identification of QTLs linked to fruit quality traits in Japanese plum (Prunus salicina Lindl.)

USDA-ARS?s Scientific Manuscript database

Marker-assisted selection (MAS) in stone fruit (Prunus species) breeding is currently difficult to achieve due to the polygenic nature of themost relevant agronomic traits linked to fruit quality. Genotyping by sequencing (GBS), however, provides a large quantity of useful data suitable for finemapp...
Spectra library assisted de novo peptide sequencing for HCD and ETD spectra pairs.

PubMed

Yan, Yan; Zhang, Kaizhong

2016-12-23

De novo peptide sequencing via tandem mass spectrometry (MS/MS) has been developed rapidly in recent years. With the use of spectra pairs from the same peptide under different fragmentation modes, performance of de novo sequencing is greatly improved. Currently, with large amount of spectra sequenced everyday, spectra libraries containing tens of thousands of annotated experimental MS/MS spectra become available. These libraries provide information of the spectra properties, thus have the potential to be used with de novo sequencing to improve its performance. In this study, an improved de novo sequencing method assisted with spectra library is proposed. It uses spectra libraries as training datasets and introduces significant scores of the features used in our previous de novo sequencing method for HCD and ETD spectra pairs. Two pairs of HCD and ETD spectral datasets were used to test the performance of the proposed method and our previous method. The results show that this proposed method achieves better sequencing accuracy with higher ranked correct sequences and less computational time. This paper proposed an advanced de novo sequencing method for HCD and ETD spectra pair and used information from spectra libraries and significant improved previous similar methods.
Machine learning applications in genetics and genomics.

PubMed

Libbrecht, Maxwell W; Noble, William Stafford

2015-06-01

The field of machine learning, which aims to develop computer algorithms that improve with experience, holds promise to enable computers to assist humans in the analysis of large, complex data sets. Here, we provide an overview of machine learning applications for the analysis of genome sequencing data sets, including the annotation of sequence elements and epigenetic, proteomic or metabolomic data. We present considerations and recurrent challenges in the application of supervised, semi-supervised and unsupervised machine learning methods, as well as of generative and discriminative modelling approaches. We provide general guidelines to assist in the selection of these machine learning methods and their practical application for the analysis of genetic and genomic data sets.
Mining on scorpion venom biodiversity.

PubMed

Rodríguez de la Vega, Ricardo C; Schwartz, Elisabeth F; Possani, Lourival D

2010-12-15

Scorpion venoms are complex mixtures of dozens or even hundreds of distinct proteins, many of which are inter-genome active elements. Fifty years after the first scorpion toxin sequences were determined, chromatography-assisted purification followed by automated protein sequencing or gene cloning, on a case-by-case basis, accumulated nearly 250 amino acid sequences of scorpion venom components. A vast majority of the available sequences correspond to proteins adopting a common three-dimensional fold, whose ion channel modulating functions have been firmly established or could be confidently inferred. However, the actual molecular diversity contained in scorpion venoms -as revealed by bioassay-driven purification, some unexpected activities of "canonical" neurotoxins and even serendipitous discoveries- is much larger than those "canonical" toxin types. In the last few years mining into the molecular diversity contained in scorpion has been assisted by high-throughput Mass Spectrometry techniques and large-scale DNA sequencing, collectively accounting for the more than twofold increase in the number of known sequences of scorpion venom components (now reaching 500 unique sequences). This review, from a comparative perspective, deals with recent data obtained by proteomic and transcriptomic studies on scorpion venoms and venom glands. Altogether, these studies reveal a large contribution of non canonical venom components, which would account for more than half of the total protein diversity of any scorpion venom. On top of aiding at the better understanding of scorpion venom biology, whether in the context of venom function or within the venom gland itself, these "novel" venom components certainly are an interesting source of bioactive proteins, whose characterization is worth pursuing. Copyright © 2009 Elsevier Ltd. All rights reserved.
BACCardI--a tool for the validation of genomic assemblies, assisting genome finishing and intergenome comparison.

PubMed

Bartels, Daniela; Kespohl, Sebastian; Albaum, Stefan; Drüke, Tanja; Goesmann, Alexander; Herold, Julia; Kaiser, Olaf; Pühler, Alfred; Pfeiffer, Friedhelm; Raddatz, Günter; Stoye, Jens; Meyer, Folker; Schuster, Stephan C

2005-04-01

We provide the graphical tool BACCardI for the construction of virtual clone maps from standard assembler output files or BLAST based sequence comparisons. This new tool has been applied to numerous genome projects to solve various problems including (a) validation of whole genome shotgun assemblies, (b) support for contig ordering in the finishing phase of a genome project, and (c) intergenome comparison between related strains when only one of the strains has been sequenced and a large insert library is available for the other. The BACCardI software can seamlessly interact with various sequence assembly packages. Genomic assemblies generated from sequence information need to be validated by independent methods such as physical maps. The time-consuming task of building physical maps can be circumvented by virtual clone maps derived from read pair information of large insert libraries.
Rational design of DNA sequences for nanotechnology, microarrays and molecular computers using Eulerian graphs.

PubMed

Pancoska, Petr; Moravek, Zdenek; Moll, Ute M

2004-01-01

Nucleic acids are molecules of choice for both established and emerging nanoscale technologies. These technologies benefit from large functional densities of 'DNA processing elements' that can be readily manufactured. To achieve the desired functionality, polynucleotide sequences are currently designed by a process that involves tedious and laborious filtering of potential candidates against a series of requirements and parameters. Here, we present a complete novel methodology for the rapid rational design of large sets of DNA sequences. This method allows for the direct implementation of very complex and detailed requirements for the generated sequences, thus avoiding 'brute force' filtering. At the same time, these sequences have narrow distributions of melting temperatures. The molecular part of the design process can be done without computer assistance, using an efficient 'human engineering' approach by drawing a single blueprint graph that represents all generated sequences. Moreover, the method eliminates the necessity for extensive thermodynamic calculations. Melting temperature can be calculated only once (or not at all). In addition, the isostability of the sequences is independent of the selection of a particular set of thermodynamic parameters. Applications are presented for DNA sequence designs for microarrays, universal microarray zip sequences and electron transfer experiments.
Trajectory design for a rendezvous mission to Earth's Trojan asteroid 2010 TK7

NASA Astrophysics Data System (ADS)

Lei, Hanlun; Xu, Bo; Zhang, Lei

2017-12-01

In this paper a rendezvous mission to the Earth's Trojan asteroid 2010 TK7 is proposed, and preliminary transfer trajectories are designed. Due to the high inclination (∼ 20.9°) of the target asteroid relative to the ecliptic plane, direct transfers usually require large amounts of fuel consumption, which is beyond the capacity of current technology. As gravity assist technique could effectively change the inclination of spacecraft's trajectory, it is adopted to reduce the launch energy and rendezvous velocity maneuver. In practical computation, impulsive and low-thrust, gravity-assisted trajectories are considered. Among all the trajectories computed, the low-thrust gravity-assisted trajectory with Venus-Earth-Venus (V-E-V) swingby sequence performs the best in terms of propellant mass. For a spacecraft with initial mass of 800 kg , propellant mass of the best trajectory is 36.74 kg . Numerical results indicate that both the impulsive and low-thrust, gravity-assisted trajectories corresponding to V-E-V sequence could satisfy mission constraints, and can be applied to practical rendezvous mission.
Computer-Vision-Assisted Palm Rehabilitation With Supervised Learning.

PubMed

Vamsikrishna, K M; Dogra, Debi Prosad; Desarkar, Maunendra Sankar

2016-05-01

Physical rehabilitation supported by the computer-assisted-interface is gaining popularity among health-care fraternity. In this paper, we have proposed a computer-vision-assisted contactless methodology to facilitate palm and finger rehabilitation. Leap motion controller has been interfaced with a computing device to record parameters describing 3-D movements of the palm of a user undergoing rehabilitation. We have proposed an interface using Unity3D development platform. Our interface is capable of analyzing intermediate steps of rehabilitation without the help of an expert, and it can provide online feedback to the user. Isolated gestures are classified using linear discriminant analysis (DA) and support vector machines (SVM). Finally, a set of discrete hidden Markov models (HMM) have been used to classify gesture sequence performed during rehabilitation. Experimental validation using a large number of samples collected from healthy volunteers reveals that DA and SVM perform similarly while applied on isolated gesture recognition. We have compared the results of HMM-based sequence classification with CRF-based techniques. Our results confirm that both HMM and CRF perform quite similarly when tested on gesture sequences. The proposed system can be used for home-based palm or finger rehabilitation in the absence of experts.
Achievements and prospects of genomics-assisted breeding in three legume crops of the semi-arid tropics.

PubMed

Varshney, Rajeev K; Mohan, S Murali; Gaur, Pooran M; Gangarao, N V P R; Pandey, Manish K; Bohra, Abhishek; Sawargaonkar, Shrikant L; Chitikineni, Annapurna; Kimurto, Paul K; Janila, Pasupuleti; Saxena, K B; Fikre, Asnake; Sharma, Mamta; Rathore, Abhishek; Pratap, Aditya; Tripathi, Shailesh; Datta, Subhojit; Chaturvedi, S K; Mallikarjuna, Nalini; Anuradha, G; Babbar, Anita; Choudhary, Arbind K; Mhase, M B; Bharadwaj, Ch; Mannur, D M; Harer, P N; Guo, Baozhu; Liang, Xuanqiang; Nadarajan, N; Gowda, C L L

2013-12-01

Advances in next-generation sequencing and genotyping technologies have enabled generation of large-scale genomic resources such as molecular markers, transcript reads and BAC-end sequences (BESs) in chickpea, pigeonpea and groundnut, three major legume crops of the semi-arid tropics. Comprehensive transcriptome assemblies and genome sequences have either been developed or underway in these crops. Based on these resources, dense genetic maps, QTL maps as well as physical maps for these legume species have also been developed. As a result, these crops have graduated from 'orphan' or 'less-studied' crops to 'genomic resources rich' crops. This article summarizes the above-mentioned advances in genomics and genomics-assisted breeding applications in the form of marker-assisted selection (MAS) for hybrid purity assessment in pigeonpea; marker-assisted backcrossing (MABC) for introgressing QTL region for drought-tolerance related traits, Fusarium wilt (FW) resistance and Ascochyta blight (AB) resistance in chickpea; late leaf spot (LLS), leaf rust and nematode resistance in groundnut. We critically present the case of use of other modern breeding approaches like marker-assisted recurrent selection (MARS) and genomic selection (GS) to utilize the full potential of genomics-assisted breeding for developing superior cultivars with enhanced tolerance to various environmental stresses. In addition, this article recommends the use of advanced-backcross (AB-backcross) breeding and development of specialized populations such as multi-parents advanced generation intercross (MAGIC) for creating new variations that will help in developing superior lines with broadened genetic base. In summary, we propose the use of integrated genomics and breeding approach in these legume crops to enhance crop productivity in marginal environments ensuring food security in developing countries. Copyright © 2012 Elsevier Inc. All rights reserved.
Next-Generation Sequencing of the Chrysanthemum nankingense (Asteraceae) Transcriptome Permits Large-Scale Unigene Assembly and SSR Marker Discovery

PubMed Central

Wang, Haibin; Jiang, Jiafu; Chen, Sumei; Qi, Xiangyu; Peng, Hui; Li, Pirui; Song, Aiping; Guan, Zhiyong; Fang, Weimin; Liao, Yuan; Chen, Fadi

2013-01-01

Background Simple sequence repeats (SSRs) are ubiquitous in eukaryotic genomes. Chrysanthemum is one of the largest genera in the Asteraceae family. Only few Chrysanthemum expressed sequence tag (EST) sequences have been acquired to date, so the number of available EST-SSR markers is very low. Methodology/Principal Findings Illumina paired-end sequencing technology produced over 53 million sequencing reads from C. nankingense mRNA. The subsequent de novo assembly yielded 70,895 unigenes, of which 45,789 (64.59%) unigenes showed similarity to the sequences in NCBI database. Out of 45,789 sequences, 107 have hits to the Chrysanthemum Nr protein database; 679 and 277 sequences have hits to the database of Helianthus and Lactuca species, respectively. MISA software identified a large number of putative EST-SSRs, allowing 1,788 primer pairs to be designed from the de novo transcriptome sequence and a further 363 from archival EST sequence. Among 100 primer pairs randomly chosen, 81 markers have amplicons and 20 are polymorphic for genotypes analysis in Chrysanthemum. The results showed that most (but not all) of the assays were transferable across species and that they exposed a significant amount of allelic diversity. Conclusions/Significance SSR markers acquired by transcriptome sequencing are potentially useful for marker-assisted breeding and genetic analysis in the genus Chrysanthemum and its related genera. PMID:23626799
LookSeq: a browser-based viewer for deep sequencing data.

PubMed

Manske, Heinrich Magnus; Kwiatkowski, Dominic P

2009-11-01

Sequencing a genome to great depth can be highly informative about heterogeneity within an individual or a population. Here we address the problem of how to visualize the multiple layers of information contained in deep sequencing data. We propose an interactive AJAX-based web viewer for browsing large data sets of aligned sequence reads. By enabling seamless browsing and fast zooming, the LookSeq program assists the user to assimilate information at different levels of resolution, from an overview of a genomic region to fine details such as heterogeneity within the sample. A specific problem, particularly if the sample is heterogeneous, is how to depict information about structural variation. LookSeq provides a simple graphical representation of paired sequence reads that is more revealing about potential insertions and deletions than are conventional methods.
Electrochemical detection of sequence-specific DNA based on formation of G-quadruplex-hemin through continuous hybridization chain reaction.

PubMed

Sun, Xiaofan; Chen, Haohan; Wang, Shuling; Zhang, Yiping; Tian, Yaping; Zhou, Nandi

2018-08-27

A high-sensitive detection of sequence-specific DNA was established based on the formation of G-quadruplex-hemin complex through continuous hybridization chain reaction (HCR). Taking HIV DNA sequence as an example, a capture probe complementary to part of HIV DNA was firstly self-assembled onto the surface of Au electrode. Then a specially designed assistant probe with both terminals complementary to the target DNA and a G-quadruplex-forming sequence in the center was introduced into the detection solution. In the presence of both the target DNA and the assistant probe, the target DNA can be captured on the electrode surface and then a continuous HCR can be conducted due to the mutual recognition of the target DNA and the assistant probe, leading to the formation of a large number of G-quadruplex on the electrode surface. With the help of hemin, a pronounced electrochemical signal can be observed in differential pulse voltammetry (DPV), due to the formation of G-quadruplex-hemin complex. The peak current is linearly related with the logarithm of the concentration of the target DNA in the range from 10 fM to 10 pM. The electrochemical sensor has high selectivity to clearly discriminate single-base mismatched and three-base mismatched sequences from the original HIV DNA sequence. Moreover, the established DNA sensor was challenged by detection of HIV DNA in human serum samples, which showed the low detection limit of 6.3 fM. Thus it has great application prospect in the field of clinical diagnosis and environmental monitoring. Copyright © 2018 Elsevier B.V. All rights reserved.
SIMAP--a comprehensive database of pre-calculated protein sequence similarities, domains, annotations and clusters.

PubMed

Rattei, Thomas; Tischler, Patrick; Götz, Stefan; Jehl, Marc-André; Hoser, Jonathan; Arnold, Roland; Conesa, Ana; Mewes, Hans-Werner

2010-01-01

The prediction of protein function as well as the reconstruction of evolutionary genesis employing sequence comparison at large is still the most powerful tool in sequence analysis. Due to the exponential growth of the number of known protein sequences and the subsequent quadratic growth of the similarity matrix, the computation of the Similarity Matrix of Proteins (SIMAP) becomes a computational intensive task. The SIMAP database provides a comprehensive and up-to-date pre-calculation of the protein sequence similarity matrix, sequence-based features and sequence clusters. As of September 2009, SIMAP covers 48 million proteins and more than 23 million non-redundant sequences. Novel features of SIMAP include the expansion of the sequence space by including databases such as ENSEMBL as well as the integration of metagenomes based on their consistent processing and annotation. Furthermore, protein function predictions by Blast2GO are pre-calculated for all sequences in SIMAP and the data access and query functions have been improved. SIMAP assists biologists to query the up-to-date sequence space systematically and facilitates large-scale downstream projects in computational biology. Access to SIMAP is freely provided through the web portal for individuals (http://mips.gsf.de/simap/) and for programmatic access through DAS (http://webclu.bio.wzw.tum.de/das/) and Web-Service (http://mips.gsf.de/webservices/services/SimapService2.0?wsdl).
Rainbow: a tool for large-scale whole-genome sequencing data analysis using cloud computing.

PubMed

Zhao, Shanrong; Prenger, Kurt; Smith, Lance; Messina, Thomas; Fan, Hongtao; Jaeger, Edward; Stephens, Susan

2013-06-27

Technical improvements have decreased sequencing costs and, as a result, the size and number of genomic datasets have increased rapidly. Because of the lower cost, large amounts of sequence data are now being produced by small to midsize research groups. Crossbow is a software tool that can detect single nucleotide polymorphisms (SNPs) in whole-genome sequencing (WGS) data from a single subject; however, Crossbow has a number of limitations when applied to multiple subjects from large-scale WGS projects. The data storage and CPU resources that are required for large-scale whole genome sequencing data analyses are too large for many core facilities and individual laboratories to provide. To help meet these challenges, we have developed Rainbow, a cloud-based software package that can assist in the automation of large-scale WGS data analyses. Here, we evaluated the performance of Rainbow by analyzing 44 different whole-genome-sequenced subjects. Rainbow has the capacity to process genomic data from more than 500 subjects in two weeks using cloud computing provided by the Amazon Web Service. The time includes the import and export of the data using Amazon Import/Export service. The average cost of processing a single sample in the cloud was less than 120 US dollars. Compared with Crossbow, the main improvements incorporated into Rainbow include the ability: (1) to handle BAM as well as FASTQ input files; (2) to split large sequence files for better load balance downstream; (3) to log the running metrics in data processing and monitoring multiple Amazon Elastic Compute Cloud (EC2) instances; and (4) to merge SOAPsnp outputs for multiple individuals into a single file to facilitate downstream genome-wide association studies. Rainbow is a scalable, cost-effective, and open-source tool for large-scale WGS data analysis. For human WGS data sequenced by either the Illumina HiSeq 2000 or HiSeq 2500 platforms, Rainbow can be used straight out of the box. Rainbow is available for third-party implementation and use, and can be downloaded from http://s3.amazonaws.com/jnj_rainbow/index.html.
Rainbow: a tool for large-scale whole-genome sequencing data analysis using cloud computing

PubMed Central

2013-01-01

Background Technical improvements have decreased sequencing costs and, as a result, the size and number of genomic datasets have increased rapidly. Because of the lower cost, large amounts of sequence data are now being produced by small to midsize research groups. Crossbow is a software tool that can detect single nucleotide polymorphisms (SNPs) in whole-genome sequencing (WGS) data from a single subject; however, Crossbow has a number of limitations when applied to multiple subjects from large-scale WGS projects. The data storage and CPU resources that are required for large-scale whole genome sequencing data analyses are too large for many core facilities and individual laboratories to provide. To help meet these challenges, we have developed Rainbow, a cloud-based software package that can assist in the automation of large-scale WGS data analyses. Results Here, we evaluated the performance of Rainbow by analyzing 44 different whole-genome-sequenced subjects. Rainbow has the capacity to process genomic data from more than 500 subjects in two weeks using cloud computing provided by the Amazon Web Service. The time includes the import and export of the data using Amazon Import/Export service. The average cost of processing a single sample in the cloud was less than 120 US dollars. Compared with Crossbow, the main improvements incorporated into Rainbow include the ability: (1) to handle BAM as well as FASTQ input files; (2) to split large sequence files for better load balance downstream; (3) to log the running metrics in data processing and monitoring multiple Amazon Elastic Compute Cloud (EC2) instances; and (4) to merge SOAPsnp outputs for multiple individuals into a single file to facilitate downstream genome-wide association studies. Conclusions Rainbow is a scalable, cost-effective, and open-source tool for large-scale WGS data analysis. For human WGS data sequenced by either the Illumina HiSeq 2000 or HiSeq 2500 platforms, Rainbow can be used straight out of the box. Rainbow is available for third-party implementation and use, and can be downloaded from http://s3.amazonaws.com/jnj_rainbow/index.html. PMID:23802613

Characterization of the Kenaf (Hibiscus cannabinus) Global Transcriptome Using Illumina Paired-End Sequencing and Development of EST-SSR Markers

PubMed Central

Li, Hui; Li, Defang; Chen, Anguo; Tang, Huijuan; Li, Jianjun; Huang, Siqi

2016-01-01

Kenaf (Hibiscus cannabinus L.) is an economically important natural fiber crop grown worldwide. However, only 20 expressed tag sequences (ESTs) for kenaf are available in public databases. The aim of this study was to develop large-scale simple sequence repeat (SSR) markers to lay a solid foundation for the construction of genetic linkage maps and marker-assisted breeding in kenaf. We used Illumina paired-end sequencing technology to generate new EST-simple sequences and MISA software to mine SSR markers. We identified 71,318 unigenes with an average length of 1143 nt and annotated these unigenes using four different protein databases. Overall, 9324 complementary pairs were designated as EST-SSR markers, and their quality was validated using 100 randomly selected SSR markers. In total, 72 primer pairs reproducibly amplified target amplicons, and 61 of these primer pairs detected significant polymorphism among 28 kenaf accessions. Thus, in this study, we have developed large-scale SSR markers for kenaf, and this new resource will facilitate construction of genetic linkage maps, investigation of fiber growth and development in kenaf, and also be of value to novel gene discovery and functional genomic studies. PMID:26960153
Discovery and mapping of a new expressed sequence tag-single nucleotide polymorphism and simple sequence repeat panel for large-scale genetic studies and breeding of Theobroma cacao L.

PubMed Central

Allegre, Mathilde; Argout, Xavier; Boccara, Michel; Fouet, Olivier; Roguet, Yolande; Bérard, Aurélie; Thévenin, Jean Marc; Chauveau, Aurélie; Rivallan, Ronan; Clement, Didier; Courtois, Brigitte; Gramacho, Karina; Boland-Augé, Anne; Tahi, Mathias; Umaharan, Pathmanathan; Brunel, Dominique; Lanaud, Claire

2012-01-01

Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr. PMID:22210604
Gold nanoparticles for high-throughput genotyping of long-range haplotypes

NASA Astrophysics Data System (ADS)

Chen, Peng; Pan, Dun; Fan, Chunhai; Chen, Jianhua; Huang, Ke; Wang, Dongfang; Zhang, Honglu; Li, You; Feng, Guoyin; Liang, Peiji; He, Lin; Shi, Yongyong

2011-10-01

Completion of the Human Genome Project and the HapMap Project has led to increasing demands for mapping complex traits in humans to understand the aetiology of diseases. Identifying variations in the DNA sequence, which affect how we develop disease and respond to pathogens and drugs, is important for this purpose, but it is difficult to identify these variations in large sample sets. Here we show that through a combination of capillary sequencing and polymerase chain reaction assisted by gold nanoparticles, it is possible to identify several DNA variations that are associated with age-related macular degeneration and psoriasis on significant regions of human genomic DNA. Our method is accurate and promising for large-scale and high-throughput genetic analysis of susceptibility towards disease and drug resistance.
Sequence stratigraphy and sedimentary study on Mishrif formation of Fauqi Oilfield of Missan in south east Iraq

NASA Astrophysics Data System (ADS)

Sang, Hua; Lin, Changsong; Jiang, Yiming

2017-05-01

The reservoir of Mishrif formation has a large scale distribution of marine facies carbonate sediments in great thickness in central and south east Iraq. Rudist reef and shoal facies limestones of the Mishrif Formation (Late Cenomanian - Middle Turonian) form a great potential reservoir rocks at oilfields and structures of Iraq. Facies modelling was applied to predict the relationship between facies distribution and reservoir characteristics to construct a predictive geologic model which will assist future exploration and development in south east Iraq. Microfacies analysis and electrofacies identification and correlations indicate that the limestone of the Mishrif Formation were mainly deposited in open platform setting. Sequence stratigraphic analyses of the Mishrif Formation indicate 3 third order depositional sequences.
Design of a sensitive aptasensor based on magnetic microbeads-assisted strand displacement amplification and target recycling.

PubMed

Li, Ying; Ji, Xiaoting; Song, Weiling; Guo, Yingshu

2013-04-03

A cross-circular amplification system for sensitive detection of adenosine triphosphate (ATP) in cancer cells was developed based on aptamer-target interaction, magnetic microbeads (MBs)-assisted strand displacement amplification and target recycling. Here we described a new recognition probe possessing two parts, the ATP aptamer and the extension part. The recognition probe was firstly immobilized on the surface of MBs and hybridized with its complementary sequence to form a duplex. When combined with ATP, the probe changed its conformation, revealing the extension part in single-strand form, which further served as a toehold for subsequent target recycling. The released complementary sequence of the probe acted as the catalyst of the MB-assisted strand displacement reaction. Incorporated with target recycling, a large amount of biotin-tagged MB complexes were formed to stimulate the generation of chemiluminescence (CL) signal in the presence of luminol and H2O2 by incorporating with streptavidin-HRP, reaching a detection limit of ATP as low as 6.1×10(-10)M. Moreover, sample assays of ATP in Ramos Burkitt's lymphoma B cells were performed, which confirmed the reliability and practicality of the protocol. Copyright © 2013 Elsevier B.V. All rights reserved.
Pfarao: a web application for protein family analysis customized for cytoskeletal and motor proteins (CyMoBase).

PubMed

Odronitz, Florian; Kollmar, Martin

2006-11-29

Annotation of protein sequences of eukaryotic organisms is crucial for the understanding of their function in the cell. Manual annotation is still by far the most accurate way to correctly predict genes. The classification of protein sequences, their phylogenetic relation and the assignment of function involves information from various sources. This often leads to a collection of heterogeneous data, which is hard to track. Cytoskeletal and motor proteins consist of large and diverse superfamilies comprising up to several dozen members per organism. Up to date there is no integrated tool available to assist in the manual large-scale comparative genomic analysis of protein families. Pfarao (Protein Family Application for Retrieval, Analysis and Organisation) is a database driven online working environment for the analysis of manually annotated protein sequences and their relationship. Currently, the system can store and interrelate a wide range of information about protein sequences, species, phylogenetic relations and sequencing projects as well as links to literature and domain predictions. Sequences can be imported from multiple sequence alignments that are generated during the annotation process. A web interface allows to conveniently browse the database and to compile tabular and graphical summaries of its content. We implemented a protein sequence-centric web application to store, organize, interrelate, and present heterogeneous data that is generated in manual genome annotation and comparative genomics. The application has been developed for the analysis of cytoskeletal and motor proteins (CyMoBase) but can easily be adapted for any protein.
Bioinformatics by Example: From Sequence to Target

NASA Astrophysics Data System (ADS)

Kossida, Sophia; Tahri, Nadia; Daizadeh, Iraj

2002-12-01

With the completion of the human genome, and the imminent completion of other large-scale sequencing and structure-determination projects, computer-assisted bioscience is aimed to become the new paradigm for conducting basic and applied research. The presence of these additional bioinformatics tools stirs great anxiety for experimental researchers (as well as for pedagogues), since they are now faced with a wider and deeper knowledge of differing disciplines (biology, chemistry, physics, mathematics, and computer science). This review targets those individuals who are interested in using computational methods in their teaching or research. By analyzing a real-life, pharmaceutical, multicomponent, target-based example the reader will experience this fascinating new discipline.
On plate graphite supported sample processing for simultaneous lipid and protein identification by matrix assisted laser desorption ionization mass spectrometry.

PubMed

Calvano, Cosima Damiana; van der Werf, Inez Dorothé; Sabbatini, Luigia; Palmisano, Francesco

2015-05-01

The simultaneous identification of lipids and proteins by matrix assisted laser desorption ionization-mass spectrometry (MALDI-MS) after direct on-plate processing of micro-samples supported on colloidal graphite is demonstrated. Taking advantages of large surface area and thermal conductivity, graphite provided an ideal substrate for on-plate proteolysis and lipid extraction. Indeed proteins could be efficiently digested on-plate within 15 min, providing sequence coverages comparable to those obtained by conventional in-solution overnight digestion. Interestingly, detection of hydrophilic phosphorylated peptides could be easily achieved without any further enrichment step. Furthermore, lipids could be simultaneously extracted/identified without any additional treatment/processing step as demonstrated for model complex samples such as milk and egg. The present approach is simple, efficient, of large applicability and offers great promise for protein and lipid identification in very small samples. Copyright © 2015 Elsevier B.V. All rights reserved.
Mississippi Curriculum Framework for Medical Assisting Technology Programs (CIP: 51.0801--Medical Assistant). Postsecondary Programs.

ERIC Educational Resources Information Center

Mississippi Research and Curriculum Unit for Vocational and Technical Education, State College.

This document, which is intended for use by community and junior colleges throughout Mississippi, contains curriculum frameworks for the course sequences in the medical assisting technology program. Presented in the introductory section are a description of the program and suggested course sequence. Section I lists baseline competencies, and…
DNA sequence-dependent mechanics and protein-assisted bending in repressor-mediated loop formation

PubMed Central

Boedicker, James Q.; Garcia, Hernan G.; Johnson, Stephanie; Phillips, Rob

2014-01-01

As the chief informational molecule of life, DNA is subject to extensive physical manipulations. The energy required to deform double-helical DNA depends on sequence, and this mechanical code of DNA influences gene regulation, such as through nucleosome positioning. Here we examine the sequence-dependent flexibility of DNA in bacterial transcription factor-mediated looping, a context for which the role of sequence remains poorly understood. Using a suite of synthetic constructs repressed by the Lac repressor and two well-known sequences that show large flexibility differences in vitro, we make precise statistical mechanical predictions as to how DNA sequence influences loop formation and test these predictions using in vivo transcription and in vitro single-molecule assays. Surprisingly, sequence-dependent flexibility does not affect in vivo gene regulation. By theoretically and experimentally quantifying the relative contributions of sequence and the DNA-bending protein HU to DNA mechanical properties, we reveal that bending by HU dominates DNA mechanics and masks intrinsic sequence-dependent flexibility. Such a quantitative understanding of how mechanical regulatory information is encoded in the genome will be a key step towards a predictive understanding of gene regulation at single-base pair resolution. PMID:24231252
Mississippi Curriculum Framework for Dental Assisting Technology Programs (Program CIP: 51.0601--Dental Assistant). Postsecondary Programs.

ERIC Educational Resources Information Center

Mississippi Research and Curriculum Unit for Vocational and Technical Education, State College.

This document, which is intended for use by community and junior colleges throughout Mississippi, contains curriculum frameworks for the course sequences in the dental assisting technology program. Presented in the introductory section are a description of the program and suggested course sequence. Section I lists baseline competencies. Section II…
Mississippi Curriculum Framework for Health Care Assistant (Program CIP: 51.1614--Nursing Assistant/Aide). Postsecondary Programs.

ERIC Educational Resources Information Center

Mississippi Research and Curriculum Unit for Vocational and Technical Education, State College.

This document, which is intended for use by community and junior colleges throughout Mississippi, contains curriculum frameworks for the course sequences in the health care assistant program. Presented in the introductory section are a description of the program and suggested course sequence. Section I lists baseline competencies for the nurse…
Mississippi Curriculum Framework for Physical Therapist Assistant (CIP: 51.0806--Physical Therapy Assistant). Postsecondary Programs.

ERIC Educational Resources Information Center

Mississippi Research and Curriculum Unit for Vocational and Technical Education, State College.

This document, which is intended for use by community and junior colleges throughout Mississippi, contains curriculum frameworks for the course sequences in the physical therapy assistant program. Presented in the introductory section are a description of the program and suggested course sequence. Section I lists baseline competencies, and section…
BASiNET-BiologicAl Sequences NETwork: a case study on coding and non-coding RNAs identification.

PubMed

Ito, Eric Augusto; Katahira, Isaque; Vicente, Fábio Fernandes da Rocha; Pereira, Luiz Filipe Protasio; Lopes, Fabrício Martins

2018-06-05

With the emergence of Next Generation Sequencing (NGS) technologies, a large volume of sequence data in particular de novo sequencing was rapidly produced at relatively low costs. In this context, computational tools are increasingly important to assist in the identification of relevant information to understand the functioning of organisms. This work introduces BASiNET, an alignment-free tool for classifying biological sequences based on the feature extraction from complex network measurements. The method initially transform the sequences and represents them as complex networks. Then it extracts topological measures and constructs a feature vector that is used to classify the sequences. The method was evaluated in the classification of coding and non-coding RNAs of 13 species and compared to the CNCI, PLEK and CPC2 methods. BASiNET outperformed all compared methods in all adopted organisms and datasets. BASiNET have classified sequences in all organisms with high accuracy and low standard deviation, showing that the method is robust and non-biased by the organism. The proposed methodology is implemented in open source in R language and freely available for download at https://cran.r-project.org/package=BASiNET.
Pfarao: a web application for protein family analysis customized for cytoskeletal and motor proteins (CyMoBase)

PubMed Central

Odronitz, Florian; Kollmar, Martin

2006-01-01

Background Annotation of protein sequences of eukaryotic organisms is crucial for the understanding of their function in the cell. Manual annotation is still by far the most accurate way to correctly predict genes. The classification of protein sequences, their phylogenetic relation and the assignment of function involves information from various sources. This often leads to a collection of heterogeneous data, which is hard to track. Cytoskeletal and motor proteins consist of large and diverse superfamilies comprising up to several dozen members per organism. Up to date there is no integrated tool available to assist in the manual large-scale comparative genomic analysis of protein families. Description Pfarao (Protein Family Application for Retrieval, Analysis and Organisation) is a database driven online working environment for the analysis of manually annotated protein sequences and their relationship. Currently, the system can store and interrelate a wide range of information about protein sequences, species, phylogenetic relations and sequencing projects as well as links to literature and domain predictions. Sequences can be imported from multiple sequence alignments that are generated during the annotation process. A web interface allows to conveniently browse the database and to compile tabular and graphical summaries of its content. Conclusion We implemented a protein sequence-centric web application to store, organize, interrelate, and present heterogeneous data that is generated in manual genome annotation and comparative genomics. The application has been developed for the analysis of cytoskeletal and motor proteins (CyMoBase) but can easily be adapted for any protein. PMID:17134497
Application of Genomic Technologies to the Breeding of Trees

PubMed Central

Badenes, Maria L.; Fernández i Martí, Angel; Ríos, Gabino; Rubio-Cabetas, María J.

2016-01-01

The recent introduction of next generation sequencing (NGS) technologies represents a major revolution in providing new tools for identifying the genes and/or genomic intervals controlling important traits for selection in breeding programs. In perennial fruit trees with long generation times and large sizes of adult plants, the impact of these techniques is even more important. High-throughput DNA sequencing technologies have provided complete annotated sequences in many important tree species. Most of the high-throughput genotyping platforms described are being used for studies of genetic diversity and population structure. Dissection of complex traits became possible through the availability of genome sequences along with phenotypic variation data, which allow to elucidate the causative genetic differences that give rise to observed phenotypic variation. Association mapping facilitates the association between genetic markers and phenotype in unstructured and complex populations, identifying molecular markers for assisted selection and breeding. Also, genomic data provide in silico identification and characterization of genes and gene families related to important traits, enabling new tools for molecular marker assisted selection in tree breeding. Deep sequencing of transcriptomes is also a powerful tool for the analysis of precise expression levels of each gene in a sample. It consists in quantifying short cDNA reads, obtained by NGS technologies, in order to compare the entire transcriptomes between genotypes and environmental conditions. The miRNAs are non-coding short RNAs involved in the regulation of different physiological processes, which can be identified by high-throughput sequencing of RNA libraries obtained by reverse transcription of purified short RNAs, and by in silico comparison with known miRNAs from other species. All together, NGS techniques and their applications have increased the resources for plant breeding in tree species, closing the former gap of genetic tools between trees and annual species. PMID:27895664
Application of Genomic Technologies to the Breeding of Trees.

PubMed

Badenes, Maria L; Fernández I Martí, Angel; Ríos, Gabino; Rubio-Cabetas, María J

2016-01-01

The recent introduction of next generation sequencing (NGS) technologies represents a major revolution in providing new tools for identifying the genes and/or genomic intervals controlling important traits for selection in breeding programs. In perennial fruit trees with long generation times and large sizes of adult plants, the impact of these techniques is even more important. High-throughput DNA sequencing technologies have provided complete annotated sequences in many important tree species. Most of the high-throughput genotyping platforms described are being used for studies of genetic diversity and population structure. Dissection of complex traits became possible through the availability of genome sequences along with phenotypic variation data, which allow to elucidate the causative genetic differences that give rise to observed phenotypic variation. Association mapping facilitates the association between genetic markers and phenotype in unstructured and complex populations, identifying molecular markers for assisted selection and breeding. Also, genomic data provide in silico identification and characterization of genes and gene families related to important traits, enabling new tools for molecular marker assisted selection in tree breeding. Deep sequencing of transcriptomes is also a powerful tool for the analysis of precise expression levels of each gene in a sample. It consists in quantifying short cDNA reads, obtained by NGS technologies, in order to compare the entire transcriptomes between genotypes and environmental conditions. The miRNAs are non-coding short RNAs involved in the regulation of different physiological processes, which can be identified by high-throughput sequencing of RNA libraries obtained by reverse transcription of purified short RNAs, and by in silico comparison with known miRNAs from other species. All together, NGS techniques and their applications have increased the resources for plant breeding in tree species, closing the former gap of genetic tools between trees and annual species.
A physics department's role in preparing physics teachers: The Colorado learning assistant model

NASA Astrophysics Data System (ADS)

Otero, Valerie; Pollock, Steven; Finkelstein, Noah

2010-11-01

In response to substantial evidence that many U.S. students are inadequately prepared in science and mathematics, we have developed an effective and adaptable model that improves the education of all students in introductory physics and increases the numbers of talented physics majors becoming certified to teach physics. We report on the Colorado Learning Assistant model and discuss its effectiveness at a large research university. Since its inception in 2003, we have increased the pool of well-qualified K-12 physics teachers by a factor of approximately three, engaged scientists significantly in the recruiting and preparation of future teachers, and improved the introductory physics sequence so that students' learning gains are typically double the traditional average.
Real time simulation of computer-assisted sequencing of terminal area operations

NASA Technical Reports Server (NTRS)

Dear, R. G.

1981-01-01

A simulation was developed to investigate the utilization of computer assisted decision making for the task of sequencing and scheduling aircraft in a high density terminal area. The simulation incorporates a decision methodology termed Constrained Position Shifting. This methodology accounts for aircraft velocity profiles, routes, and weight classes in dynamically sequencing and scheduling arriving aircraft. A sample demonstration of Constrained Position Shifting is presented where six aircraft types (including both light and heavy aircraft) are sequenced to land at Denver's Stapleton International Airport. A graphical display is utilized and Constrained Position Shifting with a maximum shift of four positions (rearward or forward) is compared to first come, first serve with respect to arrival at the runway. The implementation of computer assisted sequencing and scheduling methodologies is investigated. A time based control concept will be required and design considerations for such a system are discussed.
DNA sequence chromatogram browsing using JAVA and CORBA.

PubMed

Parsons, J D; Buehler, E; Hillier, L

1999-03-01

DNA sequence chromatograms (traces) are the primary data source for all large-scale genomic and expressed sequence tags (ESTs) sequencing projects. Access to the sequencing trace assists many later analyses, for example contig assembly and polymorphism detection, but obtaining and using traces is problematic. Traces are not collected and published centrally, they are much larger than the base calls derived from them, and viewing them requires the interactivity of a local graphical client with local data. To provide efficient global access to DNA traces, we developed a client/server system based on flexible Java components integrated into other applications including an applet for use in a WWW browser and a stand-alone trace viewer. Client/server interaction is facilitated by CORBA middleware which provides a well-defined interface, a naming service, and location independence. [The software is packaged as a Jar file available from the following URL: http://www.ebi.ac.uk/jparsons. Links to working examples of the trace viewers can be found at http://corba.ebi.ac.uk/EST. All the Washington University mouse EST traces are available for browsing at the same URL.

Self-assembly assisted polymerization (SAAP): approaching long multi-block copolymers with an ordered chain sequence and controllable block length.

PubMed

Wu, Chi; Xie, Zuowei; Zhang, Guangzhao; Zi, Guofu; Tu, Yingfeng; Yang, Yali; Cai, Ping; Nie, Ting

2002-12-07

A combination of polymer physics and synthetic chemistry has enabled us to develop self-assembly assisted polymerization (SAAP), leading to the preparation of long multi-block copolymers with an ordered chain sequence and controllable block lengths.
A De-Novo Genome Analysis Pipeline (DeNoGAP) for large-scale comparative prokaryotic genomics studies.

PubMed

Thakur, Shalabh; Guttman, David S

2016-06-30

Comparative analysis of whole genome sequence data from closely related prokaryotic species or strains is becoming an increasingly important and accessible approach for addressing both fundamental and applied biological questions. While there are number of excellent tools developed for performing this task, most scale poorly when faced with hundreds of genome sequences, and many require extensive manual curation. We have developed a de-novo genome analysis pipeline (DeNoGAP) for the automated, iterative and high-throughput analysis of data from comparative genomics projects involving hundreds of whole genome sequences. The pipeline is designed to perform reference-assisted and de novo gene prediction, homolog protein family assignment, ortholog prediction, functional annotation, and pan-genome analysis using a range of proven tools and databases. While most existing methods scale quadratically with the number of genomes since they rely on pairwise comparisons among predicted protein sequences, DeNoGAP scales linearly since the homology assignment is based on iteratively refined hidden Markov models. This iterative clustering strategy enables DeNoGAP to handle a very large number of genomes using minimal computational resources. Moreover, the modular structure of the pipeline permits easy updates as new analysis programs become available. DeNoGAP integrates bioinformatics tools and databases for comparative analysis of a large number of genomes. The pipeline offers tools and algorithms for annotation and analysis of completed and draft genome sequences. The pipeline is developed using Perl, BioPerl and SQLite on Ubuntu Linux version 12.04 LTS. Currently, the software package accompanies script for automated installation of necessary external programs on Ubuntu Linux; however, the pipeline should be also compatible with other Linux and Unix systems after necessary external programs are installed. DeNoGAP is freely available at https://sourceforge.net/projects/denogap/ .
Diversity analysis in Cannabis sativa based on large-scale development of expressed sequence tag-derived simple sequence repeat markers.

PubMed

Gao, Chunsheng; Xin, Pengfei; Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

2014-01-01

Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis.
Diversity Analysis in Cannabis sativa Based on Large-Scale Development of Expressed Sequence Tag-Derived Simple Sequence Repeat Markers

PubMed Central

Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

2014-01-01

Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis. PMID:25329551
Large-Scale SNP Discovery and Genotyping for Constructing a High-Density Genetic Map of Tea Plant Using Specific-Locus Amplified Fragment Sequencing (SLAF-seq)

PubMed Central

Ma, Chun-Lei; Jin, Ji-Qiang; Li, Chun-Fang; Wang, Rong-Kai; Zheng, Hong-Kun; Yao, Ming-Zhe; Chen, Liang

2015-01-01

Genetic maps are important tools in plant genomics and breeding. The present study reports the large-scale discovery of single nucleotide polymorphisms (SNPs) for genetic map construction in tea plant. We developed a total of 6,042 valid SNP markers using specific-locus amplified fragment sequencing (SLAF-seq), and subsequently mapped them into the previous framework map. The final map contained 6,448 molecular markers, distributing on fifteen linkage groups corresponding to the number of tea plant chromosomes. The total map length was 3,965 cM, with an average inter-locus distance of 1.0 cM. This map is the first SNP-based reference map of tea plant, as well as the most saturated one developed to date. The SNP markers and map resources generated in this study provide a wealth of genetic information that can serve as a foundation for downstream genetic analyses, such as the fine mapping of quantitative trait loci (QTL), map-based cloning, marker-assisted selection, and anchoring of scaffolds to facilitate the process of whole genome sequencing projects for tea plant. PMID:26035838
TANGLE: Two-Level Support Vector Regression Approach for Protein Backbone Torsion Angle Prediction from Primary Sequences

PubMed Central

Song, Jiangning; Tan, Hao; Wang, Mingjun; Webb, Geoffrey I.; Akutsu, Tatsuya

2012-01-01

Protein backbone torsion angles (Phi) and (Psi) involve two rotation angles rotating around the Cα-N bond (Phi) and the Cα-C bond (Psi). Due to the planarity of the linked rigid peptide bonds, these two angles can essentially determine the backbone geometry of proteins. Accordingly, the accurate prediction of protein backbone torsion angle from sequence information can assist the prediction of protein structures. In this study, we develop a new approach called TANGLE (Torsion ANGLE predictor) to predict the protein backbone torsion angles from amino acid sequences. TANGLE uses a two-level support vector regression approach to perform real-value torsion angle prediction using a variety of features derived from amino acid sequences, including the evolutionary profiles in the form of position-specific scoring matrices, predicted secondary structure, solvent accessibility and natively disordered region as well as other global sequence features. When evaluated based on a large benchmark dataset of 1,526 non-homologous proteins, the mean absolute errors (MAEs) of the Phi and Psi angle prediction are 27.8° and 44.6°, respectively, which are 1% and 3% respectively lower than that using one of the state-of-the-art prediction tools ANGLOR. Moreover, the prediction of TANGLE is significantly better than a random predictor that was built on the amino acid-specific basis, with the p-value<1.46e-147 and 7.97e-150, respectively by the Wilcoxon signed rank test. As a complementary approach to the current torsion angle prediction algorithms, TANGLE should prove useful in predicting protein structural properties and assisting protein fold recognition by applying the predicted torsion angles as useful restraints. TANGLE is freely accessible at http://sunflower.kuicr.kyoto-u.ac.jp/~sjn/TANGLE/. PMID:22319565
Bioinformatics and genomic analysis of transposable elements in eukaryotic genomes.

PubMed

Janicki, Mateusz; Rooke, Rebecca; Yang, Guojun

2011-08-01

A major portion of most eukaryotic genomes are transposable elements (TEs). During evolution, TEs have introduced profound changes to genome size, structure, and function. As integral parts of genomes, the dynamic presence of TEs will continue to be a major force in reshaping genomes. Early computational analyses of TEs in genome sequences focused on filtering out "junk" sequences to facilitate gene annotation. When the high abundance and diversity of TEs in eukaryotic genomes were recognized, these early efforts transformed into the systematic genome-wide categorization and classification of TEs. The availability of genomic sequence data reversed the classical genetic approaches to discovering new TE families and superfamilies. Curated TE databases and their accurate annotation of genome sequences in turn facilitated the studies on TEs in a number of frontiers including: (1) TE-mediated changes of genome size and structure, (2) the influence of TEs on genome and gene functions, (3) TE regulation by host, (4) the evolution of TEs and their population dynamics, and (5) genomic scale studies of TE activity. Bioinformatics and genomic approaches have become an integral part of large-scale studies on TEs to extract information with pure in silico analyses or to assist wet lab experimental studies. The current revolution in genome sequencing technology facilitates further progress in the existing frontiers of research and emergence of new initiatives. The rapid generation of large-sequence datasets at record low costs on a routine basis is challenging the computing industry on storage capacity and manipulation speed and the bioinformatics community for improvement in algorithms and their implementations.
Utility of Combining Whole Genome Sequencing with Traditional Investigational Methods To Solve Foodborne Outbreaks of Salmonella Infections Associated with Chicken: A New Tool for Tackling This Challenging Food Vehicle.

PubMed

Crowe, Samuel J; Green, Alice; Hernandez, Kimberly; Peralta, Vi; Bottichio, Lyndsay; Defibaugh-Chavez, Stephanie; Douris, Aphrodite; Gieraltowski, Laura; Hise, Kelley; La-Pham, Karen; Neil, Karen P; Simmons, Mustafa; Tillman, Glenn; Tolar, Beth; Wagner, Darlene; Wasilenko, Jamie; Holt, Kristin; Trees, Eija; Wise, Matthew E

2017-04-01

High consumption rates and a multitude of brands make multistate foodborne outbreaks of Salmonella infections associated with chicken challenging to investigate, but whole genome sequencing is a powerful tool that can be used to assist investigators. Whole genome sequencing of pathogens isolated from clinical, environmental, and food samples is increasingly being used in multistate foodborne outbreak investigations to determine with unprecedented resolution how closely related these isolates are to one another genetically. In 2014, federal and state health officials investigated an outbreak of 146 Salmonella Heidelberg infections in 24 states. A follow-up analysis was conducted after the conclusion of the investigation in which 27 clinical and 24 food isolates from the outbreak underwent whole genome sequencing. These isolates formed seven clades, the largest of which contained clinical isolates from a subcluster of case patients who attended a catered party. One isolate from a chicken processed by a large producer was closely related genetically (zero to three single-nucleotide polymorphism differences) to the clinical isolates from these subcluster case patients. Chicken from this large producer was also present in the kitchen of the caterer on the day before the event, thus providing additional evidence that the chicken from this producer was the outbreak source. This investigation highlights how whole genome sequencing can be used with epidemiologic and traceback evidence to identify chicken sources of foodborne outbreaks.
Utility of Combining Whole Genome Sequencing with Traditional Investigational Methods To Solve Foodborne Outbreaks of Salmonella Infections Associated with Chicken: A New Tool for Tackling This Challenging Food Vehicle

PubMed Central

Crowe, Samuel J.; Green, Alice; Hernandez, Kimberly; Peralta, Vi; Bottichio, Lyndsay; Defibaugh-Chavez, Stephanie; Douris, Aphrodite; Gieraltowski, Laura; Hise, Kelley; La-Pham, Karen; Neil, Karen P.; Simmons, Mustafa; Tillman, Glenn; Tolar, Beth; Wagner, Darlene; Wasilenko, Jamie; Holt, Kristin; Trees, Eija; Wise, Matthew E.

2017-01-01

High consumption rates and a multitude of brands make multistate foodborne outbreaks of Salmonella infections associated with chicken challenging to investigate, but whole genome sequencing is a powerful tool that can be used to assist investigators. Whole genome sequencing of pathogens isolated from clinical, environmental, and food samples is increasingly being used in multistate foodborne outbreak investigations to determine with unprecedented resolution how closely related these isolates are to one another genetically. In 2014, federal and state health officials investigated an outbreak of 146 Salmonella Heidelberg infections in 24 states. A follow-up analysis was conducted after the conclusion of the investigation in which 27 clinical and 24 food isolates from the outbreak underwent whole genome sequencing. These isolates formed seven clades, the largest of which contained clinical isolates from a subcluster of case patients who attended a catered party. One isolate from a chicken processed by a large producer was closely related genetically (zero to three single-nucleotide polymorphism differences) to the clinical isolates from these subcluster case patients. Chicken from this large producer was also present in the kitchen of the caterer on the day before the event, thus providing additional evidence that the chicken from this producer was the outbreak source. This investigation highlights how whole genome sequencing can be used with epidemiologic and traceback evidence to identify chicken sources of foodborne outbreaks. PMID:28294686
De novo sequencing analysis of the Rosa roxburghii fruit transcriptome reveals putative ascorbate biosynthetic genes and EST-SSR markers.

PubMed

Yan, Xiuqin; Zhang, Xue; Lu, Min; He, Yong; An, Huaming

2015-04-25

Rosa roxburghii Tratt. is a well-known ornamental rose species native to China. In addition, the fruits of this species are valued for their nutritional and medicinal characteristics, especially their high ascorbic acid (AsA) levels. Nevertheless, AsA biosynthesis in R. roxburghii fruit has not been explored in detail because of a lack of genomic resources for this species. High-throughput transcriptomic sequencing generating large volumes of transcript sequence data can aid in gene discovery and molecular marker development. In this study, we generated more than 53 million clean reads using Illumina paired-end sequencing technology. De novo assembly yielded 106,590 unigenes, with an average length of 343 bp. On the basis of sequence similarity to known proteins, 9301 and 2393 unigenes were classified into Gene Ontology and Clusters of Orthologous Group categories, respectively. There were 7480 unigenes assigned to 124 pathways in the Kyoto Encyclopedia of Gene and Genome pathway database. BLASTx searches identified 498 unique putative transcripts encoding various transcription factors, some known to regulate fruit development. qRT-PCR validated the expressions of most of the genes encoding the main enzymes involved in ascorbate biosynthesis. In addition, 9131 potential simple sequence repeat (SSR) loci were identified among the unigenes. One hundred and two primer pairs were synthesized and 71 pairs produced an amplification product during initial screening. Among the amplified products, 30 were polymorphic in the 16 R. roxburghii germplasms tested. Our study was the first to produce a large volume of transcriptome data from R. roxburghii. The resulting sequence collection is a valuable resource for gene discovery and marker-assisted selective breeding in this rose species. Copyright © 2015 Elsevier B.V. All rights reserved.
44 CFR 206.191 - Duplication of benefits.

Code of Federal Regulations, 2014 CFR

2014-10-01

... disaster relief agencies establish and follow policies and procedures to prevent and remedy duplication... disaster relief agencies and organizations provide assistance. The specific sequence, in accordance with..., DEPARTMENT OF HOMELAND SECURITY DISASTER ASSISTANCE FEDERAL DISASTER ASSISTANCE Other Individual Assistance...
44 CFR 206.191 - Duplication of benefits.

Code of Federal Regulations, 2013 CFR

2013-10-01

... disaster relief agencies establish and follow policies and procedures to prevent and remedy duplication... disaster relief agencies and organizations provide assistance. The specific sequence, in accordance with..., DEPARTMENT OF HOMELAND SECURITY DISASTER ASSISTANCE FEDERAL DISASTER ASSISTANCE Other Individual Assistance...
44 CFR 206.191 - Duplication of benefits.

Code of Federal Regulations, 2012 CFR

2012-10-01

... disaster relief agencies establish and follow policies and procedures to prevent and remedy duplication... disaster relief agencies and organizations provide assistance. The specific sequence, in accordance with..., DEPARTMENT OF HOMELAND SECURITY DISASTER ASSISTANCE FEDERAL DISASTER ASSISTANCE Other Individual Assistance...
44 CFR 206.191 - Duplication of benefits.

Code of Federal Regulations, 2011 CFR

2011-10-01

... disaster relief agencies establish and follow policies and procedures to prevent and remedy duplication... disaster relief agencies and organizations provide assistance. The specific sequence, in accordance with..., DEPARTMENT OF HOMELAND SECURITY DISASTER ASSISTANCE FEDERAL DISASTER ASSISTANCE Other Individual Assistance...
Microwave-assisted acid and base hydrolysis of intact proteins containing disulfide bonds for protein sequence analysis by mass spectrometry.

PubMed

Reiz, Bela; Li, Liang

2010-09-01

Controlled hydrolysis of proteins to generate peptide ladders combined with mass spectrometric analysis of the resultant peptides can be used for protein sequencing. In this paper, two methods of improving the microwave-assisted protein hydrolysis process are described to enable rapid sequencing of proteins containing disulfide bonds and increase sequence coverage, respectively. It was demonstrated that proteins containing disulfide bonds could be sequenced by MS analysis by first performing hydrolysis for less than 2 min, followed by 1 h of reduction to release the peptides originally linked by disulfide bonds. It was shown that a strong base could be used as a catalyst for microwave-assisted protein hydrolysis, producing complementary sequence information to that generated by microwave-assisted acid hydrolysis. However, using either acid or base hydrolysis, amide bond breakages in small regions of the polypeptide chains of the model proteins (e.g., cytochrome c and lysozyme) were not detected. Dynamic light scattering measurement of the proteins solubilized in an acid or base indicated that protein-protein interaction or aggregation was not the cause of the failure to hydrolyze certain amide bonds. It was speculated that there were some unknown local structures that might play a role in preventing an acid or base from reacting with the peptide bonds therein. 2010 American Society for Mass Spectrometry. Published by Elsevier Inc. All rights reserved.
Emerging Genomic Tools for Legume Breeding: Current Status and Future Prospects

PubMed Central

Pandey, Manish K.; Roorkiwal, Manish; Singh, Vikas K.; Ramalingam, Abirami; Kudapa, Himabindu; Thudi, Mahendar; Chitikineni, Anu; Rathore, Abhishek; Varshney, Rajeev K.

2016-01-01

Legumes play a vital role in ensuring global nutritional food security and improving soil quality through nitrogen fixation. Accelerated higher genetic gains is required to meet the demand of ever increasing global population. In recent years, speedy developments have been witnessed in legume genomics due to advancements in next-generation sequencing (NGS) and high-throughput genotyping technologies. Reference genome sequences for many legume crops have been reported in the last 5 years. The availability of the draft genome sequences and re-sequencing of elite genotypes for several important legume crops have made it possible to identify structural variations at large scale. Availability of large-scale genomic resources and low-cost and high-throughput genotyping technologies are enhancing the efficiency and resolution of genetic mapping and marker-trait association studies. Most importantly, deployment of molecular breeding approaches has resulted in development of improved lines in some legume crops such as chickpea and groundnut. In order to support genomics-driven crop improvement at a fast pace, the deployment of breeder-friendly genomics and decision support tools seems appear to be critical in breeding programs in developing countries. This review provides an overview of emerging genomics and informatics tools/approaches that will be the key driving force for accelerating genomics-assisted breeding and ultimately ensuring nutritional and food security in developing countries. PMID:27199998
Image-based computer-assisted diagnosis system for benign paroxysmal positional vertigo

NASA Astrophysics Data System (ADS)

Kohigashi, Satoru; Nakamae, Koji; Fujioka, Hiromu

2005-04-01

We develop the image based computer assisted diagnosis system for benign paroxysmal positional vertigo (BPPV) that consists of the balance control system simulator, the 3D eye movement simulator, and the extraction method of nystagmus response directly from an eye movement image sequence. In the system, the causes and conditions of BPPV are estimated by searching the database for record matching with the nystagmus response for the observed eye image sequence of the patient with BPPV. The database includes the nystagmus responses for simulated eye movement sequences. The eye movement velocity is obtained by using the balance control system simulator that allows us to simulate BPPV under various conditions such as canalithiasis, cupulolithiasis, number of otoconia, otoconium size, and so on. Then the eye movement image sequence is displayed on the CRT by the 3D eye movement simulator. The nystagmus responses are extracted from the image sequence by the proposed method and are stored in the database. In order to enhance the diagnosis accuracy, the nystagmus response for a newly simulated sequence is matched with that for the observed sequence. From the matched simulation conditions, the causes and conditions of BPPV are estimated. We apply our image based computer assisted diagnosis system to two real eye movement image sequences for patients with BPPV to show its validity.
Pilot survey of expressed sequence tags (ESTs) from the asexual blood stages of Plasmodium vivax in human patients.

PubMed

Merino, Emilio F; Fernandez-Becerra, Carmen; Madeira, Alda M B N; Machado, Ariane L; Durham, Alan; Gruber, Arthur; Hall, Neil; del Portillo, Hernando A

2003-07-21

Plasmodium vivax is the most widely distributed human malaria, responsible for 70-80 million clinical cases each year and large socio-economical burdens for countries such as Brazil where it is the most prevalent species. Unfortunately, due to the impossibility of growing this parasite in continuous in vitro culture, research on P. vivax remains largely neglected. A pilot survey of expressed sequence tags (ESTs) from the asexual blood stages of P. vivax was performed. To do so, 1,184 clones from a cDNA library constructed with parasites obtained from 10 different human patients in the Brazilian Amazon were sequenced. Sequences were automatedly processed to remove contaminants and low quality reads. A total of 806 sequences with an average length of 586 bp met such criteria and their clustering revealed 666 distinct events. The consensus sequence of each cluster and the unique sequences of the singlets were used in similarity searches against different databases that included P. vivax, Plasmodium falciparum, Plasmodium yoelii, Plasmodium knowlesi, Apicomplexa and the GenBank non-redundant database. An E-value of <10(-30) was used to define a significant database match. ESTs were manually assigned a gene ontology (GO) terminology A total of 769 ESTs could be assigned a putative identity based upon sequence similarity to known proteins in GenBank. Moreover, 292 ESTs were annotated and a GO terminology was assigned to 164 of them. These are the first ESTs reported for P. vivax and, as such, they represent a valuable resource to assist in the annotation of the P. vivax genome currently being sequenced. Moreover, since the GC-content of the P. vivax genome is strikingly different from that of P. falciparum, these ESTs will help in the validation of gene predictions for P. vivax and to create a gene index of this malaria parasite.
Conditional Probabilities of Large Earthquake Sequences in California from the Physics-based Rupture Simulator RSQSim

NASA Astrophysics Data System (ADS)

Gilchrist, J. J.; Jordan, T. H.; Shaw, B. E.; Milner, K. R.; Richards-Dinger, K. B.; Dieterich, J. H.

2017-12-01

Within the SCEC Collaboratory for Interseismic Simulation and Modeling (CISM), we are developing physics-based forecasting models for earthquake ruptures in California. We employ the 3D boundary element code RSQSim (Rate-State Earthquake Simulator of Dieterich & Richards-Dinger, 2010) to generate synthetic catalogs with tens of millions of events that span up to a million years each. This code models rupture nucleation by rate- and state-dependent friction and Coulomb stress transfer in complex, fully interacting fault systems. The Uniform California Earthquake Rupture Forecast Version 3 (UCERF3) fault and deformation models are used to specify the fault geometry and long-term slip rates. We have employed the Blue Waters supercomputer to generate long catalogs of simulated California seismicity from which we calculate the forecasting statistics for large events. We have performed probabilistic seismic hazard analysis with RSQSim catalogs that were calibrated with system-wide parameters and found a remarkably good agreement with UCERF3 (Milner et al., this meeting). We build on this analysis, comparing the conditional probabilities of sequences of large events from RSQSim and UCERF3. In making these comparisons, we consider the epistemic uncertainties associated with the RSQSim parameters (e.g., rate- and state-frictional parameters), as well as the effects of model-tuning (e.g., adjusting the RSQSim parameters to match UCERF3 recurrence rates). The comparisons illustrate how physics-based rupture simulators might assist forecasters in understanding the short-term hazards of large aftershocks and multi-event sequences associated with complex, multi-fault ruptures.
Techniques for Large-Scale Bacterial Genome Manipulation and Characterization of the Mutants with Respect to In Silico Metabolic Reconstructions.

PubMed

diCenzo, George C; Finan, Turlough M

2018-01-01

The rate at which all genes within a bacterial genome can be identified far exceeds the ability to characterize these genes. To assist in associating genes with cellular functions, a large-scale bacterial genome deletion approach can be employed to rapidly screen tens to thousands of genes for desired phenotypes. Here, we provide a detailed protocol for the generation of deletions of large segments of bacterial genomes that relies on the activity of a site-specific recombinase. In this procedure, two recombinase recognition target sequences are introduced into known positions of a bacterial genome through single cross-over plasmid integration. Subsequent expression of the site-specific recombinase mediates recombination between the two target sequences, resulting in the excision of the intervening region and its loss from the genome. We further illustrate how this deletion system can be readily adapted to function as a large-scale in vivo cloning procedure, in which the region excised from the genome is captured as a replicative plasmid. We next provide a procedure for the metabolic analysis of bacterial large-scale genome deletion mutants using the Biolog Phenotype MicroArray™ system. Finally, a pipeline is described, and a sample Matlab script is provided, for the integration of the obtained data with a draft metabolic reconstruction for the refinement of the reactions and gene-protein-reaction relationships in a metabolic reconstruction.

De Novo Transcriptome Sequencing Analysis of cDNA Library and Large-Scale Unigene Assembly in Japanese Red Pine (Pinus densiflora)

PubMed Central

Liu, Le; Zhang, Shijie; Lian, Chunlan

2015-01-01

Japanese red pine (Pinus densiflora) is extensively cultivated in Japan, Korea, China, and Russia and is harvested for timber, pulpwood, garden, and paper markets. However, genetic information and molecular markers were very scarce for this species. In this study, over 51 million sequencing clean reads from P. densiflora mRNA were produced using Illumina paired-end sequencing technology. It yielded 83,913 unigenes with a mean length of 751 bp, of which 54,530 (64.98%) unigenes showed similarity to sequences in the NCBI database. Among which the best matches in the NCBI Nr database were Picea sitchensis (41.60%), Amborella trichopoda (9.83%), and Pinus taeda (4.15%). A total of 1953 putative microsatellites were identified in 1784 unigenes using MISA (MicroSAtellite) software, of which the tri-nucleotide repeats were most abundant (50.18%) and 629 EST-SSR (expressed sequence tag- simple sequence repeats) primer pairs were successfully designed. Among 20 EST-SSR primer pairs randomly chosen, 17 markers yielded amplification products of the expected size in P. densiflora. Our results will provide a valuable resource for gene-function analysis, germplasm identification, molecular marker-assisted breeding and resistance-related gene(s) mapping for pine for P. densiflora. PMID:26690126
Asymptotic response of observables from divergent weak-coupling expansions: a fractional-calculus-assisted Padé technique.

PubMed

Dhatt, Sharmistha; Bhattacharyya, Kamal

2012-08-01

Appropriate constructions of Padé approximants are believed to provide reasonable estimates of the asymptotic (large-coupling) amplitude and exponent of an observable, given its weak-coupling expansion to some desired order. In many instances, however, sequences of such approximants are seen to converge very poorly. We outline here a strategy that exploits the idea of fractional calculus to considerably improve the convergence behavior. Pilot calculations on the ground-state perturbative energy series of quartic, sextic, and octic anharmonic oscillators reveal clearly the worth of our endeavor.
Evolutionary and comparative analyses of the soybean genome

PubMed Central

Cannon, Steven B.; Shoemaker, Randy C.

2012-01-01

The soybean genome assembly has been available since the end of 2008. Significant features of the genome include large, gene-poor, repeat-dense pericentromeric regions, spanning roughly 57% of the genome sequence; a relatively large genome size of ~1.15 billion bases; remnants of a genome duplication that occurred ~13 million years ago (Mya); and fainter remnants of older polyploidies that occurred ~58 Mya and >130 Mya. The genome sequence has been used to identify the genetic basis for numerous traits, including disease resistance, nutritional characteristics, and developmental features. The genome sequence has provided a scaffold for placement of many genomic feature elements, both from within soybean and from related species. These may be accessed at several websites, including http://www.phytozome.net, http://soybase.org, http://comparative-legumes.org, and http://www.legumebase.brc.miyazaki-u.ac.jp. The taxonomic position of soybean in the Phaseoleae tribe of the legumes means that there are approximately two dozen other beans and relatives that have undergone independent domestication, and which may have traits that will be useful for transfer to soybean. Methods of translating information between species in the Phaseoleae range from design of markers for marker assisted selection, to transformation with Agrobacterium or with other experimental transformation methods. PMID:23136483
Outcomes following prehospital airway management in severe traumatic brain injury.

PubMed

Sobuwa, Simpiwe; Hartzenberg, Henry B; Geduld, Heike; Uys, Corrie

2013-07-29

Prevention of hypoxia and thus secondary brain injury in traumatic brain injury (TBI) is critical. However there is controversy regarding the role of endotracheal intubation in the prehospital management of TBI. To describe the outcome of TBI with various airway management methods employed in the prehospital setting in the Cape Town Metropole. The study was a cohort descriptive observational analysis of 124 consecutively injured adult patients who were admitted for severe TBI (Glasgow Coma Score ≤8) to Groote Schuur and Tygerberg hospitals between 1 January 2009 and 31 August 2011. Patients were categorised by their method of airway management: rapid sequence intubation (RSI), sedation-assisted intubation, failed intubation, basic airway management, and intubated without drugs. Good outcomes were defined by a Glasgow Outcome Score of 4 - 5. There was a statistically significant association between airway management and outcome (p=0.013). Patients who underwent basic airway management had a higher proportion of a good outcome (72.9%) than patients who were intubated in the prehospital setting. A good outcome was observed with 61.8% and 38.4% of patients who experienced sedation-assisted intubation and RSI, respectively. Patients intubated without drugs had the poorest outcome (88%), followed by rapid sequence intubation (61.5%) and by the sedation assisted group (38.2%). Prehospital intubation did not demonstrate improved outcomes over basic airway management in patients with severe TBI. A large prospective, randomised trial is warranted to yield some insight into how these airway interventions influence outcome in severe TBI.
Strange Beta: Chaotic Variations for Indoor Rock Climbing Route Setting

NASA Astrophysics Data System (ADS)

Phillips, Caleb; Bradley, Elizabeth

2011-04-01

In this paper we apply chaotic systems to the task of sequence variation for the purpose of aiding humans in setting indoor rock climbing routes. This work expands on prior work where similar variations were used to assist in dance choreography and music composition. We present a formalization for transcription of rock climbing problems and a variation generator that is tuned for this domain and addresses some confounding problems, including a new approach to automatic selection of initial conditions. We analyze our system with a large blinded study in a commercial climbing gym in cooperation with experienced climbers and expert route setters. Our results show that our system is capable of assisting a human setter in producing routes that are at least as good as, and in some cases better than, those produced traditionally.
An adaptive filter method for spacecraft using gravity assist

NASA Astrophysics Data System (ADS)

Ning, Xiaolin; Huang, Panpan; Fang, Jiancheng; Liu, Gang; Ge, Shuzhi Sam

2015-04-01

Celestial navigation (CeleNav) has been successfully used during gravity assist (GA) flyby for orbit determination in many deep space missions. Due to spacecraft attitude errors, ephemeris errors, the camera center-finding bias, and the frequency of the images before and after the GA flyby, the statistics of measurement noise cannot be accurately determined, and yet have time-varying characteristics, which may introduce large estimation error and even cause filter divergence. In this paper, an unscented Kalman filter (UKF) with adaptive measurement noise covariance, called ARUKF, is proposed to deal with this problem. ARUKF scales the measurement noise covariance according to the changes in innovation and residual sequences. Simulations demonstrate that ARUKF is robust to the inaccurate initial measurement noise covariance matrix and time-varying measurement noise. The impact factors in the ARUKF are also investigated.
A new method for optimization of low-thrust gravity-assist sequences

NASA Astrophysics Data System (ADS)

Maiwald, V.

2017-09-01

Recently missions like Hayabusa and Dawn have shown the relevance and benefits of low-thrust spacecraft concerning the exploration of our solar system. In general, the efficiency of low-thrust propulsion is one means of improving mission payload mass. At the same time, gravity-assist maneuvers can serve as mission enablers, as they have the capability to provide "free energy." A combination of both, gravity-assist and low-thrust propulsion, has the potential to generally improve mission performance, i.e. planning and optimization of gravity-assist sequences for low-thrust missions is a desirable asset. Currently no established methods exist to include the gravity-assist partners as optimization variable for low-thrust missions. The present paper explains how gravity-assists are planned and optimized, including the gravity-assist partners, for high-thrust missions and discusses the possibility to transfer the established method, based on the Tisserand Criterion, to low-thrust missions. It is shown how the Tisserand Criterion needs to be adapted using a correction term for the low-thrust situation. It is explained why this necessary correction term excludes an a priori evaluation of sequences and therefore their planning and an alternate approach is proposed. Preliminary results of this method, by application of a Differential Evolution optimization algorithm, are presented and discussed, showing that the method is valid but can be improved. Two constraints on the search space are briefly presented for that aim.
Application of whole genome re-sequencing data in the development of diagnostic DNA markers tightly linked to a disease-resistance locus for marker-assisted selection in lupin (Lupinus angustifolius).

PubMed

Yang, Huaan; Jian, Jianbo; Li, Xuan; Renshaw, Daniel; Clements, Jonathan; Sweetingham, Mark W; Tan, Cong; Li, Chengdao

2015-09-02

Molecular marker-assisted breeding provides an efficient tool to develop improved crop varieties. A major challenge for the broad application of markers in marker-assisted selection is that the marker phenotypes must match plant phenotypes in a wide range of breeding germplasm. In this study, we used the legume crop species Lupinus angustifolius (lupin) to demonstrate the utility of whole genome sequencing and re-sequencing on the development of diagnostic markers for molecular plant breeding. Nine lupin cultivars released in Australia from 1973 to 2007 were subjected to whole genome re-sequencing. The re-sequencing data together with the reference genome sequence data were used in marker development, which revealed 180,596 to 795,735 SNP markers from pairwise comparisons among the cultivars. A total of 207,887 markers were anchored on the lupin genetic linkage map. Marker mining obtained an average of 387 SNP markers and 87 InDel markers for each of the 24 genome sequence assembly scaffolds bearing markers linked to 11 genes of agronomic interest. Using the R gene PhtjR conferring resistance to phomopsis stem blight disease as a test case, we discovered 17 candidate diagnostic markers by genotyping and selecting markers on a genetic linkage map. A further 243 candidate diagnostic markers were discovered by marker mining on a scaffold bearing non-diagnostic markers linked to the PhtjR gene. Nine out from the ten tested candidate diagnostic markers were confirmed as truly diagnostic on a broad range of commercial cultivars. Markers developed using these strategies meet the requirements for broad application in molecular plant breeding. We demonstrated that low-cost genome sequencing and re-sequencing data were sufficient and very effective in the development of diagnostic markers for marker-assisted selection. The strategies used in this study may be applied to any trait or plant species. Whole genome sequencing and re-sequencing provides a powerful tool to overcome current limitations in molecular plant breeding, which will enable plant breeders to precisely pyramid favourable genes to develop super crop varieties to meet future food demands.
Signatures of selection among sex-determining alleles of the honey bee.

PubMed

Hasselmann, Martin; Beye, Martin

2004-04-06

Patterns of DNA polymorphisms are a primary tool for dissecting signatures of selection; however, the underlying selective forces are poorly understood for most genes. A classical example of diversifying selection is the complementary sex-determining locus that is found in the very large insect order Hymenoptera (bees, wasps, ants, and sawflies). The gene responsible for sex determination, the complementary sex determiner (csd), has been most recently identified in the honey bee. Females are heterozygous at this locus. Males result when there is only one functional allele present, as a result of either homozygosity (fertilized eggs) or, more commonly, hemizygosity (unfertilized eggs). The homozygotes, diploid males, do not reproduce and have zero fitness, which implies positive selection in favor of rare alleles. Large differences in csd cDNA sequences within and between four populations were found that fall into two major groups, types I and II. Type I consists of several allelic lineages that were maintained over an extended period, an indication of balancing selection. Diversifying selection has operated on several confined parts of the protein, as shown by an excess of nonsynonymous differences. Elevated sequence differences indicate another selected part near a repeat region. These findings have general implications about the understanding of both the function of the multiallelic mechanism and the adaptive processes on the level of nucleotide sequences. Moreover, the first csd sequence data are a notable basis for the avoidance of diploid males in bee selection programs by allele-assisted breeding.
Top-Down-Assisted Bottom-Up Method for Homologous Protein Sequencing: Hemoglobin from 33 Bird Species

NASA Astrophysics Data System (ADS)

Song, Yang; Laskay, Ünige A.; Vilcins, Inger-Marie E.; Barbour, Alan G.; Wysocki, Vicki H.

2015-11-01

Ticks are vectors for disease transmission because they are indiscriminant in their feeding on multiple vertebrate hosts, transmitting pathogens between their hosts. Identifying the hosts on which ticks have fed is important for disease prevention and intervention. We have previously shown that hemoglobin (Hb) remnants from a host on which a tick fed can be used to reveal the host's identity. For the present research, blood was collected from 33 bird species that are common in the U.S. as hosts for ticks but that have unknown Hb sequences. A top-down-assisted bottom-up mass spectrometry approach with a customized searching database, based on variability in known bird hemoglobin sequences, has been devised to facilitate fast and complete sequencing of hemoglobin from birds with unknown sequences. These hemoglobin sequences will be added to a hemoglobin database and used for tick host identification. The general approach has the potential to sequence any set of homologous proteins completely in a rapid manner.
Merlin: Computer-Aided Oligonucleotide Design for Large Scale Genome Engineering with MAGE.

PubMed

Quintin, Michael; Ma, Natalie J; Ahmed, Samir; Bhatia, Swapnil; Lewis, Aaron; Isaacs, Farren J; Densmore, Douglas

2016-06-17

Genome engineering technologies now enable precise manipulation of organism genotype, but can be limited in scalability by their design requirements. Here we describe Merlin ( http://merlincad.org ), an open-source web-based tool to assist biologists in designing experiments using multiplex automated genome engineering (MAGE). Merlin provides methods to generate pools of single-stranded DNA oligonucleotides (oligos) for MAGE experiments by performing free energy calculation and BLAST scoring on a sliding window spanning the targeted site. These oligos are designed not only to improve recombination efficiency, but also to minimize off-target interactions. The application further assists experiment planning by reporting predicted allelic replacement rates after multiple MAGE cycles, and enables rapid result validation by generating primer sequences for multiplexed allele-specific colony PCR. Here we describe the Merlin oligo and primer design procedures and validate their functionality compared to OptMAGE by eliminating seven AvrII restriction sites from the Escherichia coli genome.
Evaluating the role of coherent delocalized phonon-like modes in DNA cyclization

DOE PAGES

Alexandrov, Ludmil B.; Rasmussen, Kim Ã.; Bishop, Alan R.; ...

2017-08-29

The innate flexibility of a DNA sequence is quantified by the Jacobson-Stockmayer’s J-factor, which measures the propensity for DNA loop formation. Recent studies of ultra-short DNA sequences revealed a discrepancy of up to six orders of magnitude between experimentally measured and theoretically predicted J-factors. These large differences suggest that, in addition to the elastic moduli of the double helix, other factors contribute to loop formation. We develop a new theoretical model that explores how coherent delocalized phonon-like modes in DNA provide single-stranded ”flexible hinges” to assist in loop formation. We also combine the Czapla-Swigon-Olson structural model of DNA with ourmore » extended Peyrard-Bishop-Dauxois model and, without changing any of the parameters of the two models, apply this new computational framework to 86 experimentally characterized DNA sequences. Our results demonstrate that the new computational framework can predict J-factors within an order of magnitude of experimental measurements for most ultra-short DNA sequences, while continuing to accurately describe the J-factors of longer sequences. Furthermore, we demonstrate that our computational framework can be used to describe the cyclization of DNA sequences that contain a base pair mismatch. Overall, our results support the conclusion that coherent delocalized phonon-like modes play an important role in DNA cyclization.« less
Evaluating the role of coherent delocalized phonon-like modes in DNA cyclization

DOE Office of Scientific and Technical Information (OSTI.GOV)

Alexandrov, Ludmil B.; Rasmussen, Kim Ã.; Bishop, Alan R.

The innate flexibility of a DNA sequence is quantified by the Jacobson-Stockmayer’s J-factor, which measures the propensity for DNA loop formation. Recent studies of ultra-short DNA sequences revealed a discrepancy of up to six orders of magnitude between experimentally measured and theoretically predicted J-factors. These large differences suggest that, in addition to the elastic moduli of the double helix, other factors contribute to loop formation. We develop a new theoretical model that explores how coherent delocalized phonon-like modes in DNA provide single-stranded ”flexible hinges” to assist in loop formation. We also combine the Czapla-Swigon-Olson structural model of DNA with ourmore » extended Peyrard-Bishop-Dauxois model and, without changing any of the parameters of the two models, apply this new computational framework to 86 experimentally characterized DNA sequences. Our results demonstrate that the new computational framework can predict J-factors within an order of magnitude of experimental measurements for most ultra-short DNA sequences, while continuing to accurately describe the J-factors of longer sequences. Furthermore, we demonstrate that our computational framework can be used to describe the cyclization of DNA sequences that contain a base pair mismatch. Overall, our results support the conclusion that coherent delocalized phonon-like modes play an important role in DNA cyclization.« less
Large-scale transcriptome characterization and mass discovery of SNPs in globe artichoke and its related taxa.

PubMed

Scaglione, Davide; Lanteri, Sergio; Acquadro, Alberto; Lai, Zhao; Knapp, Steven J; Rieseberg, Loren; Portis, Ezio

2012-10-01

Cynara cardunculus (2n = 2× = 34) is a member of the Asteraceae family that contributes significantly to the agricultural economy of the Mediterranean basin. The species includes two cultivated varieties, globe artichoke and cardoon, which are grown mainly for food. Cynara cardunculus is an orphan crop species whose genome/transcriptome has been relatively unexplored, especially in comparison to other Asteraceae crops. Hence, there is a significant need to improve its genomic resources through the identification of novel genes and sequence-based markers, to design new breeding schemes aimed at increasing quality and crop productivity. We report the outcome of cDNA sequencing and assembly for eleven accessions of C. cardunculus. Sequencing of three mapping parental genotypes using Roche 454-Titanium technology generated 1.7 × 10⁶ reads, which were assembled into 38,726 reference transcripts covering 32 Mbp. Putative enzyme-encoding genes were annotated using the KEGG-database. Transcription factors and candidate resistance genes were surveyed as well. Paired-end sequencing was done for cDNA libraries of eight other representative C. cardunculus accessions on an Illumina Genome Analyzer IIx, generating 46 × 10⁶ reads. Alignment of the IGA and 454 reads to reference transcripts led to the identification of 195,400 SNPs with a Bayesian probability exceeding 95%; a validation rate of 90% was obtained by Sanger-sequencing of a subset of contigs. These results demonstrate that the integration of data from different NGS platforms enables large-scale transcriptome characterization, along with massive SNP discovery. This information will contribute to the dissection of key agricultural traits in C. cardunculus and facilitate the implementation of marker-assisted selection programs. © 2012 The Authors. Plant Biotechnology Journal © 2012 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd.
Isolation of active regulatory elements from eukaryotic chromatin using FAIRE (Formaldehyde Assisted Isolation of Regulatory Elements)

PubMed Central

Giresi, Paul G.; Lieb, Jason D.

2009-01-01

The binding of sequence-specific regulatory factors and the recruitment of chromatin remodeling activities cause nucleosomes to be evicted from chromatin in eukaryotic cells. Traditionally, these active sites have been identified experimentally through their sensitivity to nucleases. Here we describe the details of a simple procedure for the genome-wide isolation of nucleosome-depleted DNA from human chromatin, termed FAIRE (Formaldehyde Assisted Isolation of Regulatory Elements). We also provide protocols for different methods of detecting FAIRE-enriched DNA, including use of PCR, DNA microarrays, and next-generation sequencing. FAIRE works on all eukaryotic chromatin tested to date. To perform FAIRE, chromatin is crosslinked with formaldehyde, sheared by sonication, and phenol-chloroform extracted. Most genomic DNA is crosslinked to nucleosomes and is sequestered to the interphase, whereas DNA recovered in the aqueous phase corresponds to nucleosome-depleted regions of the genome. The isolated regions are largely coincident with the location of DNaseI hypersensitive sites, transcriptional start sites, enhancers, insulators, and active promoters. Given its speed and simplicity, FAIRE has utility in establishing chromatin profiles of diverse cell types in health and disease, isolating DNA regulatory elements en masse for further characterization, and as a screening assay for the effects of small molecules on chromatin organization. PMID:19303047
A taxonomic review of viruses infecting crustaceans with an emphasis on wild hosts.

PubMed

Bateman, K S; Stentiford, G D

2017-07-01

Numerous infections by viral pathogens have been described from wild and cultured crustacean hosts, yet relatively few of these pathogens have been formally characterised and classified. To date viruses have generally been tentatively assigned to families based upon morphological and developmental characteristics and their location of infection within the host cell. Often nucleotide sequence information is unavailable. Some of these viral infections have caused well-documented devastating consequences on the global crustacean farming industry whilst their effects on wild populations remain largely unstudied. This paper provides an up to date review of all known viruses described infecting crustacean hosts. Full characterisation and harmonisation of these descriptions utilising specifications proposed by the International Committee on Taxonomy of Viruses (ICTV) is required to synonymise numerous examples of differential naming or abbreviation of naming, of the same virus in some cases. Development and application of techniques such as viral purification and high throughput sequencing of viral genomes will assist with these full descriptions and, provide appropriate diagnostic targets for surveillance of known and novel relatives. This review also highlights the importance of comparative study with viruses infecting insects and other arthropods to assist this process. Crown Copyright © 2017. Published by Elsevier Inc. All rights reserved.
Indel PDB: a database of structural insertions and deletions derived from sequence alignments of closely related proteins.

PubMed

Hsing, Michael; Cherkasov, Artem

2008-06-25

Insertions and deletions (indels) represent a common type of sequence variations, which are less studied and pose many important biological questions. Recent research has shown that the presence of sizable indels in protein sequences may be indicative of protein essentiality and their role in protein interaction networks. Examples of utilization of indels for structure-based drug design have also been recently demonstrated. Nonetheless many structural and functional characteristics of indels remain less researched or unknown. We have created a web-based resource, Indel PDB, representing a structural database of insertions/deletions identified from the sequence alignments of highly similar proteins found in the Protein Data Bank (PDB). Indel PDB utilized large amounts of available structural information to characterize 1-, 2- and 3-dimensional features of indel sites. Indel PDB contains 117,266 non-redundant indel sites extracted from 11,294 indel-containing proteins. Unlike loop databases, Indel PDB features more indel sequences with secondary structures including alpha-helices and beta-sheets in addition to loops. The insertion fragments have been characterized by their sequences, lengths, locations, secondary structure composition, solvent accessibility, protein domain association and three dimensional structures. By utilizing the data available in Indel PDB, we have studied and presented here several sequence and structural features of indels. We anticipate that Indel PDB will not only enable future functional studies of indels, but will also assist protein modeling efforts and identification of indel-directed drug binding sites.
Population structure of pigs determined by single nucleotide polymorphisms observed in assembled expressed sequence tags.

PubMed

Matsumoto, Toshimi; Okumura, Naohiko; Uenishi, Hirohide; Hayashi, Takeshi; Hamasima, Noriyuki; Awata, Takashi

2012-01-01

We have collected more than 190000 porcine expressed sequence tags (ESTs) from full-length complementary DNA (cDNA) libraries and identified more than 2800 single nucleotide polymorphisms (SNPs). In this study, we tentatively chose 222 SNPs observed in assembled ESTs to study pigs of different breeds; 104 were selected by comparing the cDNA sequences of a Meishan pig and samples of three-way cross pigs (Landrace, Large White, and Duroc: LWD), and 118 were selected from LWD samples. To evaluate the genetic variation between the chosen SNPs from pig breeds, we determined the genotypes for 192 pig samples (11 pig groups) from our DNA reference panel with matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Of the 222 reference SNPs, 186 were successfully genotyped. A neighbor-joining tree showed that the pig groups were classified into two large clusters, namely, Euro-American and East Asian pig populations. F-statistics and the analysis of molecular variance of Euro-American pig groups revealed that approximately 25% of the genetic variations occurred because of intergroup differences. As the F(IS) values were less than the F(ST) values(,) the clustering, based on the Bayesian inference, implied that there was strong genetic differentiation among pig groups and less divergence within the groups in our samples. © 2011 The Authors. Animal Science Journal © 2011 Japanese Society of Animal Science.
Genome Partitioner: A web tool for multi-level partitioning of large-scale DNA constructs for synthetic biology applications.

PubMed

Christen, Matthias; Del Medico, Luca; Christen, Heinz; Christen, Beat

2017-01-01

Recent advances in lower-cost DNA synthesis techniques have enabled new innovations in the field of synthetic biology. Still, efficient design and higher-order assembly of genome-scale DNA constructs remains a labor-intensive process. Given the complexity, computer assisted design tools that fragment large DNA sequences into fabricable DNA blocks are needed to pave the way towards streamlined assembly of biological systems. Here, we present the Genome Partitioner software implemented as a web-based interface that permits multi-level partitioning of genome-scale DNA designs. Without the need for specialized computing skills, biologists can submit their DNA designs to a fully automated pipeline that generates the optimal retrosynthetic route for higher-order DNA assembly. To test the algorithm, we partitioned a 783 kb Caulobacter crescentus genome design. We validated the partitioning strategy by assembling a 20 kb test segment encompassing a difficult to synthesize DNA sequence. Successful assembly from 1 kb subblocks into the 20 kb segment highlights the effectiveness of the Genome Partitioner for reducing synthesis costs and timelines for higher-order DNA assembly. The Genome Partitioner is broadly applicable to translate DNA designs into ready to order sequences that can be assembled with standardized protocols, thus offering new opportunities to harness the diversity of microbial genomes for synthetic biology applications. The Genome Partitioner web tool can be accessed at https://christenlab.ethz.ch/GenomePartitioner.
Integration of targeted sequencing and NIPT into clinical practice in a Chinese family with maple syrup urine disease.

PubMed

You, Yanqin; Sun, Yan; Li, Xuchao; Li, Yali; Wei, Xiaoming; Chen, Fang; Ge, Huijuan; Lan, Zhangzhang; Zhu, Qian; Tang, Ying; Wang, Shujuan; Gao, Ya; Jiang, Fuman; Song, Jiaping; Shi, Quan; Zhu, Xuan; Mu, Feng; Dong, Wei; Gao, Vince; Jiang, Hui; Yi, Xin; Wang, Wei; Gao, Zhiying

2014-08-01

This article demonstrates a prominent noninvasive prenatal approach to assist the clinical diagnosis of a single-gene disorder disease, maple syrup urine disease, using targeted sequencing knowledge from the affected family. The method reported here combines novel mutant discovery in known genes by targeted massively parallel sequencing with noninvasive prenatal testing. By applying this new strategy, we successfully revealed novel mutations in the gene BCKDHA (Ex2_4dup and c.392A>G) in this Chinese family and developed a prenatal haplotype-assisted approach to noninvasively detect the genotype of the fetus (transmitted from both parents). This is the first report of integration of targeted sequencing and noninvasive prenatal testing into clinical practice. Our study has demonstrated that this massively parallel sequencing-based strategy can potentially be used for single-gene disorder diagnosis in the future.

Identification of multiple mRNA and DNA sequences from small tissue samples isolated by laser-assisted microdissection.

PubMed

Bernsen, M R; Dijkman, H B; de Vries, E; Figdor, C G; Ruiter, D J; Adema, G J; van Muijen, G N

1998-10-01

Molecular analysis of small tissue samples has become increasingly important in biomedical studies. Using a laser dissection microscope and modified nucleic acid isolation protocols, we demonstrate that multiple mRNA as well as DNA sequences can be identified from a single-cell sample. In addition, we show that the specificity of procurement of tissue samples is not compromised by smear contamination resulting from scraping of the microtome knife during sectioning of lesions. The procedures described herein thus allow for efficient RT-PCR or PCR analysis of multiple nucleic acid sequences from small tissue samples obtained by laser-assisted microdissection.
[Prospects of molecular breeding in medical plants].

PubMed

Ma, Xiao-Jun; Mo, Chang-Ming

2017-06-01

The molecular-assisted breeding, transgenic breeding and molecular designing breeding are three development directions of plant molecular breeding. Base on these three development directions, this paper summarizes developing status and new tendency of research field of genetic linkage mapping, QTL mapping, association mapping, molecular-assisted selections, pollen-mediated transformations, agrobacterium-mediated transformations, particle gun-mediated transformations, genome editing technologies, whole-genome sequencing, transcriptome sequencing, proteome sequencing and varietal molecular designing. The objective and existing problem of medical plant molecular breeding were discussed the prospect of these three molecular breeding technologies application on medical plant molecular breeding was outlooked. Copyright© by the Chinese Pharmaceutical Association.
Low-Latency Telerobotic Sample Return and Biomolecular Sequencing for Deep Space Gateway

NASA Astrophysics Data System (ADS)

Lupisella, M.; Bleacher, J.; Lewis, R.; Dworkin, J.; Wright, M.; Burton, A.; Rubins, K.; Wallace, S.; Stahl, S.; John, K.; Archer, D.; Niles, P.; Regberg, A.; Smith, D.; Race, M.; Chiu, C.; Russell, J.; Rampe, E.; Bywaters, K.

2018-02-01

Low-latency telerobotics, crew-assisted sample return, and biomolecular sequencing can be used to acquire and analyze lunar farside and/or Apollo landing site samples. Sequencing can also be used to monitor and study Deep Space Gateway environment and crew health.
Cross-Species Functionality of Pararetroviral Elements Driving Ribosome Shunting

PubMed Central

Pooggin, Mikhail M.; Fütterer, Johannes; Hohn, Thomas

2008-01-01

Background Cauliflower mosaic virus (CaMV) and Rice tungro bacilliform virus (RTBV) belong to distinct genera of pararetroviruses infecting dicot and monocot plants, respectively. In both viruses, polycistronic translation of pregenomic (pg) RNA is initiated by shunting ribosomes that bypass a large region of the pgRNA leader with several short (s)ORFs and a stable stem-loop structure. The shunt requires translation of a 5′-proximal sORF terminating near the stem. In CaMV, mutations knocking out this sORF nearly abolish shunting and virus viability. Methodology/Principal Findings Here we show that two distant regions of the CaMV leader that form a minimal shunt configuration comprising the sORF, a bottom part of the stem, and a shunt landing sequence can be replaced by heterologous sequences that form a structurally similar configuration in RTBV without any dramatic effect on shunt-mediated translation and CaMV infectivity. The CaMV-RTBV chimeric leader sequence was largely stable over five viral passages in turnip plants: a few alterations that did eventually occur in the virus progenies are indicative of fine tuning of the chimeric sequence during adaptation to a new host. Conclusions/Significance Our findings demonstrate cross-species functionality of pararetroviral cis-elements driving ribosome shunting and evolutionary conservation of the shunt mechanism. We are grateful to Matthias Müller and Sandra Pauli for technical assistance. This work was initiated at Friedrich Miescher Institute (Basel, Switzerland). We thank Prof. Thomas Boller for hosting the group at the Institute of Botany. PMID:18286203
Whole Genome Sequencing of Greater Amberjack (Seriola dumerili) for SNP Identification on Aligned Scaffolds and Genome Structural Variation Analysis Using Parallel Resequencing

PubMed Central

Aokic, Jun-ya; Kawase, Junya; Hamada, Kazuhisa; Fujimoto, Hiroshi; Yamamoto, Ikki; Usuki, Hironori

2018-01-01

Greater amberjack (Seriola dumerili) is distributed in tropical and temperate waters worldwide and is an important aquaculture fish. We carried out de novo sequencing of the greater amberjack genome to construct a reference genome sequence to identify single nucleotide polymorphisms (SNPs) for breeding amberjack by marker-assisted or gene-assisted selection as well as to identify functional genes for biological traits. We obtained 200 times coverage and constructed a high-quality genome assembly using next generation sequencing technology. The assembled sequences were aligned onto a yellowtail (Seriola quinqueradiata) radiation hybrid (RH) physical map by sequence homology. A total of 215 of the longest amberjack sequences, with a total length of 622.8 Mbp (92% of the total length of the genome scaffolds), were lined up on the yellowtail RH map. We resequenced the whole genomes of 20 greater amberjacks and mapped the resulting sequences onto the reference genome sequence. About 186,000 nonredundant SNPs were successfully ordered on the reference genome. Further, we found differences in the genome structural variations between two greater amberjack populations using BreakDancer. We also analyzed the greater amberjack transcriptome and mapped the annotated sequences onto the reference genome sequence. PMID:29785397
A "signal on" protection-displacement-hybridization-based electrochemical hepatitis B virus gene sequence sensor with high sensitivity and peculiar adjustable specificity.

PubMed

Li, Fengqin; Xu, Yanmei; Yu, Xiang; Yu, Zhigang; He, Xunjun; Ji, Hongrui; Dong, Jinghao; Song, Yongbin; Yan, Hong; Zhang, Guiling

2016-08-15

One "signal on" electrochemical sensing strategy was constructed for the detection of a specific hepatitis B virus (HBV) gene sequence based on the protection-displacement-hybridization-based (PDHB) signaling mechanism. This sensing system is composed of three probes, one capturing probe (CP) and one assistant probe (AP) which are co-immobilized on the Au electrode surface, and one 3-methylene blue (MB) modified signaling probe (SP) free in the detection solution. One duplex are formed between AP and SP with the target, a specific HBV gene sequence, hybridizing with CP. This structure can drive the MB labels close to the electrode surface, thereby producing a large detection current. Two electrochemical testing techniques, alternating current voltammetry (ACV) and cyclic voltammetry (CV), were used for characterizing the sensor. Under the optimized conditions, the proposed sensor exhibits a high sensitivity with the detection limit of ∼5fM for the target. When used for the discrimination of point mutation, the sensor also features an outstanding ability and its peculiar high adjustability. Copyright © 2016 Elsevier B.V. All rights reserved.
An Efficient Approach for the Development of Locus Specific Primers in Bread Wheat (Triticum aestivum L.) and Its Application to Re-Sequencing of Genes Involved in Frost Tolerance

PubMed Central

Babben, Steve; Perovic, Dragan; Koch, Michael; Ordon, Frank

2015-01-01

Recent declines in costs accelerated sequencing of many species with large genomes, including hexaploid wheat (Triticum aestivum L.). Although the draft sequence of bread wheat is known, it is still one of the major challenges to developlocus specific primers suitable to be used in marker assisted selection procedures, due to the high homology of the three genomes. In this study we describe an efficient approach for the development of locus specific primers comprising four steps, i.e. (i) identification of genomic and coding sequences (CDS) of candidate genes, (ii) intron- and exon-structure reconstruction, (iii) identification of wheat A, B and D sub-genome sequences and primer development based on sequence differences between the three sub-genomes, and (iv); testing of primers for functionality, correct size and localisation. This approach was applied to single, low and high copy genes involved in frost tolerance in wheat. In summary for 27 of these genes for which sequences were derived from Triticum aestivum, Triticum monococcum and Hordeum vulgare, a set of 119 primer pairs was developed and after testing on Nulli-tetrasomic (NT) lines, a set of 65 primer pairs (54.6%), corresponding to 19 candidate genes, turned out to be specific. Out of these a set of 35 fragments was selected for validation via Sanger's amplicon re-sequencing. All fragments, with the exception of one, could be assigned to the original reference sequence. The approach presented here showed a much higher specificity in primer development in comparison to techniques used so far in bread wheat and can be applied to other polyploid species with a known draft sequence. PMID:26565976
A Deep Insight Into the Sialotranscriptome of the Chagas Disease Vector, Panstrongylus megistus (Hemiptera: Heteroptera)

PubMed Central

Ribeiro, José M. C.; Schwarz, Alexandra; Francischetti, Ivo M. B.

2015-01-01

Saliva of blood-sucking arthropods contains a complex cocktail of pharmacologically active compounds that assists feeding by counteracting their hosts’ hemostatic and inflammatory reactions. Panstrongylus megistus (Burmeister) is an important vector of Chagas disease in South America, but despite its importance there is only one salivary protein sequence publicly deposited in GenBank. In the present work, we used Illumina technology to disclose and publicly deposit 3,703 coding sequences obtained from the assembly of >70 million reads. These sequences should assist proteomic experiments aimed at identifying pharmacologically active proteins and immunological markers of vector exposure. A supplemental file of the transcriptome and deducted protein sequences can be obtained from http://exon.niaid.nih.gov/transcriptome/P_megistus/Pmeg-web.xlsx. PMID:26334808
[DNA marker-assisted selection of medicinal plants (Ⅰ) .Breeding research of disease-resistant cultivars of Panax notoginseng].

PubMed

Dong, Lin-Lin; Chen, Zhong-Jian; Wang, Yong; Wei, Fu-Gang; Zhang, Lian-Juan; Xu, Jiang; Wei, Guang-Fei; Wang, Rui; Yang, Juan; Liu, Wei-Lin; Li, Xi-Wen; Yu, Yu-Qi; Chen, Shi-Lin

2017-01-01

DNA marker-assisted selection of medicinal plants is based on the DNA polymorphism, selects the DNA sequences related to the phenotypes such as high yields, superior quality, stress-resistance and so on according to the technologies of molecular hybridization, polymerase chain reaction and high-throughput sequencing, and assists the breeding of new cultivars. This study bred the first disease-resistant cultivar of notoginseng "Miaoxiang Kangqi 1" using the technology of DNA marker-assisted selection of medicinal plants and systematic breeding. The disease-resistant cultivar of notoginseng contained 12 special SNPs based on the analysis of Restriction-site Associated DNA Sequencing (RAD-Seq). Among the SNP (record_519688) was related to the root rot-resistant characteristics, which indicated this SNP could serve as genetic markers of disease-resistant cultivars and assist the systematic breeding. Compared to the conventional cultivated cultivars, the incidence rate of root-rot and rust-rot in notoginseng seedlings decreased by 83.6% and 71.8%, respectively. The incidence rate of root-rot respectively declined by 43.6% and 62.9% in notoginseng cultivation for 2 and 3 years compared with those of the conventional cultivated cultivars. Additionally, the potential disease-resistant groups were screened based on the relative SNP, and this model enlarged the target groups and advanced the breeding efficiency. DNA marker-assisted selection of medicinal plants accelerated the breeding and promotion of new cultivars, and guaranteed the healthy development of Chinese medicinal materials industry. Copyright© by the Chinese Pharmaceutical Association.
SUGAR: graphical user interface-based data refiner for high-throughput DNA sequencing.

PubMed

Sato, Yukuto; Kojima, Kaname; Nariai, Naoki; Yamaguchi-Kabata, Yumi; Kawai, Yosuke; Takahashi, Mamoru; Mimori, Takahiro; Nagasaki, Masao

2014-08-08

Next-generation sequencers (NGSs) have become one of the main tools for current biology. To obtain useful insights from the NGS data, it is essential to control low-quality portions of the data affected by technical errors such as air bubbles in sequencing fluidics. We develop a software SUGAR (subtile-based GUI-assisted refiner) which can handle ultra-high-throughput data with user-friendly graphical user interface (GUI) and interactive analysis capability. The SUGAR generates high-resolution quality heatmaps of the flowcell, enabling users to find possible signals of technical errors during the sequencing. The sequencing data generated from the error-affected regions of a flowcell can be selectively removed by automated analysis or GUI-assisted operations implemented in the SUGAR. The automated data-cleaning function based on sequence read quality (Phred) scores was applied to a public whole human genome sequencing data and we proved the overall mapping quality was improved. The detailed data evaluation and cleaning enabled by SUGAR would reduce technical problems in sequence read mapping, improving subsequent variant analysis that require high-quality sequence data and mapping results. Therefore, the software will be especially useful to control the quality of variant calls to the low population cells, e.g., cancers, in a sample with technical errors of sequencing procedures.
Protein Annotators' Assistant: A Novel Application of Information Retrieval Techniques.

ERIC Educational Resources Information Center

Wise, Michael J.

2000-01-01

Protein Annotators' Assistant (PAA) is a software system which assists protein annotators in assigning functions to newly sequenced proteins. PAA employs a number of information retrieval techniques in a novel setting and is thus related to text categorization, where multiple categories may be suggested, except that in this case none of the…
Complete Genome Sequence of Zucchini Yellow Mosaic Virus Strain Kurdistan, Iran.

PubMed

Maghamnia, Hamid Reza; Hajizadeh, Mohammad; Azizi, Abdolbaset

2018-03-01

The complete genome sequence of Zucchini yellow mosaic virus strain Kurdistan (ZYMV-Kurdistan) infecting squash from Iran was determined from 13 overlapping fragments. Excluding the poly (A) tail, ZYMV-Kurdistan genome consisted of 9593 nucleotides (nt), with 138 and 211 nt at the 5' and 3' non-translated regions, respectively. It contained two open-reading frames (ORFs), the large ORF encoding a polyprotein of 3080 amino acids (aa) and the small overlapping ORF encoding a P3N-PIPO protein of 74 aa. This isolate had six unique aa differences compared to other ZYMV isolates and shared 79.6-98.8% identities with other ZYMV genome sequences at the nt level and 90.1-99% identities at the aa level. A phylogenetic tree of ZYMV complete genomic sequences showed that Iranian and Central European isolates are closely related and form a phylogenetically homogenous group. All values in the ratio of substitution rates at non-synonymous and synonymous sites ( d N / d S ) were below 1, suggestive of strong negative selection forces during ZYMV protein history. This is the first report of complete genome sequence information of the most prevalent virus in the west of Iran. This study helps our understanding of the genetic diversity of ZYMV isolates infecting cucurbit plants in Iran, virus evolution and epidemiology and can assist in designing better diagnostic tools.
Two Related Parametric Integrals

ERIC Educational Resources Information Center

Dana-Picard, T.

2007-01-01

Two related sequences of definite integrals are considered. By mixing hand-work, computer algebra system assistance and websurfing, fine connections can be studied between integrals and a couple of interesting sequences of integers. (Contains 4 tables.)
A blackberry (Rubus L.) expressed sequence tag library for the development of simple sequence repeat markers

PubMed Central

Lewers, Kim S; Saski, Chris A; Cuthbertson, Brandon J; Henry, David C; Staton, Meg E; Main, Dorrie S; Dhanaraj, Anik L; Rowland, Lisa J; Tomkins, Jeff P

2008-01-01

Background The recent development of novel repeat-fruiting types of blackberry (Rubus L.) cultivars, combined with a long history of morphological marker-assisted selection for thornlessness by blackberry breeders, has given rise to increased interest in using molecular markers to facilitate blackberry breeding. Yet no genetic maps, molecular markers, or even sequences exist specifically for cultivated blackberry. The purpose of this study is to begin development of these tools by generating and annotating the first blackberry expressed sequence tag (EST) library, designing primers from the ESTs to amplify regions containing simple sequence repeats (SSR), and testing the usefulness of a subset of the EST-SSRs with two blackberry cultivars. Results A cDNA library of 18,432 clones was generated from expanding leaf tissue of the cultivar Merton Thornless, a progenitor of many thornless commercial cultivars. Among the most abundantly expressed of the 3,000 genes annotated were those involved with energy, cell structure, and defense. From individual sequences containing SSRs, 673 primer pairs were designed. Of a randomly chosen set of 33 primer pairs tested with two blackberry cultivars, 10 detected an average of 1.9 polymorphic PCR products. Conclusion This rate predicts that this library may yield as many as 940 SSR primer pairs detecting 1,786 polymorphisms. This may be sufficient to generate a genetic map that can be used to associate molecular markers with phenotypic traits, making possible molecular marker-assisted breeding to compliment existing morphological marker-assisted breeding in blackberry. PMID:18570660
AmphiBase: A new genomic resource for non-model amphibian species.

PubMed

Kwon, Taejoon

2017-01-01

More than five thousand genes annotated in the recently published Xenopus laevis and Xenopus tropicalis genomes do not have a candidate orthologous counterpart in other vertebrate species. To determine whether these sequences represent genuine amphibian-specific genes or annotation errors, it is necessary to analyze them alongside sequences from other amphibian species. However, due to large genome sizes and an abundance of repeat sequences, there are limited numbers of gene sequences available from amphibian species other than Xenopus. AmphiBase is a new genomic resource covering non-model amphibian species, based on public domain transcriptome data and computational methods developed during the X. laevis genome project. Here, I review the current status of AmphiBase, including amphibian species with available transcriptome data or biological samples, and describe the challenges of building a comprehensive amphibian genomic resource in the absence of genomes. This mini-review will be informative for researchers interested in functional genomic experiments using amphibian model organisms, such as Xenopus and axolotl, and will assist in interpretation of results implicating "orphan genes." Additionally, this study highlights an opportunity for researchers working on non-model amphibian species to collaborate in their future efforts and develop amphibian genomic resources as a community. © 2017 Wiley Periodicals, Inc.
Genomics-assisted breeding in fruit trees.

PubMed

Iwata, Hiroyoshi; Minamikawa, Mai F; Kajiya-Kanegae, Hiromi; Ishimori, Motoyuki; Hayashi, Takeshi

2016-01-01

Recent advancements in genomic analysis technologies have opened up new avenues to promote the efficiency of plant breeding. Novel genomics-based approaches for plant breeding and genetics research, such as genome-wide association studies (GWAS) and genomic selection (GS), are useful, especially in fruit tree breeding. The breeding of fruit trees is hindered by their long generation time, large plant size, long juvenile phase, and the necessity to wait for the physiological maturity of the plant to assess the marketable product (fruit). In this article, we describe the potential of genomics-assisted breeding, which uses these novel genomics-based approaches, to break through these barriers in conventional fruit tree breeding. We first introduce the molecular marker systems and whole-genome sequence data that are available for fruit tree breeding. Next we introduce the statistical methods for biparental linkage and quantitative trait locus (QTL) mapping as well as GWAS and GS. We then review QTL mapping, GWAS, and GS studies conducted on fruit trees. We also review novel technologies for rapid generation advancement. Finally, we note the future prospects of genomics-assisted fruit tree breeding and problems that need to be overcome in the breeding.
Genomics-assisted breeding in fruit trees

PubMed Central

Iwata, Hiroyoshi; Minamikawa, Mai F.; Kajiya-Kanegae, Hiromi; Ishimori, Motoyuki; Hayashi, Takeshi

2016-01-01

Recent advancements in genomic analysis technologies have opened up new avenues to promote the efficiency of plant breeding. Novel genomics-based approaches for plant breeding and genetics research, such as genome-wide association studies (GWAS) and genomic selection (GS), are useful, especially in fruit tree breeding. The breeding of fruit trees is hindered by their long generation time, large plant size, long juvenile phase, and the necessity to wait for the physiological maturity of the plant to assess the marketable product (fruit). In this article, we describe the potential of genomics-assisted breeding, which uses these novel genomics-based approaches, to break through these barriers in conventional fruit tree breeding. We first introduce the molecular marker systems and whole-genome sequence data that are available for fruit tree breeding. Next we introduce the statistical methods for biparental linkage and quantitative trait locus (QTL) mapping as well as GWAS and GS. We then review QTL mapping, GWAS, and GS studies conducted on fruit trees. We also review novel technologies for rapid generation advancement. Finally, we note the future prospects of genomics-assisted fruit tree breeding and problems that need to be overcome in the breeding. PMID:27069395
MIG-seq: an effective PCR-based method for genome-wide single-nucleotide polymorphism genotyping using the next-generation sequencing platform

PubMed Central

Suyama, Yoshihisa; Matsuki, Yu

2015-01-01

Restriction-enzyme (RE)-based next-generation sequencing methods have revolutionized marker-assisted genetic studies; however, the use of REs has limited their widespread adoption, especially in field samples with low-quality DNA and/or small quantities of DNA. Here, we developed a PCR-based procedure to construct reduced representation libraries without RE digestion steps, representing de novo single-nucleotide polymorphism discovery, and its genotyping using next-generation sequencing. Using multiplexed inter-simple sequence repeat (ISSR) primers, thousands of genome-wide regions were amplified effectively from a wide variety of genomes, without prior genetic information. We demonstrated: 1) Mendelian gametic segregation of the discovered variants; 2) reproducibility of genotyping by checking its applicability for individual identification; and 3) applicability in a wide variety of species by checking standard population genetic analysis. This approach, called multiplexed ISSR genotyping by sequencing, should be applicable to many marker-assisted genetic studies with a wide range of DNA qualities and quantities. PMID:26593239
Genome Partitioner: A web tool for multi-level partitioning of large-scale DNA constructs for synthetic biology applications

PubMed Central

Del Medico, Luca; Christen, Heinz; Christen, Beat

2017-01-01

Recent advances in lower-cost DNA synthesis techniques have enabled new innovations in the field of synthetic biology. Still, efficient design and higher-order assembly of genome-scale DNA constructs remains a labor-intensive process. Given the complexity, computer assisted design tools that fragment large DNA sequences into fabricable DNA blocks are needed to pave the way towards streamlined assembly of biological systems. Here, we present the Genome Partitioner software implemented as a web-based interface that permits multi-level partitioning of genome-scale DNA designs. Without the need for specialized computing skills, biologists can submit their DNA designs to a fully automated pipeline that generates the optimal retrosynthetic route for higher-order DNA assembly. To test the algorithm, we partitioned a 783 kb Caulobacter crescentus genome design. We validated the partitioning strategy by assembling a 20 kb test segment encompassing a difficult to synthesize DNA sequence. Successful assembly from 1 kb subblocks into the 20 kb segment highlights the effectiveness of the Genome Partitioner for reducing synthesis costs and timelines for higher-order DNA assembly. The Genome Partitioner is broadly applicable to translate DNA designs into ready to order sequences that can be assembled with standardized protocols, thus offering new opportunities to harness the diversity of microbial genomes for synthetic biology applications. The Genome Partitioner web tool can be accessed at https://christenlab.ethz.ch/GenomePartitioner. PMID:28531174
Ancient orphan crop joins modern era: gene-based SNP discovery and mapping in lentil.

PubMed

Sharpe, Andrew G; Ramsay, Larissa; Sanderson, Lacey-Anne; Fedoruk, Michael J; Clarke, Wayne E; Li, Rong; Kagale, Sateesh; Vijayan, Perumal; Vandenberg, Albert; Bett, Kirstin E

2013-03-18

The genus Lens comprises a range of closely related species within the galegoid clade of the Papilionoideae family. The clade includes other important crops (e.g. chickpea and pea) as well as a sequenced model legume (Medicago truncatula). Lentil is a global food crop increasing in importance in the Indian sub-continent and elsewhere due to its nutritional value and quick cooking time. Despite this importance there has been a dearth of genetic and genomic resources for the crop and this has limited the application of marker-assisted selection strategies in breeding. We describe here the development of a deep and diverse transcriptome resource for lentil using next generation sequencing technology. The generation of data in multiple cultivated (L. culinaris) and wild (L. ervoides) genotypes together with the utilization of a bioinformatics workflow enabled the identification of a large collection of SNPs and the subsequent development of a genotyping platform that was used to establish the first comprehensive genetic map of the L. culinaris genome. Extensive collinearity with M. truncatula was evident on the basis of sequence homology between mapped markers and the model genome and large translocations and inversions relative to M. truncatula were identified. An estimate for the time divergence of L. culinaris from L. ervoides and of both from M. truncatula was also calculated. The availability of the genomic and derived molecular marker resources presented here will help change lentil breeding strategies and lead to increased genetic gain in the future.

SS-Wrapper: a package of wrapper applications for similarity searches on Linux clusters.

PubMed

Wang, Chunlin; Lefkowitz, Elliot J

2004-10-28

Large-scale sequence comparison is a powerful tool for biological inference in modern molecular biology. Comparing new sequences to those in annotated databases is a useful source of functional and structural information about these sequences. Using software such as the basic local alignment search tool (BLAST) or HMMPFAM to identify statistically significant matches between newly sequenced segments of genetic material and those in databases is an important task for most molecular biologists. Searching algorithms are intrinsically slow and data-intensive, especially in light of the rapid growth of biological sequence databases due to the emergence of high throughput DNA sequencing techniques. Thus, traditional bioinformatics tools are impractical on PCs and even on dedicated UNIX servers. To take advantage of larger databases and more reliable methods, high performance computation becomes necessary. We describe the implementation of SS-Wrapper (Similarity Search Wrapper), a package of wrapper applications that can parallelize similarity search applications on a Linux cluster. Our wrapper utilizes a query segmentation-search (QS-search) approach to parallelize sequence database search applications. It takes into consideration load balancing between each node on the cluster to maximize resource usage. QS-search is designed to wrap many different search tools, such as BLAST and HMMPFAM using the same interface. This implementation does not alter the original program, so newly obtained programs and program updates should be accommodated easily. Benchmark experiments using QS-search to optimize BLAST and HMMPFAM showed that QS-search accelerated the performance of these programs almost linearly in proportion to the number of CPUs used. We have also implemented a wrapper that utilizes a database segmentation approach (DS-BLAST) that provides a complementary solution for BLAST searches when the database is too large to fit into the memory of a single node. Used together, QS-search and DS-BLAST provide a flexible solution to adapt sequential similarity searching applications in high performance computing environments. Their ease of use and their ability to wrap a variety of database search programs provide an analytical architecture to assist both the seasoned bioinformaticist and the wet-bench biologist.
SS-Wrapper: a package of wrapper applications for similarity searches on Linux clusters

PubMed Central

Wang, Chunlin; Lefkowitz, Elliot J

2004-01-01

Background Large-scale sequence comparison is a powerful tool for biological inference in modern molecular biology. Comparing new sequences to those in annotated databases is a useful source of functional and structural information about these sequences. Using software such as the basic local alignment search tool (BLAST) or HMMPFAM to identify statistically significant matches between newly sequenced segments of genetic material and those in databases is an important task for most molecular biologists. Searching algorithms are intrinsically slow and data-intensive, especially in light of the rapid growth of biological sequence databases due to the emergence of high throughput DNA sequencing techniques. Thus, traditional bioinformatics tools are impractical on PCs and even on dedicated UNIX servers. To take advantage of larger databases and more reliable methods, high performance computation becomes necessary. Results We describe the implementation of SS-Wrapper (Similarity Search Wrapper), a package of wrapper applications that can parallelize similarity search applications on a Linux cluster. Our wrapper utilizes a query segmentation-search (QS-search) approach to parallelize sequence database search applications. It takes into consideration load balancing between each node on the cluster to maximize resource usage. QS-search is designed to wrap many different search tools, such as BLAST and HMMPFAM using the same interface. This implementation does not alter the original program, so newly obtained programs and program updates should be accommodated easily. Benchmark experiments using QS-search to optimize BLAST and HMMPFAM showed that QS-search accelerated the performance of these programs almost linearly in proportion to the number of CPUs used. We have also implemented a wrapper that utilizes a database segmentation approach (DS-BLAST) that provides a complementary solution for BLAST searches when the database is too large to fit into the memory of a single node. Conclusions Used together, QS-search and DS-BLAST provide a flexible solution to adapt sequential similarity searching applications in high performance computing environments. Their ease of use and their ability to wrap a variety of database search programs provide an analytical architecture to assist both the seasoned bioinformaticist and the wet-bench biologist. PMID:15511296
A clone-free, single molecule map of the domestic cow (Bos taurus) genome.

PubMed

Zhou, Shiguo; Goldstein, Steve; Place, Michael; Bechner, Michael; Patino, Diego; Potamousis, Konstantinos; Ravindran, Prabu; Pape, Louise; Rincon, Gonzalo; Hernandez-Ortiz, Juan; Medrano, Juan F; Schwartz, David C

2015-08-28

The cattle (Bos taurus) genome was originally selected for sequencing due to its economic importance and unique biology as a model organism for understanding other ruminants, or mammals. Currently, there are two cattle genome sequence assemblies (UMD3.1 and Btau4.6) from groups using dissimilar assembly algorithms, which were complemented by genetic and physical map resources. However, past comparisons between these assemblies revealed substantial differences. Consequently, such discordances have engendered ambiguities when using reference sequence data, impacting genomic studies in cattle and motivating construction of a new optical map resource--BtOM1.0--to guide comparisons and improvements to the current sequence builds. Accordingly, our comprehensive comparisons of BtOM1.0 against the UMD3.1 and Btau4.6 sequence builds tabulate large-to-immediate scale discordances requiring mediation. The optical map, BtOM1.0, spanning the B. taurus genome (Hereford breed, L1 Dominette 01449) was assembled from an optical map dataset consisting of 2,973,315 (439 X; raw dataset size before assembly) single molecule optical maps (Rmaps; 1 Rmap = 1 restriction mapped DNA molecule) generated by the Optical Mapping System. The BamHI map spans 2,575.30 Mb and comprises 78 optical contigs assembled by a combination of iterative (using the reference sequence: UMD3.1) and de novo assembly techniques. BtOM1.0 is a high-resolution physical map featuring an average restriction fragment size of 8.91 Kb. Comparisons of BtOM1.0 vs. UMD3.1, or Btau4.6, revealed that Btau4.6 presented far more discordances (7,463) vs. UMD3.1 (4,754). Overall, we found that Btau4.6 presented almost double the number of discordances than UMD3.1 across most of the 6 categories of sequence vs. map discrepancies, which are: COMPLEX (misassembly), DELs (extraneous sequences), INSs (missing sequences), ITs (Inverted/Translocated sequences), ECs (extra restriction cuts) and MCs (missing restriction cuts). Alignments of UMD3.1 and Btau4.6 to BtOM1.0 reveal discordances commensurate with previous reports, and affirm the NCBI's current designation of UMD3.1 sequence assembly as the "reference assembly" and the Btau4.6 as the "alternate assembly." The cattle genome optical map, BtOM1.0, when used as a comprehensive and largely independent guide, will greatly assist improvements to existing sequence builds, and later serve as an accurate physical scaffold for studies concerning the comparative genomics of cattle breeds.
Assisted reproduction treatment and epigenetic inheritance

PubMed Central

van Montfoort, A.P.A.; Hanssen, L.L.P.; de Sutter, P.; Viville, S.; Geraedts, J.P.M.; de Boer, P.

2012-01-01

BACKGROUND The subject of epigenetic risk of assisted reproduction treatment (ART), initiated by reports on an increase of children with the Beckwith–Wiedemann imprinting disorder, is very topical. Hence, there is a growing literature, including mouse studies. METHODS In order to gain information on transgenerational epigenetic inheritance and epigenetic effects induced by ART, literature databases were searched for papers on this topic using relevant keywords. RESULTS At the level of genomic imprinting involving CpG methylation, ART-induced epigenetic defects are convincingly observed in mice, especially for placenta, and seem more frequent than in humans. Data generally provide a warning as to the use of ovulation induction and in vitro culture. In human sperm from compromised spermatogenesis, sequence-specific DNA hypomethylation is observed repeatedly. Transmittance of sperm and oocyte DNA methylation defects is possible but, as deduced from the limited data available, largely prevented by selection of gametes for ART and/or non-viability of the resulting embryos. Some evidence indicates that subfertility itself is a risk factor for imprinting diseases. As in mouse, physiological effects from ART are observed in humans. In the human, indications for a broader target for changes in CpG methylation than imprinted DNA sequences alone have been found. In the mouse, a broader range of CpG sequences has not yet been studied. Also, a multigeneration study of systematic ART on epigenetic parameters is lacking. CONCLUSIONS The field of epigenetic inheritance within the lifespan of an individual and between generations (via mitosis and meiosis, respectively) is growing, driven by the expansion of chromatin research. ART can induce epigenetic variation that might be transmitted to the next generation. PMID:22267841
RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome.

PubMed

Wenger, Yvan; Galliot, Brigitte

2013-03-25

Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with a comparative analysis of the specific features of RNAseq and genome-predicted transcriptomes currently available in the freshwater hydrozoan Hydra vulgaris. To produce an accurate and extensive Hydra transcriptome, we combined Illumina and 454 Titanium reads, giving the primacy to Illumina over 454 reads to correct homopolymer errors. This strategy yielded an RNAseq transcriptome that contains 48'909 unique sequences including splice variants, representing approximately 24'450 distinct genes. Comparative analysis to the available genome-predicted transcriptomes identified 10'597 novel Hydra transcripts that encode 529 evolutionarily-conserved proteins. The annotation of 170 human orthologs points to critical functions in protein biosynthesis, FGF and TOR signaling, vesicle transport, immunity, cell cycle regulation, cell death, mitochondrial metabolism, transcription and chromatin regulation. However, a majority of these novel transcripts encodes short ORFs, at least 767 of them corresponding to pseudogenes. This RNAseq transcriptome also lacks 11'270 predicted transcripts that correspond either to silent genes or to genes expressed below the detection level of this study. We established a simple and powerful strategy to combine Illumina and 454 reads and we produced, with genome assistance, an extensive and accurate Hydra transcriptome. The comparative analysis of the RNAseq transcriptome with genome-predicted transcriptomes lead to the identification of large populations of novel as well as missing transcripts that might reflect Hydra-specific evolutionary events.
RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome

PubMed Central

2013-01-01

Background Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with a comparative analysis of the specific features of RNAseq and genome-predicted transcriptomes currently available in the freshwater hydrozoan Hydra vulgaris. Results To produce an accurate and extensive Hydra transcriptome, we combined Illumina and 454 Titanium reads, giving the primacy to Illumina over 454 reads to correct homopolymer errors. This strategy yielded an RNAseq transcriptome that contains 48’909 unique sequences including splice variants, representing approximately 24’450 distinct genes. Comparative analysis to the available genome-predicted transcriptomes identified 10’597 novel Hydra transcripts that encode 529 evolutionarily-conserved proteins. The annotation of 170 human orthologs points to critical functions in protein biosynthesis, FGF and TOR signaling, vesicle transport, immunity, cell cycle regulation, cell death, mitochondrial metabolism, transcription and chromatin regulation. However, a majority of these novel transcripts encodes short ORFs, at least 767 of them corresponding to pseudogenes. This RNAseq transcriptome also lacks 11’270 predicted transcripts that correspond either to silent genes or to genes expressed below the detection level of this study. Conclusions We established a simple and powerful strategy to combine Illumina and 454 reads and we produced, with genome assistance, an extensive and accurate Hydra transcriptome. The comparative analysis of the RNAseq transcriptome with genome-predicted transcriptomes lead to the identification of large populations of novel as well as missing transcripts that might reflect Hydra-specific evolutionary events. PMID:23530871
Iraq: Recent Developments in Reconstruction Assistance

DTIC Science & Technology

2005-05-12

in Reconstruction Assistance Summary Large-scale reconstruction assistance programs are being undertaken by the United States following the war with... assistance programs , the Coalition Provisional Authority (CPA), dissolved, and sovereignty was returned to Iraq. Security Council Resolution 1546 of June...Assessment.pdf]. Iraq: Recent Developments in Reconstruction Assistance Large-scale reconstruction assistance programs are being undertaken by the United
Development of Genomic Microsatellite Markers in Carthamus tinctorius L. (Safflower) Using Next Generation Sequencing and Assessment of Their Cross-Species Transferability and Utility for Diversity Analysis

PubMed Central

Variath, Murali Tottekkad; Joshi, Gopal; Bali, Sapinder; Agarwal, Manu; Kumar, Amar; Jagannath, Arun; Goel, Shailendra

2015-01-01

Background Safflower (Carthamus tinctorius L.), an Asteraceae member, yields high quality edible oil rich in unsaturated fatty acids and is resilient to dry conditions. The crop holds tremendous potential for improvement through concerted molecular breeding programs due to the availability of significant genetic and phenotypic diversity. Genomic resources that could facilitate such breeding programs remain largely underdeveloped in the crop. The present study was initiated to develop a large set of novel microsatellite markers for safflower using next generation sequencing. Principal Findings Low throughput genome sequencing of safflower was performed using Illumina paired end technology providing ~3.5X coverage of the genome. Analysis of sequencing data allowed identification of 23,067 regions harboring perfect microsatellite loci. The safflower genome was found to be rich in dinucleotide repeats followed by tri-, tetra-, penta- and hexa-nucleotides. Primer pairs were designed for 5,716 novel microsatellite sequences with repeat length ≥ 20 bases and optimal flanking regions. A subset of 325 microsatellite loci was tested for amplification, of which 294 loci produced robust amplification. The validated primers were used for assessment of 23 safflower accessions belonging to diverse agro-climatic zones of the world leading to identification of 93 polymorphic primers (31.6%). The numbers of observed alleles at each locus ranged from two to four and mean polymorphism information content was found to be 0.3075. The polymorphic primers were tested for cross-species transferability on nine wild relatives of cultivated safflower. All primers except one showed amplification in at least two wild species while 25 primers amplified across all the nine species. The UPGMA dendrogram clustered C. tinctorius accessions and wild species separately into two major groups. The proposed progenitor species of safflower, C. oxyacantha and C. palaestinus were genetically closer to cultivated safflower and formed a distinct cluster. The cluster analysis also distinguished diploid and tetraploid wild species of safflower. Conclusion Next generation sequencing of safflower genome generated a large set of microsatellite markers. The novel markers developed in this study will add to the existing repertoire of markers and can be used for diversity analysis, synteny studies, construction of linkage maps and marker-assisted selection. PMID:26287743
Computer-assisted segmentation of white matter lesions in 3D MR images using support vector machine.

PubMed

Lao, Zhiqiang; Shen, Dinggang; Liu, Dengfeng; Jawad, Abbas F; Melhem, Elias R; Launer, Lenore J; Bryan, R Nick; Davatzikos, Christos

2008-03-01

Brain lesions, especially white matter lesions (WMLs), are associated with cardiac and vascular disease, but also with normal aging. Quantitative analysis of WML in large clinical trials is becoming more and more important. In this article, we present a computer-assisted WML segmentation method, based on local features extracted from multiparametric magnetic resonance imaging (MRI) sequences (ie, T1-weighted, T2-weighted, proton density-weighted, and fluid attenuation inversion recovery MRI scans). A support vector machine classifier is first trained on expert-defined WMLs, and is then used to classify new scans. Postprocessing analysis further reduces false positives by using anatomic knowledge and measures of distance from the training set. Cross-validation on a population of 35 patients from three different imaging sites with WMLs of varying sizes, shapes, and locations tests the robustness and accuracy of the proposed segmentation method, compared with the manual segmentation results from two experienced neuroradiologists.
Using FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements) to isolate active regulatory DNA

PubMed Central

Simon, Jeremy M.; Giresi, Paul G.; Davis, Ian J.; Lieb, Jason D.

2013-01-01

Eviction or destabilization of nucleosomes from chromatin is a hallmark of functional regulatory elements of the eukaryotic genome. Historically identified by nuclease hypersensitivity, these regulatory elements are typically bound by transcription factors or other regulatory proteins. FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements) is an alternative approach to identify these genomic regions and has proven successful in a multitude of eukaryotic cell and tissue types. Cells or dissociated tissues are crosslinked briefly with formaldehyde, lysed, and sonicated. Sheared chromatin is subjected to phenol-chloroform extraction and the isolated DNA, typically encompassing 1–3% of the human genome, is purified. We provide guidelines for quantitative analysis by PCR, microarrays, or next-generation sequencing. Regulatory elements enriched by FAIRE display high concordance with those identified by nuclease hypersensitivity or ChIP, and the entire procedure can be completed in three days. FAIRE exhibits low technical variability, which allows its use in large-scale studies of chromatin from normal or diseased tissues. PMID:22262007
Comparisons between conventional, ultrasound-assisted and microwave-assisted methods for extraction of anthraquinones from Heterophyllaea pustulata Hook f. (Rubiaceae).

PubMed

Barrera Vázquez, M F; Comini, L R; Martini, R E; Núñez Montoya, S C; Bottini, S; Cabrera, J L

2014-03-01

This work reports a comparative study about extraction methods used to obtain anthraquinones (AQs) from stems and leaves of Heterophyllae pustulata Hook (Rubiáceae). One of the conventional procedures used to extract these metabolites from a vegetable matrix is by successive Soxhlet extractions with solvents of increasing polarity: starting with hexane to eliminate chlorophylls and fatty components, following by benzene and finally ethyl acetate. However, this technique shows a low extraction yield of total AQs, and consumes large quantities of solvent and time. Ultrasound-assisted extraction (UAE) and microwave-assisted extraction (MAE) have been investigated as alternative methods to extract these compounds, using the same sequence of solvents. It was found that UAE increases the extraction yield of total AQs and reduces the time and amount of solvent used. Nevertheless, the combination UAE with benzene, plus MAE with ethyl acetate at a constant power of 900 W showed the best results. A higher yield of total AQs was obtained in less time and using the same amount of solvent that UAE. The optimal conditions for this latter procedure were UAE with benzene at 50 °C during 60 min, followed by MAE at 900 W during 15 min using ethyl acetate as extraction solvent. Copyright © 2013 Elsevier B.V. All rights reserved.
An Efficient Strategy Combining SSR Markers- and Advanced QTL-seq-driven QTL Mapping Unravels Candidate Genes Regulating Grain Weight in Rice

PubMed Central

Daware, Anurag; Das, Sweta; Srivastava, Rishi; Badoni, Saurabh; Singh, Ashok K.; Agarwal, Pinky; Parida, Swarup K.; Tyagi, Akhilesh K.

2016-01-01

Development and use of genome-wide informative simple sequence repeat (SSR) markers and novel integrated genomic strategies are vital to drive genomics-assisted breeding applications and for efficient dissection of quantitative trait loci (QTLs) underlying complex traits in rice. The present study developed 6244 genome-wide informative SSR markers exhibiting in silico fragment length polymorphism based on repeat-unit variations among genomic sequences of 11 indica, japonica, aus, and wild rice accessions. These markers were mapped on diverse coding and non-coding sequence components of known cloned/candidate genes annotated from 12 chromosomes and revealed a much higher amplification (97%) and polymorphic potential (88%) along with wider genetic/functional diversity level (16–74% with a mean 53%) especially among accessions belonging to indica cultivar group, suggesting their utility in large-scale genomics-assisted breeding applications in rice. A high-density 3791 SSR markers-anchored genetic linkage map (IR 64 × Sonasal) spanning 2060 cM total map-length with an average inter-marker distance of 0.54 cM was generated. This reference genetic map identified six major genomic regions harboring robust QTLs (31% combined phenotypic variation explained with a 5.7–8.7 LOD) governing grain weight on six rice chromosomes. One strong grain weight major QTL region (OsqGW5.1) was narrowed-down by integrating traditional QTL mapping with high-resolution QTL region-specific integrated SSR and single nucleotide polymorphism markers-based QTL-seq analysis and differential expression profiling. This led us to delineate two natural allelic variants in two known cis-regulatory elements (RAV1AAT and CARGCW8GAT) of glycosyl hydrolase and serine carboxypeptidase genes exhibiting pronounced seed-specific differential regulation in low (Sonasal) and high (IR 64) grain weight mapping parental accessions. Our genome-wide SSR marker resource (polymorphic within/between diverse cultivar groups) and integrated genomic strategy can efficiently scan functionally relevant potential molecular tags (markers, candidate genes and alleles) regulating complex agronomic traits (grain weight) and expedite marker-assisted genetic enhancement in rice. PMID:27833617
Draft Genome Sequence of “Cohnella kolymensis” B-2846

PubMed Central

Kudryashova, Ekaterina B.; Ariskina, Elena V.

2016-01-01

A draft genome sequence of “Cohnella kolymensis” strain B-2846 was derived using IonTorrent sequencing technology. The size of the assembly and G+C content were in agreement with those of other species of this genus. Characterization of the genome of a novel species of Cohnella will assist in bacterial systematics. PMID:26769947
The gradient boosting algorithm and random boosting for genome-assisted evaluation in large data sets.

PubMed

González-Recio, O; Jiménez-Montero, J A; Alenda, R

2013-01-01

In the next few years, with the advent of high-density single nucleotide polymorphism (SNP) arrays and genome sequencing, genomic evaluation methods will need to deal with a large number of genetic variants and an increasing sample size. The boosting algorithm is a machine-learning technique that may alleviate the drawbacks of dealing with such large data sets. This algorithm combines different predictors in a sequential manner with some shrinkage on them; each predictor is applied consecutively to the residuals from the committee formed by the previous ones to form a final prediction based on a subset of covariates. Here, a detailed description is provided and examples using a toy data set are included. A modification of the algorithm called "random boosting" was proposed to increase predictive ability and decrease computation time of genome-assisted evaluation in large data sets. Random boosting uses a random selection of markers to add a subsequent weak learner to the predictive model. These modifications were applied to a real data set composed of 1,797 bulls genotyped for 39,714 SNP. Deregressed proofs of 4 yield traits and 1 type trait from January 2009 routine evaluations were used as dependent variables. A 2-fold cross-validation scenario was implemented. Sires born before 2005 were used as a training sample (1,576 and 1,562 for production and type traits, respectively), whereas younger sires were used as a testing sample to evaluate predictive ability of the algorithm on yet-to-be-observed phenotypes. Comparison with the original algorithm was provided. The predictive ability of the algorithm was measured as Pearson correlations between observed and predicted responses. Further, estimated bias was computed as the average difference between observed and predicted phenotypes. The results showed that the modification of the original boosting algorithm could be run in 1% of the time used with the original algorithm and with negligible differences in accuracy and bias. This modification may be used to speed the calculus of genome-assisted evaluation in large data sets such us those obtained from consortiums. Copyright © 2013 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Multiomics tools for the diagnosis and treatment of rare neurological disease.

PubMed

Crowther, L M; Poms, M; Plecko, Barbara

2018-05-01

Conventional workup of rare neurological disease is frequently hampered by diagnostic delay or lack of diagnosis. While biomarkers have been established for many neurometabolic disorders, improved methods are required for diagnosis of previously unidentified or underreported causes of rare neurological disease. This would result in a higher diagnostic yield and increased patient numbers required for interventional studies. Recent studies using next-generation sequencing and metabolomics have led to identification of novel disease-causing genes and biomarkers. This combined approach can assist in overcoming challenges associated with analyzing and interpreting the large amount of data obtained from each technique. In particular, metabolomics can support the pathogenicity of sequence variants in genes encoding enzymes or transporters involved in metabolic pathways. Moreover, metabolomics can show the broader perturbation caused by inborn errors of metabolism and identify a metabolic fingerprint of metabolic disorders. As such, using "omics" has great potential to meet the current needs for improved diagnosis and elucidation of rare neurological disease.
Reverse transcriptase genes are highly abundant and transcriptionally active in marine plankton assemblages

PubMed Central

Lescot, Magali; Hingamp, Pascal; Kojima, Kenji K; Villar, Emilie; Romac, Sarah; Veluchamy, Alaguraj; Boccara, Martine; Jaillon, Olivier; Iudicone, Daniele; Bowler, Chris; Wincker, Patrick; Claverie, Jean-Michel; Ogata, Hiroyuki

2016-01-01

Genes encoding reverse transcriptases (RTs) are found in most eukaryotes, often as a component of retrotransposons, as well as in retroviruses and in prokaryotic retroelements. We investigated the abundance, classification and transcriptional status of RTs based on Tara Oceans marine metagenomes and metatranscriptomes encompassing a wide organism size range. Our analyses revealed that RTs predominate large-size fraction metagenomes (>5 μm), where they reached a maximum of 13.5% of the total gene abundance. Metagenomic RTs were widely distributed across the phylogeny of known RTs, but many belonged to previously uncharacterized clades. Metatranscriptomic RTs showed distinct abundance patterns across samples compared with metagenomic RTs. The relative abundances of viral and bacterial RTs among identified RT sequences were higher in metatranscriptomes than in metagenomes and these sequences were detected in all metatranscriptome size fractions. Overall, these observations suggest an active proliferation of various RT-assisted elements, which could be involved in genome evolution or adaptive processes of plankton assemblage. PMID:26613339
Mass spectrometric survey of peptides in cephalopods with an emphasis on the FMRFamide-related peptides.

PubMed

Sweedler, J V; Li, L; Floyd, P; Gilly, W

2000-12-01

A matrix-assisted laser desorption/ionization (MALDI) mass spectrometric (MS) survey of the major peptides in the stellar, fin and pallial nerves and the posterior chromatophore lobe of the cephalopods Sepia officinalis, Loligo opalescens and Dosidicus gigas has been performed. Although a large number of putative peptides are distinct among the three species, several molecular masses are conserved. In addition to peptides, characterization of the lipid content of the nerves is reported, and these lipid peaks account for many of the lower molecular masses observed. One conserved set of peaks corresponds to the FMRFamide-related peptides (FRPs). The Loligo opalescens FMRFa gene has been sequenced. It encodes a 331 amino acid residue prohormone that is processed into 14 FRPs, which are both predicted by the nucleotide sequence and confirmed by MALDI MS. The FRPs predicted by this gene (FMRFa, FLRFa/FIRFa and ALSGDAFLRFa) are observed in all three species, indicating that members of this peptide family are highly conserved across cephalopods.
Technological advancements and their importance for nematode identification

NASA Astrophysics Data System (ADS)

Ahmed, Mohammed; Sapp, Melanie; Prior, Thomas; Karssen, Gerrit; Back, Matthew Alan

2016-06-01

Nematodes represent a species-rich and morphologically diverse group of metazoans known to inhabit both aquatic and terrestrial environments. Their role as biological indicators and as key players in nutrient cycling has been well documented. Some plant-parasitic species are also known to cause significant losses to crop production. In spite of this, there still exists a huge gap in our knowledge of their diversity due to the enormity of time and expertise often involved in characterising species using phenotypic features. Molecular methodology provides useful means of complementing the limited number of reliable diagnostic characters available for morphology-based identification. We discuss herein some of the limitations of traditional taxonomy and how molecular methodologies, especially the use of high-throughput sequencing, have assisted in carrying out large-scale nematode community studies and characterisation of phytonematodes through rapid identification of multiple taxa. We also provide brief descriptions of some the current and almost-outdated high-throughput sequencing platforms and their applications in both plant nematology and soil ecology.
Complete chloroplast genome sequence of green foxtail (Setaria viridis), a promising model system for C4 photosynthesis.

PubMed

Wang, Shuo; Gao, Li-Zhi

2016-09-01

The complete chloroplast genome of green foxtail (Setaria viridis), a promising model system for C4 photosynthesis, is first reported in this study. The genome harbors a large single copy (LSC) region of 81 016 bp and a small single copy (SSC) region of 12 456 bp separated by a pair of inverted repeat (IRa and IRb) regions of 22 315 bp. GC content is 38.92%. The proportion of coding sequence is 57.97%, comprising of 111 (19 duplicated in IR regions) unique genes, 71 of which are protein-coding genes, four are rRNA genes, and 36 are tRNA genes. Phylogenetic analysis indicated that S. viridis was clustered with its cultivated species S. italica in the tribe Paniceae of the family Poaceae. This newly determined chloroplast genome will provide valuable genetic resources to assist future studies on C4 photosynthesis in grasses.
Towards AI-powered personalization in MOOC learning

NASA Astrophysics Data System (ADS)

Yu, Han; Miao, Chunyan; Leung, Cyril; White, Timothy John

2017-12-01

Massive Open Online Courses (MOOCs) represent a form of large-scale learning that is changing the landscape of higher education. In this paper, we offer a perspective on how advances in artificial intelligence (AI) may enhance learning and research on MOOCs. We focus on emerging AI techniques including how knowledge representation tools can enable students to adjust the sequence of learning to fit their own needs; how optimization techniques can efficiently match community teaching assistants to MOOC mediation tasks to offer personal attention to learners; and how virtual learning companions with human traits such as curiosity and emotions can enhance learning experience on a large scale. These new capabilities will also bring opportunities for educational researchers to analyse students' learning skills and uncover points along learning paths where students with different backgrounds may require different help. Ethical considerations related to the application of AI in MOOC education research are also discussed.

GAP Final Technical Report 12-14-04

DOE Office of Scientific and Technical Information (OSTI.GOV)

Andrew J. Bordner, PhD, Senior Research Scientist

2004-12-14

The Genomics Annotation Platform (GAP) was designed to develop new tools for high throughput functional annotation and characterization of protein sequences and structures resulting from genomics and structural proteomics, benchmarking and application of those tools. Furthermore, this platform integrated the genomic scale sequence and structural analysis and prediction tools with the advanced structure prediction and bioinformatics environment of ICM. The development of GAP was primarily oriented towards the annotation of new biomolecular structures using both structural and sequence data. Even though the amount of protein X-ray crystal data is growing exponentially, the volume of sequence data is growing even moremore » rapidly. This trend was exploited by leveraging the wealth of sequence data to provide functional annotation for protein structures. The additional information provided by GAP is expected to assist the majority of the commercial users of ICM, who are involved in drug discovery, in identifying promising drug targets as well in devising strategies for the rational design of therapeutics directed at the protein of interest. The GAP also provided valuable tools for biochemistry education, and structural genomics centers. In addition, GAP incorporates many novel prediction and analysis methods not available in other molecular modeling packages. This development led to signing the first Molsoft agreement in the structural genomics annotation area with the University of oxford Structural Genomics Center. This commercial agreement validated the Molsoft efforts under the GAP project and provided the basis for further development of the large scale functional annotation platform.« less
Detection of cystic fibrosis transmembrane conductance regulator ΔF508 gene mutation using a paper-based nucleic acid hybridization assay and a smartphone camera.

PubMed

Malhotra, Karan; Noor, M Omair; Krull, Ulrich J

2018-05-29

Diagnostic technology that makes use of paper platforms in conjunction with the ubiquitous availability of digital cameras in cellular telephones and personal assistive devices offers opportunities for development of bioassays that are cost effective and widely distributed. Assays that operate effectively in aqueous solution require further development for implementation in paper substrates, overcoming issues associated with surface interactions on a matrix that offers a large surface-to-volume ratio and constraints on convective mixing. This report presents and compares two related methods for determination of oligonucleotides that serve as indicators of cystic fibrosis, differentiating between the normal wild-type sequence, and a mutant-type sequence that has a 3-base replacement. The transduction strategy operates by selective hybridization of oligonucleotide probes that are conjugated to fluorescent quantum dots, where hybridization of target sequences causes a molecular fluorophore to approach the quantum dot and become emissive through fluorescence resonance energy transfer. Detection can rely on hybridization of a target that is labelled with Cy3 fluorophore, or in the presence of an unlabelled target when a sandwich assay format is implemented with a labelled reporter oligonucleotide. Selectivity to determine the presence of mismatched sequences involves appropriate selection of nucleotide sequences to set melt temperatures, in conjunction with control of stringency conditions using formamide as a chaotrope. It was determined that both direct and sandwich assays on paper substrates are able to distinguish between wild-type and mutant-type samples.
NGSPanPipe: A Pipeline for Pan-genome Identification in Microbial Strains from Experimental Reads.

PubMed

Kulsum, Umay; Kapil, Arti; Singh, Harpreet; Kaur, Punit

2018-01-01

Recent advancements in sequencing technologies have decreased both time span and cost for sequencing the whole bacterial genome. High-throughput Next-Generation Sequencing (NGS) technology has led to the generation of enormous data concerning microbial populations publically available across various repositories. As a consequence, it has become possible to study and compare the genomes of different bacterial strains within a species or genus in terms of evolution, ecology and diversity. Studying the pan-genome provides insights into deciphering microevolution, global composition and diversity in virulence and pathogenesis of a species. It can also assist in identifying drug targets and proposing vaccine candidates. The effective analysis of these large genome datasets necessitates the development of robust tools. Current methods to develop pan-genome do not support direct input of raw reads from the sequencer machine but require preprocessing of reads as an assembled protein/gene sequence file or the binary matrix of orthologous genes/proteins. We have designed an easy-to-use integrated pipeline, NGSPanPipe, which can directly identify the pan-genome from short reads. The output from the pipeline is compatible with other pan-genome analysis tools. We evaluated our pipeline with other methods for developing pan-genome, i.e. reference-based assembly and de novo assembly using simulated reads of Mycobacterium tuberculosis. The single script pipeline (pipeline.pl) is applicable for all bacterial strains. It integrates multiple in-house Perl scripts and is freely accessible from https://github.com/Biomedinformatics/NGSPanPipe .
Magnetically assisted remote-controlled endovascular catheter for interventional MR imaging: in vitro navigation at 1.5 T versus X-ray fluoroscopy.

PubMed

Losey, Aaron D; Lillaney, Prasheel; Martin, Alastair J; Cooke, Daniel L; Wilson, Mark W; Thorne, Bradford R H; Sincic, Ryan S; Arenson, Ronald L; Saeed, Maythem; Hetts, Steven W

2014-06-01

To compare in vitro navigation of a magnetically assisted remote-controlled (MARC) catheter under real-time magnetic resonance (MR) imaging with manual navigation under MR imaging and standard x-ray guidance in endovascular catheterization procedures in an abdominal aortic phantom. The 2-mm-diameter custom clinical-grade microcatheter prototype with a solenoid coil at the distal tip was deflected with a foot pedal actuator used to deliver 300 mA of positive or negative current. Investigators navigated the catheter into branch vessels in a custom cryogel abdominal aortic phantom. This was repeated under MR imaging guidance without magnetic assistance and under conventional x-ray fluoroscopy. MR experiments were performed at 1.5 T by using a balanced steady-state free precession sequence. The mean procedure times and percentage success data were determined and analyzed with a linear mixed-effects regression analysis. The catheter was clearly visible under real-time MR imaging. One hundred ninety-two (80%) of 240 turns were successfully completed with magnetically assisted guidance versus 144 (60%) of 240 turns with nonassisted guidance (P < .001) and 119 (74%) of 160 turns with standard x-ray guidance (P = .028). Overall mean procedure time was shorter with magnetically assisted than with nonassisted guidance under MR imaging (37 seconds ± 6 [standard error of the mean] vs 55 seconds ± 3, P < .001), and time was comparable between magnetically assisted and standard x-ray guidance (37 seconds ± 6 vs 44 seconds ± 3, P = .045). When stratified by angle of branch vessel, magnetic assistance was faster than nonassisted MR guidance at turns of 45°, 60°, and 75°. In this study, a MARC catheter for endovascular navigation under real-time MR imaging guidance was developed and tested. For catheterization of branch vessels arising at large angles, magnetically assisted catheterization was faster than manual catheterization under MR imaging guidance and was comparable to standard x-ray guidance.
Monitoring and Surveillance of Marine Invasive Species in Californian Waters by DNA Barcoding: Methodological and Analytical Solutions

NASA Astrophysics Data System (ADS)

Campbell, T. L.; Geller, J. B.; Heller, P.; Ruiz, G.; Chang, A.; McCann, L.; Ceballos, L.; Marraffini, M.; Ashton, G.; Larson, K.; Havard, S.; Meagher, K.; Wheelock, M.; Drake, C.; Rhett, G.

2016-02-01

The Ballast Water Management Act, the Marine Invasive Species Act, and the Coastal Ecosystem Protection Act require the California Department of Fish and Wildlife to monitor and evaluate the extent of biological invasions in the state's marine and estuarine waters. This has been performed statewide, using a variety of methodologies. Conventional sample collection and processing is laborious, slow and costly, and may require considerable taxonomic expertise requiring detailed time-consuming microscopic study of multiple specimens. These factors limit the volume of biomass that can be searched for introduced species. New technologies continue to reduce the cost and increase the throughput of genetic analyses, which become efficient alternatives to traditional morphological analysis for identification, monitoring and surveillance of marine invasive species. Using next-generation sequencing of mitochondrial Cytochrome c oxidase subunit I (COI) and nuclear large subunit ribosomal RNA (LSU), we analyzed over 15,000 individual marine invertebrates collected in Californian waters. We have created sequence databases of California native and non-native species to assist in molecular identification and surveillance in North American waters. Metagenetics, the next-generation sequencing of environmental samples with comparison to DNA sequence databases, is a faster and cost-effective alternative to individual sample analysis. We have sequenced from biomass collected from whole settlement plates and plankton in California harbors, and used our introduced species database to create species lists. We can combine these species lists for individual marinas with collected environmental data, such as temperature, salinity, and dissolved oxygen to understand the ecology of marine invasions. Here we discuss high throughput sampling, sequencing, and COASTLINE, our data analysis answer to challenges working with hundreds of millions of sequencing reads from tens of thousands of specimens.
Analysis of the transcriptome of Panax notoginseng root uncovers putative triterpene saponin-biosynthetic genes and genetic markers

PubMed Central

2011-01-01

Background Panax notoginseng (Burk) F.H. Chen is important medicinal plant of the Araliacease family. Triterpene saponins are the bioactive constituents in P. notoginseng. However, available genomic information regarding this plant is limited. Moreover, details of triterpene saponin biosynthesis in the Panax species are largely unknown. Results Using the 454 pyrosequencing technology, a one-quarter GS FLX titanium run resulted in 188,185 reads with an average length of 410 bases for P. notoginseng root. These reads were processed and assembled by 454 GS De Novo Assembler software into 30,852 unique sequences. A total of 70.2% of unique sequences were annotated by Basic Local Alignment Search Tool (BLAST) similarity searches against public sequence databases. The Kyoto Encyclopedia of Genes and Genomes (KEGG) assignment discovered 41 unique sequences representing 11 genes involved in triterpene saponin backbone biosynthesis in the 454-EST dataset. In particular, the transcript encoding dammarenediol synthase (DS), which is the first committed enzyme in the biosynthetic pathway of major triterpene saponins, is highly expressed in the root of four-year-old P. notoginseng. It is worth emphasizing that the candidate cytochrome P450 (Pn02132 and Pn00158) and UDP-glycosyltransferase (Pn00082) gene most likely to be involved in hydroxylation or glycosylation of aglycones for triterpene saponin biosynthesis were discovered from 174 cytochrome P450s and 242 glycosyltransferases by phylogenetic analysis, respectively. Putative transcription factors were detected in 906 unique sequences, including Myb, homeobox, WRKY, basic helix-loop-helix (bHLH), and other family proteins. Additionally, a total of 2,772 simple sequence repeat (SSR) were identified from 2,361 unique sequences, of which, di-nucleotide motifs were the most abundant motif. Conclusion This study is the first to present a large-scale EST dataset for P. notoginseng root acquired by next-generation sequencing (NGS) technology. The candidate genes involved in triterpene saponin biosynthesis, including the putative CYP450s and UGTs, were obtained in this study. Additionally, the identification of SSRs provided plenty of genetic makers for molecular breeding and genetics applications in this species. These data will provide information on gene discovery, transcriptional regulation and marker-assisted selection for P. notoginseng. The dataset establishes an important foundation for the study with the purpose of ensuring adequate drug resources for this species. PMID:22369100
Instructional Leadership Responsibilities of Assistant Principals in Large Texas High Schools

ERIC Educational Resources Information Center

Howard-Schwind, Michelle

2010-01-01

The purpose of this study was to determine the extent secondary assistant principals in large Texas high schools demonstrate behaviors consistent with what the literature describes as instructional leadership. Three hundred seventy principals and assistant principals of large Texas high schools participated in this study. The Principal…
Iraq: Recent Developments in Reconstruction Assistance

DTIC Science & Technology

2005-03-23

Developments in Reconstruction Assistance Summary Large-scale reconstruction assistance programs are being undertaken by the United States following the war...currently the case. The House approved the measure on March 16. On June 28, 2004, the entity implementing assistance programs , the Coalition Provisional...Large-scale reconstruction assistance programs are being undertaken by the United States in Iraq. This report describes recent developments in this
PERF: an exhaustive algorithm for ultra-fast and efficient identification of microsatellites from large DNA sequences.

PubMed

Avvaru, Akshay Kumar; Sowpati, Divya Tej; Mishra, Rakesh Kumar

2018-03-15

Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have highlighted their functional roles in genome organization and gene regulation. Though several tools are currently available to identify SSRs from genomic sequences, they have significant limitations. We present a novel algorithm called PERF for extremely fast and comprehensive identification of microsatellites from DNA sequences of any size. PERF is several fold faster than existing algorithms and uses up to 5-fold lesser memory. It provides a clean and flexible command-line interface to change the default settings, and produces output in an easily-parseable tab-separated format. In addition, PERF generates an interactive and stand-alone HTML report with charts and tables for easy downstream analysis. PERF is implemented in the Python programming language. It is freely available on PyPI under the package name perf_ssr, and can be installed directly using pip or easy_install. The documentation of PERF is available at https://github.com/rkmlab/perf. The source code of PERF is deposited in GitHub at https://github.com/rkmlab/perf under an MIT license. tej@ccmb.res.in. Supplementary data are available at Bioinformatics online.
Comparative Analysis of the First Complete Enterococcus faecium Genome

PubMed Central

Lam, Margaret M. C.; Seemann, Torsten; Bulach, Dieter M.; Gladman, Simon L.; Chen, Honglei; Haring, Volker; Moore, Robert J.; Ballard, Susan; Grayson, M. Lindsay; Johnson, Paul D. R.; Howden, Benjamin P.

2012-01-01

Vancomycin-resistant enterococci (VRE) are one of the leading causes of nosocomial infections in health care facilities around the globe. In particular, infections caused by vancomycin-resistant Enterococcus faecium are becoming increasingly common. Comparative and functional genomic studies of E. faecium isolates have so far been limited owing to the lack of a fully assembled E. faecium genome sequence. Here we address this issue and report the complete 3.0-Mb genome sequence of the multilocus sequence type 17 vancomycin-resistant Enterococcus faecium strain Aus0004, isolated from the bloodstream of a patient in Melbourne, Australia, in 1998. The genome comprises a 2.9-Mb circular chromosome and three circular plasmids. The chromosome harbors putative E. faecium virulence factors such as enterococcal surface protein, hemolysin, and collagen-binding adhesin. Aus0004 has a very large accessory genome (38%) that includes three prophage and two genomic islands absent among 22 other E. faecium genomes. One of the prophage was present as inverted 50-kb repeats that appear to have facilitated a 683-kb chromosomal inversion across the replication terminus, resulting in a striking replichore imbalance. Other distinctive features include 76 insertion sequence elements and a single chromosomal copy of Tn1549 containing the vanB vancomycin resistance element. A complete E. faecium genome will be a useful resource to assist our understanding of this emerging nosocomial pathogen. PMID:22366422
VarWalker: Personalized Mutation Network Analysis of Putative Cancer Genes from Next-Generation Sequencing Data

PubMed Central

Jia, Peilin; Zhao, Zhongming

2014-01-01

A major challenge in interpreting the large volume of mutation data identified by next-generation sequencing (NGS) is to distinguish driver mutations from neutral passenger mutations to facilitate the identification of targetable genes and new drugs. Current approaches are primarily based on mutation frequencies of single-genes, which lack the power to detect infrequently mutated driver genes and ignore functional interconnection and regulation among cancer genes. We propose a novel mutation network method, VarWalker, to prioritize driver genes in large scale cancer mutation data. VarWalker fits generalized additive models for each sample based on sample-specific mutation profiles and builds on the joint frequency of both mutation genes and their close interactors. These interactors are selected and optimized using the Random Walk with Restart algorithm in a protein-protein interaction network. We applied the method in >300 tumor genomes in two large-scale NGS benchmark datasets: 183 lung adenocarcinoma samples and 121 melanoma samples. In each cancer, we derived a consensus mutation subnetwork containing significantly enriched consensus cancer genes and cancer-related functional pathways. These cancer-specific mutation networks were then validated using independent datasets for each cancer. Importantly, VarWalker prioritizes well-known, infrequently mutated genes, which are shown to interact with highly recurrently mutated genes yet have been ignored by conventional single-gene-based approaches. Utilizing VarWalker, we demonstrated that network-assisted approaches can be effectively adapted to facilitate the detection of cancer driver genes in NGS data. PMID:24516372
VarWalker: personalized mutation network analysis of putative cancer genes from next-generation sequencing data.

PubMed

Jia, Peilin; Zhao, Zhongming

2014-02-01

A major challenge in interpreting the large volume of mutation data identified by next-generation sequencing (NGS) is to distinguish driver mutations from neutral passenger mutations to facilitate the identification of targetable genes and new drugs. Current approaches are primarily based on mutation frequencies of single-genes, which lack the power to detect infrequently mutated driver genes and ignore functional interconnection and regulation among cancer genes. We propose a novel mutation network method, VarWalker, to prioritize driver genes in large scale cancer mutation data. VarWalker fits generalized additive models for each sample based on sample-specific mutation profiles and builds on the joint frequency of both mutation genes and their close interactors. These interactors are selected and optimized using the Random Walk with Restart algorithm in a protein-protein interaction network. We applied the method in >300 tumor genomes in two large-scale NGS benchmark datasets: 183 lung adenocarcinoma samples and 121 melanoma samples. In each cancer, we derived a consensus mutation subnetwork containing significantly enriched consensus cancer genes and cancer-related functional pathways. These cancer-specific mutation networks were then validated using independent datasets for each cancer. Importantly, VarWalker prioritizes well-known, infrequently mutated genes, which are shown to interact with highly recurrently mutated genes yet have been ignored by conventional single-gene-based approaches. Utilizing VarWalker, we demonstrated that network-assisted approaches can be effectively adapted to facilitate the detection of cancer driver genes in NGS data.
Curriculum Guidelines for Aspects of Oral Pathology for Dental Assisting Education.

ERIC Educational Resources Information Center

Journal of Dental Education, 1987

1987-01-01

Guidelines for structuring an oral pathology curriculum for dental assistants include: a definition of oral pathology; the scope of instruction and relationships with other fields; recommendations for prerequisites; core content in various subfields; specific behavioral objectives; and suggestions for sequencing, faculty, and facilities. (MSE)
Application of next-generation sequencing for rapid marker development in molecular plant breeding: a case study on anthracnose disease resistance in Lupinus angustifolius L.

PubMed Central

2012-01-01

Background In the last 30 years, a number of DNA fingerprinting methods such as RFLP, RAPD, AFLP, SSR, DArT, have been extensively used in marker development for molecular plant breeding. However, it remains a daunting task to identify highly polymorphic and closely linked molecular markers for a target trait for molecular marker-assisted selection. The next-generation sequencing (NGS) technology is far more powerful than any existing generic DNA fingerprinting methods in generating DNA markers. In this study, we employed a grain legume crop Lupinus angustifolius (lupin) as a test case, and examined the utility of an NGS-based method of RAD (restriction-site associated DNA) sequencing as DNA fingerprinting for rapid, cost-effective marker development tagging a disease resistance gene for molecular breeding. Results Twenty informative plants from a cross of RxS (disease resistant x susceptible) in lupin were subjected to RAD single-end sequencing by multiplex identifiers. The entire RAD sequencing products were resolved in two lanes of the 16-lanes per run sequencing platform Solexa HiSeq2000. A total of 185 million raw reads, approximately 17 Gb of sequencing data, were collected. Sequence comparison among the 20 test plants discovered 8207 SNP markers. Filtration of DNA sequencing data with marker identification parameters resulted in the discovery of 38 molecular markers linked to the disease resistance gene Lanr1. Five randomly selected markers were converted into cost-effective, simple PCR-based markers. Linkage analysis using marker genotyping data and disease resistance phenotyping data on a F8 population consisting of 186 individual plants confirmed that all these five markers were linked to the R gene. Two of these newly developed sequence-specific PCR markers, AnSeq3 and AnSeq4, flanked the target R gene at a genetic distance of 0.9 centiMorgan (cM), and are now replacing the markers previously developed by a traditional DNA fingerprinting method for marker-assisted selection in the Australian national lupin breeding program. Conclusions We demonstrated that more than 30 molecular markers linked to a target gene of agronomic trait of interest can be identified from a small portion (1/8) of one sequencing run on HiSeq2000 by applying NGS based RAD sequencing in marker development. The markers developed by the strategy described in this study are all co-dominant SNP markers, which can readily be converted into high throughput multiplex format or low-cost, simple PCR-based markers desirable for large scale marker implementation in plant breeding programs. The high density and closely linked molecular markers associated with a target trait help to overcome a major bottleneck for implementation of molecular markers on a wide range of germplasm in breeding programs. We conclude that application of NGS based RAD sequencing as DNA fingerprinting is a very rapid and cost-effective strategy for marker development in molecular plant breeding. The strategy does not require any prior genome knowledge or molecular information for the species under investigation, and it is applicable to other plant species. PMID:22805587
Application of next-generation sequencing for rapid marker development in molecular plant breeding: a case study on anthracnose disease resistance in Lupinus angustifolius L.

PubMed

Yang, Huaan; Tao, Ye; Zheng, Zequn; Li, Chengdao; Sweetingham, Mark W; Howieson, John G

2012-07-17

In the last 30 years, a number of DNA fingerprinting methods such as RFLP, RAPD, AFLP, SSR, DArT, have been extensively used in marker development for molecular plant breeding. However, it remains a daunting task to identify highly polymorphic and closely linked molecular markers for a target trait for molecular marker-assisted selection. The next-generation sequencing (NGS) technology is far more powerful than any existing generic DNA fingerprinting methods in generating DNA markers. In this study, we employed a grain legume crop Lupinus angustifolius (lupin) as a test case, and examined the utility of an NGS-based method of RAD (restriction-site associated DNA) sequencing as DNA fingerprinting for rapid, cost-effective marker development tagging a disease resistance gene for molecular breeding. Twenty informative plants from a cross of RxS (disease resistant x susceptible) in lupin were subjected to RAD single-end sequencing by multiplex identifiers. The entire RAD sequencing products were resolved in two lanes of the 16-lanes per run sequencing platform Solexa HiSeq2000. A total of 185 million raw reads, approximately 17 Gb of sequencing data, were collected. Sequence comparison among the 20 test plants discovered 8207 SNP markers. Filtration of DNA sequencing data with marker identification parameters resulted in the discovery of 38 molecular markers linked to the disease resistance gene Lanr1. Five randomly selected markers were converted into cost-effective, simple PCR-based markers. Linkage analysis using marker genotyping data and disease resistance phenotyping data on a F8 population consisting of 186 individual plants confirmed that all these five markers were linked to the R gene. Two of these newly developed sequence-specific PCR markers, AnSeq3 and AnSeq4, flanked the target R gene at a genetic distance of 0.9 centiMorgan (cM), and are now replacing the markers previously developed by a traditional DNA fingerprinting method for marker-assisted selection in the Australian national lupin breeding program. We demonstrated that more than 30 molecular markers linked to a target gene of agronomic trait of interest can be identified from a small portion (1/8) of one sequencing run on HiSeq2000 by applying NGS based RAD sequencing in marker development. The markers developed by the strategy described in this study are all co-dominant SNP markers, which can readily be converted into high throughput multiplex format or low-cost, simple PCR-based markers desirable for large scale marker implementation in plant breeding programs. The high density and closely linked molecular markers associated with a target trait help to overcome a major bottleneck for implementation of molecular markers on a wide range of germplasm in breeding programs. We conclude that application of NGS based RAD sequencing as DNA fingerprinting is a very rapid and cost-effective strategy for marker development in molecular plant breeding. The strategy does not require any prior genome knowledge or molecular information for the species under investigation, and it is applicable to other plant species.
Sequencing Computer-Assisted Learning of Transformations of Trigonometric Functions

ERIC Educational Resources Information Center

Ross, John A.; Bruce, Catherine D.; Sibbald, Timothy M.

2011-01-01

Studies incorporating technology into the teaching of trigonometry, although sparse, have demonstrated positive effects on student achievement. The optimal sequence for integrating technology with teacher-led mathematics instruction has not been determined. Our research investigated whether technology has a greater impact on student achievement…
Molecular epidemiology of the endemic multiresistance plasmid pSI54/04 of Salmonella Infantis in broiler and human population in Hungary.

PubMed

Szmolka, Ama; Szabó, Móni; Kiss, János; Pászti, Judit; Adrián, Erzsébet; Olasz, Ferenc; Nagy, Béla

2018-05-01

Salmonella Infantis (SI) became endemic in Hungary where the PFGE cluster B, characterized by a large multiresistance (MDR) plasmid emerged among broilers leading to an increased occurrence in humans. We hypothesized that this plasmid (pSI54/04) assisted dissemination of SI. Indeed, Nal-Sul-Tet phenotypes carrying pSI54/04 occurred increasingly between 2011 and 2013 among SI isolates from broilers and humans. Characterization of pSI54/04 based on genome sequence data of the MDR strain SI54/04 indicated a size of ∼277 kb and a high sequence similarity with the megaplasmid pESI of SI predominant in Israel. Molecular characterization of 78 representative broiler and human isolates detected the prototype plasmid pSI54/04 and its variants together with novel plasmid associations within the emerging cluster B. To test in vitro and in vivo pathogenicity of pSI54/04 we produced plasmidic transconjugant of the plasmid-free pre-emergent strain SI69/94. This parental strain and its transconjugant have been tested on chicken embryo fibroblasts (CEFs) and in orally infected day old chicks. The uptake of pSI54/04 did not increase the pathogenicity of the strain SI69/94 in these systems. Thus, dissemination of SI in poultry could be assisted by antimicrobial resistance rather than by virulence modules of the endemic plasmid pSI54/04 in Hungary. Copyright © 2017 Elsevier Ltd. All rights reserved.
Human-Assisted Machine Information Exploitation: a crowdsourced investigation of information-based problem solving

NASA Astrophysics Data System (ADS)

Kase, Sue E.; Vanni, Michelle; Caylor, Justine; Hoye, Jeff

2017-05-01

The Human-Assisted Machine Information Exploitation (HAMIE) investigation utilizes large-scale online data collection for developing models of information-based problem solving (IBPS) behavior in a simulated time-critical operational environment. These types of environments are characteristic of intelligence workflow processes conducted during human-geo-political unrest situations when the ability to make the best decision at the right time ensures strategic overmatch. The project takes a systems approach to Human Information Interaction (HII) by harnessing the expertise of crowds to model the interaction of the information consumer and the information required to solve a problem at different levels of system restrictiveness and decisional guidance. The design variables derived from Decision Support Systems (DSS) research represent the experimental conditions in this online single-player against-the-clock game where the player, acting in the role of an intelligence analyst, is tasked with a Commander's Critical Information Requirement (CCIR) in an information overload scenario. The player performs a sequence of three information processing tasks (annotation, relation identification, and link diagram formation) with the assistance of `HAMIE the robot' who offers varying levels of information understanding dependent on question complexity. We provide preliminary results from a pilot study conducted with Amazon Mechanical Turk (AMT) participants on the Volunteer Science scientific research platform.
ESAP plus: a web-based server for EST-SSR marker development.

PubMed

Ponyared, Piyarat; Ponsawat, Jiradej; Tongsima, Sissades; Seresangtakul, Pusadee; Akkasaeng, Chutipong; Tantisuwichwong, Nathpapat

2016-12-22

Simple sequence repeats (SSRs) have become widely used as molecular markers in plant genetic studies due to their abundance, high allelic variation at each locus and simplicity to analyze using conventional PCR amplification. To study plants with unknown genome sequence, SSR markers from Expressed Sequence Tags (ESTs), which can be obtained from the plant mRNA (converted to cDNA), must be utilized. With the advent of high-throughput sequencing technology, huge EST sequence data have been generated and are now accessible from many public databases. However, SSR marker identification from a large in-house or public EST collection requires a computational pipeline that makes use of several standard bioinformatic tools to design high quality EST-SSR primers. Some of these computational tools are not users friendly and must be tightly integrated with reference genomic databases. A web-based bioinformatic pipeline, called EST Analysis Pipeline Plus (ESAP Plus), was constructed for assisting researchers to develop SSR markers from a large EST collection. ESAP Plus incorporates several bioinformatic scripts and some useful standard software tools necessary for the four main procedures of EST-SSR marker development, namely 1) pre-processing, 2) clustering and assembly, 3) SSR mining and 4) SSR primer design. The proposed pipeline also provides two alternative steps for reducing EST redundancy and identifying SSR loci. Using public sugarcane ESTs, ESAP Plus automatically executed the aforementioned computational pipeline via a simple web user interface, which was implemented using standard PHP, HTML, CSS and Java scripts. With ESAP Plus, users can upload raw EST data and choose various filtering options and parameters to analyze each of the four main procedures through this web interface. All input EST data and their predicted SSR results will be stored in the ESAP Plus MySQL database. Users will be notified via e-mail when the automatic process is completed and they can download all the results through the web interface. ESAP Plus is a comprehensive and convenient web-based bioinformatic tool for SSR marker development. ESAP Plus offers all necessary EST-SSR development processes with various adjustable options that users can easily use to identify SSR markers from a large EST collection. With familiar web interface, users can upload the raw EST using the data submission page and visualize/download the corresponding EST-SSR information from within ESAP Plus. ESAP Plus can handle considerably large EST datasets. This EST-SSR discovery tool can be accessed directly from: http://gbp.kku.ac.th/esap_plus/ .
Long-range barcode labeling-sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chen, Feng; Zhang, Tao; Singh, Kanwar K.

Methods for sequencing single large DNA molecules by clonal multiple displacement amplification using barcoded primers. Sequences are binned based on barcode sequences and sequenced using a microdroplet-based method for sequencing large polynucleotide templates to enable assembly of haplotype-resolved complex genomes and metagenomes.

Nurse's Assistant: Task Analyses. Competency-Based Education.

ERIC Educational Resources Information Center

Henrico County Public Schools, Glen Allen, VA. Virginia Vocational Curriculum Center.

These task analyses are designed to be used in combination with the "Health Occupations Education Service Area Resource" in order to implement competency-based education in the nurse's assistant program in Virginia. The task analysis document contains the task inventory, suggested task sequence lists, and content outlines for Nursing…
A Literacy Task to Assist Reader Awareness in Children's Informational Writing

ERIC Educational Resources Information Center

Holliway, David

2010-01-01

That our writing can be misunderstood by our readers is a conceptual difficulty for developing writers. This paper outlines a perspective-taking process that assists elementary students in composing referentially detailed descriptions. Through a procedural sequence that includes drafting, feedback, readers' perspective task, revision and drafting…
An Interactive Graphics Program for Assistance in Learning Convolution.

ERIC Educational Resources Information Center

Frederick, Dean K.; Waag, Gary L.

1980-01-01

A program has been written for the interactive computer graphics facility at Rensselaer Polytechnic Institute that is designed to assist the user in learning the mathematical technique of convolving two functions. Because convolution can be represented graphically by a sequence of steps involving folding, shifting, multiplying, and integration, it…
Project UPSTART. Final Report, October 1, 1983-September 30, 1984.

ERIC Educational Resources Information Center

Frain, Joan

Project UPSTART, during this fourth year of outreach, offered assistance in replicating its developed Sequenced Neuro-Sensorimotor Program (SNSP) for severely multihandicapped infants, pre-schoolers, young adults and their families. Future replication sites were identified. Programs received outreach assistance in the areas of staff training,…
Create a Better Flow through Sequencing Resident Assistant Training

ERIC Educational Resources Information Center

Whitney, Rich; Early, Sherry; Whisler, Travis

2016-01-01

Resident assistant training happens every year for the approximate 10,000 RAs who work on campuses across the country. These training programs can include classes, pre-service summer weeks, and ongoing training throughout the year. Following educational and training models such as CAS, assessment, Bloom's taxonomy, adventure programming, and…
An insight into the sialotranscriptome of the seed-feeding bug, Oncopeltus fasciatus.

PubMed

Francischetti, Ivo M B; Lopes, Angela H; Dias, Felipe A; Pham, Van M; Ribeiro, José M C

2007-09-01

The salivary transcriptome of the seed-feeding hemipteran, Oncopeltus fasciatus (milkweed bug), is described following assembly of 1025 expressed sequence tags (ESTs) into 305 clusters of related sequences. Inspection of these sequences reveals abundance of low complexity, putative secreted products rich in the amino acids (aa) glycine, serine or threonine, which might function as silk or mucins and assist food canal lubrication and sealing of the feeding site around the mouthparts. Several protease inhibitors were found, including abundant expression of cystatin transcripts that may inhibit cysteine proteases common in seeds that might injure the insect or induce plant apoptosis. Serine proteases and lipases are described that might assist digestion and liquefaction of seed proteins and oils. Finally, several novel putative proteins are described with no known function that might affect plant physiology or act as antimicrobials.
Base-resolution detection of N 4-methylcytosine in genomic DNA using 4mC-Tet-assisted-bisulfite-sequencing

DOE PAGES

Yu, Miao; Ji, Lexiang; Neumann, Drexel A.; ...

2015-07-15

Restriction-modification (R-M) systems pose a major barrier to DNA transformation and genetic engineering of bacterial species. Systematic identification of DNA methylation in R-M systems, including N 6-methyladenine (6mA), 5-methylcytosine (5mC) and N 4-methylcytosine (4mC), will enable strategies to make these species genetically tractable. Although single-molecule, real time (SMRT) sequencing technology is capable of detecting 4mC directly for any bacterial species regardless of whether an assembled genome exists or not, it is not as scalable to profiling hundreds to thousands of samples compared with the commonly used next-generation sequencing technologies. Here, we present 4mC-Tet-assisted bisulfite-sequencing (4mC-TAB-seq), a next-generation sequencing method thatmore » rapidly and cost efficiently reveals the genome-wide locations of 4mC for bacterial species with an available assembled reference genome. In 4mC-TAB-seq, both cytosines and 5mCs are read out as thymines, whereas only 4mCs are read out as cytosines, revealing their specific positions throughout the genome. We applied 4mC-TAB-seq to study the methylation of a member of the hyperthermophilc genus, Caldicellulosiruptor, in which 4mC-related restriction is a major barrier to DNA transformation from other species. Lastly, in combination with MethylC-seq, both 4mC- and 5mC-containing motifs are identified which can assist in rapid and efficient genetic engineering of these bacteria in the future.« less
Broad Search Solar Electric Propulsion Trajectories to Saturn with Gravity Assists

NASA Technical Reports Server (NTRS)

Lam, Try; Landau, Damon; Strange, Nathan

2009-01-01

Solar electric propulsion (SEP) trajectories to Saturn using multiple gravity assists are explored for the joint NASA and ESA Titan Saturn System Mission study. Results show that these new trajectories enable greater performance compared to chemical propulsion with similar gravity assists or SEP without gravity assists. This paper describes the method used in finding these interplanetary trajectories and examines variations in the performance for different SEP systems, flight times, and flyby sequences. The benefits of the SEP trajectories for a mission to Saturn are also discussed.
Pigeonpea genomics initiative (PGI): an international effort to improve crop productivity of pigeonpea (Cajanus cajan L.)

PubMed Central

Penmetsa, R. V.; Dutta, S.; Kulwal, P. L.; Saxena, R. K.; Datta, S.; Sharma, T. R.; Rosen, B.; Carrasquilla-Garcia, N.; Farmer, A. D.; Dubey, A.; Saxena, K. B.; Gao, J.; Fakrudin, B.; Singh, M. N.; Singh, B. P.; Wanjari, K. B.; Yuan, M.; Srivastava, R. K.; Kilian, A.; Upadhyaya, H. D.; Mallikarjuna, N.; Town, C. D.; Bruening, G. E.; He, G.; May, G. D.; McCombie, R.; Jackson, S. A.; Singh, N. K.; Cook, D. R.

2009-01-01

Pigeonpea (Cajanus cajan), an important food legume crop in the semi-arid regions of the world and the second most important pulse crop in India, has an average crop productivity of 780 kg/ha. The relatively low crop yields may be attributed to non-availability of improved cultivars, poor crop husbandry and exposure to a number of biotic and abiotic stresses in pigeonpea growing regions. Narrow genetic diversity in cultivated germplasm has further hampered the effective utilization of conventional breeding as well as development and utilization of genomic tools, resulting in pigeonpea being often referred to as an ‘orphan crop legume’. To enable genomics-assisted breeding in this crop, the pigeonpea genomics initiative (PGI) was initiated in late 2006 with funding from Indian Council of Agricultural Research under the umbrella of Indo-US agricultural knowledge initiative, which was further expanded with financial support from the US National Science Foundation’s Plant Genome Research Program and the Generation Challenge Program. As a result of the PGI, the last 3 years have witnessed significant progress in development of both genetic as well as genomic resources in this crop through effective collaborations and coordination of genomics activities across several institutes and countries. For instance, 25 mapping populations segregating for a number of biotic and abiotic stresses have been developed or are under development. An 11X-genome coverage bacterial artificial chromosome (BAC) library comprising of 69,120 clones have been developed of which 50,000 clones were end sequenced to generate 87,590 BAC-end sequences (BESs). About 10,000 expressed sequence tags (ESTs) from Sanger sequencing and ca. 2 million short ESTs by 454/FLX sequencing have been generated. A variety of molecular markers have been developed from BESs, microsatellite or simple sequence repeat (SSR)-enriched libraries and mining of ESTs and genomic amplicon sequencing. Of about 21,000 SSRs identified, 6,698 SSRs are under analysis along with 670 orthologous genes using a GoldenGate SNP (single nucleotide polymorphism) genotyping platform, with large scale SNP discovery using Solexa, a next generation sequencing technology, is in progress. Similarly a diversity array technology array comprising of ca. 15,000 features has been developed. In addition, >600 unique nucleotide binding site (NBS) domain containing members of the NBS-leucine rich repeat disease resistance homologs were cloned in pigeonpea; 960 BACs containing these sequences were identified by filter hybridization, BES physical maps developed using high information content fingerprinting. To enrich the genomic resources further, sequenced soybean genome is being analyzed to establish the anchor points between pigeonpea and soybean genomes. In addition, Solexa sequencing is being used to explore the feasibility of generating whole genome sequence. In summary, the collaborative efforts of several research groups under the umbrella of PGI are making significant progress in improving molecular tools in pigeonpea and should significantly benefit pigeonpea genetics and breeding. As these efforts come to fruition, and expanded (depending on funding), pigeonpea would move from an ‘orphan legume crop’ to one where genomics-assisted breeding approaches for a sustainable crop improvement are routine. PMID:20976284
Identification and mapping of conserved ortholog set(COS) II sequences of cacao and their conversion to SNP markers for marker-assisted selection in Theobroma cocoa and comparative genomics studies

USDA-ARS?s Scientific Manuscript database

Theobroma cacao is a tree cultivated in the tropics around the world for its seeds that are the source of both chocolate and cocoa butter. The cacao genome sequencing project initiated as a collaboration between USDA, Mars, Inc. and IBM has generated a great deal of transcriptome and genome sequenc...
Draft Genome Sequences of Six Lactobacillus pentosus Strains Isolated from Brines of Traditionally Fermented Spanish-Style Green Table Olives.

PubMed

Calero-Delgado, Beatriz; Martín-Platero, Antonio M; Pérez-Pulido, Antonio J; Benítez-Cabello, Antonio; Casimiro-Soriguer, Carlos S; Martínez-Bueno, Manuel; Arroyo-López, Francisco Noé; Rodríguez-Gómez, Francisco; Bautista-Gallego, Joaquín; Garrido-Fernández, Antonio; Jiménez-Díaz, Rufino

2018-05-03

Here, we report the genome sequences of six Lactobacillus pentosus strains isolated from traditional noninoculated Spanish-style green table olive brines. The total genome sizes varied between 3.77 and 4.039 Mbp. These genome sequences will assist in revealing the genes responsible for both technological and probiotic properties of these strains. Copyright © 2018 Calero-Delgado et al.
RNase H-assisted RNA-primed rolling circle amplification for targeted RNA sequence detection.

PubMed

Takahashi, Hirokazu; Ohkawachi, Masahiko; Horio, Kyohei; Kobori, Toshiro; Aki, Tsunehiro; Matsumura, Yukihiko; Nakashimada, Yutaka; Okamura, Yoshiko

2018-05-17

RNA-primed rolling circle amplification (RPRCA) is a useful laboratory method for RNA detection; however, the detection of RNA is limited by the lack of information on 3'-terminal sequences. We uncovered that conventional RPRCA using pre-circularized probes could potentially detect the internal sequence of target RNA molecules in combination with RNase H. However, the specificity for mRNA detection was low, presumably due to non-specific hybridization of non-target RNA with the circular probe. To overcome this technical problem, we developed a method for detecting a sequence of interest in target RNA molecules via RNase H-assisted RPRCA using padlocked probes. When padlock probes are hybridized to the target RNA molecule, they are converted to the circular form by SplintR ligase. Subsequently, RNase H creates nick sites only in the hybridized RNA sequence, and single-stranded DNA is finally synthesized from the nick site by phi29 DNA polymerase. This method could specifically detect at least 10 fmol of the target RNA molecule without reverse transcription. Moreover, this method detected GFP mRNA present in 10 ng of total RNA isolated from Escherichia coli without background DNA amplification. Therefore, this method can potentially detect almost all types of RNA molecules without reverse transcription and reveal full-length sequence information.
Sequence for the Training of Eye-Hand Coordination Needed for the Organization of Handwriting Tasks

ERIC Educational Resources Information Center

Trester, Mary Fran

1971-01-01

Suggested is a sequence of 11 class activities, progressing from gross to fine motor skills, to assist the development of skills required to perform handwriting tasks successfully, for use particularly with children who lack fine motor control and eye-hand coordination. (KW)
Achievements and prospects of genomics-assisted breeding in three legume crops of the semi-arid tropics

USDA-ARS?s Scientific Manuscript database

Advances in sequencing and genotyping technologies have enabled generation of several thousand markers including SSRs, SNPs, DArTs, hundreds of thousands transcript reads and BAC-end sequences in chickpea, pigeonpea and groundnut, three major legume crops of the semi-arid tropics. Comprehensive tran...
Whole genome sequencing of elite rice cultivars as a comprehensive information resource for marker assisted selection

USDA-ARS?s Scientific Manuscript database

Current advances in sequencing technologies and bioinformatics allow to determine a nearly complete genomic background of rice, a staple food for the poor people. Consequently, comprehensive databases of variation among thousands of varieties is currently being assembled and released. Proper analysi...
Accelerating public sector rice breeding with high-density KASP markers derived from whole genome sequencing of indica rice.

PubMed

Steele, Katherine A; Quinton-Tulloch, Mark J; Amgai, Resham B; Dhakal, Rajeev; Khatiwada, Shambhu P; Vyas, Darshna; Heine, Martin; Witcombe, John R

2018-01-01

Few public sector rice breeders have the capacity to use NGS-derived markers in their breeding programmes despite rapidly expanding repositories of rice genome sequence data. They rely on > 18,000 mapped microsatellites (SSRs) for marker-assisted selection (MAS) using gel analysis. Lack of knowledge about target SNP and InDel variant loci has hampered the uptake by many breeders of Kompetitive allele-specific PCR (KASP), a proprietary technology of LGC genomics that can distinguish alleles at variant loci. KASP is a cost-effective single-step genotyping technology, cheaper than SSRs and more flexible than genotyping by sequencing (GBS) or array-based genotyping when used in selection programmes. Before this study, there were 2015 rice KASP marker loci in the public domain, mainly identified by array-based screening, leaving large proportions of the rice genome with no KASP coverage. Here we have addressed the urgent need for a wide choice of appropriate rice KASP assays and demonstrated that NGS can detect many more KASP to give full genome coverage. Through re-sequencing of nine indica rice breeding lines or released varieties, this study has identified 2.5 million variant sites. Stringent filtering of variants generated 1.3 million potential KASP assay designs, including 92,500 potential functional markers. This strategy delivers a 650-fold increase in potential selectable KASP markers at a density of 3.1 per 1 kb in the indica crosses analysed and 377,178 polymorphic KASP design sites on average per cross. This knowledge is available to breeders and has been utilised to improve the efficiency of public sector breeding in Nepal, enabling identification of polymorphic KASP at any region or quantitative trait loci in relevant crosses. Validation of 39 new KASP was carried out by genotyping progeny from a range of crosses to show that they detected segregating alleles. The new KASP have replaced SSRs to aid trait selection during marker-assisted backcrossing in these crosses, where target traits include rice blast and BLB resistance loci. Furthermore, we provide the software for plant breeders to generate KASP designs from their own datasets.
Sialotranscriptomics of Rhipicephalus zambeziensis reveals intricate expression profiles of secretory proteins and suggests tight temporal transcriptional regulation during blood-feeding.

PubMed

de Castro, Minique Hilda; de Klerk, Daniel; Pienaar, Ronel; Rees, D Jasper G; Mans, Ben J

2017-08-10

Ticks secrete a diverse mixture of secretory proteins into the host to evade its immune response and facilitate blood-feeding, making secretory proteins attractive targets for the production of recombinant anti-tick vaccines. The largely neglected tick species, Rhipicephalus zambeziensis, is an efficient vector of Theileria parva in southern Africa but its available sequence information is limited. Next generation sequencing has advanced sequence availability for ticks in recent years and has assisted the characterisation of secretory proteins. This study focused on the de novo assembly and annotation of the salivary gland transcriptome of R. zambeziensis and the temporal expression of secretory protein transcripts in female and male ticks, before the onset of feeding and during early and late feeding. The sialotranscriptome of R. zambeziensis yielded 23,631 transcripts from which 13,584 non-redundant proteins were predicted. Eighty-six percent of these contained a predicted start and stop codon and were estimated to be putatively full-length proteins. A fifth (2569) of the predicted proteins were annotated as putative secretory proteins and explained 52% of the expression in the transcriptome. Expression analyses revealed that 2832 transcripts were differentially expressed among feeding time points and 1209 between the tick sexes. The expression analyses further indicated that 57% of the annotated secretory protein transcripts were differentially expressed. Dynamic expression profiles of secretory protein transcripts were observed during feeding of female ticks. Whereby a number of transcripts were upregulated during early feeding, presumably for feeding site establishment and then during late feeding, 52% of these were downregulated, indicating that transcripts were required at specific feeding stages. This suggested that secretory proteins are under stringent transcriptional regulation that fine-tunes their expression in salivary glands during feeding. No open reading frames were predicted for 7947 transcripts. This class represented 17% of the differentially expressed transcripts, suggesting a potential transcriptional regulatory function of long non-coding RNA in tick blood-feeding. The assembled sialotranscriptome greatly expands the sequence availability of R. zambeziensis, assists in our understanding of the transcription of secretory proteins during blood-feeding and will be a valuable resource for future vaccine candidate selection.
Whole-Genome Sequences of Thirteen Isolates of Borrelia burgdorferi

DOE Office of Scientific and Technical Information (OSTI.GOV)

Schutzer S. E.; Dunn J.; Fraser-Liggett, C. M.

2011-02-01

Borrelia burgdorferi is a causative agent of Lyme disease in North America and Eurasia. The first complete genome sequence of B. burgdorferi strain 31, available for more than a decade, has assisted research on the pathogenesis of Lyme disease. Because a single genome sequence is not sufficient to understand the relationship between genotypic and geographic variation and disease phenotype, we determined the whole-genome sequences of 13 additional B. burgdorferi isolates that span the range of natural variation. These sequences should allow improved understanding of pathogenesis and provide a foundation for novel detection, diagnosis, and prevention strategies.
HAlign-II: efficient ultra-large multiple sequence alignment and phylogenetic tree reconstruction with distributed and parallel computing.

PubMed

Wan, Shixiang; Zou, Quan

2017-01-01

Multiple sequence alignment (MSA) plays a key role in biological sequence analyses, especially in phylogenetic tree construction. Extreme increase in next-generation sequencing results in shortage of efficient ultra-large biological sequence alignment approaches for coping with different sequence types. Distributed and parallel computing represents a crucial technique for accelerating ultra-large (e.g. files more than 1 GB) sequence analyses. Based on HAlign and Spark distributed computing system, we implement a highly cost-efficient and time-efficient HAlign-II tool to address ultra-large multiple biological sequence alignment and phylogenetic tree construction. The experiments in the DNA and protein large scale data sets, which are more than 1GB files, showed that HAlign II could save time and space. It outperformed the current software tools. HAlign-II can efficiently carry out MSA and construct phylogenetic trees with ultra-large numbers of biological sequences. HAlign-II shows extremely high memory efficiency and scales well with increases in computing resource. THAlign-II provides a user-friendly web server based on our distributed computing infrastructure. HAlign-II with open-source codes and datasets was established at http://lab.malab.cn/soft/halign.
A next-generation marker genotyping platform (AmpSeq) in heterozygous crops: A case study for marker assisted selection in grapevine

USDA-ARS?s Scientific Manuscript database

Marker assisted selection (MAS) has become widely used in perennial crop breeding programs to accelerate and enhance cultivar development via selection during the juvenile phase and parental selection prior to crossing. Next generation sequencing (NGS) has been widely used for whole genome molecular...

A next-generation marker genotyping platform (AmpSeq) in heterozygous crops: a case study for marker assisted selection in grapevine

USDA-ARS?s Scientific Manuscript database

Marker assisted selection (MAS) is often employed in crop breeding programs to accelerate and enhance cultivar development, via selection during the juvenile phase and parental selection prior to crossing. Next generation sequencing (NGS) and its derivative technologies have been used for genome-wid...
NHEXAS PHASE I ARIZONA STUDY--STANDARD OPERATING PROCEDURE FOR LABORATORY ASSISTANT TRAINING PLAN--GENERAL (UA-T-6.0)

EPA Science Inventory

The purpose of this SOP is to describe the training sequence of incoming student laboratory assistants. The procedure is designed to provide them with an overview of the project in terms of project goals, structure, and laboratory needs. This overview familiarizes the student l...
NHEXAS PHASE I ARIZONA STUDY--STANDARD OPERATING PROCEDURE FOR STUDENT DATA ASSISTANT TRAINING PLAN (UA-T-5.0)

EPA Science Inventory

The purpose of this SOP is to describe the training sequence for incoming student data assistants (students). The procedure is designed to provide them with an overview of the study in terms of study goals, structure and project data needs. This overview familiarizes the studen...
Construction of a high-density genetic map for grape using specific length amplified fragment (SLAF) sequencing

PubMed Central

Guo, Yinshan; Xing, Huiyang; Zhao, Yuhui; Liu, Zhendong; Li, Kun; Guo, Xiuwu

2017-01-01

Genetic maps are important tools in plant genomics and breeding. We report a large-scale discovery of single nucleotide polymorphisms (SNPs) using the specific length amplified fragment sequencing (SLAF-seq) technique for the construction of high-density genetic maps for two elite wine grape cultivars, ‘Chardonnay’ and ‘Beibinghong’, and their 130 F1 plants. A total of 372.53 M paired-end reads were obtained after preprocessing. The average sequencing depth was 33.81 for ‘Chardonnay’ (the female parent), 48.20 for ‘Beibinghong’ (the male parent), and 12.66 for the F1 offspring. We detected 202,349 high-quality SLAFs of which 144,972 were polymorphic; 10,042 SNPs were used to construct a genetic map that spanned 1,969.95 cM, with an average genetic distance of 0.23 cM between adjacent markers. This genetic map contains the largest molecular marker number of the grape maps so far reported. We thus demonstrate that SLAF-seq is a promising strategy for the construction of high-density genetic maps; the map that we report here is a good potential resource for QTL mapping of genes linked to major economic and agronomic traits, map-based cloning, and marker-assisted selection of grape. PMID:28746364
Identification of Giardia species and Giardia duodenalis assemblages by sequence analysis of the 5.8S rDNA gene and internal transcribed spacers.

PubMed

Cacciò, Simone M; Beck, Relja; Almeida, Andre; Bajer, Anna; Pozio, Edoardo

2010-05-01

PCR assays have been developed mainly to assist investigations into the epidemiology of Giardia duodenalis, the only species in the Giardia genus having zoonotic potential. However, a reliable identification of all species is of practical importance, particularly when water samples and samples from wild animals are investigated. The aim of the present work was to genotype Giardia species and G. duodenalis assemblages using as a target the region spanning the 5.8S gene and the 2 flanking internal transcribed spacers (ITS1 and ITS2) of the ribosomal gene. Primers were designed to match strongly conserved regions in the 3' end of the small subunit and in the 5' end of the large subunit ribosomal genes. The corresponding region (about 310 bp) was amplified from 49 isolates of both human and animal origin, representing all G. duodenalis assemblages as well as G. muris and G. microti. Sequence comparison and phylogenetic analysis showed that G. ardeae, G. muris, G. microti as well as the 7 G. duodenalis assemblages can be easily distinguished. Since the major subgroups within the zoonotic assemblages A and B can be identified by sequence analysis, this assay is also informative for molecular epidemiological studies.
Optical mapping and its potential for large-scale sequencing projects.

PubMed

Aston, C; Mishra, B; Schwartz, D C

1999-07-01

Physical mapping has been rediscovered as an important component of large-scale sequencing projects. Restriction maps provide landmark sequences at defined intervals, and high-resolution restriction maps can be assembled from ensembles of single molecules by optical means. Such optical maps can be constructed from both large-insert clones and genomic DNA, and are used as a scaffold for accurately aligning sequence contigs generated by shotgun sequencing.
[MALDI-TOF mass spectrometry in the investigation of large high-molecular biological compounds].

PubMed

Porubl'ova, L V; Rebriiev, A V; Hromovyĭ, T Iu; Minia, I I; Obolens'ka, M Iu

2009-01-01

MALDI-TOF (Matrix-Assisted Laser Desorption/Ionization Time-of-Flight) mass spectrometry has become, in the recent years, a tool of choice for analyses of biological polymers. The wide mass range, high accuracy, informativity and sensitivity make it a superior method for analysis of all kinds of high-molecular biological compounds including proteins, nucleic acids and lipids. MALDI-TOF-MS is particularly suitable for the identification of proteins by mass fingerprint or microsequencing. Therefore it has become an important technique of proteomics. Furthermore, the method allows making a detailed analysis of post-translational protein modifications, protein-protein and protein-nucleic acid interactions. Recently, the method was also successfully applied to nucleic acid sequencing as well as screening for mutations.
A new trajectory concept for exploring the earth's geomagnetic tail

NASA Technical Reports Server (NTRS)

Farquhar, R. W.; Dunham, D. W.

1981-01-01

An innovative trajectory technique for a magnetotail mapping mission is described which can control the apsidal rotation of an elliptical earth orbit and keep its apogee segment inside the tail region. The required apsidal rotation rate of approximately 1 deg/day is achieved by using the moon to carry out a prescribed sequence of gravity-assist maneuvers. Apogee distances are alternately raised and lowered by the lunar-swingby maneuvers; several categories of the 'sun-synchronous' swingby trajectories are identified. The strength and flexibility of the new trajectory concept is demonstrated by using real-world simulations showing that a large variety of trajectory shapes can be used to explore the earth's geomagnetic tail between 60 and 250 R sub E.
Intragenic sequences in the trophectoderm harbour the greatest proportion of methylation errors in day 17 bovine conceptuses generated using assisted reproductive technologies.

PubMed

O'Doherty, Alan M; McGettigan, Paul; Irwin, Rachelle E; Magee, David A; Gagne, Dominic; Fournier, Eric; Al-Naib, Abdullah; Sirard, Marc-André; Walsh, Colum P; Robert, Claude; Fair, Trudee

2018-06-05

Assisted reproductive technologies (ART) are widely used to treat fertility issues in humans and for the production of embryos in mammalian livestock. The use of these techniques, however, is not without consequence as they are often associated with inauspicious pre- and postnatal outcomes including premature birth, intrauterine growth restriction and increased incidence of epigenetic disorders in human and large offspring syndrome in cattle. Here, global DNA methylation profiles in the trophectoderm and embryonic discs of in vitro produced (IVP), superovulation-derived (SOV) and unstimulated, synchronised control day 17 bovine conceptuses (herein referred to as AI) were interrogated using the EmbryoGENE DNA Methylation Array (EDMA). Pyrosequencing was used to validate four loci identified as differentially methylated on the array and to assess the differentially methylated regions (DMRs) of six imprinted genes in these conceptuses. The impact of embryo-production induced DNA methylation aberrations was determined using Ingenuity Pathway Analysis, shedding light on the potential functional consequences of these differences. Of the total number of differentially methylated loci identified (3140) 77.3 and 22.7% were attributable to SOV and IVP, respectively. Differential methylation was most prominent at intragenic sequences within the trophectoderm of IVP and SOV-derived conceptuses, almost a third (30.8%) of the differentially methylated loci mapped to intragenic regions. Very few differentially methylated loci were detected in embryonic discs (ED); 0.16 and 4.9% of the differentially methylated loci were located in the ED of SOV-derived and IVP conceptuses, respectively. The overall effects of SOV and IVP on the direction of methylation changes were associated with increased methylation; 70.6% of the differentially methylated loci in SOV-derived conceptuses and 57.9% of the loci in IVP-derived conceptuses were more methylated compared to AI-conceptuses. Ontology analysis of probes associated with intragenic sequences suggests enrichment for terms associated with cancer, cell morphology and growth. By examining (1) the effects of superovulation and (2) the effects of an in vitro system (oocyte maturation, fertilisation and embryo culture) we have identified that the assisted reproduction process of superovulation alone has the largest impact on the DNA methylome of subsequent embryos.
Road Maps for Learning: A Bird's Eye View

ERIC Educational Resources Information Center

Dunne, Timothy T.

2011-01-01

The notion of the road map, advocated by Black, Wilson, and Yao (2011), and the associated minutiae of the construct map have several powerful features. At one level these notions assist the teacher to select and embody a suitable sequence of constructs within a specified curriculum. Whatever disparate sequenced pathways individual learners may…
MGA trajectory planning with an ACO-inspired algorithm

NASA Astrophysics Data System (ADS)

Ceriotti, Matteo; Vasile, Massimiliano

2010-11-01

Given a set of celestial bodies, the problem of finding an optimal sequence of swing-bys, deep space manoeuvres (DSM) and transfer arcs connecting the elements of the set is combinatorial in nature. The number of possible paths grows exponentially with the number of celestial bodies. Therefore, the design of an optimal multiple gravity assist (MGA) trajectory is a NP-hard mixed combinatorial-continuous problem. Its automated solution would greatly improve the design of future space missions, allowing the assessment of a large number of alternative mission options in a short time. This work proposes to formulate the complete automated design of a multiple gravity assist trajectory as an autonomous planning and scheduling problem. The resulting scheduled plan will provide the optimal planetary sequence and a good estimation of the set of associated optimal trajectories. The trajectory model consists of a sequence of celestial bodies connected by two-dimensional transfer arcs containing one DSM. For each transfer arc, the position of the planet and the spacecraft, at the time of arrival, are matched by varying the pericentre of the preceding swing-by, or the magnitude of the launch excess velocity, for the first arc. For each departure date, this model generates a full tree of possible transfers from the departure to the destination planet. Each leaf of the tree represents a planetary encounter and a possible way to reach that planet. An algorithm inspired by ant colony optimization (ACO) is devised to explore the space of possible plans. The ants explore the tree from departure to destination adding one node at the time: every time an ant is at a node, a probability function is used to select a feasible direction. This approach to automatic trajectory planning is applied to the design of optimal transfers to Saturn and among the Galilean moons of Jupiter. Solutions are compared to those found through more traditional genetic-algorithm techniques.
A New Targeted CFTR Mutation Panel Based on Next-Generation Sequencing Technology.

PubMed

Lucarelli, Marco; Porcaro, Luigi; Biffignandi, Alice; Costantino, Lucy; Giannone, Valentina; Alberti, Luisella; Bruno, Sabina Maria; Corbetta, Carlo; Torresani, Erminio; Colombo, Carla; Seia, Manuela

2017-09-01

Searching for mutations in the cystic fibrosis transmembrane conductance regulator gene (CFTR) is a key step in the diagnosis of and neonatal and carrier screening for cystic fibrosis (CF), and it has implications for prognosis and personalized therapy. The large number of mutations and genetic and phenotypic variability make this search a complex task. Herein, we developed, validated, and tested a laboratory assay for an extended search for mutations in CFTR using a next-generation sequencing-based method, with a panel of 188 CFTR mutations customized for the Italian population. Overall, 1426 dried blood spots from neonatal screening, 402 genomic DNA samples from various origins, and 1138 genomic DNA samples from patients with CF were analyzed. The assay showed excellent analytical and diagnostic operative characteristics. We identified and experimentally validated 159 (of 188) CFTR mutations. The assay achieved detection rates of 95.0% and 95.6% in two large-scale case series of CF patients from central and northern Italy, respectively. These detection rates are among the highest reported so far with a genetic test for CF based on a mutation panel. This assay appears to be well suited for diagnostics, neonatal and carrier screening, and assisted reproduction, and it represents a considerable advantage in CF genetic counseling. Copyright © 2017 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
An Expressed Sequence Tag (EST)-enriched genetic map of turbot (Scophthalmus maximus): a useful framework for comparative genomics across model and farmed teleosts

PubMed Central

2012-01-01

Background The turbot (Scophthalmus maximus) is a relevant species in European aquaculture. The small turbot genome provides a source for genomics strategies to use in order to understand the genetic basis of productive traits, particularly those related to sex, growth and pathogen resistance. Genetic maps represent essential genomic screening tools allowing to localize quantitative trait loci (QTL) and to identify candidate genes through comparative mapping. This information is the backbone to develop marker-assisted selection (MAS) programs in aquaculture. Expressed sequenced tag (EST) resources have largely increased in turbot, thus supplying numerous type I markers suitable for extending the previous linkage map, which was mostly based on anonymous loci. The aim of this study was to construct a higher-resolution turbot genetic map using EST-linked markers, which will turn out to be useful for comparative mapping studies. Results A consensus gene-enriched genetic map of the turbot was constructed using 463 SNP and microsatellite markers in nine reference families. This map contains 438 markers, 180 EST-linked, clustered at 24 linkage groups. Linkage and comparative genomics evidences suggested additional linkage group fusions toward the consolidation of turbot map according to karyotype information. The linkage map showed a total length of 1402.7 cM with low average intermarker distance (3.7 cM; ~2 Mb). A global 1.6:1 female-to-male recombination frequency (RF) ratio was observed, although largely variable among linkage groups and chromosome regions. Comparative sequence analysis revealed large macrosyntenic patterns against model teleost genomes, significant hits decreasing from stickleback (54%) to zebrafish (20%). Comparative mapping supported particular chromosome rearrangements within Acanthopterygii and aided to assign unallocated markers to specific turbot linkage groups. Conclusions The new gene-enriched high-resolution turbot map represents a useful genomic tool for QTL identification, positional cloning strategies, and future genome assembling. This map showed large synteny conservation against model teleost genomes. Comparative genomics and data mining from landmarks will provide straightforward access to candidate genes, which will be the basis for genetic breeding programs and evolutionary studies in this species. PMID:22747677
Genomics-assisted breeding for boosting crop improvement in pigeonpea (Cajanus cajan)

PubMed Central

Pazhamala, Lekha; Saxena, Rachit K.; Singh, Vikas K.; Sameerkumar, C. V.; Kumar, Vinay; Sinha, Pallavi; Patel, Kishan; Obala, Jimmy; Kaoneka, Seleman R.; Tongoona, P.; Shimelis, Hussein A.; Gangarao, N. V. P. R.; Odeny, Damaris; Rathore, Abhishek; Dharmaraj, P. S.; Yamini, K. N.; Varshney, Rajeev K.

2015-01-01

Pigeonpea is an important pulse crop grown predominantly in the tropical and sub-tropical regions of the world. Although pigeonpea growing area has considerably increased, yield has remained stagnant for the last six decades mainly due to the exposure of the crop to various biotic and abiotic constraints. In addition, low level of genetic variability and limited genomic resources have been serious impediments to pigeonpea crop improvement through modern breeding approaches. In recent years, however, due to the availability of next generation sequencing and high-throughput genotyping technologies, the scenario has changed tremendously. The reduced sequencing costs resulting in the decoding of the pigeonpea genome has led to the development of various genomic resources including molecular markers, transcript sequences and comprehensive genetic maps. Mapping of some important traits including resistance to Fusarium wilt and sterility mosaic disease, fertility restoration, determinacy with other agronomically important traits have paved the way for applying genomics-assisted breeding (GAB) through marker assisted selection as well as genomic selection (GS). This would accelerate the development and improvement of both varieties and hybrids in pigeonpea. Particularly for hybrid breeding programme, mitochondrial genomes of cytoplasmic male sterile (CMS) lines, maintainers and hybrids have been sequenced to identify genes responsible for cytoplasmic male sterility. Furthermore, several diagnostic molecular markers have been developed to assess the purity of commercial hybrids. In summary, pigeonpea has become a genomic resources-rich crop and efforts have already been initiated to integrate these resources in pigeonpea breeding. PMID:25741349
Application of a 5-tiered scheme for standardized classification of 2,360 unique mismatch repair gene variants in the InSiGHT locus-specific database.

PubMed

Thompson, Bryony A; Spurdle, Amanda B; Plazzer, John-Paul; Greenblatt, Marc S; Akagi, Kiwamu; Al-Mulla, Fahd; Bapat, Bharati; Bernstein, Inge; Capellá, Gabriel; den Dunnen, Johan T; du Sart, Desiree; Fabre, Aurelie; Farrell, Michael P; Farrington, Susan M; Frayling, Ian M; Frebourg, Thierry; Goldgar, David E; Heinen, Christopher D; Holinski-Feder, Elke; Kohonen-Corish, Maija; Robinson, Kristina Lagerstedt; Leung, Suet Yi; Martins, Alexandra; Moller, Pal; Morak, Monika; Nystrom, Minna; Peltomaki, Paivi; Pineda, Marta; Qi, Ming; Ramesar, Rajkumar; Rasmussen, Lene Juel; Royer-Pokora, Brigitte; Scott, Rodney J; Sijmons, Rolf; Tavtigian, Sean V; Tops, Carli M; Weber, Thomas; Wijnen, Juul; Woods, Michael O; Macrae, Finlay; Genuardi, Maurizio

2014-02-01

The clinical classification of hereditary sequence variants identified in disease-related genes directly affects clinical management of patients and their relatives. The International Society for Gastrointestinal Hereditary Tumours (InSiGHT) undertook a collaborative effort to develop, test and apply a standardized classification scheme to constitutional variants in the Lynch syndrome-associated genes MLH1, MSH2, MSH6 and PMS2. Unpublished data submission was encouraged to assist in variant classification and was recognized through microattribution. The scheme was refined by multidisciplinary expert committee review of the clinical and functional data available for variants, applied to 2,360 sequence alterations, and disseminated online. Assessment using validated criteria altered classifications for 66% of 12,006 database entries. Clinical recommendations based on transparent evaluation are now possible for 1,370 variants that were not obviously protein truncating from nomenclature. This large-scale endeavor will facilitate the consistent management of families suspected to have Lynch syndrome and demonstrates the value of multidisciplinary collaboration in the curation and classification of variants in public locus-specific databases.
KNIME4NGS: a comprehensive toolbox for next generation sequencing analysis.

PubMed

Hastreiter, Maximilian; Jeske, Tim; Hoser, Jonathan; Kluge, Michael; Ahomaa, Kaarin; Friedl, Marie-Sophie; Kopetzky, Sebastian J; Quell, Jan-Dominik; Mewes, H Werner; Küffner, Robert

2017-05-15

Analysis of Next Generation Sequencing (NGS) data requires the processing of large datasets by chaining various tools with complex input and output formats. In order to automate data analysis, we propose to standardize NGS tasks into modular workflows. This simplifies reliable handling and processing of NGS data, and corresponding solutions become substantially more reproducible and easier to maintain. Here, we present a documented, linux-based, toolbox of 42 processing modules that are combined to construct workflows facilitating a variety of tasks such as DNAseq and RNAseq analysis. We also describe important technical extensions. The high throughput executor (HTE) helps to increase the reliability and to reduce manual interventions when processing complex datasets. We also provide a dedicated binary manager that assists users in obtaining the modules' executables and keeping them up to date. As basis for this actively developed toolbox we use the workflow management software KNIME. See http://ibisngs.github.io/knime4ngs for nodes and user manual (GPLv3 license). robert.kueffner@helmholtz-muenchen.de. Supplementary data are available at Bioinformatics online.
Molecular Architecture of Full-length TRF1 Favors Its Interaction with DNA.

PubMed

Boskovic, Jasminka; Martinez-Gago, Jaime; Mendez-Pertuz, Marinela; Buscato, Alberto; Martinez-Torrecuadrada, Jorge Luis; Blasco, Maria A

2016-10-07

Telomeres are specific DNA-protein structures found at both ends of eukaryotic chromosomes that protect the genome from degradation and from being recognized as double-stranded breaks. In vertebrates, telomeres are composed of tandem repeats of the TTAGGG sequence that are bound by a six-subunit complex called shelterin. Molecular mechanisms of telomere functions remain unknown in large part due to lack of structural data on shelterins, shelterin complex, and its interaction with the telomeric DNA repeats. TRF1 is one of the best studied shelterin components; however, the molecular architecture of the full-length protein remains unknown. We have used single-particle electron microscopy to elucidate the structure of TRF1 and its interaction with telomeric DNA sequence. Our results demonstrate that full-length TRF1 presents a molecular architecture that assists its interaction with telometic DNA and at the same time makes TRFH domains accessible to other TRF1 binding partners. Furthermore, our studies suggest hypothetical models on how other proteins as TIN2 and tankyrase contribute to regulate TRF1 function. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Molecular Architecture of Full-length TRF1 Favors Its Interaction with DNA*

PubMed Central

Boskovic, Jasminka; Martinez-Gago, Jaime; Mendez-Pertuz, Marinela; Buscato, Alberto; Martinez-Torrecuadrada, Jorge Luis; Blasco, Maria A.

2016-01-01

Telomeres are specific DNA-protein structures found at both ends of eukaryotic chromosomes that protect the genome from degradation and from being recognized as double-stranded breaks. In vertebrates, telomeres are composed of tandem repeats of the TTAGGG sequence that are bound by a six-subunit complex called shelterin. Molecular mechanisms of telomere functions remain unknown in large part due to lack of structural data on shelterins, shelterin complex, and its interaction with the telomeric DNA repeats. TRF1 is one of the best studied shelterin components; however, the molecular architecture of the full-length protein remains unknown. We have used single-particle electron microscopy to elucidate the structure of TRF1 and its interaction with telomeric DNA sequence. Our results demonstrate that full-length TRF1 presents a molecular architecture that assists its interaction with telometic DNA and at the same time makes TRFH domains accessible to other TRF1 binding partners. Furthermore, our studies suggest hypothetical models on how other proteins as TIN2 and tankyrase contribute to regulate TRF1 function. PMID:27563064
Advances in QTL Mapping in Pigs

PubMed Central

Rothschild, Max F.; Hu, Zhi-liang; Jiang, Zhihua

2007-01-01

Over the past 15 years advances in the porcine genetic linkage map and discovery of useful candidate genes have led to valuable gene and trait information being discovered. Early use of exotic breed crosses and now commercial breed crosses for quantitative trait loci (QTL) scans and candidate gene analyses have led to 110 publications which have identified 1,675 QTL. Additionally, these studies continue to identify genes associated with economically important traits such as growth rate, leanness, feed intake, meat quality, litter size, and disease resistance. A well developed QTL database called PigQTLdb is now as a valuable tool for summarizing and pinpointing in silico regions of interest to researchers. The commercial pig industry is actively incorporating these markers in marker-assisted selection along with traditional performance information to improve traits of economic performance. The long awaited sequencing efforts are also now beginning to provide sequence available for both comparative genomics and large scale single nucleotide polymorphism (SNP) association studies. While these advances are all positive, development of useful new trait families and measurement of new or underlying traits still limits future discoveries. A review of these developments is presented. PMID:17384738
Application of a five-tiered scheme for standardized classification of 2,360 unique mismatch repair gene variants lodged on the InSiGHT locus-specific database

PubMed Central

Plazzer, John-Paul; Greenblatt, Marc S.; Akagi, Kiwamu; Al-Mulla, Fahd; Bapat, Bharati; Bernstein, Inge; Capellá, Gabriel; den Dunnen, Johan T.; du Sart, Desiree; Fabre, Aurelie; Farrell, Michael P.; Farrington, Susan M.; Frayling, Ian M.; Frebourg, Thierry; Goldgar, David E.; Heinen, Christopher D.; Holinski-Feder, Elke; Kohonen-Corish, Maija; Robinson, Kristina Lagerstedt; Leung, Suet Yi; Martins, Alexandra; Moller, Pal; Morak, Monika; Nystrom, Minna; Peltomaki, Paivi; Pineda, Marta; Qi, Ming; Ramesar, Rajkumar; Rasmussen, Lene Juel; Royer-Pokora, Brigitte; Scott, Rodney J.; Sijmons, Rolf; Tavtigian, Sean V.; Tops, Carli M.; Weber, Thomas; Wijnen, Juul; Woods, Michael O.; Macrae, Finlay; Genuardi, Maurizio

2015-01-01

Clinical classification of sequence variants identified in hereditary disease genes directly affects clinical management of patients and their relatives. The International Society for Gastrointestinal Hereditary Tumours (InSiGHT) undertook a collaborative effort to develop, test and apply a standardized classification scheme to constitutional variants in the Lynch Syndrome genes MLH1, MSH2, MSH6 and PMS2. Unpublished data submission was encouraged to assist variant classification, and recognized by microattribution. The scheme was refined by multidisciplinary expert committee review of clinical and functional data available for variants, applied to 2,360 sequence alterations, and disseminated online. Assessment using validated criteria altered classifications for 66% of 12,006 database entries. Clinical recommendations based on transparent evaluation are now possible for 1,370 variants not obviously protein-truncating from nomenclature. This large-scale endeavor will facilitate consistent management of suspected Lynch Syndrome families, and demonstrates the value of multidisciplinary collaboration for curation and classification of variants in public locus-specific databases. PMID:24362816

De novo assembly and characterization of bark transcriptome using Illumina sequencing and development of EST-SSR markers in rubber tree (Hevea brasiliensis Muell. Arg.)

PubMed Central

2012-01-01

Background In rubber tree, bark is one of important agricultural and biological organs. However, the molecular mechanism involved in the bark formation and development in rubber tree remains largely unknown, which is at least partially due to lack of bark transcriptomic and genomic information. Therefore, it is necessary to carried out high-throughput transcriptome sequencing of rubber tree bark to generate enormous transcript sequences for the functional characterization and molecular marker development. Results In this study, more than 30 million sequencing reads were generated using Illumina paired-end sequencing technology. In total, 22,756 unigenes with an average length of 485 bp were obtained with de novo assembly. The similarity search indicated that 16,520 and 12,558 unigenes showed significant similarities to known proteins from NCBI non-redundant and Swissprot protein databases, respectively. Among these annotated unigenes, 6,867 and 5,559 unigenes were separately assigned to Gene Ontology (GO) and Clusters of Orthologous Group (COG). When 22,756 unigenes searched against the Kyoto Encyclopedia of Genes and Genomes Pathway (KEGG) database, 12,097 unigenes were assigned to 5 main categories including 123 KEGG pathways. Among the main KEGG categories, metabolism was the biggest category (9,043, 74.75%), suggesting the active metabolic processes in rubber tree bark. In addition, a total of 39,257 EST-SSRs were identified from 22,756 unigenes, and the characterizations of EST-SSRs were further analyzed in rubber tree. 110 potential marker sites were randomly selected to validate the assembly quality and develop EST-SSR markers. Among 13 Hevea germplasms, PCR success rate and polymorphism rate of 110 markers were separately 96.36% and 55.45% in this study. Conclusion By assembling and analyzing de novo transcriptome sequencing data, we reported the comprehensive functional characterization of rubber tree bark. This research generated a substantial fraction of rubber tree transcriptome sequences, which were very useful resources for gene annotation and discovery, molecular markers development, genome assembly and annotation, and microarrays development in rubber tree. The EST-SSR markers identified and developed in this study will facilitate marker-assisted selection breeding in rubber tree. Moreover, this study also supported that transcriptome analysis based on Illumina paired-end sequencing is a powerful tool for transcriptome characterization and molecular marker development in non-model species, especially those with large and complex genomes. PMID:22607098
Comparative genomics of Lupinus angustifolius gene-rich regions: BAC library exploration, genetic mapping and cytogenetics

PubMed Central

2013-01-01

Background The narrow-leafed lupin, Lupinus angustifolius L., is a grain legume species with a relatively compact genome. The species has 2n = 40 chromosomes and its genome size is 960 Mbp/1C. During the last decade, L. angustifolius genomic studies have achieved several milestones, such as molecular-marker development, linkage maps, and bacterial artificial chromosome (BAC) libraries. Here, these resources were integratively used to identify and sequence two gene-rich regions (GRRs) of the genome. Results The genome was screened with a probe representing the sequence of a microsatellite fragment length polymorphism (MFLP) marker linked to Phomopsis stem blight resistance. BAC clones selected by hybridization were subjected to restriction fingerprinting and contig assembly, and 232 BAC-ends were sequenced and annotated. BAC fluorescence in situ hybridization (BAC-FISH) identified eight single-locus clones. Based on physical mapping, cytogenetic localization, and BAC-end annotation, five clones were chosen for sequencing. Within the sequences of clones that hybridized in FISH to a single-locus, two large GRRs were identified. The GRRs showed strong and conserved synteny to Glycine max duplicated genome regions, illustrated by both identical gene order and parallel orientation. In contrast, in the clones with dispersed FISH signals, more than one-third of sequences were transposable elements. Sequenced, single-locus clones were used to develop 12 genetic markers, increasing the number of L. angustifolius chromosomes linked to appropriate linkage groups by five pairs. Conclusions In general, probes originating from MFLP sequences can assist genome screening and gene discovery. However, such probes are not useful for positional cloning, because they tend to hybridize to numerous loci. GRRs identified in L. angustifolius contained a low number of interspersed repeats and had a high level of synteny to the genome of the model legume G. max. Our results showed that not only was the gene nucleotide sequence conserved between soybean and lupin GRRs, but the order and orientation of particular genes in syntenic blocks was homologous, as well. These findings will be valuable to the forthcoming sequencing of the lupin genome. PMID:23379841
Variability in Reading Ability Gains as a Function of Computer-Assisted Instruction Method of Presentation

ERIC Educational Resources Information Center

Johnson, Erin Phinney; Perry, Justin; Shamir, Haya

2010-01-01

This study examines the effects on early reading skills of three different methods of presenting material with computer-assisted instruction (CAI): (1) learner-controlled picture menu, which allows the student to choose activities, (2) linear sequencer, which progresses the students through lessons at a pre-specified pace, and (3) mastery-based…
U.S.-MEXICO BORDER PROGRAM ARIZONA BORDER STUDY--STANDARD OPERATING PROCEDURE FOR LABORATORY ASSISTANT TRAINING PLAN--GENERAL (UA-T-6.0)

EPA Science Inventory

The purpose of this SOP is to describe the training sequence of incoming student laboratory assistants. The procedure is designed to provide them with an overview of the project in terms of project goals, structure, and laboratory needs. This overview familiarizes the student l...
U.S.-MEXICO BORDER PROGRAM ARIZONA BORDER STUDY--STANDARD OPERATING PROCEDURE FOR GENERAL TRAINING OF STUDENT DATA ASSISTANTS (UA-T-5.0)

EPA Science Inventory

The purpose of this SOP is to describe the training sequence for incoming student data assistants (students). The procedure is designed to provide them with an overview of the study in terms of study goals, structure and project data needs. This overview familiarizes the studen...
Genome and transcriptome sequencing characterises the gene space of Macadamia integrifolia (Proteaceae).

PubMed

Nock, Catherine J; Baten, Abdul; Barkla, Bronwyn J; Furtado, Agnelo; Henry, Robert J; King, Graham J

2016-11-17

The large Gondwanan plant family Proteaceae is an early-diverging eudicot lineage renowned for its morphological, taxonomic and ecological diversity. Macadamia is the most economically important Proteaceae crop and represents an ancient rainforest-restricted lineage. The family is a focus for studies of adaptive radiation due to remarkable species diversification in Mediterranean-climate biodiversity hotspots, and numerous evolutionary transitions between biomes. Despite a long history of research, comparative analyses in the Proteaceae and macadamia breeding programs are restricted by a paucity of genetic information. To address this, we sequenced the genome and transcriptome of the widely grown Macadamia integrifolia cultivar 741. Over 95 gigabases of DNA and RNA-seq sequence data were de novo assembled and annotated. The draft assembly has a total length of 518 Mb and spans approximately 79% of the estimated genome size. Following annotation, 35,337 protein-coding genes were predicted of which over 90% were expressed in at least one of the leaf, shoot or flower tissues examined. Gene family comparisons with five other eudicot species revealed 13,689 clusters containing macadamia genes and 1005 macadamia-specific clusters, and provides evidence for linage-specific expansion of gene families involved in pathogen recognition, plant defense and monoterpene synthesis. Cyanogenesis is an important defense strategy in the Proteaceae, and a detailed analysis of macadamia gene homologues potentially involved in cyanogenic glycoside biosynthesis revealed several highly expressed candidate genes. The gene space of macadamia provides a foundation for comparative genomics, gene discovery and the acceleration of molecular-assisted breeding. This study presents the first available genomic resources for the large basal eudicot family Proteaceae, access to most macadamia genes and opportunities to uncover the genetic basis of traits of importance for adaptation and crop improvement.
Magnetically Assisted Remote-controlled Endovascular Catheter for Interventional MR Imaging: In Vitro Navigation at 1.5 T versus X-ray Fluoroscopy

PubMed Central

Losey, Aaron D.; Lillaney, Prasheel; Martin, Alastair J.; Cooke, Daniel L.; Wilson, Mark W.; Thorne, Bradford R. H.; Sincic, Ryan S.; Arenson, Ronald L.; Saeed, Maythem

2014-01-01

Purpose To compare in vitro navigation of a magnetically assisted remote-controlled (MARC) catheter under real-time magnetic resonance (MR) imaging with manual navigation under MR imaging and standard x-ray guidance in endovascular catheterization procedures in an abdominal aortic phantom. Materials and Methods The 2-mm-diameter custom clinical-grade microcatheter prototype with a solenoid coil at the distal tip was deflected with a foot pedal actuator used to deliver 300 mA of positive or negative current. Investigators navigated the catheter into branch vessels in a custom cryogel abdominal aortic phantom. This was repeated under MR imaging guidance without magnetic assistance and under conventional x-ray fluoroscopy. MR experiments were performed at 1.5 T by using a balanced steady-state free precession sequence. The mean procedure times and percentage success data were determined and analyzed with a linear mixed-effects regression analysis. Results The catheter was clearly visible under real-time MR imaging. One hundred ninety-two (80%) of 240 turns were successfully completed with magnetically assisted guidance versus 144 (60%) of 240 turns with nonassisted guidance (P < .001) and 119 (74%) of 160 turns with standard x-ray guidance (P = .028). Overall mean procedure time was shorter with magnetically assisted than with nonassisted guidance under MR imaging (37 seconds ± 6 [standard error of the mean] vs 55 seconds ± 3, P < .001), and time was comparable between magnetically assisted and standard x-ray guidance (37 seconds ± 6 vs 44 seconds ± 3, P = .045). When stratified by angle of branch vessel, magnetic assistance was faster than nonassisted MR guidance at turns of 45°, 60°, and 75°. Conclusion In this study, a MARC catheter for endovascular navigation under real-time MR imaging guidance was developed and tested. For catheterization of branch vessels arising at large angles, magnetically assisted catheterization was faster than manual catheterization under MR imaging guidance and was comparable to standard x-ray guidance. © RSNA, 2014 Online supplemental material is available for this article. PMID:24533872
Automatic detection of pelvic lymph nodes using multiple MR sequences

NASA Astrophysics Data System (ADS)

Yan, Michelle; Lu, Yue; Lu, Renzhi; Requardt, Martin; Moeller, Thomas; Takahashi, Satoru; Barentsz, Jelle

2007-03-01

A system for automatic detection of pelvic lymph nodes is developed by incorporating complementary information extracted from multiple MR sequences. A single MR sequence lacks sufficient diagnostic information for lymph node localization and staging. Correct diagnosis often requires input from multiple complementary sequences which makes manual detection of lymph nodes very labor intensive. Small lymph nodes are often missed even by highly-trained radiologists. The proposed system is aimed at assisting radiologists in finding lymph nodes faster and more accurately. To the best of our knowledge, this is the first such system reported in the literature. A 3-dimensional (3D) MR angiography (MRA) image is employed for extracting blood vessels that serve as a guide in searching for pelvic lymph nodes. Segmentation, shape and location analysis of potential lymph nodes are then performed using a high resolution 3D T1-weighted VIBE (T1-vibe) MR sequence acquired by Siemens 3T scanner. An optional contrast-agent enhanced MR image, such as post ferumoxtran-10 T2*-weighted MEDIC sequence, can also be incorporated to further improve detection accuracy of malignant nodes. The system outputs a list of potential lymph node locations that are overlaid onto the corresponding MR sequences and presents them to users with associated confidence levels as well as their sizes and lengths in each axis. Preliminary studies demonstrates the feasibility of automatic lymph node detection and scenarios in which this system may be used to assist radiologists in diagnosis and reporting.
Quantitative mutant analysis of viral quasispecies by chip-based matrix-assisted laser desorption/ ionization time-of-flight mass spectrometry

PubMed Central

Amexis, Georgios; Oeth, Paul; Abel, Kenneth; Ivshina, Anna; Pelloquin, Francois; Cantor, Charles R.; Braun, Andreas; Chumakov, Konstantin

2001-01-01

RNA viruses exist as quasispecies, heterogeneous and dynamic mixtures of mutants having one or more consensus sequences. An adequate description of the genomic structure of such viral populations must include the consensus sequence(s) plus a quantitative assessment of sequence heterogeneities. For example, in quality control of live attenuated viral vaccines, the presence of even small quantities of mutants or revertants may indicate incomplete or unstable attenuation that may influence vaccine safety. Previously, we demonstrated the monitoring of oral poliovirus vaccine with the use of mutant analysis by PCR and restriction enzyme cleavage (MAPREC). In this report, we investigate genetic variation in live attenuated mumps virus vaccine by using both MAPREC and a platform (DNA MassArray) based on matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry. Mumps vaccines prepared from the Jeryl Lynn strain typically contain at least two distinct viral substrains, JL1 and JL2, which have been characterized by full length sequencing. We report the development of assays for characterizing sequence variants in these substrains and demonstrate their use in quantitative analysis of substrains and sequence variations in mixed virus cultures and mumps vaccines. The results obtained from both the MAPREC and MALDI-TOF methods showed excellent correlation. This suggests the potential utility of MALDI-TOF for routine quality control of live viral vaccines and for assessment of genetic stability and quantitative monitoring of genetic changes in other RNA viruses of clinical interest. PMID:11593021
Clostridium sphenoides Chronic Osteomyelitis Diagnosed Via Matrix-Assisted Laser Desorption Ionization Time of Flight Mass Spectrometry, Conflicting With 16S rRNA Sequencing but Confirmed by Whole Genome Sequencing.

PubMed

Perkins, Matthew J; Snesrud, Erik; McGann, Patrick; Duplessis, Christopher A

2017-01-01

We report a case of successful treatment of chronic osteomyelitis (emanating from contaminated soil exposure) caused by Clostridium sphenoides, an organism infrequently identified as a cause of human infection and more saliently osteomyelitis (only 1 reported case in the literature). Additional impetus for reporting this case resides in the insights gained regarding pathogen identification exploiting sophisticated molecular platforms coupled to traditional microbial culture-based methods. The fastidious nature of cultivating anaerobic organisms required initial attempts at 16S rRNA sequencing to identify a Clostridium species (Clostridium celerecrescens). However, on exploiting matrix-assisted laser desorption ionization time of flight (MALDI TOF) technology, C. sphenoides was identified, and confirmed on whole genome sequencing. The discrepancies noted in the varying platforms require vigilance to seek complementary testing for conflicting results. Although highly accurate, the MALDI TOF and 16S rRNA sequencing platforms are not immune to false identification particularly in differentiating closely related organisms. More germane, whole genome sequencing should be entertained when conflicting results are obtained from MALDI TOF and 16S rRNA sequencing. Precise species and/or strain level identification can be clinically relevant as antimicrobial sensitivity profiles may be discrepant between closely related species influencing clinical outcomes. Thus, it is incumbent on us to strive to acquire the correct species characterization when resources allow to dictate optimal treatment. Reprint & Copyright © 2017 Association of Military Surgeons of the U.S.
Kinetic Competition between Elongation Rate and Binding of NELF Controls Promoter Proximal Pausing

PubMed Central

Li, Jian; Liu, Yingyun; Rhee, Ho Sung; Ghosh, Saikat Kumar B.; Bai, Lu; Pugh, B. Franklin; Gilmour, David S.

2013-01-01

Summary Pausing of RNA polymerase II (Pol II) 20-60 bp downstream of transcription start sites is a major checkpoint during transcription in animal cells. Mechanisms that control pausing are largely unknown. We developed permanganate-ChIP-seq to evaluate the state of Pol II at promoters throughout the Drosophila genome, and a biochemical system that reconstitutes promoter-proximal pausing to define pausing mechanisms. Stable open complexes of Pol II are largely absent from the transcription start sites of most mRNA genes, but are present at snRNA genes and the highly transcribed heat shock genes following their induction. The location of the pause is influenced by the timing between when NELF loads onto Pol II and how fast Pol II escapes the promoter region. Our biochemical analysis reveals that the sequence-specific transcription factor, GAF, orchestrates efficient pausing by recruiting NELF to promoters before transcription initiation and by assisting in loading NELF onto Pol II after initiation. PMID:23746353
An electrooculogram-based binary saccade sequence classification (BSSC) technique for augmentative communication and control.

PubMed

Keegan, Johnalan; Burke, Edward; Condron, James

2009-01-01

In the field of assistive technology, the electrooculogram (EOG) can be used as a channel of communication and the basis of a man-machine interface. For many people with severe motor disabilities, simple actions such as changing the TV channel require assistance. This paper describes a method of detecting saccadic eye movements and the use of a saccade sequence classification algorithm to facilitate communication and control. Saccades are fast eye movements that occurs when a person's gaze jumps from one fixation point to another. The classification is based on pre-defined sequences of saccades, guided by a static visual template (e.g. a page or poster). The template, consisting of a table of symbols each having a clearly identifiable fixation point, is situated within view of the user. To execute a particular command, the user moves his or her gaze through a pre-defined path of eye movements. This results in a well-formed sequence of saccades which are translated into a command if a match is found in a library of predefined sequences. A coordinate transformation algorithm is applied to each candidate sequence of recorded saccades to mitigate the effect of changes in the user's position and orientation relative to the visual template. Upon recognition of a saccade sequence from the library, its associated command is executed. A preliminary experiment in which two subjects were instructed to perform a series of command sequences consisting of 8 different commands are presented in the final sections. The system is also shown to be extensible to facilitate convenient text entry via an alphabetic visual template.
Techniques for automatic large scale change analysis of temporal multispectral imagery

NASA Astrophysics Data System (ADS)

Mercovich, Ryan A.

Change detection in remotely sensed imagery is a multi-faceted problem with a wide variety of desired solutions. Automatic change detection and analysis to assist in the coverage of large areas at high resolution is a popular area of research in the remote sensing community. Beyond basic change detection, the analysis of change is essential to provide results that positively impact an image analyst's job when examining potentially changed areas. Present change detection algorithms are geared toward low resolution imagery, and require analyst input to provide anything more than a simple pixel level map of the magnitude of change that has occurred. One major problem with this approach is that change occurs in such large volume at small spatial scales that a simple change map is no longer useful. This research strives to create an algorithm based on a set of metrics that performs a large area search for change in high resolution multispectral image sequences and utilizes a variety of methods to identify different types of change. Rather than simply mapping the magnitude of any change in the scene, the goal of this research is to create a useful display of the different types of change in the image. The techniques presented in this dissertation are used to interpret large area images and provide useful information to an analyst about small regions that have undergone specific types of change while retaining image context to make further manual interpretation easier. This analyst cueing to reduce information overload in a large area search environment will have an impact in the areas of disaster recovery, search and rescue situations, and land use surveys among others. By utilizing a feature based approach founded on applying existing statistical methods and new and existing topological methods to high resolution temporal multispectral imagery, a novel change detection methodology is produced that can automatically provide useful information about the change occurring in large area and high resolution image sequences. The change detection and analysis algorithm developed could be adapted to many potential image change scenarios to perform automatic large scale analysis of change.
Integrated sequencing of exome and mRNA of large-sized single cells.

PubMed

Wang, Lily Yan; Guo, Jiajie; Cao, Wei; Zhang, Meng; He, Jiankui; Li, Zhoufang

2018-01-10

Current approaches of single cell DNA-RNA integrated sequencing are difficult to call SNPs, because a large amount of DNA and RNA is lost during DNA-RNA separation. Here, we performed simultaneous single-cell exome and transcriptome sequencing on individual mouse oocytes. Using microinjection, we kept the nuclei intact to avoid DNA loss, while retaining the cytoplasm inside the cell membrane, to maximize the amount of DNA and RNA captured from the single cell. We then conducted exome-sequencing on the isolated nuclei and mRNA-sequencing on the enucleated cytoplasm. For single oocytes, exome-seq can cover up to 92% of exome region with an average sequencing depth of 10+, while mRNA-sequencing reveals more than 10,000 expressed genes in enucleated cytoplasm, with similar performance for intact oocytes. This approach provides unprecedented opportunities to study DNA-RNA regulation, such as RNA editing at single nucleotide level in oocytes. In future, this method can also be applied to other large cells, including neurons, large dendritic cells and large tumour cells for integrated exome and transcriptome sequencing.
Developing genome-wide microsatellite markers of bamboo and their applications on molecular marker assisted taxonomy for accessions in the genus Phyllostachys.

PubMed

Zhao, Hansheng; Yang, Li; Peng, Zhenhua; Sun, Huayu; Yue, Xianghua; Lou, Yongfeng; Dong, Lili; Wang, Lili; Gao, Zhimin

2015-01-26

Morphology-based taxonomy via exiguously reproductive organ has severely limitation on bamboo taxonomy, mainly owing to infrequent and unpredictable flowering events of bamboo. Here, we present the first genome-wide analysis and application of microsatellites based on the genome of moso bamboo (Phyllostachys edulis) to assist bamboo taxonomy. Of identified 127,593 microsatellite repeat-motifs, the primers of 1,451 microsatellites were designed and 1,098 markers were physically mapped on the genome of moso bamboo. A total of 917 markers were successfully validated in 9 accessions with ~39.8% polymorphic potential. Retrieved from validated microsatellite markers, 23 markers were selected for polymorphic analysis among 78 accessions and 64 alleles were detected with an average of 2.78 alleles per primers. The cluster result indicated the majority of the accessions were consistent with their current taxonomic classification, confirming the suitability and effectiveness of the developed microsatellite markers. The variations of microsatellite marker in different species were confirmed by sequencing and in silico comparative genome mapping were investigated. Lastly, a bamboo microsatellites database (http://www.bamboogdb.org/ssr) was implemented to browse and search large information of bamboo microsatellites. Consequently, our results of microsatellite marker development are valuable for assisting bamboo taxonomy and investigating genomic studies in bamboo and related grass species.
Polypeptide having or assisting in carbohydrate material degrading activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

2016-02-16

The invention relates to a polypeptide which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Deep Sequencing to Identify the Causes of Viral Encephalitis

PubMed Central

Chan, Benjamin K.; Wilson, Theodore; Fischer, Kael F.; Kriesel, John D.

2014-01-01

Deep sequencing allows for a rapid, accurate characterization of microbial DNA and RNA sequences in many types of samples. Deep sequencing (also called next generation sequencing or NGS) is being developed to assist with the diagnosis of a wide variety of infectious diseases. In this study, seven frozen brain samples from deceased subjects with recent encephalitis were investigated. RNA from each sample was extracted, randomly reverse transcribed and sequenced. The sequence analysis was performed in a blinded fashion and confirmed with pathogen-specific PCR. This analysis successfully identified measles virus sequences in two brain samples and herpes simplex virus type-1 sequences in three brain samples. No pathogen was identified in the other two brain specimens. These results were concordant with pathogen-specific PCR and partially concordant with prior neuropathological examinations, demonstrating that deep sequencing can accurately identify viral infections in frozen brain tissue. PMID:24699691
Streptomyces griseus streptomycin phosphotransferase: expression of its gene in Escherichia coli and sequence homology with other antibiotic phosphotransferases and with eukaryotic protein kinases.

PubMed

Lim, C K; Smith, M C; Petty, J; Baumberg, S; Wootton, J C

1989-12-01

The aphD gene of Streptomyces griseus, encoding a streptomycin 6-phosphotransferase (SPH), was sub-cloned in the pBR322-based expression vector pRK9 (which contains the Serratia marcescens trp promoter) with selection for expression of streptomycin resistance in Escherichia coli. Two hybrid plasmids, pCKL631 and pCKL711, were isolated which conferred resistance. Both contained a approximately 2 kbp fragment already suspected to include aphD. The properties of in vitro deletion derivatives of these plasmids were consistent with the presumed location of aphD. In vitro deletion of a sequence including most of the trp promoter largely, but not quite completely, abolished the ability of the plasmid to confer streptomycin resistance, confirming that expression was indeed principally from the trp promoter. A polypeptide of approximately 34.5 kDa was present in minicells containing plasmids that conferred streptomycin resistance, but was absent when the plasmids contained in vitro deletions removing streptomycin resistance. Part of the fragment was sequenced and an open reading frame corresponding to aphD identified. A computer-assisted comparison of the deduced SPH sequence with those of other antibiotic phosphotransferases suggested a common structure A-B-C-D-E, where B and D were conserved between all sequences compared while A, C and E divided between the streptomycin and hygromycin B phosphotransferases on one hand and kanamycin/neomycin ones on the other. A composite sequence data base was searched for homologues to consensus matrices constructed from five approximately 12-residue subsequences within blocks B and D. For one subsequence, corresponding to the N-terminal portion of block D, those sequences from the database that yielded the highest homology scores comprised almost entirely either antibiotic phosphotransferases or eukaryotic protein kinases. Possible evolutionary implications of this homology, previously described by other groups, are discussed.
Generation of Aptamers from A Primer-Free Randomized ssDNA Library Using Magnetic-Assisted Rapid Aptamer Selection

NASA Astrophysics Data System (ADS)

Tsao, Shih-Ming; Lai, Ji-Ching; Horng, Horng-Er; Liu, Tu-Chen; Hong, Chin-Yih

2017-04-01

Aptamers are oligonucleotides that can bind to specific target molecules. Most aptamers are generated using random libraries in the standard systematic evolution of ligands by exponential enrichment (SELEX). Each random library contains oligonucleotides with a randomized central region and two fixed primer regions at both ends. The fixed primer regions are necessary for amplifying target-bound sequences by PCR. However, these extra-sequences may cause non-specific bindings, which potentially interfere with good binding for random sequences. The Magnetic-Assisted Rapid Aptamer Selection (MARAS) is a newly developed protocol for generating single-strand DNA aptamers. No repeat selection cycle is required in the protocol. This study proposes and demonstrates a method to isolate aptamers for C-reactive proteins (CRP) from a randomized ssDNA library containing no fixed sequences at 5‧ and 3‧ termini using the MARAS platform. Furthermore, the isolated primer-free aptamer was sequenced and binding affinity for CRP was analyzed. The specificity of the obtained aptamer was validated using blind serum samples. The result was consistent with monoclonal antibody-based nephelometry analysis, which indicated that a primer-free aptamer has high specificity toward targets. MARAS is a feasible platform for efficiently generating primer-free aptamers for clinical diagnoses.
The Aftermath of Remedial Math: Investigating the Low Rate of Certificate Completion among Remedial Math Students

ERIC Educational Resources Information Center

Bahr, Peter Riley

2013-01-01

Nationally, a majority of community college students require remedial assistance with mathematics, but comparatively few students who begin the remedial math sequence ultimately complete it and achieve college-level math competency. The academic outcomes of students who begin the sequence but do not complete it are disproportionately unfavorable:…

Genetic variation patterns of American chestnut populations at EST-SSRs

Treesearch

Oliver Gailing; C. Dana Nelson

2017-01-01

The objective of this study is to analyze patterns of genetic variation at genic expressed sequence tag - simple sequence repeats (EST-SSRs) and at chloroplast DNA markers in populations of American chestnut (Castanea dentata Borkh.) to assist in conservation and breeding efforts. Allelic diversity at EST-SSRs decreased significantly from southwest to northeast along...
Characterization of wood decay enzymes by MALDI-MS for post-translational modification and gene identification.

Treesearch

Theodorus H. de Koker; Philip J. Kersten

2002-01-01

The recent sequencing of the Phanerochaete chrysosporium genome presents many opportunities, including the possibility of rapidly correlating specific wood decay proteins of the fungus with the corresponding gene sequences. Here we compare mass fragments of trypsin digests, determined by MALDI-MS (Matrix Assisted Laser Desorption Ionization-Mass Spectrometry), with...
Student Activity Ideas for the Technology Sequence Systems and Foundation Courses.

ERIC Educational Resources Information Center

New York State Education Dept., Albany.

This publication provides single-page outlines of brief ideas for high school student activities in each of the System and Foundation Courses of the New York State technology sequence. The idea outlines are provided as a resource to assist teachers in the development of student learning activities. The six courses for which ideas are presented are…
Sialoendoscopically assisted open sialolithectomy for removal of large submandibular hilar calculi.

PubMed

Su, Yu-xiong; Liao, Gui-qing; Zheng, Guang-sen; Liu, Hai-chao; Liang, Yu-jie; Ou, De-ming

2010-01-01

The management of large hilar calculi is a technically challenging issue during sialoendoscopic surgery. The aim of the present study was to evaluate the clinical efficacy of sialoendoscopically assisted open sialolithectomy for the removal of large submandibular hilar calculi to avoid sialoadenectomy. The present study was undertaken among patients with sialolithiasis scheduled for sialoendoscopic surgery from August 2005 to October 2008. When we failed to remove large submandibular hilar stones intraductally, we performed sialoendoscopically assisted open sialolithectomy. The clinical characteristics, pre- and intraoperative data, and outcomes were documented in a prospective fashion. Of 78 consecutive patients with submandibular sialolithiasis, 18 were treated with sialoendoscopically assisted open sialolithectomy immediately after failure of intraductal removal of calculi by sialoendoscopy. For 17 patients, large hilar sialoliths were successfully removed using this surgical technique. The surgery failed in 1 patient with multiple sialoliths, and the procedure was converted to open sialoadenectomy. Temporary numbness of the tongue for 1 week postoperatively was documented in 3 patients. The patients were followed up for a median period of 18 months without any symptoms or signs of recurrence. Our results suggest that sialoendoscopically assisted open sialolithectomy is an effective and safe surgical technique to remove large submandibular hilar calculi.
The Use of a Software-Assisted Method to Estimate Fetal Weight at and Near Term Using Magnetic Resonance Imaging.

PubMed

Kadji, Caroline; De Groof, Maxime; Camus, Margaux F; De Angelis, Riccardo; Fellas, Stéphanie; Klass, Magdalena; Cecotti, Vera; Dütemeyer, Vivien; Barakat, Elie; Cannie, Mieke M; Jani, Jacques C

2017-01-01

The aim of this study was to apply a semi-automated calculation method of fetal body volume and, thus, of magnetic resonance-estimated fetal weight (MR-EFW) prior to planned delivery and to evaluate whether the technique of measurement could be simplified while remaining accurate. MR-EFW was calculated using a semi-automated method at 38.6 weeks of gestation in 36 patients and compared to the picture archiving and communication system (PACS). Per patient, 8 sequences were acquired with a slice thickness of 4-8 mm and an intersection gap of 0, 4, 8, 12, 16, or 20 mm. The median absolute relative errors for MR-EFW and the time of planimetric measurements were calculated for all 8 sequences and for each method (assisted vs. PACS), and the difference between the methods was calculated. The median delivery weight was 3,280 g. The overall median relative error for all 288 MR-EFW calculations was 2.4% using the semi-automated method and 2.2% for the PACS method. Measurements did not differ between the 8 sequences using the assisted method (p = 0.313) or the PACS (p = 0.118), while the time of planimetric measurement decreased significantly with a larger gap (p < 0.001) and in the assisted method compared to the PACS method (p < 0.01). Our simplified MR-EFW measurement showed a dramatic decrease in time of planimetric measurement without a decrease in the accuracy of weight estimates. © 2017 S. Karger AG, Basel.
Sequencing RNA by a combination of exonuclease digestion and uridine specific chemical cleavage using MALDI-TOF.

PubMed Central

Tolson, D A; Nicholson, N H

1998-01-01

The determination of DNA sequences by partial exonuclease digestion followed by Matrix-Assisted Laser Desorption Time of Flight Mass Spectrometry (MALDI-TOF) is a well established method. When the same procedure is applied to RNA, difficulties arise due to the small (1 Da) mass difference between the nucleotides U and C, which makes unambiguous assignment difficult using a MALDI-TOF instrument. Here we report our experiences with sequence specific endonucleases and chemical methods followed by MALDI-TOF to resolve these sequence ambiguities. We have found chemical methods superior to endonucleases both in terms of correct specificity and extent of sequence coverage. This methodology can be used in combination with exonuclease digestion to rapidly assign RNA sequences. PMID:9421498
Understanding Selective Downregulation of c-Myc Expression through Inhibition of General Transcription Regulators in Multiple Myeloma

DTIC Science & Technology

2015-06-01

Love, and S. Gupta at the Whitehead Genome Core for assistance with genome sequencing . This research was supported by NIH K08 HL105678, The Wat...efficient alignment of short DNA sequences to the human genome . Genome Bioi. 10, R25. LeRoy, G., Rickards, B., and Flint, S.J. (2008). The double...of the beginning. Nature reviews. Cancer 12, 818-834, doi:10.1038/nrc3410 (2012). 12 Kool, M. et al. Genome sequencing of SHH medulloblastoma
Comparison of Human and Guinea Pig Acetylcholinesterase Sequences and Rates of Oxime-Assisted Reactivation

DTIC Science & Technology

2010-01-01

of appropriate animal model systems. For OP poisoning, the guinea pig (Cavia porcellus) is a commonly used animal model because guinea pigs more...endogenous bioscavenger in vivo. Although guinea pigs historically have been used to test OP poisoning therapies, it has been found recently that guinea pig AChE...transcribed mRNA encoding guinea pig AChE, amplified the resulting cDNA, and sequenced this product. The nucleotide and deduced amino acid sequences of
Developing an e-learning resource for nurse airway assistants in the emergency department.

PubMed

Hersey, Peter; McAleer, Sean

2017-02-23

The aims of this project were to determine the required competencies for a nurse in the emergency department assisting with a rapid sequence induction of anaesthesia (RSI), and to produce a relevant e-learning resource. A three-round multidisciplinary Delphi process produced the following competencies: ability to describe the steps and sequence of events of an RSI, familiarity with the equipment used during an RSI, ability to recognise and help manage problems occurring during an RSI, ability to prepare for an RSI, ability to apply cricoid pressure, and understanding the modification of an RSI in special circumstances. An interactive e-learning package was produced and made available online. Twelve emergency department nurses took part in an evaluation of the e-learning package. All either agreed or strongly agreed that they had increased their knowledge and found the learning useful, and 11 out of 12 nurses reported being somewhat or very confident in the role of airway assistant following completion of the learning.
TEMPO-Assisted Free Radical-Initiated Peptide Sequencing Mass Spectrometry (FRIPS MS) in Q-TOF and Orbitrap Mass Spectrometers: Single-Step Peptide Backbone Dissociations in Positive Ion Mode

NASA Astrophysics Data System (ADS)

Jang, Inae; Lee, Sun Young; Hwangbo, Song; Kang, Dukjin; Lee, Hookeun; Kim, Hugh I.; Moon, Bongjin; Oh, Han Bin

2017-01-01

The present study demonstrates that one-step peptide backbone fragmentations can be achieved using the TEMPO [2-(2,2,6,6-tetramethyl piperidine-1-oxyl)]-assisted free radical-initiated peptide sequencing (FRIPS) mass spectrometry in a hybrid quadrupole time-of-flight (Q-TOF) mass spectrometer and a Q-Exactive Orbitrap instrument in positive ion mode, in contrast to two-step peptide fragmentation in an ion-trap mass spectrometer (reference Anal. Chem. 85, 7044-7051 (30)). In the hybrid Q-TOF and Q-Exactive instruments, higher collisional energies can be applied to the target peptides, compared with the low collisional energies applied by the ion-trap instrument. The higher energy deposition and the additional multiple collisions in the collision cell in both instruments appear to result in one-step peptide backbone dissociations in positive ion mode. This new finding clearly demonstrates that the TEMPO-assisted FRIPS approach is a very useful tool in peptide mass spectrometry research.
Electrospray-assisted laser desorption/ionization and tandem mass spectrometry of peptides and proteins.

PubMed

Peng, Ivory X; Shiea, Jentaie; Ogorzalek Loo, Rachel R; Loo, Joseph A

2007-01-01

We have constructed an electrospray-assisted laser desorption/ionization (ELDI) source which utilizes a nitrogen laser pulse to desorb intact molecules from matrix-containing sample solution droplets, followed by electrospray ionization (ESI) post-ionization. The ELDI source is coupled to a quadrupole ion trap mass spectrometer and allows sampling under ambient conditions. Preliminary data showed that ELDI produces ESI-like multiply charged peptides and proteins up to 29 kDa carbonic anhydrase and 66 kDa bovine albumin from single-protein solutions, as well as from complex digest mixtures. The generated multiply charged polypeptides enable efficient tandem mass spectrometric (MS/MS)-based peptide sequencing. ELDI-MS/MS of protein digests and small intact proteins was performed both by collisionally activated dissociation (CAD) and by nozzle-skimmer dissociation (NSD). ELDI-MS/MS may be a useful tool for protein sequencing analysis and top-down proteomics study, and may complement matrix-assisted laser desorption/ionization (MALDI)-based measurements. Copyright (c) 2007 John Wiley & Sons, Ltd.
Enabling large-scale next-generation sequence assembly with Blacklight

PubMed Central

Couger, M. Brian; Pipes, Lenore; Squina, Fabio; Prade, Rolf; Siepel, Adam; Palermo, Robert; Katze, Michael G.; Mason, Christopher E.; Blood, Philip D.

2014-01-01

Summary A variety of extremely challenging biological sequence analyses were conducted on the XSEDE large shared memory resource Blacklight, using current bioinformatics tools and encompassing a wide range of scientific applications. These include genomic sequence assembly, very large metagenomic sequence assembly, transcriptome assembly, and sequencing error correction. The data sets used in these analyses included uncategorized fungal species, reference microbial data, very large soil and human gut microbiome sequence data, and primate transcriptomes, composed of both short-read and long-read sequence data. A new parallel command execution program was developed on the Blacklight resource to handle some of these analyses. These results, initially reported previously at XSEDE13 and expanded here, represent significant advances for their respective scientific communities. The breadth and depth of the results achieved demonstrate the ease of use, versatility, and unique capabilities of the Blacklight XSEDE resource for scientific analysis of genomic and transcriptomic sequence data, and the power of these resources, together with XSEDE support, in meeting the most challenging scientific problems. PMID:25294974
Rapid Threat Organism Recognition Pipeline

DOE Office of Scientific and Technical Information (OSTI.GOV)

Williams, Kelly P.; Solberg, Owen D.; Schoeniger, Joseph S.

2013-05-07

The RAPTOR computational pipeline identifies microbial nucleic acid sequences present in sequence data from clinical samples. It takes as input raw short-read genomic sequence data (in particular, the type generated by the Illumina sequencing platforms) and outputs taxonomic evaluation of detected microbes in various human-readable formats. This software was designed to assist in the diagnosis or characterization of infectious disease, by detecting pathogen sequences in nucleic acid sequence data from clinical samples. It has also been applied in the detection of algal pathogens, when algal biofuel ponds became unproductive. RAPTOR first trims and filters genomic sequence reads based on qualitymore » and related considerations, then performs a quick alignment to the human (or other host) genome to filter out host sequences, then performs a deeper search against microbial genomes. Alignment to a protein sequence database is optional. Alignment results are summarized and placed in a taxonomic framework using the Lowest Common Ancestor algorithm.« less
Carbohydrate-binding module 74 is a novel starch-binding domain associated with large and multidomain α-amylase enzymes.

PubMed

Valk, Vincent; Lammerts van Bueren, Alicia; van der Kaaij, Rachel M; Dijkhuizen, Lubbert

2016-06-01

Microbacterium aurum B8.A is a bacterium that originates from a potato starch-processing plant and employs a GH13 α-amylase (MaAmyA) enzyme that forms pores in potato starch granules. MaAmyA is a large and multi-modular protein that contains a novel domain at its C terminus (Domain 2). Deletion of Domain 2 from MaAmyA did not affect its ability to degrade starch granules but resulted in a strong reduction in granular pore size. Here, we separately expressed and purified this Domain 2 in Escherichia coli and determined its likely function in starch pore formation. Domain 2 independently binds amylose, amylopectin, and granular starch but does not have any detectable catalytic (hydrolytic or oxidizing) activity on α-glucan substrates. Therefore, we propose that this novel starch-binding domain is a new carbohydrate-binding module (CBM), the first representative of family CBM74 that assists MaAmyA in efficient pore formation in starch granules. Protein sequence-based BLAST searches revealed that CBM74 occurs widespread, but in bacteria only, and is often associated with large and multi-domain α-amylases containing family CBM25 or CBM26 domains. CBM74 may specifically function in binding to granular starches to enhance the capability of α-amylase enzymes to degrade resistant starches (RSs). Interestingly, the majority of family CBM74 representatives are found in α-amylases originating from human gut-associated Bifidobacteria, where they may assist in resistant starch degradation. The CBM74 domain thus may have a strong impact on the efficiency of RS digestion in the mammalian gastrointestinal tract. © 2016 The Authors. The FEBS Journal published by John Wiley & Sons Ltd on behalf of Federation of European Biochemical Societies.
Matrix-assisted laser desorption/ionization-time of flight mass spectrometry identification of large colony beta-hemolytic streptococci containing Lancefield groups A, C, and G.

PubMed

Jensen, Christian Salgård; Dam-Nielsen, Casper; Arpi, Magnus

2015-08-01

The aim of this study was to investigate whether large colony beta-hemolytic streptococci containing Lancefield groups A, C, and G can be adequately identified using matrix-assisted laser desorption/ionization-time of flight mass spectrometry (MALDI-ToF). Previous studies show varying results, with an identification rate from below 50% to 100%. Large colony beta-hemolytic streptococci containing Lancefield groups A, C, and G isolated from blood cultures between January 1, 2007 and May 1, 2012 were included in the study. Isolates were identified to the species level using a combination of phenotypic characteristics and 16s rRNA sequencing. The isolates were subjected to MALDI-ToF analysis. We used a two-stage approach starting with the direct method. If no valid result was obtained we proceeded to an extraction protocol. Scores above 2 were considered valid identification at the species level. A total of 97 Streptococcus pyogenes, 133 Streptococcus dysgalactiae, and 2 Streptococcus canis isolates were tested; 94%, 66%, and 100% of S. pyogenes, S. dysgalactiae, and S. canis, respectively, were correctly identified by MALDI-ToF. In most instances when the isolates were not identified by MALDI-ToF this was because MALDI-ToF was unable to differentiate between S. pyogenes and S. dysgalactiae. By removing two S. pyogenes reference spectra from the MALDI-ToF database the proportion of correctly identified isolates increased to 96% overall. MALDI-ToF is a promising method for discriminating between S. dysgalactiae, S. canis, and S. equi, although more strains need to be tested to clarify this.
Assessment of subchondral bone marrow lesions in knee osteoarthritis by MRI: a comparison of fluid sensitive and contrast enhanced sequences.

PubMed

Nielsen, Flemming K; Egund, Niels; Jørgensen, Anette; Peters, David A; Jurik, Anne Grethe

2016-11-16

Bone marrow lesions (BMLs) in knee osteoarthritis (OA) can be assessed using fluid sensitive and contrast enhanced sequences. The association between BMLs and symptoms has been investigated in several studies but only using fluid sensitive sequences. Our aims were to assess BMLs by contrast enhanced MRI sequences in comparison with a fluid sensitive STIR sequence using two different segmentation methods and to analyze the association between the MR findings and disability and pain. Twenty-two patients (mean age 61 years, range 41-79 years) with medial femoro-tibial knee OA obtained MRI and filled out a WOMAC questionnaire at baseline and follow-up (median interval of 334 days). STIR, dynamic contrast enhanced-MRI (DCE-MRI) and fat saturated T1 post-contrast (T1 CE FS) MRI sequences were obtained. All STIR and T1 CE FS sequences were assessed independently by two readers for STIR-BMLs and contrast enhancing areas of BMLs (CEA-BMLs) using manual segmentation and computer assisted segmentation, and the measurements were compared. DCE-MRIs were assessed for the relative distribution of voxels with an inflammatory enhancement pattern, N voxel , in the bone marrow. All findings were compared to WOMAC scores, including pain and overall symptoms, and changes from baseline to follow-up were analyzed. The average volume of CEA-BML was smaller than the STIR-BML volume by manual segmentation. The opposite was found for computer assisted segmentation where the average CEA-BML volume was larger than the STIR-BML volume. The contradictory finding by computer assisted segmentation was partly caused by a number of outliers with an apparent generally increased signal intensity in the anterior parts of the femoral condyle and tibial plateau causing an overestimation of the CEA-BML volume. Both CEA-BML, STIR-BML and N voxel were significantly correlated with symptoms and to a similar degree. A significant reduction in total WOMAC score was seen at follow-up, but no significant changes were observed for either CEA-BML, STIR-BML or N voxel . Neither the degree nor the volume of contrast enhancement in BMLs seems to add any clinical information compared to BMLs visualized by fluid sensitive sequences. Manual segmentation may be needed to obtain valid CEA-BML measurements.
Manager's assistant systems for space system planning

NASA Technical Reports Server (NTRS)

Bewley, William L.; Burnard, Robert; Edwards, Gary E.; Shoop, James

1992-01-01

This paper describes a class of knowledge-based 'assistant' systems for space system planning. Derived from technology produced for the DARPA/USAF Pilot's Associate program, these assistant systems help the human planner by doing the bookkeeping to maintain plan data and executing the procedures and heuristics currently used by the human planner to define, assess, diagnose, and revise plans. Intelligent systems for Space Station Freedom assembly sequence planning and Advanced Launch System modeling will be presented as examples. Ongoing NASA-funded work on a framework supporting the development of such tools will also be described.
Characterizing protein conformations by correlation analysis of coarse-grained contact matrices.

PubMed

Lindsay, Richard J; Siess, Jan; Lohry, David P; McGee, Trevor S; Ritchie, Jordan S; Johnson, Quentin R; Shen, Tongye

2018-01-14

We have developed a method to capture the essential conformational dynamics of folded biopolymers using statistical analysis of coarse-grained segment-segment contacts. Previously, the residue-residue contact analysis of simulation trajectories was successfully applied to the detection of conformational switching motions in biomolecular complexes. However, the application to large protein systems (larger than 1000 amino acid residues) is challenging using the description of residue contacts. Also, the residue-based method cannot be used to compare proteins with different sequences. To expand the scope of the method, we have tested several coarse-graining schemes that group a collection of consecutive residues into a segment. The definition of these segments may be derived from structural and sequence information, while the interaction strength of the coarse-grained segment-segment contacts is a function of the residue-residue contacts. We then perform covariance calculations on these coarse-grained contact matrices. We monitored how well the principal components of the contact matrices is preserved using various rendering functions. The new method was demonstrated to assist the reduction of the degrees of freedom for describing the conformation space, and it potentially allows for the analysis of a system that is approximately tenfold larger compared with the corresponding residue contact-based method. This method can also render a family of similar proteins into the same conformational space, and thus can be used to compare the structures of proteins with different sequences.
Characterizing protein conformations by correlation analysis of coarse-grained contact matrices

NASA Astrophysics Data System (ADS)

Lindsay, Richard J.; Siess, Jan; Lohry, David P.; McGee, Trevor S.; Ritchie, Jordan S.; Johnson, Quentin R.; Shen, Tongye

2018-01-01

We have developed a method to capture the essential conformational dynamics of folded biopolymers using statistical analysis of coarse-grained segment-segment contacts. Previously, the residue-residue contact analysis of simulation trajectories was successfully applied to the detection of conformational switching motions in biomolecular complexes. However, the application to large protein systems (larger than 1000 amino acid residues) is challenging using the description of residue contacts. Also, the residue-based method cannot be used to compare proteins with different sequences. To expand the scope of the method, we have tested several coarse-graining schemes that group a collection of consecutive residues into a segment. The definition of these segments may be derived from structural and sequence information, while the interaction strength of the coarse-grained segment-segment contacts is a function of the residue-residue contacts. We then perform covariance calculations on these coarse-grained contact matrices. We monitored how well the principal components of the contact matrices is preserved using various rendering functions. The new method was demonstrated to assist the reduction of the degrees of freedom for describing the conformation space, and it potentially allows for the analysis of a system that is approximately tenfold larger compared with the corresponding residue contact-based method. This method can also render a family of similar proteins into the same conformational space, and thus can be used to compare the structures of proteins with different sequences.
High-Throughput Development of SSR Markers from Pea (Pisum sativum L.) Based on Next Generation Sequencing of a Purified Chinese Commercial Variety

PubMed Central

Zhang, Xiaoyan; Hu, Jinguo; Bao, Shiying; Hao, Junjie; Li, Ling; He, Yuhua; Jiang, Junye; Wang, Fang; Tian, Shufang; Zong, Xuxiao

2015-01-01

Pea (Pisum sativum L.) is an important food legume globally, and is the plant species that J.G. Mendel used to lay the foundation of modern genetics. However, genomics resources of pea are limited comparing to other crop species. Application of marker assisted selection (MAS) in pea breeding has lagged behind many other crops. Development of a large number of novel and reliable SSR (simple sequence repeat) or microsatellite markers will help both basic and applied genomics research of this crop. The Illumina HiSeq 2500 System was used to uncover 8,899 putative SSR containing sequences, and 3,275 non-redundant primers were designed to amplify these SSRs. Among the 1,644 SSRs that were randomly selected for primer validation, 841 yielded reliable amplifications of detectable polymorphisms among 24 genotypes of cultivated pea (Pisum sativum L.) and wild relatives (P. fulvum Sm.) originated from diverse geographical locations. The dataset indicated that the allele number per locus ranged from 2 to 10, and that the polymorphism information content (PIC) ranged from 0.08 to 0.82 with an average of 0.38. These 1,644 novel SSR markers were also tested for polymorphism between genotypes G0003973 and G0005527. Finally, 33 polymorphic SSR markers were anchored on the genetic linkage map of G0003973 × G0005527 F2 population. PMID:26440522

Construction of an SNP-based high-density linkage map for flax (Linum usitatissimum L.) using specific length amplified fragment sequencing (SLAF-seq) technology.

PubMed

Yi, Liuxi; Gao, Fengyun; Siqin, Bateer; Zhou, Yu; Li, Qiang; Zhao, Xiaoqing; Jia, Xiaoyun; Zhang, Hui

2017-01-01

Flax is an important crop for oil and fiber, however, no high-density genetic maps have been reported for this species. Specific length amplified fragment sequencing (SLAF-seq) is a high-resolution strategy for large scale de novo discovery and genotyping of single nucleotide polymorphisms. In this study, SLAF-seq was employed to develop SNP markers in an F2 population to construct a high-density genetic map for flax. In total, 196.29 million paired-end reads were obtained. The average sequencing depth was 25.08 in male parent, 32.17 in the female parent, and 9.64 in each F2 progeny. In total, 389,288 polymorphic SLAFs were detected, from which 260,380 polymorphic SNPs were developed. After filtering, 4,638 SNPs were found suitable for genetic map construction. The final genetic map included 4,145 SNP markers on 15 linkage groups and was 2,632.94 cM in length, with an average distance of 0.64 cM between adjacent markers. To our knowledge, this map is the densest SNP-based genetic map for flax. The SNP markers and genetic map reported in here will serve as a foundation for the fine mapping of quantitative trait loci (QTLs), map-based gene cloning and marker assisted selection (MAS) for flax.
mzStudio: A Dynamic Digital Canvas for User-Driven Interrogation of Mass Spectrometry Data.

PubMed

Ficarro, Scott B; Alexander, William M; Marto, Jarrod A

2017-08-01

Although not yet truly 'comprehensive', modern mass spectrometry-based experiments can generate quantitative data for a meaningful fraction of the human proteome. Importantly for large-scale protein expression analysis, robust data pipelines are in place for identification of un-modified peptide sequences and aggregation of these data to protein-level quantification. However, interoperable software tools that enable scientists to computationally explore and document novel hypotheses for peptide sequence, modification status, or fragmentation behavior are not well-developed. Here, we introduce mzStudio, an open-source Python module built on our multiplierz project. This desktop application provides a highly-interactive graphical user interface (GUI) through which scientists can examine and annotate spectral features, re-search existing PSMs to test different modifications or new spectral matching algorithms, share results with colleagues, integrate other domain-specific software tools, and finally create publication-quality graphics. mzStudio leverages our common application programming interface (mzAPI) for access to native data files from multiple instrument platforms, including ion trap, quadrupole time-of-flight, Orbitrap, matrix-assisted laser desorption ionization, and triple quadrupole mass spectrometers and is compatible with several popular search engines including Mascot, Proteome Discoverer, X!Tandem, and Comet. The mzStudio toolkit enables researchers to create a digital provenance of data analytics and other evidence that support specific peptide sequence assignments.
Teaching Algebra-Based Concepts to Students with Learning Disabilities: The Effects of Preteaching Using a Gradual Instructional Sequence

ERIC Educational Resources Information Center

Watt, Sarah Jean

2013-01-01

Research to identify validated instructional approaches to teach math to students with LD and those at-risk for failure in both core and supplemental instructional settings is necessary to assist teachers in closing the achievement gaps that exist across the country. The concrete-to-representational-to-abstract instructional sequence (CRA) has…
Bromine isotopic signature facilitates de novo sequencing of peptides in free-radical-initiated peptide sequencing (FRIPS) mass spectrometry.

PubMed

Nam, Jungjoo; Kwon, Hyuksu; Jang, Inae; Jeon, Aeran; Moon, Jingyu; Lee, Sun Young; Kang, Dukjin; Han, Sang Yun; Moon, Bongjin; Oh, Han Bin

2015-02-01

We recently showed that free-radical-initiated peptide sequencing mass spectrometry (FRIPS MS) assisted by the remarkable thermochemical stability of (2,2,6,6-tetramethyl-piperidin-1-yl)oxyl (TEMPO) is another attractive radical-driven peptide fragmentation MS tool. Facile homolytic cleavage of the bond between the benzylic carbon and the oxygen of the TEMPO moiety in o-TEMPO-Bz-C(O)-peptide and the high reactivity of the benzylic radical species generated in •Bz-C(O)-peptide are key elements leading to extensive radical-driven peptide backbone fragmentation. In the present study, we demonstrate that the incorporation of bromine into the benzene ring, i.e. o-TEMPO-Bz(Br)-C(O)-peptide, allows unambiguous distinction of the N-terminal peptide fragments from the C-terminal fragments through the unique bromine doublet isotopic signature. Furthermore, bromine substitution does not alter the overall radical-driven peptide backbone dissociation pathways of o-TEMPO-Bz-C(O)-peptide. From a practical perspective, the presence of the bromine isotopic signature in the N-terminal peptide fragments in TEMPO-assisted FRIPS MS represents a useful and cost-effective opportunity for de novo peptide sequencing. Copyright © 2015 John Wiley & Sons, Ltd.
High throughput SNP discovery and genotyping in grapevine (Vitis vinifera L.) by combining a re-sequencing approach and SNPlex technology

PubMed Central

Lijavetzky, Diego; Cabezas, José Antonio; Ibáñez, Ana; Rodríguez, Virginia; Martínez-Zapater, José M

2007-01-01

Background Single-nucleotide polymorphisms (SNPs) are the most abundant type of DNA sequence polymorphisms. Their higher availability and stability when compared to simple sequence repeats (SSRs) provide enhanced possibilities for genetic and breeding applications such as cultivar identification, construction of genetic maps, the assessment of genetic diversity, the detection of genotype/phenotype associations, or marker-assisted breeding. In addition, the efficiency of these activities can be improved thanks to the ease with which SNP genotyping can be automated. Expressed sequence tags (EST) sequencing projects in grapevine are allowing for the in silico detection of multiple putative sequence polymorphisms within and among a reduced number of cultivars. In parallel, the sequence of the grapevine cultivar Pinot Noir is also providing thousands of polymorphisms present in this highly heterozygous genome. Still the general application of those SNPs requires further validation since their use could be restricted to those specific genotypes. Results In order to develop a large SNP set of wide application in grapevine we followed a systematic re-sequencing approach in a group of 11 grape genotypes corresponding to ancient unrelated cultivars as well as wild plants. Using this approach, we have sequenced 230 gene fragments, what represents the analysis of over 1 Mb of grape DNA sequence. This analysis has allowed the discovery of 1573 SNPs with an average of one SNP every 64 bp (one SNP every 47 bp in non-coding regions and every 69 bp in coding regions). Nucleotide diversity in grape (π = 0.0051) was found to be similar to values observed in highly polymorphic plant species such as maize. The average number of haplotypes per gene sequence was estimated as six, with three haplotypes representing over 83% of the analyzed sequences. Short-range linkage disequilibrium (LD) studies within the analyzed sequences indicate the existence of a rapid decay of LD within the selected grapevine genotypes. To validate the use of the detected polymorphisms in genetic mapping, cultivar identification and genetic diversity studies we have used the SNPlex™ genotyping technology in a sample of grapevine genotypes and segregating progenies. Conclusion These results provide accurate values for nucleotide diversity in coding sequences and a first estimate of short-range LD in grapevine. Using SNPlex™ genotyping we have shown the application of a set of discovered SNPs as molecular markers for cultivar identification, linkage mapping and genetic diversity studies. Thus, the combination a highly efficient re-sequencing approach and the SNPlex™ high throughput genotyping technology provide a powerful tool for grapevine genetic analysis. PMID:18021442
Introduction to Student Assistance Programs August, 1988. Alcohol and Drug Defense Program Substance Abuse Intervention 10-Hour Training Module.

ERIC Educational Resources Information Center

North Carolina State Dept. of Public Instruction, Raleigh.

A complete outline is presented of a 2-day, 10-hour training program for establishing a student assistance program dealing with the problems of alcohol and drug abuse. The sessions are presented in the following sequence: (1) registration and introductions; (2) presentation of the problem; (3) clarification of expectations and establishment of a…
Microwave-assisted domino and multicomponent reactions with cyclic acylketenes: expeditious syntheses of oxazinones and oxazindiones.

PubMed

Presset, Marc; Coquerel, Yoann; Rodriguez, Jean

2009-12-17

The microwave-assisted Wolff rearrangement of cyclic 2-diazo-1,3-diketones in the presence of aldehydes and primary amines provides a straightforward access to functionalized bi- and pentacyclic oxazinones following an unprecedented three-component domino reaction. Alternatively, in the presence of acyl azides, an efficient Curtius/Wolff/hetero-Diels-Alder sequence allows the direct synthesis of oxazindiones.
Rapid Identification of Cryptococcus neoformans and Cryptococcus gattii by Matrix-Assisted Laser Desorption Ionization–Time of Flight Mass Spectrometry ▿

PubMed Central

McTaggart, Lisa R.; Lei, Eric; Richardson, Susan E.; Hoang, Linda; Fothergill, Annette; Zhang, Sean X.

2011-01-01

Compared to DNA sequence analysis, matrix-assisted laser desorption ionization–time of flight mass spectrometry (MALDI-TOF MS) correctly identified 100% of Cryptococcus species, distinguishing the notable pathogens Cryptococcus neoformans and C. gattii. Identification was greatly enhanced by supplementing a commercial spectral library with additional entries to account for subspecies variability. PMID:21653762
Changes in solvation during DNA binding and cleavage are critical to altered specificity of the EcoRI endonuclease

PubMed Central

Robinson, Clifford R.; Sligar, Stephen G.

1998-01-01

Restriction endonucleases such as EcoRI bind and cleave DNA with great specificity and represent a paradigm for protein–DNA interactions and molecular recognition. Using osmotic pressure to induce water release, we demonstrate the participation of bound waters in the sequence discrimination of substrate DNA by EcoRI. Changes in solvation can play a critical role in directing sequence-specific DNA binding by EcoRI and are also crucial in assisting site discrimination during catalysis. By measuring the volume change for complex formation, we show that at the cognate sequence (GAATTC) EcoRI binding releases about 70 fewer water molecules than binding at an alternate DNA sequence (TAATTC), which differs by a single base pair. EcoRI complexation with nonspecific DNA releases substantially less water than either of these specific complexes. In cognate substrates (GAATTC) kcat decreases as osmotic pressure is increased, indicating the binding of about 30 water molecules accompanies the cleavage reaction. For the alternate substrate (TAATTC), release of about 40 water molecules accompanies the reaction, indicated by a dramatic acceleration of the rate when osmotic pressure is raised. These large differences in solvation effects demonstrate that water molecules can be key players in the molecular recognition process during both association and catalytic phases of the EcoRI reaction, acting to change the specificity of the enzyme. For both the protein–DNA complex and the transition state, there may be substantial conformational differences between cognate and alternate sites, accompanied by significant alterations in hydration and solvent accessibility. PMID:9482860
Breeding-assisted genomics.

PubMed

Poland, Jesse

2015-04-01

The revolution of inexpensive sequencing has ushered in an unprecedented age of genomics. The promise of using this technology to accelerate plant breeding is being realized with a vision of genomics-assisted breeding that will lead to rapid genetic gain for expensive and difficult traits. The reality is now that robust phenotypic data is an increasing limiting resource to complement the current wealth of genomic information. While genomics has been hailed as the discipline to fundamentally change the scope of plant breeding, a more symbiotic relationship is likely to emerge. In the context of developing and evaluating large populations needed for functional genomics, none excel in this area more than plant breeders. While genetic studies have long relied on dedicated, well-structured populations, the resources dedicated to these populations in the context of readily available, inexpensive genotyping is making this philosophy less tractable relative to directly focusing functional genomics on material in breeding programs. Through shifting effort for basic genomic studies from dedicated structured populations, to capturing the entire scope of genetic determinants in breeding lines, we can move towards not only furthering our understanding of functional genomics in plants, but also rapidly improving crops for increased food security, availability and nutrition. Copyright © 2015 Elsevier Ltd. All rights reserved.
Knowledge management in secondary pharmaceutical manufacturing by mining of data historians-A proof-of-concept study.

PubMed

Meneghetti, Natascia; Facco, Pierantonio; Bezzo, Fabrizio; Himawan, Chrismono; Zomer, Simeone; Barolo, Massimiliano

2016-05-30

In this proof-of-concept study, a methodology is proposed to systematically analyze large data historians of secondary pharmaceutical manufacturing systems using data mining techniques. The objective is to develop an approach enabling to automatically retrieve operation-relevant information that can assist the management in the periodic review of a manufactory system. The proposed methodology allows one to automatically perform three tasks: the identification of single batches within the entire data-sequence of the historical dataset, the identification of distinct operating phases within each batch, and the characterization of a batch with respect to an assigned multivariate set of operating characteristics. The approach is tested on a six-month dataset of a commercial-scale granulation/drying system, where several millions of data entries are recorded. The quality of results and the generality of the approach indicate that there is a strong potential for extending the method to even larger historical datasets and to different operations, thus making it an advanced PAT tool that can assist the implementation of continual improvement paradigms within a quality-by-design framework. Copyright © 2016 Elsevier B.V. All rights reserved.
Using relational databases for improved sequence similarity searching and large-scale genomic analyses.

PubMed

Mackey, Aaron J; Pearson, William R

2004-10-01

Relational databases are designed to integrate diverse types of information and manage large sets of search results, greatly simplifying genome-scale analyses. Relational databases are essential for management and analysis of large-scale sequence analyses, and can also be used to improve the statistical significance of similarity searches by focusing on subsets of sequence libraries most likely to contain homologs. This unit describes using relational databases to improve the efficiency of sequence similarity searching and to demonstrate various large-scale genomic analyses of homology-related data. This unit describes the installation and use of a simple protein sequence database, seqdb_demo, which is used as a basis for the other protocols. These include basic use of the database to generate a novel sequence library subset, how to extend and use seqdb_demo for the storage of sequence similarity search results and making use of various kinds of stored search results to address aspects of comparative genomic analysis.
Mantis: A Fast, Small, and Exact Large-Scale Sequence-Search Index.

PubMed

Pandey, Prashant; Almodaresi, Fatemeh; Bender, Michael A; Ferdman, Michael; Johnson, Rob; Patro, Rob

2018-06-18

Sequence-level searches on large collections of RNA sequencing experiments, such as the NCBI Sequence Read Archive (SRA), would enable one to ask many questions about the expression or variation of a given transcript in a population. Existing approaches, such as the sequence Bloom tree, suffer from fundamental limitations of the Bloom filter, resulting in slow build and query times, less-than-optimal space usage, and potentially large numbers of false-positives. This paper introduces Mantis, a space-efficient system that uses new data structures to index thousands of raw-read experiments and facilitates large-scale sequence searches. In our evaluation, index construction with Mantis is 6× faster and yields a 20% smaller index than the state-of-the-art split sequence Bloom tree (SSBT). For queries, Mantis is 6-108× faster than SSBT and has no false-positives or -negatives. For example, Mantis was able to search for all 200,400 known human transcripts in an index of 2,652 RNA sequencing experiments in 82 min; SSBT took close to 4 days. Copyright © 2018 Elsevier Inc. All rights reserved.
Genome-wide SNP identification and QTL mapping for black rot resistance in cabbage.

PubMed

Lee, Jonghoon; Izzah, Nur Kholilatul; Jayakodi, Murukarthick; Perumal, Sampath; Joh, Ho Jun; Lee, Hyeon Ju; Lee, Sang-Choon; Park, Jee Young; Yang, Ki-Woung; Nou, Il-Sup; Seo, Joodeok; Yoo, Jaeheung; Suh, Youngdeok; Ahn, Kyounggu; Lee, Ji Hyun; Choi, Gyung Ja; Yu, Yeisoo; Kim, Heebal; Yang, Tae-Jin

2015-02-03

Black rot is a destructive bacterial disease causing large yield and quality losses in Brassica oleracea. To detect quantitative trait loci (QTL) for black rot resistance, we performed whole-genome resequencing of two cabbage parental lines and genome-wide SNP identification using the recently published B. oleracea genome sequences as reference. Approximately 11.5 Gb of sequencing data was produced from each parental line. Reference genome-guided mapping and SNP calling revealed 674,521 SNPs between the two cabbage lines, with an average of one SNP per 662.5 bp. Among 167 dCAPS markers derived from candidate SNPs, 117 (70.1%) were validated as bona fide SNPs showing polymorphism between the parental lines. We then improved the resolution of a previous genetic map by adding 103 markers including 87 SNP-based dCAPS markers. The new map composed of 368 markers and covers 1467.3 cM with an average interval of 3.88 cM between adjacent markers. We evaluated black rot resistance in the mapping population in three independent inoculation tests using F2:3 progenies and identified one major QTL and three minor QTLs. We report successful utilization of whole-genome resequencing for large-scale SNP identification and development of molecular markers for genetic map construction. In addition, we identified novel QTLs for black rot resistance. The high-density genetic map will promote QTL analysis for other important agricultural traits and marker-assisted breeding of B. oleracea.
Complete sequence determination of a novel reptile iridovirus isolated from soft-shelled turtle and evolutionary analysis of Iridoviridae

PubMed Central

Huang, Youhua; Huang, Xiaohong; Liu, Hong; Gong, Jie; Ouyang, Zhengliang; Cui, Huachun; Cao, Jianhao; Zhao, Yingtao; Wang, Xiujie; Jiang, Yulin; Qin, Qiwei

2009-01-01

Background Soft-shelled turtle iridovirus (STIV) is the causative agent of severe systemic diseases in cultured soft-shelled turtles (Trionyx sinensis). To our knowledge, the only molecular information available on STIV mainly concerns the highly conserved STIV major capsid protein. The complete sequence of the STIV genome is not yet available. Therefore, determining the genome sequence of STIV and providing a detailed bioinformatic analysis of its genome content and evolution status will facilitate further understanding of the taxonomic elements of STIV and the molecular mechanisms of reptile iridovirus pathogenesis. Results We determined the complete nucleotide sequence of the STIV genome using 454 Life Science sequencing technology. The STIV genome is 105 890 bp in length with a base composition of 55.1% G+C. Computer assisted analysis revealed that the STIV genome contains 105 potential open reading frames (ORFs), which encode polypeptides ranging from 40 to 1,294 amino acids and 20 microRNA candidates. Among the putative proteins, 20 share homology with the ancestral proteins of the nuclear and cytoplasmic large DNA viruses (NCLDVs). Comparative genomic analysis showed that STIV has the highest degree of sequence conservation and a colinear arrangement of genes with frog virus 3 (FV3), followed by Tiger frog virus (TFV), Ambystoma tigrinum virus (ATV), Singapore grouper iridovirus (SGIV), Grouper iridovirus (GIV) and other iridovirus isolates. Phylogenetic analysis based on conserved core genes and complete genome sequence of STIV with other virus genomes was performed. Moreover, analysis of the gene gain-and-loss events in the family Iridoviridae suggested that the genes encoded by iridoviruses have evolved for favoring adaptation to different natural host species. Conclusion This study has provided the complete genome sequence of STIV. Phylogenetic analysis suggested that STIV and FV3 are strains of the same viral species belonging to the Ranavirus genus in the Iridoviridae family. Given virus-host co-evolution and the phylogenetic relationship among vertebrates from fish to reptiles, we propose that iridovirus might transmit between reptiles and amphibians and that STIV and FV3 are strains of the same viral species in the Ranavirus genus. PMID:19439104
Complete Taiwanese Macaque (Macaca cyclopis) Mitochondrial Genome: Reference-Assisted de novo Assembly with Multiple k-mer Strategy.

PubMed

Huang, Yu-Feng; Midha, Mohit; Chen, Tzu-Han; Wang, Yu-Tai; Smith, David Glenn; Pei, Kurtis Jai-Chyi; Chiu, Kuo Ping

2015-01-01

The Taiwanese (Formosan) macaque (Macaca cyclopis) is the only nonhuman primate endemic to Taiwan. This primate species is valuable for evolutionary studies and as subjects in medical research. However, only partial fragments of the mitochondrial genome (mitogenome) of this primate species have been sequenced, not mentioning its nuclear genome. We employed next-generation sequencing to generate 2 x 90 bp paired-end reads, followed by reference-assisted de novo assembly with multiple k-mer strategy to characterize the M. cyclopis mitogenome. We compared the assembled mitogenome with that of other macaque species for phylogenetic analysis. Our results show that, the M. cyclopis mitogenome consists of 16,563 nucleotides encoding for 13 protein-coding genes, 2 ribosomal RNAs and 22 transfer RNAs. Phylogenetic analysis indicates that M. cyclopis is most closely related to M. mulatta lasiota (Chinese rhesus macaque), supporting the notion of Asia-continental origin of M. cyclopis proposed in previous studies based on partial mitochondrial sequences. Our work presents a novel approach for assembling a mitogenome that utilizes the capabilities of de novo genome assembly with assistance of a reference genome. The availability of the complete Taiwanese macaque mitogenome will facilitate the study of primate evolution and the characterization of genetic variations for the potential usage of this species as a non-human primate model for medical research.
Functional Genomics Analysis of Singapore Grouper Iridovirus: Complete Sequence Determination and Proteomic Analysis

PubMed Central

Song, Wen Jun; Qin, Qi Wei; Qiu, Jin; Huang, Can Hua; Wang, Fan; Hew, Choy Leong

2004-01-01

Here we report the complete genome sequence of Singapore grouper iridovirus (SGIV). Sequencing of the random shotgun and restriction endonuclease genomic libraries showed that the entire SGIV genome consists of 140,131 nucleotide bp. One hundred sixty-two open reading frames (ORFs) from the sense and antisense DNA strands, coding for lengths varying from 41 to 1,268 amino acids, were identified. Computer-assisted analyses of the deduced amino acid sequences revealed that 77 of the ORFs exhibited homologies to known virus genes, 23 of which matched functional iridovirus proteins. Forty-two putative conserved domains or signatures were detected in the National Center for Biotechnology Information CD-Search database and PROSITE database. An assortment of enzyme activities involved in DNA replication, transcription, nucleotide metabolism, cell signaling, etc., were identified. Viruses were cultured on a cell line derived from the embryonated egg of the grouper Epinephelus tauvina, isolated, and purified by sucrose gradient ultracentrifugation. The protein extract from the purified virions was analyzed by polyacrylamide gel electrophoresis followed by in-gel digestion of protein bands. Matrix-assisted laser desorption ionization-time of flight mass spectrometry and database searching led to identification of 26 proteins. Twenty of these represented novel or previously unidentified genes, which were further confirmed by reverse transcription-PCR (RT-PCR) and DNA sequencing of their respective RT-PCR products. PMID:15507645
Molecular mapping and marker development for the Triticum dicoccoides-derived stripe rust resistance gene YrSM139-1B in bread wheat cv. Shaanmai 139.

PubMed

Zhang, Hong; Zhang, Lu; Wang, Changyou; Wang, Yajuan; Zhou, Xinli; Lv, Shikai; Liu, Xinlun; Kang, Zhensheng; Ji, Wanquan

2016-02-01

YrSM139-1B maybe a new gene for effective resistance to stripe rust and useful flanking markers for marker-assisted selection were developed. Stripe rust, caused by Puccinia striiformis f. sp. tritici, is an important foliar disease of wheat. Two dominant stripe rust resistant genes YrSM139-1B and YrSM139-2D were pyramided in bread wheat cultivar Shaanmai 139; one from wild emmer and the other from Thinopyrum intermedium. Three near-isogenic F7:8 line pairs (contrasting RILs), N122-1013R/S, N122-185R/S, and N122-1812R/S, independently derived from different F2 plants and differing at the YrSM139-1B locus were generated from the cross Shaanmai 139 × Hu 901-19 through marker-assisted selection. A large F2:3 population from cross N122-1013R × N122-1013S tested for stripe rust response and subjected to analysis with markers in the 1BS10-0.5 bin region using SSR expressed sequence tags (EST) and site-specific sequence markers developed from the 90 K Illumina iSelect SNP array. Five EST-STS markers and four allele-specific PCR markers were mapped to the YrSM139-1B region. The 30.5 cM genetic map for YrSM139-1B consisted of nine markers, two of which were closer to YrSM139-1B than Xgwm273, which was used in producing the contrasting RIL pairs. Race response data and allelism tests showed that YrSM139-1B is different from Yr10, Yr15, and Yr24/26/CH42.
National Weatherization Assistance Program Impact Evaluation: Energy Impacts for Large Multifamily Buildings

DOE Office of Scientific and Technical Information (OSTI.GOV)

Blasnik, Michael; Dalhoff, Greg; Carroll, David

This report estimates energy savings, energy cost savings, and cost effectiveness attributable to weatherizing large multifamily buildings under the auspices of the Department of Energy's Weatherization Assistance Program during Program Year 2008.
Targeted isolation, sequence assembly and characterization of two white spruce (Picea glauca) BAC clones for terpenoid synthase and cytochrome P450 genes involved in conifer defence reveal insights into a conifer genome

PubMed Central

2009-01-01

Background Conifers are a large group of gymnosperm trees which are separated from the angiosperms by more than 300 million years of independent evolution. Conifer genomes are extremely large and contain considerable amounts of repetitive DNA. Currently, conifer sequence resources exist predominantly as expressed sequence tags (ESTs) and full-length (FL)cDNAs. There is no genome sequence available for a conifer or any other gymnosperm. Conifer defence-related genes often group into large families with closely related members. The goals of this study are to assess the feasibility of targeted isolation and sequence assembly of conifer BAC clones containing specific genes from two large gene families, and to characterize large segments of genomic DNA sequence for the first time from a conifer. Results We used a PCR-based approach to identify BAC clones for two target genes, a terpene synthase (3-carene synthase; 3CAR) and a cytochrome P450 (CYP720B4) from a non-arrayed genomic BAC library of white spruce (Picea glauca). Shotgun genomic fragments isolated from the BAC clones were sequenced to a depth of 15.6- and 16.0-fold coverage, respectively. Assembly and manual curation yielded sequence scaffolds of 172 kbp (3CAR) and 94 kbp (CYP720B4) long. Inspection of the genomic sequences revealed the intron-exon structures, the putative promoter regions and putative cis-regulatory elements of these genes. Sequences related to transposable elements (TEs), high complexity repeats and simple repeats were prevalent and comprised approximately 40% of the sequenced genomic DNA. An in silico simulation of the effect of sequencing depth on the quality of the sequence assembly provides direction for future efforts of conifer genome sequencing. Conclusion We report the first targeted cloning, sequencing, assembly, and annotation of large segments of genomic DNA from a conifer. We demonstrate that genomic BAC clones for individual members of multi-member gene families can be isolated in a gene-specific fashion. The results of the present work provide important new information about the structure and content of conifer genomic DNA that will guide future efforts to sequence and assemble conifer genomes. PMID:19656416

Targeted isolation, sequence assembly and characterization of two white spruce (Picea glauca) BAC clones for terpenoid synthase and cytochrome P450 genes involved in conifer defence reveal insights into a conifer genome.

PubMed

Hamberger, Björn; Hall, Dawn; Yuen, Mack; Oddy, Claire; Hamberger, Britta; Keeling, Christopher I; Ritland, Carol; Ritland, Kermit; Bohlmann, Jörg

2009-08-06

Conifers are a large group of gymnosperm trees which are separated from the angiosperms by more than 300 million years of independent evolution. Conifer genomes are extremely large and contain considerable amounts of repetitive DNA. Currently, conifer sequence resources exist predominantly as expressed sequence tags (ESTs) and full-length (FL)cDNAs. There is no genome sequence available for a conifer or any other gymnosperm. Conifer defence-related genes often group into large families with closely related members. The goals of this study are to assess the feasibility of targeted isolation and sequence assembly of conifer BAC clones containing specific genes from two large gene families, and to characterize large segments of genomic DNA sequence for the first time from a conifer. We used a PCR-based approach to identify BAC clones for two target genes, a terpene synthase (3-carene synthase; 3CAR) and a cytochrome P450 (CYP720B4) from a non-arrayed genomic BAC library of white spruce (Picea glauca). Shotgun genomic fragments isolated from the BAC clones were sequenced to a depth of 15.6- and 16.0-fold coverage, respectively. Assembly and manual curation yielded sequence scaffolds of 172 kbp (3CAR) and 94 kbp (CYP720B4) long. Inspection of the genomic sequences revealed the intron-exon structures, the putative promoter regions and putative cis-regulatory elements of these genes. Sequences related to transposable elements (TEs), high complexity repeats and simple repeats were prevalent and comprised approximately 40% of the sequenced genomic DNA. An in silico simulation of the effect of sequencing depth on the quality of the sequence assembly provides direction for future efforts of conifer genome sequencing. We report the first targeted cloning, sequencing, assembly, and annotation of large segments of genomic DNA from a conifer. We demonstrate that genomic BAC clones for individual members of multi-member gene families can be isolated in a gene-specific fashion. The results of the present work provide important new information about the structure and content of conifer genomic DNA that will guide future efforts to sequence and assemble conifer genomes.
PGSB/MIPS Plant Genome Information Resources and Concepts for the Analysis of Complex Grass Genomes.

PubMed

Spannagl, Manuel; Bader, Kai; Pfeifer, Matthias; Nussbaumer, Thomas; Mayer, Klaus F X

2016-01-01

PGSB (Plant Genome and Systems Biology; formerly MIPS-Munich Institute for Protein Sequences) has been involved in developing, implementing and maintaining plant genome databases for more than a decade. Genome databases and analysis resources have focused on individual genomes and aim to provide flexible and maintainable datasets for model plant genomes as a backbone against which experimental data, e.g., from high-throughput functional genomics, can be organized and analyzed. In addition, genomes from both model and crop plants form a scaffold for comparative genomics, assisted by specialized tools such as the CrowsNest viewer to explore conserved gene order (synteny) between related species on macro- and micro-levels.The genomes of many economically important Triticeae plants such as wheat, barley, and rye present a great challenge for sequence assembly and bioinformatic analysis due to their enormous complexity and large genome size. Novel concepts and strategies have been developed to deal with these difficulties and have been applied to the genomes of wheat, barley, rye, and other cereals. This includes the GenomeZipper concept, reference-guided exome assembly, and "chromosome genomics" based on flow cytometry sorted chromosomes.
Colorimetric detection of genetically modified organisms based on exonuclease III-assisted target recycling and hemin/G-quadruplex DNAzyme amplification.

PubMed

Zhang, Decai; Wang, Weijia; Dong, Qian; Huang, Yunxiu; Wen, Dongmei; Mu, Yuejing; Yuan, Yong

2017-12-21

An isothermal colorimetric method is described for amplified detection of the CaMV 35S promoter sequence in genetically modified organism (GMO). It is based on (a) target DNA-triggered unlabeled molecular beacon (UMB) termini binding, and (b) exonuclease III (Exo III)-assisted target recycling, and (c) hemin/G-quadruplex (DNAzyme) based signal amplification. The specific binding of target to the G-quadruplex sequence-locked UMB triggers the digestion of Exo III. This, in turn, releases an active G-quadruplex segment and target DNA for successive hybridization and cleavage. The Exo III impellent recycling of targets produces numerous G-quadruplex sequences. These further associate with hemin to form DNAzymes and hence will catalyze H 2 O 2 -mediated oxidation of the chromogenic enzyme substrate ABTS 2- causing the formation of a green colored product. This finding enables a sensitive colorimetric determination of GMO DNA (at an analytical wavelength of 420 nm) at concentrations as low as 0.23 nM. By taking advantage of isothermal incubation, this method does not require sophisticated equipment or complicated syntheses. Analyses can be performed within 90 min. The method also discriminates single base mismatches. In our perception, it has a wide scope in that it may be applied to the detection of many other GMOs. Graphical abstract An isothermal and sensitive colorimetric method is described for amplified detection of CaMV 35S promoter sequence in genetically modified organism (GMO). It is based on target DNA-triggered molecular beacon (UMB) termini-binding and exonuclease III assisted target recycling, and on hemin/G-quadruplex (DNAzyme) signal amplification.
Iraq: Reconstruction Assistance

DTIC Science & Technology

2007-06-25

Iraq: Reconstruction Assistance Summary A large-scale assistance program has been undertaken by the United States in Iraq since mid-2003. To date...28, 2004, the entity implementing assistance programs , the Coalition Provisional Authority (CPA), dissolved, and sovereignty was returned to Iraq. U.N...10 U.S. Assistance Policy and Program Structure . . . . . . . . . . . . . . . . . . . . . . . . . . 11 U.S. Reconstruction
Whole-Genome Sequences of Listeria monocytogenes Sequence Type 6 Isolates Associated with a Large Foodborne Outbreak in South Africa, 2017 to 2018

PubMed Central

Tau, Nomsa; Smouse, Shannon L.; Mtshali, Phillip S.; Mnyameni, Florah; Khumalo, Zamantungwa T. H.; Ismail, Arshad; Govender, Nevashan; Thomas, Juno

2018-01-01

ABSTRACT We report whole-genome sequences for 10 Listeria monocytogenes sequence type 6 isolates associated with a large listeriosis outbreak in South Africa, which occurred over the period of 2017 to 2018. The possibility of listeriosis spreading beyond South Africa’s borders as a result of exported contaminated food products prompted us to make the genome sequences publicly available. PMID:29930052
Ordered shotgun sequencing of a 135 kb Xq25 YAC containing ANT2 and four possible genes, including three confirmed by EST matches.

PubMed Central

Chen, C N; Su, Y; Baybayan, P; Siruno, A; Nagaraja, R; Mazzarella, R; Schlessinger, D; Chen, E

1996-01-01

Ordered shotgun sequencing (OSS) has been successfully carried out with an Xq25 YAC substrate. yWXD703 DNA was subcloned into lambda phage and sequences of insert ends of the lambda subclones were used to generate a map to select a minimum tiling path of clones to be completely sequenced. The sequence of 135 038 nt contains the entire ANT2 cDNA as well as four other candidates suggested by computer-assisted analyses. One of the putative genes is homologous to a gene implicated in Graves' disease and it, ANT2 and two others are confirmed by EST matches. The results suggest that OSS can be applied to YACs in accord with earlier simulations and further indicate that the sequence of the YAC accurately reflects the sequence of uncloned human DNA. PMID:8918809
GAAP: Genome-organization-framework-Assisted Assembly Pipeline for prokaryotic genomes.

PubMed

Yuan, Lina; Yu, Yang; Zhu, Yanmin; Li, Yulai; Li, Changqing; Li, Rujiao; Ma, Qin; Siu, Gilman Kit-Hang; Yu, Jun; Jiang, Taijiao; Xiao, Jingfa; Kang, Yu

2017-01-25

Next-generation sequencing (NGS) technologies have greatly promoted the genomic study of prokaryotes. However, highly fragmented assemblies due to short reads from NGS are still a limiting factor in gaining insights into the genome biology. Reference-assisted tools are promising in genome assembly, but tend to result in false assembly when the assigned reference has extensive rearrangements. Herein, we present GAAP, a genome assembly pipeline for scaffolding based on core-gene-defined Genome Organizational Framework (cGOF) described in our previous study. Instead of assigning references, we use the multiple-reference-derived cGOFs as indexes to assist in order and orientation of the scaffolds and build a skeleton structure, and then use read pairs to extend scaffolds, called local scaffolding, and distinguish between true and chimeric adjacencies in the scaffolds. In our performance tests using both empirical and simulated data of 15 genomes in six species with diverse genome size, complexity, and all three categories of cGOFs, GAAP outcompetes or achieves comparable results when compared to three other reference-assisted programs, AlignGraph, Ragout and MeDuSa. GAAP uses both cGOF and pair-end reads to create assemblies in genomic scale, and performs better than the currently available reference-assisted assembly tools as it recovers more assemblies and makes fewer false locations, especially for species with extensive rearranged genomes. Our method is a promising solution for reconstruction of genome sequence from short reads of NGS.
Oxidation-assisted graphene heteroepitaxy on copper foil.

PubMed

Reckinger, Nicolas; Tang, Xiaohui; Joucken, Frédéric; Lajaunie, Luc; Arenal, Raul; Dubois, Emmanuel; Hackens, Benoît; Henrard, Luc; Colomer, Jean-François

2016-11-10

We propose an innovative, easy-to-implement approach to synthesize aligned large-area single-crystalline graphene flakes by chemical vapor deposition on copper foil. This method doubly takes advantage of residual oxygen present in the gas phase. First, by slightly oxidizing the copper surface, we induce grain boundary pinning in copper and, in consequence, the freezing of the thermal recrystallization process. Subsequent reduction of copper under hydrogen suddenly unlocks the delayed reconstruction, favoring the growth of centimeter-sized copper (111) grains through the mechanism of abnormal grain growth. Second, the oxidation of the copper surface also drastically reduces the nucleation density of graphene. This oxidation/reduction sequence leads to the synthesis of aligned millimeter-sized monolayer graphene domains in epitaxial registry with copper (111). The as-grown graphene flakes are demonstrated to be both single-crystalline and of high quality.
Development and validation of a mixed-tissue oligonucleotide DNA microarray for Atlantic bluefin tuna, Thunnus thynnus (Linnaeus, 1758).

PubMed

Trumbić, Željka; Bekaert, Michaël; Taggart, John B; Bron, James E; Gharbi, Karim; Mladineo, Ivona

2015-11-25

The largest of the tuna species, Atlantic bluefin tuna (Thunnus thynnus), inhabits the North Atlantic Ocean and the Mediterranean Sea and is considered to be an endangered species, largely a consequence of overfishing. T. thynnus aquaculture, referred to as fattening or farming, is a capture based activity dependent on yearly renewal from the wild. Thus, the development of aquaculture practices independent of wild resources can provide an important contribution towards ensuring security and sustainability of this species in the longer-term. The development of such practices is today greatly assisted by large scale transcriptomic studies. We have used pyrosequencing technology to sequence a mixed-tissue normalised cDNA library, derived from adult T. thynnus. A total of 976,904 raw sequence reads were assembled into 33,105 unique transcripts having a mean length of 893 bases and an N50 of 870. Of these, 33.4% showed similarity to known proteins or gene transcripts and 86.6% of them were matched to the congeneric Pacific bluefin tuna (Thunnus orientalis) genome, compared to 70.3% for the more distantly related Nile tilapia (Oreochromis niloticus) genome. Transcript sequences were used to develop a novel 15 K Agilent oligonucleotide DNA microarray for T. thynnus and comparative tissue gene expression profiles were inferred for gill, heart, liver, ovaries and testes. Functional contrasts were strongest between gills and ovaries. Gills were particularly associated with immune system, signal transduction and cell communication, while ovaries displayed signatures of glycan biosynthesis, nucleotide metabolism, transcription, translation, replication and repair. Sequence data generated from a novel mixed-tissue T. thynnus cDNA library provide an important transcriptomic resource that can be further employed for study of various aspects of T. thynnus ecology and genomics, with strong applications in aquaculture. Tissue-specific gene expression profiles inferred through the use of novel oligo-microarray can serve in the design of new and more focused transcriptomic studies for future research of tuna physiology and assessment of the welfare in a production environment.
Comparative Genomics Analyses Reveal Extensive Chromosome Colinearity and Novel Quantitative Trait Loci in Eucalyptus.

PubMed

Li, Fagen; Zhou, Changpin; Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming

2015-01-01

Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10-56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa.
Comparative Genomics Analyses Reveal Extensive Chromosome Colinearity and Novel Quantitative Trait Loci in Eucalyptus

PubMed Central

Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming

2015-01-01

Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10–56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa. PMID:26695430
Whole-Genome Sequences of Listeria monocytogenes Sequence Type 6 Isolates Associated with a Large Foodborne Outbreak in South Africa, 2017 to 2018.

PubMed

Allam, Mushal; Tau, Nomsa; Smouse, Shannon L; Mtshali, Phillip S; Mnyameni, Florah; Khumalo, Zamantungwa T H; Ismail, Arshad; Govender, Nevashan; Thomas, Juno; Smith, Anthony M

2018-06-21

We report whole-genome sequences for 10 Listeria monocytogenes sequence type 6 isolates associated with a large listeriosis outbreak in South Africa, which occurred over the period of 2017 to 2018. The possibility of listeriosis spreading beyond South Africa's borders as a result of exported contaminated food products prompted us to make the genome sequences publicly available. Copyright © 2018 Allam et al.
Ultrasensitive detection of nucleic acids and proteins using quartz crystal microbalance and surface plasmon resonance sensors based on target-triggering multiple signal amplification strategy.

PubMed

Sun, Wenbo; Song, Weiling; Guo, Xiaoyan; Wang, Zonghua

2017-07-25

In this study, quartz crystal microbalance (QCM) and surface plasmon resonance (SPR) sensors were combined with template enhanced hybridization processes (TEHP), rolling circle amplification (RCA) and biocatalytic precipitation (BCP) for ultrasensitive detection of DNA and protein. The DNA complementary to the aptamer was released by the specific binding of the aptamer to the target protein and then hybridized with the capture probe and the assistant DNA to form a ternary "Y" junction structure. The initiation chain was generated by the template-enhanced hybridization process which leaded to the rolling circle amplification reaction, and a large number of repeating unit sequences were formed. Hybridized with the enzyme-labeled probes, the biocatalytic precipitation reaction was further carried out, resulting in a large amount of insoluble precipitates and amplifying the detection signal. Under the optimum conditions, detection limits as low as 43 aM for target DNA and 53 aM for lysozyme were achieved. In addition, this method also showed good selectivity and sensitivity in human serum. Copyright © 2017 Elsevier B.V. All rights reserved.
Biotechnology and apple breeding in Japan

PubMed Central

Igarashi, Megumi; Hatsuyama, Yoshimichi; Harada, Takeo; Fukasawa-Akada, Tomoko

2016-01-01

Apple is a fruit crop of significant economic importance, and breeders world wide continue to develop novel cultivars with improved characteristics. The lengthy juvenile period and the large field space required to grow apple populations have imposed major limitations on breeding. Various molecular biological techniques have been employed to make apple breeding easier. Transgenic technology has facilitated the development of apples with resistance to fungal or bacterial diseases, improved fruit quality, or root stocks with better rooting or dwarfing ability. DNA markers for disease resistance (scab, powdery mildew, fire-blight, Alternaria blotch) and fruit skin color have also been developed, and marker-assisted selection (MAS) has been employed in breeding programs. In the last decade, genomic sequences and chromosome maps of various cultivars have become available, allowing the development of large SNP arrays, enabling efficient QTL mapping and genomic selection (GS). In recent years, new technologies for genetic improvement, such as trans-grafting, virus vectors, and genome-editing, have emerged. Using these techniques, no foreign genes are present in the final product, and some of them show considerable promise for application to apple breeding. PMID:27069388
Biotechnology and apple breeding in Japan.

PubMed

Igarashi, Megumi; Hatsuyama, Yoshimichi; Harada, Takeo; Fukasawa-Akada, Tomoko

2016-01-01

Apple is a fruit crop of significant economic importance, and breeders world wide continue to develop novel cultivars with improved characteristics. The lengthy juvenile period and the large field space required to grow apple populations have imposed major limitations on breeding. Various molecular biological techniques have been employed to make apple breeding easier. Transgenic technology has facilitated the development of apples with resistance to fungal or bacterial diseases, improved fruit quality, or root stocks with better rooting or dwarfing ability. DNA markers for disease resistance (scab, powdery mildew, fire-blight, Alternaria blotch) and fruit skin color have also been developed, and marker-assisted selection (MAS) has been employed in breeding programs. In the last decade, genomic sequences and chromosome maps of various cultivars have become available, allowing the development of large SNP arrays, enabling efficient QTL mapping and genomic selection (GS). In recent years, new technologies for genetic improvement, such as trans-grafting, virus vectors, and genome-editing, have emerged. Using these techniques, no foreign genes are present in the final product, and some of them show considerable promise for application to apple breeding.
Several Families of Sequences with Low Correlation and Large Linear Span

NASA Astrophysics Data System (ADS)

Zeng, Fanxin; Zhang, Zhenyu

In DS-CDMA systems and DS-UWB radios, low correlation of spreading sequences can greatly help to minimize multiple access interference (MAI) and large linear span of spreading sequences can reduce their predictability. In this letter, new sequence sets with low correlation and large linear span are proposed. Based on the construction Trm1[Trnm(αbt+γiαdt)]r for generating p-ary sequences of period pn-1, where n=2m, d=upm±v, b=u±v, γi∈GF(pn), and p is an arbitrary prime number, several methods to choose the parameter d are provided. The obtained sequences with family size pn are of four-valued, five-valued, six-valued or seven-valued correlation and the maximum nontrivial correlation value is (u+v-1)pm-1. The simulation by a computer shows that the linear span of the new sequences is larger than that of the sequences with Niho-type and Welch-type decimations, and similar to that of [10].
Advances in DNA sequencing technologies for high resolution HLA typing.

PubMed

Cereb, Nezih; Kim, Hwa Ran; Ryu, Jaejun; Yang, Soo Young

2015-12-01

This communication describes our experience in large-scale G group-level high resolution HLA typing using three different DNA sequencing platforms - ABI 3730 xl, Illumina MiSeq and PacBio RS II. Recent advances in DNA sequencing technologies, so-called next generation sequencing (NGS), have brought breakthroughs in deciphering the genetic information in all living species at a large scale and at an affordable level. The NGS DNA indexing system allows sequencing multiple genes for large number of individuals in a single run. Our laboratory has adopted and used these technologies for HLA molecular testing services. We found that each sequencing technology has its own strengths and weaknesses, and their sequencing performances complement each other. HLA genes are highly complex and genotyping them is quite challenging. Using these three sequencing platforms, we were able to meet all requirements for G group-level high resolution and high volume HLA typing. Copyright © 2015 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
Large-aperture plasma-assisted deposition of inertial confinement fusion laser coatings.

PubMed

Oliver, James B; Kupinski, Pete; Rigatti, Amy L; Schmid, Ansgar W; Lambropoulos, John C; Papernov, Semyon; Kozlov, Alexei; Spaulding, John; Sadowski, Daniel; Chrzan, Z Roman; Hand, Robert D; Gibson, Desmond R; Brinkley, Ian; Placido, Frank

2011-03-20

Plasma-assisted electron-beam evaporation leads to changes in the crystallinity, density, and stresses of thin films. A dual-source plasma system provides stress control of large-aperture, high-fluence coatings used in vacuum for substrates 1m in aperture.
13 CFR 125.3 - Subcontracting assistance.

Code of Federal Regulations, 2011 CFR

2011-01-01

... other resources and tools; (4) Counseling small business concerns on how to market themselves to large... 13 Business Credit and Assistance 1 2011-01-01 2011-01-01 false Subcontracting assistance. 125.3 Section 125.3 Business Credit and Assistance SMALL BUSINESS ADMINISTRATION GOVERNMENT CONTRACTING PROGRAMS...
Parallel computation of genome-scale RNA secondary structure to detect structural constraints on human genome.

PubMed

Kawaguchi, Risa; Kiryu, Hisanori

2016-05-06

RNA secondary structure around splice sites is known to assist normal splicing by promoting spliceosome recognition. However, analyzing the structural properties of entire intronic regions or pre-mRNA sequences has been difficult hitherto, owing to serious experimental and computational limitations, such as low read coverage and numerical problems. Our novel software, "ParasoR", is designed to run on a computer cluster and enables the exact computation of various structural features of long RNA sequences under the constraint of maximal base-pairing distance. ParasoR divides dynamic programming (DP) matrices into smaller pieces, such that each piece can be computed by a separate computer node without losing the connectivity information between the pieces. ParasoR directly computes the ratios of DP variables to avoid the reduction of numerical precision caused by the cancellation of a large number of Boltzmann factors. The structural preferences of mRNAs computed by ParasoR shows a high concordance with those determined by high-throughput sequencing analyses. Using ParasoR, we investigated the global structural preferences of transcribed regions in the human genome. A genome-wide folding simulation indicated that transcribed regions are significantly more structural than intergenic regions after removing repeat sequences and k-mer frequency bias. In particular, we observed a highly significant preference for base pairing over entire intronic regions as compared to their antisense sequences, as well as to intergenic regions. A comparison between pre-mRNAs and mRNAs showed that coding regions become more accessible after splicing, indicating constraints for translational efficiency. Such changes are correlated with gene expression levels, as well as GC content, and are enriched among genes associated with cytoskeleton and kinase functions. We have shown that ParasoR is very useful for analyzing the structural properties of long RNA sequences such as mRNAs, pre-mRNAs, and long non-coding RNAs whose lengths can be more than a million bases in the human genome. In our analyses, transcribed regions including introns are indicated to be subject to various types of structural constraints that cannot be explained from simple sequence composition biases. ParasoR is freely available at https://github.com/carushi/ParasoR .

FAVR (Filtering and Annotation of Variants that are Rare): methods to facilitate the analysis of rare germline genetic variants from massively parallel sequencing datasets

PubMed Central

2013-01-01

Background Characterising genetic diversity through the analysis of massively parallel sequencing (MPS) data offers enormous potential to significantly improve our understanding of the genetic basis for observed phenotypes, including predisposition to and progression of complex human disease. Great challenges remain in resolving genetic variants that are genuine from the millions of artefactual signals. Results FAVR is a suite of new methods designed to work with commonly used MPS analysis pipelines to assist in the resolution of some of the issues related to the analysis of the vast amount of resulting data, with a focus on relatively rare genetic variants. To the best of our knowledge, no equivalent method has previously been described. The most important and novel aspect of FAVR is the use of signatures in comparator sequence alignment files during variant filtering, and annotation of variants potentially shared between individuals. The FAVR methods use these signatures to facilitate filtering of (i) platform and/or mapping-specific artefacts, (ii) common genetic variants, and, where relevant, (iii) artefacts derived from imbalanced paired-end sequencing, as well as annotation of genetic variants based on evidence of co-occurrence in individuals. We applied conventional variant calling applied to whole-exome sequencing datasets, produced using both SOLiD and TruSeq chemistries, with or without downstream processing by FAVR methods. We demonstrate a 3-fold smaller rare single nucleotide variant shortlist with no detected reduction in sensitivity. This analysis included Sanger sequencing of rare variant signals not evident in dbSNP131, assessment of known variant signal preservation, and comparison of observed and expected rare variant numbers across a range of first cousin pairs. The principles described herein were applied in our recent publication identifying XRCC2 as a new breast cancer risk gene and have been made publically available as a suite of software tools. Conclusions FAVR is a platform-agnostic suite of methods that significantly enhances the analysis of large volumes of sequencing data for the study of rare genetic variants and their influence on phenotypes. PMID:23441864
Using random forests for assistance in the curation of G-protein coupled receptor databases.

PubMed

Shkurin, Aleksei; Vellido, Alfredo

2017-08-18

Biology is experiencing a gradual but fast transformation from a laboratory-centred science towards a data-centred one. As such, it requires robust data engineering and the use of quantitative data analysis methods as part of database curation. This paper focuses on G protein-coupled receptors, a large and heterogeneous super-family of cell membrane proteins of interest to biology in general. One of its families, Class C, is of particular interest to pharmacology and drug design. This family is quite heterogeneous on its own, and the discrimination of its several sub-families is a challenging problem. In the absence of known crystal structure, such discrimination must rely on their primary amino acid sequences. We are interested not as much in achieving maximum sub-family discrimination accuracy using quantitative methods, but in exploring sequence misclassification behavior. Specifically, we are interested in isolating those sequences showing consistent misclassification, that is, sequences that are very often misclassified and almost always to the same wrong sub-family. Random forests are used for this analysis due to their ensemble nature, which makes them naturally suited to gauge the consistency of misclassification. This consistency is here defined through the voting scheme of their base tree classifiers. Detailed consistency results for the random forest ensemble classification were obtained for all receptors and for all data transformations of their unaligned primary sequences. Shortlists of the most consistently misclassified receptors for each subfamily and transformation, as well as an overall shortlist including those cases that were consistently misclassified across transformations, were obtained. The latter should be referred to experts for further investigation as a data curation task. The automatic discrimination of the Class C sub-families of G protein-coupled receptors from their unaligned primary sequences shows clear limits. This study has investigated in some detail the consistency of their misclassification using random forest ensemble classifiers. Different sub-families have been shown to display very different discrimination consistency behaviors. The individual identification of consistently misclassified sequences should provide a tool for quality control to GPCR database curators.
SamSelect: a sample sequence selection algorithm for quorum planted motif search on large DNA datasets.

PubMed

Yu, Qiang; Wei, Dingbang; Huo, Hongwei

2018-06-18

Given a set of t n-length DNA sequences, q satisfying 0 < q ≤ 1, and l and d satisfying 0 ≤ d < l < n, the quorum planted motif search (qPMS) finds l-length strings that occur in at least qt input sequences with up to d mismatches and is mainly used to locate transcription factor binding sites in DNA sequences. Existing qPMS algorithms have been able to efficiently process small standard datasets (e.g., t = 20 and n = 600), but they are too time consuming to process large DNA datasets, such as ChIP-seq datasets that contain thousands of sequences or more. We analyze the effects of t and q on the time performance of qPMS algorithms and find that a large t or a small q causes a longer computation time. Based on this information, we improve the time performance of existing qPMS algorithms by selecting a sample sequence set D' with a small t and a large q from the large input dataset D and then executing qPMS algorithms on D'. A sample sequence selection algorithm named SamSelect is proposed. The experimental results on both simulated and real data show (1) that SamSelect can select D' efficiently and (2) that the qPMS algorithms executed on D' can find implanted or real motifs in a significantly shorter time than when executed on D. We improve the ability of existing qPMS algorithms to process large DNA datasets from the perspective of selecting high-quality sample sequence sets so that the qPMS algorithms can find motifs in a short time in the selected sample sequence set D', rather than take an unfeasibly long time to search the original sequence set D. Our motif discovery method is an approximate algorithm.
Pms2 Suppresses Large Expansions of the (GAA·TTC)n Sequence in Neuronal Tissues

PubMed Central

Bourn, Rebecka L.; De Biase, Irene; Pinto, Ricardo Mouro; Sandi, Chiranjeevi; Al-Mahdawi, Sahar; Pook, Mark A.; Bidichandani, Sanjay I.

2012-01-01

Expanded trinucleotide repeat sequences are the cause of several inherited neurodegenerative diseases. Disease pathogenesis is correlated with several features of somatic instability of these sequences, including further large expansions in postmitotic tissues. The presence of somatic expansions in postmitotic tissues is consistent with DNA repair being a major determinant of somatic instability. Indeed, proteins in the mismatch repair (MMR) pathway are required for instability of the expanded (CAG·CTG)n sequence, likely via recognition of intrastrand hairpins by MutSβ. It is not clear if or how MMR would affect instability of disease-causing expanded trinucleotide repeat sequences that adopt secondary structures other than hairpins, such as the triplex/R-loop forming (GAA·TTC)n sequence that causes Friedreich ataxia. We analyzed somatic instability in transgenic mice that carry an expanded (GAA·TTC)n sequence in the context of the human FXN locus and lack the individual MMR proteins Msh2, Msh6 or Pms2. The absence of Msh2 or Msh6 resulted in a dramatic reduction in somatic mutations, indicating that mammalian MMR promotes instability of the (GAA·TTC)n sequence via MutSα. The absence of Pms2 resulted in increased accumulation of large expansions in the nervous system (cerebellum, cerebrum, and dorsal root ganglia) but not in non-neuronal tissues (heart and kidney), without affecting the prevalence of contractions. Pms2 suppressed large expansions specifically in tissues showing MutSα-dependent somatic instability, suggesting that they may act on the same lesion or structure associated with the expanded (GAA·TTC)n sequence. We conclude that Pms2 specifically suppresses large expansions of a pathogenic trinucleotide repeat sequence in neuronal tissues, possibly acting independently of the canonical MMR pathway. PMID:23071719
Pms2 suppresses large expansions of the (GAA·TTC)n sequence in neuronal tissues.

PubMed

Bourn, Rebecka L; De Biase, Irene; Pinto, Ricardo Mouro; Sandi, Chiranjeevi; Al-Mahdawi, Sahar; Pook, Mark A; Bidichandani, Sanjay I

2012-01-01

Expanded trinucleotide repeat sequences are the cause of several inherited neurodegenerative diseases. Disease pathogenesis is correlated with several features of somatic instability of these sequences, including further large expansions in postmitotic tissues. The presence of somatic expansions in postmitotic tissues is consistent with DNA repair being a major determinant of somatic instability. Indeed, proteins in the mismatch repair (MMR) pathway are required for instability of the expanded (CAG·CTG)(n) sequence, likely via recognition of intrastrand hairpins by MutSβ. It is not clear if or how MMR would affect instability of disease-causing expanded trinucleotide repeat sequences that adopt secondary structures other than hairpins, such as the triplex/R-loop forming (GAA·TTC)(n) sequence that causes Friedreich ataxia. We analyzed somatic instability in transgenic mice that carry an expanded (GAA·TTC)(n) sequence in the context of the human FXN locus and lack the individual MMR proteins Msh2, Msh6 or Pms2. The absence of Msh2 or Msh6 resulted in a dramatic reduction in somatic mutations, indicating that mammalian MMR promotes instability of the (GAA·TTC)(n) sequence via MutSα. The absence of Pms2 resulted in increased accumulation of large expansions in the nervous system (cerebellum, cerebrum, and dorsal root ganglia) but not in non-neuronal tissues (heart and kidney), without affecting the prevalence of contractions. Pms2 suppressed large expansions specifically in tissues showing MutSα-dependent somatic instability, suggesting that they may act on the same lesion or structure associated with the expanded (GAA·TTC)(n) sequence. We conclude that Pms2 specifically suppresses large expansions of a pathogenic trinucleotide repeat sequence in neuronal tissues, possibly acting independently of the canonical MMR pathway.
Using complementary DNA from MyoD-transduced fibroblasts to sequence large muscle genes.

PubMed

Waddell, Leigh B; Monnier, Nicole; Cooper, Sandra T; North, Kathryn N; Clarke, Nigel F

2011-08-01

Large muscle genes are often sequenced using complementary DNA (cDNA) made from muscle messenger RNA (mRNA) to reduce the cost and workload associated with sequencing from genomic DNA. Two potential barriers are the availability of a frozen muscle biopsy, and difficulties in detecting nonsense mutations due to nonsense-mediated mRNA decay (NMD). We present patient examples showing that use of MyoD-transduced fibroblasts as a source of muscle-specific mRNA overcomes these potential difficulties in sequencing large muscle-related genes. Copyright © 2011 Wiley Periodicals, Inc.
A Robust and Versatile Method of Combinatorial Chemical Synthesis of Gene Libraries via Hierarchical Assembly of Partially Randomized Modules

PubMed Central

Popova, Blagovesta; Schubert, Steffen; Bulla, Ingo; Buchwald, Daniela; Kramer, Wilfried

2015-01-01

A major challenge in gene library generation is to guarantee a large functional size and diversity that significantly increases the chances of selecting different functional protein variants. The use of trinucleotides mixtures for controlled randomization results in superior library diversity and offers the ability to specify the type and distribution of the amino acids at each position. Here we describe the generation of a high diversity gene library using tHisF of the hyperthermophile Thermotoga maritima as a scaffold. Combining various rational criteria with contingency, we targeted 26 selected codons of the thisF gene sequence for randomization at a controlled level. We have developed a novel method of creating full-length gene libraries by combinatorial assembly of smaller sub-libraries. Full-length libraries of high diversity can easily be assembled on demand from smaller and much less diverse sub-libraries, which circumvent the notoriously troublesome long-term archivation and repeated proliferation of high diversity ensembles of phages or plasmids. We developed a generally applicable software tool for sequence analysis of mutated gene sequences that provides efficient assistance for analysis of library diversity. Finally, practical utility of the library was demonstrated in principle by assessment of the conformational stability of library members and isolating protein variants with HisF activity from it. Our approach integrates a number of features of nucleic acids synthetic chemistry, biochemistry and molecular genetics to a coherent, flexible and robust method of combinatorial gene synthesis. PMID:26355961
Computational Functional Analysis of Lipid Metabolic Enzymes.

PubMed

Bagnato, Carolina; Have, Arjen Ten; Prados, María B; Beligni, María V

2017-01-01

The computational analysis of enzymes that participate in lipid metabolism has both common and unique challenges when compared to the whole protein universe. Some of the hurdles that interfere with the functional annotation of lipid metabolic enzymes that are common to other pathways include the definition of proper starting datasets, the construction of reliable multiple sequence alignments, the definition of appropriate evolutionary models, and the reconstruction of phylogenetic trees with high statistical support, particularly for large datasets. Most enzymes that take part in lipid metabolism belong to complex superfamilies with many members that are not involved in lipid metabolism. In addition, some enzymes that do not have sequence similarity catalyze similar or even identical reactions. Some of the challenges that, albeit not unique, are more specific to lipid metabolism refer to the high compartmentalization of the routes, the catalysis in hydrophobic environments and, related to this, the function near or in biological membranes.In this work, we provide guidelines intended to assist in the proper functional annotation of lipid metabolic enzymes, based on previous experiences related to the phospholipase D superfamily and the annotation of the triglyceride synthesis pathway in algae. We describe a pipeline that starts with the definition of an initial set of sequences to be used in similarity-based searches and ends in the reconstruction of phylogenies. We also mention the main issues that have to be taken into consideration when using tools to analyze subcellular localization, hydrophobicity patterns, or presence of transmembrane domains in lipid metabolic enzymes.
The Interaction Properties of the Human Rab GTPase Family – A Comparative Analysis Reveals Determinants of Molecular Binding Selectivity

PubMed Central

Stein, Matthias; Pilli, Manohar; Bernauer, Sabine; Habermann, Bianca H.; Zerial, Marino; Wade, Rebecca C.

2012-01-01

Background Rab GTPases constitute the largest subfamily of the Ras protein superfamily. Rab proteins regulate organelle biogenesis and transport, and display distinct binding preferences for effector and activator proteins, many of which have not been elucidated yet. The underlying molecular recognition motifs, binding partner preferences and selectivities are not well understood. Methodology/Principal Findings Comparative analysis of the amino acid sequences and the three-dimensional electrostatic and hydrophobic molecular interaction fields of 62 human Rab proteins revealed a wide range of binding properties with large differences between some Rab proteins. This analysis assists the functional annotation of Rab proteins 12, 14, 26, 37 and 41 and provided an explanation for the shared function of Rab3 and 27. Rab7a and 7b have very different electrostatic potentials, indicating that they may bind to different effector proteins and thus, exert different functions. The subfamily V Rab GTPases which are associated with endosome differ subtly in the interaction properties of their switch regions, and this may explain exchange factor specificity and exchange kinetics. Conclusions/Significance We have analysed conservation of sequence and of molecular interaction fields to cluster and annotate the human Rab proteins. The analysis of three dimensional molecular interaction fields provides detailed insight that is not available from a sequence-based approach alone. Based on our results, we predict novel functions for some Rab proteins and provide insights into their divergent functions and the determinants of their binding partner selectivity. PMID:22523562
A Robust and Versatile Method of Combinatorial Chemical Synthesis of Gene Libraries via Hierarchical Assembly of Partially Randomized Modules.

PubMed

Popova, Blagovesta; Schubert, Steffen; Bulla, Ingo; Buchwald, Daniela; Kramer, Wilfried

2015-01-01

A major challenge in gene library generation is to guarantee a large functional size and diversity that significantly increases the chances of selecting different functional protein variants. The use of trinucleotides mixtures for controlled randomization results in superior library diversity and offers the ability to specify the type and distribution of the amino acids at each position. Here we describe the generation of a high diversity gene library using tHisF of the hyperthermophile Thermotoga maritima as a scaffold. Combining various rational criteria with contingency, we targeted 26 selected codons of the thisF gene sequence for randomization at a controlled level. We have developed a novel method of creating full-length gene libraries by combinatorial assembly of smaller sub-libraries. Full-length libraries of high diversity can easily be assembled on demand from smaller and much less diverse sub-libraries, which circumvent the notoriously troublesome long-term archivation and repeated proliferation of high diversity ensembles of phages or plasmids. We developed a generally applicable software tool for sequence analysis of mutated gene sequences that provides efficient assistance for analysis of library diversity. Finally, practical utility of the library was demonstrated in principle by assessment of the conformational stability of library members and isolating protein variants with HisF activity from it. Our approach integrates a number of features of nucleic acids synthetic chemistry, biochemistry and molecular genetics to a coherent, flexible and robust method of combinatorial gene synthesis.
CLAST: CUDA implemented large-scale alignment search tool.

PubMed

Yano, Masahiro; Mori, Hiroshi; Akiyama, Yutaka; Yamada, Takuji; Kurokawa, Ken

2014-12-11

Metagenomics is a powerful methodology to study microbial communities, but it is highly dependent on nucleotide sequence similarity searching against sequence databases. Metagenomic analyses with next-generation sequencing technologies produce enormous numbers of reads from microbial communities, and many reads are derived from microbes whose genomes have not yet been sequenced, limiting the usefulness of existing sequence similarity search tools. Therefore, there is a clear need for a sequence similarity search tool that can rapidly detect weak similarity in large datasets. We developed a tool, which we named CLAST (CUDA implemented large-scale alignment search tool), that enables analyses of millions of reads and thousands of reference genome sequences, and runs on NVIDIA Fermi architecture graphics processing units. CLAST has four main advantages over existing alignment tools. First, CLAST was capable of identifying sequence similarities ~80.8 times faster than BLAST and 9.6 times faster than BLAT. Second, CLAST executes global alignment as the default (local alignment is also an option), enabling CLAST to assign reads to taxonomic and functional groups based on evolutionarily distant nucleotide sequences with high accuracy. Third, CLAST does not need a preprocessed sequence database like Burrows-Wheeler Transform-based tools, and this enables CLAST to incorporate large, frequently updated sequence databases. Fourth, CLAST requires <2 GB of main memory, making it possible to run CLAST on a standard desktop computer or server node. CLAST achieved very high speed (similar to the Burrows-Wheeler Transform-based Bowtie 2 for long reads) and sensitivity (equal to BLAST, BLAT, and FR-HIT) without the need for extensive database preprocessing or a specialized computing platform. Our results demonstrate that CLAST has the potential to be one of the most powerful and realistic approaches to analyze the massive amount of sequence data from next-generation sequencing technologies.
Automated detection of records in biological sequence databases that are inconsistent with the literature.

PubMed

Bouadjenek, Mohamed Reda; Verspoor, Karin; Zobel, Justin

2017-07-01

We investigate and analyse the data quality of nucleotide sequence databases with the objective of automatic detection of data anomalies and suspicious records. Specifically, we demonstrate that the published literature associated with each data record can be used to automatically evaluate its quality, by cross-checking the consistency of the key content of the database record with the referenced publications. Focusing on GenBank, we describe a set of quality indicators based on the relevance paradigm of information retrieval (IR). Then, we use these quality indicators to train an anomaly detection algorithm to classify records as "confident" or "suspicious". Our experiments on the PubMed Central collection show assessing the coherence between the literature and database records, through our algorithms, is an effective mechanism for assisting curators to perform data cleansing. Although fewer than 0.25% of the records in our data set are known to be faulty, we would expect that there are many more in GenBank that have not yet been identified. By automated comparison with literature they can be identified with a precision of up to 10% and a recall of up to 30%, while strongly outperforming several baselines. While these results leave substantial room for improvement, they reflect both the very imbalanced nature of the data, and the limited explicitly labelled data that is available. Overall, the obtained results show promise for the development of a new kind of approach to detecting low-quality and suspicious sequence records based on literature analysis and consistency. From a practical point of view, this will greatly help curators in identifying inconsistent records in large-scale sequence databases by highlighting records that are likely to be inconsistent with the literature. Copyright © 2017 Elsevier Inc. All rights reserved.
URPD: a specific product primer design tool

PubMed Central

2012-01-01

Background Polymerase chain reaction (PCR) plays an important role in molecular biology. Primer design fundamentally determines its results. Here, we present a currently available software that is not located in analyzing large sequence but used for a rather straight-forward way of visualizing the primer design process for infrequent users. Findings URPD (yoUR Primer Design), a web-based specific product primer design tool, combines the NCBI Reference Sequences (RefSeq), UCSC In-Silico PCR, memetic algorithm (MA) and genetic algorithm (GA) primer design methods to obtain specific primer sets. A friendly user interface is accomplished by built-in parameter settings. The incorporated smooth pipeline operations effectively guide both occasional and advanced users. URPD contains an automated process, which produces feasible primer pairs that satisfy the specific needs of the experimental design with practical PCR amplifications. Visual virtual gel electrophoresis and in silico PCR provide a simulated PCR environment. The comparison of Practical gel electrophoresis comparison to virtual gel electrophoresis facilitates and verifies the PCR experiment. Wet-laboratory validation proved that the system provides feasible primers. Conclusions URPD is a user-friendly tool that provides specific primer design results. The pipeline design path makes it easy to operate for beginners. URPD also provides a high throughput primer design function. Moreover, the advanced parameter settings assist sophisticated researchers in performing experiential PCR. Several novel functions, such as a nucleotide accession number template sequence input, local and global specificity estimation, primer pair redesign, user-interactive sequence scale selection, and virtual and practical PCR gel electrophoresis discrepancies have been developed and integrated into URPD. The URPD program is implemented in JAVA and freely available at http://bio.kuas.edu.tw/urpd/. PMID:22713312
GENETICS IN ENDOCRINOLOGY: Genetic counseling for congenital hypogonadotropic hypogonadism and Kallmann syndrome: new challenges in the era of oligogenism and next-generation sequencing.

PubMed

Maione, Luigi; Dwyer, Andrew A; Francou, Bruno; Guiochon-Mantel, Anne; Binart, Nadine; Bouligand, Jérôme; Young, Jacques

2018-03-01

Congenital hypogonadotropic hypogonadism (CHH) and Kallmann syndrome (KS) are rare, related diseases that prevent normal pubertal development and cause infertility in affected men and women. However, the infertility carries a good prognosis as increasing numbers of patients with CHH/KS are now able to have children through medically assisted procreation. These are genetic diseases that can be transmitted to patients' offspring. Importantly, patients and their families should be informed of this risk and given genetic counseling. CHH and KS are phenotypically and genetically heterogeneous diseases in which the risk of transmission largely depends on the gene(s) responsible(s). Inheritance may be classically Mendelian yet more complex; oligogenic modes of transmission have also been described. The prevalence of oligogenicity has risen dramatically since the advent of massively parallel next-generation sequencing (NGS) in which tens, hundreds or thousands of genes are sequenced at the same time. NGS is medically and economically more efficient and more rapid than traditional Sanger sequencing and is increasingly being used in medical practice. Thus, it seems plausible that oligogenic forms of CHH/KS will be increasingly identified making genetic counseling even more complex. In this context, the main challenge will be to differentiate true oligogenism from situations when several rare variants that do not have a clear phenotypic effect are identified by chance. This review aims to summarize the genetics of CHH/KS and to discuss the challenges of oligogenic transmission and also its role in incomplete penetrance and variable expressivity in a perspective of genetic counseling. © 2018 European Society of Endocrinology.
Ontology and diversity of transcript-associated microsatellites mined from a globe artichoke EST database

PubMed Central

Scaglione, Davide; Acquadro, Alberto; Portis, Ezio; Taylor, Christopher A; Lanteri, Sergio; Knapp, Steven J

2009-01-01

Background The globe artichoke (Cynara cardunculus var. scolymus L.) is a significant crop in the Mediterranean basin. Despite its commercial importance and its both dietary and pharmaceutical value, knowledge of its genetics and genomics remains scant. Microsatellite markers have become a key tool in genetic and genomic analysis, and we have exploited recently acquired EST (expressed sequence tag) sequence data (Composite Genome Project - CGP) to develop an extensive set of microsatellite markers. Results A unigene assembly was created from over 36,000 globe artichoke EST sequences, containing 6,621 contigs and 12,434 singletons. Over 12,000 of these unigenes were functionally assigned on the basis of homology with Arabidopsis thaliana reference proteins. A total of 4,219 perfect repeats, located within 3,308 unigenes was identified and the gene ontology (GO) analysis highlighted some GO term's enrichments among different classes of microsatellites with respect to their position. Sufficient flanking sequence was available to enable the design of primers to amplify 2,311 of these microsatellites, and a set of 300 was tested against a DNA panel derived from 28 C. cardunculus genotypes. Consistent amplification and polymorphism was obtained from 236 of these assays. Their polymorphic information content (PIC) ranged from 0.04 to 0.90 (mean 0.66). Between 176 and 198 of the assays were informative in at least one of the three available mapping populations. Conclusion EST-based microsatellites have provided a large set of de novo genetic markers, which show significant amounts of polymorphism both between and within the three taxa of C. cardunculus. They are thus well suited as assays for phylogenetic analysis, the construction of genetic maps, marker-assisted breeding, transcript mapping and other genomic applications in the species. PMID:19785740
URPD: a specific product primer design tool.

PubMed

Chuang, Li-Yeh; Cheng, Yu-Huei; Yang, Cheng-Hong

2012-06-19

Polymerase chain reaction (PCR) plays an important role in molecular biology. Primer design fundamentally determines its results. Here, we present a currently available software that is not located in analyzing large sequence but used for a rather straight-forward way of visualizing the primer design process for infrequent users. URPD (yoUR Primer Design), a web-based specific product primer design tool, combines the NCBI Reference Sequences (RefSeq), UCSC In-Silico PCR, memetic algorithm (MA) and genetic algorithm (GA) primer design methods to obtain specific primer sets. A friendly user interface is accomplished by built-in parameter settings. The incorporated smooth pipeline operations effectively guide both occasional and advanced users. URPD contains an automated process, which produces feasible primer pairs that satisfy the specific needs of the experimental design with practical PCR amplifications. Visual virtual gel electrophoresis and in silico PCR provide a simulated PCR environment. The comparison of Practical gel electrophoresis comparison to virtual gel electrophoresis facilitates and verifies the PCR experiment. Wet-laboratory validation proved that the system provides feasible primers. URPD is a user-friendly tool that provides specific primer design results. The pipeline design path makes it easy to operate for beginners. URPD also provides a high throughput primer design function. Moreover, the advanced parameter settings assist sophisticated researchers in performing experiential PCR. Several novel functions, such as a nucleotide accession number template sequence input, local and global specificity estimation, primer pair redesign, user-interactive sequence scale selection, and virtual and practical PCR gel electrophoresis discrepancies have been developed and integrated into URPD. The URPD program is implemented in JAVA and freely available at http://bio.kuas.edu.tw/urpd/.
The Complete Plastid Genome Sequence of Madagascar Periwinkle Catharanthus roseus (L.) G. Don: Plastid Genome Evolution, Molecular Marker Identification, and Phylogenetic Implications in Asterids

PubMed Central

Ku, Chuan; Chung, Wan-Chia; Chen, Ling-Ling; Kuo, Chih-Horng

2013-01-01

The Madagascar periwinkle ( Catharanthus roseus in the family Apocynaceae) is an important medicinal plant and is the source of several widely marketed chemotherapeutic drugs. It is also commonly grown for its ornamental values and, due to ease of infection and distinctiveness of symptoms, is often used as the host for studies on phytoplasmas, an important group of uncultivated plant pathogens. To gain insights into the characteristics of apocynaceous plastid genomes (plastomes), we used a reference-assisted approach to assemble the complete plastome of C . roseus , which could be applied to other C . roseus -related studies. The C . roseus plastome is the second completely sequenced plastome in the asterid order Gentianales. We performed comparative analyses with two other representative sequences in the same order, including the complete plastome of Coffea arabica (from the basal Gentianales family Rubiaceae) and the nearly complete plastome of Asclepias syriaca (Apocynaceae). The results demonstrated considerable variations in gene content and plastome organization within Apocynaceae, including the presence/absence of three essential genes (i.e., accD, clpP, and ycf1) and large size changes in non-coding regions (e.g., rps2-rpoC2 and IRb-ndhF). To find plastome markers of potential utility for Catharanthus breeding and phylogenetic analyses, we identified 41 C . roseus -specific simple sequence repeats. Furthermore, five intergenic regions with high divergence between C . roseus and three other euasterids I taxa were identified as candidate markers. To resolve the euasterids I interordinal relationships, 82 plastome genes were used for phylogenetic inference. With the addition of representatives from Apocynaceae and sampling of most other asterid orders, a sister relationship between Gentianales and Solanales is supported. PMID:23825699
The 2010 Decennial Census: Background and Issues

DTIC Science & Technology

2009-04-27

the hearing impaired, is to be available, as are Braille and large-print questionnaire guides. This report will be updated as legislative or other...intends to provide telephone assistance, including assistance for the hearing impaired, as well as Braille and large-print questionnaire guides.59
Biomineralization of Schlumbergerella floresiana, a significant carbonate-producing benthic foraminifer.

PubMed

Sabbatini, A; Bédouet, L; Marie, A; Bartolini, A; Landemarre, L; Weber, M X; Gusti Ngurah Kade Mahardika, I; Berland, S; Zito, F; Vénec-Peyré, M-T

2014-07-01

Most foraminifera that produce a shell are efficient biomineralizers. We analyzed the calcitic shell of the large tropical benthic foraminifer Schlumbergerella floresiana. We found a suite of macromolecules containing many charged and polar amino acids and glycine that are also abundant in biomineralization proteins of other phyla. As neither genomic nor transcriptomic data are available for foraminiferal biomineralization yet, de novo-generated sequences, obtained from organic matrices submitted to ms blast database search, led to the characterization of 156 peptides. Very few homologous proteins were matched in the proteomic database, implying that the peptides are derived from unknown proteins present in the foraminiferal organic matrices. The amino acid distribution of these peptides was queried against the uniprot database and the mollusk uniprot database for comparison. The mollusks compose a well-studied phylum that yield a large variety of biomineralization proteins. These results showed that proteins extracted from S. floresiana shells contained sequences enriched with glycine, alanine, and proline, making a set of residues that provided a signature unique to foraminifera. Three of the de novo peptides exhibited sequence similarities to peptides found in proteins such as pre-collagen-P and a group of P-type ATPases including a calcium-transporting ATPase. Surprisingly, the peptide that was most similar to the collagen-like protein was a glycine-rich peptide reported from the test and spine proteome of sea urchin. The molecules, identified by matrix-assisted laser desorption ionization-time of flight mass spectrometry analyses, included acid-soluble N-glycoproteins with its sugar moieties represented by high-mannose-type glycans and carbohydrates. Describing the nature of the proteins, and associated molecules in the skeletal structure of living foraminifera, can elucidate the biomineralization mechanisms of these major carbonate producers in marine ecosystems. As fossil foraminifera provide important paleoenvironmental and paleoclimatic information, a better understanding of biomineralization in these organisms will have far-reaching impacts. © 2014 John Wiley & Sons Ltd.
Detection of Low-Copy-Number Genomic DNA Sequences in Individual Bacterial Cells by Using Peptide Nucleic Acid-Assisted Rolling-Circle Amplification and Fluorescence In Situ Hybridization▿ †

PubMed Central

Smolina, Irina; Lee, Charles; Frank-Kamenetskii, Maxim

2007-01-01

An approach is proposed for in situ detection of short signature DNA sequences present in single copies per bacterial genome. The site is locally opened by peptide nucleic acids, and a circular oligonucleotide is assembled. The amplicon generated by rolling circle amplification is detected by hybridization with fluorescently labeled decorator probes. PMID:17293504

Assistance dogs provide a useful behavioral model to enrich communicative skills of assistance robots.

PubMed

Gácsi, Márta; Szakadát, Sára; Miklósi, Adám

2013-01-01

These studies are part of a project aiming to reveal relevant aspects of human-dog interactions, which could serve as a model to design successful human-robot interactions. Presently there are no successfully commercialized assistance robots, however, assistance dogs work efficiently as partners for persons with disabilities. In Study 1, we analyzed the cooperation of 32 assistance dog-owner dyads performing a carrying task. We revealed typical behavior sequences and also differences depending on the dyads' experiences and on whether the owner was a wheelchair user. In Study 2, we investigated dogs' responses to unforeseen difficulties during a retrieving task in two contexts. Dogs displayed specific communicative and displacement behaviors, and a strong commitment to execute the insoluble task. Questionnaire data from Study 3 confirmed that these behaviors could successfully attenuate owners' disappointment. Although owners anticipated the technical competence of future assistance robots to be moderate/high, they could not imagine robots as emotional companions, which negatively affected their acceptance ratings of future robotic assistants. We propose that assistance dogs' cooperative behaviors and problem solving strategies should inspire the development of the relevant functions and social behaviors of assistance robots with limited manual and verbal skills.
DataHub knowledge based assistance for science visualization and analysis using large distributed databases

NASA Technical Reports Server (NTRS)

Handley, Thomas H., Jr.; Collins, Donald J.; Doyle, Richard J.; Jacobson, Allan S.

1991-01-01

Viewgraphs on DataHub knowledge based assistance for science visualization and analysis using large distributed databases. Topics covered include: DataHub functional architecture; data representation; logical access methods; preliminary software architecture; LinkWinds; data knowledge issues; expert systems; and data management.
Fetal head circumference, operative delivery, and fetal outcomes: a multi-ethnic population-based cohort study

PubMed Central

2013-01-01

Background Operative delivery procedures, such as primary cesarean section, vacuum-assisted, and forceps-assisted vaginal delivery increase maternal and fetal morbidity, and the cost of care. We evaluated whether large fetal head circumference (FHC) independently increases risk of such interventions, as well as fetal distress or low Apgar score, in anatomically normal infants. Methods We conducted a population-based retrospective cohort study using Washington State birth certificate data. We included singleton, term infants born to nulliparous mothers from 2003–2009. We compared mode of delivery and fetal outcomes in 10,750 large-FHC (37-41 cm) infants relative to 10,750 average-FHC (34 cm) infants, frequency matched by birth-year. Results Large-FHC infants were nearly twice as likely to be delivered by primary cesarean section as average-FHC infants (unadjusted relative risk [RR] 1.84, 95% confidence interval [CI]: 1.77, 1.92). The RR for primary cesarean section associated with large-FHC was largest for mothers aged 19 years or less (RR 2.28; 95% CI: 1.99, 2.61), and smallest for mothers aged 35 years or greater (RR 1.51; 95% CI: 1.37, 1.66) [test of homogeneity, p < 0.001]. Large-FHC infants were at increased risk of vacuum-assisted vaginal delivery (RR 1.55; 95% CI: 1.43, 1.69), and forceps-assisted vaginal delivery (RR 1.61; 95% CI: 1.32, 1.97). There was no difference in risk of fetal distress (RR 0.97; 95% CI: 0.89, 1.07) for large-FHC versus average-FHC infants. Risk estimates were unaffected by adjustment for potential confounders. Conclusions Nulliparous mothers of large-FHC infants are at increased risk of primary cesarean section, vacuum-assisted and forceps-assisted vaginal delivery relative to mothers of average-FHC infants. Maternal age modifies the association between FHC and primary cesarean section. PMID:23651454
Grandparent Education for Assisted Living Facilities

ERIC Educational Resources Information Center

Strom, Robert D.; Strom, Paris S.

2017-01-01

The assisted living population is forecast to increase at a rapid rate. Quality of life for residents should be improved by giving greater attention to their cognitive, emotional, and social needs. A university lifespan development team provided a grandparent education course at a large assisted living facility with the assistance of 20 resident…
A Computer-Assisted Laboratory Sequence for Petroleum Geology.

ERIC Educational Resources Information Center

Lumsden, David N.

1979-01-01

Describes a competitive oil-play game for petroleum geology students. It is accompanied by a computer program written in interactive Fortran. The program, however, is not essential, but useful for adding more interest. (SA)
Microcomputer-Assisted Mathematics: From Simple Interest to e.

ERIC Educational Resources Information Center

Kimberling, Clark

1985-01-01

The progression from simple interest to compound interest leads naturally and quickly to the number e, involving mathematical discovery learning through writing programs. Several programs are given, with suggestions for a teaching sequence. (MNS)
2017 Valparaíso earthquake sequence and the megathrust patchwork of central Chile

NASA Astrophysics Data System (ADS)

Nealy, Jennifer L.; Herman, Matthew W.; Moore, Ginevra L.; Hayes, Gavin P.; Benz, Harley M.; Bergman, Eric A.; Barrientos, Sergio E.

2017-09-01

In April 2017, a sequence of earthquakes offshore Valparaíso, Chile, raised concerns of a potential megathrust earthquake in the near future. The largest event in the 2017 sequence was a M6.9 on 24 April, seemingly colocated with the last great-sized earthquake in the region—a M8.0 in March 1985. The history of large earthquakes in this region shows significant variation in rupture size and extent, typically highlighted by a juxtaposition of large ruptures interspersed with smaller magnitude sequences. We show that the 2017 sequence ruptured an area between the two main slip patches during the 1985 earthquake, rerupturing a patch that had previously slipped during the October 1973 M6.5 earthquake sequence. A significant gap in historic ruptures exists directly to the south of the 2017 sequence, with large enough moment deficit to host a great-sized earthquake in the near future, if it is locked.
2017 Valparaíso earthquake sequence and the megathrust patchwork of central Chile

USGS Publications Warehouse

Nealy, Jennifer; Herman, Matthew W.; Moore, Ginevra; Hayes, Gavin; Benz, Harley M.; Bergman, Eric A.; Barrientos, Sergio E

2017-01-01

In April 2017, a sequence of earthquakes offshore Valparaíso, Chile, raised concerns of a potential megathrust earthquake in the near future. The largest event in the 2017 sequence was a M6.9 on 24 April, seemingly colocated with the last great-sized earthquake in the region—a M8.0 in March 1985. The history of large earthquakes in this region shows significant variation in rupture size and extent, typically highlighted by a juxtaposition of large ruptures interspersed with smaller magnitude sequences. We show that the 2017 sequence ruptured an area between the two main slip patches during the 1985 earthquake, rerupturing a patch that had previously slipped during the October 1973 M6.5 earthquake sequence. A significant gap in historic ruptures exists directly to the south of the 2017 sequence, with large enough moment deficit to host a great-sized earthquake in the near future, if it is locked.
Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

NASA Astrophysics Data System (ADS)

McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

2016-05-01

Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.
Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides.

PubMed

McMillen, Chelsea L; Wright, Patience M; Cassady, Carolyn J

2016-05-01

Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.
Subgenome-specific assembly of vitamin E biosynthesis genes and expression patterns during seed development provide insight into the evolution of oat genome.

PubMed

Gutierrez-Gonzalez, Juan J; Garvin, David F

2016-11-01

Vitamin E is essential for humans and thus must be a component of a healthy diet. Among the cereal grains, hexaploid oats (Avena sativa L.) have high vitamin E content. To date, no gene sequences in the vitamin E biosynthesis pathway have been reported for oats. Using deep sequencing and orthology-guided assembly, coding sequences of genes for each step in vitamin E synthesis in oats were reconstructed, including resolution of the sequences of homeologs. Three homeologs, presumably representing each of the three oat subgenomes, were identified for the main steps of the pathway. Partial sequences, likely representing pseudogenes, were recovered in some instances as well. Pairwise comparisons among homeologs revealed that two of the three putative subgenome-specific homeologs are almost identical for each gene. Synonymous substitution rates indicate the time of divergence of the two more similar subgenomes from the distinct one at 7.9-8.7 MYA, and a divergence between the similar subgenomes from a common ancestor 1.1 MYA. A new proposed evolutionary model for hexaploid oat formation is discussed. Homeolog-specific gene expression was quantified during oat seed development and compared with vitamin E accumulation. Homeolog expression largely appears to be similar for most of genes; however, for some genes, homoeolog-specific transcriptional bias was observed. The expression of HPPD, as well as certain homoeologs of VTE2 and VTE4, is highly correlated with seed vitamin E accumulation. Our findings expand our understanding of oat genome evolution and will assist efforts to modify vitamin E content and composition in oats. Published 2016. This article is a U.S. Government work and is in the public domain in the USA. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Construction of a high-density genetic map by specific locus amplified fragment sequencing (SLAF-seq) and its application to Quantitative Trait Loci (QTL) analysis for boll weight in upland cotton (Gossypium hirsutum.).

PubMed

Zhang, Zhen; Shang, Haihong; Shi, Yuzhen; Huang, Long; Li, Junwen; Ge, Qun; Gong, Juwu; Liu, Aiying; Chen, Tingting; Wang, Dan; Wang, Yanling; Palanga, Koffi Kibalou; Muhammad, Jamshed; Li, Weijie; Lu, Quanwei; Deng, Xiaoying; Tan, Yunna; Song, Weiwu; Cai, Juan; Li, Pengtao; Rashid, Harun or; Gong, Wankui; Yuan, Youlu

2016-04-11

Upland Cotton (Gossypium hirsutum) is one of the most important worldwide crops it provides natural high-quality fiber for the industrial production and everyday use. Next-generation sequencing is a powerful method to identify single nucleotide polymorphism markers on a large scale for the construction of a high-density genetic map for quantitative trait loci mapping. In this research, a recombinant inbred lines population developed from two upland cotton cultivars 0-153 and sGK9708 was used to construct a high-density genetic map through the specific locus amplified fragment sequencing method. The high-density genetic map harbored 5521 single nucleotide polymorphism markers which covered a total distance of 3259.37 cM with an average marker interval of 0.78 cM without gaps larger than 10 cM. In total 18 quantitative trait loci of boll weight were identified as stable quantitative trait loci and were detected in at least three out of 11 environments and explained 4.15-16.70 % of the observed phenotypic variation. In total, 344 candidate genes were identified within the confidence intervals of these stable quantitative trait loci based on the cotton genome sequence. These genes were categorized based on their function through gene ontology analysis, Kyoto Encyclopedia of Genes and Genomes analysis and eukaryotic orthologous groups analysis. This research reported the first high-density genetic map for Upland Cotton (Gossypium hirsutum) with a recombinant inbred line population using single nucleotide polymorphism markers developed by specific locus amplified fragment sequencing. We also identified quantitative trait loci of boll weight across 11 environments and identified candidate genes within the quantitative trait loci confidence intervals. The results of this research would provide useful information for the next-step work including fine mapping, gene functional analysis, pyramiding breeding of functional genes as well as marker-assisted selection.
Kinked silicon nanowires-enabled interweaving electrode configuration for lithium-ion batteries.

PubMed

Sandu, Georgiana; Coulombier, Michael; Kumar, Vishank; Kassa, Hailu G; Avram, Ionel; Ye, Ran; Stopin, Antoine; Bonifazi, Davide; Gohy, Jean-François; Leclère, Philippe; Gonze, Xavier; Pardoen, Thomas; Vlad, Alexandru; Melinte, Sorin

2018-06-28

A tri-dimensional interweaving kinked silicon nanowires (k-SiNWs) assembly, with a Ni current collector co-integrated, is evaluated as electrode configuration for lithium ion batteries. The large-scale fabrication of k-SiNWs is based on a procedure for continuous metal assisted chemical etching of Si, supported by a chemical peeling step that enables the reuse of the Si substrate. The kinks are triggered by a simple, repetitive etch-quench sequence in a HF and H 2 O 2 -based etchant. We find that the inter-locking frameworks of k-SiNWs and multi-walled carbon nanotubes exhibit beneficial mechanical properties with a foam-like behavior amplified by the kinks and a suitable porosity for a minimal electrode deformation upon Li insertion. In addition, ionic liquid electrolyte systems associated with the integrated Ni current collector repress the detrimental effects related to the Si-Li alloying reaction, enabling high cycling stability with 80% capacity retention (1695 mAh/g Si ) after 100 cycles. Areal capacities of 2.42 mAh/cm 2 (1276 mAh/g electrode ) can be achieved at the maximum evaluated thickness (corresponding to 1.3 mg Si /cm 2 ). This work emphasizes the versatility of the metal assisted chemical etching for the synthesis of advanced Si nanostructures for high performance lithium ion battery electrodes.
Identification of Medically Relevant Species of Arthroconidial Yeasts by Use of Matrix-Assisted Laser Desorption Ionization–Time of Flight Mass Spectrometry

PubMed Central

Kolecka, Anna; Khayhan, Kantarawee; Groenewald, Marizeth; Theelen, Bart; Arabatzis, Michael; Velegraki, Aristea; Kostrzewa, Markus; Mares, Mihai; Taj-Aldeen, Saad J.

2013-01-01

Matrix-assisted laser desorption ionization–time of flight mass spectrometry (MALDI-TOF MS) was used for an extensive identification study of arthroconidial yeasts, using 85 reference strains from the CBS-KNAW yeast collection and 134 clinical isolates collected from medical centers in Qatar, Greece, and Romania. The test set included 72 strains of ascomycetous yeasts (Galactomyces, Geotrichum, Saprochaete, and Magnusiomyces spp.) and 147 strains of basidiomycetous yeasts (Trichosporon and Guehomyces spp.). With minimal preparation time, MALDI-TOF MS proved to be an excellent diagnostic tool that provided reliable identification of most (98%) of the tested strains to the species level, with good discriminatory power. The majority of strains were correctly identified at the species level with good scores (>2.0) and seven of the tested strains with log score values between 1.7 and 2.0. The MALDI-TOF MS results obtained were consistent with validated internal transcribed spacer (ITS) and/or large subunit (LSU) ribosomal DNA sequencing results. Expanding the mass spectrum database by increasing the number of reference strains for closely related species, including those of nonclinical origin, should enhance the usefulness of MALDI-TOF MS-based diagnostic analysis of these arthroconidial fungi in medical and other laboratories. PMID:23678074
Long-term efficacy of microbiology-driven periodontal laser-assisted therapy.

PubMed

Martelli, F S; Fanti, E; Rosati, C; Martelli, M; Bacci, G; Martelli, M L; Medico, E

2016-03-01

Periodontitis represents a highly prevalent health problem, causing severe functional impairment, reduced quality of life and increased risk of systemic disorders, including respiratory, cardiovascular and osteoarticular diseases, diabetes and fertility problems. It is a typical example of a multifactorial disease, where a polymicrobial infection inducing chronic inflammation of periodontal tissues is favoured by environmental factors, life style and genetic background. Since periodontal pathogens can colonise poorly vascularised niches, antiseptics and antibiotics are typically associated with local treatments to manage the defects, with unstable outcomes especially in early-onset cases. Here, the results of a retrospective study are reported, evaluating the efficacy of a protocol (Periodontal Biological Laser-Assisted Therapy, Perioblast™) by which microbial profiling of periodontal pockets is used to determine the extent and duration of local neodymium-doped yttrium aluminium garnet (Nd:YAG) laser irradiation plus conventional treatment. The protocol was applied multicentrically on 2683 patients, and found to produce a significant and enduring improvement of all clinical and bacteriological parameters, even in aggressive cases. Microbiome sequencing of selected pockets revealed major population shifts after treatment, as well as strains potentially associated with periodontitis in the absence of known pathogens. This study, conducted for the first time on such a large series, clearly demonstrates long-term efficacy of microbiology-driven non-invasive treatment of periodontal disease.
Teaching Assistant Unionization: Origins and Implications.

ERIC Educational Resources Information Center

Rogow, Robert; Birch, Daniel R.

1984-01-01

Teaching assistants are unionized at most large Ontario and British Columbia universities. The institutions tend to be urban, have large graduate enrollments, and have faced greater budgetary concerns at the time of unionization. Canada's public policy and labor relations decisions have favored unionization more than the United States's have. (MSE)
Single nucleotide polymorphisms from Theobroma cacao expressed sequence tags associated with witches' broom disease in cacao.

PubMed

Lima, L S; Gramacho, K P; Carels, N; Novais, R; Gaiotto, F A; Lopes, U V; Gesteira, A S; Zaidan, H A; Cascardo, J C M; Pires, J L; Micheli, F

2009-07-14

In order to increase the efficiency of cacao tree resistance to witches' broom disease, which is caused by Moniliophthora perniciosa (Tricholomataceae), we looked for molecular markers that could help in the selection of resistant cacao genotypes. Among the different markers useful for developing marker-assisted selection, single nucleotide polymorphisms (SNPs) constitute the most common type of sequence difference between alleles and can be easily detected by in silico analysis from expressed sequence tag libraries. We report the first detection and analysis of SNPs from cacao-M. perniciosa interaction expressed sequence tags, using bioinformatics. Selection based on analysis of these SNPs should be useful for developing cacao varieties resistant to this devastating disease.
Repetitive sequences in plant nuclear DNA: types, distribution, evolution and function.

PubMed

Mehrotra, Shweta; Goyal, Vinod

2014-08-01

Repetitive DNA sequences are a major component of eukaryotic genomes and may account for up to 90% of the genome size. They can be divided into minisatellite, microsatellite and satellite sequences. Satellite DNA sequences are considered to be a fast-evolving component of eukaryotic genomes, comprising tandemly-arrayed, highly-repetitive and highly-conserved monomer sequences. The monomer unit of satellite DNA is 150-400 base pairs (bp) in length. Repetitive sequences may be species- or genus-specific, and may be centromeric or subtelomeric in nature. They exhibit cohesive and concerted evolution caused by molecular drive, leading to high sequence homogeneity. Repetitive sequences accumulate variations in sequence and copy number during evolution, hence they are important tools for taxonomic and phylogenetic studies, and are known as "tuning knobs" in the evolution. Therefore, knowledge of repetitive sequences assists our understanding of the organization, evolution and behavior of eukaryotic genomes. Repetitive sequences have cytoplasmic, cellular and developmental effects and play a role in chromosomal recombination. In the post-genomics era, with the introduction of next-generation sequencing technology, it is possible to evaluate complex genomes for analyzing repetitive sequences and deciphering the yet unknown functional potential of repetitive sequences. Copyright © 2014 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.
Identification of a high-efficiency baculovirus DNA replication origin that functions in insect and mammalian cells.

PubMed

Wu, Yueh-Lung; Wu, Carol-P; Huang, Yu-Hui; Huang, Sheng-Ping; Lo, Huei-Ru; Chang, Hao-Shuo; Lin, Pi-Hsiu; Wu, Ming-Cheng; Chang, Chia-Jung; Chao, Yu-Chan

2014-11-01

The p143 gene from Autographa californica multinucleocapsid nucleopolyhedrovirus (AcMNPV) has been found to increase the expression of luciferase, which is driven by the polyhedrin gene promoter, in a plasmid with virus coinfection. Further study indicated that this is due to the presence of a replication origin (ori) in the coding region of this gene. Transient DNA replication assays showed that a specific fragment of the p143 coding sequence, p143-3, underwent virus-dependent DNA replication in Spodoptera frugiperda IPLB-Sf-21 (Sf-21) cells. Deletion analysis of the p143-3 fragment showed that subfragment p143-3.2a contained the essential sequence of this putative ori. Sequence analysis of this region revealed a unique distribution of imperfect palindromes with high AT contents. No sequence homology or similarity between p143-3.2a and any other known ori was detected, suggesting that it is a novel baculovirus ori. Further study showed that the p143-3.2a ori can replicate more efficiently in infected Sf-21 cells than baculovirus homologous regions (hrs), the major baculovirus ori, or non-hr oris during virus replication. Previously, hr on its own was unable to replicate in mammalian cells, and for mammalian viral oris, viral proteins are generally required for their proper replication in host cells. However, the p143-3.2a ori was, surprisingly, found to function as an efficient ori in mammalian cells without the need for any viral proteins. We conclude that p143 contains a unique sequence that can function as an ori to enhance gene expression in not only insect cells but also mammalian cells. Baculovirus DNA replication relies on both hr and non-hr oris; however, so far very little is known about the latter oris. Here we have identified a new non-hr ori, the p143 ori, which resides in the coding region of p143. By developing a novel DNA replication-enhanced reporter system, we have identified and located the core region required for the p143 ori. This ori contains a large number of imperfect inverted repeats and is the most active ori in the viral genome during virus infection in insect cells. We also found that it is a unique ori that can replicate in mammalian cells without the assistance of baculovirus gene products. The identification of this ori should contribute to a better understanding of baculovirus DNA replication. Also, this ori is very useful in assisting with gene expression in mammalian cells. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Analysing the performance of personal computers based on Intel microprocessors for sequence aligning bioinformatics applications.

PubMed

Nair, Pradeep S; John, Eugene B

2007-01-01

Aligning specific sequences against a very large number of other sequences is a central aspect of bioinformatics. With the widespread availability of personal computers in biology laboratories, sequence alignment is now often performed locally. This makes it necessary to analyse the performance of personal computers for sequence aligning bioinformatics benchmarks. In this paper, we analyse the performance of a personal computer for the popular BLAST and FASTA sequence alignment suites. Results indicate that these benchmarks have a large number of recurring operations and use memory operations extensively. It seems that the performance can be improved with a bigger L1-cache.

Preparation of Gd(OH){sub 3} large single crystals by solid KOH assisted hydrothermal method and their luminescent and magnetic properties

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wu, Hai; Zhang, Youjin, E-mail: zyj@ustc.edu.cn; Zhou, Maozhong

Highlights: • Gd(OH){sub 3} large single crystals were prepared by solid KOH assisted hydrothermal method. • The possible growth mechanism of Gd(OH){sub 3} large single crystals was proposed. • The Gd(OH){sub 3} samples emitted a strong narrow-band ultraviolet B (NB-UVB) light. • The Gd(OH){sub 3} samples showed good paramagnetic properties. - Abstract: Large single crystals of gadolinium hydroxide [Gd(OH){sub 3}] in the length of several millimeters were successfully prepared by using solid KOH assisted hydrothermal method. Gd(OH){sub 3} samples were characterized by X-ray diffraction (XRD), 4-circle single-crystal diffraction, Fourier transform infrared (FTIR) spectroscopy and X-ray photoelectron spectroscopy (XPS). FESEM imagemore » shows hexagonal prism morphology for the Gd(OH){sub 3} large crystals. The possible growth mechanism of Gd(OH){sub 3} large single crystals was proposed. The photoluminescence and magnetic properties of Gd(OH){sub 3} species were investigated.« less
Endoscope-Assisted Microsurgical Removal of an Intraventricular Ependymal Cyst That Manifested with Tremor.

PubMed

Kutlay, Murat; Yavan, Ibrahim; Kural, Cahit; Ozer, Ilker; Daneyemez, Mehmet K; Izci, Yusuf

2016-06-01

Intraventricular ependymal cysts (ECs) are rare, histologically benign neuroepithelial cysts. Most of these cysts are clinically silent and discovered incidentally. Rarely, they become symptomatic, leading to obstruction of the cerebrospinal fluid circulation. ECs located inside the ventricles may manifest with signs of increased intracranial pressure. A 32-year-old woman presented with a 6-year history of tremor affecting her left hand. In the last month, she had been experiencing headache as well, and the tremor of the left hand was affecting her quality of life. The patient demonstrated a fine resting and intention tremor of the left hand and a voice tremor. Magnetic resonance imaging revealed a large cystic, nonenhancing lesion within the right lateral ventricle. The fluid within the cyst was isointense to cerebrospinal fluid on all sequences. Because of the rapid progression of her symptoms and no response to medication, surgical decompression of the cyst was considered. The cyst was removed by an endoscope-assisted microsurgical technique. Her postoperative course was uneventful. A marked reduction in her tremor was noted in the immediate postoperative period. Histopathologic diagnosis was of an EC. During the follow-up period, the patient's tremor, although still present, had improved dramatically. At 6 months postoperatively, she could hold a drinking glass without spilling. This is a unique case of an intraventricular EC that manifested with tremor, which improved by endoscope-assisted microsurgical removal of the cyst. This case also supports the important role of endoscopic surgery in the treatment of intraventricular cystic lesions. Copyright © 2016 Elsevier Inc. All rights reserved.
Pooled-DNA Sequencing for Elucidating New Genomic Risk Factors, Rare Variants Underlying Alzheimer's Disease.

PubMed

Jin, Sheng Chih; Benitez, Bruno A; Deming, Yuetiva; Cruchaga, Carlos

2016-01-01

Analyses of genome-wide association studies (GWAS) for complex disorders usually identify common variants with a relatively small effect size that only explain a small proportion of phenotypic heritability. Several studies have suggested that a significant fraction of heritability may be explained by low-frequency (minor allele frequency (MAF) of 1-5 %) and rare-variants that are not contained in the commercial GWAS genotyping arrays (Schork et al., Curr Opin Genet Dev 19:212, 2009). Rare variants can also have relatively large effects on risk for developing human diseases or disease phenotype (Cruchaga et al., PLoS One 7:e31039, 2012). However, it is necessary to perform next-generation sequencing (NGS) studies in a large population (>4,000 samples) to detect a significant rare-variant association. Several NGS methods, such as custom capture sequencing and amplicon-based sequencing, are designed to screen a small proportion of the genome, but most of these methods are limited in the number of samples that can be multiplexed (i.e. most sequencing kits only provide 96 distinct index). Additionally, the sequencing library preparation for 4,000 samples remains expensive and thus conducting NGS studies with the aforementioned methods are not feasible for most research laboratories.The need for low-cost large scale rare-variant detection makes pooled-DNA sequencing an ideally efficient and cost-effective technique to identify rare variants in target regions by sequencing hundreds to thousands of samples. Our recent work has demonstrated that pooled-DNA sequencing can accurately detect rare variants in targeted regions in multiple DNA samples with high sensitivity and specificity (Jin et al., Alzheimers Res Ther 4:34, 2012). In these studies we used a well-established pooled-DNA sequencing approach and a computational package, SPLINTER (short indel prediction by large deviation inference and nonlinear true frequency estimation by recursion) (Vallania et al., Genome Res 20:1711, 2010), for accurate identification of rare variants in large DNA pools. Given an average sequencing coverage of 30× per haploid genome, SPLINTER can detect rare variants and short indels up to 4 base pairs (bp) with high sensitivity and specificity (up to 1 haploid allele in a pool as large as 500 individuals). Step-by-step instructions on how to conduct pooled-DNA sequencing experiments and data analyses are described in this chapter.
K-means-clustering-based fiber nonlinearity equalization techniques for 64-QAM coherent optical communication system.

PubMed

Zhang, Junfeng; Chen, Wei; Gao, Mingyi; Shen, Gangxiang

2017-10-30

In this work, we proposed two k-means-clustering-based algorithms to mitigate the fiber nonlinearity for 64-quadrature amplitude modulation (64-QAM) signal, the training-sequence assisted k-means algorithm and the blind k-means algorithm. We experimentally demonstrated the proposed k-means-clustering-based fiber nonlinearity mitigation techniques in 75-Gb/s 64-QAM coherent optical communication system. The proposed algorithms have reduced clustering complexity and low data redundancy and they are able to quickly find appropriate initial centroids and select correctly the centroids of the clusters to obtain the global optimal solutions for large k value. We measured the bit-error-ratio (BER) performance of 64-QAM signal with different launched powers into the 50-km single mode fiber and the proposed techniques can greatly mitigate the signal impairments caused by the amplified spontaneous emission noise and the fiber Kerr nonlinearity and improve the BER performance.
A behavioural framework for the evolution of feeding in predatory aquatic mammals

PubMed Central

Fitzgerald, Erich M. G.; Evans, Alistair R.

2017-01-01

Extant aquatic mammals are a key component of aquatic ecosystems. Their morphology, ecological role and behaviour are, to a large extent, shaped by their feeding ecology. Nevertheless, the nature of this crucial aspect of their biology is often oversimplified and, consequently, misinterpreted. Here, we introduce a new framework that categorizes the feeding cycle of predatory aquatic mammals into four distinct functional stages (prey capture, manipulation and processing, water removal and swallowing), and details the feeding behaviours that can be employed at each stage. Based on this comprehensive scheme, we propose that the feeding strategies of living aquatic mammals form an evolutionary sequence that recalls the land-to-water transition of their ancestors. Our new conception helps to explain and predict the origin of particular feeding styles, such as baleen-assisted filter feeding in whales and raptorial ‘pierce’ feeding in pinnipeds, and informs the structure of present and past ecosystems. PMID:28250183
Cloud computing for genomic data analysis and collaboration.

PubMed

Langmead, Ben; Nellore, Abhinav

2018-04-01

Next-generation sequencing has made major strides in the past decade. Studies based on large sequencing data sets are growing in number, and public archives for raw sequencing data have been doubling in size every 18 months. Leveraging these data requires researchers to use large-scale computational resources. Cloud computing, a model whereby users rent computers and storage from large data centres, is a solution that is gaining traction in genomics research. Here, we describe how cloud computing is used in genomics for research and large-scale collaborations, and argue that its elasticity, reproducibility and privacy features make it ideally suited for the large-scale reanalysis of publicly available archived data, including privacy-protected data.
Design Principles for Computer-Assisted Instruction in Histology Education: An Exploratory Study

ERIC Educational Resources Information Center

Deniz, Hasan; Cakir, Hasan

2006-01-01

The purpose of this paper is to describe the development process and the key components of a computer-assisted histology material. Computer-assisted histology material is designed to supplement traditional histology education in a large Midwestern university. Usability information of the computer-assisted instruction (CAI) material was obtained…
Iraq: Recent Developments in Reconstruction Assistance

DTIC Science & Technology

2005-01-27

Developments in Reconstruction Assistance Summary Large-scale reconstruction assistance programs are being undertaken by the United States following the war...in grant aid and as much as $13.3 billion in possible loans. On June 28, 2004, the entity implementing assistance programs , the Coalition Provisional... programs are being undertaken by the United States in Iraq. This report describes recent developments in this assistance effort. The report will be updated
Iraq: Recent Developments in Reconstruction Assistance

DTIC Science & Technology

2004-12-20

Developments in Reconstruction Assistance Summary Large-scale reconstruction assistance programs are being undertaken by the United States following the war...in grant aid and as much as $13.3 billion in possible loans. On June 28, 2004, the entity implementing assistance programs , the Coalition... programs are being undertaken by the United States in Iraq. This report describes recent developments in this assistance effort. The report will be
Single-cell genome sequencing at ultra-high-throughput with microfluidic droplet barcoding.

PubMed

Lan, Freeman; Demaree, Benjamin; Ahmed, Noorsher; Abate, Adam R

2017-07-01

The application of single-cell genome sequencing to large cell populations has been hindered by technical challenges in isolating single cells during genome preparation. Here we present single-cell genomic sequencing (SiC-seq), which uses droplet microfluidics to isolate, fragment, and barcode the genomes of single cells, followed by Illumina sequencing of pooled DNA. We demonstrate ultra-high-throughput sequencing of >50,000 cells per run in a synthetic community of Gram-negative and Gram-positive bacteria and fungi. The sequenced genomes can be sorted in silico based on characteristic sequences. We use this approach to analyze the distributions of antibiotic-resistance genes, virulence factors, and phage sequences in microbial communities from an environmental sample. The ability to routinely sequence large populations of single cells will enable the de-convolution of genetic heterogeneity in diverse cell populations.
2D and 3D Modeling of the Stratigraphic Sequences at the Adriatic and Rhone Continental Margins

DTIC Science & Technology

2005-09-30

Grenerczy, D. Medak, S. Stein, and J. C. Weber (Eds.). The Adria Microplate : GPS Geodesy, Tectonics , and Hazards. Kluwer Academic Publisher, pp. 93-116... tectonics , and their influences on sequence architecture. John Swenson, with assistance from Chris Paola, Juan Fedele, myself and others have jointly...exploration of the margin’s response to variations in sea level, sediment supply, tectonic subsidence, and wave climate over longer timescales. I am
Computationally assisted screening and design of cell-interactive peptides by a cell-based assay using peptide arrays and a fuzzy neural network algorithm.

PubMed

Kaga, Chiaki; Okochi, Mina; Tomita, Yasuyuki; Kato, Ryuji; Honda, Hiroyuki

2008-03-01

We developed a method of effective peptide screening that combines experiments and computational analysis. The method is based on the concept that screening efficiency can be enhanced from even limited data by use of a model derived from computational analysis that serves as a guide to screening and combining the model with subsequent repeated experiments. Here we focus on cell-adhesion peptides as a model application of this peptide-screening strategy. Cell-adhesion peptides were screened by use of a cell-based assay of a peptide array. Starting with the screening data obtained from a limited, random 5-mer library (643 sequences), a rule regarding structural characteristics of cell-adhesion peptides was extracted by fuzzy neural network (FNN) analysis. According to this rule, peptides with unfavored residues in certain positions that led to inefficient binding were eliminated from the random sequences. In the restricted, second random library (273 sequences), the yield of cell-adhesion peptides having an adhesion rate more than 1.5-fold to that of the basal array support was significantly high (31%) compared with the unrestricted random library (20%). In the restricted third library (50 sequences), the yield of cell-adhesion peptides increased to 84%. We conclude that a repeated cycle of experiments screening limited numbers of peptides can be assisted by the rule-extracting feature of FNN.
Video enhancement workbench: an operational real-time video image processing system

NASA Astrophysics Data System (ADS)

Yool, Stephen R.; Van Vactor, David L.; Smedley, Kirk G.

1993-01-01

Video image sequences can be exploited in real-time, giving analysts rapid access to information for military or criminal investigations. Video-rate dynamic range adjustment subdues fluctuations in image intensity, thereby assisting discrimination of small or low- contrast objects. Contrast-regulated unsharp masking enhances differentially shadowed or otherwise low-contrast image regions. Real-time removal of localized hotspots, when combined with automatic histogram equalization, may enhance resolution of objects directly adjacent. In video imagery corrupted by zero-mean noise, real-time frame averaging can assist resolution and location of small or low-contrast objects. To maximize analyst efficiency, lengthy video sequences can be screened automatically for low-frequency, high-magnitude events. Combined zoom, roam, and automatic dynamic range adjustment permit rapid analysis of facial features captured by video cameras recording crimes in progress. When trying to resolve small objects in murky seawater, stereo video places the moving imagery in an optimal setting for human interpretation.
Identification of anaerobic bacteria by Bruker Biotyper matrix-assisted laser desorption ionization-time of flight mass spectrometry with on-plate formic acid preparation.

PubMed

Schmitt, Bryan H; Cunningham, Scott A; Dailey, Aaron L; Gustafson, Daniel R; Patel, Robin

2013-03-01

Identification of anaerobic bacteria using phenotypic methods is often time-consuming; methods such as 16S rRNA gene sequencing are costly and may not be readily available. We evaluated 253 clinical isolates of anaerobic bacteria using the Bruker MALDI Biotyper (Bruker Daltonics, Billerica, MA) matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) system with a user-supplemented database and an on-plate formic acid-based preparation method and compared results to those of conventional identification using biochemical testing or 16S rRNA gene sequencing. A total of 179 (70.8%) and 232 (91.7%) isolates were correctly identified to the species and genus levels, respectively, using manufacturer-recommended score cutoffs. MALDI-TOF MS offers a rapid, inexpensive method for identification of anaerobic bacteria.
The ASLOTS concept: An interactive, adaptive decision support concept for Final Approach Spacing of Aircraft (FASA). FAA-NASA Joint University Program

NASA Technical Reports Server (NTRS)

Simpson, Robert W.

1993-01-01

This presentation outlines a concept for an adaptive, interactive decision support system to assist controllers at a busy airport in achieving efficient use of multiple runways. The concept is being implemented as a computer code called FASA (Final Approach Spacing for Aircraft), and will be tested and demonstrated in ATCSIM, a high fidelity simulation of terminal area airspace and airport surface operations. Objectives are: (1) to provide automated cues to assist controllers in the sequencing and spacing of landing and takeoff aircraft; (2) to provide the controller with a limited ability to modify the sequence and spacings between aircraft, and to insert takeoffs and missed approach aircraft in the landing flows; (3) to increase spacing accuracy using more complex and precise separation criteria while reducing controller workload; and (4) achieve higher operational takeoff and landing rates on multiple runways in poor visibility.
BAC-pool sequencing and analysis of large segments of A12 and D12 homoeologous chromosomes in Upland cotton

USDA-ARS?s Scientific Manuscript database

New and emerging next generation sequencing technologies have reduced sequencing costs, but there is room for additional approaches that can be applied to complex polyploid plant genomes. Large (about 2.5GB) and highly repetitive tetraploid genome of G. hirsutum is still cost-intensive with traditi...
Identification of Non-diphtheriae Corynebacterium by Use of Matrix-Assisted Laser Desorption Ionization–Time of Flight Mass Spectrometry

PubMed Central

Alatoom, Adnan A.; Cazanave, Charles J.; Cunningham, Scott A.; Ihde, Sherry M.

2012-01-01

We evaluated the Bruker Biotyper matrix-assisted laser desorption ionization–time of flight (MALDI-TOF) mass spectrometry for identification of 92 clinical isolates of Corynebacterium species in comparison to identification using rpoB or 16S rRNA gene sequencing. Eighty isolates (87%) yielded a score of ≥1.700, and all of these were correctly identified to the species level with the exception of Corynebacterium aurimucosum being misidentified as the closely related Corynebacterium minutissimum. PMID:22075579
QTL mapping for downy mildew resistance in cucumber via bulked segregant analysis using next-generation sequencing and conventional methods.

PubMed

Win, Khin Thanda; Vegas, Juan; Zhang, Chunying; Song, Kihwan; Lee, Sanghyeob

2017-01-01

QTL mapping using NGS-assisted BSA was successfully applied to an F 2 population for downy mildew resistance in cucumber. QTLs detected by NGS-assisted BSA were confirmed by conventional QTL analysis. Downy mildew (DM), caused by Pseudoperonospora cubensis, is one of the most destructive foliar diseases in cucumber. QTL mapping is a fundamental approach for understanding the genetic inheritance of DM resistance in cucumber. Recently, many studies have reported that a combination of bulked segregant analysis (BSA) and next-generation sequencing (NGS) can be a rapid and cost-effective way of mapping QTLs. In this study, we applied NGS-assisted BSA to QTL mapping of DM resistance in cucumber and confirmed the results by conventional QTL analysis. By sequencing two DNA pools each consisting of ten individuals showing high resistance and susceptibility to DM from a F 2 population, we identified single nucleotide polymorphisms (SNPs) between the two pools. We employed a statistical method for QTL mapping based on these SNPs. Five QTLs, dm2.2, dm4.1, dm5.1, dm5.2, and dm6.1, were detected and dm2.2 showed the largest effect on DM resistance. Conventional QTL analysis using the F 2 confirmed dm2.2 (R 2 = 10.8-24 %) and dm5.2 (R 2 = 14-27.2 %) as major QTLs and dm4.1 (R 2 = 8 %) as two minor QTLs, but could not detect dm5.1 and dm6.1. A new QTL on chromosome 2, dm2.1 (R 2 = 28.2 %) was detected by the conventional QTL method using an F 3 population. This study demonstrated the effectiveness of NGS-assisted BSA for mapping QTLs conferring DM resistance in cucumber and revealed the unique genetic inheritance of DM resistance in this population through two distinct major QTLs on chromosome 2 that mainly harbor DM resistance.
SU-C-17A-02: Sirius MRI Markers for Prostate Post-Implant Assessment: MR Protocol Development

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lim, T; Wang, J; Kudchadker, R

Purpose: Currently, CT is used to visualize prostate brachytherapy sources, at the expense of accurate structure contouring. MRI is superior to CT for anatomical delineation, but the sources appear as voids on MRI images. Previously we have developed Sirius MRI markers (C4 Imaging) to replace spacers to assist source localization on MRI images. Here we develop an MRI pulse sequence protocol that enhances the signal of these markers to enable MRI-only post-implant prostate dosimetric analysis. Methods: To simulate a clinical scenario, a CIRS multi-modality prostate phantom was implanted with 66 markers and 86 sources. The implanted phantom was imaged onmore » both 1.5T and 3.0T GE scanners under various conditions, different pulse sequences (2D fast spin echo [FSE], 3D balanced steadystate free precession [bSSFP] and 3D fast spoiled gradient echo [FSPGR]), as well as varying amount of padding to simulate various patient sizes and associated signal fall-off from the surface coil elements. Standard FSE sequences from the current clinical protocols were also evaluated. Marker visibility, marker size, intra-marker distance, total scan time and artifacts were evaluated for various combinations of echo time, repetition time, flip angle, number of excitations, bandwidth, slice thickness and spacing, fieldof- view, frequency/phase encoding steps and frequency direction. Results: We have developed a 3D FSPGR pulse sequence that enhances marker signal and ensures the integrity of the marker shape while maintaining reasonable scan time. For patients contraindicated for 3.0T, we have also developed a similar sequence for 1.5T scanners. Signal fall-off with distance from prostate to coil can be compensated mainly by decreasing bandwidth. The markers are not visible using standard FSE sequences. FSPGR sequences are more robust for consistent marker visualization as compared to bSSFP sequences. Conclusion: The developed MRI pulse sequence protocol for Sirius MRI markers assists source localization to enable MRIonly post-implant prostate dosimetric analysis. S.J. Frank is a co-founder of C4 Imaging (manufactures the MRI markers)« less
Computational Prediction of the Immunomodulatory Potential of RNA Sequences.

PubMed

Nagpal, Gandharva; Chaudhary, Kumardeep; Dhanda, Sandeep Kumar; Raghava, Gajendra Pal Singh

2017-01-01

Advances in the knowledge of various roles played by non-coding RNAs have stimulated the application of RNA molecules as therapeutics. Among these molecules, miRNA, siRNA, and CRISPR-Cas9 associated gRNA have been identified as the most potent RNA molecule classes with diverse therapeutic applications. One of the major limitations of RNA-based therapeutics is immunotoxicity of RNA molecules as it may induce the innate immune system. In contrast, RNA molecules that are potent immunostimulators are strong candidates for use in vaccine adjuvants. Thus, it is important to understand the immunotoxic or immunostimulatory potential of these RNA molecules. The experimental techniques for determining immunostimulatory potential of siRNAs are time- and resource-consuming. To overcome this limitation, recently our group has developed a web-based server "imRNA" for predicting the immunomodulatory potential of RNA sequences. This server integrates a number of modules that allow users to perform various tasks including (1) generation of RNA analogs with reduced immunotoxicity, (2) identification of highly immunostimulatory regions in RNA sequence, and (3) virtual screening. This server may also assist users in the identification of minimum mutations required in a given RNA sequence to minimize its immunomodulatory potential that is required for designing RNA-based therapeutics. Besides, the server can be used for designing RNA-based vaccine adjuvants as it may assist users in the identification of mutations required for increasing immunomodulatory potential of a given RNA sequence. In summary, this chapter describes major applications of the "imRNA" server in designing RNA-based therapeutics and vaccine adjuvants (http://www.imtech.res.in/raghava/imrna/).

MALDI-TOF Mass Spectrometry Is a Fast and Reliable Platform for Identification and Ecological Studies of Species from Family Rhizobiaceae

PubMed Central

Ferreira, Laura; Sánchez-Juanes, Fernando; García-Fraile, Paula; Rivas, Raúl; Mateos, Pedro F.; Martínez-Molina, Eustoquio; González-Buitrago, José Manuel; Velázquez, Encarna

2011-01-01

Family Rhizobiaceae includes fast growing bacteria currently arranged into three genera, Rhizobium, Ensifer and Shinella, that contain pathogenic, symbiotic and saprophytic species. The identification of these species is not possible on the basis of physiological or biochemical traits and should be based on sequencing of several genes. Therefore alternative methods are necessary for rapid and reliable identification of members from family Rhizobiaceae. In this work we evaluated the suitability of Matrix-Assisted Laser Desorption Ionization-Time-of-Flight Mass Spectrometry (MALDI-TOF MS) for this purpose. Firstly, we evaluated the capability of this methodology to differentiate among species of family Rhizobiaceae including those closely related and then we extended the database of MALDI Biotyper 2.0 including the type strains of 56 species from genera Rhizobium, Ensifer and Shinella. Secondly, we evaluated the identification potential of this methodology by using several strains isolated from different sources previously identified on the basis of their rrs, recA and atpD gene sequences. The 100% of these strains were correctly identified showing that MALDI-TOF MS is an excellent tool for identification of fast growing rhizobia applicable to large populations of isolates in ecological and taxonomic studies. PMID:21655291
Discovery of Novel Antimicrobial Peptides from Varanus komodoensis (Komodo Dragon) by Large-Scale Analyses and De-Novo-Assisted Sequencing Using Electron-Transfer Dissociation Mass Spectrometry.

PubMed

Bishop, Barney M; Juba, Melanie L; Russo, Paul S; Devine, Megan; Barksdale, Stephanie M; Scott, Shaylyn; Settlage, Robert; Michalak, Pawel; Gupta, Kajal; Vliet, Kent; Schnur, Joel M; van Hoek, Monique L

2017-04-07

Komodo dragons are the largest living lizards and are the apex predators in their environs. They endure numerous strains of pathogenic bacteria in their saliva and recover from wounds inflicted by other dragons, reflecting the inherent robustness of their innate immune defense. We have employed a custom bioprospecting approach combining partial de novo peptide sequencing with transcriptome assembly to identify cationic antimicrobial peptides from Komodo dragon plasma. Through these analyses, we identified 48 novel potential cationic antimicrobial peptides. All but one of the identified peptides were derived from histone proteins. The antimicrobial effectiveness of eight of these peptides was evaluated against Pseudomonas aeruginosa (ATCC 9027) and Staphylococcus aureus (ATCC 25923), with seven peptides exhibiting antimicrobial activity against both microbes and one only showing significant potency against P. aeruginosa. This study demonstrates the power and promise of our bioprospecting approach to cationic antimicrobial peptide discovery, and it reveals the presence of a plethora of novel histone-derived antimicrobial peptides in the plasma of the Komodo dragon. These findings may have broader implications regarding the role that intact histones and histone-derived peptides play in defending the host from infection. Data are available via ProteomeXChange with identifier PXD005043.
Role of the Pepino mosaic virus 3'-untranslated region elements in negative-strand RNA synthesis in vitro.

PubMed

Osman, Toba A M; Olsthoorn, René C L; Livieratos, Ioannis C

2014-09-22

Pepino mosaic virus (PepMV) is a mechanically-transmitted positive-strand RNA potexvirus, with a 6410 nt long single-stranded (ss) RNA genome flanked by a 5'-methylguanosine cap and a 3' poly-A tail. Computer-assisted folding of the 64 nt long PepMV 3'-untranslated region (UTR) resulted in the prediction of three stem-loop structures (hp1, hp2, and hp3 in the 3'-5' direction). The importance of these structures and/or sequences for promotion of negative-strand RNA synthesis and binding to the RNA dependent RNA polymerase (RdRp) was tested in vitro using a specific RdRp assay. Hp1, which is highly variable among different PepMV isolates, appeared dispensable for negative-strand synthesis. Hp2, which is characterized by a large U-rich loop, tolerated base-pair changes in its stem as long as they maintained the stem integrity but was very sensitive to changes in the U-rich loop. Hp3, which harbours the conserved potexvirus ACUUAA hexamer motif, was essential for template activity. Template-RNA polymerase binding competition experiments showed that the ACUUAA sequence represents a high-affinity RdRp binding element. Copyright © 2014 Elsevier B.V. All rights reserved.
Whole genome sequencing of turbot (Scophthalmus maximus; Pleuronectiformes): a fish adapted to demersal life

PubMed Central

Figueras, Antonio; Robledo, Diego; Corvelo, André; Hermida, Miguel; Pereiro, Patricia; Rubiolo, Juan A.; Gómez-Garrido, Jèssica; Carreté, Laia; Bello, Xabier; Gut, Marta; Gut, Ivo Glynne; Marcet-Houben, Marina; Forn-Cuní, Gabriel; Galán, Beatriz; García, José Luis; Abal-Fabeiro, José Luis; Pardo, Belen G.; Taboada, Xoana; Fernández, Carlos; Vlasova, Anna; Hermoso-Pulido, Antonio; Guigó, Roderic; Álvarez-Dios, José Antonio; Gómez-Tato, Antonio; Viñas, Ana; Maside, Xulio; Gabaldón, Toni; Novoa, Beatriz; Bouza, Carmen; Alioto, Tyler; Martínez, Paulino

2016-01-01

The turbot is a flatfish (Pleuronectiformes) with increasing commercial value, which has prompted active genomic research aimed at more efficient selection. Here we present the sequence and annotation of the turbot genome, which represents a milestone for both boosting breeding programmes and ascertaining the origin and diversification of flatfish. We compare the turbot genome with model fish genomes to investigate teleost chromosome evolution. We observe a conserved macrosyntenic pattern within Percomorpha and identify large syntenic blocks within the turbot genome related to the teleost genome duplication. We identify gene family expansions and positive selection of genes associated with vision and metabolism of membrane lipids, which suggests adaptation to demersal lifestyle and to cold temperatures, respectively. Our data indicate a quick evolution and diversification of flatfish to adapt to benthic life and provide clues for understanding their controversial origin. Moreover, we investigate the genomic architecture of growth, sex determination and disease resistance, key traits for understanding local adaptation and boosting turbot production, by mapping candidate genes and previously reported quantitative trait loci. The genomic architecture of these productive traits has allowed the identification of candidate genes and enriched pathways that may represent useful information for future marker-assisted selection in turbot. PMID:26951068
Whole genome sequencing of turbot (Scophthalmus maximus; Pleuronectiformes): a fish adapted to demersal life.

PubMed

Figueras, Antonio; Robledo, Diego; Corvelo, André; Hermida, Miguel; Pereiro, Patricia; Rubiolo, Juan A; Gómez-Garrido, Jèssica; Carreté, Laia; Bello, Xabier; Gut, Marta; Gut, Ivo Glynne; Marcet-Houben, Marina; Forn-Cuní, Gabriel; Galán, Beatriz; García, José Luis; Abal-Fabeiro, José Luis; Pardo, Belen G; Taboada, Xoana; Fernández, Carlos; Vlasova, Anna; Hermoso-Pulido, Antonio; Guigó, Roderic; Álvarez-Dios, José Antonio; Gómez-Tato, Antonio; Viñas, Ana; Maside, Xulio; Gabaldón, Toni; Novoa, Beatriz; Bouza, Carmen; Alioto, Tyler; Martínez, Paulino

2016-06-01

The turbot is a flatfish (Pleuronectiformes) with increasing commercial value, which has prompted active genomic research aimed at more efficient selection. Here we present the sequence and annotation of the turbot genome, which represents a milestone for both boosting breeding programmes and ascertaining the origin and diversification of flatfish. We compare the turbot genome with model fish genomes to investigate teleost chromosome evolution. We observe a conserved macrosyntenic pattern within Percomorpha and identify large syntenic blocks within the turbot genome related to the teleost genome duplication. We identify gene family expansions and positive selection of genes associated with vision and metabolism of membrane lipids, which suggests adaptation to demersal lifestyle and to cold temperatures, respectively. Our data indicate a quick evolution and diversification of flatfish to adapt to benthic life and provide clues for understanding their controversial origin. Moreover, we investigate the genomic architecture of growth, sex determination and disease resistance, key traits for understanding local adaptation and boosting turbot production, by mapping candidate genes and previously reported quantitative trait loci. The genomic architecture of these productive traits has allowed the identification of candidate genes and enriched pathways that may represent useful information for future marker-assisted selection in turbot. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
DHSpred: support-vector-machine-based human DNase I hypersensitive sites prediction using the optimal features selected by random forest.

PubMed

Manavalan, Balachandran; Shin, Tae Hwan; Lee, Gwang

2018-01-05

DNase I hypersensitive sites (DHSs) are genomic regions that provide important information regarding the presence of transcriptional regulatory elements and the state of chromatin. Therefore, identifying DHSs in uncharacterized DNA sequences is crucial for understanding their biological functions and mechanisms. Although many experimental methods have been proposed to identify DHSs, they have proven to be expensive for genome-wide application. Therefore, it is necessary to develop computational methods for DHS prediction. In this study, we proposed a support vector machine (SVM)-based method for predicting DHSs, called DHSpred (DNase I Hypersensitive Site predictor in human DNA sequences), which was trained with 174 optimal features. The optimal combination of features was identified from a large set that included nucleotide composition and di- and trinucleotide physicochemical properties, using a random forest algorithm. DHSpred achieved a Matthews correlation coefficient and accuracy of 0.660 and 0.871, respectively, which were 3% higher than those of control SVM predictors trained with non-optimized features, indicating the efficiency of the feature selection method. Furthermore, the performance of DHSpred was superior to that of state-of-the-art predictors. An online prediction server has been developed to assist the scientific community, and is freely available at: http://www.thegleelab.org/DHSpred.html.
Microbial genomic island discovery, visualization and analysis.

PubMed

Bertelli, Claire; Tilley, Keith E; Brinkman, Fiona S L

2018-06-03

Horizontal gene transfer (also called lateral gene transfer) is a major mechanism for microbial genome evolution, enabling rapid adaptation and survival in specific niches. Genomic islands (GIs), commonly defined as clusters of bacterial or archaeal genes of probable horizontal origin, are of particular medical, environmental and/or industrial interest, as they disproportionately encode virulence factors and some antimicrobial resistance genes and may harbor entire metabolic pathways that confer a specific adaptation (solvent resistance, symbiosis properties, etc). As large-scale analyses of microbial genomes increases, such as for genomic epidemiology investigations of infectious disease outbreaks in public health, there is increased appreciation of the need to accurately predict and track GIs. Over the past decade, numerous computational tools have been developed to tackle the challenges inherent in accurate GI prediction. We review here the main types of GI prediction methods and discuss their advantages and limitations for a routine analysis of microbial genomes in this era of rapid whole-genome sequencing. An assessment is provided of 20 GI prediction software methods that use sequence-composition bias to identify the GIs, using a reference GI data set from 104 genomes obtained using an independent comparative genomics approach. Finally, we present guidelines to assist researchers in effectively identifying these key genomic regions.
DHSpred: support-vector-machine-based human DNase I hypersensitive sites prediction using the optimal features selected by random forest

PubMed Central

Manavalan, Balachandran; Shin, Tae Hwan; Lee, Gwang

2018-01-01

DNase I hypersensitive sites (DHSs) are genomic regions that provide important information regarding the presence of transcriptional regulatory elements and the state of chromatin. Therefore, identifying DHSs in uncharacterized DNA sequences is crucial for understanding their biological functions and mechanisms. Although many experimental methods have been proposed to identify DHSs, they have proven to be expensive for genome-wide application. Therefore, it is necessary to develop computational methods for DHS prediction. In this study, we proposed a support vector machine (SVM)-based method for predicting DHSs, called DHSpred (DNase I Hypersensitive Site predictor in human DNA sequences), which was trained with 174 optimal features. The optimal combination of features was identified from a large set that included nucleotide composition and di- and trinucleotide physicochemical properties, using a random forest algorithm. DHSpred achieved a Matthews correlation coefficient and accuracy of 0.660 and 0.871, respectively, which were 3% higher than those of control SVM predictors trained with non-optimized features, indicating the efficiency of the feature selection method. Furthermore, the performance of DHSpred was superior to that of state-of-the-art predictors. An online prediction server has been developed to assist the scientific community, and is freely available at: http://www.thegleelab.org/DHSpred.html PMID:29416743
Using PCR-based detection and genotyping to trace Streptococcus salivarius meningitis outbreak strain to oral flora of radiology physician assistant.

PubMed

Srinivasan, Velusamy; Gertz, Robert E; Shewmaker, Patricia L; Patrick, Sarah; Chitnis, Amit S; O'Connell, Heather; Benowitz, Isaac; Patel, Priti; Guh, Alice Y; Noble-Wang, Judith; Turabelidze, George; Beall, Bernard

2012-01-01

We recently investigated three cases of bacterial meningitis that were reported from a midwestern radiology clinic where facemasks were not worn during spinal injection of contrast agent during myelography procedures. Using pulsed field gel electrophoresis we linked a case strain of S. salivarius to an oral specimen of a radiology physician assistant (RPA). We also used a real-time PCR assay to detect S. salivarius DNA within a culture-negative cerebrospinal fluid (CSF) specimen. Here we extend this investigation through using a nested PCR/sequencing strategy to link the culture-negative CSF specimen to the case strain. We also provide validation of the real-time PCR assay used, demonstrating that it is not solely specific for Streptococcus salivarius, but is also highly sensitive for detection of the closely related oral species Streptococcus vestibularis. Through using multilocus sequence typing and 16S rDNA sequencing we further strengthen the link between the CSF case isolate and the RPA carriage isolate. We also demonstrate that the newly characterized strains from this study are distinct from previously characterized S. salivarius strains associated with carriage and meningitis.
High-resolution linkage map and chromosome-scale genome assembly for cassava (Manihot esculenta Crantz) from 10 populations.

PubMed

2014-12-11

Cassava (Manihot esculenta Crantz) is a major staple crop in Africa, Asia, and South America, and its starchy roots provide nourishment for 800 million people worldwide. Although native to South America, cassava was brought to Africa 400-500 years ago and is now widely cultivated across sub-Saharan Africa, but it is subject to biotic and abiotic stresses. To assist in the rapid identification of markers for pathogen resistance and crop traits, and to accelerate breeding programs, we generated a framework map for M. esculenta Crantz from reduced representation sequencing [genotyping-by-sequencing (GBS)]. The composite 2412-cM map integrates 10 biparental maps (comprising 3480 meioses) and organizes 22,403 genetic markers on 18 chromosomes, in agreement with the observed karyotype. We used the map to anchor 71.9% of the draft genome assembly and 90.7% of the predicted protein-coding genes. The chromosome-anchored genome sequence will be useful for breeding improvement by assisting in the rapid identification of markers linked to important traits, and in providing a framework for genomic selection-enhanced breeding of this important crop. Copyright © 2015 International Cassava Genetic Map Consortium (ICGMC).
High-resolution linkage map and chromosome-scale genome assembly for cassava ( Manihot esculenta Crantz) from 10 populations

DOE PAGES

Lyons, Jessica

2014-12-11

Cassava Manihot esculenta Crantz) is a major staple crop in Africa, Asia, and South America, and its starchy roots provide nourishment for 800 million people worldwide. Although native to South America, cassava was brought to Africa 400–500 years ago and is now widely cultivated across sub-Saharan Africa, but it is subject to biotic and abiotic stresses. To assist in the rapid identification of markers for pathogen resistance and crop traits, and to accelerate breeding programs, we generated a framework map for M. esculent Crantz from reduced representation sequencing [genotyping-by-sequencing (GBS)]. The composite 2412-cM map integrates 10 biparental maps (comprising 3480more » meioses) and organizes 22,403 genetic markers on 18 chromosomes, in agreement with the observed karyotype. Here, we used the map to anchor 71.9% of the draft genome assembly and 90.7% of the predicted protein-coding genes. The chromosome-anchored genome sequence will be useful for breeding improvement by assisting in the rapid identification of markers linked to important traits, and in providing a framework for genomic selectionenhanced breeding of this important crop.« less
High-resolution linkage map and chromosome-scale genome assembly for cassava ( Manihot esculenta Crantz) from 10 populations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lyons, Jessica

Cassava Manihot esculenta Crantz) is a major staple crop in Africa, Asia, and South America, and its starchy roots provide nourishment for 800 million people worldwide. Although native to South America, cassava was brought to Africa 400–500 years ago and is now widely cultivated across sub-Saharan Africa, but it is subject to biotic and abiotic stresses. To assist in the rapid identification of markers for pathogen resistance and crop traits, and to accelerate breeding programs, we generated a framework map for M. esculent Crantz from reduced representation sequencing [genotyping-by-sequencing (GBS)]. The composite 2412-cM map integrates 10 biparental maps (comprising 3480more » meioses) and organizes 22,403 genetic markers on 18 chromosomes, in agreement with the observed karyotype. Here, we used the map to anchor 71.9% of the draft genome assembly and 90.7% of the predicted protein-coding genes. The chromosome-anchored genome sequence will be useful for breeding improvement by assisting in the rapid identification of markers linked to important traits, and in providing a framework for genomic selectionenhanced breeding of this important crop.« less
Next Generation Sequencing Technologies: The Doorway to the Unexplored Genomics of Non-Model Plants

PubMed Central

Unamba, Chibuikem I. N.; Nag, Akshay; Sharma, Ram K.

2015-01-01

Non-model plants i.e., the species which have one or all of the characters such as long life cycle, difficulty to grow in the laboratory or poor fecundity, have been schemed out of sequencing projects earlier, due to high running cost of Sanger sequencing. Consequently, the information about their genomics and key biological processes are inadequate. However, the advent of fast and cost effective next generation sequencing (NGS) platforms in the recent past has enabled the unearthing of certain characteristic gene structures unique to these species. It has also aided in gaining insight about mechanisms underlying processes of gene expression and secondary metabolism as well as facilitated development of genomic resources for diversity characterization, evolutionary analysis and marker assisted breeding even without prior availability of genomic sequence information. In this review we explore how different Next Gen Sequencing platforms, as well as recent advances in NGS based high throughput genotyping technologies are rewarding efforts on de-novo whole genome/transcriptome sequencing, development of genome wide sequence based markers resources for improvement of non-model crops that are less costly than phenotyping. PMID:26734016
Performance and microbial community dynamics of electricity-assisted sequencing batch reactor (SBR) for treatment of saline petrochemical wastewater.

PubMed

Liu, Jiaxin; Shi, Shengnan; Ji, Xiangyu; Jiang, Bei; Xue, Lanlan; Li, Meidi; Tan, Liang

2017-07-01

High-salinity wastewater is often difficult to treat by common biological technologies due to salinity stress on the bacterial community. Electricity-assisted anaerobic technologies have significantly enhanced the treatment performance by alleviating the impact of salinity stress on the bacterial community, but electricity-assisted aerobic technologies have less been reported. Herein, a novel bio-electrochemistry system has been designed and operated in which a pair of stainless iron mesh-graphite plate electrodes were installed into a sequencing batch reactor (SBR, designated as S1) to strengthen the performance of saline petrochemical wastewater under aerobic conditions. The removal efficiency of phenol and chemical oxygen demand (COD) in S1 were 94.1 and 91.2%, respectively, on day 45, which was clearly higher than the removal efficiency of a single SBR (S2) and an electrochemical reactor (S3), indicating that a coupling effect existed between the electrochemical process and biodegradation. A certain amount of salinity (≤8000 mg/L) could enhance the treatment performance in S1 but weaken that in S2. Illumina sequencing revealed that microbial communities in S1 on days 45 and 91 were richer and more diverse than in S2, which suggests that electrical stimulation could enhance the diversity and richness of the microbial community, and reduce the negative effect of salinity on the microorganisms and enrich some salt-adapted microorganisms, thus improve the ability of S1 to respond to salinity stress. This novel bio-electrochemistry system was shown to be an alternative technology for the high saline petrochemical wastewater.
Active learning reduces annotation time for clinical concept extraction.

PubMed

Kholghi, Mahnoosh; Sitbon, Laurianne; Zuccon, Guido; Nguyen, Anthony

2017-10-01

To investigate: (1) the annotation time savings by various active learning query strategies compared to supervised learning and a random sampling baseline, and (2) the benefits of active learning-assisted pre-annotations in accelerating the manual annotation process compared to de novo annotation. There are 73 and 120 discharge summary reports provided by Beth Israel institute in the train and test sets of the concept extraction task in the i2b2/VA 2010 challenge, respectively. The 73 reports were used in user study experiments for manual annotation. First, all sequences within the 73 reports were manually annotated from scratch. Next, active learning models were built to generate pre-annotations for the sequences selected by a query strategy. The annotation/reviewing time per sequence was recorded. The 120 test reports were used to measure the effectiveness of the active learning models. When annotating from scratch, active learning reduced the annotation time up to 35% and 28% compared to a fully supervised approach and a random sampling baseline, respectively. Reviewing active learning-assisted pre-annotations resulted in 20% further reduction of the annotation time when compared to de novo annotation. The number of concepts that require manual annotation is a good indicator of the annotation time for various active learning approaches as demonstrated by high correlation between time rate and concept annotation rate. Active learning has a key role in reducing the time required to manually annotate domain concepts from clinical free text, either when annotating from scratch or reviewing active learning-assisted pre-annotations. Copyright © 2017 Elsevier B.V. All rights reserved.
Species-Level Identification of Actinomyces Isolates Causing Invasive Infections: Multiyear Comparison of Vitek MS (Matrix-Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry) to Partial Sequencing of the 16S rRNA Gene.

PubMed

Lynch, T; Gregson, D; Church, D L

2016-03-01

Actinomyces species are uncommon but important causes of invasive infections. The ability of our regional clinical microbiology laboratory to report species-level identification of Actinomyces relied on molecular identification by partial sequencing of the 16S ribosomal gene prior to the implementation of the Vitek MS (matrix-assisted laser desorption ionization-time of flight mass spectrometry [MALDI-TOF MS]) system. We compared the use of the Vitek MS to that of 16S rRNA gene sequencing for reliable species-level identification of invasive infections caused by Actinomyces spp. because limited data had been published for this important genera. A total of 115 cases of Actinomyces spp., either alone or as part of a polymicrobial infection, were diagnosed between 2011 and 2014. Actinomyces spp. were considered the principal pathogen in bloodstream infections (n = 17, 15%), in skin and soft tissue abscesses (n = 25, 22%), and in pulmonary (n = 26, 23%), bone (n = 27, 23%), intraabdominal (n = 16, 14%), and central nervous system (n = 4, 3%) infections. Compared to sequencing and identification from the SmartGene Integrated Database Network System (IDNS), Vitek MS identified 47/115 (41%) isolates to the correct species and 10 (9%) isolates to the correct genus. However, the Vitek MS was unable to provide identification for 43 (37%) isolates while 15 (13%) had discordant results. Phylogenetic analyses of the 16S rRNA sequences demonstrate high diversity in recovered Actinomyces spp. and provide additional information to compare/confirm discordant identifications between MALDI-TOF and 16S rRNA gene sequences. This study highlights the diversity of clinically relevant Actinomyces spp. and provides an important typing comparison. Based on our analysis, 16S rRNA gene sequencing should be used to rapidly identify Actinomyces spp. until MALDI-TOF databases are optimized. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Species-Level Identification of Actinomyces Isolates Causing Invasive Infections: Multiyear Comparison of Vitek MS (Matrix-Assisted Laser Desorption Ionization–Time of Flight Mass Spectrometry) to Partial Sequencing of the 16S rRNA Gene

PubMed Central

Gregson, D.; Church, D. L.

2016-01-01

Actinomyces species are uncommon but important causes of invasive infections. The ability of our regional clinical microbiology laboratory to report species-level identification of Actinomyces relied on molecular identification by partial sequencing of the 16S ribosomal gene prior to the implementation of the Vitek MS (matrix-assisted laser desorption ionization–time of flight mass spectrometry [MALDI-TOF MS]) system. We compared the use of the Vitek MS to that of 16S rRNA gene sequencing for reliable species-level identification of invasive infections caused by Actinomyces spp. because limited data had been published for this important genera. A total of 115 cases of Actinomyces spp., either alone or as part of a polymicrobial infection, were diagnosed between 2011 and 2014. Actinomyces spp. were considered the principal pathogen in bloodstream infections (n = 17, 15%), in skin and soft tissue abscesses (n = 25, 22%), and in pulmonary (n = 26, 23%), bone (n = 27, 23%), intraabdominal (n = 16, 14%), and central nervous system (n = 4, 3%) infections. Compared to sequencing and identification from the SmartGene Integrated Database Network System (IDNS), Vitek MS identified 47/115 (41%) isolates to the correct species and 10 (9%) isolates to the correct genus. However, the Vitek MS was unable to provide identification for 43 (37%) isolates while 15 (13%) had discordant results. Phylogenetic analyses of the 16S rRNA sequences demonstrate high diversity in recovered Actinomyces spp. and provide additional information to compare/confirm discordant identifications between MALDI-TOF and 16S rRNA gene sequences. This study highlights the diversity of clinically relevant Actinomyces spp. and provides an important typing comparison. Based on our analysis, 16S rRNA gene sequencing should be used to rapidly identify Actinomyces spp. until MALDI-TOF databases are optimized. PMID:26739153
MSLICE Sequencing

NASA Technical Reports Server (NTRS)

Crockett, Thomas M.; Joswig, Joseph C.; Shams, Khawaja S.; Norris, Jeffrey S.; Morris, John R.

2011-01-01

MSLICE Sequencing is a graphical tool for writing sequences and integrating them into RML files, as well as for producing SCMF files for uplink. When operated in a testbed environment, it also supports uplinking these SCMF files to the testbed via Chill. This software features a free-form textural sequence editor featuring syntax coloring, automatic content assistance (including command and argument completion proposals), complete with types, value ranges, unites, and descriptions from the command dictionary that appear as they are typed. The sequence editor also has a "field mode" that allows tabbing between arguments and displays type/range/units/description for each argument as it is edited. Color-coded error and warning annotations on problematic tokens are included, as well as indications of problems that are not visible in the current scroll range. "Quick Fix" suggestions are made for resolving problems, and all the features afforded by modern source editors are also included such as copy/cut/paste, undo/redo, and a sophisticated find-and-replace system optionally using regular expressions. The software offers a full XML editor for RML files, which features syntax coloring, content assistance and problem annotations as above. There is a form-based, "detail view" that allows structured editing of command arguments and sequence parameters when preferred. The "project view" shows the user s "workspace" as a tree of "resources" (projects, folders, and files) that can subsequently be opened in editors by double-clicking. Files can be added, deleted, dragged-dropped/copied-pasted between folders or projects, and these operations are undoable and redoable. A "problems view" contains a tabular list of all problems in the current workspace. Double-clicking on any row in the table opens an editor for the appropriate sequence, scrolling to the specific line with the problem, and highlighting the problematic characters. From there, one can invoke "quick fix" as described above to resolve the issue. Once resolved, saving the file causes the problem to be removed from the problem view.
Complete genome sequence of lymphocystis disease virus isolated from China.

PubMed

Zhang, Qi-Ya; Xiao, Feng; Xie, Jian; Li, Zheng-Qiu; Gui, Jian-Fang

2004-07-01

Lymphocystis diseases in fish throughout the world have been extensively described. Here we report the complete genome sequence of lymphocystis disease virus isolated in China (LCDV-C), an LCDV isolated from cultured flounder (Paralichthys olivaceus) with lymphocystis disease in China. The LCDV-C genome is 186,250 bp, with a base composition of 27.25% G+C. Computer-assisted analysis revealed 240 potential open reading frames (ORFs) and 176 nonoverlapping putative viral genes, which encode polypeptides ranging from 40 to 1,193 amino acids. The percent coding density is 67%, and the average length of each ORF is 702 bp. A search of the GenBank database using the 176 individual putative genes revealed 103 homologues to the corresponding ORFs of LCDV-1 and 73 potential genes that were not found in LCDV-1 and other iridoviruses. Among the 73 genes, there are 8 genes that contain conserved domains of cellular genes and 65 novel genes that do not show any significant homology with the sequences in public databases. Although a certain extent of similarity between putative gene products of LCDV-C and corresponding proteins of LCDV-1 was revealed, no colinearity was detected when their ORF arrangements and coding strategies were compared to each other, suggesting that a high degree of genetic rearrangements between them has occurred. And a large number of tandem and overlapping repeated sequences were observed in the LCDV-C genome. The deduced amino acid sequence of the major capsid protein (MCP) presents the highest identity to those of LCDV-1 and other iridoviruses among the LCDV-C gene products. Furthermore, a phylogenetic tree was constructed based on the multiple alignments of nine MCP amino acid sequences. Interestingly, LCDV-C and LCDV-1 were clustered together, but their amino acid identity is much less than that in other clusters. The unexpected levels of divergence between their genomes in size, gene organization, and gene product identity suggest that LCDV-C and LCDV-1 shouldn't belong to a same species and that LCDV-C should be considered a species different from LCDV-1.
Complete Genome Sequence of Lymphocystis Disease Virus Isolated from China

PubMed Central

Zhang, Qi-Ya; Xiao, Feng; Xie, Jian; Li, Zheng-Qiu; Gui, Jian-Fang

2004-01-01

Lymphocystis diseases in fish throughout the world have been extensively described. Here we report the complete genome sequence of lymphocystis disease virus isolated in China (LCDV-C), an LCDV isolated from cultured flounder (Paralichthys olivaceus) with lymphocystis disease in China. The LCDV-C genome is 186,250 bp, with a base composition of 27.25% G+C. Computer-assisted analysis revealed 240 potential open reading frames (ORFs) and 176 nonoverlapping putative viral genes, which encode polypeptides ranging from 40 to 1,193 amino acids. The percent coding density is 67%, and the average length of each ORF is 702 bp. A search of the GenBank database using the 176 individual putative genes revealed 103 homologues to the corresponding ORFs of LCDV-1 and 73 potential genes that were not found in LCDV-1 and other iridoviruses. Among the 73 genes, there are 8 genes that contain conserved domains of cellular genes and 65 novel genes that do not show any significant homology with the sequences in public databases. Although a certain extent of similarity between putative gene products of LCDV-C and corresponding proteins of LCDV-1 was revealed, no colinearity was detected when their ORF arrangements and coding strategies were compared to each other, suggesting that a high degree of genetic rearrangements between them has occurred. And a large number of tandem and overlapping repeated sequences were observed in the LCDV-C genome. The deduced amino acid sequence of the major capsid protein (MCP) presents the highest identity to those of LCDV-1 and other iridoviruses among the LCDV-C gene products. Furthermore, a phylogenetic tree was constructed based on the multiple alignments of nine MCP amino acid sequences. Interestingly, LCDV-C and LCDV-1 were clustered together, but their amino acid identity is much less than that in other clusters. The unexpected levels of divergence between their genomes in size, gene organization, and gene product identity suggest that LCDV-C and LCDV-1 shouldn't belong to a same species and that LCDV-C should be considered a species different from LCDV-1. PMID:15194775

Nucleotide sequence of soybean chloroplast DNA regions which contain the psb A and trn H genes and cover the ends of the large single copy region and one end of the inverted repeats.

PubMed

Spielmann, A; Stutz, E

1983-10-25

The soybean chloroplast psb A gene (photosystem II thylakoid membrane protein of Mr 32 000, lysine-free) and the trn H gene (tRNAHisGUG), which both map in the large single copy region adjacent to one of the inverted repeat structures (IR1), have been sequenced including flanking regions. The psb A gene shows in its structural part 92% sequence homology with the corresponding genes of spinach and N. debneyi and contains also an open reading frame for 353 aminoacids. The aminoacid sequence of a potential primary translation product (calculated Mr, 38 904, no lysine) diverges from that of spinach and N. debneyi in only two positions in the C-terminal part. The trn H gene has the same polarity as the psb A gene and the coding region is located at the very end of the large single copy region. The deduced sequence of the soybean chloroplast tRNAHisGUG is identical with that of Zea mays chloroplasts. Both ends of the large single copy region were sequenced including a small segment of the adjacent IR1 and IR2.
Geranyl diphosphate synthase large subunit, and methods of use

DOEpatents

Croteau, Rodney B.; Burke, Charles C.; Wildung, Mark R.

2001-10-16

A cDNA encoding geranyl diphosphate synthase large subunit from peppermint has been isolated and sequenced, and the corresponding amino acid sequence has been determined. Replicable recombinant cloning vehicles are provided which code for geranyl diphosphate synthase large subunit). In another aspect, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding geranyl diphosphate synthase large subunit. In yet another aspect, the present invention provides isolated, recombinant geranyl diphosphate synthase protein comprising an isolated, recombinant geranyl diphosphate synthase large subunit protein and an isolated, recombinant geranyl diphosphate synthase small subunit protein. Thus, systems and methods are provided for the recombinant expression of geranyl diphosphate synthase.
GWFASTA: server for FASTA search in eukaryotic and microbial genomes.

PubMed

Issac, Biju; Raghava, G P S

2002-09-01

Similarity searches are a powerful method for solving important biological problems such as database scanning, evolutionary studies, gene prediction, and protein structure prediction. FASTA is a widely used sequence comparison tool for rapid database scanning. Here we describe the GWFASTA server that was developed to assist the FASTA user in similarity searches against partially and/or completely sequenced genomes. GWFASTA consists of more than 60 microbial genomes, eight eukaryote genomes, and proteomes of annotatedgenomes. Infact, it provides the maximum number of databases for similarity searching from a single platform. GWFASTA allows the submission of more than one sequence as a single query for a FASTA search. It also provides integrated post-processing of FASTA output, including compositional analysis of proteins, multiple sequences alignment, and phylogenetic analysis. Furthermore, it summarizes the search results organism-wise for prokaryotes and chromosome-wise for eukaryotes. Thus, the integration of different tools for sequence analyses makes GWFASTA a powerful toolfor biologists.
Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.

PubMed

Mayer, K; Schüller, C; Wambutt, R; Murphy, G; Volckaert, G; Pohl, T; Düsterhöft, A; Stiekema, W; Entian, K D; Terryn, N; Harris, B; Ansorge, W; Brandt, P; Grivell, L; Rieger, M; Weichselgartner, M; de Simone, V; Obermaier, B; Mache, R; Müller, M; Kreis, M; Delseny, M; Puigdomenech, P; Watson, M; Schmidtheini, T; Reichert, B; Portatelle, D; Perez-Alonso, M; Boutry, M; Bancroft, I; Vos, P; Hoheisel, J; Zimmermann, W; Wedler, H; Ridley, P; Langham, S A; McCullagh, B; Bilham, L; Robben, J; Van der Schueren, J; Grymonprez, B; Chuang, Y J; Vandenbussche, F; Braeken, M; Weltjens, I; Voet, M; Bastiaens, I; Aert, R; Defoor, E; Weitzenegger, T; Bothe, G; Ramsperger, U; Hilbert, H; Braun, M; Holzer, E; Brandt, A; Peters, S; van Staveren, M; Dirske, W; Mooijman, P; Klein Lankhorst, R; Rose, M; Hauf, J; Kötter, P; Berneiser, S; Hempel, S; Feldpausch, M; Lamberth, S; Van den Daele, H; De Keyser, A; Buysshaert, C; Gielen, J; Villarroel, R; De Clercq, R; Van Montagu, M; Rogers, J; Cronin, A; Quail, M; Bray-Allen, S; Clark, L; Doggett, J; Hall, S; Kay, M; Lennard, N; McLay, K; Mayes, R; Pettett, A; Rajandream, M A; Lyne, M; Benes, V; Rechmann, S; Borkova, D; Blöcker, H; Scharfe, M; Grimm, M; Löhnert, T H; Dose, S; de Haan, M; Maarse, A; Schäfer, M; Müller-Auer, S; Gabel, C; Fuchs, M; Fartmann, B; Granderath, K; Dauner, D; Herzl, A; Neumann, S; Argiriou, A; Vitale, D; Liguori, R; Piravandi, E; Massenet, O; Quigley, F; Clabauld, G; Mündlein, A; Felber, R; Schnabl, S; Hiller, R; Schmidt, W; Lecharny, A; Aubourg, S; Chefdor, F; Cooke, R; Berger, C; Montfort, A; Casacuberta, E; Gibbons, T; Weber, N; Vandenbol, M; Bargues, M; Terol, J; Torres, A; Perez-Perez, A; Purnelle, B; Bent, E; Johnson, S; Tacon, D; Jesse, T; Heijnen, L; Schwarz, S; Scholler, P; Heber, S; Francs, P; Bielke, C; Frishman, D; Haase, D; Lemcke, K; Mewes, H W; Stocker, S; Zaccaria, P; Bevan, M; Wilson, R K; de la Bastide, M; Habermann, K; Parnell, L; Dedhia, N; Gnoj, L; Schutz, K; Huang, E; Spiegel, L; Sehkon, M; Murray, J; Sheet, P; Cordes, M; Abu-Threideh, J; Stoneking, T; Kalicki, J; Graves, T; Harmon, G; Edwards, J; Latreille, P; Courtney, L; Cloud, J; Abbott, A; Scott, K; Johnson, D; Minx, P; Bentley, D; Fulton, B; Miller, N; Greco, T; Kemp, K; Kramer, J; Fulton, L; Mardis, E; Dante, M; Pepin, K; Hillier, L; Nelson, J; Spieth, J; Ryan, E; Andrews, S; Geisel, C; Layman, D; Du, H; Ali, J; Berghoff, A; Jones, K; Drone, K; Cotton, M; Joshu, C; Antonoiu, B; Zidanic, M; Strong, C; Sun, H; Lamar, B; Yordan, C; Ma, P; Zhong, J; Preston, R; Vil, D; Shekher, M; Matero, A; Shah, R; Swaby, I K; O'Shaughnessy, A; Rodriguez, M; Hoffmann, J; Till, S; Granat, S; Shohdy, N; Hasegawa, A; Hameed, A; Lodhi, M; Johnson, A; Chen, E; Marra, M; Martienssen, R; McCombie, W R

1999-12-16

The higher plant Arabidopsis thaliana (Arabidopsis) is an important model for identifying plant genes and determining their function. To assist biological investigations and to define chromosome structure, a coordinated effort to sequence the Arabidopsis genome was initiated in late 1996. Here we report one of the first milestones of this project, the sequence of chromosome 4. Analysis of 17.38 megabases of unique sequence, representing about 17% of the genome, reveals 3,744 protein coding genes, 81 transfer RNAs and numerous repeat elements. Heterochromatic regions surrounding the putative centromere, which has not yet been completely sequenced, are characterized by an increased frequency of a variety of repeats, new repeats, reduced recombination, lowered gene density and lowered gene expression. Roughly 60% of the predicted protein-coding genes have been functionally characterized on the basis of their homology to known genes. Many genes encode predicted proteins that are homologous to human and Caenorhabditis elegans proteins.
New Impetus for Several General Assistance Programs.

ERIC Educational Resources Information Center

Reeder, Rick

1999-01-01

Describes 1999 federal funding to large general-assistance programs affecting small towns and rural areas (including Housing and Urban Development, federal disaster relief, rural extension activities, and Bureau of Indian Affairs assistance programs); increased funding for Empowerment Zones/Enterprise Communities; reauthorization of the Economic…
Computation and Theory in Large-Scale Optimization

DTIC Science & Technology

1993-01-13

Sang Jin Lee. Research Assistant. - Laura Morley, Research Assistant. - Yonca A. Ozge , Research Assistant. - Stephen M. Robinson. Professor. - Hichem...other participants. M.N. Azadez. S.J. Lee. Y.A. Ozge . and H. Sellami are continuing students in the doctoral program (in Industrial Engineering except
Distant plant homologues: don't throw out the baby.

PubMed

Gardiner, John; Overall, Robyn; Marc, Jan

2012-03-01

Plants and metazoans share many similarities in terms of conserved proteins. Antibodies have been used extensively to detect remote homologues, many of which are yet to be identified conclusively. Genome sequencing and the creation of novel sequence or structure comparison programs have assisted greatly in the identification of distant protein homologues. The continuing development of new software algorithms and the combining of bioinformatics with proteomics offer hope that remaining homologues will be soon identified. Copyright Â© 2011 Elsevier Ltd. All rights reserved.
Use of low-coverage, large-insert, short-read data for rapid and accurate generation of enhanced-quality draft Pseudomonas genome sequences.

PubMed

O'Brien, Heath E; Gong, Yunchen; Fung, Pauline; Wang, Pauline W; Guttman, David S

2011-01-01

Next-generation genomic technology has both greatly accelerated the pace of genome research as well as increased our reliance on draft genome sequences. While groups such as the Genomics Standards Consortium have made strong efforts to promote genome standards there is a still a general lack of uniformity among published draft genomes, leading to challenges for downstream comparative analyses. This lack of uniformity is a particular problem when using standard draft genomes that frequently have large numbers of low-quality sequencing tracts. Here we present a proposal for an "enhanced-quality draft" genome that identifies at least 95% of the coding sequences, thereby effectively providing a full accounting of the genic component of the genome. Enhanced-quality draft genomes are easily attainable through a combination of small- and large-insert next-generation, paired-end sequencing. We illustrate the generation of an enhanced-quality draft genome by re-sequencing the plant pathogenic bacterium Pseudomonas syringae pv. phaseolicola 1448A (Pph 1448A), which has a published, closed genome sequence of 5.93 Mbp. We use a combination of Illumina paired-end and mate-pair sequencing, and surprisingly find that de novo assemblies with 100x paired-end coverage and mate-pair sequencing with as low as low as 2-5x coverage are substantially better than assemblies based on higher coverage. The rapid and low-cost generation of large numbers of enhanced-quality draft genome sequences will be of particular value for microbial diagnostics and biosecurity, which rely on precise discrimination of potentially dangerous clones from closely related benign strains.
Accurate, Rapid Taxonomic Classification of Fungal Large-Subunit rRNA Genes

PubMed Central

Liu, Kuan-Liang; Porras-Alfaro, Andrea; Eichorst, Stephanie A.

2012-01-01

Taxonomic and phylogenetic fingerprinting based on sequence analysis of gene fragments from the large-subunit rRNA (LSU) gene or the internal transcribed spacer (ITS) region is becoming an integral part of fungal classification. The lack of an accurate and robust classification tool trained by a validated sequence database for taxonomic placement of fungal LSU genes is a severe limitation in taxonomic analysis of fungal isolates or large data sets obtained from environmental surveys. Using a hand-curated set of 8,506 fungal LSU gene fragments, we determined the performance characteristics of a naïve Bayesian classifier across multiple taxonomic levels and compared the classifier performance to that of a sequence similarity-based (BLASTN) approach. The naïve Bayesian classifier was computationally more rapid (>460-fold with our system) than the BLASTN approach, and it provided equal or superior classification accuracy. Classifier accuracies were compared using sequence fragments of 100 bp and 400 bp and two different PCR primer anchor points to mimic sequence read lengths commonly obtained using current high-throughput sequencing technologies. Accuracy was higher with 400-bp sequence reads than with 100-bp reads. It was also significantly affected by sequence location across the 1,400-bp test region. The highest accuracy was obtained across either the D1 or D2 variable region. The naïve Bayesian classifier provides an effective and rapid means to classify fungal LSU sequences from large environmental surveys. The training set and tool are publicly available through the Ribosomal Database Project (http://rdp.cme.msu.edu/classifier/classifier.jsp). PMID:22194300
Using a Dialogue System Based on Dialogue Maps for Computer Assisted Second Language Learning

ERIC Educational Resources Information Center

Choi, Sung-Kwon; Kwon, Oh-Woog; Kim, Young-Kil; Lee, Yunkeun

2016-01-01

In order to use dialogue systems for computer assisted second-language learning systems, one of the difficult issues in such systems is how to construct large-scale dialogue knowledge that matches the dialogue modelling of a dialogue system. This paper describes how we have accomplished the short-term construction of large-scale and…
The Papillomavirus Episteme: a major update to the papillomavirus sequence database.

PubMed

Van Doorslaer, Koenraad; Li, Zhiwen; Xirasagar, Sandhya; Maes, Piet; Kaminsky, David; Liou, David; Sun, Qiang; Kaur, Ramandeep; Huyen, Yentram; McBride, Alison A

2017-01-04

The Papillomavirus Episteme (PaVE) is a database of curated papillomavirus genomic sequences, accompanied by web-based sequence analysis tools. This update describes the addition of major new features. The papillomavirus genomes within PaVE have been further annotated, and now includes the major spliced mRNA transcripts. Viral genes and transcripts can be visualized on both linear and circular genome browsers. Evolutionary relationships among PaVE reference protein sequences can be analysed using multiple sequence alignments and phylogenetic trees. To assist in viral discovery, PaVE offers a typing tool; a simplified algorithm to determine whether a newly sequenced virus is novel. PaVE also now contains an image library containing gross clinical and histopathological images of papillomavirus infected lesions. Database URL: https://pave.niaid.nih.gov/. Published by Oxford University Press on behalf of Nucleic Acids Research 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.
The novel primers for mammal species identification-based mitochondrial cytochrome b sequence: implication for reserved wild animals in Thailand and endangered mammal species in Southeast Asia.

PubMed

Muangkram, Yuttamol; Wajjwalku, Worawidh; Amano, Akira; Sukmak, Manakorn

2018-01-01

We presented the powerful techniques for species identification using the short amplicon of mitochondrial cytochrome b gene sequence. Two faecal samples and one single hair sample of the Asian tapir were tested using the new cytochrome b primers. The results showed a high sequence similarity with the mainland Asian tapir group. The comparative sequence analysis of the reserved wild mammals in Thailand and the other endangered mammal species from Southeast Asia comprehensibly verified the potential of our novel primers. The forward and reverse primers were 94.2 and 93.2%, respectively, by the average value of the sequence identity among 77 species sequences, and the overall mean distance was 35.9%. This development technique could provide rapid, simple, and reliable tools for species confirmation. Especially, it could recognize the problematic biological specimens contained less DNA material from illegal products and assist with wildlife crime investigation of threatened species and related forensic casework.
Development of a Genetic Map for Onion (Allium cepa L.) Using Reference-Free Genotyping-by-Sequencing and SNP Assays

PubMed Central

Jo, Jinkwan; Purushotham, Preethi M.; Han, Koeun; Lee, Heung-Ryul; Nah, Gyoungju; Kang, Byoung-Cheorl

2017-01-01

Single nucleotide polymorphisms (SNPs) play important roles as molecular markers in plant genomics and breeding studies. Although onion (Allium cepa L.) is an important crop globally, relatively few molecular marker resources have been reported due to its large genome and high heterozygosity. Genotyping-by-sequencing (GBS) offers a greater degree of complexity reduction followed by concurrent SNP discovery and genotyping for species with complex genomes. In this study, GBS was employed for SNP mining in onion, which currently lacks a reference genome. A segregating F2 population, derived from a cross between ‘NW-001’ and ‘NW-002,’ as well as multiple parental lines were used for GBS analysis. A total of 56.15 Gbp of raw sequence data were generated and 1,851,428 SNPs were identified from the de novo assembled contigs. Stringent filtering resulted in 10,091 high-fidelity SNP markers. Robust SNPs that satisfied the segregation ratio criteria and with even distribution in the mapping population were used to construct an onion genetic map. The final map contained eight linkage groups and spanned a genetic length of 1,383 centiMorgans (cM), with an average marker interval of 8.08 cM. These robust SNPs were further analyzed using the high-throughput Fluidigm platform for marker validation. This is the first study in onion to develop genome-wide SNPs using GBS. The resulting SNP markers and developed linkage map will be valuable tools for genetic mapping of important agronomic traits and marker-assisted selection in onion breeding programs. PMID:28959273
Construction of a High-Density American Cranberry (Vaccinium macrocarpon Ait.) Composite Map Using Genotyping-by-Sequencing for Multi-pedigree Linkage Mapping

PubMed Central

Schlautman, Brandon; Covarrubias-Pazaran, Giovanny; Diaz-Garcia, Luis; Iorizzo, Massimo; Polashock, James; Grygleski, Edward; Vorsa, Nicholi; Zalapa, Juan

2017-01-01

The American cranberry (Vaccinium macrocarpon Ait.) is a recently domesticated, economically important, fruit crop with limited molecular resources. New genetic resources could accelerate genetic gain in cranberry through characterization of its genomic structure and by enabling molecular-assisted breeding strategies. To increase the availability of cranberry genomic resources, genotyping-by-sequencing (GBS) was used to discover and genotype thousands of single nucleotide polymorphisms (SNPs) within three interrelated cranberry full-sib populations. Additional simple sequence repeat (SSR) loci were added to the SNP datasets and used to construct bin maps for the parents of the populations, which were then merged to create the first high-density cranberry composite map containing 6073 markers (5437 SNPs and 636 SSRs) on 12 linkage groups (LGs) spanning 1124 cM. Interestingly, higher rates of recombination were observed in maternal than paternal gametes. The large number of markers in common (mean of 57.3) and the high degree of observed collinearity (mean Pair-wise Spearman rank correlations >0.99) between the LGs of the parental maps demonstrates the utility of GBS in cranberry for identifying polymorphic SNP loci that are transferable between pedigrees and populations in future trait-association studies. Furthermore, the high-density of markers anchored within the component maps allowed identification of segregation distortion regions, placement of centromeres on each of the 12 LGs, and anchoring of genomic scaffolds. Collectively, the results represent an important contribution to the current understanding of cranberry genomic structure and to the availability of molecular tools for future genetic research and breeding efforts in cranberry. PMID:28250016
Developing Rice with High Yield under Phosphorus Deficiency: Pup1 Sequence to Application1[W][OA

PubMed Central

Chin, Joong Hyoun; Gamuyao, Rico; Dalid, Cheryl; Bustamam, Masdiar; Prasetiyono, Joko; Moeljopawiro, Sugiono; Wissuwa, Matthias; Heuer, Sigrid

2011-01-01

The major quantitative trait locus (QTL) Phosphorus uptake1 (Pup1) confers tolerance of phosphorus deficiency in soil and is currently one of the most promising QTLs for the development of tolerant rice (Oryza sativa) varieties. To facilitate targeted introgression of Pup1 into intolerant varieties, the gene models predicted in the Pup1 region in the donor variety Kasalath were used to develop gene-based molecular markers that are evenly distributed over the fine-mapped 278-kb QTL region. To validate the gene models and optimize the markers, gene expression analyses and partial allelic sequencing were conducted. The markers were tested in more than 80 diverse rice accessions revealing three main groups with different Pup1 allele constitution. Accessions with tolerant (group I) and intolerant (group III) Pup1 alleles were distinguished from genotypes with Kasalath alleles at some of the analyzed loci (partial Pup1; group II). A germplasm survey additionally confirmed earlier data showing that Pup1 is largely absent from irrigated rice varieties but conserved in varieties and breeding lines adapted to drought-prone environments. A core set of Pup1 markers has been defined, and sequence polymorphisms suitable for single-nucleotide polymorphism marker development for high-throughput genotyping were identified. Following a marker-assisted backcrossing approach, Pup1 was introgressed into two irrigated rice varieties and three Indonesian upland varieties. First phenotypic evaluations of the introgression lines suggest that Pup1 is effective in different genetic backgrounds and environments and that it has the potential to significantly enhance grain yield under field conditions. PMID:21602323
Spectral comparison of directly imaged, young substellar companions using integral field spectroscopy - construction of an empiric log g sequence

NASA Astrophysics Data System (ADS)

Schmidt, T.; Neuhaüser, R.; Seifahrt, A.

2010-10-01

About 15 substellar companions with large separations (>∼50 AU) to their young primary stars and brown dwarfs are confirmed by both common proper motion and late-M / early-L type spectra. The origin and early evolution of these objects is still under debate. While often these substellar companions are regarded as brown dwarfs, they could possibly also be massive planets, the mass estimates are very uncertain so far. They are companions to primary stars or brown dwarfs in young associations and star forming regions like the TW Hya association, Upper Scorpius, Taurus, Beta Pic moving group, TucHor association, Lupus, Ophiuchus, and Chamaeleon, hence their ages and distances are well known, in contrast to free-floating brown dwarfs. An empirical classification is not possible, because a spectral sequence that is taking the lower gravity into account, is not existing. This problem leads to an apparent mismatch between spectra of old field type objects and young low-mass companions at the same effective temperature, hampering a determination of temperature and surface gravity independent from models. Now that about 15 such substellar candidates are found in associations of different ages, 1 - 35 Myrs, it is possible to study their spectra in comparison to each other using the advantage of light concentration by an adaptive optics system with their primary as guide star. Therefore we have begun the construction of an empirical log g sequence from beginning to observe all these substellar companions homogeneously using the AO-assisted integral field spectrograph SINFONI at VLT (ESO).
Negative pressure wound therapy-assisted dermatotraction for the closure of large open wounds in a patient with non-clostridial gas gangrene.

PubMed

Ishida, Kenichiro; Noborio, Mitsuhiro; Nishimura, Tetsuro; Ieki, Yohei; Shimahara, Yumiko; Sogabe, Taku; Ehara, Naoki; Saoyama, Yuki; Sadamitsu, Daikai

2016-04-01

A 53-year-old woman developed septic shock associated with non-clostridial gas gangrene. She presented to the emergency department with two large open wounds on both thighs and in her sacral region. Non-enhanced computed tomography showed air density in contact with the right iliopsoas, which extended to the posterior compartment of the thigh. We made repeated efforts at surgical debridement of the wound with resection of necrotic tissues. Using negative pressure wound therapy-assisted dermatotraction, the pus pockets and the wound dehiscence decreased in size. Using this method we were successful in achieving delayed closure without skin grafts. Negative pressure wound therapy can be an effective treatment for large and infected open contoured wounds. Negative pressure wound therapy-assisted dermatotraction might be beneficial for poorly healing, large, open wounds in patients in poor condition and with insufficient reserve to tolerate reconstructive surgery.
Phenotypic Variability of Osteogenesis Imperfecta Type V Caused by an IFITM5 Mutation

PubMed Central

Shapiro, Jay R; Lietman, Caressa; Grover, Monica; Lu, James T; Nagamani, Sandesh CS; Dawson, Brian C; Baldridge, Dustin M; Bainbridge, Matthew N; Cohn, Dan H; Blazo, Maria; Roberts, Timothy T; Brennen, Feng-Shu; Wu, Yimei; Gibbs, Richard A; Melvin, Pamela; Campeau, Philippe M; Lee, Brendan H

2013-01-01

In a large cohort of osteogenesis imperfecta type V (OI type V) patients (17 individuals from 12 families), we identified the same mutation in the 5′ untranslated region (5′UTR) of the interferon-induced transmembrane protein 5 (IFITM5) gene by whole exome and Sanger sequencing (IFITM5 c.–14C > T) and provide a detailed description of their phenotype. This mutation leads to the creation of a novel start codon adding five residues to IFITM5 and was recently reported in several other OI type V families. The variability of the phenotype was quite large even within families. Whereas some patients presented with the typical calcification of the forearm interosseous membrane, radial head dislocation and hyperplastic callus (HPC) formation following fractures, others had only some of the typical OI type V findings. Thirteen had calcification of interosseous membranes, 14 had radial head dislocations, 10 had HPC, 9 had long bone bowing, 11 could ambulate without assistance, and 1 had mild unilateral mixed hearing loss. The bone mineral density varied greatly, even within families. Our study thus highlights the phenotypic variability of OI type V caused by the IFITM5 mutation. PMID:23408678
Ensembl 2004.

PubMed

Birney, E; Andrews, D; Bevan, P; Caccamo, M; Cameron, G; Chen, Y; Clarke, L; Coates, G; Cox, T; Cuff, J; Curwen, V; Cutts, T; Down, T; Durbin, R; Eyras, E; Fernandez-Suarez, X M; Gane, P; Gibbins, B; Gilbert, J; Hammond, M; Hotz, H; Iyer, V; Kahari, A; Jekosch, K; Kasprzyk, A; Keefe, D; Keenan, S; Lehvaslaiho, H; McVicker, G; Melsopp, C; Meidl, P; Mongin, E; Pettett, R; Potter, S; Proctor, G; Rae, M; Searle, S; Slater, G; Smedley, D; Smith, J; Spooner, W; Stabenau, A; Stalker, J; Storey, R; Ureta-Vidal, A; Woodwark, C; Clamp, M; Hubbard, T

2004-01-01

The Ensembl (http://www.ensembl.org/) database project provides a bioinformatics framework to organize biology around the sequences of large genomes. It is a comprehensive and integrated source of annotation of large genome sequences, available via interactive website, web services or flat files. As well as being one of the leading sources of genome annotation, Ensembl is an open source software engineering project to develop a portable system able to handle very large genomes and associated requirements. The facilities of the system range from sequence analysis to data storage and visualization and installations exist around the world both in companies and at academic sites. With a total of nine genome sequences available from Ensembl and more genomes to follow, recent developments have focused mainly on closer integration between genomes and external data.
Sma3s: a three-step modular annotator for large sequence datasets.

PubMed

Muñoz-Mérida, Antonio; Viguera, Enrique; Claros, M Gonzalo; Trelles, Oswaldo; Pérez-Pulido, Antonio J

2014-08-01

Automatic sequence annotation is an essential component of modern 'omics' studies, which aim to extract information from large collections of sequence data. Most existing tools use sequence homology to establish evolutionary relationships and assign putative functions to sequences. However, it can be difficult to define a similarity threshold that achieves sufficient coverage without sacrificing annotation quality. Defining the correct configuration is critical and can be challenging for non-specialist users. Thus, the development of robust automatic annotation techniques that generate high-quality annotations without needing expert knowledge would be very valuable for the research community. We present Sma3s, a tool for automatically annotating very large collections of biological sequences from any kind of gene library or genome. Sma3s is composed of three modules that progressively annotate query sequences using either: (i) very similar homologues, (ii) orthologous sequences or (iii) terms enriched in groups of homologous sequences. We trained the system using several random sets of known sequences, demonstrating average sensitivity and specificity values of ~85%. In conclusion, Sma3s is a versatile tool for high-throughput annotation of a wide variety of sequence datasets that outperforms the accuracy of other well-established annotation algorithms, and it can enrich existing database annotations and uncover previously hidden features. Importantly, Sma3s has already been used in the functional annotation of two published transcriptomes. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

Exploration of Genetic and Genomic Resources for Abiotic and Biotic Stress Tolerance in Pearl Millet

PubMed Central

Shivhare, Radha; Lata, Charu

2017-01-01

Pearl millet is one of the most important small-grained C4 Panicoid crops with a large genome size (∼2352 Mb), short life cycle and outbreeding nature. It is highly resilient to areas with scanty rain and high temperature. Pearl millet is a nutritionally superior staple crop for people inhabiting hot, drought-prone arid and semi-arid regions of South Asia and Africa where it is widely grown and used for food, hay, silage, bird feed, building material, and fuel. Having excellent nutrient composition and exceptional buffering capacity against variable climatic conditions and pathogen attack makes pearl millet a wonderful model crop for stress tolerance studies. Pearl millet germplasm show a large range of genotypic and phenotypic variations including tolerance to abiotic and biotic stresses. Conventional breeding for enhancing abiotic and biotic stress resistance in pearl millet have met with considerable success, however, in last few years various novel approaches including functional genomics and molecular breeding have been attempted in this crop for augmenting yield under adverse environmental conditions, and there is still a lot of scope for further improvement using genomic tools. Discovery and use of various DNA-based markers such as EST-SSRs, DArT, CISP, and SSCP-SNP in pearl millet not only help in determining population structure and genetic diversity but also prove to be important for developing strategies for crop improvement at a faster rate and greater precision. Molecular marker-based genetic linkage maps and identification of genomic regions determining yield under abiotic stresses particularly terminal drought have paved way for marker-assisted selection and breeding of pearl millet cultivars. Reference collections and marker-assisted backcrossing have also been used to improve biotic stress resistance in pearl millet specifically to downy mildew. Whole genome sequencing of pearl millet genome will give new insights for processing of functional genes and assist in crop improvement programs through molecular breeding approaches. This review thus summarizes the exploration of pearl millet genetic and genomic resources for improving abiotic and biotic stress resistance and development of cultivars superior in stress tolerance. PMID:28167949
Biochemical and Molecular Characterization of a Serine Keratinase from Brevibacillus brevis US575 with Promising Keratin-Biodegradation and Hide-Dehairing Activities

PubMed Central

Jaouadi, Nadia Zaraî; Rekik, Hatem; Badis, Abdelmalek; Trabelsi, Sahar; Belhoul, Mouna; Yahiaoui, Amina Benkiar; Aicha, Houda Ben; Toumi, Abdessatar; Bejar, Samir; Jaouadi, Bassem

2013-01-01

Dehairing is one of the highly polluting operations in the leather industry. The conventional lime-sulfide process used for dehairing produces large amounts of sulfide, which poses serious toxicity and disposal problems. This operation also involves hair destruction, a process that leads to increased chemical oxygen demand (COD), biological oxygen demand (BOD), and total suspended solid (TSS) loads in the effluent. With these concerns in mind, enzyme-assisted dehairing has often been proposed as an alternative method. The main enzyme preparations so far used involved keratinases. The present paper reports on the purification of an extracellular keratinase (KERUS) newly isolated from Brevibacillus brevis strain US575. Matrix assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF/MS) analysis revealed that the purified enzyme was a monomer with a molecular mass of 29121.11 Da. The sequence of the 27 N-terminal residues of KERUS showed high homology with those of Bacillus keratinases. Optimal activity was achieved at pH 8 and 40°C. Its thermoactivity and thermostability were upgraded in the presence of 5 mM Ca2+. The enzyme was completely inhibited by phenylmethanesulfonyl fluoride (PMSF) and diiodopropyl fluorophosphates (DFP), which suggests that it belongs to the serine protease family. KERUS displayed higher levels of hydrolysis, substrate specificity, and catalytic efficiency than NUE 12 MG and KOROPON® MK EG keratinases. The enzyme also exhibited powerful keratinolytic activity that made it able to accomplish the entire feather-biodegradation process on its own. The kerUS gene encoding KERUS was cloned, sequenced, and expressed in Escherichia coli. The biochemical properties of the extracellular purified recombinant enzyme (rKERUS) were similar to those of native KERUS. Overall, the findings provide strong support for the potential candidacy of this enzyme as an effective and eco-friendly alternative to the conventional chemicals used for the dehairing of rabbit, goat, sheep and bovine hides in the leather processing industry. PMID:24146914
MetaPIGA v2.0: maximum likelihood large phylogeny estimation using the metapopulation genetic algorithm and other stochastic heuristics.

PubMed

Helaers, Raphaël; Milinkovitch, Michel C

2010-07-15

The development, in the last decade, of stochastic heuristics implemented in robust application softwares has made large phylogeny inference a key step in most comparative studies involving molecular sequences. Still, the choice of a phylogeny inference software is often dictated by a combination of parameters not related to the raw performance of the implemented algorithm(s) but rather by practical issues such as ergonomics and/or the availability of specific functionalities. Here, we present MetaPIGA v2.0, a robust implementation of several stochastic heuristics for large phylogeny inference (under maximum likelihood), including a Simulated Annealing algorithm, a classical Genetic Algorithm, and the Metapopulation Genetic Algorithm (metaGA) together with complex substitution models, discrete Gamma rate heterogeneity, and the possibility to partition data. MetaPIGA v2.0 also implements the Likelihood Ratio Test, the Akaike Information Criterion, and the Bayesian Information Criterion for automated selection of substitution models that best fit the data. Heuristics and substitution models are highly customizable through manual batch files and command line processing. However, MetaPIGA v2.0 also offers an extensive graphical user interface for parameters setting, generating and running batch files, following run progress, and manipulating result trees. MetaPIGA v2.0 uses standard formats for data sets and trees, is platform independent, runs in 32 and 64-bits systems, and takes advantage of multiprocessor and multicore computers. The metaGA resolves the major problem inherent to classical Genetic Algorithms by maintaining high inter-population variation even under strong intra-population selection. Implementation of the metaGA together with additional stochastic heuristics into a single software will allow rigorous optimization of each heuristic as well as a meaningful comparison of performances among these algorithms. MetaPIGA v2.0 gives access both to high customization for the phylogeneticist, as well as to an ergonomic interface and functionalities assisting the non-specialist for sound inference of large phylogenetic trees using nucleotide sequences. MetaPIGA v2.0 and its extensive user-manual are freely available to academics at http://www.metapiga.org.
MetaPIGA v2.0: maximum likelihood large phylogeny estimation using the metapopulation genetic algorithm and other stochastic heuristics

PubMed Central

2010-01-01

Background The development, in the last decade, of stochastic heuristics implemented in robust application softwares has made large phylogeny inference a key step in most comparative studies involving molecular sequences. Still, the choice of a phylogeny inference software is often dictated by a combination of parameters not related to the raw performance of the implemented algorithm(s) but rather by practical issues such as ergonomics and/or the availability of specific functionalities. Results Here, we present MetaPIGA v2.0, a robust implementation of several stochastic heuristics for large phylogeny inference (under maximum likelihood), including a Simulated Annealing algorithm, a classical Genetic Algorithm, and the Metapopulation Genetic Algorithm (metaGA) together with complex substitution models, discrete Gamma rate heterogeneity, and the possibility to partition data. MetaPIGA v2.0 also implements the Likelihood Ratio Test, the Akaike Information Criterion, and the Bayesian Information Criterion for automated selection of substitution models that best fit the data. Heuristics and substitution models are highly customizable through manual batch files and command line processing. However, MetaPIGA v2.0 also offers an extensive graphical user interface for parameters setting, generating and running batch files, following run progress, and manipulating result trees. MetaPIGA v2.0 uses standard formats for data sets and trees, is platform independent, runs in 32 and 64-bits systems, and takes advantage of multiprocessor and multicore computers. Conclusions The metaGA resolves the major problem inherent to classical Genetic Algorithms by maintaining high inter-population variation even under strong intra-population selection. Implementation of the metaGA together with additional stochastic heuristics into a single software will allow rigorous optimization of each heuristic as well as a meaningful comparison of performances among these algorithms. MetaPIGA v2.0 gives access both to high customization for the phylogeneticist, as well as to an ergonomic interface and functionalities assisting the non-specialist for sound inference of large phylogenetic trees using nucleotide sequences. MetaPIGA v2.0 and its extensive user-manual are freely available to academics at http://www.metapiga.org. PMID:20633263
Compressing DNA sequence databases with coil.

PubMed

White, W Timothy J; Hendy, Michael D

2008-05-20

Publicly available DNA sequence databases such as GenBank are large, and are growing at an exponential rate. The sheer volume of data being dealt with presents serious storage and data communications problems. Currently, sequence data is usually kept in large "flat files," which are then compressed using standard Lempel-Ziv (gzip) compression - an approach which rarely achieves good compression ratios. While much research has been done on compressing individual DNA sequences, surprisingly little has focused on the compression of entire databases of such sequences. In this study we introduce the sequence database compression software coil. We have designed and implemented a portable software package, coil, for compressing and decompressing DNA sequence databases based on the idea of edit-tree coding. coil is geared towards achieving high compression ratios at the expense of execution time and memory usage during compression - the compression time represents a "one-off investment" whose cost is quickly amortised if the resulting compressed file is transmitted many times. Decompression requires little memory and is extremely fast. We demonstrate a 5% improvement in compression ratio over state-of-the-art general-purpose compression tools for a large GenBank database file containing Expressed Sequence Tag (EST) data. Finally, coil can efficiently encode incremental additions to a sequence database. coil presents a compelling alternative to conventional compression of flat files for the storage and distribution of DNA sequence databases having a narrow distribution of sequence lengths, such as EST data. Increasing compression levels for databases having a wide distribution of sequence lengths is a direction for future work.
Compressing DNA sequence databases with coil

PubMed Central

White, W Timothy J; Hendy, Michael D

2008-01-01

Background Publicly available DNA sequence databases such as GenBank are large, and are growing at an exponential rate. The sheer volume of data being dealt with presents serious storage and data communications problems. Currently, sequence data is usually kept in large "flat files," which are then compressed using standard Lempel-Ziv (gzip) compression – an approach which rarely achieves good compression ratios. While much research has been done on compressing individual DNA sequences, surprisingly little has focused on the compression of entire databases of such sequences. In this study we introduce the sequence database compression software coil. Results We have designed and implemented a portable software package, coil, for compressing and decompressing DNA sequence databases based on the idea of edit-tree coding. coil is geared towards achieving high compression ratios at the expense of execution time and memory usage during compression – the compression time represents a "one-off investment" whose cost is quickly amortised if the resulting compressed file is transmitted many times. Decompression requires little memory and is extremely fast. We demonstrate a 5% improvement in compression ratio over state-of-the-art general-purpose compression tools for a large GenBank database file containing Expressed Sequence Tag (EST) data. Finally, coil can efficiently encode incremental additions to a sequence database. Conclusion coil presents a compelling alternative to conventional compression of flat files for the storage and distribution of DNA sequence databases having a narrow distribution of sequence lengths, such as EST data. Increasing compression levels for databases having a wide distribution of sequence lengths is a direction for future work. PMID:18489794
Development of a EST dataset and characterization of EST-SSRs in a traditional Chinese medicinal plant, Epimedium sagittatum (Sieb. Et Zucc.) Maxim

PubMed Central

2010-01-01

Background Epimedium sagittatum (Sieb. Et Zucc.) Maxim, a traditional Chinese medicinal plant species, has been used extensively as genuine medicinal materials. Certain Epimedium species are endangered due to commercial overexploition, while sustainable application studies, conservation genetics, systematics, and marker-assisted selection (MAS) of Epimedium is less-studied due to the lack of molecular markers. Here, we report a set of expressed sequence tags (ESTs) and simple sequence repeats (SSRs) identified in these ESTs for E. sagittatum. Results cDNAs of E. sagittatum are sequenced using 454 GS-FLX pyrosequencing technology. The raw reads are cleaned and assembled into a total of 76,459 consensus sequences comprising of 17,231 contigs and 59,228 singlets. About 38.5% (29,466) of the consensus sequences significantly match to the non-redundant protein database (E-value < 1e-10), 22,295 of which are further annotated using Gene Ontology (GO) terms. A total of 2,810 EST-SSRs is identified from the Epimedium EST dataset. Trinucleotide SSR is the dominant repeat type (55.2%) followed by dinucleotide (30.4%), tetranuleotide (7.3%), hexanucleotide (4.9%), and pentanucleotide (2.2%) SSR. The dominant repeat motif is AAG/CTT (23.6%) followed by AG/CT (19.3%), ACC/GGT (11.1%), AT/AT (7.5%), and AAC/GTT (5.9%). Thirty-two SSR-ESTs are randomly selected and primer pairs are synthesized for testing the transferability across 52 Epimedium species. Eighteen primer pairs (85.7%) could be successfully transferred to Epimedium species and sixteen of those show high genetic diversity with 0.35 of observed heterozygosity (Ho) and 0.65 of expected heterozygosity (He) and high number of alleles per locus (11.9). Conclusion A large EST dataset with a total of 76,459 consensus sequences is generated, aiming to provide sequence information for deciphering secondary metabolism, especially for flavonoid pathway in Epimedium. A total of 2,810 EST-SSRs is identified from EST dataset and ~1580 EST-SSR markers are transferable. E. sagittatum EST-SSR transferability to the major Epimedium germplasm is up to 85.7%. Therefore, this EST dataset and EST-SSRs will be a powerful resource for further studies such as taxonomy, molecular breeding, genetics, genomics, and secondary metabolism in Epimedium species. PMID:20141623
A symmetry model for genetic coding via a wallpaper group composed of the traditional four bases and an imaginary base E: towards category theory-like systematization of molecular/genetic biology.

PubMed

Sawamura, Jitsuki; Morishita, Shigeru; Ishigooka, Jun

2014-05-07

Previously, we suggested prototypal models that describe some clinical states based on group postulates. Here, we demonstrate a group/category theory-like model for molecular/genetic biology as an alternative application of our previous model. Specifically, we focus on deoxyribonucleic acid (DNA) base sequences. We construct a wallpaper pattern based on a five-letter cruciform motif with letters C, A, T, G, and E. Whereas the first four letters represent the standard DNA bases, the fifth is introduced for ease in formulating group operations that reproduce insertions and deletions of DNA base sequences. A basic group Z5 = {r, u, d, l, n} of operations is defined for the wallpaper pattern, with which a sequence of points can be generated corresponding to changes of a base in a DNA sequence by following the orbit of a point of the pattern under operations in group Z5. Other manipulations of DNA sequence can be treated using a vector-like notation 'Dj' corresponding to a DNA sequence but based on the five-letter base set; also, 'Dj's are expressed graphically. Insertions and deletions of a series of letters 'E' are admitted to assist in describing DNA recombination. Likewise, a vector-like notation Rj can be constructed for sequences of ribonucleic acid (RNA). The wallpaper group B = {Z5×∞, ●} (an ∞-fold Cartesian product of Z5) acts on Dj (or Rj) yielding changes to Dj (or Rj) denoted by 'Dj◦B(j→k) = Dk' (or 'Rj◦B(j→k) = Rk'). Based on the operations of this group, two types of groups-a modulo 5 linear group and a rotational group over the Gaussian plane, acting on the five bases-are linked as parts of the wallpaper group for broader applications. As a result, changes, insertions/deletions and DNA (RNA) recombination (partial/total conversion) are described. As an exploratory study, a notation for the canonical "central dogma" via a category theory-like way is presented for future developments. Despite the large incompleteness of our methodology, there is fertile ground to consider a symmetry model for genetic coding based on our specific wallpaper group. A more integrated formulation containing "central dogma" for future molecular/genetic biology remains to be explored.
Rapid large-scale purification of myofilament proteins using a cleavable His6-tag.

PubMed

Zhang, Mengjie; Martin, Jody L; Kumar, Mohit; Khairallah, Ramzi J; de Tombe, Pieter P

2015-11-01

With the advent of high-throughput DNA sequencing, the number of identified cardiomyopathy-causing mutations has increased tremendously. As the majority of these mutations affect myofilament proteins, there is a need to understand their functional consequence on contraction. Permeabilized myofilament preparations coupled with protein exchange protocols are a common method for examining into contractile mechanics. However, producing large quantities of myofilament proteins can be time consuming and requires different approaches for each protein of interest. In the present study, we describe a unified automated method to produce troponin C, troponin T, and troponin I as well as myosin light chain 2 fused to a His6-tag followed by a tobacco etch virus (TEV) protease site. TEV protease has the advantage of a relaxed P1' cleavage site specificity, allowing for no residues left after proteolysis and preservation of the native sequence of the protein of interest. After expression in Esherichia coli, cells were lysed by sonication in imidazole-containing buffer. The His6-tagged protein was then purified using a HisTrap nickel metal affinity column, and the His6-tag was removed by His6-TEV protease digestion for 4 h at 30°C. The protease was then removed using a HisTrap column, and complex assembly was performed via column-assisted sequential desalting. This mostly automated method allows for the purification of protein in 1 day and can be adapted to most soluble proteins. It has the advantage of greatly increasing yield while reducing the time and cost of purification. Therefore, production and purification of mutant proteins can be accelerated and functional data collected in a faster, less expensive manner. Copyright © 2015 the American Physiological Society.
Prototheca miyajii sp. nov., isolated from a patient with systemic protothecosis.

PubMed

Masuda, Michiaki; Hirose, Noriyuki; Ishikawa, Tomohiro; Ikawa, Yoshiya; Nishimura, Kazuko

2016-03-01

Species of the genus Prototheca are achlorophyllous algae and ubiquitous in nature, and so far, six species have been listed in this genus: Prototheca wickerhamii, Prototheca zopfii, Prototheca blaschkeae, Prototheca cutis, Prototheca stagnora and Prototheca ulmea. A strain of the genus Prototheca, IFM 53848T, was isolated in Japan from a patient with systemic protothecosis and had been designated P. wickerhamii. Our previous study, by using PCR analysis, revealed that its SSU rRNA gene (rDNA) was distinctively larger than that of P. wickerhamii and other species of the genus Prototheca. In this study, molecular analysis showed that the exceptionally large SSU rDNA of IFM 53848T contains four group I introns. The morphology of IFM 53848T was indistinguishable from those of P. wickerhamii or P. cutis, and phylogenetic analyses, based on the sequences of the SSU rDNA exons and the D1/D2 region of the large subunit rDNA, indicated that IFM 53848T was closely related to P. cutis. On the other hand, unlike P. cutis, IFM 53848T failed to assimilate fructose or lysine and grew well at higher temperatures of up to 42 °C. In addition, the nucleotide sequence of the ribosomal internal transcribed spacer and the matrix assisted laser desorption ionization time-of-flight mass spectrometry profile of IFM 53848T were clearly distinct from those of P. cutis. The results strongly suggest that IFM 53848T represents a novel species, and so the seventh member of the genus Prototheca, which we have named Prototheca miyajii sp. nov. The unique characteristics of the strain may provide useful insights into the systematic taxonomy of the genus Prototheca.
Rapid large-scale purification of myofilament proteins using a cleavable His6-tag

PubMed Central

Zhang, Mengjie; Martin, Jody L.; Kumar, Mohit; de Tombe, Pieter P.

2015-01-01

With the advent of high-throughput DNA sequencing, the number of identified cardiomyopathy-causing mutations has increased tremendously. As the majority of these mutations affect myofilament proteins, there is a need to understand their functional consequence on contraction. Permeabilized myofilament preparations coupled with protein exchange protocols are a common method for examining into contractile mechanics. However, producing large quantities of myofilament proteins can be time consuming and requires different approaches for each protein of interest. In the present study, we describe a unified automated method to produce troponin C, troponin T, and troponin I as well as myosin light chain 2 fused to a His6-tag followed by a tobacco etch virus (TEV) protease site. TEV protease has the advantage of a relaxed P1′ cleavage site specificity, allowing for no residues left after proteolysis and preservation of the native sequence of the protein of interest. After expression in Esherichia coli, cells were lysed by sonication in imidazole-containing buffer. The His6-tagged protein was then purified using a HisTrap nickel metal affinity column, and the His6-tag was removed by His6-TEV protease digestion for 4 h at 30°C. The protease was then removed using a HisTrap column, and complex assembly was performed via column-assisted sequential desalting. This mostly automated method allows for the purification of protein in 1 day and can be adapted to most soluble proteins. It has the advantage of greatly increasing yield while reducing the time and cost of purification. Therefore, production and purification of mutant proteins can be accelerated and functional data collected in a faster, less expensive manner. PMID:26386113
Nucleotide sequence of soybean chloroplast DNA regions which contain the psb A and trn H genes and cover the ends of the large single copy region and one end of the inverted repeats.

PubMed Central

Spielmann, A; Stutz, E

1983-01-01

The soybean chloroplast psb A gene (photosystem II thylakoid membrane protein of Mr 32 000, lysine-free) and the trn H gene (tRNAHisGUG), which both map in the large single copy region adjacent to one of the inverted repeat structures (IR1), have been sequenced including flanking regions. The psb A gene shows in its structural part 92% sequence homology with the corresponding genes of spinach and N. debneyi and contains also an open reading frame for 353 aminoacids. The aminoacid sequence of a potential primary translation product (calculated Mr, 38 904, no lysine) diverges from that of spinach and N. debneyi in only two positions in the C-terminal part. The trn H gene has the same polarity as the psb A gene and the coding region is located at the very end of the large single copy region. The deduced sequence of the soybean chloroplast tRNAHisGUG is identical with that of Zea mays chloroplasts. Both ends of the large single copy region were sequenced including a small segment of the adjacent IR1 and IR2. PMID:6314279
Mass spectrometry-based protein identification by integrating de novo sequencing with database searching.

PubMed

Wang, Penghao; Wilson, Susan R

2013-01-01

Mass spectrometry-based protein identification is a very challenging task. The main identification approaches include de novo sequencing and database searching. Both approaches have shortcomings, so an integrative approach has been developed. The integrative approach firstly infers partial peptide sequences, known as tags, directly from tandem spectra through de novo sequencing, and then puts these sequences into a database search to see if a close peptide match can be found. However the current implementation of this integrative approach has several limitations. Firstly, simplistic de novo sequencing is applied and only very short sequence tags are used. Secondly, most integrative methods apply an algorithm similar to BLAST to search for exact sequence matches and do not accommodate sequence errors well. Thirdly, by applying these methods the integrated de novo sequencing makes a limited contribution to the scoring model which is still largely based on database searching. We have developed a new integrative protein identification method which can integrate de novo sequencing more efficiently into database searching. Evaluated on large real datasets, our method outperforms popular identification methods.
Rotation history effects on soybean plants and rhizosphere microbiome

USDA-ARS?s Scientific Manuscript database

Benefits of diversified cropping systems stem from the interactions between soil characteristics, crop growth patterns and physiology, and other organisms. In order to assist in the understanding and implementation of diversified rotation sequences, a long-term experiment was established to evaluate...
THE APPLICATION OF MASS SPECTROMETRY TO THE STUDY OF MICROORGANISMS

EPA Science Inventory

The purpose of this research project is to use state-of-the-art mass spectrometric techniques, such as electrospray ionization (ESI) and matrix assisted laser desorption ionization (MALDI) mass spectrometry (MS), to provide "protein mass fingerprinting" and protein sequencing i...
An Evaluation of Micro PLATO Fortran 77 Instruction.

ERIC Educational Resources Information Center

Funk, Kenneth; And Others

1986-01-01

Evaluated the use of computer assisted instruction in teaching Fortran 77 in the College of Engineering at Oregon State University. Also investigated the effect of such factors as mathematics and computer programming background on student performance in an introductory programming course sequence. (JN)
Marker-assisted backcross approach for important agronomic traits of sorghum

USDA-ARS?s Scientific Manuscript database

Sequencing technologies are useful for identification of thousands of single nucleotide polymorphisms (SNPs) in a cost effective manner. QTL mapping, association mapping and Mutmap approaches provide opportunities for use of such SNPs to associate and identify genes that control important agronomic ...
Application of population sequencing (POPSEQ) for ordering and inputting genotyping-by-sequencing markers in hexaploid wheat

USDA-ARS?s Scientific Manuscript database

The advancement of next-generation sequencing technologies in conjunction with new bioinformatics tools enabled fine-tuning of sequence-based high resolution mapping strategies for complex genomes. Although genotyping-by-sequencing (GBS) provides a large number of markers, its application for assoc...
A Glance at Microsatellite Motifs from 454 Sequencing Reads of Watermelon Genomic DNA

USDA-ARS?s Scientific Manuscript database

A single 454 (Life Sciences Sequencing Technology) run of Charleston Gray watermelon (Citrullus lanatus var. lanatus) genomic DNA was performed and sequence data were assembled. A large scale identification of simple sequence repeat (SSR) was performed and SSR sequence data were used for the develo...
De novo sequencing and characterization of floral transcriptome in two species of buckwheat (Fagopyrum)

PubMed Central

2011-01-01

Background Transcriptome sequencing data has become an integral component of modern genetics, genomics and evolutionary biology. However, despite advances in the technologies of DNA sequencing, such data are lacking for many groups of living organisms, in particular, many plant taxa. We present here the results of transcriptome sequencing for two closely related plant species. These species, Fagopyrum esculentum and F. tataricum, belong to the order Caryophyllales - a large group of flowering plants with uncertain evolutionary relationships. F. esculentum (common buckwheat) is also an important food crop. Despite these practical and evolutionary considerations Fagopyrum species have not been the subject of large-scale sequencing projects. Results Normalized cDNA corresponding to genes expressed in flowers and inflorescences of F. esculentum and F. tataricum was sequenced using the 454 pyrosequencing technology. This resulted in 267 (for F. esculentum) and 229 (F. tataricum) thousands of reads with average length of 341-349 nucleotides. De novo assembly of the reads produced about 25 thousands of contigs for each species, with 7.5-8.2× coverage. Comparative analysis of two transcriptomes demonstrated their overall similarity but also revealed genes that are presumably differentially expressed. Among them are retrotransposon genes and genes involved in sugar biosynthesis and metabolism. Thirteen single-copy genes were used for phylogenetic analysis; the resulting trees are largely consistent with those inferred from multigenic plastid datasets. The sister relationships of the Caryophyllales and asterids now gained high support from nuclear gene sequences. Conclusions 454 transcriptome sequencing and de novo assembly was performed for two congeneric flowering plant species, F. esculentum and F. tataricum. As a result, a large set of cDNA sequences that represent orthologs of known plant genes as well as potential new genes was generated. PMID:21232141

Prediction of protein tertiary structure from sequences using a very large back-propagation neural network

DOE Office of Scientific and Technical Information (OSTI.GOV)

Liu, X.; Wilcox, G.L.

1993-12-31

We have implemented large scale back-propagation neural networks on a 544 node Connection Machine, CM-5, using the C language in MIMD mode. The program running on 512 processors performs backpropagation learning at 0.53 Gflops, which provides 76 million connection updates per second. We have applied the network to the prediction of protein tertiary structure from sequence information alone. A neural network with one hidden layer and 40 million connections is trained to learn the relationship between sequence and tertiary structure. The trained network yields predicted structures of some proteins on which it has not been trained given only their sequences.more » Presentation of the Fourier transform of the sequences accentuates periodicity in the sequence and yields good generalization with greatly increased training efficiency. Training simulations with a large, heterologous set of protein structures (111 proteins from CM-5 time) to solutions with under 2% RMS residual error within the training set (random responses give an RMS error of about 20%). Presentation of 15 sequences of related proteins in a testing set of 24 proteins yields predicted structures with less than 8% RMS residual error, indicating good apparent generalization.« less
Translational genomics for plant breeding with the genome sequence explosion.

PubMed

Kang, Yang Jae; Lee, Taeyoung; Lee, Jayern; Shim, Sangrea; Jeong, Haneul; Satyawan, Dani; Kim, Moon Young; Lee, Suk-Ha

2016-04-01

The use of next-generation sequencers and advanced genotyping technologies has propelled the field of plant genomics in model crops and plants and enhanced the discovery of hidden bridges between genotypes and phenotypes. The newly generated reference sequences of unstudied minor plants can be annotated by the knowledge of model plants via translational genomics approaches. Here, we reviewed the strategies of translational genomics and suggested perspectives on the current databases of genomic resources and the database structures of translated information on the new genome. As a draft picture of phenotypic annotation, translational genomics on newly sequenced plants will provide valuable assistance for breeders and researchers who are interested in genetic studies. © 2015 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
An "off-on" electrochemiluminescent biosensor based on DNAzyme-assisted target recycling and rolling circle amplifications for ultrasensitive detection of microRNA.

PubMed

Zhang, Pu; Wu, Xiaoyan; Yuan, Ruo; Chai, Yaqin

2015-03-17

In this study, an off-on switching of a dual amplified electrochemiluminescence (ECL) biosensor based on Pb(2+)-induced DNAzyme-assisted target recycling and rolling circle amplification (RCA) was constructed for microRNA (miRNA) detection. First, the primer probe with assistant probe and miRNA formed Y junction which was cleaved with the addition of Pb(2+) to release miRNA. Subsequently, the released miRNA could initiate the next recycling process, leading to the generation of numerous intermediate DNA sequences (S2). Afterward, bare glassy carbon electrode (GCE) was immersed into HAuCl4 solution to electrodeposit a Au nanoparticle layer (depAu), followed by the assembly of a hairpin probe (HP). Then, dopamine (DA)-modified DNA sequence (S1) was employed to hybridize with HP, which switching off the sensing system. This is the first work that employs DA to quench luminol ECL signal, possessing the biosensor ultralow background signal. Afterward, S2 produced by the target recycling process was loaded onto the prepared electrode to displace S1 and served as an initiator for RCA. With rational design, numerous repeated DNA sequences coupling with hemin to form hemin/G-quadruplex were generated, which could exhibit strongly catalytic toward H2O2, thus amplified the ECL signal and switched the ON state of the sensing system. The liner range for miRNA detection was from 1.0 fM to 100 pM with a low detection limit down to 0.3 fM. Moreover, with the high sensitivity and specificity induced by the dual signal amplification, the proposed miRNA biosensor holds great potential for analysis of other interesting tumor markers.
Improving transmission efficiency of large sequence alignment/map (SAM) files.

PubMed

Sakib, Muhammad Nazmus; Tang, Jijun; Zheng, W Jim; Huang, Chin-Tser

2011-01-01

Research in bioinformatics primarily involves collection and analysis of a large volume of genomic data. Naturally, it demands efficient storage and transfer of this huge amount of data. In recent years, some research has been done to find efficient compression algorithms to reduce the size of various sequencing data. One way to improve the transmission time of large files is to apply a maximum lossless compression on them. In this paper, we present SAMZIP, a specialized encoding scheme, for sequence alignment data in SAM (Sequence Alignment/Map) format, which improves the compression ratio of existing compression tools available. In order to achieve this, we exploit the prior knowledge of the file format and specifications. Our experimental results show that our encoding scheme improves compression ratio, thereby reducing overall transmission time significantly.
RNA sequencing: current and prospective uses in metabolic research.

PubMed

Vikman, Petter; Fadista, Joao; Oskolkov, Nikolay

2014-10-01

Previous global RNA analysis was restricted to known transcripts in species with a defined transcriptome. Next generation sequencing has transformed transcriptomics by making it possible to analyse expressed genes with an exon level resolution from any tissue in any species without any a priori knowledge of which genes that are being expressed, splice patterns or their nucleotide sequence. In addition, RNA sequencing is a more sensitive technique compared with microarrays with a larger dynamic range, and it also allows for investigation of imprinting and allele-specific expression. This can be done for a cost that is able to compete with that of a microarray, making RNA sequencing a technique available to most researchers. Therefore RNA sequencing has recently become the state of the art with regards to large-scale RNA investigations and has to a large extent replaced microarrays. The only drawback is the large data amounts produced, which together with the complexity of the data can make a researcher spend far more time on analysis than performing the actual experiment. © 2014 Society for Endocrinology.
Using SQL Databases for Sequence Similarity Searching and Analysis.

PubMed

Pearson, William R; Mackey, Aaron J

2017-09-13

Relational databases can integrate diverse types of information and manage large sets of similarity search results, greatly simplifying genome-scale analyses. By focusing on taxonomic subsets of sequences, relational databases can reduce the size and redundancy of sequence libraries and improve the statistical significance of homologs. In addition, by loading similarity search results into a relational database, it becomes possible to explore and summarize the relationships between all of the proteins in an organism and those in other biological kingdoms. This unit describes how to use relational databases to improve the efficiency of sequence similarity searching and demonstrates various large-scale genomic analyses of homology-related data. It also describes the installation and use of a simple protein sequence database, seqdb_demo, which is used as a basis for the other protocols. The unit also introduces search_demo, a database that stores sequence similarity search results. The search_demo database is then used to explore the evolutionary relationships between E. coli proteins and proteins in other organisms in a large-scale comparative genomic analysis. © 2017 by John Wiley & Sons, Inc. Copyright © 2017 John Wiley & Sons, Inc.
Laser assisted microdissection, an efficient technique to understand tissue specific gene expression patterns and functional genomics in plants.

PubMed

Gautam, Vibhav; Sarkar, Ananda K

2015-04-01

Laser assisted microdissection (LAM) is an advanced technology used to perform tissue or cell-specific expression profiling of genes and proteins, owing to its ability to isolate the desired tissue or cell type from a heterogeneous population. Due to the specificity and high efficiency acquired during its pioneering use in medical science, the LAM technique has quickly been adopted for use in many biological researches. Today, it has become a potent tool to address a wide range of questions in diverse field of plant biology. Beginning with comparative transcriptome analysis of different tissues such as reproductive parts, meristems, lateral organs, roots etc., LAM has also been extensively used in plant-pathogen interaction studies, proteomics, and metabolomics. In combination with next generation sequencing and proteomics analysis, LAM has opened up promising opportunities in the area of large scale functional studies in plants. Ever since the advent of this technique, significant improvements have been achieved in term of its instrumentation and method, which has made LAM a more efficient tool applicable in wider research areas. Here, we discuss the advancement of LAM technique with special emphasis on its methodology and highlight its scope in modern research areas of plant biology. Although we put emphasis on use of LAM in transcriptome studies, which is mostly used, we also discuss its recent application and scope in proteome and metabolome studies.
Genomics-based precision breeding approaches to improve drought tolerance in rice.

PubMed

Swamy, B P Mallikarjuna; Kumar, Arvind

2013-12-01

Rice (Oryza sativa L.), the major staple food crop of the world, faces a severe threat from widespread drought. The development of drought-tolerant rice varieties is considered a feasible option to counteract drought stress. The screening of rice germplasm under drought and its characterization at the morphological, genetic, and molecular levels revealed the existence of genetic variation for drought tolerance within the rice gene pool. The improvements made in managed drought screening and selection for grain yield under drought have significantly contributed to progress in drought breeding programs. The availability of rice genome sequence information, genome-wide molecular markers, and low-cost genotyping platforms now makes it possible to routinely apply marker-assisted breeding approaches to improve grain yield under drought. Grain yield QTLs with a large and consistent effect under drought have been indentified and successfully pyramided in popular rice mega-varieties. Various rice functional genomics resources, databases, tools, and recent advances in "-omics" are facilitating the characterization of genes and pathways involved in drought tolerance, providing the basis for candidate gene identification and allele mining. The transgenic approach is successful in generating drought tolerance in rice under controlled conditions, but field-level testing is necessary. Genomics-assisted drought breeding approaches hold great promise, but a well-planned integration with standardized phenotyping is highly essential to exploit their full potential. Copyright © 2013 Elsevier Inc. All rights reserved.
Effect of polymorphisms in the CSN3 (κ-casein) gene on milk production traits in Chinese Holstein Cattle.

PubMed

Alim, M A; Dong, T; Xie, Y; Wu, X P; Zhang, Yi; Zhang, Shengli; Sun, D X

2014-11-01

This study was designed to evaluate significant associations between single nucleotide polymorphisms (SNPs) and milk composition and milk production traits in Chinese Holstein cows. Six SNPs were identified in the κ-casein gene using pooled DNA sequencing. The identified SNPs were genotyped by Matrix-assisted laser desorption/ionization time of flight mass spectrometry (MALDI-TOF MS) methods from 507 individuals. Out of six, we identified three non-synonymous SNPs (g.10888T>C, g.10924C>A and g.10944A>G) that changed in the protein product. SIFT (Sorting_Intolerant_From_Tolerant) prediction score (0.01) demonstrated that protein changed Isoleucine > Threonine (g.10888T>C) will affect the phenotypes. Significant associations between identified SNPs and three yield traits (milk, protein and fat) and two composition traits (fat and protein percentages) were found whereas it did not reach significance for fat percentage in haplotypes association. Importantly, the significant SNPs in our results showed a large proportion of the phenotypic variation of milk protein yield and concentration. Our results suggest that CSN3 is an important candidate gene that influences milk production traits, and identified polymorphisms and haplotypes could be used as a genetic marker in programs of marker-assisted selection for the genetic improvement of milk production traits in dairy cattle.
A cascade signal amplification strategy for surface enhanced Raman spectroscopy detection of thrombin based on DNAzyme assistant DNA recycling and rolling circle amplification.

PubMed

Gao, Fenglei; Du, Lili; Tang, Daoquan; Lu, Yao; Zhang, Yanzhuo; Zhang, Lixian

2015-04-15

A sensitive protocol for surface enhanced Raman spectroscopy (SERS) detection of thrombin is designed with R6G-Ag NPs as a signal tag by combining DNAzyme assistant DNA recycling and rolling circle amplification (RCA). Molecular beacon (MB) as recognition probe immobilizes on the glass slides and performs the amplification procedure. After thrombin-induced structure-switching DNA hairpins of probe 1, the DNAzyme is liberated from the caged structure, which hybridizes with the MB for cleavage of the MB in the presence of cofactor Zn(2+) and initiates the DNA recycling process, leading to the cleavage of a large number of MB and the generation of numerous primers for triggering RCA reaction. The long amplified RCA product which contained hundreds of tandem-repeat sequences, which can bind with oligonucleotide functionalized Ag NPs reporters. The attached signal tags can be easily read out by SERS. Because of the cascade signal amplification, these newly designed protocols provides a sensitive SERS detection of thrombin down to the femolar level (2.3fM) with a linear range of 5 orders of magnitude (from 10(-14) to 10(-9)M) and have high selectivity toward its target protein. The proposed method is expected to be a good clinical tool for the diagnosis of a thrombotic disease. Copyright © 2014 Elsevier B.V. All rights reserved.
Estimation of pairwise sequence similarity of mammalian enhancers with word neighbourhood counts.

PubMed

Göke, Jonathan; Schulz, Marcel H; Lasserre, Julia; Vingron, Martin

2012-03-01

The identity of cells and tissues is to a large degree governed by transcriptional regulation. A major part is accomplished by the combinatorial binding of transcription factors at regulatory sequences, such as enhancers. Even though binding of transcription factors is sequence-specific, estimating the sequence similarity of two functionally similar enhancers is very difficult. However, a similarity measure for regulatory sequences is crucial to detect and understand functional similarities between two enhancers and will facilitate large-scale analyses like clustering, prediction and classification of genome-wide datasets. We present the standardized alignment-free sequence similarity measure N2, a flexible framework that is defined for word neighbourhoods. We explore the usefulness of adding reverse complement words as well as words including mismatches into the neighbourhood. On simulated enhancer sequences as well as functional enhancers in mouse development, N2 is shown to outperform previous alignment-free measures. N2 is flexible, faster than competing methods and less susceptible to single sequence noise and the occurrence of repetitive sequences. Experiments on the mouse enhancers reveal that enhancers active in different tissues can be separated by pairwise comparison using N2. N2 represents an improvement over previous alignment-free similarity measures without compromising speed, which makes it a good candidate for large-scale sequence comparison of regulatory sequences. The software is part of the open-source C++ library SeqAn (www.seqan.de) and a compiled version can be downloaded at http://www.seqan.de/projects/alf.html. Supplementary data are available at Bioinformatics online.
Candida guilliermondii and Other Species of Candida Misidentified as Candida famata: Assessment by Vitek 2, DNA Sequencing Analysis, and Matrix-Assisted Laser Desorption Ionization–Time of Flight Mass Spectrometry in Two Global Antifungal Surveillance Programs

PubMed Central

Woosley, Leah N.; Diekema, Daniel J.; Jones, Ronald N.; Pfaller, Michael A.

2013-01-01

Candida famata (teleomorph Debaryomyces hansenii) has been described as a medically relevant yeast, and this species has been included in many commercial identification systems that are currently used in clinical laboratories. Among 53 strains collected during the SENTRY and ARTEMIS surveillance programs and previously identified as C. famata (includes all submitted strains with this identification) by a variety of commercial methods (Vitek, MicroScan, API, and AuxaColor), DNA sequencing methods demonstrated that 19 strains were C. guilliermondii, 14 were C. parapsilosis, 5 were C. lusitaniae, 4 were C. albicans, and 3 were C. tropicalis, and five isolates belonged to other Candida species (two C. fermentati and one each C. intermedia, C. pelliculosa, and Pichia fabianni). Additionally, three misidentified C. famata strains were correctly identified as Kodomaea ohmeri, Debaryomyces nepalensis, and Debaryomyces fabryi using intergenic transcribed spacer (ITS) and/or intergenic spacer (IGS) sequencing. The Vitek 2 system identified three isolates with high confidence to be C. famata and another 15 with low confidence between C. famata and C. guilliermondii or C. parapsilosis, displaying only 56.6% agreement with DNA sequencing results. Matrix-assisted laser desorption ionization–time of flight (MALDI-TOF) results displayed 81.1% agreement with DNA sequencing. One strain each of C. metapsilosis, C. fermentati, and C. intermedia demonstrated a low score for identification (<2.0) in the MALDI Biotyper. K. ohmeri, D. nepalensis, and D. fabryi identified by DNA sequencing in this study were not in the current database for the MALDI Biotyper. These results suggest that the occurrence of C. famata in fungal infections is much lower than previously appreciated and that commercial systems do not produce accurate identifications except for the newly introduced MALDI-TOF instruments. PMID:23100350
A family of LRR sequences in the vicinity of the Co-2 locus for anthracnose resistance in Phaseolus vulgaris and its potential use in marker-assisted selection.

PubMed

Geffroy, V; Creusot, F; Falquet, J; Sévignac, M; Adam-Blondon, A F; Bannerot, H; Gepts, P; Dron, M

1998-03-01

Molecular markers offer new opportunities for breeding for disease resistance. Resistance gene pyramiding in a single cultivar, as a strategy for durable resistance, can be facilitated by marker-assisted selection (MAS). A RAPD marker, ROH20(450), linked to the Mesoamerican Co-2 anthracnose resistance gene, was previously transformed into a SCAR marker, SCH20. In the present paper we have further characterized the relevance of the SCH20 SCAR marker in different genetic backgrounds. Since this SCAR marker was found to be useful mainly in the Andean gene pool, we identified a new PCR-based marker (SCAreoli) for indirect scoring of the presence of the Co-2 gene. The SCAreoli SCAR marker is polymorphic in the Mesoamerican as well as in the Andean gene pool and should be useful in MAS. We also report that PvH20, the cloned sequence corresponding to the 450-bp RAPD marker ROH20(450), contains six imperfect leucine-rich repeats, and reveals a family of related sequences in the vicinity of the Co-2 locus. These results are discussed in the context of the recent cloning of some plant resistance genes.
Development of EST Intron-Targeting SNP Markers for Panax ginseng and Their Application to Cultivar Authentication.

PubMed

Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun

2016-06-04

Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to "Gopoong" and "K-1" were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information.
A taxonomy for mechanical ventilation: 10 fundamental maxims.

PubMed

Chatburn, Robert L; El-Khatib, Mohamad; Mireles-Cabodevila, Eduardo

2014-11-01

The American Association for Respiratory Care has declared a benchmark for competency in mechanical ventilation that includes the ability to "apply to practice all ventilation modes currently available on all invasive and noninvasive mechanical ventilators." This level of competency presupposes the ability to identify, classify, compare, and contrast all modes of ventilation. Unfortunately, current educational paradigms do not supply the tools to achieve such goals. To fill this gap, we expand and refine a previously described taxonomy for classifying modes of ventilation and explain how it can be understood in terms of 10 fundamental constructs of ventilator technology: (1) defining a breath, (2) defining an assisted breath, (3) specifying the means of assisting breaths based on control variables specified by the equation of motion, (4) classifying breaths in terms of how inspiration is started and stopped, (5) identifying ventilator-initiated versus patient-initiated start and stop events, (6) defining spontaneous and mandatory breaths, (7) defining breath sequences (8), combining control variables and breath sequences into ventilatory patterns, (9) describing targeting schemes, and (10) constructing a formal taxonomy for modes of ventilation composed of control variable, breath sequence, and targeting schemes. Having established the theoretical basis of the taxonomy, we demonstrate a step-by-step procedure to classify any mode on any mechanical ventilator. Copyright © 2014 by Daedalus Enterprises.
Core-SINE blocks comprise a large fraction of monotreme genomes; implications for vertebrate chromosome evolution.

PubMed

Kirby, Patrick J; Greaves, Ian K; Koina, Edda; Waters, Paul D; Marshall Graves, Jennifer A

2007-01-01

The genomes of the egg-laying platypus and echidna are of particular interest because monotremes are the most basal mammal group. The chromosomal distribution of an ancient family of short interspersed repeats (SINEs), the core-SINEs, was investigated to better understand monotreme genome organization and evolution. Previous studies have identified the core-SINE as the predominant SINE in the platypus genome, and in this study we quantified, characterized and localized subfamilies. Dot blot analysis suggested that a very large fraction (32% of the platypus and 16% of the echidna genome) is composed of Mon core-SINEs. Core-SINE-specific primers were used to amplify PCR products from platypus and echidna genomic DNA. Sequence analysis suggests a common consensus sequence Mon 1-B, shared by platypus and echidna, as well as platypus-specific Mon 1-C and echidna specific Mon 1-D consensus sequences. FISH mapping of the Mon core-SINE products to platypus metaphase spreads demonstrates that the Mon-1C subfamily is responsible for the striking Mon core-SINE accumulation in the distal regions of the six large autosomal pairs and the largest X chromosome. This unusual distribution highlights the dichotomy between the seven large chromosome pairs and the 19 smaller pairs in the monotreme karyotype, which has some similarity to the macro- and micro-chromosomes of birds and reptiles, and suggests that accumulation of repetitive sequences may have enlarged small chromosomes in an ancestral vertebrate. In the forthcoming sequence of the platypus genome there are still large gaps, and the extensive Mon core-SINE accumulation on the distal regions of the six large autosomal pairs may provide one explanation for this missing sequence.
Analysis of 16S libraries of mouse gastrointestinal microflora reveals a large new group of mouse intestinal bacteria.

PubMed

Salzman, Nita H; de Jong, Hendrik; Paterson, Yvonne; Harmsen, Hermie J M; Welling, Gjalt W; Bos, Nicolaas A

2002-11-01

Total genomic DNA from samples of intact mouse small intestine, large intestine, caecum and faeces was used as template for PCR amplification of 16S rRNA gene sequences with conserved bacterial primers. Phylogenetic analysis of the amplification products revealed 40 unique 16S rDNA sequences. Of these sequences, 25% (10/40) corresponded to described intestinal organisms of the mouse, including Lactobacillus spp., Helicobacter spp., segmented filamentous bacteria and members of the altered Schaedler flora (ASF360, ASF361, ASF502 and ASF519); 75% (30/40) represented novel sequences. A large number (11/40) of the novel sequences revealed a new operational taxonomic unit (OTU) belonging to the Cytophaga-Flavobacter-Bacteroides phylum, which the authors named 'mouse intestinal bacteria'. 16S rRNA probes were developed for this new OTU. Upon analysis of the novel sequences, eight were found to cluster within the Eubacterium rectale-Clostridium coccoides group and three clustered within the Bacteroides group. One of the novel sequences was distantly related to Verrucomicrobium spinosum and one was distantly related to Bacillus mycoides. Oligonucleotide probes specific for the 16S rRNA of these novel clones were generated. Using a combination of four previously described and four newly designed probes, approximately 80% of bacteria recovered from the murine large intestine and 71% of bacteria recovered from the murine caecum could be identified by fluorescence in situ hybridization (FISH).
Evaluation of 16S Rrna amplicon sequencing using two next-generation sequencing technologies for phylogenetic analysis of the rumen bacterial community in steers

USDA-ARS?s Scientific Manuscript database

Next generation sequencing technologies have vastly changed the approach of sequencing of the 16S rRNA gene for studies in microbial ecology. Three distinct technologies are available for large-scale 16S sequencing. All three are subject to biases introduced by sequencing error rates, amplificatio...
Evaluation of 16S rRNA amplicon sequencing using two next-generation sequencing technologies for phylogenetic analysis of the rumen bacterial community in steers

USDA-ARS?s Scientific Manuscript database

Next generation sequencing technologies have vastly changed the approach of sequencing of the 16S rRNA gene for studies in microbial ecology. Three distinct technologies are available for large-scale 16S sequencing. All three are subject to biases introduced by sequencing error rates, amplificatio...
A combined strategy involving Sanger and 454 pyrosequencing increases genomic resources to aid in the management of reproduction, disease control and genetic selection in the turbot (Scophthalmus maximus).

PubMed

Ribas, Laia; Pardo, Belén G; Fernández, Carlos; Alvarez-Diós, José Antonio; Gómez-Tato, Antonio; Quiroga, María Isabel; Planas, Josep V; Sitjà-Bobadilla, Ariadna; Martínez, Paulino; Piferrer, Francesc

2013-03-15

Genomic resources for plant and animal species that are under exploitation primarily for human consumption are increasingly important, among other things, for understanding physiological processes and for establishing adequate genetic selection programs. Current available techniques for high-throughput sequencing have been implemented in a number of species, including fish, to obtain a proper description of the transcriptome. The objective of this study was to generate a comprehensive transcriptomic database in turbot, a highly priced farmed fish species in Europe, with potential expansion to other areas of the world, for which there are unsolved production bottlenecks, to understand better reproductive- and immune-related functions. This information is essential to implement marker assisted selection programs useful for the turbot industry. Expressed sequence tags were generated by Sanger sequencing of cDNA libraries from different immune-related tissues after several parasitic challenges. The resulting database ("Turbot 2 database") was enlarged with sequences generated from a 454 sequencing run of brain-hypophysis-gonadal axis-derived RNA obtained from turbot at different development stages. The assembly of Sanger and 454 sequences generated 52,427 consensus sequences ("Turbot 3 database"), of which 23,661 were successfully annotated. A total of 1,410 sequences were confirmed to be related to reproduction and key genes involved in sex differentiation and maturation were identified for the first time in turbot (AR, AMH, SRY-related genes, CYP19A, ZPGs, STAR FSHR, etc.). Similarly, 2,241 sequences were related to the immune system and several novel key immune genes were identified (BCL, TRAF, NCK, CD28 and TOLLIP, among others). The number of genes of many relevant reproduction- and immune-related pathways present in the database was 50-90% of the total gene count of each pathway. In addition, 1,237 microsatellites and 7,362 single nucleotide polymorphisms (SNPs) were also compiled. Further, 2,976 putative natural antisense transcripts (NATs) including microRNAs were also identified. The combined sequencing strategies employed here significantly increased the turbot genomic resources available, including 34,400 novel sequences. The generated database contains a larger number of genes relevant for reproduction- and immune-associated studies, with an excellent coverage of most genes present in many relevant physiological pathways. This database also allowed the identification of many microsatellites and SNP markers that will be very useful for population and genome screening and a valuable aid in marker assisted selection programs.

A combined strategy involving Sanger and 454 pyrosequencing increases genomic resources to aid in the management of reproduction, disease control and genetic selection in the turbot (Scophthalmus maximus)

PubMed Central

2013-01-01

Background Genomic resources for plant and animal species that are under exploitation primarily for human consumption are increasingly important, among other things, for understanding physiological processes and for establishing adequate genetic selection programs. Current available techniques for high-throughput sequencing have been implemented in a number of species, including fish, to obtain a proper description of the transcriptome. The objective of this study was to generate a comprehensive transcriptomic database in turbot, a highly priced farmed fish species in Europe, with potential expansion to other areas of the world, for which there are unsolved production bottlenecks, to understand better reproductive- and immune-related functions. This information is essential to implement marker assisted selection programs useful for the turbot industry. Results Expressed sequence tags were generated by Sanger sequencing of cDNA libraries from different immune-related tissues after several parasitic challenges. The resulting database (“Turbot 2 database”) was enlarged with sequences generated from a 454 sequencing run of brain-hypophysis-gonadal axis-derived RNA obtained from turbot at different development stages. The assembly of Sanger and 454 sequences generated 52,427 consensus sequences (“Turbot 3 database”), of which 23,661 were successfully annotated. A total of 1,410 sequences were confirmed to be related to reproduction and key genes involved in sex differentiation and maturation were identified for the first time in turbot (AR, AMH, SRY-related genes, CYP19A, ZPGs, STAR FSHR, etc.). Similarly, 2,241 sequences were related to the immune system and several novel key immune genes were identified (BCL, TRAF, NCK, CD28 and TOLLIP, among others). The number of genes of many relevant reproduction- and immune-related pathways present in the database was 50–90% of the total gene count of each pathway. In addition, 1,237 microsatellites and 7,362 single nucleotide polymorphisms (SNPs) were also compiled. Further, 2,976 putative natural antisense transcripts (NATs) including microRNAs were also identified. Conclusions The combined sequencing strategies employed here significantly increased the turbot genomic resources available, including 34,400 novel sequences. The generated database contains a larger number of genes relevant for reproduction- and immune-associated studies, with an excellent coverage of most genes present in many relevant physiological pathways. This database also allowed the identification of many microsatellites and SNP markers that will be very useful for population and genome screening and a valuable aid in marker assisted selection programs. PMID:23497389
Appraising Reading Achievement.

ERIC Educational Resources Information Center

Ediger, Marlow

To determine quality sequence in pupil progress, evaluation approaches need to be used which guide the teacher to assist learners to attain optimally. Teachers must use a variety of procedures to appraise student achievement in reading, because no one approach is adequate. Appraisal approaches might include: (1) observation and subsequent…
Plagiarism Detection: A Comparison of Teaching Assistants and a Software Tool in Identifying Cheating in a Psychology Course

ERIC Educational Resources Information Center

Seifried, Eva; Lenhard, Wolfgang; Spinath, Birgit

2015-01-01

Essays that are assigned as homework in large classes are prone to cheating via unauthorized collaboration. In this study, we compared the ability of a software tool based on Latent Semantic Analysis (LSA) and student teaching assistants to detect plagiarism in a large group of students. To do so, we took two approaches: the first approach was…
On the way toward systems biology of Aspergillus fumigatus infection.

PubMed

Albrecht, Daniela; Kniemeyer, Olaf; Mech, Franziska; Gunzer, Matthias; Brakhage, Axel; Guthke, Reinhard

2011-06-01

Pathogenicity of Aspergillus fumigatus is multifactorial. Thus, global studies are essential for the understanding of the infection process. Therefore, a data warehouse was established where genome sequence, transcriptome and proteome data are stored. These data are analyzed for the elucidation of virulence determinants. The data analysis workflow starts with pre-processing including imputing of missing values and normalization. Last step is the identification of differentially expressed genes/proteins as interesting candidates for further analysis, in particular for functional categorization and correlation studies. Sequence data and other prior knowledge extracted from databases are integrated to support the inference of gene regulatory networks associated with pathogenicity. This knowledge-assisted data analysis aims at establishing mathematical models with predictive strength to assist further experimental work. Recently, first steps were done to extend the integrative data analysis and computational modeling by evaluating spatio-temporal data (movies) that monitor interactions of A. fumigatus morphotypes (e.g. conidia) with host immune cells. Copyright © 2011 Elsevier GmbH. All rights reserved.
A High-Resolution InDel (Insertion–Deletion) Markers-Anchored Consensus Genetic Map Identifies Major QTLs Governing Pod Number and Seed Yield in Chickpea

PubMed Central

Srivastava, Rishi; Singh, Mohar; Bajaj, Deepak; Parida, Swarup K.

2016-01-01

Development and large-scale genotyping of user-friendly informative genome/gene-derived InDel markers in natural and mapping populations is vital for accelerating genomics-assisted breeding applications of chickpea with minimal resource expenses. The present investigation employed a high-throughput whole genome next-generation resequencing strategy in low and high pod number parental accessions and homozygous individuals constituting the bulks from each of two inter-specific mapping populations [(Pusa 1103 × ILWC 46) and (Pusa 256 × ILWC 46)] to develop non-erroneous InDel markers at a genome-wide scale. Comparing these high-quality genomic sequences, 82,360 InDel markers with reference to kabuli genome and 13,891 InDel markers exhibiting differentiation between low and high pod number parental accessions and bulks of aforementioned mapping populations were developed. These informative markers were structurally and functionally annotated in diverse coding and non-coding sequence components of genome/genes of kabuli chickpea. The functional significance of regulatory and coding (frameshift and large-effect mutations) InDel markers for establishing marker-trait linkages through association/genetic mapping was apparent. The markers detected a greater amplification (97%) and intra-specific polymorphic potential (58–87%) among a diverse panel of cultivated desi, kabuli, and wild accessions even by using a simpler cost-efficient agarose gel-based assay implicating their utility in large-scale genetic analysis especially in domesticated chickpea with narrow genetic base. Two high-density inter-specific genetic linkage maps generated using aforesaid mapping populations were integrated to construct a consensus 1479 InDel markers-anchored high-resolution (inter-marker distance: 0.66 cM) genetic map for efficient molecular mapping of major QTLs governing pod number and seed yield per plant in chickpea. Utilizing these high-density genetic maps as anchors, three major genomic regions harboring each of pod number and seed yield robust QTLs (15–28% phenotypic variation explained) were identified on chromosomes 2, 4, and 6. The integration of genetic and physical maps at these QTLs mapped on chromosomes scaled-down the long major QTL intervals into high-resolution short pod number and seed yield robust QTL physical intervals (0.89–2.94 Mb) which were essentially got validated in multiple genetic backgrounds of two chickpea mapping populations. The genome-wide InDel markers including natural allelic variants and genomic loci/genes delineated at major six especially in one colocalized novel congruent robust pod number and seed yield robust QTLs mapped on a high-density consensus genetic map were found most promising in chickpea. These functionally relevant molecular tags can drive marker-assisted genetic enhancement to develop high-yielding cultivars with increased seed/pod number and yield in chickpea. PMID:27695461
Complete genome sequence and description of Lactococcus garvieae M14 isolated from Algerian fermented milk.

PubMed

Moumene, M; Drissi, F; Croce, O; Djebbari, B; Robert, C; Angelakis, E; Benouareth, D E; Raoult, D; Merhej, V

2016-03-01

We describe using a polyphasic approach that combines proteomic by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF) analysis, genomic data and phenotypic characterization the features of Lactococcus garvieae strain M14 newly isolated from the fermented milk (known as raib) of an Algerian cow. The 2 188 835 bp containing genome sequence displays a metabolic capacity to form acid fermentation that is very useful for industrial applications and encodes for two bacteriocins responsible for its eventual bioprotective properties.
Putting engineering back into protein engineering: bioinformatic approaches to catalyst design.

PubMed

Gustafsson, Claes; Govindarajan, Sridhar; Minshull, Jeremy

2003-08-01

Complex multivariate engineering problems are commonplace and not unique to protein engineering. Mathematical and data-mining tools developed in other fields of engineering have now been applied to analyze sequence-activity relationships of peptides and proteins and to assist in the design of proteins and peptides with specified properties. Decreasing costs of DNA sequencing in conjunction with methods to quickly synthesize statistically representative sets of proteins allow modern heuristic statistics to be applied to protein engineering. This provides an alternative approach to expensive assays or unreliable high-throughput surrogate screens.
Human renin 5'-flanking DNA to nucleotide-2750.

PubMed

Smith, D L; Jeyapalan, S; Lang, J A; Guo, X H; Sigmund, C D; Morris, B J

1995-01-01

Renin is one of the most important factors in blood pressure and electrolyte regulation in mammals and the renin locus has been implicated in hypertension. To assist studies of promoter control we therefore determined the 5'-flanking sequence of the human gene (REN) to residue -2750 relative to the transcription start site (+1). Sites of homology to consensus sequences for binding of trans-acting factors involved in transcriptional control of other genes were identified, and functionality for two of these (a CRE and Pit-1 site) have so far been demonstrated.
ReSeqTools: an integrated toolkit for large-scale next-generation sequencing based resequencing analysis.

PubMed

He, W; Zhao, S; Liu, X; Dong, S; Lv, J; Liu, D; Wang, J; Meng, Z

2013-12-04

Large-scale next-generation sequencing (NGS)-based resequencing detects sequence variations, constructs evolutionary histories, and identifies phenotype-related genotypes. However, NGS-based resequencing studies generate extraordinarily large amounts of data, making computations difficult. Effective use and analysis of these data for NGS-based resequencing studies remains a difficult task for individual researchers. Here, we introduce ReSeqTools, a full-featured toolkit for NGS (Illumina sequencing)-based resequencing analysis, which processes raw data, interprets mapping results, and identifies and annotates sequence variations. ReSeqTools provides abundant scalable functions for routine resequencing analysis in different modules to facilitate customization of the analysis pipeline. ReSeqTools is designed to use compressed data files as input or output to save storage space and facilitates faster and more computationally efficient large-scale resequencing studies in a user-friendly manner. It offers abundant practical functions and generates useful statistics during the analysis pipeline, which significantly simplifies resequencing analysis. Its integrated algorithms and abundant sub-functions provide a solid foundation for special demands in resequencing projects. Users can combine these functions to construct their own pipelines for other purposes.
Mobile Learning as Alternative to Assistive Technology Devices for Special Needs Students

ERIC Educational Resources Information Center

Ismaili, Jalal; Ibrahimi, El Houcine Ouazzani

2017-01-01

Assistive Technology (AT) revolutionized the process of learning for special needs students during the past three decades. Thanks to this technology, accessibility and educational inclusion became attainable more than any time in the history of special education. Meanwhile, assistive technology devices remain unreachable for a large number of…
Bilateral simultaneous robot-assisted pyelolithotomy for large (>6 cm) kidney stones: technique and review of literature.

PubMed

Rajiv, Yadav; Kumar, Abhay; Poonam, Yadav

2015-09-01

With wide availability and demonstrable efficacy of endourological techniques, open surgery for renal stone disease has largely been replaced in contemporary urological practice. However, with increasing experience of laparoscopy and robotic surgery in urology, the principle of open renal surgery is being revisited. In certain situations, laparoscopic or robotic pyelolithomy may be an excellent minimally invasive alternative to percutaneous nephrolithomy with its unique advantages. We present a case of bilateral large kidney stones managed with bilateral simultaneous robot-assisted laparoscopic pyelolithotomy with excellent results.
PSAT: A web tool to compare genomic neighborhoods of multiple prokaryotic genomes

PubMed Central

Fong, Christine; Rohmer, Laurence; Radey, Matthew; Wasnick, Michael; Brittnacher, Mitchell J

2008-01-01

Background The conservation of gene order among prokaryotic genomes can provide valuable insight into gene function, protein interactions, or events by which genomes have evolved. Although some tools are available for visualizing and comparing the order of genes between genomes of study, few support an efficient and organized analysis between large numbers of genomes. The Prokaryotic Sequence homology Analysis Tool (PSAT) is a web tool for comparing gene neighborhoods among multiple prokaryotic genomes. Results PSAT utilizes a database that is preloaded with gene annotation, BLAST hit results, and gene-clustering scores designed to help identify regions of conserved gene order. Researchers use the PSAT web interface to find a gene of interest in a reference genome and efficiently retrieve the sequence homologs found in other bacterial genomes. The tool generates a graphic of the genomic neighborhood surrounding the selected gene and the corresponding regions for its homologs in each comparison genome. Homologs in each region are color coded to assist users with analyzing gene order among various genomes. In contrast to common comparative analysis methods that filter sequence homolog data based on alignment score cutoffs, PSAT leverages gene context information for homologs, including those with weak alignment scores, enabling a more sensitive analysis. Features for constraining or ordering results are designed to help researchers browse results from large numbers of comparison genomes in an organized manner. PSAT has been demonstrated to be useful for helping to identify gene orthologs and potential functional gene clusters, and detecting genome modifications that may result in loss of function. Conclusion PSAT allows researchers to investigate the order of genes within local genomic neighborhoods of multiple genomes. A PSAT web server for public use is available for performing analyses on a growing set of reference genomes through any web browser with no client side software setup or installation required. Source code is freely available to researchers interested in setting up a local version of PSAT for analysis of genomes not available through the public server. Access to the public web server and instructions for obtaining source code can be found at . PMID:18366802
A General Conditional Large Deviation Principle

DOE PAGES

La Cour, Brian R.; Schieve, William C.

2015-07-18

Given a sequence of Borel probability measures on a Hausdorff space which satisfy a large deviation principle (LDP), we consider the corresponding sequence of measures formed by conditioning on a set B. If the large deviation rate function I is good and effectively continuous, and the conditioning set has the property that (1)more » $$\\overline{B°}$$=$$\\overline{B}$$ and (2) I(x)<∞ for all xε$$\\overline{B}$$, then the sequence of conditional measures satisfies a LDP with the good, effectively continuous rate function I B, where I B(x)=I(x)-inf I(B) if xε$$\\overline{B}$$ and I B(x)=∞ otherwise.« less
Sunflower Hybrid Breeding: From Markers to Genomic Selection

PubMed Central

Dimitrijevic, Aleksandra; Horn, Renate

2018-01-01

In sunflower, molecular markers for simple traits as, e.g., fertility restoration, high oleic acid content, herbicide tolerance or resistances to Plasmopara halstedii, Puccinia helianthi, or Orobanche cumana have been successfully used in marker-assisted breeding programs for years. However, agronomically important complex quantitative traits like yield, heterosis, drought tolerance, oil content or selection for disease resistance, e.g., against Sclerotinia sclerotiorum have been challenging and will require genome-wide approaches. Plant genetic resources for sunflower are being collected and conserved worldwide that represent valuable resources to study complex traits. Sunflower association panels provide the basis for genome-wide association studies, overcoming disadvantages of biparental populations. Advances in technologies and the availability of the sunflower genome sequence made novel approaches on the whole genome level possible. Genotype-by-sequencing, and whole genome sequencing based on next generation sequencing technologies facilitated the production of large amounts of SNP markers for high density maps as well as SNP arrays and allowed genome-wide association studies and genomic selection in sunflower. Genome wide or candidate gene based association studies have been performed for traits like branching, flowering time, resistance to Sclerotinia head and stalk rot. First steps in genomic selection with regard to hybrid performance and hybrid oil content have shown that genomic selection can successfully address complex quantitative traits in sunflower and will help to speed up sunflower breeding programs in the future. To make sunflower more competitive toward other oil crops higher levels of resistance against pathogens and better yield performance are required. In addition, optimizing plant architecture toward a more complex growth type for higher plant densities has the potential to considerably increase yields per hectare. Integrative approaches combining omic technologies (genomics, transcriptomics, proteomics, metabolomics and phenomics) using bioinformatic tools will facilitate the identification of target genes and markers for complex traits and will give a better insight into the mechanisms behind the traits. PMID:29387071
Genotyping by Sequencing for SNP-Based Linkage Analysis and Identification of QTLs Linked to Fruit Quality Traits in Japanese Plum (Prunus salicina Lindl.).

PubMed

Salazar, Juan A; Pacheco, Igor; Shinya, Paulina; Zapata, Patricio; Silva, Claudia; Aradhya, Mallikarjuna; Velasco, Dianne; Ruiz, David; Martínez-Gómez, Pedro; Infante, Rodrigo

2017-01-01

Marker-assisted selection (MAS) in stone fruit ( Prunus species) breeding is currently difficult to achieve due to the polygenic nature of the most relevant agronomic traits linked to fruit quality. Genotyping by sequencing (GBS), however, provides a large quantity of useful data suitable for fine mapping using Single Nucleotide Polymorphisms (SNPs) from a reference genome. In this study, GBS was used to genotype 272 seedlings of three F1 Japanese plum ( Prunus salicina Lindl) progenies derived from crossing "98-99" (as a common female parent) with "Angeleno," "September King," and "September Queen" as male parents. Raw sequences were aligned to the Peach genome v1, and 42,909 filtered SNPs were obtained after sequence alignment. In addition, 153 seedlings from the "98-99" × "Angeleno" cross were used to develop a genetic map for each parent. A total of 981 SNPs were mapped (479 for "98-99" and 502 for "Angeleno"), covering a genetic distance of 688.8 and 647.03 cM, respectively. Fifty five seedlings from this progeny were phenotyped for different fruit quality traits including ripening time, fruit weight, fruit shape, chlorophyll index, skin color, flesh color, over color, firmness, and soluble solids content in the years 2015 and 2016. Linkage-based QTL analysis allowed the identification of genomic regions significantly associated with ripening time (LG4 of both parents and both phenotyping years), fruit skin color (LG3 and LG4 of both parents and both years), chlorophyll degradation index (LG3 of both parents in 2015) and fruit weight (LG7 of both parents in 2016). These results represent a promising situation for GBS in the identification of SNP variants associated to fruit quality traits, potentially applicable in breeding programs through MAS, in a highly heterozygous crop species such as Japanese plum.
Development and Evaluation of a 9K SNP Array for Peach by Internationally Coordinated SNP Detection and Validation in Breeding Germplasm

PubMed Central

Scalabrin, Simone; Gilmore, Barbara; Lawley, Cynthia T.; Gasic, Ksenija; Micheletti, Diego; Rosyara, Umesh R.; Cattonaro, Federica; Vendramin, Elisa; Main, Dorrie; Aramini, Valeria; Blas, Andrea L.; Mockler, Todd C.; Bryant, Douglas W.; Wilhelm, Larry; Troggio, Michela; Sosinski, Bryon; Aranzana, Maria José; Arús, Pere; Iezzoni, Amy; Morgante, Michele; Peace, Cameron

2012-01-01

Although a large number of single nucleotide polymorphism (SNP) markers covering the entire genome are needed to enable molecular breeding efforts such as genome wide association studies, fine mapping, genomic selection and marker-assisted selection in peach [Prunus persica (L.) Batsch] and related Prunus species, only a limited number of genetic markers, including simple sequence repeats (SSRs), have been available to date. To address this need, an international consortium (The International Peach SNP Consortium; IPSC) has pursued a coordinated effort to perform genome-scale SNP discovery in peach using next generation sequencing platforms to develop and characterize a high-throughput Illumina Infinium® SNP genotyping array platform. We performed whole genome re-sequencing of 56 peach breeding accessions using the Illumina and Roche/454 sequencing technologies. Polymorphism detection algorithms identified a total of 1,022,354 SNPs. Validation with the Illumina GoldenGate® assay was performed on a subset of the predicted SNPs, verifying ∼75% of genic (exonic and intronic) SNPs, whereas only about a third of intergenic SNPs were verified. Conservative filtering was applied to arrive at a set of 8,144 SNPs that were included on the IPSC peach SNP array v1, distributed over all eight peach chromosomes with an average spacing of 26.7 kb between SNPs. Use of this platform to screen a total of 709 accessions of peach in two separate evaluation panels identified a total of 6,869 (84.3%) polymorphic SNPs. The almost 7,000 SNPs verified as polymorphic through extensive empirical evaluation represent an excellent source of markers for future studies in genetic relatedness, genetic mapping, and dissecting the genetic architecture of complex agricultural traits. The IPSC peach SNP array v1 is commercially available and we expect that it will be used worldwide for genetic studies in peach and related stone fruit and nut species. PMID:22536421
Genome Microscale Heterogeneity among Wild Potatoes Revealed by Diversity Arrays Technology Marker Sequences.

PubMed

Traini, Alessandra; Iorizzo, Massimo; Mann, Harpartap; Bradeen, James M; Carputo, Domenico; Frusciante, Luigi; Chiusano, Maria Luisa

2013-01-01

Tuber-bearing potato species possess several genes that can be exploited to improve the genetic background of the cultivated potato Solanum tuberosum. Among them, S. bulbocastanum and S. commersonii are well known for their strong resistance to environmental stresses. However, scant information is available for these species in terms of genome organization, gene function, and regulatory networks. Consequently, genomic tools to assist breeding are meager, and efficient exploitation of these species has been limited so far. In this paper, we employed the reference genome sequences from cultivated potato and tomato and a collection of sequences of 1,423 potato Diversity Arrays Technology (DArT) markers that show polymorphic representation across the genomes of S. bulbocastanum and/or S. commersonii genotypes. Our results highlighted microscale genome sequence heterogeneity that may play a significant role in functional and structural divergence between related species. Our analytical approach provides knowledge of genome structural and sequence variability that could not be detected by transcriptome and proteome approaches.
Finding similar nucleotide sequences using network BLAST searches.

PubMed

Ladunga, Istvan

2009-06-01

The Basic Local Alignment Search Tool (BLAST) is a keystone of bioinformatics due to its performance and user-friendliness. Beginner and intermediate users will learn how to design and submit blastn and Megablast searches on the Web pages at the National Center for Biotechnology Information. We map nucleic acid sequences to genomes, find identical or similar mRNA, expressed sequence tag, and noncoding RNA sequences, and run Megablast searches, which are much faster than blastn. Understanding results is assisted by taxonomy reports, genomic views, and multiple alignments. We interpret expected frequency thresholds, biological significance, and statistical significance. Weak hits provide no evidence, but hints for further analyses. We find genes that may code for homologous proteins by translated BLAST. We reduce false positives by filtering out low-complexity regions. Parsed BLAST results can be integrated into analysis pipelines. Links in the output connect to Entrez, PUBMED, structural, sequence, interaction, and expression databases. This facilitates integration with a wide spectrum of biological knowledge.
Simple sequence repeat marker loci discovery using SSR primer.

PubMed

Robinson, Andrew J; Love, Christopher G; Batley, Jacqueline; Barker, Gary; Edwards, David

2004-06-12

Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. With the increase in the availability of DNA sequence information, an automated process to identify and design PCR primers for amplification of SSR loci would be a useful tool in plant breeding programs. We report an application that integrates SPUTNIK, an SSR repeat finder, with Primer3, a PCR primer design program, into one pipeline tool, SSR Primer. On submission of multiple FASTA formatted sequences, the script screens each sequence for SSRs using SPUTNIK. The results are parsed to Primer3 for locus-specific primer design. The script makes use of a Web-based interface, enabling remote use. This program has been written in PERL and is freely available for non-commercial users by request from the authors. The Web-based version may be accessed at http://hornbill.cspp.latrobe.edu.au/
Negative pressure wound therapy‐assisted dermatotraction for the closure of large open wounds in a patient with non‐clostridial gas gangrene

PubMed Central

Noborio, Mitsuhiro; Nishimura, Tetsuro; Ieki, Yohei; Shimahara, Yumiko; Sogabe, Taku; Ehara, Naoki; Saoyama, Yuki; Sadamitsu, Daikai

2015-01-01

Case A 53‐year‐old woman developed septic shock associated with non‐clostridial gas gangrene. She presented to the emergency department with two large open wounds on both thighs and in her sacral region. Non‐enhanced computed tomography showed air density in contact with the right iliopsoas, which extended to the posterior compartment of the thigh. We made repeated efforts at surgical debridement of the wound with resection of necrotic tissues. Outcome Using negative pressure wound therapy‐assisted dermatotraction, the pus pockets and the wound dehiscence decreased in size. Using this method we were successful in achieving delayed closure without skin grafts. Conclusion Negative pressure wound therapy can be an effective treatment for large and infected open contoured wounds. Negative pressure wound therapy‐assisted dermatotraction might be beneficial for poorly healing, large, open wounds in patients in poor condition and with insufficient reserve to tolerate reconstructive surgery. PMID:29123764

A Novel Partial Sequence Alignment Tool for Finding Large Deletions

PubMed Central

Aruk, Taner; Ustek, Duran; Kursun, Olcay

2012-01-01

Finding large deletions in genome sequences has become increasingly more useful in bioinformatics, such as in clinical research and diagnosis. Although there are a number of publically available next generation sequencing mapping and sequence alignment programs, these software packages do not correctly align fragments containing deletions larger than one kb. We present a fast alignment software package, BinaryPartialAlign, that can be used by wet lab scientists to find long structural variations in their experiments. For BinaryPartialAlign, we make use of the Smith-Waterman (SW) algorithm with a binary-search-based approach for alignment with large gaps that we called partial alignment. BinaryPartialAlign implementation is compared with other straight-forward applications of SW. Simulation results on mtDNA fragments demonstrate the effectiveness (runtime and accuracy) of the proposed method. PMID:22566777
Rotation sequence to report humerothoracic kinematics during 3D motion involving large horizontal component: application to the tennis forehand drive.

PubMed

Creveaux, Thomas; Sevrez, Violaine; Dumas, Raphaël; Chèze, Laurence; Rogowski, Isabelle

2018-03-01

The aim of this study was to examine the respective aptitudes of three rotation sequences (Y t X f 'Y h '', Z t X f 'Y h '', and X t Z f 'Y h '') to effectively describe the orientation of the humerus relative to the thorax during a movement involving a large horizontal abduction/adduction component: the tennis forehand drive. An optoelectronic system was used to record the movements of eight elite male players, each performing ten forehand drives. The occurrences of gimbal lock, phase angle discontinuity and incoherency in the time course of the three angles defining humerothoracic rotation were examined for each rotation sequence. Our results demonstrated that no single sequence effectively describes humerothoracic motion without discontinuities throughout the forehand motion. The humerothoracic joint angles can nevertheless be described without singularities when considering the backswing/forward-swing and the follow-through phases separately. Our findings stress that the sequence choice may have implications for the report and interpretation of 3D joint kinematics during large shoulder range of motion. Consequently, the use of Euler/Cardan angles to represent 3D orientation of the humerothoracic joint in sport tasks requires the evaluation of the rotation sequence regarding singularity occurrence before analysing the kinematic data, especially when the task involves a large shoulder range of motion in the horizontal plane.
Use of sequence-independent-single-primer-amplification (SISPA) for whole genome sequencing using illumina MiSeq platform for avian influenza virus, Newcastle disease virus, and infectious bronchitis virus

USDA-ARS?s Scientific Manuscript database

Over the past decade, Next Generation Sequencing (NGS) technologies, also called deep sequencing, have continued to evolve, increasing capacity and lower the cost necessary for large genome sequencing projects. The one of the advantage of NGS platforms is the possibility to sequence the samples with...
New Sequences with Low Correlation and Large Family Size

NASA Astrophysics Data System (ADS)

Zeng, Fanxin

In direct-sequence code-division multiple-access (DS-CDMA) communication systems and direct-sequence ultra wideband (DS-UWB) radios, sequences with low correlation and large family size are important for reducing multiple access interference (MAI) and accepting more active users, respectively. In this paper, a new collection of families of sequences of length pn-1, which includes three constructions, is proposed. The maximum number of cyclically distinct families without GMW sequences in each construction is φ(pn-1)/n·φ(pm-1)/m, where p is a prime number, n is an even number, and n=2m, and these sequences can be binary or polyphase depending upon choice of the parameter p. In Construction I, there are pn distinct sequences within each family and the new sequences have at most d+2 nontrivial periodic correlation {-pm-1, -1, pm-1, 2pm-1,…,dpm-1}. In Construction II, the new sequences have large family size p2n and possibly take the nontrivial correlation values in {-pm-1, -1, pm-1, 2pm-1,…,(3d-4)pm-1}. In Construction III, the new sequences possess the largest family size p(d-1)n and have at most 2d correlation levels {-pm-1, -1,pm-1, 2pm-1,…,(2d-2)pm-1}. Three constructions are near-optimal with respect to the Welch bound because the values of their Welch-Ratios are moderate, WR_??_d, WR_??_3d-4 and WR_??_2d-2, respectively. Each family in Constructions I, II and III contains a GMW sequence. In addition, Helleseth sequences and Niho sequences are special cases in Constructions I and III, and their restriction conditions to the integers m and n, pm≠2 (mod 3) and n≅0 (mod 4), respectively, are removed in our sequences. Our sequences in Construction III include the sequences with Niho type decimation 3·2m-2, too. Finally, some open questions are pointed out and an example that illustrates the performance of these sequences is given.
Southwest Border Education Assistance. Hearing Before the Subcommittee on Regional and Community Development of the Committee on Environment and Public Works, United States Senate, 95th Congress, 2nd Session on S. 2997, A Bill to Provide Financial Assistance for School Construction to Local Educational Agencies Educating Large Numbers of Immigrant Children Born in Mexico (May 16, 1978).

ERIC Educational Resources Information Center

Congress of the U.S., Washington, DC. Senate Committee on Environment and Public Works.

A hearing was held to consider S.2997, a bill which would provide financial assistance for school construction to local educational agencies educating large numbers of immigrant children born in Mexico. In opening remarks, Senator Lloyd Bentsen, Texas, explained that 58,000 Mexicans immigrated to the US in 1977; towns along the American border,…
Megabase sequencing of human genome by ordered-shotgun-sequencing (OSS) strategy

NASA Astrophysics Data System (ADS)

Chen, Ellson Y.

1997-05-01

So far we have used OSS strategy to sequence over 2 megabases DNA in large-insert clones from regions of human X chromosomes with different characteristic levels of GC content. The method starts by randomly fragmenting a BAC, YAC or PAC to 8-12 kb pieces and subcloning those into lambda phage. Insert-ends of these clones are sequenced and overlapped to create a partial map. Complete sequencing is then done on a minimal tiling path of selected subclones, recursively focusing on those at the edges of contigs to facilitate mergers of clones across the entire target. To reduce manual labor, PCR processes have been adapted to prepare sequencing templates throughout the entire operation. The streamlined process can thus lend itself to further automation. The OSS approach is suitable for large- scale genomic sequencing, providing considerable flexibility in the choice of subclones or regions for more or less intensive sequencing. For example, subclones containing contaminating host cell DNA or cloning vector can be recognized and ignored with minimal sequencing effort; regions overlapping a neighboring clone already sequenced need not be redone; and segments containing tandem repeats or long repetitive sequences can be spotted early on and targeted for additional attention.
Human Genome Sequencing in Health and Disease

PubMed Central

Gonzaga-Jauregui, Claudia; Lupski, James R.; Gibbs, Richard A.

2013-01-01

Following the “finished,” euchromatic, haploid human reference genome sequence, the rapid development of novel, faster, and cheaper sequencing technologies is making possible the era of personalized human genomics. Personal diploid human genome sequences have been generated, and each has contributed to our better understanding of variation in the human genome. We have consequently begun to appreciate the vastness of individual genetic variation from single nucleotide to structural variants. Translation of genome-scale variation into medically useful information is, however, in its infancy. This review summarizes the initial steps undertaken in clinical implementation of personal genome information, and describes the application of whole-genome and exome sequencing to identify the cause of genetic diseases and to suggest adjuvant therapies. Better analysis tools and a deeper understanding of the biology of our genome are necessary in order to decipher, interpret, and optimize clinical utility of what the variation in the human genome can teach us. Personal genome sequencing may eventually become an instrument of common medical practice, providing information that assists in the formulation of a differential diagnosis. We outline herein some of the remaining challenges. PMID:22248320
Lessons learnt on the analysis of large sequence data in animal genomics.

PubMed

Biscarini, F; Cozzi, P; Orozco-Ter Wengel, P

2018-04-06

The 'omics revolution has made a large amount of sequence data available to researchers and the industry. This has had a profound impact in the field of bioinformatics, stimulating unprecedented advancements in this discipline. Mostly, this is usually looked at from the perspective of human 'omics, in particular human genomics. Plant and animal genomics, however, have also been deeply influenced by next-generation sequencing technologies, with several genomics applications now popular among researchers and the breeding industry. Genomics tends to generate huge amounts of data, and genomic sequence data account for an increasing proportion of big data in biological sciences, due largely to decreasing sequencing and genotyping costs and to large-scale sequencing and resequencing projects. The analysis of big data poses a challenge to scientists, as data gathering currently takes place at a faster pace than does data processing and analysis, and the associated computational burden is increasingly taxing, making even simple manipulation, visualization and transferring of data a cumbersome operation. The time consumed by the processing and analysing of huge data sets may be at the expense of data quality assessment and critical interpretation. Additionally, when analysing lots of data, something is likely to go awry-the software may crash or stop-and it can be very frustrating to track the error. We herein review the most relevant issues related to tackling these challenges and problems, from the perspective of animal genomics, and provide researchers that lack extensive computing experience with guidelines that will help when processing large genomic data sets. © 2018 Stichting International Foundation for Animal Genetics.
Statistical properties of filtered pseudorandom digital sequences formed from the sum of maximum-length sequences

NASA Technical Reports Server (NTRS)

Wallace, G. R.; Weathers, G. D.; Graf, E. R.

1973-01-01

The statistics of filtered pseudorandom digital sequences called hybrid-sum sequences, formed from the modulo-two sum of several maximum-length sequences, are analyzed. The results indicate that a relation exists between the statistics of the filtered sequence and the characteristic polynomials of the component maximum length sequences. An analysis procedure is developed for identifying a large group of sequences with good statistical properties for applications requiring the generation of analog pseudorandom noise. By use of the analysis approach, the filtering process is approximated by the convolution of the sequence with a sum of unit step functions. A parameter reflecting the overall statistical properties of filtered pseudorandom sequences is derived. This parameter is called the statistical quality factor. A computer algorithm to calculate the statistical quality factor for the filtered sequences is presented, and the results for two examples of sequence combinations are included. The analysis reveals that the statistics of the signals generated with the hybrid-sum generator are potentially superior to the statistics of signals generated with maximum-length generators. Furthermore, fewer calculations are required to evaluate the statistics of a large group of hybrid-sum generators than are required to evaluate the statistics of the same size group of approximately equivalent maximum-length sequences.
Unexpected series of regular frequency spacing of δ Scuti stars in the non-asymptotic regime - I. The methodology

DOE PAGES

Paparo, M.; Benko, J. M.; Hareter, M.; ...

2016-05-11

In this study, a sequence search method was developed to search the regular frequency spacing in δ Scuti stars through visual inspection and an algorithmic search. We searched for sequences of quasi-equally spaced frequencies, containing at least four members per sequence, in 90 δ Scuti stars observed by CoRoT. We found an unexpectedly large number of independent series of regular frequency spacing in 77 δ Scuti stars (from one to eight sequences) in the non-asymptotic regime. We introduce the sequence search method presenting the sequences and echelle diagram of CoRoT 102675756 and the structure of the algorithmic search. Four sequencesmore » (echelle ridges) were found in the 5–21 d –1 region where the pairs of the sequences are shifted (between 0.5 and 0.59 d –1) by twice the value of the estimated rotational splitting frequency (0.269 d –1). The general conclusions for the whole sample are also presented in this paper. The statistics of the spacings derived by the sequence search method, by FT (Fourier transform of the frequencies), and the statistics of the shifts are also compared. In many stars more than one almost equally valid spacing appeared. The model frequencies of FG Vir and their rotationally split components were used to formulate the possible explanation that one spacing is the large separation while the other is the sum of the large separation and the rotational frequency. In CoRoT 102675756, the two spacings (2.249 and 1.977 d –1) are in better agreement with the sum of a possible 1.710 d –1 large separation and two or one times, respectively, the value of the rotational frequency.« less
Unexpected series of regular frequency spacing of δ Scuti stars in the non-asymptotic regime - I. The methodology

DOE Office of Scientific and Technical Information (OSTI.GOV)

Paparo, M.; Benko, J. M.; Hareter, M.

In this study, a sequence search method was developed to search the regular frequency spacing in δ Scuti stars through visual inspection and an algorithmic search. We searched for sequences of quasi-equally spaced frequencies, containing at least four members per sequence, in 90 δ Scuti stars observed by CoRoT. We found an unexpectedly large number of independent series of regular frequency spacing in 77 δ Scuti stars (from one to eight sequences) in the non-asymptotic regime. We introduce the sequence search method presenting the sequences and echelle diagram of CoRoT 102675756 and the structure of the algorithmic search. Four sequencesmore » (echelle ridges) were found in the 5–21 d –1 region where the pairs of the sequences are shifted (between 0.5 and 0.59 d –1) by twice the value of the estimated rotational splitting frequency (0.269 d –1). The general conclusions for the whole sample are also presented in this paper. The statistics of the spacings derived by the sequence search method, by FT (Fourier transform of the frequencies), and the statistics of the shifts are also compared. In many stars more than one almost equally valid spacing appeared. The model frequencies of FG Vir and their rotationally split components were used to formulate the possible explanation that one spacing is the large separation while the other is the sum of the large separation and the rotational frequency. In CoRoT 102675756, the two spacings (2.249 and 1.977 d –1) are in better agreement with the sum of a possible 1.710 d –1 large separation and two or one times, respectively, the value of the rotational frequency.« less
Immunological and biochemical characterization of processing products from the neurotensin/neuromedin N precursor in the rat medullary thyroid carcinoma 6-23 cell line.

PubMed Central

Bidard, J N; de Nadai, F; Rovere, C; Moinier, D; Laur, J; Martinez, J; Cuber, J C; Kitabgi, P

1993-01-01

Neurotensin (NT) and neuromedin N (NN) are two related biologically active peptides that are encoded in the same precursor molecule. In the rat, the precursor consists of a 169-residue polypeptide starting with an N-terminal signal peptide and containing in its C-terminal region one copy each of NT and NN. NN precedes NT and is separated from it by a Lys-Arg sequence. Two other Lys-Arg sequences flank the N-terminus of NN and the C-terminus of NT. A fourth Lys-Arg sequence occurs near the middle of the precursor and is followed by an NN-like sequence. Finally, an Arg-Arg pair is present within the NT moiety. The four Lys-Arg doublets represent putative processing sites in the precursor molecule. The present study was designed to investigate the post-translational processing of the NT/NN precursor in the rat medullary thyroid carcinoma (rMTC) 6-23 cell line, which synthesizes large amounts of NT upon dexamethasone treatment. Five region-specific antisera recognizing the free N- or C-termini of sequences adjacent to the basic doublets were produced, characterized and used for immunoblotting and radioimmunoassay studies in combination with gel filtration, reverse-phase h.p.l.c. and trypsin digestion of rMTC 6-23 cell extracts. Because two of the antigenic sequences, i.e. NN and the NN-like sequence, start with a lysine residue that is essential for recognition by their respective antisera, a micromethod by which trypsin specifically cleaves at arginine residues was developed. The results show that dexamethasone-treated rMTC 6-23 cells produced comparable amounts of NT, NN and a peptide corresponding to a large N-terminal precursor fragment lacking the NN and NT moieties. This large fragment was purified. N-Terminal sequencing revealed that it started at residue Ser23 of the prepro-NT/NN sequence, and thus established the Cys22-Ser23 bond as the cleavage site of the signal peptide. Two other large N-terminal fragments bearing respectively the NN and NT sequences at their C-termini were present in lower amounts. The NN-like sequence was internal to all the large fragments. There was no evidence for the presence of peptides with the NN-like sequence at their N-termini. This shows that, in rMTC 6-23 cells, the precursor is readily processed at the three Lys-Arg doublets that flank and separate the NT and NN sequences. In contrast, the Lys-Arg doublet that precedes the NN-like sequence is not processed in this system.(ABSTRACT TRUNCATED AT 400 WORDS) Images Figure 3 PMID:8471039
44 CFR 206.191 - Duplication of benefits.

Code of Federal Regulations, 2010 CFR

2010-10-01

... to section 408 of the Stafford Act. (iii) Small Business Administration and Farmers Home Administration disaster loans; (iv) Other Needs assistance, pursuant to section 408 of the Stafford Act or its... resources it must consider before it does so. (4) If following the delivery sequence concept would adversely...
A Primer on Adventure Education in the Camp Setting.

ERIC Educational Resources Information Center

Nei, Eric

2003-01-01

Basic concepts of experiential learning theory are presented to assist camp directors in choosing knowledgeable staff and developing successful adventure programs. These concepts include assessment of learner (camper) readiness, activity sequencing, learning cycle, comfort zone, activity framing, task goals versus process goals, and five stages of…
Fatty Acid Methyl Ester (FAME) analyses for characterization and detection of grapevine pathogens

USDA-ARS?s Scientific Manuscript database

Grapevines can become infected by a variety of devastating pathogens, including the bacterium Xylella fastidiosa and canker fungi. Multiple strains of Xylella fastidiosa exist, each causing different diseases on various hosts. Although sequence-based genotyping can assist in distinguishing these str...
Trichoderma asperellum reconsidered: two cryptic species

USDA-ARS?s Scientific Manuscript database

Analysis of a world-wide collection of strains of Trichoderma asperellum using multilocus genealogies of four genomic regions (tef1, rbp2, act, ITS1, 2, 5.8s), sequence polymorphism-derived (SPD) markers, matrix-assisted laser desorption/ionisation–time of flight mass spectrometry (MALDI-TOF MS) of ...
Introduction to Systematic Instruction. (SCAT Project, Title VI-G).

ERIC Educational Resources Information Center

Idaho State Dept. of Education, Boise.

Developed by the staff of SCAT (Support, Competency-Assistance and Training), the document provides information on systematic instructional procedures for teachers of exceptional children. Briefly addressed are the seven components of systematic instruction: initial assessment, establishment of long term goals, sequencing of short term objectives,…
Genomic tools and and prospects for new breeding techniques in flower bulb crops

USDA-ARS?s Scientific Manuscript database

For many of the new breeding techniques, sequence information is of the utmost importance. In addition to current breeding techniques, such as marker-assisted selection (MAS) and genetic modification (GM), new breeding techniques such as zinc finger nucleases, oligonucleotide-mediated mutagenesis, R...
Direct deposition of silver nanoplates on quartz surface by sequence pre-treatment hydroxylation and silanisation.

PubMed

Abu Bakar, Norhayati; Mat Salleh, Muhamad; Ali Umar, Akrajas; Shapter, Joseph George

2017-01-01

Silver nanoparticles deposited on quartz substrates are widely used as SERS substrates. The nanoparticles can be deposited directly from colloidal solution by dipping technique. However, the adhesion of the particles on the quartz surface is very poor. Normally the substrate is pre-treated with hydroxylation or silanisation process. In this paper, we have demonstrated that the application of the sequence pre-treatment hydroxylation and silanisation have improved the density of silver nanoplates desposited on the quartz surface. •Sequence hydroxylation and silanisation pre-treatment assists the deposition of the nanoplate on the surface.•Various immersion times of the quartz surface into the colloidal nanoplates determined size distributions and density surface of the nanoplates on the surface.
Cry-Bt identifier: a biological database for PCR detection of Cry genes present in transgenic plants.

PubMed

Singh, Vinay Kumar; Ambwani, Sonu; Marla, Soma; Kumar, Anil

2009-10-23

We describe the development of a user friendly tool that would assist in the retrieval of information relating to Cry genes in transgenic crops. The tool also helps in detection of transformed Cry genes from Bacillus thuringiensis present in transgenic plants by providing suitable designed primers for PCR identification of these genes. The tool designed based on relational database model enables easy retrieval of information from the database with simple user queries. The tool also enables users to access related information about Cry genes present in various databases by interacting with different sources (nucleotide sequences, protein sequence, sequence comparison tools, published literature, conserved domains, evolutionary and structural data). http://insilicogenomics.in/Cry-btIdentifier/welcome.html.

Whole-genome CNV analysis: advances in computational approaches.

PubMed

Pirooznia, Mehdi; Goes, Fernando S; Zandi, Peter P

2015-01-01

Accumulating evidence indicates that DNA copy number variation (CNV) is likely to make a significant contribution to human diversity and also play an important role in disease susceptibility. Recent advances in genome sequencing technologies have enabled the characterization of a variety of genomic features, including CNVs. This has led to the development of several bioinformatics approaches to detect CNVs from next-generation sequencing data. Here, we review recent advances in CNV detection from whole genome sequencing. We discuss the informatics approaches and current computational tools that have been developed as well as their strengths and limitations. This review will assist researchers and analysts in choosing the most suitable tools for CNV analysis as well as provide suggestions for new directions in future development.
Simple tools for assembling and searching high-density picolitre pyrophosphate sequence data.

PubMed

Parker, Nicolas J; Parker, Andrew G

2008-04-18

The advent of pyrophosphate sequencing makes large volumes of sequencing data available at a lower cost than previously possible. However, the short read lengths are difficult to assemble and the large dataset is difficult to handle. During the sequencing of a virus from the tsetse fly, Glossina pallidipes, we found the need for tools to search quickly a set of reads for near exact text matches. A set of tools is provided to search a large data set of pyrophosphate sequence reads under a "live" CD version of Linux on a standard PC that can be used by anyone without prior knowledge of Linux and without having to install a Linux setup on the computer. The tools permit short lengths of de novo assembly, checking of existing assembled sequences, selection and display of reads from the data set and gathering counts of sequences in the reads. Demonstrations are given of the use of the tools to help with checking an assembly against the fragment data set; investigating homopolymer lengths, repeat regions and polymorphisms; and resolving inserted bases caused by incomplete chain extension. The additional information contained in a pyrophosphate sequencing data set beyond a basic assembly is difficult to access due to a lack of tools. The set of simple tools presented here would allow anyone with basic computer skills and a standard PC to access this information.
Protein Sequence Annotation Tool (PSAT): A centralized web-based meta-server for high-throughput sequence annotations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Leung, Elo; Huang, Amy; Cadag, Eithon

In this study, we introduce the Protein Sequence Annotation Tool (PSAT), a web-based, sequence annotation meta-server for performing integrated, high-throughput, genome-wide sequence analyses. Our goals in building PSAT were to (1) create an extensible platform for integration of multiple sequence-based bioinformatics tools, (2) enable functional annotations and enzyme predictions over large input protein fasta data sets, and (3) provide a web interface for convenient execution of the tools. In this paper, we demonstrate the utility of PSAT by annotating the predicted peptide gene products of Herbaspirillum sp. strain RV1423, importing the results of PSAT into EC2KEGG, and using the resultingmore » functional comparisons to identify a putative catabolic pathway, thereby distinguishing RV1423 from a well annotated Herbaspirillum species. This analysis demonstrates that high-throughput enzyme predictions, provided by PSAT processing, can be used to identify metabolic potential in an otherwise poorly annotated genome. Lastly, PSAT is a meta server that combines the results from several sequence-based annotation and function prediction codes, and is available at http://psat.llnl.gov/psat/. PSAT stands apart from other sequencebased genome annotation systems in providing a high-throughput platform for rapid de novo enzyme predictions and sequence annotations over large input protein sequence data sets in FASTA. PSAT is most appropriately applied in annotation of large protein FASTA sets that may or may not be associated with a single genome.« less
Protein Sequence Annotation Tool (PSAT): A centralized web-based meta-server for high-throughput sequence annotations

DOE PAGES

Leung, Elo; Huang, Amy; Cadag, Eithon; ...

2016-01-20

In this study, we introduce the Protein Sequence Annotation Tool (PSAT), a web-based, sequence annotation meta-server for performing integrated, high-throughput, genome-wide sequence analyses. Our goals in building PSAT were to (1) create an extensible platform for integration of multiple sequence-based bioinformatics tools, (2) enable functional annotations and enzyme predictions over large input protein fasta data sets, and (3) provide a web interface for convenient execution of the tools. In this paper, we demonstrate the utility of PSAT by annotating the predicted peptide gene products of Herbaspirillum sp. strain RV1423, importing the results of PSAT into EC2KEGG, and using the resultingmore » functional comparisons to identify a putative catabolic pathway, thereby distinguishing RV1423 from a well annotated Herbaspirillum species. This analysis demonstrates that high-throughput enzyme predictions, provided by PSAT processing, can be used to identify metabolic potential in an otherwise poorly annotated genome. Lastly, PSAT is a meta server that combines the results from several sequence-based annotation and function prediction codes, and is available at http://psat.llnl.gov/psat/. PSAT stands apart from other sequencebased genome annotation systems in providing a high-throughput platform for rapid de novo enzyme predictions and sequence annotations over large input protein sequence data sets in FASTA. PSAT is most appropriately applied in annotation of large protein FASTA sets that may or may not be associated with a single genome.« less
15 CFR 700.5 - Special priorities assistance.

Code of Federal Regulations, 2011 CFR

2011-01-01

... DEFENSE PRIORITIES AND ALLOCATIONS SYSTEM Overview § 700.5 Special priorities assistance. (a) The DPAS is designed to be largely self-executing. However, from time-to-time production or delivery problems will...
15 CFR 700.5 - Special priorities assistance.

Code of Federal Regulations, 2010 CFR

2010-01-01

... DEFENSE PRIORITIES AND ALLOCATIONS SYSTEM Overview § 700.5 Special priorities assistance. (a) The DPAS is designed to be largely self-executing. However, from time-to-time production or delivery problems will...
15 CFR 700.5 - Special priorities assistance.

Code of Federal Regulations, 2012 CFR

2012-01-01

... DEFENSE PRIORITIES AND ALLOCATIONS SYSTEM Overview § 700.5 Special priorities assistance. (a) The DPAS is designed to be largely self-executing. However, from time-to-time production or delivery problems will...
15 CFR 700.5 - Special priorities assistance.

Code of Federal Regulations, 2014 CFR

2014-01-01

... DEFENSE PRIORITIES AND ALLOCATIONS SYSTEM Overview § 700.5 Special priorities assistance. (a) The DPAS is designed to be largely self-executing. However, from time-to-time production or delivery problems will...
15 CFR 700.5 - Special priorities assistance.

Code of Federal Regulations, 2013 CFR

2013-01-01

... DEFENSE PRIORITIES AND ALLOCATIONS SYSTEM Overview § 700.5 Special priorities assistance. (a) The DPAS is designed to be largely self-executing. However, from time-to-time production or delivery problems will...
Business Development Corporation, Inc.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jasek, S.

1995-12-31

Business Development Corporation, Inc., is a company specializing in opportunity seeking and business development activities in the {open_quotes}new{close_quotes} post communist Central and Eastern Europe, with particular emphasis on the Republics of Poland and Slovakia. The company currently focuses its expertise on strategic investing and business development between Central Europe and the United States of America. In Poland and Slovakia, the company specializes in developing large scale energy and environmental {open_quotes}infrastructure{close_quotes} development projects on the federal, state, and local level. In addition, the company assists large state owned industries in the transformation and privatization process. Business Development Corporation has assisted andmore » continues to assist in projects of national importance. The staff of experts advise numerous large Polish and Slovak companies, most owned or in the process of privatization, on matters of restructuring, finance, capital structure, strategic parternships or investors, mergers, acquisitions and joint ventures with U.S. based firms. The company also assists and advises on a variety of environmental and energy matters in the public and private sector.« less
Comparison of two methods for purification of enterocin B, a bacteriocin produced by Enterococcus faecium W3.

PubMed

Dündar, Halil; Atakay, Mehmet; Çelikbıçak, Ömür; Salih, Bekir; Bozoğlu, Faruk

2015-01-01

This study aimed to compare two different approaches for the purification of enterocin B from Enterococcus faecium strain W3 based on the observation that the bacteriocin was found both in cell associated form and in culture supernatant. The first approach employed ammonium sulfate precipitation, cation-exchange chromatography, and sequential reverse-phase high-performance liquid chromatography. The latter approach exploited a pH-mediated cell adsorption-desorption method to extract cell-bound bacteriocin, and one run of reverse-phase chromatography. The first method resulted in purification of enterocin B with a recovery of 4% of the initial bacteriocin activity found in culture supernatant. MALDI-TOF MS analysis and de novo peptide sequencing of the purified bacteriocin confirmed that the active peptide was enterocin B. The second method achieved the purification of enterocin B with a higher recovery (16%) and enabled us to achieve pure bacteriocin within a shorter period of time by avoiding time consuming purification protocols. The purity and identity of the active peptide were confirmed again by matrix-assisted laser desorption/ionization time-of flight (MALDI-TOF) mass spectrometry (MS) analysis. Although both approaches were satisfactory to obtain a sufficient amount of enterocin B for use in MS and amino acid sequence analysis, the latter was proved to be applicable in large-scale and rapid purification of enterocin B.
Transcriptome-wide single nucleotide polymorphisms (SNPs) for abalone (Haliotis midae): validation and application using GoldenGate medium-throughput genotyping assays.

PubMed

Bester-Van Der Merwe, Aletta; Blaauw, Sonja; Du Plessis, Jana; Roodt-Wilding, Rouvay

2013-09-23

Haliotis midae is one of the most valuable commercial abalone species in the world, but is highly vulnerable, due to exploitation, habitat destruction and predation. In order to preserve wild and cultured stocks, genetic management and improvement of the species has become crucial. Fundamental to this is the availability and employment of molecular markers, such as microsatellites and single nucleotide (SNPs). Transcriptome sequences generated through sequencing-by-synthesis technology were utilized for the in vitro and in silico identification of 505 putative SNPs from a total of 316 selected contigs. A subset of 234 SNPs were further validated and characterized in wild and cultured abalone using two Illumina GoldenGate genotyping assays. Combined with VeraCode technology, this genotyping platform yielded a 65%-69% conversion rate (percentage polymorphic markers) with a global genotyping success rate of 76%-85% and provided a viable means for validating SNP markers in a non-model species. The utility of 31 of the validated SNPs in population structure analysis was confirmed, while a large number of SNPs (174) were shown to be informative and are, thus, good candidates for linkage map construction. The non-synonymous SNPs (50) located in coding regions of genes that showed similarities with known proteins will also be useful for genetic applications, such as the marker-assisted selection of genes of relevance to abalone aquaculture.
Generation and analysis of blueberry transcriptome sequences from leaves, developing fruit, and flower buds from cold acclimation through deacclimation.

PubMed

Rowland, Lisa J; Alkharouf, Nadim; Darwish, Omar; Ogden, Elizabeth L; Polashock, James J; Bassil, Nahla V; Main, Dorrie

2012-04-02

There has been increased consumption of blueberries in recent years fueled in part because of their many recognized health benefits. Blueberry fruit is very high in anthocyanins, which have been linked to improved night vision, prevention of macular degeneration, anti-cancer activity, and reduced risk of heart disease. Very few genomic resources have been available for blueberry, however. Further development of genomic resources like expressed sequence tags (ESTs), molecular markers, and genetic linkage maps could lead to more rapid genetic improvement. Marker-assisted selection could be used to combine traits for climatic adaptation with fruit and nutritional quality traits. Efforts to sequence the transcriptome of the commercial highbush blueberry (Vaccinium corymbosum) cultivar Bluecrop and use the sequences to identify genes associated with cold acclimation and fruit development and develop SSR markers for mapping studies are presented here. Transcriptome sequences were generated from blueberry fruit at different stages of development, flower buds at different stages of cold acclimation, and leaves by next-generation Roche 454 sequencing. Over 600,000 reads were assembled into approximately 15,000 contigs and 124,000 singletons. The assembled sequences were annotated and functionally mapped to Gene Ontology (GO) terms. Frequency of the most abundant sequences in each of the libraries was compared across all libraries to identify genes that are potentially differentially expressed during cold acclimation and fruit development. Real-time PCR was performed to confirm their differential expression patterns. Overall, 14 out of 17 of the genes examined had differential expression patterns similar to what was predicted from their reads alone. The assembled sequences were also mined for SSRs. From these sequences, 15,886 blueberry EST-SSR loci were identified. Primers were designed from 7,705 of the SSR-containing sequences with adequate flanking sequence. One hundred primer pairs were tested for amplification and polymorphism among parents of two blueberry populations currently being used for genetic linkage map construction. The tetraploid mapping population was based on a cross between the highbush cultivars Draper and Jewel (V. darrowii is also in the background of 'Jewel'). The diploid mapping population was based on a cross between an F1 hybrid of V. darrowii and diploid V. corymbosum and another diploid V. corymbosum. The overall amplification rate of the SSR primers was 68% and the polymorphism rate was 43%. These results indicate that this large collection of 454 ESTs will be a valuable resource for identifying genes that are potentially differentially expressed and play important roles in flower bud development, cold acclimation, chilling unit accumulation, and fruit development in blueberry and related species. In addition, the ESTs have already proved useful for the development of SSR and EST-PCR markers, and are currently being used for construction of genetic linkage maps in blueberry.
Generation and analysis of blueberry transcriptome sequences from leaves, developing fruit, and flower buds from cold acclimation through deacclimation

PubMed Central

2012-01-01

Background There has been increased consumption of blueberries in recent years fueled in part because of their many recognized health benefits. Blueberry fruit is very high in anthocyanins, which have been linked to improved night vision, prevention of macular degeneration, anti-cancer activity, and reduced risk of heart disease. Very few genomic resources have been available for blueberry, however. Further development of genomic resources like expressed sequence tags (ESTs), molecular markers, and genetic linkage maps could lead to more rapid genetic improvement. Marker-assisted selection could be used to combine traits for climatic adaptation with fruit and nutritional quality traits. Results Efforts to sequence the transcriptome of the commercial highbush blueberry (Vaccinium corymbosum) cultivar Bluecrop and use the sequences to identify genes associated with cold acclimation and fruit development and develop SSR markers for mapping studies are presented here. Transcriptome sequences were generated from blueberry fruit at different stages of development, flower buds at different stages of cold acclimation, and leaves by next-generation Roche 454 sequencing. Over 600,000 reads were assembled into approximately 15,000 contigs and 124,000 singletons. The assembled sequences were annotated and functionally mapped to Gene Ontology (GO) terms. Frequency of the most abundant sequences in each of the libraries was compared across all libraries to identify genes that are potentially differentially expressed during cold acclimation and fruit development. Real-time PCR was performed to confirm their differential expression patterns. Overall, 14 out of 17 of the genes examined had differential expression patterns similar to what was predicted from their reads alone. The assembled sequences were also mined for SSRs. From these sequences, 15,886 blueberry EST-SSR loci were identified. Primers were designed from 7,705 of the SSR-containing sequences with adequate flanking sequence. One hundred primer pairs were tested for amplification and polymorphism among parents of two blueberry populations currently being used for genetic linkage map construction. The tetraploid mapping population was based on a cross between the highbush cultivars Draper and Jewel (V. darrowii is also in the background of 'Jewel'). The diploid mapping population was based on a cross between an F1 hybrid of V. darrowii and diploid V. corymbosum and another diploid V. corymbosum. The overall amplification rate of the SSR primers was 68% and the polymorphism rate was 43%. Conclusions These results indicate that this large collection of 454 ESTs will be a valuable resource for identifying genes that are potentially differentially expressed and play important roles in flower bud development, cold acclimation, chilling unit accumulation, and fruit development in blueberry and related species. In addition, the ESTs have already proved useful for the development of SSR and EST-PCR markers, and are currently being used for construction of genetic linkage maps in blueberry. PMID:22471859
A Pilot Study of a Picture- and Audio-Assisted Self-Interviewing Method (PIASI) for the Study of Sensitive Questions on HIV in the Field

ERIC Educational Resources Information Center

Aarnio, Pauliina; Kulmala, Teija

2016-01-01

Self-interview methods such as audio computer-assisted self-interviewing (ACASI) are used to improve the accuracy of interview data on sensitive topics in large trials. Small field studies on sensitive topics would benefit from methodological alternatives. In a study on male involvement in antenatal HIV testing in a largely illiterate population…
Analysis of complete mitochondrial genome sequences increases phylogenetic resolution of bears (Ursidae), a mammalian family that experienced rapid speciation.

PubMed

Yu, Li; Li, Yi-Wei; Ryder, Oliver A; Zhang, Ya-Ping

2007-10-24

Despite the small number of ursid species, bear phylogeny has long been a focus of study due to their conservation value, as all bear genera have been classified as endangered at either the species or subspecies level. The Ursidae family represents a typical example of rapid evolutionary radiation. Previous analyses with a single mitochondrial (mt) gene or a small number of mt genes either provide weak support or a large unresolved polytomy for ursids. We revisit the contentious relationships within Ursidae by analyzing complete mt genome sequences and evaluating the performance of both entire mt genomes and constituent mtDNA genes in recovering a phylogeny of extremely recent speciation events. This mitochondrial genome-based phylogeny provides strong evidence that the spectacled bear diverged first, while within the genus Ursus, the sloth bear is the sister taxon of all the other five ursines. The latter group is divided into the brown bear/polar bear and the two black bears/sun bear assemblages. These findings resolve the previous conflicts between trees using partial mt genes. The ability of different categories of mt protein coding genes to recover the correct phylogeny is concordant with previous analyses for taxa with deep divergence times. This study provides a robust Ursidae phylogenetic framework for future validation by additional independent evidence, and also has significant implications for assisting in the resolution of other similarly difficult phylogenetic investigations. Identification of base composition bias and utilization of the combined data of whole mitochondrial genome sequences has allowed recovery of a strongly supported phylogeny that is upheld when using multiple alternative outgroups for the Ursidae, a mammalian family that underwent a rapid radiation since the mid- to late Pliocene. It remains to be seen if the reliability of mt genome analysis will hold up in studies of other difficult phylogenetic issues. Although the whole mitochondrial DNA sequence based phylogeny is robust, it remains in conflict with phylogenetic relationships suggested by analysis of limited nuclear-encoded data, a situation that will require gathering more nuclear DNA sequence information.
Mining candidate genes associated with powdery mildew resistance in cucumber via super-BSA by specific length amplified fragment (SLAF) sequencing.

PubMed

Zhang, Peng; Zhu, Yuqiang; Wang, Lili; Chen, Liping; Zhou, Shengjun

2015-12-14

Powdery mildew (PM) is the most common fungal disease of cucumber and other cucurbit crops, while breeding the PM-resistant materials is the effective way to defense this disease, and the recent development of modern genetics and genomics make us aware of that studying the resistance genes is the essential way to breed the PM high-resistance plant. With the ever increasing throughput of next-generation sequencing (NGS), the development of specific length amplified fragment sequencing (SLAF-seq) as a high-resolution strategy for large-scale de novo SNP discovery is gradually applied for functional gene mining. Here we combined the bulked segregant analysis (BSA) with SLAF-seq to identify candidate genes associated with PM resistance in cucumber. A segregating population comprising 251 F2 individuals was developed using H136 (female parent) as susceptible parent and BK2 (male parent) as resistance donor. After PMR test, total genomic DNA was prepared from each plant. Systemic genomic analysis of the GC content, repeat sequence, etc. was carried out by prediction software SLAF_Predict to establish condition to ensure the uniformity and density of the molecular markers. After samples were gel purified, SLAFs were generated at Biomarker Technologies Corporation in Beijing. Based on SLAF tags and the PMR test result, the hot region were annotated. A total of 73,100 high-quality SLAF tags with an average depth of 99.11× were sequenced. Among these, 5,355 polymorphic tags were identified with a polymorphism rate of 7.34 %, including 7.09 % SNPs and other polymorphism types. Finally, 140 associated SLAFs were identified, and two main Hot Regions were detected on chromosome 1 and 6, which contained five genes invovled in defense response, toxin metabolism, cell stress response, and injury response in cucumber. Associated markers identified by super-BSA in this study, could not only speed up the study of the PMR genes, but also provide a feasible solution for breeding the marker-assisted PMR cucumber. Moreover, this study could also be extended to any other species with reference genome.
Structural characterization of copia-type retrotransposons leads to insights into the marker development in a biofuel crop, Jatropha curcas L.

PubMed Central

2013-01-01

Background Recently, Jatropha curcas L. has attracted worldwide attention for its potential as a source of biodiesel. However, most DNA markers have demonstrated high levels of genetic similarity among and within jatropha populations around the globe. Despite promising features of copia-type retrotransposons as ideal genetic tools for gene tagging, mutagenesis, and marker-assisted selection, they have not been characterized in the jatropha genome yet. Here, we examined the diversity, evolution, and genome-wide organization of copia-type retrotransposons in the Asian, African, and Mesoamerican accessions of jatropha, then introduced a retrotransposon-based marker for this biofuel crop. Results In total, 157 PCR fragments that were amplified using the degenerate primers for the reverse transcriptase (RT) domain of copia-type retroelements were sequenced and aligned to construct the neighbor-joining tree. Phylogenetic analysis demonstrated that isolated copia RT sequences were classified into ten families, which were then grouped into three lineages. An in-depth study of the jatropha genome for the RT sequences of each family led to the characterization of full consensus sequences of the jatropha copia-type families. Estimated copy numbers of target sequences were largely different among families, as was presence of genes within 5 kb flanking regions for each family. Five copia-type families were as appealing candidates for the development of DNA marker systems. A candidate marker from family Jc7 was particularly capable of detecting genetic variation among different jatropha accessions. Fluorescence in situ hybridization (FISH) to metaphase chromosomes reveals that copia-type retrotransposons are scattered across chromosomes mainly located in the distal part regions. Conclusion This is the first report on genome-wide analysis and the cytogenetic mapping of copia-type retrotransposons of jatropha, leading to the discovery of families bearing high potential as DNA markers. Distinct dynamics of individual copia-type families, feasibility of a retrotransposon-based insertion polymorphism marker system in examining genetic variability, and approaches for the development of breeding strategies in jatropha using copia-type retrotransposons are discussed. PMID:24020916
Analysis of complete mitochondrial genome sequences increases phylogenetic resolution of bears (Ursidae), a mammalian family that experienced rapid speciation

PubMed Central

Yu, Li; Li, Yi-Wei; Ryder, Oliver A; Zhang, Ya-Ping

2007-01-01

Background Despite the small number of ursid species, bear phylogeny has long been a focus of study due to their conservation value, as all bear genera have been classified as endangered at either the species or subspecies level. The Ursidae family represents a typical example of rapid evolutionary radiation. Previous analyses with a single mitochondrial (mt) gene or a small number of mt genes either provide weak support or a large unresolved polytomy for ursids. We revisit the contentious relationships within Ursidae by analyzing complete mt genome sequences and evaluating the performance of both entire mt genomes and constituent mtDNA genes in recovering a phylogeny of extremely recent speciation events. Results This mitochondrial genome-based phylogeny provides strong evidence that the spectacled bear diverged first, while within the genus Ursus, the sloth bear is the sister taxon of all the other five ursines. The latter group is divided into the brown bear/polar bear and the two black bears/sun bear assemblages. These findings resolve the previous conflicts between trees using partial mt genes. The ability of different categories of mt protein coding genes to recover the correct phylogeny is concordant with previous analyses for taxa with deep divergence times. This study provides a robust Ursidae phylogenetic framework for future validation by additional independent evidence, and also has significant implications for assisting in the resolution of other similarly difficult phylogenetic investigations. Conclusion Identification of base composition bias and utilization of the combined data of whole mitochondrial genome sequences has allowed recovery of a strongly supported phylogeny that is upheld when using multiple alternative outgroups for the Ursidae, a mammalian family that underwent a rapid radiation since the mid- to late Pliocene. It remains to be seen if the reliability of mt genome analysis will hold up in studies of other difficult phylogenetic issues. Although the whole mitochondrial DNA sequence based phylogeny is robust, it remains in conflict with phylogenetic relationships suggested by analysis of limited nuclear-encoded data, a situation that will require gathering more nuclear DNA sequence information. PMID:17956639
Earthquake forecasting during the complex Amatrice-Norcia seismic sequence

PubMed Central

Marzocchi, Warner; Taroni, Matteo; Falcone, Giuseppe

2017-01-01

Earthquake forecasting is the ultimate challenge for seismologists, because it condenses the scientific knowledge about the earthquake occurrence process, and it is an essential component of any sound risk mitigation planning. It is commonly assumed that, in the short term, trustworthy earthquake forecasts are possible only for typical aftershock sequences, where the largest shock is followed by many smaller earthquakes that decay with time according to the Omori power law. We show that the current Italian operational earthquake forecasting system issued statistically reliable and skillful space-time-magnitude forecasts of the largest earthquakes during the complex 2016–2017 Amatrice-Norcia sequence, which is characterized by several bursts of seismicity and a significant deviation from the Omori law. This capability to deliver statistically reliable forecasts is an essential component of any program to assist public decision-makers and citizens in the challenging risk management of complex seismic sequences. PMID:28924610

Earthquake forecasting during the complex Amatrice-Norcia seismic sequence.

PubMed

Marzocchi, Warner; Taroni, Matteo; Falcone, Giuseppe

2017-09-01

Earthquake forecasting is the ultimate challenge for seismologists, because it condenses the scientific knowledge about the earthquake occurrence process, and it is an essential component of any sound risk mitigation planning. It is commonly assumed that, in the short term, trustworthy earthquake forecasts are possible only for typical aftershock sequences, where the largest shock is followed by many smaller earthquakes that decay with time according to the Omori power law. We show that the current Italian operational earthquake forecasting system issued statistically reliable and skillful space-time-magnitude forecasts of the largest earthquakes during the complex 2016-2017 Amatrice-Norcia sequence, which is characterized by several bursts of seismicity and a significant deviation from the Omori law. This capability to deliver statistically reliable forecasts is an essential component of any program to assist public decision-makers and citizens in the challenging risk management of complex seismic sequences.
Gesteme-free context-aware adaptation of robot behavior in human-robot cooperation.

PubMed

Nessi, Federico; Beretta, Elisa; Gatti, Cecilia; Ferrigno, Giancarlo; De Momi, Elena

2016-11-01

Cooperative robotics is receiving greater acceptance because the typical advantages provided by manipulators are combined with an intuitive usage. In particular, hands-on robotics may benefit from the adaptation of the assistant behavior with respect to the activity currently performed by the user. A fast and reliable classification of human activities is required, as well as strategies to smoothly modify the control of the manipulator. In this scenario, gesteme-based motion classification is inadequate because it needs the observation of a wide signal percentage and the definition of a rich vocabulary. In this work, a system able to recognize the user's current activity without a vocabulary of gestemes, and to accordingly adapt the manipulator's dynamic behavior is presented. An underlying stochastic model fits variations in the user's guidance forces and the resulting trajectories of the manipulator's end-effector with a set of Gaussian distribution. The high-level switching between these distributions is captured with hidden Markov models. The dynamic of the KUKA light-weight robot, a torque-controlled manipulator, is modified with respect to the classified activity using sigmoidal-shaped functions. The presented system is validated over a pool of 12 näive users in a scenario that addresses surgical targeting tasks on soft tissue. The robot's assistance is adapted in order to obtain a stiff behavior during activities that require critical accuracy constraint, and higher compliance during wide movements. Both the ability to provide the correct classification at each moment (sample accuracy) and the capability of correctly identify the correct sequence of activity (sequence accuracy) were evaluated. The proposed classifier is fast and accurate in all the experiments conducted (80% sample accuracy after the observation of ∼450ms of signal). Moreover, the ability of recognize the correct sequence of activities, without unwanted transitions is guaranteed (sequence accuracy ∼90% when computed far away from user desired transitions). Finally, the proposed activity-based adaptation of the robot's dynamic does not lead to a not smooth behavior (high smoothness, i.e. normalized jerk score <0.01). The provided system is able to dynamic assist the operator during cooperation in the presented scenario. Copyright © 2016 Elsevier B.V. All rights reserved.
Building the Scientific Modeling Assistant: An interactive environment for specialized software design

NASA Technical Reports Server (NTRS)

Keller, Richard M.

1991-01-01

The construction of scientific software models is an integral part of doing science, both within NASA and within the scientific community at large. Typically, model-building is a time-intensive and painstaking process, involving the design of very large, complex computer programs. Despite the considerable expenditure of resources involved, completed scientific models cannot easily be distributed and shared with the larger scientific community due to the low-level, idiosyncratic nature of the implemented code. To address this problem, we have initiated a research project aimed at constructing a software tool called the Scientific Modeling Assistant. This tool provides automated assistance to the scientist in developing, using, and sharing software models. We describe the Scientific Modeling Assistant, and also touch on some human-machine interaction issues relevant to building a successful tool of this type.
High-resolution phylogenetic microbial community profiling

DOE Office of Scientific and Technical Information (OSTI.GOV)

Singer, Esther; Coleman-Derr, Devin; Bowman, Brett

2014-03-17

The representation of bacterial and archaeal genome sequences is strongly biased towards cultivated organisms, which belong to merely four phylogenetic groups. Functional information and inter-phylum level relationships are still largely underexplored for candidate phyla, which are often referred to as microbial dark matter. Furthermore, a large portion of the 16S rRNA gene records in the GenBank database are labeled as environmental samples and unclassified, which is in part due to low read accuracy, potential chimeric sequences produced during PCR amplifications and the low resolution of short amplicons. In order to improve the phylogenetic classification of novel species and advance ourmore » knowledge of the ecosystem function of uncultivated microorganisms, high-throughput full length 16S rRNA gene sequencing methodologies with reduced biases are needed. We evaluated the performance of PacBio single-molecule real-time (SMRT) sequencing in high-resolution phylogenetic microbial community profiling. For this purpose, we compared PacBio and Illumina metagenomic shotgun and 16S rRNA gene sequencing of a mock community as well as of an environmental sample from Sakinaw Lake, British Columbia. Sakinaw Lake is known to contain a large age of microbial species from candidate phyla. Sequencing results show that community structure based on PacBio shotgun and 16S rRNA gene sequences is highly similar in both the mock and the environmental communities. Resolution power and community representation accuracy from SMRT sequencing data appeared to be independent of GC content of microbial genomes and was higher when compared to Illumina-based metagenome shotgun and 16S rRNA gene (iTag) sequences, e.g. full-length sequencing resolved all 23 OTUs in the mock community, while iTags did not resolve closely related species. SMRT sequencing hence offers various potential benefits when characterizing uncharted microbial communities.« less
Tumor Heterogeneity, Single-Cell Sequencing, and Drug Resistance.

PubMed

Schmidt, Felix; Efferth, Thomas

2016-06-16

Tumor heterogeneity has been compared with Darwinian evolution and survival of the fittest. The evolutionary ecosystem of tumors consisting of heterogeneous tumor cell populations represents a considerable challenge to tumor therapy, since all genetically and phenotypically different subpopulations have to be efficiently killed by therapy. Otherwise, even small surviving subpopulations may cause repopulation and refractory tumors. Single-cell sequencing allows for a better understanding of the genomic principles of tumor heterogeneity and represents the basis for more successful tumor treatments. The isolation and sequencing of single tumor cells still represents a considerable technical challenge and consists of three major steps: (1) single cell isolation (e.g., by laser-capture microdissection), fluorescence-activated cell sorting, micromanipulation, whole genome amplification (e.g., with the help of Phi29 DNA polymerase), and transcriptome-wide next generation sequencing technologies (e.g., 454 pyrosequencing, Illumina sequencing, and other systems). Data demonstrating the feasibility of single-cell sequencing for monitoring the emergence of drug-resistant cell clones in patient samples are discussed herein. It is envisioned that single-cell sequencing will be a valuable asset to assist the design of regimens for personalized tumor therapies based on tumor subpopulation-specific genetic alterations in individual patients.
Differential processing of pro-neurotensin/neuromedin N and relationship to pro-hormone convertases.

PubMed

Kitabgi, Patrick

2006-10-01

Neurotensin (NT) is synthesized as part of a larger precursor that also contains neuromedin N (NN), a six amino acid neurotensin-like peptide. NT and NN are located in the C-terminal region of the precursor (pro-NT/NN) where they are flanked and separated by three Lys-Arg sequences. A fourth dibasic sequence is present in the middle of the precursor. Dibasics are the consensus sites recognized and cleaved by endoproteases that belong to the recently identified family of pro-protein convertases (PCs). In tissues that express pro-NT/NN, the three C-terminal Lys-Arg sites are differentially processed, whereas the middle dibasic is poorly cleaved. Pro-NT/NN processing gives rise mainly to NT and NN in the brain, to NT and a large peptide ending with the NN sequence at its C-terminus (large NN) in the gut and to NT, large NN and a large peptide ending with the NT sequence (large NT) in the adrenals. Recent evidence indicates that PC1, PC2 and PC5-A are the pro-hormone convertases responsible for the processing patterns observed in the gut, brain and adrenals, respectively. As NT, NN, large NT and large NN are all endowed with biological activity, the evidence reviewed here supports the idea that post-translational processing of pro-NT/NN in tissues may generate biological diversity.
Genome Mapping and Molecular Breeding of Tomato

PubMed Central

Foolad, Majid R.

2007-01-01

The cultivated tomato, Lycopersicon esculentum, is the second most consumed vegetable worldwide and a well-studied crop species in terms of genetics, genomics, and breeding. It is one of the earliest crop plants for which a genetic linkage map was constructed, and currently there are several molecular maps based on crosses between the cultivated and various wild species of tomato. The high-density molecular map, developed based on an L. esculentum × L. pennellii cross, includes more than 2200 markers with an average marker distance of less than 1 cM and an average of 750 kbp per cM. Different types of molecular markers such as RFLPs, AFLPs, SSRs, CAPS, RGAs, ESTs, and COSs have been developed and mapped onto the 12 tomato chromosomes. Markers have been used extensively for identification and mapping of genes and QTLs for many biologically and agriculturally important traits and occasionally for germplasm screening, fingerprinting, and marker-assisted breeding. The utility of MAS in tomato breeding has been restricted largely due to limited marker polymorphism within the cultivated species and economical reasons. Also, when used, MAS has been employed mainly for improving simply-inherited traits and not much for improving complex traits. The latter has been due to unavailability of reliable PCR-based markers and problems with linkage drag. Efforts are being made to develop high-throughput markers with greater resolution, including SNPs. The expanding tomato EST database, which currently includes ∼214 000 sequences, the new microarray DNA chips, and the ongoing sequencing project are expected to aid development of more practical markers. Several BAC libraries have been developed that facilitate map-based cloning of genes and QTLs. Sequencing of the euchromatic portions of the tomato genome is paving the way for comparative and functional analysis of important genes and QTLs. PMID:18364989
Differential Distribution of Type II CRISPR-Cas Systems in Agricultural and Nonagricultural Campylobacter coli and Campylobacter jejuni Isolates Correlates with Lack of Shared Environments

PubMed Central

Pearson, Bruce M.; Louwen, Rogier; van Baarlen, Peter; van Vliet, Arnoud H.M.

2015-01-01

CRISPR (clustered regularly interspaced palindromic repeats)-Cas (CRISPR-associated) systems are sequence-specific adaptive defenses against phages and plasmids which are widespread in prokaryotes. Here we have studied whether phylogenetic relatedness or sharing of environmental niches affects the distribution and dissemination of Type II CRISPR-Cas systems, first in 132 bacterial genomes from 15 phylogenetic classes, ranging from Proteobacteria to Actinobacteria. There was clustering of distinct Type II CRISPR-Cas systems in phylogenetically distinct genera with varying G+C%, which share environmental niches. The distribution of CRISPR-Cas within a genus was studied using a large collection of genome sequences of the closely related Campylobacter species Campylobacter jejuni (N = 3,746) and Campylobacter coli (N = 486). The Cas gene cas9 and CRISPR-repeat are almost universally present in C. jejuni genomes (98.0% positive) but relatively rare in C. coli genomes (9.6% positive). Campylobacter jejuni and agricultural C. coli isolates share the C. jejuni CRISPR-Cas system, which is closely related to, but distinct from the C. coli CRISPR-Cas system found in C. coli isolates from nonagricultural sources. Analysis of the genomic position of CRISPR-Cas insertion suggests that the C. jejuni-type CRISPR-Cas has been transferred to agricultural C. coli. Conversely, the absence of the C. coli-type CRISPR-Cas in agricultural C. coli isolates may be due to these isolates not sharing the same environmental niche, and may be affected by farm hygiene and biosecurity practices in the agricultural sector. Finally, many CRISPR spacer alleles were linked with specific multilocus sequence types, suggesting that these can assist molecular epidemiology applications for C. jejuni and C. coli. PMID:26338188
Construction of a High-Density American Cranberry (Vaccinium macrocarpon Ait.) Composite Map Using Genotyping-by-Sequencing for Multi-pedigree Linkage Mapping.

PubMed

Schlautman, Brandon; Covarrubias-Pazaran, Giovanny; Diaz-Garcia, Luis; Iorizzo, Massimo; Polashock, James; Grygleski, Edward; Vorsa, Nicholi; Zalapa, Juan

2017-04-03

The American cranberry ( Vaccinium macrocarpon Ait.) is a recently domesticated, economically important, fruit crop with limited molecular resources. New genetic resources could accelerate genetic gain in cranberry through characterization of its genomic structure and by enabling molecular-assisted breeding strategies. To increase the availability of cranberry genomic resources, genotyping-by-sequencing (GBS) was used to discover and genotype thousands of single nucleotide polymorphisms (SNPs) within three interrelated cranberry full-sib populations. Additional simple sequence repeat (SSR) loci were added to the SNP datasets and used to construct bin maps for the parents of the populations, which were then merged to create the first high-density cranberry composite map containing 6073 markers (5437 SNPs and 636 SSRs) on 12 linkage groups (LGs) spanning 1124 cM. Interestingly, higher rates of recombination were observed in maternal than paternal gametes. The large number of markers in common (mean of 57.3) and the high degree of observed collinearity (mean Pair-wise Spearman rank correlations >0.99) between the LGs of the parental maps demonstrates the utility of GBS in cranberry for identifying polymorphic SNP loci that are transferable between pedigrees and populations in future trait-association studies. Furthermore, the high-density of markers anchored within the component maps allowed identification of segregation distortion regions, placement of centromeres on each of the 12 LGs, and anchoring of genomic scaffolds. Collectively, the results represent an important contribution to the current understanding of cranberry genomic structure and to the availability of molecular tools for future genetic research and breeding efforts in cranberry. Copyright © 2017 Schlautman et al.
Metagenomic Analysis of the Microbiota from the Crop of an Invasive Snail Reveals a Rich Reservoir of Novel Genes

PubMed Central

Cardoso, Alexander M.; Cavalcante, Janaína J. V.; Cantão, Maurício E.; Thompson, Claudia E.; Flatschart, Roberto B.; Glogauer, Arnaldo; Scapin, Sandra M. N.; Sade, Youssef B.; Beltrão, Paulo J. M. S. I.; Gerber, Alexandra L.; Martins, Orlando B.; Garcia, Eloi S.; de Souza, Wanderley; Vasconcelos, Ana Tereza R.

2012-01-01

The shortage of petroleum reserves and the increase in CO2 emissions have raised global concerns and highlighted the importance of adopting sustainable energy sources. Second-generation ethanol made from lignocellulosic materials is considered to be one of the most promising fuels for vehicles. The giant snail Achatina fulica is an agricultural pest whose biotechnological potential has been largely untested. Here, the composition of the microbial population within the crop of this invasive land snail, as well as key genes involved in various biochemical pathways, have been explored for the first time. In a high-throughput approach, 318 Mbp of 454-Titanium shotgun metagenomic sequencing data were obtained. The predominant bacterial phylum found was Proteobacteria, followed by Bacteroidetes and Firmicutes. Viruses, Fungi, and Archaea were present to lesser extents. The functional analysis reveals a variety of microbial genes that could assist the host in the degradation of recalcitrant lignocellulose, detoxification of xenobiotics, and synthesis of essential amino acids and vitamins, contributing to the adaptability and wide-ranging diet of this snail. More than 2,700 genes encoding glycoside hydrolase (GH) domains and carbohydrate-binding modules were detected. When we compared GH profiles, we found an abundance of sequences coding for oligosaccharide-degrading enzymes (36%), very similar to those from wallabies and giant pandas, as well as many novel cellulase and hemicellulase coding sequences, which points to this model as a remarkable potential source of enzymes for the biofuel industry. Furthermore, this work is a major step toward the understanding of the unique genetic profile of the land snail holobiont. PMID:23133637
Identification of Fur, Aconitase, and Other Proteins Expressed by Mycobacterium tuberculosis under Conditions of Low and High Concentrations of Iron by Combined Two-Dimensional Gel Electrophoresis and Mass Spectrometry

PubMed Central

Wong, Diane K.; Lee, Bai-Yu; Horwitz, Marcus A.; Gibson, Bradford W.

1999-01-01

Iron plays a critical role in the pathophysiology of Mycobacterium tuberculosis. To gain a better understanding of iron regulation by this organism, we have used two-dimensional (2-D) gel electrophoresis, mass spectrometry, and database searching to study protein expression in M. tuberculosis under conditions of high and low iron concentration. Proteins in cellular extracts from M. tuberculosis Erdman strain grown under low-iron (1 μM) and high-iron (70 μM) conditions were separated by 2-D polyacrylamide gel electrophoresis, which allowed high-resolution separation of several hundred proteins, as visualized by Coomassie staining. The expression of at least 15 proteins was induced, and the expression of at least 12 proteins was decreased under low-iron conditions. In-gel trypsin digestion was performed on these differentially expressed proteins, and the digestion mixtures were analyzed by matrix-assisted laser desorption ionization time-of-flight mass spectrometry to determine the molecular masses of the resulting tryptic peptides. Partial sequence data on some of the peptides were obtained by using after source decay and/or collision-induced dissociation. The fragmentation data were used to search computerized peptide mass and protein sequence databases for known proteins. Ten iron-regulated proteins were identified, including Fur and aconitase proteins, both of which are known to be regulated by iron in other bacterial systems. Our study shows that, where large protein sequence databases are available from genomic studies, the combined use of 2-D gel electrophoresis, mass spectrometry, and database searching to analyze proteins expressed under defined environmental conditions is a powerful tool for identifying expressed proteins and their physiologic relevance. PMID:9864233
Pearls and Pitfalls in Evaluating a Student Assistance Program: A Five-Year Case Study

ERIC Educational Resources Information Center

Wilburn, Sharon T.; Wilburn, Kenneth T.; Weaver, Dax M.; Bowles, Kathy

2007-01-01

This article presents data from a five-year evaluation-research case study of a large urban schools district's internal Student Assistance Program (SAP). The district employed specially trained and licensed school-based counselors to implement an internal SAP expanded to include tertiary prevention, and modeled after an employee assistance program…
The genome sequence of sweet cherry (Prunus avium) for use in genomics-assisted breeding.

PubMed

Shirasawa, Kenta; Isuzugawa, Kanji; Ikenaga, Mitsunobu; Saito, Yutaro; Yamamoto, Toshiya; Hirakawa, Hideki; Isobe, Sachiko

2017-10-01

We determined the genome sequence of sweet cherry (Prunus avium) using next-generation sequencing technology. The total length of the assembled sequences was 272.4 Mb, consisting of 10,148 scaffold sequences with an N50 length of 219.6 kb. The sequences covered 77.8% of the 352.9 Mb sweet cherry genome, as estimated by k-mer analysis, and included >96.0% of the core eukaryotic genes. We predicted 43,349 complete and partial protein-encoding genes. A high-density consensus map with 2,382 loci was constructed using double-digest restriction site-associated DNA sequencing. Comparing the genetic maps of sweet cherry and peach revealed high synteny between the two genomes; thus the scaffolds were integrated into pseudomolecules using map- and synteny-based strategies. Whole-genome resequencing of six modern cultivars found 1,016,866 SNPs and 162,402 insertions/deletions, out of which 0.7% were deleterious. The sequence variants, as well as simple sequence repeats, can be used as DNA markers. The genomic information helps us to identify agronomically important genes and will accelerate genetic studies and breeding programs for sweet cherries. Further information on the genomic sequences and DNA markers is available in DBcherry (http://cherry.kazusa.or.jp (8 May 2017, date last accessed)). © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Simulations Using Random-Generated DNA and RNA Sequences

ERIC Educational Resources Information Center

Bryce, C. F. A.

1977-01-01

Using a very simple computer program written in BASIC, a very large number of random-generated DNA or RNA sequences are obtained. Students use these sequences to predict complementary sequences and translational products, evaluate base compositions, determine frequencies of particular triplet codons, and suggest possible secondary structures.…
Sequence analysis reveals genomic factors affecting EST-SSR primer performance and polymorphism

USDA-ARS?s Scientific Manuscript database

Search for simple sequence repeat (SSR) motifs and design of flanking primers in expressed sequence tag (EST) sequences can be easily done at a large scale using bioinformatics programs. However, failed amplification and/or detection, along with lack of polymorphism, is often seen among randomly sel...
cis-Apa: a practical linker for the microwave-assisted preparation of cyclic pseudopeptides via RCM cyclative cleavage.

PubMed

Baron, Alice; Verdié, Pascal; Martinez, Jean; Lamaty, Frédéric

2011-02-04

A new linker cis-5-aminopent-3-enoic acid (cis-Apa) was prepared for the synthesis of cyclic pseudopeptides by cyclization-cleavage by using ring-closing methatesis (RCM). We developed a new synthetic pathway for the preparation of the cis-Apa linker that was tested in the cyclization-cleavage process of different RGD peptide sequences. Different macrocyclic peptidomimetics were prepared by using this integrated microwave-assisted method, showing that the readily available cis-Apa amino acid is well adapted as a linker in the cyclization-cleavage process.
Species Identification of Clinical Prevotella Isolates by Matrix-Assisted Laser Desorption Ionization–Time of Flight Mass Spectrometry

PubMed Central

Soetens, Oriane; De Bel, Annelies; Echahidi, Fedoua; Vancutsem, Ellen; Vandoorslaer, Kristof; Piérard, Denis

2012-01-01

The performance of matrix-assisted laser desorption–ionization time of flight mass spectrometry (MALDI-TOF MS) for species identification of Prevotella was evaluated and compared with 16S rRNA gene sequencing. Using a Bruker database, 62.7% of the 102 clinical isolates were identified to the species level and 73.5% to the genus level. Extension of the commercial database improved these figures to, respectively, 83.3% and 89.2%. MALDI-TOF MS identification of Prevotella is reliable but needs a more extensive database. PMID:22301022
PumpKin: A tool to find principal pathways in plasma chemical models

NASA Astrophysics Data System (ADS)

Markosyan, A. H.; Luque, A.; Gordillo-Vázquez, F. J.; Ebert, U.

2014-10-01

PumpKin is a software package to find all principal pathways, i.e. the dominant reaction sequences, in chemical reaction systems. Although many tools are available to integrate numerically arbitrarily complex chemical reaction systems, few tools exist in order to analyze the results and interpret them in relatively simple terms. In particular, due to the large disparity in the lifetimes of the interacting components, it is often useful to group reactions into pathways that recycle the fastest species. This allows a researcher to focus on the slow chemical dynamics, eliminating the shortest timescales. Based on the algorithm described by Lehmann (2004), PumpKin automates the process of finding such pathways, allowing the user to analyze complex kinetics and to understand the consumption and production of a certain species of interest. We designed PumpKin with an emphasis on plasma chemical systems but it can also be applied to atmospheric modeling and to industrial applications such as plasma medicine and plasma-assisted combustion.
Puerto Rico water resources planning model program description

USGS Publications Warehouse

Moody, D.W.; Maddock, Thomas; Karlinger, M.R.; Lloyd, J.J.

1973-01-01

Because the use of the Mathematical Programming System -Extended (MPSX) to solve large linear and mixed integer programs requires the preparation of many input data cards, a matrix generator program to produce the MPSX input data from a much more limited set of data may expedite the use of the mixed integer programming optimization technique. The Model Definition and Control Program (MODCQP) is intended to assist a planner in preparing MPSX input data for the Puerto Rico Water Resources Planning Model. The model utilizes a mixed-integer mathematical program to identify a minimum present cost set of water resources projects (diversions, reservoirs, ground-water fields, desalinization plants, water treatment plants, and inter-basin transfers of water) which will meet a set of future water demands and to determine their sequence of construction. While MODCOP was specifically written to generate MPSX input data for the planning model described in this report, the program can be easily modified to reflect changes in the model's mathematical structure.
CRISPRcompar: a website to compare clustered regularly interspaced short palindromic repeats.

PubMed

Grissa, Ibtissem; Vergnaud, Gilles; Pourcel, Christine

2008-07-01

Clustered regularly interspaced short palindromic repeat (CRISPR) elements are a particular family of tandem repeats present in prokaryotic genomes, in almost all archaea and in about half of bacteria, and which participate in a mechanism of acquired resistance against phages. They consist in a succession of direct repeats (DR) of 24-47 bp separated by similar sized unique sequences (spacers). In the large majority of cases, the direct repeats are highly conserved, while the number and nature of the spacers are often quite diverse, even among strains of a same species. Furthermore, the acquisition of new units (DR + spacer) was shown to happen almost exclusively on one side of the locus. Therefore, the CRISPR presents an interesting genetic marker for comparative and evolutionary analysis of closely related bacterial strains. CRISPRcompar is a web service created to assist biologists in the CRISPR typing process. Two tools facilitates the in silico investigation: CRISPRcomparison and CRISPRtionary. This website is freely accessible at http://crispr.u-psud.fr/CRISPRcompar/.

The Accuracy of Molecular Processes

NASA Astrophysics Data System (ADS)

Stavans, Joel

Recombination is arguably one of the most fundamental mechanisms driving genetic diversity during evolution. Recombination takes place in one way or another from viruses such as HIV and polio, to bacteria, and finally to man. In both prokaryotes and eukaryotes, homologous recombination is assisted by enzymes, recombinases, that promote the exchange of strands between two segments of DNA, thereby creating new genetic combinations. In bacteria, homologous recombination takes place as a pathway for the repair of DNA lesions and also during horizontal or lateral gene transfer processes, in which cells take in exogenous pieces of DNA. This allows bacteria to evolve rapidly by acquiring large sequences of DNA, a process which would take too long by gene duplications and single mutations. I will survey recent results on the fidelity of homologous recombination as catalyzed by the bacterial recombinase RecA. These results show discrimination up to the level of single base mismatches, during the initial stages of the recombination process. A cascaded kinetic proofreading process is proposed to explain this high discrimination. Kinetic proofreading ideas are also reviewed.
GIS-based planning system for managing the flow of construction and demolition waste in Brazil.

PubMed

Paz, Diogo Henrique Fernandes da; Lafayette, Kalinny Patrícia Vaz; Sobral, Maria do Carmo

2018-05-01

The objective of this article was to plan a network for municipal management of construction and demolition waste in Brazil with the assistance of a geographic information system, using the city of Recife as a case study. The methodology was carried out in three stages. The first was to map the illegal construction and demolition of waste disposal points across Recife and classify the waste according to its recyclability. In sequence, a method for indicating suitable areas for installation of voluntary delivery points, for small waste generators, are presented. Finally, a method for indicating suitable areas for the installation of trans-shipment and waste sorting areas, developed for large generators, is presented. The results show that a geographic information system is an essential tool in the planning of municipal construction and demolition waste management, in order to facilitate the spatial analysis and control the generation, sorting, collection, transportation, and final destination of construction and demolition waste, increasing the rate of recovery and recycling of materials.
Gemi: PCR Primers Prediction from Multiple Alignments

PubMed Central

Sobhy, Haitham; Colson, Philippe

2012-01-01

Designing primers and probes for polymerase chain reaction (PCR) is a preliminary and critical step that requires the identification of highly conserved regions in a given set of sequences. This task can be challenging if the targeted sequences display a high level of diversity, as frequently encountered in microbiologic studies. We developed Gemi, an automated, fast, and easy-to-use bioinformatics tool with a user-friendly interface to design primers and probes based on multiple aligned sequences. This tool can be used for the purpose of real-time and conventional PCR and can deal efficiently with large sets of sequences of a large size. PMID:23316117
Assessing the 5S ribosomal RNA heterogeneity in Arabidopsis thaliana using short RNA next generation sequencing data.

PubMed

Szymanski, Maciej; Karlowski, Wojciech M

2016-01-01

In eukaryotes, ribosomal 5S rRNAs are products of multigene families organized within clusters of tandemly repeated units. Accumulation of genomic data obtained from a variety of organisms demonstrated that the potential 5S rRNA coding sequences show a large number of variants, often incompatible with folding into a correct secondary structure. Here, we present results of an analysis of a large set of short RNA sequences generated by the next generation sequencing techniques, to address the problem of heterogeneity of the 5S rRNA transcripts in Arabidopsis and identification of potentially functional rRNA-derived fragments.
Pair-barcode high-throughput sequencing for large-scale multiplexed sample analysis

PubMed Central

2012-01-01

Background The multiplexing becomes the major limitation of the next-generation sequencing (NGS) in application to low complexity samples. Physical space segregation allows limited multiplexing, while the existing barcode approach only permits simultaneously analysis of up to several dozen samples. Results Here we introduce pair-barcode sequencing (PBS), an economic and flexible barcoding technique that permits parallel analysis of large-scale multiplexed samples. In two pilot runs using SOLiD sequencer (Applied Biosystems Inc.), 32 independent pair-barcoded miRNA libraries were simultaneously discovered by the combination of 4 unique forward barcodes and 8 unique reverse barcodes. Over 174,000,000 reads were generated and about 64% of them are assigned to both of the barcodes. After mapping all reads to pre-miRNAs in miRBase, different miRNA expression patterns are captured from the two clinical groups. The strong correlation using different barcode pairs and the high consistency of miRNA expression in two independent runs demonstrates that PBS approach is valid. Conclusions By employing PBS approach in NGS, large-scale multiplexed pooled samples could be practically analyzed in parallel so that high-throughput sequencing economically meets the requirements of samples which are low sequencing throughput demand. PMID:22276739
Pair-barcode high-throughput sequencing for large-scale multiplexed sample analysis.

PubMed

Tu, Jing; Ge, Qinyu; Wang, Shengqin; Wang, Lei; Sun, Beili; Yang, Qi; Bai, Yunfei; Lu, Zuhong

2012-01-25

The multiplexing becomes the major limitation of the next-generation sequencing (NGS) in application to low complexity samples. Physical space segregation allows limited multiplexing, while the existing barcode approach only permits simultaneously analysis of up to several dozen samples. Here we introduce pair-barcode sequencing (PBS), an economic and flexible barcoding technique that permits parallel analysis of large-scale multiplexed samples. In two pilot runs using SOLiD sequencer (Applied Biosystems Inc.), 32 independent pair-barcoded miRNA libraries were simultaneously discovered by the combination of 4 unique forward barcodes and 8 unique reverse barcodes. Over 174,000,000 reads were generated and about 64% of them are assigned to both of the barcodes. After mapping all reads to pre-miRNAs in miRBase, different miRNA expression patterns are captured from the two clinical groups. The strong correlation using different barcode pairs and the high consistency of miRNA expression in two independent runs demonstrates that PBS approach is valid. By employing PBS approach in NGS, large-scale multiplexed pooled samples could be practically analyzed in parallel so that high-throughput sequencing economically meets the requirements of samples which are low sequencing throughput demand.
Sequences Of Amino Acids For Human Serum Albumin

NASA Technical Reports Server (NTRS)

Carter, Daniel C.

1992-01-01

Sequences of amino acids defined for use in making polypeptides one-third to one-sixth as large as parent human serum albumin molecule. Smaller, chemically stable peptides have diverse applications including service as artificial human serum and as active components of biosensors and chromatographic matrices. In applications involving production of artificial sera from new sequences, little or no concern about viral contaminants. Smaller genetically engineered polypeptides more easily expressed and produced in large quantities, making commercial isolation and production more feasible and profitable.
PLATO Instruction for Elementary Accounting.

ERIC Educational Resources Information Center

McKeown, James C.

A progress report of a study using computer assisted instruction (CAI) materials for an elementary course in accounting principles is presented. The study was based on the following objectives: (1) improvement of instruction in the elementary accounting sequence, and (2) help for transfer students from two-year institutions. The materials under…
Intelligent Web-Based Learning System with Personalized Learning Path Guidance

ERIC Educational Resources Information Center

Chen, C. M.

2008-01-01

Personalized curriculum sequencing is an important research issue for web-based learning systems because no fixed learning paths will be appropriate for all learners. Therefore, many researchers focused on developing e-learning systems with personalized learning mechanisms to assist on-line web-based learning and adaptively provide learning paths…
Effects on Learning Logographic Character Formation in Computer-Assisted Handwriting Instruction

ERIC Educational Resources Information Center

Tsai, Chen-hui; Kuo, Chin-Hwa; Horng, Wen-Bing; Chen, Chun-Wen

2012-01-01

This paper reports on a study that investigates how different learning methods might affect the learning process of character handwriting among beginning college learners of Chinese, as measured by tests of recognition, approximate production, precise production, and awareness of conventional stroke sequence. Two methodologies were examined during…
Industrial Arts Curriculum Guide for Electricity/Electronics.

ERIC Educational Resources Information Center

Connecticut State Dept. of Education, Hartford. Div. of Vocational Education.

This curriculum provides a behaviorally written guide that offers a possible list of objectives to assist in establishing or revising an electrical/electronics curriculum. Teachers may choose specific objectives to suit age group and educational level or expertise. Introductory material describes the scope and sequence of an Industrial Arts…
Developing Compressed Beginning and Intermediate Algebra Courses

ERIC Educational Resources Information Center

Walker, Sylvia E.

2017-01-01

The purpose of this project was two-fold. First, it would provide an opportunity for students to complete the developmental math course sequence more quickly, thereby enabling students to proceed to a college-level mathematics course sooner. To accomplish this, the classroom was designed with computer-assisted homework courses that blended…
Development of the first consensus genetic map of intermediate wheatgrass (Thinopyrum intermedium) using genotyping-by-sequencing

USDA-ARS?s Scientific Manuscript database

Intermediate wheatgrass (Thinopyrum intermedium) has been identified as a candidate for domestication and improvement as a perennial grain, forage, and biofuel crop by several active breeding programs. To accelerate this process using genomics-assisted breeding, efficient genotyping methods and gen...
48 CFR 3.303 - Reporting suspected antitrust violations.

Code of Federal Regulations, 2010 CFR

2010-10-01

... turn in sequence as low bidder, or so that certain competitors bid low only on some sizes of contracts and high on other sizes; (5) Division of the market, so that certain competitors bid low only for..., Attention: Assistant Attorney General, Antitrust Division, and shall include— (1) A brief statement...
Metagenomic and near full-length 16S rRNA sequence data in support of the phylogenetic analysis of the rumen bacterial community in steers

USDA-ARS?s Scientific Manuscript database

Next generation sequencing technologies have vastly changed the approach of sequencing of the 16S rRNA gene for studies in microbial ecology. Three distinct technologies are available for large-scale 16S sequencing. All three are subject to biases introduced by sequencing error rates, amplificatio...
UNEXPECTED SERIES OF REGULAR FREQUENCY SPACING OF δ SCUTI STARS IN THE NON-ASYMPTOTIC REGIME. I. THE METHODOLOGY

DOE Office of Scientific and Technical Information (OSTI.GOV)

Paparó, M.; Benkő, J. M.; Hareter, M.

A sequence search method was developed to search the regular frequency spacing in δ Scuti stars through visual inspection and an algorithmic search. We searched for sequences of quasi-equally spaced frequencies, containing at least four members per sequence, in 90 δ Scuti stars observed by CoRoT . We found an unexpectedly large number of independent series of regular frequency spacing in 77 δ Scuti stars (from one to eight sequences) in the non-asymptotic regime. We introduce the sequence search method presenting the sequences and echelle diagram of CoRoT 102675756 and the structure of the algorithmic search. Four sequences (echelle ridges)more » were found in the 5–21 d{sup −1} region where the pairs of the sequences are shifted (between 0.5 and 0.59 d{sup −1}) by twice the value of the estimated rotational splitting frequency (0.269 d{sup −1}). The general conclusions for the whole sample are also presented in this paper. The statistics of the spacings derived by the sequence search method, by FT (Fourier transform of the frequencies), and the statistics of the shifts are also compared. In many stars more than one almost equally valid spacing appeared. The model frequencies of FG Vir and their rotationally split components were used to formulate the possible explanation that one spacing is the large separation while the other is the sum of the large separation and the rotational frequency. In CoRoT 102675756, the two spacings (2.249 and 1.977 d{sup −1}) are in better agreement with the sum of a possible 1.710 d{sup −1} large separation and two or one times, respectively, the value of the rotational frequency.« less
Beyond Linear Sequence Comparisons: The use of genome-levelcharacters for phylogenetic reconstruction

DOE Office of Scientific and Technical Information (OSTI.GOV)

Boore, Jeffrey L.

2004-11-27

Although the phylogenetic relationships of many organisms have been convincingly resolved by the comparisons of nucleotide or amino acid sequences, others have remained equivocal despite great effort. Now that large-scale genome sequencing projects are sampling many lineages, it is becoming feasible to compare large data sets of genome-level features and to develop this as a tool for phylogenetic reconstruction that has advantages over conventional sequence comparisons. Although it is unlikely that these will address a large number of evolutionary branch points across the broad tree of life due to the infeasibility of such sampling, they have great potential for convincinglymore » resolving many critical, contested relationships for which no other data seems promising. However, it is important that we recognize potential pitfalls, establish reasonable standards for acceptance, and employ rigorous methodology to guard against a return to earlier days of scenario-driven evolutionary reconstructions.« less
The Use of Weighted Graphs for Large-Scale Genome Analysis

PubMed Central

Zhou, Fang; Toivonen, Hannu; King, Ross D.

2014-01-01

There is an acute need for better tools to extract knowledge from the growing flood of sequence data. For example, thousands of complete genomes have been sequenced, and their metabolic networks inferred. Such data should enable a better understanding of evolution. However, most existing network analysis methods are based on pair-wise comparisons, and these do not scale to thousands of genomes. Here we propose the use of weighted graphs as a data structure to enable large-scale phylogenetic analysis of networks. We have developed three types of weighted graph for enzymes: taxonomic (these summarize phylogenetic importance), isoenzymatic (these summarize enzymatic variety/redundancy), and sequence-similarity (these summarize sequence conservation); and we applied these types of weighted graph to survey prokaryotic metabolism. To demonstrate the utility of this approach we have compared and contrasted the large-scale evolution of metabolism in Archaea and Eubacteria. Our results provide evidence for limits to the contingency of evolution. PMID:24619061
Analysis of expressed sequence tags generated from full-length enriched cDNA libraries of melon

PubMed Central

2011-01-01

Background Melon (Cucumis melo), an economically important vegetable crop, belongs to the Cucurbitaceae family which includes several other important crops such as watermelon, cucumber, and pumpkin. It has served as a model system for sex determination and vascular biology studies. However, genomic resources currently available for melon are limited. Result We constructed eleven full-length enriched and four standard cDNA libraries from fruits, flowers, leaves, roots, cotyledons, and calluses of four different melon genotypes, and generated 71,577 and 22,179 ESTs from full-length enriched and standard cDNA libraries, respectively. These ESTs, together with ~35,000 ESTs available in public domains, were assembled into 24,444 unigenes, which were extensively annotated by comparing their sequences to different protein and functional domain databases, assigning them Gene Ontology (GO) terms, and mapping them onto metabolic pathways. Comparative analysis of melon unigenes and other plant genomes revealed that 75% to 85% of melon unigenes had homologs in other dicot plants, while approximately 70% had homologs in monocot plants. The analysis also identified 6,972 gene families that were conserved across dicot and monocot plants, and 181, 1,192, and 220 gene families specific to fleshy fruit-bearing plants, the Cucurbitaceae family, and melon, respectively. Digital expression analysis identified a total of 175 tissue-specific genes, which provides a valuable gene sequence resource for future genomics and functional studies. Furthermore, we identified 4,068 simple sequence repeats (SSRs) and 3,073 single nucleotide polymorphisms (SNPs) in the melon EST collection. Finally, we obtained a total of 1,382 melon full-length transcripts through the analysis of full-length enriched cDNA clones that were sequenced from both ends. Analysis of these full-length transcripts indicated that sizes of melon 5' and 3' UTRs were similar to those of tomato, but longer than many other dicot plants. Codon usages of melon full-length transcripts were largely similar to those of Arabidopsis coding sequences. Conclusion The collection of melon ESTs generated from full-length enriched and standard cDNA libraries is expected to play significant roles in annotating the melon genome. The ESTs and associated analysis results will be useful resources for gene discovery, functional analysis, marker-assisted breeding of melon and closely related species, comparative genomic studies and for gaining insights into gene expression patterns. PMID:21599934
Computer aided identification of a Hevein-like antimicrobial peptide of bell pepper leaves for biotechnological use.

PubMed

Games, Patrícia Dias; daSilva, Elói Quintas Gonçalves; Barbosa, Meire de Oliveira; Almeida-Souza, Hebréia Oliveira; Fontes, Patrícia Pereira; deMagalhães, Marcos Jorge; Pereira, Paulo Roberto Gomes; Prates, Maura Vianna; Franco, Gloria Regina; Faria-Campos, Alessandra; Campos, Sérgio Vale Aguiar; Baracat-Pereira, Maria Cristina

2016-12-15

Antimicrobial peptides from plants present mechanisms of action that are different from those of conventional defense agents. They are under-explored but have a potential as commercial antimicrobials. Bell pepper leaves ('Magali R') are discarded after harvesting the fruit and are sources of bioactive peptides. This work reports the isolation by peptidomics tools, and the identification and partially characterization by computational tools of an antimicrobial peptide from bell pepper leaves, and evidences the usefulness of records and the in silico analysis for the study of plant peptides aiming biotechnological uses. Aqueous extracts from leaves were enriched in peptide by salt fractionation and ultrafiltration. An antimicrobial peptide was isolated by tandem chromatographic procedures. Mass spectrometry, automated peptide sequencing and bioinformatics tools were used alternately for identification and partial characterization of the Hevein-like peptide, named HEV-CANN. The computational tools that assisted to the identification of the peptide included BlastP, PSI-Blast, ClustalOmega, PeptideCutter, and ProtParam; conventional protein databases (DB) as Mascot, Protein-DB, GenBank-DB, RefSeq, Swiss-Prot, and UniProtKB; specific for peptides DB as Amper, APD2, CAMP, LAMPs, and PhytAMP; other tools included in ExPASy for Proteomics; The Bioactive Peptide Databases, and The Pepper Genome Database. The HEV-CANN sequence presented 40 amino acid residues, 4258.8 Da, theoretical pI-value of 8.78, and four disulfide bonds. It was stable, and it has inhibited the growth of phytopathogenic bacteria and a fungus. HEV-CANN presented a chitin-binding domain in their sequence. There was a high identity and a positive alignment of HEV-CANN sequence in various databases, but there was not a complete identity, suggesting that HEV-CANN may be produced by ribosomal synthesis, which is in accordance with its constitutive nature. Computational tools for proteomics and databases are not adjusted for short sequences, which hampered HEV-CANN identification. The adjustment of statistical tests in large databases for proteins is an alternative to promote the significant identification of peptides. The development of specific DB for plant antimicrobial peptides, with information about peptide sequences, functional genomic data, structural motifs and domains of molecules, functional domains, and peptide-biomolecule interactions are valuable and necessary.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.