Polymorphism in the Eruption Sequence of Primary Dentition: A Cross-sectional Study
Bhojraj, Nandlal; Narayanappa
2017-01-01
Introduction Primary teeth have shown wide variations in their eruption time among different population. Population specific eruption ages are provided as mean with standard deviations or median ages with its percentile range. This alone will be insufficient for prediction of tooth eruption sequence because they provide no information on the frequency of sequence variation within the pairs of teeth. Norms of polymorphic variation in the eruption sequence can be more useful. Aim This study aims at providing norms for the sequence polymorphism in primary teeth among the children of Mysore population. Materials and Methods A cross-sectional study was designed with 1392 children, recruited from December 2015 to June 2016 by simple random sampling method. Tooth was recorded as present or absent. Across the entire possible intra quadrant tooth pair, cases of present-present, absent-absent, present-absent and absent-present and were counted and computed as percentages. Results Sequence polymorphisms were more common in 82-84 pairs of teeth. Significant polymorphic reverse sequence was observed in 52-54 (9%), 82-84 (35%) in males and 82-84 (18%) in females. There was no polymorphism in maxillary arch in females. Conclusion The present study provides the baseline data values for sequence variation in primary teeth eruption. To the best of investigators knowledge, there are no previous studies describing the sequence polymorphism in primary teeth in Indian population. The results of this study helps in assessment of eruption sequence problems in paediatric dentistry and in evaluation and prediction of tooth eruption sequence in individual child. PMID:28658912
Okamoto, Hidehiko; Stracke, Henning; Lagemann, Lothar; Pantev, Christo
2010-01-01
The capability of involuntarily tracking certain sound signals during the simultaneous presence of noise is essential in human daily life. Previous studies have demonstrated that top-down auditory focused attention can enhance excitatory and inhibitory neural activity, resulting in sharpening of frequency tuning of auditory neurons. In the present study, we investigated bottom-up driven involuntary neural processing of sound signals in noisy environments by means of magnetoencephalography. We contrasted two sound signal sequencing conditions: "constant sequencing" versus "random sequencing." Based on a pool of 16 different frequencies, either identical (constant sequencing) or pseudorandomly chosen (random sequencing) test frequencies were presented blockwise together with band-eliminated noises to nonattending subjects. The results demonstrated that the auditory evoked fields elicited in the constant sequencing condition were significantly enhanced compared with the random sequencing condition. However, the enhancement was not significantly different between different band-eliminated noise conditions. Thus the present study confirms that by constant sound signal sequencing under nonattentive listening the neural activity in human auditory cortex can be enhanced, but not sharpened. Our results indicate that bottom-up driven involuntary neural processing may mainly amplify excitatory neural networks, but may not effectively enhance inhibitory neural circuits.
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies.
Utturkar, Sagar M; Klingeman, Dawn M; Hurt, Richard A; Brown, Steven D
2017-01-01
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.
Sequence Diversity Diagram for comparative analysis of multiple sequence alignments.
Sakai, Ryo; Aerts, Jan
2014-01-01
The sequence logo is a graphical representation of a set of aligned sequences, commonly used to depict conservation of amino acid or nucleotide sequences. Although it effectively communicates the amount of information present at every position, this visual representation falls short when the domain task is to compare between two or more sets of aligned sequences. We present a new visual presentation called a Sequence Diversity Diagram and validate our design choices with a case study. Our software was developed using the open-source program called Processing. It loads multiple sequence alignment FASTA files and a configuration file, which can be modified as needed to change the visualization. The redesigned figure improves on the visual comparison of two or more sets, and it additionally encodes information on sequential position conservation. In our case study of the adenylate kinase lid domain, the Sequence Diversity Diagram reveals unexpected patterns and new insights, for example the identification of subgroups within the protein subfamily. Our future work will integrate this visual encoding into interactive visualization tools to support higher level data exploration tasks.
Setoh, Yin Xiang; Amarilla, Alberto A; Peng, Nias Y; Slonchak, Andrii; Periasamy, Parthiban; Figueiredo, Luiz T M; Aquino, Victor H; Khromykh, Alexander A
2018-01-01
Rocio virus (ROCV) is an arbovirus belonging to the genus Flavivirus, family Flaviviridae. We present an updated sequence of ROCV strain SPH 34675 (GenBank: AY632542.4), the only available full genome sequence prior to this study. Using next-generation sequencing of the entire genome, we reveal substantial sequence variation from the prototype sequence, with 30 nucleotide differences amounting to 14 amino acid changes, as well as significant changes to predicted 3'UTR RNA structures. Our results present an updated and corrected sequence of a potential emerging human-virulent flavivirus uniquely indigenous to Brazil (GenBank: MF461639).
Jongsma, Marijtje L A; Gerrits, Niels J H M; van Rijn, Clementina M; Quiroga, Rodrigo Quian; Maes, Joseph H R
2012-07-01
The aim of this study was to track recall performance and event-related potentials (ERPs) across multiple trials in a digit-learning task. When a sequence is practiced by repetition, the number of errors typically decreases and a learning curve emerges. Until now, almost all ERP learning and memory research has focused on effects after a single presentation and, therefore, fails to capture the dynamic changes that characterize a learning process. However, the current study used a free-recall task in which a sequence of ten auditory digits was presented repeatedly. Auditory sequences of ten digits were presented in a logical order (control sequences) or in a random order (experimental sequences). Each sequence was presented six times. Participants had to reproduce the sequence after each presentation. EEG recordings were made at the time of the digit presentations. Recall performance for the control sequences was close to asymptote right after the first learning trial, whereas performance for the experimental sequences initially displayed primacy and recency effects. However, these latter effects gradually disappeared over the six repetitions, resulting in near-asymptotic recall performance for all digits. The performance improvement for the middle items of the list was accompanied by an increase in P300 amplitude, implying a close correspondence between this ERP component and the behavioral data. These results, which were discussed in the framework of theories on the functional significance of the P300 amplitude, add to the scarce empirical data on the dynamics of ERP responses in the process of intentional learning. Copyright © 2011 Elsevier B.V. All rights reserved.
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
DOE Office of Scientific and Technical Information (OSTI.GOV)
Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.; ...
2017-07-18
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies
Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Richard A.; Brown, Steven D.
2017-01-01
This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences. PMID:28769883
Sharifdini, Meysam; Heidari, Zahra; Hesari, Zahra; Vatandoost, Sajad; Kia, Eshrat Beigom
2017-06-01
The present study was performed to analyze molecularly the phylogenetic positions of human-infecting Trichostrongylus species in Mazandaran Province, Iran, which is an endemic area for trichostrongyliasis. DNA from 7 Trichostrongylus infected stool samples were extracted by using in-house (IH) method. PCR amplification of ITS2-rDNA region was performed, and products were sequenced. Phylogenetic analysis of the nucleotide sequence data was performed using MEGA 5.0 software. Six out of 7 isolates had high similarity with Trichostrongylus colubriformis , while the other one showed high homology with Trichostrongylus axei registered in GenBank reference sequences. Intra-specific variations within isolates of T. colubriformis and T. axei amounted to 0-1.8% and 0-0.6%, respectively. Trichostrongylus species obtained in the present study were in a cluster with the relevant reference sequences from previous studies. BLAST analysis indicated that there was 100% homology among all 6 ITS2 sequences of T. colubriformis in the present study and most previously registered sequences of T. colubriformis from human, sheep, and goat isolates from Iran and also human isolates from Laos, Thailand, and France. The ITS2 sequence of T. axei exhibited 99.4% homology with the human isolate of T. axei from Thailand, sheep isolates from New Zealand and Iran, and cattle isolate from USA.
ERIC Educational Resources Information Center
Gounard, Beverley Roberts
This paper summarizes two studies which examine children's free recall of letter sequences in an auditory presentation. In both studies, sequences of six or eight letters were presented to 80 third-grade and 80 eighth-grade pupils, at the rate of one item every other second or four items per second. In the first study, where recall was either…
TIR-NBS-LRR genes are rare in monocots: evidence from diverse monocot orders
Tarr, D Ellen K; Alexander, Helen M
2009-01-01
Background Plant resistance (R) gene products recognize pathogen effector molecules. Many R genes code for proteins containing nucleotide binding site (NBS) and C-terminal leucine-rich repeat (LRR) domains. NBS-LRR proteins can be divided into two groups, TIR-NBS-LRR and non-TIR-NBS-LRR, based on the structure of the N-terminal domain. Although both classes are clearly present in gymnosperms and eudicots, only non-TIR sequences have been found consistently in monocots. Since most studies in monocots have been limited to agriculturally important grasses, it is difficult to draw conclusions. The purpose of our study was to look for evidence of these sequences in additional monocot orders. Findings Using degenerate PCR, we amplified NBS sequences from four monocot species (C. blanda, D. marginata, S. trifasciata, and Spathiphyllum sp.), a gymnosperm (C. revoluta) and a eudicot (C. canephora). We successfully amplified TIR-NBS-LRR sequences from dicot and gymnosperm DNA, but not from monocot DNA. Using databases, we obtained NBS sequences from additional monocots, magnoliids and basal angiosperms. TIR-type sequences were not present in monocot or magnoliid sequences, but were present in the basal angiosperms. Phylogenetic analysis supported a single TIR clade and multiple non-TIR clades. Conclusion We were unable to find monocot TIR-NBS-LRR sequences by PCR amplification or database searches. In contrast to previous studies, our results represent five monocot orders (Poales, Zingiberales, Arecales, Asparagales, and Alismatales). Our results establish the presence of TIR-NBS-LRR sequences in basal angiosperms and suggest that although these sequences were present in early land plants, they have been reduced significantly in monocots and magnoliids. PMID:19785756
ERIC Educational Resources Information Center
Gaubatz, Julie
2013-01-01
Studies of high-school science course sequences have been limited primarily to a small number of site-specific investigations comparing traditional science sequences (e.g., Biology-Chemistry-Physics: BCP) to various Physics First-influenced sequences (Physics-Chemistry-Biology: PCB). The present study summarizes a five-year program evaluation…
Rueda, P; Morón, G; Sarraseca, J; Leclerc, C; Casal, J I
2004-03-01
We have previously developed an antigen-delivery system based on hybrid recombinant porcine parvovirus-like particles (PPV-VLPs) formed by the self-assembly of the VP2 protein of PPV carrying a foreign epitope at its N terminus. In this study, different constructs were made containing a CD8(+) T-cell epitope of chicken ovalbumin (OVA) to analyse the influence of the sequence inserted into VP2 on the correct processing of VLPs by antigen-presenting cells. We analysed the presentation of the OVA epitope inserted without flanking sequences or with either different natural flanking sequences or with the natural flanking sequences of a CD8(+) T-cell epitope from the lymphocytic choriomeningitis virus nucleoprotein, and as a dimer with or without linker sequences. All constructs were studied in terms of level of expression, assembly of VLPs and ability to deliver the inserted epitope into the MHC I pathway. The presentation of the OVA epitope was considerably improved by insertion of short natural flanking sequences, which indicated the relevance of the flanking sequences on the processing of PPV-VLPs. Only PPV-VLPs carrying two copies of the OVA epitope linked by two glycines were able to be properly processed, suggesting that the introduction of flexible residues between the two consecutive OVA epitopes may be necessary for the correct presentation of these dimers by PPV-VLPs. These results provide information to improve the insertion of epitopes into PPV-VLPs to facilitate their processing and presentation by MHC class I molecules.
ERIC Educational Resources Information Center
Wiles, Clyde A.
The study's purpose was to investigate the differential effects on the achievement of second-grade students that could be attributed to three instructional sequences for the learning of the addition and subtraction algorithms. One sequence presented the addition algorithm first (AS), the second presented the subtraction algorithm first (SA), and…
Recombinative Generalization: An Exploratory Study in Musical Reading
Perez, William Ferreira; de Rose, Julio C
2010-01-01
The present study aimed to extend the findings of recombinative generalization research in alphabetical reading and spelling to the context of musical reading. One participant was taught to respond discriminatively to six two-note sequences, choosing the corresponding notation on the staff in the presence of each sequence. When novel three- and four-note sequences were presented, she selected the corresponding notation. These results suggest the generality of previous research to the context of musical teaching. PMID:22477462
The Processing on Different Types of English Formulaic Sequences
ERIC Educational Resources Information Center
Qian, Li
2015-01-01
Formulaic sequences are found to be processed faster than their matched novel phrases in previous studies. Given the variety of formulaic types, few studies have compared processing on different types of formulaic sequences. The present study explored the processing among idioms, speech formulae and written formulae. It has been found that in…
Picture or Text First? Explaining Sequence Effects When Learning with Pictures and Text
ERIC Educational Resources Information Center
Eitel, Alexander; Scheiter, Katharina
2015-01-01
The present article reviews 42 studies investigating the role of sequencing of text and pictures for learning outcomes. Whereas several of the reviewed studies revealed better learning outcomes from presenting the picture before the text rather than after it, other studies demonstrated the opposite effect. Against the backdrop of theories on…
Learning of goal-relevant and -irrelevant complex visual sequences in human V1.
Rosenthal, Clive R; Mallik, Indira; Caballero-Gaudes, Cesar; Sereno, Martin I; Soto, David
2018-06-12
Learning and memory are supported by a network involving the medial temporal lobe and linked neocortical regions. Emerging evidence indicates that primary visual cortex (i.e., V1) may contribute to recognition memory, but this has been tested only with a single visuospatial sequence as the target memorandum. The present study used functional magnetic resonance imaging to investigate whether human V1 can support the learning of multiple, concurrent complex visual sequences involving discontinous (second-order) associations. Two peripheral, goal-irrelevant but structured sequences of orientated gratings appeared simultaneously in fixed locations of the right and left visual fields alongside a central, goal-relevant sequence that was in the focus of spatial attention. Pseudorandom sequences were introduced at multiple intervals during the presentation of the three structured visual sequences to provide an online measure of sequence-specific knowledge at each retinotopic location. We found that a network involving the precuneus and V1 was involved in learning the structured sequence presented at central fixation, whereas right V1 was modulated by repeated exposure to the concurrent structured sequence presented in the left visual field. The same result was not found in left V1. These results indicate for the first time that human V1 can support the learning of multiple concurrent sequences involving complex discontinuous inter-item associations, even peripheral sequences that are goal-irrelevant. Copyright © 2018. Published by Elsevier Inc.
A Paradox within the Time Value of Money: A Critical Thinking Exercise for Finance Students
ERIC Educational Resources Information Center
Delaney, Charles J.; Rich, Steven P.; Rose, John T.
2016-01-01
This study presents a paradox within the time value of money (TVM), namely, that the interest-principal sequence embedded in the payment stream of an amortized loan is exactly the opposite of the interest-principal sequence implicit in the present value of a matching annuity. We examine this inverse sequence, both mathematically and intuitively,…
ERIC Educational Resources Information Center
Chevalier, Nicolas; James, Tiffany D.; Wiebe, Sandra A.; Nelson, Jennifer Mize; Espy, Kimberly Andrews
2014-01-01
The present study addressed whether developmental improvement in working memory span task performance relies upon a growing ability to proactively plan response sequences during childhood. Two hundred thirteen children completed a working memory span task in which they used a touchscreen to reproduce orally presented sequences of animal names.…
ERIC Educational Resources Information Center
Du, Wenchong; Kelly, Steve W.
2013-01-01
The present study examines implicit sequence learning in adult dyslexics with a focus on comparing sequence transitions with different statistical complexities. Learning of a 12-item deterministic sequence was assessed in 12 dyslexic and 12 non-dyslexic university students. Both groups showed equivalent standard reaction time increments when the…
Porter, Danielle P.; Daeumer, Martin; Thielen, Alexander; Chang, Silvia; Martin, Ross; Cohen, Cal; Miller, Michael D.; White, Kirsten L.
2015-01-01
At Week 96 of the Single-Tablet Regimen (STaR) study, more treatment-naïve subjects that received rilpivirine/emtricitabine/tenofovir DF (RPV/FTC/TDF) developed resistance mutations compared to those treated with efavirenz (EFV)/FTC/TDF by population sequencing. Furthermore, more RPV/FTC/TDF-treated subjects with baseline HIV-1 RNA >100,000 copies/mL developed resistance compared to subjects with baseline HIV-1 RNA ≤100,000 copies/mL. Here, deep sequencing was utilized to assess the presence of pre-existing low-frequency variants in subjects with and without resistance development in the STaR study. Deep sequencing (Illumina MiSeq) was performed on baseline and virologic failure samples for all subjects analyzed for resistance by population sequencing during the clinical study (n = 33), as well as baseline samples from control subjects with virologic response (n = 118). Primary NRTI or NNRTI drug resistance mutations present at low frequency (≥2% to 20%) were detected in 6.6% of baseline samples by deep sequencing, all of which occurred in control subjects. Deep sequencing results were generally consistent with population sequencing but detected additional primary NNRTI and NRTI resistance mutations at virologic failure in seven samples. HIV-1 drug resistance mutations emerging while on RPV/FTC/TDF or EFV/FTC/TDF treatment were not present at low frequency at baseline in the STaR study. PMID:26690199
Porter, Danielle P; Daeumer, Martin; Thielen, Alexander; Chang, Silvia; Martin, Ross; Cohen, Cal; Miller, Michael D; White, Kirsten L
2015-12-07
At Week 96 of the Single-Tablet Regimen (STaR) study, more treatment-naïve subjects that received rilpivirine/emtricitabine/tenofovir DF (RPV/FTC/TDF) developed resistance mutations compared to those treated with efavirenz (EFV)/FTC/TDF by population sequencing. Furthermore, more RPV/FTC/TDF-treated subjects with baseline HIV-1 RNA >100,000 copies/mL developed resistance compared to subjects with baseline HIV-1 RNA ≤100,000 copies/mL. Here, deep sequencing was utilized to assess the presence of pre-existing low-frequency variants in subjects with and without resistance development in the STaR study. Deep sequencing (Illumina MiSeq) was performed on baseline and virologic failure samples for all subjects analyzed for resistance by population sequencing during the clinical study (n = 33), as well as baseline samples from control subjects with virologic response (n = 118). Primary NRTI or NNRTI drug resistance mutations present at low frequency (≥2% to 20%) were detected in 6.6% of baseline samples by deep sequencing, all of which occurred in control subjects. Deep sequencing results were generally consistent with population sequencing but detected additional primary NNRTI and NRTI resistance mutations at virologic failure in seven samples. HIV-1 drug resistance mutations emerging while on RPV/FTC/TDF or EFV/FTC/TDF treatment were not present at low frequency at baseline in the STaR study.
Skilled memory in expert figure skaters.
Deakin, J M; Allard, F
1991-01-01
The present studies extend skilled-memory theory to a domain involving the performance of motor sequences. Skilled figure skaters were better able than their less skilled counterparts to perform short skating sequences that were choreographed, rather than randomly constructed. Expert skaters encoded sequences for performance very differently from the way in which they encoded sequences that were verbally presented for verbal recall. Tasks interpolated between sequence and recall showed no significant influence on recall accuracy, implicating long-term memory in skating memory. There was little evidence for the use of retrieval structures when skaters learned the brief sequences used throughout these studies. Finally, expert skaters were able to judge the similarity of two skating elements faster than less skilled skaters, indicating a faster access to semantic memory for experts. The data indicate that skaters show many of the same skilled-memory characteristics as have been described in other skill domains involving memorization, such as digit span and memory for dinner orders.
Behera, Bijay Kumar; Kumari, Kavita; Baisvar, Vishwamitra Singh; Rout, Ajaya Kumar; Pakrashi, Sudip; Paria, Prasenjet; Jena, J K
2017-01-01
In the present study, the complete mitochondrial genome sequence of Labeo gonius is reported using PGM sequencer (Ion Torrent). The complete mitogenome of L. gonius is obtained by the de novo sequences assembly of genomic reads using the Torrent Mapping Alignment Program (TMAP) which is 16 614 bp in length. The mitogenome of L. gonius comprised of 13 protein-coding genes, 22 tRNAs, 2 rRNA genes, and D-loop as control region along with gene order and organization, being similar to most of other fish mitogenomes of NCBI databases. The mitogenome in the present study has 99% similarity to the complete mitogenome sequence of Labeo fimbriatus, as reported earlier. The phylogenetic analysis of Cypriniformes depicted that their mitogenomes are closely related to each other. The complete mitogenome sequence of L. gonius would be helpful in understanding the population genetics, phylogenetics, and evolution of Indian Carps.
Repair Sequences in Dysarthric Conversational Speech: A Study in Interactional Phonetics
ERIC Educational Resources Information Center
Rutter, Ben
2009-01-01
This paper presents some findings from a case study of repair sequences in conversations between a dysarthric speaker, Chris, and her interactional partners. It adopts the methodology of interactional phonetics, where turn design, sequence organization, and variation in phonetic parameters are analysed in unison. The analysis focused on the use of…
Uribe-Convers, Simon; Duke, Justin R.; Moore, Michael J.; Tank, David C.
2014-01-01
• Premise of the study: We present an alternative approach for molecular systematic studies that combines long PCR and next-generation sequencing. Our approach can be used to generate templates from any DNA source for next-generation sequencing. Here we test our approach by amplifying complete chloroplast genomes, and we present a set of 58 potentially universal primers for angiosperms to do so. Additionally, this approach is likely to be particularly useful for nuclear and mitochondrial regions. • Methods and Results: Chloroplast genomes of 30 species across angiosperms were amplified to test our approach. Amplification success varied depending on whether PCR conditions were optimized for a given taxon. To further test our approach, some amplicons were sequenced on an Illumina HiSeq 2000. • Conclusions: Although here we tested this approach by sequencing plastomes, long PCR amplicons could be generated using DNA from any genome, expanding the possibilities of this approach for molecular systematic studies. PMID:25202592
Olson, Nathan D.; Lund, Steven P.; Zook, Justin M.; Rojas-Cornejo, Fabiola; Beck, Brian; Foy, Carole; Huggett, Jim; Whale, Alexandra S.; Sui, Zhiwei; Baoutina, Anna; Dobeson, Michael; Partis, Lina; Morrow, Jayne B.
2015-01-01
This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing®, or Ion Torrent PGM®. The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies. PMID:27077030
Genome Sequence of Lactobacillus plantarum Strain UCMA 3037.
Naz, Saima; Tareb, Raouf; Bernardeau, Marion; Vaisse, Melissa; Lucchetti-Miganeh, Celine; Rechenmann, Mathias; Vernoux, Jean-Paul
2013-05-23
Nucleic acid of the strain Lactobacillus plantarum UCMA 3037, isolated from raw milk camembert cheese in our laboratory, was sequenced. We present its draft genome sequence with the aim of studying its functional properties and relationship to the cheese ecosystem.
ERIC Educational Resources Information Center
Penrod, Becky; Gardella, Laura; Fernand, Jonathan
2012-01-01
Few studies have examined the effects of the high-probability instructional sequence in the treatment of food selectivity, and results of these studies have been mixed (e.g., Dawson et al., 2003; Patel et al., 2007). The present study extended previous research on the high-probability instructional sequence by combining this procedure with…
Pre-Attentive Auditory Processing of Lexicality
ERIC Educational Resources Information Center
Jacobsen, Thomas; Horvath, Janos; Schroger, Erich; Lattner, Sonja; Widmann, Andreas; Winkler, Istvan
2004-01-01
The effects of lexicality on auditory change detection based on auditory sensory memory representations were investigated by presenting oddball sequences of repeatedly presented stimuli, while participants ignored the auditory stimuli. In a cross-linguistic study of Hungarian and German participants, stimulus sequences were composed of words that…
Incidental Sequence Learning across the Lifespan
ERIC Educational Resources Information Center
Weiermann, Brigitte; Meier, Beat
2012-01-01
The purpose of the present study was to investigate incidental sequence learning across the lifespan. We tested 50 children (aged 7-16), 50 young adults (aged 20-30), and 50 older adults (aged >65) with a sequence learning paradigm that involved both a task and a response sequence. After several blocks of practice, all age groups slowed down…
Characterization of Austrian koi herpesvirus samples based on the ORF40 region.
Marek, A; Schachner, O; Bilic, I; Hess, M
2010-02-17
Using a PCR that amplifies a region of the thymidine kinase (TK) gene, an epidemic spread of koi herpesvirus (KHV) was determined in koi carps in Austria in 2007. A total of 15 virus samples from different locations in Austria were analyzed to determine their genetic relatedness following PCR and nucleic acid sequencing of the open reading frame 40 (ORF40) region of the KHV genome. ORF40-specific PCR amplification products that were obtained from tissue samples shared 100% nucleotide sequence identity with the published sequence of the Japanese strain of KHV. The ORF40 sequence of one isolate from the UK that was included in the present study was 100% identical with the published sequence of an Israeli strain of KHV. This is the first study that used a larger number of samples and a PCR method, which allowed distinguishing all 3 strains of KHV. The present investigation provides information on the epidemiology of KHV infections in Europe and describes a useful molecular tool for epidemiological studies.
Bounds on the cross-correlation functions of state m-sequences
NASA Astrophysics Data System (ADS)
Woodcock, C. F.; Davies, Phillip A.; Shaar, Ahmed A.
1987-03-01
Lower and upper bounds on the peaks of the periodic Hamming cross-correlation function for state m-sequences, which are often used in frequency-hopped spread-spectrum systems, are derived. The state position mapped (SPM) sequences of the state m-sequences are described. The use of SPM sequences for OR-channel code division multiplexing is studied. The relation between the Hamming cross-correlation function and the correlation function of SPM sequence is examined. Numerical results which support the theoretical data are presented.
Fernández-Caballero Rico, Jose Ángel; Chueca Porcuna, Natalia; Álvarez Estévez, Marta; Mosquera Gutiérrez, María Del Mar; Marcos Maeso, María Ángeles; García, Federico
2018-02-01
To show how to generate a consensus sequence from the information of massive parallel sequences data obtained from routine HIV anti-retroviral resistance studies, and that may be suitable for molecular epidemiology studies. Paired Sanger (Trugene-Siemens) and next-generation sequencing (NGS) (454 GSJunior-Roche) HIV RT and protease sequences from 62 patients were studied. NGS consensus sequences were generated using Mesquite, using 10%, 15%, and 20% thresholds. Molecular evolutionary genetics analysis (MEGA) was used for phylogenetic studies. At a 10% threshold, NGS-Sanger sequences from 17/62 patients were phylogenetically related, with a median bootstrap-value of 88% (IQR83.5-95.5). Association increased to 36/62 sequences, median bootstrap 94% (IQR85.5-98)], using a 15% threshold. Maximum association was at the 20% threshold, with 61/62 sequences associated, and a median bootstrap value of 99% (IQR98-100). A safe method is presented to generate consensus sequences from HIV-NGS data at 20% threshold, which will prove useful for molecular epidemiological studies. Copyright © 2016 Elsevier España, S.L.U. and Sociedad Española de Enfermedades Infecciosas y Microbiología Clínica. All rights reserved.
Proteome Studies of Filamentous Fungi
DOE Office of Scientific and Technical Information (OSTI.GOV)
Baker, Scott E.; Panisko, Ellen A.
2011-04-20
The continued fast pace of fungal genome sequence generation has enabled proteomic analysis of a wide breadth of organisms that span the breadth of the Kingdom Fungi. There is some phylogenetic bias to the current catalog of fungi with reasonable DNA sequence databases (genomic or EST) that could be analyzed at a global proteomic level. However, the rapid development of next generation sequencing platforms has lowered the cost of genome sequencing such that in the near future, having a genome sequence will no longer be a time or cost bottleneck for downstream proteomic (and transcriptomic) analyses. High throughput, non-gel basedmore » proteomics offers a snapshot of proteins present in a given sample at a single point in time. There are a number of different variations on the general method and technologies for identifying peptides in a given sample. We present a method that can serve as a “baseline” for proteomic studies of fungi.« less
Proteome studies of filamentous fungi.
Baker, Scott E; Panisko, Ellen A
2011-01-01
The continued fast pace of fungal genome sequence generation has enabled proteomic analysis of a wide variety of organisms that span the breadth of the Kingdom Fungi. There is some phylogenetic bias to the current catalog of fungi with reasonable DNA sequence databases (genomic or EST) that could be analyzed at a global proteomic level. However, the rapid development of next generation sequencing platforms has lowered the cost of genome sequencing such that in the near future, having a genome sequence will no longer be a time or cost bottleneck for downstream proteomic (and transcriptomic) analyses. High throughput, nongel-based proteomics offers a snapshot of proteins present in a given sample at a single point in time. There are a number of variations on the general methods and technologies for identifying peptides in a given sample. We present a method that can serve as a "baseline" for proteomic studies of fungi.
Function-Based Algorithms for Biological Sequences
ERIC Educational Resources Information Center
Mohanty, Pragyan Sheela P.
2015-01-01
Two problems at two different abstraction levels of computational biology are studied. At the molecular level, efficient pattern matching algorithms in DNA sequences are presented. For gene order data, an efficient data structure is presented capable of storing all gene re-orderings in a systematic manner. A common characteristic of presented…
Congenital amusia: a short-term memory deficit for non-verbal, but not verbal sounds.
Tillmann, Barbara; Schulze, Katrin; Foxton, Jessica M
2009-12-01
Congenital amusia refers to a lifelong disorder of music processing and is linked to pitch-processing deficits. The present study investigated congenital amusics' short-term memory for tones, musical timbres and words. Sequences of five events (tones, timbres or words) were presented in pairs and participants had to indicate whether the sequences were the same or different. The performance of congenital amusics confirmed a memory deficit for tone sequences, but showed normal performance for word sequences. For timbre sequences, amusics' memory performance was impaired in comparison to matched controls. Overall timbre performance was found to be correlated with melodic contour processing (as assessed by the Montreal Battery of Evaluation of Amusia). The present findings show that amusics' deficits extend to non-verbal sound material other than pitch, in this case timbre, while not affecting memory for verbal material. This is in line with previous suggestions about the domain-specificity of congenital amusia.
Cantalupo, Paul G.; Katz, Joshua P.
2015-01-01
ABSTRACT We searched The Cancer Genome Atlas (TCGA) database for viruses by comparing non-human reads present in transcriptome sequencing (RNA-Seq) and whole-exome sequencing (WXS) data to viral sequence databases. Human papillomavirus 18 (HPV18) is an etiologic agent of cervical cancer, and as expected, we found robust expression of HPV18 genes in cervical cancer samples. In agreement with previous studies, we also found HPV18 transcripts in non-cervical cancer samples, including those from the colon, rectum, and normal kidney. However, in each of these cases, HPV18 gene expression was low, and single-nucleotide variants and positions of genomic alignments matched the integrated portion of HPV18 present in HeLa cells. Chimeric reads that match a known virus-cell junction of HPV18 integrated in HeLa cells were also present in some samples. We hypothesize that HPV18 sequences in these non-cervical samples are due to nucleic acid contamination from HeLa cells. This finding highlights the problems that contamination presents in computational virus detection pipelines. IMPORTANCE Viruses associated with cancer can be detected by searching tumor sequence databases. Several studies involving searches of the TCGA database have reported the presence of HPV18, a known cause of cervical cancer, in a small number of additional cancers, including those of the rectum, kidney, and colon. We have determined that the sequences related to HPV18 in non-cervical samples are due to nucleic acid contamination from HeLa cells. To our knowledge, this is the first report of the misidentification of viruses in next-generation sequencing data of tumors due to contamination with a cancer cell line. These results raise awareness of the difficulty of accurately identifying viruses in human sequence databases. PMID:25631090
Learning of Sensory Sequences in Cerebellar Patients
ERIC Educational Resources Information Center
Frings, Markus; Boenisch, Raoul; Gerwig, Marcus; Diener, Hans-Christoph; Timmann, Dagmar
2004-01-01
A possible role of the cerebellum in detecting and recognizing event sequences has been proposed. The present study sought to determine whether patients with cerebellar lesions are impaired in the acquisition and discrimination of sequences of sensory stimuli of different modalities. A group of 26 cerebellar patients and 26 controls matched for…
Wu, Fengnian; Jiang, Hongyan; Beattie, G Andrew C; Holford, Paul; Chen, Jianchi; Wallis, Christopher M; Zheng, Zheng; Deng, Xiaoling; Cen, Yijing
2018-04-24
Diaphorina citri (Asian citrus psyllid; ACP) transmits 'Candidatus Liberibacter asiaticus' associated with citrus Huanglongbing (HLB). ACP has been reported in 11 provinces/regions in China, yet its population diversity remains unclear. In this study, we evaluated ACP population diversity in China using representative whole mitochondrial genome (mitogenome) sequences. Additional mitogenome sequences outside China were also acquired and evaluated. The sizes of the 27 ACP mitogenome sequences ranged from 14 986 to 15 030 bp. Along with three previously published mitogenome sequences, the 30 sequences formed three major mitochondrial groups (MGs): MG1, present in southwestern China and occurring at elevations above 1000 m; MG2, present in southeastern China and Southeast Asia (Cambodia, Indonesia, Malaysia, and Vietnam) and occurring at elevations below 180 m; and MG3, present in the USA and Pakistan. Single nucleotide polymorphisms in five genes (cox2, atp8, nad3, nad1 and rrnL) contributed mostly in the ACP diversity. Among these genes, rrnL had the most variation. Mitogenome sequences analyses revealed two major phylogenetic groups of ACP present in China as well as a possible unique group present currently in Pakistan and the USA. The information could have significant implications for current ACP control and HLB management. © 2018 Society of Chemical Industry. © 2018 Society of Chemical Industry.
NASA Astrophysics Data System (ADS)
Zhang, Xiao-Yong; Wang, Guang-Hua; Xu, Xin-Ya; Nong, Xu-Hua; Wang, Jie; Amin, Muhammad; Qi, Shu-Hua
2016-10-01
The present study investigated the fungal diversity in four different deep-sea sediments from Okinawa Trough using high-throughput Illumina sequencing of the nuclear ribosomal internal transcribed spacer-1 (ITS1). A total of 40,297 fungal ITS1 sequences clustered into 420 operational taxonomic units (OTUs) with 97% sequence similarity and 170 taxa were recovered from these sediments. Most ITS1 sequences (78%) belonged to the phylum Ascomycota, followed by Basidiomycota (17.3%), Zygomycota (1.5%) and Chytridiomycota (0.8%), and a small proportion (2.4%) belonged to unassigned fungal phyla. Compared with previous studies on fungal diversity of sediments from deep-sea environments by culture-dependent approach and clone library analysis, the present result suggested that Illumina sequencing had been dramatically accelerating the discovery of fungal community of deep-sea sediments. Furthermore, our results revealed that Sordariomycetes was the most diverse and abundant fungal class in this study, challenging the traditional view that the diversity of Sordariomycetes phylotypes was low in the deep-sea environments. In addition, more than 12 taxa accounted for 21.5% sequences were found to be rarely reported as deep-sea fungi, suggesting the deep-sea sediments from Okinawa Trough harbored a plethora of different fungal communities compared with other deep-sea environments. To our knowledge, this study is the first exploration of the fungal diversity in deep-sea sediments from Okinawa Trough using high-throughput Illumina sequencing.
Chrobak, Adrian Andrzej; Siuda-Krzywicka, Katarzyna; Siwek, Grzegorz Przemysław; Tereszko, Anna; Janeczko, Weronika; Starowicz-Filip, Anna; Siwek, Marcin; Dudek, Dominika
2017-10-03
Impairment of implicit motor sequence learning was shown in schizophrenia (SZ) and, most recently, in bipolar disorder (BD), and was connected to cerebellar abnormalities. The goal of this study was to compare implicit motor sequence learning in BD and SZ. We examined 33 patients with BD, 33 patients with SZ and 31 healthy controls with a use of ambidextrous Serial Reaction Time Task (SRTT), which allows exploring asymmetries in performance depending on the hand used. BD and SZ patients presented impaired implicit motor sequence learning, although the pattern of their impairments was different. While BD patients showed no signs of implicit motor sequence learning for both hands, the SZ group presented some features of motor learning when performing with the right, but not with the left hand. To our best knowledge this is the first study comparing implicit motor sequence learning in BD and SZ. We show that both diseases share impairments in this domain, however in the case of SZ this impairment differs dependently on the hand performing SRTT. We propose that implicit motor sequence learning impairments constitute an overlapping symptom in BD and SZ and suggest further neuroimaging studies to verify cerebellar underpinnings as its cause. Copyright © 2017 Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Rau, M. A.; Aleven, V.; Rummel, N.; Pardos, Z.
2014-01-01
Providing learners with multiple representations of learning content has been shown to enhance learning outcomes. When multiple representations are presented across consecutive problems, we have to decide in what sequence to present them. Prior research has demonstrated that interleaving "tasks types" (as opposed to blocking them) can…
Liu, Guo-Hua; Li, Chun; Li, Jia-Yuan; Zhou, Dong-Hui; Xiong, Rong-Chuan; Lin, Rui-Qing; Zou, Feng-Cai; Zhu, Xing-Quan
2012-01-01
Sparganosis, caused by the plerocercoid larvae of members of the genus Spirometra, can cause significant public health problem and considerable economic losses. In the present study, the complete mitochondrial DNA (mtDNA) sequence of Spirometra erinaceieuropaei from China was determined, characterized and compared with that of S. erinaceieuropaei from Japan. The gene arrangement in the mt genome sequences of S. erinaceieuropaei from China and Japan is identical. The identity of the mt genomes was 99.1% between S. erinaceieuropaei from China and Japan, and the complete mtDNA sequence of S. erinaceieuropaei from China is slightly shorter (2 bp) than that from Japan. Phylogenetic analysis of S. erinaceieuropaei with other representative cestodes using two different computational algorithms [Bayesian inference (BI) and maximum likelihood (ML)] based on concatenated amino acid sequences of 12 protein-coding genes, revealed that S. erinaceieuropaei is closely related to Diphyllobothrium spp., supporting classification based on morphological features. The present study determined the complete mtDNA sequences of S. erinaceieuropaei from China that provides novel genetic markers for studying the population genetics and molecular epidemiology of S. erinaceieuropaei in humans and animals. PMID:22553464
SEQassembly: A Practical Tools Program for Coding Sequences Splicing
NASA Astrophysics Data System (ADS)
Lee, Hongbin; Yang, Hang; Fu, Lei; Qin, Long; Li, Huili; He, Feng; Wang, Bo; Wu, Xiaoming
CDS (Coding Sequences) is a portion of mRNA sequences, which are composed by a number of exon sequence segments. The construction of CDS sequence is important for profound genetic analysis such as genotyping. A program in MATLAB environment is presented, which can process batch of samples sequences into code segments under the guide of reference exon models, and splice these code segments of same sample source into CDS according to the exon order in queue file. This program is useful in transcriptional polymorphism detection and gene function study.
1998-12-01
Type II restriction enzymes, such as Eco R1 endonulease, present a unique advantage for the study of sequence-specific recognition because they leave a record of where they have been in the form of the cleaved ends of the DNA sites where they were bound. The differential behavior of a sequence -specific protein at sites of differing base sequence is the essence of the sequence-specificity; the core question is how do these proteins discriminate between different DNA sequences especially when the two sequences are very similar. Principal Investigator: Dan Carter/New Century Pharmaceuticals
Protein Crystal Eco R1 Endonulease-DNA Complex
NASA Technical Reports Server (NTRS)
1998-01-01
Type II restriction enzymes, such as Eco R1 endonulease, present a unique advantage for the study of sequence-specific recognition because they leave a record of where they have been in the form of the cleaved ends of the DNA sites where they were bound. The differential behavior of a sequence -specific protein at sites of differing base sequence is the essence of the sequence-specificity; the core question is how do these proteins discriminate between different DNA sequences especially when the two sequences are very similar. Principal Investigator: Dan Carter/New Century Pharmaceuticals
Virus Identification in Unknown Tropical Febrile Illness Cases Using Deep Sequencing
Balmaseda, Angel; Harris, Eva; DeRisi, Joseph L.
2012-01-01
Dengue virus is an emerging infectious agent that infects an estimated 50–100 million people annually worldwide, yet current diagnostic practices cannot detect an etiologic pathogen in ∼40% of dengue-like illnesses. Metagenomic approaches to pathogen detection, such as viral microarrays and deep sequencing, are promising tools to address emerging and non-diagnosable disease challenges. In this study, we used the Virochip microarray and deep sequencing to characterize the spectrum of viruses present in human sera from 123 Nicaraguan patients presenting with dengue-like symptoms but testing negative for dengue virus. We utilized a barcoding strategy to simultaneously deep sequence multiple serum specimens, generating on average over 1 million reads per sample. We then implemented a stepwise bioinformatic filtering pipeline to remove the majority of human and low-quality sequences to improve the speed and accuracy of subsequent unbiased database searches. By deep sequencing, we were able to detect virus sequence in 37% (45/123) of previously negative cases. These included 13 cases with Human Herpesvirus 6 sequences. Other samples contained sequences with similarity to sequences from viruses in the Herpesviridae, Flaviviridae, Circoviridae, Anelloviridae, Asfarviridae, and Parvoviridae families. In some cases, the putative viral sequences were virtually identical to known viruses, and in others they diverged, suggesting that they may derive from novel viruses. These results demonstrate the utility of unbiased metagenomic approaches in the detection of known and divergent viruses in the study of tropical febrile illness. PMID:22347512
ERIC Educational Resources Information Center
Blanco-López, Ángel; Franco-Mariscal, Antonio Joaquín; España-Ramos, Enrique
2016-01-01
We present a case study to illustrate the design and implementation of a teaching sequence about oral and dental health and hygiene. This teaching sequence was aimed at year 10 students (age 15-16) and sought to develop their scientific competences. In line with the PISA assessment framework for science and the tenets of a context-based approach…
Sanz, Yolanda
2017-01-01
Abstract The miniaturized and portable DNA sequencer MinION™ has demonstrated great potential in different analyses such as genome-wide sequencing, pathogen outbreak detection and surveillance, human genome variability, and microbial diversity. In this study, we tested the ability of the MinION™ platform to perform long amplicon sequencing in order to design new approaches to study microbial diversity using a multi-locus approach. After compiling a robust database by parsing and extracting the rrn bacterial region from more than 67000 complete or draft bacterial genomes, we demonstrated that the data obtained during sequencing of the long amplicon in the MinION™ device using R9 and R9.4 chemistries were sufficient to study 2 mock microbial communities in a multiplex manner and to almost completely reconstruct the microbial diversity contained in the HM782D and D6305 mock communities. Although nanopore-based sequencing produces reads with lower per-base accuracy compared with other platforms, we presented a novel approach consisting of multi-locus and long amplicon sequencing using the MinION™ MkIb DNA sequencer and R9 and R9.4 chemistries that help to overcome the main disadvantage of this portable sequencing platform. Furthermore, the nanopore sequencing library, constructed with the last releases of pore chemistry (R9.4) and sequencing kit (SQK-LSK108), permitted the retrieval of the higher level of 1D read accuracy sufficient to characterize the microbial species present in each mock community analysed. Improvements in nanopore chemistry, such as minimizing base-calling errors and new library protocols able to produce rapid 1D libraries, will provide more reliable information in the near future. Such data will be useful for more comprehensive and faster specific detection of microbial species and strains in complex ecosystems. PMID:28605506
Verwey, Willem B
2015-05-01
Research has provided many indications that highly practiced 6-key sequences are carried out in a chunking mode in which key-specific stimuli past the first are largely ignored. When in such sequences a deviating stimulus occasionally occurs at an unpredictable location, participants fall back to responding to individual stimuli (Verwey & Abrahamse, 2012). The observation that in such a situation execution still benefits from prior practice has been attributed to the possibility to operate in an associative mode. To better understand the contribution to the execution of keying sequences of motor chunks, associative sequence knowledge and also of explicit sequence knowledge, the present study tested three alternative accounts for the earlier finding of an execution rate increase at the end of 6-key sequences performed in the associative mode. The results provide evidence that the earlier observed execution rate increase can be attributed to the use of explicit sequence knowledge. In the present experiment this benefit was limited to sequences that are executed at the moderately fast rates of the associative mode, and occurred at both the earlier and final elements of the sequences. Copyright © 2015 Elsevier B.V. All rights reserved.
Hysteretic energy prediction method for mainshock-aftershock sequences
NASA Astrophysics Data System (ADS)
Zhai, Changhai; Ji, Duofa; Wen, Weiping; Li, Cuihua; Lei, Weidong; Xie, Lili
2018-04-01
Structures located in seismically active regions may be subjected to mainshock-aftershock (MSAS) sequences. Strong aftershocks significantly affect the hysteretic energy demand of structures. The hysteretic energy, E H,seq, is normalized by mass m and expressed in terms of the equivalent velocity, V D,seq, to quantitatively investigate aftershock effects on the hysteretic energy of structures. The equivalent velocity, V D,seq, is computed by analyzing the response time-history of an inelastic single-degree-of-freedom (SDOF) system with a varying vibration period subjected to 309 MSAS sequences. The present study selected two kinds of MSAS sequences, with one aftershock and two aftershocks, respectively. The aftershocks are scaled to maintain different relative intensities. The variation of the equivalent velocity, V D,seq, is studied for consideration of the ductility values, site conditions, relative intensities, number of aftershocks, hysteretic models, and damping ratios. The MSAS sequence with one aftershock exhibited a 10% to 30% hysteretic energy increase, whereas the MSAS sequence with two aftershocks presented a 20% to 40% hysteretic energy increase. Finally, a hysteretic energy prediction equation is proposed as a function of the vibration period, ductility value, and damping ratio to estimate hysteretic energy for mainshock-aftershock sequences.
Visual Sequence Learning in Infancy: Domain-General and Domain-Specific Associations with Language
ERIC Educational Resources Information Center
Shafto, Carissa L.; Conway, Christopher M.; Field, Suzanne L.; Houston, Derek M.
2012-01-01
Research suggests that nonlinguistic sequence learning abilities are an important contributor to language development (Conway, Bauernschmidt, Huang, & Pisoni, 2010). The current study investigated visual sequence learning (VSL) as a possible predictor of vocabulary development in infants. Fifty-eight 8.5-month-old infants were presented with a…
Event-Related Potential Correlates of Declarative and Non-Declarative Sequence Knowledge
ERIC Educational Resources Information Center
Ferdinand, Nicola K.; Runger, Dennis; Frensch, Peter A.; Mecklinger, Axel
2010-01-01
The goal of the present study was to demonstrate that declarative and non-declarative knowledge acquired in an incidental sequence learning task contributes differentially to memory retrieval and leads to dissociable ERP signatures in a recognition memory task. For this purpose, participants performed a sequence learning task and were classified…
Sawle, Lucas; Ghosh, Kingshuk
2015-08-28
A general formalism to compute configurational properties of proteins and other heteropolymers with an arbitrary sequence of charges and non-uniform excluded volume interaction is presented. A variational approach is utilized to predict average distance between any two monomers in the chain. The presented analytical model, for the first time, explicitly incorporates the role of sequence charge distribution to determine relative sizes between two sequences that vary not only in total charge composition but also in charge decoration (even when charge composition is fixed). Furthermore, the formalism is general enough to allow variation in excluded volume interactions between two monomers. Model predictions are benchmarked against the all-atom Monte Carlo studies of Das and Pappu [Proc. Natl. Acad. Sci. U. S. A. 110, 13392 (2013)] for 30 different synthetic sequences of polyampholytes. These sequences possess an equal number of glutamic acid (E) and lysine (K) residues but differ in the patterning within the sequence. Without any fit parameter, the model captures the strong sequence dependence of the simulated values of the radius of gyration with a correlation coefficient of R(2) = 0.9. The model is then applied to real proteins to compare the unfolded state dimensions of 540 orthologous pairs of thermophilic and mesophilic proteins. The excluded volume parameters are assumed similar under denatured conditions, and only electrostatic effects encoded in the sequence are accounted for. With these assumptions, thermophilic proteins are found-with high statistical significance-to have more compact disordered ensemble compared to their mesophilic counterparts. The method presented here, due to its analytical nature, is capable of making such high throughput analysis of multiple proteins and will have broad applications in proteomic studies as well as in other heteropolymeric systems.
Peng, Xian; Yuan, Han; Chen, Wufan; Ding, Lei
2017-01-01
Continuous loop averaging deconvolution (CLAD) is one of the proven methods for recovering transient auditory evoked potentials (AEPs) in rapid stimulation paradigms, which requires an elaborated stimulus sequence design to attenuate impacts from noise in data. The present study aimed to develop a new metric in gauging a CLAD sequence in terms of noise gain factor (NGF), which has been proposed previously but with less effectiveness in the presence of pink (1/f) noise. We derived the new metric by explicitly introducing the 1/f model into the proposed time-continuous sequence. We selected several representative CLAD sequences to test their noise property on typical EEG recordings, as well as on five real CLAD electroencephalogram (EEG) recordings to retrieve the middle latency responses. We also demonstrated the merit of the new metric in generating and quantifying optimized sequences using a classic genetic algorithm. The new metric shows evident improvements in measuring actual noise gains at different frequencies, and better performance than the original NGF in various aspects. The new metric is a generalized NGF measurement that can better quantify the performance of a CLAD sequence, and provide a more efficient mean of generating CLAD sequences via the incorporation with optimization algorithms. The present study can facilitate the specific application of CLAD paradigm with desired sequences in the clinic. PMID:28414803
Genetic Analyses in Small-for-Gestational-Age Newborns.
Stalman, Susanne E; Solanky, Nita; Ishida, Miho; Alemán-Charlet, Cristina; Abu-Amero, Sayeda; Alders, Marielle; Alvizi, Lucas; Baird, William; Demetriou, Charalambos; Henneman, Peter; James, Chela; Knegt, Lia C; Leon, Lydia J; Mannens, Marcel M A M; Mul, Adi N; Nibbering, Nicole A; Peskett, Emma; Rezwan, Faisal I; Ris-Stalpers, Carrie; van der Post, Joris A M; Kamp, Gerdine A; Plötz, Frans B; Wit, Jan M; Stanier, Philip; Moore, Gudrun E; Hennekam, Raoul C
2018-03-01
Small for gestational age (SGA) can be the result of fetal growth restriction, which is associated with perinatal morbidity and mortality. Mechanisms that control prenatal growth are poorly understood. The aim of the current study was to gain more insight into prenatal growth failure and determine an effective diagnostic approach in SGA newborns. We hypothesized that one or more copy number variations (CNVs) and disturbed methylation and sequence variants may be present in genes associated with fetal growth. A prospective cohort study of subjects with a low birth weight for gestational age. The study was conducted at an academic pediatric research institute. A total of 21 SGA newborns with a mean birth weight below the first centile and a control cohort of 24 appropriate-for-gestational-age newborns were studied. Array comparative genomic hybridization, genome-wide methylation studies, and exome sequencing were performed. The numbers of CNVs, methylation disturbances, and sequence variants. The genetic analyses demonstrated three CNVs, one systematically disturbed methylation pattern, and one sequence variant explaining SGA. Additional methylation disturbances and sequence variants were present in 20 patients. In 19 patients, multiple abnormalities were found. Our results confirm the influence of a large number of mechanisms explaining dysregulation of fetal growth. We concluded that CNVs, methylation disturbances, and sequence variants all contribute to prenatal growth failure. These genetic workups can be an effective diagnostic approach in SGA newborns.
Sakthivelkumar, S; Ramaraj, P; Veeramani, V; Janarthanan, S
2015-09-01
The basis of the present study was to distinguish the existence of any genetic variability among populations of Culex quinquefasciatus which would be a valuable tool in the management of mosquito control programmes. In the present study, population of Cx. quinquefasciatus collected at different locations in Tamil Nadu were analyzed for their genetic variation based on 28S rDNA D2 region nucleotide sequences. A high degree of genetic polymorphism was detected in the sequences of D2 region of 28S rDNA on the predicted secondary structures in spite of high nucleotide sequence similarity. The findings based on secondary structure using rDNA sequences suggested the existence of a complex genotypic diversity of Cx. quinquefasciatus population collected at different locations of Tamil Nadu, India. This complexity in genetic diversity in a single mosquito population collected at different locations is considered an important issue towards their influence and nature of vector potential of these mosquitoes.
Equine infectious anemia virus in naturally infected horses from the Brazilian Pantanal.
Cursino, Andreia Elisa; Vilela, Ana Paula Pessoa; Franco-Luiz, Ana Paula Moreira; de Oliveira, Jaquelline Germano; Nogueira, Márcia Furlan; Júnior, João Pessoa Araújo; de Aguiar, Daniel Moura; Kroon, Erna Geessien
2018-05-11
Equine infectious anemia (EIA) has a worldwide distribution, and is widespread in Brazil. The Brazilian Pantanal presents with high prevalence comprising equine performance and indirectly the livestock industry, since the horses are used for cattle management. Although EIA is routinely diagnosed by the agar gel immunodiffusion test (AGID), this serological assay has some limitations, so PCR-based detection methods have the potential to overcome these limitations and act as complementary tests to those currently used. Considering the limited number of equine infectious anemia virus (EIAV) sequences which are available in public databases and the great genome variability, studies of EIAV detection and characterization molecular remain important. In this study we detected EIAV proviral DNA from 23 peripheral blood mononuclear cell (PBMCs) samples of naturally infected horses from Brazilian Pantanal using a semi-nested-PCR (sn-PCR). The serological profile of the animals was also evaluated by AGID and ELISA for gp90 and p26. Furthermore, the EIAV PCR amplified DNA was sequenced and phylogenetically analyzed. Here we describe the first EIAV sequences of the 5' LTR of the tat gene in naturally infected horses from Brazil, which presented with 91% similarity to EIAV reference sequences. The Brazilian EIAV sequences also presented variable nucleotide similarities among themselves, ranging from 93,5% to 100%. Phylogenetic analysis showed that Brazilian EIAV sequences grouped in a separate clade relative to other reference sequences. Thus this molecular detection and characterization may provide information about EIAV circulation in Brazilian territories and improve phylogenetic inferences.
Zheng, H; Ye, C; Segura, M; Gottschalk, M; Xu, J
2008-09-01
Streptococcus suis serotype 2 sequence type 7 strains emerged in 1996 and caused a streptococcal toxic shock-like syndrome in 1998 and 2005 in China. Evidence indicated that the virulence of S. suis sequence type 7 had increased, but the mechanism was unknown. The sequence type 7 strain SC84, isolated from a patient with streptococcal toxic shock-like syndrome during the Sichuan outbreak, and the sequence type 1 strain 31533, a typical highly pathogenic strain isolated from a diseased pig, were used in comparative studies. In this study we show the mechanisms underlying cytokine production differed between the two types of strains. The S. suis sequence type 7 strain SC84 possesses a stronger capacity to stimulate T cells, naive T cells and peripheral blood mononuclear cell proliferation than does S. suis sequence type 1 strain 31533. The T cell response to both strains was dependent upon the presence of antigen-presenting cells. Histo-incompatible antigen-presenting cells were sufficient to provide the accessory signals to naive T cell stimulated by the two strains, indicating that both sequence type 7 and 1 strains possess mitogens; however, the mitogenic effect was different. Therefore, we propose that the difference in the mitogenic effect of sequence type 7 strain SC84 compared with the sequence type 1 strain 31533 of S. suis may be associated with the clinical, epidemiological and microbiological difference, where the ST 7 strains have a larger mitogenic effect.
Zheng, H; Ye, C; Segura, M; Gottschalk, M; Xu, J
2008-01-01
Streptococcus suis serotype 2 sequence type 7 strains emerged in 1996 and caused a streptococcal toxic shock-like syndrome in 1998 and 2005 in China. Evidence indicated that the virulence of S. suis sequence type 7 had increased, but the mechanism was unknown. The sequence type 7 strain SC84, isolated from a patient with streptococcal toxic shock-like syndrome during the Sichuan outbreak, and the sequence type 1 strain 31533, a typical highly pathogenic strain isolated from a diseased pig, were used in comparative studies. In this study we show the mechanisms underlying cytokine production differed between the two types of strains. The S. suis sequence type 7 strain SC84 possesses a stronger capacity to stimulate T cells, naive T cells and peripheral blood mononuclear cell proliferation than does S. suis sequence type 1 strain 31533. The T cell response to both strains was dependent upon the presence of antigen-presenting cells. Histo-incompatible antigen-presenting cells were sufficient to provide the accessory signals to naive T cell stimulated by the two strains, indicating that both sequence type 7 and 1 strains possess mitogens; however, the mitogenic effect was different. Therefore, we propose that the difference in the mitogenic effect of sequence type 7 strain SC84 compared with the sequence type 1 strain 31533 of S. suis may be associated with the clinical, epidemiological and microbiological difference, where the ST 7 strains have a larger mitogenic effect. PMID:18803762
First-order and higher order sequence learning in specific language impairment.
Clark, Gillian M; Lum, Jarrad A G
2017-02-01
A core claim of the procedural deficit hypothesis of specific language impairment (SLI) is that the disorder is associated with poor implicit sequence learning. This study investigated whether implicit sequence learning problems in SLI are present for first-order conditional (FOC) and higher order conditional (HOC) sequences. Twenty-five children with SLI and 27 age-matched, nonlanguage-impaired children completed 2 serial reaction time tasks. On 1 version, the sequence to be implicitly learnt comprised a FOC sequence and on the other a HOC sequence. Results showed that the SLI group learned the HOC sequence (η p ² = .285, p = .005) but not the FOC sequence (η p ² = .099, p = .118). The control group learned both sequences (FOC η p ² = .497, HOC η p 2= .465, ps < .001). The SLI group's difficulty learning the FOC sequence is consistent with the procedural deficit hypothesis. However, the study provides new evidence that multiple mechanisms may underpin the learning of FOC and HOC sequences. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
DNA Multiple Sequence Alignment Guided by Protein Domains: The MSA-PAD 2.0 Method.
Balech, Bachir; Monaco, Alfonso; Perniola, Michele; Santamaria, Monica; Donvito, Giacinto; Vicario, Saverio; Maggi, Giorgio; Pesole, Graziano
2018-01-01
Multiple sequence alignment (MSA) is a fundamental component in many DNA sequence analyses including metagenomics studies and phylogeny inference. When guided by protein profiles, DNA multiple alignments assume a higher precision and robustness. Here we present details of the use of the upgraded version of MSA-PAD (2.0), which is a DNA multiple sequence alignment framework able to align DNA sequences coding for single/multiple protein domains guided by PFAM or user-defined annotations. MSA-PAD has two alignment strategies, called "Gene" and "Genome," accounting for coding domains order and genomic rearrangements, respectively. Novel options were added to the present version, where the MSA can be guided by protein profiles provided by the user. This allows MSA-PAD 2.0 to run faster and to add custom protein profiles sometimes not present in PFAM database according to the user's interest. MSA-PAD 2.0 is currently freely available as a Web application at https://recasgateway.cloud.ba.infn.it/ .
ERIC Educational Resources Information Center
Gabay, Yafit; Schiff, Rachel; Vakil, Eli
2012-01-01
Motor sequence learning has been studied extensively in Developmental dyslexia (DD). The purpose of the present research was to examine procedural learning of letter names and motor sequences in individuals with DD and control groups. Both groups completed the Serial Search Task which enabled the assessment of learning of letter names and motor…
The Role of RT Carry-Over for Congruence Sequence Effects in Masked Priming
ERIC Educational Resources Information Center
Huber-Huber, Christoph; Ansorge, Ulrich
2017-01-01
The present study disentangles 2 sources of the congruence sequence effect with masked primes: congruence and response time of the previous trial (reaction time [RT] carry-over). Using arrows as primes and targets and a metacontrast masking procedure we found congruence as well as congruence sequence effects. In addition, congruence sequence…
The Effects of Explicit Instruction of Formulaic Sequences on Second-Language Writers
ERIC Educational Resources Information Center
Colovic-Markovic, Jelena
2012-01-01
The present study investigated the effects of the explicit teaching of formulaic sequences (i.e., academic and topic-induced) on L2 writing. The research examined separately the effects of the treatment on the students' abilities to produce the target formulaic sequences in controlled (i.e., C-tests) and uncontrolled situations (i.e.,…
Guidelines for Grades 9-12 Mathematics Curriculum. Toward Meeting Present and Future Needs.
ERIC Educational Resources Information Center
Peterson, Wayne, Ed.
Three sequences of coursework are detailed in the curriculum development guidelines provided in this document. The 4-year sequence, structured around problem-solving, applications, and the acquisition of theory, is designed for the college-bound student who plans to enter a mathematics-based field of study. The 3-year sequence is designed for…
ERIC Educational Resources Information Center
Dawkins, Paul Christian
2012-01-01
This study presents how the introduction of a metaphor for sequence convergence constituted an experientially real context in which an undergraduate real analysis student developed a property-based definition of sequence convergence. I use elements from Zandieh and Rasmussen's (2010) Defining as a Mathematical Activity framework to trace the…
Shamsi, Shokoofeh; Ghadam, Masoumeh; Suthar, Jaydipbhai; Ebrahimzadeh Mousavi, Hoseinali; Soltani, Mehdi; Mirzargar, Saeed
2016-11-07
Despite several reports on the presence of the potentially zoonotic nematodes among edible fishes in the Persian Gulf, there is still no study on the specific identification of these parasites or their genetic characterisation. In the present study, a total of 600 fish belonging to five popular species of fish in the region, including Otolithes ruber, Psettodes erumei, Saurida tumbil, Scomberomorus commerson and Sphyraena jello were examined for infection with nematode parasites. Detailed microscopy of nematodes found in the present study followed by characterisation of the first and second internal transcribed spacers (ITS-1 and ITS-2, respectively) showed that they belong to five distinct taxa that could be potentially zoonotic. Anisakis type I was found in four species of fish, had identical ITS sequences as Anisakis typica previously reported in Australian waters and was different from those reported in the Nearctic. Hysterothylacium type VI in the present study was morphologically similar to those previously described from Australasian waters and ITS sequences were identical among Australian specimens and those found in the present study. Another Hysterothylacium larval type was also found in the present study which had identical ITS sequences and similar morphology to those previously reported and identified as H. amoyense in China Sea. Since no ITS sequence data from a well identified adult H. amoyense with an identifiable museum voucher number is yet available and due to some other issues discussed in the article we suggest assignment of this larval type from the China Sea and the Persian Gulf to H. amoyense is doubtful until future studies on a well identified male specimen of H. amoyense or other species reveals the specific identity of this larval type. We propose to refer to this larval type as Hysterothylacium larval type XV. In the present study we also describe a new species, Hysterothylacium persicum and discuss how to differentiate it from closely related species. We also found some adult females with distinct morphology and ITS sequence but due to lack of male specimens they have been referred as Hysterothylacium sp. in this paper. They had the same ITS sequence data as Hysterothylacium larval type VI. This study shows the presence of a relatively broad diversity of potentially zoonotic nematodes in edible fish of the Persian Gulf. Therefore educational campaigns for public and local health practitioners are suggested to protect consumers from becoming infected with these parasites. Copyright © 2016 Elsevier B.V. All rights reserved.
Experimental and analytical study of high velocity impact on Kevlar/Epoxy composite plates
NASA Astrophysics Data System (ADS)
Sikarwar, Rahul S.; Velmurugan, Raman; Madhu, Velmuri
2012-12-01
In the present study, impact behavior of Kevlar/Epoxy composite plates has been carried out experimentally by considering different thicknesses and lay-up sequences and compared with analytical results. The effect of thickness, lay-up sequence on energy absorbing capacity has been studied for high velocity impact. Four lay-up sequences and four thickness values have been considered. Initial velocities and residual velocities are measured experimentally to calculate the energy absorbing capacity of laminates. Residual velocity of projectile and energy absorbed by laminates are calculated analytically. The results obtained from analytical study are found to be in good agreement with experimental results. It is observed from the study that 0/90 lay-up sequence is most effective for impact resistance. Delamination area is maximum on the back side of the plate for all thickness values and lay-up sequences. The delamination area on the back is maximum for 0/90/45/-45 laminates compared to other lay-up sequences.
Impact of Next Generation Sequencing Techniques in Food Microbiology
Mayo, Baltasar; Rachid, Caio T. C. C; Alegría, Ángel; Leite, Analy M. O; Peixoto, Raquel S; Delgado, Susana
2014-01-01
Understanding the Maxam-Gilbert and Sanger sequencing as the first generation, in recent years there has been an explosion of newly-developed sequencing strategies, which are usually referred to as next generation sequencing (NGS) techniques. NGS techniques have high-throughputs and produce thousands or even millions of sequences at the same time. These sequences allow for the accurate identification of microbial taxa, including uncultivable organisms and those present in small numbers. In specific applications, NGS provides a complete inventory of all microbial operons and genes present or being expressed under different study conditions. NGS techniques are revolutionizing the field of microbial ecology and have recently been used to examine several food ecosystems. After a short introduction to the most common NGS systems and platforms, this review addresses how NGS techniques have been employed in the study of food microbiota and food fermentations, and discusses their limits and perspectives. The most important findings are reviewed, including those made in the study of the microbiota of milk, fermented dairy products, and plant-, meat- and fish-derived fermented foods. The knowledge that can be gained on microbial diversity, population structure and population dynamics via the use of these technologies could be vital in improving the monitoring and manipulation of foods and fermented food products. They should also improve their safety. PMID:25132799
ERIC Educational Resources Information Center
Boreham, N. C.; And Others
1985-01-01
This study investigated the effects of two sequences of instruction--theory-to-application and application-to-theory--on medical students' cognitive preferences in preclinical science teaching. Results indicate that presenting an example of the clinical application of biochemical theory before presenting the theory itself increased students'…
Introduction to Semiconductor Physics in Secondary Education: Evaluation of a Teaching Sequence
ERIC Educational Resources Information Center
Garcia-Carmona, Antonio; Criado, Ana Maria
2009-01-01
The present article presents a didactic proposal oriented to teaching notions of semiconductor physics in secondary education. The methods and the results of a pilot study designed to analyse the effectiveness of a teaching sequence on the topic are also described. The subjects were 60 students, aged 14-15 years, of a secondary school in Seville,…
Nandi, Sukdeb; Anbazhagan, Rajendra; Kumar, Manoj
2010-01-01
Canine parvovirus 2 (CPV-2) is one of the most important viruses that causes haemorrhagic gastroenteritis and myocarditis of dogs worldwide. The picture has been complicated further due to the emergence of new mutants of CPV, namely: CPV-2a, CPV-2b and CPV-2c. In this study, the molecular characterisation of strains present in the CPV vaccines available on the Indian market was performed using polymerase chain reaction and DNA sequencing. The VP1/VP2 genes of two vaccine strains and a field strain (Bhopal) were sequenced and the nucleotide and the deduced amino acid sequences were compared. The results indicated that the isolate belonged to CPV type 2b and the strains in the vaccines belonged to type CPV-2. From the study, it is inferred that the CPV strain used in commercially available vaccine preparation differed from the strains present in CPV infection in dogs in India.
Modahl, Cassandra M.; Mackessy, Stephen P.
2016-01-01
Envenomation of humans by snakes is a complex and continuously evolving medical emergency, and treatment is made that much more difficult by the diverse biochemical composition of many venoms. Venomous snakes and their venoms also provide models for the study of molecular evolutionary processes leading to adaptation and genotype-phenotype relationships. To compare venom complexity and protein sequences, venom gland transcriptomes are assembled, which usually requires the sacrifice of snakes for tissue. However, toxin transcripts are also present in venoms, offering the possibility of obtaining cDNA sequences directly from venom. This study provides evidence that unknown full-length venom protein transcripts can be obtained from the venoms of multiple species from all major venomous snake families. These unknown venom protein cDNAs are obtained by the use of primers designed from conserved signal peptide sequences within each venom protein superfamily. This technique was used to assemble a partial venom gland transcriptome for the Middle American Rattlesnake (Crotalus simus tzabcan) by amplifying sequences for phospholipases A2, serine proteases, C-lectins, and metalloproteinases from within venom. Phospholipase A2 sequences were also recovered from the venoms of several rattlesnakes and an elapid snake (Pseudechis porphyriacus), and three-finger toxin sequences were recovered from multiple rear-fanged snake species, demonstrating that the three major clades of advanced snakes (Elapidae, Viperidae, Colubridae) have stable mRNA present in their venoms. These cDNA sequences from venom were then used to explore potential activities derived from protein sequence similarities and evolutionary histories within these large multigene superfamilies. Venom-derived sequences can also be used to aid in characterizing venoms that lack proteomic profiles and identify sequence characteristics indicating specific envenomation profiles. This approach, requiring only venom, provides access to cDNA sequences in the absence of living specimens, even from commercial venom sources, to evaluate important regional differences in venom composition and to study snake venom protein evolution. PMID:27280639
HIV-1 low copy viral sequencing-A prototype assay.
Mellberg, Tomas; Krabbe, Jon; Gisslén, Magnus; Svennerholm, Bo
2016-01-01
In HIV-1 patients with low viral burden, sequencing is often problematic, yet important. This study presents a sensitive, sub-type independent system for sequencing of low level viremia. Sequencing data from 32 HIV-1 infected patients with low level viremia were collected longitudinally. A combination of ViroSeq® HIV-1 Genotyping System and an in-house nesting protocol was used. Eight sub-types were represented. The success-rate of amplification of both PR and RT in the same sample was 100% in samples with viral loads above 100 copies/ml. Below 100 copies/ml, this study managed to amplify both regions in 7/13 (54%) samples. The assays were able to amplify either PR or RT in all sub-types included but one sub-type A specimen. In conclusion, this study presents a promising, simple assay to increase the ability to perform HIV-1 resistance testing at low level viremia. This is a prototype assay and the method needs further testing to evaluate clinical performance.
NASA Astrophysics Data System (ADS)
De Santis, A.
2017-12-01
The SAFE (Swarm for Earthquake study) project (funded by European Space Agency in the framework "STSE Swarm+Innovation", 2014-2016) aimed at applying the new approach of geosystemics to the analysis of Swarm satellite (ESA) electromagnetic data for investigating the preparatory phase of earthquakes. We present in this talk the case study of the most recent seismic sequence in Italy. First a M6 earthquake on 24 August 2016 and then a M6.5 earthquake on 30 October 2016 shocked almost in the same region of Central Italy causing about 300 deaths in total (mostly on 24 August), with a revival of other significant seismicity on January 2017. Analysing both geophysical and climatological satellite and ground data preceding the major earthquakes of the sequence we present results that confirm a complex solid earth-atmosphere coupling in the preparation phase of the whole sequence.
Abe, Takashi; Hamano, Yuta; Ikemura, Toshimichi
2014-01-01
A strategy of evolutionary studies that can compare vast numbers of genome sequences is becoming increasingly important with the remarkable progress of high-throughput DNA sequencing methods. We previously established a sequence alignment-free clustering method "BLSOM" for di-, tri-, and tetranucleotide compositions in genome sequences, which can characterize sequence characteristics (genome signatures) of a wide range of species. In the present study, we generated BLSOMs for tetra- and pentanucleotide compositions in approximately one million sequence fragments derived from 101 eukaryotes, for which almost complete genome sequences were available. BLSOM recognized phylotype-specific characteristics (e.g., key combinations of oligonucleotide frequencies) in the genome sequences, permitting phylotype-specific clustering of the sequences without any information regarding the species. In our detailed examination of 12 Drosophila species, the correlation between their phylogenetic classification and the classification on the BLSOMs was observed to visualize oligonucleotides diagnostic for species-specific clustering.
Archaeon and archaeal virus diversity classification via sequence entropy and fractal dimension
NASA Astrophysics Data System (ADS)
Tremberger, George, Jr.; Gallardo, Victor; Espinoza, Carola; Holden, Todd; Gadura, N.; Cheung, E.; Schneider, P.; Lieberman, D.; Cheung, T.
2010-09-01
Archaea are important potential candidates in astrobiology as their metabolism includes solar, inorganic and organic energy sources. Archaeal viruses would also be expected to be present in a sustainable archaeal exobiological community. Genetic sequence Shannon entropy and fractal dimension can be used to establish a two-dimensional measure for classification and phylogenetic study of these organisms. A sequence fractal dimension can be calculated from a numerical series consisting of the atomic numbers of each nucleotide. Archaeal 16S and 23S ribosomal RNA sequences were studied. Outliers in the 16S rRNA fractal dimension and entropy plot were found to be halophilic archaea. Positive correlation (R-square ~ 0.75, N = 18) was observed between fractal dimension and entropy across the studied species. The 16S ribosomal RNA sequence entropy correlates with the 23S ribosomal RNA sequence entropy across species with R-square 0.93, N = 18. Entropy values correspond positively with branch lengths of a published phylogeny. The studied archaeal virus sequences have high fractal dimensions of 2.02 or more. A comparison of selected extremophile sequences with archaeal sequences from the Humboldt Marine Ecosystem database (Wood-Hull Oceanography Institute, MIT) suggests the presence of continuous sequence expression as inferred from distributions of entropy and fractal dimension, consistent with the diversity expected in an exobiological archaeal community.
Mouse mammary tumor virus-like gene sequences are present in lung patient specimens
2011-01-01
Background Previous studies have reported on the presence of Murine Mammary Tumor Virus (MMTV)-like gene sequences in human cancer tissue specimens. Here, we search for MMTV-like gene sequences in lung diseases including carcinomas specimens from a Mexican population. This study was based on our previous study reporting that the INER51 lung cancer cell line, from a pleural effusion of a Mexican patient, contains MMTV-like env gene sequences. Results The MMTV-like env gene sequences have been detected in three out of 18 specimens studied, by PCR using a specific set of MMTV-like primers. The three identified MMTV-like gene sequences, which were assigned as INER6, HZ101, and HZ14, were 99%, 98%, and 97% homologous, respectively, as compared to GenBank sequence accession number AY161347. The INER6 and HZ-101 samples were isolated from lung cancer specimens, and the HZ-14 was isolated from an acute inflammatory lung infiltrate sample. Two of the env sequences exhibited disruption of the reading frame due to mutations. Conclusion In summary, we identified the presence of MMTV-like gene sequences in 2 out of 11 (18%) of the lung carcinomas and 1 out of 7 (14%) of acute inflamatory lung infiltrate specimens studied of a Mexican Population. PMID:21943279
No effects of transcranial DLPFC stimulation on implicit task sequence learning and consolidation.
Savic, Branislav; Cazzoli, Dario; Müri, René; Meier, Beat
2017-08-29
Neurostimulation of the dorsolateral prefrontal cortex (DLPFC) can modulate performance in cognitive tasks. In a recent study, however, transcranial direct current stimulation (tDCS) of the DLPFC did not affect implicit task sequence learning and consolidation in a paradigm that involved bimanual responses. Because bimanual performance increases the coupling between homologous cortical areas of the hemispheres and left and right DLPFC were stimulated separately the null findings may have been due to the bimanual setup. The aim of the present study was to test the effect of neuro-stimulation on sequence learning in a uni-manual setup. For this purpose two experiments were conducted. In Experiment 1, the DLPFC was stimulated with tDCS. In Experiment 2 the DLPFC was stimulated with transcranial magnetic stimulation (TMS). In both experiments, consolidation was measured 24 hours later. The results showed that sequence learning was present in all conditions and sessions, but it was not influenced by stimulation. Likewise, consolidation of sequence learning was robust across sessions, but it was not influenced by stimulation. These results replicate and extend previous findings. They indicate that established tDCS and TMS protocols on the DLPFC do not influence implicit task sequence learning and consolidation.
NASA Astrophysics Data System (ADS)
Mananga, Eugene S.; Reid, Alicia E.
2013-01-01
This paper presents a study of finite pulse widths for the BABA pulse sequence using the Floquet-Magnus expansion (FME) approach. In the FME scheme, the first order ? is identical to its counterparts in average Hamiltonian theory (AHT) and Floquet theory (FT). However, the timing part in the FME approach is introduced via the ? function not present in other schemes. This function provides an easy way for evaluating the spin evolution during the time in between' through the Magnus expansion of the operator connected to the timing part of the evolution. The evaluation of ? is particularly useful for the analysis of the non-stroboscopic evolution. Here, the importance of the boundary conditions, which provide a natural choice of ? , is ignored. This work uses the ? function to compare the efficiency of the BABA pulse sequence with ? and the BABA pulse sequence with finite pulses. Calculations of ? and ? are presented.
Zhao, Zhong-Hui; Bian, Qing-Qing; Ren, Wan-Xin; Cheng, Wen-Yu; Jia, Yan-Qing; Fang, Yan-Qin; Zhao, Guang-Hui
2014-06-01
The present study examined the variations in three mitochondrial (mt) DNA sequences, namely cytochrome b (cytb), cytochrome c oxidase subunit 3 (cox3) and NADH dehydrogenase subunit 5 (nad5), among Baylisascaris schroederi isolates from the Qinling subspecies of the giant panda in Shaanxi province, northwestern China. No differences in length were detected in the three mt fragments from different isolates. The intra-specific sequence variations within all B. schroederi samples were 0-2.6% for pcytb, 0-1.8% for pcox3 and 0-2.1% for pnad5, while the inter-specific sequence differences among members of the genus Baylisascaris were 8.2-15.2%, 6.2-15.9% and 8.4-16.0% for pcytb, pcox3, pnad5, respectively. A phylogenetic analysis of the combined sequences of pcytb, pcox3 and pnad 5 showed that all B. schroederi samples in the present study were located in two large clusters, with one cluster containing samples from giant pandas in Sichuan province. These findings provide basic information for further study of molecular epidemiology and control of B. schroederi infection in the Qinling subspecies of the giant panda and throughout China.
Mesoscopic modeling of DNA denaturation rates: Sequence dependence and experimental comparison
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dahlen, Oda, E-mail: oda.dahlen@ntnu.no; Erp, Titus S. van, E-mail: titus.van.erp@ntnu.no
Using rare event simulation techniques, we calculated DNA denaturation rate constants for a range of sequences and temperatures for the Peyrard-Bishop-Dauxois (PBD) model with two different parameter sets. We studied a larger variety of sequences compared to previous studies that only consider DNA homopolymers and DNA sequences containing an equal amount of weak AT- and strong GC-base pairs. Our results show that, contrary to previous findings, an even distribution of the strong GC-base pairs does not always result in the fastest possible denaturation. In addition, we applied an adaptation of the PBD model to study hairpin denaturation for which experimentalmore » data are available. This is the first quantitative study in which dynamical results from the mesoscopic PBD model have been compared with experiments. Our results show that present parameterized models, although giving good results regarding thermodynamic properties, overestimate denaturation rates by orders of magnitude. We believe that our dynamical approach is, therefore, an important tool for verifying DNA models and for developing next generation models that have higher predictive power than present ones.« less
Study of infectious diseases in archaeological bone material - A dataset.
Pucu, Elisa; Cascardo, Paula; Chame, Marcia; Felice, Gisele; Guidon, Niéde; Cleonice Vergne, Maria; Campos, Guadalupe; Roberto Machado-Silva, José; Leles, Daniela
2017-08-01
Bones of human and ground sloth remains were analyzed for presence of Trypanosoma cruzi by conventional PCR using primers TC, TC1 and TC2. Sequence results amplified a fragment with the same product size as the primers (300 and 350pb). Amplified PCR product was sequenced and analyzed on GenBank, using Blast. Although these sequences did not match with these parasites they showed high amplification with species of bacteria. This article presents the methodology used and the alignment of the sequences. The display of this dataset will allow further analysis of our results and discussion presented in the manuscript "Finding the unexpected: a critical view on molecular diagnosis of infectious diseases in archaeological samples" (Pucu et al. 2017) [1].
Ferles, Christos; Beaufort, William-Scott; Ferle, Vanessa
2017-01-01
The present study devises mapping methodologies and projection techniques that visualize and demonstrate biological sequence data clustering results. The Sequence Data Density Display (SDDD) and Sequence Likelihood Projection (SLP) visualizations represent the input symbolical sequences in a lower-dimensional space in such a way that the clusters and relations of data elements are depicted graphically. Both operate in combination/synergy with the Self-Organizing Hidden Markov Model Map (SOHMMM). The resulting unified framework is in position to analyze automatically and directly raw sequence data. This analysis is carried out with little, or even complete absence of, prior information/domain knowledge.
Determination of a mutational spectrum
Thilly, William G.; Keohavong, Phouthone
1991-01-01
A method of resolving (physically separating) mutant DNA from nonmutant DNA and a method of defining or establishing a mutational spectrum or profile of alterations present in nucleic acid sequences from a sample to be analyzed, such as a tissue or body fluid. The present method is based on the fact that it is possible, through the use of DGGE, to separate nucleic acid sequences which differ by only a single base change and on the ability to detect the separate mutant molecules. The present invention, in another aspect, relates to a method for determining a mutational spectrum in a DNA sequence of interest present in a population of cells. The method of the present invention is useful as a diagnostic or analytical tool in forensic science in assessing environmental and/or occupational exposures to potentially genetically toxic materials (also referred to as potential mutagens); in biotechnology, particularly in the study of the relationship between the amino acid sequence of enzymes and other biologically-active proteins or protein-containing substances and their respective functions; and in determining the effects of drugs, cosmetics and other chemicals for which toxicity data must be obtained.
Moretto, Marco; Barghini, Elena; Mascagni, Flavia; Natali, Lucia; Brilli, Matteo; Lomsadze, Alexandre; Sonego, Paolo; Giongo, Lara; Alonge, Michael; Velasco, Riccardo; Varotto, Claudio; Šurbanovski, Nada; Borodovsky, Mark; Ward, Judson A; Engelen, Kristof; Cavallini, Andrea; Cestaro, Alessandro
2018-01-01
Abstract Background The genus Potentilla is closely related to that of Fragaria, the economically important strawberry genus. Potentilla micrantha is a species that does not develop berries but shares numerous morphological and ecological characteristics with Fragaria vesca. These similarities make P. micrantha an attractive choice for comparative genomics studies with F. vesca. Findings In this study, the P. micrantha genome was sequenced and annotated, and RNA-Seq data from the different developmental stages of flowering and fruiting were used to develop a set of gene predictions. A 327 Mbp sequence and annotation of the genome of P. micrantha, spanning 2674 sequence contigs, with an N50 size of 335,712, estimated to cover 80% of the total genome size of the species was developed. The genus Potentilla has a characteristically larger genome size than Fragaria, but the recovered sequence scaffolds were remarkably collinear at the micro-syntenic level with the genome of F. vesca, its closest sequenced relative. A total of 33,602 genes were predicted, and 95.1% of bench-marking universal single-copy orthologous genes were complete within the presented sequence. Thus, we argue that the majority of the gene-rich regions of the genome have been sequenced. Conclusions Comparisons of RNA-Seq data from the stages of floral and fruit development revealed genes differentially expressed between P. micrantha and F. vesca.The data presented are a valuable resource for future studies of berry development in Fragaria and the Rosaceae and they also shed light on the evolution of genome size and organization in this family. PMID:29659812
Buti, Matteo; Moretto, Marco; Barghini, Elena; Mascagni, Flavia; Natali, Lucia; Brilli, Matteo; Lomsadze, Alexandre; Sonego, Paolo; Giongo, Lara; Alonge, Michael; Velasco, Riccardo; Varotto, Claudio; Šurbanovski, Nada; Borodovsky, Mark; Ward, Judson A; Engelen, Kristof; Cavallini, Andrea; Cestaro, Alessandro; Sargent, Daniel James
2018-04-01
The genus Potentilla is closely related to that of Fragaria, the economically important strawberry genus. Potentilla micrantha is a species that does not develop berries but shares numerous morphological and ecological characteristics with Fragaria vesca. These similarities make P. micrantha an attractive choice for comparative genomics studies with F. vesca. In this study, the P. micrantha genome was sequenced and annotated, and RNA-Seq data from the different developmental stages of flowering and fruiting were used to develop a set of gene predictions. A 327 Mbp sequence and annotation of the genome of P. micrantha, spanning 2674 sequence contigs, with an N50 size of 335,712, estimated to cover 80% of the total genome size of the species was developed. The genus Potentilla has a characteristically larger genome size than Fragaria, but the recovered sequence scaffolds were remarkably collinear at the micro-syntenic level with the genome of F. vesca, its closest sequenced relative. A total of 33,602 genes were predicted, and 95.1% of bench-marking universal single-copy orthologous genes were complete within the presented sequence. Thus, we argue that the majority of the gene-rich regions of the genome have been sequenced. Comparisons of RNA-Seq data from the stages of floral and fruit development revealed genes differentially expressed between P. micrantha and F. vesca.The data presented are a valuable resource for future studies of berry development in Fragaria and the Rosaceae and they also shed light on the evolution of genome size and organization in this family.
ERIC Educational Resources Information Center
Guajardo, Gustavo
2017-01-01
Spanish generally shows a Sequence of Tense (SOT) phenomenon in subjunctive clauses: the tense of the embedded clause (present or past) must agree with the tense of the matrix clause. It has been reported, however, that one kind of violation sometimes occurs, in which a present tense subjunctive clause is embedded under a past tense matrix clause…
Some special values of vertices of trees on the suborbital graphs
NASA Astrophysics Data System (ADS)
Deǧer, A. H.; Akbaba, Ü.
2018-01-01
In the present study, the action of a congruence subgroup of S L(2, Z) on ℚ ^ is examined. From this action and its properties, vertices of paths of minimal length on the suborbital graph Fu,N give rise to some special sequence values, that are alternate sequences such as identity, Fibonacci and Lucas sequences. These types of vertices also give rise to special continued fractions, hence from recurrence relations for continued fractions, values of these vertices and values of special sequences were associated.
Liu, Yu; Koyutürk, Mehmet; Maxwell, Sean; Xiang, Min; Veigl, Martina; Cooper, Richard S; Tayo, Bamidele O; Li, Li; LaFramboise, Thomas; Wang, Zhenghe; Zhu, Xiaofeng; Chance, Mark R
2014-08-16
Sequences up to several megabases in length have been found to be present in individual genomes but absent in the human reference genome. These sequences may be common in populations, and their absence in the reference genome may indicate rare variants in the genomes of individuals who served as donors for the human genome project. As the reference genome is used in probe design for microarray technology and mapping short reads in next generation sequencing (NGS), this missing sequence could be a source of bias in functional genomic studies and variant analysis. One End Anchor (OEA) and/or orphan reads from paired-end sequencing have been used to identify novel sequences that are absent in reference genome. However, there is no study to investigate the distribution, evolution and functionality of those sequences in human populations. To systematically identify and study the missing common sequences (micSeqs), we extended the previous method by pooling OEA reads from large number of individuals and applying strict filtering methods to remove false sequences. The pipeline was applied to data from phase 1 of the 1000 Genomes Project. We identified 309 micSeqs that are present in at least 1% of the human population, but absent in the reference genome. We confirmed 76% of these 309 micSeqs by comparison to other primate genomes, individual human genomes, and gene expression data. Furthermore, we randomly selected fifteen micSeqs and confirmed their presence using PCR validation in 38 additional individuals. Functional analysis using published RNA-seq and ChIP-seq data showed that eleven micSeqs are highly expressed in human brain and three micSeqs contain transcription factor (TF) binding regions, suggesting they are functional elements. In addition, the identified micSeqs are absent in non-primates and show dynamic acquisition during primate evolution culminating with most micSeqs being present in Africans, suggesting some micSeqs may be important sources of human diversity. 76% of micSeqs were confirmed by a comparative genomics approach. Fourteen micSeqs are expressed in human brain or contain TF binding regions. Some micSeqs are primate-specific, conserved and may play a role in the evolution of primates.
Whole-exome sequencing identifies USH2A mutations in a pseudo-dominant Usher syndrome family.
Zheng, Sui-Lian; Zhang, Hong-Liang; Lin, Zhen-Lang; Kang, Qian-Yan
2015-10-01
Usher syndrome (USH) is an autosomal recessive (AR) multi-sensory degenerative disorder leading to deaf-blindness. USH is clinically subdivided into three subclasses, and 10 genes have been identified thus far. Clinical and genetic heterogeneities in USH make a precise diagnosis difficult. A dominant‑like USH family in successive generations was identified, and the present study aimed to determine the genetic predisposition of this family. Whole‑exome sequencing was performed in two affected patients and an unaffected relative. Systematic data were analyzed by bioinformatic analysis to remove the candidate mutations via step‑wise filtering. Direct Sanger sequencing and co‑segregation analysis were performed in the pedigree. One novel and two known mutations in the USH2A gene were identified, and were further confirmed by direct sequencing and co‑segregation analysis. The affected mother carried compound mutations in the USH2A gene, while the unaffected father carried a heterozygous mutation. The present study demonstrates that whole‑exome sequencing is a robust approach for the molecular diagnosis of disorders with high levels of genetic heterogeneity.
Behera, Bijay Kumar; Baisvar, Vishwamitra Singh; Kumari, Kavita; Rout, Ajaya Kumar; Pakrashi, Sudip; Paria, Prasenjet; Rao, A R; Rai, Anil
2017-03-01
In the present study, the complete mitochondrial genome sequence of Anabas testudineusis reported using PGM sequencer (Ion Torrent, Life Technologies, La Jolla, CA). The complete mitogenome of climbing perch, A. testudineusis obtained by the de novo sequences assembly of genomic reads using the Torrent Mapping Alignment Program (TMAP), which is 16 603 bp in length. The mitogenome of A. testudineus composed of 13 protein- coding genes, two rRNA, and 22 tRNAs. Here, 20 tRNAs genes showed typical clover leaf model, and D-Loop as the control region along with gene order and organization, being closely similar to Osphronemidae and most of other Perciformes fish mitogenomes of NCBI databases. The mitogenome in the present study has 99% similarity to the complete mitogenome sequence of earlier reported A. testudineus. The phylogenetic analysis of Anabantidae depicted that their mitogenomes are closely related to each other. The complete mitogenome sequence of A. testudineus would be helpful in understanding the population genetics, phylogenetics, and evolution of Anabantidae.
Nowrousian, Minou; Stajich, Jason E.; Chu, Meiling; Engh, Ines; Espagne, Eric; Halliday, Karen; Kamerewerd, Jens; Kempken, Frank; Knab, Birgit; Kuo, Hsiao-Che; Osiewacz, Heinz D.; Pöggeler, Stefanie; Read, Nick D.; Seiler, Stephan; Smith, Kristina M.; Zickler, Denise; Kück, Ulrich; Freitag, Michael
2010-01-01
Filamentous fungi are of great importance in ecology, agriculture, medicine, and biotechnology. Thus, it is not surprising that genomes for more than 100 filamentous fungi have been sequenced, most of them by Sanger sequencing. While next-generation sequencing techniques have revolutionized genome resequencing, e.g. for strain comparisons, genetic mapping, or transcriptome and ChIP analyses, de novo assembly of eukaryotic genomes still presents significant hurdles, because of their large size and stretches of repetitive sequences. Filamentous fungi contain few repetitive regions in their 30–90 Mb genomes and thus are suitable candidates to test de novo genome assembly from short sequence reads. Here, we present a high-quality draft sequence of the Sordaria macrospora genome that was obtained by a combination of Illumina/Solexa and Roche/454 sequencing. Paired-end Solexa sequencing of genomic DNA to 85-fold coverage and an additional 10-fold coverage by single-end 454 sequencing resulted in ∼4 Gb of DNA sequence. Reads were assembled to a 40 Mb draft version (N50 of 117 kb) with the Velvet assembler. Comparative analysis with Neurospora genomes increased the N50 to 498 kb. The S. macrospora genome contains even fewer repeat regions than its closest sequenced relative, Neurospora crassa. Comparison with genomes of other fungi showed that S. macrospora, a model organism for morphogenesis and meiosis, harbors duplications of several genes involved in self/nonself-recognition. Furthermore, S. macrospora contains more polyketide biosynthesis genes than N. crassa. Phylogenetic analyses suggest that some of these genes may have been acquired by horizontal gene transfer from a distantly related ascomycete group. Our study shows that, for typical filamentous fungi, de novo assembly of genomes from short sequence reads alone is feasible, that a mixture of Solexa and 454 sequencing substantially improves the assembly, and that the resulting data can be used for comparative studies to address basic questions of fungal biology. PMID:20386741
Nowrousian, Minou; Stajich, Jason E; Chu, Meiling; Engh, Ines; Espagne, Eric; Halliday, Karen; Kamerewerd, Jens; Kempken, Frank; Knab, Birgit; Kuo, Hsiao-Che; Osiewacz, Heinz D; Pöggeler, Stefanie; Read, Nick D; Seiler, Stephan; Smith, Kristina M; Zickler, Denise; Kück, Ulrich; Freitag, Michael
2010-04-08
Filamentous fungi are of great importance in ecology, agriculture, medicine, and biotechnology. Thus, it is not surprising that genomes for more than 100 filamentous fungi have been sequenced, most of them by Sanger sequencing. While next-generation sequencing techniques have revolutionized genome resequencing, e.g. for strain comparisons, genetic mapping, or transcriptome and ChIP analyses, de novo assembly of eukaryotic genomes still presents significant hurdles, because of their large size and stretches of repetitive sequences. Filamentous fungi contain few repetitive regions in their 30-90 Mb genomes and thus are suitable candidates to test de novo genome assembly from short sequence reads. Here, we present a high-quality draft sequence of the Sordaria macrospora genome that was obtained by a combination of Illumina/Solexa and Roche/454 sequencing. Paired-end Solexa sequencing of genomic DNA to 85-fold coverage and an additional 10-fold coverage by single-end 454 sequencing resulted in approximately 4 Gb of DNA sequence. Reads were assembled to a 40 Mb draft version (N50 of 117 kb) with the Velvet assembler. Comparative analysis with Neurospora genomes increased the N50 to 498 kb. The S. macrospora genome contains even fewer repeat regions than its closest sequenced relative, Neurospora crassa. Comparison with genomes of other fungi showed that S. macrospora, a model organism for morphogenesis and meiosis, harbors duplications of several genes involved in self/nonself-recognition. Furthermore, S. macrospora contains more polyketide biosynthesis genes than N. crassa. Phylogenetic analyses suggest that some of these genes may have been acquired by horizontal gene transfer from a distantly related ascomycete group. Our study shows that, for typical filamentous fungi, de novo assembly of genomes from short sequence reads alone is feasible, that a mixture of Solexa and 454 sequencing substantially improves the assembly, and that the resulting data can be used for comparative studies to address basic questions of fungal biology.
Impact of Lateral Transfers on the Genomes of Lepidoptera
Drezen, Jean-Michel; Josse, Thibaut; Bézier, Annie; Gauthier, Jérémy; Huguet, Elisabeth
2017-01-01
Transfer of DNA sequences between species regardless of their evolutionary distance is very common in bacteria, but evidence that horizontal gene transfer (HGT) also occurs in multicellular organisms has been accumulating in the past few years. The actual extent of this phenomenon is underestimated due to frequent sequence filtering of “alien” DNA before genome assembly. However, recent studies based on genome sequencing have revealed, and experimentally verified, the presence of foreign DNA sequences in the genetic material of several species of Lepidoptera. Large DNA viruses, such as baculoviruses and the symbiotic viruses of parasitic wasps (bracoviruses), have the potential to mediate these transfers in Lepidoptera. In particular, using ultra-deep sequencing, newly integrated transposons have been identified within baculovirus genomes. Bacterial genes have also been acquired by genomes of Lepidoptera, as in other insects and nematodes. In addition, insertions of bracovirus sequences were present in the genomes of certain moth and butterfly lineages, that were likely corresponding to rearrangements of ancient integrations. The viral genes present in these sequences, sometimes of hymenopteran origin, have been co-opted by lepidopteran species to confer some protection against pathogens. PMID:29120392
Carvalho, Paulo F.; Goldstone, Robert L.
2015-01-01
Inductive category learning takes place across time. As such, it is not surprising that the sequence in which information is studied has an impact in what is learned and how efficient learning is. In this paper we review research on different learning sequences and how this impacts learning. We analyze different aspects of interleaved (frequent alternation between categories during study) and blocked study (infrequent alternation between categories during study) that might explain how and when one sequence of study results in improved learning. While these different sequences of study differ in the amount of temporal spacing and temporal juxtaposition between items of different categories, these aspects do not seem to account for the majority of the results available in the literature. However, differences in the type of category being studied and the duration of the retention interval between study and test may play an important role. We conclude that there is no single aspect that is able to account for all the evidence available. Understanding learning as a process of sequential comparisons in time and how different sequences fundamentally alter the statistics of this experience offers a promising framework for understanding sequencing effects in category learning. We use this framework to present novel predictions and hypotheses for future research on sequencing effects in inductive category learning. PMID:25983699
OncoNEM: inferring tumor evolution from single-cell sequencing data.
Ross, Edith M; Markowetz, Florian
2016-04-15
Single-cell sequencing promises a high-resolution view of genetic heterogeneity and clonal evolution in cancer. However, methods to infer tumor evolution from single-cell sequencing data lag behind methods developed for bulk-sequencing data. Here, we present OncoNEM, a probabilistic method for inferring intra-tumor evolutionary lineage trees from somatic single nucleotide variants of single cells. OncoNEM identifies homogeneous cellular subpopulations and infers their genotypes as well as a tree describing their evolutionary relationships. In simulation studies, we assess OncoNEM's robustness and benchmark its performance against competing methods. Finally, we show its applicability in case studies of muscle-invasive bladder cancer and essential thrombocythemia.
Manríquez, René A; Vera, Tamara; Villalba, Melina V; Mancilla, Alejandra; Vakharia, Vikram N; Yañez, Alejandro J; Cárcamo, Juan G
2017-01-31
The infectious pancreatic necrosis virus (IPNV) causes significant economic losses in Chilean salmon farming. For effective sanitary management, the IPNV strains present in Chile need to be fully studied, characterized, and constantly updated at the molecular level. In this study, 36 Chilean IPNV isolates collected over 6 years (2006-2011) from Salmo salar, Oncorhynchus mykiss, and Oncorhynchus kisutch were genotypically characterized. Salmonid samples were obtained from freshwater, estuary, and seawater sources from central, southern, and the extreme-south of Chile (35° to 53°S). Sequence analysis of the VP2 gene classified 10 IPNV isolates as genogroup 1 and 26 as genogroup 5. Analyses indicated a preferential, but not obligate, relationship between genogroup 5 isolates and S. salar infection. Fifteen genogroup 5 and nine genogroup 1 isolates presented VP2 gene residues associated with high virulence (i.e. Thr, Ala, and Thr at positions 217, 221, and 247, respectively). Four genogroup 5 isolates presented an oddly long VP5 deduced amino acid sequence (29.6 kDa). Analysis of the VP2 amino acid motifs associated with clinical and subclinical infections identified the clinical fingerprint in only genogroup 5 isolates; in contrast, the genogroup 1 isolates presented sequences predominantly associated with the subclinical fingerprint. Predictive analysis of VP5 showed an absence of transmembrane domains and plasma membrane tropism signals. WebLogo analysis of the VP5 BH domains revealed high identities with the marine birnavirus Y-6 and Japanese IPNV strain E1-S. Sequence analysis for putative 25 kDa proteins, coded by the ORF between VP2 and VP4, exhibited three putative nuclear localization sequences and signals of mitochondrial tropism in two isolates. This study provides important advances in updating the characterizations of IPNV strains present in Chile. The results from this study will help in identifying epidemiological links and generating specific biotechnological tools for controlling IPNV outbreaks in Chilean salmon farming.
2D-dynamic representation of DNA sequences as a graphical tool in bioinformatics
NASA Astrophysics Data System (ADS)
Bielińska-Wa̧Ż, D.; Wa̧Ż, P.
2016-10-01
2D-dynamic representation of DNA sequences is briefly reviewed. Some new examples of 2D-dynamic graphs which are the graphical tool of the method are shown. Using the examples of the complete genome sequences of the Zika virus it is shown that the present method can be applied for the study of the evolution of viral genomes.
Attentional awakening: gradual modulation of temporal attention in rapid serial visual presentation.
Ariga, Atsunori; Yokosawa, Kazuhiko
2008-03-01
Orienting attention to a point in time facilitates processing of an item within rapidly changing surroundings. We used a one-target RSVP task to look for differences in accuracy in reporting a target related to when the target temporally appeared in the sequence. The results show that observers correctly report a target early in the sequence less frequently than later in the sequence. Previous RSVP studies predicted equivalently accurate performances for one target wherever it appeared in the sequence. We named this new phenomenon attentional awakening, which reflects a gradual modulation of temporal attention in a rapid sequence.
Wooley, John C.; Godzik, Adam; Friedberg, Iddo
2010-01-01
Metagenomics is a discipline that enables the genomic study of uncultured microorganisms. Faster, cheaper sequencing technologies and the ability to sequence uncultured microbes sampled directly from their habitats are expanding and transforming our view of the microbial world. Distilling meaningful information from the millions of new genomic sequences presents a serious challenge to bioinformaticians. In cultured microbes, the genomic data come from a single clone, making sequence assembly and annotation tractable. In metagenomics, the data come from heterogeneous microbial communities, sometimes containing more than 10,000 species, with the sequence data being noisy and partial. From sampling, to assembly, to gene calling and function prediction, bioinformatics faces new demands in interpreting voluminous, noisy, and often partial sequence data. Although metagenomics is a relative newcomer to science, the past few years have seen an explosion in computational methods applied to metagenomic-based research. It is therefore not within the scope of this article to provide an exhaustive review. Rather, we provide here a concise yet comprehensive introduction to the current computational requirements presented by metagenomics, and review the recent progress made. We also note whether there is software that implements any of the methods presented here, and briefly review its utility. Nevertheless, it would be useful if readers of this article would avail themselves of the comment section provided by this journal, and relate their own experiences. Finally, the last section of this article provides a few representative studies illustrating different facets of recent scientific discoveries made using metagenomics. PMID:20195499
Spontaneous Spatial Mapping of Learned Sequence in Chimpanzees: Evidence for a SNARC-Like Effect
Adachi, Ikuma
2014-01-01
In the last couple of decades, there has been a growing number of reports on space-based representation of numbers and serial order in humans. In the present study, to explore evolutionary origins of such representations, we examined whether our closest evolutionary relatives, chimpanzees, map an acquired sequence onto space in a similar way to humans. The subjects had been trained to perform a number sequence task in which they touched a sequence of “small” to “large” Arabic numerals presented in random locations on the monitor. This task was presented in sessions that also included test trials consisting of only two numerals (1 and 9) horizontally arranged. On half of the trials 1 was located to the left of 9, whereas on the other half 1 was to the right to 9. The Chimpanzees' performance was systematically influenced by the spatial arrangement of the stimuli; specifically, they responded quicker when 1 was on the left and 9 on the right compared to the other way around. This result suggests that chimpanzees, like humans, spontaneously map a learned sequence onto space. PMID:24643044
Xiao, Fanshu; Yu, Yuhe; Li, Jinjin; Juneau, Philippe; Yan, Qingyun
2018-05-25
The 16S rRNA gene is one of the most commonly used molecular markers for estimating bacterial diversity during the past decades. However, there is no consistency about the sequencing depth (from thousand to millions of sequences per sample), and the clustering methods used to generate OTUs may also be different among studies. These inconsistent premises make effective comparisons among studies difficult or unreliable. This study aims to examine the necessary sequencing depth and clustering method that would be needed to ensure a stable diversity patterns for studying fish gut microbiota. A total number of 42 samples dataset of Siniperca chuatsi (carnivorous fish) gut microbiota were used to test how the sequencing depth and clustering may affect the alpha and beta diversity patterns of fish intestinal microbiota. Interestingly, we found that the sequencing depth (resampling 1000-11,000 per sample) and the clustering methods (UPARSE and UCLUST) did not bias the estimates of the diversity patterns during the fish development from larva to adult. Although we should acknowledge that a suitable sequencing depth may differ case by case, our finding indicates that a shallow sequencing such as 1000 sequences per sample may be also enough to reflect the general diversity patterns of fish gut microbiota. However, we have shown in the present study that strict pre-processing of the original sequences is required to ensure reliable results. This study provides evidences to help making a strong scientific choice of the sequencing depth and clustering method for future studies on fish gut microbiota patterns, but at the same time reducing as much as possible the costs related to the analysis.
Coordinate cytokine regulatory sequences
Frazer, Kelly A.; Rubin, Edward M.; Loots, Gabriela G.
2005-05-10
The present invention provides CNS sequences that regulate the cytokine gene expression, expression cassettes and vectors comprising or lacking the CNS sequences, host cells and non-human transgenic animals comprising the CNS sequences or lacking the CNS sequences. The present invention also provides methods for identifying compounds that modulate the functions of CNS sequences as well as methods for diagnosing defects in the CNS sequences of patients.
Approaches for in silico finishing of microbial genome sequences
Kremer, Frederico Schmitt; McBride, Alan John Alexander; Pinto, Luciano da Silva
2017-01-01
Abstract The introduction of next-generation sequencing (NGS) had a significant effect on the availability of genomic information, leading to an increase in the number of sequenced genomes from a large spectrum of organisms. Unfortunately, due to the limitations implied by the short-read sequencing platforms, most of these newly sequenced genomes remained as “drafts”, incomplete representations of the whole genetic content. The previous genome sequencing studies indicated that finishing a genome sequenced by NGS, even bacteria, may require additional sequencing to fill the gaps, making the entire process very expensive. As such, several in silico approaches have been developed to optimize the genome assemblies and facilitate the finishing process. The present review aims to explore some free (open source, in many cases) tools that are available to facilitate genome finishing. PMID:28898352
Approaches for in silico finishing of microbial genome sequences.
Kremer, Frederico Schmitt; McBride, Alan John Alexander; Pinto, Luciano da Silva
The introduction of next-generation sequencing (NGS) had a significant effect on the availability of genomic information, leading to an increase in the number of sequenced genomes from a large spectrum of organisms. Unfortunately, due to the limitations implied by the short-read sequencing platforms, most of these newly sequenced genomes remained as "drafts", incomplete representations of the whole genetic content. The previous genome sequencing studies indicated that finishing a genome sequenced by NGS, even bacteria, may require additional sequencing to fill the gaps, making the entire process very expensive. As such, several in silico approaches have been developed to optimize the genome assemblies and facilitate the finishing process. The present review aims to explore some free (open source, in many cases) tools that are available to facilitate genome finishing.
Pérez-Oseguera, Ángeles; Castro-Jaimes, Semiramis; Salgado-Camargo, Abraham David; Silva-Sanchez, Jesus; Garza-González, Elvira; Castillo-Ramírez, Santiago
2017-01-01
ABSTRACT In this study, we present the complete genome sequence of a blaOXA-58-producing Acinetobacter baumannii strain, sampled from a Mexican hospital and not related to the international clones. PMID:28883144
Curriculum Sequencing and the Acquisition of Clock-Reading Skills among Chinese and Flemish Children
ERIC Educational Resources Information Center
Burny, Elise; Valcke, Martin; Desoete, Annemie; Van Luit, Johannes E. Hans
2013-01-01
The present study addresses the impact of the curriculum on primary school children's acquisition of clock-reading knowledge from analog and digital clocks. Focusing on Chinese and Flemish children's clock-reading knowledge, the study is about whether the differences in sequencing of learning and instruction opportunities--as defined by the…
NASA Astrophysics Data System (ADS)
Mumladze, Tea; Wang, Haijun; Graham, Gerhard
2017-04-01
The seismic network that forms the International Monitoring System (IMS) of the Comprehensive Nuclear-test-ban Treaty Organization (CTBTO) will ultimately consist of 170 seismic stations (50 primary and 120 auxiliary) in 76 countries around the world. The Network is still under the development, but currently more than 80% of the network is in operation. The objective of seismic monitoring is to detect and locate underground nuclear explosions. However, the data from the IMS also can be widely used for scientific and civil purposes. In this study we present the results of data analysis of the seismic sequence in 2016 in Central Italy. Several hundred earthquakes were recorded for this sequence by the seismic stations of the IMS. All events were accurately located the analysts of the International Data Centre (IDC) of the CTBTO. In this study we will present the epicentral and magnitude distribution, station recordings and teleseismic phases as obtained from the Reviewed Event Bulletin (REB). We will also present a comparison of the database of the IDC with the databases of the European-Mediterranean Seismological Centre (EMSC) and U.S. Geological Survey (USGS). Present work shows that IMS data can be used for earthquake sequence analyses and can play an important role in seismological research.
Genetic characterization of Babesia and Theileria parasites in water buffaloes in Sri Lanka.
Sivakumar, Thillaiampalam; Tattiyapong, Muncharee; Fukushi, Shintaro; Hayashida, Kyoko; Kothalawala, Hemal; Silva, Seekkuge Susil Priyantha; Vimalakumar, Singarayar Caniciyas; Kanagaratnam, Ratnam; Meewewa, Asela Sanjeewa; Suthaharan, Kalpana; Puvirajan, Thamotharampillai; de Silva, Weligodage Kumarawansa; Igarashi, Ikuo; Yokoyama, Naoaki
2014-02-24
Water buffaloes are thought to be the reservoir hosts for several hemoprotozoan parasites that infect cattle. In the present study, we surveyed Sri Lankan bred water buffaloes for infections with Babesia bovis, Babesia bigemina, Theileria annulata, and Theileria orientalis using parasite-specific PCR assays. When 320 blood-derived DNA samples from water buffaloes reared in three different districts (Polonnaruwa, Mannar, and Mullaitivu) of Sri Lanka were PCR screened, B. bovis, B. bigemina, and T. orientalis were detected. While T. orientalis was the predominant parasite (82.5%), low PCR-positive rates were observed for B. bovis (1.9%) and B. bigemina (1.6%). Amplicons of the gene sequences of the Rhoptry Associated Protein-1 (RAP-1) of B. bovis, the Apical Membrane Antigen-1 (AMA-1) of B. bigemina, and the Major Piroplasm Surface Protein (MPSP) of T. orientalis were compared with those characterized previously in Sri Lankan cattle. While the B. bigemina AMA-1 sequences from water buffaloes shared high identity values with those from cattle, B. bovis RAP-1 sequences from water buffaloes diverged genetically from those of cattle. For T. orientalis, none of the MPSP sequence types reported previously in Sri Lankan cattle (types 1, 3, 5, and 7) were detected in the water buffaloes, and the MPSP sequences analyzed in the present study belonged to types N1 or N2. In summary, in addition to reporting the first PCR-based survey of Babesia and Theileria parasites in water buffaloes in Sri Lanka, the present study found that the predominant variants of water buffalo-derived B. bovis RAP-1 and T. orientalis MPSP sequences were different from those previously described from cattle in this country. Copyright © 2013 Elsevier B.V. All rights reserved.
Bacterial Community Analysis of Drinking Water Biofilms in Southern Sweden
Lührig, Katharina; Canbäck, Björn; Paul, Catherine J.; Johansson, Tomas; Persson, Kenneth M.; Rådström, Peter
2015-01-01
Next-generation sequencing of the V1–V2 and V3 variable regions of the 16S rRNA gene generated a total of 674,116 reads that described six distinct bacterial biofilm communities from both water meters and pipes. A high degree of reproducibility was demonstrated for the experimental and analytical work-flow by analyzing the communities present in parallel water meters, the rare occurrence of biological replicates within a working drinking water distribution system. The communities observed in water meters from households that did not complain about their drinking water were defined by sequences representing Proteobacteria (82–87%), with 22–40% of all sequences being classified as Sphingomonadaceae. However, a water meter biofilm community from a household with consumer reports of red water and flowing water containing elevated levels of iron and manganese had fewer sequences representing Proteobacteria (44%); only 0.6% of all sequences were classified as Sphingomonadaceae; and, in contrast to the other water meter communities, markedly more sequences represented Nitrospira and Pedomicrobium. The biofilm communities in pipes were distinct from those in water meters, and contained sequences that were identified as Mycobacterium, Nocardia, Desulfovibrio, and Sulfuricurvum. The approach employed in the present study resolved the bacterial diversity present in these biofilm communities as well as the differences that occurred in biofilms within a single distribution system, and suggests that next-generation sequencing of 16S rRNA amplicons can show changes in bacterial biofilm communities associated with different water qualities. PMID:25739379
Bacterial community analysis of drinking water biofilms in southern Sweden.
Lührig, Katharina; Canbäck, Björn; Paul, Catherine J; Johansson, Tomas; Persson, Kenneth M; Rådström, Peter
2015-01-01
Next-generation sequencing of the V1-V2 and V3 variable regions of the 16S rRNA gene generated a total of 674,116 reads that described six distinct bacterial biofilm communities from both water meters and pipes. A high degree of reproducibility was demonstrated for the experimental and analytical work-flow by analyzing the communities present in parallel water meters, the rare occurrence of biological replicates within a working drinking water distribution system. The communities observed in water meters from households that did not complain about their drinking water were defined by sequences representing Proteobacteria (82-87%), with 22-40% of all sequences being classified as Sphingomonadaceae. However, a water meter biofilm community from a household with consumer reports of red water and flowing water containing elevated levels of iron and manganese had fewer sequences representing Proteobacteria (44%); only 0.6% of all sequences were classified as Sphingomonadaceae; and, in contrast to the other water meter communities, markedly more sequences represented Nitrospira and Pedomicrobium. The biofilm communities in pipes were distinct from those in water meters, and contained sequences that were identified as Mycobacterium, Nocardia, Desulfovibrio, and Sulfuricurvum. The approach employed in the present study resolved the bacterial diversity present in these biofilm communities as well as the differences that occurred in biofilms within a single distribution system, and suggests that next-generation sequencing of 16S rRNA amplicons can show changes in bacterial biofilm communities associated with different water qualities.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mays, S.E.; Poloski, J.P.; Sullivan, W.H.
1982-07-01
This report describes a risk study of the Browns Ferry, Unit 1, nuclear plant. The study is one of four such studies sponsored by the NRC Office of Research, Division of Risk Assessment, as part of its Interim Reliability Evaluation Program (IREP), Phase II. This report is contained in four volumes: a main report and three appendixes. Appendix C generally describes the methods used to estimate accident sequence frequency values. Information is presented concerning the approach, example collection, failure data, candidate dominant sequences, uncertainty analysis, and sensitivity analysis.
Ali, M A; Al-Hemaid, F M; Lee, J; Hatamleh, A A; Gyulai, G; Rahman, M O
2015-10-02
The present study explored the systematic inventory of Echinops L. (Asteraceae) of Saudi Arabia, with special reference to the molecular typing of Echinops abuzinadianus Chaudhary, an endemic species to Saudi Arabia, based on the internal transcribed spacer (ITS) sequences (ITS1-5.8S-ITS2) of nuclear ribosomal DNA. A sequence similarity search using BLAST and a phylogenetic analysis of the ITS sequence of E. abuzinadianus revealed a high level of sequence similarity with E. glaberrimus DC. (section Ritropsis). The novel primary sequence and the secondary structure of ITS2 of E. abuzinadianus could potentially be used for molecular genotyping.
Matrix Transformations between Certain Sequence Spaces over the Non-Newtonian Complex Field
Efe, Hakan
2014-01-01
In some cases, the most general linear operator between two sequence spaces is given by an infinite matrix. So the theory of matrix transformations has always been of great interest in the study of sequence spaces. In the present paper, we introduce the matrix transformations in sequence spaces over the field ℂ* and characterize some classes of infinite matrices with respect to the non-Newtonian calculus. Also we give the necessary and sufficient conditions on an infinite matrix transforming one of the classical sets over ℂ* to another one. Furthermore, the concept for sequence-to-sequence and series-to-series methods of summability is given with some illustrated examples. PMID:25110740
Dipeptide Sequence Determination: Analyzing Phenylthiohydantoin Amino Acids by HPLC
NASA Astrophysics Data System (ADS)
Barton, Janice S.; Tang, Chung-Fei; Reed, Steven S.
2000-02-01
Amino acid composition and sequence determination, important techniques for characterizing peptides and proteins, are essential for predicting conformation and studying sequence alignment. This experiment presents improved, fundamental methods of sequence analysis for an upper-division biochemistry laboratory. Working in pairs, students use the Edman reagent to prepare phenylthiohydantoin derivatives of amino acids for determination of the sequence of an unknown dipeptide. With a single HPLC technique, students identify both the N-terminal amino acid and the composition of the dipeptide. This method yields good precision of retention times and allows use of a broad range of amino acids as components of the dipeptide. Students learn fundamental principles and techniques of sequence analysis and HPLC.
Diversity of Babesia bovis merozoite surface antigen genes in the Philippines.
Tattiyapong, Muncharee; Sivakumar, Thillaiampalam; Ybanez, Adrian Patalinghug; Ybanez, Rochelle Haidee Daclan; Perez, Zandro Obligado; Guswanto, Azirwan; Igarashi, Ikuo; Yokoyama, Naoaki
2014-02-01
Babesia bovis is the causative agent of fatal babesiosis in cattle. In the present study, we investigated the genetic diversity of B. bovis among Philippine cattle, based on the genes that encode merozoite surface antigens (MSAs). Forty-one B. bovis-positive blood DNA samples from cattle were used to amplify the msa-1, msa-2b, and msa-2c genes. In phylogenetic analyses, the msa-1, msa-2b, and msa-2c gene sequences generated from Philippine B. bovis-positive DNA samples were found in six, three, and four different clades, respectively. All of the msa-1 and most of the msa-2b sequences were found in clades that were formed only by Philippine msa sequences in the respective phylograms. While all the msa-1 sequences from the Philippines showed similarity to those formed by Australian msa-1 sequences, the msa-2b sequences showed similarity to either Australian or Mexican msa-2b sequences. In contrast, msa-2c sequences from the Philippines were distributed across all the clades of the phylogram, although one clade was formed exclusively by Philippine msa-2c sequences. Similarities among the deduced amino acid sequences of MSA-1, MSA-2b, and MSA-2c from the Philippines were 62.2-100, 73.1-100, and 67.3-100%, respectively. The present findings demonstrate that B. bovis populations are genetically diverse in the Philippines. This information will provide a good foundation for the future design and implementation of improved immunological preventive methodologies against bovine babesiosis in the Philippines. The study has also generated a set of data that will be useful for futher understanding of the global genetic diversity of this important parasite. © 2013.
Das Bhowmik, Aneek; Gupta, Neerja; Dalal, Ashwin; Kabra, Madhulika
In the present study we report on genetic analysis in a patient with developmental delay, truncal obesity and vision problem, to find the causative mutation. Whole exome sequencing was performed on genomic DNA extracted from whole blood of the patient which revealed a homozygous nonsense variant (c.2816T>A) in exon 8 of ALMS1 gene that results in a stop codon and premature truncation at codon 939 (p.L939Ter) of the protein. The mutation was confirmed by Sanger sequencing. Exome sequencing was helpful in establishing diagnosis of Alstrom syndrome in this patient. This case highlights the utility of exome sequencing in clinical practice. Copyright © 2016 Asia Oceania Association for the Study of Obesity. Published by Elsevier Ltd. All rights reserved.
Microbial community structure in three deep-sea carbonate crusts.
Heijs, S K; Aloisi, G; Bouloubassi, I; Pancost, R D; Pierre, C; Sinninghe Damsté, J S; Gottschal, J C; van Elsas, J D; Forney, L J
2006-10-01
Carbonate crusts in marine environments can act as sinks for carbon dioxide. Therefore, understanding carbonate crust formation could be important for understanding global warming. In the present study, the microbial communities of three carbonate crust samples from deep-sea mud volcanoes in the eastern Mediterranean were characterized by sequencing 16S ribosomal RNA (rRNA) genes amplified from DNA directly retrieved from the samples. In combination with the mineralogical composition of the crusts and lipid analyses, sequence data were used to assess the possible role of prokaryotes in crust formation. Collectively, the obtained data showed the presence of highly diverse communities, which were distinct in each of the carbonate crusts studied. Bacterial 16S rRNA gene sequences were found in all crusts and the majority was classified as alpha-, gamma-, and delta- Proteobacteria. Interestingly, sequences of Proteobacteria related to Halomonas and Halovibrio sp., which can play an active role in carbonate mineral formation, were present in all crusts. Archaeal 16S rRNA gene sequences were retrieved from two of the crusts studied. Several of those were closely related to archaeal sequences of organisms that have previously been linked to the anaerobic oxidation of methane (AOM). However, the majority of archaeal sequences were not related to sequences of organisms known to be involved in AOM. In combination with the strongly negative delta 13C values of archaeal lipids, these results open the possibility that organisms with a role in AOM may be more diverse within the Archaea than previously suggested. Different communities found in the crusts could carry out similar processes that might play a role in carbonate crust formation.
Mining of Microbial Genomes for the Novel Sources of Nitrilases.
Sharma, Nikhil; Thakur, Neerja; Raj, Tilak; Savitri; Bhalla, Tek Chand
2017-01-01
Next-generation DNA sequencing (NGS) has made it feasible to sequence large number of microbial genomes and advancements in computational biology have opened enormous opportunities to mine genome sequence data for novel genes and enzymes or their sources. In the present communication in silico mining of microbial genomes has been carried out to find novel sources of nitrilases. The sequences selected were analyzed for homology and considered for designing motifs. The manually designed motifs based on amino acid sequences of nitrilases were used to screen 2000 microbial genomes (translated to proteomes). This resulted in identification of one hundred thirty-eight putative/hypothetical sequences which could potentially code for nitrilase activity. In vitro validation of nine predicted sources of nitrilases was done for nitrile/cyanide hydrolyzing activity. Out of nine predicted nitrilases, Gluconacetobacter diazotrophicus , Sphingopyxis alaskensis , Saccharomonospora viridis , and Shimwellia blattae were specific for aliphatic nitriles, whereas nitrilases from Geodermatophilus obscurus , Nocardiopsis dassonvillei , Runella slithyformis , and Streptomyces albus possessed activity for aromatic nitriles. Flavobacterium indicum was specific towards potassium cyanide (KCN) which revealed the presence of nitrilase homolog, that is, cyanide dihydratase with no activity for either aliphatic, aromatic, or aryl nitriles. The present study reports the novel sources of nitrilases and cyanide dihydratase which were not reported hitherto by in silico or in vitro studies.
Kravatsky, Yuri; Chechetkin, Vladimir; Fedoseeva, Daria; Gorbacheva, Maria; Kravatskaya, Galina; Kretova, Olga; Tchurikov, Nickolai
2017-11-23
The efficient development of antiviral drugs, including efficient antiviral small interfering RNAs (siRNAs), requires continuous monitoring of the strict correspondence between a drug and the related highly variable viral DNA/RNA target(s). Deep sequencing is able to provide an assessment of both the general target conservation and the frequency of particular mutations in the different target sites. The aim of this study was to develop a reliable bioinformatic pipeline for the analysis of millions of short, deep sequencing reads corresponding to selected highly variable viral sequences that are drug target(s). The suggested bioinformatic pipeline combines the available programs and the ad hoc scripts based on an original algorithm of the search for the conserved targets in the deep sequencing data. We also present the statistical criteria for the threshold of reliable mutation detection and for the assessment of variations between corresponding data sets. These criteria are robust against the possible sequencing errors in the reads. As an example, the bioinformatic pipeline is applied to the study of the conservation of RNA interference (RNAi) targets in human immunodeficiency virus 1 (HIV-1) subtype A. The developed pipeline is freely available to download at the website http://virmut.eimb.ru/. Brief comments and comparisons between VirMut and other pipelines are also presented.
Development of Genetic Markers in Eucalyptus Species by Target Enrichment and Exome Sequencing
Dasgupta, Modhumita Ghosh; Dharanishanthi, Veeramuthu; Agarwal, Ishangi; Krutovsky, Konstantin V.
2015-01-01
The advent of next-generation sequencing has facilitated large-scale discovery, validation and assessment of genetic markers for high density genotyping. The present study was undertaken to identify markers in genes supposedly related to wood property traits in three Eucalyptus species. Ninety four genes involved in xylogenesis were selected for hybridization probe based nuclear genomic DNA target enrichment and exome sequencing. Genomic DNA was isolated from the leaf tissues and used for on-array probe hybridization followed by Illumina sequencing. The raw sequence reads were trimmed and high-quality reads were mapped to the E. grandis reference sequence and the presence of single nucleotide variants (SNVs) and insertions/ deletions (InDels) were identified across the three species. The average read coverage was 216X and a total of 2294 SNVs and 479 InDels were discovered in E. camaldulensis, 2383 SNVs and 518 InDels in E. tereticornis, and 1228 SNVs and 409 InDels in E. grandis. Additionally, SNV calling and InDel detection were conducted in pair-wise comparisons of E. tereticornis vs. E. grandis, E. camaldulensis vs. E. tereticornis and E. camaldulensis vs. E. grandis. This study presents an efficient and high throughput method on development of genetic markers for family– based QTL and association analysis in Eucalyptus. PMID:25602379
A study of entropy/clarity of genetic sequences using metric spaces and fuzzy sets.
Georgiou, D N; Karakasidis, T E; Nieto, Juan J; Torres, A
2010-11-07
The study of genetic sequences is of great importance in biology and medicine. Sequence analysis and taxonomy are two major fields of application of bioinformatics. In the present paper we extend the notion of entropy and clarity to the use of different metrics and apply them in the case of the Fuzzy Polynuclotide Space (FPS). Applications of these notions on selected polynucleotides and complete genomes both in the I(12×k) space, but also using their representation in FPS are presented. Our results show that the values of fuzzy entropy/clarity are indicative of the degree of complexity necessary for the description of the polynucleotides in the FPS, although in the latter case the interpretation is slightly different than in the case of the I(12×k) hypercube. Fuzzy entropy/clarity along with the use of appropriate metrics can contribute to sequence analysis and taxonomy. Copyright © 2010 Elsevier Ltd. All rights reserved.
Joseph, Sneha; Poriya, Paresh; Kundu, Rahul
2016-11-01
The present study reports the phylogenetic relationship of six zoanthid species belonging to three genera, Isaurus, Palythoa, and Zoanthus identified using systematic computational analysis of mtDNA gene sequences. All six species are first recorded from the coasts of Kathiawar Peninsula, India. Genus: Isaurus is represented by Isaurus tuberculatus, genus Zoanthus is represented by Zoanthus kuroshio and Zoanthus sansibaricus, while genus Palythoa is represented by Palythoa tuberculosa, P. sp. JVK-2006 and Palythoa heliodiscus. Results of the present study revealed that among the various species observed along the coastline, a minimum of 99% sequence divergence and a maximum of 96% sequence divergence were seen. An interspecific divergence of 1-4% and negligible intraspecific divergence was observed. These results not only highlighted the efficiency of the COI gene region in species identification but also demonstrated the genetic variability of zoanthids along the Saurashtra coastline of the west coast of India.
Premzl, Marko
2015-01-01
Using eutherian comparative genomic analysis protocol and public genomic sequence data sets, the present work attempted to update and revise two gene data sets. The most comprehensive third party annotation gene data sets of eutherian adenohypophysis cystine-knot genes (128 complete coding sequences), and d-dopachrome tautomerases and macrophage migration inhibitory factor genes (30 complete coding sequences) were annotated. For example, the present study first described primate-specific cystine-knot Prometheus genes, as well as differential gene expansions of D-dopachrome tautomerase genes. Furthermore, new frameworks of future experiments of two eutherian gene data sets were proposed. PMID:25941635
Whole genome sequencing data and de novo draft assemblies for 66 teleost species
Malmstrøm, Martin; Matschiner, Michael; Tørresen, Ole K.; Jakobsen, Kjetill S.; Jentoft, Sissel
2017-01-01
Teleost fishes comprise more than half of all vertebrate species, yet genomic data are only available for 0.2% of their diversity. Here, we present whole genome sequencing data for 66 new species of teleosts, vastly expanding the availability of genomic data for this important vertebrate group. We report on de novo assemblies based on low-coverage (9–39×) sequencing and present detailed methodology for all analyses. To facilitate further utilization of this data set, we present statistical analyses of the gene space completeness and verify the expected phylogenetic position of the sequenced genomes in a large mitogenomic context. We further present a nuclear marker set used for phylogenetic inference and evaluate each gene tree in relation to the species tree to test for homogeneity in the phylogenetic signal. Collectively, these analyses illustrate the robustness of this highly diverse data set and enable extensive reuse of the selected phylogenetic markers and the genomic data in general. This data set covers all major teleost lineages and provides unprecedented opportunities for comparative studies of teleosts. PMID:28094797
Accounting for rate-dependent category boundary shifts in speech perception.
Bosker, Hans Rutger
2017-01-01
The perception of temporal contrasts in speech is known to be influenced by the speech rate in the surrounding context. This rate-dependent perception is suggested to involve general auditory processes because it is also elicited by nonspeech contexts, such as pure tone sequences. Two general auditory mechanisms have been proposed to underlie rate-dependent perception: durational contrast and neural entrainment. This study compares the predictions of these two accounts of rate-dependent speech perception by means of four experiments, in which participants heard tone sequences followed by Dutch target words ambiguous between /ɑs/ "ash" and /a:s/ "bait". Tone sequences varied in the duration of tones (short vs. long) and in the presentation rate of the tones (fast vs. slow). Results show that the duration of preceding tones did not influence target perception in any of the experiments, thus challenging durational contrast as explanatory mechanism behind rate-dependent perception. Instead, the presentation rate consistently elicited a category boundary shift, with faster presentation rates inducing more /a:s/ responses, but only if the tone sequence was isochronous. Therefore, this study proposes an alternative, neurobiologically plausible account of rate-dependent perception involving neural entrainment of endogenous oscillations to the rate of a rhythmic stimulus.
Ramos, Rommel Thiago Jucá; Carneiro, Adriana Ribeiro; Soares, Siomar de Castro; dos Santos, Anderson Rodrigues; Almeida, Sintia; Guimarães, Luis; Figueira, Flávia; Barbosa, Eudes; Tauch, Andreas; Azevedo, Vasco; Silva, Artur
2013-03-01
New sequencing platforms have enabled rapid decoding of complete prokaryotic genomes at relatively low cost. The Ion Torrent platform is an example of these technologies, characterized by lower coverage, generating challenges for the genome assembly. One particular problem is the lack of genomes that enable reference-based assembly, such as the one used in the present study, Corynebacterium pseudotuberculosis biovar equi, which causes high economic losses in the US equine industry. The quality treatment strategy incorporated into the assembly pipeline enabled a 16-fold greater use of the sequencing data obtained compared with traditional quality filter approaches. Data preprocessing prior to the de novo assembly enabled the use of known methodologies in the next-generation sequencing data assembly. Moreover, manual curation was proved to be essential for ensuring a quality assembly, which was validated by comparative genomics with other species of the genus Corynebacterium. The present study presents a modus operandi that enables a greater and better use of data obtained from semiconductor sequencing for obtaining the complete genome from a prokaryotic microorganism, C. pseudotuberculosis, which is not a traditional biological model such as Escherichia coli. © 2012 The Authors. Published by Society for Applied Microbiology and Blackwell Publishing Ltd. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
Whole-exome sequencing identified a variant in EFTUD2 gene in establishing a genetic diagnosis.
Rengasamy Venugopalan, S; Farrow, E G; Lypka, M
2017-06-01
Craniofacial anomalies are complex and have an overlapping phenotype. Mandibulofacial Dysostosis and Oculo-Auriculo-Vertebral Spectrum are conditions that share common craniofacial phenotype and present a challenge in arriving at a diagnosis. In this report, we present a case of female proband who was given a differential diagnosis of Treacher Collins syndrome or Hemifacial Microsomia without certainty. Prior genetic testing reported negative for 22q deletion and FGFR screenings. The objective of this study was to demonstrate the critical role of whole-exome sequencing in establishing a genetic diagnosis of the proband. The participants were 14½-year-old affected female proband/parent trio. Proband/parent trio were enrolled in the study. Surgical tissue sample from the proband and parental blood samples were collected and prepared for whole-exome sequencing. Illumina HiSeq 2500 instrument was used for sequencing (125 nucleotide reads/84X coverage). Analyses of variants were performed using custom-developed software, RUNES and VIKING. Variant analyses following whole-exome sequencing identified a heterozygous de novo pathogenic variant, c.259C>T (p.Gln87*), in EFTUD2 (NM_004247.3) gene in the proband. Previous studies have reported that the variants in EFTUD2 gene were associated with Mandibulofacial Dysostosis with Microcephaly. Patients with facial asymmetry, micrognathia, choanal atresia and microcephaly should be analyzed for variants in EFTUD2 gene. Next-generation sequencing techniques, such as whole-exome sequencing offer great promise to improve the understanding of etiologies of sporadic genetic diseases. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Wang, Zhe; Deng, Qiong; Zhou, Tong; Yang, Hao; Gu, Zemao
2018-07-01
Although high diversity of parasitic ciliates has been reported in China, little is known about the species from high altitude areas, especially in Tibet. To investigate the species of parasitic ciliates in Tibet, a project was initiated in the Chabalang wetland in 2013. Two Trichodina species, namely, Trichodina sp. and T. reticulata Hirschmann & Partsch, 1955, were isolated from gills of an invasive fish, Micropercops swinhonis for the first time. In the present study, we provided the morphological, morphometrical, and molecular characterizations of the two species and conducted the phylogenetic analyses of mobilids based on the small subunit ribosomal RNA gene (SSU rDNA) sequences. Both morphological characters and morphometric data of the T. reticulata agreed well with previous studies. Although two partial SSU rDNA sequences were obtained in the present study, only the sequence of T. reticulata population in the present study was thought to be reliable. The other sequence may not belong to the other species. Thus, we regarded the other species isolated in the present study as Trichodina sp. to avoid the wrong or confused species identification. Morphologically, Trichodina sp. is distinguished mainly by its large body shape with a broad adhesive disk, robust and obliquely quadrilateral blades, and well-developed rays. T. reticulata is mainly characterized with the 8-12 spherical or elliptical granules in the central zone of adhesive disk. Phylogenetic analyses consistently showed the two ectoparasites clustered with freshwater species of the genus Trichodina within the order Mobilida. Our study extended the host range of T. reticulata and supplemented the molecular data. Also, results reveal that invasion of exotic fish may cause a potential threat to native fish by introducing or dispersing parasitic ciliates.
Chávez Montes, Ricardo A; de Fátima Rosas-Cárdenas, Flor; De Paoli, Emanuele; Accerbi, Monica; Rymarquis, Linda A; Mahalingam, Gayathri; Marsch-Martínez, Nayelli; Meyers, Blake C; Green, Pamela J; de Folter, Stefan
2014-04-23
Small RNAs are pivotal regulators of gene expression that guide transcriptional and post-transcriptional silencing mechanisms in eukaryotes, including plants. Here we report a comprehensive atlas of sRNA and miRNA from 3 species of algae and 31 representative species across vascular plants, including non-model plants. We sequence and quantify sRNAs from 99 different tissues or treatments across species, resulting in a data set of over 132 million distinct sequences. Using miRBase mature sequences as a reference, we identify the miRNA sequences present in these libraries. We apply diverse profiling methods to examine critical sRNA and miRNA features, such as size distribution, tissue-specific regulation and sequence conservation between species, as well as to predict putative new miRNA sequences. We also develop database resources, computational analysis tools and a dedicated website, http://smallrna.udel.edu/. This study provides new insights on plant sRNAs and miRNAs, and a foundation for future studies.
Shah, Dheeraj; Singh, Meenakshi; Gupta, Piyush; Faridi, M M A
2014-03-01
The aim of the present study was to evaluate whether the order of complementary feeding in relation to breast-feeding affects breast milk, semisolid, or total energy intake in infants. The present study was designed as a randomized crossover trial. The study was conducted in a tertiary care hospital. The study participants were 25 healthy infants between the ages of 7 and 11 months who were exclusively breast-fed for at least 6 months and were now receiving complementary foods for at least 1 month in addition to breast-feeding. Infants were randomized to follow a sequence of either complementary feeding before breast-feeding (sequence A) or complementary feeding after breast-feeding (sequence B) for the first day (24 hours) of the study period using simple randomization. For the next day, the sequence was reversed for each child. All babies received 3 actively fed complementary food meals per day (morning, afternoon, and evening). A semisolid study diet was prepared in the hospital by cooking rice and pulse with oil using a standard method, ensuring the energy density of at least 0.6 kcal/g. The infants were allowed ad libitum breast-feeding during the observation period. Semisolid intake was directly measured and breast milk intake was quantified by test weighing method. Energy intake from complementary foods was calculated from the product of energy density of the diet served on that day and the total amount consumed. The total energy intake and energy intake from breast milk and complementary foods between the 2 sequences were compared. The mean (standard deviation) energy intake from breast milk during 12 hours of daytime by following sequence A (complementary feeding before breast-feeding) was 132.0 (67.4) kcal in comparison with 135.9 (56.2) kcal in sequence B, which was not statistically different (P = 0.83). The mean (standard deviation) energy consumed from semisolids in sequences A and B was also comparable (88.6 [75.5] kcal vs. 85.5 [89.7] kcal; P = 0.58). The total energy intake during daytime in sequence A was 220.6 (96.2) kcal in comparison with 221.5 (94.0) kcal in sequence B, which was also comparable (P = 0.97). The results related to energy intake through breast milk and total energy intake were not different when insensible losses during feeding were adjusted in both groups. Altering the sequence of complementary feeding in relation to breast-feeding does not affect total energy intake.
RefSeq microbial genomes database: new representation and annotation strategy.
Tatusova, Tatiana; Ciufo, Stacy; Fedorov, Boris; O'Neill, Kathleen; Tolstoy, Igor
2014-01-01
The source of the microbial genomic sequences in the RefSeq collection is the set of primary sequence records submitted to the International Nucleotide Sequence Database public archives. These can be accessed through the Entrez search and retrieval system at http://www.ncbi.nlm.nih.gov/genome. Next-generation sequencing has enabled researchers to perform genomic sequencing at rates that were unimaginable in the past. Microbial genomes can now be sequenced in a matter of hours, which has led to a significant increase in the number of assembled genomes deposited in the public archives. This huge increase in DNA sequence data presents new challenges for the annotation, analysis and visualization bioinformatics tools. New strategies have been developed for the annotation and representation of reference genomes and sequence variations derived from population studies and clinical outbreaks.
Trudeau, Natacha; Morford, Jill P; Sutton, Ann
2010-06-01
Graphic symbols are often used to represent words in Augmentative and Alternative Communication systems. Previous findings suggest that different processes operate when using graphic symbols and when using speech. This study assessed the ability of native speakers of French with no communication disorders from four age groups to interpret graphic-symbol sequences of varying length and canonicity. Results reveal that, as they get older, participants show an increase in their capacity to interpret graphic-symbol sequences. Constituent order played an important role in the interpretation of the sequences. However, the specific word-order strategies used varied depending on the age group and the type of sequence presented.
Karaboga, D; Aslan, S
2016-04-27
The great majority of biological sequences share significant similarity with other sequences as a result of evolutionary processes, and identifying these sequence similarities is one of the most challenging problems in bioinformatics. In this paper, we present a discrete artificial bee colony (ABC) algorithm, which is inspired by the intelligent foraging behavior of real honey bees, for the detection of highly conserved residue patterns or motifs within sequences. Experimental studies on three different data sets showed that the proposed discrete model, by adhering to the fundamental scheme of the ABC algorithm, produced competitive or better results than other metaheuristic motif discovery techniques.
Cicek, Mustafa; Mutlu, Ozal; Erdemir, Aysegul; Ozkan, Ebru; Saricay, Yunus; Turgut-Balik, Dilek
2013-06-01
One of the most important step in structure-based drug design studies is obtaining the protein in active form after cloning the target gene. In one of our previous study, it was determined that an internal Shine-Dalgarno-like sequence present just before the third methionine at N-terminus of wild type lactate dehydrogenase enzyme of Plasmodium falciparum prevent the translation of full length protein. Inspection of the same region in P. vivax LDH, which was overproduced as an active enzyme, indicated that the codon preference in the same region was slightly different than the codon preference of wild type PfLDH. In this study, 5'-GGAGGC-3' sequence of P. vivax that codes for two glycine residues just before the third methionine was exchanged to 5'-GGAGGA-3', by mimicking P. falciparum LDH, to prove the possible effects of having an internal SD-like sequence when expressing an eukaryotic protein in a prokaryotic system. Exchange was made by site-directed mutagenesis. Results indicated that having two glycine residues with an internal SD-like sequence (GGAGGA) just before the third methionine abolishes the enzyme activity due to the preference of the prokaryotic system used for the expression. This study emphasizes the awareness of use of a prokaryotic system to overproduce an eukaryotic protein.
Ayyagari, Vijaya Sai; Sreerama, Krupanidhi
2017-08-01
Achatina fulica (Lissachatina fulica) is one of the most invasive species found across the globe causing a significant damage to crops, vegetables, and horticultural plants. This terrestrial snail is native to east Africa and spread to different parts of the world by introductions. India, a hot spot for biodiversity of several endemic gastropods, has witnessed an outburst of this snail population in several parts of the country posing a serious threat to crop loss and also to human health. With an objective to evaluate the genetic diversity of this snail, we have sampled this snail from different parts of India and analyzed its haplotype diversity by means of 16S rDNA sequence information. Apart from this, we have studied the phylogenetic relationships of the isolates sequenced in the present study in relation with other global populations by Bayesian and Maximum-likelihood approaches. Of the isolates sequenced, haplotype 'C' is the predominant one. A new haplotype 'S' from the state of Odisha was observed. The isolates sequenced in the present study clustered with its conspecifics from the Indian sub-continent. Haplotype network analyses were also carried out for studying the evolution of different haplotypes. It was observed that haplotype 'S' was associated with a Mauritius haplotype 'H', indicating the possibility of multiple introductions of A. fulica to India.
Spatial serial order processing in schizophrenia.
Fraser, David; Park, Sohee; Clark, Gina; Yohanna, Daniel; Houk, James C
2004-10-01
The aim of this study was to examine serial order processing deficits in 21 schizophrenia patients and 16 age- and education-matched healthy controls. In a spatial serial order working memory task, one to four spatial targets were presented in a randomized sequence. Subjects were required to remember the locations and the order in which the targets were presented. Patients showed a marked deficit in ability to remember the sequences compared with controls. Increasing the number of targets within a sequence resulted in poorer memory performance for both control and schizophrenia subjects, but the effect was much more pronounced in the patients. Targets presented at the end of a long sequence were more vulnerable to memory error in schizophrenia patients. Performance deficits were not attributable to motor errors, but to errors in target choice. The results support the idea that the memory errors seen in schizophrenia patients may be due to saturating the working memory network at relatively low levels of memory load.
Molecular taxonomic techniques such as DNA barcoding offer interesting new capabilities for studying community biodiversity for applications like biological monitoring. Beyond DNA barcoding, new DNA sequencing technologies (i.e. Next-Generation Sequencing) present even greater po...
Next generation sequencing applications for microRNA biomarker discovery in toxicological studies
Next Generation Sequencing (NGS) technology will be reviewed for its base pair resolution, wide dynamic range, and insights into the genome and transcriptome, with special focus upon the biomarker potential of microRNAs (miRNAs). The first part of this presentation reviews commo...
2010-01-01
Background The phenomenon of desiccation tolerance, also called anhydrobiosis, involves the ability of an organism to survive the loss of almost all cellular water without sustaining irreversible damage. Although there are several physiological, morphological and ecological studies on tardigrades, only limited DNA sequence information is available. Therefore, we explored the transcriptome in the active and anhydrobiotic state of the tardigrade Milnesium tardigradum which has extraordinary tolerance to desiccation and freezing. In this study, we present the first overview of the transcriptome of M. tardigradum and its response to desiccation and discuss potential parallels to stress responses in other organisms. Results We sequenced a total of 9984 expressed sequence tags (ESTs) from two cDNA libraries from the eutardigrade M. tardigradum in its active and inactive, anhydrobiotic (tun) stage. Assembly of these ESTs resulted in 3283 putative unique transcripts, whereof ~50% showed significant sequence similarity to known genes. The resulting unigenes were functionally annotated using the Gene Ontology (GO) vocabulary. A GO term enrichment analysis revealed several GOs that were significantly underrepresented in the inactive stage. Furthermore we compared the putative unigenes of M. tardigradum with ESTs from two other eutardigrade species that are available from public sequence databases, namely Richtersius coronifer and Hypsibius dujardini. The processed sequences of the three tardigrade species revealed similar functional content and the M. tardigradum dataset contained additional sequences from tardigrades not present in the other two. Conclusions This study describes novel sequence data from the tardigrade M. tardigradum, which significantly contributes to the available tardigrade sequence data and will help to establish this extraordinary tardigrade as a model for studying anhydrobiosis. Functional comparison of active and anhydrobiotic tardigrades revealed a differential distribution of Gene Ontology terms associated with chromatin structure and the translation machinery, which are underrepresented in the inactive animals. These findings imply a widespread metabolic response of the animals on dehydration. The collective tardigrade transcriptome data will serve as a reference for further studies and support the identification and characterization of genes involved in the anhydrobiotic response. PMID:20226016
Mali, Brahim; Grohme, Markus A; Förster, Frank; Dandekar, Thomas; Schnölzer, Martina; Reuter, Dirk; Wełnicz, Weronika; Schill, Ralph O; Frohme, Marcus
2010-03-12
The phenomenon of desiccation tolerance, also called anhydrobiosis, involves the ability of an organism to survive the loss of almost all cellular water without sustaining irreversible damage. Although there are several physiological, morphological and ecological studies on tardigrades, only limited DNA sequence information is available. Therefore, we explored the transcriptome in the active and anhydrobiotic state of the tardigrade Milnesium tardigradum which has extraordinary tolerance to desiccation and freezing. In this study, we present the first overview of the transcriptome of M. tardigradum and its response to desiccation and discuss potential parallels to stress responses in other organisms. We sequenced a total of 9984 expressed sequence tags (ESTs) from two cDNA libraries from the eutardigrade M. tardigradum in its active and inactive, anhydrobiotic (tun) stage. Assembly of these ESTs resulted in 3283 putative unique transcripts, whereof approximately 50% showed significant sequence similarity to known genes. The resulting unigenes were functionally annotated using the Gene Ontology (GO) vocabulary. A GO term enrichment analysis revealed several GOs that were significantly underrepresented in the inactive stage. Furthermore we compared the putative unigenes of M. tardigradum with ESTs from two other eutardigrade species that are available from public sequence databases, namely Richtersius coronifer and Hypsibius dujardini. The processed sequences of the three tardigrade species revealed similar functional content and the M. tardigradum dataset contained additional sequences from tardigrades not present in the other two. This study describes novel sequence data from the tardigrade M. tardigradum, which significantly contributes to the available tardigrade sequence data and will help to establish this extraordinary tardigrade as a model for studying anhydrobiosis. Functional comparison of active and anhydrobiotic tardigrades revealed a differential distribution of Gene Ontology terms associated with chromatin structure and the translation machinery, which are underrepresented in the inactive animals. These findings imply a widespread metabolic response of the animals on dehydration. The collective tardigrade transcriptome data will serve as a reference for further studies and support the identification and characterization of genes involved in the anhydrobiotic response.
Synchronized tapping facilitates learning sound sequences as indexed by the P300.
Kamiyama, Keiko S; Okanoya, Kazuo
2014-01-01
The purpose of the present study was to determine whether and how single finger tapping in synchrony with sound sequences contributed to the auditory processing of them. The participants learned two unfamiliar sound sequences via different methods. In the tapping condition, they learned an auditory sequence while they tapped in synchrony with each sound onset. In the no tapping condition, they learned another sequence while they kept pressing a key until the sequence ended. After these learning sessions, we presented the two melodies again and recorded event-related potentials (ERPs). During the ERP recordings, 10% of the tones within each melody deviated from the original tones. An analysis of the grand average ERPs showed that deviant stimuli elicited a significant P300 in the tapping but not in the no-tapping condition. In addition, the significance of the P300 effect in the tapping condition increased as the participants showed highly synchronized tapping behavior during the learning sessions. These results indicated that single finger tapping promoted the conscious detection and evaluation of deviants within the learned sequences. The effect was related to individuals' musical ability to coordinate their finger movements along with external auditory events.
Synchronized tapping facilitates learning sound sequences as indexed by the P300
Kamiyama, Keiko S.; Okanoya, Kazuo
2014-01-01
The purpose of the present study was to determine whether and how single finger tapping in synchrony with sound sequences contributed to the auditory processing of them. The participants learned two unfamiliar sound sequences via different methods. In the tapping condition, they learned an auditory sequence while they tapped in synchrony with each sound onset. In the no tapping condition, they learned another sequence while they kept pressing a key until the sequence ended. After these learning sessions, we presented the two melodies again and recorded event-related potentials (ERPs). During the ERP recordings, 10% of the tones within each melody deviated from the original tones. An analysis of the grand average ERPs showed that deviant stimuli elicited a significant P300 in the tapping but not in the no-tapping condition. In addition, the significance of the P300 effect in the tapping condition increased as the participants showed highly synchronized tapping behavior during the learning sessions. These results indicated that single finger tapping promoted the conscious detection and evaluation of deviants within the learned sequences. The effect was related to individuals’ musical ability to coordinate their finger movements along with external auditory events. PMID:25400564
Sequence and Structure Dependent DNA-DNA Interactions
NASA Astrophysics Data System (ADS)
Kopchick, Benjamin; Qiu, Xiangyun
Molecular forces between dsDNA strands are largely dominated by electrostatics and have been extensively studied. Quantitative knowledge has been accumulated on how DNA-DNA interactions are modulated by varied biological constituents such as ions, cationic ligands, and proteins. Despite its central role in biology, the sequence of DNA has not received substantial attention and ``random'' DNA sequences are typically used in biophysical studies. However, ~50% of human genome is composed of non-random-sequence DNAs, particularly repetitive sequences. Furthermore, covalent modifications of DNA such as methylation play key roles in gene functions. Such DNAs with specific sequences or modifications often take on structures other than the canonical B-form. Here we present series of quantitative measurements of the DNA-DNA forces with the osmotic stress method on different DNA sequences, from short repeats to the most frequent sequences in genome, and to modifications such as bromination and methylation. We observe peculiar behaviors that appear to be strongly correlated with the incurred structural changes. We speculate the causalities in terms of the differences in hydration shell and DNA surface structures.
Infants learn better from left to right: a directional bias in infants' sequence learning.
Bulf, Hermann; de Hevia, Maria Dolores; Gariboldi, Valeria; Macchi Cassia, Viola
2017-05-26
A wealth of studies show that human adults map ordered information onto a directional spatial continuum. We asked whether mapping ordinal information into a directional space constitutes an early predisposition, already functional prior to the acquisition of symbolic knowledge and language. While it is known that preverbal infants represent numerical order along a left-to-right spatial continuum, no studies have investigated yet whether infants, like adults, organize any kind of ordinal information onto a directional space. We investigated whether 7-month-olds' ability to learn high-order rule-like patterns from visual sequences of geometric shapes was affected by the spatial orientation of the sequences (left-to-right vs. right-to-left). Results showed that infants readily learn rule-like patterns when visual sequences were presented from left to right, but not when presented from right to left. This result provides evidence that spatial orientation critically determines preverbal infants' ability to perceive and learn ordered information in visual sequences, opening to the idea that a left-to-right spatially organized mental representation of ordered dimensions might be rooted in biologically-determined constraints on human brain development.
Detection of Plasmodium sp. in capybara.
dos Santos, Leonilda Correia; Curotto, Sandra Mara Rotter; de Moraes, Wanderlei; Cubas, Zalmir Silvino; Costa-Nascimento, Maria de Jesus; de Barros Filho, Ivan Roque; Biondo, Alexander Welker; Kirchgatter, Karin
2009-07-07
In the present study, we have microscopically and molecularly surveyed blood samples from 11 captive capybaras (Hydrochaeris hydrochaeris) from the Sanctuary Zoo for Plasmodium sp. infection. One animal presented positive on blood smear by light microscopy. Polymerase chain reaction was carried out accordingly using a nested genus-specific protocol, which uses oligonucleotides from conserved sequences flanking a variable sequence region in the small subunit ribosomal RNA (ssrRNA) of all Plasmodium organisms. This revealed three positive animals. Products from two samples were purified and sequenced. The results showed less than 1% divergence between the two capybara sequences. When compared with GenBank sequences, a 55% similarity was obtained to Toxoplasma gondii and a higher similarity (73-77.2%) was found to ssrRNAs from Plasmodium species that infect reptile, avian, rodents, and human beings. The most similar Plasmodium sequence was from Plasmodium mexicanum that infects lizards of North America, where around 78% identity was found. This work is the first report of Plasmodium in capybaras, and due to the low similarity with other Plasmodium species, we suggest it is a new species, which, in the future could be denominated "Plasmodium hydrochaeri".
Biomarker Discovery and Mechanistic Studies of Prostate Cancer using Targeted Proteomic Approaches
2012-07-01
basigin in Drosophila ) tightly regulates cytoskeleton rearrangement in Drosophila melanogaster [23]. Based on the present results and the existing...from OligoEngine according to the manufac- turer’s instruction. Plasmids were amplified in DH5a cell and confirmed by sequencing . Subconfluent cell...electrophoresis and the results are shown in Figure 1 (Panel C). The RT-PCR products were cloned and subjected to DNA sequenc - ing. The sequencing
D{sub {infinity}}-differential A{sub {infinity}}-algebras and spectral sequences
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lapin, S V
2002-02-28
In the present paper the construction of a D{sub {infinity}}-differential A{sub {infinity}}-(co)algebra is introduced and basic homotopy properties of this construction are studied. The connection between D{sub {infinity}}-differential A{sub {infinity}}-(co)algebras and spectral sequences is established, which enables us to construct the structure of an A{sub {infinity}} -coalgebra on the Milnor coalgebra directly from the differentials of the Adams spectral sequence.
Bengtsson, Johan; Eriksson, K Martin; Hartmann, Martin; Wang, Zheng; Shenoy, Belle Damodara; Grelet, Gwen-Aëlle; Abarenkov, Kessy; Petri, Anna; Rosenblad, Magnus Alm; Nilsson, R Henrik
2011-10-01
The ribosomal small subunit (SSU) rRNA gene has emerged as an important genetic marker for taxonomic identification in environmental sequencing datasets. In addition to being present in the nucleus of eukaryotes and the core genome of prokaryotes, the gene is also found in the mitochondria of eukaryotes and in the chloroplasts of photosynthetic eukaryotes. These three sets of genes are conceptually paralogous and should in most situations not be aligned and analyzed jointly. To identify the origin of SSU sequences in complex sequence datasets has hitherto been a time-consuming and largely manual undertaking. However, the present study introduces Metaxa ( http://microbiology.se/software/metaxa/ ), an automated software tool to extract full-length and partial SSU sequences from larger sequence datasets and assign them to an archaeal, bacterial, nuclear eukaryote, mitochondrial, or chloroplast origin. Using data from reference databases and from full-length organelle and organism genomes, we show that Metaxa detects and scores SSU sequences for origin with very low proportions of false positives and negatives. We believe that this tool will be useful in microbial and evolutionary ecology as well as in metagenomics.
SEQATOMS: a web tool for identifying missing regions in PDB in sequence context.
Brandt, Bernd W; Heringa, Jaap; Leunissen, Jack A M
2008-07-01
With over 46 000 proteins, the Protein Data Bank (PDB) is the most important database with structural information of biological macromolecules. PDB files contain sequence and coordinate information. Residues present in the sequence can be absent from the coordinate section, which means their position in space is unknown. Similarity searches are routinely carried out against sequences taken from PDB SEQRES. However, there no distinction is made between residues that have a known or unknown position in the 3D protein structure. We present a FASTA sequence database that is produced by combining the sequence and coordinate information. All residues absent from the PDB coordinate section are masked with lower-case letters, thereby providing a view of these residues in the context of the entire protein sequence, which facilitates inspecting 'missing' regions. We also provide a masked version of the CATH domain database. A user-friendly BLAST interface is available for similarity searching. In contrast to standard (stand-alone) BLAST output, which only contains upper-case letters, our output retains the lower-case letters of the masked regions. Thus, our server can be used to perform BLAST searching case-sensitively. Here, we have applied it to the study of missing regions in their sequence context. SEQATOMS is available at http://www.bioinformatics.nl/tools/seqatoms/.
IMRT sequencing for a six-bank multi-leaf system.
Topolnjak, R; van der Heide, U A; Lagendijk, J J W
2005-05-07
In this study, we present a sequencer for delivering step-and-shoot IMRT using a six-bank multi-leaf system. Such a system was proposed earlier and combines a high-resolution field-shaping ability with a large field size. It consists of three layers of two opposing leaf banks with 1 cm leaves. The layers are rotated relative to each other at 60 degrees . A low-resolution mode of sequencing is achieved by using one layer of leaves as primary MLC, while the other two are used to improve back-up collimation. For high-resolution sequencing, an algorithm is presented that creates segments shaped by all six banks. Compared to a hypothetical mini-MLC with 0.4 cm leaves, a similar performance can be achieved, but a trade-off has to be made between accuracy and the number of segments.
Genotyping of ancient Mycobacterium tuberculosis strains reveals historic genetic diversity.
Müller, Romy; Roberts, Charlotte A; Brown, Terence A
2014-04-22
The evolutionary history of the Mycobacterium tuberculosis complex (MTBC) has previously been studied by analysis of sequence diversity in extant strains, but not addressed by direct examination of strain genotypes in archaeological remains. Here, we use ancient DNA sequencing to type 11 single nucleotide polymorphisms and two large sequence polymorphisms in the MTBC strains present in 10 archaeological samples from skeletons from Britain and Europe dating to the second-nineteenth centuries AD. The results enable us to assign the strains to groupings and lineages recognized in the extant MTBC. We show that at least during the eighteenth-nineteenth centuries AD, strains of M. tuberculosis belonging to different genetic groups were present in Britain at the same time, possibly even at a single location, and we present evidence for a mixed infection in at least one individual. Our study shows that ancient DNA typing applied to multiple samples can provide sufficiently detailed information to contribute to both archaeological and evolutionary knowledge of the history of tuberculosis.
Compressing DNA sequence databases with coil.
White, W Timothy J; Hendy, Michael D
2008-05-20
Publicly available DNA sequence databases such as GenBank are large, and are growing at an exponential rate. The sheer volume of data being dealt with presents serious storage and data communications problems. Currently, sequence data is usually kept in large "flat files," which are then compressed using standard Lempel-Ziv (gzip) compression - an approach which rarely achieves good compression ratios. While much research has been done on compressing individual DNA sequences, surprisingly little has focused on the compression of entire databases of such sequences. In this study we introduce the sequence database compression software coil. We have designed and implemented a portable software package, coil, for compressing and decompressing DNA sequence databases based on the idea of edit-tree coding. coil is geared towards achieving high compression ratios at the expense of execution time and memory usage during compression - the compression time represents a "one-off investment" whose cost is quickly amortised if the resulting compressed file is transmitted many times. Decompression requires little memory and is extremely fast. We demonstrate a 5% improvement in compression ratio over state-of-the-art general-purpose compression tools for a large GenBank database file containing Expressed Sequence Tag (EST) data. Finally, coil can efficiently encode incremental additions to a sequence database. coil presents a compelling alternative to conventional compression of flat files for the storage and distribution of DNA sequence databases having a narrow distribution of sequence lengths, such as EST data. Increasing compression levels for databases having a wide distribution of sequence lengths is a direction for future work.
Compressing DNA sequence databases with coil
White, W Timothy J; Hendy, Michael D
2008-01-01
Background Publicly available DNA sequence databases such as GenBank are large, and are growing at an exponential rate. The sheer volume of data being dealt with presents serious storage and data communications problems. Currently, sequence data is usually kept in large "flat files," which are then compressed using standard Lempel-Ziv (gzip) compression – an approach which rarely achieves good compression ratios. While much research has been done on compressing individual DNA sequences, surprisingly little has focused on the compression of entire databases of such sequences. In this study we introduce the sequence database compression software coil. Results We have designed and implemented a portable software package, coil, for compressing and decompressing DNA sequence databases based on the idea of edit-tree coding. coil is geared towards achieving high compression ratios at the expense of execution time and memory usage during compression – the compression time represents a "one-off investment" whose cost is quickly amortised if the resulting compressed file is transmitted many times. Decompression requires little memory and is extremely fast. We demonstrate a 5% improvement in compression ratio over state-of-the-art general-purpose compression tools for a large GenBank database file containing Expressed Sequence Tag (EST) data. Finally, coil can efficiently encode incremental additions to a sequence database. Conclusion coil presents a compelling alternative to conventional compression of flat files for the storage and distribution of DNA sequence databases having a narrow distribution of sequence lengths, such as EST data. Increasing compression levels for databases having a wide distribution of sequence lengths is a direction for future work. PMID:18489794
The full mitochondrial genome sequence of Raillietina tetragona from chicken (Cestoda: Davaineidae).
Liang, Jian-Ying; Lin, Rui-Qing
2016-11-01
In the present study, the complete mitochondrial DNA (mtDNA) sequence of Raillietina tetragona was sequenced and its gene contents and genome organizations was compared with that of other tapeworm. The complete mt genome sequence of R. tetragona is 14,444 bp in length. It contains 12 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes, and two non-coding region. All genes are transcribed in the same direction and have a nucleotide composition high in A and T. The contents of A + T of the complete mt genome are 71.4% for R. tetragona. The R. tetragona mt genome sequence provides novel mtDNA marker for studying the molecular epidemiology and population genetics of Raillietina and has implications for the molecular diagnosis of chicken cestodosis caused by Raillietina.
Palau, Montserrat; Boujida, Nadia; Manresa, Àngels; Miñana-Galbis, David
2018-04-19
The complete genome sequence of the halophilic strain Marinobacter flavimaris LMG 23834 T is presented here. The genomic information of this type strain will be useful for taxonomic purposes and for its potential use in bioremediation studies. Copyright © 2018 Palau et al.
Deng, Peng; Tan, Xiaoqing; Wu, Ying; Bai, Qunhua; Jia, Yan; Xiao, Hong
2015-03-01
The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica , which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function.
DENG, PENG; TAN, XIAOQING; WU, YING; BAI, QUNHUA; JIA, YAN; XIAO, HONG
2015-01-01
The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica, which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function. PMID:25667630
Danley, Patrick D; Mullen, Sean P; Liu, Fenglong; Nene, Vishvanath; Quackenbush, John; Shaw, Kerry L
2007-01-01
Background As the developmental costs of genomic tools decline, genomic approaches to non-model systems are becoming more feasible. Many of these systems may lack advanced genetic tools but are extremely valuable models in other biological fields. Here we report the development of expressed sequence tags (EST's) in an orthopteroid insect, a model for the study of neurobiology, speciation, and evolution. Results We report the sequencing of 14,502 EST's from clones derived from a nerve cord cDNA library, and the subsequent construction of a Gene Index from these sequences, from the Hawaiian trigonidiine cricket Laupala kohalensis. The Gene Index contains 8607 unique sequences comprised of 2575 tentative consensus (TC) sequences and 6032 singletons. For each of the unique sequences, an attempt was made to assign a provisional annotation and to categorize its function using a Gene Ontology-based classification through a sequence-based comparison to known proteins. In addition, a set of unique 70 base pair oligomers that can be used for DNA microarrays was developed. All Gene Index information is posted at the DFCI Gene Indices web page Conclusion Orthopterans are models used to understand the neurophysiological basis of complex motor patterns such as flight and stridulation. The sequences presented in the cricket Gene Index will provide neurophysiologists with many genetic tools that have been largely absent in this field. The cricket Gene Index is one of only two gene indices to be developed in an evolutionary model system. Species within the genus Laupala have speciated recently, rapidly, and extensively. Therefore, the genes identified in the cricket Gene Index can be used to study the genomics of speciation. Furthermore, this gene index represents a significant EST resources for basal insects. As such, this resource is a valuable comparative tool for the understanding of invertebrate molecular evolution. The sequences presented here will provide much needed genomic resources for three distinct but overlapping fields of inquiry: neurobiology, speciation, and molecular evolution. PMID:17459168
Bankov, Katrin; Döring, Claudia; Schneider, Markus; Hartmann, Sylvia; Winkelmann, Ria; Albert, Joerg G; Bechstein, Wolf Otto; Zeuzem, Stefan; Hansmann, Martin Leo; Peveling-Oberhag, Jan; Walter, Dirk
2018-04-30
Definite diagnosis and therapeutic management of cholangiocarcinoma (CCA) remains a challenge. The aim of the current study was to investigate feasibility and potential impact on clinical management of targeted sequencing of intraductal biopsies. Intraductal biopsies with suspicious findings from 16 patients with CCA in later clinical course were analyzed with targeted sequencing including tumor and control benign tissue (n = 55 samples). A CCA-specific sequencing panel containing 41 genes was designed and a dual strand targeted enrichment was applied. Sequencing was successfully performed for all samples. In total, 79 mutations were identified and a mean of 1.7 mutations per tumor sample (range 0-4) as well as 2.3 per biopsy (0-6) were detected and potentially therapeutically relevant genes were identified in 6/16 cases. In 14/18 (78%) biopsies with dysplasia or inconclusive findings at least one mutation was detected. The majority of mutations were found in both surgical specimen and biopsy (68%), while 28% were only present in biopsies in contrast to 4% being only present in the surgical tumor specimen. Targeted sequencing from intraductal biopsies is feasible and potentially improves the diagnostic yield. A profound genetic heterogeneity in biliary dysplasia needs to be considered in clinical management and warrants further investigation. The current study is the first to demonstrate the feasibility of sequencing of intraductal biopsies which holds the potential to impact diagnostic and therapeutical management of patients with biliary dysplasia and neoplasia.
2014-01-01
Background DNA repeats, such as transposable elements, minisatellites and palindromic sequences, are abundant in sequences and have been shown to have significant and functional roles in the evolution of the host genomes. In a previous study, we introduced the concept of a repeat DNA module, a flexible motif present in at least two occurences in the sequences. This concept was embedded into ModuleOrganizer, a tool allowing the detection of repeat modules in a set of sequences. However, its implementation remains difficult for larger sequences. Results Here we present Visual ModuleOrganizer, a Java graphical interface that enables a new and optimized version of the ModuleOrganizer tool. To implement this version, it was recoded in C++ with compressed suffix tree data structures. This leads to less memory usage (at least 120-fold decrease in average) and decreases by at least four the computation time during the module detection process in large sequences. Visual ModuleOrganizer interface allows users to easily choose ModuleOrganizer parameters and to graphically display the results. Moreover, Visual ModuleOrganizer dynamically handles graphical results through four main parameters: gene annotations, overlapping modules with known annotations, location of the module in a minimal number of sequences, and the minimal length of the modules. As a case study, the analysis of FoldBack4 sequences clearly demonstrated that our tools can be extended to comparative and evolutionary analyses of any repeat sequence elements in a set of genomic sequences. With the increasing number of sequences available in public databases, it is now possible to perform comparative analyses of repeated DNA modules in a graphic and friendly manner within a reasonable time period. Availability Visual ModuleOrganizer interface and the new version of the ModuleOrganizer tool are freely available at: http://lcb.cnrs-mrs.fr/spip.php?rubrique313. PMID:24678954
Van Neste, Christophe; Van Criekinge, Wim; Deforce, Dieter; Van Nieuwerburgh, Filip
2016-01-01
It is difficult to predict if and when massively parallel sequencing of forensic STR loci will replace capillary electrophoresis as the new standard technology in forensic genetics. The main benefits of sequencing are increased multiplexing scales and SNP detection. There is not yet a consensus on how sequenced profiles should be reported. We present the Forensic Loci Allele Database (FLAD) service, made freely available on http://forensic.ugent.be/FLAD/. It offers permanent identifiers for sequenced forensic alleles (STR or SNP) and their microvariants for use in forensic allele nomenclature. Analogous to Genbank, its aim is to provide permanent identifiers for forensically relevant allele sequences. Researchers that are developing forensic sequencing kits or are performing population studies, can register on http://forensic.ugent.be/FLAD/ and add loci and allele sequences with a short and simple application interface (API). Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Lim, Hassol; Park, Young-Mi; Lee, Jong-Keuk; Taek Lim, Hyun
2016-10-01
To present an efficient and successful application of a single-exome sequencing study in a family clinically diagnosed with X-linked retinitis pigmentosa. Exome sequencing study based on clinical examination data. An 8-year-old proband and his family. The proband and his family members underwent comprehensive ophthalmologic examinations. Exome sequencing was undertaken in the proband using Agilent SureSelect Human All Exon Kit and Illumina HiSeq 2000 platform. Bioinformatic analysis used Illumina pipeline with Burrows-Wheeler Aligner-Genome Analysis Toolkit (BWA-GATK), followed by ANNOVAR to perform variant functional annotation. All variants passing filter criteria were validated by Sanger sequencing to confirm familial segregation. Analysis of exome sequence data identified a novel frameshift mutation in RP2 gene resulting in a premature stop codon (c.665delC, p.Pro222fsTer237). Sanger sequencing revealed this mutation co-segregated with the disease phenotype in the child's family. We identified a novel causative mutation in RP2 from a single proband's exome sequence data analysis. This study highlights the effectiveness of the whole-exome sequencing in the genetic diagnosis of X-linked retinitis pigmentosa, over the conventional sequencing methods. Even using a single exome, exome sequencing technology would be able to pinpoint pathogenic variant(s) for X-linked retinitis pigmentosa, when properly applied with aid of adequate variant filtering strategy. Copyright © 2016 Canadian Ophthalmological Society. Published by Elsevier Inc. All rights reserved.
Lange, Nicholas D; Thomas, Rick P; Buttaccio, Daniel R; Illingworth, David A; Davelaar, Eddy J
2013-02-01
Although temporal dynamics are inherent aspects of diagnostic tasks, few studies have investigated how various aspects of time course influence hypothesis generation. An experiment is reported that demonstrates that working memory dynamics operating during serial data acquisition bias hypothesis generation. The presentation rate (and order) of a sequence of serially presented symptoms was manipulated to be either fast (180 ms per symptom) or slow (1,500 ms per symptom) in a simulated medical diagnosis task. When the presentation rate was slow, participants chose the disease hypothesis consistent with the symptoms appearing later in the sequence. When the presentation rate was fast, however, participants chose the disease hypothesis consistent with the symptoms appearing earlier in the sequence, therefore representing a novel primacy effect. We predicted and account for this effect through competitive working memory dynamics governing information acquisition and the contribution of maintained information to the retrieval of hypotheses from long-term memory.
Nielsen, Tue Kjærgaard; Hansen, Lars Hestbjerg
2015-01-01
Sphingomonas sp. SRS2 was the first described pure strain that is capable of mineralizing the phenylurea herbicide isoproturon and some of its related compounds. This strain has been studied thoroughly and shows potential for bioremediation purposes. We present the draft genome sequence of this bacterium, which will aid future studies. PMID:26021936
Liu, Fei; Wu, Xiao-Li; Liu, Ying; Chen, Da-Xia; Zhang, De-Li; Yang, Da-Jian
2016-02-01
Isaria farinosa is the pathogen of the host of Ophiocordyceps sinensis. The present research has analyzed the progress on the molecular biology according to the bibliometrics, the sequences (including the gene sequences) of I. farinosa in the NCBI. The results indicated that different country had published different number of the papers, and had landed different kinds and different number of the sequences (including the gene sequences). China had published the most number of the papers, and had landed the most number of the sequences (including the gene sequences). America had landed the most numbers of the function genes. The main content about the pathogen study was focus on the biological controlling. The main content about the molecular study concentrated on the phylogenies classification. In recent years some protease genes and chitinase genes had been researched. With the increase of the effect on the healthy of O. sinensis, and the whole sequence and more and more pharmacological activities of I. farinosa being made known to the public, the study on the molecular biology of the I. farinosa would be deeper and wider. Copyright© by the Chinese Pharmaceutical Association.
PeanutDB: an integrated bioinformatics web portal for Arachis hypogaea transcriptomics
2012-01-01
Background The peanut (Arachis hypogaea) is an important crop cultivated worldwide for oil production and food sources. Its complex genetic architecture (e.g., the large and tetraploid genome possibly due to unique cross of wild diploid relatives and subsequent chromosome duplication: 2n = 4x = 40, AABB, 2800 Mb) presents a major challenge for its genome sequencing and makes it a less-studied crop. Without a doubt, transcriptome sequencing is the most effective way to harness the genome structure and gene expression dynamics of this non-model species that has a limited genomic resource. Description With the development of next generation sequencing technologies such as 454 pyro-sequencing and Illumina sequencing by synthesis, the transcriptomics data of peanut is rapidly accumulated in both the public databases and private sectors. Integrating 187,636 Sanger reads (103,685,419 bases), 1,165,168 Roche 454 reads (333,862,593 bases) and 57,135,995 Illumina reads (4,073,740,115 bases), we generated the first release of our peanut transcriptome assembly that contains 32,619 contigs. We provided EC, KEGG and GO functional annotations to these contigs and detected SSRs, SNPs and other genetic polymorphisms for each contig. Based on both open-source and our in-house tools, PeanutDB presents many seamlessly integrated web interfaces that allow users to search, filter, navigate and visualize easily the whole transcript assembly, its annotations and detected polymorphisms and simple sequence repeats. For each contig, sequence alignment is presented in both bird’s-eye view and nucleotide level resolution, with colorfully highlighted regions of mismatches, indels and repeats that facilitate close examination of assembly quality, genetic polymorphisms, sequence repeats and/or sequencing errors. Conclusion As a public genomic database that integrates peanut transcriptome data from different sources, PeanutDB (http://bioinfolab.muohio.edu/txid3818v1) provides the Peanut research community with an easy-to-use web portal that will definitely facilitate genomics research and molecular breeding in this less-studied crop. PMID:22712730
Hahnke, Sarah; Abendroth, Christian; Langer, Thomas; Codoñer, Francisco M; Ramm, Patrice; Porcar, Manuel; Luschnig, Olaf; Klocke, Michael
2018-04-05
A new Ruminococcaceae bacterium, strain HV4-5-B5C, participating in the anaerobic digestion of grass, was isolated from a mesophilic two-stage laboratory-scale leach bed biogas system. The draft annotated genome sequence presented in this study and 16S rRNA gene sequence analysis indicated the affiliation of HV4-5-B5C with the family Ruminococcaceae outside recently described genera. Copyright © 2018 Hahnke et al.
Cankar, Katarina; Chauvensy-Ancel, Valérie; Fortabat, Marie-Noelle; Gruden, Kristina; Kobilinsky, André; Zel, Jana; Bertheau, Yves
2008-05-15
Detection of nonauthorized genetically modified organisms (GMOs) has always presented an analytical challenge because the complete sequence data needed to detect them are generally unavailable although sequence similarity to known GMOs can be expected. A new approach, differential quantitative polymerase chain reaction (PCR), for detection of nonauthorized GMOs is presented here. This method is based on the presence of several common elements (e.g., promoter, genes of interest) in different GMOs. A statistical model was developed to study the difference between the number of molecules of such a common sequence and the number of molecules identifying the approved GMO (as determined by border-fragment-based PCR) and the donor organism of the common sequence. When this difference differs statistically from zero, the presence of a nonauthorized GMO can be inferred. The interest and scope of such an approach were tested on a case study of different proportions of genetically modified maize events, with the P35S promoter as the Cauliflower Mosaic Virus common sequence. The presence of a nonauthorized GMO was successfully detected in the mixtures analyzed and in the presence of (donor organism of P35S promoter). This method could be easily transposed to other common GMO sequences and other species and is applicable to other detection areas such as microbiology.
Weber, Stefanie; Büscher, Anja K; Hagmann, Henning; Liebau, Max C; Heberle, Christian; Ludwig, Michael; Rath, Sabine; Alberer, Martin; Beissert, Antje; Zenker, Martin; Hoyer, Peter F; Konrad, Martin; Klein, Hanns-Georg; Hoefele, Julia
2016-01-01
Steroid-resistant nephrotic syndrome (SRNS) is a severe cause of progressive renal disease. Genetic forms of SRNS can present with autosomal recessive or autosomal dominant inheritance. Recent studies have identified mutations in multiple podocyte genes responsible for SRNS. Improved sequencing methods (next-generation sequencing, NGS) now promise rapid mutational testing of SRNS genes. In the present study, a simultaneous screening of ten SRNS genes in 37 SRNS patients was performed by NGS. In 38 % of the patients, causative mutations in one SRNS gene were found. In 22 % of the patients, in addition to these mutations, a secondary variant in a different gene was identified. This high incidence of accumulating sequence variants was unexpected but, although they might have modifier effects, the pathogenic potential of these additional sequence variants seems unclear so far. The example of molecular diagnostics by NGS in SRNS patients shows that these new sequencing technologies might provide further insight into molecular pathogenicity in genetic disorders but will also generate results, which will be difficult to interpret and complicate genetic counseling. Although NGS promises more frequent identification of disease-causing mutations, the identification of causative mutations, the interpretation of incidental findings and possible pitfalls might pose problems, which hopefully will decrease by further experience and elucidation of molecular interactions.
Diversity of phytases in the rumen.
Nakashima, Brenda A; McAllister, Tim A; Sharma, Ranjana; Selinger, L Brent
2007-01-01
Examples of a new class of phytase related to protein tyrosine phosphatases (PTP) were recently isolated from several anaerobic bacteria from the rumen of cattle. In this study, the diversity of PTP-like phytase gene sequences in the rumen was surveyed by using the polymerase chain reaction (PCR). Two sets of degenerate primers were used to amplify sequences from rumen fluid total community DNA and genomic DNA from nine bacterial isolates. Four novel PTP-like phytase sequences were retrieved from rumen fluid, whereas all nine of the anaerobic bacterial isolates investigated in this work contained PTP-like phytase sequences. One isolate, Selenomonas lacticifex, contained two distinct PTP-like phytase sequences, suggesting that multiple phytate hydrolyzing enzymes are present in this bacterium. The degenerate primer and PCR conditions described here, as well as novel sequences obtained in this study, will provide a valuable resource for future studies on this new class of phytase. The observed diversity of microbial phytases in the rumen may account for the ability of ruminants to derive a significant proportion of their phosphorus requirements from phytate.
Helicobacter pylori Heat Shock Protein A: Serologic Responses and Genetic Diversity
Ng, Enders K. W.; Thompson, Stuart A.; Pérez-Pérez, Guillermo I.; Kansau, Imad; van der Ende, Arie; Labigne, Agnès; Sung, Joseph J. Y.; Chung, S. C. Sydney; Blaser, Martin J.
1999-01-01
Helicobacter pylori synthesizes an unusual GroES homolog, heat shock protein A (HspA). The present study was aimed at an assessment of the serological response to HspA in a group of Chinese patients with defined gastroduodenal pathologies and determination of whether diversity is present in the nucleotide sequences encoding HspA in isolates from these patients. Serum samples collected from 154 patients who had an upper gastrointestinal pathology and the presence of H. pylori defined by biopsy were tested for an immunoglobulin G (IgG) serologic response to H. pylori HspA by an enzyme linked immunosorbant assay. HspA-encoding nucleotide sequences in H. pylori isolates from 14 patients (7 seropositive and 7 seronegative for HspA) were analyzed by PCR and direct sequencing of the PCR products. The sequencing results were compared to those of 48 isolates from other parts of the world. Of the 154 known H. pylori-positive patients, 54 (35.1%) were seropositive for HspA. The A domain (GroES homology) of HspA was highly conserved in the 14 isolates tested. Although the B domain (metal-binding site unique to H. pylori) resembled that in the known major variant, particular amino acid substitutions allowed definition of an HspA variant associated with isolates from East Asia. There were no associations between patient characteristics and HspA seropositivity or amino acid sequences. We confirmed in this study that the clinical outcomes of H. pylori infection are not related to HspA antigenicity or to sequence variation. However, B-domain sequence variation may be a marker for the study of the genetic diversity of H. pylori strains of different geographic origins. PMID:10225839
Not all (possibly) “random” sequences are created equal
Pincus, Steve; Kalman, Rudolf E.
1997-01-01
The need to assess the randomness of a single sequence, especially a finite sequence, is ubiquitous, yet is unaddressed by axiomatic probability theory. Here, we assess randomness via approximate entropy (ApEn), a computable measure of sequential irregularity, applicable to single sequences of both (even very short) finite and infinite length. We indicate the novelty and facility of the multidimensional viewpoint taken by ApEn, in contrast to classical measures. Furthermore and notably, for finite length, finite state sequences, one can identify maximally irregular sequences, and then apply ApEn to quantify the extent to which given sequences differ from maximal irregularity, via a set of deficit (defm) functions. The utility of these defm functions which we show allows one to considerably refine the notions of probabilistic independence and normality, is featured in several studies, including (i) digits of e, π, √2, and √3, both in base 2 and in base 10, and (ii) sequences given by fractional parts of multiples of irrationals. We prove companion analytic results, which also feature in a discussion of the role and validity of the almost sure properties from axiomatic probability theory insofar as they apply to specified sequences and sets of sequences (in the physical world). We conclude by relating the present results and perspective to both previous and subsequent studies. PMID:11038612
Transcriptome Analysis and Development of SSR Molecular Markers in Glycyrrhiza uralensis Fisch.
Liu, Yaling; Zhang, Pengfei; Song, Meiling; Hou, Junling; Qing, Mei; Wang, Wenquan; Liu, Chunsheng
2015-01-01
Licorice is an important traditional Chinese medicine with clinical and industrial applications. Genetic resources of licorice are insufficient for analysis of molecular biology and genetic functions; as such, transcriptome sequencing must be conducted for functional characterization and development of molecular markers. In this study, transcriptome sequencing on the Illumina HiSeq 2500 sequencing platform generated a total of 5.41 Gb clean data. De novo assembly yielded a total of 46,641 unigenes. Comparison analysis using BLAST showed that the annotations of 29,614 unigenes were conserved. Further study revealed 773 genes related to biosynthesis of secondary metabolites of licorice, 40 genes involved in biosynthesis of the terpenoid backbone, and 16 genes associated with biosynthesis of glycyrrhizic acid. Analysis of unigenes larger than 1 Kb with a length of 11,702 nt presented 7,032 simple sequence repeats (SSR). Sixty-four of 69 randomly designed and synthesized SSR pairs were successfully amplified, 33 pairs of primers were polymorphism in in Glycyrrhiza uralensis Fisch., Glycyrrhiza inflata Bat., Glycyrrhiza glabra L. and Glycyrrhiza pallidiflora Maxim. This study not only presents the molecular biology data of licorice but also provides a basis for genetic diversity research and molecular marker-assisted breeding of licorice. PMID:26571372
Echinococcus granulosus Sensu Stricto in Dogs and Jackals from Caspian Sea Region, Northern Iran
GHOLAMI, Shirzad; JAHANDAR, Hefzallah; ABASTABAR, Mahdi; PAGHEH, Abdolsatar; MOBEDI, Iraj; SHARBATKHORI, Mitra
2016-01-01
Background: The aim of the present study was genotyping of Echinococcus granulosus isolates from dogs and jackals in Mazandaran Province, northern Iran, and using partial sequence of the mitochondrial cytochrome c oxidase subunit 1 gene (cox1). Methods: E. granulosus isolates (n = 15) were collected from 42 stray dogs and 16 jackals found in south of the Caspian Sea in northern Iran. After morphological study, the isolates were genetically characterized using consensus sequences (366bp) of the cox1 gene. Phylogenetic analysis of cox1 nucleotide sequence data was performed using a Bayesian Inference approach. Results: Four different sequences were observed among the isolates. Two genotypes [G1 (66.7%) and G3 (33.3%)] were identified among the isolates. The G1 sequences indicated three sequence profiles. One profile (Maz1) had 100% homology with reference sequence (AN: KP339045). Two other profiles, designated Maz2 and Maz3, had 99% homology with the G1 genotype (ANs: KP339046 and KP339047). A G3 sequence designated Maz4 showed 100% homology with a G3 reference sequence (AN: KP339048). Conclusion: The occurrence of the G1 genotype of E. granulosus sensu stricto as a frequent genotype in dogs is emphasized. This study established the first molecular characterization of E. granulosus in the province. PMID:28096852
2011-01-01
Background The genus Silene is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four Silene and one Dianthus species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the Silene EST annotation database. Results A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO) terms, and thousands of single-nucleotide polymorphisms (SNPs) were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49%) that have significant hits in similarity searches with the A. thaliana proteome. The SiESTa database is accessible at http://www.siesta.ethz.ch. Conclusion The sequence collections established in the present study provide an important genomic resource for four Silene and one Dianthus species and will help to further develop Silene as a plant model system. The genes characterized will be useful for future research not only in the species included in the present study, but also in related species for which no genomic resources are yet available. Our results demonstrate the efficiency of massively parallel transcriptome sequencing in a comparative framework as an approach for developing genomic resources in diverse groups of non-model organisms. PMID:21791039
Vakil, Eli; Bloch, Ayala; Cohen, Haggar
2017-03-01
The serial reaction time (SRT) task has generated a very large amount of research. Nevertheless the debate continues as to the exact cognitive processes underlying implicit sequence learning. Thus, the first goal of this study is to elucidate the underlying cognitive processes enabling sequence acquisition. We therefore compared reaction time (RT) in sequence learning in a standard manual activated (MA) to that in an ocular activated (OA) version of the task, within a single experimental setting. The second goal is to use eye movement measures to compare anticipation, as an additional indication of sequence learning, between the two versions of the SRT. Performance of the group given the MA version of the task (n = 29) was compared with that of the group given the OA version (n = 30). The results showed that although overall, RT was faster for the OA group, the rate of sequence learning was similar to that of the MA group performing the standard version of the SRT. Because the stimulus-response association is automatic and exists prior to training in the OA task, the decreased reaction time in this version of the task reflects a purer measure of the sequence learning that occurs in the SRT task. The results of this study show that eye tracking anticipation can be measured directly and can serve as a direct measure of sequence learning. Finally, using the OA version of the SRT to study sequence learning presents a significant methodological contribution by making sequence learning studies possible among populations that struggle to perform manual responses.
The number of reduced alignments between two DNA sequences
2014-01-01
Background In this study we consider DNA sequences as mathematical strings. Total and reduced alignments between two DNA sequences have been considered in the literature to measure their similarity. Results for explicit representations of some alignments have been already obtained. Results We present exact, explicit and computable formulas for the number of different possible alignments between two DNA sequences and a new formula for a class of reduced alignments. Conclusions A unified approach for a wide class of alignments between two DNA sequences has been provided. The formula is computable and, if complemented by software development, will provide a deeper insight into the theory of sequence alignment and give rise to new comparison methods. AMS Subject Classification Primary 92B05, 33C20, secondary 39A14, 65Q30 PMID:24684679
Nowrousian, Minou; Würtz, Christian; Pöggeler, Stefanie; Kück, Ulrich
2004-03-01
One of the most challenging parts of large scale sequencing projects is the identification of functional elements encoded in a genome. Recently, studies of genomes of up to six different Saccharomyces species have demonstrated that a comparative analysis of genome sequences from closely related species is a powerful approach to identify open reading frames and other functional regions within genomes [Science 301 (2003) 71, Nature 423 (2003) 241]. Here, we present a comparison of selected sequences from Sordaria macrospora to their corresponding Neurospora crassa orthologous regions. Our analysis indicates that due to the high degree of sequence similarity and conservation of overall genomic organization, S. macrospora sequence information can be used to simplify the annotation of the N. crassa genome.
Charbit-Henrion, Fabienne; Parlato, Marianna; Hanein, Sylvain; Duclaux-Loras, Rémi; Nowak, Jan; Begue, Bernadette; Rakotobe, Sabine; Bruneau, Julie; Fourrage, Cécile; Alibeu, Olivier; Rieux-Laucat, Frédéric; Lévy, Eva; Stolzenberg, Marie-Claude; Mazerolles, Fabienne; Latour, Sylvain; Lenoir, Christelle; Fischer, Alain; Picard, Capucine; Aloi, Marina; Amil Dias, Jorge; Ben Hariz, Mongi; Bourrier, Anne; Breuer, Christian; Breton, Anne; Bronski, Jiri; Buderus, Stephan; Cananzi, Mara; Coopman, Stéphanie; Crémilleux, Clara; Dabadie, Alain; Dumant-Forest, Clémentine; Egritas Gurkan, Odul; Fabre, Alexandre; Fischer, Aude; German Diaz, Marta; Gonzalez-Lama, Yago; Goulet, Olivier; Guariso, Graziella; Gurcan, Neslihan; Homan, Matjaz; Hugot, Jean-Pierre; Jeziorski, Eric; Karanika, Evi; Lachaux, Alain; Lewindon, Peter; Lima, Rosa; Magro, Fernando; Major, Janos; Malamut, Georgia; Mas, Emmanuel; Mattyus, Istvan; Mearin, Luisa M; Melek, Jan; Navas-Lopez, Victor Manuel; Paerregaard, Anders; Pelatan, Cecile; Pigneur, Bénédicte; Pinto Pais, Isabel; Rebeuh, Julie; Romano, Claudio; Siala, Nadia; Strisciuglio, Caterina; Tempia-Caliera, Michela; Tounian, Patrick; Turner, Dan; Urbonas, Vaidotas; Willot, Stéphanie; Ruemmele, Frank M; Cerf-Bensussan, Nadine
2018-05-18
An expanding number of monogenic defects have been identified as causative of severe forms of very early-onset inflammatory bowel diseases (VEO-IBD). The present study aimed at defining how next-generation sequencing (NGS) methods can be used to improve identification of known molecular diagnosis and adapt treatment. 207 children were recruited in 45 Paediatric centres through an international collaborative network (ESPGHAN GENIUS working group) with a clinical presentation of severe VEO-IBD (n=185) or an anamnesis suggestive of a monogenic disorder (n=22). Patients were divided at inclusion into three phenotypic subsets: predominantly small bowel inflammation, colitis with perianal lesions, and colitis only. Methods to obtain molecular diagnosis included functional tests followed by specific Sanger sequencing, custom-made targeted NGS, and in selected cases whole exome sequencing (WES) of parents-child trios. Genetic findings were validated clinically and/or functionally. Molecular diagnosis was achieved in 66/207 children (32%): 61% with small bowel inflammation, 39% with colitis and perianal lesions and 18% with colitis only. Targeted NGS pinpointed gene mutations causative of atypical presentations and identified large exonic copy number variations previously missed by WES. Our results lead us to propose an optimised diagnostic strategy to identify known monogenic causes of severe IBD.
Viral metagenomic analysis of feces of wild small carnivores
2014-01-01
Background Recent studies have clearly demonstrated the enormous virus diversity that exists among wild animals. This exemplifies the required expansion of our knowledge of the virus diversity present in wildlife, as well as the potential transmission of these viruses to domestic animals or humans. Methods In the present study we evaluated the viral diversity of fecal samples (n = 42) collected from 10 different species of wild small carnivores inhabiting the northern part of Spain using random PCR in combination with next-generation sequencing. Samples were collected from American mink (Neovison vison), European mink (Mustela lutreola), European polecat (Mustela putorius), European pine marten (Martes martes), stone marten (Martes foina), Eurasian otter (Lutra lutra) and Eurasian badger (Meles meles) of the family of Mustelidae; common genet (Genetta genetta) of the family of Viverridae; red fox (Vulpes vulpes) of the family of Canidae and European wild cat (Felis silvestris) of the family of Felidae. Results A number of sequences of possible novel viruses or virus variants were detected, including a theilovirus, phleboviruses, an amdovirus, a kobuvirus and picobirnaviruses. Conclusions Using random PCR in combination with next generation sequencing, sequences of various novel viruses or virus variants were detected in fecal samples collected from Spanish carnivores. Detected novel viruses highlight the viral diversity that is present in fecal material of wild carnivores. PMID:24886057
Klingeman, Dawn M.; Utturkar, Sagar; Lu, Tse -Yuan S.; ...
2015-11-12
Draft genome sequences for four Actinobacteria from the genus Streptomyces are presented. Streptomyces is a metabolically diverse genus that is abundant in soils and has been reported in association with plants. The strains described in this study were isolated from the Populus trichocarpa endosphere and rhizosphere.
Draft genome sequence of the oleaginous yeast Cryptococcus curvatus ATCC 20509
DOE Office of Scientific and Technical Information (OSTI.GOV)
Close, Dan; Ojumu, John O.
Cryptococcus curvatus ATCC 20509 is a commonly used nonmodel oleaginous yeast capable of converting a variety of carbon sources into fatty acids. In addition, we present the draft genome sequence of this popular organism to provide a means for more in-depth studies of its fatty acid production potential.
ERIC Educational Resources Information Center
Arafeh, Sousan
2016-01-01
Best practice in curriculum development and implementation requires that discipline-based standards or requirements embody both curricular and programme scopes and sequences. Ensuring these are present and aligned in course/programme content, activities and assessments to support student success requires formalised and systematised review and…
Targeted enrichment strategies for next-generation plant biology
Richard Cronn; Brian J. Knaus; Aaron Liston; Peter J. Maughan; Matthew Parks; John V. Syring; Joshua Udall
2012-01-01
The dramatic advances offered by modem DNA sequencers continue to redefine the limits of what can be accomplished in comparative plant biology. Even with recent achievements, however, plant genomes present obstacles that can make it difficult to execute large-scale population and phylogenetic studies on next-generation sequencing platforms. Factors like large genome...
ERIC Educational Resources Information Center
Furey, William M.; Marcotte, Amanda M.; Hintze, John M.; Shackett, Caroline M.
2016-01-01
The study presents a critical analysis of written expression curriculum-based measurement (WE-CBM) metrics derived from 3- and 10-min test lengths. Criterion validity and classification accuracy were examined for Total Words Written (TWW), Correct Writing Sequences (CWS), Percent Correct Writing Sequences (%CWS), and Correct Minus Incorrect…
Judgments Relative to Patterns: How Temporal Sequence Patterns Affect Judgments and Memory
ERIC Educational Resources Information Center
Kusev, Petko; Ayton, Peter; van Schaik, Paul; Tsaneva-Atanasova, Krasimira; Stewart, Neil; Chater, Nick
2011-01-01
RESix experiments studied relative frequency judgment and recall of sequentially presented items drawn from 2 distinct categories (i.e., city and animal). The experiments show that judged frequencies of categories of sequentially encountered stimuli are affected by certain properties of the sequence configuration. We found (a) a "first-run…
Draft genome sequence of the oleaginous yeast Cryptococcus curvatus ATCC 20509
Close, Dan; Ojumu, John O.
2016-11-03
Cryptococcus curvatus ATCC 20509 is a commonly used nonmodel oleaginous yeast capable of converting a variety of carbon sources into fatty acids. In addition, we present the draft genome sequence of this popular organism to provide a means for more in-depth studies of its fatty acid production potential.
USDA-ARS?s Scientific Manuscript database
Resistance gene analogs (RGAs) were searched bioinformatically in the sugar beet (Beta vulgaris L.) genome as potential candidates for improving resistance against different diseases. In the present study, Ion Torrent sequencing technology was used to identify mutations in 21 RGAs. The DNA samples o...
Speech Sequence Skill Learning in Adults Who Stutter
ERIC Educational Resources Information Center
Bauerly, Kim R.; De Nil, Luc F.
2011-01-01
The present study compared the ability of 12 people who stutter (PWS) and 12 people who do not stutter (PNS) to consolidate a novel sequential speech task. Participants practiced 100 repetitions of a single, monosyllabic, nonsense word sequence during an initial practice session and returned 24-h later to perform an additional 50 repetitions.…
Dissecting the relationship between protein structure and sequence variation
NASA Astrophysics Data System (ADS)
Shahmoradi, Amir; Wilke, Claus; Wilke Lab Team
2015-03-01
Over the past decade several independent works have shown that some structural properties of proteins are capable of predicting protein evolution. The strength and significance of these structure-sequence relations, however, appear to vary widely among different proteins, with absolute correlation strengths ranging from 0 . 1 to 0 . 8 . Here we present the results from a comprehensive search for the potential biophysical and structural determinants of protein evolution by studying more than 200 structural and evolutionary properties in a dataset of 209 monomeric enzymes. We discuss the main protein characteristics responsible for the general patterns of protein evolution, and identify sequence divergence as the main determinant of the strengths of virtually all structure-evolution relationships, explaining ~ 10 - 30 % of observed variation in sequence-structure relations. In addition to sequence divergence, we identify several protein structural properties that are moderately but significantly coupled with the strength of sequence-structure relations. In particular, proteins with more homogeneous back-bone hydrogen bond energies, large fractions of helical secondary structures and low fraction of beta sheets tend to have the strongest sequence-structure relation. BEACON-NSF center for the study of evolution in action.
Gupta, S K; Gopalakrishna, T
2010-07-01
Unigene sequences available in public databases provide a cost-effective and valuable source for the development of molecular markers. In this study, the identification and development of unigene-based SSR markers in cowpea (Vigna unguiculata (L.) Walp.) is presented. A total of 1071 SSRs were identified in 15 740 cowpea unigene sequences downloaded from the National Center for Biotechnology Information. The most frequent SSR motifs present in the unigenes were trinucleotides (59.7%), followed by dinucleotides (34.8%), pentanucleotides (4%), and tetranucleotides (1.5%). The copy number varied from 6 to 33 for dinucleotide, 5 to 29 for trinucleotide, 5 to 7 for tetranucleotide, and 4 to 6 for pentanucleotide repeats. Primer pairs were successfully designed for 803 SSR motifs and 102 SSR markers were finally characterized and validated. Putative function was assigned to 64.7% of the unigene SSR markers based on significant homology to reported proteins. About 31.7% of the SSRs were present in coding sequences and 68.3% in untranslated regions of the genes. About 87% of the SSRs located in the coding sequences were trinucleotide repeats. Allelic variation at 32 SSR loci produced 98 alleles in 20 cowpea genotypes. The polymorphic information content for the SSR markers varied from 0.10 to 0.83 with an average of 0.53. These unigene SSR markers showed a high rate of transferability (88%) across other Vigna species, thereby expanding their utility. Alignment of unigene sequences with soybean genomic sequences revealed the presence of introns in amplified products of some of the SSR markers. This study presents the distribution of SSRs in the expressed portion of the cowpea genome and is the first report of the development of functional unigene-based SSR markers in cowpea. These SSR markers would play an important role in molecular mapping, comparative genomics, and marker-assisted selection strategies in cowpea and other Vigna species.
Johnson, Matthew G.; Gardner, Elliot M.; Liu, Yang; Medina, Rafael; Goffinet, Bernard; Shaw, A. Jonathan; Zerega, Nyree J. C.; Wickett, Norman J.
2016-01-01
Premise of the study: Using sequence data generated via target enrichment for phylogenetics requires reassembly of high-throughput sequence reads into loci, presenting a number of bioinformatics challenges. We developed HybPiper as a user-friendly platform for assembly of gene regions, extraction of exon and intron sequences, and identification of paralogous gene copies. We test HybPiper using baits designed to target 333 phylogenetic markers and 125 genes of functional significance in Artocarpus (Moraceae). Methods and Results: HybPiper implements parallel execution of sequence assembly in three phases: read mapping, contig assembly, and target sequence extraction. The pipeline was able to recover nearly complete gene sequences for all genes in 22 species of Artocarpus. HybPiper also recovered more than 500 bp of nontargeted intron sequence in over half of the phylogenetic markers and identified paralogous gene copies in Artocarpus. Conclusions: HybPiper was designed for Linux and Mac OS X and is freely available at https://github.com/mossmatters/HybPiper. PMID:27437175
Yin, Changchuan
2015-04-01
To apply digital signal processing (DSP) methods to analyze DNA sequences, the sequences first must be specially mapped into numerical sequences. Thus, effective numerical mappings of DNA sequences play key roles in the effectiveness of DSP-based methods such as exon prediction. Despite numerous mappings of symbolic DNA sequences to numerical series, the existing mapping methods do not include the genetic coding features of DNA sequences. We present a novel numerical representation of DNA sequences using genetic codon context (GCC) in which the numerical values are optimized by simulation annealing to maximize the 3-periodicity signal to noise ratio (SNR). The optimized GCC representation is then applied in exon and intron prediction by Short-Time Fourier Transform (STFT) approach. The results show the GCC method enhances the SNR values of exon sequences and thus increases the accuracy of predicting protein coding regions in genomes compared with the commonly used 4D binary representation. In addition, this study offers a novel way to reveal specific features of DNA sequences by optimizing numerical mappings of symbolic DNA sequences.
Matrix metalloproteinases outside vertebrates.
Marino-Puertas, Laura; Goulas, Theodoros; Gomis-Rüth, F Xavier
2017-11-01
The matrix metalloproteinase (MMP) family belongs to the metzincin clan of zinc-dependent metallopeptidases. Due to their enormous implications in physiology and disease, MMPs have mainly been studied in vertebrates. They are engaged in extracellular protein processing and degradation, and present extensive paralogy, with 23 forms in humans. One characteristic of MMPs is a ~165-residue catalytic domain (CD), which has been structurally studied for 14 MMPs from human, mouse, rat, pig and the oral-microbiome bacterium Tannerella forsythia. These studies revealed close overall coincidence and characteristic structural features, which distinguish MMPs from other metzincins and give rise to a sequence pattern for their identification. Here, we reviewed the literature available on MMPs outside vertebrates and performed database searches for potential MMP CDs in invertebrates, plants, fungi, viruses, protists, archaea and bacteria. These and previous results revealed that MMPs are widely present in several copies in Eumetazoa and higher plants (Tracheophyta), but have just token presence in eukaryotic algae. A few dozen sequences were found in Ascomycota (within fungi) and in double-stranded DNA viruses infecting invertebrates (within viruses). In contrast, a few hundred sequences were found in archaea and >1000 in bacteria, with several copies for some species. Most of the archaeal and bacterial phyla containing potential MMPs are present in human oral and gut microbiomes. Overall, MMP-like sequences are present across all kingdoms of life, but their asymmetric distribution contradicts the vertical descent model from a eubacterial or archaeal ancestor. This article is part of a Special Issue entitled: Matrix Metalloproteinases edited by Rafael Fridman. Copyright © 2017 Elsevier B.V. All rights reserved.
Levels of integration in cognitive control and sequence processing in the prefrontal cortex.
Bahlmann, Jörg; Korb, Franziska M; Gratton, Caterina; Friederici, Angela D
2012-01-01
Cognitive control is necessary to flexibly act in changing environments. Sequence processing is needed in language comprehension to build the syntactic structure in sentences. Functional imaging studies suggest that sequence processing engages the left ventrolateral prefrontal cortex (PFC). In contrast, cognitive control processes additionally recruit bilateral rostral lateral PFC regions. The present study aimed to investigate these two types of processes in one experimental paradigm. Sequence processing was manipulated using two different sequencing rules varying in complexity. Cognitive control was varied with different cue-sets that determined the choice of a sequencing rule. Univariate analyses revealed distinct PFC regions for the two types of processing (i.e. sequence processing: left ventrolateral PFC and cognitive control processing: bilateral dorsolateral and rostral PFC). Moreover, in a common brain network (including left lateral PFC and intraparietal sulcus) no interaction between sequence and cognitive control processing was observed. In contrast, a multivariate pattern analysis revealed an interaction of sequence and cognitive control processing, such that voxels in left lateral PFC and parietal cortex showed different tuning functions for tasks involving different sequencing and cognitive control demands. These results suggest that the difference between the process of rule selection (i.e. cognitive control) and the process of rule-based sequencing (i.e. sequence processing) find their neuronal underpinnings in distinct activation patterns in lateral PFC. Moreover, the combination of rule selection and rule sequencing can shape the response of neurons in lateral PFC and parietal cortex.
Levels of Integration in Cognitive Control and Sequence Processing in the Prefrontal Cortex
Bahlmann, Jörg; Korb, Franziska M.; Gratton, Caterina; Friederici, Angela D.
2012-01-01
Cognitive control is necessary to flexibly act in changing environments. Sequence processing is needed in language comprehension to build the syntactic structure in sentences. Functional imaging studies suggest that sequence processing engages the left ventrolateral prefrontal cortex (PFC). In contrast, cognitive control processes additionally recruit bilateral rostral lateral PFC regions. The present study aimed to investigate these two types of processes in one experimental paradigm. Sequence processing was manipulated using two different sequencing rules varying in complexity. Cognitive control was varied with different cue-sets that determined the choice of a sequencing rule. Univariate analyses revealed distinct PFC regions for the two types of processing (i.e. sequence processing: left ventrolateral PFC and cognitive control processing: bilateral dorsolateral and rostral PFC). Moreover, in a common brain network (including left lateral PFC and intraparietal sulcus) no interaction between sequence and cognitive control processing was observed. In contrast, a multivariate pattern analysis revealed an interaction of sequence and cognitive control processing, such that voxels in left lateral PFC and parietal cortex showed different tuning functions for tasks involving different sequencing and cognitive control demands. These results suggest that the difference between the process of rule selection (i.e. cognitive control) and the process of rule-based sequencing (i.e. sequence processing) find their neuronal underpinnings in distinct activation patterns in lateral PFC. Moreover, the combination of rule selection and rule sequencing can shape the response of neurons in lateral PFC and parietal cortex. PMID:22952762
Hardwicke, Joseph T; Richards, Helen; Cafferky, Louise; Underwood, Imogen; ter Horst, Britt; Slator, Rona
2016-03-01
Pierre Robin sequence results from a cascade of events that occur during embryologic development and frequently presents with cleft palate. Some studies have shown speech outcomes to be worse in patients with Pierre Robin sequence after cleft palate repair. A cohort of Pierre Robin sequence patients who all required an airway intervention and nasogastric feeding in the neonatal period were identified and speech outcomes assessed at 5 years of age. A cleft- and sex-matched non-Pierre Robin sequence, cleft palate-only comparison group was also identified from the same institution and study period. A total of 24 patients with Pierre Robin sequence that required airway and nutritional support in the neonatal period were matched for age, sex, and cleft type to a group of 24 non-Pierre Robin sequence cleft patients. There was no significant difference in the incidence of oronasal fistula between the groups. Secondary surgery for velopharyngeal incompetence was significantly more (p = 0.017) in the Pierre Robin sequence group, who also had significantly greater nasality (p = 0.031) and cleft speech characteristic (p = 0.023) scores. The authors hypothesize that other factors may exist in Pierre Robin sequence that may lead to poor speech outcomes. The authors would suggest counseling parents of children with Pierre Robin sequence that have required a neonatal airway intervention, that speech development may be poorer than in other children with cleft palate, and that these children will have a significantly higher incidence of secondary speech surgery. Risk, II.
Daikoku, Tatsuya; Yatomi, Yutaka; Yumoto, Masato
2017-01-27
Previous neural studies have supported the hypothesis that statistical learning mechanisms are used broadly across different domains such as language and music. However, these studies have only investigated a single aspect of statistical learning at a time, such as recognizing word boundaries or learning word order patterns. In this study, we neutrally investigated how the two levels of statistical learning for recognizing word boundaries and word ordering could be reflected in neuromagnetic responses and how acquired statistical knowledge is reorganised when the syntactic rules are revised. Neuromagnetic responses to the Japanese-vowel sequence (a, e, i, o, and u), presented every .45s, were recorded from 14 right-handed Japanese participants. The vowel order was constrained by a Markov stochastic model such that five nonsense words (aue, eao, iea, oiu, and uoi) were chained with an either-or rule: the probability of the forthcoming word was statistically defined (80% for one word; 20% for the other word) by the most recent two words. All of the word transition probabilities (80% and 20%) were switched in the middle of the sequence. In the first and second quarters of the sequence, the neuromagnetic responses to the words that appeared with higher transitional probability were significantly reduced compared with those that appeared with a lower transitional probability. After switching the word transition probabilities, the response reduction was replicated in the last quarter of the sequence. The responses to the final vowels in the words were significantly reduced compared with those to the initial vowels in the last quarter of the sequence. The results suggest that both within-word and between-word statistical learning are reflected in neural responses. The present study supports the hypothesis that listeners learn larger structures such as phrases first, and they subsequently extract smaller structures, such as words, from the learned phrases. The present study provides the first neurophysiological evidence that the correction of statistical knowledge requires more time than the acquisition of new statistical knowledge. Copyright © 2016 Elsevier Ltd. All rights reserved.
Baeßler, Bettina; Schaarschmidt, Frank; Stehning, Christian; Schnackenburg, Bernhard; Maintz, David; Bunck, Alexander C
2015-11-01
Previous studies showed that myocardial T2 relaxation times measured by cardiac T2-mapping vary significantly depending on sequence and field strength. Therefore, a systematic comparison of different T2-mapping sequences and the establishment of dedicated T2 reference values is mandatory for diagnostic decision-making. Phantom experiments using gel probes with a range of different T1 and T2 times were performed on a clinical 1.5T and 3T scanner. In addition, 30 healthy volunteers were examined at 1.5 and 3T in immediate succession. In each examination, three different T2-mapping sequences were performed at three short-axis slices: Multi Echo Spin Echo (MESE), T2-prepared balanced SSFP (T2prep), and Gradient Spin Echo with and without fat saturation (GraSEFS/GraSE). Segmented T2-Maps were generated according to the AHA 16-segment model and statistical analysis was performed. Significant intra-individual differences between mean T2 times were observed for all sequences. In general, T2prep resulted in lowest and GraSE in highest T2 times. A significant variation with field strength was observed for mean T2 in phantom as well as in vivo, with higher T2 values at 1.5T compared to 3T, regardless of the sequence used. Segmental T2 values for each sequence at 1.5 and 3T are presented. Despite a careful selection of sequence parameters and volunteers, significant variations of the measured T2 values were observed between field strengths, MR sequences and myocardial segments. Therefore, we present segmental T2 values for each sequence at 1.5 and 3T with the inherent potential to serve as reference values for future studies. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Crotoxin: Structural Studies, Mechanism of Action and Cloning of Its gene
1989-12-01
B-chain. Sequencing of the three peptides present in the acidic subunit, two of which are blocked by pyroglutamate , represents a significant...We have completed the sequence determination of both the basic and acidic subunits of crotoxin. The acidic subunit peptides were difficult, since two...of the three peptides were blocked at the amino-terminus by pyroglutamate . Earlier structural studies on crotoxin and related crotalid dimeric
Nielsen, Tue Kjærgaard; Sørensen, Sebastian R; Hansen, Lars Hestbjerg
2015-05-28
Sphingomonas sp. SRS2 was the first described pure strain that is capable of mineralizing the phenylurea herbicide isoproturon and some of its related compounds. This strain has been studied thoroughly and shows potential for bioremediation purposes. We present the draft genome sequence of this bacterium, which will aid future studies. Copyright © 2015 Nielsen et al.
Does tonality boost short-term memory in congenital amusia?
Albouy, Philippe; Schulze, Katrin; Caclin, Anne; Tillmann, Barbara
2013-11-06
Congenital amusia is a neuro-developmental disorder of music perception and production. Recent findings have demonstrated that this deficit is linked to an impaired short-term memory for tone sequences. As it has been shown before that non-musicians' implicit knowledge of musical regularities can improve short-term memory for tone information, the present study investigated if this type of implicit knowledge could also influence amusics' short-term memory performance. Congenital amusics and their matched controls, who were non-musicians, had to indicate whether sequences of five tones, presented in pairs, were the same or different; half of the pairs respected musical regularities (tonal sequences) and the other half did not (atonal sequences). As previously reported for non-musician participants, the control participants showed better performance (as measured with d') for tonal sequences than for atonal ones. While this improvement was not observed in amusics, both control and amusic participants showed faster response times for tonal sequences than for atonal sequences. These findings suggest that some implicit processing of tonal structures is potentially preserved in congenital amusia. This observation is encouraging as it strengthens the perspective to exploit implicit knowledge to help reducing pitch perception and memory deficits in amusia. © 2013 Elsevier B.V. All rights reserved.
Musical Scales in Tone Sequences Improve Temporal Accuracy.
Li, Min S; Di Luca, Massimiliano
2018-01-01
Predicting the time of stimulus onset is a key component in perception. Previous investigations of perceived timing have focused on the effect of stimulus properties such as rhythm and temporal irregularity, but the influence of non-temporal properties and their role in predicting stimulus timing has not been exhaustively considered. The present study aims to understand how a non-temporal pattern in a sequence of regularly timed stimuli could improve or bias the detection of temporal deviations. We presented interspersed sequences of 3, 4, 5, and 6 auditory tones where only the timing of the last stimulus could slightly deviate from isochrony. Participants reported whether the last tone was 'earlier' or 'later' relative to the expected regular timing. In two conditions, the tones composing the sequence were either organized into musical scales or they were random tones. In one experiment, all sequences ended with the same tone; in the other experiment, each sequence ended with a different tone. Results indicate higher discriminability of anisochrony with musical scales and with longer sequences, irrespective of the knowledge of the final tone. Such an outcome suggests that the predictability of non-temporal properties, as enabled by the musical scale pattern, can be a factor in determining the sensitivity of time judgments.
The utility of Next Generation Sequencing for molecular diagnostics in Rett syndrome.
Vidal, Silvia; Brandi, Núria; Pacheco, Paola; Gerotina, Edgar; Blasco, Laura; Trotta, Jean-Rémi; Derdak, Sophia; Del Mar O'Callaghan, Maria; Garcia-Cazorla, Àngels; Pineda, Mercè; Armstrong, Judith
2017-09-25
Rett syndrome (RTT) is an early-onset neurodevelopmental disorder that almost exclusively affects girls and is totally disabling. Three genes have been identified that cause RTT: MECP2, CDKL5 and FOXG1. However, the etiology of some of RTT patients still remains unknown. Recently, next generation sequencing (NGS) has promoted genetic diagnoses because of the quickness and affordability of the method. To evaluate the usefulness of NGS in genetic diagnosis, we present the genetic study of RTT-like patients using different techniques based on this technology. We studied 1577 patients with RTT-like clinical diagnoses and reviewed patients who were previously studied and thought to have RTT genes by Sanger sequencing. Genetically, 477 of 1577 patients with a RTT-like suspicion have been diagnosed. Positive results were found in 30% by Sanger sequencing, 23% with a custom panel, 24% with a commercial panel and 32% with whole exome sequencing. A genetic study using NGS allows the study of a larger number of genes associated with RTT-like symptoms simultaneously, providing genetic study of a wider group of patients as well as significantly reducing the response time and cost of the study.
An Exploration of Rhythmic Grouping of Speech Sequences by French- and German-Learning Infants
Abboub, Nawal; Boll-Avetisyan, Natalie; Bhatara, Anjali; Höhle, Barbara; Nazzi, Thierry
2016-01-01
Rhythm in music and speech can be characterized by a constellation of several acoustic cues. Individually, these cues have different effects on rhythmic perception: sequences of sounds alternating in duration are perceived as short-long pairs (weak-strong/iambic pattern), whereas sequences of sounds alternating in intensity or pitch are perceived as loud-soft, or high-low pairs (strong-weak/trochaic pattern). This perceptual bias—called the Iambic-Trochaic Law (ITL)–has been claimed to be an universal property of the auditory system applying in both the music and the language domains. Recent studies have shown that language experience can modulate the effects of the ITL on rhythmic perception of both speech and non-speech sequences in adults, and of non-speech sequences in 7.5-month-old infants. The goal of the present study was to explore whether language experience also modulates infants’ grouping of speech. To do so, we presented sequences of syllables to monolingual French- and German-learning 7.5-month-olds. Using the Headturn Preference Procedure (HPP), we examined whether they were able to perceive a rhythmic structure in sequences of syllables that alternated in duration, pitch, or intensity. Our findings show that both French- and German-learning infants perceived a rhythmic structure when it was cued by duration or pitch but not intensity. Our findings also show differences in how these infants use duration and pitch cues to group syllable sequences, suggesting that pitch cues were the easier ones to use. Moreover, performance did not differ across languages, failing to reveal early language effects on rhythmic perception. These results contribute to our understanding of the origin of rhythmic perception and perceptual mechanisms shared across music and speech, which may bootstrap language acquisition. PMID:27378887
Dong, Jiajia; Vicente, Natallia; Chintauan-Marquier, Ioana C; Ramadi, Cahyo; Dettai, Agnès; Robillard, Tony
2017-05-15
In the present study, we report the high-coverage complete mitochondrial genome (mitogenome) of the cricket Cardiodactylus muiri Otte, 2007. The mitogenome was sequenced using a long-PCR approach on an Ion Torrent Personal Genome Machine (PGM) for next generation sequencing technology. The total length of the amplified mitogenome is 16,328 bp, representing 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes and one noncoding region (D-loop region). The new sets of long-PCR primers reported here are invaluable resources for future comparative evolutionary genomic studies in Orthopteran insects. The new mitogenome sequence is compared with published cricket mitogenomes. In the taxonomic part, we present new records for the species and describe life-history traits, habitat and male calling song of the species; based on observation of new material, the species Cardiodactylus buru Gorochov & Robillard, 2014 is synonymized under C. muiri.
Quick, Josh; Grubaugh, Nathan D; Pullan, Steven T; Claro, Ingra M; Smith, Andrew D; Gangavarapu, Karthik; Oliveira, Glenn; Robles-Sikisaka, Refugio; Rogers, Thomas F; Beutler, Nathan A; Burton, Dennis R; Lewis-Ximenez, Lia Laura; de Jesus, Jaqueline Goes; Giovanetti, Marta; Hill, Sarah; Black, Allison; Bedford, Trevor; Carroll, Miles W; Nunes, Marcio; Alcantara, Luiz Carlos; Sabino, Ester C; Baylis, Sally A; Faria, Nuno; Loose, Matthew; Simpson, Jared T; Pybus, Oliver G; Andersen, Kristian G; Loman, Nicholas J
2018-01-01
Genome sequencing has become a powerful tool for studying emerging infectious diseases; however, genome sequencing directly from clinical samples without isolation remains challenging for viruses such as Zika, where metagenomic sequencing methods may generate insufficient numbers of viral reads. Here we present a protocol for generating coding-sequence complete genomes comprising an online primer design tool, a novel multiplex PCR enrichment protocol, optimised library preparation methods for the portable MinION sequencer (Oxford Nanopore Technologies) and the Illumina range of instruments, and a bioinformatics pipeline for generating consensus sequences. The MinION protocol does not require an internet connection for analysis, making it suitable for field applications with limited connectivity. Our method relies on multiplex PCR for targeted enrichment of viral genomes from samples containing as few as 50 genome copies per reaction. Viral consensus sequences can be achieved starting with clinical samples in 1-2 days following a simple laboratory workflow. This method has been successfully used by several groups studying Zika virus evolution and is facilitating an understanding of the spread of the virus in the Americas. PMID:28538739
Bidard, J N; de Nadai, F; Rovere, C; Moinier, D; Laur, J; Martinez, J; Cuber, J C; Kitabgi, P
1993-01-01
Neurotensin (NT) and neuromedin N (NN) are two related biologically active peptides that are encoded in the same precursor molecule. In the rat, the precursor consists of a 169-residue polypeptide starting with an N-terminal signal peptide and containing in its C-terminal region one copy each of NT and NN. NN precedes NT and is separated from it by a Lys-Arg sequence. Two other Lys-Arg sequences flank the N-terminus of NN and the C-terminus of NT. A fourth Lys-Arg sequence occurs near the middle of the precursor and is followed by an NN-like sequence. Finally, an Arg-Arg pair is present within the NT moiety. The four Lys-Arg doublets represent putative processing sites in the precursor molecule. The present study was designed to investigate the post-translational processing of the NT/NN precursor in the rat medullary thyroid carcinoma (rMTC) 6-23 cell line, which synthesizes large amounts of NT upon dexamethasone treatment. Five region-specific antisera recognizing the free N- or C-termini of sequences adjacent to the basic doublets were produced, characterized and used for immunoblotting and radioimmunoassay studies in combination with gel filtration, reverse-phase h.p.l.c. and trypsin digestion of rMTC 6-23 cell extracts. Because two of the antigenic sequences, i.e. NN and the NN-like sequence, start with a lysine residue that is essential for recognition by their respective antisera, a micromethod by which trypsin specifically cleaves at arginine residues was developed. The results show that dexamethasone-treated rMTC 6-23 cells produced comparable amounts of NT, NN and a peptide corresponding to a large N-terminal precursor fragment lacking the NN and NT moieties. This large fragment was purified. N-Terminal sequencing revealed that it started at residue Ser23 of the prepro-NT/NN sequence, and thus established the Cys22-Ser23 bond as the cleavage site of the signal peptide. Two other large N-terminal fragments bearing respectively the NN and NT sequences at their C-termini were present in lower amounts. The NN-like sequence was internal to all the large fragments. There was no evidence for the presence of peptides with the NN-like sequence at their N-termini. This shows that, in rMTC 6-23 cells, the precursor is readily processed at the three Lys-Arg doublets that flank and separate the NT and NN sequences. In contrast, the Lys-Arg doublet that precedes the NN-like sequence is not processed in this system.(ABSTRACT TRUNCATED AT 400 WORDS) Images Figure 3 PMID:8471039
Asha, Srinivasan; Sreekumar, Sweda; Soniya, E V
2016-01-01
Analysis of high-throughput small RNA deep sequencing data, in combination with black pepper transcriptome sequences revealed microRNA-mediated gene regulation in black pepper ( Piper nigrum L.). Black pepper is an important spice crop and its berries are used worldwide as a natural food additive that contributes unique flavour to foods. In the present study to characterize microRNAs from black pepper, we generated a small RNA library from black pepper leaf and sequenced it by Illumina high-throughput sequencing technology. MicroRNAs belonging to a total of 303 conserved miRNA families were identified from the sRNAome data. Subsequent analysis from recently sequenced black pepper transcriptome confirmed precursor sequences of 50 conserved miRNAs and four potential novel miRNA candidates. Stem-loop qRT-PCR experiments demonstrated differential expression of eight conserved miRNAs in black pepper. Computational analysis of targets of the miRNAs showed 223 potential black pepper unigene targets that encode diverse transcription factors and enzymes involved in plant development, disease resistance, metabolic and signalling pathways. RLM-RACE experiments further mapped miRNA-mediated cleavage at five of the mRNA targets. In addition, miRNA isoforms corresponding to 18 miRNA families were also identified from black pepper. This study presents the first large-scale identification of microRNAs from black pepper and provides the foundation for the future studies of miRNA-mediated gene regulation of stress responses and diverse metabolic processes in black pepper.
Cellulases and coding sequences
Li, Xin-Liang; Ljungdahl, Lars G.; Chen, Huizhong
2001-02-20
The present invention provides three fungal cellulases, their coding sequences, recombinant DNA molecules comprising the cellulase coding sequences, recombinant host cells and methods for producing same. The present cellulases are from Orpinomyces PC-2.
Cellulases and coding sequences
Li, Xin-Liang; Ljungdahl, Lars G.; Chen, Huizhong
2001-01-01
The present invention provides three fungal cellulases, their coding sequences, recombinant DNA molecules comprising the cellulase coding sequences, recombinant host cells and methods for producing same. The present cellulases are from Orpinomyces PC-2.
Mananga, Eugene S; Reid, Alicia E
This paper presents the study of finite pulse widths for the BABA pulse sequence using the Floquet-Magnus expansion (FME) approach. In the FME scheme, the first order F 1 is identical to its counterparts in average Hamiltonian theory (AHT) and Floquet theory (FT). However, the timing part in the FME approach is introduced via the Λ 1 ( t ) function not present in other schemes. This function provides an easy way for evaluating the spin evolution during "the time in between" through the Magnus expansion of the operator connected to the timing part of the evolution. The evaluation of Λ 1 ( t ) is useful especially for the analysis of the non-stroboscopic evolution. Here, the importance of the boundary conditions, which provides a natural choice of Λ 1 (0) is ignored. This work uses the Λ 1 ( t ) function to compare the efficiency of the BABA pulse sequence with δ - pulses and the BABA pulse sequence with finite pulses. Calculations of Λ 1 ( t ) and F 1 are presented.
Mananga, Eugene S.; Reid, Alicia E.
2013-01-01
This paper presents the study of finite pulse widths for the BABA pulse sequence using the Floquet-Magnus expansion (FME) approach. In the FME scheme, the first order F1 is identical to its counterparts in average Hamiltonian theory (AHT) and Floquet theory (FT). However, the timing part in the FME approach is introduced via the Λ1 (t) function not present in other schemes. This function provides an easy way for evaluating the spin evolution during “the time in between” through the Magnus expansion of the operator connected to the timing part of the evolution. The evaluation of Λ1 (t) is useful especially for the analysis of the non-stroboscopic evolution. Here, the importance of the boundary conditions, which provides a natural choice of Λ1 (0) is ignored. This work uses the Λ1 (t) function to compare the efficiency of the BABA pulse sequence with δ – pulses and the BABA pulse sequence with finite pulses. Calculations of Λ1 (t) and F1 are presented. PMID:25792763
Zhang, Bo; Wu, Wen-Qiang; Liu, Na-Nv; Duan, Xiao-Lei; Li, Ming; Dou, Shuo-Xing; Hou, Xi-Miao; Xi, Xu-Guang
2016-01-01
Alternative DNA structures that deviate from B-form double-stranded DNA such as G-quadruplex (G4) DNA can be formed by G-rich sequences that are widely distributed throughout the human genome. We have previously shown that Pif1p not only unfolds G4, but also unwinds the downstream duplex DNA in a G4-stimulated manner. In the present study, we further characterized the G4-stimulated duplex DNA unwinding phenomenon by means of single-molecule fluorescence resonance energy transfer. It was found that Pif1p did not unwind the partial duplex DNA immediately after unfolding the upstream G4 structure, but rather, it would dwell at the ss/dsDNA junction with a ‘waiting time’. Further studies revealed that the waiting time was in fact related to a protein dimerization process that was sensitive to ssDNA sequence and would become rapid if the sequence is G-rich. Furthermore, we identified that the G-rich sequence, as the G4 structure, equally stimulates duplex DNA unwinding. The present work sheds new light on the molecular mechanism by which G4-unwinding helicase Pif1p resolves physiological G4/duplex DNA structures in cells. PMID:27471032
Coxiella Detection in Ticks from Wildlife and Livestock in Malaysia
Khoo, Jing-Jing; Lim, Fang-Shiang; Chen, Fezshin; Phoon, Wai-Hong; Khor, Chee-Sieng; Pike, Brian L.; Chang, Li-Yen
2016-01-01
Abstract Recent studies have shown that ticks harbor Coxiella-like bacteria, which are potentially tick-specific endosymbionts. We recently described the detection of Coxiella-like bacteria and possibly Coxiella burnetii in ticks found from rural areas in Malaysia. In the present study, we collected ticks, including Haemaphysalis bispinosa, Haemaphysalis hystricis, Dermacentor compactus, Dermacentor steini, and Amblyomma sp. from wildlife and domesticated goats from four different locations in Malaysia. Coxiella 16s rRNA genomic sequences were detected by PCR in 89% of ticks tested. Similarity analysis and phylogenetic analyses of the 16s rRNA and rpoB partial sequences were performed for 10 representative samples selected based on the tick species, sex, and location. The findings here suggested the presence of C. burnetii in two samples, each from D. steini and H. hystricis. The sequences of both samples clustered with published C. burnetii sequences. The remaining eight tick samples were shown to harbor 16s rRNA sequences of Coxiella-like bacteria, which clustered phylogenetically according to the respective tick host species. The findings presented here added to the growing evidence of the association between Coxiella-like bacteria and ticks across species and geographical boundaries. The importance of C. burnetii found in ticks in Malaysia warrants further investigation. PMID:27763821
Liu, Siyang; Huang, Shujia; Rao, Junhua; Ye, Weijian; Krogh, Anders; Wang, Jun
2015-01-01
Comprehensive recognition of genomic variation in one individual is important for understanding disease and developing personalized medication and treatment. Many tools based on DNA re-sequencing exist for identification of single nucleotide polymorphisms, small insertions and deletions (indels) as well as large deletions. However, these approaches consistently display a substantial bias against the recovery of complex structural variants and novel sequence in individual genomes and do not provide interpretation information such as the annotation of ancestral state and formation mechanism. We present a novel approach implemented in a single software package, AsmVar, to discover, genotype and characterize different forms of structural variation and novel sequence from population-scale de novo genome assemblies up to nucleotide resolution. Application of AsmVar to several human de novo genome assemblies captures a wide spectrum of structural variants and novel sequences present in the human population in high sensitivity and specificity. Our method provides a direct solution for investigating structural variants and novel sequences from de novo genome assemblies, facilitating the construction of population-scale pan-genomes. Our study also highlights the usefulness of the de novo assembly strategy for definition of genome structure.
Investigating the long-term course of schizophrenia by sequence analysis.
An der Heiden, Wolfram; Häfner, Heinz
2015-08-30
In the present study we set out to explore the long-term clinical course of schizophrenia in a holistic manner by adopting sequence analysis. Our aim was to identify course types of illness by means of cluster analysis. The study was based on course and outcome data for 107 patients followed up over 134 months after first admission in the ABC Schizophrenia Study. Focusing on the main syndromes (positive, negative, depressive and unspecific symptoms) and their combinations we looked for similarities in individual illness courses using the 'optimal matching' method. A cluster analysis performed on the resulting similarity matrix yielded two main groups (a 'improving' and a 'chronic' group), which comprised a total of six different types of illness course. The course types differed in both quantitative (frequency of syndromes and syndrome combinations) and qualitative terms (clinical presentation, sequence of syndromes). Cluster membership was only rarely, but clearly associated with sociodemographic characteristics, treatment data and other illness variables. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Analysis and Visualization Tool for Targeted Amplicon Bisulfite Sequencing on Ion Torrent Sequencers
Pabinger, Stephan; Ernst, Karina; Pulverer, Walter; Kallmeyer, Rainer; Valdes, Ana M.; Metrustry, Sarah; Katic, Denis; Nuzzo, Angelo; Kriegner, Albert; Vierlinger, Klemens; Weinhaeusel, Andreas
2016-01-01
Targeted sequencing of PCR amplicons generated from bisulfite deaminated DNA is a flexible, cost-effective way to study methylation of a sample at single CpG resolution and perform subsequent multi-target, multi-sample comparisons. Currently, no platform specific protocol, support, or analysis solution is provided to perform targeted bisulfite sequencing on a Personal Genome Machine (PGM). Here, we present a novel tool, called TABSAT, for analyzing targeted bisulfite sequencing data generated on Ion Torrent sequencers. The workflow starts with raw sequencing data, performs quality assessment, and uses a tailored version of Bismark to map the reads to a reference genome. The pipeline visualizes results as lollipop plots and is able to deduce specific methylation-patterns present in a sample. The obtained profiles are then summarized and compared between samples. In order to assess the performance of the targeted bisulfite sequencing workflow, 48 samples were used to generate 53 different Bisulfite-Sequencing PCR amplicons from each sample, resulting in 2,544 amplicon targets. We obtained a mean coverage of 282X using 1,196,822 aligned reads. Next, we compared the sequencing results of these targets to the methylation level of the corresponding sites on an Illumina 450k methylation chip. The calculated average Pearson correlation coefficient of 0.91 confirms the sequencing results with one of the industry-leading CpG methylation platforms and shows that targeted amplicon bisulfite sequencing provides an accurate and cost-efficient method for DNA methylation studies, e.g., to provide platform-independent confirmation of Illumina Infinium 450k methylation data. TABSAT offers a novel way to analyze data generated by Ion Torrent instruments and can also be used with data from the Illumina MiSeq platform. It can be easily accessed via the Platomics platform, which offers a web-based graphical user interface along with sample and parameter storage. TABSAT is freely available under a GNU General Public License version 3.0 (GPLv3) at https://github.com/tadkeys/tabsat/ and http://demo.platomics.com/. PMID:27467908
Stopping decisions: information order effects on nonfocal evaluations.
Yu, Michael; Gonzalez, Cleotilde
2013-08-01
We investigated how the order in which information is presented affects when a person decides to stop performing a task. A stopping decision is a decision to stop performing a task on the basis of a sequence of cues. Previous order-effects models do not account for how these contexts limit available working memory for making such decisions. Participants decided how long to perform a task known as the Work Hazard Game that began by rewarding points but later cost points if work continued after an unannounced "emergency." An additive sequence of cues indicated the probability of an emergency. Study I involved a three-group design with cue sequences that indicated the same risk at each decision point but whose final cue presented a high, medium, or low probability. Study 2 had a 2 x 2 design with high or low final cues and an easy or a challenging task. In Study I, participants stopped sooner when the most recent cue presented a high rather than low probability (p = .09), despite the same emergency risk. In Study 2, participants stopped sooner when the most recent cue presented a high rather than low probability for the challenging task but not for the easy task (p = .08). Stopping decisions appear sensitive to the most recent cue observed while experiencing task load. Participants responded to the same risks differently only on the basis of a change in presentation. Findings may be relevant for research and training for hazardous jobs, such as subsurface coal mining, fishing, and trucking.
Sakai, Kazuko; Takeda, Masayuki; Okamoto, Isamu; Nakagawa, Kazuhiko; Nishio, Kazuto
2015-01-01
Hepatocyte growth factor (HGF) expression is a poor prognostic factor in various types of cancer. Expression levels of HGF have been reported to be regulated by shorter poly(dA) sequences in the promoter region. In the present study, the poly(dA) mononucleotide tract in various types of human cancer cell lines was examined and compared with the HGF expression levels in those cells. Short deoxyadenosine repeat sequences were detected in five of the 55 cell lines used in the present study. The H69, IM95, CCK-81, Sui73 and H28 cells exhibited a truncated poly(dA) sequence in which the number of poly(dA) repeats was reduced by ≥5 bp. Two of the cell lines exhibited high HGF expression, determined by reverse transcription quantitative polymerase chain reaction and enzyme-linked immunosorbent assay. The CCK-81, Sui73 and H28 cells with shorter poly(dA) sequences exhibited low HGF expression. The cause of the suppression of HGF expression in the CCK-81, Sui73 and H28 cells was clarified by two approaches, suppression by methylation and single nucleotide polymorphisms in the HGF gene. Exposure to 5-Aza-dC, an inhibitor of DNA methyltransferase 1, induced an increased expression of HGF in the CCK-81 cells, but not in the other cells. Single-nucleotide polymorphism (SNP) rs72525097 in intron 1 was detected in the Sui73 and H28 cells. Taken together, it was found that the defect of poly(dA) in the HGF promoter was present in various types of cancer, including lung, stomach, colorectal, pancreas and mesothelioma. The present study proposes the negative regulation mechanisms by methylation and SNP in intron 1 of HGF for HGF expression in cancer cells with short poly(dA).
Short-term effects of processing musical syntax: an ERP study.
Koelsch, Stefan; Jentschke, Sebastian
2008-05-30
We investigated influences of short-term experience on music-syntactic processing, using a chord-sequence paradigm in which sequences ended on a harmony that was syntactically either regular or irregular. In contrast to previous studies (in which block durations were rather short), chord sequences were presented to participants for around 2 h while they were watching a silent movie with subtitles. Results showed that the music-syntactically irregular chord functions elicited an early right anterior negativity (ERAN), and that the ERAN amplitude significantly declined over the course of the experiment. The ERAN has previously been suggested to reflect the processing of music-syntactic irregularities, and the present data show that the cognitive representations of musical regularities are influenced by the repeated presentation of unexpected, irregular harmonies. Because harmonies were task-irrelevant, the data suggest that cognitive representations of musical regularities can change implicitly, i.e., even when listeners do not attend to the harmonies, and when they are presumably oblivious of the changes of such representations. Although the ERAN amplitude was significantly reduced, it was still present towards the end of the experiment at the right anterior electrodes, indicating that cognitive representations of basic music-syntactic regularities cannot easily be erased.
2017-01-01
We present a sensor that exploits the phenomenon of upconversion luminescence to detect the presence of specific sequences of small oligonucleotides such as miRNAs among others. The sensor is based on NaYF4:Yb,Er@SiO2 nanoparticles functionalized with ssDNA that contain azide groups on the 3′ ends. In the presence of a target sequence, interstrand ligation is possible via the click-reaction between one azide of the upconversion probe and a DBCO-ssDNA-biotin probe present in the solution. As a result of this specific and selective process, biotin is covalently attached to the surface of the upconversion nanoparticles. The presence of biotin on the surface of the nanoparticles allows their selective capture on a streptavidin-coated support, giving a luminescent signal proportional to the amount of target strands present in the test samples. With the aim of studying the analytical properties of the sensor, total RNA samples were extracted from healthy mosquitoes and were spiked-in with a specific target sequence at different concentrations. The result of these experiments revealed that the sensor was able to detect 10–17 moles per well (100 fM) of the target sequence in mixtures containing 100 ng of total RNA per well. A similar limit of detection was found for spiked human serum samples, demonstrating the suitability of the sensor for detecting specific sequences of small oligonucleotides under real conditions. In contrast, in the presence of noncomplementary sequences or sequences having mismatches, the luminescent signal was negligible or conspicuously reduced. PMID:28332400
Singh, Prashant; Singh, Satya Shila; Elster, Josef; Mishra, Arun Kumar
2013-06-01
In order to assess phylogeny, population genetics, and approximation of future course of cyanobacterial evolution based on nifH gene sequences, 41 heterocystous cyanobacterial strains collected from all over India have been used in the present study. NifH gene sequence analysis data confirm that the heterocystous cyanobacteria are monophyletic while the stigonematales show polyphyletic origin with grave intermixing. Further, analysis of nifH gene sequence data using intricate mathematical extrapolations revealed that the nucleotide diversity and recombination frequency is much greater in Nostocales than the Stigonematales. Similarly, DNA divergence studies showed significant values of divergence with greater gene conversion tracts in the unbranched (Nostocales) than the branched (Stigonematales) strains. Our data strongly support the origin of true branching cyanobacterial strains from the unbranched strains.
Wittevrongel, Benjamin; Van Wolputte, Elia; Van Hulle, Marc M
2017-11-08
When encoding visual targets using various lagged versions of a pseudorandom binary sequence of luminance changes, the EEG signal recorded over the viewer's occipital pole exhibits so-called code-modulated visual evoked potentials (cVEPs), the phase lags of which can be tied to these targets. The cVEP paradigm has enjoyed interest in the brain-computer interfacing (BCI) community for the reported high information transfer rates (ITR, in bits/min). In this study, we introduce a novel decoding algorithm based on spatiotemporal beamforming, and show that this algorithm is able to accurately identify the gazed target. Especially for a small number of repetitions of the coding sequence, our beamforming approach significantly outperforms an optimised support vector machine (SVM)-based classifier, which is considered state-of-the-art in cVEP-based BCI. In addition to the traditional 60 Hz stimulus presentation rate for the coding sequence, we also explore the 120 Hz rate, and show that the latter enables faster communication, with a maximal median ITR of 172.87 bits/min. Finally, we also report on a transition effect in the EEG signal following the onset of the stimulus sequence, and recommend to exclude the first 150 ms of the trials from decoding when relying on a single presentation of the stimulus sequence.
Liu, Laura; Chen, Ho-Min; Tsai, Shawn; Chang, Tsong-Chi; Tsai, Tzu-Hsun; Yang, Chung-May; Chao, An-Ning; Chen, Kuan-Jen; Kao, Ling-Yuh; Yeung, Ling; Yeh, Lung-Kun; Hwang, Yih-Shiou; Wu, Wei-Chi; Lai, Chi-Chun
2015-01-01
Purpose To investigate the clinical characteristics of X-linked retinoschisis (XLRS) and identify genetic mutations in Taiwanese patients with XLRS. Methods This study included 23 affected males from 16 families with XLRS. Fundus photography, spectral domain optical coherent tomography (SD-OCT), fundus autofluorescence (FAF), and full-field electroretinograms (ERGs) were performed. The coding regions of the RS1 gene that encodes retinoschisin were sequenced. Results The median age at diagnosis was 18 years (range 4–58 years). The best-corrected visual acuity ranged from no light perception to 20/25. The typical spoke-wheel pattern in the macula was present in 61% of the patients (14/23) while peripheral retinoschisis was present in 43% of the patients (10/23). Four eyes presented with vitreous hemorrhage, and two eyes presented with leukocoria that mimics Coats’ disease. Macular schisis was identified with SD-OCT in 82% of the eyes (31/38) while foveal atrophy was present in 18% of the eyes (7/38). Concentric area of high intensity was the most common FAF abnormality observed. Seven out of 12 patients (58%) showed electronegative ERG findings. Sequencing of the RS1 gene identified nine mutations, six of which were novel. The mutations are all located in exons 4–6, including six missense mutations, two nonsense mutations, and one deletion-caused frameshift mutation. Conclusions XLRS is a clinically heterogeneous disease with profound phenotypic inter- and intrafamiliar variability. Genetic sequencing is valuable as it allows a definite diagnosis of XLRS to be made without the classical clinical features and ERG findings. This study showed the variety of clinical features of XLRS and reported novel mutations. PMID:25999676
Amelogenin Evolution and Tetrapod Enamel Structure
Diekwisch, Thomas G.H.; Jin, Tianquan; Wang, Xinping; Ito, Yoshihiro; Schmidt, Marcella; Druzinsky, Robert; Yamane, Akira; Luan, Xianghong
2009-01-01
Amelogenins are the major proteins involved in tooth enamel formation. In the present study we have cloned and sequenced four novel amelogenins from three amphibian species in order to analyze similarities and differences between mammalian and non-mammalian amelogenins. The newly sequenced amphibian amelogenin sequences were from a Red-eyed tree frog (Litoria chloris) and a Mexican axolotl (Ambystoma mexicanum). We identified two amelogenin isoforms in the Eastern Red-backed Salamander (Plethodon cinereus). Sequence comparisons confirmed that non-mammalian amelogenins are overall shorter than their mammalian counterparts, contain less proline and less glutamine, and feature shorter polyproline tripeptide repeat stretches than mammalian amelogenins. We propose that unique sequence parameters of mammalian amelogenins might be a pre-requisite for complex mammalian enamel prism architecture. PMID:19828974
Illustrative case studies in the return of exome and genome sequencing results
Amendola, Laura M; Lautenbach, Denise; Scollon, Sarah; Bernhardt, Barbara; Biswas, Sawona; East, Kelly; Everett, Jessica; Gilmore, Marian J; Himes, Patricia; Raymond, Victoria M; Wynn, Julia; Hart, Ragan; Jarvik, Gail P
2015-01-01
Whole genome and exome sequencing tests are increasingly being ordered in clinical practice, creating a need for research exploring the return of results from these tests. A goal of the Clinical Sequencing and Exploratory Research (CSER) consortium is to gain experience with this process to develop best practice recommendations for offering exome and genome testing and returning results. Genetic counselors in the CSER consortium have an integral role in the return of results from these genomic sequencing tests and have gained valuable insight. We present seven emerging themes related to return of exome and genome sequencing results accompanied by case descriptions illustrating important lessons learned, counseling challenges specific to these tests and considerations for future research and practice. PMID:26478737
Nucleic acid arrays and methods of synthesis
Sabanayagam, Chandran R.; Sano, Takeshi; Misasi, John; Hatch, Anson; Cantor, Charles
2001-01-01
The present invention generally relates to high density nucleic acid arrays and methods of synthesizing nucleic acid sequences on a solid surface. Specifically, the present invention contemplates the use of stabilized nucleic acid primer sequences immobilized on solid surfaces, and circular nucleic acid sequence templates combined with the use of isothermal rolling circle amplification to thereby increase nucleic acid sequence concentrations in a sample or on an array of nucleic acid sequences.
Hernández-Martínez, Miguel Ángel; Escalante, Ananías A.; Arévalo-Herrera, Myriam; Herrera, Sócrates
2011-01-01
Circumsporozoite (CS) protein is a malaria antigen involved in sporozoite invasion of hepatocytes, and thus considered to have good vaccine potential. We evaluated the polymorphism of the Plasmodium vivax CS gene in 24 parasite isolates collected from malaria-endemic areas of Colombia. We sequenced 27 alleles, most of which (25/27) corresponded to the VK247 genotype and the remainder to the VK210 type. All VK247 alleles presented a mutation (Gly → Asn) at position 28 in the N-terminal region, whereas the C-terminal presented three insertions: the ANKKAGDAG, which is common in all VK247 isolates; 12 alleles presented the insertion GAGGQAAGGNAANKKAGDAG; and 5 alleles presented the insertion GGNAGGNA. Both repeat regions were polymorphic in gene sequence and size. Sequences coding for B-, T-CD4+, and T-CD8+ cell epitopes were found to be conserved. This study confirms the high polymorphism of the repeat domain and the highly conserved nature of the flanking regions. PMID:21292878
Verstappen, Koen M; Huijbregts, Loes; Spaninks, Mirlin; Wagenaar, Jaap A; Fluit, Ad C; Duim, Birgitta
2017-01-01
Staphylococcus pseudintermedius is an opportunistic pathogen in dogs and cats and occasionally causes infections in humans. S. pseudintermedius is often resistant to multiple classes of antimicrobials. It requires a reliable detection so that it is not misidentified as S. aureus. Phenotypic and currently-used molecular-based diagnostic assays lack specificity or are labour-intensive using multiplex PCR or nucleic acid sequencing. The aim of this study was to identify a specific target for real-time PCR by comparing whole genome sequences of S. pseudintermedius and non-pseudintermedius.Genome sequences were downloaded from public repositories and supplemented by isolates that were sequenced in this study. A Perl-script was written that analysed 300-nt fragments from a reference genome sequence of S. pseudintermedius and checked if this sequence was present in other S. pseudintermedius genomes (n = 74) and non-pseudintermedius genomes (n = 138). Six sequences specific for S. pseudintermedius were identified (sequence length between 300-500 nt). One sequence, which was located in the spsJ gene, was used to develop primers and a probe. The real-time PCR showed 100% specificity when testing for S. pseudintermedius isolates (n = 54), and eight other staphylococcal species (n = 43). In conclusion, a novel approach by comparing whole genome sequences identified a sequence that is specific for S. pseudintermedius and provided a real-time PCR target for rapid and reliable detection of S. pseudintermedius.
Costa, Marina C; Esteves, Francisco; Antunes, Francisco; Matos, Olga
2006-12-01
The aim of the present study was to evaluate the genetic variation of Pneumocystis jirovecii dihydrofolate reductase (DHFR) gene in an immunocompromised Portuguese population and to investigate the possible association between DHFR genotypes and P. jirovecii pneumonia (PcP) prophylaxis with co-trimoxazole. One hundred and thirty-eight P. jirovecii isolates were submitted to DHFR genetic characterization by PCR and sequencing. In the studied population, 72.7% of the patients presented sequences identical to the wild-type sequence of the P. jirovecii DHFR gene and 27.3% presented point substitutions. A total of nine substitution sites were identified; four synonymous substitutions at nucleotide positions 201, 272, 312 and 381 were detected in 31 patients. Five non-synonymous substitutions were observed, leading to the DHFR mutations Leu-13-->Ser, Asn-23-->Ser, Ser-31-->Phe, Met-52-->Leu and Ala-67-->Val. With the exception of the polymorphism at position 312 and the mutation at codon 52, all polymorphisms were reported in this study for the first time. Our results suggest that DHFR gene polymorphisms are frequent in the Portuguese immunocompromised population but do not seem to be associated with PcP prophylaxis failure (P = 0.748 and P = 0.730).
Analysis of sequence repeats of proteins in the PDB.
Mary Rajathei, David; Selvaraj, Samuel
2013-12-01
Internal repeats in protein sequences play a significant role in the evolution of protein structure and function. Applications of different bioinformatics tools help in the identification and characterization of these repeats. In the present study, we analyzed sequence repeats in a non-redundant set of proteins available in the Protein Data Bank (PDB). We used RADAR for detecting internal repeats in a protein, PDBeFOLD for assessing structural similarity, PDBsum for finding functional involvement and Pfam for domain assignment of the repeats in a protein. Through the analysis of sequence repeats, we found that identity of the sequence repeats falls in the range of 20-40% and, the superimposed structures of the most of the sequence repeats maintain similar overall folding. Analysis sequence repeats at the functional level reveals that most of the sequence repeats are involved in the function of the protein through functionally involved residues in the repeat regions. We also found that sequence repeats in single and two domain proteins often contained conserved sequence motifs for the function of the domain. Copyright © 2013 Elsevier Ltd. All rights reserved.
Restricted transfer of learning between unimanual and bimanual finger sequences
Bai, Wenjun
2016-01-01
When training bimanual skills, such as playing piano, people sometimes practice each hand separately and at a later stage combine the movements of the two hands. This poses the critical question of whether motor skills can be acquired by separately practicing each subcomponent or should be trained as a whole. In the present study, we addressed this question by training human subjects for 4 days in a unimanual or bimanual version of the discrete sequence production task. Both groups were then tested on trained and untrained sequences on both unimanual and bimanual versions of the task. Surprisingly, we found no evidence of transfer from trained unimanual to bimanual or from trained bimanual to unimanual sequences. In half the participants, we also investigated whether cuing the sequences on the left and right hand with unique letters would change transfer. With these cues, untrained sequences that shared some components with the trained sequences were performed more quickly than sequences that did not. However, the amount of this transfer was limited to ∼10% of the overall sequence-specific learning gains. These results suggest that unimanual and bimanual sequences are learned in separate representations. Making participants aware of the interrelationship between sequences can induce some transferrable component, although the main component of the skill remains unique to unimanual or bimanual execution. NEW & NOTEWORTHY Studies in reaching movement demonstrated that approximately half of motor learning can transfer across unimanual and bimanual contexts, suggesting that neural representations for unimanual and bimanual movements are fairly overlapping at the level of elementary movement. In this study, we show that little or no transfer occurred across unimanual and bimanual sequential finger movements. This result suggests that bimanual sequences are represented at a level of the motor hierarchy that integrates movements of both hands. PMID:27974447
Properties of a U1 RNA enhancer-like sequence.
Ciliberto, G; Palla, F; Tebb, G; Mattaj, I W; Philipson, L
1987-01-01
The properties of a X.laevis U1B snRNA gene enhancer have been studied by microinjection in Xenopus oocytes. The enhancer-like sequence, defined as a short DNA stretch that is able to activate transcription in an orientation independent manner, is interchangeable between different U snRNA genes. The enhancer sequence alone does not, however, efficiently activate transcription from an SV40 pol II promoter but regains its activity when combined with the U-gene specific proximal sequence element. DNase I protection experiments show that the X.laevis U1B enhancer can interact specifically with a nuclear factor present in mammalian cells. Images PMID:3031597
Chromatin and RNAi factors protect the C. elegans germline against repetitive sequences
Robert, Valérie J.P.; Sijen, Titia; van Wolfswinkel, Josien; Plasterk, Ronald H.A.
2005-01-01
Protection of genomes against invasion by repetitive sequences, such as transposons, viruses, and repetitive transgenes, involves strong and selective silencing of these sequences. During silencing of repetitive transgenes, a trans effect (“cosuppression”) occurs that results in silencing of cognate endogenous genes. Here we report RNA interference (RNAi) screens performed to catalog genes required for cosuppression in the Caenorhabditis elegans germline. We find factors with a putative role in chromatin remodeling and factors involved in RNAi. Together with molecular data also presented in this study, these results suggest that in C. elegans repetitive sequences trigger transcriptional gene silencing using RNAi and chromatin factors. PMID:15774721
PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences.
Mirarab, Siavash; Nguyen, Nam; Guo, Sheng; Wang, Li-San; Kim, Junhyong; Warnow, Tandy
2015-05-01
We introduce PASTA, a new multiple sequence alignment algorithm. PASTA uses a new technique to produce an alignment given a guide tree that enables it to be both highly scalable and very accurate. We present a study on biological and simulated data with up to 200,000 sequences, showing that PASTA produces highly accurate alignments, improving on the accuracy and scalability of the leading alignment methods (including SATé). We also show that trees estimated on PASTA alignments are highly accurate--slightly better than SATé trees, but with substantial improvements relative to other methods. Finally, PASTA is faster than SATé, highly parallelizable, and requires relatively little memory.
A communal catalogue reveals Earth's multiscale microbial diversity.
Thompson, Luke R; Sanders, Jon G; McDonald, Daniel; Amir, Amnon; Ladau, Joshua; Locey, Kenneth J; Prill, Robert J; Tripathi, Anupriya; Gibbons, Sean M; Ackermann, Gail; Navas-Molina, Jose A; Janssen, Stefan; Kopylova, Evguenia; Vázquez-Baeza, Yoshiki; González, Antonio; Morton, James T; Mirarab, Siavash; Zech Xu, Zhenjiang; Jiang, Lingjing; Haroon, Mohamed F; Kanbar, Jad; Zhu, Qiyun; Jin Song, Se; Kosciolek, Tomasz; Bokulich, Nicholas A; Lefler, Joshua; Brislawn, Colin J; Humphrey, Gregory; Owens, Sarah M; Hampton-Marcell, Jarrad; Berg-Lyons, Donna; McKenzie, Valerie; Fierer, Noah; Fuhrman, Jed A; Clauset, Aaron; Stevens, Rick L; Shade, Ashley; Pollard, Katherine S; Goodwin, Kelly D; Jansson, Janet K; Gilbert, Jack A; Knight, Rob
2017-11-23
Our growing awareness of the microbial world's importance and diversity contrasts starkly with our limited understanding of its fundamental structure. Despite recent advances in DNA sequencing, a lack of standardized protocols and common analytical frameworks impedes comparisons among studies, hindering the development of global inferences about microbial life on Earth. Here we present a meta-analysis of microbial community samples collected by hundreds of researchers for the Earth Microbiome Project. Coordinated protocols and new analytical methods, particularly the use of exact sequences instead of clustered operational taxonomic units, enable bacterial and archaeal ribosomal RNA gene sequences to be followed across multiple studies and allow us to explore patterns of diversity at an unprecedented scale. The result is both a reference database giving global context to DNA sequence data and a framework for incorporating data from future studies, fostering increasingly complete characterization of Earth's microbial diversity.
NASA Technical Reports Server (NTRS)
Wu, T.; Orgel, L. E.
1992-01-01
We have used [32P]-labeled hairpin oligonucleotides to study template-directed synthesis on templates containing one or more A or T residues within a run of C residues. When nucleoside-5'-phosphoro(2-methyl)imidazolides are used as substrates, isolated A and T residues function efficiently in facilitating the incorporation of U and A, respectively. The reactions are regiospecific, producing mainly 3'-5'-phosphodiester bonds. Pairs of consecutive non-C residues are copied much less efficiently. Limited synthesis of CA and AC sequences on templates containing TG and GT sequences was observed along with some synthesis of the AA sequences on templates containing TT sequences. The other dimer sequences investigated, AA, AG, GA, TA, and AT, could not be copied. If A is absent from the reaction mixture, misincorporation of G residues is a significant reaction on templates containing an isolated T residue or two consecutive T residues. However, if both A and G are present, A is incorporated to a much greater extent than G. We believe that wobble-pairing between T and G is responsible for misincorporation when only G is present.
Chevalier, Nicolas; James, Tiffany D; Wiebe, Sandra A; Nelson, Jennifer Mize; Espy, Kimberly Andrews
2014-07-01
The present study addressed whether developmental improvement in working memory span task performance relies upon a growing ability to proactively plan response sequences during childhood. Two hundred thirteen children completed a working memory span task in which they used a touchscreen to reproduce orally presented sequences of animal names. Children were assessed longitudinally at 7 time points between 3 and 10 years of age. Twenty-one young adults also completed the same task. Proactive response sequence planning was assessed by comparing recall durations for the 1st item (preparatory interval) and subsequent items. At preschool age, the preparatory interval was generally shorter than subsequent item recall durations, whereas it was systematically longer during elementary school and in adults. Although children mostly approached the task reactively at preschool, they proactively planned response sequences with increasing efficiency from age 7 on, like adults. These findings clarify the nature of the changes in executive control that support working memory performance with age. (PsycINFO Database Record (c) 2014 APA, all rights reserved).
Data compression of discrete sequence: A tree based approach using dynamic programming
NASA Technical Reports Server (NTRS)
Shivaram, Gurusrasad; Seetharaman, Guna; Rao, T. R. N.
1994-01-01
A dynamic programming based approach for data compression of a ID sequence is presented. The compression of an input sequence of size N to that of a smaller size k is achieved by dividing the input sequence into k subsequences and replacing the subsequences by their respective average values. The partitioning of the input sequence is carried with the intention of reducing the mean squared error in the reconstructed sequence. The complexity involved in finding the partitions which would result in such an optimal compressed sequence is reduced by using the dynamic programming approach, which is presented.
An Investigation of the Role of Sequencing in Children's Reading Comprehension
ERIC Educational Resources Information Center
Gouldthorp, Bethanie; Katsipis, Lia; Mueller, Cara
2018-01-01
To date, little is known about the high-level language skills and cognitive processes underlying reading comprehension in children. The present study aimed to investigate whether children with high, compared with low, reading comprehension differ in their sequencing skill, which was defined as the ability to identify and recall the temporal order…
Timing of Visual Bodily Behavior in Repair Sequences: Evidence from Three Languages
ERIC Educational Resources Information Center
Floyd, Simeon; Manrique, Elizabeth; Rossi, Giovanni; Torreira, Francisco
2016-01-01
This article expands the study of other-initiated repair in conversation--when one party signals a problem with producing or perceiving another's turn at talk--into the domain of visual bodily behavior. It presents one primary cross-linguistic finding about the timing of visual bodily behavior in repair sequences: if the party who initiates repair…
da Silva, Fábio Daniel Florêncio; Lima, Alex Ranieri Jerônimo; Moraes, Pablo Henrique Gonçalves; Siqueira, Andrei Santos; Dall’Agnol, Leonardo Teixeira; Baraúna, Anna Rafaella Ferreira; Martins, Luisa Carício; Oliveira, Karol Guimarães; de Lima, Clayton Pereira Silva; Nunes, Márcio Roberto Teixeira; Vianez-Júnior, João Lídio Silva Gonçalves
2016-01-01
Ecological interactions between cyanobacteria and heterotrophic prokaryotes are poorly known. To improve the genomic studies of heterotrophic bacterium-cyanobacterium associations, the draft genome sequence (3.2 Mbp) of Limnobacter sp. strain CACIAM 66H1, found in a nonaxenic culture of Synechococcus sp. (cyanobacteria), is presented here. PMID:27198027
Real-time film recording from stroke-written CRT's
NASA Technical Reports Server (NTRS)
Hunt, R.; Grunwald, A. J.
1980-01-01
Real-time simulation studies often require motion-picture recording of events directly from stroke written cathode-ray tubes (CRT's). Difficulty presented is prevention of "flicker," which results from lack of synchronization between display sequence on CRT and shutter motion of camera. Programmable method has been devised for phasing display sequence to shutter motion, ensuring flicker-free recordings.
Formative Research on the Simplifying Conditions Method (SCM) for Task Analysis and Sequencing.
ERIC Educational Resources Information Center
Kim, YoungHwan; Reigluth, Charles M.
The Simplifying Conditions Method (SCM) is a set of guidelines for task analysis and sequencing of instructional content under the Elaboration Theory (ET). This article introduces the fundamentals of SCM and presents the findings from a formative research study on SCM. It was conducted in two distinct phases: design and instruction. In the first…
On How "n" and "i" Turned out to Become Indices in Mathematical Sequences and Formulae
ERIC Educational Resources Information Center
Schubring, Gert
2011-01-01
The evolution of indexing sequences and series has been little studied so far, although this topic presents difficulties for students. This symbolisation signified, however, an important means for operating more generally with finite and infinite series. This paper investigates the first steps during the eighteenth century and then the different…
USDA-ARS?s Scientific Manuscript database
The increase in the consumption of fresh produce in the United States has correlated with a rise in the number of reported foodborne illnesses. To identify potential risk factors associated with post-harvest practices, the present study employed multilocus sequence typing (MLST) for the genotypic c...
Motor Interference Does Not Impair the Memory Consolidation of Imagined Movements
ERIC Educational Resources Information Center
Debarnot, Ursula; Maley, Laura; De Rossi, Danilo; Guillot, Aymeric
2010-01-01
The present study aimed to investigate whether an interference task might impact the sleep-dependent consolidation process of a mentally learned sequence of movements. Thirty-two participants were subjected to a first training session through motor imagery (MI) or physical practice (PP) of a finger sequence learning task. After 2 h, half of the…
The Development of Long-Term Lexical Representations through Hebb Repetition Learning
ERIC Educational Resources Information Center
Szmalec, Arnaud; Page, Mike P. A.; Duyck, Wouter
2012-01-01
This study clarifies the involvement of short- and long-term memory in novel word-form learning, using the Hebb repetition paradigm. In Experiment 1, participants recalled sequences of visually presented syllables (e.g., "la"-"va"-"bu"-"sa"-"fa"-"ra"-"re"-"si"-"di"), with one particular (Hebb) sequence repeated on every third trial. Crucially,…
Draft Genome of Janthinobacterium sp. RA13 Isolated from Lake Washington Sediment
McTaggart, Tami L.; Shapiro, Nicole; Woyke, Tanja
2015-01-01
Sequencing the genome of Janthinobacterium sp. RA13 from Lake Washington sediment is announced. From the genome content, a versatile life-style is predicted, but not bona fide methylotrophy. With the availability of its genomic sequence, Janthinobacterium sp. RA13 presents a prospective model for studying microbial communities in lake sediments. PMID:25676775
Molecular basis of length polymorphism in the human zeta-globin gene complex.
Goodbourn, S E; Higgs, D R; Clegg, J B; Weatherall, D J
1983-01-01
The length polymorphism between the human zeta-globin gene and its pseudogene is caused by an allele-specific variation in the copy number of a tandemly repeating 36-base-pair sequence. This sequence is related to a tandemly repeated 14-base-pair sequence in the 5' flanking region of the human insulin gene, which is known to cause length polymorphism, and to a repetitive sequence in intervening sequence (IVS) 1 of the pseudo-zeta-globin gene. Evidence is presented that the latter is also of variable length, probably because of differences in the copy number of the tandem repeat. The homology between the three length polymorphisms may be an indication of the presence of a more widespread group of related sequences in the human genome, which might be useful for generalized linkage studies. PMID:6308667
Hodzic, Jasin; Gurbeta, Lejla; Omanovic-Miklicanin, Enisa; Badnjevic, Almir
2017-01-01
Introduction: Major advancements in DNA sequencing methods introduced in the first decade of the new millennium initiated a rapid expansion of sequencing studies, which yielded a tremendous amount of DNA sequence data, including whole sequenced genomes of various species, including plants. A set of novel sequencing platforms, often collectively named as “next-generation sequencing” (NGS) completely transformed the life sciences, by allowing extensive throughput, while greatly reducing the necessary time, labor and cost of any sequencing endeavor. Purpose: of this paper is to present an overview NGS platforms used to produce the current compendium of published draft genomes of various plants, namely the Roche/454, ABI/SOLiD, and Solexa/Illumina, and to determine the most frequently used platform for the whole genome sequencing of plants in light of genotypization of immortelle plant. Materials and methods: 45 papers were selected (with 47 presented plant genome draft sequences), and utilized sequencing techniques and NGS platforms (Roche/454, ABI/SOLiD and Illumina/Solexa) in selected papers were determined. Subsequently, frequency of usage of each platform or combination of platforms was calculated. Results: Illumina/Solexa platforms are by used either as sole sequencing tool in 40.42% of published genomes, or in combination with other platforms - additional 48.94% of published genomes, followed by Roche/454 platforms, used in combination with traditional Sanger sequencing method (10.64%), and never as a sole tool. ABI/SOLiD was only used in combination with Illumina/Solexa and Roche/454 in 4.25% of publications. Conclusions: Illumina/Solexa platforms are by far most preferred by researchers, most probably due to most affordable sequencing costs. Taking into consideration the current economic situation in the Balkans region, Illumina Solexa is the best (if not the only) platform choice if the sequencing of immortelle plant (Helichrysium arenarium) is to be performed by the researchers in this region. PMID:28974852
Abdel-Shafi, Iman R; Shoieb, Eman Y; Attia, Samar S; Rubio, José M; Ta-Tang, Thuy-Huong; El-Badry, Ayman A
2017-03-01
Lymphatic filariasis (LF) is a serious vector-borne health problem, and Wuchereria bancrofti (W.b) is the major cause of LF worldwide and is focally endemic in Egypt. Identification of filarial infection using traditional morphologic and immunological criteria can be difficult and lead to misdiagnosis. The aim of the present study was molecular detection of W.b in residents in endemic areas in Egypt, sequence variance analysis, and phylogenetic analysis of W.b DNA. Collected blood samples from residents in filariasis endemic areas in five governorates were subjected to semi-nested PCR targeting repeated DNA sequence, for detection of W.b DNA. PCR products were sequenced; subsequently, a phylogenetic analysis of the obtained sequences was performed. Out of 300 blood samples, W.b DNA was identified in 48 (16%). Sequencing analysis confirmed PCR results identifying only W.b species. Sequence alignment and phylogenetic analysis indicated genetically distinct clusters of W.b among the study population. Study results demonstrated that the semi-nested PCR proved to be an effective diagnostic tool for accurate and rapid detection of W.b infections in nano-epidemics and is applicable for samples collected in the daytime as well as the night time. PCR products sequencing and phylogenitic analysis revealed three different nucleotide sequences variants. Further genetic studies of W.b in Egypt and other endemic areas are needed to distinguish related strains and the various ecological as well as drug effects exerted on them to support W.b elimination.
Analysis on the use of Multi-Sequence MRI Series for Segmentation of Abdominal Organs
NASA Astrophysics Data System (ADS)
Selver, M. A.; Selvi, E.; Kavur, E.; Dicle, O.
2015-01-01
Segmentation of abdominal organs from MRI data sets is a challenging task due to various limitations and artefacts. During the routine clinical practice, radiologists use multiple MR sequences in order to analyze different anatomical properties. These sequences have different characteristics in terms of acquisition parameters (such as contrast mechanisms and pulse sequence designs) and image properties (such as pixel spacing, slice thicknesses and dynamic range). For a complete understanding of the data, computational techniques should combine the information coming from these various MRI sequences. These sequences are not acquired in parallel but in a sequential manner (one after another). Therefore, patient movements and respiratory motions change the position and shape of the abdominal organs. In this study, the amount of these effects is measured using three different symmetric surface distance metrics performed to three dimensional data acquired from various MRI sequences. The results are compared to intra and inter observer differences and discussions on using multiple MRI sequences for segmentation and the necessities for registration are presented.
Chakalov, Ivan; Draganova, Rossitza; Wollbrink, Andreas; Preissl, Hubert; Pantev, Christo
2012-06-20
The aim of the present study was to identify a specific neuronal correlate underlying the pre-attentive auditory stream segregation of subsequent sound patterns alternating in spectral or temporal cues. Fifteen participants with normal hearing were presented with series' of two consecutive ABA auditory tone-triplet sequences, the initial triplets being the Adaptation sequence and the subsequent triplets being the Test sequence. In the first experiment, the frequency separation (delta-f) between A and B tones in the sequences was varied by 2, 4 and 10 semitones. In the second experiment, a constant delta-f of 6 semitones was maintained but the Inter-Stimulus Intervals (ISIs) between A and B tones were varied. Auditory evoked magnetic fields (AEFs) were recorded using magnetoencephalography (MEG). Participants watched a muted video of their choice and ignored the auditory stimuli. In a subsequent behavioral study both MEG experiments were replicated to provide information about the participants' perceptual state. MEG measurements showed a significant increase in the amplitude of the B-tone related P1 component of the AEFs as delta-f increased. This effect was seen predominantly in the left hemisphere. A significant increase in the amplitude of the N1 component was only obtained for a Test sequence delta-f of 10 semitones with a prior Adaptation sequence of 2 semitones. This effect was more pronounced in the right hemisphere. The additional behavioral data indicated an increased probability of two-stream perception for delta-f = 4 and delta-f = 10 semitones with a preceding Adaptation sequence of 2 semitones. However, neither the neural activity nor the perception of the successive streaming sequences were modulated when the ISIs were alternated. Our MEG experiment demonstrated differences in the behavior of P1 and N1 components during the automatic segregation of sounds when induced by an initial Adaptation sequence. The P1 component appeared enhanced in all Test-conditions and thus demonstrates the preceding context effect, whereas N1 was specifically modulated only by large delta-f Test sequences induced by a preceding small delta-f Adaptation sequence. These results suggest that P1 and N1 components represent at least partially-different systems that underlie the neural representation of auditory streaming.
Molecular analysis of the human faecal archaea in a southern Indian population.
Rani, Sandya B; Balamurugan, Ramadass; Ramakrishna, Balakrishnan S
2017-03-01
Archaea are an important constituent of the human gut microbiota, but there is no information on human gut archaea in an Indian population. In this study, faecal samples were obtained from different age groups (neonatal babies, preschool children, school-going children, adolescents, adults and elderly) of a southern Indian population, and from a tribal population also resident in southern India). 16S rRNA gene sequences specific to Archaea were amplified from pooled faecal DNA in each group, sequenced, and aligned against the NCBI database. Of the 806 adequate sequences in the study, most aligned with 22 known sequences. There were 9 novel sequences in the present study. All sequences were deposited in the GenBank nucleotide sequence database with the following accession numbers: KF607113 - KF607918. Methanobrevibacter was the most prevalent genus among all the age groups accounting for 98% in neonates, 96% in post-weaning, and 100% each in preschool, school and adult population. In the elderly, Methanobrevibacter accounted for 96% and in tribal adults, 99% of the clones belonged to Methanobrevibacter genus. Other genera detected included Caldisphaera, Halobaculum, Methanosphaeraand Thermogymnomonas. Methanobrevibacter smithii predominated in all age groups, accounting for 749 (92.9%) of the 806 sequences. Archaea can be found in the faeces of southern Indian residents immediately after birth. Methanobrevibacter smithii was the dominant faecal archeon in all age groups, with other genera being found at the extremes of age.
Borges, Juliana N; Cunha, Luiz F G; Miranda, Daniele F; Monteiro-Neto, Cassiano; Santos, Cláudia P
2015-12-01
Pseudoterranova larvae parasitizing cutlassfish Trichiurus lepturus and bluefish Pomatomus saltatrix from Southwest Atlantic coast of Brazil were studied in this work by morphological, ultrastructural and molecular approaches. The genetic analysis were performed for the ITS2 intergenic region specific for Pseudoterranova decipiens, the partial 28S (LSU) of ribosomal DNA and the mtDNA cox-1 region. We obtained results for the 28S region and mtDNA cox-1 that was amplified using the polymerase chain reaction and sequenced to evaluate the phylogenetic relationships between sequences of this study and sequences from the GenBank. The morphological profile indicated that all the nine specimens collected from both fish were L3 larvae of Pseudoterranova sp. The genetic profile confirmed the generic level but due to the absence of similar sequences for adult parasites on GenBank for the regions amplifyied, it was not possible to identify them to the species level. The sequences obtained presented 89% of similarity with Pseudoterranova decipiens (28S sequences) and Contracaecum osculatum B (mtDNA cox-1). The low similarity allied to the fact that the amplification with the specific primer for P. decipiens didn't occur, lead us to conclude that our sequences don't belong to P. decipiens complex.
Efficient use of unlabeled data for protein sequence classification: a comparative study.
Kuksa, Pavel; Huang, Pai-Hsi; Pavlovic, Vladimir
2009-04-29
Recent studies in computational primary protein sequence analysis have leveraged the power of unlabeled data. For example, predictive models based on string kernels trained on sequences known to belong to particular folds or superfamilies, the so-called labeled data set, can attain significantly improved accuracy if this data is supplemented with protein sequences that lack any class tags-the unlabeled data. In this study, we present a principled and biologically motivated computational framework that more effectively exploits the unlabeled data by only using the sequence regions that are more likely to be biologically relevant for better prediction accuracy. As overly-represented sequences in large uncurated databases may bias the estimation of computational models that rely on unlabeled data, we also propose a method to remove this bias and improve performance of the resulting classifiers. Combined with state-of-the-art string kernels, our proposed computational framework achieves very accurate semi-supervised protein remote fold and homology detection on three large unlabeled databases. It outperforms current state-of-the-art methods and exhibits significant reduction in running time. The unlabeled sequences used under the semi-supervised setting resemble the unpolished gemstones; when used as-is, they may carry unnecessary features and hence compromise the classification accuracy but once cut and polished, they improve the accuracy of the classifiers considerably.
Influence of age on adaptability of human mastication.
Peyron, Marie-Agnès; Blanc, Olivier; Lund, James P; Woda, Alain
2004-08-01
The objective of this work was to study the influence of age on the ability of subjects to adapt mastication to changes in the hardness of foods. The study was carried out on 67 volunteers aged from 25 to 75 yr (29 males, 38 females) who had complete healthy dentitions. Surface electromyograms of the left and right masseter and temporalis muscles were recorded simultaneously with jaw movements using an electromagnetic transducer. Each volunteer was asked to chew and swallow four visco-elastic model foods of different hardness, each presented three times in random order. The number of masticatory cycles, their frequency, and the sum of all electromyographic (EMG) activity in all four muscles were calculated for each masticatory sequence. Multiple linear regression analyses were used to assess the effects of hardness, age, and gender. Hardness was associated to an increase in the mean number of cycles and mean summed EMG activity per sequence. It also increased mean vertical amplitude. Mean vertical amplitude and mean summed EMG activity per sequence were higher in males. These adaptations were present at all ages. Age was associated with an increase of 0.3 cycles per sequence per year of life and with a progressive increase in mean summed EMG activity per sequence. Cycle and opening duration early in the sequence also fell with age. We concluded that although the number of cycles needed to chew a standard piece of food increases progressively with age, the capacity to adapt to changes in the hardness of food is maintained.
Influence of motion on face recognition.
Bonfiglio, Natale S; Manfredi, Valentina; Pessa, Eliano
2012-02-01
The influence of motion information and temporal associations on recognition of non-familiar faces was investigated using two groups which performed a face recognition task. One group was presented with regular temporal sequences of face views designed to produce the impression of motion of the face rotating in depth, the other group with random sequences of the same views. In one condition, participants viewed the sequences of the views in rapid succession with a negligible interstimulus interval (ISI). This condition was characterized by three different presentation times. In another condition, participants were presented a sequence with a 1-sec. ISI among the views. That regular sequences of views with a negligible ISI and a shorter presentation time were hypothesized to give rise to better recognition, related to a stronger impression of face rotation. Analysis of data from 45 participants showed a shorter presentation time was associated with significantly better accuracy on the recognition task; however, differences between performances associated with regular and random sequences were not significant.
Single-molecule analysis of DNA cross-links using nanopore technology
NASA Astrophysics Data System (ADS)
Wolna, Anna H.
The alpha-hemolysin (alpha-HL) protein ion channel is a potential next-generation sequencing platform that has been extensively used to study nucleic acids at a single-molecule level. After applying a potential across a lipid bilayer, the imbedded alpha-HL allows monitoring of the duration and current levels of DNA translocation and immobilization. Because this method does not require DNA amplification prior to sequencing, all the DNA damage present in the cell at any given time will be present during the sequencing experiment. The goal of this research is to determine if these damage sites give distinguishable current levels beyond those observed for the canonical nucleobases. Because DNA cross-links are one of the most prevalent types of DNA damage occurring in vivo, the blockage current levels were determined for thymine-dimers, guanine(C8)-thymine(N3) cross-links and platinum adducts. All of these cross-links give a different blockage current level compared to the undamaged strands when immobilized in the ion channel, and they all can easily translocate across the alpha-HL channel. Additionally, the alpha-HL nanopore technique presents a unique opportunity to study the effects of DNA cross-links, such as thymine-dimers, on the secondary structure of DNA G-quadruplexes folded from the human telomere sequence. Using this single-molecule nanopore technique we can detect subtle structural differences that cannot be easily addressed using conventional methods. The human telomere plays crucial roles in maintaining genome stability. In the presence of suitable cations, the repetitive 5'-TTAGGG human telomere sequence can fold into G-quadruplexes that adopt the hybrid fold in vivo. The telomere sequence is hypersensitive to UV-induced thymine-dimer (T=T) formation, and yet the presence of thymine dimers does not cause telomere shortening. The potential structural disruption and thermodynamic stability of the T=T-containing natural telomere sequences were studied to understand how this damage is tolerated in telomeric DNA. The alpha-HL experiments determined that T=Ts disrupt double-chain reversal loop formation but are well tolerated in edgewise and diagonal loops of the hybrid G-quadruplexes. These studies demonstrated the power of the alpha-HL ion channel to analyze DNA modifications and secondary structures at a single-molecule level.
Bacterial Landscape of Bloodstream Infections in Neutropenic Patients via High Throughput Sequencing
Gyarmati, Peter; Kalin, Mats; Öhrmalm, Lars; Giske, Christian G.
2015-01-01
Background Bloodstream infection (BSI) is a common and potentially life-threatening complication in patients with hematological malignancies and therapy-induced neutropenia. Administration of broad spectrum antibiotics has substantially decreased the mortality rate in febrile neutropenia, but bacterial infection is documented in only one-third or fewer of the cases. BSI is typically diagnosed by blood culture; however, this method can detect only culturable pathogens. Methods In the present study, a total of 130 blood samples from hematological patients receiving dose-intensive antitumoural treatment were subjected to 16S rRNA PCR and 62 of them were cultured. PCR positive samples were processed to high throughput sequencing by amplifying the V1-V3 regions of the 16S rRNA gene to obtain a full spectrum of bacteria present in BSI. Results Five phyla and 30 genera were identified with sequencing compared to 2 phyla and 4 genera with culture. The largest proportion of bacteria detected by sequencing belonged to Proteobacteria (55.2%), Firmicutes (33.4%) and Actinobacteria (8.6%), while Fusobacteria (0.4%) and Bacteroidetes (0.1%) were also detected. Ninety-eight percent of the bacteria identified by sequencing were opportunistic human pathogens and 65% belonged to the normal human microbiota. Conclusions The present study indicates that BSIs in neutropenic hosts contain a much broader diversity of bacteria, likely with host origin, than previously realized. The elevated ratio of Proteobacteria in BSI corroborates the results found in other systemic inflammatory diseases, such as inflammatory bowel disease or mucosal infections. This knowledge may become of value for tailoring antimicrobial drug administration. PMID:26270467
Analysis of Sequence Data Under Multivariate Trait-Dependent Sampling.
Tao, Ran; Zeng, Donglin; Franceschini, Nora; North, Kari E; Boerwinkle, Eric; Lin, Dan-Yu
2015-06-01
High-throughput DNA sequencing allows for the genotyping of common and rare variants for genetic association studies. At the present time and for the foreseeable future, it is not economically feasible to sequence all individuals in a large cohort. A cost-effective strategy is to sequence those individuals with extreme values of a quantitative trait. We consider the design under which the sampling depends on multiple quantitative traits. Under such trait-dependent sampling, standard linear regression analysis can result in bias of parameter estimation, inflation of type I error, and loss of power. We construct a likelihood function that properly reflects the sampling mechanism and utilizes all available data. We implement a computationally efficient EM algorithm and establish the theoretical properties of the resulting maximum likelihood estimators. Our methods can be used to perform separate inference on each trait or simultaneous inference on multiple traits. We pay special attention to gene-level association tests for rare variants. We demonstrate the superiority of the proposed methods over standard linear regression through extensive simulation studies. We provide applications to the Cohorts for Heart and Aging Research in Genomic Epidemiology Targeted Sequencing Study and the National Heart, Lung, and Blood Institute Exome Sequencing Project.
Design of association studies with pooled or un-pooled next-generation sequencing data.
Kim, Su Yeon; Li, Yingrui; Guo, Yiran; Li, Ruiqiang; Holmkvist, Johan; Hansen, Torben; Pedersen, Oluf; Wang, Jun; Nielsen, Rasmus
2010-07-01
Most common hereditary diseases in humans are complex and multifactorial. Large-scale genome-wide association studies based on SNP genotyping have only identified a small fraction of the heritable variation of these diseases. One explanation may be that many rare variants (a minor allele frequency, MAF <5%), which are not included in the common genotyping platforms, may contribute substantially to the genetic variation of these diseases. Next-generation sequencing, which would allow the analysis of rare variants, is now becoming so cheap that it provides a viable alternative to SNP genotyping. In this paper, we present cost-effective protocols for using next-generation sequencing in association mapping studies based on pooled and un-pooled samples, and identify optimal designs with respect to total number of individuals, number of individuals per pool, and the sequencing coverage. We perform a small empirical study to evaluate the pooling variance in a realistic setting where pooling is combined with exon-capturing. To test for associations, we develop a likelihood ratio statistic that accounts for the high error rate of next-generation sequencing data. We also perform extensive simulations to determine the power and accuracy of this method. Overall, our findings suggest that with a fixed cost, sequencing many individuals at a more shallow depth with larger pool size achieves higher power than sequencing a small number of individuals in higher depth with smaller pool size, even in the presence of high error rates. Our results provide guidelines for researchers who are developing association mapping studies based on next-generation sequencing. (c) 2010 Wiley-Liss, Inc.
Bilingual Control: Sequential Memory in Language Switching
ERIC Educational Resources Information Center
Declerck, Mathieu; Philipp, Andrea M.; Koch, Iring
2013-01-01
To investigate bilingual language control, prior language switching studies presented visual objects, which had to be named in different languages, typically indicated by a visual cue. The present study examined language switching of predictable responses by introducing a novel sequence-based language switching paradigm. In 4 experiments,…
Hölzemer, Angelique; Thobakgale, Christina F; Jimenez Cruz, Camilo A; Garcia-Beltran, Wilfredo F; Carlson, Jonathan M; van Teijlingen, Nienke H; Mann, Jaclyn K; Jaggernath, Manjeetha; Kang, Seung-gu; Körner, Christian; Chung, Amy W; Schafer, Jamie L; Evans, David T; Alter, Galit; Walker, Bruce D; Goulder, Philip J; Carrington, Mary; Hartmann, Pia; Pertel, Thomas; Zhou, Ruhong; Ndung'u, Thumbi; Altfeld, Marcus
2015-11-01
Viruses can evade immune surveillance, but the underlying mechanisms are insufficiently understood. Here, we sought to understand the mechanisms by which natural killer (NK) cells recognize HIV-1-infected cells and how this virus can evade NK-cell-mediated immune pressure. Two sequence mutations in p24 Gag associated with the presence of specific KIR/HLA combined genotypes were identified in HIV-1 clade C viruses from a large cohort of infected, untreated individuals in South Africa (n = 392), suggesting viral escape from KIR+ NK cells through sequence variations within HLA class I-presented epitopes. One sequence polymorphism at position 303 of p24 Gag (TGag303V), selected for in infected individuals with both KIR2DL3 and HLA-C*03:04, enabled significantly better binding of the inhibitory KIR2DL3 receptor to HLA-C*03:04-expressing cells presenting this variant epitope compared to the wild-type epitope (wild-type mean 18.01 ± 10.45 standard deviation [SD] and variant mean 44.67 ± 14.42 SD, p = 0.002). Furthermore, activation of primary KIR2DL3+ NK cells from healthy donors in response to HLA-C*03:04+ target cells presenting the variant epitope was significantly reduced in comparison to cells presenting the wild-type sequence (wild-type mean 0.78 ± 0.07 standard error of the mean [SEM] and variant mean 0.63 ± 0.07 SEM, p = 0.012). Structural modeling and surface plasmon resonance of KIR/peptide/HLA interactions in the context of the different viral sequence variants studied supported these results. Future studies will be needed to assess processing and antigen presentation of the investigated HIV-1 epitope in natural infection, and the consequences for viral control. These data provide novel insights into how viruses can evade NK cell immunity through the selection of mutations in HLA-presented epitopes that enhance binding to inhibitory NK cell receptors. Better understanding of the mechanisms by which HIV-1 evades NK-cell-mediated immune pressure and the functional validation of a structural modeling approach will facilitate the development of novel targeted immune interventions to harness the antiviral activities of NK cells.
Lung Sliding Identification Is Less Accurate in the Left Hemithorax.
Piette, Eric; Daoust, Raoul; Lambert, Jean; Denault, André
2017-02-01
The aim of our study was to compare the accuracy of lung sliding identification for the left and right hemithoraxes, using prerecorded short US sequences, in a group of physicians with mixed clinical and US training. A total of 140 US sequences of a complete respiratory cycle were recorded in the operating room. Each sequence was divided in two, yielding 140 sequences of present lung sliding and 140 sequences of absent lung sliding. Of these 280 sequences, 40 were randomly repeated to assess intraobserver variability, for a total of 320 sequences. Descriptive data, the mean accuracy of each participant, as well as the rate of correct answers for each of the original 280 sequences were tabulated and compared for different subgroups of clinical and US training. A video with examples of present and absent lung sliding and a lung pulse was shown before testing. Two sessions were planned to facilitate the participation of 75 clinicians. In the first group, the rate of accurate lung sliding identification was lower in the left hemithorax than in the right (67.0% [interquartile range (IQR), 43.0-83.0] versus 80.0% [IQR, 57.0-95.0]; P < .001). In the second group, the rate of accurate lung sliding identification was also lower in the left hemithorax than in the right (76.3% [IQR, 42.9-90.9] versus 88.7% [IQR, 63.1-96.9]; P = .001). Mean accuracy rates were 67.5% (95% confidence interval, 65.7-69.4) in the first group and 73.1% (95% confidence interval, 70.7-75.5) in the second (P < .001). Lung sliding identification seems less accurate in the left hemithorax when using a short US examination. This study was done on recorded US sequences and should be repeated in a live clinical situation to confirm our results. © 2016 by the American Institute of Ultrasound in Medicine.
CDinFusion – Submission-Ready, On-Line Integration of Sequence and Contextual Data
Hankeln, Wolfgang; Wendel, Norma Johanna; Gerken, Jan; Waldmann, Jost; Buttigieg, Pier Luigi; Kostadinov, Ivaylo; Kottmann, Renzo; Yilmaz, Pelin; Glöckner, Frank Oliver
2011-01-01
State of the art (DNA) sequencing methods applied in “Omics” studies grant insight into the ‘blueprints’ of organisms from all domains of life. Sequencing is carried out around the globe and the data is submitted to the public repositories of the International Nucleotide Sequence Database Collaboration. However, the context in which these studies are conducted often gets lost, because experimental data, as well as information about the environment are rarely submitted along with the sequence data. If these contextual or metadata are missing, key opportunities of comparison and analysis across studies and habitats are hampered or even impossible. To address this problem, the Genomic Standards Consortium (GSC) promotes checklists and standards to better describe our sequence data collection and to promote the capturing, exchange and integration of sequence data with contextual data. In a recent community effort the GSC has developed a series of recommendations for contextual data that should be submitted along with sequence data. To support the scientific community to significantly enhance the quality and quantity of contextual data in the public sequence data repositories, specialized software tools are needed. In this work we present CDinFusion, a web-based tool to integrate contextual and sequence data in (Multi)FASTA format prior to submission. The tool is open source and available under the Lesser GNU Public License 3. A public installation is hosted and maintained at the Max Planck Institute for Marine Microbiology at http://www.megx.net/cdinfusion. The tool may also be installed locally using the open source code available at http://code.google.com/p/cdinfusion. PMID:21935468
Draft genome of the living fossil Ginkgo biloba.
Guan, Rui; Zhao, Yunpeng; Zhang, He; Fan, Guangyi; Liu, Xin; Zhou, Wenbin; Shi, Chengcheng; Wang, Jiahao; Liu, Weiqing; Liang, Xinming; Fu, Yuanyuan; Ma, Kailong; Zhao, Lijun; Zhang, Fumin; Lu, Zuhong; Lee, Simon Ming-Yuen; Xu, Xun; Wang, Jian; Yang, Huanming; Fu, Chengxin; Ge, Song; Chen, Wenbin
2016-11-21
Ginkgo biloba L. (Ginkgoaceae) is one of the most distinctive plants. It possesses a suite of fascinating characteristics including a large genome, outstanding resistance/tolerance to abiotic and biotic stresses, and dioecious reproduction, making it an ideal model species for biological studies. However, the lack of a high-quality genome sequence has been an impediment to our understanding of its biology and evolution. The 10.61 Gb genome sequence containing 41,840 annotated genes was assembled in the present study. Repetitive sequences account for 76.58% of the assembled sequence, and long terminal repeat retrotransposons (LTR-RTs) are particularly prevalent. The diversity and abundance of LTR-RTs is due to their gradual accumulation and a remarkable amplification between 16 and 24 million years ago, and they contribute to the long introns and large genome. Whole genome duplication (WGD) may have occurred twice, with an ancient WGD consistent with that shown to occur in other seed plants, and a more recent event specific to ginkgo. Abundant gene clusters from tandem duplication were also evident, and enrichment of expanded gene families indicates a remarkable array of chemical and antibacterial defense pathways. The ginkgo genome consists mainly of LTR-RTs resulting from ancient gradual accumulation and two WGD events. The multiple defense mechanisms underlying the characteristic resilience of ginkgo are fostered by a remarkable enrichment in ancient duplicated and ginkgo-specific gene clusters. The present study sheds light on sequencing large genomes, and opens an avenue for further genetic and evolutionary research.
The accuracy of ultrashort echo time MRI sequences for medical additive manufacturing.
van Eijnatten, Maureen; Rijkhorst, Erik-Jan; Hofman, Mark; Forouzanfar, Tymour; Wolff, Jan
2016-01-01
Additively manufactured bone models, implants and drill guides are becoming increasingly popular amongst maxillofacial surgeons and dentists. To date, such constructs are commonly manufactured using CT technology that induces ionizing radiation. Recently, ultrashort echo time (UTE) MRI sequences have been developed that allow radiation-free imaging of facial bones. The aim of the present study was to assess the feasibility of UTE MRI sequences for medical additive manufacturing (AM). Three morphologically different dry human mandibles were scanned using a CT and MRI scanner. Additionally, optical scans of all three mandibles were made to acquire a "gold standard". All CT and MRI scans were converted into Standard Tessellation Language (STL) models and geometrically compared with the gold standard. To quantify the accuracy of the AM process, the CT, MRI and gold-standard STL models of one of the mandibles were additively manufactured, optically scanned and compared with the original gold-standard STL model. Geometric differences between all three CT-derived STL models and the gold standard were <1.0 mm. All three MRI-derived STL models generally presented deviations <1.5 mm in the symphyseal and mandibular area. The AM process introduced minor deviations of <0.5 mm. This study demonstrates that MRI using UTE sequences is a feasible alternative to CT in generating STL models of the mandible and would therefore be suitable for surgical planning and AM. Further in vivo studies are necessary to assess the usability of UTE MRI sequences in clinical settings.
[Whole Genome Sequencing of Human mtDNA Based on Ion Torrent PGM™ Platform].
Cao, Y; Zou, K N; Huang, J P; Ma, K; Ping, Y
2017-08-01
To analyze and detect the whole genome sequence of human mitochondrial DNA (mtDNA) by Ion Torrent PGM™ platform and to study the differences of mtDNA sequence in different tissues. Samples were collected from 6 unrelated individuals by forensic postmortem examination, including chest blood, hair, costicartilage, nail, skeletal muscle and oral epithelium. Amplification of whole genome sequence of mtDNA was performed by 4 pairs of primer. Libraries were constructed with Ion Shear™ Plus Reagents kit and Ion Plus Fragment Library kit. Whole genome sequencing of mtDNA was performed using Ion Torrent PGM™ platform. Sanger sequencing was used to determine the heteroplasmy positions and the mutation positions on HVⅠ region. The whole genome sequence of mtDNA from all samples were amplified successfully. Six unrelated individuals belonged to 6 different haplotypes. Different tissues in one individual had heteroplasmy difference. The heteroplasmy positions and the mutation positions on HVⅠ region were verified by Sanger sequencing. After a consistency check by the Kappa method, it was found that the results of mtDNA sequence had a high consistency in different tissues. The testing method used in present study for sequencing the whole genome sequence of human mtDNA can detect the heteroplasmy difference in different tissues, which have good consistency. The results provide guidance for the further applications of mtDNA in forensic science. Copyright© by the Editorial Department of Journal of Forensic Medicine
Adamiak, Paul; Vanderkooi, Otto G; Kellner, James D; Schryvers, Anthony B; Bettinger, Julie A; Alcantara, Joenel
2014-06-03
Multi-locus sequence typing (MLST) is a portable, broadly applicable method for classifying bacterial isolates at an intra-species level. This methodology provides clinical and scientific investigators with a standardized means of monitoring evolution within bacterial populations. MLST uses the DNA sequences from a set of genes such that each unique combination of sequences defines an isolate's sequence type. In order to reliably determine the sequence of a typing gene, matching sequence reads for both strands of the gene must be obtained. This study assesses the ability of both the standard, and an alternative set of, Streptococcus pneumoniae MLST primers to completely sequence, in both directions, the required typing alleles. The results demonstrated that for five (aroE, recP, spi, xpt, ddl) of the seven S. pneumoniae typing alleles, the standard primers were unable to obtain the complete forward and reverse sequences. This is due to the standard primers annealing too closely to the target regions, and current sequencing technology failing to sequence the bases that are too close to the primer. The alternative primer set described here, which includes a combination of primers proposed by the CDC and several designed as part of this study, addresses this limitation by annealing to highly conserved segments further from the target region. This primer set was subsequently employed to sequence type 105 S. pneumoniae isolates collected by the Canadian Immunization Monitoring Program ACTive (IMPACT) over a period of 18 years. The inability of several of the standard S. pneumoniae MLST primers to fully sequence the required region was consistently observed and is the result of a shift in sequencing technology occurring after the original primers were designed. The results presented here introduce clear documentation describing this phenomenon into the literature, and provide additional guidance, through the introduction of a widely validated set of alternative primers, to research groups seeking to undertake S. pneumoniae MLST based studies.
Detection of nucleic acid sequences by invader-directed cleavage
Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert
1999-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.
Farmery, James H R; Smith, Mike L; Lynch, Andy G
2018-01-22
Telomere length is a risk factor in disease and the dynamics of telomere length are crucial to our understanding of cell replication and vitality. The proliferation of whole genome sequencing represents an unprecedented opportunity to glean new insights into telomere biology on a previously unimaginable scale. To this end, a number of approaches for estimating telomere length from whole-genome sequencing data have been proposed. Here we present Telomerecat, a novel approach to the estimation of telomere length. Previous methods have been dependent on the number of telomeres present in a cell being known, which may be problematic when analysing aneuploid cancer data and non-human samples. Telomerecat is designed to be agnostic to the number of telomeres present, making it suited for the purpose of estimating telomere length in cancer studies. Telomerecat also accounts for interstitial telomeric reads and presents a novel approach to dealing with sequencing errors. We show that Telomerecat performs well at telomere length estimation when compared to leading experimental and computational methods. Furthermore, we show that it detects expected patterns in longitudinal data, repeated measurements, and cross-species comparisons. We also apply the method to a cancer cell data, uncovering an interesting relationship with the underlying telomerase genotype.
Maurya, Anand Prakash; Das Talukdar, Anupam; Chanda, Debadatta Dhar; Chakravarty, Atanu; Bhattacharjee, Amitabha
2016-01-01
The present study was aimed to investigate the genetic context, association with IS26 and horizontal transmission of SHV-148 among Escherichia coli in Tertiary Referral Hospital of India. Phenotypic characterisation of extended-spectrum beta-lactamases (ESBLs) was carried out as per CLSI criteria. Molecular characterisation of blaSHVand integron was carried out by polymerase chain reaction (PCR) assay and confirmed by sequencing. Linkage of IS26 with blaSHV-148was achieved by PCR. Purified products were cloned on pGEM-T vector and sequenced. Strain typing was performed by pulsed field gel electrophoresis with Xba I digestion. Transferability experiment and antimicrobial susceptibility was performed. A total of 33 isolates showed the presence of SHV-148 variant by sequencing and all were Class 1 integron borne. PCR and sequencing results suggested that all blaSHV-148 showed linkage with IS26 and were present in the upstream portion of the gene cassette and were also horizontally transferable through F type of Inc group. Susceptibility results suggest that tigecycline was most effective. The present study reports for the first time of SHV-148 mediated extended spectrum cephalosporin resistance from India. Association of their resistance gene with IS26 and Class 1 integron and carriage within IncF plasmid signifies the potential mobilising unit for the horizontal transfer.
Unique Trichomonas vaginalis gene sequences identified in multinational regions of Northwest China.
Liu, Jun; Feng, Meng; Wang, Xiaolan; Fu, Yongfeng; Ma, Cailing; Cheng, Xunjia
2017-07-24
Trichomonas vaginalis (T. vaginalis) is a flagellated protozoan parasite that infects humans worldwide. This study determined the sequence of the 18S ribosomal RNA gene of T. vaginalis infecting both females and males in Xinjiang, China. Samples from 73 females and 28 males were collected and confirmed for infection with T. vaginalis, a total of 110 sequences were identified when the T. vaginalis 18S ribosomal RNA gene was sequenced. These sequences were used to prepare a phylogenetic network. The rooted network comprised three large clades and several independent branches. Most of the Xinjiang sequences were in one group. Preliminary results suggest that Xinjiang T. vaginalis isolates might be genetically unique, as indicated by the sequence of their 18S ribosomal RNA gene. Low migration rate of local people in this province may contribute to a genetic conservativeness of T. vaginalis. The unique genetic feature of our isolates may suggest a different clinical presentation of trichomoniasis, including metronidazole susceptibility, T. vaginalis virus or Mycoplasma co-infection characteristics. The transmission and evolution of Xinjiang T. vaginalis is of interest and should be studied further. More attention should be given to T. vaginalis infection in both females and males in Xinjiang.
Jose, Jency; Jalali, S K; Shivalingaswamy, T M; Kumar, N K Krishna; Bhatnagar, R; Bandyopadhyay, A
2013-06-01
A PCR based method for detection of viral DNA in nucleopolyhedrovirus of three lepidopterans, Spodoptera litura, Amsacta albistriga and Helicoverpa armigera, was developed by employing the late expression factor-8 (lef-8) gene of three NPV using specific primers. The amplicons of 689, 699 and 665 bp were amplified, respectively, and the nucleotide sequences were submitted to GenBank and the accession numbers were obtained. The sequences of lef-8 gene of S. litura NPV and H. armigera NPV matched with those of their respective references in the GenBank database, thereby confirming their identity, however, the sequence of A. albistriga NPV was the first sequence submitted to the GenBank database. The sequence similarity analysis between the three lef-8 gene of NPV sequenced in the present study revealed that there was no significant similarity between them, however A. albistriga NPV and S. litura NPV were found to be closely related. CLUSTAL alignment of the sequences generated revealed general relatedness among NPVs lef-8 gene. The study confirmed that lef-8 gene can be used for quick and correct discriminatory identification of insect viruses.
Patiño, Liliana Catherine; Beau, Isabelle; Carlosama, Carolina; Buitrago, July Constanza; González, Ronald; Suárez, Carlos Fernando; Patarroyo, Manuel Alfonso; Delemer, Brigitte; Young, Jacques; Binart, Nadine; Laissue, Paul
2017-07-01
Is it possible to identify new mutations potentially associated with non-syndromic primary ovarian insufficiency (POI) via whole-exome sequencing (WES)? WES is an efficient tool to study genetic causes of POI as we have identified new mutations, some of which lead to protein destablization potentially contributing to the disease etiology. POI is a frequently occurring complex pathology leading to infertility. Mutations in only few candidate genes, mainly identified by Sanger sequencing, have been definitively related to the pathogenesis of the disease. This is a retrospective cohort study performed on 69 women affected by POI. WES and an innovative bioinformatics analysis were used on non-synonymous sequence variants in a subset of 420 selected POI candidate genes. Mutations in BMPR1B and GREM1 were modeled by using fragment molecular orbital analysis. Fifty-five coding variants in 49 genes potentially related to POI were identified in 33 out of 69 patients (48%). These genes participate in key biological processes in the ovary, such as meiosis, follicular development, granulosa cell differentiation/proliferation and ovulation. The presence of at least two mutations in distinct genes in 42% of the patients argued in favor of a polygenic nature of POI. It is possible that regulatory regions, not analyzed in the present study, carry further variants related to POI. WES and the in silico analyses presented here represent an efficient approach for mapping variants associated with POI etiology. Sequence variants presented here represents potential future genetic biomarkers. This study was supported by the Universidad del Rosario and Colciencias (Grants CS/CIGGUR-ABN062-2016 and 672-2014). Colciencias supported Liliana Catherine Patiño´s work (Fellowship: 617, 2013). The authors declare no conflict of interest. © The Author 2017. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
DYZ1 arrays show sequence variation between the monozygotic males
2014-01-01
Background Monozygotic twins (MZT) are an important resource for genetical studies in the context of normal and diseased genomes. In the present study we used DYZ1, a satellite fraction present in the form of tandem arrays on the long arm of the human Y chromosome, as a tool to uncover sequence variations between the monozygotic males. Results We detected copy number variation, frequent insertions and deletions within the sequences of DYZ1 arrays amongst all the three sets of twins used in the present study. MZT1b showed loss of 35 bp compared to that in 1a, whereas 2a showed loss of 31 bp compared to that in 2b. Similarly, 3b showed 10 bp insertion compared to that in 3a. MZT1a germline DNA showed loss of 5 bp and 1b blood DNA showed loss of 26 bp compared to that of 1a blood and 1b germline DNA, respectively. Of the 69 restriction sites detected in DYZ1 arrays, MboII, BsrI, TspEI and TaqI enzymes showed frequent loss and or gain amongst all the 3 pairs studied. MZT1 pair showed loss/gain of VspI, BsrDI, AgsI, PleI, TspDTI, TspEI, TfiI and TaqI restriction sites in both blood and germline DNA. All the three sets of MZT showed differences in the number of DYZ1 copies. FISH signals reflected somatic mosaicism of the DYZ1 copies across the cells. Conclusions DYZ1 showed both sequence and copy number variation between the MZT males. Sequence variation was also noticed between germline and blood DNA samples of the same individual as we observed at least in one set of sample. The result suggests that DYZ1 faithfully records all the genetical changes occurring after the twining which may be ascribed to the environmental factors. PMID:24495361
Types of diaphragmatic motion during hepatic angiography.
Katsuda, T; Kuroda, C; Fujita, M
1997-01-01
To determine the types and causes of diaphragmatic motion during hepatic angiography, the authors used transarterial cut-film portography (TAP) to study movement of the diaphragm during breath-holding. Thirty-three TAP sequences were studied, and the patients' diaphragmatic motions were classified into four categories according to the distance their diaphragms moved. Results showed that the diaphragm was stationary in 33% of the TAP studies, while perpetual motion occurred in 15% of the studies, early-phase motion occurred in 12% and late-phase motion occurred in 40%. Ten sequences showed diaphragmatic motion of more than 10 mm, with eight sequences showing caudal motion and two showing cranial motion. This article discusses the cause of diaphragmatic motion during breath-holding for hepatic angiography and presents suggestions to reduce motion artifacts during the exam.
Sequential associative memory with nonuniformity of the layer sizes.
Teramae, Jun-Nosuke; Fukai, Tomoki
2007-01-01
Sequence retrieval has a fundamental importance in information processing by the brain, and has extensively been studied in neural network models. Most of the previous sequential associative memory embedded sequences of memory patterns have nearly equal sizes. It was recently shown that local cortical networks display many diverse yet repeatable precise temporal sequences of neuronal activities, termed "neuronal avalanches." Interestingly, these avalanches displayed size and lifetime distributions that obey power laws. Inspired by these experimental findings, here we consider an associative memory model of binary neurons that stores sequences of memory patterns with highly variable sizes. Our analysis includes the case where the statistics of these size variations obey the above-mentioned power laws. We study the retrieval dynamics of such memory systems by analytically deriving the equations that govern the time evolution of macroscopic order parameters. We calculate the critical sequence length beyond which the network cannot retrieve memory sequences correctly. As an application of the analysis, we show how the present variability in sequential memory patterns degrades the power-law lifetime distribution of retrieved neural activities.
NASA Astrophysics Data System (ADS)
Mananga, Eugene Stephane
2018-01-01
The utility of the average Hamiltonian theory and its antecedent the Magnus expansion is presented. We assessed the concept of convergence of the Magnus expansion in quadrupolar spectroscopy of spin-1 via the square of the magnitude of the average Hamiltonian. We investigated this approach for two specific modified composite pulse sequences: COM-Im and COM-IVm. It is demonstrated that the size of the square of the magnitude of zero order average Hamiltonian obtained on the appropriated basis is a viable approach to study the convergence of the Magnus expansion. The approach turns to be efficient in studying pulse sequences in general and can be very useful to investigate coherent averaging in the development of high resolution NMR technique in solids. This approach allows comparing theoretically the two modified composite pulse sequences COM-Im and COM-IVm. We also compare theoretically the current modified composite sequences (COM-Im and COM-IVm) to the recently published modified composite pulse sequences (MCOM-I, MCOM-IV, MCOM-I_d, MCOM-IV_d).
Quantitative analysis and prediction of G-quadruplex forming sequences in double-stranded DNA
Kim, Minji; Kreig, Alex; Lee, Chun-Ying; Rube, H. Tomas; Calvert, Jacob; Song, Jun S.; Myong, Sua
2016-01-01
Abstract G-quadruplex (GQ) is a four-stranded DNA structure that can be formed in guanine-rich sequences. GQ structures have been proposed to regulate diverse biological processes including transcription, replication, translation and telomere maintenance. Recent studies have demonstrated the existence of GQ DNA in live mammalian cells and a significant number of potential GQ forming sequences in the human genome. We present a systematic and quantitative analysis of GQ folding propensity on a large set of 438 GQ forming sequences in double-stranded DNA by integrating fluorescence measurement, single-molecule imaging and computational modeling. We find that short minimum loop length and the thymine base are two main factors that lead to high GQ folding propensity. Linear and Gaussian process regression models further validate that the GQ folding potential can be predicted with high accuracy based on the loop length distribution and the nucleotide content of the loop sequences. Our study provides important new parameters that can inform the evaluation and classification of putative GQ sequences in the human genome. PMID:27095201
Identification of Y-Chromosome Sequences in Turner Syndrome.
Silva-Grecco, Roseane Lopes da; Trovó-Marqui, Alessandra Bernadete; Sousa, Tiago Alves de; Croce, Lilian Da; Balarin, Marly Aparecida Spadotto
2016-05-01
To investigate the presence of Y-chromosome sequences and determine their frequency in patients with Turner syndrome. The study included 23 patients with Turner syndrome from Brazil, who gave written informed consent for participating in the study. Cytogenetic analyses were performed in peripheral blood lymphocytes, with 100 metaphases per patient. Genomic DNA was also extracted from peripheral blood lymphocytes, and gene sequences DYZ1, DYZ3, ZFY and SRY were amplified by Polymerase Chain Reaction. The cytogenetic analysis showed a 45,X karyotype in 9 patients (39.2 %) and a mosaic pattern in 14 (60.8 %). In 8.7 % (2 out of 23) of the patients, Y-chromosome sequences were found. This prevalence is very similar to those reported previously. The initial karyotype analysis of these patients did not reveal Y-chromosome material, but they were found positive for Y-specific sequences in the lymphocyte DNA analysis. The PCR technique showed that 2 (8.7 %) of the patients with Turner syndrome had Y-chromosome sequences, both presenting marker chromosomes on cytogenetic analysis.
Criado, A; Martinez, J; Buling, A; Barba, J C; Merino, S; Jefferies, R; Irwin, P J
2006-12-20
As a continuation of our studies on molecular epizootiology of piroplasmosis in Spain and other countries, we present in this contribution the finding of new hosts for some piroplasms, as well as information on their 18S rRNA gene sequences. Genetic data were complemented with sequences of apocytochrome b gene (whenever possible). The following conclusions were drawn from these molecular studies: Theileria annulata is capable of infecting dogs, since it was diagnosed in a symptomatic animal. According to cytochrome b sequences, isolates from cows and dog present slight differences. The same isolates showed, however, identical sequence in the 18S rRNA gene. This exemplifies well the usefulness of the mitochondrial gene for examining infra-specific variation. Babesia bovis is an occasional parasite of equines, since it was detected in two symptomatic horses. We found evidence of genetic polymorphism occurring in the 18S rRNA gene of Spanish T. equi-like and B. ovis isolates. B. bennetti from Spanish seagull is loosely related to B. ovis, and might represent a genetically distinct branch of babesids. A partial sequence of a cytochrome b pseudogene was obtained for the first time in Babesia canis rossi from South Africa. The pseudogene is distantly related to B. bigemina cytochrome b gene. These new findings confirm the ability of some piroplasms to infect multiple hosts, as well as the existence of a relatively wide genetic polymorphisms with respect to the cytochrome b gene. On the other hand, the existence of mtDNA-like pseudogenes of possible nuclear location in piroplasms is interesting due to their possible impact on molecular phylogeny studies.
Inaugural Genomics Automation Congress and the coming deluge of sequencing data.
Creighton, Chad J
2010-10-01
Presentations at Select Biosciences's first 'Genomics Automation Congress' (Boston, MA, USA) in 2010 focused on next-generation sequencing and the platforms and methodology around them. The meeting provided an overview of sequencing technologies, both new and emerging. Speakers shared their recent work on applying sequencing to profile cells for various levels of biomolecular complexity, including DNA sequences, DNA copy, DNA methylation, mRNA and microRNA. With sequencing time and costs continuing to drop dramatically, a virtual explosion of very large sequencing datasets is at hand, which will probably present challenges and opportunities for high-level data analysis and interpretation, as well as for information technology infrastructure.
Advances in high throughput DNA sequence data compression.
Sardaraz, Muhammad; Tahir, Muhammad; Ikram, Ataul Aziz
2016-06-01
Advances in high throughput sequencing technologies and reduction in cost of sequencing have led to exponential growth in high throughput DNA sequence data. This growth has posed challenges such as storage, retrieval, and transmission of sequencing data. Data compression is used to cope with these challenges. Various methods have been developed to compress genomic and sequencing data. In this article, we present a comprehensive review of compression methods for genome and reads compression. Algorithms are categorized as referential or reference free. Experimental results and comparative analysis of various methods for data compression are presented. Finally, key challenges and research directions in DNA sequence data compression are highlighted.
Principles of Quantitative MR Imaging with Illustrated Review of Applicable Modular Pulse Diagrams.
Mills, Andrew F; Sakai, Osamu; Anderson, Stephan W; Jara, Hernan
2017-01-01
Continued improvements in diagnostic accuracy using magnetic resonance (MR) imaging will require development of methods for tissue analysis that complement traditional qualitative MR imaging studies. Quantitative MR imaging is based on measurement and interpretation of tissue-specific parameters independent of experimental design, compared with qualitative MR imaging, which relies on interpretation of tissue contrast that results from experimental pulse sequence parameters. Quantitative MR imaging represents a natural next step in the evolution of MR imaging practice, since quantitative MR imaging data can be acquired using currently available qualitative imaging pulse sequences without modifications to imaging equipment. The article presents a review of the basic physical concepts used in MR imaging and how quantitative MR imaging is distinct from qualitative MR imaging. Subsequently, the article reviews the hierarchical organization of major applicable pulse sequences used in this article, with the sequences organized into conventional, hybrid, and multispectral sequences capable of calculating the main tissue parameters of T1, T2, and proton density. While this new concept offers the potential for improved diagnostic accuracy and workflow, awareness of this extension to qualitative imaging is generally low. This article reviews the basic physical concepts in MR imaging, describes commonly measured tissue parameters in quantitative MR imaging, and presents the major available pulse sequences used for quantitative MR imaging, with a focus on the hierarchical organization of these sequences. © RSNA, 2017.
Hand, Melanie L.; Spangenberg, German C.; Forster, John W.; Cogan, Noel O. I.
2013-01-01
Chloroplast genome sequences are of broad significance in plant biology, due to frequent use in molecular phylogenetics, comparative genomics, population genetics, and genetic modification studies. The present study used a second-generation sequencing approach to determine and assemble the plastid genomes (plastomes) of four representatives from the agriculturally important Lolium-Festuca species complex of pasture grasses (Lolium multiflorum, Festuca pratensis, Festuca altissima, and Festuca ovina). Total cellular DNA was extracted from either roots or leaves, was sequenced, and the output was filtered for plastome-related reads. A comparison between sources revealed fewer plastome-related reads from root-derived template but an increase in incidental bacterium-derived sequences. Plastome assembly and annotation indicated high levels of sequence identity and a conserved organization and gene content between species. However, frequent deletions within the F. ovina plastome appeared to contribute to a smaller plastid genome size. Comparative analysis with complete plastome sequences from other members of the Poaceae confirmed conservation of most grass-specific features. Detailed analysis of the rbcL–psaI intergenic region, however, revealed a “hot-spot” of variation characterized by independent deletion events. The evolutionary implications of this observation are discussed. The complete plastome sequences are anticipated to provide the basis for potential organelle-specific genetic modification of pasture grasses. PMID:23550121
2010-01-01
Background Little genomic or trancriptomic information on Ganoderma lucidum (Lingzhi) is known. This study aims to discover the transcripts involved in secondary metabolite biosynthesis and developmental regulation of G. lucidum using an expressed sequence tag (EST) library. Methods A cDNA library was constructed from the G. lucidum fruiting body. Its high-quality ESTs were assembled into unique sequences with contigs and singletons. The unique sequences were annotated according to sequence similarities to genes or proteins available in public databases. The detection of simple sequence repeats (SSRs) was preformed by online analysis. Results A total of 1,023 clones were randomly selected from the G. lucidum library and sequenced, yielding 879 high-quality ESTs. These ESTs showed similarities to a diverse range of genes. The sequences encoding squalene epoxidase (SE) and farnesyl-diphosphate synthase (FPS) were identified in this EST collection. Several candidate genes, such as hydrophobin, MOB2, profilin and PHO84 were detected for the first time in G. lucidum. Thirteen (13) potential SSR-motif microsatellite loci were also identified. Conclusion The present study demonstrates a successful application of EST analysis in the discovery of transcripts involved in the secondary metabolite biosynthesis and the developmental regulation of G. lucidum. PMID:20230644
Lingner, Thomas; Kataya, Amr R. A.; Reumann, Sigrun
2012-01-01
We recently developed the first algorithms specifically for plants to predict proteins carrying peroxisome targeting signals type 1 (PTS1) from genome sequences.1 As validated experimentally, the prediction methods are able to correctly predict unknown peroxisomal Arabidopsis proteins and to infer novel PTS1 tripeptides. The high prediction performance is primarily determined by the large number and sequence diversity of the underlying positive example sequences, which mainly derived from EST databases. However, a few constructs remained cytosolic in experimental validation studies, indicating sequencing errors in some ESTs. To identify erroneous sequences, we validated subcellular targeting of additional positive example sequences in the present study. Moreover, we analyzed the distribution of prediction scores separately for each orthologous group of PTS1 proteins, which generally resembled normal distributions with group-specific mean values. The cytosolic sequences commonly represented outliers of low prediction scores and were located at the very tail of a fitted normal distribution. Three statistical methods for identifying outliers were compared in terms of sensitivity and specificity.” Their combined application allows elimination of erroneous ESTs from positive example data sets. This new post-validation method will further improve the prediction accuracy of both PTS1 and PTS2 protein prediction models for plants, fungi, and mammals. PMID:22415050
Lingner, Thomas; Kataya, Amr R A; Reumann, Sigrun
2012-02-01
We recently developed the first algorithms specifically for plants to predict proteins carrying peroxisome targeting signals type 1 (PTS1) from genome sequences. As validated experimentally, the prediction methods are able to correctly predict unknown peroxisomal Arabidopsis proteins and to infer novel PTS1 tripeptides. The high prediction performance is primarily determined by the large number and sequence diversity of the underlying positive example sequences, which mainly derived from EST databases. However, a few constructs remained cytosolic in experimental validation studies, indicating sequencing errors in some ESTs. To identify erroneous sequences, we validated subcellular targeting of additional positive example sequences in the present study. Moreover, we analyzed the distribution of prediction scores separately for each orthologous group of PTS1 proteins, which generally resembled normal distributions with group-specific mean values. The cytosolic sequences commonly represented outliers of low prediction scores and were located at the very tail of a fitted normal distribution. Three statistical methods for identifying outliers were compared in terms of sensitivity and specificity." Their combined application allows elimination of erroneous ESTs from positive example data sets. This new post-validation method will further improve the prediction accuracy of both PTS1 and PTS2 protein prediction models for plants, fungi, and mammals.
Poirier, Simon; Coeuret, Gwendoline; Champomier-Vergès, Marie-Christine; Chaillou, Stéphane
2018-06-14
In this study, we present the draft genome sequences of nine strains from various psychrotrophic species identified in meat products and being recognized as important emerging food spoilers. Many of these species have only one or few strains being sequenced, and this work will contribute to the improvement of the overall genomic knowledge about them. Copyright © 2018 Poirier et al.
Current state-of-art of STR sequencing in forensic genetics.
Alonso, Antonio; Barrio, Pedro A; Müller, Petra; Köcher, Steffi; Berger, Burkhard; Martin, Pablo; Bodner, Martin; Willuweit, Sascha; Parson, Walther; Roewer, Lutz; Budowle, Bruce
2018-05-11
The current state of validation and implementation strategies of MPS technology for the analysis of STR markers for forensic genetics use is described, covering the topics of the current catalogue of commercial MPS-STR panels, leading MPS-platforms, and MPS-STR data analysis tools. In addition, the developmental and internal validation studies carried out to date to evaluate reliability, sensitivity, mixture analysis, concordance, and the ability to analyze challenged samples are summarized. The results of various MPS-STR population studies that showed a large number of new STR sequence variants that increase the power of discrimination in several forensically-relevant loci are also presented. Finally, various initiatives developed by several international projects and standardization (or guidelines) groups to facilitate application of MPS technology for STR marker analyses are discussed in regard to promoting a standard STR sequence nomenclature, performing population studies to detect sequence variants, and developing a universal system to translate sequence variants into a simple STR nomenclature (numbers and letters) compatible with national STR databases. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
NASA Astrophysics Data System (ADS)
Yang, Hong
Until recently, recovery and analysis of genetic information encoded in ancient DNA sequences from Pleistocene fossils were impossible. Recent advances in molecular biology offered technical tools to obtain ancient DNA sequences from well-preserved Quaternary fossils and opened the possibilities to directly study genetic changes in fossil species to address various biological and paleontological questions. Ancient DNA studies involving Pleistocene fossil material and ancient DNA degradation and preservation in Quaternary deposits are reviewed. The molecular technology applied to isolate, amplify, and sequence ancient DNA is also presented. Authentication of ancient DNA sequences and technical problems associated with modern and ancient DNA contamination are discussed. As illustrated in recent studies on ancient DNA from proboscideans, it is apparent that fossil DNA sequence data can shed light on many aspects of Quaternary research such as systematics and phylogeny. conservation biology, evolutionary theory, molecular taphonomy, and forensic sciences. Improvement of molecular techniques and a better understanding of DNA degradation during fossilization are likely to build on current strengths and to overcome existing problems, making fossil DNA data a unique source of information for Quaternary scientists.
Skill-dependent proximal-to-distal sequence in team-handball throwing.
Wagner, Herbert; Pfusterschmied, Jürgen; Von Duvillard, Serge P; Müller, Erich
2012-01-01
The importance of proximal-to-distal sequencing in human performance throwing has been reported previously. However, a comprehensive comparison of the proximal-to-distal sequence in team-handball throwing in athletes with different training experience and competition is lacking. Therefore, the aim of the study was to compare the ball velocity and proximal-to-distal sequence in the team-handball standing throw with run-up of players of different skill (less experienced, experienced, and elite). Twenty-four male team-handball players (n = 8 for each group) performed five standing throws with run-up with maximal ball velocity and accuracy. Kinematics and ball trajectories were recorded with a Vicon motion capture system and joint movements were calculated. A specific proximal-to-distal sequence, where elbow flexion occurred before shoulder internal rotation, was found in all three groups. These results are in line with previous studies in team-handball. Furthermore, the results of the present study suggest that in the team-handball standing throw with run-up, increased playing experience is associated with an increase in ball velocity as well as a delayed start to trunk flexion.
Probabilistic topic modeling for the analysis and classification of genomic sequences
2015-01-01
Background Studies on genomic sequences for classification and taxonomic identification have a leading role in the biomedical field and in the analysis of biodiversity. These studies are focusing on the so-called barcode genes, representing a well defined region of the whole genome. Recently, alignment-free techniques are gaining more importance because they are able to overcome the drawbacks of sequence alignment techniques. In this paper a new alignment-free method for DNA sequences clustering and classification is proposed. The method is based on k-mers representation and text mining techniques. Methods The presented method is based on Probabilistic Topic Modeling, a statistical technique originally proposed for text documents. Probabilistic topic models are able to find in a document corpus the topics (recurrent themes) characterizing classes of documents. This technique, applied on DNA sequences representing the documents, exploits the frequency of fixed-length k-mers and builds a generative model for a training group of sequences. This generative model, obtained through the Latent Dirichlet Allocation (LDA) algorithm, is then used to classify a large set of genomic sequences. Results and conclusions We performed classification of over 7000 16S DNA barcode sequences taken from Ribosomal Database Project (RDP) repository, training probabilistic topic models. The proposed method is compared to the RDP tool and Support Vector Machine (SVM) classification algorithm in a extensive set of trials using both complete sequences and short sequence snippets (from 400 bp to 25 bp). Our method reaches very similar results to RDP classifier and SVM for complete sequences. The most interesting results are obtained when short sequence snippets are considered. In these conditions the proposed method outperforms RDP and SVM with ultra short sequences and it exhibits a smooth decrease of performance, at every taxonomic level, when the sequence length is decreased. PMID:25916734
Conflict Background Triggered Congruency Sequence Effects in Graphic Judgment Task
Zhao, Liang; Wang, Yonghui
2013-01-01
Congruency sequence effects refer to the reduction of congruency effects when following an incongruent trial than following a congruent trial. The conflict monitoring account, one of the most influential contributions to this effect, assumes that the sequential modulations are evoked by response conflict. The present study aimed at exploring the congruency sequence effects in the absence of response conflict. We found congruency sequence effects occurred in graphic judgment task, in which the conflict stimuli acted as irrelevant information. The findings reveal that processing task-irrelevant conflict stimulus features could also induce sequential modulations of interference. The results do not support the interpretation of conflict monitoring and favor a feature integration account that the congruency sequence effects are attributed to the repetitions of stimulus and response features. PMID:23372766
First report of bacterial community from a Bat Guano using Illumina next-generation sequencing.
De Mandal, Surajit; Zothansanga; Panda, Amritha Kumari; Bisht, Satpal Singh; Senthil Kumar, Nachimuthu
2015-06-01
V4 hypervariable region of 16S rDNA was analyzed for identifying the bacterial communities present in Bat Guano from the unexplored cave - Pnahkyndeng, Meghalaya, Northeast India. Metagenome comprised of 585,434 raw Illumina sequences with a 59.59% G+C content. A total of 416,490 preprocessed reads were clustered into 1282 OTUs (operational taxonomical units) comprising of 18 bacterial phyla. The taxonomic profile showed that the guano bacterial community is dominated by Chloroflexi, Actinobacteria and Crenarchaeota which account for 70.73% of all sequence reads and 43.83% of all OTUs. Metagenome sequence data are available at NCBI under the accession no. SRP051094. This study is the first to characterize Bat Guano bacterial community using next-generation sequencing approach.
First report of bacterial community from a Bat Guano using Illumina next-generation sequencing
De Mandal, Surajit; Zothansanga; Panda, Amritha Kumari; Bisht, Satpal Singh; Senthil Kumar, Nachimuthu
2015-01-01
V4 hypervariable region of 16S rDNA was analyzed for identifying the bacterial communities present in Bat Guano from the unexplored cave — Pnahkyndeng, Meghalaya, Northeast India. Metagenome comprised of 585,434 raw Illumina sequences with a 59.59% G+C content. A total of 416,490 preprocessed reads were clustered into 1282 OTUs (operational taxonomical units) comprising of 18 bacterial phyla. The taxonomic profile showed that the guano bacterial community is dominated by Chloroflexi, Actinobacteria and Crenarchaeota which account for 70.73% of all sequence reads and 43.83% of all OTUs. Metagenome sequence data are available at NCBI under the accession no. SRP051094. This study is the first to characterize Bat Guano bacterial community using next-generation sequencing approach. PMID:26484190
The Past, Present, and Future of Human Centromere Genomics
Aldrup-MacDonald, Megan E.; Sullivan, Beth A.
2014-01-01
The centromere is the chromosomal locus essential for chromosome inheritance and genome stability. Human centromeres are located at repetitive alpha satellite DNA arrays that compose approximately 5% of the genome. Contiguous alpha satellite DNA sequence is absent from the assembled reference genome, limiting current understanding of centromere organization and function. Here, we review the progress in centromere genomics spanning the discovery of the sequence to its molecular characterization and the work done during the Human Genome Project era to elucidate alpha satellite structure and sequence variation. We discuss exciting recent advances in alpha satellite sequence assembly that have provided important insight into the abundance and complex organization of this sequence on human chromosomes. In light of these new findings, we offer perspectives for future studies of human centromere assembly and function. PMID:24683489
Hagger, Martin S; Chatzisarantis, Nikos L D; Harris, Jemma
2006-02-01
The present study tested a motivational sequence in which global-level psychological need satisfaction from self-determination theory influenced intentions and behavior directly and indirectly through contextual-level motivation and situational-level decision-making constructs from the theory of planned behavior. Two samples of university students (N = 511) completed measures of global-level psychological need satisfaction, contextual-level autonomous motivation, and situational-level attitudes, subjective norms, perceived behavioral control, intentions, and behavior in two behavioral contexts: exercise and dieting. A structural equation model supported the proposed sequence in both samples. The indirect effect was present for exercise behavior, whereas both direct and indirect effects were found for dieting behavior. Findings independently supported the component theories and provided a comprehensive integrated explanation of volitional behavior.
Daikoku, Tatsuya; Takahashi, Yuji; Futagami, Hiroko; Tarumoto, Nagayoshi; Yasuda, Hideki
2017-02-01
In real-world auditory environments, humans are exposed to overlapping auditory information such as those made by human voices and musical instruments even during routine physical activities such as walking and cycling. The present study investigated how concurrent physical exercise affects performance of incidental and intentional learning of overlapping auditory streams, and whether physical fitness modulates the performances of learning. Participants were grouped with 11 participants with lower and higher fitness each, based on their Vo 2 max value. They were presented simultaneous auditory sequences with a distinct statistical regularity each other (i.e. statistical learning), while they were pedaling on the bike and seating on a bike at rest. In experiment 1, they were instructed to attend to one of the two sequences and ignore to the other sequence. In experiment 2, they were instructed to attend to both of the two sequences. After exposure to the sequences, learning effects were evaluated by familiarity test. In the experiment 1, performance of statistical learning of ignored sequences during concurrent pedaling could be higher in the participants with high than low physical fitness, whereas in attended sequence, there was no significant difference in performance of statistical learning between high than low physical fitness. Furthermore, there was no significant effect of physical fitness on learning while resting. In the experiment 2, the both participants with high and low physical fitness could perform intentional statistical learning of two simultaneous sequences in the both exercise and rest sessions. The improvement in physical fitness might facilitate incidental but not intentional statistical learning of simultaneous auditory sequences during concurrent physical exercise.
A complete, multi-level conformational clustering of antibody complementarity-determining regions
Nikoloudis, Dimitris; Pitts, Jim E.
2014-01-01
Classification of antibody complementarity-determining region (CDR) conformations is an important step that drives antibody modelling and engineering, prediction from sequence, directed mutagenesis and induced-fit studies, and allows inferences on sequence-to-structure relations. Most of the previous work performed conformational clustering on a reduced set of structures or after application of various structure pre-filtering criteria. In this study, it was judged that a clustering of every available CDR conformation would produce a complete and redundant repertoire, increase the number of sequence examples and allow better decisions on structure validity in the future. In order to cope with the potential increase in data noise, a first-level statistical clustering was performed using structure superposition Root-Mean-Square Deviation (RMSD) as a distance-criterion, coupled with second- and third-level clustering that employed Ramachandran regions for a deeper qualitative classification. The classification of a total of 12,712 CDR conformations is thus presented, along with rich annotation and cluster descriptions, and the results are compared to previous major studies. The present repertoire has procured an improved image of our current CDR Knowledge-Base, with a novel nesting of conformational sensitivity and specificity that can serve as a systematic framework for improved prediction from sequence as well as a number of future studies that would aid in knowledge-based antibody engineering such as humanisation. PMID:25071986
da Silva, Fábio Daniel Florêncio; Lima, Alex Ranieri Jerônimo; Moraes, Pablo Henrique Gonçalves; Siqueira, Andrei Santos; Dall'Agnol, Leonardo Teixeira; Baraúna, Anna Rafaella Ferreira; Martins, Luisa Carício; Oliveira, Karol Guimarães; de Lima, Clayton Pereira Silva; Nunes, Márcio Roberto Teixeira; Vianez-Júnior, João Lídio Silva Gonçalves; Gonçalves, Evonnildo Costa
2016-05-19
Ecological interactions between cyanobacteria and heterotrophic prokaryotes are poorly known. To improve the genomic studies of heterotrophic bacterium-cyanobacterium associations, the draft genome sequence (3.2 Mbp) of Limnobacter sp. strain CACIAM 66H1, found in a nonaxenic culture of Synechococcus sp. (cyanobacteria), is presented here. Copyright © 2016 da Silva et al.
ERIC Educational Resources Information Center
Campbell, Una C.; Winsauer, Peter J.; Stevenson, Michael W.; Moerschbaecher, Joseph M.
2004-01-01
The present study investigated the effects of positive and negative GABA[subscript A] modulators under three different baselines of repeated acquisition in squirrel monkeys in which the monkeys acquired a three-response sequence on three keys under a second-order fixed-ratio (FR) schedule of food reinforcement. In two of these baselines, the…
Alexandraki, Voula; Kazou, Maria; Pot, Bruno; Tsakalidou, Effie; Papadimitriou, Konstantinos
2017-08-24
Lactobacillus delbrueckii subsp. bulgaricus is widely used in the production of yogurt and cheese. In this study, we present the complete genome sequence of L. delbrueckii subsp. bulgaricus ACA-DC 87 isolated from traditional Greek yogurt. Whole-genome analysis may reveal desirable technological traits of the strain for dairy fermentations. Copyright © 2017 Alexandraki et al.
Amin, Oumed Gerjis M; Jackwood, Daral J
2014-10-01
The present study was undertaken to characterize field isolates of infectious bursal disease virus (IBDV). The identification was done using reverse transcription-polymerase chain reaction (RT-PCR) and partial sequencing of the VP2 gene. Pooled bursal samples were collected from commercial broiler farms located in the Kurdistan Regional Government (KRG) of Iraq. The genetic material of the IBDV was detected in 10 out of 29 field samples. Sequences of the hypervariable VP2 region were determined for 10 of these viruses. Molecular analysis of the VP2 gene of five IBDVs showed amino acid sequences consistent with the very virulent (vv) IBDV. Two samples were identified as classic vaccine viruses, and three samples were classic vaccine viruses that appear to have mutated during replication in the field. Phylogenetic analysis showed that all five field IBDV strains of the present study were closely related to each other. On the basis of nucleotide sequencing and phylogenetic analysis, it is very likely that IBD-causing viruses in this part of Iraq are of the very virulent type. These IBDVs appear to be evolving relative to their type strains.
Clayton, Stephen; Prigmore, Elena; Langley, Elizabeth; Yang, Fengtang; Maguire, Sean; Fu, Beiyuan; Rajan, Diana; Sheppard, Olivia; Scott, Carol; Hauser, Heidi; Stephens, Philip J.; Stebbings, Lucy A.; Ng, Bee Ling; Fitzgerald, Tomas; Quail, Michael A.; Banerjee, Ruby; Rothkamm, Kai; Tybulewicz, Victor L. J.; Fisher, Elizabeth M. C.; Carter, Nigel P.
2013-01-01
Down syndrome (DS) is caused by trisomy of chromosome 21 (Hsa21) and presents a complex phenotype that arises from abnormal dosage of genes on this chromosome. However, the individual dosage-sensitive genes underlying each phenotype remain largely unknown. To help dissect genotype – phenotype correlations in this complex syndrome, the first fully transchromosomic mouse model, the Tc1 mouse, which carries a copy of human chromosome 21 was produced in 2005. The Tc1 strain is trisomic for the majority of genes that cause phenotypes associated with DS, and this freely available mouse strain has become used widely to study DS, the effects of gene dosage abnormalities, and the effect on the basic biology of cells when a mouse carries a freely segregating human chromosome. Tc1 mice were created by a process that included irradiation microcell-mediated chromosome transfer of Hsa21 into recipient mouse embryonic stem cells. Here, the combination of next generation sequencing, array-CGH and fluorescence in situ hybridization technologies has enabled us to identify unsuspected rearrangements of Hsa21 in this mouse model; revealing one deletion, six duplications and more than 25 de novo structural rearrangements. Our study is not only essential for informing functional studies of the Tc1 mouse but also (1) presents for the first time a detailed sequence analysis of the effects of gamma radiation on an entire human chromosome, which gives some mechanistic insight into the effects of radiation damage on DNA, and (2) overcomes specific technical difficulties of assaying a human chromosome on a mouse background where highly conserved sequences may confound the analysis. Sequence data generated in this study is deposited in the ENA database, Study Accession number: ERP000439. PMID:23596509
Lichenase and coding sequences
Li, Xin-Liang; Ljungdahl, Lars G.; Chen, Huizhong
2000-08-15
The present invention provides a fungal lichenase, i.e., an endo-1,3-1,4-.beta.-D-glucanohydrolase, its coding sequence, recombinant DNA molecules comprising the lichenase coding sequences, recombinant host cells and methods for producing same. The present lichenase is from Orpinomyces PC-2.
antaRNA: ant colony-based RNA sequence design.
Kleinkauf, Robert; Mann, Martin; Backofen, Rolf
2015-10-01
RNA sequence design is studied at least as long as the classical folding problem. Although for the latter the functional fold of an RNA molecule is to be found ,: inverse folding tries to identify RNA sequences that fold into a function-specific target structure. In combination with RNA-based biotechnology and synthetic biology ,: reliable RNA sequence design becomes a crucial step to generate novel biochemical components. In this article ,: the computational tool antaRNA is presented. It is capable of compiling RNA sequences for a given structure that comply in addition with an adjustable full range objective GC-content distribution ,: specific sequence constraints and additional fuzzy structure constraints. antaRNA applies ant colony optimization meta-heuristics and its superior performance is shown on a biological datasets. http://www.bioinf.uni-freiburg.de/Software/antaRNA CONTACT: backofen@informatik.uni-freiburg.de Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Gene Deletion in Barley Mediated by LTR-retrotransposon BARE
Shang, Yi; Yang, Fei; Schulman, Alan H.; Zhu, Jinghuan; Jia, Yong; Wang, Junmei; Zhang, Xiao-Qi; Jia, Qiaojun; Hua, Wei; Yang, Jianming; Li, Chengdao
2017-01-01
A poly-row branched spike (prbs) barley mutant was obtained from soaking a two-rowed barley inflorescence in a solution of maize genomic DNA. Positional cloning and sequencing demonstrated that the prbs mutant resulted from a 28 kb deletion including the inflorescence architecture gene HvRA2. Sequence annotation revealed that the HvRA2 gene is flanked by two LTR (long terminal repeat) retrotransposons (BARE) sharing 89% sequence identity. A recombination between the integrase (IN) gene regions of the two BARE copies resulted in the formation of an intact BARE and loss of HvRA2. No maize DNA was detected in the recombination region although the flanking sequences of HvRA2 gene showed over 73% of sequence identity with repetitive sequences on 10 maize chromosomes. It is still unknown whether the interaction of retrotransposons between barley and maize has resulted in the recombination observed in the present study. PMID:28252053
NASA Technical Reports Server (NTRS)
Nakayama, S.; Kretsinger, R. H.
1993-01-01
In the first report in this series we presented dendrograms based on 152 individual proteins of the EF-hand family. In the second we used sequences from 228 proteins, containing 835 domains, and showed that eight of the 29 subfamilies are congruent and that the EF-hand domains of the remaining 21 subfamilies have diverse evolutionary histories. In this study we have computed dendrograms within and among the EF-hand subfamilies using the encoding DNA sequences. In most instances the dendrograms based on protein and on DNA sequences are very similar. Significant differences between protein and DNA trees for calmodulin remain unexplained. In our fourth report we evaluate the sequences and the distribution of introns within the EF-hand family and conclude that exon shuffling did not play a significant role in its evolution.
NASA Astrophysics Data System (ADS)
Giblin, M. F.; Sieckman, G. L.; Owen, N. K.; Hoffman, T. J.; Forte, L. R.; Volkert, W. A.
2005-12-01
The human Escherichia coli heat-stable enterotoxin (STh, amino acid sequence N1SSNYCCELCCNPACTGCY19) binds specifically to the guanylate cyclase C (GC-C) receptor, which is present in high density on the apical surface of normal intestinal epithelial cells as well as on the surface of human colon cancer cells. In the current study, two STh analogs were synthesized and evaluated in vitro and in vivo. Both analogs shared identical 6-19 core sequences, and had N-terminal pendant DOTA moieties. The analogs differed in the identity of a 6 amino acid peptide sequence intervening between DOTA and the 6-19 core. In one analog, the peptide was an RGD-containing sequence found in human fibronectin (GRGDSP), while in the other this peptide sequence was randomly scrambled (GRDSGP). The results indicated that the presence of the human fibronectin sequence in the hybrid peptide did not affect tumor localization in vivo.
Quantum-Sequencing: Biophysics of quantum tunneling through nucleic acids
NASA Astrophysics Data System (ADS)
Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant
2014-03-01
Tunneling microscopy and spectroscopy has extensively been used in physical surface sciences to study quantum tunneling to measure electronic local density of states of nanomaterials and to characterize adsorbed species. Quantum-Sequencing (Q-Seq) is a new method based on tunneling microscopy for electronic sequencing of single molecule of nucleic acids. A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free single-molecule sequencing method. Here, we present the unique ``electronic fingerprints'' for all nucleotides on DNA and RNA using Q-Seq along their intrinsic biophysical parameters. We have analyzed tunneling spectra for the nucleotides at different pH conditions and analyzed the HOMO, LUMO and energy gap for all of them. In addition we show a number of biophysical parameters to further characterize all nucleobases (electron and hole transition voltage and energy barriers). These results highlight the robustness of Q-Seq as a technique for next-generation sequencing.
Rhythm sensitivity in macaque monkeys
Selezneva, Elena; Deike, Susann; Knyazeva, Stanislava; Scheich, Henning; Brechmann, André; Brosch, Michael
2013-01-01
This study provides evidence that monkeys are rhythm sensitive. We composed isochronous tone sequences consisting of repeating triplets of two short tones and one long tone which humans perceive as repeating triplets of two weak and one strong beat. This regular sequence was compared to an irregular sequence with the same number of randomly arranged short and long tones with no such beat structure. To search for indication of rhythm sensitivity we employed an oddball paradigm in which occasional duration deviants were introduced in the sequences. In a pilot study on humans we showed that subjects more easily detected these deviants when they occurred in a regular sequence. In the monkeys we searched for spontaneous behaviors the animals executed concomitant with the deviants. We found that monkeys more frequently exhibited changes of gaze and facial expressions to the deviants when they occurred in the regular sequence compared to the irregular sequence. In addition we recorded neuronal firing and local field potentials from 175 sites of the primary auditory cortex during sequence presentation. We found that both types of neuronal signals differentiated regular from irregular sequences. Both signals were stronger in regular sequences and occurred after the onset of the long tones, i.e., at the position of the strong beat. Local field potential responses were also significantly larger for the durational deviants in regular sequences, yet in a later time window. We speculate that these temporal pattern-selective mechanisms with a focus on strong beats and their deviants underlie the perception of rhythm in the chosen sequences. PMID:24046732
Method to amplify variable sequences without imposing primer sequences
Bradbury, Andrew M.; Zeytun, Ahmet
2006-11-14
The present invention provides methods of amplifying target sequences without including regions flanking the target sequence in the amplified product or imposing amplification primer sequences on the amplified product. Also provided are methods of preparing a library from such amplified target sequences.
Detection of a novel herpesvirus from bats in the Philippines.
Sano, Kaori; Okazaki, Sachiko; Taniguchi, Satoshi; Masangkay, Joseph S; Puentespina, Roberto; Eres, Eduardo; Cosico, Edison; Quibod, Niña; Kondo, Taisuke; Shimoda, Hiroshi; Hatta, Yuuki; Mitomo, Shumpei; Oba, Mami; Katayama, Yukie; Sassa, Yukiko; Furuya, Tetsuya; Nagai, Makoto; Une, Yumi; Maeda, Ken; Kyuwa, Shigeru; Yoshikawa, Yasuhiro; Akashi, Hiroomi; Omatsu, Tsutomu; Mizutani, Tetsuya
2015-08-01
Bats are natural hosts of many zoonotic viruses. Monitoring bat viruses is important to detect novel bat-borne infectious diseases. In this study, next generation sequencing techniques and conventional PCR were used to analyze intestine, lung, and blood clot samples collected from wild bats captured at three locations in Davao region, in the Philippines in 2012. Different viral genes belonging to the Retroviridae and Herpesviridae families were identified using next generation sequencing. The existence of herpesvirus in the samples was confirmed by PCR using herpesvirus consensus primers. The nucleotide sequences of the resulting PCR amplicons were 166-bp. Further phylogenetic analysis identified that the virus from which this nucleotide sequence was obtained belonged to the Gammaherpesvirinae subfamily. PCR using primers specific to the nucleotide sequence obtained revealed that the infection rate among the captured bats was 30 %. In this study, we present the partial genome of a novel gammaherpesvirus detected from wild bats. Our observations also indicate that this herpesvirus may be widely distributed in bat populations in Davao region.
Moser, Aline; Wüthrich, Daniel; Bruggmann, Rémy; Eugster-Meier, Elisabeth; Meile, Leo; Irmler, Stefan
2017-01-01
The advent of massive parallel sequencing technologies has opened up possibilities for the study of the bacterial diversity of ecosystems without the need for enrichment or single strain isolation. By exploiting 78 genome data-sets from Lactobacillus helveticus strains, we found that the slpH locus that encodes a putative surface layer protein displays sufficient genetic heterogeneity to be a suitable target for strain typing. Based on high-throughput slpH gene sequencing and the detection of single-base DNA sequence variations, we established a culture-independent method to assess the biodiversity of the L. helveticus strains present in fermented dairy food. When we applied the method to study the L. helveticus strain composition in 15 natural whey cultures (NWCs) that were collected at different Gruyère, a protected designation of origin (PDO) production facilities, we detected a total of 10 sequence types (STs). In addition, we monitored the development of a three-strain mix in raclette cheese for 17 weeks. PMID:28775722
Genetic analysis of duck circovirus in Pekin ducks from South Korea.
Cha, S-Y; Kang, M; Cho, J-G; Jang, H-K
2013-11-01
The genetic organization of the 24 duck circovirus (DuCV) strains detected in commercial Pekin ducks from South Korea between 2011 and 2012 is described in this study. Multiple sequence alignment and phylogenetic analyses were performed on the 24 viral genome sequences as well as on 45 genome sequences available from the GenBank database. Phylogenetic analyses based on the genomic and open reading frame 2/cap sequences demonstrated that all DuCV strains belonged to genotype 1 and were designated in a subcluster under genotype 1. Analysis of the capsid protein amino acid sequences of the 24 Korean DuCV strains showed 10 substitutions compared with that of other genotype 1 strains. Our analysis showed that genotype 1 is predominant and circulating in South Korea. These present results serve as incentive to add more data to the DuCV database and provide insight to conduct further intensive study on the geographic relationships among these virus strains.
Nucleotide and amino acid variations of tannase gene from different Aspergillus strains.
Borrego-Terrazas, J A; Lara-Victoriano, F; Flores-Gallegos, A C; Veana, F; Aguilar, C N; Rodríguez-Herrera, R
2014-08-01
Tannase is an enzyme that catalyses the hydrolysis of ester bonds present in tannins. Most of the scientific reports about this biocatalysis focus on aspects related to tannase production and its recovery; on the other hand, reports assessing the molecular aspects of the tannase gene or protein are scarce. In the present study, a tannase gene fragment from several Aspergillus strains isolated from the Mexican semidesert was sequenced and compared with tannase amino acid sequences reported in NCBI database using bioinformatics tools. The genetic relationship among the different tannase sequences was also determined. A conserved region of 7 amino acids was found with the conserved motif GXSXG common to esterases, in which the active-site serine residue is located. In addition, in Aspergillus niger strains GH1 and PSH, we found an extra codon in the tannase sequences encoding glycine. The tannase gene belonging to semidesert fungal strains followed a neutral evolution path with the formation of 10 haplotypes, of which A. niger GH1 and PSH haplotypes are the oldest.
Pelnena, Dita; Burnyte, Birute; Jankevics, Eriks; Lace, Baiba; Dagyte, Evelina; Grigalioniene, Kristina; Utkus, Algirdas; Krumina, Zita; Rozentale, Jolanta; Adomaitiene, Irina; Stavusis, Janis; Pliss, Liana; Inashkina, Inna
2017-12-12
The most common mitochondrial disorder in children is Leigh syndrome, which is a progressive and genetically heterogeneous neurodegenerative disorder caused by mutations in nuclear genes or mitochondrial DNA (mtDNA). In the present study, a novel and robust method of complete mtDNA sequencing, which allows amplification of the whole mitochondrial genome, was tested. Complete mtDNA sequencing was performed in a cohort of patients with suspected mitochondrial mutations. Patients from Latvia and Lithuania (n = 92 and n = 57, respectively) referred by clinical geneticists were included. The de novo point mutations m.9185T>C and m.13513G>A, respectively, were detected in two patients with lactic acidosis and neurodegenerative lesions. In one patient with neurodegenerative lesions, the mutation m.9185T>C was identified. These mutations are associated with Leigh syndrome. The present data suggest that full-length mtDNA sequencing is recommended as a supplement to nuclear gene testing and enzymatic assays to enhance mitochondrial disease diagnostics.
Exome Sequencing in the Clinical Diagnosis of Sporadic or Familial Cerebellar Ataxia
Fogel, Brent L.; Lee, Hane; Deignan, Joshua L.; Strom, Samuel P.; Kantarci, Sibel; Wang, Xizhe; Quintero-Rivera, Fabiola; Vilain, Eric; Grody, Wayne W.; Perlman, Susan; Geschwind, Daniel H.; Nelson, Stanley F.
2015-01-01
IMPORTANCE Cerebellar ataxias are a diverse collection of neurologic disorders with causes ranging from common acquired etiologies to rare genetic conditions. Numerous genetic disorders have been associated with chronic progressive ataxia and this consequently presents a diagnostic challenge for the clinician regarding how to approach and prioritize genetic testing in patients with such clinically heterogeneous phenotypes. Additionally, while the value of genetic testing in early-onset and/or familial cases seems clear, many patients with ataxia present sporadically with adult onset of symptoms and the contribution of genetic variation to the phenotype of these patients has not yet been established. OBJECTIVE To investigate the contribution of genetic disease in a population of patients with predominantly adult- and sporadic-onset cerebellar ataxia. DESIGN, SETTING, AND PARTICIPANTS We examined a consecutive series of 76 patients presenting to a tertiary referral center for evaluation of chronic progressive cerebellar ataxia. MAIN OUTCOMES AND MEASURES Next-generation exome sequencing coupled with comprehensive bioinformatic analysis, phenotypic analysis, and clinical correlation. RESULTS We identified clinically relevant genetic information in more than 60% of patients studied (n = 46), including diagnostic pathogenic gene variants in 21% (n = 16), a notable yield given the diverse genetics and clinical heterogeneity of the cerebellar ataxias. CONCLUSIONS AND RELEVANCE This study demonstrated that clinical exome sequencing in patients with adult-onset and sporadic presentations of ataxia is a high-yield test, providing a definitive diagnosis in more than one-fifth of patients and suggesting a potential diagnosis in more than one-third to guide additional phenotyping and diagnostic evaluation. Therefore, clinical exome sequencing is an appropriate consideration in the routine genetic evaluation of all patients presenting with chronic progressive cerebellar ataxia. PMID:25133958
Jeong, Young-Min; Kim, Namshin; Ahn, Byung Ohg; Oh, Mijin; Chung, Won-Hyong; Chung, Hee; Jeong, Seongmun; Lim, Ki-Byung; Hwang, Yoon-Jung; Kim, Goon-Bo; Baek, Seunghoon; Choi, Sang-Bong; Hyung, Dae-Jin; Lee, Seung-Won; Sohn, Seong-Han; Kwon, Soo-Jin; Jin, Mina; Seol, Young-Joo; Chae, Won Byoung; Choi, Keun Jin; Park, Beom-Seok; Yu, Hee-Ju; Mun, Jeong-Hwan
2016-07-01
This study presents a chromosome-scale draft genome sequence of radish that is assembled into nine chromosomal pseudomolecules. A comprehensive comparative genome analysis with the Brassica genomes provides genomic evidences on the evolution of the mesohexaploid radish genome. Radish (Raphanus sativus L.) is an agronomically important root vegetable crop and its origin and phylogenetic position in the tribe Brassiceae is controversial. Here we present a comprehensive analysis of the radish genome based on the chromosome sequences of R. sativus cv. WK10039. The radish genome was sequenced and assembled into 426.2 Mb spanning >98 % of the gene space, of which 344.0 Mb were integrated into nine chromosome pseudomolecules. Approximately 36 % of the genome was repetitive sequences and 46,514 protein-coding genes were predicted and annotated. Comparative mapping of the tPCK-like ancestral genome revealed that the radish genome has intermediate characteristics between the Brassica A/C and B genomes in the triplicated segments, suggesting an internal origin from the genus Brassica. The evolutionary characteristics shared between radish and other Brassica species provided genomic evidences that the current form of nine chromosomes in radish was rearranged from the chromosomes of hexaploid progenitor. Overall, this study provides a chromosome-scale draft genome sequence of radish as well as novel insight into evolution of the mesohexaploid genomes in the tribe Brassiceae.
APE1 incision activity at abasic sites in tandem repeat sequences.
Li, Mengxia; Völker, Jens; Breslauer, Kenneth J; Wilson, David M
2014-05-29
Repetitive DNA sequences, such as those present in microsatellites and minisatellites, telomeres, and trinucleotide repeats (linked to fragile X syndrome, Huntington disease, etc.), account for nearly 30% of the human genome. These domains exhibit enhanced susceptibility to oxidative attack to yield base modifications, strand breaks, and abasic sites; have a propensity to adopt non-canonical DNA forms modulated by the positions of the lesions; and, when not properly processed, can contribute to genome instability that underlies aging and disease development. Knowledge on the repair efficiencies of DNA damage within such repetitive sequences is therefore crucial for understanding the impact of such domains on genomic integrity. In the present study, using strategically designed oligonucleotide substrates, we determined the ability of human apurinic/apyrimidinic endonuclease 1 (APE1) to cleave at apurinic/apyrimidinic (AP) sites in a collection of tandem DNA repeat landscapes involving telomeric and CAG/CTG repeat sequences. Our studies reveal the differential influence of domain sequence, conformation, and AP site location/relative positioning on the efficiency of APE1 binding and strand incision. Intriguingly, our data demonstrate that APE1 endonuclease efficiency correlates with the thermodynamic stability of the DNA substrate. We discuss how these results have both predictive and mechanistic consequences for understanding the success and failure of repair protein activity associated with such oxidatively sensitive, conformationally plastic/dynamic repetitive DNA domains. Published by Elsevier Ltd.
Viral quasispecies inference from 454 pyrosequencing
2013-01-01
Background Many potentially life-threatening infectious viruses are highly mutable in nature. Characterizing the fittest variants within a quasispecies from infected patients is expected to allow unprecedented opportunities to investigate the relationship between quasispecies diversity and disease epidemiology. The advent of next-generation sequencing technologies has allowed the study of virus diversity with high-throughput sequencing, although these methods come with higher rates of errors which can artificially increase diversity. Results Here we introduce a novel computational approach that incorporates base quality scores from next-generation sequencers for reconstructing viral genome sequences that simultaneously infers the number of variants within a quasispecies that are present. Comparisons on simulated and clinical data on dengue virus suggest that the novel approach provides a more accurate inference of the underlying number of variants within the quasispecies, which is vital for clinical efforts in mapping the within-host viral diversity. Sequence alignments generated by our approach are also found to exhibit lower rates of error. Conclusions The ability to infer the viral quasispecies colony that is present within a human host provides the potential for a more accurate classification of the viral phenotype. Understanding the genomics of viruses will be relevant not just to studying how to control or even eradicate these viral infectious diseases, but also in learning about the innate protection in the human host against the viruses. PMID:24308284
PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences
Mirarab, Siavash; Nguyen, Nam; Guo, Sheng; Wang, Li-San; Kim, Junhyong
2015-01-01
Abstract We introduce PASTA, a new multiple sequence alignment algorithm. PASTA uses a new technique to produce an alignment given a guide tree that enables it to be both highly scalable and very accurate. We present a study on biological and simulated data with up to 200,000 sequences, showing that PASTA produces highly accurate alignments, improving on the accuracy and scalability of the leading alignment methods (including SATé). We also show that trees estimated on PASTA alignments are highly accurate—slightly better than SATé trees, but with substantial improvements relative to other methods. Finally, PASTA is faster than SATé, highly parallelizable, and requires relatively little memory. PMID:25549288
Gonzaga-Jauregui, Claudia; Mir, Sabina; Penney, Samantha; Jhangiani, Shalini; Midgen, Craig; Finegold, Milton; Muzny, Donna M.; Wang, Min; Bacino, Carlos A.; Gibbs, Richard A.; Lupski, James R.; Kellermayer, Richard; Hanchard, Neil A.
2014-01-01
Severe congenital hypertriglyceridemia (HTG) is a rare disorder caused by mutations in genes affecting lipoprotein lipase (LPL) activity. Here we report a 5-week-old Hispanic girl with severe HTG (12,031 mg/dL, normal limit 150 mg/dL) who presented with the unusual combination of lower gastrointestinal bleeding and milky plasma. Initial colonoscopy was consistent with colitis, which resolved with reduction of triglycerides. After negative sequencing of the LPL gene, whole-exome sequencing revealed novel compound heterozygous mutations in GPIHBP1. Our study broadens the phenotype of GPIHBP1-associated HTG, reinforces the effectiveness of whole-exome sequencing in Mendelian diagnoses, and implicates triglycer-ides in gastrointestinal mucosal injury. PMID:24614124
Gonzaga-Jauregui, Claudia; Mir, Sabina; Penney, Samantha; Jhangiani, Shalini; Midgen, Craig; Finegold, Milton; Muzny, Donna M; Wang, Min; Bacino, Carlos A; Gibbs, Richard A; Lupski, James R; Kellermayer, Richard; Hanchard, Neil A
2014-07-01
Severe congenital hypertriglyceridemia (HTG) is a rare disorder caused by mutations in genes affecting lipoprotein lipase (LPL) activity. Here we report a 5-week-old Hispanic girl with severe HTG (12,031 mg/dL, normal limit 150 mg/dL) who presented with the unusual combination of lower gastrointestinal bleeding and milky plasma. Initial colonoscopy was consistent with colitis, which resolved with reduction of triglycerides. After negative sequencing of the LPL gene, whole-exome sequencing revealed novel compound heterozygous mutations in GPIHBP1. Our study broadens the phenotype of GPIHBP1-associated HTG, reinforces the effectiveness of whole-exome sequencing in Mendelian diagnoses, and implicates triglycerides in gastrointestinal mucosal injury.
Nasr Esfahani, Bahram; Moghim, Sharareh; Ghasemian Safaei, Hajieh; Moghoofei, Mohsen; Sedighi, Mansour; Hadifar, Shima
2016-01-01
Background Taxonomic and phylogenetic studies of Mycobacterium species have been based around the 16sRNA gene for many years. However, due to the high strain similarity between species in the Mycobacterium genus (94.3% - 100%), defining a valid phylogenetic tree is difficult; consequently, its use in estimating the boundaries between species is limited. The sequence of the rpoB gene makes it an appropriate gene for phylogenetic analysis, especially in bacteria with limited variation. Objectives In the present study, a 360bp sequence of rpoB was used for precise classification of Mycobacterium strains isolated in Isfahan, Iran. Materials and Methods From February to October 2013, 57 clinical and environmental isolates were collected, subcultured, and identified by phenotypic methods. After DNA extraction, a 360bp fragment was PCR-amplified and sequenced. The phylogenetic tree was constructed based on consensus sequence data, using MEGA5 software. Results Slow and fast-growing groups of the Mycobacterium strains were clearly differentiated based on the constructed tree of 56 common Mycobacterium isolates. Each species with a unique title in the tree was identified; in total, 13 nods with a bootstrap value of over 50% were supported. Among the slow-growing group was Mycobacterium kansasii, with M. tuberculosis in a cluster with a bootstrap value of 98% and M. gordonae in another cluster with a bootstrap value of 90%. In the fast-growing group, one cluster with a bootstrap value of 89% was defined, including all fast-growing members present in this study. Conclusions The results suggest that only the application of the rpoB gene sequence is sufficient for taxonomic categorization and definition of a new Mycobacterium species, due to its high resolution power and proper variation in its sequence (85% - 100%); the resulting tree has high validity. PMID:27284397
Zhao, Ying; Tsang, Chi-Ching; Xiao, Meng; Cheng, Jingwei; Xu, Yingchun; Lau, Susanna K P; Woo, Patrick C Y
2015-10-22
Internal transcribed spacer region (ITS) sequencing is the most extensively used technology for accurate molecular identification of fungal pathogens in clinical microbiology laboratories. Intra-genomic ITS sequence heterogeneity, which makes fungal identification based on direct sequencing of PCR products difficult, has rarely been reported in pathogenic fungi. During the process of performing ITS sequencing on 71 yeast strains isolated from various clinical specimens, direct sequencing of the PCR products showed ambiguous sequences in six of them. After cloning the PCR products into plasmids for sequencing, interpretable sequencing electropherograms could be obtained. For each of the six isolates, 10-49 clones were selected for sequencing and two to seven intra-genomic ITS copies were detected. The identities of these six isolates were confirmed to be Candida glabrata (n=2), Pichia (Candida) norvegensis (n=2), Candida tropicalis (n=1) and Saccharomyces cerevisiae (n=1). Multiple sequence alignment revealed that one to four intra-genomic ITS polymorphic sites were present in the six isolates, and all these polymorphic sites were located in the ITS1 and/or ITS2 regions. We report and describe the first evidence of intra-genomic ITS sequence heterogeneity in four different pathogenic yeasts, which occurred exclusively in the ITS1 and ITS2 spacer regions for the six isolates in this study.
Zhao, Ying; Tsang, Chi-Ching; Xiao, Meng; Cheng, Jingwei; Xu, Yingchun; Lau, Susanna K. P.; Woo, Patrick C. Y.
2015-01-01
Internal transcribed spacer region (ITS) sequencing is the most extensively used technology for accurate molecular identification of fungal pathogens in clinical microbiology laboratories. Intra-genomic ITS sequence heterogeneity, which makes fungal identification based on direct sequencing of PCR products difficult, has rarely been reported in pathogenic fungi. During the process of performing ITS sequencing on 71 yeast strains isolated from various clinical specimens, direct sequencing of the PCR products showed ambiguous sequences in six of them. After cloning the PCR products into plasmids for sequencing, interpretable sequencing electropherograms could be obtained. For each of the six isolates, 10–49 clones were selected for sequencing and two to seven intra-genomic ITS copies were detected. The identities of these six isolates were confirmed to be Candida glabrata (n = 2), Pichia (Candida) norvegensis (n = 2), Candida tropicalis (n = 1) and Saccharomyces cerevisiae (n = 1). Multiple sequence alignment revealed that one to four intra-genomic ITS polymorphic sites were present in the six isolates, and all these polymorphic sites were located in the ITS1 and/or ITS2 regions. We report and describe the first evidence of intra-genomic ITS sequence heterogeneity in four different pathogenic yeasts, which occurred exclusively in the ITS1 and ITS2 spacer regions for the six isolates in this study. PMID:26506340
Learning of grammar-like visual sequences by adults with and without language-learning disabilities.
Aguilar, Jessica M; Plante, Elena
2014-08-01
Two studies examined learning of grammar-like visual sequences to determine whether a general deficit in statistical learning characterizes this population. Furthermore, we tested the hypothesis that difficulty in sustaining attention during the learning task might account for differences in statistical learning. In Study 1, adults with normal language (NL) or language-learning disability (LLD) were familiarized with the visual artificial grammar and then tested using items that conformed or deviated from the grammar. In Study 2, a 2nd sample of adults with NL and LLD were presented auditory word pairs with weak semantic associations (e.g., groom + clean) along with the visual learning task. Participants were instructed to attend to visual sequences and to ignore the auditory stimuli. Incidental encoding of these words would indicate reduced attention to the primary task. In Studies 1 and 2, both groups demonstrated learning and generalization of the artificial grammar. In Study 2, neither the NL nor the LLD group appeared to encode the words presented during the learning phase. The results argue against a general deficit in statistical learning for individuals with LLD and demonstrate that both NL and LLD learners can ignore extraneous auditory stimuli during visual learning.
Conservation and variability of West Nile virus proteins.
Koo, Qi Ying; Khan, Asif M; Jung, Keun-Ok; Ramdas, Shweta; Miotto, Olivo; Tan, Tin Wee; Brusic, Vladimir; Salmon, Jerome; August, J Thomas
2009-01-01
West Nile virus (WNV) has emerged globally as an increasingly important pathogen for humans and domestic animals. Studies of the evolutionary diversity of the virus over its known history will help to elucidate conserved sites, and characterize their correspondence to other pathogens and their relevance to the immune system. We describe a large-scale analysis of the entire WNV proteome, aimed at identifying and characterizing evolutionarily conserved amino acid sequences. This study, which used 2,746 WNV protein sequences collected from the NCBI GenPept database, focused on analysis of peptides of length 9 amino acids or more, which are immunologically relevant as potential T-cell epitopes. Entropy-based analysis of the diversity of WNV sequences, revealed the presence of numerous evolutionarily stable nonamer positions across the proteome (entropy value of < or = 1). The representation (frequency) of nonamers variant to the predominant peptide at these stable positions was, generally, low (< or = 10% of the WNV sequences analyzed). Eighty-eight fragments of length 9-29 amino acids, representing approximately 34% of the WNV polyprotein length, were identified to be identical and evolutionarily stable in all analyzed WNV sequences. Of the 88 completely conserved sequences, 67 are also present in other flaviviruses, and several have been associated with the functional and structural properties of viral proteins. Immunoinformatic analysis revealed that the majority (78/88) of conserved sequences are potentially immunogenic, while 44 contained experimentally confirmed human T-cell epitopes. This study identified a comprehensive catalogue of completely conserved WNV sequences, many of which are shared by other flaviviruses, and majority are potential epitopes. The complete conservation of these immunologically relevant sequences through the entire recorded WNV history suggests they will be valuable as components of peptide-specific vaccines or other therapeutic applications, for sequence-specific diagnosis of a wide-range of Flavivirus infections, and for studies of homologous sequences among other flaviviruses.
Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul S.; Richmond, Zina; Purcell, Maureen K.; Johns, Robert; Johnson, Stewart C.; Sakasida, Sonja M.
2015-01-01
Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period.
Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul; Richmond, Zina; Johns, Robert; Purcell, Maureen K.; Johnson, Stewart C.; Saksida, Sonja M.
2015-01-01
Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period. PMID:26536673
Quick, Joshua; Grubaugh, Nathan D; Pullan, Steven T; Claro, Ingra M; Smith, Andrew D; Gangavarapu, Karthik; Oliveira, Glenn; Robles-Sikisaka, Refugio; Rogers, Thomas F; Beutler, Nathan A; Burton, Dennis R; Lewis-Ximenez, Lia Laura; de Jesus, Jaqueline Goes; Giovanetti, Marta; Hill, Sarah C; Black, Allison; Bedford, Trevor; Carroll, Miles W; Nunes, Marcio; Alcantara, Luiz Carlos; Sabino, Ester C; Baylis, Sally A; Faria, Nuno R; Loose, Matthew; Simpson, Jared T; Pybus, Oliver G; Andersen, Kristian G; Loman, Nicholas J
2017-06-01
Genome sequencing has become a powerful tool for studying emerging infectious diseases; however, genome sequencing directly from clinical samples (i.e., without isolation and culture) remains challenging for viruses such as Zika, for which metagenomic sequencing methods may generate insufficient numbers of viral reads. Here we present a protocol for generating coding-sequence-complete genomes, comprising an online primer design tool, a novel multiplex PCR enrichment protocol, optimized library preparation methods for the portable MinION sequencer (Oxford Nanopore Technologies) and the Illumina range of instruments, and a bioinformatics pipeline for generating consensus sequences. The MinION protocol does not require an Internet connection for analysis, making it suitable for field applications with limited connectivity. Our method relies on multiplex PCR for targeted enrichment of viral genomes from samples containing as few as 50 genome copies per reaction. Viral consensus sequences can be achieved in 1-2 d by starting with clinical samples and following a simple laboratory workflow. This method has been successfully used by several groups studying Zika virus evolution and is facilitating an understanding of the spread of the virus in the Americas. The protocol can be used to sequence other viral genomes using the online Primal Scheme primer designer software. It is suitable for sequencing either RNA or DNA viruses in the field during outbreaks or as an inexpensive, convenient method for use in the lab.
Genetic diversity of merozoite surface antigens in Babesia bovis detected from Sri Lankan cattle.
Sivakumar, Thillaiampalam; Okubo, Kazuhiro; Igarashi, Ikuo; de Silva, Weligodage Kumarawansa; Kothalawala, Hemal; Silva, Seekkuge Susil Priyantha; Vimalakumar, Singarayar Caniciyas; Meewewa, Asela Sanjeewa; Yokoyama, Naoaki
2013-10-01
Babesia bovis, the causative agent of severe bovine babesiosis, is endemic in Sri Lanka. The live attenuated vaccine (K-strain), which was introduced in the early 1990s, has been used to immunize cattle populations in endemic areas of the country. The present study was undertaken to determine the genetic diversity of merozoite surface antigens (MSAs) in B. bovis isolates from Sri Lankan cattle, and to compare the gene sequences obtained from such isolates against those of the K-strain. Forty-four bovine blood samples isolated from different geographical regions of Sri Lanka and judged to be B. bovis-positive by PCR screening were used to amplify MSAs (MSA-1, MSA-2c, MSA-2a1, MSA-2a2, and MSA-2b), AMA-1, and 12D3 genes from parasite DNA. Although the AMA-1 and 12D3 gene sequences were highly conserved among the Sri Lankan isolates, the MSA gene sequences from the same isolates were highly diverse. Sri Lankan MSA-1, MSA-2c, MSA-2a1, MSA-2a2, and MSA-2b sequences clustered within 5, 2, 4, 1, and 9 different clades in the gene phylograms, respectively, while the minimum similarity values among the deduced amino acid sequences of these genes were 36.8%, 68.7%, 80.3%, 100%, and 68.3%, respectively. In the phylograms, none of the Sri Lankan sequences fell within clades containing the respective K-strain sequences. Additionally, the similarity values for MSA-1 and MSA-2c were 40-61.8% and 90.9-93.2% between the Sri Lankan isolates and the K-strain, respectively, while the K-strain MSA-2a/b sequence shared 64.5-69.8%, 69.3%, and 70.5-80.3% similarities with the Sri Lankan MSA-2a1, MSA-2a2, and MSA-2b sequences, respectively. The present study has shown that genetic diversity among MSAs of Sri Lankan B. bovis isolates is very high, and that the sequences of field isolates diverged genetically from the K-strain. Copyright © 2013 Elsevier B.V. All rights reserved.
Statistical learning of music- and language-like sequences and tolerance for spectral shifts.
Daikoku, Tatsuya; Yatomi, Yutaka; Yumoto, Masato
2015-02-01
In our previous study (Daikoku, Yatomi, & Yumoto, 2014), we demonstrated that the N1m response could be a marker for the statistical learning process of pitch sequence, in which each tone was ordered by a Markov stochastic model. The aim of the present study was to investigate how the statistical learning of music- and language-like auditory sequences is reflected in the N1m responses based on the assumption that both language and music share domain generality. By using vowel sounds generated by a formant synthesizer, we devised music- and language-like auditory sequences in which higher-ordered transitional rules were embedded according to a Markov stochastic model by controlling fundamental (F0) and/or formant frequencies (F1-F2). In each sequence, F0 and/or F1-F2 were spectrally shifted in the last one-third of the tone sequence. Neuromagnetic responses to the tone sequences were recorded from 14 right-handed normal volunteers. In the music- and language-like sequences with pitch change, the N1m responses to the tones that appeared with higher transitional probability were significantly decreased compared with the responses to the tones that appeared with lower transitional probability within the first two-thirds of each sequence. Moreover, the amplitude difference was even retained within the last one-third of the sequence after the spectral shifts. However, in the language-like sequence without pitch change, no significant difference could be detected. The pitch change may facilitate the statistical learning in language and music. Statistically acquired knowledge may be appropriated to process altered auditory sequences with spectral shifts. The relative processing of spectral sequences may be a domain-general auditory mechanism that is innate to humans. Copyright © 2014 Elsevier Inc. All rights reserved.
Locating Sequence on FPC Maps and Selecting a Minimal Tiling Path
Engler, Friedrich W.; Hatfield, James; Nelson, William; Soderlund, Carol A.
2003-01-01
This study discusses three software tools, the first two aid in integrating sequence with an FPC physical map and the third automatically selects a minimal tiling path given genomic draft sequence and BAC end sequences. The first tool, FSD (FPC Simulated Digest), takes a sequenced clone and adds it back to the map based on a fingerprint generated by an in silico digest of the clone. This allows verification of sequenced clone positions and the integration of sequenced clones that were not originally part of the FPC map. The second tool, BSS (Blast Some Sequence), takes a query sequence and positions it on the map based on sequence associated with the clones in the map. BSS has multiple uses as follows: (1) When the query is a file of marker sequences, they can be added as electronic markers. (2) When the query is draft sequence, the results of BSS can be used to close gaps in a sequenced clone or the physical map. (3) When the query is a sequenced clone and the target is BAC end sequences, one may select the next clone for sequencing using both sequence comparison results and map location. (4) When the query is whole-genome draft sequence and the target is BAC end sequences, the results can be used to select many clones for a minimal tiling path at once. The third tool, pickMTP, automates the majority of this last usage of BSS. Results are presented using the rice FPC map, BAC end sequences, and whole-genome shotgun from Syngenta. PMID:12915486
Xavier, Crislaine; Cabral-de-Mello, Diogo Cavalcanti; de Moura, Rita Cássia
2014-12-01
Cytogenetic studies of the Neotropical beetle genus Dichotomius (Scarabaeinae, Coleoptera) have shown dynamism for centromeric constitutive heterochromatin sequences. In the present work we studied the chromosomes and isolated repetitive sequences of Dichotomius schiffleri aiming to contribute to the understanding of coleopteran genome/chromosomal organization. Dichotomius schiffleri presented a conserved karyotype and heterochromatin distribution in comparison to other species of the genus with 2n = 18, biarmed chromosomes, and pericentromeric C-positive blocks. Similarly to heterochromatin distributional patterns, the highly and moderately repetitive DNA fraction (C 0 t-1 DNA) was detected in pericentromeric areas, contrasting with the euchromatic mapping of an isolated TE (named DsmarMITE). After structural analyses, the DsmarMITE was classified as a non-autonomous element of the type miniature inverted-repeat transposable element (MITE) with terminal inverted repeats similar to Mariner elements of insects from different orders. The euchromatic distribution for DsmarMITE indicates that it does not play a part in the dynamics of constitutive heterochromatin sequences.
Coupling molecules and morphology to discover new clades of ciliates.
NASA Astrophysics Data System (ADS)
Grattepanche, J. D.; Maurer-Alcalá, X. X.; Tucker, S. J.; McManus, G. B.; Katz, L. A.
2016-02-01
In a previous study using high-throughput sequencing (Grattepanche et al submitted, oral presentation?), we observe the presence of two clades of spirotrich ciliates mainly present in marine deep-water along the New England coast. These clades, clusters X1 and X2, are characterized by several deletions in their SSU-rDNA and have been observed elsewhere as both identical and similar sequences have been deposited on GenBank from other environmental studies, but lack morphological description. In order to link molecules (SSU-rDNA sequence) to their morphology, we sample below the photic zone (between 60 to 400m of depth) in the New England coast (Northeast Atlantic) in a transect crossing the continental shelf. We designed an oligonucleotide probe specific for choreotrich and oligotrich ciliates and another specific to clusters X1 and X2 to describe these clades through a combination of Fluorescence In Situ Hybridization (FISH) and light microscopy. Our aim is to increase our knowledge on the morphology of these `unknown' clades of ciliates, which will allow for future ecological studies.
Nonparametric Combinatorial Sequence Models
NASA Astrophysics Data System (ADS)
Wauthier, Fabian L.; Jordan, Michael I.; Jojic, Nebojsa
This work considers biological sequences that exhibit combinatorial structures in their composition: groups of positions of the aligned sequences are "linked" and covary as one unit across sequences. If multiple such groups exist, complex interactions can emerge between them. Sequences of this kind arise frequently in biology but methodologies for analyzing them are still being developed. This paper presents a nonparametric prior on sequences which allows combinatorial structures to emerge and which induces a posterior distribution over factorized sequence representations. We carry out experiments on three sequence datasets which indicate that combinatorial structures are indeed present and that combinatorial sequence models can more succinctly describe them than simpler mixture models. We conclude with an application to MHC binding prediction which highlights the utility of the posterior distribution induced by the prior. By integrating out the posterior our method compares favorably to leading binding predictors.
Efficient use of unlabeled data for protein sequence classification: a comparative study
Kuksa, Pavel; Huang, Pai-Hsi; Pavlovic, Vladimir
2009-01-01
Background Recent studies in computational primary protein sequence analysis have leveraged the power of unlabeled data. For example, predictive models based on string kernels trained on sequences known to belong to particular folds or superfamilies, the so-called labeled data set, can attain significantly improved accuracy if this data is supplemented with protein sequences that lack any class tags–the unlabeled data. In this study, we present a principled and biologically motivated computational framework that more effectively exploits the unlabeled data by only using the sequence regions that are more likely to be biologically relevant for better prediction accuracy. As overly-represented sequences in large uncurated databases may bias the estimation of computational models that rely on unlabeled data, we also propose a method to remove this bias and improve performance of the resulting classifiers. Results Combined with state-of-the-art string kernels, our proposed computational framework achieves very accurate semi-supervised protein remote fold and homology detection on three large unlabeled databases. It outperforms current state-of-the-art methods and exhibits significant reduction in running time. Conclusion The unlabeled sequences used under the semi-supervised setting resemble the unpolished gemstones; when used as-is, they may carry unnecessary features and hence compromise the classification accuracy but once cut and polished, they improve the accuracy of the classifiers considerably. PMID:19426450
2013-01-01
Background With high quantity and quality data production and low cost, next generation sequencing has the potential to provide new opportunities for plant phylogeographic studies on single and multiple species. Here we present an approach for in silicio chloroplast DNA assembly and single nucleotide polymorphism detection from short-read shotgun sequencing. The approach is simple and effective and can be implemented using standard bioinformatic tools. Results The chloroplast genome of Toona ciliata (Meliaceae), 159,514 base pairs long, was assembled from shotgun sequencing on the Illumina platform using de novo assembly of contigs. To evaluate its practicality, value and quality, we compared the short read assembly with an assembly completed using 454 data obtained after chloroplast DNA isolation. Sanger sequence verifications indicated that the Illumina dataset outperformed the longer read 454 data. Pooling of several individuals during preparation of the shotgun library enabled detection of informative chloroplast SNP markers. Following validation, we used the identified SNPs for a preliminary phylogeographic study of T. ciliata in Australia and to confirm low diversity across the distribution. Conclusions Our approach provides a simple method for construction of whole chloroplast genomes from shotgun sequencing of whole genomic DNA using short-read data and no available closely related reference genome (e.g. from the same species or genus). The high coverage of Illumina sequence data also renders this method appropriate for multiplexing and SNP discovery and therefore a useful approach for landscape level studies of evolutionary ecology. PMID:23497206
Pillai, Roshni; Yathiraj, Asha
2017-09-01
The study evaluated whether there exists a difference/relation in the way four different memory skills (memory score, sequencing score, memory span, & sequencing span) are processed through the auditory modality, visual modality and combined modalities. Four memory skills were evaluated on 30 typically developing children aged 7 years and 8 years across three modality conditions (auditory, visual, & auditory-visual). Analogous auditory and visual stimuli were presented to evaluate the three modality conditions across the two age groups. The children obtained significantly higher memory scores through the auditory modality compared to the visual modality. Likewise, their memory scores were significantly higher through the auditory-visual modality condition than through the visual modality. However, no effect of modality was observed on the sequencing scores as well as for the memory and the sequencing span. A good agreement was seen between the different modality conditions that were studied (auditory, visual, & auditory-visual) for the different memory skills measures (memory scores, sequencing scores, memory span, & sequencing span). A relatively lower agreement was noted only between the auditory and visual modalities as well as between the visual and auditory-visual modality conditions for the memory scores, measured using Bland-Altman plots. The study highlights the efficacy of using analogous stimuli to assess the auditory, visual as well as combined modalities. The study supports the view that the performance of children on different memory skills was better through the auditory modality compared to the visual modality. Copyright © 2017 Elsevier B.V. All rights reserved.
Identification of a µ opiate receptor signaling mechanism in human placenta.
Mantione, Kirk J; Angert, Robert M; Cadet, Patrick; Kream, Richard M; Stefano, George B
2010-11-01
Previous studies report that genes in the morphine biosynthetic pathway have been found in placental tissue. Prior researchers have shown that kappa opioid receptors are present in human placenta. We determined if a µ opiate receptor was present and which subtype was expressed in human placenta. We also sought to demonstrate a functional µ opiate receptor in human placenta. Polymerase chain reactions as well as DNA sequencing were performed to identify the µ opiate receptor subtypes present in human placenta. The functionality of the receptor was demonstrated by real time amperometric measurements of morphine induced NO release. The µ4 opiate receptor sequence was present as well as the µ1 opioid receptor transcript. The addition of morphine to placental tissue resulted in immediate nitric oxide release and this effect was blocked by naloxone. In the present study, an intact morphine signaling system has been demonstrated in human placenta. Morphine signaling in human placenta probably functions to regulate the immune, vascular, and endocrine functions of this organ via NO.
McMeel, O M; Hoey, E M; Ferguson, A
2001-01-01
The cDNA nucleotide sequences of the lactate dehydrogenase alleles LDH-C1*90 and *100 of brown trout (Salmo trutta) were found to differ at position 308 where an A is present in the *100 allele but a G is present in the *90 allele. This base substitution results in an amino acid change from aspartic acid at position 82 in the LDH-C1 100 allozyme to a glycine in the 90 allozyme. Since aspartic acid has a net negative charge whilst glycine is uncharged, this is consistent with the electrophoretic observation that the LDH-C1 100 allozyme has a more anodal mobility relative to the LDH-C1 90 allozyme. Based on alignment of the cDNA sequence with the mouse genomic sequence, a local primer set was designed, incorporating the variable position, and was found to give very good amplification with brown trout genomic DNA. Sequencing of this fragment confirmed the difference in both homozygous and heterozygous individuals. Digestion of the polymerase chain reaction products with BslI, a restriction enzyme specific for the site difference, gave one, two and three fragments for the two homozygotes and the heterozygote, respectively, following electrophoretic separation. This provides a DNA-based means of routine screening of the highly informative LDH-C1* polymorphism in brown trout population genetic studies. Primer sets presented could be used to sequence cDNA of other LDH* genes of brown trout and other species.
Jun, Goo; Wing, Mary Kate; Abecasis, Gonçalo R; Kang, Hyun Min
2015-06-01
The analysis of next-generation sequencing data is computationally and statistically challenging because of the massive volume of data and imperfect data quality. We present GotCloud, a pipeline for efficiently detecting and genotyping high-quality variants from large-scale sequencing data. GotCloud automates sequence alignment, sample-level quality control, variant calling, filtering of likely artifacts using machine-learning techniques, and genotype refinement using haplotype information. The pipeline can process thousands of samples in parallel and requires less computational resources than current alternatives. Experiments with whole-genome and exome-targeted sequence data generated by the 1000 Genomes Project show that the pipeline provides effective filtering against false positive variants and high power to detect true variants. Our pipeline has already contributed to variant detection and genotyping in several large-scale sequencing projects, including the 1000 Genomes Project and the NHLBI Exome Sequencing Project. We hope it will now prove useful to many medical sequencing studies. © 2015 Jun et al.; Published by Cold Spring Harbor Laboratory Press.
Typing Clostridium difficile strains based on tandem repeat sequences
2009-01-01
Background Genotyping of epidemic Clostridium difficile strains is necessary to track their emergence and spread. Portability of genotyping data is desirable to facilitate inter-laboratory comparisons and epidemiological studies. Results This report presents results from a systematic screen for variation in repetitive DNA in the genome of C. difficile. We describe two tandem repeat loci, designated 'TR6' and 'TR10', which display extensive sequence variation that may be useful for sequence-based strain typing. Based on an investigation of 154 C. difficile isolates comprising 75 ribotypes, tandem repeat sequencing demonstrated excellent concordance with widely used PCR ribotyping and equal discriminatory power. Moreover, tandem repeat sequences enabled the reconstruction of the isolates' largely clonal population structure and evolutionary history. Conclusion We conclude that sequence analysis of the two repetitive loci introduced here may be highly useful for routine typing of C. difficile. Tandem repeat sequence typing resolves phylogenetic diversity to a level equivalent to PCR ribotypes. DNA sequences may be stored in databases accessible over the internet, obviating the need for the exchange of reference strains. PMID:19133124
Möbius, Petra; Hölzer, Martin; Felder, Marius; Nordsiek, Gabriele; Groth, Marco; Köhler, Heike; Reichwald, Kathrin; Platzer, Matthias; Marz, Manja
2015-01-01
Mycobacterium avium (M. a.) subsp. paratuberculosis (MAP)—the etiologic agent of Johne’s disease—affects cattle, sheep, and other ruminants worldwide. To decipher phenotypic differences among sheep and cattle strains (belonging to MAP-S [Type-I/III], respectively, MAP-C [Type-II]), comparative genome analysis needs data from diverse isolates originating from different geographic regions of the world. This study presents the so far best assembled genome of a MAP-S-strain: Sheep isolate JIII-386 from Germany. One newly sequenced cattle isolate (JII-1961, Germany), four published MAP strains of MAP-C and MAP-S from the United States and Australia, and M. a. subsp. hominissuis (MAH) strain 104 were used for assembly improvement and comparisons. All genomes were annotated by BacProt and results compared with NCBI (National Center for Biotechnology Information) annotation. Corresponding protein-coding sequences (CDSs) were detected, but also CDSs that were exclusively determined by either NCBI or BacProt. A new Shine–Dalgarno sequence motif (5′-AGCTGG-3′) was extracted. Novel CDSs including PE-PGRS family protein genes and about 80 noncoding RNAs exhibiting high sequence conservation are presented. Previously found genetic differences between MAP-types are partially revised. Four of ten assumed MAP-S-specific large sequence polymorphism regions (LSPSs) are still present in MAP-C strains; new LSPSs were identified. Independently of the regional origin of the strains, the number of individual CDSs and single nucleotide variants confirms the strong similarity of MAP-C strains and shows higher diversity among MAP-S strains. This study gives ambiguous results regarding the hypothesis that MAP-S is the evolutionary intermediate between MAH and MAP-C, but it clearly shows a higher similarity of MAP to MAH than to Mycobacterium intracellulare. PMID:26384038
Sequence analysis of sub-genotype D hepatitis B surface antigens isolated from Jeddah, Saudi Arabia.
El Hadad, Sahar; Alakilli, Saleha; Rabah, Samar; Sabir, Jamal
2018-05-01
Little is known about the prevalence of HBV genotypes/sub-genotypes in Jeddah province, although the hepatitis B virus (HBV) was identified as the most predominant type of hepatitis in Saudi Arabia. To characterize HBV genotypes/sub-genotypes, serum samples from 15 patients with chronic HBV were collected and subjected to HBsAg gene amplification and sequence analysis. Phylogenetic analysis of the HBsAg gene sequences revealed that 11 (48%) isolates belonged to HBV/D while 4 (18%) were associated with HBV/C. Notably, a HBV/D sub-genotype phylogenetic tree identified that eight current isolates (72%) belonged to HBV/D1, whereas three isolates (28%) appeared to be more closely related to HBV/D5, although they formed a novel cluster supported by a branch with 99% bootstrap value. Isolates belonging to D1 were grouped in one branch and seemed to be more closely related to various strains isolated from different countries. For further determination of whether the three current isolates belonged to HBV/D5 or represented a novel sub-genotype, HBV/DA, whole HBV genome sequences would be required. In the present study, we verified that HBV/D1 is the most prevalent HBV sub-genotype in Jeddah, and identified novel variant mutations suggesting that an additional sub-genotype designated HBV/DA should be proposed. Overall, the results of the present HBsAg sequence analyses provide us with insights regarding the nucleotide differences between the present HBsAg /D isolates identified in the populace of Jeddah, Saudi Arabia and those previously isolated worldwide. Additional studies with large numbers of subjects in other areas might lead to the discovery of the specific HBV strain genotypes or even additional new sub-genotypes that are circulating in Saudi Arabia.
Saravanan, Konda Mani; Dunker, A Keith; Krishnaswamy, Sankaran
2017-12-27
More than 60 prediction methods for intrinsically disordered proteins (IDPs) have been developed over the years, many of which are accessible on the World Wide Web. Nearly, all of these predictors give balanced accuracies in the ~65%-~80% range. Since predictors are not perfect, further studies are required to uncover the role of amino acid residues in native IDP as compared to predicted IDP regions. In the present work, we make use of sequences of 100% predicted IDP regions, false positive disorder predictions, and experimentally determined IDP regions to distinguish the characteristics of native versus predicted IDP regions. A higher occurrence of asparagine is observed in sequences of native IDP regions but not in sequences of false positive predictions of IDP regions. The occurrences of certain combinations of amino acids at the pentapeptide level provide a distinguishing feature in the IDPs with respect to globular proteins. The distinguishing features presented in this paper provide insights into the sequence fingerprints of amino acid residues in experimentally determined as compared to predicted IDP regions. These observations and additional work along these lines should enable the development of improvements in the accuracy of disorder prediction algorithm.
Ceuppens, Siele; De Coninck, Dieter; Bottledoorn, Nadine; Van Nieuwerburgh, Filip; Uyttendaele, Mieke
2017-09-18
Application of 16S rRNA (gene) amplicon sequencing on food samples is increasingly applied for assessing microbial diversity but may as unintended advantage also enable simultaneous detection of any human pathogens without a priori definition. In the present study high-throughput next-generation sequencing (NGS) of the V1-V2-V3 regions of the 16S rRNA gene was applied to identify the bacteria present on fresh basil leaves. However, results were strongly impacted by variations in the bioinformatics analysis pipelines (MEGAN, SILVAngs, QIIME and MG-RAST), including the database choice (Greengenes, RDP and M5RNA) and the annotation algorithm (best hit, representative hit and lowest common ancestor). The use of pipelines with default parameters will lead to discrepancies. The estimate of microbial diversity of fresh basil using 16S rRNA (gene) amplicon sequencing is thus indicative but subject to biases. Salmonella enterica was detected at low frequencies, between 0.1% and 0.4% of bacterial sequences, corresponding with 37 to 166 reads. However, this result was dependent upon the pipeline used: Salmonella was detected by MEGAN, SILVAngs and MG-RAST, but not by QIIME. Confirmation of Salmonella sequences by real-time PCR was unsuccessful. It was shown that taxonomic resolution obtained from the short (500bp) sequence reads of the 16S rRNA gene containing the hypervariable regions V1-V3 cannot allow distinction of Salmonella with closely related enterobacterial species. In conclusion 16S amplicon sequencing, getting the status of standard method in microbial ecology studies of foods, needs expertise on both bioinformatics and microbiology for analysis of results. It is a powerful tool to estimate bacterial diversity but amenable to biases. Limitations concerning taxonomic resolution for some bacterial species or its inability to detect sub-dominant (pathogenic) species should be acknowledged in order to avoid overinterpretation of results. Copyright © 2017 Elsevier B.V. All rights reserved.
Fiete, Dorothy; Mi, Yiling; Beranek, Mary; Baenziger, Nancy L; Baenziger, Jacques U
2017-05-01
Expanded access to DNA sequencing now fosters ready detection of site-specific human genome alterations whose actual significance requires in-depth functional study to rule in or out disease-causing mutations. This is a particular concern for genomic sequence differences in glycosyltransferases, whose implications are often difficult to assess. A recent whole-exome sequencing study identifies (c.229 C > T) in the GalNAc-4-ST1 glycosyltransferase (CHST8) as a disease-causing missense R77W mutation yielding the genodermatosis peeling skin syndrome (PSS) when homozygous. Cabral et al. (Genomics. 2012;99:202-208) cite this sequence change as reducing keratinocyte GalNAc-4-ST1 activity, thus decreasing glycosaminoglycan sulfation, as the mechanism for this blistering disorder. Such an identification could point toward potential clinical and/or prenatal diagnosis of a harmful medical condition. However, GalNAc-4-ST1 has minimal activity toward glycosaminoglycans, instead modifying terminal β1,4-linked GalNAc on N- and O-linked oligosaccharides on specific glycoproteins. We find expression, processing and catalytic activity of GalNAc-4-ST1 completely equivalent between wild type and (R77W) sulfotransferases. Moreover, keratinocytes have little or no GalNAc-4-ST1 mRNA, indicating that they do not express GalNAc-4-ST1. In addition, loss-of-function of GalNAc-4-ST1 primarily presents as reproductive system aberrations rather than skin effects. These findings, an allele frequency of 0.004357, and a 10-fold difference in prevalence of CHST8 (c.299 C > T, R77W) across different ethnic groups, suggest that this sequence represents a "passenger" distributed polymorphism, a simple sequence variant form of the enzyme having normal activity, rather than a "driver" disease-causing mutation that accounts for PSS. This study presents an example for guiding biomedical research initiatives, as well as medical and personal/family perspectives, regarding newly-identified genomic sequence differences. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Weeraratne, Thilini C; Surendran, Sinnathambi N; Reimer, Lisa J; Wondji, Charles S; Perera, M Devika B; Walton, Catherine; Parakrama Karunaratne, S H P
2017-06-02
Genus Anopheles is a major mosquito group of interest in Sri Lanka as it includes vectors of malaria and its members exist as species complexes. Taxonomy of the group is mainly based on morphological features, which are not conclusive and can be easily erased while handling the specimens. A combined effort, using morphology and DNA barcoding (using the markers cytochrome c oxidase subunit I (COI) gene and internal transcribed spacer 2 (ITS2) region, was made during the present study to recognize anophelines collected from eight districts of Sri Lanka for the first time. Cytochrome c oxidase subunit I and ITS2 regions of morphologically identified anopheline mosquitoes from Sri Lanka were sequenced. These sequences together with GenBank sequences were used in phylogenetic tree construction and molecular characterization of mosquitoes. According to morphological identification, the field-collected adult mosquitoes belonged to 15 species, i.e., Anopheles aconitus, Anopheles annularis, Anopheles barbirostris, Anopheles culicifacies, Anopheles jamesii, Anopheles karwari, Anopheles maculatus, Anopheles nigerrimus, Anopheles pallidus, Anopheles peditaeniatus, Anopheles pseudojamesi, Anopheles subpictus, Anopheles tessellatus, Anopheles vagus, and Anopheles varuna. However, analysis of 123 COI sequences (445 bp) (16 clades supported by strong bootstrap value in the neighbour joining tree and inter-specific distances of >3%) showed that there are 16 distinct species. Identity of the morphologically identified species, except An. subpictus, was comparable with the DNA barcoding results. COI sequence analysis showed that morphologically identified An. subpictus is composed of two genetic entities: An. subpictus species A and species B (inter-specific K2P distance 0.128). All the four haplotypes of An. culicifacies discovered during the present study belonged to a single species. ITS2 sequences (542 bp) were obtained for all the species except for An. barbirostris, An. subpictus species B, An. tessellatus, and An. varuna. Each of these sequences was represented by a single species-specific haplotype. The present study reflects the importance and feasibility of COI and ITS2 genetic markers in identifying anophelines and their sibling species, and the significance of integrated systematic approach in mosquito taxonomy. Wide distribution of malaria vectors in the country perhaps indicates the potential for re-emergence of malaria in the country.
There and Back Again: Learning from the History of a Freshman Seminar Sequence
ERIC Educational Resources Information Center
deLusé, Stephanie R.
2014-01-01
The evolution of The Human Event, a course sequence at Barrett, The Honors College at Arizona State University, provides a case study of using a program's history to understand its present and improve its future. While Barrett is situated at a public university with 76,000 students, and is now a large college in itself with 4,803 honors students,…
Oberto, Jacques; Gaudin, Marie; Cossu, Matteo; Gorlas, Aurore; Slesarev, Alexeï; Marguet, Evelyne; Forterre, Patrick
2014-03-27
Thermococcus nautili 30-1 (formerly Thermococcus nautilus), an anaerobic hyperthermophilic marine archaeon, was isolated in 1999 from a deep-sea hydrothermal vent during the Amistad campaign. Here, we present the complete sequence of T. nautili, which is able to produce membrane vesicles containing plasmid DNA. This property makes T. nautili a model organism to study horizontal gene transfer.
Rodriguez-Anaya, Libia Zulema; Gonzalez-Galaviz, Jose Reyes; Casillas-Hernandez, Ramón; Lares-Villa, Fernando; Estrada, Karel
2016-01-01
The first genome sequence of a Mexican white spot syndrome virus is presented here. White spot syndrome is a shrimp pandemic virus that has devastated production in Mexico for more than 10 years. The availability of this genome will greatly aid epidemiological studies worldwide, contributing to the molecular diagnostic and disease prevention in shrimp farming. PMID:26966222
ERIC Educational Resources Information Center
Tremblay, Sebastien; Saint-Aubin, Jean
2009-01-01
In the present study, the authors offer a window onto the mechanisms that drive the Hebb repetition effect through the analysis of eye movement and recall performance. In a spatial serial recall task in which sequences of dots are to be remembered in order, when one particular series is repeated every 4 trials, memory performance markedly improves…
ERIC Educational Resources Information Center
Savinainen, Antti; Mäkynen, Asko; Nieminen, Pasi; Viiri, Jouni
2017-01-01
This paper presents a research-based teaching-learning sequence (TLS) that focuses on the notion of interaction in teaching Newton's third law (N3 law) which is, as earlier studies have shown, a challenging topic for students to learn. The TLS made systematic use of a visual representation tool--an interaction diagram (ID)--highlighting…
Dunitz, Madison I.; James, Pamela M.; Jospin, Guillaume; Coil, David A.; Chandler, James Angus
2014-01-01
Here we present the draft genome of Tatumella sp. strain UCD-D_suzukii, the first member of this genus to be sequenced. The genome contains 3,602,931 bp in 72 scaffolds. This strain was isolated from Drosophila suzukii larvae as part of a larger project to study the microbiota of D. suzukii. PMID:24762940
ERIC Educational Resources Information Center
Besson, Ugo; Borghi, Lidia; De Ambrosis, Anna; Mascheretti, Paolo
2010-01-01
We have developed a teaching-learning sequence (TLS) on friction based on a preliminary study involving three dimensions: an analysis of didactic research on the topic, an overview of usual approaches, and a critical analysis of the subject, considered also in its historical development. We found that mostly the usual presentations do not take…
Case Study Projects for College Mathematics Courses Based on a Particular Function of Two Variables
ERIC Educational Resources Information Center
Shi, Y.
2007-01-01
Based on a sequence of number pairs, a recent paper (Mauch, E. and Shi, Y., 2005, Using a sequence of number pairs as an example in teaching mathematics, "Mathematics and Computer Education," 39(3), 198-205) presented some interesting examples that can be used in teaching high school and college mathematics classes such as algebra, geometry,…
Certain topological properties and duals of the domain of a triangle matrix in a sequence space
NASA Astrophysics Data System (ADS)
Altay, Bilâl; Basar, Feyzi
2007-12-01
The matrix domain of the particular limitation methods Cesàro, Riesz, difference, summation and Euler were studied by several authors. In the present paper, certain topological properties and [beta]- and [gamma]-duals of the domain of a triangle matrix in a sequence space have been examined as an application of the characterization of the related matrix classes.
Draft Genome Sequence of Escherichia coli K-12 (ATCC 10798).
Dimitrova, Daniela; Engelbrecht, Kathleen C; Putonti, Catherine; Koenig, David W; Wolfe, Alan J
2017-07-06
Here, we present the draft genome sequence of Escherichia coli ATCC 10798. E. coli ATCC 10798 is a K-12 strain, one of the most well-studied model microorganisms. The size of the genome was 4,685,496 bp, with a G+C content of 50.70%. This assembly consists of 62 contigs and the F plasmid. Copyright © 2017 Dimitrova et al.
Draft Genome Sequence of Escherichia coli K-12 (ATCC 10798)
Dimitrova, Daniela; Engelbrecht, Kathleen C.; Koenig, David W.; Wolfe, Alan J.
2017-01-01
ABSTRACT Here, we present the draft genome sequence of Escherichia coli ATCC 10798. E. coli ATCC 10798 is a K-12 strain, one of the most well-studied model microorganisms. The size of the genome was 4,685,496 bp, with a G+C content of 50.70%. This assembly consists of 62 contigs and the F plasmid. PMID:28684574
Taste and Temperature in Swallowing Transit Time after Stroke
Cola, Paula C.; Gatto, Ana R.; da Silva, Roberta G.; Spadotto, André A.; Ribeiro, Priscila W.; Schelp, Arthur O.; Carvalho, Lidia R.; Henry, Maria A.C.A.
2012-01-01
Background Oropharyngeal dysphagia is common in individuals after stroke. Taste and temperature are used in dysphagia rehabilitation. The influence of stimuli, such as taste and temperature, on swallowing biomechanics has been investigated in both healthy individuals and in individuals with neurological disease. However, some questions still remain unanswered, such as how the sequence of offered stimuli influences the pharyngeal response. The goal of the present study was to determine the influence of the sequence of stimuli, sour taste and cold temperature, on pharyngeal transit time during deglutition in individuals after stroke. Methods The study included 60 individuals with unilateral ischemic stroke, 29 males and 31 females, aged 41–88 years (mean age: 66.2 years) examined 0–50 days after ictus (median: 6 days), with mild to moderate oropharyngeal dysphagia. Exclusion criteria were hemorrhagic stroke patients, patients with decreased level of consciousness, and clinically unstable patients, as confirmed by medical evaluation. The individuals were divided into two groups of 30 individuals each. Group 1 received a nonrandomized sequence of stimuli (i.e. natural, cold, sour, and sour-cold) and group 2 received a randomized sequence of stimuli. A videofluoroscopic swallowing study was performed to analyze the pharyngeal transit time. Four different stimuli (natural, cold, sour, and sour-cold) were offered. The images were digitalized and specific software was used to measure the pharyngeal transit time. Since the values did not present regular distribution and uniform variances, nonparametric tests were performed. Results Individuals in group 1 presented a significantly shorter pharyngeal transit time with the sour-cold stimulus than with the other stimuli. Individuals in group 2 did not show a significant difference in pharyngeal transit time between stimuli. Conclusions The results showed that the sequence of offered stimuli influences the pharyngeal transit time in a different way in individuals after stroke and suggest that, when the sour-cold stimulus is offered in a randomized sequence, it can influence the response to the other stimuli in stroke patients. Hence, the sour-cold stimulus could be used as a therapeutic aid in dysphagic stroke patients. PMID:23139681
Rapid and Easy Protocol for Quantification of Next-Generation Sequencing Libraries.
Hawkins, Steve F C; Guest, Paul C
2018-01-01
The emergence of next-generation sequencing (NGS) over the last 10 years has increased the efficiency of DNA sequencing in terms of speed, ease, and price. However, the exact quantification of a NGS library is crucial in order to obtain good data on sequencing platforms developed by the current market leader Illumina. Different approaches for DNA quantification are available currently and the most commonly used are based on analysis of the physical properties of the DNA through spectrophotometric or fluorometric methods. Although these methods are technically simple, they do not allow exact quantification as can be achieved using a real-time quantitative PCR (qPCR) approach. A qPCR protocol for DNA quantification with applications in NGS library preparation studies is presented here. This can be applied in various fields of study such as medical disorders resulting from nutritional programming disturbances.
He, Shui-Lian; Yang, Yang; Morrell, Peter L; Yi, Ting-Shuang
2015-01-01
Foxtail millet (Setaria italica (L.) Beauv) is one of the earliest domesticated grains, which has been cultivated in northern China by 8,700 years before present (YBP) and across Eurasia by 4,000 YBP. Owing to a small genome and diploid nature, foxtail millet is a tractable model crop for studying functional genomics of millets and bioenergy grasses. In this study, we examined nucleotide sequence diversity, geographic structure, and levels of linkage disequilibrium at four nuclear loci (ADH1, G3PDH, IGS1 and TPI1) in representative samples of 311 landrace accessions across its cultivated range. Higher levels of nucleotide sequence and haplotype diversity were observed in samples from China relative to other sampled regions. Genetic assignment analysis classified the accessions into seven clusters based on nucleotide sequence polymorphisms. Intralocus LD decayed rapidly to half the initial value within ~1.2 kb or less.
Desmottes, Lise; Meulemans, Thierry; Maillart, Christelle
2016-01-01
According to the Procedural Deficit Hypothesis (PDH), difficulties in the procedural memory system may contribute to the language difficulties encountered by children with Specific Language Impairment (SLI). Most studies investigating the PDH have used the sequence learning paradigm; however these studies have principally focused on initial sequence learning in a single practice session. The present study sought to extend these investigations by assessing the consolidation stage and longer-term retention of implicit sequence-specific knowledge in 42 children with or without SLI. Both groups of children completed a serial reaction time task and were tested 24h and one week after practice. Results showed that children with SLI succeeded as well as children with typical development (TD) in the early acquisition stage of the sequence learning task. However, as training blocks progressed, only TD children improved their sequence knowledge while children with SLI did not appear to evolve any more. Moreover, children with SLI showed a lack of the consolidation gains in sequence knowledge displayed by the TD children. Overall, these results were in line with the predictions of the PDH and suggest that later learning stages in procedural memory are impaired in SLI. Copyright © 2015 Elsevier Ltd. All rights reserved.
Borst, Gregoire; Niven, Elaine; Logie, Robert H
2012-04-01
Visual mental imagery and working memory are often assumed to play similar roles in high-order functions, but little is known of their functional relationship. In this study, we investigated whether similar cognitive processes are involved in the generation of visual mental images, in short-term retention of those mental images, and in short-term retention of visual information. Participants encoded and recalled visually or aurally presented sequences of letters under two interference conditions: spatial tapping or irrelevant visual input (IVI). In Experiment 1, spatial tapping selectively interfered with the retention of sequences of letters when participants generated visual mental images from aural presentation of the letter names and when the letters were presented visually. In Experiment 2, encoding of the sequences was disrupted by both interference tasks. However, in Experiment 3, IVI interfered with the generation of the mental images, but not with their retention, whereas spatial tapping was more disruptive during retention than during encoding. Results suggest that the temporary retention of visual mental images and of visual information may be supported by the same visual short-term memory store but that this store is not involved in image generation.
Exome sequencing establishes a gelsolin mutation as the cause of inherited bulbar-onset neuropathy.
Caress, James B; Johnson, Janel O; Abramzon, Yevgeniya A; Hawkins, Gregory A; Gibbs, J Raphael; Sullivan, Elizabeth A; Chahal, Chamanpreet S; Traynor, Bryan J
2017-11-01
Progressive bulbar motor neuropathy is primarily caused by bulbar-onset ALS. Hereditary amyloidosis type IV also presents with a bulbar neuropathy that mimics motor neuron disease. The disease is prevalent in Finland only and is not commonly included in the differential diagnosis of ALS. We studied 18 members of a family in which some had bulbar motor neuropathy, and we performed exome sequencing. Five affected family members were found to have a D187Y substitution in the GSN gene known to cause hereditary amyloidosis type IV. This American family presented with progressive bulbar neuropathy due to a gelsolin mutation not found in Finland. Hereditary amyloidosis type IV presents with bulbar motor neuropathy and not with peripheral neuropathy as occurs with common forms of amyloidosis. This report demonstrates the power of exome sequencing to determine the cause of rare hereditary diseases with incomplete or atypical phenotypes. Muscle Nerve 56: 1001-1005, 2017. © 2016 Wiley Periodicals, Inc.
Chen, Jin-Jin; Zhao, Qing-Sheng; Liu, Yi-Lan; Zha, Sheng-Hua; Zhao, Bing
2015-09-01
Maca (Lepidium meyenii) is an herbaceous plant that grows in high plateaus and has been used as both food and folk medicine for centuries because of its benefits to human health. In the present study, ITS (internal transcribed spacer) sequences of forty-three maca samples, collected from different regions or vendors, were amplified and analyzed. The ITS sequences of nineteen potential adulterants of maca were also collected and analyzed. The results indicated that the ITS sequence of maca was consistent in all samples and unique when compared with its adulterants. Therefore, this DNA-barcoding approach based on the ITS sequence can be used for the molecular identification of maca and its adulterants. Copyright © 2015 China Pharmaceutical University. Published by Elsevier B.V. All rights reserved.
One chromosome, one contig: complete microbial genomes from long-read sequencing and assembly.
Koren, Sergey; Phillippy, Adam M
2015-02-01
Like a jigsaw puzzle with large pieces, a genome sequenced with long reads is easier to assemble. However, recent sequencing technologies have favored lowering per-base cost at the expense of read length. This has dramatically reduced sequencing cost, but resulted in fragmented assemblies, which negatively affect downstream analyses and hinder the creation of finished (gapless, high-quality) genomes. In contrast, emerging long-read sequencing technologies can now produce reads tens of kilobases in length, enabling the automated finishing of microbial genomes for under $1000. This promises to improve the quality of reference databases and facilitate new studies of chromosomal structure and variation. We present an overview of these new technologies and the methods used to assemble long reads into complete genomes. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.
Jensen, Peter D; Zhang, Yuanji; Wiggins, B Elizabeth; Petrick, Jay S; Zhu, Jin; Kerstetter, Randall A; Heck, Gregory R; Ivashuta, Sergey I
2013-01-01
Long double-stranded RNAs (long dsRNAs) are precursors for the effector molecules of sequence-specific RNA-based gene silencing in eukaryotes. Plant cells can contain numerous endogenous long dsRNAs. This study demonstrates that such endogenous long dsRNAs in plants have sequence complementarity to human genes. Many of these complementary long dsRNAs have perfect sequence complementarity of at least 21 nucleotides to human genes; enough complementarity to potentially trigger gene silencing in targeted human cells if delivered in functional form. However, the number and diversity of long dsRNA molecules in plant tissue from crops such as lettuce, tomato, corn, soy and rice with complementarity to human genes that have a long history of safe consumption supports a conclusion that long dsRNAs do not present a significant dietary risk.
Seqenv: linking sequences to environments through text mining.
Sinclair, Lucas; Ijaz, Umer Z; Jensen, Lars Juhl; Coolen, Marco J L; Gubry-Rangin, Cecile; Chroňáková, Alica; Oulas, Anastasis; Pavloudi, Christina; Schnetzer, Julia; Weimann, Aaron; Ijaz, Ali; Eiler, Alexander; Quince, Christopher; Pafilis, Evangelos
2016-01-01
Understanding the distribution of taxa and associated traits across different environments is one of the central questions in microbial ecology. High-throughput sequencing (HTS) studies are presently generating huge volumes of data to address this biogeographical topic. However, these studies are often focused on specific environment types or processes leading to the production of individual, unconnected datasets. The large amounts of legacy sequence data with associated metadata that exist can be harnessed to better place the genetic information found in these surveys into a wider environmental context. Here we introduce a software program, seqenv, to carry out precisely such a task. It automatically performs similarity searches of short sequences against the "nt" nucleotide database provided by NCBI and, out of every hit, extracts-if it is available-the textual metadata field. After collecting all the isolation sources from all the search results, we run a text mining algorithm to identify and parse words that are associated with the Environmental Ontology (EnvO) controlled vocabulary. This, in turn, enables us to determine both in which environments individual sequences or taxa have previously been observed and, by weighted summation of those results, to summarize complete samples. We present two demonstrative applications of seqenv to a survey of ammonia oxidizing archaea as well as to a plankton paleome dataset from the Black Sea. These demonstrate the ability of the tool to reveal novel patterns in HTS and its utility in the fields of environmental source tracking, paleontology, and studies of microbial biogeography. To install seqenv, go to: https://github.com/xapple/seqenv.
The accuracy of ultrashort echo time MRI sequences for medical additive manufacturing
Rijkhorst, Erik-Jan; Hofman, Mark; Forouzanfar, Tymour; Wolff, Jan
2016-01-01
Objectives: Additively manufactured bone models, implants and drill guides are becoming increasingly popular amongst maxillofacial surgeons and dentists. To date, such constructs are commonly manufactured using CT technology that induces ionizing radiation. Recently, ultrashort echo time (UTE) MRI sequences have been developed that allow radiation-free imaging of facial bones. The aim of the present study was to assess the feasibility of UTE MRI sequences for medical additive manufacturing (AM). Methods: Three morphologically different dry human mandibles were scanned using a CT and MRI scanner. Additionally, optical scans of all three mandibles were made to acquire a “gold standard”. All CT and MRI scans were converted into Standard Tessellation Language (STL) models and geometrically compared with the gold standard. To quantify the accuracy of the AM process, the CT, MRI and gold-standard STL models of one of the mandibles were additively manufactured, optically scanned and compared with the original gold-standard STL model. Results: Geometric differences between all three CT-derived STL models and the gold standard were <1.0 mm. All three MRI-derived STL models generally presented deviations <1.5 mm in the symphyseal and mandibular area. The AM process introduced minor deviations of <0.5 mm. Conclusions: This study demonstrates that MRI using UTE sequences is a feasible alternative to CT in generating STL models of the mandible and would therefore be suitable for surgical planning and AM. Further in vivo studies are necessary to assess the usability of UTE MRI sequences in clinical settings. PMID:26943179
Sequence Effect on the Formation of DNA Minidumbbells.
Liu, Yuan; Lam, Sik Lok
2017-11-16
The DNA minidumbbell (MDB) is a recently identified non-B structure. The reported MDBs contain two TTTA, CCTG, or CTTG type II loops. At present, the knowledge and understanding of the sequence criteria for MDB formation are still limited. In this study, we performed a systematic high-resolution nuclear magnetic resonance (NMR) and native gel study to investigate the effect of sequence variations in tandem repeats on the formation of MDBs. Our NMR results reveal the importance of hydrogen bonds, base-base stacking, and hydrophobic interactions from each of the participating residues. We conclude that in the MDBs formed by tandem repeats, C-G loop-closing base pairs are more stabilizing than T-A loop-closing base pairs, and thymine residues in both the second and third loop positions are more stabilizing than cytosine residues. The results from this study enrich our knowledge on the sequence criteria for the formation of MDBs, paving a path for better exploring their potential roles in biological systems and DNA nanotechnology.
Dolz, Roser; Pujols, Joan; Ordóñez, German; Porta, Ramon; Majó, Natàlia
2008-04-25
An in-depth molecular study of infectious bronchitis viruses (IBV) with particular interest in evolutionary aspects of IBV in Spain was carried out in the present study based on the S1 gene molecular characterization of twenty-six Spanish strains isolated over a fourteen-year period. Four genotypes were identified based on S1 gene sequence analyses and phylogenetic studies. A drastic virus population shift was demonstrated along time and the novel Italy 02 serotype was shown to have displaced the previous predominant serotype 4/91 in the field. Detailed analyses of synonymous to non-synonymous ratio of the S1 gene sequences of this new serotype Italy 02 suggested positive selection pressures might have contributed to the successful establishment of Italy 02 serotype in our country. In addition, differences on the fitness abilities of new emergent genotypes were indicated. Furthermore, intergenic sequences (IGs)-like motifs within S1 gene sequences of IBV isolates were suggested to enhance the recombination abilities of certain serotypes.
Restricted transfer of learning between unimanual and bimanual finger sequences.
Yokoi, Atsushi; Bai, Wenjun; Diedrichsen, Jörn
2017-03-01
When training bimanual skills, such as playing piano, people sometimes practice each hand separately and at a later stage combine the movements of the two hands. This poses the critical question of whether motor skills can be acquired by separately practicing each subcomponent or should be trained as a whole. In the present study, we addressed this question by training human subjects for 4 days in a unimanual or bimanual version of the discrete sequence production task. Both groups were then tested on trained and untrained sequences on both unimanual and bimanual versions of the task. Surprisingly, we found no evidence of transfer from trained unimanual to bimanual or from trained bimanual to unimanual sequences. In half the participants, we also investigated whether cuing the sequences on the left and right hand with unique letters would change transfer. With these cues, untrained sequences that shared some components with the trained sequences were performed more quickly than sequences that did not. However, the amount of this transfer was limited to ∼10% of the overall sequence-specific learning gains. These results suggest that unimanual and bimanual sequences are learned in separate representations. Making participants aware of the interrelationship between sequences can induce some transferrable component, although the main component of the skill remains unique to unimanual or bimanual execution. NEW & NOTEWORTHY Studies in reaching movement demonstrated that approximately half of motor learning can transfer across unimanual and bimanual contexts, suggesting that neural representations for unimanual and bimanual movements are fairly overlapping at the level of elementary movement. In this study, we show that little or no transfer occurred across unimanual and bimanual sequential finger movements. This result suggests that bimanual sequences are represented at a level of the motor hierarchy that integrates movements of both hands. Copyright © 2017 the American Physiological Society.
Pagnuco, Inti Anabela; Revuelta, María Victoria; Bondino, Hernán Gabriel; Brun, Marcel; Ten Have, Arjen
2018-01-01
Protein superfamilies can be divided into subfamilies of proteins with different functional characteristics. Their sequences can be classified hierarchically, which is part of sequence function assignation. Typically, there are no clear subfamily hallmarks that would allow pattern-based function assignation by which this task is mostly achieved based on the similarity principle. This is hampered by the lack of a score cut-off that is both sensitive and specific. HMMER Cut-off Threshold Tool (HMMERCTTER) adds a reliable cut-off threshold to the popular HMMER. Using a high quality superfamily phylogeny, it clusters a set of training sequences such that the cluster-specific HMMER profiles show cluster or subfamily member detection with 100% precision and recall (P&R), thereby generating a specific threshold as inclusion cut-off. Profiles and thresholds are then used as classifiers to screen a target dataset. Iterative inclusion of novel sequences to groups and the corresponding HMMER profiles results in high sensitivity while specificity is maintained by imposing 100% P&R self detection. In three presented case studies of protein superfamilies, classification of large datasets with 100% precision was achieved with over 95% recall. Limits and caveats are presented and explained. HMMERCTTER is a promising protein superfamily sequence classifier provided high quality training datasets are used. It provides a decision support system that aids in the difficult task of sequence function assignation in the twilight zone of sequence similarity. All relevant data and source codes are available from the Github repository at the following URL: https://github.com/BBCMdP/HMMERCTTER.
Pagnuco, Inti Anabela; Revuelta, María Victoria; Bondino, Hernán Gabriel; Brun, Marcel
2018-01-01
Background Protein superfamilies can be divided into subfamilies of proteins with different functional characteristics. Their sequences can be classified hierarchically, which is part of sequence function assignation. Typically, there are no clear subfamily hallmarks that would allow pattern-based function assignation by which this task is mostly achieved based on the similarity principle. This is hampered by the lack of a score cut-off that is both sensitive and specific. Results HMMER Cut-off Threshold Tool (HMMERCTTER) adds a reliable cut-off threshold to the popular HMMER. Using a high quality superfamily phylogeny, it clusters a set of training sequences such that the cluster-specific HMMER profiles show cluster or subfamily member detection with 100% precision and recall (P&R), thereby generating a specific threshold as inclusion cut-off. Profiles and thresholds are then used as classifiers to screen a target dataset. Iterative inclusion of novel sequences to groups and the corresponding HMMER profiles results in high sensitivity while specificity is maintained by imposing 100% P&R self detection. In three presented case studies of protein superfamilies, classification of large datasets with 100% precision was achieved with over 95% recall. Limits and caveats are presented and explained. Conclusions HMMERCTTER is a promising protein superfamily sequence classifier provided high quality training datasets are used. It provides a decision support system that aids in the difficult task of sequence function assignation in the twilight zone of sequence similarity. All relevant data and source codes are available from the Github repository at the following URL: https://github.com/BBCMdP/HMMERCTTER. PMID:29579071
Keller, A; Danner, N; Grimmer, G; Ankenbrand, M; von der Ohe, K; von der Ohe, W; Rost, S; Härtel, S; Steffan-Dewenter, I
2015-03-01
The identification of pollen plays an important role in ecology, palaeo-climatology, honey quality control and other areas. Currently, expert knowledge and reference collections are essential to identify pollen origin through light microscopy. Pollen identification through molecular sequencing and DNA barcoding has been proposed as an alternative approach, but the assessment of mixed pollen samples originating from multiple plant species is still a tedious and error-prone task. Next-generation sequencing has been proposed to avoid this hindrance. In this study we assessed mixed pollen probes through next-generation sequencing of amplicons from the highly variable, species-specific internal transcribed spacer 2 region of nuclear ribosomal DNA. Further, we developed a bioinformatic workflow to analyse these high-throughput data with a newly created reference database. To evaluate the feasibility, we compared results from classical identification based on light microscopy from the same samples with our sequencing results. We assessed in total 16 mixed pollen samples, 14 originated from honeybee colonies and two from solitary bee nests. The sequencing technique resulted in higher taxon richness (deeper assignments and more identified taxa) compared to light microscopy. Abundance estimations from sequencing data were significantly correlated with counted abundances through light microscopy. Simulation analyses of taxon specificity and sensitivity indicate that 96% of taxa present in the database are correctly identifiable at the genus level and 70% at the species level. Next-generation sequencing thus presents a useful and efficient workflow to identify pollen at the genus and species level without requiring specialised palynological expert knowledge. © 2014 German Botanical Society and The Royal Botanical Society of the Netherlands.
Structure-Based Phylogenetic Analysis of the Lipocalin Superfamily.
Lakshmi, Balasubramanian; Mishra, Madhulika; Srinivasan, Narayanaswamy; Archunan, Govindaraju
2015-01-01
Lipocalins constitute a superfamily of extracellular proteins that are found in all three kingdoms of life. Although very divergent in their sequences and functions, they show remarkable similarity in 3-D structures. Lipocalins bind and transport small hydrophobic molecules. Earlier sequence-based phylogenetic studies of lipocalins highlighted that they have a long evolutionary history. However the molecular and structural basis of their functional diversity is not completely understood. The main objective of the present study is to understand functional diversity of the lipocalins using a structure-based phylogenetic approach. The present study with 39 protein domains from the lipocalin superfamily suggests that the clusters of lipocalins obtained by structure-based phylogeny correspond well with the functional diversity. The detailed analysis on each of the clusters and sub-clusters reveals that the 39 lipocalin domains cluster based on their mode of ligand binding though the clustering was performed on the basis of gross domain structure. The outliers in the phylogenetic tree are often from single member families. Also structure-based phylogenetic approach has provided pointers to assign putative function for the domains of unknown function in lipocalin family. The approach employed in the present study can be used in the future for the functional identification of new lipocalin proteins and may be extended to other protein families where members show poor sequence similarity but high structural similarity.
Motion detection and compensation in infrared retinal image sequences.
Scharcanski, J; Schardosim, L R; Santos, D; Stuchi, A
2013-01-01
Infrared image data captured by non-mydriatic digital retinography systems often are used in the diagnosis and treatment of the diabetic macular edema (DME). Infrared illumination is less aggressive to the patient retina, and retinal studies can be carried out without pupil dilation. However, sequences of infrared eye fundus images of static scenes, tend to present pixel intensity fluctuations in time, and noisy and background illumination changes pose a challenge to most motion detection methods proposed in the literature. In this paper, we present a retinal motion detection method that is adaptive to background noise and illumination changes. Our experimental results indicate that this method is suitable for detecting retinal motion in infrared image sequences, and compensate the detected motion, which is relevant in retinal laser treatment systems for DME. Copyright © 2013 Elsevier Ltd. All rights reserved.
Memory for tonal pitches: a music-length effect hypothesis.
Akiva-Kabiri, Lilach; Vecchi, Tomaso; Granot, Roni; Basso, Demis; Schön, Daniele
2009-07-01
One of the most studied effects of verbal working memory (WM) is the influence of the length of the words that compose the list to be remembered. This work aims to investigate the nature of musical WM by replicating the word length effect in the musical domain. Length and rate of presentation were manipulated in a recognition task of tone sequences. Results showed significant effects for both factors (length and presentation rate) as well as their interaction, suggesting the existence of different strategies (e.g., chunking and rehearsal) for the immediate memory of musical information, depending upon the length of the sequences.
Detecting Nano-Scale Vibrations in Rotating Devices by Using Advanced Computational Methods
del Toro, Raúl M.; Haber, Rodolfo E.; Schmittdiel, Michael C.
2010-01-01
This paper presents a computational method for detecting vibrations related to eccentricity in ultra precision rotation devices used for nano-scale manufacturing. The vibration is indirectly measured via a frequency domain analysis of the signal from a piezoelectric sensor attached to the stationary component of the rotating device. The algorithm searches for particular harmonic sequences associated with the eccentricity of the device rotation axis. The detected sequence is quantified and serves as input to a regression model that estimates the eccentricity. A case study presents the application of the computational algorithm during precision manufacturing processes. PMID:22399918
Dengue Virus Type 4 Phylogenetics in Brazil 2011: Looking beyond the Veil
de Souza, Renato Pereira; Rocco, Iray M.; Maeda, Adriana Y.; Spenassatto, Carine; Bisordi, Ivani; Suzuki, Akemi; Silveira, Vivian R.; Silva, Sarai J. S.; Azevedo, Roberta M.; Tolentino, Fernanda M.; Assis, Jaqueline C.; Bassi, Margarida G.; Dambrós, Bibiana P.; Tumioto, Gabriela L.; Gregianini, Tatiana S.; Souza, Luiza Terezinha M.; Timenetsky, Maria do Carmo S. T.; Santos, Cecília L. S.
2011-01-01
Dengue Fever and Dengue Hemorrhagic Fever are diseases affecting approximately 100 million people/year and are a major concern in developing countries. In the present study, the phylogenetic relationship of six strains of the first autochthonous cases of DENV-4 infection occurred in Sao Paulo State, Parana State and Rio Grande do Sul State, Brazil, 2011 were studied. Nucleotide sequences of the envelope gene were determined and compared with sequences representative of the genotypes I, II, III and Sylvatic for DEN4 retrieved from GenBank. We employed a Bayesian phylogenetic approach to reconstruct the phylogenetic relationships of Brazilian DENV-4 and we estimated evolutionary rates and dates of divergence for DENV-4 found in Brazil in 2011. All samples sequenced in this study were located in Genotype II. The studied strains are monophyletic and our data suggest that they have been evolving separately for at least 4 to 6 years. Our data suggest that the virus might have been present in the region for some time, without being noticed by Health Surveillance Services due to a low level of circulation and a higher prevalence of DENV-1 and DENV- 2. PMID:22216365
First detection of canine parvovirus type 2b from diarrheic dogs in Himachal Pradesh.
Sharma, Shalini; Dhar, Prasenjit; Thakur, Aneesh; Sharma, Vivek; Sharma, Mandeep
2016-09-01
The present study was conducted to detect the presence of canine parvovirus (CPV) among diarrheic dogs in Himachal Pradesh and to identify the most prevalent antigenic variant of CPV based on molecular typing and sequence analysis of VP2 gene. A total of 102 fecal samples were collected from clinical cases of diarrhea or hemorrhagic gastroenteritis from CPV vaccinated or non-vaccinated dogs. Samples were tested using CPV-specific polymerase chain reaction (PCR) targeting VP2 gene, multiplex PCR for detection of CPV-2a and CPV-2b antigenic variants, and a PCR for the detection of CPV-2c. CPV-2b isolate was cultured on Madin-Darby canine kidney (MDCK) cell lines and sequenced using VP2 structural protein gene. Multiple alignment and phylogenetic analysis was done using ClustalW and MEGA6 and inferred using the Neighbor-Joining method. No sample was found positive for the original CPV strain usually present in the vaccine. However, about 50% (52 out of 102) of the samples were found to be positive with CPV-2ab PCR assay that detects newer variants of CPV circulating in the field. In addition, multiplex PCR assay that identifies both CPV-2ab and CPV-2b revealed that CPV-2b was the major antigenic variant present in the affected dogs. A PCR positive isolate of CPV-2b was adapted to grow in MDCK cells and produced characteristic cytopathic effect after 5 th passage. Multiple sequence alignment of VP2 structural gene of CPV-2b isolate (Accession number HG004610) used in the study was found to be similar to other sequenced isolates in NCBI sequence database and showed 98-99% homology. This study reports the first detection of CPV-2b in dogs with hemorrhagic gastroenteritis in Himachal Pradesh and absence of other antigenic types of CPV. Further, CPV-specific PCR assay can be used for rapid confirmation of circulating virus strains under field conditions.
Aguilera-Mendoza, Longendri; Marrero-Ponce, Yovani; Tellez-Ibarra, Roberto; Llorente-Quesada, Monica T; Salgado, Jesús; Barigye, Stephen J; Liu, Jun
2015-08-01
The large variety of antimicrobial peptide (AMP) databases developed to date are characterized by a substantial overlap of data and similarity of sequences. Our goals are to analyze the levels of redundancy for all available AMP databases and use this information to build a new non-redundant sequence database. For this purpose, a new software tool is introduced. A comparative study of 25 AMP databases reveals the overlap and diversity among them and the internal diversity within each database. The overlap analysis shows that only one database (Peptaibol) contains exclusive data, not present in any other, whereas all sequences in the LAMP_Patent database are included in CAMP_Patent. However, the majority of databases have their own set of unique sequences, as well as some overlap with other databases. The complete set of non-duplicate sequences comprises 16 990 cases, which is almost half of the total number of reported peptides. On the other hand, the diversity analysis identifies the most and least diverse databases and proves that all databases exhibit some level of redundancy. Finally, we present a new parallel-free software, named Dover Analyzer, developed to compute the overlap and diversity between any number of databases and compile a set of non-redundant sequences. These results are useful for selecting or building a suitable representative set of AMPs, according to specific needs. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
NASA Astrophysics Data System (ADS)
Awasthi, Amit; Hothi, Navjot; Kaur, Prabhjot; Singh, Nirankar; Chakraborty, Monojit; Bansal, Sangeeta
2017-12-01
Atmospheric composition of ambient air consists of different gases in definite proportion that affect the earth's climate and its ecological system. Due to varied anthropogenic reasons, this composition is changed, which ultimately have an impact on the health of living beings. For survival, the human respiratory system is one of the sensitive systems, which is easily and closely affected by the change in atmospheric composition of an external environment. Many studies have been conducted to quantify the effects of atmospheric pollution on human health by using different approaches. This article presents different scenario of studies conducted to evaluate the effects on different human groups. Differences between the studies conducted using spirometry and survey methods are presented in this article to extract a better sequence between these two methodologies. Many studies have been conducted to measure the respiratory status by evaluating the respiratory symptoms and hospital admissions. Limited numbers of studies are found with repeated spirometry on the same subjects for long duration to nullify the error arising due to decrease in efforts by the same subjects during manoeuvre of pulmonary function tests. Present study reveals the importance of methodological sequencing in order to obtain more authentic and reliable results. This study suggests that impacts of deteriorating atmospheric composition on human health can be more significantly studied if spirometry is done after survey analysis. The article also proposes that efficiency and authenticity of surveys involving health impacts will increase, if medical data information of patients is saved in hospitals in a proper format.
Singh, Swati; Gupta, Sanchita; Mani, Ashutosh; Chaturvedi, Anoop
2012-01-01
Humulus lupulus is commonly known as hops, a member of the family moraceae. Currently many projects are underway leading to the accumulation of voluminous genomic and expressed sequence tag sequences in public databases. The genetically characterized domains in these databases are limited due to non-availability of reliable molecular markers. The large data of EST sequences are available in hops. The simple sequence repeat markers extracted from EST data are used as molecular markers for genetic characterization, in the present study. 25,495 EST sequences were examined and assembled to get full-length sequences. Maximum frequency distribution was shown by mononucleotide SSR motifs i.e. 60.44% in contig and 62.16% in singleton where as minimum frequency are observed for hexanucleotide SSR in contig (0.09%) and pentanucleotide SSR in singletons (0.12%). Maximum trinucleotide motifs code for Glutamic acid (GAA) while AT/TA were the most frequent repeat of dinucleotide SSRs. Flanking primer pairs were designed in-silico for the SSR containing sequences. Functional categorization of SSRs containing sequences was done through gene ontology terms like biological process, cellular component and molecular function. PMID:22368382
Bai, Yu; Iwasaki, Yuki; Kanaya, Shigehiko; Zhao, Yue; Ikemura, Toshimichi
2014-01-01
With remarkable increase of genomic sequence data of a wide range of species, novel tools are needed for comprehensive analyses of the big sequence data. Self-Organizing Map (SOM) is an effective tool for clustering and visualizing high-dimensional data such as oligonucleotide composition on one map. By modifying the conventional SOM, we have previously developed Batch-Learning SOM (BLSOM), which allows classification of sequence fragments according to species, solely depending on the oligonucleotide composition. In the present study, we introduce the oligonucleotide BLSOM used for characterization of vertebrate genome sequences. We first analyzed pentanucleotide compositions in 100 kb sequences derived from a wide range of vertebrate genomes and then the compositions in the human and mouse genomes in order to investigate an efficient method for detecting differences between the closely related genomes. BLSOM can recognize the species-specific key combination of oligonucleotide frequencies in each genome, which is called a "genome signature," and the specific regions specifically enriched in transcription-factor-binding sequences. Because the classification and visualization power is very high, BLSOM is an efficient powerful tool for extracting a wide range of information from massive amounts of genomic sequences (i.e., big sequence data).
Dissection of the Octoploid Strawberry Genome by Deep Sequencing of the Genomes of Fragaria Species
Hirakawa, Hideki; Shirasawa, Kenta; Kosugi, Shunichi; Tashiro, Kosuke; Nakayama, Shinobu; Yamada, Manabu; Kohara, Mistuyo; Watanabe, Akiko; Kishida, Yoshie; Fujishiro, Tsunakazu; Tsuruoka, Hisano; Minami, Chiharu; Sasamoto, Shigemi; Kato, Midori; Nanri, Keiko; Komaki, Akiko; Yanagi, Tomohiro; Guoxin, Qin; Maeda, Fumi; Ishikawa, Masami; Kuhara, Satoru; Sato, Shusei; Tabata, Satoshi; Isobe, Sachiko N.
2014-01-01
Cultivated strawberry (Fragaria x ananassa) is octoploid and shows allogamous behaviour. The present study aims at dissecting this octoploid genome through comparison with its wild relatives, F. iinumae, F. nipponica, F. nubicola, and F. orientalis by de novo whole-genome sequencing on an Illumina and Roche 454 platforms. The total length of the assembled Illumina genome sequences obtained was 698 Mb for F. x ananassa, and ∼200 Mb each for the four wild species. Subsequently, a virtual reference genome termed FANhybrid_r1.2 was constructed by integrating the sequences of the four homoeologous subgenomes of F. x ananassa, from which heterozygous regions in the Roche 454 and Illumina genome sequences were eliminated. The total length of FANhybrid_r1.2 thus created was 173.2 Mb with the N50 length of 5137 bp. The Illumina-assembled genome sequences of F. x ananassa and the four wild species were then mapped onto the reference genome, along with the previously published F. vesca genome sequence to establish the subgenomic structure of F. x ananassa. The strategy adopted in this study has turned out to be successful in dissecting the genome of octoploid F. x ananassa and appears promising when applied to the analysis of other polyploid plant species. PMID:24282021
The LAM-PCR Method to Sequence LV Integration Sites.
Wang, Wei; Bartholomae, Cynthia C; Gabriel, Richard; Deichmann, Annette; Schmidt, Manfred
2016-01-01
Integrating viral gene transfer vectors are commonly used gene delivery tools in clinical gene therapy trials providing stable integration and continuous gene expression of the transgene in the treated host cell. However, integration of the reverse-transcribed vector DNA into the host genome is a potentially mutagenic event that may directly contribute to unwanted side effects. A comprehensive and accurate analysis of the integration site (IS) repertoire is indispensable to study clonality in transduced cells obtained from patients undergoing gene therapy and to identify potential in vivo selection of affected cell clones. To date, next-generation sequencing (NGS) of vector-genome junctions allows sophisticated studies on the integration repertoire in vitro and in vivo. We have explored the use of the Illumina MiSeq Personal Sequencer platform to sequence vector ISs amplified by non-restrictive linear amplification-mediated PCR (nrLAM-PCR) and LAM-PCR. MiSeq-based high-quality IS sequence retrieval is accomplished by the introduction of a double-barcode strategy that substantially minimizes the frequency of IS sequence collisions compared to the conventionally used single-barcode protocol. Here, we present an updated protocol of (nr)LAM-PCR for the analysis of lentiviral IS using a double-barcode system and followed by deep sequencing using the MiSeq device.
Scannell, Devin R.; Zill, Oliver A.; Rokas, Antonis; Payen, Celia; Dunham, Maitreya J.; Eisen, Michael B.; Rine, Jasper; Johnston, Mark; Hittinger, Chris Todd
2011-01-01
High-quality, well-annotated genome sequences and standardized laboratory strains fuel experimental and evolutionary research. We present improved genome sequences of three species of Saccharomyces sensu stricto yeasts: S. bayanus var. uvarum (CBS 7001), S. kudriavzevii (IFO 1802T and ZP 591), and S. mikatae (IFO 1815T), and describe their comparison to the genomes of S. cerevisiae and S. paradoxus. The new sequences, derived by assembling millions of short DNA sequence reads together with previously published Sanger shotgun reads, have vastly greater long-range continuity and far fewer gaps than the previously available genome sequences. New gene predictions defined a set of 5261 protein-coding orthologs across the five most commonly studied Saccharomyces yeasts, enabling a re-examination of the tempo and mode of yeast gene evolution and improved inferences of species-specific gains and losses. To facilitate experimental investigations, we generated genetically marked, stable haploid strains for all three of these Saccharomyces species. These nearly complete genome sequences and the collection of genetically marked strains provide a valuable toolset for comparative studies of gene function, metabolism, and evolution, and render Saccharomyces sensu stricto the most experimentally tractable model genus. These resources are freely available and accessible through www.SaccharomycesSensuStricto.org. PMID:22384314
Unusual RNA plant virus integration in the soybean genome leads to the production of small RNAs.
da Fonseca, Guilherme Cordenonsi; de Oliveira, Luiz Felipe Valter; de Morais, Guilherme Loss; Abdelnor, Ricardo Vilela; Nepomuceno, Alexandre Lima; Waterhouse, Peter M; Farinelli, Laurent; Margis, Rogerio
2016-05-01
Horizontal gene transfer (HGT) is known to be a major force in genome evolution. The acquisition of genes from viruses by eukaryotic genomes is a well-studied example of HGT, including rare cases of non-retroviral RNA virus integration. The present study describes the integration of cucumber mosaic virus RNA-1 into soybean genome. After an initial metatranscriptomic analysis of small RNAs derived from soybean, the de novo assembly resulted a 3029-nt contig homologous to RNA-1. The integration of this sequence in the soybean genome was confirmed by DNA deep sequencing. The locus where the integration occurred harbors the full RNA-1 sequence followed by the partial sequence of an endogenous mRNA and another sequence of RNA-1 as an inverted repeat and allowing the formation of a hairpin structure. This region recombined into a retrotransposon located inside an exon of a soybean gene. The nucleotide similarity of the integrated sequence compared to other Cucumber mosaic virus sequences indicates that the integration event occurred recently. We described a rare event of non-retroviral RNA virus integration in soybean that leads to the production of a double-stranded RNA in a similar fashion to virus resistance RNAi plants. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
cuBLASTP: Fine-Grained Parallelization of Protein Sequence Search on CPU+GPU.
Zhang, Jing; Wang, Hao; Feng, Wu-Chun
2017-01-01
BLAST, short for Basic Local Alignment Search Tool, is a ubiquitous tool used in the life sciences for pairwise sequence search. However, with the advent of next-generation sequencing (NGS), whether at the outset or downstream from NGS, the exponential growth of sequence databases is outstripping our ability to analyze the data. While recent studies have utilized the graphics processing unit (GPU) to speedup the BLAST algorithm for searching protein sequences (i.e., BLASTP), these studies use coarse-grained parallelism, where one sequence alignment is mapped to only one thread. Such an approach does not efficiently utilize the capabilities of a GPU, particularly due to the irregularity of BLASTP in both execution paths and memory-access patterns. To address the above shortcomings, we present a fine-grained approach to parallelize BLASTP, where each individual phase of sequence search is mapped to many threads on a GPU. This approach, which we refer to as cuBLASTP, reorders data-access patterns and reduces divergent branches of the most time-consuming phases (i.e., hit detection and ungapped extension). In addition, cuBLASTP optimizes the remaining phases (i.e., gapped extension and alignment with trace back) on a multicore CPU and overlaps their execution with the phases running on the GPU.
Nagano, Daisuke; Sivakumar, Thillaiampalam; De De Macedo, Alane Caine Costa; Inpankaew, Tawin; Alhassan, Andy; Igarashi, Ikuo; Yokoyama, Naoaki
2013-11-01
In the present study, we screened blood DNA samples obtained from cattle bred in Brazil (n=164) and Ghana (n=80) for Babesia bovis using a diagnostic PCR assay and found prevalences of 14.6% and 46.3%, respectively. Subsequently, the genetic diversity of B. bovis in Thailand, Brazil and Ghana was analyzed, based on the DNA sequence of merozoite surface antigen-1 (MSA-1). In Thailand, MSA-1 sequences were relatively conserved and found in a single clade of the phylogram, while Brazilian MSA-1 sequences showed high genetic diversity and were dispersed across three different clades. In contrast, the sequences from Ghanaian samples were detected in two different clades, one of which contained only a single Ghanaian sequence. The identities among the MSA-1 sequences from Thailand, Brazil and Ghana were 99.0-100%, 57.5-99.4% and 60.3-100%, respectively, while the similarities among the deduced MSA-1 amino acid sequences within the respective countries were 98.4-100%, 59.4-99.7% and 58.7-100%, respectively. These observations suggested that the genetic diversity of B. bovis based on MSA-1 sequences was higher in Brazil and Ghana than in Thailand. The current data highlight the importance of conducting extensive studies on the genetic diversity of B. bovis before designing immune control strategies in each surveyed country.
Cloning and sequence analysis of chitin synthase gene fragments of Demodex mites.
Zhao, Ya-e; Wang, Zheng-hang; Xu, Yang; Xu, Ji-ru; Liu, Wen-yan; Wei, Meng; Wang, Chu-ying
2012-10-01
To our knowledge, few reports on Demodex studied at the molecular level are available at present. In this study our group, for the first time, cloned, sequenced and analyzed the chitin synthase (CHS) gene fragments of Demodex folliculorum, Demodex brevis, and Demodex canis (three isolates from each species) from Xi'an China, by designing specific primers based on the only partial sequence of the CHS gene of D. canis from Japan, retrieved from GenBank. Results show that amplification was successful only in three D. canis isolates and one D. brevis isolate out of the nine Demodex isolates. The obtained fragments were sequenced to be 339 bp for D. canis and 338 bp for D. brevis. The CHS gene sequence similarities between the three Xi'an D. canis isolates and one Japanese D. canis isolate ranged from 99.7% to 100.0%, and those between four D. canis isolates and one D. brevis isolate were 99.1%-99.4%. Phylogenetic trees based on maximum parsimony (MP) and maximum likelihood (ML) methods shared the same clusters, according with the traditional classification. Two open reading frames (ORFs) were identified in each CHS gene sequenced, and their corresponding amino acid sequences were located at the catalytic domain. The relatively conserved sequences could be deduced to be a CHS class A gene, which is associated with chitin synthesis in the integument of Demodex mites.
Cloning and sequence analysis of chitin synthase gene fragments of Demodex mites*
Zhao, Ya-e; Wang, Zheng-hang; Xu, Yang; Xu, Ji-ru; Liu, Wen-yan; Wei, Meng; Wang, Chu-ying
2012-01-01
To our knowledge, few reports on Demodex studied at the molecular level are available at present. In this study our group, for the first time, cloned, sequenced and analyzed the chitin synthase (CHS) gene fragments of Demodex folliculorum, Demodex brevis, and Demodex canis (three isolates from each species) from Xi’an China, by designing specific primers based on the only partial sequence of the CHS gene of D. canis from Japan, retrieved from GenBank. Results show that amplification was successful only in three D. canis isolates and one D. brevis isolate out of the nine Demodex isolates. The obtained fragments were sequenced to be 339 bp for D. canis and 338 bp for D. brevis. The CHS gene sequence similarities between the three Xi’an D. canis isolates and one Japanese D. canis isolate ranged from 99.7% to 100.0%, and those between four D. canis isolates and one D. brevis isolate were 99.1%–99.4%. Phylogenetic trees based on maximum parsimony (MP) and maximum likelihood (ML) methods shared the same clusters, according with the traditional classification. Two open reading frames (ORFs) were identified in each CHS gene sequenced, and their corresponding amino acid sequences were located at the catalytic domain. The relatively conserved sequences could be deduced to be a CHS class A gene, which is associated with chitin synthesis in the integument of Demodex mites. PMID:23024043
Ahmed, Nisar; Riaz, Adeel; Zubair, Zahra; Saqib, Muhammad; Ijaz, Sehrish; Nawaz-Ul-Rehman, Muhammad Shah; Al-Qahtani, Ahmed; Mubin, Muhammad
2018-03-15
The infection in dogs due to canine parvovirus (CPV), is a highly contagious one with high mortality rate. The present study was undertaken for a detailed genetic analysis of partial VP2 gene i.e., 630 bp isolated from rectal swab samples of infected domestic and stray dogs from all areas of district Faisalabad. Monitoring of viruses is important, as continuous prevalence of viral infection might be associated with emergence of new virulent strains. In the present study, 40 rectal swab samples were collected from diarrheic dogs from different areas of district Faisalabad, Pakistan, in 2014-15 and screened for the presence of CPV by immunochromatography. Most of these dogs were stray dogs showing symptoms of diarrhea. Viral DNA was isolated and partial VP2 gene was amplified using gene specific primer pair Hfor/Hrev through PCR. Amplified fragments were cloned in pTZ57R/T (Fermentas) and completely sequenced. Sequences were analyzed and assembled by the Lasergene DNA analysis package (v8; DNAStar Inc., Madison, WI, USA). The results with immunochromatography showed that 33/40 (82%) of dogs were positive for CPV. We were able to amplify a fragment of 630 bp from 25 samples. In 25 samples the sequences of CPV-2a were detected showing the amino acid substitution Ser297Ala and presence of amino acid (426-Asn) in partial VP2 protein. Interestingly the BLAST analysis showed the of feline panleukopenia virus (FPV) sequences in 3 samples which were already positive for new CPV-2a, with 99% sequence homology to other FPV sequences present in GenBank. Phylogenetic analysis showed clustering of partial CPV-VP-2 gene with viruses from China, India, Japan and Uruguay identifying a new variant, whereas the 3 FPV sequences showed immediate ancestral relationship with viruses from Portugal, South Africa and USA. Interesting observation was that CPV are clustering away from the commercial vaccine strains. In this work we provide a better understanding of CPV prevailing in Pakistan at molecular level. The detection of FPV could be a case of real co-infection or a case of dual presence, due to ingestion of contaminated food.
Evidence of birth-and-death evolution of 5S rRNA gene in Channa species (Teleostei, Perciformes).
Barman, Anindya Sundar; Singh, Mamta; Singh, Rajeev Kumar; Lal, Kuldeep Kumar
2016-12-01
In higher eukaryotes, minor rDNA family codes for 5S rRNA that is arranged in tandem arrays and comprises of a highly conserved 120 bp long coding sequence with a variable non-transcribed spacer (NTS). Initially the 5S rDNA repeats are considered to be evolved by the process of concerted evolution. But some recent reports, including teleost fishes suggested that evolution of 5S rDNA repeat does not fit into the concerted evolution model and evolution of 5S rDNA family may be explained by a birth-and-death evolution model. In order to study the mode of evolution of 5S rDNA repeats in Perciformes fish species, nucleotide sequence and molecular organization of five species of genus Channa were analyzed in the present study. Molecular analyses revealed several variants of 5S rDNA repeats (four types of NTS) and networks created by a neighbor net algorithm for each type of sequences (I, II, III and IV) did not show a clear clustering in species specific manner. The stable secondary structure is predicted and upstream and downstream conserved regulatory elements were characterized. Sequence analyses also shown the presence of two putative pseudogenes in Channa marulius. Present study supported that 5S rDNA repeats in genus Channa were evolved under the process of birth-and-death.
Phylogenetic diversity in the genus Bacillus as seen by 16S rRNA sequencing studies
NASA Technical Reports Server (NTRS)
Rossler, D.; Ludwig, W.; Schleifer, K. H.; Lin, C.; McGill, T. J.; Wisotzkey, J. D.; Jurtshuk, P. Jr; Fox, G. E.
1991-01-01
Comparative sequence analysis of 16S ribosomal (r)RNAs or DNAs of Bacillus alvei, B. laterosporus, B. macerans, B. macquariensis, B. polymyxa and B. stearothermophilus revealed the phylogenetic diversity of the genus Bacillus. Based on the presently available data set of 16S rRNA sequences from bacilli and relatives at least four major "Bacillus clusters" can be defined: a "Bacillus subtilis cluster" including B. stearothermophilus, a "B. brevis cluster" including B. laterosporus, a "B. alvei cluster" including B. macerans, B. maquariensis and B. polymyxa and a "B. cycloheptanicus branch".
Coverage Bias and Sensitivity of Variant Calling for Four Whole-genome Sequencing Technologies
Lasitschka, Bärbel; Jones, David; Northcott, Paul; Hutter, Barbara; Jäger, Natalie; Kool, Marcel; Taylor, Michael; Lichter, Peter; Pfister, Stefan; Wolf, Stephan; Brors, Benedikt; Eils, Roland
2013-01-01
The emergence of high-throughput, next-generation sequencing technologies has dramatically altered the way we assess genomes in population genetics and in cancer genomics. Currently, there are four commonly used whole-genome sequencing platforms on the market: Illumina’s HiSeq2000, Life Technologies’ SOLiD 4 and its completely redesigned 5500xl SOLiD, and Complete Genomics’ technology. A number of earlier studies have compared a subset of those sequencing platforms or compared those platforms with Sanger sequencing, which is prohibitively expensive for whole genome studies. Here we present a detailed comparison of the performance of all currently available whole genome sequencing platforms, especially regarding their ability to call SNVs and to evenly cover the genome and specific genomic regions. Unlike earlier studies, we base our comparison on four different samples, allowing us to assess the between-sample variation of the platforms. We find a pronounced GC bias in GC-rich regions for Life Technologies’ platforms, with Complete Genomics performing best here, while we see the least bias in GC-poor regions for HiSeq2000 and 5500xl. HiSeq2000 gives the most uniform coverage and displays the least sample-to-sample variation. In contrast, Complete Genomics exhibits by far the smallest fraction of bases not covered, while the SOLiD platforms reveal remarkable shortcomings, especially in covering CpG islands. When comparing the performance of the four platforms for calling SNPs, HiSeq2000 and Complete Genomics achieve the highest sensitivity, while the SOLiD platforms show the lowest false positive rate. Finally, we find that integrating sequencing data from different platforms offers the potential to combine the strengths of different technologies. In summary, our results detail the strengths and weaknesses of all four whole-genome sequencing platforms. It indicates application areas that call for a specific sequencing platform and disallow other platforms. This helps to identify the proper sequencing platform for whole genome studies with different application scopes. PMID:23776689
Smith, Rick W A; Monroe, Cara; Bolnick, Deborah A
2015-01-01
While cytosine methylation has been widely studied in extant populations, relatively few studies have analyzed methylation in ancient DNA. Most existing studies of epigenetic marks in ancient DNA have inferred patterns of methylation in highly degraded samples using post-mortem damage to cytosines as a proxy for cytosine methylation levels. However, this approach limits the inference of methylation compared with direct bisulfite sequencing, the current gold standard for analyzing cytosine methylation at single nucleotide resolution. In this study, we used direct bisulfite sequencing to assess cytosine methylation in ancient DNA from the skeletal remains of 30 Native Americans ranging in age from approximately 230 to 4500 years before present. Unmethylated cytosines were converted to uracils by treatment with sodium bisulfite, bisulfite products of a CpG-rich retrotransposon were pyrosequenced, and C-to-T ratios were quantified for a single CpG position. We found that cytosine methylation is readily recoverable from most samples, given adequate preservation of endogenous nuclear DNA. In addition, our results indicate that the precision of cytosine methylation estimates is inversely correlated with aDNA preservation, such that samples of low DNA concentration show higher variability in measures of percent methylation than samples of high DNA concentration. In particular, samples in this study with a DNA concentration above 0.015 ng/μL generated the most consistent measures of cytosine methylation. This study presents evidence of cytosine methylation in a large collection of ancient human remains, and indicates that it is possible to analyze epigenetic patterns in ancient populations using direct bisulfite sequencing approaches.
Explaining the harmonic sequence paradox.
Schmidt, Ulrich; Zimper, Alexander
2012-05-01
According to the harmonic sequence paradox, an expected utility decision maker's willingness to pay for a gamble whose expected payoffs evolve according to the harmonic series is finite if and only if his marginal utility of additional income becomes zero for rather low payoff levels. Since the assumption of zero marginal utility is implausible for finite payoff levels, expected utility theory - as well as its standard generalizations such as cumulative prospect theory - are apparently unable to explain a finite willingness to pay. This paper presents first an experimental study of the harmonic sequence paradox. Additionally, it demonstrates that the theoretical argument of the harmonic sequence paradox only applies to time-patient decision makers, whereas the paradox is easily avoided if time-impatience is introduced. ©2011 The British Psychological Society.
Structured prediction models for RNN based sequence labeling in clinical text.
Jagannatha, Abhyuday N; Yu, Hong
2016-11-01
Sequence labeling is a widely used method for named entity recognition and information extraction from unstructured natural language data. In clinical domain one major application of sequence labeling involves extraction of medical entities such as medication, indication, and side-effects from Electronic Health Record narratives. Sequence labeling in this domain, presents its own set of challenges and objectives. In this work we experimented with various CRF based structured learning models with Recurrent Neural Networks. We extend the previously studied LSTM-CRF models with explicit modeling of pairwise potentials. We also propose an approximate version of skip-chain CRF inference with RNN potentials. We use these methodologies for structured prediction in order to improve the exact phrase detection of various medical entities.
Structured prediction models for RNN based sequence labeling in clinical text
Jagannatha, Abhyuday N; Yu, Hong
2016-01-01
Sequence labeling is a widely used method for named entity recognition and information extraction from unstructured natural language data. In clinical domain one major application of sequence labeling involves extraction of medical entities such as medication, indication, and side-effects from Electronic Health Record narratives. Sequence labeling in this domain, presents its own set of challenges and objectives. In this work we experimented with various CRF based structured learning models with Recurrent Neural Networks. We extend the previously studied LSTM-CRF models with explicit modeling of pairwise potentials. We also propose an approximate version of skip-chain CRF inference with RNN potentials. We use these methodologies1 for structured prediction in order to improve the exact phrase detection of various medical entities. PMID:28004040
Registration methods for nonblind watermark detection in digital cinema applications
NASA Astrophysics Data System (ADS)
Nguyen, Philippe; Balter, Raphaele; Montfort, Nicolas; Baudry, Severine
2003-06-01
Digital watermarking may be used to enforce copyright protection of digital cinema, by embedding in each projected movie an unique identifier (fingerprint). By identifying the source of illegal copies, watermarking will thus incite movie theatre managers to enforce copyright protection, in particular by preventing people from coming in with a handy cam. We propose here a non-blind watermark method to improve the watermark detection on very impaired sequences. We first present a study on the picture impairments caused by the projection on a screen, then acquisition with a handy cam. We show that images undergo geometric deformations, which are fully described by a projective geometry model. The sequence also undergoes spatial and temporal luminance variation. Based on this study and on the impairments models which follow, we propose a method to match the retrieved sequence to the original one. First, temporal registration is performed by comparing the average luminance variation on both sequences. To compensate for geometric transformations, we used paired points from both sequences, obtained by applying a feature points detector. The matching of the feature points then enables to retrieve the geometric transform parameters. Tests show that the watermark retrieval on rectified sequences is greatly improved.
Scholz, Christian F. P.; Poulsen, Knud
2012-01-01
The close phylogenetic relationship of the important pathogen Streptococcus pneumoniae and several species of commensal streptococci, particularly Streptococcus mitis and Streptococcus pseudopneumoniae, and the recently demonstrated sharing of genes and phenotypic traits previously considered specific for S. pneumoniae hamper the exact identification of S. pneumoniae. Based on sequence analysis of 16S rRNA genes of a collection of 634 streptococcal strains, identified by multilocus sequence analysis, we detected a cytosine at position 203 present in all 440 strains of S. pneumoniae but replaced by an adenosine residue in all strains representing other species of mitis group streptococci. The S. pneumoniae-specific sequence signature could be demonstrated by sequence analysis or indirectly by restriction endonuclease digestion of a PCR amplicon covering the site. The S. pneumoniae-specific signature offers an inexpensive means for validation of the identity of clinical isolates and should be used as an integrated marker in the annotation procedure employed in 16S rRNA-based molecular studies of complex human microbiotas. This may avoid frequent misidentifications such as those we demonstrate to have occurred in previous reports and in reference sequence databases. PMID:22442329
Shu, Fan-Fan; Lv, Rui-Qing; Zhang, Yi-Fang; Duan, Gang; Wu, Ding-Yu; Li, Bi-Feng; Yang, Jian-Fa; Zou, Feng-Cai
2012-08-01
On mainland China, liver flukes of Fasciola spp. (Digenea: Fasciolidae) can cause serious acute and chronic morbidity in numerous species of mammals such as sheep, goats, cattle, and humans. The objective of the present study was to examine the taxonomic identity of Fasciola species in Yunnan province by sequences of the first and second internal transcribed spacers (ITS-1 and ITS-2) of nuclear ribosomal DNA (rDNA). The ITS rDNA was amplified from 10 samples representing Fasciola species in cattle from 2 geographical locations in Yunnan Province, by polymerase chain reaction (PCR), and the products were sequenced directly. The lengths of the ITS-1 and ITS-2 sequences were 422 and 361-362 base pairs, respectively, for all samples sequenced. Using ITS sequences, 2 Fasciola species were revealed, namely Fasciola hepatica and Fasciola gigantica. This is the first demonstration of F. gigantica in cattle in Yunnan Province, China using a molecular approach; our findings have implications for studying the population genetic characterization of the Chinese Fasciola species and for the prevention and control of Fasciola spp. in this province.
Impact of sequencing depth on the characterization of the microbiome and resistome.
Zaheer, Rahat; Noyes, Noelle; Ortega Polo, Rodrigo; Cook, Shaun R; Marinier, Eric; Van Domselaar, Gary; Belk, Keith E; Morley, Paul S; McAllister, Tim A
2018-04-12
Developments in high-throughput next generation sequencing (NGS) technology have rapidly advanced the understanding of overall microbial ecology as well as occurrence and diversity of specific genes within diverse environments. In the present study, we compared the ability of varying sequencing depths to generate meaningful information about the taxonomic structure and prevalence of antimicrobial resistance genes (ARGs) in the bovine fecal microbial community. Metagenomic sequencing was conducted on eight composite fecal samples originating from four beef cattle feedlots. Metagenomic DNA was sequenced to various depths, D1, D0.5 and D0.25, with average sample read counts of 117, 59 and 26 million, respectively. A comparative analysis of the relative abundance of reads aligning to different phyla and antimicrobial classes indicated that the relative proportions of read assignments remained fairly constant regardless of depth. However, the number of reads being assigned to ARGs as well as to microbial taxa increased significantly with increasing depth. We found a depth of D0.5 was suitable to describe the microbiome and resistome of cattle fecal samples. This study helps define a balance between cost and required sequencing depth to acquire meaningful results.
Cicuendez, Marta; Castaño-León, Ana; Ramos, Ana; Hilario, Amaya; Gómez, Pedro A; Lagares, Alfonso
To compare the identification capability of traumatic axonal injury (TAI) by different sequences on conventional magnetic resonance (MR) studies in traumatic brain injury (TBI) patients. We retropectevely analyzed 264 TBI patients to whom a MR had been performed in the first 60 days after trauma. All clinical variables related to prognosis were registered, as well as the data from the initial computed tomography. The MR imaging protocol consisted of a 3-plane localizer sequence T1-weighted and T2-weighted fast spin-echo, FLAIR and gradient-echo images (GRET2*). TAI lesions were classified according to Gentry and Firsching classifications. We calculated weighted kappa coefficients and the area under the ROC curve for each MR sequence. A multivariable analyses was performed to correlate MR findings in each sequence with the final outcome of the patients. TAI lesions were adequately visualized on T2, FLAIR and GRET2* sequences in more than 80% of the studies. Subcortical TAI lesions were well on FLAIR and GRET2* sequences visualized hemorrhagic TAI lesions. We saw that these MR sequences had a high inter-rater agreement for TAI diagnosis (0.8). T2 sequence presented the highest value on ROC curve in Gentry (0.68, 95%CI: 0.61-0.76, p<0.001, Nagerlkerke-R 2 0.26) and Firsching classifications (0.64, 95%CI 0.57-0.72, p<0.001, Nagerlkerke-R 2 0.19), followed by FLAIR and GRET2* sequences. Both classifications determined by each of these sequences were associated with poor outcome after performing a multivariable analyses adjusted for prognostic factors (p<0.02). We recommend to perform conventional MR study in subacute phase including T2, FLAIR and GRET2* sequences for visualize TAI lesions. These MR findings added prognostic information in TBI patients. Copyright © 2017 Sociedad Española de Neurocirugía. Publicado por Elsevier España, S.L.U. All rights reserved.
Identifying Group-Specific Sequences for Microbial Communities Using Long k-mer Sequence Signatures
Wang, Ying; Fu, Lei; Ren, Jie; Yu, Zhaoxia; Chen, Ting; Sun, Fengzhu
2018-01-01
Comparing metagenomic samples is crucial for understanding microbial communities. For different groups of microbial communities, such as human gut metagenomic samples from patients with a certain disease and healthy controls, identifying group-specific sequences offers essential information for potential biomarker discovery. A sequence that is present, or rich, in one group, but absent, or scarce, in another group is considered “group-specific” in our study. Our main purpose is to discover group-specific sequence regions between control and case groups as disease-associated markers. We developed a long k-mer (k ≥ 30 bps)-based computational pipeline to detect group-specific sequences at strain resolution free from reference sequences, sequence alignments, and metagenome-wide de novo assembly. We called our method MetaGO: Group-specific oligonucleotide analysis for metagenomic samples. An open-source pipeline on Apache Spark was developed with parallel computing. We applied MetaGO to one simulated and three real metagenomic datasets to evaluate the discriminative capability of identified group-specific markers. In the simulated dataset, 99.11% of group-specific logical 40-mers covered 98.89% disease-specific regions from the disease-associated strain. In addition, 97.90% of group-specific numerical 40-mers covered 99.61 and 96.39% of differentially abundant genome and regions between two groups, respectively. For a large-scale metagenomic liver cirrhosis (LC)-associated dataset, we identified 37,647 group-specific 40-mer features. Any one of the features can predict disease status of the training samples with the average of sensitivity and specificity higher than 0.8. The random forests classification using the top 10 group-specific features yielded a higher AUC (from ∼0.8 to ∼0.9) than that of previous studies. All group-specific 40-mers were present in LC patients, but not healthy controls. All the assembled 11 LC-specific sequences can be mapped to two strains of Veillonella parvula: UTDB1-3 and DSM2008. The experiments on the other two real datasets related to Inflammatory Bowel Disease and Type 2 Diabetes in Women consistently demonstrated that MetaGO achieved better prediction accuracy with fewer features compared to previous studies. The experiments showed that MetaGO is a powerful tool for identifying group-specific k-mers, which would be clinically applicable for disease prediction. MetaGO is available at https://github.com/VVsmileyx/MetaGO. PMID:29774017
Pang, Jiaohui; Cheng, Qiqun; Sun, Dandan; Zhang, Heng; Jin, Shaofei
2016-09-01
Yellowfin tuna (Thunnus albacares) is one of the most important economic fishes around the world. In the present study, we determined the complete mitochondrial DNA sequence and organization of T. albacares. The entire mitochondrial genome is a circular-molecule of 16,528 bp in length, which encodes 37 genes in all. These genes comprise 13 protein-coding genes (ATP6 and 8, COI-III, Cytb, ND1-6 and 4 L), 22 transfer RNA genes (tRNAs), and 2 ribosomal RNA genes (12S and 16S rRNAs). The complete mitochondrial genome sequence of T. albacares can provide basic information for the studies on molecular taxonomy and conservation genetics of teleost fishes.
Stark width regularities within spectral series of the lithium isoelectronic sequence
NASA Astrophysics Data System (ADS)
Tapalaga, Irinel; Trklja, Nora; Dojčinović, Ivan P.; Purić, Jagoš
2018-03-01
Stark width regularities within spectral series of the lithium isoelectronic sequence have been studied in an approach that includes both neutrals and ions. The influence of environmental conditions and certain atomic parameters on the Stark widths of spectral lines has been investigated. This study gives a simple model for the calculation of Stark broadening data for spectral lines within the lithium isoelectronic sequence. The proposed model requires fewer parameters than any other model. The obtained relations were used for predictions of Stark widths for transitions that have not yet been measured or calculated. In the framework of the present research, three algorithms for fast data processing have been made and they enable quality control and provide verification of the theoretically calculated results.
Jia, Yi; Huan, Jun; Buhr, Vincent; Zhang, Jintao; Carayannopoulos, Leonidas N
2009-01-01
Background Automatic identification of structure fingerprints from a group of diverse protein structures is challenging, especially for proteins whose divergent amino acid sequences may fall into the "twilight-" or "midnight-" zones where pair-wise sequence identities to known sequences fall below 25% and sequence-based functional annotations often fail. Results Here we report a novel graph database mining method and demonstrate its application to protein structure pattern identification and structure classification. The biologic motivation of our study is to recognize common structure patterns in "immunoevasins", proteins mediating virus evasion of host immune defense. Our experimental study, using both viral and non-viral proteins, demonstrates the efficiency and efficacy of the proposed method. Conclusion We present a theoretic framework, offer a practical software implementation for incorporating prior domain knowledge, such as substitution matrices as studied here, and devise an efficient algorithm to identify approximate matched frequent subgraphs. By doing so, we significantly expanded the analytical power of sophisticated data mining algorithms in dealing with large volume of complicated and noisy protein structure data. And without loss of generality, choice of appropriate compatibility matrices allows our method to be easily employed in domains where subgraph labels have some uncertainty. PMID:19208148
Sharma, Anshul; Kaur, Jasmine; Lee, Sulhee; Park, Young-Seo
2018-06-01
In the present study, 35 Leuconostoc mesenteroides strains isolated from vegetables and food products from South Korea were studied by multilocus sequence typing (MLST) of seven housekeeping genes (atpA, groEL, gyrB, pheS, pyrG, rpoA, and uvrC). The fragment sizes of the seven amplified housekeeping genes ranged in length from 366 to 1414 bp. Sequence analysis indicated 27 different sequence types (STs) with 25 of them being represented by a single strain indicating high genetic diversity, whereas the remaining 2 were characterized by five strains each. In total, 220 polymorphic nucleotide sites were detected among seven housekeeping genes. The phylogenetic analysis based on the STs of the seven loci indicated that the 35 strains belonged to two major groups, A (28 strains) and B (7 strains). Split decomposition analysis showed that intraspecies recombination played a role in generating diversity among strains. The minimum spanning tree showed that the evolution of the STs was not correlated with food source. This study signifies that the multilocus sequence typing is a valuable tool to access the genetic diversity among L. mesenteroides strains from South Korea and can be used further to monitor the evolutionary changes.
Genetic Identification of Orientobilharzia turkestanicum from Sheep Isolates in Iran.
Tabaripour, Reza; Youssefi, Mohammad Reza; Tabaripour, Rabeeh
2015-01-01
Adult worms of Orientobilharzia turkestanicum live in the portal veins, or intestinal veins of cattle, sheep, goat and many other mammals causing orientobilharziasis. Orientobilharziasis causes significant economic losses to livestock industry of Iran. However, there is limited information about genotypes of O. turkestanicum in Iran. In this study, 30 isolates of O. turkestanicum obtained from sheep were characterized by sequencing mitochondrial cytochrome c oxidase subunit 1 (cox1) and nicotinamide adenine dinucleotide dehydrogenase subunit 1 (nad1) gene. The mitochondrial cox1 and nad1 DNA were amplified by polymerase chain reaction (PCR) and then sequenced and compared with O. turkestanicum and that of other members of the Schistosomatidae available in Gen-Bank(™). Phylogenetic relationships between them were re-constructed using the maximum parsimony method. Phylogenetic analyses done in present study placed O. turkestanicum within the Schistosoma genus, and indicates that O. turkestanicum was phylogenetically closer to the African schistosome group than to the Asian schistosome group. Comparison of nad1 and cox1 sequences of O. turkestanicum obtained in this study with corresponding sequences available in Genbank(™) revealed some sequence variations and provided evidence for presence of microvarients in Iran.
Sill, Orriana C; Smith, David M
2012-08-01
In recent years, many animal models of memory have focused on one or more of the various components of episodic memory. For example, the odor sequence memory task requires subjects to remember individual items and events (the odors) and the temporal aspects of the experience (the sequence of odor presentation). The well-known spatial context coding function of the hippocampus, as exemplified by place cell firing, may reflect the "where" component of episodic memory. In the present study, we added a contextual component to the odor sequence memory task by training rats to choose the earlier odor in one context and the later odor in another context and we compared the effects of temporary hippocampal lesions on performance of the original single context task and the new dual context task. Temporary lesions significantly impaired the single context task, although performance remained significantly above chance levels. In contrast, performance dropped all the way to chance when temporary lesions were used in the dual context task. These results demonstrate that rats can learn a dual context version of the odor sequence learning task that requires the use of contextual information along with the requirement to remember the "what" and "when" components of the odor sequence. Moreover, the addition of the contextual component made the task fully dependent on the hippocampus.
Phylogenetic Position of a Copper Age Sheep (Ovis aries) Mitochondrial DNA
Olivieri, Cristina; Ermini, Luca; Rizzi, Ermanno; Corti, Giorgio; Luciani, Stefania; Marota, Isolina; De Bellis, Gianluca; Rollo, Franco
2012-01-01
Background Sheep (Ovis aries) were domesticated in the Fertile Crescent region about 9,000-8,000 years ago. Currently, few mitochondrial (mt) DNA studies are available on archaeological sheep. In particular, no data on archaeological European sheep are available. Methodology/Principal Findings Here we describe the first portion of mtDNA sequence of a Copper Age European sheep. DNA was extracted from hair shafts which were part of the clothes of the so-called Tyrolean Iceman or Ötzi (5,350 - 5,100 years before present). Mitochondrial DNA (a total of 2,429 base pairs, encompassing a portion of the control region, tRNAPhe, a portion of the 12S rRNA gene, and the whole cytochrome B gene) was sequenced using a mixed sequencing procedure based on PCR amplification and 454 sequencing of pooled amplification products. We have compared the sequence with the corresponding sequence of 334 extant lineages. Conclusions/Significance A phylogenetic network based on a new cladistic notation for the mitochondrial diversity of domestic sheep shows that the Ötzi's sheep falls within haplogroup B, thus demonstrating that sheep belonging to this haplogroup were already present in the Alps more than 5,000 years ago. On the other hand, the lineage of the Ötzi's sheep is defined by two transitions (16147, and 16440) which, assembled together, define a motif that has not yet been identified in modern sheep populations. PMID:22457789
The role of RT carry-over for congruence sequence effects in masked priming.
Huber-Huber, Christoph; Ansorge, Ulrich
2017-05-01
The present study disentangles 2 sources of the congruence sequence effect with masked primes: congruence and response time of the previous trial (reaction time [RT] carry-over). Using arrows as primes and targets and a metacontrast masking procedure we found congruence as well as congruence sequence effects. In addition, congruence sequence effects decreased when RT carry-over was accounted for in a mixed model analysis, suggesting that RT carry-over contributes to congruence sequence effects in masked priming. Crucially, effects of previous trial congruence were not cancelled out completely indicating that RT carry-over and previous trial congruence are 2 sources feeding into the congruence sequence effect. A secondary task requiring response speed judgments demonstrated general awareness of response speed (Experiments 1), but removing this secondary task (Experiment 2) showed that RT carry-over effects were also present in single-task conditions. During (dual-task) prime-awareness test parts of both experiments, however, RT carry-over failed to modulate congruence effects, suggesting that some task sets of the participants can prevent the effect. The basic RT carry-over effects are consistent with the conflict adaptation account, with the adaptation to the statistics of the environment (ASE) model, and possibly with the temporal learning explanation. Additionally considering the task-dependence of RT carry-over, the results are most compatible with the conflict adaptation account. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Mucosal and Cutaneous Human Papillomaviruses Detected in Raw Sewages
La Rosa, Giuseppina; Fratini, Marta; Accardi, Luisa; D'Oro, Graziana; Della Libera, Simonetta; Muscillo, Michele; Di Bonito, Paola
2013-01-01
Epitheliotropic viruses can find their way into sewage. The aim of the present study was to investigate the occurrence, distribution, and genetic diversity of Human Papillomaviruses (HPVs) in urban wastewaters. Sewage samples were collected from treatment plants distributed throughout Italy. The DNA extracted from these samples was analyzed by PCR using five PV-specific sets of primers targeting the L1 (GP5/GP6, MY09/MY11, FAP59/64, SKF/SKR) and E1 regions (PM-A/PM-B), according to the protocols previously validated for the detection of mucosal and cutaneous HPV genotypes. PCR products underwent sequencing analysis and the sequences were aligned to reference genomes from the Papillomavirus Episteme database. Phylogenetic analysis was then performed to assess the genetic relationships among the different sequences and between the sequences of the samples and those of the prototype strains. A broad spectrum of sequences related to mucosal and cutaneous HPV types was detected in 81% of the sewage samples analyzed. Surprisingly, sequences related to the anogenital HPV6 and 11 were detected in 19% of the samples, and sequences related to the “high risk” oncogenic HPV16 were identified in two samples. Sequences related to HPV9, HPV20, HPV25, HPV76, HPV80, HPV104, HPV110, HPV111, HPV120 and HPV145 beta Papillomaviruses were detected in 76% of the samples. In addition, similarity searches and phylogenetic analysis of some sequences suggest that they could belong to putative new genotypes of the beta genus. In this study, for the first time, the presence of HPV viruses strongly related to human cancer is reported in sewage samples. Our data increases the knowledge of HPV genomic diversity and suggests that virological analysis of urban sewage can provide key information useful in supporting epidemiological studies. PMID:23341898
Eby, Joshua C; Turner, Lauren; Nguyen, Bryan; Kang, June; Neville, Carly; Temple, Louise
2016-09-15
The number of cases of pertussis has increased in the United States despite vaccination. We present the genome of an isolate of Bordetella pertussis from a vaccinated patient from Virginia. The genome was sequenced by long-read methodology and compared to that of a clinical isolate used for laboratory studies, D420. Copyright © 2016 Eby et al.
Complete mitochondrial genome sequence of Melipona scutellaris, a Brazilian stingless bee.
Pereira, Ulisses de Padua; Bonetti, Ana Maria; Goulart, Luiz Ricardo; Santos, Anderson Rodrigues Dos; Oliveira, Guilherme Correa de; Cuadros-Orellana, Sara; Ueira-Vieira, Carlos
2016-09-01
Melipona scutellaris is a Brazilian stingless bee species and a highly important native pollinator besides its use in rational rearing for honey production. In this study, we present the whole mitochondrial DNA sequence of M. scutellaris from a haploid male. The mitogenome has a size of 14,862 bp and harbors 13 protein-coding genes (PCGs), 2 rRNA genes and 21 tRNA genes.
Draft Whole-Genome Sequence of Bacillus altitudinis Strain B-388, a Producer of Extracellular RNase.
Shah Mahmud, Raihan; Ulyanova, Vera; Malanin, Sergey; Dudkina, Elena; Vershinina, Valentina; Ilinskaya, Olga
2015-01-29
Here, we present a draft genome sequence of Bacillus altitudinis strain B-388, including a putative plasmid. The strain was isolated from the intestine of Indian meal moth, a common pest of stored grains, and it is characterized by the production of extracellular RNase, similar to binase, which is of interest for comparative studies and biotechnology. Copyright © 2015 Shah Mahmud et al.
Kheiri, Ahmed; Keedwell, Ed
2017-01-01
Operations research is a well-established field that uses computational systems to support decisions in business and public life. Good solutions to operations research problems can make a large difference to the efficient running of businesses and organisations and so the field often searches for new methods to improve these solutions. The high school timetabling problem is an example of an operations research problem and is a challenging task which requires assigning events and resources to time slots subject to a set of constraints. In this article, a new sequence-based selection hyper-heuristic is presented that produces excellent results on a suite of high school timetabling problems. In this study, we present an easy-to-implement, easy-to-maintain, and effective sequence-based selection hyper-heuristic to solve high school timetabling problems using a benchmark of unified real-world instances collected from different countries. We show that with sequence-based methods, it is possible to discover new best known solutions for a number of the problems in the timetabling domain. Through this investigation, the usefulness of sequence-based selection hyper-heuristics has been demonstrated and the capability of these methods has been shown to exceed the state of the art.
SIMPLEX: Cloud-Enabled Pipeline for the Comprehensive Analysis of Exome Sequencing Data
Fischer, Maria; Snajder, Rene; Pabinger, Stephan; Dander, Andreas; Schossig, Anna; Zschocke, Johannes; Trajanoski, Zlatko; Stocker, Gernot
2012-01-01
In recent studies, exome sequencing has proven to be a successful screening tool for the identification of candidate genes causing rare genetic diseases. Although underlying targeted sequencing methods are well established, necessary data handling and focused, structured analysis still remain demanding tasks. Here, we present a cloud-enabled autonomous analysis pipeline, which comprises the complete exome analysis workflow. The pipeline combines several in-house developed and published applications to perform the following steps: (a) initial quality control, (b) intelligent data filtering and pre-processing, (c) sequence alignment to a reference genome, (d) SNP and DIP detection, (e) functional annotation of variants using different approaches, and (f) detailed report generation during various stages of the workflow. The pipeline connects the selected analysis steps, exposes all available parameters for customized usage, performs required data handling, and distributes computationally expensive tasks either on a dedicated high-performance computing infrastructure or on the Amazon cloud environment (EC2). The presented application has already been used in several research projects including studies to elucidate the role of rare genetic diseases. The pipeline is continuously tested and is publicly available under the GPL as a VirtualBox or Cloud image at http://simplex.i-med.ac.at; additional supplementary data is provided at http://www.icbi.at/exome. PMID:22870267
Gut Microbiome and Putative Resistome of Inca and Italian Nobility Mummies
Santiago-Rodriguez, Tasha M.; Luciani, Stefania; Toranzos, Gary A.; Marota, Isolina; Giuffra, Valentina; Cano, Raul J.
2017-01-01
Little is still known about the microbiome resulting from the process of mummification of the human gut. In the present study, the gut microbiota, genes associated with metabolism, and putative resistome of Inca and Italian nobility mummies were characterized by using high-throughput sequencing. The Italian nobility mummies exhibited a higher bacterial diversity as compared to the Inca mummies when using 16S ribosomal (rRNA) gene amplicon sequencing, but both groups showed bacterial and fungal taxa when using shotgun metagenomic sequencing that may resemble both the thanatomicrobiome and extant human gut microbiomes. Identification of sequences associated with plants, animals, and carbohydrate-active enzymes (CAZymes) may provide further insights into the dietary habits of Inca and Italian nobility mummies. Putative antibiotic-resistance genes in the Inca and Italian nobility mummies support a human gut resistome prior to the antibiotic therapy era. The higher proportion of putative antibiotic-resistance genes in the Inca compared to Italian nobility mummies may support the hypotheses that a greater exposure to the environment may result in a greater acquisition of antibiotic-resistance genes. The present study adds knowledge of the microbiome resulting from the process of mummification of the human gut, insights of ancient dietary habits, and the preserved putative human gut resistome prior the antibiotic therapy era. PMID:29112136
Gut Microbiome and Putative Resistome of Inca and Italian Nobility Mummies.
Santiago-Rodriguez, Tasha M; Fornaciari, Gino; Luciani, Stefania; Toranzos, Gary A; Marota, Isolina; Giuffra, Valentina; Cano, Raul J
2017-11-07
Little is still known about the microbiome resulting from the process of mummification of the human gut. In the present study, the gut microbiota, genes associated with metabolism, and putative resistome of Inca and Italian nobility mummies were characterized by using high-throughput sequencing. The Italian nobility mummies exhibited a higher bacterial diversity as compared to the Inca mummies when using 16S ribosomal (rRNA) gene amplicon sequencing, but both groups showed bacterial and fungal taxa when using shotgun metagenomic sequencing that may resemble both the thanatomicrobiome and extant human gut microbiomes. Identification of sequences associated with plants, animals, and carbohydrate-active enzymes (CAZymes) may provide further insights into the dietary habits of Inca and Italian nobility mummies. Putative antibiotic-resistance genes in the Inca and Italian nobility mummies support a human gut resistome prior to the antibiotic therapy era. The higher proportion of putative antibiotic-resistance genes in the Inca compared to Italian nobility mummies may support the hypotheses that a greater exposure to the environment may result in a greater acquisition of antibiotic-resistance genes. The present study adds knowledge of the microbiome resulting from the process of mummification of the human gut, insights of ancient dietary habits, and the preserved putative human gut resistome prior the antibiotic therapy era.
Zhou, Chan; Mao, Fenglou; Yin, Yanbin; Huang, Jinling; Gogarten, Johann Peter; Xu, Ying
2014-01-01
A challenge in phylogenetic inference of gene trees is how to properly sample a large pool of homologous sequences to derive a good representative subset of sequences. Such a need arises in various applications, e.g. when (1) accuracy-oriented phylogenetic reconstruction methods may not be able to deal with a large pool of sequences due to their high demand in computing resources; (2) applications analyzing a collection of gene trees may prefer to use trees with fewer operational taxonomic units (OTUs), for instance for the detection of horizontal gene transfer events by identifying phylogenetic conflicts; and (3) the pool of available sequences is biased towards extensively studied species. In the past, the creation of subsamples often relied on manual selection. Here we present an Automated sequence-Sampling method for improving the Taxonomic diversity of gene phylogenetic trees, AST, to obtain representative sequences that maximize the taxonomic diversity of the sampled sequences. To demonstrate the effectiveness of AST, we have tested it to solve four problems, namely, inference of the evolutionary histories of the small ribosomal subunit protein S5 of E. coli, 16 S ribosomal RNAs and glycosyl-transferase gene family 8, and a study of ancient horizontal gene transfers from bacteria to plants. Our results show that the resolution of our computational results is almost as good as that of manual inference by domain experts, hence making the tool generally useful to phylogenetic studies by non-phylogeny specialists. The program is available at http://csbl.bmb.uga.edu/~zhouchan/AST.php.
Zhou, Chan; Mao, Fenglou; Yin, Yanbin; Huang, Jinling; Gogarten, Johann Peter; Xu, Ying
2014-01-01
A challenge in phylogenetic inference of gene trees is how to properly sample a large pool of homologous sequences to derive a good representative subset of sequences. Such a need arises in various applications, e.g. when (1) accuracy-oriented phylogenetic reconstruction methods may not be able to deal with a large pool of sequences due to their high demand in computing resources; (2) applications analyzing a collection of gene trees may prefer to use trees with fewer operational taxonomic units (OTUs), for instance for the detection of horizontal gene transfer events by identifying phylogenetic conflicts; and (3) the pool of available sequences is biased towards extensively studied species. In the past, the creation of subsamples often relied on manual selection. Here we present an Automated sequence-Sampling method for improving the Taxonomic diversity of gene phylogenetic trees, AST, to obtain representative sequences that maximize the taxonomic diversity of the sampled sequences. To demonstrate the effectiveness of AST, we have tested it to solve four problems, namely, inference of the evolutionary histories of the small ribosomal subunit protein S5 of E. coli, 16 S ribosomal RNAs and glycosyl-transferase gene family 8, and a study of ancient horizontal gene transfers from bacteria to plants. Our results show that the resolution of our computational results is almost as good as that of manual inference by domain experts, hence making the tool generally useful to phylogenetic studies by non-phylogeny specialists. The program is available at http://csbl.bmb.uga.edu/~zhouchan/AST.php. PMID:24892935
msgbsR: An R package for analysing methylation-sensitive restriction enzyme sequencing data.
Mayne, Benjamin T; Leemaqz, Shalem Y; Buckberry, Sam; Rodriguez Lopez, Carlos M; Roberts, Claire T; Bianco-Miotto, Tina; Breen, James
2018-02-01
Genotyping-by-sequencing (GBS) or restriction-site associated DNA marker sequencing (RAD-seq) is a practical and cost-effective method for analysing large genomes from high diversity species. This method of sequencing, coupled with methylation-sensitive enzymes (often referred to as methylation-sensitive restriction enzyme sequencing or MRE-seq), is an effective tool to study DNA methylation in parts of the genome that are inaccessible in other sequencing techniques or are not annotated in microarray technologies. Current software tools do not fulfil all methylation-sensitive restriction sequencing assays for determining differences in DNA methylation between samples. To fill this computational need, we present msgbsR, an R package that contains tools for the analysis of methylation-sensitive restriction enzyme sequencing experiments. msgbsR can be used to identify and quantify read counts at methylated sites directly from alignment files (BAM files) and enables verification of restriction enzyme cut sites with the correct recognition sequence of the individual enzyme. In addition, msgbsR assesses DNA methylation based on read coverage, similar to RNA sequencing experiments, rather than methylation proportion and is a useful tool in analysing differential methylation on large populations. The package is fully documented and available freely online as a Bioconductor package ( https://bioconductor.org/packages/release/bioc/html/msgbsR.html ).
V, Pavana Jyothi; S, Akila; Selvan, Malini K; Naidu, Hariprasad; Raghunathan, Shwethaa; Kota, Sathish; Sundaram, R C Raja; Rana, Samir Kumar; Raj, G Dhinakar; Srinivasan, V A; Mohana Subramanian, B
2016-12-01
Canine parvovirus (CPV) is a non-enveloped single stranded DNA virus with an icosahedral capsid. Mini-sequencing based CPV typing was developed earlier to detect and differentiate all the CPV types and FPV in a single reaction. This technique was further evaluated in the present study by performing the mini-sequencing directly from fecal samples which avoided tedious virus isolation steps by cell culture system. Fecal swab samples were collected from 84 dogs with enteritis symptoms, suggestive of parvoviral infection from different locations across India. Seventy six of these samples were positive by PCR; the subsequent mini-sequencing reaction typed 74 of them as type 2a virus, and 2 samples as type 2b. Additionally, 25 of the positive samples were typed by cycle sequencing of PCR products. Direct CPV typing from fecal samples using mini-sequencing showed 100% correlation with CPV typing by cycle sequencing. Moreover, CPV typing was achieved by mini-sequencing even with faintly positive PCR amplicons which was not possible by cycle sequencing. Therefore, the mini-sequencing technique is recommended for regular epidemiological follow up of CPV types, since the technique is rapid, highly sensitive and high capacity method for CPV typing. Copyright © 2016. Published by Elsevier B.V.
Fröhlich, K U
1994-04-01
A new method for the presentation of alignments of long sequences is described. The degree of identity for the aligned sequences is averaged for sections of a fixed number of residues. The resulting values are converted to shades of gray, with white corresponding to lack of identity and black corresponding to perfect identity. A sequence alignment is represented as a bar filled with varying shades of gray. The display is compact and allows for a fast and intuitive recognition of the distribution of regions with a high similarity. It is well suited for the presentation of alignments of long sequences, e.g. of protein superfamilies, in plenary lectures. The method is implemented as a HyperCard stack for Apple Macintosh computers. Several options for the modification of the output are available (e.g. background reduction, size of the summation window, consideration of amino acid similarity, inclusion of graphic markers to indicate specific domains). The output is a PostScript file which can be printed, imported as EPS or processed further with Adobe Illustrator.
Diaz, Francisco J; Berg, Michel J; Krebill, Ron; Welty, Timothy; Gidal, Barry E; Alloway, Rita; Privitera, Michael
2013-12-01
Due to concern and debate in the epilepsy medical community and to the current interest of the US Food and Drug Administration (FDA) in revising approaches to the approval of generic drugs, the FDA is currently supporting ongoing bioequivalence studies of antiepileptic drugs, the EQUIGEN studies. During the design of these crossover studies, the researchers could not find commercial or non-commercial statistical software that quickly allowed computation of sample sizes for their designs, particularly software implementing the FDA requirement of using random-effects linear models for the analyses of bioequivalence studies. This article presents tables for sample-size evaluations of average bioequivalence studies based on the two crossover designs used in the EQUIGEN studies: the four-period, two-sequence, two-formulation design, and the six-period, three-sequence, three-formulation design. Sample-size computations assume that random-effects linear models are used in bioequivalence analyses with crossover designs. Random-effects linear models have been traditionally viewed by many pharmacologists and clinical researchers as just mathematical devices to analyze repeated-measures data. In contrast, a modern view of these models attributes an important mathematical role in theoretical formulations in personalized medicine to them, because these models not only have parameters that represent average patients, but also have parameters that represent individual patients. Moreover, the notation and language of random-effects linear models have evolved over the years. Thus, another goal of this article is to provide a presentation of the statistical modeling of data from bioequivalence studies that highlights the modern view of these models, with special emphasis on power analyses and sample-size computations.
Mollusk genes encoding lysine tRNA (UUU) contain introns.
Matsuo, M; Abe, Y; Saruta, Y; Okada, N
1995-11-20
New intron-containing genes encoding tRNAs were discovered when genomic DNA isolated from various animal species was amplified by the polymerase chain reaction (PCR) with primers based on sequences of rabbit tRNA(Lys). From sequencing analysis of the products of PCR, we found that introns are present in several genes encoding tRNA(Lys) in mollusks, such as Loligo bleekeri (squid) and Octopus vulgaris (octopus). These introns were specific to genes encoding tRNA(Lys)(CUU) and were not present in genes encoding tRNA(Lys)(CUU). In addition, the sequences of the introns were different from one another. To confirm the results of our initial experiments, we isolated and sequenced genes encoding tRNA(Lys)(CUU) and tRNA(Lys)(UUU). The gene for tRNA(Lys)(UUU) from squid contained an intron, whose sequence was the same as that identified by PCR, and the gene formed a cluster with a corresponding pseudogene. Several DNA regions of 2.1 kb containing this cluster appeared to be tandemly arrayed in the squid genome. By contrast, the gene encoding tRNA(Lys)(CUU) did not contain an intron, as shown also by PCR. The tRNA(Lys)(UUU) that corresponded to the analyzed gene was isolated and characterized. The present study provides the first example of an intron-containing gene encoding a tRNA in mollusks and suggests the universality of introns in such genes in higher eukaryotes.
Rapid identification of causative species in patients with Old World leishmaniasis.
Minodier, P; Piarroux, R; Gambarelli, F; Joblet, C; Dumon, H
1997-01-01
Conventional methods for the identification of species of Leishmania parasite causing infections have limitations. By using a DNA-based alternative, the present study tries to develop a new tool for this purpose. Thirty-three patients living in Marseilles (in the south of France) were suffering from visceral or cutaneous leishmaniasis. DNA of the parasite in clinical samples (bone marrow, peripheral blood, or skin) from these patients were amplified by PCR and were directly sequenced. The sequences observed were compared to these of 30 strains of the genus causing Old World leishmaniasis collected in Europe, Africa, or Asia. In the analysis of the sequences of the strains, two different sequence patterns for Leishmania infantum, one sequence for Leishmania donovani, one sequence for Leishmania major, two sequences for Leishmania tropica, and one sequence for Leishmania aethiopica were obtained. Four sequences were observed among the strains from the patients: one was similar to the sequence for the L. major strains, two were identical to the sequences for the L. infantum strains, and the last sequence was not observed within the strains but had a high degree of homology with the sequences of the L. infantum and L. donovani strains. The L. infantum strains from all immunocompetent patients had the same sequence. The L. infantum strains from immunodeficient patients suffering from visceral leishmaniasis had three different sequences. This fact might signify that some variants of L. infantum acquire pathogenicity exclusively in immunocompromised patients. To dispense with the sequencing step, a restriction assay with HaeIII was used. Some restriction patterns might support genetic exchanges in members of the genus Leishmania. PMID:9316906
1-deoxy-d-xylulose-5-phosphate reductoisomerases and method of use
Croteau, Rodney B.; Lange, Bernd M.
2001-01-01
The present invention relates to isolated DNA sequences which code for the expression of plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein, such as the sequence presented in SEQ ID NO:1 which encodes a 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein from peppermint (Mentha x piperita). Additionally, the present invention relates to isolated plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein. In other aspects, the present invention is directed to replicable recombinant cloning vehicles comprising a nucleic acid sequence which codes for a plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase, to modified host cells transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence of the invention.
1-deoxy-D-xylulose-5-phosphate reductoisomerases, and methods of use
Croteau, Rodney B.; Lange, Bernd M.
2002-07-16
The present invention relates to isolated DNA sequences which code for the expression of plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein, such as the sequence presented in SEQ ID NO:1 which encodes a 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein from peppermint (Mentha x piperita). Additionally, the present invention relates to isolated plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein. In other aspects, the present invention is directed to replicable recombinant cloning vehicles comprising a nucleic acid sequence which codes for a plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase, to modified host cells transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence of the invention.
A safety mechanism for observational learning.
Badets, Arnaud; Boutin, Arnaud; Michelet, Thomas
2018-04-01
This empirical article presents the first evidence of a "safety mechanism" based on an observational-learning paradigm. It is accepted that during observational learning, a person can use different strategies to learn a motor skill, but it is unknown whether the learner is able to circumvent the encoding of an uncompleted observed skill. In this study, participants were tested in a dyadic protocol in which an observer watched a participant practicing two different motor sequences during a learning phase. During this phase, one of the two motor sequences was interrupted by a stop signal that precluded motor learning. The results of the subsequent retention test revealed that both groups learned the two motor sequences, but only the physical practice group showed worse performance for the interrupted sequence. The observers were consequently able to use a safety strategy to learn both sequences equally. Our findings are discussed in light of the implications of the action observation network for sequence learning and the cognitive mechanisms of error-based observation.
Complete Mitochondrial Genome of Echinostoma hortense (Digenea: Echinostomatidae).
Liu, Ze-Xuan; Zhang, Yan; Liu, Yu-Ting; Chang, Qiao-Cheng; Su, Xin; Fu, Xue; Yue, Dong-Mei; Gao, Yuan; Wang, Chun-Ren
2016-04-01
Echinostoma hortense (Digenea: Echinostomatidae) is one of the intestinal flukes with medical importance in humans. However, the mitochondrial (mt) genome of this fluke has not been known yet. The present study has determined the complete mt genome sequences of E. hortense and assessed the phylogenetic relationships with other digenean species for which the complete mt genome sequences are available in GenBank using concatenated amino acid sequences inferred from 12 protein-coding genes. The mt genome of E. hortense contained 12 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes, and 1 non-coding region. The length of the mt genome of E. hortense was 14,994 bp, which was somewhat smaller than those of other trematode species. Phylogenetic analyses based on concatenated nucleotide sequence datasets for all 12 protein-coding genes using maximum parsimony (MP) method showed that E. hortense and Hypoderaeum conoideum gathered together, and they were closer to each other than to Fasciolidae and other echinostomatid trematodes. The availability of the complete mt genome sequences of E. hortense provides important genetic markers for diagnostics, population genetics, and evolutionary studies of digeneans.
Complete Mitochondrial Genome of Echinostoma hortense (Digenea: Echinostomatidae)
Liu, Ze-Xuan; Zhang, Yan; Liu, Yu-Ting; Chang, Qiao-Cheng; Su, Xin; Fu, Xue; Yue, Dong-Mei; Gao, Yuan; Wang, Chun-Ren
2016-01-01
Echinostoma hortense (Digenea: Echinostomatidae) is one of the intestinal flukes with medical importance in humans. However, the mitochondrial (mt) genome of this fluke has not been known yet. The present study has determined the complete mt genome sequences of E. hortense and assessed the phylogenetic relationships with other digenean species for which the complete mt genome sequences are available in GenBank using concatenated amino acid sequences inferred from 12 protein-coding genes. The mt genome of E. hortense contained 12 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes, and 1 non-coding region. The length of the mt genome of E. hortense was 14,994 bp, which was somewhat smaller than those of other trematode species. Phylogenetic analyses based on concatenated nucleotide sequence datasets for all 12 protein-coding genes using maximum parsimony (MP) method showed that E. hortense and Hypoderaeum conoideum gathered together, and they were closer to each other than to Fasciolidae and other echinostomatid trematodes. The availability of the complete mt genome sequences of E. hortense provides important genetic markers for diagnostics, population genetics, and evolutionary studies of digeneans. PMID:27180575
NASA Astrophysics Data System (ADS)
Ramírez-Rojas, Alejandro; Telesca, Luciano; Lovallo, Michele; Flores, Leticia
2015-04-01
By using the method of the visibility graph (VG), five magnitude time series extracted from the seismic catalog of the Mexican subduction zone were investigated. The five seismic sequences represent the seismicity which occurred between 2005 and 2012 in five seismic areas: Guerrero, Chiapas, Oaxaca, Jalisco and Michoacan. Among the five seismic sequences, the Jalisco sequence shows VG properties significantly different from those shown by the other four. Such a difference could be inherent in the different tectonic settings of Jalisco with respect to those characterizing the other four areas. The VG properties of the seismic sequences have been put in relationship with the more typical seismological characteristics (b-value and a-value of the Gutenberg-Richter law). The present study was supported by the Bilateral Project Italy-Mexico "Experimental Stick-slip models of tectonic faults: innovative statistical approaches applied to synthetic seismic sequences", jointly funded by MAECI (Italy) and AMEXCID (Mexico) in the framework of the Bilateral Agreement for Scientific and Technological Cooperation PE 2014-2016
Kamel, Katarzyna A; Kroc, Magdalena; Święcicki, Wojciech
2015-01-01
Sequence tagged site (STS) markers are valuable tools for genetic and physical mapping that can be successfully used in comparative analyses among related species. Current challenges for molecular markers genotyping in plants include the lack of fast, sensitive and inexpensive methods suitable for sequence variant detection. In contrast, high resolution melting (HRM) is a simple and high-throughput assay, which has been widely applied in sequence polymorphism identification as well as in the studies of genetic variability and genotyping. The present study is the first attempt to use the HRM analysis to genotype STS markers in narrow-leafed lupin (Lupinus angustifolius L.). The sensitivity and utility of this method was confirmed by the sequence polymorphism detection based on melting curve profiles in the parental genotypes and progeny of the narrow-leafed lupin mapping population. Application of different approaches, including amplicon size and a simulated heterozygote analysis, has allowed for successful genetic mapping of 16 new STS markers in the narrow-leafed lupin genome.
Lorenzo, J Ramiro; Alonso, Leonardo G; Sánchez, Ignacio E
2015-01-01
Asparagine residues in proteins undergo spontaneous deamidation, a post-translational modification that may act as a molecular clock for the regulation of protein function and turnover. Asparagine deamidation is modulated by protein local sequence, secondary structure and hydrogen bonding. We present NGOME, an algorithm able to predict non-enzymatic deamidation of internal asparagine residues in proteins in the absence of structural data, using sequence-based predictions of secondary structure and intrinsic disorder. Compared to previous algorithms, NGOME does not require three-dimensional structures yet yields better predictions than available sequence-only methods. Four case studies of specific proteins show how NGOME may help the user identify deamidation-prone asparagine residues, often related to protein gain of function, protein degradation or protein misfolding in pathological processes. A fifth case study applies NGOME at a proteomic scale and unveils a correlation between asparagine deamidation and protein degradation in yeast. NGOME is freely available as a webserver at the National EMBnet node Argentina, URL: http://www.embnet.qb.fcen.uba.ar/ in the subpage "Protein and nucleic acid structure and sequence analysis".
Next-Generation Sequencing of Aquatic Oligochaetes: Comparison of Experimental Communities
Vivien, Régis; Lejzerowicz, Franck; Pawlowski, Jan
2016-01-01
Aquatic oligochaetes are a common group of freshwater benthic invertebrates known to be very sensitive to environmental changes and currently used as bioindicators in some countries. However, more extensive application of oligochaetes for assessing the ecological quality of sediments in watercourses and lakes would require overcoming the difficulties related to morphology-based identification of oligochaetes species. This study tested the Next-Generation Sequencing (NGS) of a standard cytochrome c oxydase I (COI) barcode as a tool for the rapid assessment of oligochaete diversity in environmental samples, based on mixed specimen samples. To know the composition of each sample we Sanger sequenced every specimen present in these samples. Our study showed that a large majority of OTUs (Operational Taxonomic Unit) could be detected by NGS analyses. We also observed congruence between the NGS and specimen abundance data for several but not all OTUs. Because the differences in sequence abundance data were consistent across samples, we exploited these variations to empirically design correction factors. We showed that such factors increased the congruence between the values of oligochaetes-based indices inferred from the NGS and the Sanger-sequenced specimen data. The validation of these correction factors by further experimental studies will be needed for the adaptation and use of NGS technology in biomonitoring studies based on oligochaete communities. PMID:26866802
Hsing, Michael; Cherkasov, Artem
2008-06-25
Insertions and deletions (indels) represent a common type of sequence variations, which are less studied and pose many important biological questions. Recent research has shown that the presence of sizable indels in protein sequences may be indicative of protein essentiality and their role in protein interaction networks. Examples of utilization of indels for structure-based drug design have also been recently demonstrated. Nonetheless many structural and functional characteristics of indels remain less researched or unknown. We have created a web-based resource, Indel PDB, representing a structural database of insertions/deletions identified from the sequence alignments of highly similar proteins found in the Protein Data Bank (PDB). Indel PDB utilized large amounts of available structural information to characterize 1-, 2- and 3-dimensional features of indel sites. Indel PDB contains 117,266 non-redundant indel sites extracted from 11,294 indel-containing proteins. Unlike loop databases, Indel PDB features more indel sequences with secondary structures including alpha-helices and beta-sheets in addition to loops. The insertion fragments have been characterized by their sequences, lengths, locations, secondary structure composition, solvent accessibility, protein domain association and three dimensional structures. By utilizing the data available in Indel PDB, we have studied and presented here several sequence and structural features of indels. We anticipate that Indel PDB will not only enable future functional studies of indels, but will also assist protein modeling efforts and identification of indel-directed drug binding sites.
Moura, Felipe Arruda; van Emmerik, Richard E A; Santana, Juliana Exel; Martins, Luiz Eduardo Barreto; Barros, Ricardo Machado Leite de; Cunha, Sergio Augusto
2016-12-01
The purpose of this study was to investigate the coordination between teams spread during football matches using cross-correlation and vector coding techniques. Using a video-based tracking system, we obtained the trajectories of 257 players during 10 matches. Team spread was calculated as functions of time. For a general coordination description, we calculated the cross-correlation between the signals. Vector coding was used to identify the coordination patterns between teams during offensive sequences that ended in shots on goal or defensive tackles. Cross-correlation showed that opponent teams have a tendency to present in-phase coordination, with a short time lag. During offensive sequences, vector coding results showed that, although in-phase coordination dominated, other patterns were observed. We verified that during the early stages, offensive sequences ending in shots on goal present greater anti-phase and attacking team phase periods, compared to sequences ending in tackles. Results suggest that the attacking team may seek to present a contrary behaviour of its opponent (or may lead the adversary behaviour) in the beginning of the attacking play, regarding to the distribution strategy, to increase the chances of a shot on goal. The techniques allowed detecting the coordination patterns between teams, providing additional information about football dynamics and players' interaction.
Le Bras, Stéphanie; Cohen-Tannoudji, Michel; Guyot, Valérie; Vandormael-Pournin, Sandrine; Coumailleau, Franck; Babinet, Charles; Baldacci, Patricia
2002-08-21
The DDK syndrome is defined as the embryonic lethality of F1 mouse embryos from crosses between DDK females and males from other strains (named hereafter as non-DDK strains). Genetically controlled by the Ovum mutant (Om) locus, it is due to a deleterious interaction between a maternal factor present in DDK oocytes and the non-DDK paternal pronucleus. Therefore, the DDK syndrome constitutes a unique genetic tool to study the crucial interactions that take place between the parental genomes and the egg cytoplasm during mammalian development. In this paper, we present an extensive analysis performed by exon trapping on the Om region. Twenty-seven trapped sequences were from genes in the databases: beta-adaptin, CCT zeta2, DNA LigaseIII, Notchless, Rad51l3 and Scya1. Twenty-eight other sequences presented similarities with expressed sequence tags and genomic sequences whereas 57 did not. The pattern of expression of 37 of these markers was established. Importantly, five of them are expressed in DDK oocytes and are candidate genes for the maternal factor, and 20 are candidate genes for the paternal factor since they are expressed in testis. This data is an important step towards identifying the genes responsible for the DDK syndrome.
Motor Sequence Learning-Induced Neural Efficiency in Functional Brain Connectivity
Karim, Helmet T; Huppert, Theodore J; Erickson, Kirk I; Wollam, Mariegold E; Sparto, Patrick J; Sejdić, Ervin; VanSwearingen, Jessie M
2016-01-01
Previous studies have shown the functional neural circuitry differences before and after an explicitly learned motor sequence task, but have not assessed these changes during the process of motor skill learning. Functional magnetic resonance imaging activity was measured while participants (n=13) were asked to tap their fingers to visually presented sequences in blocks that were either the same sequence repeated (learning block) or random sequences (control block). Motor learning was associated with a decrease in brain activity during learning compared to control. Lower brain activation was noted in the posterior parietal association area and bilateral thalamus during the later periods of learning (not during the control). Compared to the control condition, we found the task-related motor learning was associated with decreased connectivity between the putamen and left inferior frontal gyrus and left middle cingulate brain regions. Motor learning was associated with changes in network activity, spatial extent, and connectivity. PMID:27845228
Genome survey sequencing of red swamp crayfish Procambarus clarkii.
Shi, Linlin; Yi, Shaokui; Li, Yanhe
2018-06-21
Red swamp crayfish, Procambarus clarkii, presently is an important aquatic commercial species in China. The crayfish is a hot area of research focus, and its genetic improvement is quite urgent for the crayfish aquaculture in China. However, the knowledge of its genomic landscape is limited. In this study, a survey of P. clarkii genome was investigated based on Illumina's Solexa sequencing platform. Meanwhile, its genome size was estimated using flow cytometry. Interestingly, the genome size estimated is about 8.50 Gb by flow cytometry and 1.86 Gb with genome survey sequencing. Based on the assembled genome sequences, total of 136,962 genes and 152,268 exons were predicted, and the predicted genes ranged from 150 to 12,807 bp in length. The survey sequences could help accelerate the progress of gene discovery involved in genetic diversity and evolutionary analysis, even though it could not successfully applied for estimation of P. clarkii genome size.
Hwang, Byungjin; Bang, Duhee
2016-01-01
All synthetic DNA materials require prior programming of the building blocks of the oligonucleotide sequences. The development of a programmable microarray platform provides cost-effective and time-efficient solutions in the field of data storage using DNA. However, the scalability of the synthesis is not on par with the accelerating sequencing capacity. Here, we report on a new paradigm of generating genetic material (writing) using a degenerate oligonucleotide and optomechanical retrieval method that leverages sequencing (reading) throughput to generate the desired number of oligonucleotides. As a proof of concept, we demonstrate the feasibility of our concept in digital information storage in DNA. In simulation, the ability to store data is expected to exponentially increase with increase in degenerate space. The present study highlights the major framework change in conventional DNA writing paradigm as a sequencer itself can become a potential source of making genetic materials. PMID:27876825
Solving Assembly Sequence Planning using Angle Modulated Simulated Kalman Filter
NASA Astrophysics Data System (ADS)
Mustapa, Ainizar; Yusof, Zulkifli Md.; Adam, Asrul; Muhammad, Badaruddin; Ibrahim, Zuwairie
2018-03-01
This paper presents an implementation of Simulated Kalman Filter (SKF) algorithm for optimizing an Assembly Sequence Planning (ASP) problem. The SKF search strategy contains three simple steps; predict-measure-estimate. The main objective of the ASP is to determine the sequence of component installation to shorten assembly time or save assembly costs. Initially, permutation sequence is generated to represent each agent. Each agent is then subjected to a precedence matrix constraint to produce feasible assembly sequence. Next, the Angle Modulated SKF (AMSKF) is proposed for solving ASP problem. The main idea of the angle modulated approach in solving combinatorial optimization problem is to use a function, g(x), to create a continuous signal. The performance of the proposed AMSKF is compared against previous works in solving ASP by applying BGSA, BPSO, and MSPSO. Using a case study of ASP, the results show that AMSKF outperformed all the algorithms in obtaining the best solution.
Hwang, Byungjin; Bang, Duhee
2016-11-23
All synthetic DNA materials require prior programming of the building blocks of the oligonucleotide sequences. The development of a programmable microarray platform provides cost-effective and time-efficient solutions in the field of data storage using DNA. However, the scalability of the synthesis is not on par with the accelerating sequencing capacity. Here, we report on a new paradigm of generating genetic material (writing) using a degenerate oligonucleotide and optomechanical retrieval method that leverages sequencing (reading) throughput to generate the desired number of oligonucleotides. As a proof of concept, we demonstrate the feasibility of our concept in digital information storage in DNA. In simulation, the ability to store data is expected to exponentially increase with increase in degenerate space. The present study highlights the major framework change in conventional DNA writing paradigm as a sequencer itself can become a potential source of making genetic materials.
Designing pH induced fold switch in proteins
NASA Astrophysics Data System (ADS)
Baruah, Anupaul; Biswas, Parbati
2015-05-01
This work investigates the computational design of a pH induced protein fold switch based on a self-consistent mean-field approach by identifying the ensemble averaged characteristics of sequences that encode a fold switch. The primary challenge to balance the alternative sets of interactions present in both target structures is overcome by simultaneously optimizing two foldability criteria corresponding to two target structures. The change in pH is modeled by altering the residual charge on the amino acids. The energy landscape of the fold switch protein is found to be double funneled. The fold switch sequences stabilize the interactions of the sites with similar relative surface accessibility in both target structures. Fold switch sequences have low sequence complexity and hence lower sequence entropy. The pH induced fold switch is mediated by attractive electrostatic interactions rather than hydrophobic-hydrophobic contacts. This study may provide valuable insights to the design of fold switch proteins.
Sobti, Ranbir Chander; Kumari, Mamtesh; Sharma, Vijay Lakshmi; Sodhi, Monika; Mukesh, Manishi; Shouche, Yogesh
2009-11-01
The present study was aimed to get the nucleotide sequences of a part of COII mitochondrial gene amplified from individuals of five species of Termites (Isoptera: Termitidae: Macrotermitinae). Four of them belonged to the genus Odontotermes (O. obesus, O. horni, O. bhagwatii and Odontotermes sp.) and one to Microtermes (M. obesi). Partial COII gene fragments were amplified by using specific primers. The sequences so obtained were characterized to calculate the frequencies of each nucleotide bases and a high A + T content was observed. The interspecific pairwise sequence divergence in Odontotermes species ranged from 6.5% to 17.1% across COII fragment. M. obesi sequence diversity ranged from 2.5 with Odontotermes sp. to 19.0% with O. bhagwatii. Phylogenetic trees drawn on the basis of distance neighbour-joining method revealed three main clades clustering all the individuals according to their genera and families.
Chandranaik, B M; Singh, Raj Kumar; Hosamani, Mahusudan; Krishnappa, Giriappa; Harish, Balur R; Chethana, C S; Renukaprasad, C
2011-02-01
The present paper describes the isolation of buffalo pox virus from scab lesions and its molecular characterization through B5R gene sequencing. During our study, pustular pox lesions were observed on the teats and mammary parenchyma of cattle and buffaloes, and the disease was of significant zoonotic importance since similar lesions were produced on the hands, legs, and face of people in close contact with the affected animals. The collected scab materials were subjected for virus isolation in 9-11-day-old chicken embryos by the chorioallontoic membrane route and in the Vero cell line. The virus was confirmed by a sensitive and rapid diagnostic polymerase chain reaction using the primers that amplify "A type inclusion" gene, and further, B5R gene of the virus was sequenced and compared with the corresponding sequences of other orthopoxviruses. The results showed high sequence homology of our isolates with other orthopoxviruses.
Krzeminska, Urszula; Wilson, Robyn; Rahman, Sadequr; Song, Beng Kah; Seneviratne, Sampath; Gan, Han Ming; Austin, Christopher M
2016-07-01
The complete mitochondrial genomes of two jungle crows (Corvus macrorhynchos) were sequenced. DNA was extracted from tissue samples obtained from shed feathers collected in the field in Sri Lanka and sequenced using the Illumina MiSeq Personal Sequencer. Jungle crow mitogenomes have a structural organization typical of the genus Corvus and are 16,927 bp and 17,066 bp in length, both comprising 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal subunit genes, and a non-coding control region. In addition, we complement already available house crow (Corvus spelendens) mitogenome resources by sequencing an individual from Singapore. A phylogenetic tree constructed from Corvidae family mitogenome sequences available on GenBank is presented. We confirm the monophyly of the genus Corvus and propose to use complete mitogenome resources for further intra- and interspecies genetic studies.
Transformation of temporal sequences in the zebra finch auditory system
Lim, Yoonseob; Lagoy, Ryan; Shinn-Cunningham, Barbara G; Gardner, Timothy J
2016-01-01
This study examines how temporally patterned stimuli are transformed as they propagate from primary to secondary zones in the thalamorecipient auditory pallium in zebra finches. Using a new class of synthetic click stimuli, we find a robust mapping from temporal sequences in the primary zone to distinct population vectors in secondary auditory areas. We tested whether songbirds could discriminate synthetic click sequences in an operant setup and found that a robust behavioral discrimination is present for click sequences composed of intervals ranging from 11 ms to 40 ms, but breaks down for stimuli composed of longer inter-click intervals. This work suggests that the analog of the songbird auditory cortex transforms temporal patterns to sequence-selective population responses or ‘spatial codes', and that these distinct population responses contribute to behavioral discrimination of temporally complex sounds. DOI: http://dx.doi.org/10.7554/eLife.18205.001 PMID:27897971
Tosto, D S; Hopp, H E
1996-01-01
The internal transcribed spacer region (ITS1 and ITS2) of the 18S-25S nuclear ribosomal DNA sequence and the intervening 5.8S region from five species of the genus Oxalis was amplified by polymerase chain reaction and subjected to direct DNA sequencing. On the basis of cytogenetic studies some species of this genus were postulated to be related by the number of chromosomes. Sequence homologies in the ITS1, 5.8S and ITS2 among species are in good agreement with previous relationships established on the basis of chromosome numbers. We also identified a highly conserved sequence of six bp in the ITS1, reported to be present in a wide range of flowering plants, but not in the Oxalidaceae family to which the genus Oxalis belongs to.
Zhang, Hanyuan; Vieira Resende E Silva, Bruno; Cui, Juan
2018-05-01
Small RNA sequencing is the most widely used tool for microRNA (miRNA) discovery, and shows great potential for the efficient study of miRNA cross-species transport, i.e., by detecting the presence of exogenous miRNA sequences in the host species. Because of the increased appreciation of dietary miRNAs and their far-reaching implication in human health, research interests are currently growing with regard to exogenous miRNAs bioavailability, mechanisms of cross-species transport and miRNA function in cellular biological processes. In this article, we present microRNA Discovery (miRDis), a new small RNA sequencing data analysis pipeline for both endogenous and exogenous miRNA detection. Specifically, we developed and deployed a Web service that supports the annotation and expression profiling data of known host miRNAs and the detection of novel miRNAs, other noncoding RNAs, and the exogenous miRNAs from dietary species. As a proof-of-concept, we analyzed a set of human plasma sequencing data from a milk-feeding study where 225 human miRNAs were detected in the plasma samples and 44 show elevated expression after milk intake. By examining the bovine-specific sequences, data indicate that three bovine miRNAs (bta-miR-378, -181* and -150) are present in human plasma possibly because of the dietary uptake. Further evaluation based on different sets of public data demonstrates that miRDis outperforms other state-of-the-art tools in both detection and quantification of miRNA from either animal or plant sources. The miRDis Web server is available at: http://sbbi.unl.edu/miRDis/index.php.
Chen, Tsute; Yu, Wen-Han; Izard, Jacques; Baranova, Oxana V.; Lakshmanan, Abirami; Dewhirst, Floyd E.
2010-01-01
The human oral microbiome is the most studied human microflora, but 53% of the species have not yet been validly named and 35% remain uncultivated. The uncultivated taxa are known primarily from 16S rRNA sequence information. Sequence information tied solely to obscure isolate or clone numbers, and usually lacking accurate phylogenetic placement, is a major impediment to working with human oral microbiome data. The goal of creating the Human Oral Microbiome Database (HOMD) is to provide the scientific community with a body site-specific comprehensive database for the more than 600 prokaryote species that are present in the human oral cavity based on a curated 16S rRNA gene-based provisional naming scheme. Currently, two primary types of information are provided in HOMD—taxonomic and genomic. Named oral species and taxa identified from 16S rRNA gene sequence analysis of oral isolates and cloning studies were placed into defined 16S rRNA phylotypes and each given unique Human Oral Taxon (HOT) number. The HOT interlinks phenotypic, phylogenetic, genomic, clinical and bibliographic information for each taxon. A BLAST search tool is provided to match user 16S rRNA gene sequences to a curated, full length, 16S rRNA gene reference data set. For genomic analysis, HOMD provides comprehensive set of analysis tools and maintains frequently updated annotations for all the human oral microbial genomes that have been sequenced and publicly released. Oral bacterial genome sequences, determined as part of the Human Microbiome Project, are being added to the HOMD as they become available. We provide HOMD as a conceptual model for the presentation of microbiome data for other human body sites. Database URL: http://www.homd.org PMID:20624719
Using high throughput sequencing to explore the biodiversity in oral bacterial communities.
Diaz, P I; Dupuy, A K; Abusleme, L; Reese, B; Obergfell, C; Choquette, L; Dongari-Bagtzoglou, A; Peterson, D E; Terzi, E; Strausbaugh, L D
2012-06-01
High throughput sequencing of 16S ribosomal RNA gene amplicons is a cost-effective method for characterization of oral bacterial communities. However, before undertaking large-scale studies, it is necessary to understand the technique-associated limitations and intrinsic variability of the oral ecosystem. In this work we evaluated bias in species representation using an in vitro-assembled mock community of oral bacteria. We then characterized the bacterial communities in saliva and buccal mucosa of five healthy subjects to investigate the power of high throughput sequencing in revealing their diversity and biogeography patterns. Mock community analysis showed primer and DNA isolation biases and an overestimation of diversity that was reduced after eliminating singleton operational taxonomic units (OTUs). Sequencing of salivary and mucosal communities found a total of 455 OTUs (0.3% dissimilarity) with only 78 of these present in all subjects. We demonstrate that this variability was partly the result of incomplete richness coverage even at great sequencing depths, and so comparing communities by their structure was more effective than comparisons based solely on membership. With respect to oral biogeography, we found inter-subject variability in community structure was lower than site differences between salivary and mucosal communities within subjects. These differences were evident at very low sequencing depths and were mostly caused by the abundance of Streptococcus mitis and Gemella haemolysans in mucosa. In summary, we present an experimental and data analysis framework that will facilitate design and interpretation of pyrosequencing-based studies. Despite challenges associated with this technique, we demonstrate its power for evaluation of oral diversity and biogeography patterns. © 2012 John Wiley & Sons A/S.
Alvares, Keith; Dixit, Saryu N; Lux, Elizabeth; Veis, Arthur
2009-09-18
Studies of mineralization of embryonic spicules and of the sea urchin genome have identified several putative mineralization-related proteins. These predicted proteins have not been isolated or confirmed in mature mineralized tissues. Mature Lytechinus variegatus teeth were demineralized with 0.6 N HCl after prior removal of non-mineralized constituents with 4.0 M guanidinium HCl. The HCl-extracted proteins were fractionated on ceramic hydroxyapatite and separated into bound and unbound pools. Gel electrophoresis compared the protein distributions. The differentially present bands were purified and digested with trypsin, and the tryptic peptides were separated by high pressure liquid chromatography. NH2-terminal sequences were determined by Edman degradation and compared with the genomic sequence bank data. Two of the putative mineralization-related proteins were found. Their complete amino acid sequences were cloned from our L. variegatus cDNA library. Apatite-binding UTMP16 was found to be present in two isoforms; both isoforms had a signal sequence, a Ser-Asp-rich extracellular matrix domain, and a transmembrane and cytosolic insertion sequence. UTMP19, although rich in Glu and Thr did not bind to apatite. It had neither signal peptide nor transmembrane domain but did have typical nuclear localization and nuclear exit signal sequences. Both proteins were phosphorylated and good substrates for phosphatase. Immunolocalization studies with anti-UTMP16 show it to concentrate at the syncytial membranes in contact with the mineral. On the basis of our TOF-SIMS analyses of magnesium ion and Asp mapping of the mineral phase composition, we speculate that UTMP16 may be important in establishing the high magnesium columns that fuse the calcite plates together to enhance the mechanical strength of the mineralized tooth.
Identification and analysis of pig chimeric mRNAs using RNA sequencing data
2012-01-01
Background Gene fusion is ubiquitous over the course of evolution. It is expected to increase the diversity and complexity of transcriptomes and proteomes through chimeric sequence segments or altered regulation. However, chimeric mRNAs in pigs remain unclear. Here we identified some chimeric mRNAs in pigs and analyzed the expression of them across individuals and breeds using RNA-sequencing data. Results The present study identified 669 putative chimeric mRNAs in pigs, of which 251 chimeric candidates were detected in a set of RNA-sequencing data. The 618 candidates had clear trans-splicing sites, 537 of which obeyed the canonical GU-AG splice rule. Only two putative pig chimera variants whose fusion junction was overlapped with that of a known human chimeric mRNA were found. A set of unique chimeric events were considered middle variances in the expression across individuals and breeds, and revealed non-significant variance between sexes. Furthermore, the genomic region of the 5′ partner gene shares a similar DNA sequence with that of the 3′ partner gene for 458 putative chimeric mRNAs. The 81 of those shared DNA sequences significantly matched the known DNA-binding motifs in the JASPAR CORE database. Four DNA motifs shared in parental genomic regions had significant similarity with known human CTCF binding sites. Conclusions The present study provided detailed information on some pig chimeric mRNAs. We proposed a model that trans-acting factors, such as CTCF, induced the spatial organisation of parental genes to the same transcriptional factory so that parental genes were coordinatively transcribed to give birth to chimeric mRNAs. PMID:22925561
Phylogenetic analysis of human immunodeficiency virus type 2 isolated from Cuban individuals.
Machado, Liuber Y; Díaz, Héctor M; Noa, Enrique; Martín, Dayamí; Blanco, Madeline; Díaz, Dervel F; Sánchez, Yordank R; Nibot, Carmen; Sánchez, Lourdes; Dubed, Marta
2014-08-01
The presence of infection by human immunodeficiency virus type 2 (HIV-2) in Cuba has been previously documented. However, genetic information on the strains that circulate in the Cuban people is still unknown. The present work constitutes the first study concerning the phylogenetic relationship of HIV-2 Cuban isolates conducted on 13 Cuban patients who were diagnosed with HIV-2. The env sequences were analyzed for the construction of a phylogenetic tree with reference sequences of HIV-2. Phylogenetic analysis of the env gene showed that all the Cuban sequences clustered in group A of HIV-2. The analysis indicated several independent introductions of HIV-2 into Cuba. The results of the study will reinforce the program on the epidemiological surveillance of the infection in Cuba and make possible further molecular evolutionary studies.
Microbial Metagenomics: Beyond the Genome
NASA Astrophysics Data System (ADS)
Gilbert, Jack A.; Dupont, Christopher L.
2011-01-01
Metagenomics literally means “beyond the genome.” Marine microbial metagenomic databases presently comprise ˜400 billion base pairs of DNA, only ˜3% of that found in 1 ml of seawater. Very soon a trillion-base-pair sequence run will be feasible, so it is time to reflect on what we have learned from metagenomics. We review the impact of metagenomics on our understanding of marine microbial communities. We consider the studies facilitated by data generated through the Global Ocean Sampling expedition, as well as the revolution wrought at the individual laboratory level through next generation sequencing technologies. We review recent studies and discoveries since 2008, provide a discussion of bioinformatic analyses, including conceptual pipelines and sequence annotation and predict the future of metagenomics, with suggestions of collaborative community studies tailored toward answering some of the fundamental questions in marine microbial ecology.
ERIC Educational Resources Information Center
Noland, Mildred Jean
A study was conducted investigating whether a sequence of visuals presented in a serial manner differs in connotative meaning from the same set of visuals presented simultaneously. How the meanings of pairs of shots relate to their constituent visuals was also explored. Sixteen pairs of visuals were presented to both male and female subjects in…
Quantifying the relationship between sequence and three-dimensional structure conservation in RNA
2010-01-01
Background In recent years, the number of available RNA structures has rapidly grown reflecting the increased interest on RNA biology. Similarly to the studies carried out two decades ago for proteins, which gave the fundamental grounds for developing comparative protein structure prediction methods, we are now able to quantify the relationship between sequence and structure conservation in RNA. Results Here we introduce an all-against-all sequence- and three-dimensional (3D) structure-based comparison of a representative set of RNA structures, which have allowed us to quantitatively confirm that: (i) there is a measurable relationship between sequence and structure conservation that weakens for alignments resulting in below 60% sequence identity, (ii) evolution tends to conserve more RNA structure than sequence, and (iii) there is a twilight zone for RNA homology detection. Discussion The computational analysis here presented quantitatively describes the relationship between sequence and structure for RNA molecules and defines a twilight zone region for detecting RNA homology. Our work could represent the theoretical basis and limitations for future developments in comparative RNA 3D structure prediction. PMID:20550657
Draft De Novo Transcriptome of the Rat Kangaroo Potorous tridactylus as a Tool for Cell Biology
Udy, Dylan B.; Voorhies, Mark; Chan, Patricia P.; Lowe, Todd M.; Dumont, Sophie
2015-01-01
The rat kangaroo (long-nosed potoroo, Potorous tridactylus) is a marsupial native to Australia. Cultured rat kangaroo kidney epithelial cells (PtK) are commonly used to study cell biological processes. These mammalian cells are large, adherent, and flat, and contain large and few chromosomes—and are thus ideal for imaging intra-cellular dynamics such as those of mitosis. Despite this, neither the rat kangaroo genome nor transcriptome have been sequenced, creating a challenge for probing the molecular basis of these cellular dynamics. Here, we present the sequencing, assembly and annotation of the draft rat kangaroo de novo transcriptome. We sequenced 679 million reads that mapped to 347,323 Trinity transcripts and 20,079 Unigenes. We present statistics emerging from transcriptome-wide analyses, and analyses suggesting that the transcriptome covers full-length sequences of most genes, many with multiple isoforms. We also validate our findings with a proof-of-concept gene knockdown experiment. We expect that this high quality transcriptome will make rat kangaroo cells a more tractable system for linking molecular-scale function and cellular-scale dynamics. PMID:26252667
A sample of potential disk hosting first ascent red giants
NASA Astrophysics Data System (ADS)
Steele, Amy; Debes, John
2018-01-01
Observations of (sub)giants with planets and disks provide the first set of proof that disks can survive the first stages of post-main-sequence evolution, even though the disks are expected to dissipate by this time. The infrared (IR) excesses present around a number of post-main-sequence (PMS) stars could be due to a traditional debris disk with planets (e.g. kappa CrB), some remnant of enhanced mass loss (e.g. the shell-like structure of R Sculptoris), and/or background contamination. We present a sample of potential disk hosting first ascent red giants. These stars all have infrared excesses at 22 microns, and possibly host circumstellar debris. We summarize the characteristics of the sample to better inform the incidence rates of thermally emitting material around giant stars. A thorough follow-up study of these candidates would serve as the first step in probing the composition of the dust in these systems that have left the main sequence, providing clues to the degree of disk processing that occurs beyond the main-sequence.
VisRseq: R-based visual framework for analysis of sequencing data
2015-01-01
Background Several tools have been developed to enable biologists to perform initial browsing and exploration of sequencing data. However the computational tool set for further analyses often requires significant computational expertise to use and many of the biologists with the knowledge needed to interpret these data must rely on programming experts. Results We present VisRseq, a framework for analysis of sequencing datasets that provides a computationally rich and accessible framework for integrative and interactive analyses without requiring programming expertise. We achieve this aim by providing R apps, which offer a semi-auto generated and unified graphical user interface for computational packages in R and repositories such as Bioconductor. To address the interactivity limitation inherent in R libraries, our framework includes several native apps that provide exploration and brushing operations as well as an integrated genome browser. The apps can be chained together to create more powerful analysis workflows. Conclusions To validate the usability of VisRseq for analysis of sequencing data, we present two case studies performed by our collaborators and report their workflow and insights. PMID:26328469
VisRseq: R-based visual framework for analysis of sequencing data.
Younesy, Hamid; Möller, Torsten; Lorincz, Matthew C; Karimi, Mohammad M; Jones, Steven J M
2015-01-01
Several tools have been developed to enable biologists to perform initial browsing and exploration of sequencing data. However the computational tool set for further analyses often requires significant computational expertise to use and many of the biologists with the knowledge needed to interpret these data must rely on programming experts. We present VisRseq, a framework for analysis of sequencing datasets that provides a computationally rich and accessible framework for integrative and interactive analyses without requiring programming expertise. We achieve this aim by providing R apps, which offer a semi-auto generated and unified graphical user interface for computational packages in R and repositories such as Bioconductor. To address the interactivity limitation inherent in R libraries, our framework includes several native apps that provide exploration and brushing operations as well as an integrated genome browser. The apps can be chained together to create more powerful analysis workflows. To validate the usability of VisRseq for analysis of sequencing data, we present two case studies performed by our collaborators and report their workflow and insights.
Draft De Novo Transcriptome of the Rat Kangaroo Potorous tridactylus as a Tool for Cell Biology.
Udy, Dylan B; Voorhies, Mark; Chan, Patricia P; Lowe, Todd M; Dumont, Sophie
2015-01-01
The rat kangaroo (long-nosed potoroo, Potorous tridactylus) is a marsupial native to Australia. Cultured rat kangaroo kidney epithelial cells (PtK) are commonly used to study cell biological processes. These mammalian cells are large, adherent, and flat, and contain large and few chromosomes-and are thus ideal for imaging intra-cellular dynamics such as those of mitosis. Despite this, neither the rat kangaroo genome nor transcriptome have been sequenced, creating a challenge for probing the molecular basis of these cellular dynamics. Here, we present the sequencing, assembly and annotation of the draft rat kangaroo de novo transcriptome. We sequenced 679 million reads that mapped to 347,323 Trinity transcripts and 20,079 Unigenes. We present statistics emerging from transcriptome-wide analyses, and analyses suggesting that the transcriptome covers full-length sequences of most genes, many with multiple isoforms. We also validate our findings with a proof-of-concept gene knockdown experiment. We expect that this high quality transcriptome will make rat kangaroo cells a more tractable system for linking molecular-scale function and cellular-scale dynamics.
Digital video steganalysis exploiting collusion sensitivity
NASA Astrophysics Data System (ADS)
Budhia, Udit; Kundur, Deepa
2004-09-01
In this paper we present an effective steganalyis technique for digital video sequences based on the collusion attack. Steganalysis is the process of detecting with a high probability and low complexity the presence of covert data in multimedia. Existing algorithms for steganalysis target detecting covert information in still images. When applied directly to video sequences these approaches are suboptimal. In this paper, we present a method that overcomes this limitation by using redundant information present in the temporal domain to detect covert messages in the form of Gaussian watermarks. Our gains are achieved by exploiting the collusion attack that has recently been studied in the field of digital video watermarking, and more sophisticated pattern recognition tools. Applications of our scheme include cybersecurity and cyberforensics.
SeqCompress: an algorithm for biological sequence compression.
Sardaraz, Muhammad; Tahir, Muhammad; Ikram, Ataul Aziz; Bajwa, Hassan
2014-10-01
The growth of Next Generation Sequencing technologies presents significant research challenges, specifically to design bioinformatics tools that handle massive amount of data efficiently. Biological sequence data storage cost has become a noticeable proportion of total cost in the generation and analysis. Particularly increase in DNA sequencing rate is significantly outstripping the rate of increase in disk storage capacity, which may go beyond the limit of storage capacity. It is essential to develop algorithms that handle large data sets via better memory management. This article presents a DNA sequence compression algorithm SeqCompress that copes with the space complexity of biological sequences. The algorithm is based on lossless data compression and uses statistical model as well as arithmetic coding to compress DNA sequences. The proposed algorithm is compared with recent specialized compression tools for biological sequences. Experimental results show that proposed algorithm has better compression gain as compared to other existing algorithms. Copyright © 2014 Elsevier Inc. All rights reserved.
Accurate multiple sequence-structure alignment of RNA sequences using combinatorial optimization.
Bauer, Markus; Klau, Gunnar W; Reinert, Knut
2007-07-27
The discovery of functional non-coding RNA sequences has led to an increasing interest in algorithms related to RNA analysis. Traditional sequence alignment algorithms, however, fail at computing reliable alignments of low-homology RNA sequences. The spatial conformation of RNA sequences largely determines their function, and therefore RNA alignment algorithms have to take structural information into account. We present a graph-based representation for sequence-structure alignments, which we model as an integer linear program (ILP). We sketch how we compute an optimal or near-optimal solution to the ILP using methods from combinatorial optimization, and present results on a recently published benchmark set for RNA alignments. The implementation of our algorithm yields better alignments in terms of two published scores than the other programs that we tested: This is especially the case with an increasing number of input sequences. Our program LARA is freely available for academic purposes from http://www.planet-lisa.net.
Nanopore-CMOS Interfaces for DNA Sequencing
Magierowski, Sebastian; Huang, Yiyun; Wang, Chengjie; Ghafar-Zadeh, Ebrahim
2016-01-01
DNA sequencers based on nanopore sensors present an opportunity for a significant break from the template-based incumbents of the last forty years. Key advantages ushered by nanopore technology include a simplified chemistry and the ability to interface to CMOS technology. The latter opportunity offers substantial promise for improvement in sequencing speed, size and cost. This paper reviews existing and emerging means of interfacing nanopores to CMOS technology with an emphasis on massively-arrayed structures. It presents this in the context of incumbent DNA sequencing techniques, reviews and quantifies nanopore characteristics and models and presents CMOS circuit methods for the amplification of low-current nanopore signals in such interfaces. PMID:27509529
Nanopore-CMOS Interfaces for DNA Sequencing.
Magierowski, Sebastian; Huang, Yiyun; Wang, Chengjie; Ghafar-Zadeh, Ebrahim
2016-08-06
DNA sequencers based on nanopore sensors present an opportunity for a significant break from the template-based incumbents of the last forty years. Key advantages ushered by nanopore technology include a simplified chemistry and the ability to interface to CMOS technology. The latter opportunity offers substantial promise for improvement in sequencing speed, size and cost. This paper reviews existing and emerging means of interfacing nanopores to CMOS technology with an emphasis on massively-arrayed structures. It presents this in the context of incumbent DNA sequencing techniques, reviews and quantifies nanopore characteristics and models and presents CMOS circuit methods for the amplification of low-current nanopore signals in such interfaces.
Hierarchical Traces for Reduced NSM Memory Requirements
NASA Astrophysics Data System (ADS)
Dahl, Torbjørn S.
This paper presents work on using hierarchical long term memory to reduce the memory requirements of nearest sequence memory (NSM) learning, a previously published, instance-based reinforcement learning algorithm. A hierarchical memory representation reduces the memory requirements by allowing traces to share common sub-sequences. We present moderated mechanisms for estimating discounted future rewards and for dealing with hidden state using hierarchical memory. We also present an experimental analysis of how the sub-sequence length affects the memory compression achieved and show that the reduced memory requirements do not effect the speed of learning. Finally, we analyse and discuss the persistence of the sub-sequences independent of specific trace instances.
Sequencing Adventure Activities: A New Perspective.
ERIC Educational Resources Information Center
Bisson, Christian
Sequencing in adventure education involves putting activities in an order appropriate to the needs of the group. Contrary to the common assumption that each adventure sequence is unique, a review of literature concerning five sequencing models reveals a certain universality. These models present sequences that move through four phases: group…
The Porcelain Crab Transcriptome and PCAD, the Porcelain Crab Microarray and Sequence Database
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tagmount, Abderrahmane; Wang, Mei; Lindquist, Erika
2010-01-27
Background: With the emergence of a completed genome sequence of the freshwater crustacean Daphnia pulex, construction of genomic-scale sequence databases for additional crustacean sequences are important for comparative genomics and annotation. Porcelain crabs, genus Petrolisthes, have been powerful crustacean models for environmental and evolutionary physiology with respect to thermal adaptation and understanding responses of marine organisms to climate change. Here, we present a large-scale EST sequencing and cDNA microarray database project for the porcelain crab Petrolisthes cinctipes. Methodology/Principal Findings: A set of ~;;30K unique sequences (UniSeqs) representing ~;;19K clusters were generated from ~;;98K high quality ESTs from a set ofmore » tissue specific non-normalized and mixed-tissue normalized cDNA libraries from the porcelain crab Petrolisthes cinctipes. Homology for each UniSeq was assessed using BLAST, InterProScan, GO and KEGG database searches. Approximately 66percent of the UniSeqs had homology in at least one of the databases. All EST and UniSeq sequences along with annotation results and coordinated cDNA microarray datasets have been made publicly accessible at the Porcelain Crab Array Database (PCAD), a feature-enriched version of the Stanford and Longhorn Array Databases.Conclusions/Significance: The EST project presented here represents the third largest sequencing effort for any crustacean, and the largest effort for any crab species. Our assembly and clustering results suggest that our porcelain crab EST data set is equally diverse to the much larger EST set generated in the Daphnia pulex genome sequencing project, and thus will be an important resource to the Daphnia research community. Our homology results support the pancrustacea hypothesis and suggest that Malacostraca may be ancestral to Branchiopoda and Hexapoda. Our results also suggest that our cDNA microarrays cover as much of the transcriptome as can reasonably be captured in EST library sequencing approaches, and thus represent a rich resource for studies of environmental genomics.« less
Seegelke, Christian; Hughes, Charmayne M L
2015-12-01
It has been proposed that the preparation of goal-direct actions involves internal movement simulation, or motor imagery. Evidence suggests that motor imagery is critically involved in the prediction of action consequences and contributes heavily to movement planning processes. The present study examined whether the sensitivity towards end-state comfort and the possibility/impossibility to perform an action sequence are considered during motor imagery. Participants performed a mental rotation task in which two images were simultaneously presented. The image on the left depicted the start posture of a right hand when grasping a bar, while the right image depicted the hand posture at the end of the action sequence. The right image displayed the bar in a vertical orientation with the hand in a comfortable (thumb-up) or in an uncomfortable (thumb-down) posture, while the bar in the left image was rotated in picture plane in steps of 45°. Crucially, the two images formed either a physically possible or physically impossible to perform action sequence. Results revealed strikingly different response time patterns for the two action sequence conditions. In general, response times increased almost monotonically with increasing angular disparity for the possible to perform action sequences. However, slight deviations from this monotonicity were apparent when the sequences contained an uncomfortable as opposed to a comfortable final posture. In contrast, for the impossible sequences, response times did not follow a typical mental rotation function, but instead were uniformly very slow. These findings suggest that both biomechanical constraints (i.e., end-state comfort) and the awareness of the possibility/impossibility to perform an action sequence are considered during motor imagery. We conclude that motor representations contain information about the spatiotemporal movement organization and the possibility of performing an action, which are crucially involved in anticipation and planning of action sequences. Copyright © 2015 Elsevier Inc. All rights reserved.
Warburton, Marilyn L; Williams, William Paul; Hawkins, Leigh; Bridges, Susan; Gresham, Cathy; Harper, Jonathan; Ozkan, Seval; Mylroie, J Erik; Shan, Xueyan
2011-07-01
A public candidate gene testing pipeline for resistance to aflatoxin accumulation or Aspergillus flavus infection in maize is presented here. The pipeline consists of steps for identifying, testing, and verifying the association of selected maize gene sequences with resistance under field conditions. Resources include a database of genetic and protein sequences associated with the reduction in aflatoxin contamination from previous studies; eight diverse inbred maize lines for polymorphism identification within any maize gene sequence; four Quantitative Trait Loci (QTL) mapping populations and one association mapping panel, all phenotyped for aflatoxin accumulation resistance and associated phenotypes; and capacity for Insertion/Deletion (InDel) and SNP genotyping in the population(s) for mapping. To date, ten genes have been identified as possible candidate genes and put through the candidate gene testing pipeline, and results are presented here to demonstrate the utility of the pipeline.
Bastien, Géraldine; Arnal, Grégory; Bozonnet, Sophie; Laguerre, Sandrine; Ferreira, Fernando; Fauré, Régis; Henrissat, Bernard; Lefèvre, Fabrice; Robe, Patrick; Bouchez, Olivier; Noirot, Céline; Dumon, Claire; O'Donohue, Michael
2013-05-14
The metagenomic analysis of gut microbiomes has emerged as a powerful strategy for the identification of biomass-degrading enzymes, which will be no doubt useful for the development of advanced biorefining processes. In the present study, we have performed a functional metagenomic analysis on comb and gut microbiomes associated with the fungus-growing termite, Pseudacanthotermes militaris. Using whole termite abdomens and fungal-comb material respectively, two fosmid-based metagenomic libraries were created and screened for the presence of xylan-degrading enzymes. This revealed 101 positive clones, corresponding to an extremely high global hit rate of 0.49%. Many clones displayed either β-d-xylosidase (EC 3.2.1.37) or α-l-arabinofuranosidase (EC 3.2.1.55) activity, while others displayed the ability to degrade AZCL-xylan or AZCL-β-(1,3)-β-(1,4)-glucan. Using secondary screening it was possible to pinpoint clones of interest that were used to prepare fosmid DNA. Sequencing of fosmid DNA generated 1.46 Mbp of sequence data, and bioinformatics analysis revealed 63 sequences encoding putative carbohydrate-active enzymes, with many of these forming parts of sequence clusters, probably having carbohydrate degradation and metabolic functions. Taxonomic assignment of the different sequences revealed that Firmicutes and Bacteroidetes were predominant phyla in the gut sample, while microbial diversity in the comb sample resembled that of typical soil samples. Cloning and expression in E. coli of six enzyme candidates identified in the libraries provided access to individual enzyme activities, which all proved to be coherent with the primary and secondary functional screens. This study shows that the gut microbiome of P. militaris possesses the potential to degrade biomass components, such as arabinoxylans and arabinans. Moreover, the data presented suggests that prokaryotic microorganisms present in the comb could also play a part in the degradation of biomass within the termite mound, although further investigation will be needed to clarify the complex synergies that might exist between the different microbiomes that constitute the termitosphere of fungus-growing termites. This study exemplifies the power of functional metagenomics for the discovery of biomass-active enzymes and has provided a collection of potentially interesting biocatalysts for further study.
Denoising DNA deep sequencing data—high-throughput sequencing errors and their correction
Laehnemann, David; Borkhardt, Arndt
2016-01-01
Characterizing the errors generated by common high-throughput sequencing platforms and telling true genetic variation from technical artefacts are two interdependent steps, essential to many analyses such as single nucleotide variant calling, haplotype inference, sequence assembly and evolutionary studies. Both random and systematic errors can show a specific occurrence profile for each of the six prominent sequencing platforms surveyed here: 454 pyrosequencing, Complete Genomics DNA nanoball sequencing, Illumina sequencing by synthesis, Ion Torrent semiconductor sequencing, Pacific Biosciences single-molecule real-time sequencing and Oxford Nanopore sequencing. There is a large variety of programs available for error removal in sequencing read data, which differ in the error models and statistical techniques they use, the features of the data they analyse, the parameters they determine from them and the data structures and algorithms they use. We highlight the assumptions they make and for which data types these hold, providing guidance which tools to consider for benchmarking with regard to the data properties. While no benchmarking results are included here, such specific benchmarks would greatly inform tool choices and future software development. The development of stand-alone error correctors, as well as single nucleotide variant and haplotype callers, could also benefit from using more of the knowledge about error profiles and from (re)combining ideas from the existing approaches presented here. PMID:26026159
Matsuoka, Masanari; Sugita, Masatake; Kikuchi, Takeshi
2014-09-18
Proteins that share a high sequence homology while exhibiting drastically different 3D structures are investigated in this study. Recently, artificial proteins related to the sequences of the GA and IgG binding GB domains of human serum albumin have been designed. These artificial proteins, referred to as GA and GB, share 98% amino acid sequence identity but exhibit different 3D structures, namely, a 3α bundle versus a 4β + α structure. Discriminating between their 3D structures based on their amino acid sequences is a very difficult problem. In the present work, in addition to using bioinformatics techniques, an analysis based on inter-residue average distance statistics is used to address this problem. It was hard to distinguish which structure a given sequence would take only with the results of ordinary analyses like BLAST and conservation analyses. However, in addition to these analyses, with the analysis based on the inter-residue average distance statistics and our sequence tendency analysis, we could infer which part would play an important role in its structural formation. The results suggest possible determinants of the different 3D structures for sequences with high sequence identity. The possibility of discriminating between the 3D structures based on the given sequences is also discussed.
Pulseq: A rapid and hardware-independent pulse sequence prototyping framework.
Layton, Kelvin J; Kroboth, Stefan; Jia, Feng; Littin, Sebastian; Yu, Huijun; Leupold, Jochen; Nielsen, Jon-Fredrik; Stöcker, Tony; Zaitsev, Maxim
2017-04-01
Implementing new magnetic resonance experiments, or sequences, often involves extensive programming on vendor-specific platforms, which can be time consuming and costly. This situation is exacerbated when research sequences need to be implemented on several platforms simultaneously, for example, at different field strengths. This work presents an alternative programming environment that is hardware-independent, open-source, and promotes rapid sequence prototyping. A novel file format is described to efficiently store the hardware events and timing information required for an MR pulse sequence. Platform-dependent interpreter modules convert the file to appropriate instructions to run the sequence on MR hardware. Sequences can be designed in high-level languages, such as MATLAB, or with a graphical interface. Spin physics simulation tools are incorporated into the framework, allowing for comparison between real and virtual experiments. Minimal effort is required to implement relatively advanced sequences using the tools provided. Sequences are executed on three different MR platforms, demonstrating the flexibility of the approach. A high-level, flexible and hardware-independent approach to sequence programming is ideal for the rapid development of new sequences. The framework is currently not suitable for large patient studies or routine scanning although this would be possible with deeper integration into existing workflows. Magn Reson Med 77:1544-1552, 2017. © 2016 International Society for Magnetic Resonance in Medicine. © 2016 International Society for Magnetic Resonance in Medicine.
Paparo, M.; Benko, J. M.; Hareter, M.; ...
2016-05-11
In this study, a sequence search method was developed to search the regular frequency spacing in δ Scuti stars through visual inspection and an algorithmic search. We searched for sequences of quasi-equally spaced frequencies, containing at least four members per sequence, in 90 δ Scuti stars observed by CoRoT. We found an unexpectedly large number of independent series of regular frequency spacing in 77 δ Scuti stars (from one to eight sequences) in the non-asymptotic regime. We introduce the sequence search method presenting the sequences and echelle diagram of CoRoT 102675756 and the structure of the algorithmic search. Four sequencesmore » (echelle ridges) were found in the 5–21 d –1 region where the pairs of the sequences are shifted (between 0.5 and 0.59 d –1) by twice the value of the estimated rotational splitting frequency (0.269 d –1). The general conclusions for the whole sample are also presented in this paper. The statistics of the spacings derived by the sequence search method, by FT (Fourier transform of the frequencies), and the statistics of the shifts are also compared. In many stars more than one almost equally valid spacing appeared. The model frequencies of FG Vir and their rotationally split components were used to formulate the possible explanation that one spacing is the large separation while the other is the sum of the large separation and the rotational frequency. In CoRoT 102675756, the two spacings (2.249 and 1.977 d –1) are in better agreement with the sum of a possible 1.710 d –1 large separation and two or one times, respectively, the value of the rotational frequency.« less
PCV2d-2 is the predominant type of PCV2 DNA in pig samples collected in the U.S. during 2014-2016.
Xiao, Chao-Ting; Harmon, Karen M; Halbur, Patrick G; Opriessnig, Tanja
2016-12-25
Porcine circovirus type 2 (PCV2) vaccination was introduced in the US in 2006 and since has been adopted by most pig producers. While porcine circovirus associated disease (PCVAD) outbreaks are now relatively uncommon in the US, PCV2 remains a concern which is emphasized by increasing numbers of PCR and sequencing requests for PCV2. In the present study, randomly selected lung tissues from 586 pigs submitted in 2015 were tested for presence of PCV2 DNA. Positive samples were further characterized by sequencing and combined with available PCV2 open-reading-frame (ORF) 2 sequences from the client data base of the Iowa State University Veterinary Diagnostic Laboratory. The prevalence of PCV2 in the randomly selected lung tissues was 23% (135/586) with 11.3% PCV2a, 29% PCV2b and 71.8% for PCV2d subgroup PCV2d-2. A total of 455 ORF2 sequences obtained from 2014 through 2016 were analyzed and PCV2d accounted for 66.7% of the 2014 sequences, 71.8% of the 2015 sequences, and 72% of the 2016 sequences. Interestingly, only 1.9% (9/455) of the sequences belonged to the recently identified PCV2e genotype. The present data indicates that despite an almost 100% PCV2 vaccine coverage in the US, PCV2 DNA can still be detected in almost 1 of 4 randomly selected pig tissues. PCV2d-2 is now the predominant genotype in the USA suggesting that PCV2d-2 may have some advantage over PCV2a and PCV2b in its ability to replicate in pigs under vaccination pressure. Copyright © 2016. Published by Elsevier B.V.