Sample records for sequencing reveals patterns

  1. EventThread: Visual Summarization and Stage Analysis of Event Sequence Data.

    PubMed

    Guo, Shunan; Xu, Ke; Zhao, Rongwen; Gotz, David; Zha, Hongyuan; Cao, Nan

    2018-01-01

    Event sequence data such as electronic health records, a person's academic records, or car service records, are ordered series of events which have occurred over a period of time. Analyzing collections of event sequences can reveal common or semantically important sequential patterns. For example, event sequence analysis might reveal frequently used care plans for treating a disease, typical publishing patterns of professors, and the patterns of service that result in a well-maintained car. It is challenging, however, to visually explore large numbers of event sequences, or sequences with large numbers of event types. Existing methods focus on extracting explicitly matching patterns of events using statistical analysis to create stages of event progression over time. However, these methods fail to capture latent clusters of similar but not identical evolutions of event sequences. In this paper, we introduce a novel visualization system named EventThread which clusters event sequences into threads based on tensor analysis and visualizes the latent stage categories and evolution patterns by interactively grouping the threads by similarity into time-specific clusters. We demonstrate the effectiveness of EventThread through usage scenarios in three different application domains and via interviews with an expert user.

  2. The role of consolidation in learning context-dependent phonotactic patterns in speech and digital sequence production.

    PubMed

    Anderson, Nathaniel D; Dell, Gary S

    2018-04-03

    Speakers implicitly learn novel phonotactic patterns by producing strings of syllables. The learning is revealed in their speech errors. First-order patterns, such as "/f/ must be a syllable onset," can be distinguished from contingent, or second-order, patterns, such as "/f/ must be an onset if the vowel is /a/, but a coda if the vowel is /o/." A metaanalysis of 19 experiments clearly demonstrated that first-order patterns affect speech errors to a very great extent in a single experimental session, but second-order vowel-contingent patterns only affect errors on the second day of testing, suggesting the need for a consolidation period. Two experiments tested an analogue to these studies involving sequences of button pushes, with fingers as "consonants" and thumbs as "vowels." The button-push errors revealed two of the key speech-error findings: first-order patterns are learned quickly, but second-order thumb-contingent patterns are only strongly revealed in the errors on the second day of testing. The influence of computational complexity on the implicit learning of phonotactic patterns in speech production may be a general feature of sequence production.

  3. Discovering weighted patterns in intron sequences using self-adaptive harmony search and back-propagation algorithms.

    PubMed

    Huang, Yin-Fu; Wang, Chia-Ming; Liou, Sing-Wu

    2013-01-01

    A hybrid self-adaptive harmony search and back-propagation mining system was proposed to discover weighted patterns in human intron sequences. By testing the weights under a lazy nearest neighbor classifier, the numerical results revealed the significance of these weighted patterns. Comparing these weighted patterns with the popular intron consensus model, it is clear that the discovered weighted patterns make originally the ambiguous 5SS and 3SS header patterns more specific and concrete.

  4. Discovering Weighted Patterns in Intron Sequences Using Self-Adaptive Harmony Search and Back-Propagation Algorithms

    PubMed Central

    Wang, Chia-Ming; Liou, Sing-Wu

    2013-01-01

    A hybrid self-adaptive harmony search and back-propagation mining system was proposed to discover weighted patterns in human intron sequences. By testing the weights under a lazy nearest neighbor classifier, the numerical results revealed the significance of these weighted patterns. Comparing these weighted patterns with the popular intron consensus model, it is clear that the discovered weighted patterns make originally the ambiguous 5SS and 3SS header patterns more specific and concrete. PMID:23737711

  5. Reduced representation bisulphite sequencing of the cattle genome reveals DNA methylation patterns

    USDA-ARS?s Scientific Manuscript database

    Using reduced representation bisulphite sequencing (RRBS), we obtained the first single-base-resolution maps of bovine DNA methylation in ten somatic tissues. In total, we observed 1,868,049 cytosines in the CG-enriched regions. Similar to the methylation patterns in other species, the CG context wa...

  6. Conversion of amino-acid sequence in proteins to classical music: search for auditory patterns

    PubMed Central

    2007-01-01

    We have converted genome-encoded protein sequences into musical notes to reveal auditory patterns without compromising musicality. We derived a reduced range of 13 base notes by pairing similar amino acids and distinguishing them using variations of three-note chords and codon distribution to dictate rhythm. The conversion will help make genomic coding sequences more approachable for the general public, young children, and vision-impaired scientists. PMID:17477882

  7. The Relationships between Navigational Patterns and Informational Processing Styles of Hypermedia Users.

    ERIC Educational Resources Information Center

    Lee, Mi Jar; Harvey, Francis A.

    This study investigated the relationships between hypermedia users' information processing styles and navigational patterns. Three aspects of navigational patterns were investigated: navigational depth patterns that reveal how comprehensively users access; navigational path patterns that display what sequences users follow; and navigational method…

  8. A Bioinformatics Approach for Detecting Repetitive Nested Motifs using Pattern Matching.

    PubMed

    Romero, José R; Carballido, Jessica A; Garbus, Ingrid; Echenique, Viviana C; Ponzoni, Ignacio

    2016-01-01

    The identification of nested motifs in genomic sequences is a complex computational problem. The detection of these patterns is important to allow the discovery of transposable element (TE) insertions, incomplete reverse transcripts, deletions, and/or mutations. In this study, a de novo strategy for detecting patterns that represent nested motifs was designed based on exhaustive searches for pairs of motifs and combinatorial pattern analysis. These patterns can be grouped into three categories, motifs within other motifs, motifs flanked by other motifs, and motifs of large size. The methodology used in this study, applied to genomic sequences from the plant species Aegilops tauschii and Oryza sativa , revealed that it is possible to identify putative nested TEs by detecting these three types of patterns. The results were validated through BLAST alignments, which revealed the efficacy and usefulness of the new method, which is called Mamushka.

  9. G-quadruplex prediction in E. coli genome reveals a conserved putative G-quadruplex-Hairpin-Duplex switch.

    PubMed

    Kaplan, Oktay I; Berber, Burak; Hekim, Nezih; Doluca, Osman

    2016-11-02

    Many studies show that short non-coding sequences are widely conserved among regulatory elements. More and more conserved sequences are being discovered since the development of next generation sequencing technology. A common approach to identify conserved sequences with regulatory roles relies on topological changes such as hairpin formation at the DNA or RNA level. G-quadruplexes, non-canonical nucleic acid topologies with little established biological roles, are increasingly considered for conserved regulatory element discovery. Since the tertiary structure of G-quadruplexes is strongly dependent on the loop sequence which is disregarded by the generally accepted algorithm, we hypothesized that G-quadruplexes with similar topology and, indirectly, similar interaction patterns, can be determined using phylogenetic clustering based on differences in the loop sequences. Phylogenetic analysis of 52 G-quadruplex forming sequences in the Escherichia coli genome revealed two conserved G-quadruplex motifs with a potential regulatory role. Further analysis revealed that both motifs tend to form hairpins and G quadruplexes, as supported by circular dichroism studies. The phylogenetic analysis as described in this work can greatly improve the discovery of functional G-quadruplex structures and may explain unknown regulatory patterns. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. Protein sequences from mastodon and Tyrannosaurus rex revealed by mass spectrometry.

    PubMed

    Asara, John M; Schweitzer, Mary H; Freimark, Lisa M; Phillips, Matthew; Cantley, Lewis C

    2007-04-13

    Fossilized bones from extinct taxa harbor the potential for obtaining protein or DNA sequences that could reveal evolutionary links to extant species. We used mass spectrometry to obtain protein sequences from bones of a 160,000- to 600,000-year-old extinct mastodon (Mammut americanum) and a 68-million-year-old dinosaur (Tyrannosaurus rex). The presence of T. rex sequences indicates that their peptide bonds were remarkably stable. Mass spectrometry can thus be used to determine unique sequences from ancient organisms from peptide fragmentation patterns, a valuable tool to study the evolution and adaptation of ancient taxa from which genomic sequences are unlikely to be obtained.

  11. Population-genomic variation within RNA viruses of the Western honey bee, Apis mellifera, inferred from deep sequencing

    USDA-ARS?s Scientific Manuscript database

    Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RN...

  12. Ancient DNA sequence revealed by error-correcting codes.

    PubMed

    Brandão, Marcelo M; Spoladore, Larissa; Faria, Luzinete C B; Rocha, Andréa S L; Silva-Filho, Marcio C; Palazzo, Reginaldo

    2015-07-10

    A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code.

  13. Ancient DNA sequence revealed by error-correcting codes

    PubMed Central

    Brandão, Marcelo M.; Spoladore, Larissa; Faria, Luzinete C. B.; Rocha, Andréa S. L.; Silva-Filho, Marcio C.; Palazzo, Reginaldo

    2015-01-01

    A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code. PMID:26159228

  14. Analysis of noise-induced temporal correlations in neuronal spike sequences

    NASA Astrophysics Data System (ADS)

    Reinoso, José A.; Torrent, M. C.; Masoller, Cristina

    2016-11-01

    We investigate temporal correlations in sequences of noise-induced neuronal spikes, using a symbolic method of time-series analysis. We focus on the sequence of time-intervals between consecutive spikes (inter-spike-intervals, ISIs). The analysis method, known as ordinal analysis, transforms the ISI sequence into a sequence of ordinal patterns (OPs), which are defined in terms of the relative ordering of consecutive ISIs. The ISI sequences are obtained from extensive simulations of two neuron models (FitzHugh-Nagumo, FHN, and integrate-and-fire, IF), with correlated noise. We find that, as the noise strength increases, temporal order gradually emerges, revealed by the existence of more frequent ordinal patterns in the ISI sequence. While in the FHN model the most frequent OP depends on the noise strength, in the IF model it is independent of the noise strength. In both models, the correlation time of the noise affects the OP probabilities but does not modify the most probable pattern.

  15. Quantification of fetal heart rate regularity using symbolic dynamics

    NASA Astrophysics Data System (ADS)

    van Leeuwen, P.; Cysarz, D.; Lange, S.; Geue, D.; Groenemeyer, D.

    2007-03-01

    Fetal heart rate complexity was examined on the basis of RR interval time series obtained in the second and third trimester of pregnancy. In each fetal RR interval time series, short term beat-to-beat heart rate changes were coded in 8bit binary sequences. Redundancies of the 28 different binary patterns were reduced by two different procedures. The complexity of these sequences was quantified using the approximate entropy (ApEn), resulting in discrete ApEn values which were used for classifying the sequences into 17 pattern sets. Also, the sequences were grouped into 20 pattern classes with respect to identity after rotation or inversion of the binary value. There was a specific, nonuniform distribution of the sequences in the pattern sets and this differed from the distribution found in surrogate data. In the course of gestation, the number of sequences increased in seven pattern sets, decreased in four and remained unchanged in six. Sequences that occurred less often over time, both regular and irregular, were characterized by patterns reflecting frequent beat-to-beat reversals in heart rate. They were also predominant in the surrogate data, suggesting that these patterns are associated with stochastic heart beat trains. Sequences that occurred more frequently over time were relatively rare in the surrogate data. Some of these sequences had a high degree of regularity and corresponded to prolonged heart rate accelerations or decelerations which may be associated with directed fetal activity or movement or baroreflex activity. Application of the pattern classes revealed that those sequences with a high degree of irregularity correspond to heart rate patterns resulting from complex physiological activity such as fetal breathing movements. The results suggest that the development of the autonomic nervous system and the emergence of fetal behavioral states lead to increases in not only irregular but also regular heart rate patterns. Using symbolic dynamics to examine the cardiovascular system may thus lead to new insight with respect to fetal development.

  16. Pneumocystis jirovecii multilocus genotyping profiles in patients from Portugal and Spain.

    PubMed

    Esteves, F; Montes-Cano, M A; de la Horra, C; Costa, M C; Calderón, E J; Antunes, F; Matos, O

    2008-04-01

    Pneumonia caused by the opportunistic organism Pneumocystis jirovecii is a clinically important infection affecting AIDS and other immunocompromised patients. The present study aimed to compare and characterise the frequency pattern of DNA sequences from the P. jirovecii mitochondrial large-subunit rRNA (mtLSU rRNA) gene, the dihydropteroate synthase (DHPS) gene and the internal transcribed spacer (ITS) regions of the nuclear rRNA operon in specimens from Lisbon (Portugal) and Seville (Spain). Total DNA was extracted and used for specific molecular sequence analysis of the three loci. In both populations, mtLSU rRNA gene analysis revealed an overall prevalence of genotype 1. In the Portuguese population, genotype 2 was the second most common, followed by genotype 3. Inversely, in the Spanish population, genotype 3 was the second most common, followed by genotype 2. The DHPS wild-type sequence was the genotype observed most frequently in both populations, and the DHPS genotype frequency pattern was identical to distribution patterns revealed in other European studies. ITS types showed a significant diversity in both populations because of the high sequence variability in these genomic regions. The most prevalent ITS type in the Portuguese population was Eg, followed by Cg. In contrast to other European studies, Bi was the most common ITS type in the Spanish samples, followed by Eg. A statistically significant association between mtLSU rRNA genotype 1 and ITS type Eg was revealed.

  17. CpG PatternFinder: a Windows-based utility program for easy and rapid identification of the CpG methylation status of DNA.

    PubMed

    Xu, Yi-Hua; Manoharan, Herbert T; Pitot, Henry C

    2007-09-01

    The bisulfite genomic sequencing technique is one of the most widely used techniques to study sequence-specific DNA methylation because of its unambiguous ability to reveal DNA methylation status to the order of a single nucleotide. One characteristic feature of the bisulfite genomic sequencing technique is that a number of sample sequence files will be produced from a single DNA sample. The PCR products of bisulfite-treated DNA samples cannot be sequenced directly because they are heterogeneous in nature; therefore they should be cloned into suitable plasmids and then sequenced. This procedure generates an enormous number of sample DNA sequence files as well as adding extra bases belonging to the plasmids to the sequence, which will cause problems in the final sequence comparison. Finding the methylation status for each CpG in each sample sequence is not an easy job. As a result CpG PatternFinder was developed for this purpose. The main functions of the CpG PatternFinder are: (i) to analyze the reference sequence to obtain CpG and non-CpG-C residue position information. (ii) To tailor sample sequence files (delete insertions and mark deletions from the sample sequence files) based on a configuration of ClustalW multiple alignment. (iii) To align sample sequence files with a reference file to obtain bisulfite conversion efficiency and CpG methylation status. And, (iv) to produce graphics, highlighted aligned sequence text and a summary report which can be easily exported to Microsoft Office suite. CpG PatternFinder is designed to operate cooperatively with BioEdit, a freeware on the internet. It can handle up to 100 files of sample DNA sequences simultaneously, and the total CpG pattern analysis process can be finished in minutes. CpG PatternFinder is an ideal software tool for DNA methylation studies to determine the differential methylation pattern in a large number of individuals in a population. Previously we developed the CpG Analyzer program; CpG PatternFinder is our further effort to create software tools for DNA methylation studies.

  18. Levels of integration in cognitive control and sequence processing in the prefrontal cortex.

    PubMed

    Bahlmann, Jörg; Korb, Franziska M; Gratton, Caterina; Friederici, Angela D

    2012-01-01

    Cognitive control is necessary to flexibly act in changing environments. Sequence processing is needed in language comprehension to build the syntactic structure in sentences. Functional imaging studies suggest that sequence processing engages the left ventrolateral prefrontal cortex (PFC). In contrast, cognitive control processes additionally recruit bilateral rostral lateral PFC regions. The present study aimed to investigate these two types of processes in one experimental paradigm. Sequence processing was manipulated using two different sequencing rules varying in complexity. Cognitive control was varied with different cue-sets that determined the choice of a sequencing rule. Univariate analyses revealed distinct PFC regions for the two types of processing (i.e. sequence processing: left ventrolateral PFC and cognitive control processing: bilateral dorsolateral and rostral PFC). Moreover, in a common brain network (including left lateral PFC and intraparietal sulcus) no interaction between sequence and cognitive control processing was observed. In contrast, a multivariate pattern analysis revealed an interaction of sequence and cognitive control processing, such that voxels in left lateral PFC and parietal cortex showed different tuning functions for tasks involving different sequencing and cognitive control demands. These results suggest that the difference between the process of rule selection (i.e. cognitive control) and the process of rule-based sequencing (i.e. sequence processing) find their neuronal underpinnings in distinct activation patterns in lateral PFC. Moreover, the combination of rule selection and rule sequencing can shape the response of neurons in lateral PFC and parietal cortex.

  19. Levels of Integration in Cognitive Control and Sequence Processing in the Prefrontal Cortex

    PubMed Central

    Bahlmann, Jörg; Korb, Franziska M.; Gratton, Caterina; Friederici, Angela D.

    2012-01-01

    Cognitive control is necessary to flexibly act in changing environments. Sequence processing is needed in language comprehension to build the syntactic structure in sentences. Functional imaging studies suggest that sequence processing engages the left ventrolateral prefrontal cortex (PFC). In contrast, cognitive control processes additionally recruit bilateral rostral lateral PFC regions. The present study aimed to investigate these two types of processes in one experimental paradigm. Sequence processing was manipulated using two different sequencing rules varying in complexity. Cognitive control was varied with different cue-sets that determined the choice of a sequencing rule. Univariate analyses revealed distinct PFC regions for the two types of processing (i.e. sequence processing: left ventrolateral PFC and cognitive control processing: bilateral dorsolateral and rostral PFC). Moreover, in a common brain network (including left lateral PFC and intraparietal sulcus) no interaction between sequence and cognitive control processing was observed. In contrast, a multivariate pattern analysis revealed an interaction of sequence and cognitive control processing, such that voxels in left lateral PFC and parietal cortex showed different tuning functions for tasks involving different sequencing and cognitive control demands. These results suggest that the difference between the process of rule selection (i.e. cognitive control) and the process of rule-based sequencing (i.e. sequence processing) find their neuronal underpinnings in distinct activation patterns in lateral PFC. Moreover, the combination of rule selection and rule sequencing can shape the response of neurons in lateral PFC and parietal cortex. PMID:22952762

  20. Analysis of DNA methylation in Arabidopsis thaliana based on methylation-sensitive AFLP markers.

    PubMed

    Cervera, M T; Ruiz-García, L; Martínez-Zapater, J M

    2002-12-01

    AFLP analysis using restriction enzyme isoschizomers that differ in their sensitivity to methylation of their recognition sites has been used to analyse the methylation state of anonymous CCGG sequences in Arabidopsis thaliana. The technique was modified to improve the quality of fingerprints and to visualise larger numbers of scorable fragments. Sequencing of amplified fragments indicated that detection was generally associated with non-methylation of the cytosine to which the isoschizomer is sensitive. Comparison of EcoRI/ HpaII and EcoRI/ MspI patterns in different ecotypes revealed that 35-43% of CCGG sites were differentially digested by the isoschizomers. Interestingly, the pattern of digestion among different plants belonging to the same ecotype is highly conserved, with the rate of intra-ecotype methylation-sensitive polymorphisms being less than 1%. However, pairwise comparisons of methylation patterns between samples belonging to different ecotypes revealed differences in up to 34% of the methylation-sensitive polymorphisms. The lack of correlation between inter-ecotype similarity matrices based on methylation-insensitive or methylation-sensitive polymorphisms suggests that whatever the mechanisms regulating methylation may be, they are not related to nucleotide sequence variation.

  1. Analysis of the origin of predictability in human communications

    NASA Astrophysics Data System (ADS)

    Zhang, Lin; Liu, Yani; Wu, Ye; Xiao, Jinghua

    2014-01-01

    Human behaviors in daily life can be traced by their communications via electronic devices. E-mails, short messages and cell-phone calls can be used to investigate the predictability of communication partners’ patterns, because these three are the most representative and common behaviors in daily communications. In this paper, we show that all the three manners have apparent predictability in partners’ patterns, and moreover, the short message users’ sequences have the highest predictability among the three. We also reveal that people with fewer communication partners have higher predictability. Finally, we investigate the origin of predictability, which comes from two aspects: one is the intrinsic pattern in the partners sequence, that is, people have the preference of communicating with a fixed partner after another fixed one. The other aspect is the burst, which is communicating with the same partner several times in a row. The high burst in short message communication pattern is one of the main reasons for its high predictability, the intrinsic pattern in e-mail partners sequence is the main reason for its predictability, and the predictability of cell-phone call partners sequence comes from both aspects.

  2. Reduced representation bisulphite sequencing of the ten bovine somatic tissues reveals DNA methylation patterns

    USDA-ARS?s Scientific Manuscript database

    As a major component epigenetics, DNA methylation has been proved that widely functions in individual development and various diseases. It has been well studied in model organisms and human but includes limited data for the economic animals. Using reduced representation bisulphite sequencing (RRBS),...

  3. Kangaroo – A pattern-matching program for biological sequences

    PubMed Central

    2002-01-01

    Background Biologists are often interested in performing a simple database search to identify proteins or genes that contain a well-defined sequence pattern. Many databases do not provide straightforward or readily available query tools to perform simple searches, such as identifying transcription binding sites, protein motifs, or repetitive DNA sequences. However, in many cases simple pattern-matching searches can reveal a wealth of information. We present in this paper a regular expression pattern-matching tool that was used to identify short repetitive DNA sequences in human coding regions for the purpose of identifying potential mutation sites in mismatch repair deficient cells. Results Kangaroo is a web-based regular expression pattern-matching program that can search for patterns in DNA, protein, or coding region sequences in ten different organisms. The program is implemented to facilitate a wide range of queries with no restriction on the length or complexity of the query expression. The program is accessible on the web at http://bioinfo.mshri.on.ca/kangaroo/ and the source code is freely distributed at http://sourceforge.net/projects/slritools/. Conclusion A low-level simple pattern-matching application can prove to be a useful tool in many research settings. For example, Kangaroo was used to identify potential genetic targets in a human colorectal cancer variant that is characterized by a high frequency of mutations in coding regions containing mononucleotide repeats. PMID:12150718

  4. Evaluating, Comparing, and Interpreting Protein Domain Hierarchies

    PubMed Central

    2014-01-01

    Abstract Arranging protein domain sequences hierarchically into evolutionarily divergent subgroups is important for investigating evolutionary history, for speeding up web-based similarity searches, for identifying sequence determinants of protein function, and for genome annotation. However, whether or not a particular hierarchy is optimal is often unclear, and independently constructed hierarchies for the same domain can often differ significantly. This article describes methods for statistically evaluating specific aspects of a hierarchy, for probing the criteria underlying its construction and for direct comparisons between hierarchies. Information theoretical notions are used to quantify the contributions of specific hierarchical features to the underlying statistical model. Such features include subhierarchies, sequence subgroups, individual sequences, and subgroup-associated signature patterns. Underlying properties are graphically displayed in plots of each specific feature's contributions, in heat maps of pattern residue conservation, in “contrast alignments,” and through cross-mapping of subgroups between hierarchies. Together, these approaches provide a deeper understanding of protein domain functional divergence, reveal uncertainties caused by inconsistent patterns of sequence conservation, and help resolve conflicts between competing hierarchies. PMID:24559108

  5. Complete mitochondrial genome sequences of three bats species and whole genome mitochondrial analyses reveal patterns of codon bias and lend support to a basal split in Chiroptera.

    PubMed

    Meganathan, P R; Pagan, Heidi J T; McCulloch, Eve S; Stevens, Richard D; Ray, David A

    2012-01-15

    Order Chiroptera is a unique group of mammals whose members have attained self-powered flight as their main mode of locomotion. Much speculation persists regarding bat evolution; however, lack of sufficient molecular data hampers evolutionary and conservation studies. Of ~1200 species, complete mitochondrial genome sequences are available for only eleven. Additional sequences should be generated if we are to resolve many questions concerning these fascinating mammals. Herein, we describe the complete mitochondrial genomes of three bats: Corynorhinus rafinesquii, Lasiurus borealis and Artibeus lituratus. We also compare the currently available mitochondrial genomes and analyze codon usage in Chiroptera. C. rafinesquii, L. borealis and A. lituratus mitochondrial genomes are 16438 bp, 17048 bp and 16709 bp, respectively. Genome organization and gene arrangements are similar to other bats. Phylogenetic analyses using complete mitochondrial genome sequences support previously established phylogenetic relationships and suggest utility in future studies focusing on the evolutionary aspects of these species. Comprehensive analyses of available bat mitochondrial genomes reveal distinct nucleotide patterns and synonymous codon preferences corresponding to different chiropteran families. These patterns suggest that mutational and selection forces are acting to different extents within Chiroptera and shape their mitochondrial genomes. Copyright © 2011 Elsevier B.V. All rights reserved.

  6. SHARAKU: an algorithm for aligning and clustering read mapping profiles of deep sequencing in non-coding RNA processing.

    PubMed

    Tsuchiya, Mariko; Amano, Kojiro; Abe, Masaya; Seki, Misato; Hase, Sumitaka; Sato, Kengo; Sakakibara, Yasubumi

    2016-06-15

    Deep sequencing of the transcripts of regulatory non-coding RNA generates footprints of post-transcriptional processes. After obtaining sequence reads, the short reads are mapped to a reference genome, and specific mapping patterns can be detected called read mapping profiles, which are distinct from random non-functional degradation patterns. These patterns reflect the maturation processes that lead to the production of shorter RNA sequences. Recent next-generation sequencing studies have revealed not only the typical maturation process of miRNAs but also the various processing mechanisms of small RNAs derived from tRNAs and snoRNAs. We developed an algorithm termed SHARAKU to align two read mapping profiles of next-generation sequencing outputs for non-coding RNAs. In contrast with previous work, SHARAKU incorporates the primary and secondary sequence structures into an alignment of read mapping profiles to allow for the detection of common processing patterns. Using a benchmark simulated dataset, SHARAKU exhibited superior performance to previous methods for correctly clustering the read mapping profiles with respect to 5'-end processing and 3'-end processing from degradation patterns and in detecting similar processing patterns in deriving the shorter RNAs. Further, using experimental data of small RNA sequencing for the common marmoset brain, SHARAKU succeeded in identifying the significant clusters of read mapping profiles for similar processing patterns of small derived RNA families expressed in the brain. The source code of our program SHARAKU is available at http://www.dna.bio.keio.ac.jp/sharaku/, and the simulated dataset used in this work is available at the same link. Accession code: The sequence data from the whole RNA transcripts in the hippocampus of the left brain used in this work is available from the DNA DataBank of Japan (DDBJ) Sequence Read Archive (DRA) under the accession number DRA004502. yasu@bio.keio.ac.jp Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  7. Plastome Sequences of Lygodium japonicum and Marsilea crenata Reveal the Genome Organization Transformation from Basal Ferns to Core Leptosporangiates

    PubMed Central

    Gao, Lei; Wang, Bo; Wang, Zhi-Wei; Zhou, Yuan; Su, Ying-Juan; Wang, Ting

    2013-01-01

    Previous studies have shown that core leptosporangiates, the most species-rich group of extant ferns (monilophytes), have a distinct plastid genome (plastome) organization pattern from basal fern lineages. However, the details of genome structure transformation from ancestral ferns to core leptosporangiates remain unclear because of limited plastome data available. Here, we have determined the complete chloroplast genome sequences of Lygodium japonicum (Lygodiaceae), a member of schizaeoid ferns (Schizaeales), and Marsilea crenata (Marsileaceae), a representative of heterosporous ferns (Salviniales). The two species represent the sister and the basal lineages of core leptosporangiates, respectively, for which the plastome sequences are currently unavailable. Comparative genomic analysis of all sequenced fern plastomes reveals that the gene order of L. japonicum plastome occupies an intermediate position between that of basal ferns and core leptosporangiates. The two exons of the fern ndhB gene have a unique pattern of intragenic copy number variances. Specifically, the substitution rate heterogeneity between the two exons is congruent with their copy number changes, confirming the constraint role that inverted repeats may play on the substitution rate of chloroplast gene sequences. PMID:23821521

  8. TP53 Mutation Status of Tubo-ovarian and Peritoneal High-grade Serous Carcinoma with a Wild-type p53 Immunostaining Pattern.

    PubMed

    Na, Kiyong; Sung, Ji-Youn; Kim, Hyun-Soo

    2017-12-01

    Diffuse and strong nuclear p53 immunoreactivity and a complete lack of p53 expression are regarded as indicative of missense and nonsense mutations, respectively, of the TP53 gene. Tubo-ovarian and peritoneal high-grade serous carcinoma (HGSC) is characterized by aberrant p53 expression induced by a TP53 mutation. However, our experience with some HGSC cases with a wild-type p53 immunostaining pattern led us to comprehensively review previous cases and investigate the TP53 mutational status of the exceptional cases. We analyzed the immunophenotype of 153 cases of HGSC and performed TP53 gene sequencing analysis in those with a wild-type p53 immunostaining pattern. Immunostaining revealed that 109 (71.3%) cases displayed diffuse and strong p53 expression (missense mutation pattern), while 39 (25.5%) had no p53 expression (nonsense mutation pattern). The remaining five cases of HGSC showed a wild-type p53 immunostaining pattern. Direct sequencing analysis revealed that three of these cases harbored nonsense TP53 mutations and two had novel splice site deletions. TP53 mutation is almost invariably present in HGSC, and p53 immunostaining can be used as a surrogate marker of TP53 mutation. In cases with a wild-type p53 immunostaining pattern, direct sequencing for TP53 mutational status can be helpful to confirm the presence of a TP53 mutation. Copyright© 2017, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.

  9. Measuring patterns in team interaction sequences using a discrete recurrence approach.

    PubMed

    Gorman, Jamie C; Cooke, Nancy J; Amazeen, Polemnia G; Fouse, Shannon

    2012-08-01

    Recurrence-based measures of communication determinism and pattern information are described and validated using previously collected team interaction data. Team coordination dynamics has revealed that"mixing" team membership can lead to flexible interaction processes, but keeping a team "intact" can lead to rigid interaction processes. We hypothesized that communication of intact teams would have greater determinism and higher pattern information compared to that of mixed teams. Determinism and pattern information were measured from three-person Uninhabited Air Vehicle team communication sequences over a series of 40-minute missions. Because team members communicated using push-to-talk buttons, communication sequences were automatically generated during each mission. The Composition x Mission determinism effect was significant. Intact teams' determinism increased over missions, whereas mixed teams' determinism did not change. Intact teams had significantly higher maximum pattern information than mixed teams. Results from these new communication analysis methods converge with content-based methods and support our hypotheses. Because they are not content based, and because they are automatic and fast, these new methods may be amenable to real-time communication pattern analysis.

  10. Deep sequencing reveals cell-type-specific patterns of single-cell transcriptome variation.

    PubMed

    Dueck, Hannah; Khaladkar, Mugdha; Kim, Tae Kyung; Spaethling, Jennifer M; Francis, Chantal; Suresh, Sangita; Fisher, Stephen A; Seale, Patrick; Beck, Sheryl G; Bartfai, Tamas; Kuhn, Bernhard; Eberwine, James; Kim, Junhyong

    2015-06-09

    Differentiation of metazoan cells requires execution of different gene expression programs but recent single-cell transcriptome profiling has revealed considerable variation within cells of seeming identical phenotype. This brings into question the relationship between transcriptome states and cell phenotypes. Additionally, single-cell transcriptomics presents unique analysis challenges that need to be addressed to answer this question. We present high quality deep read-depth single-cell RNA sequencing for 91 cells from five mouse tissues and 18 cells from two rat tissues, along with 30 control samples of bulk RNA diluted to single-cell levels. We find that transcriptomes differ globally across tissues with regard to the number of genes expressed, the average expression patterns, and within-cell-type variation patterns. We develop methods to filter genes for reliable quantification and to calibrate biological variation. All cell types include genes with high variability in expression, in a tissue-specific manner. We also find evidence that single-cell variability of neuronal genes in mice is correlated with that in rats consistent with the hypothesis that levels of variation may be conserved. Single-cell RNA-sequencing data provide a unique view of transcriptome function; however, careful analysis is required in order to use single-cell RNA-sequencing measurements for this purpose. Technical variation must be considered in single-cell RNA-sequencing studies of expression variation. For a subset of genes, biological variability within each cell type appears to be regulated in order to perform dynamic functions, rather than solely molecular noise.

  11. Mitochondrial DNA analyses reveal low genetic diversity in Culex quinquefasciatus from residential areas in Malaysia.

    PubMed

    Low, V L; Lim, P E; Chen, C D; Lim, Y A L; Tan, T K; Norma-Rashid, Y; Lee, H L; Sofian-Azirun, M

    2014-06-01

    The present study explored the intraspecific genetic diversity, dispersal patterns and phylogeographic relationships of Culex quinquefasciatus Say (Diptera: Culicidae) in Malaysia using reference data available in GenBank in order to reveal this species' phylogenetic relationships. A statistical parsimony network of 70 taxa aligned as 624 characters of the cytochrome c oxidase subunit I (COI) gene and 685 characters of the cytochrome c oxidase subunit II (COII) gene revealed three haplotypes (A1-A3) and four haplotypes (B1-B4), respectively. The concatenated sequences of both COI and COII genes with a total of 1309 characters revealed seven haplotypes (AB1-AB7). Analysis using tcs indicated that haplotype AB1 was the common ancestor and the most widespread haplotype in Malaysia. The genetic distance based on concatenated sequences of both COI and COII genes ranged from 0.00076 to 0.00229. Sequence alignment of Cx. quinquefasciatus from Malaysia and other countries revealed four haplotypes (AA1-AA4) by the COI gene and nine haplotypes (BB1-BB9) by the COII gene. Phylogenetic analyses demonstrated that Malaysian Cx. quinquefasciatus share the same genetic lineage as East African and Asian Cx. quinquefasciatus. This study has inferred the genetic lineages, dispersal patterns and hypothetical ancestral genotypes of Cx. quinquefasciatus. © 2013 The Royal Entomological Society.

  12. Multilocus sequence analysis of Thermoanaerobacter isolates reveals recombining, but differentiated, populations from geothermal springs of the Uzon Caldera, Kamchatka, Russia

    PubMed Central

    Wagner, Isaac D.; Varghese, Litty B.; Hemme, Christopher L.; Wiegel, Juergen

    2013-01-01

    Thermal environments have island-like characteristics and provide a unique opportunity to study population structure and diversity patterns of microbial taxa inhabiting these sites. Strains having ≥98% 16S rRNA gene sequence similarity to the obligately anaerobic Firmicutes Thermoanaerobacter uzonensis were isolated from seven geothermal springs, separated by up to 1600 m, within the Uzon Caldera (Kamchatka, Russian Far East). The intraspecies variation and spatial patterns of diversity for this taxon were assessed by multilocus sequence analysis (MLSA) of 106 strains. Analysis of eight protein-coding loci (gyrB, lepA, leuS, pyrG, recA, recG, rplB, and rpoB) revealed that all loci were polymorphic and that nucleotide substitutions were mostly synonymous. There were 148 variable nucleotide sites across 8003 bp concatenates of the protein-coding loci. While pairwise FST values indicated a small but significant level of genetic differentiation between most subpopulations, there was a negligible relationship between genetic divergence and spatial separation. Strains with the same allelic profile were only isolated from the same hot spring, occasionally from consecutive years, and single locus variant (SLV) sequence types were usually derived from the same spring. While recombination occurred, there was an “epidemic” population structure in which a particular T. uzonensis sequence type rose in frequency relative to the rest of the population. These results demonstrate spatial diversity patterns for an anaerobic bacterial species in a relative small geographic location and reinforce the view that terrestrial geothermal springs are excellent places to look for biogeographic diversity patterns regardless of the involved distances. PMID:23801987

  13. Comparative whole genome DNA methylation profiling of cattle sperm and somatic tissues reveals striking hypomethylated patterns in sperm

    USDA-ARS?s Scientific Manuscript database

    Using whole-genome bisulfite sequencing (WGBS), we profiled the DNA methylome of cattle sperms through comparison with three bovine somatic tissues (mammary grand, brain and blood). Large differences between them were observed in the methylation patterns of global CpGs, pericentromeric satellites, p...

  14. Endophyte Microbiome Diversity in Micropropagated Atriplex canescens and Atriplex torreyi var griffithsii

    PubMed Central

    Lucero, Mary E.; Unc, Adrian; Cooke, Peter; Dowd, Scot; Sun, Shulei

    2011-01-01

    Microbial diversity associated with micropropagated Atriplex species was assessed using microscopy, isolate culturing, and sequencing. Light, electron, and confocal microscopy revealed microbial cells in aseptically regenerated leaves and roots. Clone libraries and tag-encoded FLX amplicon pyrosequencing (TEFAP) analysis amplified sequences from callus homologous to diverse fungal and bacterial taxa. Culturing isolated some seed borne endophyte taxa which could be readily propagated apart from the host. Microbial cells were observed within biofilm-like residues associated with plant cell surfaces and intercellular spaces. Various universal primers amplified both plant and microbial sequences, with different primers revealing different patterns of fungal diversity. Bacterial and fungal TEFAP followed by alignment with sequences from curated databases revealed 7 bacterial and 17 ascomycete taxa in A. canescens, and 5 bacterial taxa in A. torreyi. Additional diversity was observed among isolates and clone libraries. Micropropagated Atriplex retains a complex, intimately associated microbiome which includes diverse strains well poised to interact in manners that influence host physiology. Microbiome analysis was facilitated by high throughput sequencing methods, but primer biases continue to limit recovery of diverse sequences from even moderately complex communities. PMID:21437280

  15. Frequency of EBV LMP-1 Promoter and Coding Variations in Burkitt Lymphoma Samples in Africa and South America and Peripheral Blood in Uganda.

    PubMed

    Liao, Hsiao-Mei; Liu, Hebing; Lei, Heiyan; Li, Bingjie; Chin, Pei-Ju; Tsai, Shien; Bhatia, Kishor; Gutierrez, Marina; Epelman, Sidnei; Biggar, Robert J; Nkrumah, Francis; Neequaye, Janet; Ogwang, Martin D; Reynolds, Steven J; Lo, Shyh-Ching; Mbulaiteye, Sam M

    2018-06-02

    Epstein-Barr virus (EBV) is linked to several cancers, including endemic Burkitt lymphoma (eBL), but causal variants are unknown. We recently reported novel sequence variants in the LMP-1 gene and promoter in EBV genomes sequenced from 13 of 14 BL biopsies. Alignments of the novel sequence variants for 114 published EBV genomes, including 27 from BL cases, revealed four LMP-1 variant patterns, designated A to D. Pattern A variant was found in 48% of BL EBV genomes. Here, we used PCR-Sanger sequencing to evaluate 50 additional BL biopsies from Ghana, Brazil, and Argentina, and peripheral blood samples from 113 eBL cases and 115 controls in Uganda. Pattern A was found in 60.9% of 64 BL biopsies evaluated. Compared to PCR-negative subjects in Uganda, detection of Pattern A in peripheral blood was associated with eBL case status (odds ratio [OR] 31.7, 95% confidence interval: 6.8⁻149), controlling for relevant confounders. Variant Pattern A and Pattern D were associated with eBL case status, but with lower ORs (9.7 and 13.6, respectively). Our results support the hypothesis that EBV LMP-1 Pattern A may be associated with eBL, but it is not the sole associated variant. Further research is needed to replicate and elucidate our findings.

  16. Global Diversity of Desert Hypolithic Cyanobacteria.

    PubMed

    Lacap-Bugler, Donnabella C; Lee, Kevin K; Archer, Stephen; Gillman, Len N; Lau, Maggie C Y; Leuzinger, Sebastian; Lee, Charles K; Maki, Teruya; McKay, Christopher P; Perrott, John K; de Los Rios-Murillo, Asunción; Warren-Rhodes, Kimberley A; Hopkins, David W; Pointing, Stephen B

    2017-01-01

    Global patterns in diversity were estimated for cyanobacteria-dominated hypolithic communities that colonize ventral surfaces of quartz stones and are common in desert environments. A total of 64 hypolithic communities were recovered from deserts on every continent plus a tropical moisture sufficient location. Community diversity was estimated using a combined t-RFLP fingerprinting and high throughput sequencing approach. The t-RFLP analysis revealed desert communities were different from the single non-desert location. A striking pattern also emerged where Antarctic desert communities were clearly distinct from all other deserts. Some overlap in community similarity occurred for hot, cold and tundra deserts. A further observation was that the producer-consumer ratio displayed a significant negative correlation with growing season, such that shorter growing seasons supported communities with greater abundance of producers, and this pattern was independent of macroclimate. High-throughput sequencing of 16S rRNA and nif H genes from four representative samples validated the t-RFLP study and revealed patterns of taxonomic and putative diazotrophic diversity for desert communities from the Taklimakan Desert, Tibetan Plateau, Canadian Arctic and Antarctic. All communities were dominated by cyanobacteria and among these 21 taxa were potentially endemic to any given desert location. Some others occurred in all but the most extreme hot and polar deserts suggesting they were relatively less well adapted to environmental stress. The t-RFLP and sequencing data revealed the two most abundant cyanobacterial taxa were Phormidium in Antarctic and Tibetan deserts and Chroococcidiopsis in hot and cold deserts. The Arctic tundra displayed a more heterogenous cyanobacterial assemblage and this was attributed to the maritime-influenced sampling location. The most abundant heterotrophic taxa were ubiquitous among samples and belonged to the Acidobacteria, Actinobacteria, Bacteroidetes, and Proteobacteria. Sequencing using nitrogenase gene-specific primers revealed all putative diazotrophs were Proteobacteria of the orders Burkholderiales, Rhizobiales, and Rhodospirillales. We envisage cyanobacterial carbon input to the system is accompanied by nitrogen fixation largely from non-cyanobacterial taxa. Overall the results indicate desert hypoliths worldwide are dominated by cyanobacteria and that growing season is a useful predictor of their abundance. Differences in cyanobacterial taxa encountered may reflect their adaptation to different moisture availability regimes in polar and non-polar deserts.

  17. Global Diversity of Desert Hypolithic Cyanobacteria

    PubMed Central

    Lacap-Bugler, Donnabella C.; Lee, Kevin K.; Archer, Stephen; Gillman, Len N.; Lau, Maggie C.Y.; Leuzinger, Sebastian; Lee, Charles K.; Maki, Teruya; McKay, Christopher P.; Perrott, John K.; de los Rios-Murillo, Asunción; Warren-Rhodes, Kimberley A.; Hopkins, David W.; Pointing, Stephen B.

    2017-01-01

    Global patterns in diversity were estimated for cyanobacteria-dominated hypolithic communities that colonize ventral surfaces of quartz stones and are common in desert environments. A total of 64 hypolithic communities were recovered from deserts on every continent plus a tropical moisture sufficient location. Community diversity was estimated using a combined t-RFLP fingerprinting and high throughput sequencing approach. The t-RFLP analysis revealed desert communities were different from the single non-desert location. A striking pattern also emerged where Antarctic desert communities were clearly distinct from all other deserts. Some overlap in community similarity occurred for hot, cold and tundra deserts. A further observation was that the producer-consumer ratio displayed a significant negative correlation with growing season, such that shorter growing seasons supported communities with greater abundance of producers, and this pattern was independent of macroclimate. High-throughput sequencing of 16S rRNA and nifH genes from four representative samples validated the t-RFLP study and revealed patterns of taxonomic and putative diazotrophic diversity for desert communities from the Taklimakan Desert, Tibetan Plateau, Canadian Arctic and Antarctic. All communities were dominated by cyanobacteria and among these 21 taxa were potentially endemic to any given desert location. Some others occurred in all but the most extreme hot and polar deserts suggesting they were relatively less well adapted to environmental stress. The t-RFLP and sequencing data revealed the two most abundant cyanobacterial taxa were Phormidium in Antarctic and Tibetan deserts and Chroococcidiopsis in hot and cold deserts. The Arctic tundra displayed a more heterogenous cyanobacterial assemblage and this was attributed to the maritime-influenced sampling location. The most abundant heterotrophic taxa were ubiquitous among samples and belonged to the Acidobacteria, Actinobacteria, Bacteroidetes, and Proteobacteria. Sequencing using nitrogenase gene-specific primers revealed all putative diazotrophs were Proteobacteria of the orders Burkholderiales, Rhizobiales, and Rhodospirillales. We envisage cyanobacterial carbon input to the system is accompanied by nitrogen fixation largely from non-cyanobacterial taxa. Overall the results indicate desert hypoliths worldwide are dominated by cyanobacteria and that growing season is a useful predictor of their abundance. Differences in cyanobacterial taxa encountered may reflect their adaptation to different moisture availability regimes in polar and non-polar deserts. PMID:28559886

  18. Molecular Networking and Pattern-Based Genome Mining Improves Discovery of Biosynthetic Gene Clusters and their Products from Salinispora Species

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Duncan, Katherine R.; Crüsemann, Max; Lechner, Anna

    Genome sequencing has revealed that bacteria contain many more biosynthetic gene clusters than predicted based on the number of secondary metabolites discovered to date. While this biosynthetic reservoir has fostered interest in new tools for natural product discovery, there remains a gap between gene cluster detection and compound discovery. In this paper, we apply molecular networking and the new concept of pattern-based genome mining to 35 Salinispora strains, including 30 for which draft genome sequences were either available or obtained for this study. The results provide a method to simultaneously compare large numbers of complex microbial extracts, which facilitated themore » identification of media components, known compounds and their derivatives, and new compounds that could be prioritized for structure elucidation. Finally, these efforts revealed considerable metabolite diversity and led to several molecular family-gene cluster pairings, of which the quinomycin-type depsipeptide retimycin A was characterized and linked to gene cluster NRPS40 using pattern-based bioinformatic approaches.« less

  19. Molecular Networking and Pattern-Based Genome Mining Improves Discovery of Biosynthetic Gene Clusters and their Products from Salinispora Species

    DOE PAGES

    Duncan, Katherine R.; Crüsemann, Max; Lechner, Anna; ...

    2015-04-09

    Genome sequencing has revealed that bacteria contain many more biosynthetic gene clusters than predicted based on the number of secondary metabolites discovered to date. While this biosynthetic reservoir has fostered interest in new tools for natural product discovery, there remains a gap between gene cluster detection and compound discovery. In this paper, we apply molecular networking and the new concept of pattern-based genome mining to 35 Salinispora strains, including 30 for which draft genome sequences were either available or obtained for this study. The results provide a method to simultaneously compare large numbers of complex microbial extracts, which facilitated themore » identification of media components, known compounds and their derivatives, and new compounds that could be prioritized for structure elucidation. Finally, these efforts revealed considerable metabolite diversity and led to several molecular family-gene cluster pairings, of which the quinomycin-type depsipeptide retimycin A was characterized and linked to gene cluster NRPS40 using pattern-based bioinformatic approaches.« less

  20. Molecular Networking and Pattern-Based Genome Mining Improves discovery of biosynthetic gene clusters and their products from Salinispora species

    PubMed Central

    Duncan, Katherine R.; Crüsemann, Max; Lechner, Anna; Sarkar, Anindita; Li, Jie; Ziemert, Nadine; Wang, Mingxun; Bandeira, Nuno; Moore, Bradley S.; Dorrestein, Pieter C.; Jensen, Paul R.

    2015-01-01

    Summary Genome sequencing has revealed that bacteria contain many more biosynthetic gene clusters than predicted based on the number of secondary metabolites discovered to date. While this biosynthetic reservoir has fostered interest in new tools for natural product discovery, there remains a gap between gene cluster detection and compound discovery. Here we apply molecular networking and the new concept of pattern-based genome mining to 35 Salinispora strains including 30 for which draft genome sequences were either available or obtained for this study. The results provide a method to simultaneously compare large numbers of complex microbial extracts, which facilitated the identification of media components, known compounds and their derivatives, and new compounds that could be prioritized for structure elucidation. These efforts revealed considerable metabolite diversity and led to several molecular family-gene cluster pairings, of which the quinomycin-type depsipeptide retimycin A was characterized and linked to gene cluster NRPS40 using pattern-based bioinformatic approaches. PMID:25865308

  1. Sequence of the tomato chloroplast DNA and evolutionary comparison of solanaceous plastid genomes.

    PubMed

    Kahlau, Sabine; Aspinall, Sue; Gray, John C; Bock, Ralph

    2006-08-01

    Tomato, Solanum lycopersicum (formerly Lycopersicon esculentum), has long been one of the classical model species of plant genetics. More recently, solanaceous species have become a model of evolutionary genomics, with several EST projects and a tomato genome project having been initiated. As a first contribution toward deciphering the genetic information of tomato, we present here the complete sequence of the tomato chloroplast genome (plastome). The size of this circular genome is 155,461 base pairs (bp), with an average AT content of 62.14%. It contains 114 genes and conserved open reading frames (ycfs). Comparison with the previously sequenced plastid DNAs of Nicotiana tabacum and Atropa belladonna reveals patterns of plastid genome evolution in the Solanaceae family and identifies varying degrees of conservation of individual plastid genes. In addition, we discovered several new sites of RNA editing by cytidine-to-uridine conversion. A detailed comparison of editing patterns in the three solanaceous species highlights the dynamics of RNA editing site evolution in chloroplasts. To assess the level of intraspecific plastome variation in tomato, the plastome of a second tomato cultivar was sequenced. Comparison of the two genotypes (IPA-6, bred in South America, and Ailsa Craig, bred in Europe) revealed no nucleotide differences, suggesting that the plastomes of modern tomato cultivars display very little, if any, sequence variation.

  2. Capturing the Temporal Sequence of Interaction in Young Siblings

    PubMed Central

    Steele, Fiona; Jenkins, Jennifer

    2015-01-01

    We explored whether young children exhibit subtypes of behavioral sequences during sibling interaction. Ten-minute, free-play observations of over 300 sibling dyads were coded for positivity, negativity and disengagement. The data were analyzed using growth mixture modeling (GMM). Younger (18-month-old) children’s temporal behavioral sequences showed a harmonious (53%) and a casual (47%) class. Older (approximately four-year-old) children’s behavior was more differentiated revealing a harmonious (25%), a deteriorating (31%), a recovery (22%) and a casual (22%) class. A more positive maternal affective climate was associated with more positive patterns. Siblings’ sequential behavioral patterns tended to be complementary rather than reciprocal in nature. The study illustrates a novel use of GMM and makes a theoretical contribution by showing that young children exhibit distinct types of temporal behavioral sequences that are related to parenting processes. PMID:25996957

  3. A Case-by-Case Evolutionary Analysis of Four Imprinted Retrogenes

    PubMed Central

    McCole, Ruth B; Loughran, Noeleen B; Chahal, Mandeep; Fernandes, Luis P; Roberts, Roland G; Fraternali, Franca; O'Connell, Mary J; Oakey, Rebecca J

    2011-01-01

    Retroposition is a widespread phenomenon resulting in the generation of new genes that are initially related to a parent gene via very high coding sequence similarity. We examine the evolutionary fate of four retrogenes generated by such an event; mouse Inpp5f_v2, Mcts2, Nap1l5, and U2af1-rs1. These genes are all subject to the epigenetic phenomenon of parental imprinting. We first provide new data on the age of these retrogene insertions. Using codon-based models of sequence evolution, we show these retrogenes have diverse evolutionary trajectories, including divergence from the parent coding sequence under positive selection pressure, purifying selection pressure maintaining parent-retrogene similarity, and neutral evolution. Examination of the expression pattern of retrogenes shows an atypical, broad pattern across multiple tissues. Protein 3D structure modeling reveals that a positively selected residue in U2af1-rs1, not shared by its parent, may influence protein conformation. Our case-by-case analysis of the evolution of four imprinted retrogenes reveals that this interesting class of imprinted genes, while similar in regulation and sequence characteristics, follow very varied evolutionary paths. PMID:21166792

  4. Partial bisulfite conversion for unique template sequencing

    PubMed Central

    Kumar, Vijay; Rosenbaum, Julie; Wang, Zihua; Forcier, Talitha; Ronemus, Michael; Wigler, Michael

    2018-01-01

    Abstract We introduce a new protocol, mutational sequencing or muSeq, which uses sodium bisulfite to randomly deaminate unmethylated cytosines at a fixed and tunable rate. The muSeq protocol marks each initial template molecule with a unique mutation signature that is present in every copy of the template, and in every fragmented copy of a copy. In the sequenced read data, this signature is observed as a unique pattern of C-to-T or G-to-A nucleotide conversions. Clustering reads with the same conversion pattern enables accurate count and long-range assembly of initial template molecules from short-read sequence data. We explore count and low-error sequencing by profiling 135 000 restriction fragments in a PstI representation, demonstrating that muSeq improves copy number inference and significantly reduces sporadic sequencer error. We explore long-range assembly in the context of cDNA, generating contiguous transcript clusters greater than 3,000 bp in length. The muSeq assemblies reveal transcriptional diversity not observable from short-read data alone. PMID:29161423

  5. Metatranscriptome sequence analysis reveals diel periodicity of microbial community gene expression in the ocean's interior

    NASA Astrophysics Data System (ADS)

    Vislova, A.; Aylward, F.; Sosa, O.; DeLong, E.

    2016-02-01

    Previous work has revealed diel periodicity of gene expression in key metabolic pathways in both autotrophic and heterotrophic microbes in the surface ocean. In this study, we investigated patterns of diel periodicity of gene expression in depth profiles (25, 75, 125 and 250 meters). We postulated that microbial diel transcriptional signals would be increasingly dampened with depth, and that the timing of peak expression of specific transcripts would be shifted in time between depths, in accordance with depth-dependent diel light variability. Bacterioplankton were sampled from four depths every four hours at station ALOHA (22° 45' N 158° W) over 2 days. RNA was extracted from cells preserved on filters, converted to cDNA, and sequenced on the Illumina platform. Surprisingly, harmonic regression analysis revealed an increasing proportion of genes with diel periodic expression patterns with increasing depth between 25- 125 meters. At 250 meters, the proportion of genes exhibiting diel expression patterns decreased an order of magnitude compared to the photic zone. Community composition, functional gene categories, and diel patterns of gene expression were significantly different between the photic zone and 250 meter samples. The signals driving diel periodic gene expression in microbes at 250 meters is under further investigation. These data are now beginning provide a better understanding of the tempo and mode of microbial dynamics among specific taxa, throughout the ocean's interior.

  6. Cloning and sequence analysis of Hemonchus contortus HC58cDNA.

    PubMed

    Muleke, Charles I; Ruofeng, Yan; Lixin, Xu; Xinwen, Bo; Xiangrui, Li

    2007-06-01

    The complete coding sequence of Hemonchus contortus HC58cDNA was generated by rapid amplification of cDNA ends and polymerase chain reaction using primers based on the 5' and 3' ends of the parasite mRNA, accession no. AF305964. The HC58cDNA gene was 851 bp long, with open reading frame of 717 bp, precursors to 239 amino acids coding for approximately 27 kDa protein. Analysis of amino acid sequence revealed conserved residues of cysteine, histidine, asparagine, occluding loop pattern, hemoglobinase motif and glutamine of the oxyanion hole characteristic of cathepsin B like proteases (CBL). Comparison of the predicted amino acid sequences showed the protein shared 33.5-58.7% identity to cathepsin B homologues in the papain clan CA family (family C1). Phylogenetic analysis revealed close evolutionary proximity of the protein sequence to counterpart sequences in the CBL, suggesting that HC58cDNA was a member of the papain family.

  7. Molecular phylogeny of Coxsackievirus A16 in Shenzhen, China, from 2005 to 2009.

    PubMed

    Zong, Wenping; He, Yaqing; Yu, Shouyi; Yang, Hong; Xian, Huixia; Liao, Yuxue; Hu, Guifang

    2011-04-01

    Phylogenetic analysis of a Coxsackievirus A16 (CA16) sequence from Shenzhen, China, and other Chinese and international CA16 sequences revealed a pattern of endemic cocirculation of strains of clusters B2a and B2b within subtype B2 viruses. Amino acid evolution and nucleotide variation in the VP1 region were slight for 5 years.

  8. Sequence Dependencies of DNA Deformability and Hydration in the Minor Groove

    PubMed Central

    Yonetani, Yoshiteru; Kono, Hidetoshi

    2009-01-01

    Abstract DNA deformability and hydration are both sequence-dependent and are essential in specific DNA sequence recognition by proteins. However, the relationship between the two is not well understood. Here, systematic molecular dynamics simulations of 136 DNA sequences that differ from each other in their central tetramer revealed that sequence dependence of hydration is clearly correlated with that of deformability. We show that this correlation can be illustrated by four typical cases. Most rigid basepair steps are highly likely to form an ordered hydration pattern composed of one water molecule forming a bridge between the bases of distinct strands, but a few exceptions favor another ordered hydration composed of two water molecules forming such a bridge. Steps with medium deformability can display both of these hydration patterns with frequent transition. Highly flexible steps do not have any stable hydration pattern. A detailed picture of this correlation demonstrates that motions of hydration water molecules and DNA bases are tightly coupled with each other at the atomic level. These results contribute to our understanding of the entropic contribution from water molecules in protein or drug binding and could be applied for the purpose of predicting binding sites. PMID:19686662

  9. Large-Scale Genomic Analysis of Codon Usage in Dengue Virus and Evaluation of Its Phylogenetic Dependence

    PubMed Central

    Lara-Ramírez, Edgar E.; Salazar, Ma Isabel; López-López, María de Jesús; Salas-Benito, Juan Santiago; Sánchez-Varela, Alejandro

    2014-01-01

    The increasing number of dengue virus (DENV) genome sequences available allows identifying the contributing factors to DENV evolution. In the present study, the codon usage in serotypes 1–4 (DENV1–4) has been explored for 3047 sequenced genomes using different statistics methods. The correlation analysis of total GC content (GC) with GC content at the three nucleotide positions of codons (GC1, GC2, and GC3) as well as the effective number of codons (ENC, ENCp) versus GC3 plots revealed mutational bias and purifying selection pressures as the major forces influencing the codon usage, but with distinct pressure on specific nucleotide position in the codon. The correspondence analysis (CA) and clustering analysis on relative synonymous codon usage (RSCU) within each serotype showed similar clustering patterns to the phylogenetic analysis of nucleotide sequences for DENV1–4. These clustering patterns are strongly related to the virus geographic origin. The phylogenetic dependence analysis also suggests that stabilizing selection acts on the codon usage bias. Our analysis of a large scale reveals new feature on DENV genomic evolution. PMID:25136631

  10. A communal catalogue reveals Earth’s multiscale microbial diversity

    DOE PAGES

    Thompson, Luke R.; Sanders, Jon G.; McDonald, Daniel; ...

    2017-11-01

    Our growing awareness of the importance and diversity of the microbial world contrasts starkly with our limited understanding of its fundamental structure. Despite remarkable advances in DNA sequence generation, a lack of standardized protocols and common analytical framework impede useful comparison between studies, hindering development of global inferences about microbial life on Earth. Here, we show that with coordinated protocols, exact microbial 16S rRNA gene sequences can be followed across scores of individual studies, revealing patterns of diversity, community structure, and life history strategy at a planetary scale. Using 27,751 crowdsourced environmental samples comprising more than 2.2 billion reads, wemore » find sharp divides between host-associated and free-living communities. We show that the distribution of taxonomic and sequence diversity follows consistent trends across samples types and along gradients of environmental parameters, highlighting some of the global evolutionary patterns and ecological principles that underpin Earth’s microbiome. Here, this dataset provides the most complete environmental survey of our microbial world to date, and serves as a growing reference to provide immediate global context to future microbial surveys.« less

  11. A communal catalogue reveals Earth’s multiscale microbial diversity

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Thompson, Luke R.; Sanders, Jon G.; McDonald, Daniel

    Our growing awareness of the importance and diversity of the microbial world contrasts starkly with our limited understanding of its fundamental structure. Despite remarkable advances in DNA sequence generation, a lack of standardized protocols and common analytical framework impede useful comparison between studies, hindering development of global inferences about microbial life on Earth. Here, we show that with coordinated protocols, exact microbial 16S rRNA gene sequences can be followed across scores of individual studies, revealing patterns of diversity, community structure, and life history strategy at a planetary scale. Using 27,751 crowdsourced environmental samples comprising more than 2.2 billion reads, wemore » find sharp divides between host-associated and free-living communities. We show that the distribution of taxonomic and sequence diversity follows consistent trends across samples types and along gradients of environmental parameters, highlighting some of the global evolutionary patterns and ecological principles that underpin Earth’s microbiome. Here, this dataset provides the most complete environmental survey of our microbial world to date, and serves as a growing reference to provide immediate global context to future microbial surveys.« less

  12. Sequencing of Seven Haloarchaeal Genomes Reveals Patterns of Genomic Flux

    PubMed Central

    Lynch, Erin A.; Langille, Morgan G. I.; Darling, Aaron; Wilbanks, Elizabeth G.; Haltiner, Caitlin; Shao, Katie S. Y.; Starr, Michael O.; Teiling, Clotilde; Harkins, Timothy T.; Edwards, Robert A.; Eisen, Jonathan A.; Facciotti, Marc T.

    2012-01-01

    We report the sequencing of seven genomes from two haloarchaeal genera, Haloferax and Haloarcula. Ease of cultivation and the existence of well-developed genetic and biochemical tools for several diverse haloarchaeal species make haloarchaea a model group for the study of archaeal biology. The unique physiological properties of these organisms also make them good candidates for novel enzyme discovery for biotechnological applications. Seven genomes were sequenced to ∼20×coverage and assembled to an average of 50 contigs (range 5 scaffolds - 168 contigs). Comparisons of protein-coding gene compliments revealed large-scale differences in COG functional group enrichment between these genera. Analysis of genes encoding machinery for DNA metabolism reveals genera-specific expansions of the general transcription factor TATA binding protein as well as a history of extensive duplication and horizontal transfer of the proliferating cell nuclear antigen. Insights gained from this study emphasize the importance of haloarchaea for investigation of archaeal biology. PMID:22848480

  13. Observations of Displacement-driven Maturation along a Subduction-Transform Edge Propagator Fault

    NASA Astrophysics Data System (ADS)

    Neely, J. S.; Furlong, K. P.

    2016-12-01

    The Solomon Islands-Vanuatu composite subduction zone represents a tectonically complex region along the Pacific-Australia plate boundary in the southwest Pacific Ocean. Here the Australia plate subducts under the Pacific plate in two parts - the Solomon Trench and the Vanuatu Trench - with the two segments separated by a transform fault produced by a tear in the approaching Australia plate. As a result of the Australia plate tearing, the two subducting sections are offset by the 280 km long San Cristobal Trough (SCT) transform fault, which acts as a Subduction-Transform Edge Propagator (STEP) fault. The formation of this transform fault provides an opportunity to study the evolution of a newly created transform plate boundary. As distance from the tear increases, both the magnitude and frequency of earthquakes along the transform increase reflecting the coalescence of fault segments into a through-going structure. Over the past few decades, there have been several instances of larger magnitude earthquakes migrating westward along the STEP through a rapid succession of events. A recent May 2015 sequence of MW 6.8, MW 6.9, and MW 6.8 earthquakes followed this pattern, with an east to west migration over three days. However, neither this 2015 sequence, nor a previous 1993 progression, ruptured into or nucleated a large earthquake within the region near the tear. SCT sequence termination outside the region of the newly formed fault occurs even though Coulomb Failure Stress analyses reveal that the tear end of the SCT is positively loaded for failure by the earthquake sequence. Changing seismicity patterns along the SCT are also mapped by b-value variations that correspond to the rupture patterns of these propagating sequences. These seismicity pattern changes along the SCT reveal a fault maturation process with strain localization driven by cumulative slip corresponding to approximately 80-100 km of displacement.

  14. Genome-wide identification of conserved microRNA and their response to drought stress in Dongxiang wild rice (Oryza rufipogon Griff.).

    PubMed

    Zhang, Fantao; Luo, Xiangdong; Zhou, Yi; Xie, Jiankun

    2016-04-01

    To identify drought stress-responsive conserved microRNA (miRNA) from Dongxiang wild rice (Oryza rufipogon Griff., DXWR) on a genome-wide scale, high-throughput sequencing technology was used to sequence libraries of DXWR samples, treated with and without drought stress. 505 conserved miRNAs corresponding to 215 families were identified. 17 were significantly down-regulated and 16 were up-regulated under drought stress. Stem-loop qRT-PCR revealed the same expression patterns as high-throughput sequencing, suggesting the accuracy of the sequencing result was high. Potential target genes of the drought-responsive miRNA were predicted to be involved in diverse biological processes. Furthermore, 16 miRNA families were first identified to be involved in drought stress response from plants. These results present a comprehensive view of the conserved miRNA and their expression patterns under drought stress for DXWR, which will provide valuable information and sequence resources for future basis studies.

  15. Constructing Patient Specific Clinical Trajectories from Electronic Healthcare Reimbursement Claims using Sequential Pattern Mining

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pullum, Laura L; Hobson, Tanner C

    We examine the use of electronic healthcare reimbursement claims (EHRC) for analyzing healthcare delivery and practice patterns across the United States (US). By analyzing over 1 billion EHRCs, we track patterns of clinical procedures administered to patients with heart disease (HD) using sequential pattern mining algorithms. Our analyses reveal that the clinical procedures performed on HD patients are highly varied leading up to and after the primary diagnosis. The discovered clinical procedure sequences reveal significant differences in the overall costs incurred across different parts of the US, indicating significant heterogeneity in treating HD patients. We show that a data-driven approachmore » to understand patient specific clinical trajectories constructed from EHRC can provide quantitative insights into how to better manage and treat patients.« less

  16. Early history of European domestic cattle as revealed by ancient DNA.

    PubMed

    Bollongino, R; Edwards, C J; Alt, K W; Burger, J; Bradley, D G

    2006-03-22

    We present an extensive ancient DNA analysis of mainly Neolithic cattle bones sampled from archaeological sites along the route of Neolithic expansion, from Turkey to North-Central Europe and Britain. We place this first reasonable population sample of Neolithic cattle mitochondrial DNA sequence diversity in context to illustrate the continuity of haplotype variation patterns from the first European domestic cattle to the present. Interestingly, the dominant Central European pattern, a starburst phylogeny around the modal sequence, T3, has a Neolithic origin, and the reduced diversity within this cluster in the ancient samples accords with their shorter history of post-domestic accumulation of mutation.

  17. Data-Driven Sequence of Changes to Anatomical Brain Connectivity in Sporadic Alzheimer's Disease.

    PubMed

    Oxtoby, Neil P; Garbarino, Sara; Firth, Nicholas C; Warren, Jason D; Schott, Jonathan M; Alexander, Daniel C

    2017-01-01

    Model-based investigations of transneuronal spreading mechanisms in neurodegenerative diseases relate the pattern of pathology severity to the brain's connectivity matrix, which reveals information about how pathology propagates through the connectivity network. Such network models typically use networks based on functional or structural connectivity in young and healthy individuals, and only end-stage patterns of pathology, thereby ignoring/excluding the effects of normal aging and disease progression. Here, we examine the sequence of changes in the elderly brain's anatomical connectivity over the course of a neurodegenerative disease. We do this in a data-driven manner that is not dependent upon clinical disease stage, by using event-based disease progression modeling. Using data from the Alzheimer's Disease Neuroimaging Initiative dataset, we sequence the progressive decline of anatomical connectivity, as quantified by graph-theory metrics, in the Alzheimer's disease brain. Ours is the first single model to contribute to understanding all three of the nature, the location, and the sequence of changes to anatomical connectivity in the human brain due to Alzheimer's disease. Our experimental results reveal new insights into Alzheimer's disease: that degeneration of anatomical connectivity in the brain may be a viable, even early, biomarker and should be considered when studying such neurodegenerative diseases.

  18. Genetic diversity of Babesia bovis in virulent and attenuated strains.

    PubMed

    Mazuz, M L; Molad, T; Fish, L; Leibovitz, B; Wolkomirsky, R; Fleiderovitz, L; Shkap, V

    2012-03-01

    The aim of this study was to compare the genetic diversity of the single copy Bv80 gene sequences of Babesia bovis in populations of attenuated and virulent parasites. PCR/ RT-PCR followed by cloning and sequence analyses of 4 attenuated and 4 virulent strains were performed. Multiple fragments in the range of 420 to 744 bp were amplified by PCR or RT-PCR. Cloning of the PCR fragments and sequence analyses revealed the presence of mixed subpopulations in either virulent or attenuated parasites with a total of 19 variants with 12 different sequences that differed in number and type of tandem repeats. High levels of intra- and inter-strain diversity of the Bv80 gene, with the presence of mixed populations of parasites were found in both the virulent field isolates and the attenuated vaccine strains. In addition, during the attenuation process, sequence analyses showed changes in the pattern of the parasite subpopulations. Despite high polymorphism found by sequence analyses, the patterns observed and the number of repeats, order, or motifs found could not discriminate between virulent field isolates and attenuated vaccine strains of the parasite.

  19. Partial bisulfite conversion for unique template sequencing.

    PubMed

    Kumar, Vijay; Rosenbaum, Julie; Wang, Zihua; Forcier, Talitha; Ronemus, Michael; Wigler, Michael; Levy, Dan

    2018-01-25

    We introduce a new protocol, mutational sequencing or muSeq, which uses sodium bisulfite to randomly deaminate unmethylated cytosines at a fixed and tunable rate. The muSeq protocol marks each initial template molecule with a unique mutation signature that is present in every copy of the template, and in every fragmented copy of a copy. In the sequenced read data, this signature is observed as a unique pattern of C-to-T or G-to-A nucleotide conversions. Clustering reads with the same conversion pattern enables accurate count and long-range assembly of initial template molecules from short-read sequence data. We explore count and low-error sequencing by profiling 135 000 restriction fragments in a PstI representation, demonstrating that muSeq improves copy number inference and significantly reduces sporadic sequencer error. We explore long-range assembly in the context of cDNA, generating contiguous transcript clusters greater than 3,000 bp in length. The muSeq assemblies reveal transcriptional diversity not observable from short-read data alone. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  20. A sequential coalescent algorithm for chromosomal inversions

    PubMed Central

    Peischl, S; Koch, E; Guerrero, R F; Kirkpatrick, M

    2013-01-01

    Chromosomal inversions are common in natural populations and are believed to be involved in many important evolutionary phenomena, including speciation, the evolution of sex chromosomes and local adaptation. While recent advances in sequencing and genotyping methods are leading to rapidly increasing amounts of genome-wide sequence data that reveal interesting patterns of genetic variation within inverted regions, efficient simulation methods to study these patterns are largely missing. In this work, we extend the sequential Markovian coalescent, an approximation to the coalescent with recombination, to include the effects of polymorphic inversions on patterns of recombination. Results show that our algorithm is fast, memory-efficient and accurate, making it feasible to simulate large inversions in large populations for the first time. The SMC algorithm enables studies of patterns of genetic variation (for example, linkage disequilibria) and tests of hypotheses (using simulation-based approaches) that were previously intractable. PMID:23632894

  1. Archaeal β diversity patterns under the seafloor along geochemical gradients

    NASA Astrophysics Data System (ADS)

    Koyano, Hitoshi; Tsubouchi, Taishi; Kishino, Hirohisa; Akutsu, Tatsuya

    2014-09-01

    Recently, deep drilling into the seafloor has revealed that there are vast sedimentary ecosystems of diverse microorganisms, particularly archaea, in subsurface areas. We investigated the β diversity patterns of archaeal communities in sediment layers under the seafloor and their determinants. This study was accomplished by analyzing large environmental samples of 16S ribosomal RNA gene sequences and various geochemical data collected from a sediment core of 365.3 m, obtained by drilling into the seafloor off the east coast of the Shimokita Peninsula. To extract the maximum amount of information from these environmental samples, we first developed a method for measuring β diversity using sequence data by applying probability theory on a set of strings developed by two of the authors in a previous publication. We introduced an index of β diversity between sequence populations from which the sequence data were sampled. We then constructed an estimator of the β diversity index based on the sequence data and demonstrated that it converges to the β diversity index between sequence populations with probability of 1 as the number of sampled sequences increases. Next, we applied this new method to quantify β diversities between archaeal sequence populations under the seafloor and constructed a quantitative model of the estimated β diversity patterns. Nearly 90% of the variation in the archaeal β diversity was explained by a model that included as variables the differences in the abundances of chlorine, iodine, and carbon between the sediment layers.

  2. Molecular characterization and distribution of a 145-bp tandem repeat family in the genus Populus.

    PubMed

    Rajagopal, J; Das, S; Khurana, D K; Srivastava, P S; Lakshmikumaran, M

    1999-10-01

    This report aims to describe the identification and molecular characterization of a 145-bp tandem repeat family that accounts for nearly 1.5% of the Populus genome. Three members of this repeat family were cloned and sequenced from Populus deltoides and P. ciliata. The dimers of the repeat were sequenced in order to confirm the head-to-tail organization of the repeat. Hybridization-based analysis using the 145-bp tandem repeat as a probe on genomic DNA gave rise to ladder patterns which were identified to be a result of methylation and (or) sequence heterogeneity. Analysis of the methylation pattern of the repeat family using methylation-sensitive isoschizomers revealed variable methylation of the C residues and lack of methylation of the A residues. Sequence comparisons between the monomers revealed a high degree of sequence divergence that ranged between 6% and 11% in P. deltoides and between 4.2% and 8.3% in P. ciliata. This indicated the presence of sub-families within the 145-bp tandem family of repeats. Divergence was mainly due to the accumulation of point mutations and was concentrated in the central region of the repeat. The 145-bp tandem repeat family did not show significant homology to known tandem repeats from plants. A short stretch of 36 bp was found to show homology of 66.7% to a centromeric repeat from Chironomus plumosus. Dot-blot analysis and Southern hybridization data revealed the presence of the repeat family in 13 of the 14 Populus species examined. The absence of the 145-bp repeat from P. euphratica suggested that this species is relatively distant from other members of the genus, which correlates with taxonomic classifications. The widespread occurrence of the tandem family in the genus indicated that this family may be of ancient origin.

  3. Molecular Diagnostic Tools for Detection and Differentiation of Phytoplasmas Based on Chaperonin-60 Reveal Differences in Host Plant Infection Patterns

    PubMed Central

    Dumonceaux, Tim J.; Green, Margaret; Hammond, Christine; Perez, Edel; Olivier, Chrystel

    2014-01-01

    Phytoplasmas (‘Candidatus Phytoplasma’ spp.) are insect-vectored bacteria that infect a wide variety of plants, including many agriculturally important species. The infections can cause devastating yield losses by inducing morphological changes that dramatically alter inflorescence development. Detection of phytoplasma infection typically utilizes sequences located within the 16S–23S rRNA-encoding locus, and these sequences are necessary for strain identification by currently accepted standards for phytoplasma classification. However, these methods can generate PCR products >1400 bp that are less divergent in sequence than protein-encoding genes, limiting strain resolution in certain cases. We describe a method for accessing the chaperonin-60 (cpn60) gene sequence from a diverse array of ‘Ca.Phytoplasma’ spp. Two degenerate primer sets were designed based on the known sequence diversity of cpn60 from ‘Ca.Phytoplasma’ spp. and used to amplify cpn60 gene fragments from various reference samples and infected plant tissues. Forty three cpn60 sequences were thereby determined. The cpn60 PCR-gel electrophoresis method was highly sensitive compared to 16S-23S-targeted PCR-gel electrophoresis. The topology of a phylogenetic tree generated using cpn60 sequences was congruent with that reported for 16S rRNA-encoding genes. The cpn60 sequences were used to design a hybridization array using oligonucleotide-coupled fluorescent microspheres, providing rapid diagnosis and typing of phytoplasma infections. The oligonucleotide-coupled fluorescent microsphere assay revealed samples that were infected simultaneously with two subtypes of phytoplasma. These tools were applied to show that two host plants, Brassica napus and Camelina sativa, displayed different phytoplasma infection patterns. PMID:25551224

  4. Arpeggio: harmonic compression of ChIP-seq data reveals protein-chromatin interaction signatures

    PubMed Central

    Stanton, Kelly Patrick; Parisi, Fabio; Strino, Francesco; Rabin, Neta; Asp, Patrik; Kluger, Yuval

    2013-01-01

    Researchers generating new genome-wide data in an exploratory sequencing study can gain biological insights by comparing their data with well-annotated data sets possessing similar genomic patterns. Data compression techniques are needed for efficient comparisons of a new genomic experiment with large repositories of publicly available profiles. Furthermore, data representations that allow comparisons of genomic signals from different platforms and across species enhance our ability to leverage these large repositories. Here, we present a signal processing approach that characterizes protein–chromatin interaction patterns at length scales of several kilobases. This allows us to efficiently compare numerous chromatin-immunoprecipitation sequencing (ChIP-seq) data sets consisting of many types of DNA-binding proteins collected from a variety of cells, conditions and organisms. Importantly, these interaction patterns broadly reflect the biological properties of the binding events. To generate these profiles, termed Arpeggio profiles, we applied harmonic deconvolution techniques to the autocorrelation profiles of the ChIP-seq signals. We used 806 publicly available ChIP-seq experiments and showed that Arpeggio profiles with similar spectral densities shared biological properties. Arpeggio profiles of ChIP-seq data sets revealed characteristics that are not easily detected by standard peak finders. They also allowed us to relate sequencing data sets from different genomes, experimental platforms and protocols. Arpeggio is freely available at http://sourceforge.net/p/arpeggio/wiki/Home/. PMID:23873955

  5. Arpeggio: harmonic compression of ChIP-seq data reveals protein-chromatin interaction signatures.

    PubMed

    Stanton, Kelly Patrick; Parisi, Fabio; Strino, Francesco; Rabin, Neta; Asp, Patrik; Kluger, Yuval

    2013-09-01

    Researchers generating new genome-wide data in an exploratory sequencing study can gain biological insights by comparing their data with well-annotated data sets possessing similar genomic patterns. Data compression techniques are needed for efficient comparisons of a new genomic experiment with large repositories of publicly available profiles. Furthermore, data representations that allow comparisons of genomic signals from different platforms and across species enhance our ability to leverage these large repositories. Here, we present a signal processing approach that characterizes protein-chromatin interaction patterns at length scales of several kilobases. This allows us to efficiently compare numerous chromatin-immunoprecipitation sequencing (ChIP-seq) data sets consisting of many types of DNA-binding proteins collected from a variety of cells, conditions and organisms. Importantly, these interaction patterns broadly reflect the biological properties of the binding events. To generate these profiles, termed Arpeggio profiles, we applied harmonic deconvolution techniques to the autocorrelation profiles of the ChIP-seq signals. We used 806 publicly available ChIP-seq experiments and showed that Arpeggio profiles with similar spectral densities shared biological properties. Arpeggio profiles of ChIP-seq data sets revealed characteristics that are not easily detected by standard peak finders. They also allowed us to relate sequencing data sets from different genomes, experimental platforms and protocols. Arpeggio is freely available at http://sourceforge.net/p/arpeggio/wiki/Home/.

  6. Site-targeted mutagenesis for stabilization of recombinant monoclonal antibody expressed in tobacco (Nicotiana tabacum) plants

    PubMed Central

    Hehle, Verena K.; Paul, Matthew J.; Roberts, Victoria A.; van Dolleweerd, Craig J.; Ma, Julian K.-C.

    2016-01-01

    This study examined the degradation pattern of a murine IgG1κ monoclonal antibody expressed in and extracted from transformed Nicotiana tabacum. Gel electrophoresis of leaf extracts revealed a consistent pattern of recombinant immunoglobulin bands, including intact and full-length antibody, as well as smaller antibody fragments. N-terminal sequencing revealed these smaller fragments to be proteolytic cleavage products and identified a limited number of protease-sensitive sites in the antibody light and heavy chain sequences. No strictly conserved target sequence was evident, although the peptide bonds that were susceptible to proteolysis were predominantly and consistently located within or near to the interdomain or solvent-exposed regions in the antibody structure. Amino acids surrounding identified cleavage sites were mutated in an attempt to increase resistance. Different Guy’s 13 antibody heavy and light chain mutant combinations were expressed transiently in N. tabacum and demonstrated intensity shifts in the fragmentation pattern, resulting in alterations to the full-length antibody-to-fragment ratio. The work strengthens the understanding of proteolytic cleavage of antibodies expressed in plants and presents a novel approach to stabilize full-length antibody by site-directed mutagenesis.—Hehle, V. K., Paul, M. J., Roberts, V. A., van Dolleweerd, C. J., Ma, J. K.-C. Site-targeted mutagenesis for stabilization of recombinant monoclonal antibody expressed in tobacco (Nicotiana tabacum) plants. PMID:26712217

  7. Exome sequencing of a colorectal cancer family reveals shared mutation pattern and predisposition circuitry along tumor pathways.

    PubMed

    Suleiman, Suleiman H; Koko, Mahmoud E; Nasir, Wafaa H; Elfateh, Ommnyiah; Elgizouli, Ubai K; Abdallah, Mohammed O E; Alfarouk, Khalid O; Hussain, Ayman; Faisal, Shima; Ibrahim, Fathelrahamn M A; Romano, Maurizio; Sultan, Ali; Banks, Lawrence; Newport, Melanie; Baralle, Francesco; Elhassan, Ahmed M; Mohamed, Hiba S; Ibrahim, Muntaser E

    2015-01-01

    The molecular basis of cancer and cancer multiple phenotypes are not yet fully understood. Next Generation Sequencing promises new insight into the role of genetic interactions in shaping the complexity of cancer. Aiming to outline the differences in mutation patterns between familial colorectal cancer cases and controls we analyzed whole exomes of cancer tissues and control samples from an extended colorectal cancer pedigree, providing one of the first data sets of exome sequencing of cancer in an African population against a background of large effective size typically with excess of variants. Tumors showed hMSH2 loss of function SNV consistent with Lynch syndrome. Sets of genes harboring insertions-deletions in tumor tissues revealed, however, significant GO enrichment, a feature that was not seen in control samples, suggesting that ordered insertions-deletions are central to tumorigenesis in this type of cancer. Network analysis identified multiple hub genes of centrality. ELAVL1/HuR showed remarkable centrality, interacting specially with genes harboring non-synonymous SNVs thus reinforcing the proposition of targeted mutagenesis in cancer pathways. A likely explanation to such mutation pattern is DNA/RNA editing, suggested here by nucleotide transition-to-transversion ratio that significantly departed from expected values (p-value 5e-6). NFKB1 also showed significant centrality along with ELAVL1, raising the suspicion of viral etiology given the known interaction between oncogenic viruses and these proteins.

  8. WebLogo: A Sequence Logo Generator

    PubMed Central

    Crooks, Gavin E.; Hon, Gary; Chandonia, John-Marc; Brenner, Steven E.

    2004-01-01

    WebLogo generates sequence logos, graphical representations of the patterns within a multiple sequence alignment. Sequence logos provide a richer and more precise description of sequence similarity than consensus sequences and can rapidly reveal significant features of the alignment otherwise difficult to perceive. Each logo consists of stacks of letters, one stack for each position in the sequence. The overall height of each stack indicates the sequence conservation at that position (measured in bits), whereas the height of symbols within the stack reflects the relative frequency of the corresponding amino or nucleic acid at that position. WebLogo has been enhanced recently with additional features and options, to provide a convenient and highly configurable sequence logo generator. A command line interface and the complete, open WebLogo source code are available for local installation and customization. PMID:15173120

  9. Unlinking the methylome pattern from nucleotide sequence, revealed by large-scale in vivo genome engineering and methylome editing in medaka fish

    PubMed Central

    Nakamura, Ryohei; Uno, Ayako; Kumagai, Masahiko; Fukushima, Hiroto S.; Morishita, Shinichi; Takeda, Hiroyuki

    2017-01-01

    The heavily methylated vertebrate genomes are punctuated by stretches of poorly methylated DNA sequences that usually mark gene regulatory regions. It is known that the methylation state of these regions confers transcriptional control over their associated genes. Given its governance on the transcriptome, cellular functions and identity, genome-wide DNA methylation pattern is tightly regulated and evidently predefined. However, how is the methylation pattern determined in vivo remains enigmatic. Based on in silico and in vitro evidence, recent studies proposed that the regional hypomethylated state is primarily determined by local DNA sequence, e.g., high CpG density and presence of specific transcription factor binding sites. Nonetheless, the dependency of DNA methylation on nucleotide sequence has not been carefully validated in vertebrates in vivo. Herein, with the use of medaka (Oryzias latipes) as a model, the sequence dependency of DNA methylation was intensively tested in vivo. Our statistical modeling confirmed the strong statistical association between nucleotide sequence pattern and methylation state in the medaka genome. However, by manipulating the methylation state of a number of genomic sequences and reintegrating them into medaka embryos, we demonstrated that artificially conferred DNA methylation states were predominantly and robustly maintained in vivo, regardless of their sequences and endogenous states. This feature was also observed in the medaka transgene that had passed across generations. Thus, despite the observed statistical association, nucleotide sequence was unable to autonomously determine its own methylation state in medaka in vivo. Our results apparently argue against the notion of the governance on the DNA methylation by nucleotide sequence, but instead suggest the involvement of other epigenetic factors in defining and maintaining the DNA methylation landscape. Further investigation in other vertebrate models in vivo will be needed for the generalization of our observations made in medaka. PMID:29267279

  10. Deep sequencing of the Mexican avocado transcriptome, an ancient angiosperm with a high content of fatty acids.

    PubMed

    Ibarra-Laclette, Enrique; Méndez-Bravo, Alfonso; Pérez-Torres, Claudia Anahí; Albert, Victor A; Mockaitis, Keithanne; Kilaru, Aruna; López-Gómez, Rodolfo; Cervantes-Luevano, Jacob Israel; Herrera-Estrella, Luis

    2015-08-13

    Avocado (Persea americana) is an economically important tropical fruit considered to be a good source of fatty acids. Despite its importance, the molecular and cellular characterization of biochemical and developmental processes in avocado is limited due to the lack of transcriptome and genomic information. The transcriptomes of seeds, roots, stems, leaves, aerial buds and flowers were determined using different sequencing platforms. Additionally, the transcriptomes of three different stages of fruit ripening (pre-climacteric, climacteric and post-climacteric) were also analyzed. The analysis of the RNAseqatlas presented here reveals strong differences in gene expression patterns between different organs, especially between root and flower, but also reveals similarities among the gene expression patterns in other organs, such as stem, leaves and aerial buds (vegetative organs) or seed and fruit (storage organs). Important regulators, functional categories, and differentially expressed genes involved in avocado fruit ripening were identified. Additionally, to demonstrate the utility of the avocado gene expression atlas, we investigated the expression patterns of genes implicated in fatty acid metabolism and fruit ripening. A description of transcriptomic changes occurring during fruit ripening was obtained in Mexican avocado, contributing to a dynamic view of the expression patterns of genes involved in fatty acid biosynthesis and the fruit ripening process.

  11. An evolutionary conserved pattern of 18S rRNA sequence complementarity to mRNA 5′ UTRs and its implications for eukaryotic gene translation regulation

    PubMed Central

    Pánek, Josef; Kolář, Michal; Vohradský, Jiří; Shivaya Valášek, Leoš

    2013-01-01

    There are several key mechanisms regulating eukaryotic gene expression at the level of protein synthesis. Interestingly, the least explored mechanisms of translational control are those that involve the translating ribosome per se, mediated for example via predicted interactions between the ribosomal RNAs (rRNAs) and mRNAs. Here, we took advantage of robustly growing large-scale data sets of mRNA sequences for numerous organisms, solved ribosomal structures and computational power to computationally explore the mRNA–rRNA complementarity that is statistically significant across the species. Our predictions reveal highly specific sequence complementarity of 18S rRNA sequences with mRNA 5′ untranslated regions (UTRs) forming a well-defined 3D pattern on the rRNA sequence of the 40S subunit. Broader evolutionary conservation of this pattern may imply that 5′ UTRs of eukaryotic mRNAs, which have already emerged from the mRNA-binding channel, may contact several complementary spots on 18S rRNA situated near the exit of the mRNA binding channel and on the middle-to-lower body of the solvent-exposed 40S ribosome including its left foot. We discuss physiological significance of this structurally conserved pattern and, in the context of previously published experimental results, propose that it modulates scanning of the 40S subunit through 5′ UTRs of mRNAs. PMID:23804757

  12. Identification and expression analysis of cDNA encoding insulin-like growth factor 2 in horses

    PubMed Central

    KIKUCHI, Kohta; SASAKI, Keisuke; AKIZAWA, Hiroki; TSUKAHARA, Hayato; BAI, Hanako; TAKAHASHI, Masashi; NAMBO, Yasuo; HATA, Hiroshi; KAWAHARA, Manabu

    2017-01-01

    Insulin-like growth factor 2 (IGF2) is responsible for a broad range of physiological processes during fetal development and adulthood, but genomic analyses of IGF2 containing the 5ʹ- and 3ʹ-untranslated regions (UTRs) in equines have been limited. In this study, we characterized the IGF2 mRNA containing the UTRs, and determined its expression pattern in the fetal tissues of horses. The complete equine IGF2 mRNA sequence harboring another exon approximately 2.8 kb upstream from the canonical transcription start site was identified as a new transcript variant. As this upstream exon did not contain the start codon, the amino acid sequence was identical to the canonical variant. Analysis of the deduced amino acid sequence revealed that the protein possessed two major domains, IlGF and IGF2_C, and analysis of IGF2 sequence polymorphism in fetal tissues of Hokkaido native horse and Thoroughbreds revealed a single nucleotide polymorphism (T to C transition) at position 398 in Thoroughbreds, which caused an amino acid substitution at position 133 in the IGF2 sequence. Furthermore, the expression pattern of the IGF2 mRNA in the fetal tissues of horses was determined for the first time, and was found to be consistent with those of other species. Taken together, these results suggested that the transcriptional and translational products of the IGF2 gene have conserved functions in the fetal development of mammals, including horses. PMID:29151450

  13. Mitochondrial genome sequencing reveals potential origins of the scabies mite Sarcoptes scabiei infesting two iconic Australian marsupials.

    PubMed

    Fraser, Tamieka A; Shao, Renfu; Fountain-Jones, Nicholas M; Charleston, Michael; Martin, Alynn; Whiteley, Pam; Holme, Roz; Carver, Scott; Polkinghorne, Adam

    2017-11-28

    Debilitating skin infestations caused by the mite, Sarcoptes scabiei, have a profound impact on human and animal health globally. In Australia, this impact is evident across different segments of Australian society, with a growing recognition that it can contribute to rapid declines of native Australian marsupials. Cross-host transmission has been suggested to play a significant role in the epidemiology and origin of mite infestations in different species but a chronic lack of genetic resources has made further inferences difficult. To investigate the origins and molecular epidemiology of S. scabiei in Australian wildlife, we sequenced the mitochondrial genomes of S. scabiei from diseased wombats (Vombatus ursinus) and koalas (Phascolarctos cinereus) spanning New South Wales, Victoria and Tasmania, and compared them with the recently sequenced mitochondrial genome sequences of S. scabiei from humans. We found unique S. scabiei haplotypes among individual wombat and koala hosts with high sequence similarity (99.1% - 100%). Phylogenetic analysis of near full-length mitochondrial genomes revealed three clades of S. scabiei (one human and two marsupial), with no apparent geographic or host species pattern, suggestive of multiple introductions. The availability of additional mitochondrial gene sequences also enabled a re-evaluation of a range of putative molecular markers of S. scabiei, revealing that cox1 is the most informative gene for molecular epidemiological investigations. Utilising this gene target, we provide additional evidence to support cross-host transmission between different animal hosts. Our results suggest a history of parasite invasion through colonisation of Australia from hosts across the globe and the potential for cross-host transmission being a common feature of the epidemiology of this neglected pathogen. If this is the case, comparable patterns may exist elsewhere in the 'New World'. This work provides a basis for expanded molecular studies into mange epidemiology in humans and animals in Australia and other geographic regions.

  14. Characterization of a highly polymorphic region 5′ to JH in the human immunoglobulin heavy chain

    PubMed Central

    Silva, Alcino J.; Johnson, John P.; White, Raymond L.

    1987-01-01

    A cloned DNA segment 1.25 kilobases (kb) upstream from the joining segments of the human heavy chain immunoglobulin gene revealed extensive polymorphic variation at this locus, and the polymorphic pattern was stably transmitted to the next generation. Genomic restriction analysis showed that the polymorphism was caused by insertions/deletions within an MspI/BamHI fragment. Sequencing of one allele, 848 base pairs (bp) long, revealed eleven 50-base-pair tandem repeats. A second allele, 648 bp long, was cloned from a human genomic cosmid library, sequenced, and found to contain four fewer repeats than the first allele. A survey of 186 chromosomes from unrelated individuals of primarily northern European descent revealed at least six alleles. Images PMID:2884636

  15. Genetic differences between blood- and brain-derived viral sequences from human immunodeficiency virus type 1-infected patients: evidence of conserved elements in the V3 region of the envelope protein of brain-derived sequences.

    PubMed Central

    Korber, B T; Kunstman, K J; Patterson, B K; Furtado, M; McEvilly, M M; Levy, R; Wolinsky, S M

    1994-01-01

    Human immunodeficiency virus type 1 (HIV-1) sequences were generated from blood and from brain tissue obtained by stereotactic biopsy from six patients undergoing a diagnostic neurosurgical procedure. Proviral DNA was directly amplified by nested PCR, and 8 to 36 clones from each sample were sequenced. Phylogenetic analysis of intrapatient envelope V3-V5 region HIV-1 DNA sequence sets revealed that brain viral sequences were clustered relative to the blood viral sequences, suggestive of tissue-specific compartmentalization of the virus in four of the six cases. In the other two cases, the blood and brain virus sequences were intermingled in the phylogenetic analyses, suggesting trafficking of virus between the two tissues. Slide-based PCR-driven in situ hybridization of two of the patients' brain biopsy samples confirmed our interpretation of the intrapatient phylogenetic analyses. Interpatient V3 region brain-derived sequence distances were significantly less than blood-derived sequence distances. Relative to the tip of the loop, the set of brain-derived viral sequences had a tendency towards negative or neutral charge compared with the set of blood-derived viral sequences. Entropy calculations were used as a measure of the variability at each position in alignments of blood and brain viral sequences. A relatively conserved set of positions were found, with a significantly lower entropy in the brain-than in the blood-derived viral sequences. These sites constitute a brain "signature pattern," or a noncontiguous set of amino acids in the V3 region conserved in viral sequences derived from brain tissue. This brain-derived signature pattern was also well preserved among isolates previously characterized in vitro as macrophage tropic. Macrophage-monocyte tropism may be the biological constraint that results in the conservation of the viral brain signature pattern. Images PMID:7933130

  16. Cholera outbreaks (2012) in three districts of Nepal reveal clonal transmission of multi-drug resistant Vibrio cholerae O1

    PubMed Central

    2014-01-01

    Background Although endemic cholera causes significant morbidity and mortality each year in Nepal, lack of information about the causal bacterium often hinders cholera intervention and prevention. In 2012, diarrheal outbreaks affected three districts of Nepal with confirmed cases of mortality. This study was designed to understand the drug response patterns, source, and transmission of Vibrio cholerae associated with 2012 cholera outbreaks in Nepal. Methods V. cholerae (n = 28) isolated from 2012 diarrhea outbreaks {n = 22; Kathmandu (n = 12), Doti (n = 9), Bajhang (n = 1)}, and surface water (n = 6; Kathmandu) were tested for antimicrobial response. Virulence properties and DNA fingerprinting of the strains were determined by multi-locus genetic screening employing polymerase chain reaction, DNA sequencing, and pulsed-field gel electrophoresis (PFGE). Results All V. cholerae strains isolated from patients and surface water were confirmed to be toxigenic, belonging to serogroup O1, Ogawa serotype, biotype El Tor, and possessed classical biotype cholera toxin (CTX). Double-mismatch amplification mutation assay (DMAMA)-PCR revealed the V. cholerae strains to possess the B-7 allele of ctx subunit B. DNA sequencing of tcpA revealed a point mutation at amino acid position 64 (N → S) while the ctxAB promoter revealed four copies of the tandem heptamer repeat sequence 5'-TTTTGAT-3'. V. cholerae possessed all the ORFs of the Vibrio seventh pandemic island (VSP)-I but lacked the ORFs 498–511 of VSP-II. All strains were multidrug resistant with resistance to trimethoprim-sulfamethoxazole (SXT), nalidixic acid (NA), and streptomycin (S); all carried the SXT genetic element. DNA sequencing and deduced amino acid sequence of gyrA and parC of the NAR strains (n = 4) revealed point mutations at amino acid positions 83 (S → I), and 85 (S → L), respectively. Similar PFGE (NotI) pattern revealed the Nepalese V. cholerae to be clonal, and related closely with V. cholerae associated with cholera in Bangladesh and Haiti. Conclusions In 2012, diarrhea outbreaks in three districts of Nepal were due to transmission of multidrug resistant V. cholerae El Tor possessing cholera toxin (ctx) B-7 allele, which is clonal and related closely with V. cholerae associated with cholera in Bangladesh and Haiti. PMID:25022982

  17. Negatively supercoiled simian virus 40 DNA contains Z-DNA segments within transcriptional enhancer sequences

    NASA Technical Reports Server (NTRS)

    Nordheim, A.; Rich, A.

    1983-01-01

    Three 8-base pair (bp) segments of alternating purine-pyrimidine from the simian virus 40 enhancer region form Z-DNA on negative supercoiling; minichromosome DNase I-hypersensitive sites determined by others bracket these three segments. A survey of transcriptional enhancer sequences reveals a pattern of potential Z-DNA-forming regions which occur in pairs 50-80 bp apart. This may influence local chromatin structure and may be related to transcriptional activation.

  18. Inactivity periods and postural change speed can explain atypical postural change patterns of Caenorhabditis elegans mutants.

    PubMed

    Fukunaga, Tsukasa; Iwasaki, Wataru

    2017-01-19

    With rapid advances in genome sequencing and editing technologies, systematic and quantitative analysis of animal behavior is expected to be another key to facilitating data-driven behavioral genetics. The nematode Caenorhabditis elegans is a model organism in this field. Several video-tracking systems are available for automatically recording behavioral data for the nematode, but computational methods for analyzing these data are still under development. In this study, we applied the Gaussian mixture model-based binning method to time-series postural data for 322 C. elegans strains. We revealed that the occurrence patterns of the postural states and the transition patterns among these states have a relationship as expected, and such a relationship must be taken into account to identify strains with atypical behaviors that are different from those of wild type. Based on this observation, we identified several strains that exhibit atypical transition patterns that cannot be fully explained by their occurrence patterns of postural states. Surprisingly, we found that two simple factors-overall acceleration of postural movement and elimination of inactivity periods-explained the behavioral characteristics of strains with very atypical transition patterns; therefore, computational analysis of animal behavior must be accompanied by evaluation of the effects of these simple factors. Finally, we found that the npr-1 and npr-3 mutants have similar behavioral patterns that were not predictable by sequence homology, proving that our data-driven approach can reveal the functions of genes that have not yet been characterized. We propose that elimination of inactivity periods and overall acceleration of postural change speed can explain behavioral phenotypes of strains with very atypical postural transition patterns. Our methods and results constitute guidelines for effectively finding strains that show "truly" interesting behaviors and systematically uncovering novel gene functions by bioimage-informatic approaches.

  19. A Natural View of Microbial Biodiversity within Hot Spring Cyanobacterial Mat Communities

    PubMed Central

    Ward, David M.; Ferris, Michael J.; Nold, Stephen C.; Bateson, Mary M.

    1998-01-01

    This review summarizes a decade of research in which we have used molecular methods, in conjunction with more traditional approaches, to study hot spring cyanobacterial mats as models for understanding principles of microbial community ecology. Molecular methods reveal that the composition of these communities is grossly oversimplified by microscopic and cultivation methods. For example, none of 31 unique 16S rRNA sequences detected in the Octopus Spring mat, Yellowstone National Park, matches that of any prokaryote previously cultivated from geothermal systems; 11 are contributed by genetically diverse cyanobacteria, even though a single cyanobacterial species was suspected based on morphologic and culture analysis. By studying the basis for the incongruity between culture and molecular samplings of community composition, we are beginning to cultivate isolates whose 16S rRNA sequences are readily detected. By placing the genetic diversity detected in context with the well-defined natural environmental gradients typical of hot spring mat systems, the relationship between gene and species diversity is clarified and ecological patterns of species occurrence emerge. By combining these ecological patterns with the evolutionary patterns inherently revealed by phylogenetic analysis of gene sequence data, we find that it may be possible to understand microbial biodiversity within these systems by using principles similar to those developed by evolutionary ecologists to understand biodiversity of larger species. We hope that such an approach guides microbial ecologists to a more realistic and predictive understanding of microbial species occurrence and responsiveness in both natural and disturbed habitats. PMID:9841675

  20. A natural view of microbial biodiversity within hot spring cyanobacterial mat communities

    NASA Technical Reports Server (NTRS)

    Ward, D. M.; Ferris, M. J.; Nold, S. C.; Bateson, M. M.

    1998-01-01

    This review summarizes a decade of research in which we have used molecular methods, in conjunction with more traditional approaches, to study hot spring cyanobacterial mats as models for understanding principles of microbial community ecology. Molecular methods reveal that the composition of these communities is grossly oversimplified by microscopic and cultivation methods. For example, none of 31 unique 16S rRNA sequences detected in the Octopus Spring mat, Yellowstone National Park, matches that of any prokaryote previously cultivated from geothermal systems; 11 are contributed by genetically diverse cyanobacteria, even though a single cyanobacterial species was suspected based on morphologic and culture analysis. By studying the basis for the incongruity between culture and molecular samplings of community composition, we are beginning to cultivate isolates whose 16S rRNA sequences are readily detected. By placing the genetic diversity detected in context with the well-defined natural environmental gradients typical of hot spring mat systems, the relationship between gene and species diversity is clarified and ecological patterns of species occurrence emerge. By combining these ecological patterns with the evolutionary patterns inherently revealed by phylogenetic analysis of gene sequence data, we find that it may be possible to understand microbial biodiversity within these systems by using principles similar to those developed by evolutionary ecologists to understand biodiversity of larger species. We hope that such an approach guides microbial ecologists to a more realistic and predictive understanding of microbial species occurrence and responsiveness in both natural and disturbed habitats.

  1. Exome sequencing of a colorectal cancer family reveals shared mutation pattern and predisposition circuitry along tumor pathways

    PubMed Central

    Suleiman, Suleiman H.; Koko, Mahmoud E.; Nasir, Wafaa H.; Elfateh, Ommnyiah; Elgizouli, Ubai K.; Abdallah, Mohammed O. E.; Alfarouk, Khalid O.; Hussain, Ayman; Faisal, Shima; Ibrahim, Fathelrahamn M. A.; Romano, Maurizio; Sultan, Ali; Banks, Lawrence; Newport, Melanie; Baralle, Francesco; Elhassan, Ahmed M.; Mohamed, Hiba S.; Ibrahim, Muntaser E.

    2015-01-01

    The molecular basis of cancer and cancer multiple phenotypes are not yet fully understood. Next Generation Sequencing promises new insight into the role of genetic interactions in shaping the complexity of cancer. Aiming to outline the differences in mutation patterns between familial colorectal cancer cases and controls we analyzed whole exomes of cancer tissues and control samples from an extended colorectal cancer pedigree, providing one of the first data sets of exome sequencing of cancer in an African population against a background of large effective size typically with excess of variants. Tumors showed hMSH2 loss of function SNV consistent with Lynch syndrome. Sets of genes harboring insertions–deletions in tumor tissues revealed, however, significant GO enrichment, a feature that was not seen in control samples, suggesting that ordered insertions–deletions are central to tumorigenesis in this type of cancer. Network analysis identified multiple hub genes of centrality. ELAVL1/HuR showed remarkable centrality, interacting specially with genes harboring non-synonymous SNVs thus reinforcing the proposition of targeted mutagenesis in cancer pathways. A likely explanation to such mutation pattern is DNA/RNA editing, suggested here by nucleotide transition-to-transversion ratio that significantly departed from expected values (p-value 5e-6). NFKB1 also showed significant centrality along with ELAVL1, raising the suspicion of viral etiology given the known interaction between oncogenic viruses and these proteins. PMID:26442106

  2. Re-analysis of human immunodeficiency virus type 1 isolates from Cyprus and Greece, initially designated 'subtype I', reveals a unique complex A/G/H/K/? mosaic pattern.

    PubMed

    Paraskevis, D; Magiorkinis, M; Vandamme, A M; Kostrikis, L G; Hatzakis, A

    2001-03-01

    Human immunodeficiency virus type 1 (HIV-1) has been classified into three main groups and 11 distinct subtypes. Moreover, several circulating recombinant forms (CRFs) of HIV-1 have been recently documented to have spread widely causing extensive HIV-1 epidemics. A subtype, initially designated I (CRF04_cpx), was documented in Cyprus and Greece and was found to comprise regions of sequence derived from subtypes A and G as well as regions of unclassified sequence. Re-analysis of the three full-length CRF04_cpx sequences that were available revealed a mosaic genomic organization of unique complexity comprising regions of sequence from at least five distinct subtypes, A, G, H, K and unclassified regions. These strains account for approximately 2% of the total HIV-1-infected population in Greece, thus providing evidence of the great capability of HIV-1 to recombine and produce highly divergent strains which can be spread successfully through different infection routes.

  3. Temporal motifs reveal homophily, gender-specific patterns, and group talk in call sequences.

    PubMed

    Kovanen, Lauri; Kaski, Kimmo; Kertész, János; Saramäki, Jari

    2013-11-05

    Recent studies on electronic communication records have shown that human communication has complex temporal structure. We study how communication patterns that involve multiple individuals are affected by attributes such as sex and age. To this end, we represent the communication records as a colored temporal network where node color is used to represent individuals' attributes, and identify patterns known as temporal motifs. We then construct a null model for the occurrence of temporal motifs that takes into account the interaction frequencies and connectivity between nodes of different colors. This null model allows us to detect significant patterns in call sequences that cannot be observed in a static network that uses interaction frequencies as link weights. We find sex-related differences in communication patterns in a large dataset of mobile phone records and show the existence of temporal homophily, the tendency of similar individuals to participate in communication patterns beyond what would be expected on the basis of their average interaction frequencies. We also show that temporal patterns differ between dense and sparse neighborhoods in the network. Because also this result is independent of interaction frequencies, it can be seen as an extension of Granovetter's hypothesis to temporal networks.

  4. Temporal motifs reveal homophily, gender-specific patterns, and group talk in call sequences

    PubMed Central

    Kovanen, Lauri; Kaski, Kimmo; Kertész, János; Saramäki, Jari

    2013-01-01

    Recent studies on electronic communication records have shown that human communication has complex temporal structure. We study how communication patterns that involve multiple individuals are affected by attributes such as sex and age. To this end, we represent the communication records as a colored temporal network where node color is used to represent individuals’ attributes, and identify patterns known as temporal motifs. We then construct a null model for the occurrence of temporal motifs that takes into account the interaction frequencies and connectivity between nodes of different colors. This null model allows us to detect significant patterns in call sequences that cannot be observed in a static network that uses interaction frequencies as link weights. We find sex-related differences in communication patterns in a large dataset of mobile phone records and show the existence of temporal homophily, the tendency of similar individuals to participate in communication patterns beyond what would be expected on the basis of their average interaction frequencies. We also show that temporal patterns differ between dense and sparse neighborhoods in the network. Because also this result is independent of interaction frequencies, it can be seen as an extension of Granovetter’s hypothesis to temporal networks. PMID:24145424

  5. The organization and expression of the mdm2 gene.

    PubMed

    de Oca Luna, R M; Tabor, A D; Eberspaecher, H; Hulboy, D L; Worth, L L; Colman, M S; Finlay, C A; Lozano, G

    1996-05-01

    The mdm2 gene encodes a zinc finger protein that negatively regulates p53 function by binding and masking the p53 transcriptional activation domain. Two different promoters control expression of mdm2, one of which is also transactivated by p53. We cloned and characterized the mdm2 gene from a murine 129 library. It contained at least 12 exons and spanned approximately 25 kb of DNA. Sequencing of the mdm2 gene revealed three nucleotide differences that resulted in amino acid substitutions in the previously published mdm2 sequence. Sequencing of normal BalbC/J DNA and the original cosmid clone isolated from the 3T3DM cell line revealed that they are identical, suggesting that the published sequence is in error at these three positions. In addition, we analyzed the expression pattern of mdm2 and found ubiquitous low-level expression throughout embryo development and in adult tissues. Analysis of mRNA from numerous tissues for several mdm2 spliced variants that had been identified in the transformed 3T3DM cell line revealed that these variants could not be detected in the developing embryo or in adult tissues.

  6. The role of replay and theta sequences in mediating hippocampal-prefrontal interactions for memory and cognition.

    PubMed

    Zielinski, Mark C; Tang, Wenbo; Jadhav, Shantanu P

    2017-12-18

    Sequential activity is seen in the hippocampus during multiple network patterns, prominently as replay activity during both awake and sleep sharp-wave ripples (SWRs), and as theta sequences during active exploration. Although various mnemonic and cognitive functions have been ascribed to these hippocampal sequences, evidence for these proposed functions remains primarily phenomenological. Here, we briefly review current knowledge about replay events and theta sequences in spatial memory tasks. We reason that in order to gain a mechanistic and causal understanding of how these patterns influence memory and cognitive processing, it is important to consider how these sequences influence activity in other regions, and in particular, the prefrontal cortex, which is crucial for memory-guided behavior. For spatial memory tasks, we posit that hippocampal-prefrontal interactions mediated by replay and theta sequences play complementary and overlapping roles at different stages in learning, supporting memory encoding and retrieval, deliberative decision making, planning, and guiding future actions. This framework offers testable predictions for future physiology and closed-loop feedback inactivation experiments for specifically targeting hippocampal sequences as well as coordinated prefrontal activity in different network states, with the potential to reveal their causal roles in memory-guided behavior. © 2017 Wiley Periodicals, Inc.

  7. The expression of the clock gene cycle has rhythmic pattern and is affected by photoperiod in the moth Sesamia nonagrioides.

    PubMed

    Kontogiannatos, Dimitrios; Gkouvitsas, Theodoros; Kourti, Anna

    2017-06-01

    To obtain clues to the link between the molecular mechanism of circadian and photoperiod clocks, we have cloned the circadian clock gene cycle (Sncyc) in the corn stalk borer, Sesamia nonagrioides, which undergoes facultative diapause controlled by photoperiod. Sequence analysis revealed a high degree of conservation among insects for this gene. SnCYC consists of 667 amino acids and structural analysis showed that it contains a BCTR domain in its C-terminal in addition to the common domains found in Drosophila CYC, i.e. bHLH, PAS-A, PAS-B domains. The results revealed that the sequence of Sncyc showed a similarity to that of its mammalian orthologue, Bmal1. We also investigated the expression patterns of Sncyc in the brain of larvae growing under long-day 16L: 8D (LD), constant darkness (DD) and short-day 10L: 14D (SD) conditions using qRT-PCR assays. The mRNAs of Sncyc expression was rhythmic in LD, DD and SD cycles. Also, it is remarkable that the photoperiodic conditions affect the expression patterns and/or amplitudes of circadian clock gene Sncyc. This gene is associated with diapause in S. nonagrioides, because under SD (diapause conditions) the photoperiodic signal altered mRNA accumulation. Sequence and expression analysis of cyc in S. nonagrioides shows interesting differences compared to Drosophila where this gene does not oscillate or change in expression patterns in response to photoperiod, suggesting that this species is an interesting new model to study the molecular control of insect circadian and photoperiodic clocks. Copyright © 2017 Elsevier Inc. All rights reserved.

  8. Evolutionary transitions between beneficial and phytopathogenic Rhodococcus challenge disease management

    PubMed Central

    Thomas, William J; Gordon, Michael I; Stevens, Danielle M; Creason, Allison L; Belcher, Michael S; Serdani, Maryna; Wiseman, Michele S; Grünwald, Niklaus J; Putnam, Melodie L

    2017-01-01

    Understanding how bacteria affect plant health is crucial for developing sustainable crop production systems. We coupled ecological sampling and genome sequencing to characterize the population genetic history of Rhodococcus and the distribution patterns of virulence plasmids in isolates from nurseries. Analysis of chromosome sequences shows that plants host multiple lineages of Rhodococcus, and suggested that these bacteria are transmitted due to independent introductions, reservoir populations, and point source outbreaks. We demonstrate that isolates lacking virulence genes promote beneficial plant growth, and that the acquisition of a virulence plasmid is sufficient to transition beneficial symbionts to phytopathogens. This evolutionary transition, along with the distribution patterns of plasmids, reveals the impact of horizontal gene transfer in rapidly generating new pathogenic lineages and provides an alternative explanation for pathogen transmission patterns. Results also uncovered a misdiagnosed epidemic that implicated beneficial Rhodococcus bacteria as pathogens of pistachio. The misdiagnosis perpetuated the unnecessary removal of trees and exacerbated economic losses. PMID:29231813

  9. Evolutionary transitions between beneficial and phytopathogenic Rhodococcus challenge disease management.

    PubMed

    Savory, Elizabeth A; Fuller, Skylar L; Weisberg, Alexandra J; Thomas, William J; Gordon, Michael I; Stevens, Danielle M; Creason, Allison L; Belcher, Michael S; Serdani, Maryna; Wiseman, Michele S; Grünwald, Niklaus J; Putnam, Melodie L; Chang, Jeff H

    2017-12-12

    Understanding how bacteria affect plant health is crucial for developing sustainable crop production systems. We coupled ecological sampling and genome sequencing to characterize the population genetic history of Rhodococcus and the distribution patterns of virulence plasmids in isolates from nurseries. Analysis of chromosome sequences shows that plants host multiple lineages of Rhodococcus , and suggested that these bacteria are transmitted due to independent introductions, reservoir populations, and point source outbreaks. We demonstrate that isolates lacking virulence genes promote beneficial plant growth, and that the acquisition of a virulence plasmid is sufficient to transition beneficial symbionts to phytopathogens. This evolutionary transition, along with the distribution patterns of plasmids, reveals the impact of horizontal gene transfer in rapidly generating new pathogenic lineages and provides an alternative explanation for pathogen transmission patterns. Results also uncovered a misdiagnosed epidemic that implicated beneficial Rhodococcus bacteria as pathogens of pistachio. The misdiagnosis perpetuated the unnecessary removal of trees and exacerbated economic losses.

  10. Transposable elements in fish chromosomes: a study in the marine cobia species.

    PubMed

    Costa, G W W F; Cioffi, M B; Bertollo, L A C; Molina, W F

    2013-01-01

    Rachycentron canadum, a unique representative of the Rachycentridae family, has been the subject of considerable biotechnological interest due to its potential use in marine fish farming. This species has undergone extensive research concerning the location of genes and multigene families on its chromosomes. Although most of the genome of some organisms is composed of repeated DNA sequences, aspects of the origin and dispersion of these elements are still largely unknown. The physical mapping of repetitive sequences on the chromosomes of R. canadum proved to be relevant for evolutionary and applied purposes. Therefore, here, we present the mapping by fluorescence in situ hybridization of the transposable element (TE) Tol2, the non-LTR retrotransposons Rex1 and Rex3, together with the 18S and 5S rRNA genes in the chromosome of this species. The Tol2 TE, belonging to the family of hAT transposons, is homogeneously distributed in the euchromatic regions of the chromosomes but with huge colocalization with the 18S rDNA sites. The hybridization signals for Rex1 and Rex3 revealed a semi-arbitrary distribution pattern, presenting differentiated dispersion in euchromatic and heterochromatic regions. Rex1 elements are associated preferentially in heterochromatic regions, while Rex3 shows a scarce distribution in the euchromatic regions of the chromosomes. The colocalization of TEs with 18S and 5S rDNA revealed complex chromosomal regions of repetitive sequences. In addition, the nonpreferential distribution of Rex1 and Rex3 in all heterochromatic regions, as well as the preferential distribution of the Tol2 transposon associated with 18S rDNA sequences, reveals a distinct pattern of organization of TEs in the genome of this species. A heterogeneous chromosomal colonization of TEs may confer different evolutionary rates to the heterochromatic regions of this species.

  11. Some maternal lineages of domestic horses may have origins in East Asia revealed with further evidence of mitochondrial genomes and HVR-1 sequences.

    PubMed

    Ma, Hongying; Wu, Yajiang; Xiang, Hai; Yang, Yunzhou; Wang, Min; Zhao, Chunjiang; Wu, Changxin

    2018-01-01

    There are large populations of indigenous horse ( Equus caballus ) in China and some other parts of East Asia. However, their matrilineal genetic diversity and origin remained poorly understood. Using a combination of mitochondrial DNA (mtDNA) and hypervariable region (HVR-1) sequences, we aim to investigate the origin of matrilineal inheritance in these domestic horses. To investigate patterns of matrilineal inheritance in domestic horses, we conducted a phylogenetic study using 31 de novo mtDNA genomes together with 317 others from the GenBank. In terms of the updated phylogeny, a total of 5,180 horse mitochondrial HVR-1 sequences were analyzed. Eightteen haplogroups (Aw-Rw) were uncovered from the analysis of the whole mitochondrial genomes. Most of which have a divergence time before the earliest domestication of wild horses (about 5,800 years ago) and during the Upper Paleolithic (35-10 KYA). The distribution of some haplogroups shows geographic patterns. The Lw haplogroup contained a significantly higher proportion of European horses than the horses from other regions, while haplogroups Jw, Rw, and some maternal lineages of Cw, have a higher frequency in the horses from East Asia. The 5,180 sequences of horse mitochondrial HVR-1 form nine major haplogroups (A-I). We revealed a corresponding relationship between the haplotypes of HVR-1 and those of whole mitochondrial DNA sequences. The data of the HVR-1 sequences also suggests that Jw, Rw, and some haplotypes of Cw may have originated in East Asia while Lw probably formed in Europe. Our study supports the hypothesis of the multiple origins of the maternal lineage of domestic horses and some maternal lineages of domestic horses may have originated from East Asia.

  12. Molecular evolution of the leptin exon 3 in some species of the family Canidae.

    PubMed

    Chmurzynska, Agata; Zajac, Magdalena; Switonski, Marek

    2003-01-01

    The structure of the leptin gene seems to be well conserved. The polymorphism of this gene in four species belonging to the Canidae family (the dog (Canis familiaris)--16 different breeds, the Chinese racoon dog (Nyctereutes procyonoides procyonoides), the red fox (Vulpes vulpes) and the arctic fox (Alopex lagopus)) were studied with the use of single strand conformation polymorphism (SSCP), restriction fragment length polymorphism (RFLP) and DNA sequencing techniques. For exon 2, all species presented the same SSCP pattern, while in exon 3 some differences were found. DNA sequencing of exon 3 revealed the presence of six nucleotide substitutions, differentiating the studied species. Three of them cause amino acid substitutions as well. For all dog breeds studied, SSCP patterns were identical.

  13. Observations of diffusion-limited aggregation-like patterns by atmospheric plasma jet

    NASA Astrophysics Data System (ADS)

    Chiu, Ching-Yang; Chu, Hong-Yu

    2017-11-01

    We report on the observations of diffusion-limited aggregation-like patterns during the thin film removal process by an atmospheric plasma jet. The fractal patterns are found to have various structures like dense branching and tree-like patterns. The determination of surface morphology reveals that the footprints of discharge bursts are not as random as expected. We propose a diffusion-limited aggregation model with a few extra requirements by analogy with the experimental results, and thereby present the beauty of nature. We show that the model simulates not only the shapes of the patterns similar to the experimental observations, but also the growing sequences of fluctuating, oscillatory, and zigzag traces.

  14. Brain histamine depletion enhances the behavioural sequences complexity of mice tested in the open-field: Partial reversal effect of the dopamine D2/D3 antagonist sulpiride.

    PubMed

    Santangelo, Andrea; Provensi, Gustavo; Costa, Alessia; Blandina, Patrizio; Ricca, Valdo; Crescimanno, Giuseppe; Casarrubea, Maurizio; Passani, M Beatrice

    2017-02-01

    Markers of histaminergic dysregulation were found in several neuropsychiatric disorders characterized by repetitive behaviours, thoughts and stereotypies. We analysed the effect of acute histamine depletion by means of i. c.v. injections of alpha-fluoromethylhistidine, a blocker of histidine decarboxylase, on the temporal organization of motor sequences of CD1 mice behaviour in the open-field test. An ethogram encompassing 9 behavioural components was employed. Durations and frequencies were only slightly affected by treatments. However, as revealed by multivariate t-pattern analysis, histamine depletion was associated with a striking increase in the number of behavioural patterns. We found 42 patterns of different composition occurring, on average, 520.90 ± 50.23 times per mouse in the histamine depleted (HD) group, whereas controls showed 12 different patterns occurring on average 223.30 ± 20.64 times. Exploratory and grooming behaviours clustered separately, and the increased pattern complexity involved exclusively exploratory patterns. To test the hypothesis of a histamine-dopamine interplay on behavioural pattern phenotype, non-sedative doses of the D2/D3 antagonist sulpiride (12.5-25-50 mg/kg) were additionally administered to different groups of HD mice. Sulpiride counterbalanced the enhancement of exploratory patterns of different composition, but it did not affect the mean number of patterns at none of the doses used. Our results provide new insights on the role of histamine on repetitive behavioural sequences of freely moving mice. Histamine deficiency is correlated with a general enhancement of pattern complexity. This study supports a putative involvement of histamine in the pathophysiology of tics and related disorders. Copyright © 2016 Elsevier Ltd. All rights reserved.

  15. High-Throughput Sequencing for Detection of Subpopulations of Bacteria Not Previously Associated with Artisanal Cheeses

    PubMed Central

    Quigley, Lisa; O'Sullivan, Orla; Beresford, Tom P.; Ross, R. Paul; Fitzgerald, Gerald F.

    2012-01-01

    Here, high-throughput sequencing was employed to reveal the highly diverse bacterial populations present in 62 Irish artisanal cheeses and, in some cases, associated cheese rinds. Using this approach, we revealed the presence of several genera not previously associated with cheese, including Faecalibacterium, Prevotella, and Helcococcus and, for the first time, detected the presence of Arthrobacter and Brachybacterium in goats' milk cheese. Our analysis confirmed many previously observed patterns, such as the dominance of typical cheese bacteria, the fact that the microbiota of raw and pasteurized milk cheeses differ, and that the level of cheese maturation has a significant influence on Lactobacillus populations. It was also noted that cheeses containing adjunct ingredients had lower proportions of Lactococcus species. It is thus apparent that high-throughput sequencing-based investigations can provide valuable insights into the microbial populations of artisanal foods. PMID:22685131

  16. High-throughput sequencing for detection of subpopulations of bacteria not previously associated with artisanal cheeses.

    PubMed

    Quigley, Lisa; O'Sullivan, Orla; Beresford, Tom P; Ross, R Paul; Fitzgerald, Gerald F; Cotter, Paul D

    2012-08-01

    Here, high-throughput sequencing was employed to reveal the highly diverse bacterial populations present in 62 Irish artisanal cheeses and, in some cases, associated cheese rinds. Using this approach, we revealed the presence of several genera not previously associated with cheese, including Faecalibacterium, Prevotella, and Helcococcus and, for the first time, detected the presence of Arthrobacter and Brachybacterium in goats' milk cheese. Our analysis confirmed many previously observed patterns, such as the dominance of typical cheese bacteria, the fact that the microbiota of raw and pasteurized milk cheeses differ, and that the level of cheese maturation has a significant influence on Lactobacillus populations. It was also noted that cheeses containing adjunct ingredients had lower proportions of Lactococcus species. It is thus apparent that high-throughput sequencing-based investigations can provide valuable insights into the microbial populations of artisanal foods.

  17. Analysis of evolutionary patterns of genes in campylobacter jejuni and C. coli

    USDA-ARS?s Scientific Manuscript database

    Background: In order to investigate the population genetics structure of thermophilic Campylobacter spp., we extracted a set of 1029 core gene families (CGF) from 25 sequenced genomes of C. jejuni, C. coli and C. lari. Based on these CGFs we employed different approaches to reveal the evolutionary ...

  18. Revealing representational content with pattern-information fMRI--an introductory guide.

    PubMed

    Mur, Marieke; Bandettini, Peter A; Kriegeskorte, Nikolaus

    2009-03-01

    Conventional statistical analysis methods for functional magnetic resonance imaging (fMRI) data are very successful at detecting brain regions that are activated as a whole during specific mental activities. The overall activation of a region is usually taken to indicate involvement of the region in the task. However, such activation analysis does not consider the multivoxel patterns of activity within a brain region. These patterns of activity, which are thought to reflect neuronal population codes, can be investigated by pattern-information analysis. In this framework, a region's multivariate pattern information is taken to indicate representational content. This tutorial introduction motivates pattern-information analysis, explains its underlying assumptions, introduces the most widespread methods in an intuitive way, and outlines the basic sequence of analysis steps.

  19. Sequence heterogeneities of genes encoding 16S rRNAs in Paenibacillus polymyxa detected by temperature gradient gel electrophoresis.

    PubMed Central

    Nübel, U; Engelen, B; Felske, A; Snaidr, J; Wieshuber, A; Amann, R I; Ludwig, W; Backhaus, H

    1996-01-01

    Sequence heterogeneities in 16S rRNA genes from individual strains of Paenibacillus polymyxa were detected by sequence-dependent separation of PCR products by temperature gradient gel electrophoresis (TGGE). A fragment of the 16S rRNA genes, comprising variable regions V6 to V8, was used as a target sequence for amplifications. PCR products from P. polymyxa (type strain) emerged as a well-defined pattern of bands in the gradient gel. Six plasmids with different inserts, individually demonstrating the migration characteristics of single bands of the pattern, were obtained by cloning the PCR products. Their sequences were analyzed as a representative sample of the total heterogeneity. An amount of 10 variant nucleotide positions in the fragment of 347 bp was observed, with all substitutions conserving the relevant secondary structures of the V6 and V8 regions in the RNA molecules. Hybridizations with specifically designed probes demonstrated different chromosomal locations of the respective rRNA genes. Amplifications of reverse-transcribed rRNA from ribosome preparations, as well as whole-cell hybridizations, revealed a predominant representation of particular sequences in ribosomes of exponentially growing laboratory cultures. Different strains of P. polymyxa showed not only remarkably differing patterns of PCR products in TGGE analysis but also discriminative whole-cell labeling with the designed oligonucleotide probes, indicating the different representation of individual sequences in active ribosomes. Our results demonstrate the usefulness of TGGE for the structural analysis of heterogeneous rRNA genes together with their expression, stress problems of the generation of meaningful data for 16S rRNA sequences and probe designs, and might have consequences for evolutionary concepts. PMID:8824607

  20. Tracing the phylogeographic history of Southeast Asian long-tailed macaques through mitogenomes of museum specimens.

    PubMed

    Yao, Lu; Li, Hongjie; Martin, Robert D; Moreau, Corrie S; Malhi, Ripan S

    2017-11-01

    The biogeographical history of Southeast Asia is complicated due to the continuous emergences and disappearances of land bridges throughout the Pleistocene. Here, we use long-tailed macaques (Macaca fascicularis), which are widely distributed throughout the mainland and islands of Southeast Asia, asa model for better understanding the biogeographical patterns of diversification in this geographically complex region. A reliable intraspecific phylogeny including individuals from localities on oceanic islands, continental islands, and the mainland is needed to trace relatedness along with the pattern and timing of colonization in this region. We used high-throughput sequencing techniques to sequence mitochondrial genomes (mitogenomes) from 95 Southeast Asian M. fascicularis specimens housed at natural history museums around the world. To achieve a comprehensive picture, we more than tripled the mitogenome sample size for M. fascicularis from previous studies, and for the first time included documented samples from the Philippines and several small Indonesian islands. Confirming the result from a previous, recent intraspecific phylogeny for M. fascicularis, the newly reconstructed phylogeny of 135 specimens divides the samples into two major clades: Clade A includes haplotypes from the mainland and some from northern Sumatra, while Clade B includes all insular haplotypes along with lineages from southern Sumatra. This study resolves a previous disparity by revealing a disjunction in the origin of Sumatran macaques, with separate lineages originating within the two major clades, suggesting that at least two major migrations to Sumatra occurred. However, our dated phylogeny reveals that the two major clades split ∼1.88Ma, which is earlier than in previously published phylogenies. Our new data reveal that most Philippine macaque lineages diverged from the Borneo stock within the last ∼0.06-0.43Ma. Finally, our study provides insight into successful sequencing of DNA across museums and shotgun sequencing of DNA specimens asa method to sequence the mitogenome. Copyright © 2017 Elsevier Inc. All rights reserved.

  1. Timing, sequencing, and executive control in repetitive movement production.

    PubMed

    Krampe, Ralf Th; Mayr, Ulrich; Kliegl, Reinhold

    2005-06-01

    The authors demonstrate that the timing and sequencing of target durations require low-level timing and executive control. Sixteen young (M-sub(age) = 19 years) and 16 older (M-sub(age) = 70 years) adults participated in 2 experiments. In Experiment 1, individual mean-variance functions for low-level timing (isochronous tapping) and the sequencing of multiple targets (rhythm production) revealed (a) a dissociation of low-level timing and sequencing in both age groups, (b) negligible age differences for low-level timing, and (c) large age differences for sequencing. Experiment 2 supported the distinction between low-level timing and executive functions: Selection against a dominant rhythm and switching between rhythms impaired performances in both age groups and induced pronounced perseveration of the dominant pattern in older adults. ((c) 2005 APA, all rights reserved).

  2. Isolation and characterization of major histocompatibility complex class II B genes in cranes.

    PubMed

    Kohyama, Tetsuo I; Akiyama, Takuya; Nishida, Chizuko; Takami, Kazutoshi; Onuma, Manabu; Momose, Kunikazu; Masuda, Ryuichi

    2015-11-01

    In this study, we isolated and characterized the major histocompatibility complex (MHC) class II B genes in cranes. Genomic sequences spanning exons 1 to 4 were amplified and determined in 13 crane species and three other species closely related to cranes. In all, 55 unique sequences were identified, and at least two polymorphic MHC class II B loci were found in most species. An analysis of sequence polymorphisms showed the signature of positive selection and recombination. A phylogenetic reconstruction based on exon 2 sequences indicated that trans-species polymorphism has persisted for at least 10 million years, whereas phylogenetic analyses of the sequences flanking exon 2 revealed a pattern of concerted evolution. These results suggest that both balancing selection and recombination play important roles in the crane MHC evolution.

  3. A compositional segmentation of the human mitochondrial genome is related to heterogeneities in the guanine mutation rate

    PubMed Central

    Samuels, David C.; Boys, Richard J.; Henderson, Daniel A.; Chinnery, Patrick F.

    2003-01-01

    We applied a hidden Markov model segmentation method to the human mitochondrial genome to identify patterns in the sequence, to compare these patterns to the gene structure of mtDNA and to see whether these patterns reveal additional characteristics important for our understanding of genome evolution, structure and function. Our analysis identified three segmentation categories based upon the sequence transition probabilities. Category 2 segments corresponded to the tRNA and rRNA genes, with a greater strand-symmetry in these segments. Category 1 and 3 segments covered the protein- coding genes and almost all of the non-coding D-loop. Compared to category 1, the mtDNA segments assigned to category 3 had much lower guanine abundance. A comparison to two independent databases of mitochondrial mutations and polymorphisms showed that the high substitution rate of guanine in human mtDNA is largest in the category 3 segments. Analysis of synonymous mutations showed the same pattern. This suggests that this heterogeneity in the mutation rate is partly independent of respiratory chain function and is a direct property of the genome sequence itself. This has important implications for our understanding of mtDNA evolution and its use as a ‘molecular clock’ to determine the rate of population and species divergence. PMID:14530452

  4. Microbial Characterization of Qatari Barchan Sand Dunes

    PubMed Central

    Chatziefthimiou, Aspassia D.; Nguyen, Hanh; Richer, Renee; Louge, Michel; Sultan, Ali A.; Schloss, Patrick; Hay, Anthony G.

    2016-01-01

    This study represents the first characterization of sand microbiota in migrating barchan sand dunes. Bacterial communities were studied through direct counts and cultivation, as well as 16S rRNA gene and metagenomic sequence analysis to gain an understanding of microbial abundance, diversity, and potential metabolic capabilities. Direct on-grain cell counts gave an average of 5.3 ± 0.4 x 105 cells g-1 of sand. Cultured isolates (N = 64) selected for 16S rRNA gene sequencing belonged to the phyla Actinobacteria (58%), Firmicutes (27%) and Proteobacteria (15%). Deep-sequencing of 16S rRNA gene amplicons from 18 dunes demonstrated a high relative abundance of Proteobacteria, particularly enteric bacteria, and a dune-specific-pattern of bacterial community composition that correlated with dune size. Shotgun metagenome sequences of two representative dunes were analyzed and found to have similar relative bacterial abundance, though the relative abundances of eukaryotic, viral and enterobacterial sequences were greater in sand from the dune closer to a camel-pen. Functional analysis revealed patterns similar to those observed in desert soils; however, the increased relative abundance of genes encoding sporulation and dormancy are consistent with the dune microbiome being well-adapted to the exceptionally hyper-arid Qatari desert. PMID:27655399

  5. Effects of tonal language background on tests of temporal sequencing in children.

    PubMed

    Mukari, Siti Zamratol-Mai S; Yu, Xuan; Ishak, Wan Syafira; Mazlan, Rafidah

    2015-01-01

    The aims of the present study were to determine the effects of language background on the performance of the pitch pattern sequence test (PPST) and duration pattern sequence test (DPST). As temporal order sequencing may be affected by age and working memory, these factors were also studied. Performance of tonal and non-tonal language speakers on PPST and DPST were compared. Twenty-eight native Mandarin (tonal language) speakers and twenty-nine native Malay (non-tonal language) speakers between seven to nine years old participated in this study. The results revealed that relative to native Malay speakers, native Mandarin speakers demonstrated better scores on the PPST in both humming and verbal labeling responses. However, a similar language effect was not apparent in the DPST. An age effect was only significant in the PPST (verbal labeling). Finally, no significant effect of working memory was found on the PPST and the DPST. These findings suggest that the PPST is affected by tonal language background, and highlight the importance of developing different normative values for tonal and non-tonal language speakers.

  6. Analysis of SINE and LINE repeat content of Y chromosomes in the platypus, Ornithorhynchus anatinus.

    PubMed

    Kortschak, R Daniel; Tsend-Ayush, Enkhjargal; Grützner, Frank

    2009-01-01

    Monotremes feature an extraordinary sex-chromosome system that consists of five X and five Y chromosomes in males. These sex chromosomes share homology with bird sex chromosomes but no homology with the therian X. The genome of a female platypus was recently completed, providing unique insights into sequence and gene content of autosomes and X chromosomes, but no Y-specific sequence has so far been analysed. Here we report the isolation, sequencing and analysis of approximately 700 kb of sequence of the non-recombining regions of Y2, Y3 and Y5, which revealed differences in base composition and repeat content between autosomes and sex chromosomes, and within the sex chromosomes themselves. This provides the first insights into repeat content of Y chromosomes in platypus, which overall show similar patterns of repeat composition to Y chromosomes in other species. Interestingly, we also observed differences between the various Y chromosomes, and in combination with timing and activity patterns we provide an approach that can be used to examine the evolutionary history of the platypus sex-chromosome chain.

  7. Nonneutral mitochondrial DNA variation in humans and chimpanzees

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nachman, M.W.; Aquadro, C.F.; Brown, W.M.

    1996-03-01

    We sequenced the NADH dehydrogenase subunit 3 (ND3) gene from a sample of 61 humans, five common chimpanzees, and one gorilla to test whether patterns of mitochondrial DNA (mtDNA) variation are consistent with a neutral model of molecular evolution. Within humans and within chimpanzees, the ratio of replacement to silent nucleotide substitutions was higher than observed in comparisons between species, contrary to neutral expectations. To test the generality of this result, we reanalyzed published human RFLP data from the entire mitochondrial genome. Gains of restriction sites relative to a known human mtDNA sequence were used to infer unambiguous nucleotide substitutions.more » We also compared the complete mtDNA sequences of three humans. Both the RFLP data and the sequence data reveal a higher ratio of replacement to silent nucleotide substitutions within humans than is seen between species. This pattern is observed at most or all human mitochondrial genes and is inconsistent with a strictly neutral model. These data suggest that many mitochondrial protein polymorphisms are slightly deleterious, consistent with studies of human mitochondrial diseases. 59 refs., 2 figs., 8 tabs.« less

  8. A family of cellular proteins related to snake venom disintegrins.

    PubMed

    Weskamp, G; Blobel, C P

    1994-03-29

    Disintegrins are short soluble integrin ligands that were initially identified in snake venom. A previously recognized cellular protein with a disintegrin domain was the guinea pig sperm protein PH-30, a protein implicated in sperm-egg membrane binding and fusion. Here we present peptide sequences that are characteristic for several cellular disintegrin-domain proteins. These peptide sequences were deduced from cDNA sequence tags that were generated by polymerase chain reaction from various mouse tissue and a mouse muscle cell line. Northern blot analysis with four sequence tags revealed distinct mRNA expression patterns. Evidently, cellular proteins containing a disintegrin domain define a superfamily of potential integrin ligands that are likely to function in important cell-cell and cell-matrix interactions.

  9. Proliferating cell nuclear antigen (Pcna) as a direct downstream target gene of Hoxc8

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Min, Hyehyun; Lee, Ji-Yeon; Bok, Jinwoong

    2010-02-19

    Hoxc8 is a member of Hox family transcription factors that play crucial roles in spatiotemporal body patterning during embryogenesis. Hox proteins contain a conserved 61 amino acid homeodomain, which is responsible for recognition and binding of the proteins onto Hox-specific DNA binding motifs and regulates expression of their target genes. Previously, using proteome analysis, we identified Proliferating cell nuclear antigen (Pcna) as one of the putative target genes of Hoxc8. Here, we asked whether Hoxc8 regulates Pcna expression by directly binding to the regulatory sequence of Pcna. In mouse embryos at embryonic day 11.5, the expression pattern of Pcna wasmore » similar to that of Hoxc8 along the anteroposterior body axis. Moreover, Pcna transcript levels as well as cell proliferation rate were increased by overexpression of Hoxc8 in C3H10T1/2 mouse embryonic fibroblast cells. Characterization of 2.3 kb genomic sequence upstream of Pcna coding region revealed that the upstream sequence contains several Hox core binding sequences and one Hox-Pbx binding sequence. Direct binding of Hoxc8 proteins to the Pcna regulatory sequence was verified by chromatin immunoprecipitation assay. Taken together, our data suggest that Pcna is a direct downstream target of Hoxc8.« less

  10. Evolution of bacterial-like phosphoprotein phosphatases in photosynthetic eukaryotes features ancestral mitochondrial or archaeal origin and possible lateral gene transfer.

    PubMed

    Uhrig, R Glen; Kerk, David; Moorhead, Greg B

    2013-12-01

    Protein phosphorylation is a reversible regulatory process catalyzed by the opposing reactions of protein kinases and phosphatases, which are central to the proper functioning of the cell. Dysfunction of members in either the protein kinase or phosphatase family can have wide-ranging deleterious effects in both metazoans and plants alike. Previously, three bacterial-like phosphoprotein phosphatase classes were uncovered in eukaryotes and named according to the bacterial sequences with which they have the greatest similarity: Shewanella-like (SLP), Rhizobiales-like (RLPH), and ApaH-like (ALPH) phosphatases. Utilizing the wealth of data resulting from recently sequenced complete eukaryotic genomes, we conducted database searching by hidden Markov models, multiple sequence alignment, and phylogenetic tree inference with Bayesian and maximum likelihood methods to elucidate the pattern of evolution of eukaryotic bacterial-like phosphoprotein phosphatase sequences, which are predominantly distributed in photosynthetic eukaryotes. We uncovered a pattern of ancestral mitochondrial (SLP and RLPH) or archaeal (ALPH) gene entry into eukaryotes, supplemented by possible instances of lateral gene transfer between bacteria and eukaryotes. In addition to the previously known green algal and plant SLP1 and SLP2 protein forms, a more ancestral third form (SLP3) was found in green algae. Data from in silico subcellular localization predictions revealed class-specific differences in plants likely to result in distinct functions, and for SLP sequences, distinctive and possibly functionally significant differences between plants and nonphotosynthetic eukaryotes. Conserved carboxyl-terminal sequence motifs with class-specific patterns of residue substitutions, most prominent in photosynthetic organisms, raise the possibility of complex interactions with regulatory proteins.

  11. Functional MRI reveals expert-novice differences during sport-related anticipation.

    PubMed

    Wright, Michael J; Bishop, Daniel T; Jackson, Robin C; Abernethy, Bruce

    2010-01-27

    We examined the effect of expertise on cortical activation during sports anticipation using functional MRI. In experiment 1, recreational players predicted badminton stroke direction and the pattern of active clusters was consistent with a proposed perception-of-action network. This pattern was not replicated in a stimulus-matched, action-unrelated control task. In experiment 2, players of three different skill levels anticipated stroke direction from clips occluded either 160 ms before or 80 ms after racquet-shuttle contact. Early-occluded sequences produced more activation than late-occluded sequences overall, in most cortical regions of interest, but experts showed an additional enhancement in medial, dorsolateral and ventrolateral frontal cortex. Anticipation in open-skill sports engages cortical areas integral to observing and understanding others' actions; such activity is enhanced in experts.

  12. Molecular evolution of the leptin exon 3 in some species of the family Canidae

    PubMed Central

    Chmurzynska, Agata; Zajac, Magdalena; Switonski, Marek

    2003-01-01

    The structure of the leptin gene seems to be well conserved. The polymorphism of this gene in four species belonging to the Canidae family (the dog (Canis familiaris) – 16 different breeds, the Chinese racoon dog (Nyctereutes procyonoides procyonoides), the red fox (Vulpes vulpes) and the arctic fox (Alopex lagopus)) were studied with the use of single strand conformation polymorphism (SSCP), restriction fragment length polymorphism (RFLP) and DNA sequencing techniques. For exon 2, all species presented the same SSCP pattern, while in exon 3 some differences were found. DNA sequencing of exon 3 revealed the presence of six nucleotide substitutions, differentiating the studied species. Three of them cause amino acid substitutions as well. For all dog breeds studied, SSCP patterns were identical. PMID:12939206

  13. Evolution Analysis of Simple Sequence Repeats in Plant Genome.

    PubMed

    Qin, Zhen; Wang, Yanping; Wang, Qingmei; Li, Aixian; Hou, Fuyun; Zhang, Liming

    2015-01-01

    Simple sequence repeats (SSRs) are widespread units on genome sequences, and play many important roles in plants. In order to reveal the evolution of plant genomes, we investigated the evolutionary regularities of SSRs during the evolution of plant species and the plant kingdom by analysis of twelve sequenced plant genome sequences. First, in the twelve studied plant genomes, the main SSRs were those which contain repeats of 1-3 nucleotides combination. Second, in mononucleotide SSRs, the A/T percentage gradually increased along with the evolution of plants (except for P. patens). With the increase of SSRs repeat number the percentage of A/T in C. reinhardtii had no significant change, while the percentage of A/T in terrestrial plants species gradually declined. Third, in dinucleotide SSRs, the percentage of AT/TA increased along with the evolution of plant kingdom and the repeat number increased in terrestrial plants species. This trend was more obvious in dicotyledon than monocotyledon. The percentage of CG/GC showed the opposite pattern to the AT/TA. Forth, in trinucleotide SSRs, the percentages of combinations including two or three A/T were in a rising trend along with the evolution of plant kingdom; meanwhile with the increase of SSRs repeat number in plants species, different species chose different combinations as dominant SSRs. SSRs in C. reinhardtii, P. patens, Z. mays and A. thaliana showed their specific patterns related to evolutionary position or specific changes of genome sequences. The results showed that, SSRs not only had the general pattern in the evolution of plant kingdom, but also were associated with the evolution of the specific genome sequence. The study of the evolutionary regularities of SSRs provided new insights for the analysis of the plant genome evolution.

  14. VDJ-Seq: Deep Sequencing Analysis of Rearranged Immunoglobulin Heavy Chain Gene to Reveal Clonal Evolution Patterns of B Cell Lymphoma.

    PubMed

    Jiang, Yanwen; Nie, Kui; Redmond, David; Melnick, Ari M; Tam, Wayne; Elemento, Olivier

    2015-12-28

    Understanding tumor clonality is critical to understanding the mechanisms involved in tumorigenesis and disease progression. In addition, understanding the clonal composition changes that occur within a tumor in response to certain micro-environment or treatments may lead to the design of more sophisticated and effective approaches to eradicate tumor cells. However, tracking tumor clonal sub-populations has been challenging due to the lack of distinguishable markers. To address this problem, a VDJ-seq protocol was created to trace the clonal evolution patterns of diffuse large B cell lymphoma (DLBCL) relapse by exploiting VDJ recombination and somatic hypermutation (SHM), two unique features of B cell lymphomas. In this protocol, Next-Generation sequencing (NGS) libraries with indexing potential were constructed from amplified rearranged immunoglobulin heavy chain (IgH) VDJ region from pairs of primary diagnosis and relapse DLBCL samples. On average more than half million VDJ sequences per sample were obtained after sequencing, which contain both VDJ rearrangement and SHM information. In addition, customized bioinformatics pipelines were developed to fully utilize sequence information for the characterization of IgH-VDJ repertoire within these samples. Furthermore, the pipeline allows the reconstruction and comparison of the clonal architecture of individual tumors, which enables the examination of the clonal heterogeneity within the diagnosis tumors and deduction of clonal evolution patterns between diagnosis and relapse tumor pairs. When applying this analysis to several diagnosis-relapse pairs, we uncovered key evidence that multiple distinctive tumor evolutionary patterns could lead to DLBCL relapse. Additionally, this approach can be expanded into other clinical aspects, such as identification of minimal residual disease, monitoring relapse progress and treatment response, and investigation of immune repertoires in non-lymphoma contexts.

  15. Basic leucine zipper family in barley: genome-wide characterization of members and expression analysis.

    PubMed

    Pourabed, Ehsan; Ghane Golmohamadi, Farzan; Soleymani Monfared, Peyman; Razavi, Seyed Morteza; Shobbar, Zahra-Sadat

    2015-01-01

    The basic leucine zipper (bZIP) family is one of the largest and most diverse transcription factors in eukaryotes participating in many essential plant processes. We identified 141 bZIP proteins encoded by 89 genes from the Hordeum vulgare genome. HvbZIPs were classified into 11 groups based on their DNA-binding motif. Amino acid sequence alignment of the HvbZIPs basic-hinge regions revealed some highly conserved residues within each group. The leucine zipper heptads were analyzed predicting their dimerization properties. 34 conserved motifs were identified outside the bZIP domain. Phylogenetic analysis indicated that major diversification within the bZIP family predated the monocot/dicot divergence, although intra-species duplication and parallel evolution seems to be occurred afterward. Localization of HvbZIPs on the barley chromosomes revealed that different groups have been distributed on seven chromosomes of barley. Six types of intron pattern were detected within the basic-hinge regions. Most of the detected cis-elements in the promoter and UTR sequences were involved in seed development or abiotic stress response. Microarray data analysis revealed differential expression pattern of HvbZIPs in response to ABA treatment, drought, and cold stresses and during barley grain development and germination. This information would be helpful for functional characterization of bZIP transcription factors in barley.

  16. Phylogeography of the dark kangaroo mouse, Microdipodops megacephalus: cryptic lineages and dispersal routes in North America's Great Basin.

    PubMed

    Hafner, John C; Upham, Nathan S

    2011-06-01

    AIM: The rodent genus Microdipodops (kangaroo mice) includes two sand-obligate endemics of the Great Basin Desert: M. megacephalus and M. pallidus. The dark kangaroo mouse, M. megacephalus, is distributed throughout the Great Basin and our principal aims were to formulate phylogenetic hypotheses for this taxon and make phylogeographical comparisons with its congener. LOCATION: The Great Basin Desert of western North America. METHODS: DNA sequence data from three mitochondrial genes were examined from 186 individuals of M. megacephalus, representing 47 general localities. Phylogenetic inference was used to analyse the sequence data. Directional analysis of phylogeographical patterns was used to examine haplotype sharing patterns and recover routes of gene exchange. Haplotype-area curves were constructed to evaluate the relationship between genetic variation and distributional island size for M. megacephalus and M. pallidus. RESULTS: Microdipodops megacephalus is a rare desert rodent (trapping success was 2.67%). Temporal comparison of trapping data shows that kangaroo mice are becoming less abundant in the study area. The distribution has changed slightly since the 1930s but many northern populations now appear to be small, fragmented, or locally extinct. Four principal phylogroups (the Idaho isolate and the western, central and eastern clades) are evident; mean sequence divergence between phylogroups for cytochrome b is c. 8%. Data from haplotype sharing show two trends: a north-south trend and a web-shaped trend. Analyses of haplotype-area curves reveal significant positive relationships. MAIN CONCLUSIONS: The four phylogroups of M. megacephalus appear to represent morphologically cryptic species; in comparison, a companion study revealed two cryptic lineages in M. pallidus. Estimated divergence times of the principal clades of M. megacephalus (c. 2-4 Ma) indicate that these kangaroo mice were Pleistocene invaders into the Great Basin coincident with the formation of sandy habitats. The north-south and web patterns from directional analyses reveal past routes of gene flow and provide evidence for source-sink population regulation. The web pattern was not seen in the companion study of M. pallidus. Significant haplotype-area curves indicate that the distributional islands are now in approximate genetic equilibrium. The patterns described here are potentially useful to conservation biologists and wildlife managers and may serve as a model for other sand-obligate organisms of the Great Basin.

  17. Linear and Nonlinear Statistical Characterization of DNA

    NASA Astrophysics Data System (ADS)

    Norio Oiwa, Nestor; Goldman, Carla; Glazier, James

    2002-03-01

    We find spatial order in the distribution of protein-coding (including RNAs) and control segments of GenBank genomic sequences, irrespective of ATCG content. This is achieved by correlations, histograms, fractal dimensions and singularity spectra. Estimates of these quantities in complete nuclear genome indicate that coding sequences are long-range correlated and their disposition are self-similar (multifractal) for eukaryotes. These characteristics are absent in prokaryotes, where there are few noncoding sequences, suggesting the `junk' DNA play a relevant role to the genome structure and function. Concerning the genetic message of ATCG sequences, we build a random walk (Levy flight), using DNA symmetry arguments, where we associate A, T, C and G as left, right, down and up steps, respectively. Nonlinear analysis of mitochondrial DNA walks reveal multifractal pattern based on palindromic sequences, which fold in hairpins and loops.

  18. Genome Sequence of the Bacterium Streptomyces davawensis JCM 4913 and Heterologous Production of the Unique Antibiotic Roseoflavin

    PubMed Central

    Jankowitsch, Frank; Schwarz, Julia; Rückert, Christian; Gust, Bertolt; Szczepanowski, Rafael; Blom, Jochen; Pelzer, Stefan; Kalinowski, Jörn

    2012-01-01

    Streptomyces davawensis JCM 4913 synthesizes the antibiotic roseoflavin, a structural riboflavin (vitamin B2) analog. Here, we report the 9,466,619-bp linear chromosome of S. davawensis JCM 4913 and a 89,331-bp linear plasmid. The sequence has an average G+C content of 70.58% and contains six rRNA operons (16S-23S-5S) and 69 tRNA genes. The 8,616 predicted protein-coding sequences include 32 clusters coding for secondary metabolites, several of which are unique to S. davawensis. The chromosome contains long terminal inverted repeats of 33,255 bp each and atypical telomeres. Sequence analysis with regard to riboflavin biosynthesis revealed three different patterns of gene organization in Streptomyces species. Heterologous expression of a set of genes present on a subgenomic fragment of S. davawensis resulted in the production of roseoflavin by the host Streptomyces coelicolor M1152. Phylogenetic analysis revealed that S. davawensis is a close relative of Streptomyces cinnabarinus, and much to our surprise, we found that the latter bacterium is a roseoflavin producer as well. PMID:23043000

  19. Determinants of Base-Pair Substitution Patterns Revealed by Whole-Genome Sequencing of DNA Mismatch Repair Defective Escherichia coli.

    PubMed

    Foster, Patricia L; Niccum, Brittany A; Popodi, Ellen; Townes, Jesse P; Lee, Heewook; MohammedIsmail, Wazim; Tang, Haixu

    2018-06-15

    Mismatch repair (MMR) is a major contributor to replication fidelity, but its impact varies with sequence context and the nature of the mismatch. Mutation accumulation experiments followed by whole-genome sequencing of MMR-defective E. coli strains yielded ≈30,000 base-pair substitutions, revealing mutational patterns across the entire chromosome. The base-pair substitution spectrum was dominated by A:T > G:C transitions, which occurred predominantly at the center base of 5'N A C3'+5'G T N3' triplets. Surprisingly, growth on minimal medium or at low temperature attenuated these mutations. Mononucleotide runs were also hotspots for base-pair substitutions, and the rate at which these occurred increased with run length. Comparison with ≈2000 base-pair substitutions accumulated in MMR-proficient strains revealed that both kinds of hotspots appeared in the wild-type spectrum and so are likely to be sites of frequent replication errors. In MMR-defective strains transitions were strand biased, occurring twice as often when A and C rather than T and G were on the lagging-strand template. Loss of nucleotide diphosphate kinase increases the cellular concentration of dCTP, which resulted in increased rates of mutations due to misinsertion of C opposite A and T. In an mmr ndk double mutant strain, these mutations were more frequent when the template A and T were on the leading strand, suggesting that lagging-strand synthesis was more error-prone or less well corrected by proofreading than was leading strand synthesis. Copyright © 2018, Genetics.

  20. Group 16SrXI phytoplasma strains, including subgroup 16SrXI-B and a new subgroup, 16SrXI-D, are associated with sugar cane white leaf.

    PubMed

    Zhang, Rong-Yue; Li, Wen-Feng; Huang, Ying-Kun; Wang, Xiao-Yan; Shan, Hong-Li; Luo, Zhi-Ming; Yin, Jiong

    2016-01-01

    Sugar cane white leaf (SCWL) is a serious disease caused by phytoplasmas. In this study, we performed nested PCR with phytoplasma universal primer pairs (P1/P7 and R16F2n/R16R2) for the 16S rRNA gene to detect SCWL phytoplasmas in 31 SCWL samples collected from Baoshan and Lincang, Yunnan, China. We cloned and sequenced the nested PCR products, revealing that the 16S rRNA gene sequences from 31 SCWL samples were all 1247 bp in length and shared more than 99 % nucleotide sequence similarity with the 16S rRNA gene sequences of SCWL phytoplasmas from various countries. Based on the reported 16S rRNA gene sequence data from SCWL isolates of various countries, we conducted phylogenetic and virtual RFLP analysis. In the resulting phylogenetic tree, all SCWL isolates clustered into two branches, with the Lincang and Baoshan SCWL phytoplasma isolates belonging to different branches. The virtual RFLP patterns show that phytoplasmas of the Lincang branch belong to subgroup 16SrXI-B. However, the virtual RFLP patterns revealed by HaeIII digestion of phytoplasmas of the Baoshan branch differed from those of subgroup 16SrXI-B. According to the results of phylogenetic and virtual RFLP analysis, we propose that the phytoplasmas of the Baoshan branch represent a new subgroup, 16SrXI-D. These findings suggest that SCWL is caused by phytoplasmas from group 16SrXI, including subgroup 16SrXI-B and a new subgroup, 16SrXI-D.

  1. Hits to the left, flops to the right: different emotions during listening to music are reflected in cortical lateralisation patterns.

    PubMed

    Altenmüller, Eckart; Schürmann, Kristian; Lim, Vanessa K; Parlitz, Dietrich

    2002-01-01

    In order to investigate the neurobiological mechanisms accompanying emotional valence judgements during listening to complex auditory stimuli, cortical direct current (dc)-electroencephalography (EEG) activation patterns were recorded from 16 right-handed students. Students listened to 160 short sequences taken from the repertoires of jazz, rock-pop, classical music and environmental sounds (each n=40). Emotional valence of the perceived stimuli were rated on a 5-step scale after each sequence. Brain activation patterns during listening revealed widespread bilateral fronto-temporal activation, but a highly significant lateralisation effect: positive emotional attributions were accompanied by an increase in left temporal activation, negative by a more bilateral pattern with preponderance of the right fronto-temporal cortex. Female participants demonstrated greater valence-related differences than males. No differences related to the four stimulus categories could be detected, suggesting that the actual auditory brain activation patterns were more determined by their affective emotional valence than by differences in acoustical "fine" structure. The results are consistent with a model of hemispheric specialisation concerning perceived positive or negative emotions proposed by Heilman [Journal of Neuropsychiatry and Clinical Neuroscience 9 (1997) 439].

  2. Understanding the unique flowering sequence in Dipsacus fullonum: Evidence from geometrical changes during head development

    PubMed Central

    Naghiloo, Somayeh; Claßen-Bockhoff, Regine

    2017-01-01

    The genus Dipsacus is characterized by a remarkable bidirectional flowering sequence and a rare phyllotactic pattern. Considering that flower initiation and flowering sequence may be interconnected, we document the development of the head meristem in Dipsacus fullonum. Our results indicate a gradual change in the geometry of the head meristem beginning with a dome shaped stage, continuing with a remarkable widening in the middle part of the head meristem and ending in a spindle-like form. Quantitative data confirm that meristem expansion is higher in the middle part than at the base of the meristem. Likewise, the size of the flower primordia in the middle part of the young head is significantly larger than at the base soon after initiation. We conclude that the change in the geometry of the meristem and the availability of newly generated space result in the promotion of the middle flowers and the bidirectional flowering sequence at anthesis. Our investigation on phyllotactic patterns reveals a high tendency (30%) of the head meristem to insert or lose parastichies. This finding can also be attributed to changes in the expansion rate of the meristem. Dependent on the spatio-temporal relation between meristem expansion and primordia initiation, either flower primordia are promoted or additional parastichies appear. Our results emphasize the important role of geometry in flower development and phyllotactic pattern formation. PMID:28328952

  3. Heterochrony and patterns of cranial suture closure in hystricognath rodents

    PubMed Central

    Wilson, Laura A B; Sánchez-Villagra, Marcelo R

    2009-01-01

    Sutures, joints that allow one bone to articulate with another through intervening fibrous connective tissue, serve as major sites of bone expansion during postnatal craniofacial growth in the vertebrate skull and represent an aspect of cranial ontogeny which may exhibit functional and phylogenetic correlates. Suture evolution among hystricognath rodents, an ecologically diverse group represented here by 26 species, is examined using sequence heterochrony methods, i.e. event pairing and parsimov. Although minor nuances in suture closure sequence exist between species, the overall sequence was found to be conserved both across the hystricognath group and, to an increasing degree, within selected clades. At species level, suture closure pattern exhibited a significant positive correlation with patterns previously reported for hominoids. Patterns for most clades revealed the first sutures to close are those contacting the exoccipital, interparietal, and palatine bones. Heterochronic shifts were found along 19 of 35 branches within the hystricognath phylogeny. The number of shifts per node ranged from one to seven events and, overall, involved 21 of 34 suture sites. The topology generated by parsimony analyses of the event pair matrix yielded only one grouping that was congruent with the evolutionary relationships, compiled from morphological and molecular studies, taken as framework. Sutures contacting the exoccipital displayed the highest levels of most complete closure across all species. Level of suture closure is negatively correlated with cranial length (P < 0.05). Differing life history and locomotory strategies are coupled in part with differing suture closure patterns among several species. PMID:19245501

  4. A New Approach for Mining Order-Preserving Submatrices Based on All Common Subsequences.

    PubMed

    Xue, Yun; Liao, Zhengling; Li, Meihang; Luo, Jie; Kuang, Qiuhua; Hu, Xiaohui; Li, Tiechen

    2015-01-01

    Order-preserving submatrices (OPSMs) have been applied in many fields, such as DNA microarray data analysis, automatic recommendation systems, and target marketing systems, as an important unsupervised learning model. Unfortunately, most existing methods are heuristic algorithms which are unable to reveal OPSMs entirely in NP-complete problem. In particular, deep OPSMs, corresponding to long patterns with few supporting sequences, incur explosive computational costs and are completely pruned by most popular methods. In this paper, we propose an exact method to discover all OPSMs based on frequent sequential pattern mining. First, an existing algorithm was adjusted to disclose all common subsequence (ACS) between every two row sequences, and therefore all deep OPSMs will not be missed. Then, an improved data structure for prefix tree was used to store and traverse ACS, and Apriori principle was employed to efficiently mine the frequent sequential pattern. Finally, experiments were implemented on gene and synthetic datasets. Results demonstrated the effectiveness and efficiency of this method.

  5. Simultaneous Differentiation and Typing of Entamoeba histolytica and Entamoeba dispar

    PubMed Central

    Zaki, Mehreen; Meelu, Parool; Sun, Wei; Clark, C. Graham

    2002-01-01

    Sequences corresponding to some of the polymorphic loci previously reported from Entamoeba histolytica have been detected in Entamoeba dispar. Comparison of nucleotide sequences of two loci between E. dispar strain SAW760 and E. histolytica strain HM-1:IMSS revealed significant differences in both repeat and flanking regions. The tandem repeat units varied not only in sequence but also in number and arrangement between the two species at both the loci. Using the sequences obtained, primer pairs aimed at amplifying species-specific products were designed and tested on a variety of E. histolytica and E. dispar samples. Amplification results were in complete agreement with the original species classification in all cases, and the PCR products displayed discernible size and pattern variations among the isolates. PMID:11923344

  6. cDNA-AFLP analysis reveals differential gene expression in compatible interaction of wheat challenged with Puccinia striiformis f. sp. tritici

    PubMed Central

    Wang, Xiaojie; Tang, Chunlei; Zhang, Gang; Li, Yingchun; Wang, Chenfang; Liu, Bo; Qu, Zhipeng; Zhao, Jie; Han, Qingmei; Huang, Lili; Chen, Xianming; Kang, Zhensheng

    2009-01-01

    Background Puccinia striiformis f. sp. tritici is a fungal pathogen causing stripe rust, one of the most important wheat diseases worldwide. The fungus is strictly biotrophic and thus, completely dependent on living host cells for its reproduction, which makes it difficult to study genes of the pathogen. In spite of its economic importance, little is known about the molecular basis of compatible interaction between the pathogen and wheat host. In this study, we identified wheat and P. striiformis genes associated with the infection process by conducting a large-scale transcriptomic analysis using cDNA-AFLP. Results Of the total 54,912 transcript derived fragments (TDFs) obtained using cDNA-AFLP with 64 primer pairs, 2,306 (4.2%) displayed altered expression patterns after inoculation, of which 966 showed up-regulated and 1,340 down-regulated. 186 TDFs produced reliable sequences after sequencing of 208 TDFs selected, of which 74 (40%) had known functions through BLAST searching the GenBank database. Majority of the latter group had predicted gene products involved in energy (13%), signal transduction (5.4%), disease/defence (5.9%) and metabolism (5% of the sequenced TDFs). BLAST searching of the wheat stem rust fungus genome database identified 18 TDFs possibly from the stripe rust pathogen, of which 9 were validated of the pathogen origin using PCR-based assays followed by sequencing confirmation. Of the 186 reliable TDFs, 29 homologous to genes known to play a role in disease/defense, signal transduction or uncharacterized genes were further selected for validation of cDNA-AFLP expression patterns using qRT-PCR analyses. Results confirmed the altered expression patterns of 28 (96.5%) genes revealed by the cDNA-AFLP technique. Conclusion The results show that cDNA-AFLP is a reliable technique for studying expression patterns of genes involved in the wheat-stripe rust interactions. Genes involved in compatible interactions between wheat and the stripe rust pathogen were identified and their expression patterns were determined. The present study should be helpful in elucidating the molecular basis of the infection process, and identifying genes that can be targeted for inhibiting the growth and reproduction of the pathogen. Moreover, this study can also be used to elucidate the defence responses of the genes that were of plant origin. PMID:19566949

  7. Thermodynamics of complexity and pattern manipulation.

    PubMed

    Garner, Andrew J P; Thompson, Jayne; Vedral, Vlatko; Gu, Mile

    2017-04-01

    Many organisms capitalize on their ability to predict the environment to maximize available free energy and reinvest this energy to create new complex structures. This functionality relies on the manipulation of patterns-temporally ordered sequences of data. Here, we propose a framework to describe pattern manipulators-devices that convert thermodynamic work to patterns or vice versa-and use them to build a "pattern engine" that facilitates a thermodynamic cycle of pattern creation and consumption. We show that the least heat dissipation is achieved by the provably simplest devices, the ones that exhibit desired operational behavior while maintaining the least internal memory. We derive the ultimate limits of this heat dissipation and show that it is generally nonzero and connected with the pattern's intrinsic crypticity-a complexity theoretic quantity that captures the puzzling difference between the amount of information the pattern's past behavior reveals about its future and the amount one needs to communicate about this past to optimally predict the future.

  8. Mapping the pericentric heterochromatin by comparative genomic hybridization analysis and chromosome deletions in Drosophila melanogaster

    PubMed Central

    He, Bing; Caudy, Amy; Parsons, Lance; Rosebrock, Adam; Pane, Attilio; Raj, Sandeep; Wieschaus, Eric

    2012-01-01

    Heterochromatin represents a significant portion of eukaryotic genomes and has essential structural and regulatory functions. Its molecular organization is largely unknown due to difficulties in sequencing through and assembling repetitive sequences enriched in the heterochromatin. Here we developed a novel strategy using chromosomal rearrangements and embryonic phenotypes to position unmapped Drosophila melanogaster heterochromatic sequence to specific chromosomal regions. By excluding sequences that can be mapped to the assembled euchromatic arms, we identified sequences that are specific to heterochromatin and used them to design heterochromatin specific probes (“H-probes”) for microarray. By comparative genomic hybridization (CGH) analyses of embryos deficient for each chromosome or chromosome arm, we were able to map most of our H-probes to specific chromosome arms. We also positioned sequences mapped to the second and X chromosomes to finer intervals by analyzing smaller deletions with breakpoints in heterochromatin. Using this approach, we were able to map >40% (13.9 Mb) of the previously unmapped heterochromatin sequences assembled by the whole-genome sequencing effort on arm U and arm Uextra to specific locations. We also identified and mapped 110 kb of novel heterochromatic sequences. Subsequent analyses revealed that sequences located within different heterochromatic regions have distinct properties, such as sequence composition, degree of repetitiveness, and level of underreplication in polytenized tissues. Surprisingly, although heterochromatin is generally considered to be transcriptionally silent, we detected region-specific temporal patterns of transcription in heterochromatin during oogenesis and early embryonic development. Our study provides a useful approach to elucidate the molecular organization and function of heterochromatin and reveals region-specific variation of heterochromatin. PMID:22745230

  9. Comparative study of the hemagglutinin and neuraminidase genes of influenza A virus H3N2, H9N2, and H5N1 subtypes using bioinformatics techniques.

    PubMed

    Ahn, Insung; Son, Hyeon S

    2007-07-01

    To investigate the genomic patterns of influenza A virus subtypes, such as H3N2, H9N2, and H5N1, we collected 1842 sequences of the hemagglutinin and neuraminidase genes from the NCBI database and parsed them into 7 categories: accession number, host species, sampling year, country, subtype, gene name, and sequence. The sequences that were isolated from the human, avian, and swine populations were extracted and stored in a MySQL database for intensive analysis. The GC content and relative synonymous codon usage (RSCU) values were calculated using JAVA codes. As a result, correspondence analysis of the RSCU values yielded the unique codon usage pattern (CUP) of each subtype and revealed no extreme differences among the human, avian, and swine isolates. H5N1 subtype viruses exhibited little variation in CUPs compared with other subtypes, suggesting that the H5N1 CUP has not yet undergone significant changes within each host species. Moreover, some observations may be relevant to CUP variation that has occurred over time among the H3N2 subtype viruses isolated from humans. All the sequences were divided into 3 groups over time, and each group seemed to have preferred synonymous codon patterns for each amino acid, especially for arginine, glycine, leucine, and valine. The bioinformatics technique we introduce in this study may be useful in predicting the evolutionary patterns of pandemic viruses.

  10. Sequence and Structure Analysis of Distantly-Related Viruses Reveals Extensive Gene Transfer between Viruses and Hosts and among Viruses

    PubMed Central

    Caprari, Silvia; Metzler, Saskia; Lengauer, Thomas; Kalinina, Olga V.

    2015-01-01

    The origin and evolution of viruses is a subject of ongoing debate. In this study, we provide a full account of the evolutionary relationships between proteins of significant sequence and structural similarity found in viruses that belong to different classes according to the Baltimore classification. We show that such proteins can be found in viruses from all Baltimore classes. For protein families that include these proteins, we observe two patterns of the taxonomic spread. In the first pattern, they can be found in a large number of viruses from all implicated Baltimore classes. In the other pattern, the instances of the corresponding protein in species from each Baltimore class are restricted to a few compact clades. Proteins with the first pattern of distribution are products of so-called viral hallmark genes reported previously. Additionally, this pattern is displayed by the envelope glycoproteins from Flaviviridae and Bunyaviridae and helicases of superfamilies 1 and 2 that have homologs in cellular organisms. The second pattern can often be explained by horizontal gene transfer from the host or between viruses, an example being Orthomyxoviridae and Coronaviridae hemagglutinin esterases. Another facet of horizontal gene transfer comprises multiple independent introduction events of genes from cellular organisms into otherwise unrelated viruses. PMID:26492264

  11. High-resolution characterization of a hepatocellular carcinoma genome.

    PubMed

    Totoki, Yasushi; Tatsuno, Kenji; Yamamoto, Shogo; Arai, Yasuhito; Hosoda, Fumie; Ishikawa, Shumpei; Tsutsumi, Shuichi; Sonoda, Kohtaro; Totsuka, Hirohiko; Shirakihara, Takuya; Sakamoto, Hiromi; Wang, Linghua; Ojima, Hidenori; Shimada, Kazuaki; Kosuge, Tomoo; Okusaka, Takuji; Kato, Kazuto; Kusuda, Jun; Yoshida, Teruhiko; Aburatani, Hiroyuki; Shibata, Tatsuhiro

    2011-05-01

    Hepatocellular carcinoma, one of the most common virus-associated cancers, is the third most frequent cause of cancer-related death worldwide. By massively parallel sequencing of a primary hepatitis C virus-positive hepatocellular carcinoma (36× coverage) and matched lymphocytes (>28× coverage) from the same individual, we identified more than 11,000 somatic substitutions of the tumor genome that showed predominance of T>C/A>G transition and a decrease of the T>C substitution on the transcribed strand, suggesting preferential DNA repair. Gene annotation enrichment analysis of 63 validated non-synonymous substitutions revealed enrichment of phosphoproteins. We further validated 22 chromosomal rearrangements, generating four fusion transcripts that had altered transcriptional regulation (BCORL1-ELF4) or promoter activity. Whole-exome sequencing at a higher sequence depth (>76× coverage) revealed a TSC1 nonsense substitution in a subpopulation of the tumor cells. This first high-resolution characterization of a virus-associated cancer genome identified previously uncharacterized mutation patterns, intra-chromosomal rearrangements and fusion genes, as well as genetic heterogeneity within the tumor.

  12. Not all order memory is equal: Test demands reveal dissociations in memory for sequence information.

    PubMed

    Jonker, Tanya R; MacLeod, Colin M

    2017-02-01

    Remembering the order of a sequence of events is a fundamental feature of episodic memory. Indeed, a number of formal models represent temporal context as part of the memory system, and memory for order has been researched extensively. Yet, the nature of the code(s) underlying sequence memory is still relatively unknown. Across 4 experiments that manipulated encoding task, we found evidence for 3 dissociable facets of order memory. Experiment 1 introduced a test requiring a judgment of which of 2 alternatives had immediately followed a word during encoding. This measure revealed better retention of interitem associations following relational encoding (silent reading) than relatively item-specific encoding (judging referent size), a pattern consistent with that observed in previous research using order reconstruction tests. In sharp contrast, Experiment 2 demonstrated the reverse pattern: Memory for the studied order of 2 sequentially presented items was actually better following item-specific encoding than following relational encoding. Experiment 3 reproduced this dissociation in a single experiment using both tests. Experiment 4 extended these findings by further dissociating the roles of relational encoding and item strength in the 2 tests. Taken together, these results indicate that memory for event sequence is influenced by (a) interitem associations, (b) the emphasized directionality of an association, and (c) an item's strength independent of other items. Memory for order is more complicated than has been portrayed in theories of memory and its nuances should be carefully considered when designing tests and models of temporal and relational memory. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  13. Dissecting enzyme function with microfluidic-based deep mutational scanning.

    PubMed

    Romero, Philip A; Tran, Tuan M; Abate, Adam R

    2015-06-09

    Natural enzymes are incredibly proficient catalysts, but engineering them to have new or improved functions is challenging due to the complexity of how an enzyme's sequence relates to its biochemical properties. Here, we present an ultrahigh-throughput method for mapping enzyme sequence-function relationships that combines droplet microfluidic screening with next-generation DNA sequencing. We apply our method to map the activity of millions of glycosidase sequence variants. Microfluidic-based deep mutational scanning provides a comprehensive and unbiased view of the enzyme function landscape. The mapping displays expected patterns of mutational tolerance and a strong correspondence to sequence variation within the enzyme family, but also reveals previously unreported sites that are crucial for glycosidase function. We modified the screening protocol to include a high-temperature incubation step, and the resulting thermotolerance landscape allowed the discovery of mutations that enhance enzyme thermostability. Droplet microfluidics provides a general platform for enzyme screening that, when combined with DNA-sequencing technologies, enables high-throughput mapping of enzyme sequence space.

  14. Increased complexity of circRNA expression during species evolution.

    PubMed

    Dong, Rui; Ma, Xu-Kai; Chen, Ling-Ling; Yang, Li

    2017-08-03

    Circular RNAs (circRNAs) are broadly identified from precursor mRNA (pre-mRNA) back-splicing across various species. Recent studies have suggested a cell-/tissue- specific manner of circRNA expression. However, the distinct expression pattern of circRNAs among species and its underlying mechanism still remain to be explored. Here, we systematically compared circRNA expression from human and mouse, and found that only a small portion of human circRNAs could be determined in parallel mouse samples. The conserved circRNA expression between human and mouse is correlated with the existence of orientation-opposite complementary sequences in introns that flank back-spliced exons in both species, but not the circRNA sequences themselves. Quantification of RNA pairing capacity of orientation-opposite complementary sequences across circRNA-flanking introns by Complementary Sequence Index (CSI) identifies that among all types of complementary sequences, SINEs, especially Alu elements in human, contribute the most for circRNA formation and that their diverse distribution across species leads to the increased complexity of circRNA expression during species evolution. Together, our integrated and comparative reference catalog of circRNAs in different species reveals a species-specific pattern of circRNA expression and suggests a previously under-appreciated impact of fast-evolved SINEs on the regulation of (circRNA) gene expression.

  15. Comparative analysis of DNA methylation polymorphism in drought sensitive (HPKC2) and tolerant (HPK4) genotypes of horse Gram (Macrotyloma uniflorum).

    PubMed

    Bhardwaj, Jyoti; Mahajan, Monika; Yadav, Sudesh Kumar

    2013-08-01

    DNA methylation is known as an epigenetic modification that affects gene expression in plants. Variation in CpG methylation behavior was studied in two natural horse gram (Macrotyloma uniflorum [Lam.] Verdc.) genotypes, HPKC2 (drought-sensitive) and HPK4 (drought-tolerant). The methylation pattern in both genotypes was studied through methylation-sensitive amplified polymorphism. The results revealed that methylation was higher in HPKC2 (10.1%) than in HPK4 (8.6%). Sequencing demonstrated sequence homology with the DRE binding factor (cbf1), the POZ/BTB protein, and the Ty1-copia retrotransposon among some of the polymorphic fragments showing alteration in methylation behavior. Differences in DNA methylation patterns could explain the differential drought tolerance and the epigenetic signature of these two horse gram genotypes.

  16. Stratigraphic architecture and gamma ray logs of deeper ramp carbonates (Upper Jurassic, SW Germany)

    NASA Astrophysics Data System (ADS)

    Pawellek, T.; Aigner, T.

    2003-07-01

    The objective of this paper is to contribute to the development of sequence stratigraphic models for extensive epicontinental carbonate systems deposited over cratonic areas. Epicontinental carbonates of the SW German Upper Jurassic were analysed in terms of microfacies, sedimentology and sequence stratigraphy based on 2.5 km of core, 70 borehole gamma ray logs and 24 quarries. Facies analysis revealed six major facies belts across the deeper parts of the carbonate ramp, situated generally below fair-weather wave base, and mostly below average storm wave base but in the reach of occasional storm events. Observed stratigraphic patterns differ in some aspects from widely published sequence stratigraphic models: Elementary sedimentary cycles are mostly more or less symmetrical and are, thus, referred to as "genetic sequences" or "genetic units" [AAAPG Bull. 55 (1971) 1137; Frazier, D.E., 1974. Depositional episodes: their relationship to the Quaternary stratigraphic framework in the northwestern portion of the Gulf Basin. University of Texas, Austin, Bureau of Economic Geology Geologicalo Circular 71-1; AAPG Bull. 73 (1989) 125; Galloway, W.E., Hobday, D.K., 1996. Terrigenous Clastic Depositional Systems. 489 pp., Springer; Cross, T.A., Baker, M.R., Chapin, M.S., Clark, M.S., Gardner, M.H., Hanson, M.S., Lessenger, M.A., Little, L.D., McDonough, K.J., Sonnenfeld, M.D., Valasek, D.W., Williams, M.R., Witter, D.N., 1993. Applications of high-resolution sequence stratigraphy to reservoir analysis. Edition Technip 1993, 11-33; Bull. Cent. Rech. Explor. Prod. Elf-Aquitaine 16 (1992) 357; Homewood, P., Mauriaud, P., Lafont, F., 2000. Best practices in sequence stratigraphy. Elf EP Mem. 25, 81 pp.; Homewood, P., Eberli, G.P., 2000. Genetic stratigraphy on the exploration and production scales. Elf EP Mem. 24, 290 pp.], in contrast to the asymmetrical, shallowing-upward "parasequences" of the EXXON approach. Neither sequence boundaries nor maximum flooding surfaces could be clearly delineated. Cycle boundaries are generally not represented by sharp stratal surfaces but are always transitional and, thus, referred to as "turnarounds" [Nor. Pet. Soc. Spec. Publ. 8 (1998) 171]. Several types of genetic sequences were delineated. Both major types of facies and sequences show characteristic gamma ray log signatures. Based on the cycle stacking and the gamma ray patterns, a hierarchy of sequences was recognized, probably driven in part by 100,000- and 400,000-year Milankovitch signals. The cyclicity allowed regional correlations across various depositional environments such as sponge-microbial bioherms and coeval basins. The basin-wide correlation revealed evidence for a subtle clinoform-type stratigraphic architecture along very gentle slopes, rather than a so far assumed simple "layer cake" pattern.

  17. Genome Sequence, Structural Proteins, and Capsid Organization of the Cyanophage Syn5: A “Horned” Bacteriophage of Marine Synechococcus

    PubMed Central

    Pope, Welkin H.; Weigele, Peter R.; Chang, Juan; Pedulla, Marisa L.; Ford, Michael E.; Houtz, Jennifer M.; Jiang, Wen; Chiu, Wah; Hatfull, Graham F.; Hendrix, Roger W.; King, Jonathan

    2010-01-01

    Marine Synechococcus spp and marine Prochlorococcus spp are numerically dominant photoautotrophs in the open oceans and contributors to the global carbon cycle. Syn5 is a short-tailed cyanophage isolated from the Sargasso Sea on Synechococcus strain WH8109. Syn5 has been grown in WH8109 to high titer in the laboratory and purified and concentrated retaining infectivity. Genome sequencing and annotation of Syn5 revealed that the linear genome is 46,214bp with a 237bp terminal direct repeat. Sixty-one open reading frames (ORFs) were identified. Based on genomic organization and sequence similarity to known protein sequences within GenBank, Syn5 shares features with T7-like phages. The presence of a putative integrase suggests access to a temperate life-cycle. Assignment of eleven ORFs to structural proteins found within the phage virion was confirmed by mass-spectrometry and N-terminal sequencing. Eight of these identified structural proteins exhibited amino acid sequence similarity to enteric phage proteins. The remaining three virion proteins did not resemble any known phage sequences in GenBank as of August 2006. Cryoelectron micrographs of purified Syn5 virions revealed that the capsid has a single “horn”, a novel fibrous structure protruding from the opposing end of the capsid from the tail of the virion. The tail appendage displayed an apparent three-fold rather than six-fold symmetry. An 18Å-resolution icosahedral reconstruction of the capsid revealed a T=7 lattice, but with an unusual pattern of surface knobs. This phage/host system should allow detailed investigation of the physiology and biochemistry of phage propagation in marine photosynthetic bacteria. PMID:17383677

  18. Inference of Functionally-Relevant N-acetyltransferase Residues Based on Statistical Correlations.

    PubMed

    Neuwald, Andrew F; Altschul, Stephen F

    2016-12-01

    Over evolutionary time, members of a superfamily of homologous proteins sharing a common structural core diverge into subgroups filling various functional niches. At the sequence level, such divergence appears as correlations that arise from residue patterns distinct to each subgroup. Such a superfamily may be viewed as a population of sequences corresponding to a complex, high-dimensional probability distribution. Here we model this distribution as hierarchical interrelated hidden Markov models (hiHMMs), which describe these sequence correlations implicitly. By characterizing such correlations one may hope to obtain information regarding functionally-relevant properties that have thus far evaded detection. To do so, we infer a hiHMM distribution from sequence data using Bayes' theorem and Markov chain Monte Carlo (MCMC) sampling, which is widely recognized as the most effective approach for characterizing a complex, high dimensional distribution. Other routines then map correlated residue patterns to available structures with a view to hypothesis generation. When applied to N-acetyltransferases, this reveals sequence and structural features indicative of functionally important, yet generally unknown biochemical properties. Even for sets of proteins for which nothing is known beyond unannotated sequences and structures, this can lead to helpful insights. We describe, for example, a putative coenzyme-A-induced-fit substrate binding mechanism mediated by arginine residue switching between salt bridge and π-π stacking interactions. A suite of programs implementing this approach is available (psed.igs.umaryland.edu).

  19. Whole-exome sequencing reveals the spectrum of gene mutations and the clonal evolution patterns in paediatric acute myeloid leukaemia.

    PubMed

    Shiba, Norio; Yoshida, Kenichi; Shiraishi, Yuichi; Okuno, Yusuke; Yamato, Genki; Hara, Yusuke; Nagata, Yasunobu; Chiba, Kenichi; Tanaka, Hiroko; Terui, Kiminori; Kato, Motohiro; Park, Myoung-Ja; Ohki, Kentaro; Shimada, Akira; Takita, Junko; Tomizawa, Daisuke; Kudo, Kazuko; Arakawa, Hirokazu; Adachi, Souichi; Taga, Takashi; Tawa, Akio; Ito, Etsuro; Horibe, Keizo; Sanada, Masashi; Miyano, Satoru; Ogawa, Seishi; Hayashi, Yasuhide

    2016-11-01

    Acute myeloid leukaemia (AML) is a molecularly and clinically heterogeneous disease. Targeted sequencing efforts have identified several mutations with diagnostic and prognostic values in KIT, NPM1, CEBPA and FLT3 in both adult and paediatric AML. In addition, massively parallel sequencing enabled the discovery of recurrent mutations (i.e. IDH1/2 and DNMT3A) in adult AML. In this study, whole-exome sequencing (WES) of 22 paediatric AML patients revealed mutations in components of the cohesin complex (RAD21 and SMC3), BCORL1 and ASXL2 in addition to previously known gene mutations. We also revealed intratumoural heterogeneities in many patients, implicating multiple clonal evolution events in the development of AML. Furthermore, targeted deep sequencing in 182 paediatric AML patients identified three major categories of recurrently mutated genes: cohesion complex genes [STAG2, RAD21 and SMC3 in 17 patients (8·3%)], epigenetic regulators [ASXL1/ASXL2 in 17 patients (8·3%), BCOR/BCORL1 in 7 patients (3·4%)] and signalling molecules. We also performed WES in four patients with relapsed AML. Relapsed AML evolved from one of the subclones at the initial phase and was accompanied by many additional mutations, including common driver mutations that were absent or existed only with lower allele frequency in the diagnostic samples, indicating a multistep process causing leukaemia recurrence. © 2016 John Wiley & Sons Ltd.

  20. Sequence stratigraphy of the Triassic in the Barentsz Sea

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Skjold, L.JU.; Van Veen, P.M.; Gjelberg, J.

    1990-05-01

    A regional study of the Triassic in the Barentsz Sea (20-32{degree}E, 71-74{degree}N) revealed sequences that correlate seismically for hundreds of kilometers. Recent offshore drilling results enabled them to establish a biostratigraphic time framework. Comparisons with information from onshore outcrops (such as the Svalbard Archipelago) aided the piecing together of these superregional sequences. Seismic character analysis identified three units with composite progradational patterns (Induan, Olenekian, and Anisian). Fluvial, deltaic, and marine deposits can be distinguished and located relative to the paleocoastlines. Corresponding downlap surfaces suggest the development of condensed intervals, predicted to consist of organic-rich source rocks, as was later confirmedmore » by drilling. Regional predictions based on this sequence-stratigraphic approach have proved valuable when correlating and evaluating well information. The sequences identified also help define third-order sea level curves for the area; these improve published curves thought to have global significance.« less

  1. Genetic diversity of Clostridium perfringens type A isolates from animals, food poisoning outbreaks and sludge

    PubMed Central

    Johansson, Anders; Aspan, Anna; Bagge, Elisabeth; Båverud, Viveca; Engström, Björn E; Johansson, Karl-Erik

    2006-01-01

    Background Clostridium perfringens, a serious pathogen, causes enteric diseases in domestic animals and food poisoning in humans. The epidemiological relationship between C. perfringens isolates from the same source has previously been investigated chiefly by pulsed-field gel electrophoresis (PFGE). In this study the genetic diversity of C. perfringens isolated from various animals, from food poisoning outbreaks and from sludge was investigated. Results We used PFGE to examine the genetic diversity of 95 C. perfringens type A isolates from eight different sources. The isolates were also examined for the presence of the beta2 toxin gene (cpb2) and the enterotoxin gene (cpe). The cpb2 gene from the 28 cpb2-positive isolates was also partially sequenced (519 bp, corresponding to positions 188 to 706 in the consensus cpb2 sequence). The results of PFGE revealed a wide genetic diversity among the C. perfringens type A isolates. The genetic relatedness of the isolates ranged from 58 to 100% and 56 distinct PFGE types were identified. Almost all clusters with similar patterns comprised isolates with a known epidemiological correlation. Most of the isolates from pig, horse and sheep carried the cpb2 gene. All isolates originating from food poisoning outbreaks carried the cpe gene and three of these also carried cpb2. Two evolutionary different populations were identified by sequence analysis of the partially sequenced cpb2 genes from our study and cpb2 sequences previously deposited in GenBank. Conclusion As revealed by PFGE, there was a wide genetic diversity among C. perfringens isolates from different sources. Epidemiologically related isolates showed a high genetic similarity, as expected, while isolates with no obvious epidemiological relationship expressed a lesser degree of genetic similarity. The wide diversity revealed by PFGE was not reflected in the 16S rRNA sequences, which had a considerable degree of sequence similarity. Sequence comparison of the partially sequenced cpb2 gene revealed two genetically different populations. This is to our knowledge the first study in which the genetic diversity of C. perfringens isolates both from different animals species, from food poisoning outbreaks and from sludge has been investigated. PMID:16737528

  2. Arnica (Asteraceae) phylogeny revisited using RPB2: complex patterns and multiple d-paralogues.

    PubMed

    Ekenäs, Catarina; Heidari, Nahid; Andreasen, Katarina

    2012-08-01

    The region coding for the second largest subunit of RNA polymerase II (RPB2) was explored for resolving interspecific relationships in Arnica and lower level taxa in general. The region between exons 17 and 23 was cloned and sequenced for 33 accessions of Arnica and four outgroup taxa. Three paralogues of the RPB2-d copy (RPB2-dA, B and C) were detected in Arnica and outgroup taxa, indicating that the duplications must have occurred before the divergence of Arnica. Parsimony and Bayesian analyses of separate alignments of the three copies reveal complex patterns in Arnica, likely reflecting a history of lineage sorting in combination with apomixis, polyploidization, and possibly hybridization. Cloned sequences of some taxa do not form monophyletic clades within paralogues, but form multiple strongly supported clades with sequences of other taxa. Some well supported groups are present in more than one paralogue and many groups are in line with earlier hypotheses regarding interspecific relationships within the genus. Low levels of homoplasy in combination with relatively high sequence variation indicates that the introns of the RPB2 region could be suitable for phylogenetic studies in low level taxonomy. Copyright © 2012. Published by Elsevier Inc.

  3. A Spiking Neural Network System for Robust Sequence Recognition.

    PubMed

    Yu, Qiang; Yan, Rui; Tang, Huajin; Tan, Kay Chen; Li, Haizhou

    2016-03-01

    This paper proposes a biologically plausible network architecture with spiking neurons for sequence recognition. This architecture is a unified and consistent system with functional parts of sensory encoding, learning, and decoding. This is the first systematic model attempting to reveal the neural mechanisms considering both the upstream and the downstream neurons together. The whole system is a consistent temporal framework, where the precise timing of spikes is employed for information processing and cognitive computing. Experimental results show that the system is competent to perform the sequence recognition, being robust to noisy sensory inputs and invariant to changes in the intervals between input stimuli within a certain range. The classification ability of the temporal learning rule used in the system is investigated through two benchmark tasks that outperform the other two widely used learning rules for classification. The results also demonstrate the computational power of spiking neurons over perceptrons for processing spatiotemporal patterns. In summary, the system provides a general way with spiking neurons to encode external stimuli into spatiotemporal spikes, to learn the encoded spike patterns with temporal learning rules, and to decode the sequence order with downstream neurons. The system structure would be beneficial for developments in both hardware and software.

  4. Focal expression of mutant huntingtin in the songbird basal ganglia disrupts cortico-basal ganglia networks and vocal sequences

    PubMed Central

    Tanaka, Masashi; Singh Alvarado, Jonnathan; Murugan, Malavika; Mooney, Richard

    2016-01-01

    The basal ganglia (BG) promote complex sequential movements by helping to select elementary motor gestures appropriate to a given behavioral context. Indeed, Huntington’s disease (HD), which causes striatal atrophy in the BG, is characterized by hyperkinesia and chorea. How striatal cell loss alters activity in the BG and downstream motor cortical regions to cause these disorganized movements remains unknown. Here, we show that expressing the genetic mutation that causes HD in a song-related region of the songbird BG destabilizes syllable sequences and increases overall vocal activity, but leave the structure of individual syllables intact. These behavioral changes are paralleled by the selective loss of striatal neurons and reduction of inhibitory synapses on pallidal neurons that serve as the BG output. Chronic recordings in singing birds revealed disrupted temporal patterns of activity in pallidal neurons and downstream cortical neurons. Moreover, reversible inactivation of the cortical neurons rescued the disorganized vocal sequences in transfected birds. These findings shed light on a key role of temporal patterns of cortico-BG activity in the regulation of complex motor sequences and show how a genetic mutation alters cortico-BG networks to cause disorganized movements. PMID:26951661

  5. Molecular and phenotypic characterization of Listeria monocytogenes from U.S. Department of Agriculture Food Safety and Inspection Service surveillance of ready-to-eat foods and processing facilities.

    PubMed

    Ward, Todd J; Evans, Peter; Wiedmann, Martin; Usgaard, Thomas; Roof, Sherry E; Stroika, Steven G; Hise, Kelley

    2010-05-01

    A panel of 501 Listeria monocytogenes isolates obtained from the U.S. Department of Agriculture Food Safety and Inspection Service monitoring programs for ready-to-eat (RTE) foods were subtyped by multilocus genotyping (MLGT) and by sequencing the virulence gene inlA, which codes for internalin. MLGT analyses confirmed that clonal lineages associated with previous epidemic outbreaks were rare (7.6%) contaminants of RTE meat and poultry products and their production environments. Conversely, sequence analyses revealed mutations leading to 11 different premature stop codons (PMSCs) in inlA, including three novel PMSC mutations, and revealed that the frequency of these virulence-attenuating mutations among RTE isolates (48.5%) was substantially higher than previously appreciated. Significant differences (P < 0.001) in the frequency of inlA PMSCs were observed between lineages and between major serogroups, which could partially explain differences in association of these subtypes with human listeriosis. Interrogation of single-nucleotide polymorphisms responsible for PMSCs in inlA improved strain resolution among isolates with the 10 most common pulsed-field gel electrophoresis (PFGE) patterns, 8 of which included isolates with a PMSC in inlA. The presence or absence of PMSCs in inlA accounted for significant differences (P < 0.05) in Caco-2 invasion efficiencies among isolates with identical PFGE patterns, and the proportion of PulseNet entries from clinical sources was significantly higher (P < 0.001) for PFGE patterns exclusively from isolates with full-length inlA. These results indicated that integration of PFGE and DNA sequence-based subtyping provides an improved framework for prediction of relative risk associated with L. monocytogenes strains from RTE foods.

  6. Phylodynamics of the HIV-1 CRF02_AG clade in Cameroon

    PubMed Central

    Faria, Nuno Rodrigues; Suchard, Marc A; Abecasis, Ana; Sousa, J. D.; Ndembi, Nicaise; Camacho, R.J.; Vandamme, Anne-Mieke; Peeters, Martine; Lemey, Philippe

    2015-01-01

    Evolutionary analyses have revealed an origin of pandemic HIV-1 group M in the Congo River basin in the first part of the XXth century, but the patterns of historical viral spread in or around its epicentre remain largely unexplored. Here, we combine epidemiologic and molecular sequence data to investigate the spatiotemporal patterns of the CRF02_AG clade. By explicitly integrating prevalence counts and genetic population size estimates we date the epidemic emergence of CRF02_AG at 1973.1 (1972.1, 1975.3 95% CI). To infer their phylogeographic signature at a regional scale, we analyze pol and env time-stamped sequence data from 8 countries using a Bayesian phylogeographic approach based on a discrete asymmetric model. Our data confirms a spatial origin of this clade in the Democratic Republic of Congo (DRC) and suggests that viral dissemination to Cameroon occurred at an early stage of the evolutionary history of CRF02_AG. We find considerable support for epidemiological linkage between neighbour countries. Compilation of ethnographic data suggests that well-supported viral migration was related with chance exportation events rather than by sustained human migratory flows. Finally, using sequence data from 15 locations in Cameroon, we use relaxed random walk models to explore the spatiotemporal dynamics of CRF02_AG at a finer geographical detail. Phylogeographic dispersal in continuous space reveals that at least two distinct CRF02_AG lineages are circulating in overlapping regions that are evolving at different evolutionary and diffusion rates. Altogether, by combining molecular and epidemiological data, our results provide a time scale for CRF02_AG, place its spatial root within the putative root of group-M diversity and propose a scenario for the spatiotemporal patterns of a successful HIV-1 lineage both at a regional and country-scale. PMID:21565285

  7. High-resolution SAR11 ecotype dynamics at the Bermuda Atlantic Time-series Study site by phylogenetic placement of pyrosequences

    PubMed Central

    Vergin, Kevin L; Beszteri, Bánk; Monier, Adam; Cameron Thrash, J; Temperton, Ben; Treusch, Alexander H; Kilpert, Fabian; Worden, Alexandra Z; Giovannoni, Stephen J

    2013-01-01

    Advances in next-generation sequencing technologies are providing longer nucleotide sequence reads that contain more information about phylogenetic relationships. We sought to use this information to understand the evolution and ecology of bacterioplankton at our long-term study site in the Western Sargasso Sea. A bioinformatics pipeline called PhyloAssigner was developed to align pyrosequencing reads to a reference multiple sequence alignment of 16S ribosomal RNA (rRNA) genes and assign them phylogenetic positions in a reference tree using a maximum likelihood algorithm. Here, we used this pipeline to investigate the ecologically important SAR11 clade of Alphaproteobacteria. A combined set of 2.7 million pyrosequencing reads from the 16S rRNA V1–V2 regions, representing 9 years at the Bermuda Atlantic Time-series Study (BATS) site, was quality checked and parsed into a comprehensive bacterial tree, yielding 929 036 Alphaproteobacteria reads. Phylogenetic structure within the SAR11 clade was linked to seasonally recurring spatiotemporal patterns. This analysis resolved four new SAR11 ecotypes in addition to five others that had been described previously at BATS. The data support a conclusion reached previously that the SAR11 clade diversified by subdivision of niche space in the ocean water column, but the new data reveal a more complex pattern in which deep branches of the clade diversified repeatedly across depth strata and seasonal regimes. The new data also revealed the presence of an unrecognized clade of Alphaproteobacteria, here named SMA-1 (Sargasso Mesopelagic Alphaproteobacteria, group 1), in the upper mesopelagic zone. The high-resolution phylogenetic analyses performed herein highlight significant, previously unknown, patterns of evolutionary diversification, within perhaps the most widely distributed heterotrophic marine bacterial clade, and strongly links to ecosystem regimes. PMID:23466704

  8. High-resolution SAR11 ecotype dynamics at the Bermuda Atlantic Time-series Study site by phylogenetic placement of pyrosequences.

    PubMed

    Vergin, Kevin L; Beszteri, Bánk; Monier, Adam; Thrash, J Cameron; Temperton, Ben; Treusch, Alexander H; Kilpert, Fabian; Worden, Alexandra Z; Giovannoni, Stephen J

    2013-07-01

    Advances in next-generation sequencing technologies are providing longer nucleotide sequence reads that contain more information about phylogenetic relationships. We sought to use this information to understand the evolution and ecology of bacterioplankton at our long-term study site in the Western Sargasso Sea. A bioinformatics pipeline called PhyloAssigner was developed to align pyrosequencing reads to a reference multiple sequence alignment of 16S ribosomal RNA (rRNA) genes and assign them phylogenetic positions in a reference tree using a maximum likelihood algorithm. Here, we used this pipeline to investigate the ecologically important SAR11 clade of Alphaproteobacteria. A combined set of 2.7 million pyrosequencing reads from the 16S rRNA V1-V2 regions, representing 9 years at the Bermuda Atlantic Time-series Study (BATS) site, was quality checked and parsed into a comprehensive bacterial tree, yielding 929 036 Alphaproteobacteria reads. Phylogenetic structure within the SAR11 clade was linked to seasonally recurring spatiotemporal patterns. This analysis resolved four new SAR11 ecotypes in addition to five others that had been described previously at BATS. The data support a conclusion reached previously that the SAR11 clade diversified by subdivision of niche space in the ocean water column, but the new data reveal a more complex pattern in which deep branches of the clade diversified repeatedly across depth strata and seasonal regimes. The new data also revealed the presence of an unrecognized clade of Alphaproteobacteria, here named SMA-1 (Sargasso Mesopelagic Alphaproteobacteria, group 1), in the upper mesopelagic zone. The high-resolution phylogenetic analyses performed herein highlight significant, previously unknown, patterns of evolutionary diversification, within perhaps the most widely distributed heterotrophic marine bacterial clade, and strongly links to ecosystem regimes.

  9. Thermodynamics of complexity and pattern manipulation

    NASA Astrophysics Data System (ADS)

    Garner, Andrew J. P.; Thompson, Jayne; Vedral, Vlatko; Gu, Mile

    2017-04-01

    Many organisms capitalize on their ability to predict the environment to maximize available free energy and reinvest this energy to create new complex structures. This functionality relies on the manipulation of patterns—temporally ordered sequences of data. Here, we propose a framework to describe pattern manipulators—devices that convert thermodynamic work to patterns or vice versa—and use them to build a "pattern engine" that facilitates a thermodynamic cycle of pattern creation and consumption. We show that the least heat dissipation is achieved by the provably simplest devices, the ones that exhibit desired operational behavior while maintaining the least internal memory. We derive the ultimate limits of this heat dissipation and show that it is generally nonzero and connected with the pattern's intrinsic crypticity—a complexity theoretic quantity that captures the puzzling difference between the amount of information the pattern's past behavior reveals about its future and the amount one needs to communicate about this past to optimally predict the future.

  10. Newly discovered young CORE-SINEs in marsupial genomes.

    PubMed

    Munemasa, Maruo; Nikaido, Masato; Nishihara, Hidenori; Donnellan, Stephen; Austin, Christopher C; Okada, Norihiro

    2008-01-15

    Although recent mammalian genome projects have uncovered a large part of genomic component of various groups, several repetitive sequences still remain to be characterized and classified for particular groups. The short interspersed repetitive elements (SINEs) distributed among marsupial genomes are one example. We have identified and characterized two new SINEs from marsupial genomes that belong to the CORE-SINE family, characterized by a highly conserved "CORE" domain. PCR and genomic dot blot analyses revealed that the distribution of each SINE shows distinct patterns among the marsupial genomes, implying different timing of their retroposition during the evolution of marsupials. The members of Mar3 (Marsupialia 3) SINE are distributed throughout the genomes of all marsupials, whereas the Mac1 (Macropodoidea 1) SINE is distributed specifically in the genomes of kangaroos. Sequence alignment of the Mar3 SINEs revealed that they can be further divided into four subgroups, each of which has diagnostic nucleotides. The insertion patterns of each SINE at particular genomic loci, together with the distribution patterns of each SINE, suggest that the Mar3 SINEs have intensively amplified after the radiation of diprotodontians, whereas the Mac1 SINE has amplified only slightly after the divergence of hypsiprimnodons from other macropods. By compiling the information of CORE-SINEs characterized to date, we propose a comprehensive picture of how SINE evolution occurred in the genomes of marsupials.

  11. Phylogenetic relationships among Synallaxini spinetails (Aves: Furnariidae) reveal a new biogeographic pattern across the Amazon and Paraná river basins.

    PubMed

    Claramunt, Santiago

    2014-09-01

    Relationships among genera in the tribe Synallaxini have proved difficult to resolve. In this study, I investigate relationships among Synallaxis, Certhiaxis and Schoeniophylax using DNA sequences from the mitochondrion and three nuclear regions. I implemented novel primers and protocols for amplifying and sequencing autosomal and sex-linked introns in Furnariidae that resolved basal relationships in the Synallaxini with strong support. Synallaxis propinqua is sister to Schoeniophylax phryganophilus, and together they form a clade with Certhiaxis. The results are robust to analytical approaches when all genomic regions are analyzed jointly (parsimony, maximum likelihood, and species-tree analysis) and the same basal relationships are recovered by most genomic regions when analyzed separately. A sister relationship between S. propinqua, an Amazonian river island specialist, and S. phryganophilus, from the Paraná River basin region, reveals a new biogeographic pattern shared by at least other four pairs of taxa with similar distributions and ecologies. Estimates of divergence times for these five pairs span from the late Miocene to the Pleistocene. Identification of the historical events that produced this pattern is difficult and further advances will require additional studies of the taxa involved and a better understanding of the recent environmental history of South America. A new classification is proposed for the Synallaxini, including the description of a new genus for S. propinqua. Copyright © 2014 Elsevier Inc. All rights reserved.

  12. Sequence Diversity Diagram for comparative analysis of multiple sequence alignments.

    PubMed

    Sakai, Ryo; Aerts, Jan

    2014-01-01

    The sequence logo is a graphical representation of a set of aligned sequences, commonly used to depict conservation of amino acid or nucleotide sequences. Although it effectively communicates the amount of information present at every position, this visual representation falls short when the domain task is to compare between two or more sets of aligned sequences. We present a new visual presentation called a Sequence Diversity Diagram and validate our design choices with a case study. Our software was developed using the open-source program called Processing. It loads multiple sequence alignment FASTA files and a configuration file, which can be modified as needed to change the visualization. The redesigned figure improves on the visual comparison of two or more sets, and it additionally encodes information on sequential position conservation. In our case study of the adenylate kinase lid domain, the Sequence Diversity Diagram reveals unexpected patterns and new insights, for example the identification of subgroups within the protein subfamily. Our future work will integrate this visual encoding into interactive visualization tools to support higher level data exploration tasks.

  13. Sequence analysis of the pyruvylated galactan sulfate-derived oligosaccharides by negative-ion electrospray tandem mass spectrometry.

    PubMed

    Li, Na; Mao, Wenjun; Liu, Xue; Wang, Shuyao; Xia, Zheng; Cao, Sujian; Li, Lin; Zhang, Qi; Liu, Shan

    2016-10-04

    Five sulfated oligosaccharide fragments, F1-F5, were prepared from a pyruvylated galactan sulfate from the green alga Codium divaricatum, by partial depolymerization using mild acid hydrolysis and purification with gel-permeation chromatography. Negative-ion electrospray tandem mass spectrometry with collision-induced dissociation (ES-CID-MS/MS) is attempted for sequence determination of the sulfated oligosaccharides. The sequence of F1 with homogeneous disaccharide composition was first characterized to be Galp-(4SO4)-(1 → 3)-Galp by detailed nuclear magnetic resonance spectroscopic analyses. The fragmentation pattern of F1 in the product ion spectra was established on the basis of negative-ion ES-CID MS/MS, which was then applied to sequence analysis of other sulfated oligosaccharides. The sequences of F2 and F3 were deduced to be Galp-(4SO4)-(1 → 3)-Galp-(1 → 3)-Galp-(1 → 3)-Galp and 3,4-O-(1-carboxyethylidene)-Galp-(6SO4)-(1 → 3)-Galp, respectively. The sequences of major fragments in F4 and F5 were also deduced. The investigation demonstrated that negative-ion ES-CID-MS/MS was an efficient method for the sequence analysis of the pyruvylated galactan sulfate-derived oligosaccharides which revealed the patterns of substitution and glycosidic linkages. The pyruvylated galactan sulfate-derived oligosaccharides were novel sulfated oligosaccharides different from other algal polysaccharide-derived oligosaccharides. Copyright © 2016 Elsevier Ltd. All rights reserved.

  14. Syntax-induced pattern deafness

    PubMed Central

    Endress, Ansgar D.; Hauser, Marc D.

    2009-01-01

    Perceptual systems often force systematically biased interpretations upon sensory input. These interpretations are obligatory, inaccessible to conscious control, and prevent observers from perceiving alternative percepts. Here we report a similarly impenetrable phenomenon in the domain of language, where the syntactic system prevents listeners from detecting a simple perceptual pattern. Healthy human adults listened to three-word sequences conforming to patterns readily learned even by honeybees, rats, and sleeping human neonates. Specifically, sequences either started or ended with two words from the same syntactic category (e.g., noun–noun–verb or verb–verb–noun). Although participants readily processed the categories and learned repetition patterns over nonsyntactic categories (e.g., animal–animal–clothes), they failed to learn the repetition pattern over syntactic categories, even when explicitly instructed to look for it. Further experiments revealed that participants successfully learned the repetition patterns only when they were consistent with syntactically possible structures, irrespective of whether these structures were attested in English or in other languages unknown to the participants. When the repetition patterns did not match such syntactically possible structures, participants failed to learn them. Our results suggest that when human adults hear a string of nouns and verbs, their syntactic system obligatorily attempts an interpretation (e.g., in terms of subjects, objects, and predicates). As a result, subjects fail to perceive the simpler pattern of repetitions—a form of syntax-induced pattern deafness that is reminiscent of how other perceptual systems force specific interpretations upon sensory input. PMID:19920182

  15. The paradox of HBV evolution as revealed from a 16th century mummy

    PubMed Central

    Duggan, Ana T.; Poinar, Debi; Poinar, Hendrik N.

    2018-01-01

    Hepatitis B virus (HBV) is a ubiquitous viral pathogen associated with large-scale morbidity and mortality in humans. However, there is considerable uncertainty over the time-scale of its origin and evolution. Initial shotgun data from a mid-16th century Italian child mummy, that was previously paleopathologically identified as having been infected with Variola virus (VARV, the agent of smallpox), showed no DNA reads for VARV yet did for hepatitis B virus (HBV). Previously, electron microscopy provided evidence for the presence of VARV in this sample, although similar analyses conducted here did not reveal any VARV particles. We attempted to enrich and sequence for both VARV and HBV DNA. Although we did not recover any reads identified as VARV, we were successful in reconstructing an HBV genome at 163.8X coverage. Strikingly, both the HBV sequence and that of the associated host mitochondrial DNA displayed a nearly identical cytosine deamination pattern near the termini of DNA fragments, characteristic of an ancient origin. In contrast, phylogenetic analyses revealed a close relationship between the putative ancient virus and contemporary HBV strains (of genotype D), at first suggesting contamination. In addressing this paradox we demonstrate that HBV evolution is characterized by a marked lack of temporal structure. This confounds attempts to use molecular clock-based methods to date the origin of this virus over the time-frame sampled so far, and means that phylogenetic measures alone cannot yet be used to determine HBV sequence authenticity. If genuine, this phylogenetic pattern indicates that the genotypes of HBV diversified long before the 16th century, and enables comparison of potential pathogenic similarities between modern and ancient HBV. These results have important implications for our understanding of the emergence and evolution of this common viral pathogen. PMID:29300782

  16. Multilocus Sequence Analysis of Nectar Pseudomonads Reveals High Genetic Diversity and Contrasting Recombination Patterns

    PubMed Central

    Álvarez-Pérez, Sergio; de Vega, Clara; Herrera, Carlos M.

    2013-01-01

    The genetic and evolutionary relationships among floral nectar-dwelling Pseudomonas ‘sensu stricto’ isolates associated to South African and Mediterranean plants were investigated by multilocus sequence analysis (MLSA) of four core housekeeping genes (rrs, gyrB, rpoB and rpoD). A total of 35 different sequence types were found for the 38 nectar bacterial isolates characterised. Phylogenetic analyses resulted in the identification of three main clades [nectar groups (NGs) 1, 2 and 3] of nectar pseudomonads, which were closely related to five intrageneric groups: Pseudomonas oryzihabitans (NG 1); P. fluorescens, P. lutea and P. syringae (NG 2); and P. rhizosphaerae (NG 3). Linkage disequilibrium analysis pointed to a mostly clonal population structure, even when the analysis was restricted to isolates from the same floristic region or belonging to the same NG. Nevertheless, signatures of recombination were observed for NG 3, which exclusively included isolates retrieved from the floral nectar of insect-pollinated Mediterranean plants. In contrast, the other two NGs comprised both South African and Mediterranean isolates. Analyses relating diversification to floristic region and pollinator type revealed that there has been more unique evolution of the nectar pseudomonads within the Mediterranean region than would be expected by chance. This is the first work analysing the sequence of multiple loci to reveal geno- and ecotypes of nectar bacteria. PMID:24116076

  17. Y chromosomal haplotype characteristics of domestic sheep (Ovis aries) in China.

    PubMed

    Wang, Yutao; Xu, Lei; Yan, Wei; Li, Shaobin; Wang, Jiqing; Liu, Xiu; Hu, Jiang; Luo, Yuzhu

    2015-07-10

    Investigations on the variation present at the male-specific Y chromosome region provide strong information to understand the origin and evolution of domestic sheep. One SNP OY1 (g.88A>G) in the upstream region of SRY gene, and the microsatellite SRYM18 locus within ovine Y chromosome were analyzed in one hundred and forty five samples collected from eleven breeds in China. SNP OY1 was analyzed using PCR-SSCP method and sequencing. Two different PCR-SSCP patterns represented two specific sequences with sequence analysis revealing SNP-OY1 (g.88A>G) were observed, while SNP A-OY1 showed the most common frequency (82.8%). Sequencing of the SRYM18 region revealed one novel size fragment (A2) with different repetitive units. Seven haplotypes (H4, H5, H6, H7, H8, H9 and H12) and two novel haplotypes (Ha and Hb) were established using combined genotype analysis. H6 showed the highest frequency (43.4%) across all breeds, and H8 showed the second frequency (24.1%). Ha was only found in one breed (Tan), while Hb was present in three breeds (Gansu alpine, White Suffolk and Duolang). Our findings reveal one novel allele in SRYM18 region and two novel male haplotypes of domestic sheep in China. Copyright © 2015 Elsevier B.V. All rights reserved.

  18. Reprint of: Initial uncertainty impacts statistical learning in sound sequence processing.

    PubMed

    Todd, Juanita; Provost, Alexander; Whitson, Lisa; Mullens, Daniel

    2018-05-18

    This paper features two studies confirming a lasting impact of first learning on how subsequent experience is weighted in early relevance-filtering processes. In both studies participants were exposed to sequences of sound that contained a regular pattern on two different timescales. Regular patterning in sound is readily detected by the auditory system and used to form "prediction models" that define the most likely properties of sound to be encountered in a given context. The presence and strength of these prediction models is inferred from changes in automatically elicited components of auditory evoked potentials. Both studies employed sound sequences that contained both a local and longer-term pattern. The local pattern was defined by a regular repeating pure tone occasionally interrupted by a rare deviating tone (p=0.125) that was physically different (a 30msvs. 60ms duration difference in one condition and a 1000Hz vs. 1500Hz frequency difference in the other). The longer-term pattern was defined by the rate at which the two tones alternated probabilities (i.e., the tone that was first rare became common and the tone that was first common became rare). There was no task related to the tones and participants were asked to ignore them while focussing attention on a movie with subtitles. Auditory-evoked potentials revealed long lasting modulatory influences based on whether the tone was initially encountered as rare and unpredictable or common and predictable. The results are interpreted as evidence that probability (or indeed predictability) assigns a differential information-value to the two tones that in turn affects the extent to which prediction models are updated and imposed. These effects are exposed for both common and rare occurrences of the tones. The studies contribute to a body of work that reveals that probabilistic information is not faithfully represented in these early evoked potentials and instead exposes that predictability (or conversely uncertainty) may trigger value-based learning modulations even in task-irrelevant incidental learning. Copyright © 2017 IBRO. Published by Elsevier Ltd. All rights reserved.

  19. Distinct biological subtypes and patterns of genome evolution in lymphoma revealed by circulating tumor DNA.

    PubMed

    Scherer, Florian; Kurtz, David M; Newman, Aaron M; Stehr, Henning; Craig, Alexander F M; Esfahani, Mohammad Shahrokh; Lovejoy, Alexander F; Chabon, Jacob J; Klass, Daniel M; Liu, Chih Long; Zhou, Li; Glover, Cynthia; Visser, Brendan C; Poultsides, George A; Advani, Ranjana H; Maeda, Lauren S; Gupta, Neel K; Levy, Ronald; Ohgami, Robert S; Kunder, Christian A; Diehn, Maximilian; Alizadeh, Ash A

    2016-11-09

    Patients with diffuse large B cell lymphoma (DLBCL) exhibit marked diversity in tumor behavior and outcomes, yet the identification of poor-risk groups remains challenging. In addition, the biology underlying these differences is incompletely understood. We hypothesized that characterization of mutational heterogeneity and genomic evolution using circulating tumor DNA (ctDNA) profiling could reveal molecular determinants of adverse outcomes. To address this hypothesis, we applied cancer personalized profiling by deep sequencing (CAPP-Seq) analysis to tumor biopsies and cell-free DNA samples from 92 lymphoma patients and 24 healthy subjects. At diagnosis, the amount of ctDNA was found to strongly correlate with clinical indices and was independently predictive of patient outcomes. We demonstrate that ctDNA genotyping can classify transcriptionally defined tumor subtypes, including DLBCL cell of origin, directly from plasma. By simultaneously tracking multiple somatic mutations in ctDNA, our approach outperformed immunoglobulin sequencing and radiographic imaging for the detection of minimal residual disease and facilitated noninvasive identification of emergent resistance mutations to targeted therapies. In addition, we identified distinct patterns of clonal evolution distinguishing indolent follicular lymphomas from those that transformed into DLBCL, allowing for potential noninvasive prediction of histological transformation. Collectively, our results demonstrate that ctDNA analysis reveals biological factors that underlie lymphoma clinical outcomes and could facilitate individualized therapy. Copyright © 2016, American Association for the Advancement of Science.

  20. Chronodes: Interactive Multifocus Exploration of Event Sequences

    PubMed Central

    POLACK, PETER J.; CHEN, SHANG-TSE; KAHNG, MINSUK; DE BARBARO, KAYA; BASOLE, RAHUL; SHARMIN, MOUSHUMI; CHAU, DUEN HORNG

    2018-01-01

    The advent of mobile health (mHealth) technologies challenges the capabilities of current visualizations, interactive tools, and algorithms. We present Chronodes, an interactive system that unifies data mining and human-centric visualization techniques to support explorative analysis of longitudinal mHealth data. Chronodes extracts and visualizes frequent event sequences that reveal chronological patterns across multiple participant timelines of mHealth data. It then combines novel interaction and visualization techniques to enable multifocus event sequence analysis, which allows health researchers to interactively define, explore, and compare groups of participant behaviors using event sequence combinations. Through summarizing insights gained from a pilot study with 20 behavioral and biomedical health experts, we discuss Chronodes’s efficacy and potential impact in the mHealth domain. Ultimately, we outline important open challenges in mHealth, and offer recommendations and design guidelines for future research. PMID:29515937

  1. Fine-scale phylogenetic architecture of a complex bacterial community.

    PubMed

    Acinas, Silvia G; Klepac-Ceraj, Vanja; Hunt, Dana E; Pharino, Chanathip; Ceraj, Ivica; Distel, Daniel L; Polz, Martin F

    2004-07-29

    Although molecular data have revealed the vast scope of microbial diversity, two fundamental questions remain unanswered even for well-defined natural microbial communities: how many bacterial types co-exist, and are such types naturally organized into phylogenetically discrete units of potential ecological significance? It has been argued that without such information, the environmental function, population biology and biogeography of microorganisms cannot be rigorously explored. Here we address these questions by comprehensive sampling of two large 16S ribosomal RNA clone libraries from a coastal bacterioplankton community. We show that compensation for artefacts generated by common library construction techniques reveals fine-scale patterns of community composition. At least 516 ribotypes (unique rRNA sequences) were detected in the sample and, by statistical extrapolation, at least 1,633 co-existing ribotypes in the sampled population. More than 50% of the ribotypes fall into discrete clusters containing less than 1% sequence divergence. This pattern cannot be accounted for by interoperon variation, indicating a large predominance of closely related taxa in this community. We propose that such microdiverse clusters arise by selective sweeps and persist because competitive mechanisms are too weak to purge diversity from within them.

  2. Effect of oxygen minimum zone formation on communities of marine protists.

    PubMed

    Orsi, William; Song, Young C; Hallam, Steven; Edgcomb, Virginia

    2012-08-01

    Changes in ocean temperature and circulation patterns compounded by human activities are leading to oxygen minimum zone (OMZ) expansion with concomitant alteration in nutrient and climate active trace gas cycling. Here, we report the response of microbial eukaryote populations to seasonal changes in water column oxygen-deficiency using Saanich Inlet, a seasonally anoxic fjord on the coast of Vancouver Island British Columbia, as a model ecosystem. We combine small subunit ribosomal RNA gene sequencing approaches with multivariate statistical methods to reveal shifts in operational taxonomic units during successive stages of seasonal stratification and renewal. A meta-analysis is used to identify common and unique patterns of community composition between Saanich Inlet and the anoxic/sulfidic Cariaco Basin (Venezuela) and Framvaren Fjord (Norway) to show shared and unique responses of microbial eukaryotes to oxygen and sulfide in these three environments. Our analyses also reveal temporal fluctuations in rare populations of microbial eukaryotes, particularly anaerobic ciliates, that may be of significant importance to the biogeochemical cycling of methane in OMZs. Eukaryotic 18S rRNA gene sequences recovered from the Saanich Inlet water column on were deposited in Genbank under accession numbers HQ864863–HQ871151.

  3. Comparative analysis of taxonomic, functional, and metabolic patterns of microbiomes from 14 full-scale biogas reactors by metagenomic sequencing and radioisotopic analysis.

    PubMed

    Luo, Gang; Fotidis, Ioannis A; Angelidaki, Irini

    2016-01-01

    Biogas production is a very complex process due to the high complexity in diversity and interactions of the microorganisms mediating it, and only limited and diffuse knowledge exists about the variation of taxonomic and functional patterns of microbiomes across different biogas reactors, and their relationships with the metabolic patterns. The present study used metagenomic sequencing and radioisotopic analysis to assess the taxonomic, functional, and metabolic patterns of microbiomes from 14 full-scale biogas reactors operated under various conditions treating either sludge or manure. The results from metagenomic analysis showed that the dominant methanogenic pathway revealed by radioisotopic analysis was not always correlated with the taxonomic and functional compositions. It was found by radioisotopic experiments that the aceticlastic methanogenic pathway was dominant, while metagenomics analysis showed higher relative abundance of hydrogenotrophic methanogens. Principal coordinates analysis showed the sludge-based samples were clearly distinct from the manure-based samples for both taxonomic and functional patterns, and canonical correspondence analysis showed that the both temperature and free ammonia were crucial environmental variables shaping the taxonomic and functional patterns. The study further the overall patterns of functional genes were strongly correlated with overall patterns of taxonomic composition across different biogas reactors. The discrepancy between the metabolic patterns determined by metagenomic analysis and metabolic pathways determined by radioisotopic analysis was found. Besides, a clear correlation between taxonomic and functional patterns was demonstrated for biogas reactors, and also the environmental factors that shaping both taxonomic and functional genes patterns were identified.

  4. Encephalitozoonosis in two inland bearded dragons (Pogona vitticeps).

    PubMed

    Richter, B; Csokai, J; Graner, I; Eisenberg, T; Pantchev, N; Eskens, H U; Nedorost, N

    2013-02-01

    Microsporidiosis is reported rarely in reptiles. Sporadic multisystemic granulomatous disease of captive bearded dragons (Pogona vitticeps) has been associated with microsporidia showing Encephalitozoon-like morphology. Two such cases are described herein. Both animals displayed clinical signs suggestive of renal failure. Necropsy examination revealed granulomatous lesions in the liver and adrenal area in both animals, and in several other organs in one animal. The lesions were associated with intracellular protozoa consistent with microsporidia. Ultrastructural examination of the organisms revealed morphology similar to Encephalitozoon spp. Immunohistochemistry and chromogenic in-situ hybridization for Encephalitozoon cuniculi were positive in both animals. Nucleotide sequencing of the partial small subunit ribosomal RNA gene and the complete internal transcribed spacer (ITS) region revealed high similarity with published E. cuniculi sequences in both animals. However, the ITS region showed a GTTT-repeat pattern distinct from mammalian E. cuniculi strains. This may be a novel E. cuniculi strain associated with multisystemic granulomatous disease in bearded dragons. Copyright © 2012 Elsevier Ltd. All rights reserved.

  5. Genetic population structure in the yellow mongoose, Cynictis penicillata.

    PubMed

    Van Vuuren, B J; Robinson, T J

    1997-12-01

    Phylogeographic structure was determined for the yellow mongoose, Cynictis penicillata, using mtDNA RFLPs and control region sequences. The RFLP analysis revealed 13 haplotypes which showed weak geographical patterning consistent with a recent range expansion from a refugial population(s). An analysis of molecular variance (AMOVA) revealed no correspondence between mtDNA phylogeography and subspecies delimitation, nor between matrilines and areas characterized by a high incidence of the viverrid-type rabies, of which the yellow mongoose is the principal vector. The lack of structure was also shown by control region sequences although four of the maternal lineages shared a near-perfect 81 bp repeat. We speculate that regional hot spots of the viverrid rabies biotype reflect population density differences in the yellow mongoose that are not underscored by genetic partitioning, at least at the level of resolution provided by our analyses.

  6. Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21.

    PubMed

    Patil, N; Berno, A J; Hinds, D A; Barrett, W A; Doshi, J M; Hacker, C R; Kautzer, C R; Lee, D H; Marjoribanks, C; McDonough, D P; Nguyen, B T; Norris, M C; Sheehan, J B; Shen, N; Stern, D; Stokowski, R P; Thomas, D J; Trulson, M O; Vyas, K R; Frazer, K A; Fodor, S P; Cox, D R

    2001-11-23

    Global patterns of human DNA sequence variation (haplotypes) defined by common single nucleotide polymorphisms (SNPs) have important implications for identifying disease associations and human traits. We have used high-density oligonucleotide arrays, in combination with somatic cell genetics, to identify a large fraction of all common human chromosome 21 SNPs and to directly observe the haplotype structure defined by these SNPs. This structure reveals blocks of limited haplotype diversity in which more than 80% of a global human sample can typically be characterized by only three common haplotypes.

  7. Promoter mapping of the mouse Tcp-10bt gene in transgenic mice identifies essential male germ cell regulatory sequences.

    PubMed

    Ewulonu, U K; Snyder, L; Silver, L M; Schimenti, J C

    1996-03-01

    Transgenic mice were generated to localize essential promoter elements in the mouse testis-expressed Tcp-10 genes. These genes are expressed exclusively in male germ cells, and exhibit a diffuse range of transcriptional start sites, possibly due to the absence of a TATA box. A series of transgene constructs containing different amounts of 5' flanking DNA revealed that all sequences necessary for appropriate temporal and tissue-specific transcription of Tcp-10 reside between positions -1 to -973. All transgenic animals containing these sequences expressed a chimeric transgene at high levels, in a pattern that paralleled the endogenous genes. These experiments further defined a 227 bp fragment from -746 to -973 that was absolutely essential for expression. In a gel-shift assay, this 227-bp fragment bound nuclear protein from testis, but not other tissues, to yield two retarded bands. Sequence analysis of this fragment revealed a half-site for the AP-2 transcription factor recognition sequence. Gel shift assays using native or mutant oligonucleotides demonstrated that the putative AP-2 recognition sequence was essential for generating the retarded bands. Since the binding activity is testis-specific, but AP-2 expression is not exclusive to male germ cells, it is possible that transcription of Tcp-10 requires interaction between AP-2 and a germ cell-specific transcription factor.

  8. Molecular characterization of Myf5 and comparative expression patterns of myogenic regulatory factors in Siniperca chuatsi.

    PubMed

    Zhu, Xin; Li, Yu-Long; Liu, Li; Wang, Jian-Hua; Li, Hong-Hui; Wu, Ping; Chu, Wu-Ying; Zhang, Jian-She

    2016-01-01

    Myogenic regulatory factors (MRFs) are muscle-specific basic helix-loop-helix (bHLH) transcription factor that plays an essential role in regulating skeletal muscle development and growth. To investigate molecular characterization of Myf5 and compare the expressional patterns of the four MRFs, we cloned the Myf5 cDNA sequence and analyzed the MRFs expressional patterns using quantitative real-time polymerase chain reaction in Chinese perch (Siniperca chuatsi). Sequence analysis indicated that Chinese perch Myf5 and other MRFs shared a highly conserved bHLH domain with those of other vertebrates. Sequence alignment and phylogenetic tree showed that Chinese perch MRFs had the highest identity with the MRFs of Epinephelus coioides. Spatio-temporal expressional patterns revealed that the MRFs were primarily expressed in muscle, especially in white muscle. During embryonic development period, Myf5, MyoD and MyoG mRNAs had a steep increase at neurula stage, and their highest expressional level was predominantly observed at hatching period. Whereas the highest expressional level of the MRF4 was observed at the muscular effect stage. The expressional patterns of post-embryonic development showed that the Myf5, MyoD and MyoG mRNAs were highest at 90 days post-hatching (dph). Furthermore, starvation and refeeding results showed that the transcription of the MRFs in the fast skeletal muscle of Chinese perch responded quickly to a single meal after 7 days of fasting. It indicated that the MRFs might contribute to muscle recovery after refeeding in Chinese perch. Copyright © 2015 Elsevier B.V. All rights reserved.

  9. Horizontal Transfer of Segments of the 16S rRNA Genes between Species of the Streptococcus anginosus Group

    PubMed Central

    Schouls, Leo M.; Schot, Corrie S.; Jacobs, Jan A.

    2003-01-01

    The nature in variation of the 16S rRNA gene of members of the Streptococcus anginosus group was investigated by hybridization and DNA sequencing. A collection of 708 strains was analyzed by reverse line blot hybridization. This revealed the presence of distinct reaction patterns representing 11 different hybridization groups. The 16S rRNA genes of two strains of each hybridization group were sequenced to near-completion, and the sequence data confirmed the reverse line blot hybridization results. Closer inspection of the sequences revealed mosaic-like structures, strongly suggesting horizontal transfer of segments of the 16S rRNA gene between different species belonging to the Streptococcus anginosus group. Southern blot hybridization further showed that within a single strain all copies of the 16S rRNA gene had the same composition, indicating that the apparent mosaic structures were not PCR-induced artifacts. These findings indicate that the highly conserved rRNA genes are also subject to recombination and that these events may be fixed in the population. Such recombination may lead to the construction of incorrect phylogenetic trees based on the 16S rRNA genes. PMID:14645285

  10. Sequencing of Single Pollen Nuclei Reveals Meiotic Recombination Events at Megabase Resolution and Circumvents Segregation Distortion Caused by Postmeiotic Processes

    PubMed Central

    Dreissig, Steven; Fuchs, Jörg; Himmelbach, Axel; Mascher, Martin; Houben, Andreas

    2017-01-01

    Meiotic recombination is a fundamental mechanism to generate novel allelic combinations which can be harnessed by breeders to achieve crop improvement. The recombination landscape of many crop species, including the major crop barley, is characterized by a dearth of recombination in 65% of the genome. In addition, segregation distortion caused by selection on genetically linked loci is a frequent and undesirable phenomenon in double haploid populations which hampers genetic mapping and breeding. Here, we present an approach to directly investigate recombination at the DNA sequence level by combining flow-sorting of haploid pollen nuclei of barley with single-cell genome sequencing. We confirm the skewed distribution of recombination events toward distal chromosomal regions at megabase resolution and show that segregation distortion is almost absent if directly measured in pollen. Furthermore, we show a bimodal distribution of inter-crossover distances, which supports the existence of two classes of crossovers which are sensitive or less sensitive to physical interference. We conclude that single pollen nuclei sequencing is an approach capable of revealing recombination patterns in the absence of segregation distortion. PMID:29018459

  11. Preparation of Meloidogyne javanica near-isogenic lines virulent and avirulent against the tomato resistance gene Mi and preliminary analyses of the genetic variation between the two lines.

    PubMed

    Xu, Jian-Hua; Narabu, Takashi; Li, Hong-Mei; Fu, Peng

    2002-01-01

    Meloidogyne javanica, reproducing by mitotic parthenogenesis, is an economically important pathogen of a wide range of crops. A pair of near-isogenic lines virulent and avirulent toward the tomato resistance gene Mi were prepared for M. javanica by continuously selecting an avirulent population on the resistant tomato cultivar Momotaro over 19 generations. Random amplified polymorphic DNA (RAPD) analysis with 102 primers revealed that RAPD patterns were highly conserved between the virulent and avirulent lines, confirming that the two lines were genomically very similar. Nevertheless, with one of the primers a distinct polymorphic fragment, specific for the avirulent lines, was amplified. Southern hybridization results indicated that the polymorphic fragment and its homologs were deleted from the genome of the virulent line during the process of virulence acquisition. Sequence analysis and homology searches of public data bases, however, revealed no published sequences significantly similar to the sequence of the fragment, precluding a prediction of the potential function of the sequence. The successful preparation of the near-isogenic Mi-virulent and avirulent lines laid a firm foundation for the further identification and isolation of virulence-related genes in M. javanica.

  12. Sexual Selection of Protamine 1 in Mammals.

    PubMed

    Lüke, Lena; Tourmente, Maximiliano; Roldan, Eduardo R S

    2016-01-01

    Protamines have a crucial role in male fertility. They are involved in sperm chromatin packaging and influence the shape of the sperm head and, hence, are important for sperm performance. Protamine structure is basic with numerous arginine-rich DNA-binding domains. Postcopulatory sexual selection is thought to play an important role in protamine sequence evolution and expression. Here, we analyze patterns of evolution and sexual selection (in the form of sperm competition) acting on protamine 1 gene sequence in 237 mammalian species. We assessed common patterns as well as differences between the major mammalian subclasses (Eutheria, Metatheria) and clades. We found that a high arginine content in protamine 1 associates with a lower sperm head width, which may have an impact on sperm swimming velocity. Increase in arginine content in protamine 1 across mammals appears to take place in a way consistent with sexual selection. In metatherians, increase in sequence length correlates with sexual selection. Differences in selective pressures on sequences and codon sites were observed between mammalian clades. Our study revealed a complex evolutionary pattern of protamine 1, with different selective constraints, and effects of sexual selection, between mammalian groups. In contrast, the effect of arginine content on head shape, and the possible involvement of sperm competition, was identified across all mammals. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  13. Transcriptional dynamics of the developing sweet cherry (Prunus avium L.) fruit: sequencing, annotation and expression profiling of exocarp-associated genes

    PubMed Central

    Alkio, Merianne; Jonas, Uwe; Declercq, Myriam; Van Nocker, Steven; Knoche, Moritz

    2014-01-01

    The exocarp, or skin, of fleshy fruit is a specialized tissue that protects the fruit, attracts seed dispersing fruit eaters, and has large economical relevance for fruit quality. Development of the exocarp involves regulated activities of many genes. This research analyzed global gene expression in the exocarp of developing sweet cherry (Prunus avium L., ‘Regina’), a fruit crop species with little public genomic resources. A catalog of transcript models (contigs) representing expressed genes was constructed from de novo assembled short complementary DNA (cDNA) sequences generated from developing fruit between flowering and maturity at 14 time points. Expression levels in each sample were estimated for 34 695 contigs from numbers of reads mapping to each contig. Contigs were annotated functionally based on BLAST, gene ontology and InterProScan analyses. Coregulated genes were detected using partitional clustering of expression patterns. The results are discussed with emphasis on genes putatively involved in cuticle deposition, cell wall metabolism and sugar transport. The high temporal resolution of the expression patterns presented here reveals finely tuned developmental specialization of individual members of gene families. Moreover, the de novo assembled sweet cherry fruit transcriptome with 7760 full-length protein coding sequences and over 20 000 other, annotated cDNA sequences together with their developmental expression patterns is expected to accelerate molecular research on this important tree fruit crop. PMID:26504533

  14. Evidence for Sequence Scrambling and Divergent H/D Exchange Reactions of Doubly-Charged Isobaric b-Type Fragment Ions

    NASA Astrophysics Data System (ADS)

    Zekavat, Behrooz; Miladi, Mahsan; Al-Fdeilat, Abdullah H.; Somogyi, Arpad; Solouki, Touradj

    2014-02-01

    To date, only a limited number of reports are available on structural variants of multiply-charged b-fragment ions. We report on observed bimodal gas-phase hydrogen/deuterium exchange (HDX) reaction kinetics and patterns for substance P b10 2+ that point to presence of isomeric structures. We also compare HDX reactions, post-ion mobility/collision-induced dissociation (post-IM/CID), and sustained off-resonance irradiation-collision induced dissociation (SORI-CID) of substance P b10 2+ and a cyclic peptide with an identical amino acid (AA) sequence order to substance P b10. The observed HDX patterns and reaction kinetics and SORI-CID pattern for the doubly charged head-to-tail cyclized peptide were different from either of the presumed isomers of substance P b10 2+, suggesting that b10 2+ may not exist exclusively as a head-to-tail cyclized structure. Ultra-high mass measurement accuracy was used to assign identities of the observed SORI-CID fragment ions of substance P b10 2+; over 30 % of the observed SORI-CID fragment ions from substance P b10 2+ had rearranged (scrambled) AA sequences. Moreover, post-IM/CID experiments revealed the presence of two conformer types for substance P b10 2+, whereas only one conformer type was observed for the head-to-tail cyclized peptide. We also show that AA sequence scrambling from CID of doubly-charged b-fragment ions is not unique to substance P b10 2+.

  15. Evidence for sequence scrambling and divergent H/D exchange reactions of doubly-charged isobaric b-type fragment ions.

    PubMed

    Zekavat, Behrooz; Miladi, Mahsan; Al-Fdeilat, Abdullah H; Somogyi, Arpad; Solouki, Touradj

    2014-02-01

    To date, only a limited number of reports are available on structural variants of multiply-charged b-fragment ions. We report on observed bimodal gas-phase hydrogen/deuterium exchange (HDX) reaction kinetics and patterns for substance P b10(2+) that point to presence of isomeric structures. We also compare HDX reactions, post-ion mobility/collision-induced dissociation (post-IM/CID), and sustained off-resonance irradiation-collision induced dissociation (SORI-CID) of substance P b10(2+) and a cyclic peptide with an identical amino acid (AA) sequence order to substance P b10. The observed HDX patterns and reaction kinetics and SORI-CID pattern for the doubly charged head-to-tail cyclized peptide were different from either of the presumed isomers of substance P b10(2+), suggesting that b10(2+) may not exist exclusively as a head-to-tail cyclized structure. Ultra-high mass measurement accuracy was used to assign identities of the observed SORI-CID fragment ions of substance P b10(2+); over 30% of the observed SORI-CID fragment ions from substance P b10(2+) had rearranged (scrambled) AA sequences. Moreover, post-IM/CID experiments revealed the presence of two conformer types for substance P b10(2+), whereas only one conformer type was observed for the head-to-tail cyclized peptide. We also show that AA sequence scrambling from CID of doubly-charged b-fragment ions is not unique to substance P b10(2+).

  16. Computational identification of epitopes in the glycoproteins of novel bunyavirus (SFTS virus) recognized by a human monoclonal antibody (MAb 4-5)

    NASA Astrophysics Data System (ADS)

    Zhang, Wenshuai; Zeng, Xiaoyan; Zhang, Li; Peng, Haiyan; Jiao, Yongjun; Zeng, Jun; Treutlein, Herbert R.

    2013-06-01

    In this work, we have developed a new approach to predict the epitopes of antigens that are recognized by a specific antibody. Our method is based on the "multiple copy simultaneous search" (MCSS) approach which identifies optimal locations of small chemical functional groups on the surfaces of the antibody, and identifying sequence patterns of peptides that can bind to the surface of the antibody. The identified sequence patterns are then used to search the amino-acid sequence of the antigen protein. The approach was validated by reproducing the binding epitope of HIV gp120 envelop glycoprotein for the human neutralizing antibody as revealed in the available crystal structure. Our method was then applied to predict the epitopes of two glycoproteins of a newly discovered bunyavirus recognized by an antibody named MAb 4-5. These predicted epitopes can be verified by experimental methods. We also discuss the involvement of different amino acids in the antigen-antibody recognition based on the distributions of MCSS minima of different functional groups.

  17. Molecular characterization of the probiotic strain Bacillus cereus var. toyoi NCIMB 40112 and differentiation from food poisoning strains.

    PubMed

    Klein, Günter

    2011-07-01

    Bacillus cereus var. toyoi strain NCIMB 40112 (Toyocerin), a probiotic authorized in the European Union as feed additive for swine, bovines, poultry, and rabbits, was characterized by DNA fingerprinting applying pulsed-field gel electrophoresis and multilocus sequence typing and was compared with reference strains (of clinical and environmental origins). The probiotic strain was clearly characterized by pulsed-field gel electrophoresis using the restriction enzymes Apa I and Sma I resulting in unique DNA patterns. The comparison to the clinical reference strain B. cereus DSM 4312 was done with the same restriction enzymes, and again a clear differentiation of the two strains was possible by the resulting DNA patterns. The use of the restriction enzymes Apa I and Sma I is recommended for further studies. Furthermore, multilocus sequence typing analysis revealed a sequence type (ST 111) that was different from all known STs of B. cereus strains from food poisoning incidents. Thus, a strain characterization and differentiation from food poisoning strains for the probiotic strain was possible. Copyright ©, International Association for Food Protection

  18. Commensurability-driven structural defects in double emulsions produced with two-step microfluidic techniques.

    PubMed

    Schmit, Alexandre; Salkin, Louis; Courbin, Laurent; Panizza, Pascal

    2014-07-14

    The combination of two drop makers such as flow focusing geometries or ┬ junctions is commonly used in microfluidics to fabricate monodisperse double emulsions and novel fluid-based materials. Here we investigate the physics of the encapsulation of small droplets inside large drops that is at the core of such processes. The number of droplets per drop studied over time for large sequences of consecutive drops reveals that the dynamics of these systems are complex: we find a succession of well-defined elementary patterns and defects. We present a simple model based on a discrete approach that predicts the nature of these patterns and their non-trivial scheme of arrangement in a sequence as a function of the ratio of the two timescales of the problem, the production times of droplets and drops. Experiments validate our model as they concur very well with predictions.

  19. Migratory flyway and geographical distance are barriers to the gene flow of influenza virus among North American birds

    USGS Publications Warehouse

    Lam, Tommy Tsan-Yuk; Ip, Hon S.; Ghedin, Elodie; Wentworth, David E.; Halpin, Rebecca A.; Stockwell, Timothy B.; Spiro, David J.; Dusek, Robert J.; Bortner, James B.; Hoskins, Jenny; Bales, Bradley D.; Yparraguirre, Dan R.; Holmes, Edward C.

    2012-01-01

    Despite the importance of migratory birds in the ecology and evolution of avian influenza virus (AIV), there is a lack of information on the patterns of AIV spread at the intra-continental scale. We applied a variety of statistical phylogeographic techniques to a plethora of viral genome sequence data to determine the strength, pattern and determinants of gene flow in AIV sampled from wild birds in North America. These analyses revealed a clear isolation-by-distance of AIV among sampling localities. In addition, we show that phylogeographic models incorporating information on the avian flyway of sampling proved a better fit to the observed sequence data than those specifying homogeneous or random rates of gene flow among localities. In sum, these data strongly suggest that the intra-continental spread of AIV by migratory birds is subject to major ecological barriers, including spatial distance and avian flyway.

  20. Systematic Observation of an Expert Driver's Gaze Strategy—An On-Road Case Study

    PubMed Central

    Lappi, Otto; Rinkkala, Paavo; Pekkanen, Jami

    2017-01-01

    In this paper we present and qualitatively analyze an expert driver's gaze behavior in natural driving on a real road, with no specific experimental task or instruction. Previous eye tracking research on naturalistic tasks has revealed recurring patterns of gaze behavior that are surprisingly regular and repeatable. Lappi (2016) identified in the literature seven “qualitative laws of gaze behavior in the wild”: recurring patterns that tend to go together, the more so the more naturalistic the setting, all of them expected in extended sequences of fully naturalistic behavior. However, no study to date has observed all in a single experiment. Here, we wanted to do just that: present observations supporting all the “laws” in a single behavioral sequence by a single subject. We discuss the laws in terms of unresolved issues in driver modeling and open challenges for experimental and theoretical development. PMID:28496422

  1. Analysis of a four generation family reveals the widespread sequence-dependent maintenance of allelic DNA methylation in somatic and germ cells

    PubMed Central

    Tang, Aifa; Huang, Yi; Li, Zesong; Wan, Shengqing; Mou, Lisha; Yin, Guangliang; Li, Ning; Xie, Jun; Xia, Yudong; Li, Xianxin; Luo, Liya; Zhang, Junwen; Chen, Shen; Wu, Song; Sun, Jihua; Sun, Xiaojuan; Jiang, Zhimao; Chen, Jing; Li, Yingrui; Wang, Jian; Wang, Jun; Cai, Zhiming; Gui, Yaoting

    2016-01-01

    Differential methylation of the homologous chromosomes, a well-known mechanism leading to genomic imprinting and X-chromosome inactivation, is widely reported at the non-imprinted regions on autosomes. To evaluate the transgenerational DNA methylation patterns in human, we analyzed the DNA methylomes of somatic and germ cells in a four-generation family. We found that allelic asymmetry of DNA methylation was pervasive at the non-imprinted loci and was likely regulated by cis-acting genetic variants. We also observed that the allelic methylation patterns for the vast majority of the cis-regulated loci were shared between the somatic and germ cells from the same individual. These results demonstrated the interaction between genetic and epigenetic variations and suggested the possibility of widespread sequence-dependent transmission of DNA methylation during spermatogenesis. PMID:26758766

  2. Association mining of dependency between time series

    NASA Astrophysics Data System (ADS)

    Hafez, Alaaeldin

    2001-03-01

    Time series analysis is considered as a crucial component of strategic control over a broad variety of disciplines in business, science and engineering. Time series data is a sequence of observations collected over intervals of time. Each time series describes a phenomenon as a function of time. Analysis on time series data includes discovering trends (or patterns) in a time series sequence. In the last few years, data mining has emerged and been recognized as a new technology for data analysis. Data Mining is the process of discovering potentially valuable patterns, associations, trends, sequences and dependencies in data. Data mining techniques can discover information that many traditional business analysis and statistical techniques fail to deliver. In this paper, we adapt and innovate data mining techniques to analyze time series data. By using data mining techniques, maximal frequent patterns are discovered and used in predicting future sequences or trends, where trends describe the behavior of a sequence. In order to include different types of time series (e.g. irregular and non- systematic), we consider past frequent patterns of the same time sequences (local patterns) and of other dependent time sequences (global patterns). We use the word 'dependent' instead of the word 'similar' for emphasis on real life time series where two time series sequences could be completely different (in values, shapes, etc.), but they still react to the same conditions in a dependent way. In this paper, we propose the Dependence Mining Technique that could be used in predicting time series sequences. The proposed technique consists of three phases: (a) for all time series sequences, generate their trend sequences, (b) discover maximal frequent trend patterns, generate pattern vectors (to keep information of frequent trend patterns), use trend pattern vectors to predict future time series sequences.

  3. Petroleum system elements within the Late Cretaceous and Early Paleogene sediments of Nigeria's inland basins: An integrated sequence stratigraphic approach

    NASA Astrophysics Data System (ADS)

    Dim, Chidozie Izuchukwu Princeton; Onuoha, K. Mosto; Okeugo, Chukwudike Gabriel; Ozumba, Bertram Maduka

    2017-06-01

    Sequence stratigraphic studies have been carried out using subsurface well and 2D seismic data in the Late Cretaceous and Early Paleogene sediments of Anambra and proximal onshore section of Niger Delta Basin in the Southeastern Nigeria. The aim was to establish the stratigraphic framework for better understanding of the reservoir, source and seal rock presence and distribution in the basin. Thirteen stratigraphic bounding surfaces (consisting of six maximum flooding surfaces - MFSs and seven sequence boundaries - SBs) were recognized and calibrated using a newly modified chronostratigraphic chart. Stratigraphic surfaces were matched with corresponding foraminiferal and palynological biozones, aiding correlation across wells in this study. Well log sequence stratigraphic correlation reveals that stratal packages within the basin are segmented into six depositional sequences occurring from Late Cretaceous to Early Paleogene age. Generated gross depositional environment maps at various MFSs show that sediment packages deposited within shelfal to deep marine settings, reflect continuous rise and fall of sea levels within a regressive cycle. Each of these sequences consist of three system tracts (lowstand system tract - LST, transgressive system tract - TST and highstand system tract - HST) that are associated with mainly progradational and retrogradational sediment stacking patterns. Well correlation reveals that the sand and shale units of the LSTs, HSTs and TSTs, that constitute the reservoir and source/seal packages respectively are laterally continuous and thicken basinwards, due to structural influences. Result from interpretation of seismic section reveals the presence of hanging wall, footwall, horst block and collapsed crest structures. These structural features generally aid migration and offer entrapment mechanism for hydrocarbon accumulation. The combination of these reservoirs, sources, seals and trap elements form a good petroleum system that is viable for hydrocarbon exploration and development.

  4. Evolution of rDNA in Nicotiana Allopolyploids: A Potential Link between rDNA Homogenization and Epigenetics

    PubMed Central

    Kovarik, Ales; Dadejova, Martina; Lim, Yoong K.; Chase, Mark W.; Clarkson, James J.; Knapp, Sandra; Leitch, Andrew R.

    2008-01-01

    Background The evolution and biology of rDNA have interested biologists for many years, in part, because of two intriguing processes: (1) nucleolar dominance and (2) sequence homogenization. We review patterns of evolution in rDNA in the angiosperm genus Nicotiana to determine consequences of allopolyploidy on these processes. Scope Allopolyploid species of Nicotiana are ideal for studying rDNA evolution because phylogenetic reconstruction of DNA sequences has revealed patterns of species divergence and their parents. From these studies we also know that polyploids formed over widely different timeframes (thousands to millions of years), enabling comparative and temporal studies of rDNA structure, activity and chromosomal distribution. In addition studies on synthetic polyploids enable the consequences of de novo polyploidy on rDNA activity to be determined. Conclusions We propose that rDNA epigenetic expression patterns established even in F1 hybrids have a material influence on the likely patterns of divergence of rDNA. It is the active rDNA units that are vulnerable to homogenization, which probably acts to reduce mutational load across the active array. Those rDNA units that are epigenetically silenced may be less vulnerable to sequence homogenization. Selection cannot act on these silenced genes, and they are likely to accumulate mutations and eventually be eliminated from the genome. It is likely that whole silenced arrays will be deleted in polyploids of 1 million years of age and older. PMID:18310159

  5. Understanding continental megathrust earthquake potential through geological mountain building processes: an example in Nepal Himalaya

    NASA Astrophysics Data System (ADS)

    Zhang, Huai; Zhang, Zhen; Wang, Liangshu; Leroy, Yves; shi, Yaolin

    2017-04-01

    How to reconcile continent megathrust earthquake characteristics, for instances, mapping the large-great earthquake sequences into geological mountain building process, as well as partitioning the seismic-aseismic slips, is fundamental and unclear. Here, we scope these issues by focusing a typical continental collisional belt, the great Nepal Himalaya. We first prove that refined Nepal Himalaya thrusting sequences, with accurately defining of large earthquake cycle scale, provide new geodynamical hints on long-term earthquake potential in association with, either seismic-aseismic slip partition up to the interpretation of the binary interseismic coupling pattern on the Main Himalayan Thrust (MHT), or the large-great earthquake classification via seismic cycle patterns on MHT. Subsequently, sequential limit analysis is adopted to retrieve the detailed thrusting sequences of Nepal Himalaya mountain wedge. Our model results exhibit apparent thrusting concentration phenomenon with four thrusting clusters, entitled as thrusting 'families', to facilitate the development of sub-structural regions respectively. Within the hinterland thrusting family, the total aseismic shortening and the corresponding spatio-temporal release pattern are revealed by mapping projection. Whereas, in the other three families, mapping projection delivers long-term large (M<8)-great (M>8) earthquake recurrence information, including total lifespans, frequencies and large-great earthquake alternation information by identifying rupture distances along the MHT. In addition, this partition has universality in continental-continental collisional orogenic belt with identified interseismic coupling pattern, while not applicable in continental-oceanic megathrust context.

  6. Base changes in tumour DNA have the power to reveal the causes and evolution of cancer

    DOE PAGES

    Hollstein, M.; Alexandrov, L. B.; Wild, C. P.; ...

    2016-06-06

    Next-generation sequencing (NGS) technology has demonstrated that the cancer genomes are peppered with mutations. Although most somatic tumour mutations are unlikely to have any role in the cancer process per se, the spectra of DNA sequence changes in tumour mutation catalogues have the potential to identify the mutagens, and to reveal the mutagenic processes responsible for human cancer. Very recently, a novel approach for data mining of the vast compilations of tumour NGS data succeeded in separating and precisely defining at least 30 distinct patterns of sequence change hidden in mutation databases. At least half of these mutational signatures canmore » be readily assigned to known human carcinogenic exposures or endogenous mechanisms of mutagenesis. A quantum leap in our knowledge of mutagenesis in human cancers has resulted, stimulating a flurry of research activity. We trace here the major findings leading first to the hypothesis that carcinogenic insults leave characteristic imprints on the DNA sequence of tumours, and culminating in empirical evidence from NGS data that well-defined carcinogen mutational signatures are indeed present in tumour genomic DNA from a variety of cancer types. The notion that tumour DNAs can divulge environmental sources of mutation is now a well-accepted fact. This approach to cancer aetiology has also incriminated various endogenous, enzyme-driven processes that increase the somatic mutation load in sporadic cancers. The tasks now confronting the field of molecular epidemiology are to assign mutagenic processes to orphan and newly discovered tumour mutation patterns, and to determine whether avoidable cancer risk factors influence signatures produced by endogenous enzymatic mechanisms. As a result, innovative research with experimental models and exploitation of the geographical heterogeneity in cancer incidence can address these challenges.« less

  7. Sequence and expression variation in SUPPRESSOR of OVEREXPRESSION of CONSTANS 1 (SOC1): homeolog evolution in Indian Brassicas.

    PubMed

    Sri, Tanu; Mayee, Pratiksha; Singh, Anandita

    2015-09-01

    Whole genome sequence analyses allow unravelling such evolutionary consequences of meso-triplication event in Brassicaceae (∼14-20 million years ago (MYA)) as differential gene fractionation and diversification in homeologous sub-genomes. This study presents a simple gene-centric approach involving microsynteny and natural genetic variation analysis for understanding SUPPRESSOR of OVEREXPRESSION of CONSTANS 1 (SOC1) homeolog evolution in Brassica. Analysis of microsynteny in Brassica rapa homeologous regions containing SOC1 revealed differential gene fractionation correlating to reported fractionation status of sub-genomes of origin, viz. least fractionated (LF), moderately fractionated 1 (MF1) and most fractionated (MF2), respectively. Screening 18 cultivars of 6 Brassica species led to the identification of 8 genomic and 27 transcript variants of SOC1, including splice-forms. Co-occurrence of both interrupted and intronless SOC1 genes was detected in few Brassica species. In silico analysis characterised Brassica SOC1 as MADS intervening, K-box, C-terminal (MIKC(C)) transcription factor, with highly conserved MADS and I domains relative to K-box and C-terminal domain. Phylogenetic analyses and multiple sequence alignments depicting shared pattern of silent/non-silent mutations assigned Brassica SOC1 homologs into groups based on shared diploid base genome. In addition, a sub-genome structure in uncharacterised Brassica genomes was inferred. Expression analysis of putative MF2 and LF (Brassica diploid base genome A (AA)) sub-genome-specific SOC1 homeologs of Brassica juncea revealed near identical expression pattern. However, MF2-specific homeolog exhibited significantly higher expression implying regulatory diversification. In conclusion, evidence for polyploidy-induced sequence and regulatory evolution in Brassica SOC1 is being presented wherein differential homeolog expression is implied in functional diversification.

  8. Base changes in tumour DNA have the power to reveal the causes and evolution of cancer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hollstein, M.; Alexandrov, L. B.; Wild, C. P.

    Next-generation sequencing (NGS) technology has demonstrated that the cancer genomes are peppered with mutations. Although most somatic tumour mutations are unlikely to have any role in the cancer process per se, the spectra of DNA sequence changes in tumour mutation catalogues have the potential to identify the mutagens, and to reveal the mutagenic processes responsible for human cancer. Very recently, a novel approach for data mining of the vast compilations of tumour NGS data succeeded in separating and precisely defining at least 30 distinct patterns of sequence change hidden in mutation databases. At least half of these mutational signatures canmore » be readily assigned to known human carcinogenic exposures or endogenous mechanisms of mutagenesis. A quantum leap in our knowledge of mutagenesis in human cancers has resulted, stimulating a flurry of research activity. We trace here the major findings leading first to the hypothesis that carcinogenic insults leave characteristic imprints on the DNA sequence of tumours, and culminating in empirical evidence from NGS data that well-defined carcinogen mutational signatures are indeed present in tumour genomic DNA from a variety of cancer types. The notion that tumour DNAs can divulge environmental sources of mutation is now a well-accepted fact. This approach to cancer aetiology has also incriminated various endogenous, enzyme-driven processes that increase the somatic mutation load in sporadic cancers. The tasks now confronting the field of molecular epidemiology are to assign mutagenic processes to orphan and newly discovered tumour mutation patterns, and to determine whether avoidable cancer risk factors influence signatures produced by endogenous enzymatic mechanisms. As a result, innovative research with experimental models and exploitation of the geographical heterogeneity in cancer incidence can address these challenges.« less

  9. Genomic sequencing and in vivo footprinting of an expression-specific DNase I-hypersensitive site of avian vitellogenin II promoter reveal a demethylation of a mCpG and a change in specific interactions of proteins with DNA.

    PubMed Central

    Saluz, H P; Feavers, I M; Jiricny, J; Jost, J P

    1988-01-01

    Genomic sequencing was used to study the in vivo methylation pattern of two CpG sites in the promoter region of the avian vitellogenin gene. The CpG at position +10 was fully methylated in DNA isolated from tissues that do not express the gene but was unmethylated in the liver of mature hens and estradiol-treated roosters. In the latter tissue, this site became demethylated and DNase I hypersensitive after estradiol treatment. A second CpG (position -52) was unmethylated in all tissues examined. In vivo genomic footprinting with dimethyl sulfate revealed different patterns of DNA protection in silent and expressed genes. In rooster liver cells, at least 10 base pairs of DNA, including the methylated CpG, were protected by protein(s). Gel-shift assays indicated that a protein factor, present in rooster liver nuclear extract, bound at this site only when it was methylated. In hen liver cells, the same unmethylated CpG lies within a protected region of approximately equal to 20 base pairs. In vitro DNase I protection and gel-shift assays indicate that this sequence is bound by a protein, which binds both double- and single-stranded DNA. For the latter substrate, this factor was shown to bind solely the noncoding (i.e., mRNA-like) strand. Images PMID:3413118

  10. Prasinoviruses reveal a complex evolutionary history and a patchy environmental distribution

    NASA Astrophysics Data System (ADS)

    Finke, J. F.; Suttle, C.

    2016-02-01

    Prasinophytes constitute a group of eukaryotic phytoplankton that has a global distribution and is a major component of coastal and oceanic communities. Members of this group are infected by large double-stranded DNA viruses that can be significant agents of mortality, and which show evidence of substantial horizontal transfer of genes from their hosts and other organisms. However, information on the genetic diversity of these viruses and their environmental distribution is limited. This study examines the genetic repertoire, phylogeny and environmental distribution of large double-stranded DNA viruses infecting Micromonas pusilla and other prasinophytes. The genomes of viruses infecting M. pusilla were sequenced and compared to those of viruses infecting other prasinophytes, revealing a relatively small set of core genes and a larger flexible pan genome. Comparing genomes among prasinoviruses highlights their variable genetic content and complex evolutionary history. While some of the pan genome is clearly host derived, many open reading frames are most similar to those found in other eukaryotes and bacteria. Gene content of the viruses is is congruent with phylogenetic analysis of viral DNA polymerase sequences and indicates that two clades of M. pusilla viruses are less related to each other than to other prasinoviruses. Moreover, the environmental distribution of prasinovirus DNA polymerase sequences indicates a complex pattern of virus-host interactions in nature. Ultimately, these patterns are influenced by the genetic repertoire encoded by prasinoviruses, and the distribution of the hosts they infect.

  11. Marine turtle mitogenome phylogenetics and evolution.

    PubMed

    Duchene, Sebastián; Frey, Amy; Alfaro-Núñez, Alonzo; Dutton, Peter H; Thomas P Gilbert, M; Morin, Phillip A

    2012-10-01

    The sea turtles are a group of cretaceous origin containing seven recognized living species: leatherback, hawksbill, Kemp's ridley, olive ridley, loggerhead, green, and flatback. The leatherback is the single member of the Dermochelidae family, whereas all other sea turtles belong in Cheloniidae. Analyses of partial mitochondrial sequences and some nuclear markers have revealed phylogenetic inconsistencies within Cheloniidae, especially regarding the placement of the flatback. Population genetic studies based on D-Loop sequences have shown considerable structuring in species with broad geographic distributions, shedding light on complex migration patterns and possible geographic or climatic events as driving forces of sea-turtle distribution. We have sequenced complete mitogenomes for all sea-turtle species, including samples from their geographic range extremes, and performed phylogenetic analyses to assess sea-turtle evolution with a large molecular dataset. We found variation in the length of the ATP8 gene and a highly variable site in ND4 near a proton translocation channel in the resulting protein. Complete mitogenomes show strong support and resolution for phylogenetic relationships among all sea turtles, and reveal phylogeographic patterns within globally-distributed species. Although there was clear concordance between phylogenies and geographic origin of samples in most taxa, we found evidence of more recent dispersal events in the loggerhead and olive ridley turtles, suggesting more recent migrations (<1 Myr) in these species. Overall, our results demonstrate the complexity of sea-turtle diversity, and indicate the need for further research in phylogeography and molecular evolution. Published by Elsevier Inc.

  12. Phylogeography and Sex-Biased Dispersal across Riverine Manatee Populations (Trichechus inunguis and Trichechus manatus) in South America

    PubMed Central

    Satizábal, Paula; Mignucci-Giannoni, Antonio A.; Duchêne, Sebastián; Caicedo-Herrera, Dalila; Perea-Sicchar, Carlos M.; García-Dávila, Carmen R.; Trujillo, Fernando; Caballero, Susana J.

    2012-01-01

    Phylogeographic patterns and sex-biased dispersal were studied in riverine populations of West Indian (Trichechus manatus) and Amazonian manatees (T. inunguis) in South America, using 410bp D-loop (Control Region, Mitochondrial DNA) sequences and 15 nuclear microsatellite loci. This multi-locus approach was key to disentangle complex patterns of gene flow among populations. D-loop analyses revealed population structuring among all Colombian rivers for T. manatus, while microsatellite data suggested no structure. Two main populations of T. inunguis separating the Colombian and Peruvian Amazon were supported by analysis of the D-loop and microsatellite data. Overall, we provide molecular evidence for differences in dispersal patterns between sexes, demonstrating male-biased gene flow dispersal in riverine manatees. These results are in contrast with previously reported levels of population structure shown by microsatellite data in marine manatee populations, revealing low habitat restrictions to gene flow in riverine habitats, and more significant dispersal limitations for males in marine environments. PMID:23285054

  13. Phylogeography and sex-biased dispersal across riverine manatee populations (Trichechus inunguis and Trichechus manatus) in South America.

    PubMed

    Satizábal, Paula; Mignucci-Giannoni, Antonio A; Duchêne, Sebastián; Caicedo-Herrera, Dalila; Perea-Sicchar, Carlos M; García-Dávila, Carmen R; Trujillo, Fernando; Caballero, Susana J

    2012-01-01

    Phylogeographic patterns and sex-biased dispersal were studied in riverine populations of West Indian (Trichechus manatus) and Amazonian manatees (T. inunguis) in South America, using 410bp D-loop (Control Region, Mitochondrial DNA) sequences and 15 nuclear microsatellite loci. This multi-locus approach was key to disentangle complex patterns of gene flow among populations. D-loop analyses revealed population structuring among all Colombian rivers for T. manatus, while microsatellite data suggested no structure. Two main populations of T. inunguis separating the Colombian and Peruvian Amazon were supported by analysis of the D-loop and microsatellite data. Overall, we provide molecular evidence for differences in dispersal patterns between sexes, demonstrating male-biased gene flow dispersal in riverine manatees. These results are in contrast with previously reported levels of population structure shown by microsatellite data in marine manatee populations, revealing low habitat restrictions to gene flow in riverine habitats, and more significant dispersal limitations for males in marine environments.

  14. Sequence investigation of 34 forensic autosomal STRs with massively parallel sequencing.

    PubMed

    Zhang, Suhua; Niu, Yong; Bian, Yingnan; Dong, Rixia; Liu, Xiling; Bao, Yun; Jin, Chao; Zheng, Hancheng; Li, Chengtao

    2018-05-01

    STRs vary not only in the length of the repeat units and the number of repeats but also in the region with which they conform to an incremental repeat pattern. Massively parallel sequencing (MPS) offers new possibilities in the analysis of STRs since they can simultaneously sequence multiple targets in a single reaction and capture potential internal sequence variations. Here, we sequenced 34 STRs applied in the forensic community of China with a custom-designed panel. MPS performance were evaluated from sequencing reads analysis, concordance study and sensitivity testing. High coverage sequencing data were obtained to determine the constitute ratios and heterozygous balance. No actual inconsistent genotypes were observed between capillary electrophoresis (CE) and MPS, demonstrating the reliability of the panel and the MPS technology. With the sequencing data from the 200 investigated individuals, 346 and 418 alleles were obtained via CE and MPS technologies at the 34 STRs, indicating MPS technology provides higher discrimination than CE detection. The whole study demonstrated that STR genotyping with the custom panel and MPS technology has the potential not only to reveal length and sequence variations but also to satisfy the demands of high throughput and high multiplexing with acceptable sensitivity.

  15. Is multiple-sequence alignment required for accurate inference of phylogeny?

    PubMed

    Höhl, Michael; Ragan, Mark A

    2007-04-01

    The process of inferring phylogenetic trees from molecular sequences almost always starts with a multiple alignment of these sequences but can also be based on methods that do not involve multiple sequence alignment. Very little is known about the accuracy with which such alignment-free methods recover the correct phylogeny or about the potential for increasing their accuracy. We conducted a large-scale comparison of ten alignment-free methods, among them one new approach that does not calculate distances and a faster variant of our pattern-based approach; all distance-based alignment-free methods are freely available from http://www.bioinformatics.org.au (as Python package decaf+py). We show that most methods exhibit a higher overall reconstruction accuracy in the presence of high among-site rate variation. Under all conditions that we considered, variants of the pattern-based approach were significantly better than the other alignment-free methods. The new pattern-based variant achieved a speed-up of an order of magnitude in the distance calculation step, accompanied by a small loss of tree reconstruction accuracy. A method of Bayesian inference from k-mers did not improve on classical alignment-free (and distance-based) methods but may still offer other advantages due to its Bayesian nature. We found the optimal word length k of word-based methods to be stable across various data sets, and we provide parameter ranges for two different alphabets. The influence of these alphabets was analyzed to reveal a trade-off in reconstruction accuracy between long and short branches. We have mapped the phylogenetic accuracy for many alignment-free methods, among them several recently introduced ones, and increased our understanding of their behavior in response to biologically important parameters. In all experiments, the pattern-based approach emerged as superior, at the expense of higher resource consumption. Nonetheless, no alignment-free method that we examined recovers the correct phylogeny as accurately as does an approach based on maximum-likelihood distance estimates of multiply aligned sequences.

  16. Comparison of Five Major Trichome Regulatory Genes in Brassica villosa with Orthologues within the Brassicaceae

    PubMed Central

    Nayidu, Naghabushana K.; Kagale, Sateesh; Taheri, Ali; Withana-Gamage, Thushan S.; Parkin, Isobel A. P.; Sharpe, Andrew G.; Gruber, Margaret Y.

    2014-01-01

    Coding sequences for major trichome regulatory genes, including the positive regulators GLABRA 1(GL1), GLABRA 2 (GL2), ENHANCER OF GLABRA 3 (EGL3), and TRANSPARENT TESTA GLABRA 1 (TTG1) and the negative regulator TRIPTYCHON (TRY), were cloned from wild Brassica villosa, which is characterized by dense trichome coverage over most of the plant. Transcript (FPKM) levels from RNA sequencing indicated much higher expression of the GL2 and TTG1 regulatory genes in B. villosa leaves compared with expression levels of GL1 and EGL3 genes in either B. villosa or the reference genome species, glabrous B. oleracea; however, cotyledon TTG1 expression was high in both species. RNA sequencing and Q-PCR also revealed an unusual expression pattern for the negative regulators TRY and CPC, which were much more highly expressed in trichome-rich B. villosa leaves than in glabrous B. oleracea leaves and in glabrous cotyledons from both species. The B. villosa TRY expression pattern also contrasted with TRY expression patterns in two diploid Brassica species, and with the Arabidopsis model for expression of negative regulators of trichome development. Further unique sequence polymorphisms, protein characteristics, and gene evolution studies highlighted specific amino acids in GL1 and GL2 coding sequences that distinguished glabrous species from hairy species and several variants that were specific for each B. villosa gene. Positive selection was observed for GL1 between hairy and non-hairy plants, and as expected the origin of the four expressed positive trichome regulatory genes in B. villosa was predicted to be from B. oleracea. In particular the unpredicted expression patterns for TRY and CPC in B. villosa suggest additional characterization is needed to determine the function of the expanded families of trichome regulatory genes in more complex polyploid species within the Brassicaceae. PMID:24755905

  17. Synthesis of compact patterns for NMR relaxation decay in intelligent "electronic tongue" for analyzing heavy oil composition

    NASA Astrophysics Data System (ADS)

    Lapshenkov, E. M.; Volkov, V. Y.; Kulagin, V. P.

    2018-05-01

    The article is devoted to the problem of pattern creation of the NMR sensor signal for subsequent recognition by the artificial neural network in the intelligent device "the electronic tongue". The specific problem of removing redundant data from the spin-spin relaxation signal pattern that is used as a source of information in analyzing the composition of oil and petroleum products is considered. The method is proposed that makes it possible to remove redundant data of the relaxation decay pattern but without introducing additional distortion. This method is based on combining some relaxation decay curve intervals that increment below the noise level such that the increment of the combined intervals is above the noise level. In this case, the relaxation decay curve samples that are located inside the combined intervals are removed from the pattern. This method was tested on the heavy-oil NMR signal patterns that were created by using the Carr-Purcell-Meibum-Gill (CPMG) sequence for recording the relaxation process. Parameters of CPMG sequence are: 100 μs - time interval between 180° pulses, 0.4s - duration of measurement. As a result, it was revealed that the proposed method allowed one to reduce the number of samples 15 times (from 4000 to 270), and the maximum detected root mean square error (RMS error) equals 0.00239 (equivalent to signal-to-noise ratio 418).

  18. Mammalian genome projects reveal new growth hormone (GH) sequences. Characterization of the GH-encoding genes of armadillo (Dasypus novemcinctus), hedgehog (Erinaceus europaeus), bat (Myotis lucifugus), hyrax (Procavia capensis), shrew (Sorex araneus), ground squirrel (Spermophilus tridecemlineatus), elephant (Loxodonta africana), cat (Felis catus) and opossum (Monodelphis domestica).

    PubMed

    Wallis, Michael

    2008-01-15

    Mammalian growth hormone (GH) sequences have been shown previously to display episodic evolution: the sequence is generally strongly conserved but on at least two occasions during mammalian evolution (on lineages leading to higher primates and ruminants) bursts of rapid evolution occurred. However, the number of mammalian orders studied previously has been relatively limited, and the availability of sequence data via mammalian genome projects provides the potential for extending the range of GH gene sequences examined. Complete or nearly complete GH gene sequences for six mammalian species for which no data were previously available have been extracted from the genome databases-Dasypus novemcinctus (nine-banded armadillo), Erinaceus europaeus (western European hedgehog), Myotis lucifugus (little brown bat), Procavia capensis (cape rock hyrax), Sorex araneus (European shrew), Spermophilus tridecemlineatus (13-lined ground squirrel). In addition incomplete data for several other species have been extended. Examination of the data in detail and comparison with previously available sequences has allowed assessment of the reliability of deduced sequences. Several of the new sequences differ substantially from the consensus sequence previously determined for eutherian GHs, indicating greater variability than previously recognised, and confirming the episodic pattern of evolution. The episodic pattern is not seen for signal sequences, 5' upstream sequence or synonymous substitutions-it is specific to the mature protein sequence, suggesting that it relates to the hormonal function. The substitutions accumulated during the course of GH evolution have occurred mainly on the side of the hormone facing away from the receptor, in a non-random fashion, and it is suggested that this may reflect interaction of the receptor-bound hormone with other proteins or small ligands.

  19. Next generation sequencing and analysis of a conserved transcriptome of New Zealand's kiwi.

    PubMed

    Subramanian, Sankar; Huynen, Leon; Millar, Craig D; Lambert, David M

    2010-12-15

    Kiwi is a highly distinctive, flightless and endangered ratite bird endemic to New Zealand. To understand the patterns of molecular evolution of the nuclear protein-coding genes in brown kiwi (Apteryx australis mantelli) and to determine the timescale of avian history we sequenced a transcriptome obtained from a kiwi embryo using next generation sequencing methods. We then assembled the conserved protein-coding regions using the chicken proteome as a scaffold. Using 1,543 conserved protein coding genes we estimated the neutral evolutionary divergence between the kiwi and chicken to be ~45%, which is approximately equal to the divergence computed for the human-mouse pair using the same set of genes. A large fraction of genes was found to be under high selective constraint, as most of the expressed genes appeared to be involved in developmental gene regulation. Our study suggests a significant relationship between gene expression levels and protein evolution. Using sequences from over 700 nuclear genes we estimated the divergence between the two basal avian groups, Palaeognathae and Neognathae to be 132 million years, which is consistent with previous studies using mitochondrial genes. The results of this investigation revealed patterns of mutation and purifying selection in conserved protein coding regions in birds. Furthermore this study suggests a relatively cost-effective way of obtaining a glimpse into the fundamental molecular evolutionary attributes of a genome, particularly when no closely related genomic sequence is available.

  20. Unusual Intron Conservation near Tissue-Regulated Exons Found by Splicing Microarrays

    PubMed Central

    Sugnet, Charles W; Srinivasan, Karpagam; Clark, Tyson A; O'Brien, Georgeann; Cline, Melissa S; Wang, Hui; Williams, Alan; Kulp, David; Blume, John E; Haussler, David; Ares, Manuel

    2006-01-01

    Alternative splicing contributes to both gene regulation and protein diversity. To discover broad relationships between regulation of alternative splicing and sequence conservation, we applied a systems approach, using oligonucleotide microarrays designed to capture splicing information across the mouse genome. In a set of 22 adult tissues, we observe differential expression of RNA containing at least two alternative splice junctions for about 40% of the 6,216 alternative events we could detect. Statistical comparisons identify 171 cassette exons whose inclusion or skipping is different in brain relative to other tissues and another 28 exons whose splicing is different in muscle. A subset of these exons is associated with unusual blocks of intron sequence whose conservation in vertebrates rivals that of protein-coding exons. By focusing on sets of exons with similar regulatory patterns, we have identified new sequence motifs implicated in brain and muscle splicing regulation. Of note is a motif that is strikingly similar to the branchpoint consensus but is located downstream of the 5′ splice site of exons included in muscle. Analysis of three paralogous membrane-associated guanylate kinase genes reveals that each contains a paralogous tissue-regulated exon with a similar tissue inclusion pattern. While the intron sequences flanking these exons remain highly conserved among mammalian orthologs, the paralogous flanking intron sequences have diverged considerably, suggesting unusually complex evolution of the regulation of alternative splicing in multigene families. PMID:16424921

  1. Functionally conserved cis-regulatory elements of COL18A1 identified through zebrafish transgenesis.

    PubMed

    Kague, Erika; Bessling, Seneca L; Lee, Josephine; Hu, Gui; Passos-Bueno, Maria Rita; Fisher, Shannon

    2010-01-15

    Type XVIII collagen is a component of basement membranes, and expressed prominently in the eye, blood vessels, liver, and the central nervous system. Homozygous mutations in COL18A1 lead to Knobloch Syndrome, characterized by ocular defects and occipital encephalocele. However, relatively little has been described on the role of type XVIII collagen in development, and nothing is known about the regulation of its tissue-specific expression pattern. We have used zebrafish transgenesis to identify and characterize cis-regulatory sequences controlling expression of the human gene. Candidate enhancers were selected from non-coding sequence associated with COL18A1 based on sequence conservation among mammals. Although these displayed no overt conservation with orthologous zebrafish sequences, four regions nonetheless acted as tissue-specific transcriptional enhancers in the zebrafish embryo, and together recapitulated the major aspects of col18a1 expression. Additional post-hoc computational analysis on positive enhancer sequences revealed alignments between mammalian and teleost sequences, which we hypothesize predict the corresponding zebrafish enhancers; for one of these, we demonstrate functional overlap with the orthologous human enhancer sequence. Our results provide important insight into the biological function and regulation of COL18A1, and point to additional sequences that may contribute to complex diseases involving COL18A1. More generally, we show that combining functional data with targeted analyses for phylogenetic conservation can reveal conserved cis-regulatory elements in the large number of cases where computational alignment alone falls short. Copyright 2009 Elsevier Inc. All rights reserved.

  2. Isolation and molecular characterization of partial FSH and LH receptor genes in Arabian camels (Camelus dromedarius)

    PubMed Central

    Jelokhani-Niaraki, Saber; Tahmoorespur, Mojtaba; Bitaraf-Sani, Morteza

    2015-01-01

    Very little is known about LHR and FSHR genes of domestic dromedary camels. The main objective of this study was to determine and analyze partial genomic regions of FSHR and LHR genes in dromedary camels for the first time. To this end, a total of50 DNA samples belonging to dromedary camels raised in Iran were sent for sequencing (25 samples of each gene). We compared the nucleotide sequences of Camelus dromedarius with corresponding sequences of previously published FSHR and LHR genes in bactrian camels and other species. According to the data, the same nucleotide variation was identified in both regions of the two camel species. The alignment of deduced protein sequences of the two different species revealed an amino acid variation at the FSHR region. No evidence of amino acid variation was observed, however, in LHR sequences. Phylogenetic analysis indicated that both camel species had a close relationship and clustered together in a separate branch. This was further confirmed by genetic distance values illustrating significant sequence identity between Camelus dromedarius and Camelus bactrianus. Interestingly, sequence comparisons revealed heterozygote patterns in FSHR sequences isolated from dromedary camels of Iran. In comparison to other species, this camel contains three amino acid substitutions at 5, 67, and 105 positions in the FSHR coding region. These positions are found exclusively in camels and can be considered as species specific. The results of our study can be used for hormone functionality research (FSHR and LHR) as well as reproduction-linked polymorphisms and breeding programs. PMID:27844002

  3. Isolation and molecular characterization of partial FSH and LH receptor genes in Arabian camels (Camelus dromedarius).

    PubMed

    Jelokhani-Niaraki, Saber; Tahmoorespur, Mojtaba; Bitaraf-Sani, Morteza

    2015-06-01

    Very little is known about LHR and FSHR genes of domestic dromedary camels. The main objective of this study was to determine and analyze partial genomic regions of FSHR and LHR genes in dromedary camels for the first time. To this end, a total of50 DNA samples belonging to dromedary camels raised in Iran were sent for sequencing (25 samples of each gene). We compared the nucleotide sequences of Camelus dromedarius with corresponding sequences of previously published FSHR and LHR genes in bactrian camels and other species. According to the data, the same nucleotide variation was identified in both regions of the two camel species. The alignment of deduced protein sequences of the two different species revealed an amino acid variation at the FSHR region. No evidence of amino acid variation was observed, however, in LHR sequences. Phylogenetic analysis indicated that both camel species had a close relationship and clustered together in a separate branch. This was further confirmed by genetic distance values illustrating significant sequence identity between Camelus dromedarius and Camelus bactrianus . Interestingly, sequence comparisons revealed heterozygote patterns in FSHR sequences isolated from dromedary camels of Iran. In comparison to other species, this camel contains three amino acid substitutions at 5, 67, and 105 positions in the FSHR coding region. These positions are found exclusively in camels and can be considered as species specific. The results of our study can be used for hormone functionality research ( FSHR and LHR ) as well as reproduction-linked polymorphisms and breeding programs.

  4. Structator: fast index-based search for RNA sequence-structure patterns

    PubMed Central

    2011-01-01

    Background The secondary structure of RNA molecules is intimately related to their function and often more conserved than the sequence. Hence, the important task of searching databases for RNAs requires to match sequence-structure patterns. Unfortunately, current tools for this task have, in the best case, a running time that is only linear in the size of sequence databases. Furthermore, established index data structures for fast sequence matching, like suffix trees or arrays, cannot benefit from the complementarity constraints introduced by the secondary structure of RNAs. Results We present a novel method and readily applicable software for time efficient matching of RNA sequence-structure patterns in sequence databases. Our approach is based on affix arrays, a recently introduced index data structure, preprocessed from the target database. Affix arrays support bidirectional pattern search, which is required for efficiently handling the structural constraints of the pattern. Structural patterns like stem-loops can be matched inside out, such that the loop region is matched first and then the pairing bases on the boundaries are matched consecutively. This allows to exploit base pairing information for search space reduction and leads to an expected running time that is sublinear in the size of the sequence database. The incorporation of a new chaining approach in the search of RNA sequence-structure patterns enables the description of molecules folding into complex secondary structures with multiple ordered patterns. The chaining approach removes spurious matches from the set of intermediate results, in particular of patterns with little specificity. In benchmark experiments on the Rfam database, our method runs up to two orders of magnitude faster than previous methods. Conclusions The presented method's sublinear expected running time makes it well suited for RNA sequence-structure pattern matching in large sequence databases. RNA molecules containing several stem-loop substructures can be described by multiple sequence-structure patterns and their matches are efficiently handled by a novel chaining method. Beyond our algorithmic contributions, we provide with Structator a complete and robust open-source software solution for index-based search of RNA sequence-structure patterns. The Structator software is available at http://www.zbh.uni-hamburg.de/Structator. PMID:21619640

  5. Regulatory sequence analysis tools.

    PubMed

    van Helden, Jacques

    2003-07-01

    The web resource Regulatory Sequence Analysis Tools (RSAT) (http://rsat.ulb.ac.be/rsat) offers a collection of software tools dedicated to the prediction of regulatory sites in non-coding DNA sequences. These tools include sequence retrieval, pattern discovery, pattern matching, genome-scale pattern matching, feature-map drawing, random sequence generation and other utilities. Alternative formats are supported for the representation of regulatory motifs (strings or position-specific scoring matrices) and several algorithms are proposed for pattern discovery. RSAT currently holds >100 fully sequenced genomes and these data are regularly updated from GenBank.

  6. Global ecological pattern of ammonia-oxidizing archaea.

    PubMed

    Cao, Huiluo; Auguet, Jean-Christophe; Gu, Ji-Dong

    2013-01-01

    The global distribution of ammonia-oxidizing archaea (AOA), which play a pivotal role in the nitrification process, has been confirmed through numerous ecological studies. Though newly available amoA (ammonia monooxygenase subunit A) gene sequences from new environments are accumulating rapidly in public repositories, a lack of information on the ecological and evolutionary factors shaping community assembly of AOA on the global scale is apparent. We conducted a meta-analysis on uncultured AOA using over ca. 6,200 archaeal amoA gene sequences, so as to reveal their community distribution patterns along a wide spectrum of physicochemical conditions and habitat types. The sequences were dereplicated at 95% identity level resulting in a dataset containing 1,476 archaeal amoA gene sequences from eight habitat types: namely soil, freshwater, freshwater sediment, estuarine sediment, marine water, marine sediment, geothermal system, and symbiosis. The updated comprehensive amoA phylogeny was composed of three major monophyletic clusters (i.e. Nitrosopumilus, Nitrosotalea, Nitrosocaldus) and a non-monophyletic cluster constituted mostly by soil and sediment sequences that we named Nitrososphaera. Diversity measurements indicated that marine and estuarine sediments as well as symbionts might be the largest reservoirs of AOA diversity. Phylogenetic analyses were further carried out using macroevolutionary analyses to explore the diversification pattern and rates of nitrifying archaea. In contrast to other habitats that displayed constant diversification rates, marine planktonic AOA interestingly exhibit a very recent and accelerating diversification rate congruent with the lowest phylogenetic diversity observed in their habitats. This result suggested the existence of AOA communities with different evolutionary history in the different habitats. Based on an up-to-date amoA phylogeny, this analysis provided insights into the possible evolutionary mechanisms and environmental parameters that shape AOA community assembly at global scale.

  7. Analyses of transcriptome sequences reveal multiple ancient large-scale duplication events in the ancestor of Sphagnopsida (Bryophyta)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Devos, Nicolas; Szövényi, Péter; Weston, David J.

    In this study, the goal of this research was to investigate whether there has been a whole-genome duplication (WGD) in the ancestry of Sphagnum (peatmoss) or the class Sphagnopsida, and to determine if the timing of any such duplication(s) and patterns of paralog retention could help explain the rapid radiation and current ecological dominance of peatmosses.

  8. Phylogenetic Affiliation of Soil Bacteria That Degrade Aliphatic Polyesters Available Commercially as Biodegradable Plastics

    PubMed Central

    Suyama, Tetsushi; Tokiwa, Yutaka; Ouichanpagdee, Pornpimol; Kanagawa, Takahiro; Kamagata, Yoichi

    1998-01-01

    Thirty-nine morphologically different soil bacteria capable of degrading poly(β-hydroxyalkanoate), poly(ɛ-caprolactone), poly(hexamethylene carbonate), or poly(tetramethylene succinate) were isolated. Their phylogenetic positions were determined by 16S ribosomal DNA sequencing, and all of them fell into the classes Firmicutes and Proteobacteria. Determinations of substrate utilization revealed characteristic patterns of substrate specificities. PMID:9835597

  9. Analyses of transcriptome sequences reveal multiple ancient large-scale duplication events in the ancestor of Sphagnopsida (Bryophyta)

    DOE PAGES

    Devos, Nicolas; Szövényi, Péter; Weston, David J.; ...

    2016-02-22

    In this study, the goal of this research was to investigate whether there has been a whole-genome duplication (WGD) in the ancestry of Sphagnum (peatmoss) or the class Sphagnopsida, and to determine if the timing of any such duplication(s) and patterns of paralog retention could help explain the rapid radiation and current ecological dominance of peatmosses.

  10. FA-SAT Is an Old Satellite DNA Frozen in Several Bilateria Genomes

    PubMed Central

    Chaves, Raquel; Ferreira, Daniela; Mendes-da-Silva, Ana; Meles, Susana; Adega, Filomena

    2017-01-01

    Abstract In recent years, a growing body of evidence has recognized the tandem repeat sequences, and specifically satellite DNA, as a functional class of sequences in the genomic “dark matter.” Using an original, complementary, and thus an eclectic experimental design, we show that the cat archetypal satellite DNA sequence, FA-SAT, is “frozen” conservatively in several Bilateria genomes. We found different genomic FA-SAT architectures, and the interspersion pattern was conserved. In Carnivora genomes, the FA-SAT-related sequences are also amplified, with the predominance of a specific FA-SAT variant, at the heterochromatic regions. We inspected the cat genome project to locate FA-SAT array flanking regions and revealed an intensive intermingling with transposable elements. Our results also show that FA-SAT-related sequences are transcribed and that the most abundant FA-SAT variant is not always the most transcribed. We thus conclude that the DNA sequences of FA-SAT and their transcripts are “frozen” in these genomes. Future work is needed to disclose any putative function that these sequences may play in these genomes. PMID:29608678

  11. Differential sequence diversity at merozoite surface protein-1 locus of Plasmodium knowlesi from humans and macaques in Thailand.

    PubMed

    Putaporntip, Chaturong; Thongaree, Siriporn; Jongwutiwes, Somchai

    2013-08-01

    To determine the genetic diversity and potential transmission routes of Plasmodium knowlesi, we analyzed the complete nucleotide sequence of the gene encoding the merozoite surface protein-1 of this simian malaria (Pkmsp-1), an asexual blood-stage vaccine candidate, from naturally infected humans and macaques in Thailand. Analysis of Pkmsp-1 sequences from humans (n=12) and monkeys (n=12) reveals five conserved and four variable domains. Most nucleotide substitutions in conserved domains were dimorphic whereas three of four variable domains contained complex repeats with extensive sequence and size variation. Besides purifying selection in conserved domains, evidence of intragenic recombination scattering across Pkmsp-1 was detected. The number of haplotypes, haplotype diversity, nucleotide diversity and recombination sites of human-derived sequences exceeded that of monkey-derived sequences. Phylogenetic networks based on concatenated conserved sequences of Pkmsp-1 displayed a character pattern that could have arisen from sampling process or the presence of two independent routes of P. knowlesi transmission, i.e. from macaques to human and from human to humans in Thailand. Copyright © 2013 Elsevier B.V. All rights reserved.

  12. A compact, in vivo screen of all 6-mers reveals drivers of tissue-specific expression and guides synthetic regulatory element design.

    PubMed

    Smith, Robin P; Riesenfeld, Samantha J; Holloway, Alisha K; Li, Qiang; Murphy, Karl K; Feliciano, Natalie M; Orecchia, Lorenzo; Oksenberg, Nir; Pollard, Katherine S; Ahituv, Nadav

    2013-07-18

    Large-scale annotation efforts have improved our ability to coarsely predict regulatory elements throughout vertebrate genomes. However, it is unclear how complex spatiotemporal patterns of gene expression driven by these elements emerge from the activity of short, transcription factor binding sequences. We describe a comprehensive promoter extension assay in which the regulatory potential of all 6 base-pair (bp) sequences was tested in the context of a minimal promoter. To enable this large-scale screen, we developed algorithms that use a reverse-complement aware decomposition of the de Bruijn graph to design a library of DNA oligomers incorporating every 6-bp sequence exactly once. Our library multiplexes all 4,096 unique 6-mers into 184 double-stranded 15-bp oligomers, which is sufficiently compact for in vivo testing. We injected each multiplexed construct into zebrafish embryos and scored GFP expression in 15 tissues at two developmental time points. Twenty-seven constructs produced consistent expression patterns, with the majority doing so in only one tissue. Functional sequences are enriched near biologically relevant genes, match motifs for developmental transcription factors, and are required for enhancer activity. By concatenating tissue-specific functional sequences, we generated completely synthetic enhancers for the notochord, epidermis, spinal cord, forebrain and otic lateral line, and show that short regulatory sequences do not always function modularly. This work introduces a unique in vivo catalog of short, functional regulatory sequences and demonstrates several important principles of regulatory element organization. Furthermore, we provide resources for designing compact, reverse-complement aware k-mer libraries.

  13. Extended turn construction and test question sequences in the conversations of three speakers with agrammatic aphasia

    PubMed Central

    Beckley, Firle; Best, Wendy; Johnson, Fiona; Edwards, Susan; Maxim, Jane

    2013-01-01

    The application of Conversation Analysis (CA) to the investigation of agrammatic aphasia reveals that utterances produced by speakers with agrammatism engaged in everyday conversation differ significantly from utterances produced in response to decontextualised assessment and therapy tasks. Early studies have demonstrated that speakers with agrammatism construct turns from sequences of nouns, adjectives, discourse markers and conjunctions, packaged by a distinct pattern of prosody. This article presents examples of turn construction methods deployed by three people with agrammatism as they take an extended turn, in order to recount a past event, initiate a discussion or have a disagreement. This is followed by examples of sequences occurring in the talk of two of these speakers that result in different, and more limited, turn construction opportunities, namely “test” questions asked in order to initiate a new topic of talk, despite the conversation partner knowing the answer. The contrast between extended turns and test question sequences illustrates the effect of interactional context on aphasic turn construction practices, and the potential of less than optimal sequences to mask turn construction skills. It is suggested that the interactional motivation for test question sequences in these data are to invite people with aphasia to contribute to conversation, rather than to practise saying words in an attempt to improve language skills. The idea that test question sequences may have their origins in early attempts to deal with acute aphasia, and the potential for conversation partnerships to become “stuck” in such interactional patterns after they may have outlived their usefulness, are discussed with a view to clinical implications. PMID:23848370

  14. A communal catalogue reveals Earth's multiscale microbial diversity.

    PubMed

    Thompson, Luke R; Sanders, Jon G; McDonald, Daniel; Amir, Amnon; Ladau, Joshua; Locey, Kenneth J; Prill, Robert J; Tripathi, Anupriya; Gibbons, Sean M; Ackermann, Gail; Navas-Molina, Jose A; Janssen, Stefan; Kopylova, Evguenia; Vázquez-Baeza, Yoshiki; González, Antonio; Morton, James T; Mirarab, Siavash; Zech Xu, Zhenjiang; Jiang, Lingjing; Haroon, Mohamed F; Kanbar, Jad; Zhu, Qiyun; Jin Song, Se; Kosciolek, Tomasz; Bokulich, Nicholas A; Lefler, Joshua; Brislawn, Colin J; Humphrey, Gregory; Owens, Sarah M; Hampton-Marcell, Jarrad; Berg-Lyons, Donna; McKenzie, Valerie; Fierer, Noah; Fuhrman, Jed A; Clauset, Aaron; Stevens, Rick L; Shade, Ashley; Pollard, Katherine S; Goodwin, Kelly D; Jansson, Janet K; Gilbert, Jack A; Knight, Rob

    2017-11-23

    Our growing awareness of the microbial world's importance and diversity contrasts starkly with our limited understanding of its fundamental structure. Despite recent advances in DNA sequencing, a lack of standardized protocols and common analytical frameworks impedes comparisons among studies, hindering the development of global inferences about microbial life on Earth. Here we present a meta-analysis of microbial community samples collected by hundreds of researchers for the Earth Microbiome Project. Coordinated protocols and new analytical methods, particularly the use of exact sequences instead of clustered operational taxonomic units, enable bacterial and archaeal ribosomal RNA gene sequences to be followed across multiple studies and allow us to explore patterns of diversity at an unprecedented scale. The result is both a reference database giving global context to DNA sequence data and a framework for incorporating data from future studies, fostering increasingly complete characterization of Earth's microbial diversity.

  15. Recovering complete and draft population genomes from metagenome datasets

    DOE PAGES

    Sangwan, Naseer; Xia, Fangfang; Gilbert, Jack A.

    2016-03-08

    Assembly of metagenomic sequence data into microbial genomes is of fundamental value to improving our understanding of microbial ecology and metabolism by elucidating the functional potential of hard-to-culture microorganisms. Here, we provide a synthesis of available methods to bin metagenomic contigs into species-level groups and highlight how genetic diversity, sequencing depth, and coverage influence binning success. Despite the computational cost on application to deeply sequenced complex metagenomes (e.g., soil), covarying patterns of contig coverage across multiple datasets significantly improves the binning process. We also discuss and compare current genome validation methods and reveal how these methods tackle the problem ofmore » chimeric genome bins i.e., sequences from multiple species. Finally, we explore how population genome assembly can be used to uncover biogeographic trends and to characterize the effect of in situ functional constraints on the genome-wide evolution.« less

  16. Protein sequences bound to mineral surfaces persist into deep time

    PubMed Central

    Demarchi, Beatrice; Hall, Shaun; Roncal-Herrero, Teresa; Freeman, Colin L; Woolley, Jos; Crisp, Molly K; Wilson, Julie; Fotakis, Anna; Fischer, Roman; Kessler, Benedikt M; Rakownikow Jersie-Christensen, Rosa; Olsen, Jesper V; Haile, James; Thomas, Jessica; Marean, Curtis W; Parkington, John; Presslee, Samantha; Lee-Thorp, Julia; Ditchfield, Peter; Hamilton, Jacqueline F; Ward, Martyn W; Wang, Chunting Michelle; Shaw, Marvin D; Harrison, Terry; Domínguez-Rodrigo, Manuel; MacPhee, Ross DE; Kwekason, Amandus; Ecker, Michaela; Kolska Horwitz, Liora; Chazan, Michael; Kröger, Roland; Thomas-Oates, Jane; Harding, John H; Cappellini, Enrico; Penkman, Kirsty; Collins, Matthew J

    2016-01-01

    Proteins persist longer in the fossil record than DNA, but the longevity, survival mechanisms and substrates remain contested. Here, we demonstrate the role of mineral binding in preserving the protein sequence in ostrich (Struthionidae) eggshell, including from the palaeontological sites of Laetoli (3.8 Ma) and Olduvai Gorge (1.3 Ma) in Tanzania. By tracking protein diagenesis back in time we find consistent patterns of preservation, demonstrating authenticity of the surviving sequences. Molecular dynamics simulations of struthiocalcin-1 and -2, the dominant proteins within the eggshell, reveal that distinct domains bind to the mineral surface. It is the domain with the strongest calculated binding energy to the calcite surface that is selectively preserved. Thermal age calculations demonstrate that the Laetoli and Olduvai peptides are 50 times older than any previously authenticated sequence (equivalent to ~16 Ma at a constant 10°C). DOI: http://dx.doi.org/10.7554/eLife.17092.001 PMID:27668515

  17. Recovering complete and draft population genomes from metagenome datasets

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sangwan, Naseer; Xia, Fangfang; Gilbert, Jack A.

    Assembly of metagenomic sequence data into microbial genomes is of fundamental value to improving our understanding of microbial ecology and metabolism by elucidating the functional potential of hard-to-culture microorganisms. Here, we provide a synthesis of available methods to bin metagenomic contigs into species-level groups and highlight how genetic diversity, sequencing depth, and coverage influence binning success. Despite the computational cost on application to deeply sequenced complex metagenomes (e.g., soil), covarying patterns of contig coverage across multiple datasets significantly improves the binning process. We also discuss and compare current genome validation methods and reveal how these methods tackle the problem ofmore » chimeric genome bins i.e., sequences from multiple species. Finally, we explore how population genome assembly can be used to uncover biogeographic trends and to characterize the effect of in situ functional constraints on the genome-wide evolution.« less

  18. Undesirable Choice Biases with Small Differences in the Spatial Structure of Chance Stimulus Sequences.

    PubMed

    Herrera, David; Treviño, Mario

    2015-01-01

    In two-alternative discrimination tasks, experimenters usually randomize the location of the rewarded stimulus so that systematic behavior with respect to irrelevant stimuli can only produce chance performance on the learning curves. One way to achieve this is to use random numbers derived from a discrete binomial distribution to create a 'full random training schedule' (FRS). When using FRS, however, sporadic but long laterally-biased training sequences occur by chance and such 'input biases' are thought to promote the generation of laterally-biased choices (i.e., 'output biases'). As an alternative, a 'Gellerman-like training schedule' (GLS) can be used. It removes most input biases by prohibiting the reward from appearing on the same location for more than three consecutive trials. The sequence of past rewards obtained from choosing a particular discriminative stimulus influences the probability of choosing that same stimulus on subsequent trials. Assuming that the long-term average ratio of choices matches the long-term average ratio of reinforcers, we hypothesized that a reduced amount of input biases in GLS compared to FRS should lead to a reduced production of output biases. We compared the choice patterns produced by a 'Rational Decision Maker' (RDM) in response to computer-generated FRS and GLS training sequences. To create a virtual RDM, we implemented an algorithm that generated choices based on past rewards. Our simulations revealed that, although the GLS presented fewer input biases than the FRS, the virtual RDM produced more output biases with GLS than with FRS under a variety of test conditions. Our results reveal that the statistical and temporal properties of training sequences interacted with the RDM to influence the production of output biases. Thus, discrete changes in the training paradigms did not translate linearly into modifications in the pattern of choices generated by a RDM. Virtual RDMs could be further employed to guide the selection of proper training schedules for perceptual decision-making studies.

  19. Digital Gene Expression Analysis Based on De Novo Transcriptome Assembly Reveals New Genes Associated with Floral Organ Differentiation of the Orchid Plant Cymbidium ensifolium

    PubMed Central

    Yang, Fengxi; Zhu, Genfa

    2015-01-01

    Cymbidium ensifolium belongs to the genus Cymbidium of the orchid family. Owing to its spectacular flower morphology, C. ensifolium has considerable ecological and cultural value. However, limited genetic data is available for this non-model plant, and the molecular mechanism underlying floral organ identity is still poorly understood. In this study, we characterize the floral transcriptome of C. ensifolium and present, for the first time, extensive sequence and transcript abundance data of individual floral organs. After sequencing, over 10 Gb clean sequence data were generated and assembled into 111,892 unigenes with an average length of 932.03 base pairs, including 1,227 clusters and 110,665 singletons. Assembled sequences were annotated with gene descriptions, gene ontology, clusters of orthologous group terms, the Kyoto Encyclopedia of Genes and Genomes, and the plant transcription factor database. From these annotations, 131 flowering-associated unigenes, 61 CONSTANS-LIKE (COL) unigenes and 90 floral homeotic genes were identified. In addition, four digital gene expression libraries were constructed for the sepal, petal, labellum and gynostemium, and 1,058 genes corresponding to individual floral organ development were identified. Among them, eight MADS-box genes were further investigated by full-length cDNA sequence analysis and expression validation, which revealed two APETALA1/AGL9-like MADS-box genes preferentially expressed in the sepal and petal, two AGAMOUS-like genes particularly restricted to the gynostemium, and four DEF-like genes distinctively expressed in different floral organs. The spatial expression of these genes varied distinctly in different floral mutant corresponding to different floral morphogenesis, which validated the specialized roles of them in floral patterning and further supported the effectiveness of our in silico analysis. This dataset generated in our study provides new insights into the molecular mechanisms underlying floral patterning of Cymbidium and supports a valuable resource for molecular breeding of the orchid plant. PMID:26580566

  20. Analysis of HIV-1 intersubtype recombination breakpoints suggests region with high pairing probability may be a more fundamental factor than sequence similarity affecting HIV-1 recombination.

    PubMed

    Jia, Lei; Li, Lin; Gui, Tao; Liu, Siyang; Li, Hanping; Han, Jingwan; Guo, Wei; Liu, Yongjian; Li, Jingyun

    2016-09-21

    With increasing data on HIV-1, a more relevant molecular model describing mechanism details of HIV-1 genetic recombination usually requires upgrades. Currently an incomplete structural understanding of the copy choice mechanism along with several other issues in the field that lack elucidation led us to perform an analysis of the correlation between breakpoint distributions and (1) the probability of base pairing, and (2) intersubtype genetic similarity to further explore structural mechanisms. Near full length sequences of URFs from Asia, Europe, and Africa (one sequence/patient), and representative sequences of worldwide CRFs were retrieved from the Los Alamos HIV database. Their recombination patterns were analyzed by jpHMM in detail. Then the relationships between breakpoint distributions and (1) the probability of base pairing, and (2) intersubtype genetic similarities were investigated. Pearson correlation test showed that all URF groups and the CRF group exhibit the same breakpoint distribution pattern. Additionally, the Wilcoxon two-sample test indicated a significant and inexplicable limitation of recombination in regions with high pairing probability. These regions have been found to be strongly conserved across distinct biological states (i.e., strong intersubtype similarity), and genetic similarity has been determined to be a very important factor promoting recombination. Thus, the results revealed an unexpected disagreement between intersubtype similarity and breakpoint distribution, which were further confirmed by genetic similarity analysis. Our analysis reveals a critical conflict between results from natural HIV-1 isolates and those from HIV-1-based assay vectors in which genetic similarity has been shown to be a very critical factor promoting recombination. These results indicate the region with high-pairing probabilities may be a more fundamental factor affecting HIV-1 recombination than sequence similarity in natural HIV-1 infections. Our findings will be relevant in furthering the understanding of HIV-1 recombination mechanisms.

  1. MUSI: an integrated system for identifying multiple specificity from very large peptide or nucleic acid data sets.

    PubMed

    Kim, Taehyung; Tyndel, Marc S; Huang, Haiming; Sidhu, Sachdev S; Bader, Gary D; Gfeller, David; Kim, Philip M

    2012-03-01

    Peptide recognition domains and transcription factors play crucial roles in cellular signaling. They bind linear stretches of amino acids or nucleotides, respectively, with high specificity. Experimental techniques that assess the binding specificity of these domains, such as microarrays or phage display, can retrieve thousands of distinct ligands, providing detailed insight into binding specificity. In particular, the advent of next-generation sequencing has recently increased the throughput of such methods by several orders of magnitude. These advances have helped reveal the presence of distinct binding specificity classes that co-exist within a set of ligands interacting with the same target. Here, we introduce a software system called MUSI that can rapidly analyze very large data sets of binding sequences to determine the relevant binding specificity patterns. Our pipeline provides two major advances. First, it can detect previously unrecognized multiple specificity patterns in any data set. Second, it offers integrated processing of very large data sets from next-generation sequencing machines. The results are visualized as multiple sequence logos describing the different binding preferences of the protein under investigation. We demonstrate the performance of MUSI by analyzing recent phage display data for human SH3 domains as well as microarray data for mouse transcription factors.

  2. Linkage Study Revealed Complex Haplotypes in a Multifamily due to Different Mutations in CAPN3 Gene in an Iranian Ethnic Group.

    PubMed

    Mojbafan, Marzieh; Tonekaboni, Seyed Hassan; Abiri, Maryam; Kianfar, Soudeh; Sarhadi, Ameneh; Nilipour, Yalda; Tavakkoly-Bazzaz, Javad; Zeinali, Sirous

    2016-07-01

    Calpainopathy is an autosomal recessive form of limb girdle muscular dystrophies which is caused by mutation in CAPN3 gene. In the present study, co-segregation of this disorder was analyzed with four short tandem repeat markers linked to the CAPN3 gene. Three apparently unrelated Iranian families with same ethnicity were investigated. Haplotype analysis and sequencing of the CAPN3 gene were performed. DNA sample from one of the patients was simultaneously sent for next-generation sequencing. DNA sequencing identified two mutations. It was seen as a homozygous c.2105C>T in exon 19 in one family, a homozygous novel mutation c.380G>A in exon 3 in another family, and a compound heterozygote form of these two mutations in the third family. Next-generation sequencing also confirmed our results. It was expected that, due to the rare nature of limb girdle muscular dystrophies, affected individuals from the same ethnic group share similar mutations. Haplotype analysis showed two different homozygote patterns in two families, yet a compound heterozygote pattern in the third family as seen in the mutation analysis. This study shows that haplotype analysis would help in determining presence of different founders.

  3. DNA isolation protocol effects on nuclear DNA analysis by microarrays, droplet digital PCR, and whole genome sequencing, and on mitochondrial DNA copy number estimation.

    PubMed

    Nacheva, Elizabeth; Mokretar, Katya; Soenmez, Aynur; Pittman, Alan M; Grace, Colin; Valli, Roberto; Ejaz, Ayesha; Vattathil, Selina; Maserati, Emanuela; Houlden, Henry; Taanman, Jan-Willem; Schapira, Anthony H; Proukakis, Christos

    2017-01-01

    Potential bias introduced during DNA isolation is inadequately explored, although it could have significant impact on downstream analysis. To investigate this in human brain, we isolated DNA from cerebellum and frontal cortex using spin columns under different conditions, and salting-out. We first analysed DNA using array CGH, which revealed a striking wave pattern suggesting primarily GC-rich cerebellar losses, even against matched frontal cortex DNA, with a similar pattern on a SNP array. The aCGH changes varied with the isolation protocol. Droplet digital PCR of two genes also showed protocol-dependent losses. Whole genome sequencing showed GC-dependent variation in coverage with spin column isolation from cerebellum. We also extracted and sequenced DNA from substantia nigra using salting-out and phenol / chloroform. The mtDNA copy number, assessed by reads mapping to the mitochondrial genome, was higher in substantia nigra when using phenol / chloroform. We thus provide evidence for significant method-dependent bias in DNA isolation from human brain, as reported in rat tissues. This may contribute to array "waves", and could affect copy number determination, particularly if mosaicism is being sought, and sequencing coverage. Variations in isolation protocol may also affect apparent mtDNA abundance.

  4. DNA isolation protocol effects on nuclear DNA analysis by microarrays, droplet digital PCR, and whole genome sequencing, and on mitochondrial DNA copy number estimation

    PubMed Central

    Nacheva, Elizabeth; Mokretar, Katya; Soenmez, Aynur; Pittman, Alan M.; Grace, Colin; Valli, Roberto; Ejaz, Ayesha; Vattathil, Selina; Maserati, Emanuela; Houlden, Henry; Taanman, Jan-Willem; Schapira, Anthony H.

    2017-01-01

    Potential bias introduced during DNA isolation is inadequately explored, although it could have significant impact on downstream analysis. To investigate this in human brain, we isolated DNA from cerebellum and frontal cortex using spin columns under different conditions, and salting-out. We first analysed DNA using array CGH, which revealed a striking wave pattern suggesting primarily GC-rich cerebellar losses, even against matched frontal cortex DNA, with a similar pattern on a SNP array. The aCGH changes varied with the isolation protocol. Droplet digital PCR of two genes also showed protocol-dependent losses. Whole genome sequencing showed GC-dependent variation in coverage with spin column isolation from cerebellum. We also extracted and sequenced DNA from substantia nigra using salting-out and phenol / chloroform. The mtDNA copy number, assessed by reads mapping to the mitochondrial genome, was higher in substantia nigra when using phenol / chloroform. We thus provide evidence for significant method-dependent bias in DNA isolation from human brain, as reported in rat tissues. This may contribute to array “waves”, and could affect copy number determination, particularly if mosaicism is being sought, and sequencing coverage. Variations in isolation protocol may also affect apparent mtDNA abundance. PMID:28683077

  5. Prediction during statistical learning, and implications for the implicit/explicit divide

    PubMed Central

    Dale, Rick; Duran, Nicholas D.; Morehead, J. Ryan

    2012-01-01

    Accounts of statistical learning, both implicit and explicit, often invoke predictive processes as central to learning, yet practically all experiments employ non-predictive measures during training. We argue that the common theoretical assumption of anticipation and prediction needs clearer, more direct evidence for it during learning. We offer a novel experimental context to explore prediction, and report results from a simple sequential learning task designed to promote predictive behaviors in participants as they responded to a short sequence of simple stimulus events. Predictive tendencies in participants were measured using their computer mouse, the trajectories of which served as a means of tapping into predictive behavior while participants were exposed to very short and simple sequences of events. A total of 143 participants were randomly assigned to stimulus sequences along a continuum of regularity. Analysis of computer-mouse trajectories revealed that (a) participants almost always anticipate events in some manner, (b) participants exhibit two stable patterns of behavior, either reacting to vs. predicting future events, (c) the extent to which participants predict relates to performance on a recall test, and (d) explicit reports of perceiving patterns in the brief sequence correlates with extent of prediction. We end with a discussion of implicit and explicit statistical learning and of the role prediction may play in both kinds of learning. PMID:22723817

  6. Accurate and High-Coverage Immune Repertoire Sequencing Reveals Characteristics of Antibody Repertoire Diversification in Young Children with Malaria

    NASA Astrophysics Data System (ADS)

    Jiang, Ning

    Accurately measuring the immune repertoire sequence composition, diversity, and abundance is important in studying repertoire response in infections, vaccinations, and cancer immunology. Using molecular identifiers (MIDs) to tag mRNA molecules is an effective method in improving the accuracy of immune repertoire sequencing (IR-seq). However, it is still difficult to use IR-seq on small amount of clinical samples to achieve a high coverage of the repertoire diversities. This is especially challenging in studying infections and vaccinations where B cell subpopulations with fewer cells, such as memory B cells or plasmablasts, are often of great interest to study somatic mutation patterns and diversity changes. Here, we describe an approach of IR-seq based on the use of MIDs in combination with a clustering method that can reveal more than 80% of the antibody diversity in a sample and can be applied to as few as 1,000 B cells. We applied this to study the antibody repertoires of young children before and during an acute malaria infection. We discovered unexpectedly high levels of somatic hypermutation (SHM) in infants and revealed characteristics of antibody repertoire development in young children that would have a profound impact on immunization in children.

  7. Bacteremia due to Moraxella atlantae in a cancer patient.

    PubMed

    De Baere, Thierry; Muylaert, An; Everaert, Els; Wauters, Georges; Claeys, Geert; Verschraegen, Gerda; Vaneechoutte, Mario

    2002-07-01

    A gram-negative alkaline phosphatase- and pyrrolidone peptidase-positive rod-shaped bacterium (CCUG 45702) was isolated from two aerobic blood cultures from a female cancer patient. No identification could be reached using phenotypic techniques. Amplification of the tRNA intergenic spacers revealed fragments with lengths of 116, 133, and 270 bp, but no such pattern was present in our reference library. Sequencing of the 16S rRNA gene revealed its identity as Moraxella atlantae, a species isolated only rarely and published only once as causing infection. In retrospect, the phenotypic characteristics fit the identification as M. atlantae (formerly known as CDC group M-3). Comparative 16S rRNA sequence analysis indicates that M. atlantae, M. lincolnii, and M. osloensis might constitute three separate genera within the MORAXELLACEAE: After treatment with amoxicillin-clavulanic acid for 2 days, fever subsided and the patient was dismissed.

  8. Bacteremia Due to Moraxella atlantae in a Cancer Patient

    PubMed Central

    De Baere, Thierry; Muylaert, An; Everaert, Els; Wauters, Georges; Claeys, Geert; Verschraegen, Gerda; Vaneechoutte, Mario

    2002-01-01

    A gram-negative alkaline phosphatase- and pyrrolidone peptidase-positive rod-shaped bacterium (CCUG 45702) was isolated from two aerobic blood cultures from a female cancer patient. No identification could be reached using phenotypic techniques. Amplification of the tRNA intergenic spacers revealed fragments with lengths of 116, 133, and 270 bp, but no such pattern was present in our reference library. Sequencing of the 16S rRNA gene revealed its identity as Moraxella atlantae, a species isolated only rarely and published only once as causing infection. In retrospect, the phenotypic characteristics fit the identification as M. atlantae (formerly known as CDC group M-3). Comparative 16S rRNA sequence analysis indicates that M. atlantae, M. lincolnii, and M. osloensis might constitute three separate genera within the Moraxellaceae. After treatment with amoxicillin-clavulanic acid for 2 days, fever subsided and the patient was dismissed. PMID:12089312

  9. Substantial Regional Variation in Substitution Rates in the Human Genome: Importance of GC Content, Gene Density, and Telomere-Specific Effects

    NASA Astrophysics Data System (ADS)

    Arndt, Peter F.; Hwa, Terence; Petrov, Dmitri A.

    2005-06-01

    This study presents the first global, 1 Mbp level analysis of patterns of nucleotide substitutions along the human lineage. The study is based on the analysis of a large amount of repetitive elements deposited into the human genome since the mammalian radiation, yielding a number of results that would have been difficult to obtain using the more conventional comparative method of analysis. This analysis revealed substantial and consistent variability of rates of substitution, with the variability ranging up to 2-fold among different regions. The rates of substitutions of C or G nucleotides with A or T nucleotides vary much more sharply than the reverse rates suggesting that much of that variation is due to differences in mutation rates rather than in the probabilities of fixation of C/G vs. A/T nucleotides across the genome. For all types of substitution we observe substantially more hotspots than coldspots, with hotspots showing substantial clustering over tens of Mbp's. Our analysis revealed that GC-content of surrounding sequences is the best predictor of the rates of substitution. The pattern of substitution appears very different near telomeres compared to the rest of the genome and cannot be explained by the genome-wide correlations of the substitution rates with GC content or exon density. The telomere pattern of substitution is consistent with natural selection or biased gene conversion acting to increase the GC-content of the sequences that are within 10-15 Mbp away from the telomere.

  10. Characterization of the patterns of polymorphism in a [open quotes]cryptic repeat[close quotes] reveals a novel type of hypervariable sequence

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jacobson, D.P.; Schmeling, P.; Sommer, S.S.

    Alternating purine and pyrimidine repeats (RY(i)) are an abundant source of polymorphism. The subset with long tandem repeats of GT or AC (GT(i)) have been studied extensively, but cryptic RY(i) (i.e., no single tandem repeat predominates) have received little attention. The factor IX gene has a polymorphic cryptic RY(i) of 142-216 bp. Previously, there were four known polymorphic alleles, of the form AB, A[sub 2]B, A[sub 2]B[sub 2], and A[sub 3]B[sub 2], where A = (GT)(AC)[sub 3](AT)[sub 3](GT)(AT)[sub 4] and B = A with an additional 3' AT dinucleotide. To further characterize this locus, the authors examined more than 1,700more » additional human chromosomes and determined the sequences of the homologous sites in orangutans and chimpanzees. The novel alleles found in humans expand the repertoire of A/B alleles to A[sub 0-4]B[sub 1] and A[sub 1-3]B[sub 2]. The A[sub n]B[sub 2] series are abundant in Caucasians but are absent in blacks and Asians. Conversely, the A[sub 0]B[sub 1] allele is common in blacks but is not found in more than 1,700 Caucasian chromosomes. The data are compatible with a model in which recombination is more frequent than polymerase slippage at this locus. In orangutans, the RY(i) is present, but the sequence is markedly different. An A/B-type of pattern was discerned in which B differs from A by an additional six (AT) dinucleotides at the 3' end. In chimpanzees, the size of the RY(i) locus was greatly expanded, and the sequence showed a novel pattern of hypervariability in which there are many tandem repeats of the form (GT)[sub n](AC)[sub 0](AT)[sub p](GT)[sub q](AT)[sub s], where n, o, p, q, and s are different integers. The sequences of the factor IX intron 1 cryptic RY(i) in three primates provide perspective on the range of possible patterns of polymorphism. Analysis of the patterns suggests how the RY(i) can be conserved during evolution, while the precise sequence varies. 25 refs., 5 figs., 3 tabs.« less

  11. Characterization of a prototype strain of hepatitis E virus.

    PubMed

    Tsarev, S A; Emerson, S U; Reyes, G R; Tsareva, T S; Legters, L J; Malik, I A; Iqbal, M; Purcell, R H

    1992-01-15

    A strain of hepatitis E virus (SAR-55) implicated in an epidemic of enterically transmitted non-A, non-B hepatitis, now called hepatitis E, was characterized extensively. Six cynomolgus monkeys (Macaca fascicularis) were infected with a strain of hepatitis E virus from Pakistan. Reverse transcription-polymerase chain reaction was used to determine the pattern of virus shedding in feces, bile, and serum relative to hepatitis and induction of specific antibodies. Virtually the entire genome of SAR-55 (7195 nucleotides) was sequenced. Comparison of the sequence of SAR-55 with that of a Burmese strain revealed a high level of homology except for one region encoding 100 amino acids of a putative nonstructural polyprotein. Identification of this region as hypervariable was obtained by partial sequencing of a third isolate of hepatitis E virus from Kirgizia.

  12. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Boore, Jeffrey L.; Staton, Joseph

    We have determined the sequence of about half (7470 nts) of the mitochondrial genome of the sipunculid Phascolopsis gouldii, the first representative of this phylum to be so studied. All of the 19 identified genes are transcribed from the same DNA strand. The arrangement of these genes is remarkably similar to that of the oligochaete annelid Lumbricus terrestris. Comparison of both the inferred amino acid sequences and the gene arrangements of a variety of diverse metazoan taxa reveals that the phylum Sipuncula is more closely related to Annelida than to Mollusca. This requires reinterpretation of the homology of several embryologicalmore » features and of patterns of animal body plan evolution.« less

  13. Patterns and Sequences: Interactive Exploration of Clickstreams to Understand Common Visitor Paths.

    PubMed

    Liu, Zhicheng; Wang, Yang; Dontcheva, Mira; Hoffman, Matthew; Walker, Seth; Wilson, Alan

    2017-01-01

    Modern web clickstream data consists of long, high-dimensional sequences of multivariate events, making it difficult to analyze. Following the overarching principle that the visual interface should provide information about the dataset at multiple levels of granularity and allow users to easily navigate across these levels, we identify four levels of granularity in clickstream analysis: patterns, segments, sequences and events. We present an analytic pipeline consisting of three stages: pattern mining, pattern pruning and coordinated exploration between patterns and sequences. Based on this approach, we discuss properties of maximal sequential patterns, propose methods to reduce the number of patterns and describe design considerations for visualizing the extracted sequential patterns and the corresponding raw sequences. We demonstrate the viability of our approach through an analysis scenario and discuss the strengths and limitations of the methods based on user feedback.

  14. An Exploration of Rhythmic Grouping of Speech Sequences by French- and German-Learning Infants

    PubMed Central

    Abboub, Nawal; Boll-Avetisyan, Natalie; Bhatara, Anjali; Höhle, Barbara; Nazzi, Thierry

    2016-01-01

    Rhythm in music and speech can be characterized by a constellation of several acoustic cues. Individually, these cues have different effects on rhythmic perception: sequences of sounds alternating in duration are perceived as short-long pairs (weak-strong/iambic pattern), whereas sequences of sounds alternating in intensity or pitch are perceived as loud-soft, or high-low pairs (strong-weak/trochaic pattern). This perceptual bias—called the Iambic-Trochaic Law (ITL)–has been claimed to be an universal property of the auditory system applying in both the music and the language domains. Recent studies have shown that language experience can modulate the effects of the ITL on rhythmic perception of both speech and non-speech sequences in adults, and of non-speech sequences in 7.5-month-old infants. The goal of the present study was to explore whether language experience also modulates infants’ grouping of speech. To do so, we presented sequences of syllables to monolingual French- and German-learning 7.5-month-olds. Using the Headturn Preference Procedure (HPP), we examined whether they were able to perceive a rhythmic structure in sequences of syllables that alternated in duration, pitch, or intensity. Our findings show that both French- and German-learning infants perceived a rhythmic structure when it was cued by duration or pitch but not intensity. Our findings also show differences in how these infants use duration and pitch cues to group syllable sequences, suggesting that pitch cues were the easier ones to use. Moreover, performance did not differ across languages, failing to reveal early language effects on rhythmic perception. These results contribute to our understanding of the origin of rhythmic perception and perceptual mechanisms shared across music and speech, which may bootstrap language acquisition. PMID:27378887

  15. Deep sequencing reveals double mutations in cis of MPL exon 10 in myeloproliferative neoplasms.

    PubMed

    Pietra, Daniela; Brisci, Angela; Rumi, Elisa; Boggi, Sabrina; Elena, Chiara; Pietrelli, Alessandro; Bordoni, Roberta; Ferrari, Maurizio; Passamonti, Francesco; De Bellis, Gianluca; Cremonesi, Laura; Cazzola, Mario

    2011-04-01

    Somatic mutations of MPL exon 10, mainly involving a W515 substitution, have been described in JAK2 (V617F)-negative patients with essential thrombocythemia and primary myelofibrosis. We used direct sequencing and high-resolution melt analysis to identify mutations of MPL exon 10 in 570 patients with myeloproliferative neoplasms, and allele specific PCR and deep sequencing to further characterize a subset of mutated patients. Somatic mutations were detected in 33 of 221 patients (15%) with JAK2 (V617F)-negative essential thrombocythemia or primary myelofibrosis. Only one patient with essential thrombocythemia carried both JAK2 (V617F) and MPL (W515L). High-resolution melt analysis identified abnormal patterns in all the MPL mutated cases, while direct sequencing did not detect the mutant MPL in one fifth of them. In 3 cases carrying double MPL mutations, deep sequencing analysis showed identical load and location in cis of the paired lesions, indicating their simultaneous occurrence on the same chromosome.

  16. Conserved intergenic sequences revealed by CTAG-profiling in Salmonella: thermodynamic modeling for function prediction

    NASA Astrophysics Data System (ADS)

    Tang, Le; Zhu, Songling; Mastriani, Emilio; Fang, Xin; Zhou, Yu-Jie; Li, Yong-Guo; Johnston, Randal N.; Guo, Zheng; Liu, Gui-Rong; Liu, Shu-Lin

    2017-03-01

    Highly conserved short sequences help identify functional genomic regions and facilitate genomic annotation. We used Salmonella as the model to search the genome for evolutionarily conserved regions and focused on the tetranucleotide sequence CTAG for its potentially important functions. In Salmonella, CTAG is highly conserved across the lineages and large numbers of CTAG-containing short sequences fall in intergenic regions, strongly indicating their biological importance. Computer modeling demonstrated stable stem-loop structures in some of the CTAG-containing intergenic regions, and substitution of a nucleotide of the CTAG sequence would radically rearrange the free energy and disrupt the structure. The postulated degeneration of CTAG takes distinct patterns among Salmonella lineages and provides novel information about genomic divergence and evolution of these bacterial pathogens. Comparison of the vertically and horizontally transmitted genomic segments showed different CTAG distribution landscapes, with the genome amelioration process to remove CTAG taking place inward from both terminals of the horizontally acquired segment.

  17. Chromosome rearrangements via template switching between diverged repeated sequences

    PubMed Central

    Anand, Ranjith P.; Tsaponina, Olga; Greenwell, Patricia W.; Lee, Cheng-Sheng; Du, Wei; Petes, Thomas D.

    2014-01-01

    Recent high-resolution genome analyses of cancer and other diseases have revealed the occurrence of microhomology-mediated chromosome rearrangements and copy number changes. Although some of these rearrangements appear to involve nonhomologous end-joining, many must have involved mechanisms requiring new DNA synthesis. Models such as microhomology-mediated break-induced replication (MM-BIR) have been invoked to explain these rearrangements. We examined BIR and template switching between highly diverged sequences in Saccharomyces cerevisiae, induced during repair of a site-specific double-strand break (DSB). Our data show that such template switches are robust mechanisms that give rise to complex rearrangements. Template switches between highly divergent sequences appear to be mechanistically distinct from the initial strand invasions that establish BIR. In particular, such jumps are less constrained by sequence divergence and exhibit a different pattern of microhomology junctions. BIR traversing repeated DNA sequences frequently results in complex translocations analogous to those seen in mammalian cells. These results suggest that template switching among repeated genes is a potent driver of genome instability and evolution. PMID:25367035

  18. Reduced attentional focus and the influence on expert anticipatory perception.

    PubMed

    Gorman, Adam D; Abernethy, Bruce; Farrow, Damian

    2018-01-01

    The anticipatory memory encodings of expert and novice basketball players were examined under conditions of both full (attended condition) and reduced (unattended condition) attention (see also Gorman, Abernethy, & Farrow in Attention, Perception, & Psychophysics, 75, 835-844, 2013a). Participants completed a typical pattern recall task using dynamic playing sequences from basketball, and their responses were compared to both the original target pattern as well as to the series of patterns that occurred immediately after and immediately before the target image. The latter had not previously been employed in a pattern recall task when examining the anticipatory encoding of pattern information. Results revealed that the overall extent of the forward displacement for both the attended and unattended patterns was generally significantly greater for the experts, with the expert advantage tending to be most prominent for the attacking patterns. The novel addition of both forward and backward scenes may provide a more precise measure of the anticipatory effect, suggesting that future research in this domain should use a similar methodological design.

  19. M Protein Gene (emm Type) Analysis of Group A Beta-Hemolytic Streptococci from Ethiopia Reveals Unique Patterns

    PubMed Central

    Tewodros, Wezenet; Kronvall, Göran

    2005-01-01

    The genetic diversity of group A streptococcal (GAS) isolates obtained in 1990 from Ethiopian children with various streptococcal diseases was studied by using emm gene sequence analysis. A total of 217 GAS isolates were included: 155 and 62 isolates from throat and skin, respectively. A total of 78 different emm/st types were detected among the 217 isolates. Of these, 166 (76.5%) belonged to 52 validated reference emm types, 26 (11.9%) belonged to 16 already recognized sequence types (st types) and 25 (11.5%) belonged to 10 undocumented new sequence types. Resistance to tetracycline (148 of 217) was not correlated to emm type. Isolation rate of the classical rheumatogenic and nephritogenic strains was low from cases of acute rheumatic fever (ARF) and acute glomerulonephritis (AGN), respectively. Instead, the recently discovered st types were overrepresented among isolates from patients with ARF (3 of 7) and AGN (9 of 16) (P < 0.01) compared to isolates from subjects with tonsillitis and from healthy carriers (10 of 57 and 16 of 90, respectively). In contrast to rheumatogenic strains from the temperate regions, more than half of the isolates from ARF (four of seven) carried the genetic marker for skin preference, emm pattern D, although most of them (six of seven) were isolated from throat. Of 57 tonsillitis-associated isolates, 16 (28%) belonged to emm pattern D compared to <1% in temperate regions. As in other reports emm patterns A to C were strongly associated with throat, whereas emm pattern D did not correlate to skin. This first large-scale emm typing report from Africa has demonstrated a heterogeneous GAS population and contrasting nature of GAS epidemiology in the region. PMID:16145079

  20. Using long-term experimental evolution to uncover the patterns and determinants of molecular evolution of an Escherichia coli natural isolate in the streptomycin treated mouse gut

    PubMed Central

    Ghalayini, Mohamed; Magnan, Mélanie; Glodt, Jérémy; Pintard, Coralie; Dion, Sara; Denamur, Erick; Tenaillon, Olivier

    2017-01-01

    Though microbial ecology of the gut is now a major focus of interest, little is known about the molecular determinants of microbial adaptation in the gut. Experimental evolution coupled with whole genome sequencing can provide insights of the adaptive process. In vitro experiments have revealed some conserved patterns: intermediate convergence, epistatic interactions between beneficial mutations and mutations in global regulators. To test the relevance of these patterns and to identify the selective pressures acting in vivo, we have performed a long-term adaptation of an E. coli natural isolate, the streptomycin resistant strain 536, in the digestive tract of streptomycin treated mice. After a year of evolution, a clone from 15 replicates was sequenced. Consistently with in vitro observations, the identified mutations revealed a strong pattern of convergence at the mutation, gene, operon and functional levels. Yet, the rate of molecular evolution was lower than in in vitro and no mutations in global regulators were recovered. More specific targets were observed: the dgo operon, involved in the galactonate pathway that improved growth on D-galactonate, and rluD and gidB, implicated in the maturation of the ribosomes, which mutations improved growth only in the presence of streptomycin. As in vitro, the non-random associations of mutations within the same pathways suggested a role of epistasis in shaping the adaptive landscape. Overall, we show that “evolve and sequence” approach coupled to an analysis of convergence, when applied to a natural isolate, can be used to study adaptation in vivo and uncover the specific selective pressures of that environment. PMID:27661780

  1. Multiple introductions and onward transmission of HIV-1 subtype B strains in Shanghai, China.

    PubMed

    Li, Xiaoshan; Zhu, Kexin; Xue, Yile; Wei, Feiran; Gao, Rong; Duerr, Ralf; Fang, Kun; Li, Wei; Song, Yue; Du, Guoping; Yan, Wenjuan; Musa, Taha Hussein; Ge, You; Ji, Yu; Zhong, Ping; Wei, Pingmin

    2017-08-01

    To investigate the viral genetic evolution, spatial origins and patterns of transmission of HIV-1 subtype B in Shanghai, China. A total of 242 Shanghai subtype B and 1519 reference pol sequences were subjected to phylogenetic inference and genetic transmission network analyses. Phylogenetic analysis revealed that subtype B strains circulating in Shanghai were genetically diverse and closely associated with viral sequence lineages in Beijing (76 of 242 [31.4%]), Central China (Henan/Hebei/Hunan/Hubei) (43 of 242 [17.8%]), Chinese Taiwan (20 of 242 [8.3%]), Japan (6 of 242 [2.5%]), and Korea (7 of 242 [2.9%]), suggesting multiple introductions into Shanghai from mainland China and Taiwan, Japan, and Korea. Interestingly, a monophyletic Shanghai lineage (SH-L) (36 of 242 [14.9%]) of HIV-1 subtype B most likely originated from an Argentine strain, transferred through Liaoning infected individuals. In-depth analyses of 195 Shanghai subtype B sequences revealed that a total of 37.9% (n = 74) sequences contributed to 35 transmission networks, whereof 33.8% (n = 25) of the sequences associated with infected individuals from other provinces. Our new findings reflect the evolution complexity and transmission dynamics of HIV-1 subtype B in Shanghai, which would provide critical information for the design of effective prevention measures against HIV transmission. Copyright © 2017 The British Infection Association. Published by Elsevier Ltd. All rights reserved.

  2. Reduced representation genome sequencing reveals patterns of genetic diversity and selection in apple.

    PubMed

    Ma, Baiquan; Liao, Liao; Peng, Qian; Fang, Ting; Zhou, Hui; Korban, Schuyler S; Han, Yuepeng

    2017-03-01

    Identifying DNA sequence variations is a fundamental step towards deciphering the genetic basis of traits of interest. Here, a total of 20 cultivated and 10 wild apples were genotyped using specific-locus amplified fragment sequencing, and 39,635 single nucleotide polymorphisms with no missing genotypes and evenly distributed along the genome were selected to investigate patterns of genome-wide genetic variations between cultivated and wild apples. Overall, wild apples displayed higher levels of genetic diversity than cultivated apples. Linkage disequilibrium (LD) decays were observed quite rapidly in cultivated and wild apples, with an r 2 -value below 0.2 at 440 and 280 bp, respectively. Moreover, bidirectional gene flow and different distribution patterns of LD blocks were detected between domesticated and wild apples. Most LD blocks unique to cultivated apples were located within QTL regions controlling fruit quality, thus suggesting that fruit quality had probably undergone selection during apple domestication. The genome of the earliest cultivated apple in China, Nai, was highly similar to that of Malus sieversii, and contained a small portion of genetic material from other wild apple species. This suggested that introgression could have been an important driving force during initial domestication of apple. These findings will facilitate future breeding and genetic dissection of complex traits in apple. © 2017 Institute of Botany, Chinese Academy of Sciences.

  3. A versatile palindromic amphipathic repeat coding sequence horizontally distributed among diverse bacterial and eucaryotic microbes

    PubMed Central

    2010-01-01

    Background Intragenic tandem repeats occur throughout all domains of life and impart functional and structural variability to diverse translation products. Repeat proteins confer distinctive surface phenotypes to many unicellular organisms, including those with minimal genomes such as the wall-less bacterial monoderms, Mollicutes. One such repeat pattern in this clade is distributed in a manner suggesting its exchange by horizontal gene transfer (HGT). Expanding genome sequence databases reveal the pattern in a widening range of bacteria, and recently among eucaryotic microbes. We examined the genomic flux and consequences of the motif by determining its distribution, predicted structural features and association with membrane-targeted proteins. Results Using a refined hidden Markov model, we document a 25-residue protein sequence motif tandemly arrayed in variable-number repeats in ORFs lacking assigned functions. It appears sporadically in unicellular microbes from disparate bacterial and eucaryotic clades, representing diverse lifestyles and ecological niches that include host parasitic, marine and extreme environments. Tracts of the repeats predict a malleable configuration of recurring domains, with conserved hydrophobic residues forming an amphipathic secondary structure in which hydrophilic residues endow extensive sequence variation. Many ORFs with these domains also have membrane-targeting sequences that predict assorted topologies; others may comprise reservoirs of sequence variants. We demonstrate expressed variants among surface lipoproteins that distinguish closely related animal pathogens belonging to a subgroup of the Mollicutes. DNA sequences encoding the tandem domains display dyad symmetry. Moreover, in some taxa the domains occur in ORFs selectively associated with mobile elements. These features, a punctate phylogenetic distribution, and different patterns of dispersal in genomes of related taxa, suggest that the repeat may be disseminated by HGT and intra-genomic shuffling. Conclusions We describe novel features of PARCELs (Palindromic Amphipathic Repeat Coding ELements), a set of widely distributed repeat protein domains and coding sequences that were likely acquired through HGT by diverse unicellular microbes, further mobilized and diversified within genomes, and co-opted for expression in the membrane proteome of some taxa. Disseminated by multiple gene-centric vehicles, ORFs harboring these elements enhance accessory gene pools as part of the "mobilome" connecting genomes of various clades, in taxa sharing common niches. PMID:20626840

  4. Interaction of healthcare worker hands and portable medical equipment: a sequence analysis to show potential transmission opportunities.

    PubMed

    Jinadatha, Chetan; Villamaria, Frank C; Coppin, John D; Dale, Charles R; Williams, Marjory D; Whitworth, Ryan; Stibich, Mark

    2017-12-28

    While research has demonstrated the importance of a clean health care environment, there is a lack of research on the role portable medical equipment (PME) play in the transmission cycle of healthcare-acquired infections (HAIs). This study investigated the patterns and sequence of contact events among health care workers, patients, surfaces, and medical equipment in a hospital environment. Research staff observed patient care events over six different 24 h periods on six different hospital units. Each encounter was recorded as a sequence of events and analyzed using sequence analysis and visually represented by network plots. In addition, a point prevalence microbial sample was taken from the computer on wheels (COW). The most touched items during patient care was the individual patient (850), bedrail (375), bed-surface (302), and bed side Table (223). Three of the top ten most common subsequences included touching PME and the patient: computer on wheels ➔ patient (62 of 274 total sequences, 22.6%, contained this sequence), patient ➔ COW (20.4%), and patient ➔ IV pump (16.1%). The network plots revealed large interconnectedness among objects in the room, the patient, PME, and the healthcare worker. Our results demonstrated that PME such as COW and IV pump were two of the most highly-touched items during patient care. Even with proper hand sanitization and personal protective equipment, this sequence analysis reveals the potential for contamination from the patient and environment, to a vector such as portable medical equipment, and ultimately to another patient in the hospital.

  5. Asymmetry of perceived key movement in chorale sequences: converging evidence from a probe-tone analysis.

    PubMed

    Cuddy, L L; Thompson, W F

    1992-01-01

    In a probe-tone experiment, two groups of listeners--one trained, the other untrained, in traditional music theory--rated the goodness of fit of each of the 12 notes of the chromatic scale to four-voice harmonic sequences. Sequences were 12 simplified excerpts from Bach chorales, 4 nonmodulating, and 8 modulating. Modulations occurred either one or two steps in either the clockwise or the counterclockwise direction on the cycle of fifths. A consistent pattern of probe-tone ratings was obtained for each sequence, with no significant differences between listener groups. Two methods of analysis (Fourier analysis and regression analysis) revealed a directional asymmetry in the perceived key movement conveyed by modulating sequences. For a given modulation distance, modulations in the counterclockwise direction effected a clearer shift in tonal organization toward the final key than did clockwise modulations. The nature of the directional asymmetry was consistent with results reported for identification and rating of key change in the sequences (Thompson & Cuddy, 1989a). Further, according to the multiple-regression analysis, probe-tone ratings did not merely reflect the distribution of tones in the sequence. Rather, ratings were sensitive to the temporal structure of the tonal organization in the sequence.

  6. Characterization and Expression Analysis of Receptor for Activated C Kinase from Silk-producing Insect Antheraea pernyi.

    PubMed

    Zhu, Bao-Jian; Yu, Hao; Tian, Sen; Dai, Li-Shang; Sun, Yu; Liu, Chao-Liang

    2016-01-01

    The receptor for activated C kinase (RACK) is an important scaffold protein with regulatory functions in cells. However, its role in the immune response of Antheraea pernyi to pathogen challenge remains unclear. To investigate the biological functions of RACK in the wild silkworm A. pernyi, cloning was performed and the expression patterns of the RACK gene were analyzed. Sequence analysis revealed that the RACK gene was 1120 bp containing a 960-bp open reading frame. The deduced RACK protein sequence reveals the higher identity with its homologs from other insects. SDS-PAGE and western blot analysis demonstrated successful expression of a 36-kDa recombinant RACK protein in Escherichia coli. The titer of a rabbit-raised antibody against recombinant RACK protein was about 1: 20000, determined by ELISA. Real-time PCR analysis showed that RACK expression was higher in fat bodies than in other examined A. pernyi tissues. The expression of RACK mRNA in fat bodies of fifth larvae of A. pernyi was obviously induced after nucleopolyhedrovirus, E. coli or Beauveria bassiana challenge. However, the expression patterns of RACK were different in response to these pathogens. Our data suggest that RACK may play a role in the innate immune responses of A. pernyi.

  7. The Tara Oceans voyage reveals global diversity and distribution patterns of marine planktonic ciliates

    PubMed Central

    Gimmler, Anna; Korn, Ralf; de Vargas, Colomban; Audic, Stéphane; Stoeck, Thorsten

    2016-01-01

    Illumina reads of the SSU-rDNA-V9 region obtained from the circumglobal Tara Oceans expedition allow the investigation of protistan plankton diversity patterns on a global scale. We analyzed 6,137,350 V9-amplicons from ocean surface waters and the deep chlorophyll maximum, which were taxonomically assigned to the phylum Ciliophora. For open ocean samples global planktonic ciliate diversity is relatively low (ca. 1,300 observed and predicted ciliate OTUs). We found that 17% of all detected ciliate OTUs occurred in all oceanic regions under study. On average, local ciliate OTU richness represented 27% of the global ciliate OTU richness, indicating that a large proportion of ciliates is widely distributed. Yet, more than half of these OTUs shared <90% sequence similarity with reference sequences of described ciliates. While alpha-diversity measures (richness and exp(Shannon H)) are hardly affected by contemporary environmental conditions, species (OTU) turnover and community similarity (β-diversity) across taxonomic groups showed strong correlation to environmental parameters. Logistic regression models predicted significant correlations between the occurrence of specific ciliate genera and individual nutrients, the oceanic carbonate system and temperature. Planktonic ciliates displayed distinct vertical distributions relative to chlorophyll a. In contrast, the Tara Oceans dataset did not reveal any evidence that latitude is structuring ciliate communities. PMID:27633177

  8. Bacterial Community Dynamics during Production of Registered Designation of Origin Salers Cheese as Evaluated by 16S rRNA Gene Single-Strand Conformation Polymorphism Analysis

    PubMed Central

    Duthoit, Frédérique; Godon, Jean-Jacques; Montel, Marie-Christine

    2003-01-01

    Microbial dynamics during processing and ripening of traditional cheeses such as registered designation of origin Salers cheese, an artisanal cheese produced in France, play an important role in the elaboration of sensory qualities. The aim of the present study was to obtain a picture of the dynamics of the microbial ecosystem of RDO Salers cheese by using culture-independent methods. This included DNA extraction, PCR, and single-strand conformation polymorphism (SSCP) analysis. Bacterial and high-GC% gram-positive bacterial primers were used to amplify V2 or V3 regions of the 16S rRNA gene. SSCP patterns revealed changes during the manufacturing of the cheese. Patterns of the ecosystems of cheeses that were provided by three farmers were also quite different. Cloning and sequencing of the 16S rRNA gene revealed sequences related to lactic acid bacteria (Lactococcus lactis, Streptococcus thermophilus, Enterococcus faecium, Leuconostoc mesenteroides, Leuconostoc pseudomesenteroides, Lactobacillus plantarum, and Lactobacillus pentosus), which were predominant during manufacturing and ripening. Bacteria belonging to the high-GC% gram-positive group (essentially corynebacteria) were found by using specific primers. The present molecular approach can effectively describe the ecosystem of artisanal dairy products. PMID:12839752

  9. Identification of tissue-specific, abiotic stress-responsive gene expression patterns in wine grape (Vitis vinifera L.) based on curation and mining of large-scale EST data sets

    PubMed Central

    2011-01-01

    Background Abiotic stresses, such as water deficit and soil salinity, result in changes in physiology, nutrient use, and vegetative growth in vines, and ultimately, yield and flavor in berries of wine grape, Vitis vinifera L. Large-scale expressed sequence tags (ESTs) were generated, curated, and analyzed to identify major genetic determinants responsible for stress-adaptive responses. Although roots serve as the first site of perception and/or injury for many types of abiotic stress, EST sequencing in root tissues of wine grape exposed to abiotic stresses has been extremely limited to date. To overcome this limitation, large-scale EST sequencing was conducted from root tissues exposed to multiple abiotic stresses. Results A total of 62,236 expressed sequence tags (ESTs) were generated from leaf, berry, and root tissues from vines subjected to abiotic stresses and compared with 32,286 ESTs sequenced from 20 public cDNA libraries. Curation to correct annotation errors, clustering and assembly of the berry and leaf ESTs with currently available V. vinifera full-length transcripts and ESTs yielded a total of 13,278 unique sequences, with 2302 singletons and 10,976 mapped to V. vinifera gene models. Of these, 739 transcripts were found to have significant differential expression in stressed leaves and berries including 250 genes not described previously as being abiotic stress responsive. In a second analysis of 16,452 ESTs from a normalized root cDNA library derived from roots exposed to multiple, short-term, abiotic stresses, 135 genes with root-enriched expression patterns were identified on the basis of their relative EST abundance in roots relative to other tissues. Conclusions The large-scale analysis of relative EST frequency counts among a diverse collection of 23 different cDNA libraries from leaf, berry, and root tissues of wine grape exposed to a variety of abiotic stress conditions revealed distinct, tissue-specific expression patterns, previously unrecognized stress-induced genes, and many novel genes with root-enriched mRNA expression for improving our understanding of root biology and manipulation of rootstock traits in wine grape. mRNA abundance estimates based on EST library-enriched expression patterns showed only modest correlations between microarray and quantitative, real-time reverse transcription-polymerase chain reaction (qRT-PCR) methods highlighting the need for deep-sequencing expression profiling methods. PMID:21592389

  10. Molecular diversity of arbuscular mycorrhizal fungi and their distribution patterns related to host-plants and habitats in a hot and arid ecosystem, southwest China.

    PubMed

    Li, Ling-Fei; Li, Tao; Zhang, Yan; Zhao, Zhi-Wei

    2010-03-01

    The communities of arbuscular mycorrhizal fungi (AMF) colonizing the roots of Bothriochloa pertusa, Cajanus cajan and Heteropogon contortus in a fallow land (FL) and an undisturbed land (UL) were characterized. The large subunit rDNA genes of AMF from roots were amplified and cloned. A total of 2353 clones were screened by restriction fragment length polymorphism, and 428 clones were subsequently sequenced. A total of 393 AMF sequences, which were grouped into 100 operational taxonomic units, were obtained. Phylogenetic analysis revealed that the AMF sequences belonged to Glomus, Acaulospora and Scutellospora, and that Glomus was the dominant genus. Of the 393 AMF sequences, 81% were novel. The diversity of AMF colonizing the same plant species was higher in the UL than in the FL, which confirmed strongly from the molecular evidence that soil disturbance reduced AMF population and species richness. The results revealed that AMF communities were significantly different among host-plant species and between the two habitats. The similarity of AMF communities colonizing different plant species within a habitat was higher than that of the same plant species from different habitats. The molecular evidence supported our previous hypothesis based on morphological analyses that AMF communities were more influenced by habitats compared with host preference.

  11. Analyses of Evolutionary Characteristics of the Hemagglutinin-Esterase Gene of Influenza C Virus during a Period of 68 Years Reveals Evolutionary Patterns Different from Influenza A and B Viruses.

    PubMed

    Furuse, Yuki; Matsuzaki, Yoko; Nishimura, Hidekazu; Oshitani, Hitoshi

    2016-11-26

    Infections with the influenza C virus causing respiratory symptoms are common, particularly among children. Since isolation and detection of the virus are rarely performed, compared with influenza A and B viruses, the small number of available sequences of the virus makes it difficult to analyze its evolutionary dynamics. Recently, we reported the full genome sequence of 102 strains of the virus. Here, we exploited the data to elucidate the evolutionary characteristics and phylodynamics of the virus compared with influenza A and B viruses. Along with our data, we obtained public sequence data of the hemagglutinin-esterase gene of the virus; the dataset consists of 218 unique sequences of the virus collected from 14 countries between 1947 and 2014. Informatics analyses revealed that (1) multiple lineages have been circulating globally; (2) there have been weak and infrequent selective bottlenecks; (3) the evolutionary rate is low because of weak positive selection and a low capability to induce mutations; and (4) there is no significant positive selection although a few mutations affecting its antigenicity have been induced. The unique evolutionary dynamics of the influenza C virus must be shaped by multiple factors, including virological, immunological, and epidemiological characteristics.

  12. Analyses of Evolutionary Characteristics of the Hemagglutinin-Esterase Gene of Influenza C Virus during a Period of 68 Years Reveals Evolutionary Patterns Different from Influenza A and B Viruses

    PubMed Central

    Furuse, Yuki; Matsuzaki, Yoko; Nishimura, Hidekazu; Oshitani, Hitoshi

    2016-01-01

    Infections with the influenza C virus causing respiratory symptoms are common, particularly among children. Since isolation and detection of the virus are rarely performed, compared with influenza A and B viruses, the small number of available sequences of the virus makes it difficult to analyze its evolutionary dynamics. Recently, we reported the full genome sequence of 102 strains of the virus. Here, we exploited the data to elucidate the evolutionary characteristics and phylodynamics of the virus compared with influenza A and B viruses. Along with our data, we obtained public sequence data of the hemagglutinin-esterase gene of the virus; the dataset consists of 218 unique sequences of the virus collected from 14 countries between 1947 and 2014. Informatics analyses revealed that (1) multiple lineages have been circulating globally; (2) there have been weak and infrequent selective bottlenecks; (3) the evolutionary rate is low because of weak positive selection and a low capability to induce mutations; and (4) there is no significant positive selection although a few mutations affecting its antigenicity have been induced. The unique evolutionary dynamics of the influenza C virus must be shaped by multiple factors, including virological, immunological, and epidemiological characteristics. PMID:27898037

  13. Deep COI sequencing of standardized benthic samples unveils overlooked diversity of Jordanian coral reefs in the northern Red Sea.

    PubMed

    Al-Rshaidat, Mamoon M D; Snider, Allison; Rosebraugh, Sydney; Devine, Amanda M; Devine, Thomas D; Plaisance, Laetitia; Knowlton, Nancy; Leray, Matthieu

    2016-09-01

    High-throughput sequencing (HTS) of DNA barcodes (metabarcoding), particularly when combined with standardized sampling protocols, is one of the most promising approaches for censusing overlooked cryptic invertebrate communities. We present biodiversity estimates based on sequencing of the cytochrome c oxidase subunit 1 (COI) gene for coral reefs of the Gulf of Aqaba, a semi-enclosed system in the northern Red Sea. Samples were obtained from standardized sampling devices (Autonomous Reef Monitoring Structures (ARMS)) deployed for 18 months. DNA barcoding of non-sessile specimens >2 mm revealed 83 OTUs in six phyla, of which only 25% matched a reference sequence in public databases. Metabarcoding of the 2 mm - 500 μm and sessile bulk fractions revealed 1197 OTUs in 15 animal phyla, of which only 4.9% matched reference barcodes. These results highlight the scarcity of COI data for cryptobenthic organisms of the Red Sea. Compared with data obtained using similar methods, our results suggest that Gulf of Aqaba reefs are less diverse than two Pacific coral reefs but much more diverse than an Atlantic oyster reef at a similar latitude. The standardized approaches used here show promise for establishing baseline data on biodiversity, monitoring the impacts of environmental change, and quantifying patterns of diversity at regional and global scales.

  14. Influence of volcanic activity on the population genetic structure of Hawaiian Tetragnatha spiders: Fragmentation, rapid population growth and the potential for accelerated evolution

    USGS Publications Warehouse

    Vandergast, A.G.; Gillespie, R.G.; Roderick, G.K.

    2004-01-01

    Volcanic activity on the island of Hawaii results in a cyclical pattern of habitat destruction and fragmentation by lava, followed by habitat regeneration on newly formed substrates. While this pattern has been hypothesized to promote the diversification of Hawaiian lineages, there have been few attempts to link geological processes to measurable changes in population structure. We investigated the genetic structure of three species of Hawaiian spiders in forests fragmented by a 150-year-old lava flow on Mauna Loa Volcano, island of Hawaii: Tetragnatha quasimodo (forest and lava flow generalist), T. anuenue and T. brevignatha (forest specialists). To estimate fragmentation effects on population subdivision in each species, we examined variation in mitochondrial and nuclear genomes (DNA sequences and allozymes, respectively). Population subdivision was higher for forest specialists than for the generalist in fragments separated by lava. Patterns of mtDNA sequence evolution also revealed that forest specialists have undergone rapid expansion, while the generalist has experienced more gradual population growth. Results confirm that patterns of neutral genetic variation reflect patterns of volcanic activity in some Tetragnatha species. Our study further suggests that population subdivision and expansion can occur across small spatial and temporal scales, which may facilitate the rapid spread of new character states, leading to speciation as hypothesized by H. L. Carson 30 years ago.

  15. Identification of Y-Chromosome Sequences in Turner Syndrome.

    PubMed

    Silva-Grecco, Roseane Lopes da; Trovó-Marqui, Alessandra Bernadete; Sousa, Tiago Alves de; Croce, Lilian Da; Balarin, Marly Aparecida Spadotto

    2016-05-01

    To investigate the presence of Y-chromosome sequences and determine their frequency in patients with Turner syndrome. The study included 23 patients with Turner syndrome from Brazil, who gave written informed consent for participating in the study. Cytogenetic analyses were performed in peripheral blood lymphocytes, with 100 metaphases per patient. Genomic DNA was also extracted from peripheral blood lymphocytes, and gene sequences DYZ1, DYZ3, ZFY and SRY were amplified by Polymerase Chain Reaction. The cytogenetic analysis showed a 45,X karyotype in 9 patients (39.2 %) and a mosaic pattern in 14 (60.8 %). In 8.7 % (2 out of 23) of the patients, Y-chromosome sequences were found. This prevalence is very similar to those reported previously. The initial karyotype analysis of these patients did not reveal Y-chromosome material, but they were found positive for Y-specific sequences in the lymphocyte DNA analysis. The PCR technique showed that 2 (8.7 %) of the patients with Turner syndrome had Y-chromosome sequences, both presenting marker chromosomes on cytogenetic analysis.

  16. Cenozoic sedimentation in the Mumbai Offshore Basin: Implications for tectonic evolution of the western continental margin of India

    NASA Astrophysics Data System (ADS)

    Nair, Nisha; Pandey, Dhananjai K.

    2018-02-01

    Interpretation of multichannel seismic reflection data along the Mumbai Offshore Basin (MOB) revealed the tectonic processes that led to the development of sedimentary basins during Cenozoic evolution. Structural interpretation along three selected MCS profiles from MOB revealed seven major sedimentary sequences (∼3.0 s TWT, thick) and the associated complex fault patterns. These stratigraphic sequences are interpreted to host detritus of syn- to post rift events during rift-drift process. The acoustic basement appeared to be faulted with interspaced intrusive bodies. The sections also depicted the presence of slumping of sediments, subsidence, marginal basins, rollover anticlines, mud diapirs etc accompanied by normal to thrust faults related to recent tectonics. Presence of upthrusts in the slope region marks the locations of local compression during collision. Forward gravity modeling constrained with results from seismic and drill results, revealed that the crustal structure beneath the MOB has undergone an extensional type tectonics intruded with intrusive bodies. Results from the seismo-gravity modeling in association with litholog data from drilled wells from the western continental margin of India (WCMI) are presented here.

  17. Characterisation and expression of microRNAs in developing wings of the neotropical butterfly Heliconius melpomene

    PubMed Central

    2011-01-01

    Background Heliconius butterflies are an excellent system for studies of adaptive convergent and divergent phenotypic traits. Wing colour patterns are used as signals to both predators and potential mates and are inherited in a Mendelian manner. The underlying genetic mechanisms of pattern formation have been studied for many years and shed light on broad issues, such as the repeatability of evolution. In Heliconius melpomene, the yellow hindwing bar is controlled by the HmYb locus. MicroRNAs (miRNAs) are important post-transcriptional regulators of gene expression that have key roles in many biological processes, including development. miRNAs could act as regulators of genes involved in wing development, patterning and pigmentation. For this reason we characterised miRNAs in developing butterfly wings and examined differences in their expression between colour pattern races. Results We sequenced small RNA libraries from two colour pattern races and detected 142 Heliconius miRNAs with homology to others found in miRBase. Several highly abundant miRNAs were differentially represented in the libraries between colour pattern races. These candidates were tested further using Northern blots, showing that differences in expression were primarily due to developmental stage rather than colour pattern. Assembly of sequenced reads to the HmYb region identified hme-miR-193 and hme-miR-2788; located 2380 bp apart in an intergenic region. These two miRNAs are expressed in wings and show an upregulation between 24 and 72 hours post-pupation, indicating a potential role in butterfly wing development. A search for miRNAs in all available H. melpomene BAC sequences (~ 2.5 Mb) did not reveal any other miRNAs and no novel miRNAs were predicted. Conclusions Here we describe the first butterfly miRNAs and characterise their expression in developing wings. Some show differences in expression across developing pupal stages and may have important functions in butterfly wing development. Two miRNAs were located in the HmYb region and were expressed in developing pupal wings. Future work will examine the expression of these miRNAs in different colour pattern races and identify miRNA targets among wing patterning genes. PMID:21266089

  18. Characterisation and expression of microRNAs in developing wings of the neotropical butterfly Heliconius melpomene.

    PubMed

    Surridge, Alison K; Lopez-Gomollon, Sara; Moxon, Simon; Maroja, Luana S; Rathjen, Tina; Nadeau, Nicola J; Dalmay, Tamas; Jiggins, Chris D

    2011-01-26

    Heliconius butterflies are an excellent system for studies of adaptive convergent and divergent phenotypic traits. Wing colour patterns are used as signals to both predators and potential mates and are inherited in a Mendelian manner. The underlying genetic mechanisms of pattern formation have been studied for many years and shed light on broad issues, such as the repeatability of evolution. In Heliconius melpomene, the yellow hindwing bar is controlled by the HmYb locus. MicroRNAs (miRNAs) are important post-transcriptional regulators of gene expression that have key roles in many biological processes, including development. miRNAs could act as regulators of genes involved in wing development, patterning and pigmentation. For this reason we characterised miRNAs in developing butterfly wings and examined differences in their expression between colour pattern races. We sequenced small RNA libraries from two colour pattern races and detected 142 Heliconius miRNAs with homology to others found in miRBase. Several highly abundant miRNAs were differentially represented in the libraries between colour pattern races. These candidates were tested further using Northern blots, showing that differences in expression were primarily due to developmental stage rather than colour pattern. Assembly of sequenced reads to the HmYb region identified hme-miR-193 and hme-miR-2788; located 2380 bp apart in an intergenic region. These two miRNAs are expressed in wings and show an upregulation between 24 and 72 hours post-pupation, indicating a potential role in butterfly wing development. A search for miRNAs in all available H. melpomene BAC sequences (~2.5 Mb) did not reveal any other miRNAs and no novel miRNAs were predicted. Here we describe the first butterfly miRNAs and characterise their expression in developing wings. Some show differences in expression across developing pupal stages and may have important functions in butterfly wing development. Two miRNAs were located in the HmYb region and were expressed in developing pupal wings. Future work will examine the expression of these miRNAs in different colour pattern races and identify miRNA targets among wing patterning genes.

  19. Sequential Pattern Mining of Electronic Healthcare Reimbursement Claims: Experiences and Challenges in Uncovering How Patients are Treated by Physicians

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pullum, Laura L; Ramanathan, Arvind; Hobson, Tanner C

    We examine the use of electronic healthcare reimbursement claims (EHRC) for analyzing healthcare delivery and practice patterns across the United States (US). We show that EHRCs are correlated with disease incidence estimates published by the Centers for Disease Control. Further, by analyzing over 1 billion EHRCs, we track patterns of clinical procedures administered to patients with autism spectrum disorder (ASD), heart disease (HD) and breast cancer (BC) using sequential pattern mining algorithms. Our analyses reveal that in contrast to treating HD and BC, clinical procedures for ASD diagnoses are highly varied leading up to and after the ASD diagnoses. Themore » discovered clinical procedure sequences also reveal significant differences in the overall costs incurred across different parts of the US, indicating a lack of consensus amongst practitioners in treating ASD patients. We show that a data-driven approach to understand clinical trajectories using EHRC can provide quantitative insights into how to better manage and treat patients. Based on our experience, we also discuss emerging challenges in using EHRC datasets for gaining insights into the state of contemporary healthcare delivery and practice in the US.« less

  20. Identification of MicroRNAs in Helicoverpa armigera and Spodoptera litura Based on Deep Sequencing and Homology Analysis

    PubMed Central

    Ge, Xie; Zhang, Yong; Jiang, Jianhao; Zhong, Yi; Yang, Xiaonan; Li, Zhiqian; Huang, Yongping; Tan, Anjiang

    2013-01-01

    The current identification of microRNAs (miRNAs) in insects is largely dependent on genome sequences. However, the lack of available genome sequences inhibits the identification of miRNAs in various insect species. In this study, we used a miRNA database of the silkworm Bombyx mori as a reference to identify miRNAs in Helicoverpa armigera and Spodoptera litura using deep sequencing and homology analysis. Because all three species belong to the Lepidoptera, the experiment produced reliable results. Our study identified 97 and 91 conserved miRNAs in H. armigera and S. litura, respectively. Using the genome of B. mori and BAC sequences of H. armigera as references, 1 novel miRNA and 8 novel miRNA candidates were identified in H. armigera, and 4 novel miRNA candidates were identified in S. litura. An evolutionary analysis revealed that most of the identified miRNAs were insect-specific, and more than 20 miRNAs were Lepidoptera-specific. The investigation of the expression patterns of miR-2a, miR-34, miR-2796-3p and miR-11 revealed their potential roles in insect development. miRNA target prediction revealed that conserved miRNA target sites exist in various genes in the 3 species. Conserved miRNA target sites for the Hsp90 gene among the 3 species were validated in the mammalian 293T cell line using a dual-luciferase reporter assay. Our study provides a new approach with which to identify miRNAs in insects lacking genome information and contributes to the functional analysis of insect miRNAs. PMID:23289012

  1. Assessing DNA Barcodes for Species Identification in North American Reptiles and Amphibians in Natural History Collections.

    PubMed

    Chambers, E Anne; Hebert, Paul D N

    2016-01-01

    High rates of species discovery and loss have led to the urgent need for more rapid assessment of species diversity in the herpetofauna. DNA barcoding allows for the preliminary identification of species based on sequence divergence. Prior DNA barcoding work on reptiles and amphibians has revealed higher biodiversity counts than previously estimated due to cases of cryptic and undiscovered species. Past studies have provided DNA barcodes for just 14% of the North American herpetofauna, revealing the need for expanded coverage. This study extends the DNA barcode reference library for North American herpetofauna, assesses the utility of this approach in aiding species delimitation, and examines the correspondence between current species boundaries and sequence clusters designated by the BIN system. Sequences were obtained from 730 specimens, representing 274 species (43%) from the North American herpetofauna. Mean intraspecific divergences were 1% and 3%, while average congeneric sequence divergences were 16% and 14% in amphibians and reptiles, respectively. BIN assignments corresponded with current species boundaries in 79% of amphibians, 100% of turtles, and 60% of squamates. Deep divergences (>2%) were noted in 35% of squamate and 16% of amphibian species, and low divergences (<2%) occurred in 12% of reptiles and 23% of amphibians, patterns reflected in BIN assignments. Sequence recovery declined with specimen age, and variation in recovery success was noted among collections. Within collections, barcodes effectively flagged seven mislabeled tissues, and barcode fragments were recovered from five formalin-fixed specimens. This study demonstrates that DNA barcodes can effectively flag errors in museum collections, while BIN splits and merges reveal taxa belonging to deeply diverged or hybridizing lineages. This study is the first effort to compile a reference library of DNA barcodes for herpetofauna on a continental scale.

  2. Assessing DNA Barcodes for Species Identification in North American Reptiles and Amphibians in Natural History Collections

    PubMed Central

    Chambers, E. Anne; Hebert, Paul D. N.

    2016-01-01

    Background High rates of species discovery and loss have led to the urgent need for more rapid assessment of species diversity in the herpetofauna. DNA barcoding allows for the preliminary identification of species based on sequence divergence. Prior DNA barcoding work on reptiles and amphibians has revealed higher biodiversity counts than previously estimated due to cases of cryptic and undiscovered species. Past studies have provided DNA barcodes for just 14% of the North American herpetofauna, revealing the need for expanded coverage. Methodology/Principal Findings This study extends the DNA barcode reference library for North American herpetofauna, assesses the utility of this approach in aiding species delimitation, and examines the correspondence between current species boundaries and sequence clusters designated by the BIN system. Sequences were obtained from 730 specimens, representing 274 species (43%) from the North American herpetofauna. Mean intraspecific divergences were 1% and 3%, while average congeneric sequence divergences were 16% and 14% in amphibians and reptiles, respectively. BIN assignments corresponded with current species boundaries in 79% of amphibians, 100% of turtles, and 60% of squamates. Deep divergences (>2%) were noted in 35% of squamate and 16% of amphibian species, and low divergences (<2%) occurred in 12% of reptiles and 23% of amphibians, patterns reflected in BIN assignments. Sequence recovery declined with specimen age, and variation in recovery success was noted among collections. Within collections, barcodes effectively flagged seven mislabeled tissues, and barcode fragments were recovered from five formalin-fixed specimens. Conclusions/Significance This study demonstrates that DNA barcodes can effectively flag errors in museum collections, while BIN splits and merges reveal taxa belonging to deeply diverged or hybridizing lineages. This study is the first effort to compile a reference library of DNA barcodes for herpetofauna on a continental scale. PMID:27116180

  3. Context based computational analysis and characterization of ARS consensus sequences (ACS) of Saccharomyces cerevisiae genome.

    PubMed

    Singh, Vinod Kumar; Krishnamachari, Annangarachari

    2016-09-01

    Genome-wide experimental studies in Saccharomyces cerevisiae reveal that autonomous replicating sequence (ARS) requires an essential consensus sequence (ACS) for replication activity. Computational studies identified thousands of ACS like patterns in the genome. However, only a few hundreds of these sites act as replicating sites and the rest are considered as dormant or evolving sites. In a bid to understand the sequence makeup of replication sites, a content and context-based analysis was performed on a set of replicating ACS sequences that binds to origin-recognition complex (ORC) denoted as ORC-ACS and non-replicating ACS sequences (nrACS), that are not bound by ORC. In this study, DNA properties such as base composition, correlation, sequence dependent thermodynamic and DNA structural profiles, and their positions have been considered for characterizing ORC-ACS and nrACS. Analysis reveals that ORC-ACS depict marked differences in nucleotide composition and context features in its vicinity compared to nrACS. Interestingly, an A-rich motif was also discovered in ORC-ACS sequences within its nucleosome-free region. Profound changes in the conformational features, such as DNA helical twist, inclination angle and stacking energy between ORC-ACS and nrACS were observed. Distribution of ACS motifs in the non-coding segments points to the locations of ORC-ACS which are found far away from the adjacent gene start position compared to nrACS thereby enabling an accessible environment for ORC-proteins. Our attempt is novel in considering the contextual view of ACS and its flanking region along with nucleosome positioning in the S. cerevisiae genome and may be useful for any computational prediction scheme.

  4. Soil bacterial diversity patterns and drivers along an elevational gradient on Shennongjia Mountain, China

    PubMed Central

    Zhang, Yuguang; Cong, Jing; Lu, Hui; Li, Guangliang; Xue, Yadong; Deng, Ye; Li, Hui; Zhou, Jizhong; Li, Diqiang

    2015-01-01

    Understanding biological diversity elevational pattern and the driver factors are indispensable to develop the ecological theories. Elevational gradient may minimize the impact of environmental factors and is the ideal places to study soil microbial elevational patterns. In this study, we selected four typical vegetation types from 1000 to 2800 m above the sea level on the northern slope of Shennongjia Mountain in central China, and analysed the soil bacterial community composition, elevational patterns and the relationship between soil bacterial diversity and environmental factors by using the 16S rRNA Illumina sequencing and multivariate statistical analysis. The results revealed that the dominant bacterial phyla were Acidobacteria, Actinobacteria, Alphaproteobacteria, Betaproteobacteria, Gammaproteobacteria and Verrucomicrobia, which accounted for over 75% of the bacterial sequences obtained from tested samples, and the soil bacterial operational taxonomic unit (OTU) richness was a significant monotonous decreasing (P < 0.01) trend with the elevational increasing. The similarity of soil bacterial population composition decreased significantly (P < 0.01) with elevational distance increased as measured by the Jaccard and Bray–Curtis index. Canonical correspondence analysis and Mantel test analysis indicated that plant diversity and soil pH were significantly correlated (P < 0.01) with the soil bacterial community. Therefore, the soil bacterial diversity on Shennongjia Mountain had a significant and different elevational pattern, and plant diversity and soil pH may be the key factors in shaping the soil bacterial spatial pattern. PMID:26032124

  5. Hybridization Reveals the Evolving Genomic Architecture of Speciation

    PubMed Central

    Kronforst, Marcus R.; Hansen, Matthew E.B.; Crawford, Nicholas G.; Gallant, Jason R.; Zhang, Wei; Kulathinal, Rob J.; Kapan, Durrell D.; Mullen, Sean P.

    2014-01-01

    SUMMARY The rate at which genomes diverge during speciation is unknown, as are the physical dynamics of the process. Here, we compare full genome sequences of 32 butterflies, representing five species from a hybridizing Heliconius butterfly community, to examine genome-wide patterns of introgression and infer how divergence evolves during the speciation process. Our analyses reveal that initial divergence is restricted to a small fraction of the genome, largely clustered around known wing-patterning genes. Over time, divergence evolves rapidly, due primarily to the origin of new divergent regions. Furthermore, divergent genomic regions display signatures of both selection and adaptive introgression, demonstrating the link between microevolutionary processes acting within species and the origin of species across macroevolutionary timescales. Our results provide a uniquely comprehensive portrait of the evolving species boundary due to the role that hybridization plays in reducing the background accumulation of divergence at neutral sites. PMID:24183670

  6. Frequent inter-species transmission and geographic subdivision in avian influenza viruses from wild birds.

    PubMed

    Chen, Rubing; Holmes, Edward C

    2009-01-05

    Revealing the factors that shape the genetic structure of avian influenza viruses (AIVs) in wild bird populations is essential to understanding their evolution. However, the relationship between epidemiological dynamics and patterns of genetic diversity in AIV is not well understood, especially at the continental scale. To address this question, we undertook a phylogeographic analysis of complete genome sequences of AIV sampled from wild birds in North America. In particular, we asked whether host species, geographic location or sampling time played the major role in shaping patterns of viral genetic diversity. Strikingly, our analysis revealed no strong species effect, yet a significant viral clustering by time and place of sampling, as well as the circulation of multiple viral lineages in single locations. These results suggest that AIVs can readily infect many of the bird species that share breeding/feeding areas.

  7. Amino acid positions subject to multiple coevolutionary constraints can be robustly identified by their eigenvector network centrality scores.

    PubMed

    Parente, Daniel J; Ray, J Christian J; Swint-Kruse, Liskin

    2015-12-01

    As proteins evolve, amino acid positions key to protein structure or function are subject to mutational constraints. These positions can be detected by analyzing sequence families for amino acid conservation or for coevolution between pairs of positions. Coevolutionary scores are usually rank-ordered and thresholded to reveal the top pairwise scores, but they also can be treated as weighted networks. Here, we used network analyses to bypass a major complication of coevolution studies: For a given sequence alignment, alternative algorithms usually identify different, top pairwise scores. We reconciled results from five commonly-used, mathematically divergent algorithms (ELSC, McBASC, OMES, SCA, and ZNMI), using the LacI/GalR and 1,6-bisphosphate aldolase protein families as models. Calculations used unthresholded coevolution scores from which column-specific properties such as sequence entropy and random noise were subtracted; "central" positions were identified by calculating various network centrality scores. When compared among algorithms, network centrality methods, particularly eigenvector centrality, showed markedly better agreement than comparisons of the top pairwise scores. Positions with large centrality scores occurred at key structural locations and/or were functionally sensitive to mutations. Further, the top central positions often differed from those with top pairwise coevolution scores: instead of a few strong scores, central positions often had multiple, moderate scores. We conclude that eigenvector centrality calculations reveal a robust evolutionary pattern of constraints-detectable by divergent algorithms--that occur at key protein locations. Finally, we discuss the fact that multiple patterns coexist in evolutionary data that, together, give rise to emergent protein functions. © 2015 Wiley Periodicals, Inc.

  8. Deep sequencing of small RNA repertoires in mice reveals metabolic disorders-associated hepatic miRNAs.

    PubMed

    Liang, Tingming; Liu, Chang; Ye, Zhenchao

    2013-01-01

    Obesity and associated metabolic disorders contribute importantly to the metabolic syndrome. On the other hand, microRNAs (miRNAs) are a class of small non-coding RNAs that repress target gene expression by inducing mRNA degradation and/or translation repression. Dysregulation of specific miRNAs in obesity may influence energy metabolism and cause insulin resistance, which leads to dyslipidemia, steatosis hepatis and type 2 diabetes. In the present study, we comprehensively analyzed and validated dysregulated miRNAs in ob/ob mouse liver, as well as miRNA groups based on miRNA gene cluster and gene family by using deep sequencing miRNA datasets. We found that over 13.8% of the total analyzed miRNAs were dysregulated, of which 37 miRNA species showed significantly differential expression. Further RT-qPCR analysis in some selected miRNAs validated the similar expression patterns observed in deep sequencing. Interestingly, we found that miRNA gene cluster and family always showed consistent dysregulation patterns in ob/ob mouse liver, although they had various enrichment levels. Functional enrichment analysis revealed the versatile physiological roles (over six signal pathways and five human diseases) of these miRNAs. Biological studies indicated that overexpression of miR-126 or inhibition of miR-24 in AML-12 cells attenuated free fatty acids-induced fat accumulation. Taken together, our data strongly suggest that obesity and metabolic disturbance are tightly associated with functional miRNAs. We also identified hepatic miRNA candidates serving as potential biomarkers for the diagnose of the metabolic syndrome.

  9. Influences of Plant Species, Season and Location on Leaf Endophytic Bacterial Communities of Non-Cultivated Plants

    PubMed Central

    Ding, Tao; Melcher, Ulrich

    2016-01-01

    Bacteria are known to be associated endophytically with plants. Research on endophytic bacteria has identified their importance in food safety, agricultural production and phytoremediation. However, the diversity of endophytic bacterial communities and the forces that shape their compositions in non-cultivated plants are largely uncharacterized. In this study, we explored the diversity, community structure, and dynamics of endophytic bacteria in different plant species in the Tallgrass Prairie Preserve of northern Oklahoma, USA. High throughput sequencing of amplified segments of bacterial rDNA from 81 samples collected at four sampling times from five plant species at four locations identified 335 distinct OTUs at 97% sequence similarity, representing 16 phyla. Proteobacteria was the dominant phylum in the communities, followed by the phyla Bacteriodetes and Actinobacteria. Bacteria from four classes of Proteobacteria were detected with Alphaproteobacteria as the dominant class. Analysis of molecular variance revealed that host plant species and collecting date had significant influences on the compositions of the leaf endophytic bacterial communities. The proportion of Alphaproteobacteria was much higher in the communities from Asclepias viridis than from other plant species and differed from month to month. The most dominant bacterial groups identified in LDA Effect Size analysis showed host-specific patterns, indicating mutual selection between host plants and endophytic bacteria and that leaf endophytic bacterial compositions were dynamic, varying with the host plant’s growing season in three distinct patterns. In summary, next generation sequencing has revealed variations in the taxonomic compositions of leaf endophytic bacterial communities dependent primarily on the nature of the plant host species. PMID:26974817

  10. Influences of Plant Species, Season and Location on Leaf Endophytic Bacterial Communities of Non-Cultivated Plants.

    PubMed

    Ding, Tao; Melcher, Ulrich

    2016-01-01

    Bacteria are known to be associated endophytically with plants. Research on endophytic bacteria has identified their importance in food safety, agricultural production and phytoremediation. However, the diversity of endophytic bacterial communities and the forces that shape their compositions in non-cultivated plants are largely uncharacterized. In this study, we explored the diversity, community structure, and dynamics of endophytic bacteria in different plant species in the Tallgrass Prairie Preserve of northern Oklahoma, USA. High throughput sequencing of amplified segments of bacterial rDNA from 81 samples collected at four sampling times from five plant species at four locations identified 335 distinct OTUs at 97% sequence similarity, representing 16 phyla. Proteobacteria was the dominant phylum in the communities, followed by the phyla Bacteriodetes and Actinobacteria. Bacteria from four classes of Proteobacteria were detected with Alphaproteobacteria as the dominant class. Analysis of molecular variance revealed that host plant species and collecting date had significant influences on the compositions of the leaf endophytic bacterial communities. The proportion of Alphaproteobacteria was much higher in the communities from Asclepias viridis than from other plant species and differed from month to month. The most dominant bacterial groups identified in LDA Effect Size analysis showed host-specific patterns, indicating mutual selection between host plants and endophytic bacteria and that leaf endophytic bacterial compositions were dynamic, varying with the host plant's growing season in three distinct patterns. In summary, next generation sequencing has revealed variations in the taxonomic compositions of leaf endophytic bacterial communities dependent primarily on the nature of the plant host species.

  11. Single-strand conformation polymorphism (SSCP)-based mutation scanning approaches to fingerprint sequence variation in ribosomal DNA of ascaridoid nematodes.

    PubMed

    Zhu, X Q; Gasser, R B

    1998-06-01

    In this study, we assessed single-strand conformation polymorphism (SSCP)-based approaches for their capacity to fingerprint sequence variation in ribosomal DNA (rDNA) of ascaridoid nematodes of veterinary and/or human health significance. The second internal transcribed spacer region (ITS-2) of rDNA was utilised as the target region because it is known to provide species-specific markers for this group of parasites. ITS-2 was amplified by PCR from genomic DNA derived from individual parasites and subjected to analysis. Direct SSCP analysis of amplicons from seven taxa (Toxocara vitulorum, Toxocara cati, Toxocara canis, Toxascaris leonina, Baylisascaris procyonis, Ascaris suum and Parascaris equorum) showed that the single-strand (ss) ITS-2 patterns produced allowed their unequivocal identification to species. While no variation in SSCP patterns was detected in the ITS-2 within four species for which multiple samples were available, the method allowed the direct display of four distinct sequence types of ITS-2 among individual worms of T. cati. Comparison of SSCP/sequencing with the methods of dideoxy fingerprinting (ddF) and restriction endonuclease fingerprinting (REF) revealed that also ddF allowed the definition of the four sequence types, whereas REF displayed three of four. The findings indicate the usefulness of the SSCP-based approaches for the identification of ascaridoid nematodes to species, the direct display of sequence variation in rDNA and the detection of population variation. The ability to fingerprint microheterogeneity in ITS-2 rDNA using such approaches also has implications for studying fundamental aspects relating to mutational change in rDNA.

  12. Evolution of toll-like receptors in the context of terrestrial ungulates and cetaceans diversification.

    PubMed

    Ishengoma, Edson; Agaba, Morris

    2017-02-16

    Toll-like receptors (TLRs) are the frontline actors in the innate immune response to various pathogens and are expected to be targets of natural selection in species adapted to habitats with contrasting pathogen burdens. The recent publication of genome sequences of giraffe and okapi together afforded the opportunity to examine the evolution of selected TLRs in broad range of terrestrial ungulates and cetaceans during their complex habitat diversification. Through direct sequence comparisons and standard evolutionary approaches, the extent of nucleotide and protein sequence diversity in seven Toll-like receptors (TLR2, TLR3, TLR4, TLR5, TLR7, TLR9 and TLR10) between giraffe and closely related species was determined. In addition, comparison of the patterning of key TLR motifs and domains between giraffe and related species was performed. The quantification of selection pressure and divergence on TLRs among terrestrial ungulates and cetaceans was also performed. Sequence analysis shows that giraffe has 94-99% nucleotide identity with okapi and cattle for all TLRs analyzed. Variations in the number of Leucine-rich repeats were observed in some of TLRs between giraffe, okapi and cattle. Patterning of key TLR domains did not reveal any significant differences in the domain architecture among giraffe, okapi and cattle. Molecular evolutionary analysis for selection pressure identifies positive selection on key sites for all TLRs examined suggesting that pervasive evolutionary pressure has taken place during the evolution of terrestrial ungulates and cetaceans. Analysis of positively selected sites showed some site to be part of Leucine-rich motifs suggesting functional relevance in species-specific recognition of pathogen associated molecular patterns. Notably, clade analysis reveals significant selection divergence between terrestrial ungulates and cetaceans in viral sensing TLR3. Mapping of giraffe TLR3 key substitutions to the structure of the receptor indicates that at least one of giraffe altered sites coincides with TLR3 residue known to play a critical role in receptor signaling activity. There is overall structural conservation in TLRs among giraffe, okapi and cattle indicating that the mechanism for innate immune response utilizing TLR pathways may not have changed very much during the evolution of these species. However, a broader phylogenetic analysis revealed signatures of adaptive evolution among terrestrial ungulates and cetaceans, including the observed selection divergence in TLR3. This suggests that long term ecological dynamics has led to species-specific innovation and functional variation in the mechanisms mediating innate immunity in terrestrial ungulates and cetaceans.

  13. The evolution and phylogeography of the African elephant inferred from mitochondrial DNA sequence and nuclear microsatellite markers.

    PubMed

    Eggert, Lori S; Rasner, Caylor A; Woodruff, David S

    2002-10-07

    Recent genetic results support the recognition of two African elephant species: Loxodonta africana, the savannah elephant, and Loxodonta cyclotis, the forest elephant. The study, however, did not include the populations of West Africa, where the taxonomic affinities of elephants have been much debated. We examined mitochondrial cytochrome b control region sequences and four microsatellite loci to investigate the genetic differences between the forest and savannah elephants of West and Central Africa. We then combined our data with published control region sequences from across Africa to examine patterns at the continental level. Our analysis reveals several deeply divergent lineages that do not correspond with the currently recognized taxonomy: (i) the forest elephants of Central Africa; the forest and savannah elephants of West Africa; and (iii) the savannah elephants of eastern, southern and Central Africa. We propose that the complex phylogeographic patterns we detect in African elephants result from repeated continental-scale climatic changes over their five-to-six million year evolutionary history. Until there is consensus on the taxonomy, we suggest that the genetic and ecological distinctness of these lineages should be an important factor in conservation management planning.

  14. Shotgun Bisulfite Sequencing of the Betula platyphylla Genome Reveals the Tree’s DNA Methylation Patterning

    PubMed Central

    Su, Chang; Wang, Chao; He, Lin; Yang, Chuanping; Wang, Yucheng

    2014-01-01

    DNA methylation plays a critical role in the regulation of gene expression. Most studies of DNA methylation have been performed in herbaceous plants, and little is known about the methylation patterns in tree genomes. In the present study, we generated a map of methylated cytosines at single base pair resolution for Betula platyphylla (white birch) by bisulfite sequencing combined with transcriptomics to analyze DNA methylation and its effects on gene expression. We obtained a detailed view of the function of DNA methylation sequence composition and distribution in the genome of B. platyphylla. There are 34,460 genes in the whole genome of birch, and 31,297 genes are methylated. Conservatively, we estimated that 14.29% of genomic cytosines are methylcytosines in birch. Among the methylation sites, the CHH context accounts for 48.86%, and is the largest proportion. Combined transcriptome and methylation analysis showed that the genes with moderate methylation levels had higher expression levels than genes with high and low methylation. In addition, methylated genes are highly enriched for the GO subcategories of binding activities, catalytic activities, cellular processes, response to stimulus and cell death, suggesting that methylation mediates these pathways in birch trees. PMID:25514241

  15. Unilateral congenital terminal finger absences: a condition that differs from symbrachydactyly.

    PubMed

    Knight, Jeffrey B; Pritsch, Tamir; Ezaki, Marybeth; Oishi, Scott N

    2012-01-01

    To describe a type of nonhereditary unilateral transverse deficiency, which we have named hypodactyly, that is distinct from symbrachydactyly or amniotic disruption sequence. We identified 19 patients with unilateral congenital anomalies consisting of absent or short bulbous fingers that lack terminal ectodermal elements. Medical records and radiographs were retrospectively reviewed and contrasted with the typical findings of symbrachydactyly and amniotic disruption sequence. No associated syndromes or potentially causative diagnoses were identified in the hypodactyly patients. The digital absences were of a truncated pattern with thickened, tubular soft tissue coverage. Radiographs revealed a pattern of severity progression that is different from that of symbrachydactyly. Distal phalanges were the bony elements absent most frequently, followed sequentially by the middle phalanx and proximal phalanx. In all cases, metacarpals were present. Unlike symbrachydactyly, the ulnar 2 digits were more involved than the index and long fingers, and the thumb was the least involved digit. Hypodactyly appears to be a congenital hand anomaly that is clinically and radiographically different from symbrachydactyly or amniotic disruption sequence and is presumed to be caused by a distinct pathomechanism. Prognostic IV. Copyright © 2012 American Society for Surgery of the Hand. Published by Elsevier Inc. All rights reserved.

  16. Strong regularities in world wide web surfing

    PubMed

    Huberman; Pirolli; Pitkow; Lukose

    1998-04-03

    One of the most common modes of accessing information in the World Wide Web is surfing from one document to another along hyperlinks. Several large empirical studies have revealed common patterns of surfing behavior. A model that assumes that users make a sequence of decisions to proceed to another page, continuing as long as the value of the current page exceeds some threshold, yields the probability distribution for the number of pages that a user visits within a given Web site. This model was verified by comparing its predictions with detailed measurements of surfing patterns. The model also explains the observed Zipf-like distributions in page hits observed at Web sites.

  17. Proximal dominant hereditary motor and sensory neuropathy with proximal dominance association with mutation in the TRK-fused gene.

    PubMed

    Lee, Sang-Soo; Lee, Hye Jin; Park, Jin-Mo; Hong, Young Bin; Park, Kee-Duk; Yoo, Jeong Hyun; Koo, Heasoo; Jung, Sung-Chul; Park, Hyung Soon; Lee, Ji Hyun; Lee, Min Goo; Hyun, Young Se; Nakhro, Khriezhanou; Chung, Ki Wha; Choi, Byung-Ok

    2013-05-01

    Hereditary motor and sensory neuropathy with proximal dominance (HMSN-P) has been reported as a rare type of autosomal dominant adult-onset Charcot-Marie-Tooth disease. HMSN-P has been described only in Japanese descendants since 1997, and the causative gene has not been found. To identify the genetic cause of HMSN-P in a Korean family and determine the pathogenic mechanism. Genetic and observational analysis. Translational research center for rare neurologic disease. Twenty-eight individuals (12 men and 16 women) from a Korean family with HMSN-P. Whole-exome sequencing, linkage analysis, and magnetic resonance imaging. Through whole-exome sequencing, we revealed that HMSN-P is caused by a mutation in the TRK-fused gene (TFG). Clinical heterogeneities were revealed in HMSN-P between Korean and Japanese patients. The patients in the present report showed faster progression of the disease compared with the Japanese patients, and sensory nerve action potentials of the sural nerve were lost in the early stages of the disease. Moreover, tremor and hyperlipidemia were frequently found. Magnetic resonance imaging of the lower extremity revealed a distinct proximal dominant and sequential pattern of muscular involvement with a clearly different pattern than patients with Charcot-Marie-Tooth disease type 1A. Particularly, endoneural blood vessels revealed marked narrowing of the lumen with swollen vesicular endothelial cells. The underlying cause of HMSN-P proves to be a mutation in TFG that lies on chromosome 3q13.2. This disease is not limited to Japanese descendants, and marked narrowing of endoneural blood vessels was noted in the present study. We believe that TFG can affect the peripheral nerve tissue.

  18. Skull ontogeny: developmental patterns of fishes conserved across major tetrapod clades.

    PubMed

    Schoch, Rainer R

    2006-01-01

    In vertebrates, the ontogeny of the bony skull forms a particularly complex part of embryonic development. Although this area used to be restricted to neontology, recent discoveries of fossil ontogenies provide an additional source of data. One of the most detailed ossification sequences is known from Permo-Carboniferous amphibians, the branchiosaurids. These temnospondyls form a near-perfect link between the piscine osteichthyans and the various clades of extant tetrapods, retaining a full complement of dermal bones in the skull. For the first time, the broader evolutionary significance of these event sequences is analyzed, focusing on the identification of sequence heterochronies. A set of 120 event pairs was analyzed by event pair cracking, which helped identify active movers. A cladistic analysis of the event pair data was also carried out, highlighting some shared patterns between widely divergent clades of tetrapods. The analyses revealed an unexpected degree of similarity between the widely divergent taxa. Most interesting is the apparently modular composition of the cranial sequence: five clusters of bones were discovered in each of which the elements form in the same time window: (1) jaw bones, (2) marginal palatal elements, (3) circumorbital bones, (4) skull roof elements, and (5) neurocranial ossifications. In the studied taxa, these "modules" have in most cases been shifted fore and back on the trajectory relative to the Amia sequence, but did not disintegrate. Such "modules" might indicate a high degree of evolutionary limitation (constraint).

  19. Spatial and temporal plasticity of chromatin during programmed DNA-reorganization in Stylonychia macronuclear development

    PubMed Central

    Postberg, Jan; Heyse, Katharina; Cremer, Marion; Cremer, Thomas; Lipps, Hans J

    2008-01-01

    Background: In this study we exploit the unique genome organization of ciliates to characterize the biological function of histone modification patterns and chromatin plasticity for the processing of specific DNA sequences during a nuclear differentiation process. Ciliates are single-cell eukaryotes containing two morphologically and functionally specialized types of nuclei, the somatic macronucleus and the germline micronucleus. In the course of sexual reproduction a new macronucleus develops from a micronuclear derivative. During this process specific DNA sequences are eliminated from the genome, while sequences that will be transcribed in the mature macronucleus are retained. Results: We show by immunofluorescence microscopy, Western analyses and chromatin immunoprecipitation (ChIP) experiments that each nuclear type establishes its specific histone modification signature. Our analyses reveal that the early macronuclear anlage adopts a permissive chromatin state immediately after the fusion of two heterochromatic germline micronuclei. As macronuclear development progresses, repressive histone modifications that specify sequences to be eliminated are introduced de novo. ChIP analyses demonstrate that permissive histone modifications are associated with sequences that will be retained in the new macronucleus. Furthermore, our data support the hypothesis that a PIWI-family protein is involved in a transnuclear cross-talk and in the RNAi-dependent control of developmental chromatin reorganization. Conclusion: Based on these data we present a comprehensive analysis of the spatial and temporal pattern of histone modifications during this nuclear differentiation process. Results obtained in this study may also be relevant for our understanding of chromatin plasticity during metazoan embryogenesis. PMID:19014664

  20. The impact of sampling, PCR, and sequencing replication on discerning changes in drinking water bacterial community over diurnal time-scales.

    PubMed

    Bautista-de Los Santos, Quyen Melina; Schroeder, Joanna L; Blakemore, Oliver; Moses, Jonathan; Haffey, Mark; Sloan, William; Pinto, Ameet J

    2016-03-01

    High-throughput and deep DNA sequencing, particularly amplicon sequencing, is being increasingly utilized to reveal spatial and temporal dynamics of bacterial communities in drinking water systems. Whilst the sampling and methodological biases associated with PCR and sequencing have been studied in other environments, they have not been quantified for drinking water. These biases are likely to have the greatest effect on the ability to characterize subtle spatio-temporal patterns influenced by process/environmental conditions. In such cases, intra-sample variability may swamp any underlying small, systematic variation. To evaluate this, we undertook a study with replication at multiple levels including sampling sites, sample collection, PCR amplification, and high throughput sequencing of 16S rRNA amplicons. The variability inherent to the PCR amplification and sequencing steps is significant enough to mask differences between bacterial communities from replicate samples. This was largely driven by greater variability in detection of rare bacteria (relative abundance <0.01%) across PCR/sequencing replicates as compared to replicate samples. Despite this, we captured significant changes in bacterial community over diurnal time-scales and find that the extent and pattern of diurnal changes is specific to each sampling location. Further, we find diurnal changes in bacterial community arise due to differences in the presence/absence of the low abundance bacteria and changes in the relative abundance of dominant bacteria. Finally, we show that bacterial community composition is significantly different across sampling sites for time-periods during which there are typically rapid changes in water use. This suggests hydraulic changes (driven by changes in water demand) contribute to shaping the bacterial community in bulk drinking water over diurnal time-scales. Copyright © 2015 Elsevier Ltd. All rights reserved.

  1. Characterization of a prototype strain of hepatitis E virus.

    PubMed Central

    Tsarev, S A; Emerson, S U; Reyes, G R; Tsareva, T S; Legters, L J; Malik, I A; Iqbal, M; Purcell, R H

    1992-01-01

    A strain of hepatitis E virus (SAR-55) implicated in an epidemic of enterically transmitted non-A, non-B hepatitis, now called hepatitis E, was characterized extensively. Six cynomolgus monkeys (Macaca fascicularis) were infected with a strain of hepatitis E virus from Pakistan. Reverse transcription-polymerase chain reaction was used to determine the pattern of virus shedding in feces, bile, and serum relative to hepatitis and induction of specific antibodies. Virtually the entire genome of SAR-55 (7195 nucleotides) was sequenced. Comparison of the sequence of SAR-55 with that of a Burmese strain revealed a high level of homology except for one region encoding 100 amino acids of a putative nonstructural polyprotein. Identification of this region as hypervariable was obtained by partial sequencing of a third isolate of hepatitis E virus from Kirgizia. Images PMID:1731327

  2. Evidence for Long-Timescale Patterns of Synaptic Inputs in CA1 of Awake Behaving Mice.

    PubMed

    Kolb, Ilya; Talei Franzesi, Giovanni; Wang, Michael; Kodandaramaiah, Suhasa B; Forest, Craig R; Boyden, Edward S; Singer, Annabelle C

    2018-02-14

    Repeated sequences of neural activity are a pervasive feature of neural networks in vivo and in vitro In the hippocampus, sequential firing of many neurons over periods of 100-300 ms reoccurs during behavior and during periods of quiescence. However, it is not known whether the hippocampus produces longer sequences of activity or whether such sequences are restricted to specific network states. Furthermore, whether long repeated patterns of activity are transmitted to single cells downstream is unclear. To answer these questions, we recorded intracellularly from hippocampal CA1 of awake, behaving male mice to examine both subthreshold activity and spiking output in single neurons. In eight of nine recordings, we discovered long (900 ms) reoccurring subthreshold fluctuations or "repeats." Repeats generally were high-amplitude, nonoscillatory events reoccurring with 10 ms precision. Using statistical controls, we determined that repeats occurred more often than would be expected from unstructured network activity (e.g., by chance). Most spikes occurred during a repeat, and when a repeat contained a spike, the spike reoccurred with precision on the order of ≤20 ms, showing that long repeated patterns of subthreshold activity are strongly connected to spike output. Unexpectedly, we found that repeats occurred independently of classic hippocampal network states like theta oscillations or sharp-wave ripples. Together, these results reveal surprisingly long patterns of repeated activity in the hippocampal network that occur nonstochastically, are transmitted to single downstream neurons, and strongly shape their output. This suggests that the timescale of information transmission in the hippocampal network is much longer than previously thought. SIGNIFICANCE STATEMENT We found long (≥900 ms), repeated, subthreshold patterns of activity in CA1 of awake, behaving mice. These repeated patterns ("repeats") occurred more often than expected by chance and with 10 ms precision. Most spikes occurred within repeats and reoccurred with a precision on the order of 20 ms. Surprisingly, there was no correlation between repeat occurrence and classical network states such as theta oscillations and sharp-wave ripples. These results provide strong evidence that long patterns of activity are repeated and transmitted to downstream neurons, suggesting that the hippocampus can generate longer sequences of repeated activity than previously thought. Copyright © 2018 the authors 0270-6474/18/381822-14$15.00/0.

  3. rasbhari: Optimizing Spaced Seeds for Database Searching, Read Mapping and Alignment-Free Sequence Comparison.

    PubMed

    Hahn, Lars; Leimeister, Chris-André; Ounit, Rachid; Lonardi, Stefano; Morgenstern, Burkhard

    2016-10-01

    Many algorithms for sequence analysis rely on word matching or word statistics. Often, these approaches can be improved if binary patterns representing match and don't-care positions are used as a filter, such that only those positions of words are considered that correspond to the match positions of the patterns. The performance of these approaches, however, depends on the underlying patterns. Herein, we show that the overlap complexity of a pattern set that was introduced by Ilie and Ilie is closely related to the variance of the number of matches between two evolutionarily related sequences with respect to this pattern set. We propose a modified hill-climbing algorithm to optimize pattern sets for database searching, read mapping and alignment-free sequence comparison of nucleic-acid sequences; our implementation of this algorithm is called rasbhari. Depending on the application at hand, rasbhari can either minimize the overlap complexity of pattern sets, maximize their sensitivity in database searching or minimize the variance of the number of pattern-based matches in alignment-free sequence comparison. We show that, for database searching, rasbhari generates pattern sets with slightly higher sensitivity than existing approaches. In our Spaced Words approach to alignment-free sequence comparison, pattern sets calculated with rasbhari led to more accurate estimates of phylogenetic distances than the randomly generated pattern sets that we previously used. Finally, we used rasbhari to generate patterns for short read classification with CLARK-S. Here too, the sensitivity of the results could be improved, compared to the default patterns of the program. We integrated rasbhari into Spaced Words; the source code of rasbhari is freely available at http://rasbhari.gobics.de/.

  4. Isolation and characterization of a novel herpesvirus from a free-ranging eastern grey kangaroo (Macropus giganteus).

    PubMed

    Vaz, Paola Karinna; Motha, Julian; McCowan, Christina; Ficorilli, Nino; Whiteley, Pam Lizette; Wilks, Colin Reginald; Hartley, Carol Anne; Gilkerson, James Rudkin; Browning, Glenn Francis; Devlin, Joanne Maree

    2013-01-01

    We isolated a macropodid herpesvirus from a free-ranging eastern grey kangaroo (Macropus giganteous) displaying clinical signs of respiratory disease and possibly neurologic disease. Sequence analysis of the herpesvirus glycoprotein G (gG) and glycoprotein B (gB) genes revealed that the virus was an alphaherpesvirus most closely related to macropodid herpesvirus 2 (MaHV-2) with 82.7% gG and 94.6% gB amino acid sequence identity. Serologic analyses showed similar cross-neutralization patterns to those of MaHV-2. The two viruses had different growth characteristics in cell culture. Most notably, this virus formed significantly larger plaques and extensive syncytia when compared with MaHV-2. No syncytia were observed for MaHV-2. Restriction endonuclease analysis of whole viral genomes demonstrated distinct restriction endonuclease cleavage patterns for all three macropodid herpesviruses. These studies suggest that a distinct macropodid alphaherpesvirus may be capable of infecting and causing disease in eastern grey kangaroos.

  5. Conserved patterns hidden within group A Streptococcus M protein hypervariability recognize human C4b-binding protein

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Buffalo, Cosmo Z.; Bahn-Suh, Adrian J.; Hirakis, Sophia P.

    No vaccine exists against group A Streptococcus (GAS), a leading cause of worldwide morbidity and mortality. A severe hurdle is the hypervariability of its major antigen, the M protein, with >200 different M types known. Neutralizing antibodies typically recognize M protein hypervariable regions (HVRs) and confer narrow protection. In stark contrast, human C4b-binding protein (C4BP), which is recruited to the GAS surface to block phagocytic killing, interacts with a remarkably large number of M protein HVRs (apparently ~90%). Such broad recognition is rare, and we discovered a unique mechanism for this through the structure determination of four sequence-diverse M proteinsmore » in complexes with C4BP. The structures revealed a uniform and tolerant ‘reading head’ in C4BP, which detected conserved sequence patterns hidden within hypervariability. Our results open up possibilities for rational therapies that target the M–C4BP interaction, and also inform a path towards vaccine design.« less

  6. Analysis of the skin transcriptome in two oujiang color varieties of common carp.

    PubMed

    Wang, Chenghui; Wachholtz, Michael; Wang, Jun; Liao, Xiaolin; Lu, Guoqing

    2014-01-01

    Body color and coloration patterns are important phenotypic traits to maintain survival and reproduction activities. The Oujiang color varieties of common carp (Cyprinus carpio var. color), with a narrow distribution in Zhejiang Province of China and a history of aquaculture for over 1,200 years, consistently exhibit a variety of body color patterns. The molecular mechanism underlying diverse color patterns in these variants is unknown. To the practical end, it is essential to develop molecular markers that can distinguish different phenotypes and assist selective breeding. In this exploratory study, we conducted Roche 454 transcriptome sequencing of two pooled skin tissue samples of Oujiang common carp, which correspond to distinct color patterns, red with big black spots (RB) and whole white (WW), and a total of 737,525 sequence reads were generated. The reads obtained in this study were co-assembled jointly with common carp Roche 454 sequencing reads downloaded from NCBI SRA database, resulting in 43,923 isotigs and 546,676 singletons. Over 31 thousand (31,445; 71.6%) isotigs were found with significant BLAST matches (E<1e-10) to the nr protein database, which corresponds to 12,597 annotated zebrafish genes. A total of 70,947 isotigs and singletons (transcripts) were annotated with Gene Ontology, and 60,221 transcripts were found with corresponding EC numbers. Out of 145 zebrafish pigmentation genes, orthologs for 117 were recovered in Oujiang color carp transcriptome, including 18 found only among singletons. Our transcriptome analysis revealed over 52,902 SNPs in Oujiang common carp, and identified 63 SNP markers that are putatively unique either for RB or WW. The transcriptome of Oujiang color varieties of common carp obtained through this study, along with the pigmentation genes recovered and the color pattern-specific molecular markers developed, will facilitate future research on the molecular mechanism of color patterns and promote aquaculture of Oujiang color varieties of common carp through molecular marker assisted-selective breeding.

  7. Scaling of theory-of-mind understandings in Chinese children.

    PubMed

    Wellman, Henry M; Fang, Fuxi; Liu, David; Zhu, Liqi; Liu, Guoxiong

    2006-12-01

    Prior research demonstrates that understanding of theory of mind develops at different paces in children raised in different cultures. Are these differences simply differences in timing, or do they represent different patterns of cultural learning? That is, to what extent are sequences of theory-of-mind understanding universal, and to what extent are they culture-specific? We addressed these questions by using a theory-of-mind scale to examine performance of 140 Chinese children living in Beijing and to compare their performance with that of 135 English-speaking children living in the United States and Australia. Results reveal a common sequence of understanding, as well as sociocultural differences in children's developing theories of mind.

  8. Substance-Abusing Female Offenders as Victims: Chronological Sequencing of Pathways Into the Criminal Justice System

    PubMed Central

    Smith, Vivian C.

    2017-01-01

    This study assesses the entrance of substance-abusing female offenders (N=1,209) into the criminal justice system through temporal patterns (using age of first victimization, drug use and arrest). Nine pathways were identified. Unexpectedly, the leading path was a sequence where drug use preceded arrest in absence of childhood victimization. However, women under a path inclusive of victimization possessed more risk factors. Findings support feminist pathway research, which states that childhood victimization is generally present in female offenders’ lives. Nevertheless, results also revealed that a drug pathway without childhood abuse proved to be as important and even more dominant among criminal justice-involved women. PMID:28824349

  9. Isolation of two new retrotransposon sequences and development of molecular and cytological markers for Dasypyrum villosum (L.).

    PubMed

    Zhang, Jie; Jiang, Yun; Xuan, Pu; Guo, Yuanlin; Deng, Guangbing; Yu, Maoqun; Long, Hai

    2017-10-01

    Dasypyrum villosum is a valuable genetic resource for wheat improvement. With the aim to efficiently monitor the D. villosum chromatin introduced into common wheat, two novel retrotransposon sequences were isolated by RAPD, and were successfully converted to D. villosum-specific SCAR markers. In addition, we constructed a chromosomal karyotype of D. villosum. Our results revealed that different accessions of D. villosum showed slightly different signal patterns, indicating that distribution of repeats did not diverge significantly among D. villosum accessions. The two SCAR markers and FISH karyotype of D. villosum could be used for efficient and precise identification of D. villosum chromatin in wheat breeding.

  10. A confidence interval analysis of sampling effort, sequencing depth, and taxonomic resolution of fungal community ecology in the era of high-throughput sequencing.

    PubMed

    Oono, Ryoko

    2017-01-01

    High-throughput sequencing technology has helped microbial community ecologists explore ecological and evolutionary patterns at unprecedented scales. The benefits of a large sample size still typically outweigh that of greater sequencing depths per sample for accurate estimations of ecological inferences. However, excluding or not sequencing rare taxa may mislead the answers to the questions 'how and why are communities different?' This study evaluates the confidence intervals of ecological inferences from high-throughput sequencing data of foliar fungal endophytes as case studies through a range of sampling efforts, sequencing depths, and taxonomic resolutions to understand how technical and analytical practices may affect our interpretations. Increasing sampling size reliably decreased confidence intervals across multiple community comparisons. However, the effects of sequencing depths on confidence intervals depended on how rare taxa influenced the dissimilarity estimates among communities and did not significantly decrease confidence intervals for all community comparisons. A comparison of simulated communities under random drift suggests that sequencing depths are important in estimating dissimilarities between microbial communities under neutral selective processes. Confidence interval analyses reveal important biases as well as biological trends in microbial community studies that otherwise may be ignored when communities are only compared for statistically significant differences.

  11. A confidence interval analysis of sampling effort, sequencing depth, and taxonomic resolution of fungal community ecology in the era of high-throughput sequencing

    PubMed Central

    2017-01-01

    High-throughput sequencing technology has helped microbial community ecologists explore ecological and evolutionary patterns at unprecedented scales. The benefits of a large sample size still typically outweigh that of greater sequencing depths per sample for accurate estimations of ecological inferences. However, excluding or not sequencing rare taxa may mislead the answers to the questions ‘how and why are communities different?’ This study evaluates the confidence intervals of ecological inferences from high-throughput sequencing data of foliar fungal endophytes as case studies through a range of sampling efforts, sequencing depths, and taxonomic resolutions to understand how technical and analytical practices may affect our interpretations. Increasing sampling size reliably decreased confidence intervals across multiple community comparisons. However, the effects of sequencing depths on confidence intervals depended on how rare taxa influenced the dissimilarity estimates among communities and did not significantly decrease confidence intervals for all community comparisons. A comparison of simulated communities under random drift suggests that sequencing depths are important in estimating dissimilarities between microbial communities under neutral selective processes. Confidence interval analyses reveal important biases as well as biological trends in microbial community studies that otherwise may be ignored when communities are only compared for statistically significant differences. PMID:29253889

  12. Intronic sequences are required for AINTEGUMENTA-LIKE6 expression in Arabidopsis flowers.

    PubMed

    Krizek, Beth A

    2015-10-12

    The AINTEGUMENTA-LIKE6/PLETHORA3 (AIL6/PLT3) gene of Arabidopsis thaliana is a key regulator of growth and patterning in both shoots and roots. AIL6 encodes an AINTEGUMENTA-LIKE/PLETHORA (AIL/PLT) transcription factor that is expressed in the root stem cell niche, the peripheral region of the shoot apical meristem and young lateral organ primordia. In flowers, AIL6 acts redundantly with AINTEGUMENTA (ANT) to regulate floral organ positioning, growth, identity and patterning. Experiments were undertaken to define the genomic regions required for AIL6 function and expression in flowers. Transgenic plants expressing a copy of the coding region of AIL6 in the context of 7.7 kb of 5' sequence and 919 bp of 3' sequence (AIL6:cAIL6-3') fail to fully complement AIL6 function when assayed in the ant-4 ail6-2 double mutant background. In contrast, a genomic copy of AIL6 with the same amount of 5' and 3' sequence (AIL6:gAIL6-3') can fully complement ant-4 ail6-2. In addition, a genomic copy of AIL6 with 590 bp of 5' sequence and 919 bp of 3' sequence (AIL6m:gAIL6-3') complements ant-4 ail6-2 and contains all regulatory elements needed to confer normal AIL6 expression in inflorescences. Efforts to map cis-regulatory elements reveal that the third intron of AIL6 contains enhancer elements that confer expression in young flowers but in a broader pattern than that of AIL6 mRNA in wild-type flowers. Some AIL6:gAIL6-3' and AIL6m:gAIL6-3' lines confer an over-rescue phenotype in the ant-4 ail6-2 background that is correlated with higher levels of AIL6 mRNA accumulation. The results presented here indicate that AIL6 intronic sequences serve as transcriptional enhancer elements. In addition, the results show that increased expression of AIL6 can partially compensate for loss of ANT function in flowers.

  13. Empirical synchronized flow in oversaturated city traffic.

    PubMed

    Kerner, Boris S; Hemmerle, Peter; Koller, Micha; Hermanns, Gerhard; Klenov, Sergey L; Rehborn, Hubert; Schreckenberg, Michael

    2014-09-01

    Based on a study of anonymized GPS probe vehicle traces measured by personal navigation devices in vehicles randomly distributed in city traffic, empirical synchronized flow in oversaturated city traffic has been revealed. It turns out that real oversaturated city traffic resulting from speed breakdown in a city in most cases can be considered random spatiotemporal alternations between sequences of moving queues and synchronized flow patterns in which the moving queues do not occur.

  14. DNA barcoding of gypsy moths from China (Lepidoptera: Erebidae) reveals new haplotypes and divergence patterns within gypsy moth subspecies

    Treesearch

    Fang Chen; Youqing Luo; Melody A. Keena; Ying Wu; Peng Wu; Juan Shi

    2015-01-01

    The gypsy moth from Asia (two subspecies) is considered a greater threat to North America than European gypsy moth, because of a broader host range and females being capable of flight. Variation within and among gypsy moths from China (nine locations), one of the native countries of Asian gypsy moth, were compared using DNA barcode sequences (658 bp of mtDNA cytochrome...

  15. Targeted capture sequencing in Whitebark pine reveals range-wide demographic and adaptive patterns despite challenges of a large, repetitive genome

    Treesearch

    John V. Syring; Jacob A. Tennessen; Tara N. Jennings; Jill Wegrzyn; Camille Scelfo-Dalbey; Richard Cronn

    2016-01-01

    Whitebark pine (Pinus albicaulis) inhabits an expansive range in western North America, and it is a keystone species of subalpine environments. Whitebark is susceptible to multiple threats – climate change, white pine blister rust, mountain pine beetle, and fire exclusion – and it is suffering significant mortality range-wide, prompting the tree to be listed as ‘...

  16. Molecular Characterization of Tomato 3-Dehydroquinate Dehydratase-Shikimate:NADP Oxidoreductase1

    PubMed Central

    Bischoff, Markus; Schaller, Andreas; Bieri, Fabian; Kessler, Felix; Amrhein, Nikolaus; Schmid, Jürg

    2001-01-01

    Analysis of cDNAs encoding the bifunctional 3-dehydroquinate dehydratase-shikimate:NADP oxidoreductase (DHQase-SORase) from tomato (Lycopersicon esculentum) revealed two classes of cDNAs that differed by 57 bp within the coding regions, but were otherwise identical. Comparison of these cDNA sequences with the sequence of the corresponding single gene unequivocally proved that the primary transcript is differentially spliced, potentially giving rise to two polypeptides that differ by 19 amino acids. Quantitative real-time polymerase chain reaction revealed that the longer transcript constitutes at most 1% to 2% of DHQase-SORase transcripts. Expression of the respective polypeptides in Escherichia coli mutants lacking the DHQase or the SORase activity gave functional complementation only in case of the shorter polypeptide, indicating that skipping of a potential exon is a prerequisite for the production of an enzymatically active protein. The deduced amino acid sequence revealed that the DHQase-SORase is most likely synthesized as a precursor with a very short (13-amino acid) plastid-specific transit peptide. Like other genes encoding enzymes of the prechorismate pathway in tomato, this gene is elicitor-inducible. Tissue-specific expression resembles the patterns obtained for 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase 2 and dehydroquinate synthase genes. This work completes our studies of the prechorismate pathway in that cDNAs for all seven enzymes (including isozymes) of the prechorismate pathway from tomato have now been characterized. PMID:11299368

  17. Revealing glacier flow and surge dynamics from animated satellite image sequences: examples from the Karakoram

    NASA Astrophysics Data System (ADS)

    Paul, F.

    2015-04-01

    Although animated images are very popular on the Internet, they have so far found only limited use for glaciological applications. With long time-series of satellite images becoming increasingly available and glaciers being well recognized for their rapid changes and variable flow dynamics, animated sequences of multiple satellite images reveal glacier dynamics in a time-lapse mode, making the otherwise slow changes of glacier movement visible and understandable for a wide public. For this study animated image sequences were created from freely available image quick-looks of orthorectified Landsat scenes for four regions in the central Karakoram mountain range. The animations play automatically in a web-browser and might help to demonstrate glacier flow dynamics for educational purposes. The animations revealed highly complex patterns of glacier flow and surge dynamics over a 15-year time period (1998-2013). In contrast to other regions, surging glaciers in the Karakoram are often small (around 10 km2), steep, debris free, and advance for several years at comparably low annual rates (a few hundred m a-1). The advance periods of individual glaciers are generally out of phase, indicating a limited climatic control on their dynamics. On the other hand, nearly all other glaciers in the region are either stable or slightly advancing, indicating balanced or even positive mass budgets over the past few years to decades.

  18. Cheese rind communities provide tractable systems for in situ and in vitro studies of microbial diversity

    PubMed Central

    Wolfe, Benjamin E.; Button, Julie E.; Santarelli, Marcela; Dutton, Rachel J.

    2014-01-01

    SUMMARY Tractable microbial communities are needed to bridge the gap between observations of patterns of microbial diversity and mechanisms that can explain these patterns. We developed cheese rinds as model microbial communities by characterizing in situ patterns of diversity and by developing an in vitro system for community reconstruction. Sequencing of 137 different rind communities across 10 countries revealed 24 widely distributed and culturable genera of bacteria and fungi as dominant community members. Reproducible community types formed independent of geographic location of production. Intensive temporal sampling demonstrated that assembly of these communities is highly reproducible. Patterns of community composition and succession observed in situ can be recapitulated in a simple in vitro system. Widespread positive and negative interactions were identified between bacterial and fungal community members. Cheese rind microbial communities represent an experimentally tractable system for defining mechanisms that influence microbial community assembly and function. PMID:25036636

  19. Complete sequence of two tick-borne flaviviruses isolated from Siberia and the UK: analysis and significance of the 5' and 3'-UTRs.

    PubMed

    Gritsun, T S; Venugopal, K; Zanotto, P M; Mikhailov, M V; Sall, A A; Holmes, E C; Polkinghorne, I; Frolova, T V; Pogodina, V V; Lashkevich, V A; Gould, E A

    1997-05-01

    The complete nucleotide sequence of two tick-transmitted flaviviruses, Vasilchenko (Vs) from Siberia and louping ill (LI) from the UK, have been determined. The genomes were respectively, 10928 and 10871 nucleotides (nt) in length. The coding strategy and functional protein sequence motifs of tick-borne flaviviruses are presented in both Vs and LI viruses. The phylogenies based on maximum likelihood, maximum parsimony and distance analysis of the polyproteins, identified Vs virus as a member of the tick-borne encephalitis virus subgroup within the tick-borne serocomplex, genus Flavivirus, family Flaviviridae. Comparative alignment of the 3'-untranslated regions revealed deletions of different lengths essentially at the same position downstream of the stop codon for all tick-borne viruses. Two direct 27 nucleotide repeats at the 3'-end were found only for Vs and LI virus. Immediately following the deletions a region of 332-334 nt with relatively conserved primary structure (67-94% identity) was observed at the 3'-non-coding end of the virus genome. Pairwise comparisons of the nucleotide sequence data revealed similar levels of variation between the coding region, and the 5' and 3'-termini of the genome, implying an equivalent strong selective control for translated and untranslated regions. Indeed the predicted folding of the 5' and 3'-untranslated regions revealed patterns of stem and loop structures conserved for all tick-borne flaviviruses suggesting a purifying selection for preservation of essential RNA secondary structures which could be involved in translational control and replication. The possible implications of these findings are discussed.

  20. A Single Banana Streak Virus Integration Event in the Banana Genome as the Origin of Infectious Endogenous Pararetrovirus▿

    PubMed Central

    Gayral, Philippe; Noa-Carrazana, Juan-Carlos; Lescot, Magali; Lheureux, Fabrice; Lockhart, Benham E. L.; Matsumoto, Takashi; Piffanelli, Pietro; Iskra-Caruana, Marie-Line

    2008-01-01

    Sequencing of plant nuclear genomes reveals the widespread presence of integrated viral sequences known as endogenous pararetroviruses (EPRVs). Banana is one of the three plant species known to harbor infectious EPRVs. Musa balbisiana carries integrated copies of Banana streak virus (BSV), which are infectious by releasing virions in interspecific hybrids. Here, we analyze the organization of the EPRV of BSV Goldfinger (BSGfV) present in the wild diploid M. balbisiana cv. Pisang Klutuk Wulung (PKW) revealed by the study of Musa bacterial artificial chromosome resources and interspecific genetic cross. cv. PKW contains two similar EPRVs of BSGfV. Genotyping of these integrants and studies of their segregation pattern show an allelic insertion. Despite the fact that integrated BSGfV has undergone extensive rearrangement, both EPRVs contain the full-length viral genome. The high degree of sequence conservation between the integrated and episomal form of the virus indicates a recent integration event; however, only one allele is infectious. Analysis of BSGfV EPRV segregation among an F1 population from an interspecific genetic cross revealed that these EPRV sequences correspond to two alleles originating from a single integration event. We describe here for the first time the full genomic and genetic organization of the two EPRVs of BSGfV present in cv. PKW in response to the challenge facing both scientists and breeders to identify and generate genetic resources free from BSV. We discuss the consequences of this unique host-pathogen interaction in terms of genetic and genomic plant defenses versus strategies of infectious BSGfV EPRVs. PMID:18417582

  1. Distinct pattern of TP53 mutations in human immunodeficiency virus-related head and neck squamous cell carcinoma.

    PubMed

    Gleber-Netto, Frederico O; Zhao, Mei; Trivedi, Sanchit; Wang, Jiping; Jasser, Samar; McDowell, Christina; Kadara, Humam; Zhang, Jiexin; Wang, Jing; William, William N; Lee, J Jack; Nguyen, Minh Ly; Pai, Sara I; Walline, Heather M; Shin, Dong M; Ferris, Robert L; Carey, Thomas E; Myers, Jeffrey N; Pickering, Curtis R

    2018-01-01

    Human immunodeficiency virus-infected individuals (HIVIIs) have a higher incidence of head and neck squamous cell carcinoma (HNSCC), and clinical and histopathological differences have been observed in their tumors in comparison with those of HNSCC patients without a human immunodeficiency virus (HIV) infection. The reasons for these differences are not clear, and molecular differences between HIV-related HNSCC and non-HIV-related HNSCC may exist. This study compared the mutational patterns of HIV-related HNSCC and non-HIV-related HNSCC. The DNA of 20 samples of HIV-related HNSCCs and 32 samples of non-HIV-related HNSCCs was sequenced. DNA libraries covering exons of 18 genes frequently mutated in HNSCC (AJUBA, CASP8, CCND1, CDKN2A, EGFR, FAT1, FBXW7, HLA-A, HRAS, KEAP1, NFE2L2, NOTCH1, NOTCH2, NSD1, PIK3CA, TGFBR2, TP53, and TP63) were prepared and sequenced on an Ion Personal Genome Machine sequencer. DNA sequencing data were analyzed with Ion Reporter software. The human papillomavirus (HPV) status of the tumor samples was assessed with in situ hybridization, the MassARRAY HPV multiplex polymerase chain reaction assay, and p16 immunostaining. Mutation calls were compared among the studied groups. HIV-related HNSCC revealed a distinct pattern of mutations in comparison with non-HIV-related HNSCC. TP53 mutation frequencies were significantly lower in HIV-related HNSCC. Mutations in HIV+ patients tended to be TpC>T nucleotide changes for all mutated genes but especially for TP53. HNSCC in HIVIIs presents a distinct pattern of genetic mutations, particularly in the TP53 gene. HIV-related HNSCC may have a distinct biology, and an effect of the HIV virus on the pathogenesis of these tumors should not be ruled out. Cancer 2018;124:84-94. © 2017 American Cancer Society. © 2017 American Cancer Society.

  2. Precursory seismic quiescence: A preliminary assessment of the hypothesis

    USGS Publications Warehouse

    Reasenberg, P.A.; Matthews, M.V.

    1988-01-01

    Numerous cases of precursory seismic quiescence have been reported in recent years. Some investigators have interpreted these observations as evidence that seismic quiescence is a somewhat reliable precursor to moderate or large earthquakes. However, because failures of the pattern to predict earthquakes may not, in general, be reported, and because numerous earthquakes are not preceded by quiescence, the validity and reliability of the quiescence precursor have not been established. We have analyzed the seismicity rate prior to, and in the source region of, 37 shallow earthquakes (M 5.3-7.0) in central California and Japan for patterns of rate fluctuation, especially precursory quiescence. Nonuniformity in rate for these pre-mainshock sequences is relatively high, and numerous intervals with significant (p<0.10) extrema in rate are observed in some of the sequences. In other sequences, however, the rate remains within normal limits up to the time of the mainshock. Overall, in terms of an observational basis for intermediate-term earthquake prediction, no evidence is found in the cases studied for a systematic, widespread or reliable pattern of quiescence prior to the mainshocks. In earthquake sequences comprising full seismic cycles for 5 sets of (M 3.7-5.1) repeat earthquakes on the San Andreas fault near Bear Valley, California, the seismicity rates are found to be uniform. A composite of the estimated rate fluctuations for the sequences, normalized to the length of the seismic cycle, reveals a weak pattern of a low rate in the first third of the cycle, and a high rate in the last few months. While these observations are qualitative, they may represent weak expressions of physical processes occurring in the source region over the seismic cycle. Re-examination of seismicity rate fluctuations in volumes along the creeping section of the San Andreas fault specified by Wyss and Burford (1985) qualitatively confirms the existence of low-rate intervals in volumes 361, 386, 382, 372 and 401. However, only the quiescence in volume 386 is found by the present study to be statistically significant. ?? 1988 Birkha??user Verlag.

  3. Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure

    PubMed Central

    2013-01-01

    Background Candida albicans is a ubiquitous opportunistic fungal pathogen that afflicts immunocompromised human hosts. With rare and transient exceptions the yeast is diploid, yet despite its clinical relevance the respective sequences of its two homologous chromosomes have not been completely resolved. Results We construct a phased diploid genome assembly by deep sequencing a standard laboratory wild-type strain and a panel of strains homozygous for particular chromosomes. The assembly has 700-fold coverage on average, allowing extensive revision and expansion of the number of known SNPs and indels. This phased genome significantly enhances the sensitivity and specificity of allele-specific expression measurements by enabling pooling and cross-validation of signal across multiple polymorphic sites. Additionally, the diploid assembly reveals pervasive and unexpected patterns in allelic differences between homologous chromosomes. Firstly, we see striking clustering of indels, concentrated primarily in the repeat sequences in promoters. Secondly, both indels and their repeat-sequence substrate are enriched near replication origins. Finally, we reveal an intimate link between repeat sequences and indels, which argues that repeat length is under selective pressure for most eukaryotes. This connection is described by a concise one-parameter model that explains repeat-sequence abundance in C. albicans as a function of the indel rate, and provides a general framework to interpret repeat abundance in species ranging from bacteria to humans. Conclusions The phased genome assembly and insights into repeat plasticity will be valuable for better understanding allele-specific phenomena and genome evolution. PMID:24025428

  4. Deletion in the EVC2 gene causes chondrodysplastic dwarfism in Tyrolean Grey cattle.

    PubMed

    Murgiano, Leonardo; Jagannathan, Vidhya; Benazzi, Cinzia; Bolcato, Marilena; Brunetti, Barbara; Muscatello, Luisa Vera; Dittmer, Keren; Piffer, Christian; Gentile, Arcangelo; Drögemüller, Cord

    2014-01-01

    During the summer of 2013 seven Italian Tyrolean Grey calves were born with abnormally short limbs. Detailed clinical and pathological examination revealed similarities to chondrodysplastic dwarfism. Pedigree analysis showed a common founder, assuming autosomal monogenic recessive transmission of the defective allele. A positional cloning approach combining genome wide association and homozygosity mapping identified a single 1.6 Mb genomic region on BTA 6 that was associated with the disease. Whole genome re-sequencing of an affected calf revealed a single candidate causal mutation in the Ellis van Creveld syndrome 2 (EVC2) gene. This gene is known to be associated with chondrodysplastic dwarfism in Japanese Brown cattle, and dwarfism, abnormal nails and teeth, and dysostosis in humans with Ellis-van Creveld syndrome. Sanger sequencing confirmed the presence of a 2 bp deletion in exon 19 (c.2993_2994ACdel) that led to a premature stop codon in the coding sequence of bovine EVC2, and was concordant with the recessive pattern of inheritance in affected and carrier animals. This loss of function mutation confirms the important role of EVC2 in bone development. Genetic testing can now be used to eliminate this form of chondrodysplastic dwarfism from Tyrolean Grey cattle.

  5. Deletion in the EVC2 Gene Causes Chondrodysplastic Dwarfism in Tyrolean Grey Cattle

    PubMed Central

    Murgiano, Leonardo; Jagannathan, Vidhya; Benazzi, Cinzia; Bolcato, Marilena; Brunetti, Barbara; Muscatello, Luisa Vera; Dittmer, Keren; Piffer, Christian; Gentile, Arcangelo; Drögemüller, Cord

    2014-01-01

    During the summer of 2013 seven Italian Tyrolean Grey calves were born with abnormally short limbs. Detailed clinical and pathological examination revealed similarities to chondrodysplastic dwarfism. Pedigree analysis showed a common founder, assuming autosomal monogenic recessive transmission of the defective allele. A positional cloning approach combining genome wide association and homozygosity mapping identified a single 1.6 Mb genomic region on BTA 6 that was associated with the disease. Whole genome re-sequencing of an affected calf revealed a single candidate causal mutation in the Ellis van Creveld syndrome 2 (EVC2) gene. This gene is known to be associated with chondrodysplastic dwarfism in Japanese Brown cattle, and dwarfism, abnormal nails and teeth, and dysostosis in humans with Ellis-van Creveld syndrome. Sanger sequencing confirmed the presence of a 2 bp deletion in exon 19 (c.2993_2994ACdel) that led to a premature stop codon in the coding sequence of bovine EVC2, and was concordant with the recessive pattern of inheritance in affected and carrier animals. This loss of function mutation confirms the important role of EVC2 in bone development. Genetic testing can now be used to eliminate this form of chondrodysplastic dwarfism from Tyrolean Grey cattle. PMID:24733244

  6. Deppdb--DNA electrostatic potential properties database: electrostatic properties of genome DNA.

    PubMed

    Osypov, Alexander A; Krutinin, Gleb G; Kamzolova, Svetlana G

    2010-06-01

    The electrostatic properties of genome DNA influence its interactions with different proteins, in particular, the regulation of transcription by RNA-polymerases. DEPPDB--DNA Electrostatic Potential Properties Database--was developed to hold and provide all available information on the electrostatic properties of genome DNA combined with its sequence and annotation of biological and structural properties of genome elements and whole genomes. Genomes in DEPPDB are organized on a taxonomical basis. Currently, the database contains all the completely sequenced bacterial and viral genomes according to NCBI RefSeq. General properties of the genome DNA electrostatic potential profile and principles of its formation are revealed. This potential correlates with the GC content but does not correspond to it exactly and strongly depends on both the sequence arrangement and its context (flanking regions). Analysis of the promoter regions for bacterial and viral RNA polymerases revealed a correspondence between the scale of these proteins' physical properties and electrostatic profile patterns. We also discovered a direct correlation between the potential value and the binding frequency of RNA polymerase to DNA, supporting the idea of the role of electrostatics in these interactions. This matches a pronounced tendency of the promoter regions to possess higher values of the electrostatic potential.

  7. Novel Sequence-Based Mapping of Recently Emerging H5NX Influenza Viruses Reveals Pandemic Vaccine Candidates

    PubMed Central

    Anderson, Christopher S.; DeDiego, Marta L.; Thakar, Juilee; Topham, David J.

    2016-01-01

    Recently, an avian influenza virus, H5NX subclade 2.3.4.4, emerged and spread to North America. This subclade has frequently reassorted, leading to multiple novel viruses capable of human infection. Four cases of human infections, three leading to death, have already occurred. Existing vaccine strains do not protect against these new viruses, raising a need to identify new vaccine candidate strains. We have developed a novel sequence-based mapping (SBM) tool capable of visualizing complex protein sequence data sets using a single intuitive map. We applied SBM on the complete set of avian H5 viruses in order to better understand hemagglutinin protein variance amongst H5 viruses and identify any patterns associated with this variation. The analysis successfully identified the original reassortments that lead to the emergence of this new subclade of H5 viruses, as well as their known unusual ability to re-assort among neuraminidase subtypes. In addition, our analysis revealed distinct clusters of 2.3.4.4 variants that would not be covered by existing strains in the H5 vaccine stockpile. The results suggest that our method may be useful for pandemic candidate vaccine virus selection. PMID:27494186

  8. Population and genomic analysis of the genus Halorubrum

    PubMed Central

    Fullmer, Matthew S.; Soucy, Shannon M.; Swithers, Kristen S.; Makkay, Andrea M.; Wheeler, Ryan; Ventosa, Antonio; Gogarten, J. Peter; Papke, R. Thane

    2014-01-01

    The Halobacteria are known to engage in frequent gene transfer and homologous recombination. For stably diverged lineages to persist some checks on the rate of between lineage recombination must exist. We surveyed a group of isolates from the Aran-Bidgol endorheic lake in Iran and sequenced a selection of them. Multilocus Sequence Analysis (MLSA) and Average Nucleotide Identity (ANI) revealed multiple clusters (phylogroups) of organisms present in the lake. Patterns of intein and Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs) presence/absence and their sequence similarity, GC usage along with the ANI and the identities of the genes used in the MLSA revealed that two of these clusters share an exchange bias toward others in their phylogroup while showing reduced rates of exchange with other organisms in the environment. However, a third cluster, composed in part of named species from other areas of central Asia, displayed many indications of variability in exchange partners, from within the lake as well as outside the lake. We conclude that barriers to gene exchange exist between the two purely Aran-Bidgol phylogroups, and that the third cluster with members from other regions is not a single population and likely reflects an amalgamation of several populations. PMID:24782836

  9. Seasonal and regional diversity of maple sap microbiota revealed using community PCR fingerprinting and 16S rRNA gene clone libraries.

    PubMed

    Filteau, Marie; Lagacé, Luc; LaPointe, Gisèle; Roy, Denis

    2010-04-01

    An arbitrary primed community PCR fingerprinting technique based on capillary electrophoresis was developed to study maple sap microbial community characteristics among 19 production sites in Québec over the tapping season. Presumptive fragment identification was made with corresponding fingerprint profiles of bacterial isolate cultures. Maple sap microbial communities were subsequently compared using a representative subset of 13 16S rRNA gene clone libraries followed by gene sequence analysis. Results from both methods indicated that all maple sap production sites and flow periods shared common microbiota members, but distinctive features also existed. Changes over the season in relative abundance of predominant populations showed evidence of a common pattern. Pseudomonas (64%) and Rahnella (8%) were the most abundantly and frequently represented genera of the 2239 sequences analyzed. Janthinobacterium, Leuconostoc, Lactococcus, Weissella, Epilithonimonas and Sphingomonas were revealed as occasional contaminants in maple sap. Maple sap microbiota showed a low level of deep diversity along with a high variation of similar 16S rRNA gene sequences within the Pseudomonas genus. Predominance of Pseudomonas is suggested as a typical feature of maple sap microbiota across geographical regions, production sites, and sap flow periods.

  10. Qualitative and quantitative assessment of Illumina's forensic STR and SNP kits on MiSeq FGx™.

    PubMed

    Sharma, Vishakha; Chow, Hoi Yan; Siegel, Donald; Wurmbach, Elisa

    2017-01-01

    Massively parallel sequencing (MPS) is a powerful tool transforming DNA analysis in multiple fields ranging from medicine, to environmental science, to evolutionary biology. In forensic applications, MPS offers the ability to significantly increase the discriminatory power of human identification as well as aid in mixture deconvolution. However, before the benefits of any new technology can be employed, a thorough evaluation of its quality, consistency, sensitivity, and specificity must be rigorously evaluated in order to gain a detailed understanding of the technique including sources of error, error rates, and other restrictions/limitations. This extensive study assessed the performance of Illumina's MiSeq FGx MPS system and ForenSeq™ kit in nine experimental runs including 314 reaction samples. In-depth data analysis evaluated the consequences of different assay conditions on test results. Variables included: sample numbers per run, targets per run, DNA input per sample, and replications. Results are presented as heat maps revealing patterns for each locus. Data analysis focused on read numbers (allele coverage), drop-outs, drop-ins, and sequence analysis. The study revealed that loci with high read numbers performed better and resulted in fewer drop-outs and well balanced heterozygous alleles. Several loci were prone to drop-outs which led to falsely typed homozygotes and therefore to genotype errors. Sequence analysis of allele drop-in typically revealed a single nucleotide change (deletion, insertion, or substitution). Analyses of sequences, no template controls, and spurious alleles suggest no contamination during library preparation, pooling, and sequencing, but indicate that sequencing or PCR errors may have occurred due to DNA polymerase infidelities. Finally, we found utilizing Illumina's FGx System at recommended conditions does not guarantee 100% outcomes for all samples tested, including the positive control, and required manual editing due to low read numbers and/or allele drop-in. These findings are important for progressing towards implementation of MPS in forensic DNA testing.

  11. Suppressive subtractive hybridization approach revealed differential expression of hypersensitive response and reactive oxygen species production genes in tea (Camellia sinensis (L.) O. Kuntze) leaves during Pestalotiopsis thea infection.

    PubMed

    Senthilkumar, Palanisamy; Thirugnanasambantham, Krishnaraj; Mandal, Abul Kalam Azad

    2012-12-01

    Tea (Camellia sinensis (L.) O. Kuntze) is an economically important plant cultivated for its leaves. Infection of Pestalotiopsis theae in leaves causes gray blight disease and enormous loss to the tea industry. We used suppressive subtractive hybridization (SSH) technique to unravel the differential gene expression pattern during gray blight disease development in tea. Complementary DNA from P. theae-infected and uninfected leaves of disease tolerant cultivar UPASI-10 was used as tester and driver populations respectively. Subtraction efficiency was confirmed by comparing abundance of β-actin gene. A total of 377 and 720 clones with insert size >250 bp from forward and reverse library respectively were sequenced and analyzed. Basic Local Alignment Search Tool analysis revealed 17 sequences in forward SSH library have high degree of similarity with disease and hypersensitive response related genes and 20 sequences with hypothetical proteins while in reverse SSH library, 23 sequences have high degree of similarity with disease and stress response-related genes and 15 sequences with hypothetical proteins. Functional analysis indicated unknown (61 and 59 %) or hypothetical functions (23 and 18 %) for most of the differentially regulated genes in forward and reverse SSH library, respectively, while others have important role in different cellular activities. Majority of the upregulated genes are related to hypersensitive response and reactive oxygen species production. Based on these expressed sequence tag data, putative role of differentially expressed genes were discussed in relation to disease. We also demonstrated the efficiency of SSH as a tool in enriching gray blight disease related up- and downregulated genes in tea. The present study revealed that many genes related to disease resistance were suppressed during P. theae infection and enhancing these genes by the application of inducers may impart better disease tolerance to the plants.

  12. Automating the generation of lexical patterns for processing free text in clinical documents.

    PubMed

    Meng, Frank; Morioka, Craig

    2015-09-01

    Many tasks in natural language processing utilize lexical pattern-matching techniques, including information extraction (IE), negation identification, and syntactic parsing. However, it is generally difficult to derive patterns that achieve acceptable levels of recall while also remaining highly precise. We present a multiple sequence alignment (MSA)-based technique that automatically generates patterns, thereby leveraging language usage to determine the context of words that influence a given target. MSAs capture the commonalities among word sequences and are able to reveal areas of linguistic stability and variation. In this way, MSAs provide a systemic approach to generating lexical patterns that are generalizable, which will both increase recall levels and maintain high levels of precision. The MSA-generated patterns exhibited consistent F1-, F.5-, and F2- scores compared to two baseline techniques for IE across four different tasks. Both baseline techniques performed well for some tasks and less well for others, but MSA was found to consistently perform at a high level for all four tasks. The performance of MSA on the four extraction tasks indicates the method's versatility. The results show that the MSA-based patterns are able to handle the extraction of individual data elements as well as relations between two concepts without the need for large amounts of manual intervention. We presented an MSA-based framework for generating lexical patterns that showed consistently high levels of both performance and recall over four different extraction tasks when compared to baseline methods. © The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  13. Sequence memory based on coherent spin-interaction neural networks.

    PubMed

    Xia, Min; Wong, W K; Wang, Zhijie

    2014-12-01

    Sequence information processing, for instance, the sequence memory, plays an important role on many functions of brain. In the workings of the human brain, the steady-state period is alterable. However, in the existing sequence memory models using heteroassociations, the steady-state period cannot be changed in the sequence recall. In this work, a novel neural network model for sequence memory with controllable steady-state period based on coherent spininteraction is proposed. In the proposed model, neurons fire collectively in a phase-coherent manner, which lets a neuron group respond differently to different patterns and also lets different neuron groups respond differently to one pattern. The simulation results demonstrating the performance of the sequence memory are presented. By introducing a new coherent spin-interaction sequence memory model, the steady-state period can be controlled by dimension parameters and the overlap between the input pattern and the stored patterns. The sequence storage capacity is enlarged by coherent spin interaction compared with the existing sequence memory models. Furthermore, the sequence storage capacity has an exponential relationship to the dimension of the neural network.

  14. Genomic Investigation of a Legionellosis Outbreak in a Persistently Colonized Hotel.

    PubMed

    Sánchez-Busó, Leonor; Guiral, Silvia; Crespi, Sebastián; Moya, Víctor; Camaró, María L; Olmos, María P; Adrián, Francisco; Morera, Vicente; González-Morán, Francisco; Vanaclocha, Hermelinda; González-Candelas, Fernando

    2015-01-01

    A long-lasting legionellosis outbreak was reported between November 2011 and July 2012 in a hotel in Calpe (Spain) affecting 44 patients including six deaths. Intensive epidemiological and microbiological investigations were performed in order to detect the reservoirs. Clinical and environmental samples were tested for the presence and genetic characterization of Legionella pneumophila. Six of the isolates were subjected to whole-genome sequencing. Sequencing of 14 clinical and 260 environmental samples revealed sequence type (ST) 23 as the main responsible strain for the infections. This ST was found in the spa pool, from where it spread to other hotel public spaces, explaining the ST23 clinical cases, including guests who had not visited the spa. Uncultured clinical specimens showed profiles compatible with ST23, ST578, and mixed patterns. Profiles compatible with ST578 were obtained by direct sequencing from biofilm samples collected from the domestic water system, which provided evidence for the source of infection for non ST23 patients. Whole genome data from five ST23 strains and the identification of different STs and Legionella species showed that different hotel premises were likely colonized since the hotel opening thus explaining how different patients had been infected by distinct STs. Both epidemiological and molecular data are essential in the investigation of legionellosis outbreaks. Whole-genome sequencing data revealed significant intra-ST variability and allowed to make further inference on the short-term evolution of a local colonization of L. pneumophila.

  15. Mhc class II B gene evolution in East African cichlid fishes.

    PubMed

    Figueroa, F; Mayer, W E; Sültmann, H; O'hUigin, C; Tichy, H; Satta, Y; Takezaki, N; Takahata, N; Klein, J

    2000-06-01

    A distinctive feature of essential major histocompatibility complex (Mhc) loci is their polymorphism characterized by large genetic distances between alleles and long persistence times of allelic lineages. Since the lineages often span several successive speciations, we investigated the behavior of the Mhc alleles during or close to the speciation phase. We sequenced exon 2 of the class II B locus 4 from 232 East African cichlid fishes representing 32 related species. The divergence times of the (sub)species ranged from 6,000 to 8.4 million years. Two types of evolutionary analysis were used to elucidate the pattern of exon 2 sequence divergence. First, phylogenetic methods were applied to reconstruct the most likely evolutionary pathways leading from the last common ancestor of the set to the extant sequences, and to assess the probable mechanisms involved in allelic diversification. Second, pairwise comparisons of sequences were carried out to detect differences seemingly incompatible with origin by nonparallel point mutations. The analysis revealed point mutations to be the most important mechanism behind allelic divergences, with recombination playing only an auxiliary part. Comparison of sequences from related species revealed evidence of random allelic (lineage) losses apparently associated with speciation. Sharing of identical alleles could be demonstrated between species that diverged 2 million years ago. The phylogeny of the exon was incongruent with that of the flanking introns, indicating either a high degree of convergent evolution at the peptide-binding region-encoding sites, or intron homogenization.

  16. Genomic Investigation of a Legionellosis Outbreak in a Persistently Colonized Hotel

    PubMed Central

    Sánchez-Busó, Leonor; Guiral, Silvia; Crespi, Sebastián; Moya, Víctor; Camaró, María L.; Olmos, María P.; Adrián, Francisco; Morera, Vicente; González-Morán, Francisco; Vanaclocha, Hermelinda; González-Candelas, Fernando

    2016-01-01

    Objectives: A long-lasting legionellosis outbreak was reported between November 2011 and July 2012 in a hotel in Calpe (Spain) affecting 44 patients including six deaths. Intensive epidemiological and microbiological investigations were performed in order to detect the reservoirs. Methods: Clinical and environmental samples were tested for the presence and genetic characterization of Legionella pneumophila. Six of the isolates were subjected to whole-genome sequencing. Results: Sequencing of 14 clinical and 260 environmental samples revealed sequence type (ST) 23 as the main responsible strain for the infections. This ST was found in the spa pool, from where it spread to other hotel public spaces, explaining the ST23 clinical cases, including guests who had not visited the spa. Uncultured clinical specimens showed profiles compatible with ST23, ST578, and mixed patterns. Profiles compatible with ST578 were obtained by direct sequencing from biofilm samples collected from the domestic water system, which provided evidence for the source of infection for non ST23 patients. Whole genome data from five ST23 strains and the identification of different STs and Legionella species showed that different hotel premises were likely colonized since the hotel opening thus explaining how different patients had been infected by distinct STs. Conclusions: Both epidemiological and molecular data are essential in the investigation of legionellosis outbreaks. Whole-genome sequencing data revealed significant intra-ST variability and allowed to make further inference on the short-term evolution of a local colonization of L. pneumophila. PMID:26834713

  17. Sequence and structural analyses of nuclear export signals in the NESdb database

    PubMed Central

    Xu, Darui; Farmer, Alicia; Collett, Garen; Grishin, Nick V.; Chook, Yuh Min

    2012-01-01

    We compiled >200 nuclear export signal (NES)–containing CRM1 cargoes in a database named NESdb. We analyzed the sequences and three-dimensional structures of natural, experimentally identified NESs and of false-positive NESs that were generated from the database in order to identify properties that might distinguish the two groups of sequences. Analyses of amino acid frequencies, sequence logos, and agreement with existing NES consensus sequences revealed strong preferences for the Φ1-X3-Φ2-X2-Φ3-X-Φ4 pattern and for negatively charged amino acids in the nonhydrophobic positions of experimentally identified NESs but not of false positives. Strong preferences against certain hydrophobic amino acids in the hydrophobic positions were also revealed. These findings led to a new and more precise NES consensus. More important, three-dimensional structures are now available for 68 NESs within 56 different cargo proteins. Analyses of these structures showed that experimentally identified NESs are more likely than the false positives to adopt α-helical conformations that transition to loops at their C-termini and more likely to be surface accessible within their protein domains or be present in disordered or unobserved parts of the structures. Such distinguishing features for real NESs might be useful in future NES prediction efforts. Finally, we also tested CRM1-binding of 40 NESs that were found in the 56 structures. We found that 16 of the NES peptides did not bind CRM1, hence illustrating how NESs are easily misidentified. PMID:22833565

  18. BrEPS 2.0: Optimization of sequence pattern prediction for enzyme annotation.

    PubMed

    Dudek, Christian-Alexander; Dannheim, Henning; Schomburg, Dietmar

    2017-01-01

    The prediction of gene functions is crucial for a large number of different life science areas. Faster high throughput sequencing techniques generate more and larger datasets. The manual annotation by classical wet-lab experiments is not suitable for these large amounts of data. We showed earlier that the automatic sequence pattern-based BrEPS protocol, based on manually curated sequences, can be used for the prediction of enzymatic functions of genes. The growing sequence databases provide the opportunity for more reliable patterns, but are also a challenge for the implementation of automatic protocols. We reimplemented and optimized the BrEPS pattern generation to be applicable for larger datasets in an acceptable timescale. Primary improvement of the new BrEPS protocol is the enhanced data selection step. Manually curated annotations from Swiss-Prot are used as reliable source for function prediction of enzymes observed on protein level. The pool of sequences is extended by highly similar sequences from TrEMBL and SwissProt. This allows us to restrict the selection of Swiss-Prot entries, without losing the diversity of sequences needed to generate significant patterns. Additionally, a supporting pattern type was introduced by extending the patterns at semi-conserved positions with highly similar amino acids. Extended patterns have an increased complexity, increasing the chance to match more sequences, without losing the essential structural information of the pattern. To enhance the usability of the database, we introduced enzyme function prediction based on consensus EC numbers and IUBMB enzyme nomenclature. BrEPS is part of the Braunschweig Enzyme Database (BRENDA) and is available on a completely redesigned website and as download. The database can be downloaded and used with the BrEPScmd command line tool for large scale sequence analysis. The BrEPS website and downloads for the database creation tool, command line tool and database are freely accessible at http://breps.tu-bs.de.

  19. BrEPS 2.0: Optimization of sequence pattern prediction for enzyme annotation

    PubMed Central

    Schomburg, Dietmar

    2017-01-01

    The prediction of gene functions is crucial for a large number of different life science areas. Faster high throughput sequencing techniques generate more and larger datasets. The manual annotation by classical wet-lab experiments is not suitable for these large amounts of data. We showed earlier that the automatic sequence pattern-based BrEPS protocol, based on manually curated sequences, can be used for the prediction of enzymatic functions of genes. The growing sequence databases provide the opportunity for more reliable patterns, but are also a challenge for the implementation of automatic protocols. We reimplemented and optimized the BrEPS pattern generation to be applicable for larger datasets in an acceptable timescale. Primary improvement of the new BrEPS protocol is the enhanced data selection step. Manually curated annotations from Swiss-Prot are used as reliable source for function prediction of enzymes observed on protein level. The pool of sequences is extended by highly similar sequences from TrEMBL and SwissProt. This allows us to restrict the selection of Swiss-Prot entries, without losing the diversity of sequences needed to generate significant patterns. Additionally, a supporting pattern type was introduced by extending the patterns at semi-conserved positions with highly similar amino acids. Extended patterns have an increased complexity, increasing the chance to match more sequences, without losing the essential structural information of the pattern. To enhance the usability of the database, we introduced enzyme function prediction based on consensus EC numbers and IUBMB enzyme nomenclature. BrEPS is part of the Braunschweig Enzyme Database (BRENDA) and is available on a completely redesigned website and as download. The database can be downloaded and used with the BrEPScmd command line tool for large scale sequence analysis. The BrEPS website and downloads for the database creation tool, command line tool and database are freely accessible at http://breps.tu-bs.de. PMID:28750104

  20. Whole-Genome-Sequencing characterization of bloodstream infection-causing hypervirulent Klebsiella pneumoniae of capsular serotype K2 and ST374.

    PubMed

    Wang, Xiaoli; Xie, Yingzhou; Li, Gang; Liu, Jialin; Li, Xiaobin; Tian, Lijun; Sun, Jingyong; Ou, Hong-Yu; Qu, Hongping

    2018-01-01

    Hypervirulent K. pneumoniae variants (hvKP) have been increasingly reported worldwide, causing metastasis of severe infections such as liver abscesses and bacteremia. The capsular serotype K2 hvKP strains show diverse multi-locus sequence types (MLSTs), but with limited genetics and virulence information. In this study, we report a hypermucoviscous K. pneumoniae strain, RJF293, isolated from a human bloodstream sample in a Chinese hospital. It caused a metastatic infection and fatal septic shock in a critical patient. The microbiological features and genetic background were investigated with multiple approaches. The Strain RJF293 was determined to be multilocis sequence type (ST) 374 and serotype K2, displayed a median lethal dose (LD50) of 1.5 × 10 2 CFU in BALB/c mice and was as virulent as the ST23 K1 serotype hvKP strain NTUH-K2044 in a mouse lethality assay. Whole genome sequencing revealed that the RJF293 genome codes for 32 putative virulence factors and exhibits a unique presence/absence pattern in comparison to the other 105 completely sequenced K. pneumoniae genomes. Whole genome SNP-based phylogenetic analysis revealed that strain RJF293 formed a single clade, distant from those containing either ST66 or ST86 hvKP. Compared to the other sequenced hvKP chromosomes, RJF293 contains several strain-variable regions, including one prophage, one ICEKp1 family integrative and conjugative element and six large genomic islands. The sequencing of the first complete genome of an ST374 K2 hvKP clinical strain should reinforce our understanding of the epidemiology and virulence mechanisms of this bloodstream infection-causing hvKP with clinical significance.

  1. Whole-Genome-Sequencing characterization of bloodstream infection-causing hypervirulent Klebsiella pneumoniae of capsular serotype K2 and ST374

    PubMed Central

    Wang, Xiaoli; Xie, Yingzhou; Li, Gang; Liu, Jialin; Li, Xiaobin; Tian, Lijun; Sun, Jingyong; Qu, Hongping

    2018-01-01

    ABSTRACT Hypervirulent K. pneumoniae variants (hvKP) have been increasingly reported worldwide, causing metastasis of severe infections such as liver abscesses and bacteremia. The capsular serotype K2 hvKP strains show diverse multi-locus sequence types (MLSTs), but with limited genetics and virulence information. In this study, we report a hypermucoviscous K. pneumoniae strain, RJF293, isolated from a human bloodstream sample in a Chinese hospital. It caused a metastatic infection and fatal septic shock in a critical patient. The microbiological features and genetic background were investigated with multiple approaches. The Strain RJF293 was determined to be multilocis sequence type (ST) 374 and serotype K2, displayed a median lethal dose (LD50) of 1.5 × 102 CFU in BALB/c mice and was as virulent as the ST23 K1 serotype hvKP strain NTUH-K2044 in a mouse lethality assay. Whole genome sequencing revealed that the RJF293 genome codes for 32 putative virulence factors and exhibits a unique presence/absence pattern in comparison to the other 105 completely sequenced K. pneumoniae genomes. Whole genome SNP-based phylogenetic analysis revealed that strain RJF293 formed a single clade, distant from those containing either ST66 or ST86 hvKP. Compared to the other sequenced hvKP chromosomes, RJF293 contains several strain-variable regions, including one prophage, one ICEKp1 family integrative and conjugative element and six large genomic islands. The sequencing of the first complete genome of an ST374 K2 hvKP clinical strain should reinforce our understanding of the epidemiology and virulence mechanisms of this bloodstream infection-causing hvKP with clinical significance. PMID:29338592

  2. Sequence analyses and evolutionary relationships among the energy-coupling proteins Enzyme I and HPr of the bacterial phosphoenolpyruvate: sugar phosphotransferase system.

    PubMed Central

    Reizer, J.; Hoischen, C.; Reizer, A.; Pham, T. N.; Saier, M. H.

    1993-01-01

    We have previously reported the overexpression, purification, and biochemical properties of the Bacillus subtilis Enzyme I of the phosphoenolpyruvate: sugar phosphotransferase system (PTS) (Reizer, J., et al., 1992, J. Biol. Chem. 267, 9158-9169). We now report the sequencing of the ptsI gene of B. subtilis encoding Enzyme I (570 amino acids and 63,076 Da). Putative transcriptional regulatory signals are identified, and the pts operon is shown to be subject to carbon source-dependent regulation. Multiple alignments of the B. subtilis Enzyme I with (1) six other sequenced Enzymes I of the PTS from various bacterial species, (2) phosphoenolpyruvate synthase of Escherichia coli, and (3) bacterial and plant pyruvate: phosphate dikinases (PPDKs) revealed regions of sequence similarity as well as divergence. Statistical analyses revealed that these three types of proteins comprise a homologous family, and the phylogenetic tree of the 11 sequenced protein members of this family was constructed. This tree was compared with that of the 12 sequence HPr proteins or protein domains. Antibodies raised against the B. subtilis and E. coli Enzymes I exhibited immunological cross-reactivity with each other as well as with PPDK of Bacteroides symbiosus, providing support for the evolutionary relationships of these proteins suggested from the sequence comparisons. Putative flexible linkers tethering the N-terminal and the C-terminal domains of protein members of the Enzyme I family were identified, and their potential significance with regard to Enzyme I function is discussed. The codon choice pattern of the B. subtilis and E. coli ptsI and ptsH genes was found to exhibit a bias toward optimal codons in these organisms.(ABSTRACT TRUNCATED AT 250 WORDS) PMID:7686067

  3. DMRT gene cluster analysis in the platypus: new insights into genomic organization and regulatory regions.

    PubMed

    El-Mogharbel, Nisrine; Wakefield, Matthew; Deakin, Janine E; Tsend-Ayush, Enkhjargal; Grützner, Frank; Alsop, Amber; Ezaz, Tariq; Marshall Graves, Jennifer A

    2007-01-01

    We isolated and characterized a cluster of platypus DMRT genes and compared their arrangement, location, and sequence across vertebrates. The DMRT gene cluster on human 9p24.3 harbors, in order, DMRT1, DMRT3, and DMRT2, which share a DM domain. DMRT1 is highly conserved and involved in sexual development in vertebrates, and deletions in this region cause sex reversal in humans. Sequence comparisons of DMRT genes between species have been valuable in identifying exons, control regions, and conserved nongenic regions (CNGs). The addition of platypus sequences is expected to be particularly valuable, since monotremes fill a gap in the vertebrate genome coverage. We therefore isolated and fully sequenced platypus BAC clones containing DMRT3 and DMRT2 as well as DMRT1 and then generated multispecies alignments and ran prediction programs followed by experimental verification to annotate this gene cluster. We found that the three genes have 58-66% identity to their human orthologues, lie in the same order as in other vertebrates, and colocate on 1 of the 10 platypus sex chromosomes, X5. We also predict that optimal annotation of the newly sequenced platypus genome will be challenging. The analysis of platypus sequence revealed differences in structure and sequence of the DMRT gene cluster. Multispecies comparison was particularly effective for detecting CNGs, revealing several novel potential regulatory regions within DMRT3 and DMRT2 as well as DMRT1. RT-PCR indicated that platypus DMRT1 and DMRT3 are expressed specifically in the adult testis (and not ovary), but DMRT2 has a wider expression profile, as it does for other mammals. The platypus DMRT1 expression pattern, and its location on an X chromosome, suggests an involvement in monotreme sexual development.

  4. Ancient DNA studies: new perspectives on old samples

    PubMed Central

    2012-01-01

    In spite of past controversies, the field of ancient DNA is now a reliable research area due to recent methodological improvements. A series of recent large-scale studies have revealed the true potential of ancient DNA samples to study the processes of evolution and to test models and assumptions commonly used to reconstruct patterns of evolution and to analyze population genetics and palaeoecological changes. Recent advances in DNA technologies, such as next-generation sequencing make it possible to recover DNA information from archaeological and paleontological remains allowing us to go back in time and study the genetic relationships between extinct organisms and their contemporary relatives. With the next-generation sequencing methodologies, DNA sequences can be retrieved even from samples (for example human remains) for which the technical pitfalls of classical methodologies required stringent criteria to guaranty the reliability of the results. In this paper, we review the methodologies applied to ancient DNA analysis and the perspectives that next-generation sequencing applications provide in this field. PMID:22697611

  5. Subclonal diversification of primary breast cancer revealed by multiregion sequencing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yates, Lucy R.; Gerstung, Moritz; Knappskog, Stian

    Sequencing cancer genomes may enable tailoring of therapeutics to the underlying biological abnormalities driving a particular patient's tumor. However, sequencing-based strategies rely heavily on representative sampling of tumors. To understand the subclonal structure of primary breast cancer, we applied whole-genome and targeted sequencing to multiple samples from each of 50 patients' tumors (303 samples in total). The extent of subclonal diversification varied among cases and followed spatial patterns. No strict temporal order was evident, with point mutations and rearrangements affecting the most common breast cancer genes, including PIK3CA, TP53, PTEN, BRCA2 and MYC, occurring early in some tumors and latemore » in others. In 13 out of 50 cancers, potentially targetable mutations were subclonal. Landmarks of disease progression, such as resistance to chemotherapy and the acquisition of invasive or metastatic potential, arose within detectable subclones of antecedent lesions. These findings highlight the importance of including analyses of subclonal structure and tumor evolution in clinical trials of primary breast cancer.« less

  6. Evidence for two attentional components in visual working memory.

    PubMed

    Allen, Richard J; Baddeley, Alan D; Hitch, Graham J

    2014-11-01

    How does executive attentional control contribute to memory for sequences of visual objects, and what does this reveal about storage and processing in working memory? Three experiments examined the impact of a concurrent executive load (backward counting) on memory for sequences of individually presented visual objects. Experiments 1 and 2 found disruptive concurrent load effects of equivalent magnitude on memory for shapes, colors, and colored shape conjunctions (as measured by single-probe recognition). These effects were present only for Items 1 and 2 in a 3-item sequence; the final item was always impervious to this disruption. This pattern of findings was precisely replicated in Experiment 3 when using a cued verbal recall measure of shape-color binding, with error analysis providing additional insights concerning attention-related loss of early-sequence items. These findings indicate an important role for executive processes in maintaining representations of earlier encountered stimuli in an active form alongside privileged storage of the most recent stimulus. PsycINFO Database Record (c) 2014 APA, all rights reserved.

  7. Subclonal diversification of primary breast cancer revealed by multiregion sequencing

    DOE PAGES

    Yates, Lucy R.; Gerstung, Moritz; Knappskog, Stian; ...

    2015-06-22

    Sequencing cancer genomes may enable tailoring of therapeutics to the underlying biological abnormalities driving a particular patient's tumor. However, sequencing-based strategies rely heavily on representative sampling of tumors. To understand the subclonal structure of primary breast cancer, we applied whole-genome and targeted sequencing to multiple samples from each of 50 patients' tumors (303 samples in total). The extent of subclonal diversification varied among cases and followed spatial patterns. No strict temporal order was evident, with point mutations and rearrangements affecting the most common breast cancer genes, including PIK3CA, TP53, PTEN, BRCA2 and MYC, occurring early in some tumors and latemore » in others. In 13 out of 50 cancers, potentially targetable mutations were subclonal. Landmarks of disease progression, such as resistance to chemotherapy and the acquisition of invasive or metastatic potential, arose within detectable subclones of antecedent lesions. These findings highlight the importance of including analyses of subclonal structure and tumor evolution in clinical trials of primary breast cancer.« less

  8. Correlation between genome reduction and bacterial growth.

    PubMed

    Kurokawa, Masaomi; Seno, Shigeto; Matsuda, Hideo; Ying, Bei-Wen

    2016-12-01

    Genome reduction by removing dispensable genomic sequences in bacteria is commonly used in both fundamental and applied studies to determine the minimal genetic requirements for a living system or to develop highly efficient bioreactors. Nevertheless, whether and how the accumulative loss of dispensable genomic sequences disturbs bacterial growth remains unclear. To investigate the relationship between genome reduction and growth, a series of Escherichia coli strains carrying genomes reduced in a stepwise manner were used. Intensive growth analyses revealed that the accumulation of multiple genomic deletions caused decreases in the exponential growth rate and the saturated cell density in a deletion-length-dependent manner as well as gradual changes in the patterns of growth dynamics, regardless of the growth media. Accordingly, a perspective growth model linking genome evolution to genome engineering was proposed. This study provides the first demonstration of a quantitative connection between genomic sequence and bacterial growth, indicating that growth rate is potentially associated with dispensable genomic sequences. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  9. Evolutionary mechanisms involved in the virulence of infectious salmon anaemia virus (ISAV), a piscine orthomyxovirus

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Markussen, Turhan; Jonassen, Christine Monceyron; Numanovic, Sanela

    2008-05-10

    Infectious salmon anaemia virus (ISAV) is an orthomyxovirus causing a multisystemic, emerging disease in Atlantic salmon. Here we present, for the first time, detailed sequence analyses of the full-genome sequence of a presumed avirulent isolate displaying a full-length hemagglutinin-esterase (HE) gene (HPR0), and compare this with full-genome sequences of 11 Norwegian ISAV isolates from clinically diseased fish. These analyses revealed the presence of a virulence marker right upstream of the putative cleavage site R{sub 267} in the fusion (F) protein, suggesting a Q{sub 266} {yields} L{sub 266} substitution to be a prerequisite for virulence. To gain virulence in isolates lackingmore » this substitution, a sequence insertion near the cleavage site seems to be required. This strongly suggests the involvement of a protease recognition pattern at the cleavage site of the fusion protein as a determinant of virulence, as seen in highly pathogenic influenza A virus H5 or H7 and the paramyxovirus Newcastle disease virus.« less

  10. Insights into fungal communities in composts revealed by 454-pyrosequencing: implications for human health and safety.

    PubMed

    De Gannes, Vidya; Eudoxie, Gaius; Hickey, William J

    2013-01-01

    Fungal community composition in composts of lignocellulosic wastes was assessed via 454-pyrosequencing of ITS1 libraries derived from the three major composting phases. Ascomycota represented most (93%) of the 27,987 fungal sequences. A total of 102 genera, 120 species, and 222 operational taxonomic units (OTUs; >97% similarity) were identified. Thirty genera predominated (ca. 94% of the sequences), and at the species level, sequences matching Chaetomium funicola and Fusarium oxysporum were the most abundant (26 and 12%, respectively). In all composts, fungal diversity in the mature phase exceeded that of the mesophilic phase, but there was no consistent pattern in diversity changes occurring in the thermophilic phase. Fifteen species of human pathogens were identified, eight of which have not been previously identified in composts. This study demonstrated that deep sequencing can elucidate fungal community diversity in composts, and that this information can have important implications for compost use and human health.

  11. Insights into fungal communities in composts revealed by 454-pyrosequencing: implications for human health and safety

    PubMed Central

    De Gannes, Vidya; Eudoxie, Gaius; Hickey, William J.

    2013-01-01

    Fungal community composition in composts of lignocellulosic wastes was assessed via 454-pyrosequencing of ITS1 libraries derived from the three major composting phases. Ascomycota represented most (93%) of the 27,987 fungal sequences. A total of 102 genera, 120 species, and 222 operational taxonomic units (OTUs; >97% similarity) were identified. Thirty genera predominated (ca. 94% of the sequences), and at the species level, sequences matching Chaetomium funicola and Fusarium oxysporum were the most abundant (26 and 12%, respectively). In all composts, fungal diversity in the mature phase exceeded that of the mesophilic phase, but there was no consistent pattern in diversity changes occurring in the thermophilic phase. Fifteen species of human pathogens were identified, eight of which have not been previously identified in composts. This study demonstrated that deep sequencing can elucidate fungal community diversity in composts, and that this information can have important implications for compost use and human health. PMID:23785368

  12. Variation along ITS markers across strains of Fibrocapsa japonica (Raphidophyceae) suggests hybridisation events and recent range expansion

    NASA Astrophysics Data System (ADS)

    Kooistra, Wiebe H. C. F.; de Boer, M. Karin; Vrieling, Engel G.; Connell, Laurie B.; Gieskes, Winfried W. C.

    2001-12-01

    The flagellate micro-alga Fibrocapsa japonica can form harmful algal blooms along all temperate coastal regions of the world. The species was first observed in coastal waters of Japan and the western US in the 1970s; it has been reported regularly worldwide since. To unravel whether this apparent range expansion can be tracked, we assessed genetic variation among nuclear ribosomal DNA ITS sequences, obtained from sixteen global strains collected over the course of three decades. Ten sequence positions showed polymorphism across the strains. Nine out of these revealed ambiguities in several or most sequences sampled. The oldest strain collected (LB-2161) was the only one without such intra-individual polymorphism. In the others, the proportion of ambiguities at variable sites increased with more recent collection date. The pattern does not result from loss of variation due to sexual reproduction and random drift in culture because sister cultures CS-332 and NIES-136 showed virtually the same ITS-pattern after seven years of separation. Neither are the patterns explained by recent range expansion of a single genotype, because in that case one would expect lowest genetic diversity in the recently invaded North Sea; instead, polymorphism is highest there. Recent ballast-water-mediated mixing of formerly isolated populations and subsequent ongoing sexual reproduction among them can explain the increase in ambiguities. The species' capacity to form harmful blooms may well have been enhanced through increased genetic diversity of regional populations.

  13. Designing and conducting in silico analysis for identifying of Echinococcus spp. with discrimination of novel haplotypes: an approach to better understanding of parasite taxonomic.

    PubMed

    Spotin, Adel; Gholami, Shirzad; Nasab, Abbas Najafi; Fallah, Esmaeil; Oskouei, Mahmoud Mahami; Semnani, Vahid; Shariatzadeh, Seyyed Ali; Shahbazi, Abbas

    2015-04-01

    The definitive identification of Echinococcus species is currently carried out by sequencing and phylogenetic strategies. However, the application of polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) patterns is not broadly used as a result of heterogeneity traits of Echinococcus genome in different regions of the world. Therefore, designing and conducting a standardized pattern should indigenously be considered in under-studied areas. In this investigation, an in silico mapping was designed and developed for eight Echinococcus spp. on the basis of regional sequences in Iran and the world. The numbers of 60 Echinococcus isolates were collected from the liver and lungs of 15 human, 15 sheep, 15 cattle, and 15 camel cases in Semnan province, Central Iran. DNA samples were extracted and examined by polymerase chain reaction of ribosomal DNA (rDNA) internal transcribed spacer 1 (ITS1) and PCR-RFLP via Rsa1 endonuclease enzyme. Moreover, 15 amplicons of cytochrome oxidase 1 (Cox1) were directly sequenced in order to identify the strains/haplotypes. PCR-RFLP and phylogenetic analyses revealed firmly the presence of the G1 and G6 genotypes with heterogeneity (three novel haplotypes) of Cox1 gene although no other expected genotypes were found in the region. Finding shows that the identification of novel haplotypes along with discrimination of Echinococcus spp. through regional patterns can unambiguously illustrate the real taxonomic status of parasite in Central Iran.

  14. Normal and compound poisson approximations for pattern occurrences in NGS reads.

    PubMed

    Zhai, Zhiyuan; Reinert, Gesine; Song, Kai; Waterman, Michael S; Luan, Yihui; Sun, Fengzhu

    2012-06-01

    Next generation sequencing (NGS) technologies are now widely used in many biological studies. In NGS, sequence reads are randomly sampled from the genome sequence of interest. Most computational approaches for NGS data first map the reads to the genome and then analyze the data based on the mapped reads. Since many organisms have unknown genome sequences and many reads cannot be uniquely mapped to the genomes even if the genome sequences are known, alternative analytical methods are needed for the study of NGS data. Here we suggest using word patterns to analyze NGS data. Word pattern counting (the study of the probabilistic distribution of the number of occurrences of word patterns in one or multiple long sequences) has played an important role in molecular sequence analysis. However, no studies are available on the distribution of the number of occurrences of word patterns in NGS reads. In this article, we build probabilistic models for the background sequence and the sampling process of the sequence reads from the genome. Based on the models, we provide normal and compound Poisson approximations for the number of occurrences of word patterns from the sequence reads, with bounds on the approximation error. The main challenge is to consider the randomness in generating the long background sequence, as well as in the sampling of the reads using NGS. We show the accuracy of these approximations under a variety of conditions for different patterns with various characteristics. Under realistic assumptions, the compound Poisson approximation seems to outperform the normal approximation in most situations. These approximate distributions can be used to evaluate the statistical significance of the occurrence of patterns from NGS data. The theory and the computational algorithm for calculating the approximate distributions are then used to analyze ChIP-Seq data using transcription factor GABP. Software is available online (www-rcf.usc.edu/∼fsun/Programs/NGS_motif_power/NGS_motif_power.html). In addition, Supplementary Material can be found online (www.liebertonline.com/cmb).

  15. The Neural Correlates of Implicit Sequence Learning in Schizophrenia

    PubMed Central

    Marvel, Cherie L.; Turner, Beth M.; O’Leary, Daniel S.; Johnson, Hans J.; Pierson, Ronald K.; Boles Ponto, Laura L.; Andreasen, Nancy C.

    2009-01-01

    Twenty-seven schizophrenia spectrum patients and 25 healthy controls performed a probabilistic version of the serial reaction time task (SRT) that included sequence trials embedded within random trials. Patients showed diminished, yet measurable, sequence learning. Postexperimental analyses revealed that a group of patients performed above chance when generating short spans of the sequence. This high-generation group showed SRT learning that was similar in magnitude to that of controls. Their learning was evident from the very 1st block; however, unlike controls, learning did not develop further with continued testing. A subset of 12 patients and 11 controls performed the SRT in conjunction with positron emission tomography. High-generation performance, which corresponded to SRT learning in patients, correlated to activity in the premotor cortex and parahippocampus. These areas have been associated with stimulus-driven visuospatial processing. Taken together, these results suggest that a subset of patients who showed moderate success on the SRT used an explicit stimulus-driven strategy to process the sequential stimuli. This adaptive strategy facilitated sequence learning but may have interfered with conventional implicit learning of the overall stimulus pattern. PMID:17983290

  16. Ancient genomics

    PubMed Central

    Der Sarkissian, Clio; Allentoft, Morten E.; Ávila-Arcos, María C.; Barnett, Ross; Campos, Paula F.; Cappellini, Enrico; Ermini, Luca; Fernández, Ruth; da Fonseca, Rute; Ginolhac, Aurélien; Hansen, Anders J.; Jónsson, Hákon; Korneliussen, Thorfinn; Margaryan, Ashot; Martin, Michael D.; Moreno-Mayar, J. Víctor; Raghavan, Maanasa; Rasmussen, Morten; Velasco, Marcela Sandoval; Schroeder, Hannes; Schubert, Mikkel; Seguin-Orlando, Andaine; Wales, Nathan; Gilbert, M. Thomas P.; Willerslev, Eske; Orlando, Ludovic

    2015-01-01

    The past decade has witnessed a revolution in ancient DNA (aDNA) research. Although the field's focus was previously limited to mitochondrial DNA and a few nuclear markers, whole genome sequences from the deep past can now be retrieved. This breakthrough is tightly connected to the massive sequence throughput of next generation sequencing platforms and the ability to target short and degraded DNA molecules. Many ancient specimens previously unsuitable for DNA analyses because of extensive degradation can now successfully be used as source materials. Additionally, the analytical power obtained by increasing the number of sequence reads to billions effectively means that contamination issues that have haunted aDNA research for decades, particularly in human studies, can now be efficiently and confidently quantified. At present, whole genomes have been sequenced from ancient anatomically modern humans, archaic hominins, ancient pathogens and megafaunal species. Those have revealed important functional and phenotypic information, as well as unexpected adaptation, migration and admixture patterns. As such, the field of aDNA has entered the new era of genomics and has provided valuable information when testing specific hypotheses related to the past. PMID:25487338

  17. Sequence diversity of the leukotoxin (lktA) gene in caprine and ovine strains of Mannheimia haemolytica.

    PubMed

    Vougidou, C; Sandalakis, V; Psaroulaki, A; Petridou, E; Ekateriniadou, L

    2013-04-20

    Mannheimia haemolytica is the aetiological agent of pneumonic pasteurellosis in small ruminants. The primary virulence factor of the bacterium is a leukotoxin (LktA), which induces apoptosis in susceptible cells via mitochondrial targeting. It has been previously shown that certain lktA alleles are associated either with cattle or sheep. The objective of the present study was to investigate lktA sequence variation among ovine and caprine M haemolytica strains isolated from pneumonic lungs, revealing any potential adaptation for the caprine host, for which there is no available data. Furthermore, we investigated amino acid variation in the N-terminal part of the sequences and its effect on targeting mitochondria. Data analysis showed that the prevalent caprine genotype differed at a single non-synonymous site from a previously described uncommon bovine allele, whereas the ovine sequences represented new, distinct alleles. N-terminal sequence differences did not affect the mitochondrial targeting ability of the isolates; interestingly enough in one case, mitochondrial matrix targeting was indicated rather than membrane association, suggesting an alternative LktA trafficking pattern.

  18. Continuities in stone flaking technology at Liang Bua, Flores, Indonesia.

    PubMed

    Moore, M W; Sutikna, T; Jatmiko; Morwood, M J; Brumm, A

    2009-11-01

    This study examines trends in stone tool reduction technology at Liang Bua, Flores, Indonesia, where excavations have revealed a stratified artifact sequence spanning 95k.yr. The reduction sequence practiced throughout the Pleistocene was straightforward and unchanging. Large flakes were produced off-site and carried into the cave where they were reduced centripetally and bifacially by four techniques: freehand, burination, truncation, and bipolar. The locus of technological complexity at Liang Bua was not in knapping products, but in the way techniques were integrated. This reduction sequence persisted across the Pleistocene/Holocene boundary with a minor shift favoring unifacial flaking after 11ka. Other stone-related changes occurred at the same time, including the first appearance of edge-glossed flakes, a change in raw material selection, and more frequent fire-induced damage to stone artifacts. Later in the Holocene, technological complexity was generated by "adding-on" rectangular-sectioned stone adzes to the reduction sequence. The Pleistocene pattern is directly associated with Homo floresiensis skeletal remains and the Holocene changes correlate with the appearance of Homo sapiens. The one reduction sequence continues across this hominin replacement.

  19. Effects of 16S rDNA sampling on estimates of the number of endosymbiont lineages in sucking lice

    PubMed Central

    Burleigh, J. Gordon; Light, Jessica E.; Reed, David L.

    2016-01-01

    Phylogenetic trees can reveal the origins of endosymbiotic lineages of bacteria and detect patterns of co-evolution with their hosts. Although taxon sampling can greatly affect phylogenetic and co-evolutionary inference, most hypotheses of endosymbiont relationships are based on few available bacterial sequences. Here we examined how different sampling strategies of Gammaproteobacteria sequences affect estimates of the number of endosymbiont lineages in parasitic sucking lice (Insecta: Phthirapatera: Anoplura). We estimated the number of louse endosymbiont lineages using both newly obtained and previously sequenced 16S rDNA bacterial sequences and more than 42,000 16S rDNA sequences from other Gammaproteobacteria. We also performed parametric and nonparametric bootstrapping experiments to examine the effects of phylogenetic error and uncertainty on these estimates. Sampling of 16S rDNA sequences affects the estimates of endosymbiont diversity in sucking lice until we reach a threshold of genetic diversity, the size of which depends on the sampling strategy. Sampling by maximizing the diversity of 16S rDNA sequences is more efficient than randomly sampling available 16S rDNA sequences. Although simulation results validate estimates of multiple endosymbiont lineages in sucking lice, the bootstrap results suggest that the precise number of endosymbiont origins is still uncertain. PMID:27547523

  20. Inferring Multiple Refugia and Phylogeographical Patterns in Pinus massoniana Based on Nucleotide Sequence Variation and DNA Fingerprinting

    PubMed Central

    Lin, Chung-Jian; Huang, Chi-Chung; Huang, Chao-Ching; Chiang, Yu-Chung; Chiang, Tzen-Yuh

    2012-01-01

    Background Pinus massoniana, an ecologically and economically important conifer, is widespread across central and southern mainland China and Taiwan. In this study, we tested the central–marginal paradigm that predicts that the marginal populations tend to be less polymorphic than the central ones in their genetic composition, and examined a founders' effect in the island population. Methodology/Principal Findings We examined the phylogeography and population structuring of the P. massoniana based on nucleotide sequences of cpDNA atpB-rbcL intergenic spacer, intron regions of the AdhC2 locus, and microsatellite fingerprints. SAMOVA analysis of nucleotide sequences indicated that most genetic variants resided among geographical regions. High levels of genetic diversity in the marginal populations in the south region, a pattern seemingly contradicting the central–marginal paradigm, and the fixation of private haplotypes in most populations indicate that multiple refugia may have existed over the glacial maxima. STRUCTURE analyses on microsatellites revealed that genetic structure of mainland populations was mediated with recent genetic exchanges mostly via pollen flow, and that the genetic composition in east region was intermixed between south and west regions, a pattern likely shaped by gene introgression and maintenance of ancestral polymorphisms. As expected, the small island population in Taiwan was genetically differentiated from mainland populations. Conclusions/Significance The marginal populations in south region possessed divergent gene pools, suggesting that the past glaciations might have low impacts on these populations at low latitudes. Estimates of ancestral population sizes interestingly reflect a recent expansion in mainland from a rather smaller population, a pattern that seemingly agrees with the pollen record. PMID:22952747

  1. Virus-like attachment sites as structural landmarks of plants retrotransposons.

    PubMed

    Ochoa Cruz, Edgar Andres; Cruz, Guilherme Marcello Queiroga; Vieira, Andréia Prata; Van Sluys, Marie-Anne

    2016-01-01

    The genomic data available nowadays has enabled the study of repetitive sequences and their relationship to viruses. Among them, long terminal repeat retrotransposons (LTR-RTs) are the largest component of most plant genomes, the Gypsy and Copia superfamilies being the most common. Recently it has been found that Del lineage, an LTR-RT of Gypsy superfamily, has putative virus-like attachment (vl-att) sites. This signature, originally described for retroviruses, is recognized by retroviral integrase conferring specificity to the integration process. Here we retrieved 26,092 putative complete LTR-RTs from 10 lineages found in 10 fully sequenced angiosperm genomes and found putative vl-att sites that are a conserved structural landmark across these genomes. Furthermore, we reveal that each plant genome has a distinguishable LTR-RT lineage amplification pattern that could be related to the vl-att sites diversity. We used these patterns to generate a specific quick-response (QR) code for each genome that could be used as a barcode of identification of plants in the future. The universal distribution of vl-att sites represents a new structural feature common to plant LTR-RTs and retroviruses. This is an important finding that expands the information about the structural similarity between LTR-RT and retroviruses. We speculate that the sequence diversity of vl-att sites could be important for the life cycle of retrotransposons, as it was shown for retroviruses. All the structural vl-att site signatures are strong candidates for further functional studies. Moreover, this is the first identification of specific LTR-RT content and their amplification patterns in a large dataset of LTR-RT lineages and angiosperm genomes. These distribution patterns could be used in the future with biotechnological identification purposes.

  2. Response of heat shock protein genes of the oriental fruit moth under diapause and thermal stress reveals multiple patterns dependent on the nature of stress exposure.

    PubMed

    Zhang, Bo; Peng, Yu; Zheng, Jincheng; Liang, Lina; Hoffmann, Ary A; Ma, Chun-Sen

    2016-07-01

    Heat shock protein gene (Hsp) families are thought to be important in thermal adaptation, but their expression patterns under various thermal stresses have still been poorly characterized outside of model systems. We have therefore characterized Hsp genes and their stress responses in the oriental fruit moth (OFM), Grapholita molesta, a widespread global orchard pest, and compared patterns of expression in this species to that of other insects. Genes from four Hsp families showed variable expression levels among tissues and developmental stages. Members of the Hsp40, 70, and 90 families were highly expressed under short exposures to heat and cold. Expression of Hsp40, 70, and Hsc70 family members increased in OFM undergoing diapause, while Hsp90 was downregulated. We found that there was strong sequence conservation of members of large Hsp families (Hsp40, Hsp60, Hsp70, Hsc70) across taxa, but this was not always matched by conservation of expression patterns. When the large Hsps as well as small Hsps from OFM were compared under acute and ramping heat stress, two groups of sHsps expression patterns were apparent, depending on whether expression increased or decreased immediately after stress exposure. These results highlight potential differences in conservation of function as opposed to sequence in this gene family and also point to Hsp genes potentially useful as bioindicators of diapause and thermal stress in OFM.

  3. Whole-exome sequencing reveals a novel missense mutation in the MARS gene related to a rare Charcot-Marie-Tooth neuropathy type 2U.

    PubMed

    Sagi-Dain, Lena; Shemer, Lilach; Zelnik, Nathanel; Zoabi, Yusri; Orit, Sadeh; Adir, Vardit; Schif, Aharon; Peleg, Amir

    2018-06-01

    Charcot-Marie-Tooth (CMT) is a heterogeneous group of progressive disorders, characterized by chronic motor and sensory polyneuropathy. This hereditary disorder is related to numerous genes and varying inheritance patterns. Thus, many patients do not reach a final genetic diagnosis. We describe a 13-year-old girl presenting with progressive bilateral leg weakness and gait instability. Extensive laboratory studies and spinal magnetic resonance imaging scan were normal. Nerve conduction studies revealed severe lower limb peripheral neuropathy with prominent demyelinative component. Following presumptive diagnosis of chronic inflammatory demyelinating polyneuropathy, the patient received treatment with steroids and intravenous immunoglobulins courses for several months, with no apparent improvement. Whole-exome sequencing revealed a novel heterozygous c.2209C>T (p.Arg737Trp) mutation in the MARS gene (OMIM 156560). This gene has recently been related to CMT type 2U. In-silico prediction programs classified this mutation as a probable cause for protein malfunction. Allele frequency data reported this variant in 0.003% of representative Caucasian population. Family segregation analysis study revealed that the patient had inherited the variant from her 60-years old mother, reported as healthy. Neurologic examination of the mother demonstrated decreased tendon reflexes, while nerve conduction studies were consistent with demyelinative and axonal sensory-motor polyneuropathy. Our report highlights the importance of next-generation sequencing approach to facilitate the proper molecular diagnosis of highly heterogeneous neurologic disorders. Amongst other numerous benefits, this approach might prevent unnecessary diagnostic testing and potentially harmful medical treatment. © 2018 Peripheral Nerve Society.

  4. Using high throughput sequencing to explore the biodiversity in oral bacterial communities.

    PubMed

    Diaz, P I; Dupuy, A K; Abusleme, L; Reese, B; Obergfell, C; Choquette, L; Dongari-Bagtzoglou, A; Peterson, D E; Terzi, E; Strausbaugh, L D

    2012-06-01

    High throughput sequencing of 16S ribosomal RNA gene amplicons is a cost-effective method for characterization of oral bacterial communities. However, before undertaking large-scale studies, it is necessary to understand the technique-associated limitations and intrinsic variability of the oral ecosystem. In this work we evaluated bias in species representation using an in vitro-assembled mock community of oral bacteria. We then characterized the bacterial communities in saliva and buccal mucosa of five healthy subjects to investigate the power of high throughput sequencing in revealing their diversity and biogeography patterns. Mock community analysis showed primer and DNA isolation biases and an overestimation of diversity that was reduced after eliminating singleton operational taxonomic units (OTUs). Sequencing of salivary and mucosal communities found a total of 455 OTUs (0.3% dissimilarity) with only 78 of these present in all subjects. We demonstrate that this variability was partly the result of incomplete richness coverage even at great sequencing depths, and so comparing communities by their structure was more effective than comparisons based solely on membership. With respect to oral biogeography, we found inter-subject variability in community structure was lower than site differences between salivary and mucosal communities within subjects. These differences were evident at very low sequencing depths and were mostly caused by the abundance of Streptococcus mitis and Gemella haemolysans in mucosa. In summary, we present an experimental and data analysis framework that will facilitate design and interpretation of pyrosequencing-based studies. Despite challenges associated with this technique, we demonstrate its power for evaluation of oral diversity and biogeography patterns. © 2012 John Wiley & Sons A/S.

  5. Molecular Population Genetics of the Alcohol Dehydrogenase Gene Region of DROSOPHILA MELANOGASTER

    PubMed Central

    Aquadro, Charles F.; Desse, Susan F.; Bland, Molly M.; Langley, Charles H.; Laurie-Ahlberg, Cathy C.

    1986-01-01

    Variation in the DNA restriction map of a 13-kb region of chromosome II including the alcohol dehydrogenase structural gene (Adh) was examined in Drosophila melanogaster from natural populations. Detailed analysis of 48 D. melanogaster lines representing four eastern United States populations revealed extensive DNA sequence variation due to base substitutions, insertions and deletions. Cloning of this region from several lines allowed characterization of length variation as due to unique sequence insertions or deletions [nine sizes; 21–200 base pairs (bp)] or transposable element insertions (several sizes, 340 bp to 10.2 kb, representing four different elements). Despite this extensive variation in sequences flanking the Adh gene, only one length polymorphism is clearly associated with altered Adh expression (a copia element approximately 250 bp 5' to the distal transcript start site). Nonetheless, the frequency spectra of transposable elements within and between Drosophila species suggests they are slightly deleterious. Strong nonrandom associations are observed among Adh region sequence variants, ADH allozyme (Fast vs. Slow), ADH enzyme activity and the chromosome inversion ln(2L) t. Phylogenetic analysis of restriction map haplotypes suggest that the major twofold component of ADH activity variation (high vs. low, typical of Fast and Slow allozymes, respectively) is due to sequence variation tightly linked to and possibly distinct from that underlying the allozyme difference. The patterns of nucleotide and haplotype variation for Fast and Slow allozyme lines are consistent with the recent increase in frequency and spread of the Fast haplotype associated with high ADH activity. These data emphasize the important role of evolutionary history and strong nonrandom associations among tightly linked sequence variation as determinants of the patterns of variation observed in natural populations. PMID:3026893

  6. Sequence selection by dynamical symmetry breaking in an autocatalytic binary polymer model

    NASA Astrophysics Data System (ADS)

    Fellermann, Harold; Tanaka, Shinpei; Rasmussen, Steen

    2017-12-01

    Template-directed replication of nucleic acids is at the essence of all living beings and a major milestone for any origin of life scenario. We present an idealized model of prebiotic sequence replication, where binary polymers act as templates for their autocatalytic replication, thereby serving as each others reactants and products in an intertwined molecular ecology. Our model demonstrates how autocatalysis alters the qualitative and quantitative system dynamics in counterintuitive ways. Most notably, numerical simulations reveal a very strong intrinsic selection mechanism that favors the appearance of a few population structures with highly ordered and repetitive sequence patterns when starting from a pool of monomers. We demonstrate both analytically and through simulation how this "selection of the dullest" is caused by continued symmetry breaking through random fluctuations in the transient dynamics that are amplified by autocatalysis and eventually propagate to the population level. The impact of these observations on related prebiotic mathematical models is discussed.

  7. Genome Sequencing and Analysis of the Tasmanian Devil and Its Transmissible Cancer

    PubMed Central

    Murchison, Elizabeth P.; Schulz-Trieglaff, Ole B.; Ning, Zemin; Alexandrov, Ludmil B.; Bauer, Markus J.; Fu, Beiyuan; Hims, Matthew; Ding, Zhihao; Ivakhno, Sergii; Stewart, Caitlin; Ng, Bee Ling; Wong, Wendy; Aken, Bronwen; White, Simon; Alsop, Amber; Becq, Jennifer; Bignell, Graham R.; Cheetham, R. Keira; Cheng, William; Connor, Thomas R.; Cox, Anthony J.; Feng, Zhi-Ping; Gu, Yong; Grocock, Russell J.; Harris, Simon R.; Khrebtukova, Irina; Kingsbury, Zoya; Kowarsky, Mark; Kreiss, Alexandre; Luo, Shujun; Marshall, John; McBride, David J.; Murray, Lisa; Pearse, Anne-Maree; Raine, Keiran; Rasolonjatovo, Isabelle; Shaw, Richard; Tedder, Philip; Tregidgo, Carolyn; Vilella, Albert J.; Wedge, David C.; Woods, Gregory M.; Gormley, Niall; Humphray, Sean; Schroth, Gary; Smith, Geoffrey; Hall, Kevin; Searle, Stephen M.J.; Carter, Nigel P.; Papenfuss, Anthony T.; Futreal, P. Andrew; Campbell, Peter J.; Yang, Fengtang; Bentley, David R.; Evers, Dirk J.; Stratton, Michael R.

    2012-01-01

    Summary The Tasmanian devil (Sarcophilus harrisii), the largest marsupial carnivore, is endangered due to a transmissible facial cancer spread by direct transfer of living cancer cells through biting. Here we describe the sequencing, assembly, and annotation of the Tasmanian devil genome and whole-genome sequences for two geographically distant subclones of the cancer. Genomic analysis suggests that the cancer first arose from a female Tasmanian devil and that the clone has subsequently genetically diverged during its spread across Tasmania. The devil cancer genome contains more than 17,000 somatic base substitution mutations and bears the imprint of a distinct mutational process. Genotyping of somatic mutations in 104 geographically and temporally distributed Tasmanian devil tumors reveals the pattern of evolution and spread of this parasitic clonal lineage, with evidence of a selective sweep in one geographical area and persistence of parallel lineages in other populations. PaperClip PMID:22341448

  8. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kvon, Evgeny Z.; Kamneva, Olga K.; Melo, Uirá S.

    The evolution of body shape is thought to be tightly coupled to changes in regulatory sequences, but specific molecular events associated with major morphological transitions in vertebrates have remained elusive. In this paper, we identified snake-specific sequence changes within an otherwise highly conserved long-range limb enhancer of Sonic hedgehog (Shh). Transgenic mouse reporter assays revealed that the in vivo activity pattern of the enhancer is conserved across a wide range of vertebrates, including fish, but not in snakes. Genomic substitution of the mouse enhancer with its human or fish ortholog results in normal limb development. In contrast, replacement with snake orthologsmore » caused severe limb reduction. Synthetic restoration of a single transcription factor binding site lost in the snake lineage reinstated full in vivo function to the snake enhancer. Our results demonstrate changes in a regulatory sequence associated with a major body plan transition and highlight the role of enhancers in morphological evolution.« less

  9. Lineage tracing reveals multipotent stem cells maintain human adenomas and the pattern of clonal expansion in tumor evolution

    PubMed Central

    Humphries, Adam; Cereser, Biancastella; Gay, Laura J.; Miller, Daniel S. J.; Das, Bibek; Gutteridge, Alice; Elia, George; Nye, Emma; Jeffery, Rosemary; Poulsom, Richard; Novelli, Marco R.; Rodriguez-Justo, Manuel; McDonald, Stuart A. C.; Wright, Nicholas A.; Graham, Trevor A.

    2013-01-01

    The genetic and morphological development of colorectal cancer is a paradigm for tumorigenesis. However, the dynamics of clonal evolution underpinning carcinogenesis remain poorly understood. Here we identify multipotential stem cells within human colorectal adenomas and use methylation patterns of nonexpressed genes to characterize clonal evolution. Numerous individual crypts from six colonic adenomas and a hyperplastic polyp were microdissected and characterized for genetic lesions. Clones deficient in cytochrome c oxidase (CCO−) were identified by histochemical staining followed by mtDNA sequencing. Topographical maps of clone locations were constructed using a combination of these data. Multilineage differentiation within clones was demonstrated by immunofluorescence. Methylation patterns of adenomatous crypts were determined by clonal bisulphite sequencing; methylation pattern diversity was compared with a mathematical model to infer to clonal dynamics. Individual adenomatous crypts were clonal for mtDNA mutations and contained both mucin-secreting and neuroendocrine cells, demonstrating that the crypt contained a multipotent stem cell. The intracrypt methylation pattern was consistent with the crypts containing multiple competing stem cells. Adenomas were epigenetically diverse populations, suggesting that they were relatively mitotically old populations. Intratumor clones typically showed less diversity in methylation pattern than the tumor as a whole. Mathematical modeling suggested that recent clonal sweeps encompassing the whole adenoma had not occurred. Adenomatous crypts within human tumors contain actively dividing stem cells. Adenomas appeared to be relatively mitotically old populations, pocketed with occasional newly generated subclones that were the result of recent rapid clonal expansion. Relative stasis and occasional rapid subclone growth may characterize colorectal tumorigenesis. PMID:23766371

  10. Lineage tracing reveals multipotent stem cells maintain human adenomas and the pattern of clonal expansion in tumor evolution.

    PubMed

    Humphries, Adam; Cereser, Biancastella; Gay, Laura J; Miller, Daniel S J; Das, Bibek; Gutteridge, Alice; Elia, George; Nye, Emma; Jeffery, Rosemary; Poulsom, Richard; Novelli, Marco R; Rodriguez-Justo, Manuel; McDonald, Stuart A C; Wright, Nicholas A; Graham, Trevor A

    2013-07-02

    The genetic and morphological development of colorectal cancer is a paradigm for tumorigenesis. However, the dynamics of clonal evolution underpinning carcinogenesis remain poorly understood. Here we identify multipotential stem cells within human colorectal adenomas and use methylation patterns of nonexpressed genes to characterize clonal evolution. Numerous individual crypts from six colonic adenomas and a hyperplastic polyp were microdissected and characterized for genetic lesions. Clones deficient in cytochrome c oxidase (CCO(-)) were identified by histochemical staining followed by mtDNA sequencing. Topographical maps of clone locations were constructed using a combination of these data. Multilineage differentiation within clones was demonstrated by immunofluorescence. Methylation patterns of adenomatous crypts were determined by clonal bisulphite sequencing; methylation pattern diversity was compared with a mathematical model to infer to clonal dynamics. Individual adenomatous crypts were clonal for mtDNA mutations and contained both mucin-secreting and neuroendocrine cells, demonstrating that the crypt contained a multipotent stem cell. The intracrypt methylation pattern was consistent with the crypts containing multiple competing stem cells. Adenomas were epigenetically diverse populations, suggesting that they were relatively mitotically old populations. Intratumor clones typically showed less diversity in methylation pattern than the tumor as a whole. Mathematical modeling suggested that recent clonal sweeps encompassing the whole adenoma had not occurred. Adenomatous crypts within human tumors contain actively dividing stem cells. Adenomas appeared to be relatively mitotically old populations, pocketed with occasional newly generated subclones that were the result of recent rapid clonal expansion. Relative stasis and occasional rapid subclone growth may characterize colorectal tumorigenesis.

  11. The First Chameleon Transcriptome: Comparative Genomic Analysis of the OXPHOS System Reveals Loss of COX8 in Iguanian Lizards

    PubMed Central

    Bar-Yaacov, Dan; Bouskila, Amos; Mishmar, Dan

    2013-01-01

    Recently, we found dramatic mitochondrial DNA divergence of Israeli Chamaeleo chamaeleon populations into two geographically distinct groups. We aimed to examine whether the same pattern of divergence could be found in nuclear genes. However, no genomic resource is available for any chameleon species. Here we present the first chameleon transcriptome, obtained using deep sequencing (SOLiD). Our analysis identified 164,000 sequence contigs of which 19,000 yielded unique BlastX hits. To test the efficacy of our sequencing effort, we examined whether the chameleon and other available reptilian transcriptomes harbored complete sets of genes comprising known biochemical pathways, focusing on the nDNA-encoded oxidative phosphorylation (OXPHOS) genes as a model. As a reference for the screen, we used the human 86 (including isoforms) known structural nDNA-encoded OXPHOS subunits. Analysis of 34 publicly available vertebrate transcriptomes revealed orthologs for most human OXPHOS genes. However, OXPHOS subunit COX8 (Cytochrome C oxidase subunit 8), including all its known isoforms, was consistently absent in transcriptomes of iguanian lizards, implying loss of this subunit during the radiation of this suborder. The lack of COX8 in the suborder Iguania is intriguing, since it is important for cellular respiration and ATP production. Our sequencing effort added a new resource for comparative genomic studies, and shed new light on the evolutionary dynamics of the OXPHOS system. PMID:24009133

  12. The first Chameleon transcriptome: comparative genomic analysis of the OXPHOS system reveals loss of COX8 in Iguanian lizards.

    PubMed

    Bar-Yaacov, Dan; Bouskila, Amos; Mishmar, Dan

    2013-01-01

    Recently, we found dramatic mitochondrial DNA divergence of Israeli Chamaeleo chamaeleon populations into two geographically distinct groups. We aimed to examine whether the same pattern of divergence could be found in nuclear genes. However, no genomic resource is available for any chameleon species. Here we present the first chameleon transcriptome, obtained using deep sequencing (SOLiD). Our analysis identified 164,000 sequence contigs of which 19,000 yielded unique BlastX hits. To test the efficacy of our sequencing effort, we examined whether the chameleon and other available reptilian transcriptomes harbored complete sets of genes comprising known biochemical pathways, focusing on the nDNA-encoded oxidative phosphorylation (OXPHOS) genes as a model. As a reference for the screen, we used the human 86 (including isoforms) known structural nDNA-encoded OXPHOS subunits. Analysis of 34 publicly available vertebrate transcriptomes revealed orthologs for most human OXPHOS genes. However, OXPHOS subunit COX8 (Cytochrome C oxidase subunit 8), including all its known isoforms, was consistently absent in transcriptomes of iguanian lizards, implying loss of this subunit during the radiation of this suborder. The lack of COX8 in the suborder Iguania is intriguing, since it is important for cellular respiration and ATP production. Our sequencing effort added a new resource for comparative genomic studies, and shed new light on the evolutionary dynamics of the OXPHOS system.

  13. Elucidation of hepatitis C virus transmission and early diversification by single genome sequencing.

    PubMed

    Li, Hui; Stoddard, Mark B; Wang, Shuyi; Blair, Lily M; Giorgi, Elena E; Parrish, Erica H; Learn, Gerald H; Hraber, Peter; Goepfert, Paul A; Saag, Michael S; Denny, Thomas N; Haynes, Barton F; Hahn, Beatrice H; Ribeiro, Ruy M; Perelson, Alan S; Korber, Bette T; Bhattacharya, Tanmoy; Shaw, George M

    2012-01-01

    A precise molecular identification of transmitted hepatitis C virus (HCV) genomes could illuminate key aspects of transmission biology, immunopathogenesis and natural history. We used single genome sequencing of 2,922 half or quarter genomes from plasma viral RNA to identify transmitted/founder (T/F) viruses in 17 subjects with acute community-acquired HCV infection. Sequences from 13 of 17 acute subjects, but none of 14 chronic controls, exhibited one or more discrete low diversity viral lineages. Sequences within each lineage generally revealed a star-like phylogeny of mutations that coalesced to unambiguous T/F viral genomes. Numbers of transmitted viruses leading to productive clinical infection were estimated to range from 1 to 37 or more (median = 4). Four acutely infected subjects showed a distinctly different pattern of virus diversity that deviated from a star-like phylogeny. In these cases, empirical analysis and mathematical modeling suggested high multiplicity virus transmission from individuals who themselves were acutely infected or had experienced a virus population bottleneck due to antiviral drug therapy. These results provide new quantitative and qualitative insights into HCV transmission, revealing for the first time virus-host interactions that successful vaccines or treatment interventions will need to overcome. Our findings further suggest a novel experimental strategy for identifying full-length T/F genomes for proteome-wide analyses of HCV biology and adaptation to antiviral drug or immune pressures.

  14. Distribution and Diversity of Microbial Eukaryotes in Bathypelagic Waters of the South China Sea.

    PubMed

    Xu, Dapeng; Jiao, Nianzhi; Ren, Rui; Warren, Alan

    2017-05-01

    Little is known about the biodiversity of microbial eukaryotes in the South China Sea, especially in waters at bathyal depths. Here, we employed SSU rDNA gene sequencing to reveal the diversity and community structure across depth and distance gradients in the South China Sea. Vertically, the highest alpha diversity was found at 75-m depth. The communities of microbial eukaryotes were clustered into shallow-, middle-, and deep-water groups according to the depth from which they were collected, indicating a depth-related diversity and distribution pattern. Rhizaria sequences dominated the microeukaryote community and occurred in all samples except those from less than 50-m deep, being most abundant near the sea floor where they contributed ca. 64-97% and 40-74% of the total sequences and OTUs recovered, respectively. A large portion of rhizarian OTUs has neither a nearest named neighbor nor a nearest neighbor in the GenBank database which indicated the presence of new phylotypes in the South China Sea. Given their overwhelming abundance and richness, further phylogenetic analysis of rhizarians were performed and three new genetic clusters were revealed containing sequences retrieved from the deep waters of the South China Sea. Our results shed light on the diversity and community structure of microbial eukaryotes in this not yet fully explored area. © 2016 The Author(s) Journal of Eukaryotic Microbiology © 2016 International Society of Protistologists.

  15. Whole-Genome Sequencing of Sake Yeast Saccharomyces cerevisiae Kyokai no. 7

    PubMed Central

    Akao, Takeshi; Yashiro, Isao; Hosoyama, Akira; Kitagaki, Hiroshi; Horikawa, Hiroshi; Watanabe, Daisuke; Akada, Rinji; Ando, Yoshinori; Harashima, Satoshi; Inoue, Toyohisa; Inoue, Yoshiharu; Kajiwara, Susumu; Kitamoto, Katsuhiko; Kitamoto, Noriyuki; Kobayashi, Osamu; Kuhara, Satoru; Masubuchi, Takashi; Mizoguchi, Haruhiko; Nakao, Yoshihiro; Nakazato, Atsumi; Namise, Masahiro; Oba, Takahiro; Ogata, Tomoo; Ohta, Akinori; Sato, Masahide; Shibasaki, Seiji; Takatsume, Yoshifumi; Tanimoto, Shota; Tsuboi, Hirokazu; Nishimura, Akira; Yoda, Koji; Ishikawa, Takeaki; Iwashita, Kazuhiro; Fujita, Nobuyuki; Shimoi, Hitoshi

    2011-01-01

    The term ‘sake yeast’ is generally used to indicate the Saccharomyces cerevisiae strains that possess characteristics distinct from others including the laboratory strain S288C and are well suited for sake brewery. Here, we report the draft whole-genome shotgun sequence of a commonly used diploid sake yeast strain, Kyokai no. 7 (K7). The assembled sequence of K7 was nearly identical to that of the S288C, except for several subtelomeric polymorphisms and two large inversions in K7. A survey of heterozygous bases between the homologous chromosomes revealed the presence of mosaic-like uneven distribution of heterozygosity in K7. The distribution patterns appeared to have resulted from repeated losses of heterozygosity in the ancestral lineage of K7. Analysis of genes revealed the presence of both K7-acquired and K7-lost genes, in addition to numerous others with segmentations and terminal discrepancies in comparison with those of S288C. The distribution of Ty element also largely differed in the two strains. Interestingly, two regions in chromosomes I and VII of S288C have apparently been replaced by Ty elements in K7. Sequence comparisons suggest that these gene conversions were caused by cDNA-mediated recombination of Ty elements. The present study advances our understanding of the functional and evolutionary genomics of the sake yeast. PMID:21900213

  16. Molecular complexity of successive bacterial epidemics deconvoluted by comparative pathogenomics.

    PubMed

    Beres, Stephen B; Carroll, Ronan K; Shea, Patrick R; Sitkiewicz, Izabela; Martinez-Gutierrez, Juan Carlos; Low, Donald E; McGeer, Allison; Willey, Barbara M; Green, Karen; Tyrrell, Gregory J; Goldman, Thomas D; Feldgarden, Michael; Birren, Bruce W; Fofanov, Yuriy; Boos, John; Wheaton, William D; Honisch, Christiane; Musser, James M

    2010-03-02

    Understanding the fine-structure molecular architecture of bacterial epidemics has been a long-sought goal of infectious disease research. We used short-read-length DNA sequencing coupled with mass spectroscopy analysis of SNPs to study the molecular pathogenomics of three successive epidemics of invasive infections involving 344 serotype M3 group A Streptococcus in Ontario, Canada. Sequencing the genome of 95 strains from the three epidemics, coupled with analysis of 280 biallelic SNPs in all 344 strains, revealed an unexpectedly complex population structure composed of a dynamic mixture of distinct clonally related complexes. We discovered that each epidemic is dominated by micro- and macrobursts of multiple emergent clones, some with distinct strain genotype-patient phenotype relationships. On average, strains were differentiated from one another by only 49 SNPs and 11 insertion-deletion events (indels) in the core genome. Ten percent of SNPs are strain specific; that is, each strain has a unique genome sequence. We identified nonrandom temporal-spatial patterns of strain distribution within and between the epidemic peaks. The extensive full-genome data permitted us to identify genes with significantly increased rates of nonsynonymous (amino acid-altering) nucleotide polymorphisms, thereby providing clues about selective forces operative in the host. Comparative expression microarray analysis revealed that closely related strains differentiated by seemingly modest genetic changes can have significantly divergent transcriptomes. We conclude that enhanced understanding of bacterial epidemics requires a deep-sequencing, geographically centric, comparative pathogenomics strategy.

  17. Elucidation of Hepatitis C Virus Transmission and Early Diversification by Single Genome Sequencing

    PubMed Central

    Li, Hui; Stoddard, Mark B.; Wang, Shuyi; Blair, Lily M.; Giorgi, Elena E.; Parrish, Erica H.; Learn, Gerald H.; Hraber, Peter; Goepfert, Paul A.; Saag, Michael S.; Denny, Thomas N.; Haynes, Barton F.; Hahn, Beatrice H.; Ribeiro, Ruy M.; Perelson, Alan S.; Korber, Bette T.; Bhattacharya, Tanmoy; Shaw, George M.

    2012-01-01

    A precise molecular identification of transmitted hepatitis C virus (HCV) genomes could illuminate key aspects of transmission biology, immunopathogenesis and natural history. We used single genome sequencing of 2,922 half or quarter genomes from plasma viral RNA to identify transmitted/founder (T/F) viruses in 17 subjects with acute community-acquired HCV infection. Sequences from 13 of 17 acute subjects, but none of 14 chronic controls, exhibited one or more discrete low diversity viral lineages. Sequences within each lineage generally revealed a star-like phylogeny of mutations that coalesced to unambiguous T/F viral genomes. Numbers of transmitted viruses leading to productive clinical infection were estimated to range from 1 to 37 or more (median = 4). Four acutely infected subjects showed a distinctly different pattern of virus diversity that deviated from a star-like phylogeny. In these cases, empirical analysis and mathematical modeling suggested high multiplicity virus transmission from individuals who themselves were acutely infected or had experienced a virus population bottleneck due to antiviral drug therapy. These results provide new quantitative and qualitative insights into HCV transmission, revealing for the first time virus-host interactions that successful vaccines or treatment interventions will need to overcome. Our findings further suggest a novel experimental strategy for identifying full-length T/F genomes for proteome-wide analyses of HCV biology and adaptation to antiviral drug or immune pressures. PMID:22927816

  18. Evolution and phylogeny of the mud shrimps (Crustacea: Decapoda) revealed from complete mitochondrial genomes.

    PubMed

    Lin, Feng-Jiau; Liu, Yuan; Sha, Zhongli; Tsang, Ling Ming; Chu, Ka Hou; Chan, Tin-Yam; Liu, Ruiyu; Cui, Zhaoxia

    2012-11-16

    The evolutionary history and relationships of the mud shrimps (Crustacea: Decapoda: Gebiidea and Axiidea) are contentious, with previous attempts revealing mixed results. The mud shrimps were once classified in the infraorder Thalassinidea. Recent molecular phylogenetic analyses, however, suggest separation of the group into two individual infraorders, Gebiidea and Axiidea. Mitochondrial (mt) genome sequence and structure can be especially powerful in resolving higher systematic relationships that may offer new insights into the phylogeny of the mud shrimps and the other decapod infraorders, and test the hypothesis of dividing the mud shrimps into two infraorders. We present the complete mitochondrial genome sequences of five mud shrimps, Austinogebia edulis, Upogebia major, Thalassina kelanang (Gebiidea), Nihonotrypaea thermophilus and Neaxius glyptocercus (Axiidea). All five genomes encode a standard set of 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes and a putative control region. Except for T. kelanang, mud shrimp mitochondrial genomes exhibited rearrangements and novel patterns compared to the pancrustacean ground pattern. Each of the two Gebiidea species (A. edulis and U. major) and two Axiidea species (N. glyptocercus and N. thermophiles) share unique gene order specific to their infraorders and analyses further suggest these two derived gene orders have evolved independently. Phylogenetic analyses based on the concatenated nucleotide and amino acid sequences of 13 protein-coding genes indicate the possible polyphyly of mud shrimps, supporting the division of the group into two infraorders. However, the infraordinal relationships among the Gebiidea and Axiidea, and other reptants are poorly resolved. The inclusion of mt genome from more taxa, in particular the reptant infraorders Polychelida and Glypheidea is required in further analysis. Phylogenetic analyses on the mt genome sequences and the distinct gene orders provide further evidences for the divergence between the two mud shrimp infraorders, Gebiidea and Axiidea, corroborating previous molecular phylogeny and justifying their infraordinal status. Mitochondrial genome sequences appear to be promising markers for resolving phylogenetic issues concerning decapod crustaceans that warrant further investigations and our present study has also provided further information concerning the mt genome evolution of the Decapoda.

  19. Evolution and phylogeny of the mud shrimps (Crustacea: Decapoda) revealed from complete mitochondrial genomes

    PubMed Central

    2012-01-01

    Background The evolutionary history and relationships of the mud shrimps (Crustacea: Decapoda: Gebiidea and Axiidea) are contentious, with previous attempts revealing mixed results. The mud shrimps were once classified in the infraorder Thalassinidea. Recent molecular phylogenetic analyses, however, suggest separation of the group into two individual infraorders, Gebiidea and Axiidea. Mitochondrial (mt) genome sequence and structure can be especially powerful in resolving higher systematic relationships that may offer new insights into the phylogeny of the mud shrimps and the other decapod infraorders, and test the hypothesis of dividing the mud shrimps into two infraorders. Results We present the complete mitochondrial genome sequences of five mud shrimps, Austinogebia edulis, Upogebia major, Thalassina kelanang (Gebiidea), Nihonotrypaea thermophilus and Neaxius glyptocercus (Axiidea). All five genomes encode a standard set of 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes and a putative control region. Except for T. kelanang, mud shrimp mitochondrial genomes exhibited rearrangements and novel patterns compared to the pancrustacean ground pattern. Each of the two Gebiidea species (A. edulis and U. major) and two Axiidea species (N. glyptocercus and N. thermophiles) share unique gene order specific to their infraorders and analyses further suggest these two derived gene orders have evolved independently. Phylogenetic analyses based on the concatenated nucleotide and amino acid sequences of 13 protein-coding genes indicate the possible polyphyly of mud shrimps, supporting the division of the group into two infraorders. However, the infraordinal relationships among the Gebiidea and Axiidea, and other reptants are poorly resolved. The inclusion of mt genome from more taxa, in particular the reptant infraorders Polychelida and Glypheidea is required in further analysis. Conclusions Phylogenetic analyses on the mt genome sequences and the distinct gene orders provide further evidences for the divergence between the two mud shrimp infraorders, Gebiidea and Axiidea, corroborating previous molecular phylogeny and justifying their infraordinal status. Mitochondrial genome sequences appear to be promising markers for resolving phylogenetic issues concerning decapod crustaceans that warrant further investigations and our present study has also provided further information concerning the mt genome evolution of the Decapoda. PMID:23153176

  20. Network analysis reveals seasonal variation of co-occurrence correlations between Cyanobacteria and other bacterioplankton.

    PubMed

    Zhao, Dayong; Shen, Feng; Zeng, Jin; Huang, Rui; Yu, Zhongbo; Wu, Qinglong L

    2016-12-15

    Association network approaches have recently been proposed as a means for exploring the associations between bacterial communities. In the present study, high-throughput sequencing was employed to investigate the seasonal variations in the composition of bacterioplankton communities in six eutrophic urban lakes of Nanjing City, China. Over 150,000 16S rRNA sequences were derived from 52 water samples, and correlation-based network analyses were conducted. Our results demonstrated that the architecture of the co-occurrence networks varied in different seasons. Cyanobacteria played various roles in the ecological networks during different seasons. Co-occurrence patterns revealed that members of Cyanobacteria shared a very similar niche and they had weak positive correlations with other phyla in summer. To explore the effect of environmental factors on species-species co-occurrence networks and to determine the most influential environmental factors, the original positive network was simplified by module partitioning and by calculating module eigengenes. Module eigengene analysis indicated that temperature only affected some Cyanobacteria; the rest were mainly affected by nitrogen associated factors throughout the year. Cyanobacteria were dominant in summer which may result from strong co-occurrence patterns and suitable living conditions. Overall, this study has improved our understanding of the roles of Cyanobacteria and other bacterioplankton in ecological networks. Copyright © 2016 Elsevier B.V. All rights reserved.

  1. Revising the recent evolutionary history of equids using ancient DNA.

    PubMed

    Orlando, Ludovic; Metcalf, Jessica L; Alberdi, Maria T; Telles-Antunes, Miguel; Bonjean, Dominique; Otte, Marcel; Martin, Fabiana; Eisenmann, Véra; Mashkour, Marjan; Morello, Flavia; Prado, Jose L; Salas-Gismondi, Rodolfo; Shockey, Bruce J; Wrinn, Patrick J; Vasil'ev, Sergei K; Ovodov, Nikolai D; Cherry, Michael I; Hopwood, Blair; Male, Dean; Austin, Jeremy J; Hänni, Catherine; Cooper, Alan

    2009-12-22

    The rich fossil record of the family Equidae (Mammalia: Perissodactyla) over the past 55 MY has made it an icon for the patterns and processes of macroevolution. Despite this, many aspects of equid phylogenetic relationships and taxonomy remain unresolved. Recent genetic analyses of extinct equids have revealed unexpected evolutionary patterns and a need for major revisions at the generic, subgeneric, and species levels. To investigate this issue we examine 35 ancient equid specimens from four geographic regions (South America, Europe, Southwest Asia, and South Africa), of which 22 delivered 87-688 bp of reproducible aDNA mitochondrial sequence. Phylogenetic analyses support a major revision of the recent evolutionary history of equids and reveal two new species, a South American hippidion and a descendant of a basal lineage potentially related to Middle Pleistocene equids. Sequences from specimens assigned to the giant extinct Cape zebra, Equus capensis, formed a separate clade within the modern plain zebra species, a phenotypicically plastic group that also included the extinct quagga. In addition, we revise the currently recognized extinction times for two hemione-related equid groups. However, it is apparent that the current dataset cannot solve all of the taxonomic and phylogenetic questions relevant to the evolution of Equus. In light of these findings, we propose a rapid DNA barcoding approach to evaluate the taxonomic status of the many Late Pleistocene fossil Equidae species that have been described from purely morphological analyses.

  2. Overlay improvement by exposure map based mask registration optimization

    NASA Astrophysics Data System (ADS)

    Shi, Irene; Guo, Eric; Chen, Ming; Lu, Max; Li, Gordon; Li, Rivan; Tian, Eric

    2015-03-01

    Along with the increased miniaturization of semiconductor electronic devices, the design rules of advanced semiconductor devices shrink dramatically. [1] One of the main challenges of lithography step is the layer-to-layer overlay control. Furthermore, DPT (Double Patterning Technology) has been adapted for the advanced technology node like 28nm and 14nm, corresponding overlay budget becomes even tighter. [2][3] After the in-die mask registration (pattern placement) measurement is introduced, with the model analysis of a KLA SOV (sources of variation) tool, it's observed that registration difference between masks is a significant error source of wafer layer-to-layer overlay at 28nm process. [4][5] Mask registration optimization would highly improve wafer overlay performance accordingly. It was reported that a laser based registration control (RegC) process could be applied after the pattern generation or after pellicle mounting and allowed fine tuning of the mask registration. [6] In this paper we propose a novel method of mask registration correction, which can be applied before mask writing based on mask exposure map, considering the factors of mask chip layout, writing sequence, and pattern density distribution. Our experiment data show if pattern density on the mask keeps at a low level, in-die mask registration residue error in 3sigma could be always under 5nm whatever blank type and related writer POSCOR (position correction) file was applied; it proves random error induced by material or equipment would occupy relatively fixed error budget as an error source of mask registration. On the real production, comparing the mask registration difference through critical production layers, it could be revealed that registration residue error of line space layers with higher pattern density is always much larger than the one of contact hole layers with lower pattern density. Additionally, the mask registration difference between layers with similar pattern density could also achieve under 5nm performance. We assume mask registration excluding random error is mostly induced by charge accumulation during mask writing, which may be calculated from surrounding exposed pattern density. Multi-loading test mask registration result shows that with x direction writing sequence, mask registration behavior in x direction is mainly related to sequence direction, but mask registration in y direction would be highly impacted by pattern density distribution map. It proves part of mask registration error is due to charge issue from nearby environment. If exposure sequence is chip by chip for normal multi chip layout case, mask registration of both x and y direction would be impacted analogously, which has also been proved by real data. Therefore, we try to set up a simple model to predict the mask registration error based on mask exposure map, and correct it with the given POSCOR (position correction) file for advanced mask writing if needed.

  3. The 3of5 web application for complex and comprehensive pattern matching in protein sequences.

    PubMed

    Seiler, Markus; Mehrle, Alexander; Poustka, Annemarie; Wiemann, Stefan

    2006-03-16

    The identification of patterns in biological sequences is a key challenge in genome analysis and in proteomics. Frequently such patterns are complex and highly variable, especially in protein sequences. They are frequently described using terms of regular expressions (RegEx) because of the user-friendly terminology. Limitations arise for queries with the increasing complexity of patterns and are accompanied by requirements for enhanced capabilities. This is especially true for patterns containing ambiguous characters and positions and/or length ambiguities. We have implemented the 3of5 web application in order to enable complex pattern matching in protein sequences. 3of5 is named after a special use of its main feature, the novel n-of-m pattern type. This feature allows for an extensive specification of variable patterns where the individual elements may vary in their position, order, and content within a defined stretch of sequence. The number of distinct elements can be constrained by operators, and individual characters may be excluded. The n-of-m pattern type can be combined with common regular expression terms and thus also allows for a comprehensive description of complex patterns. 3of5 increases the fidelity of pattern matching and finds ALL possible solutions in protein sequences in cases of length-ambiguous patterns instead of simply reporting the longest or shortest hits. Grouping and combined search for patterns provides a hierarchical arrangement of larger patterns sets. The algorithm is implemented as internet application and freely accessible. The application is available at http://dkfz.de/mga2/3of5/3of5.html. The 3of5 application offers an extended vocabulary for the definition of search patterns and thus allows the user to comprehensively specify and identify peptide patterns with variable elements. The n-of-m pattern type offers an improved accuracy for pattern matching in combination with the ability to find all solutions, without compromising the user friendliness of regular expression terms.

  4. The Organization of Repetitive DNA in the Genomes of Amazonian Lizard Species in the Family Teiidae.

    PubMed

    Carvalho, Natalia D M; Pinheiro, Vanessa S S; Carmo, Edson J; Goll, Leonardo G; Schneider, Carlos H; Gross, Maria C

    2015-01-01

    Repetitive DNA is the largest fraction of the eukaryote genome and comprises tandem and dispersed sequences. It presents variations in relation to its composition, number of copies, distribution, dynamics, and genome organization, and participates in the evolutionary diversification of different vertebrate species. Repetitive sequences are usually located in the heterochromatin of centromeric and telomeric regions of chromosomes, contributing to chromosomal structures. Therefore, the aim of this study was to physically map repetitive DNA sequences (5S rDNA, telomeric sequences, tropomyosin gene 1, and retroelements Rex1 and SINE) of mitotic chromosomes of Amazonian species of teiids (Ameiva ameiva, Cnemidophorus sp. 1, Kentropyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin) to understand their genome organization and karyotype evolution. The mapping of repetitive sequences revealed a distinct pattern in Cnemidophorus sp. 1, whereas the other species showed all sequences interspersed in the heterochromatic region. Physical mapping of the tropomyosin 1 gene was performed for the first time in lizards and showed that in addition to being functional, this gene has a structural function similar to the mapped repetitive elements as it is located preferentially in centromeric regions and termini of chromosomes. © 2016 S. Karger AG, Basel.

  5. The mitochondrial genome of the legume Vigna radiata and the analysis of recombination across short mitochondrial repeats.

    PubMed

    Alverson, Andrew J; Zhuo, Shi; Rice, Danny W; Sloan, Daniel B; Palmer, Jeffrey D

    2011-01-20

    The mitochondrial genomes of seed plants are exceptionally fluid in size, structure, and sequence content, with the accumulation and activity of repetitive sequences underlying much of this variation. We report the first fully sequenced mitochondrial genome of a legume, Vigna radiata (mung bean), and show that despite its unexceptional size (401,262 nt), the genome is unusually depauperate in repetitive DNA and "promiscuous" sequences from the chloroplast and nuclear genomes. Although Vigna lacks the large, recombinationally active repeats typical of most other seed plants, a PCR survey of its modest repertoire of short (38-297 nt) repeats nevertheless revealed evidence for recombination across all of them. A set of novel control assays showed, however, that these results could instead reflect, in part or entirely, artifacts of PCR-mediated recombination. Consequently, we recommend that other methods, especially high-depth genome sequencing, be used instead of PCR to infer patterns of plant mitochondrial recombination. The average-sized but repeat- and feature-poor mitochondrial genome of Vigna makes it ever more difficult to generalize about the factors shaping the size and sequence content of plant mitochondrial genomes.

  6. Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences.

    PubMed

    Bergman, C M; Kreitman, M

    2001-08-01

    Comparative genomic approaches to gene and cis-regulatory prediction are based on the principle that differential DNA sequence conservation reflects variation in functional constraint. Using this principle, we analyze noncoding sequence conservation in Drosophila for 40 loci with known or suspected cis-regulatory function encompassing >100 kb of DNA. We estimate the fraction of noncoding DNA conserved in both intergenic and intronic regions and describe the length distribution of ungapped conserved noncoding blocks. On average, 22%-26% of noncoding sequences surveyed are conserved in Drosophila, with median block length approximately 19 bp. We show that point substitution in conserved noncoding blocks exhibits transition bias as well as lineage effects in base composition, and occurs more than an order of magnitude more frequently than insertion/deletion (indel) substitution. Overall, patterns of noncoding DNA structure and evolution differ remarkably little between intergenic and intronic conserved blocks, suggesting that the effects of transcription per se contribute minimally to the constraints operating on these sequences. The results of this study have implications for the development of alignment and prediction algorithms specific to noncoding DNA, as well as for models of cis-regulatory DNA sequence evolution.

  7. Translating natural genetic variation to gene expression in a computational model of the Drosophila gap gene regulatory network

    PubMed Central

    Kozlov, Konstantin N.; Kulakovskiy, Ivan V.; Zubair, Asif; Marjoram, Paul; Lawrie, David S.; Nuzhdin, Sergey V.; Samsonova, Maria G.

    2017-01-01

    Annotating the genotype-phenotype relationship, and developing a proper quantitative description of the relationship, requires understanding the impact of natural genomic variation on gene expression. We apply a sequence-level model of gap gene expression in the early development of Drosophila to analyze single nucleotide polymorphisms (SNPs) in a panel of natural sequenced D. melanogaster lines. Using a thermodynamic modeling framework, we provide both analytical and computational descriptions of how single-nucleotide variants affect gene expression. The analysis reveals that the sequence variants increase (decrease) gene expression if located within binding sites of repressors (activators). We show that the sign of SNP influence (activation or repression) may change in time and space and elucidate the origin of this change in specific examples. The thermodynamic modeling approach predicts non-local and non-linear effects arising from SNPs, and combinations of SNPs, in individual fly genotypes. Simulation of individual fly genotypes using our model reveals that this non-linearity reduces to almost additive inputs from multiple SNPs. Further, we see signatures of the action of purifying selection in the gap gene regulatory regions. To infer the specific targets of purifying selection, we analyze the patterns of polymorphism in the data at two phenotypic levels: the strengths of binding and expression. We find that combinations of SNPs show evidence of being under selective pressure, while individual SNPs do not. The model predicts that SNPs appear to accumulate in the genotypes of the natural population in a way biased towards small increases in activating action on the expression pattern. Taken together, these results provide a systems-level view of how genetic variation translates to the level of gene regulatory networks via combinatorial SNP effects. PMID:28898266

  8. A cultivation-independent PCR-RFLP assay targeting oprF gene for detection and identification of Pseudomonas spp. in samples from fibrocystic pediatric patients.

    PubMed

    Lagares, Antonio; Agaras, Betina; Bettiol, Marisa P; Gatti, Blanca M; Valverde, Claudio

    2015-07-01

    Species-specific genetic markers are crucial to develop faithful and sensitive molecular methods for the detection and identification of Pseudomonas aeruginosa (Pa). We have previously set up a PCR-RFLP protocol targeting oprF, the gene encoding the genus-specific outer membrane porin F, whose strong conservation and marked sequence diversity allowed detection and differentiation of environmental isolates (Agaras et al., 2012). Here, we evaluated the ability of the PCR-RFLP assay to genotype clinical isolates previously identified as Pa by conventional microbiological methods within a collection of 62 presumptive Pa isolates from different pediatric clinical samples and different sections of the Hospital de Niños "Sor María Ludovica" from La Plata, Argentina. All isolates, but one, gave an oprF amplicon consistent with that from reference Pa strains. The sequence of the smaller-sized amplicon revealed that the isolate was in fact a mendocina Pseudomonas strain. The oprF RFLP pattern generated with TaqI or HaeIII nucleases matched those of reference Pa strains for 59 isolates (96%). The other two Pa isolates (4%) revealed a different RFLP pattern based on HaeIII digestion, although oprF sequencing confirmed that Pa identification was correct. We next tested the effectiveness of the PCR-RFLP to detect pseudomonads on clinical samples of pediatric fibrocystic patients directly without sample cultivation. The expected amplicon and its cognate RFLP profile were obtained for all samples in which Pa was previously detected by cultivation-dependent methods. Altogether, these results provide the basis for the application of the oprF PCR-RFLP protocol to directly detect and identify Pa and other non-Pa pseudomonads in fibrocystic clinical samples. Copyright © 2015 Elsevier B.V. All rights reserved.

  9. Bathymetric and geographic population structure in the pan-Atlantic deep-sea bivalve Deminucula atacellana (Schenck, 1939).

    PubMed

    Zardus, John D; Etter, Ron J; Chase, Michael R; Rex, Michael A; Boyle, Elizabeth E

    2006-03-01

    The deep-sea soft-sediment environment hosts a diverse and highly endemic fauna of uncertain origin. We know little about how this fauna evolved because geographic patterns of genetic variation, the essential information for inferring patterns of population differentiation and speciation are poorly understood. Using formalin-fixed specimens from archival collections, we quantify patterns of genetic variation in the protobranch bivalve Deminucula atacellana, a species widespread throughout the Atlantic Ocean at bathyal and abyssal depths. Samples were taken from 18 localities in the North American, West European and Argentine basins. A hypervariable region of mitochondrial 16S rDNA was amplified by polymerase chain reaction (PCR) and sequenced from 130 individuals revealing 21 haplotypes. Except for several important exceptions, haplotypes are unique to each basin. Overall gene diversity is high (h = 0.73) with pronounced population structure (Phi(ST) = 0.877) and highly significant geographic associations (P < 0.0001). Sequences cluster into four major clades corresponding to differences in geography and depth. Genetic divergence was much greater among populations at different depths within the same basin, than among those at similar depths but separated by thousands of kilometres. Isolation by distance probably explains much of the interbasin variation. Depth-related divergence may reflect historical patterns of colonization or strong environmental selective gradients. Broadly distributed deep-sea organisms can possess highly genetically divergent populations, despite the lack of any morphological divergence.

  10. Genetic Analyses of the Internal Transcribed Spacer Sequences Suggest Introgression and Duplication in the Medicinal Mushroom Agaricus subrufescens

    PubMed Central

    Chen, Jie; Moinard, Magalie; Xu, Jianping; Wang, Shouxian; Foulongne-Oriol, Marie; Zhao, Ruilin; Hyde, Kevin D.; Callac, Philippe

    2016-01-01

    The internal transcribed spacer (ITS) region of the nuclear ribosomal RNA gene cluster is widely used in fungal taxonomy and phylogeographic studies. The medicinal and edible mushroom Agaricus subrufescens has a worldwide distribution with a high level of polymorphism in the ITS region. A previous analysis suggested notable ITS sequence heterogeneity within the wild French isolate CA487. The objective of this study was to investigate the pattern and potential mechanism of ITS sequence heterogeneity within this strain. Using PCR, cloning, and sequencing, we identified three types of ITS sequences, A, B, and C with a balanced distribution, which differed from each other at 13 polymorphic positions. The phylogenetic comparisons with samples from different continents revealed that the type C sequence was similar to those found in Oceanian and Asian specimens of A. subrufescens while types A and B sequences were close to those found in the Americas or in Europe. We further investigated the inheritance of these three ITS sequence types by analyzing their distribution among single-spore isolates from CA487. In this analysis, three co-dominant markers were used firstly to distinguish the homokaryotic offspring from the heterokaryotic offspring. The homokaryotic offspring were then analyzed for their ITS types. Our genetic analyses revealed that types A and B were two alleles segregating at one locus ITSI, while type C was not allelic with types A and B but was located at another unlinked locus ITSII. Furthermore, type C was present in only one of the two constitutive haploid nuclei (n) of the heterokaryotic (n+n) parent CA487. These data suggest that there was a relatively recent introduction of the type C sequence and a duplication of the ITS locus in this strain. Whether other genes were also transferred and duplicated and their impacts on genome structure and stability remain to be investigated. PMID:27228131

  11. Molecular characterization and combined genotype association study of bovine cluster of differentiation 14 gene with clinical mastitis in crossbred dairy cattle

    PubMed Central

    Selvan, A. Sakthivel; Gupta, I. D.; Verma, A.; Chaudhari, M. V.; Magotra, A.

    2016-01-01

    Aim: The present study was undertaken with the objectives to characterize and to analyze combined genotypes of cluster of differentiation 14 (CD14) gene to explore its association with clinical mastitis in Karan Fries (KF) cows maintained in the National Dairy Research Institute herd, Karnal. Materials and Methods: Genomic DNA was extracted using blood of randomly selected 94 KF lactating cattle by phenol-chloroform method. After checking its quality and quantity, polymerase chain reaction (PCR) was carried out using six sets of reported gene-specific primers to amplify complete KF CD14 gene. The forward and reverse sequences for each PCR fragments were assembled to form complete sequence for the respective region of KF CD14 gene. The multiple sequence alignments of the edited sequence with the corresponding reference with reported Bos taurus sequence (EU148610.1) were performed with ClustalW software to identify single nucleotide polymorphisms (SNPs). Basic Local Alignment Search Tool analysis was performed to compare the sequence identity of KF CD14 gene with other species. The restriction fragment length polymorphism (RFLP) analysis was carried out in all KF cows using Helicobacter pylori 188I (Hpy188I) (contig 2) and Haemophilus influenzae I (HinfI) (contig 4) restriction enzyme (RE). Cows were assigned genotypes obtained by PCR-RFLP analysis, and association study was done using Chi-square (χ2) test. The genotypes of both contigs (loci) number 2 and 4 were combined with respect to each animal to construct combined genotype patterns. Results: Two types of sequences of KF were obtained: One with 2630 bp having one insertion at 616 nucleotide (nt) position and one deletion at 1117 nt position, and the another sequence was of 2629 bp having only one deletion at 615 nt position. ClustalW, multiple alignments of KF CD14 gene sequence with B. taurus cattle sequence (EU148610.1), revealed 24 nt changes (SNPs). Cows were also screened using PCR-RFLP with Hpy188I (contig 2) and HinfI (contig 4) RE, which revealed three genotypes each that differed significantly regarding mastitis incidence. The maximum possible combination of these two loci shown nine combined genotype patterns and it was observed only eight combined genotypes out of nine: AACC, AACD, AADD, ABCD, ABDD, BBCC, BBCD, and BBDD. The combined genotype ABCC was not observed in the studied population of KF cows. Out of 94 animals, AACD combined genotype animals (10.63%) were found to be not affected with mastitis, and ABDD combined genotyped animals was observed having the highest mastitis incidence of 15.96%. Conclusion: AACD typed cows were found to be least susceptible to mastitis incidence as compared to other combined genotypes. PMID:27536026

  12. Molecular characterization and combined genotype association study of bovine cluster of differentiation 14 gene with clinical mastitis in crossbred dairy cattle.

    PubMed

    Selvan, A Sakthivel; Gupta, I D; Verma, A; Chaudhari, M V; Magotra, A

    2016-07-01

    The present study was undertaken with the objectives to characterize and to analyze combined genotypes of cluster of differentiation 14 (CD14) gene to explore its association with clinical mastitis in Karan Fries (KF) cows maintained in the National Dairy Research Institute herd, Karnal. Genomic DNA was extracted using blood of randomly selected 94 KF lactating cattle by phenol-chloroform method. After checking its quality and quantity, polymerase chain reaction (PCR) was carried out using six sets of reported gene-specific primers to amplify complete KF CD14 gene. The forward and reverse sequences for each PCR fragments were assembled to form complete sequence for the respective region of KF CD14 gene. The multiple sequence alignments of the edited sequence with the corresponding reference with reported Bos taurus sequence (EU148610.1) were performed with ClustalW software to identify single nucleotide polymorphisms (SNPs). Basic Local Alignment Search Tool analysis was performed to compare the sequence identity of KF CD14 gene with other species. The restriction fragment length polymorphism (RFLP) analysis was carried out in all KF cows using Helicobacter pylori 188I (Hpy188I) (contig 2) and Haemophilus influenzae I (HinfI) (contig 4) restriction enzyme (RE). Cows were assigned genotypes obtained by PCR-RFLP analysis, and association study was done using Chi-square (χ (2)) test. The genotypes of both contigs (loci) number 2 and 4 were combined with respect to each animal to construct combined genotype patterns. Two types of sequences of KF were obtained: One with 2630 bp having one insertion at 616 nucleotide (nt) position and one deletion at 1117 nt position, and the another sequence was of 2629 bp having only one deletion at 615 nt position. ClustalW, multiple alignments of KF CD14 gene sequence with B. taurus cattle sequence (EU148610.1), revealed 24 nt changes (SNPs). Cows were also screened using PCR-RFLP with Hpy188I (contig 2) and HinfI (contig 4) RE, which revealed three genotypes each that differed significantly regarding mastitis incidence. The maximum possible combination of these two loci shown nine combined genotype patterns and it was observed only eight combined genotypes out of nine: AACC, AACD, AADD, ABCD, ABDD, BBCC, BBCD, and BBDD. The combined genotype ABCC was not observed in the studied population of KF cows. Out of 94 animals, AACD combined genotype animals (10.63%) were found to be not affected with mastitis, and ABDD combined genotyped animals was observed having the highest mastitis incidence of 15.96%. AACD typed cows were found to be least susceptible to mastitis incidence as compared to other combined genotypes.

  13. Single cell sequencing reveals heterogeneity within ovarian cancer epithelium and cancer associated stromal cells.

    PubMed

    Winterhoff, Boris J; Maile, Makayla; Mitra, Amit Kumar; Sebe, Attila; Bazzaro, Martina; Geller, Melissa A; Abrahante, Juan E; Klein, Molly; Hellweg, Raffaele; Mullany, Sally A; Beckman, Kenneth; Daniel, Jerry; Starr, Timothy K

    2017-03-01

    The purpose of this study was to determine the level of heterogeneity in high grade serous ovarian cancer (HGSOC) by analyzing RNA expression in single epithelial and cancer associated stromal cells. In addition, we explored the possibility of identifying subgroups based on pathway activation and pre-defined signatures from cancer stem cells and chemo-resistant cells. A fresh, HGSOC tumor specimen derived from ovary was enzymatically digested and depleted of immune infiltrating cells. RNA sequencing was performed on 92 single cells and 66 of these single cell datasets passed quality control checks. Sequences were analyzed using multiple bioinformatics tools, including clustering, principle components analysis, and geneset enrichment analysis to identify subgroups and activated pathways. Immunohistochemistry for ovarian cancer, stem cell and stromal markers was performed on adjacent tumor sections. Analysis of the gene expression patterns identified two major subsets of cells characterized by epithelial and stromal gene expression patterns. The epithelial group was characterized by proliferative genes including genes associated with oxidative phosphorylation and MYC activity, while the stromal group was characterized by increased expression of extracellular matrix (ECM) genes and genes associated with epithelial-to-mesenchymal transition (EMT). Neither group expressed a signature correlating with published chemo-resistant gene signatures, but many cells, predominantly in the stromal subgroup, expressed markers associated with cancer stem cells. Single cell sequencing provides a means of identifying subpopulations of cancer cells within a single patient. Single cell sequence analysis may prove to be critical for understanding the etiology, progression and drug resistance in ovarian cancer. Copyright © 2017 Elsevier Inc. All rights reserved.

  14. Misconceptions on Missing Data in RAD-seq Phylogenetics with a Deep-scale Example from Flowering Plants.

    PubMed

    Eaton, Deren A R; Spriggs, Elizabeth L; Park, Brian; Donoghue, Michael J

    2017-05-01

    Restriction-site associated DNA (RAD) sequencing and related methods rely on the conservation of enzyme recognition sites to isolate homologous DNA fragments for sequencing, with the consequence that mutations disrupting these sites lead to missing information. There is thus a clear expectation for how missing data should be distributed, with fewer loci recovered between more distantly related samples. This observation has led to a related expectation: that RAD-seq data are insufficiently informative for resolving deeper scale phylogenetic relationships. Here we investigate the relationship between missing information among samples at the tips of a tree and information at edges within it. We re-analyze and review the distribution of missing data across ten RAD-seq data sets and carry out simulations to determine expected patterns of missing information. We also present new empirical results for the angiosperm clade Viburnum (Adoxaceae, with a crown age >50 Ma) for which we examine phylogenetic information at different depths in the tree and with varied sequencing effort. The total number of loci, the proportion that are shared, and phylogenetic informativeness varied dramatically across the examined RAD-seq data sets. Insufficient or uneven sequencing coverage accounted for similar proportions of missing data as dropout from mutation-disruption. Simulations reveal that mutation-disruption, which results in phylogenetically distributed missing data, can be distinguished from the more stochastic patterns of missing data caused by low sequencing coverage. In Viburnum, doubling sequencing coverage nearly doubled the number of parsimony informative sites, and increased by >10X the number of loci with data shared across >40 taxa. Our analysis leads to a set of practical recommendations for maximizing phylogenetic information in RAD-seq studies. [hierarchical redundancy; phylogenetic informativeness; quartet informativeness; Restriction-site associated DNA (RAD) sequencing; sequencing coverage; Viburnum.]. © The authors 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For permissions, please e-mail: journals.permission@oup.com.

  15. Clonal relationship and differentiation among Mycobacterium abscessus isolates as determined using the semiautomated repetitive extragenic palindromic sequence PCR-based DiversiLab system.

    PubMed

    Mougari, Faiza; Raskine, Laurent; Ferroni, Agnes; Marcon, Estelle; Sermet-Gaudelus, Isabelle; Veziris, Nicolas; Heym, Beate; Gaillard, Jean-Louis; Nassif, Xavier; Cambau, Emmanuelle

    2014-06-01

    Mycobacterium abscessus is a rapidly growing mycobacterium that causes respiratory tract infections in predisposed patients, such as those with cystic fibrosis and nosocomial skin and soft tissue infections. In order to investigate the clonal relationships between the strains causing epidemic episodes, we evaluated the discriminatory power of the semiautomated DiversiLab (DL) repetitive extragenic palindromic sequence PCR (REP-PCR) test for M. abscessus genotyping. Since M. abscessus was shown to be composed of subspecies (M. abscessus subsp. massiliense, M. abscessus subsp. bolletii, and M. abscessus subsp. abscessus), we also evaluated the ability of this technique to differentiate subspecies. The technique was applied to two collections of clinical isolates, (i) 83 M. abscessus original isolates (43 M. abscessus subsp. abscessus, 12 M. abscessus subsp. bolletii, and 28 M. abscessus subsp. massiliense) from infected patients and (ii) 35 repeated isolates obtained over 1 year from four cystic fibrosis patients. The DL REP-PCR test was standardized for DNA extraction, DNA amplification, and electrophoresis pattern comparisons. Among the isolates from distinct patients, 53/83 (62%) isolates showed a specific pattern, and 30 were distributed in 11 clusters and 6 patterns, with 2 to 4 isolates per pattern. The clusters and patterns did not fully correlate with multilocus sequence typing (MLST) analysis results. This revealed a high genomic diversity between patients, with a discriminatory power of 98% (Simpson's diversity index). However, since some isolates shared identical patterns, this raises the question of whether it is due to transmission between patients or a common reservoir. Multiple isolates from the same patient showed identical patterns, except for one patient infected by two strains. Between the M. abscessus subspecies, the indexes were <70%, indicating that the DL REP-PCR test is not an accurate tool for identifying organisms to the subspecies level. REP-PCR appears to be a rapid genotyping method that is useful for investigating epidemics of M. abscessus infections. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  16. Mining co-occurrence and sequence patterns from cancer diagnoses in New York State.

    PubMed

    Wang, Yu; Hou, Wei; Wang, Fusheng

    2018-01-01

    The goal of this study is to discover disease co-occurrence and sequence patterns from large scale cancer diagnosis histories in New York State. In particular, we want to identify disparities among different patient groups. Our study will provide essential knowledge for clinical researchers to further investigate comorbidities and disease progression for improving the management of multiple diseases. We used inpatient discharge and outpatient visit records from the New York State Statewide Planning and Research Cooperative System (SPARCS) from 2011-2015. We grouped each patient's visit history to generate diagnosis sequences for seven most popular cancer types. We performed frequent disease co-occurrence mining using the Apriori algorithm, and frequent disease sequence patterns discovery using the cSPADE algorithm. Different types of cancer demonstrated distinct patterns. Disparities of both disease co-occurrence and sequence patterns were observed from patients within different age groups. There were also considerable disparities in disease co-occurrence patterns with respect to different claim types (i.e., inpatient, outpatient, emergency department and ambulatory surgery). Disparities regarding genders were mostly found where the cancer types were gender specific. Supports of most patterns were usually higher for males than for females. Compared with secondary diagnosis codes, primary diagnosis codes can convey more stable results. Two disease sequences consisting of the same diagnoses but in different orders were usually with different supports. Our results suggest that the methods adopted can generate potentially interesting and clinically meaningful disease co-occurrence and sequence patterns, and identify disparities among various patient groups. These patterns could imply comorbidities and disease progressions.

  17. Hybridization and massive mtDNA unidirectional introgression between the closely related Neotropical toads Rhinella marina and R. schneideri inferred from mtDNA and nuclear markers

    PubMed Central

    2011-01-01

    Background The classical perspective that interspecific hybridization in animals is rare has been changing due to a growing list of empirical examples showing the occurrence of gene flow between closely related species. Using sequence data from cyt b mitochondrial gene and three intron nuclear genes (RPL9, c-myc, and RPL3) we investigated patterns of nucleotide polymorphism and divergence between two closely related toad species R. marina and R. schneideri. By comparing levels of differentiation at nuclear and mtDNA levels we were able to describe patterns of introgression and infer the history of hybridization between these species. Results All nuclear loci are essentially concordant in revealing two well differentiated groups of haplotypes, corresponding to the morphologically-defined species R. marina and R. schneideri. Mitochondrial DNA analysis also revealed two well-differentiated groups of haplotypes but, in stark contrast with the nuclear genealogies, all R. schneideri sequences are clustered with sequences of R. marina from the right Amazon bank (RAB), while R. marina sequences from the left Amazon bank (LAB) are monophyletic. An Isolation-with-Migration (IM) analysis using nuclear data showed that R. marina and R. schneideri diverged at ≈ 1.69 Myr (early Pleistocene), while R. marina populations from LAB and RAB diverged at ≈ 0.33 Myr (middle Pleistocene). This time of divergence is not consistent with the split between LAB and RAB populations obtained with mtDNA data (≈ 1.59 Myr), which is notably similar to the estimate obtained with nuclear genes between R. marina and R. schneideri. Coalescent simulations of mtDNA phylogeny under the speciation history inferred from nuclear genes rejected the hypothesis of incomplete lineage sorting to explain the conflicting signal between mtDNA and nuclear-based phylogenies. Conclusions The cytonuclear discordance seems to reflect the occurrence of interspecific hybridization between these two closely related toad species. Overall, our results suggest a phenomenon of extensive mtDNA unidirectional introgression from the previously occurring R. schneideri into the invading R. marina. We hypothesize that climatic-induced range shifts during the Pleistocene/Holocene may have played an important role in the observed patterns of introgression. PMID:21939538

  18. High resolution optical DNA mapping

    NASA Astrophysics Data System (ADS)

    Baday, Murat

    Many types of diseases including cancer and autism are associated with copy-number variations in the genome. Most of these variations could not be identified with existing sequencing and optical DNA mapping methods. We have developed Multi-color Super-resolution technique, with potential for high throughput and low cost, which can allow us to recognize more of these variations. Our technique has made 10--fold improvement in the resolution of optical DNA mapping. Using a 180 kb BAC clone as a model system, we resolved dense patterns from 108 fluorescent labels of two different colors representing two different sequence-motifs. Overall, a detailed DNA map with 100 bp resolution was achieved, which has the potential to reveal detailed information about genetic variance and to facilitate medical diagnosis of genetic disease.

  19. Ecology and evolution of rabies virus in Europe.

    PubMed

    Bourhy, H; Kissi, B; Audry, L; Smreczak, M; Sadkowska-Todys, M; Kulonen, K; Tordo, N; Zmudzinski, J F; Holmes, E C

    1999-10-01

    The evolution of rabies viruses of predominantly European origin was studied by comparing nucleotide sequences of the nucleoprotein and glycoprotein genes, and by typing isolates using RFLP. Phylogenetic analysis of the gene sequence data revealed a number of distinct groups, each associated with a particular geographical area. Such a pattern suggests that rabies virus has spread westwards and southwards across Europe during this century, but that physical barriers such as the Vistula river in Poland have enabled localized evolution. During this dispersal process, two species jumps took place - one into red foxes and another into raccoon dogs, although it is unclear whether virus strains are preferentially adapted to particular animal species or whether ecological forces explain the occurrence of the phylogenetic groups.

  20. Simultaneously measuring multiple protein interactions and their correlations in a cell by Protein-interactome Footprinting

    PubMed Central

    Luo, Si-Wei; Liang, Zhi; Wu, Jia-Rui

    2017-01-01

    Quantitatively detecting correlations of multiple protein-protein interactions (PPIs) in vivo is a big challenge. Here we introduce a novel method, termed Protein-interactome Footprinting (PiF), to simultaneously measure multiple PPIs in one cell. The principle of PiF is that each target physical PPI in the interactome is simultaneously transcoded into a specific DNA sequence based on dimerization of the target proteins fused with DNA-binding domains. The interaction intensity of each target protein is quantified as the copy number of the specific DNA sequences bound by each fusion protein dimers. Using PiF, we quantitatively reveal dynamic patterns of PPIs and their correlation network in E. coli two-component systems. PMID:28338015

  1. Long Terminal Repeat Retrotransposon Content in Eight Diploid Sunflower Species Inferred from Next-Generation Sequence Data

    PubMed Central

    Tetreault, Hannah M.; Ungerer, Mark C.

    2016-01-01

    The most abundant transposable elements (TEs) in plant genomes are Class I long terminal repeat (LTR) retrotransposons represented by superfamilies gypsy and copia. Amplification of these superfamilies directly impacts genome structure and contributes to differential patterns of genome size evolution among plant lineages. Utilizing short-read Illumina data and sequence information from a panel of Helianthus annuus (sunflower) full-length gypsy and copia elements, we explore the contribution of these sequences to genome size variation among eight diploid Helianthus species and an outgroup taxon, Phoebanthus tenuifolius. We also explore transcriptional dynamics of these elements in both leaf and bud tissue via RT-PCR. We demonstrate that most LTR retrotransposon sublineages (i.e., families) display patterns of similar genomic abundance across species. A small number of LTR retrotransposon sublineages exhibit lineage-specific amplification, particularly in the genomes of species with larger estimated nuclear DNA content. RT-PCR assays reveal that some LTR retrotransposon sublineages are transcriptionally active across all species and tissue types, whereas others display species-specific and tissue-specific expression. The species with the largest estimated genome size, H. agrestis, has experienced amplification of LTR retrotransposon sublineages, some of which have proliferated independently in other lineages in the Helianthus phylogeny. PMID:27233667

  2. Temporal and Motor Representation of Rhythm in Fronto-Parietal Cortical Areas: An fMRI Study

    PubMed Central

    Konoike, Naho; Kotozaki, Yuka; Jeong, Hyeonjeong; Miyazaki, Atsuko; Sakaki, Kohei; Shinada, Takamitsu; Sugiura, Motoaki; Kawashima, Ryuta; Nakamura, Katsuki

    2015-01-01

    When sounds occur with temporally structured patterns, we can feel a rhythm. To memorize a rhythm, perception of its temporal patterns and organization of them into a hierarchically structured sequence are necessary. On the other hand, rhythm perception can often cause unintentional body movements. Thus, we hypothesized that rhythm information can be manifested in two different ways; temporal and motor representations. The motor representation depends on effectors, such as the finger or foot, whereas the temporal representation is effector-independent. We tested our hypothesis with a working memory paradigm to elucidate neuronal correlates of temporal or motor representation of rhythm and to reveal the neural networks associated with these representations. We measured brain activity by fMRI while participants memorized rhythms and reproduced them by tapping with the right finger, left finger, or foot, or by articulation. The right inferior frontal gyrus and the inferior parietal lobule exhibited significant effector-independent activations during encoding and retrieval of rhythm information, whereas the left inferior parietal lobule and supplementary motor area (SMA) showed effector-dependent activations during retrieval. These results suggest that temporal sequences of rhythm are probably represented in the right fronto-parietal network, whereas motor sequences of rhythm can be represented in the SMA-parietal network. PMID:26076024

  3. A new earthworm cellulase and its possible role in the innate immunity.

    PubMed

    Park, In Yong; Cha, Ju Roung; Ok, Suk-Mi; Shin, Chuog; Kim, Jin-Se; Kwak, Hee-Jin; Yu, Yun-Sang; Kim, Yu-Kyung; Medina, Brenda; Cho, Sung-Jin; Park, Soon Cheol

    2017-02-01

    A new endogenous cellulase (Ean-EG) from the earthworm, Eisenia andrei and its expression pattern are demonstrated. Based on a deduced amino acid sequence, the open reading frame (ORF) of Ean-EG consisted of 1368 bps corresponding to a polypeptide of 456 amino acid residues in which is contained the conserved region specific to GHF9 that has the essential amino acid residues for enzyme activity. In multiple alignments and phylogenetic analysis, the deduced amino acid sequence of Ean- EG showed the highest sequence similarity (about 79%) to that of an annelid (Pheretima hilgendorfi) and could be clustered together with other GHF9 cellulases, indicating that Ean-EG could be categorized as a member of the GHF9 to which most animal cellulases belong. The histological expression pattern of Ean-EG mRNA using in situ hybridization revealed that the most distinct expression was observed in epithelial cells with positive hybridization signal in epidermis, chloragogen tissue cells, coelomic cell-aggregate, and even blood vessel, which could strongly support the fact that at least in the earthworm, Eisenia andrei, cellulase function must not be limited to digestive process but be possibly extended to the innate immunity. Copyright © 2016 Elsevier Ltd. All rights reserved.

  4. Quantification of DNA cleavage specificity in Hi-C experiments.

    PubMed

    Meluzzi, Dario; Arya, Gaurav

    2016-01-08

    Hi-C experiments produce large numbers of DNA sequence read pairs that are typically analyzed to deduce genomewide interactions between arbitrary loci. A key step in these experiments is the cleavage of cross-linked chromatin with a restriction endonuclease. Although this cleavage should happen specifically at the enzyme's recognition sequence, an unknown proportion of cleavage events may involve other sequences, owing to the enzyme's star activity or to random DNA breakage. A quantitative estimation of these non-specific cleavages may enable simulating realistic Hi-C read pairs for validation of downstream analyses, monitoring the reproducibility of experimental conditions and investigating biophysical properties that correlate with DNA cleavage patterns. Here we describe a computational method for analyzing Hi-C read pairs to estimate the fractions of cleavages at different possible targets. The method relies on expressing an observed local target distribution downstream of aligned reads as a linear combination of known conditional local target distributions. We validated this method using Hi-C read pairs obtained by computer simulation. Application of the method to experimental Hi-C datasets from murine cells revealed interesting similarities and differences in patterns of cleavage across the various experiments considered. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  5. Distribution of Interstitial Telomeric Sequences in Primates and the Pygmy Tree Shrew (Scandentia).

    PubMed

    Mazzoleni, Sofia; Schillaci, Odessa; Sineo, Luca; Dumas, Francesca

    2017-01-01

    It has been hypothesized that interstitial telomeric sequences (ITSs), i.e., repeated telomeric DNA sequences found at intrachromosomal sites in many vertebrates, could be correlated to chromosomal rearrangements and plasticity. To test this hypothesis, we hybridized a telomeric PNA probe through FISH on representative species of 2 primate infraorders, Strepsirrhini (Lemur catta, Otolemur garnettii, Nycticebus coucang) and Catarrhini (Erythrocebus patas, Cercopithecus petaurista, Chlorocebus aethiops, Colobus guereza), as well as on 1 species of the order Scandentia, Tupaia minor, used as an outgroup for primates in phylogenetic reconstructions. In almost all primate species analyzed, we found a telomeric pattern only. In Tupaia, the hybridization revealed many bright ITSs on at least 11 chromosome pairs, both biarmed and acrocentric. These ITS signals in Tupaia correspond to fusion points of ancestral human syntenic associations, but are also present in other chromosomes showing synteny to only a single human chromosome. This distribution pattern was compared to that of the heterochromatin regions detected through sequential C-banding performed after FISH. Our results in the analyzed species, compared with literature data on ITSs in primates, allowed us to discuss different mechanisms responsible for the origin and distribution of ITSs, supporting the correlation between rearrangements and ITSs. © 2017 S. Karger AG, Basel.

  6. Necessary Sequencing Depth and Clustering Method to Obtain Relatively Stable Diversity Patterns in Studying Fish Gut Microbiota.

    PubMed

    Xiao, Fanshu; Yu, Yuhe; Li, Jinjin; Juneau, Philippe; Yan, Qingyun

    2018-05-25

    The 16S rRNA gene is one of the most commonly used molecular markers for estimating bacterial diversity during the past decades. However, there is no consistency about the sequencing depth (from thousand to millions of sequences per sample), and the clustering methods used to generate OTUs may also be different among studies. These inconsistent premises make effective comparisons among studies difficult or unreliable. This study aims to examine the necessary sequencing depth and clustering method that would be needed to ensure a stable diversity patterns for studying fish gut microbiota. A total number of 42 samples dataset of Siniperca chuatsi (carnivorous fish) gut microbiota were used to test how the sequencing depth and clustering may affect the alpha and beta diversity patterns of fish intestinal microbiota. Interestingly, we found that the sequencing depth (resampling 1000-11,000 per sample) and the clustering methods (UPARSE and UCLUST) did not bias the estimates of the diversity patterns during the fish development from larva to adult. Although we should acknowledge that a suitable sequencing depth may differ case by case, our finding indicates that a shallow sequencing such as 1000 sequences per sample may be also enough to reflect the general diversity patterns of fish gut microbiota. However, we have shown in the present study that strict pre-processing of the original sequences is required to ensure reliable results. This study provides evidences to help making a strong scientific choice of the sequencing depth and clustering method for future studies on fish gut microbiota patterns, but at the same time reducing as much as possible the costs related to the analysis.

  7. Minimizing eddy currents induced in the ground plane of a large phased-array ultrasound applicator for echo-planar imaging-based MR thermometry.

    PubMed

    Lechner-Greite, Silke M; Hehn, Nicolas; Werner, Beat; Zadicario, Eyal; Tarasek, Matthew; Yeo, Desmond

    2016-01-01

    The study aims to investigate different ground plane segmentation designs of an ultrasound transducer to reduce gradient field induced eddy currents and the associated geometric distortion and temperature map errors in echo-planar imaging (EPI)-based MR thermometry in transcranial magnetic resonance (MR)-guided focused ultrasound (tcMRgFUS). Six different ground plane segmentations were considered and the efficacy of each in suppressing eddy currents was investigated in silico and in operando. For the latter case, the segmented ground planes were implemented in a transducer mockup model for validation. Robust spoiled gradient (SPGR) echo sequences and multi-shot EPI sequences were acquired. For each sequence and pattern, geometric distortions were quantified in the magnitude images and expressed in millimeters. Phase images were used for extracting the temperature maps on the basis of the temperature-dependent proton resonance frequency shift phenomenon. The means, standard deviations, and signal-to-noise ratios (SNRs) were extracted and contrasted with the geometric distortions of all patterns. The geometric distortion analysis and temperature map evaluations showed that more than one pattern could be considered the best-performing transducer. In the sagittal plane, the star (d) (3.46 ± 2.33 mm) and star-ring patterns (f) (2.72 ± 2.8 mm) showed smaller geometric distortions than the currently available seven-segment sheet (c) (5.54 ± 4.21 mm) and were both comparable to the reference scenario (a) (2.77 ± 2.24 mm). Contrasting these results with the temperature maps revealed that (d) performs as well as (a) in SPGR and EPI. We demonstrated that segmenting the transducer ground plane into a star pattern reduces eddy currents to a level wherein multi-plane EPI for accurate MR thermometry in tcMRgFUS is feasible.

  8. LINE-1 retrotransposons: from 'parasite' sequences to functional elements.

    PubMed

    Paço, Ana; Adega, Filomena; Chaves, Raquel

    2015-02-01

    Long interspersed nuclear elements-1 (LINE-1) are the most abundant and active retrotransposons in the mammalian genomes. Traditionally, the occurrence of LINE-1 sequences in the genome of mammals has been explained by the selfish DNA hypothesis. Nevertheless, recently, it has also been argued that these sequences could play important roles in these genomes, as in the regulation of gene expression, genome modelling and X-chromosome inactivation. The non-random chromosomal distribution is a striking feature of these retroelements that somehow reflects its functionality. In the present study, we have isolated and analysed a fraction of the open reading frame 2 (ORF2) LINE-1 sequence from three rodent species, Cricetus cricetus, Peromyscus eremicus and Praomys tullbergi. Physical mapping of the isolated sequences revealed an interspersed longitudinal AT pattern of distribution along all the chromosomes of the complement in the three genomes. A detailed analysis shows that these sequences are preferentially located in the euchromatic regions, although some signals could be detected in the heterochromatin. In addition, a coincidence between the location of imprinted gene regions (as Xist and Tsix gene regions) and the LINE-1 retroelements was also observed. According to these results, we propose an involvement of LINE-1 sequences in different genomic events as gene imprinting, X-chromosome inactivation and evolution of repetitive sequences located at the heterochromatic regions (e.g. satellite DNA sequences) of the rodents' genomes analysed.

  9. Systematic and fully automated identification of protein sequence patterns.

    PubMed

    Hart, R K; Royyuru, A K; Stolovitzky, G; Califano, A

    2000-01-01

    We present an efficient algorithm to systematically and automatically identify patterns in protein sequence families. The procedure is based on the Splash deterministic pattern discovery algorithm and on a framework to assess the statistical significance of patterns. We demonstrate its application to the fully automated discovery of patterns in 974 PROSITE families (the complete subset of PROSITE families which are defined by patterns and contain DR records). Splash generates patterns with better specificity and undiminished sensitivity, or vice versa, in 28% of the families; identical statistics were obtained in 48% of the families, worse statistics in 15%, and mixed behavior in the remaining 9%. In about 75% of the cases, Splash patterns identify sequence sites that overlap more than 50% with the corresponding PROSITE pattern. The procedure is sufficiently rapid to enable its use for daily curation of existing motif and profile databases. Third, our results show that the statistical significance of discovered patterns correlates well with their biological significance. The trypsin subfamily of serine proteases is used to illustrate this method's ability to exhaustively discover all motifs in a family that are statistically and biologically significant. Finally, we discuss applications of sequence patterns to multiple sequence alignment and the training of more sensitive score-based motif models, akin to the procedure used by PSI-BLAST. All results are available at httpl//www.research.ibm.com/spat/.

  10. Analysis of ELA-DQB exon 2 polymorphism in Argentine Creole horses by PCR-RFLP and PCR-SSCP.

    PubMed

    Villegas-Castagnasso, E E; Díaz, S; Giovambattista, G; Dulout, F N; Peral-García, P

    2003-08-01

    The second exon of equine leucocyte antigen (ELA)-DQB genes was amplified from genomic DNA of 32 Argentine Creole horses by PCR. Amplified DNA was analysed by PCR-restriction fragment length polymorphism (RFLP) and PCR-single-strand conformation polymorphism (SSCP). The PCR-RFLP analysis revealed two HaeIII patterns, four RsaI patterns, five MspI patterns and two HinfI patterns. EcoRI showed no variation in the analysed sample. Additional patterns that did not account for known exon 2 DNA sequences were observed, suggesting the existence of novel ELA-DQB alleles. PCR-SSCP analysis exhibited seven different band patterns, and the number of bands per animal ranged from four to nine. Both methods indicated that at least two DQB genes are present. The presence of more than two alleles in each animal showed that the primers employed in this work are not specific for a unique DQB locus. The improvement of this PCR-RFLP method should provide a simple and rapid technique for an accurate definition of ELA-DQB typing in horses.

  11. Spatial patterns of fasting and fed antropyloric pressure waves in humans.

    PubMed Central

    Sun, W M; Hebbard, G S; Malbert, C H; Jones, K L; Doran, S; Horowitz, M; Dent, J

    1997-01-01

    1. Gastric mechanics were investigated by categorizing the temporal and spatial patterning of pressure waves associated with individual gastric contractions. 2. In twelve healthy volunteers, intraluminal pressures were monitored from nine side hole recording points spaced at 1.5 cm intervals along the antrum, pylorus and duodenum. 3. Pressure wave sequences that occurred during phase II fasting contractions (n = 221) and after food (n = 778) were evaluated. 4. The most common pattern of pressure wave onset along the antrum was a variable combination of antegrade, synchronous and retrograde propagation between side hole pairs. This variable pattern accounted for 42% of sequences after food, and 34% during fasting (P < 0.05). Other common pressure wave sequence patterns were: purely antegrade-29% after food and 42% during fasting (P < 0.05); purely synchronous-23% fed and 17% fasting; and purely retrograde-6% fed and 8% fasting. The length of sequences was shorter after food (P < 0.05). Some sequences 'skipped' individual recording points. 5. The spatial patterning of gastric pressure wave sequences is diverse, and may explain the differing mechanical outcomes among individual gastric contractions. 6. Better understanding of gastric mechanics may be gained from temporally precise correlations of luminal flows and pressures and gastric wall motion during individual gastric contraction sequences. PMID:9306286

  12. Diversity of the small subunit ribosomal RNA gene of the arbuscular mycorrhizal fungi colonizing Clintonia borealis from a mixed-wood boreal forest.

    PubMed

    DeBellis, Tonia; Widden, Paul

    2006-11-01

    Arbuscular mycorrhizal fungi (AMF) communities in Clintonia borealis roots from a boreal mixed forests in northwestern Québec were investigated. Roots were sampled from 100 m2 plots whose overstory was dominated by either trembling aspen (Populus tremuloides Michx.), white birch (Betula papyrifera Marsh.), or mixed white spruce (Picea glauca (Moench) Voss) and balsam fir (Abies balsamea (L.) Mill.). Part of the 18S ribosomal gene of the AMF was amplified and the resulting PCR products were cloned. Restriction analysis of the 576 resulting clones yielded 92 different restriction patterns which were then sequenced. Fifty-two sequences closely matched other Glomus sequences from Genbank. Phylogenetic analysis revealed 10 different AMF sequence types, most of which clustered with other uncultured AM sequences from plant roots from various field sites. Compared with other AMF communities from comparable studies, richness and diversity were higher than observed in an arable field, but lower than seen in a tropical forest and a temperate wetland. The AMF communities from Clintonia roots under the different canopy types did not differ significantly and the dominant sequence type, which clustered with AM sequences from a variety of environments and hosts at distant geographical locations, represented 66.9% of all the clones analyzed.

  13. Disrupted implicit motor sequence learning in schizophrenia and bipolar disorder revealed with ambidextrous Serial Reaction Time Task.

    PubMed

    Chrobak, Adrian Andrzej; Siuda-Krzywicka, Katarzyna; Siwek, Grzegorz Przemysław; Tereszko, Anna; Janeczko, Weronika; Starowicz-Filip, Anna; Siwek, Marcin; Dudek, Dominika

    2017-10-03

    Impairment of implicit motor sequence learning was shown in schizophrenia (SZ) and, most recently, in bipolar disorder (BD), and was connected to cerebellar abnormalities. The goal of this study was to compare implicit motor sequence learning in BD and SZ. We examined 33 patients with BD, 33 patients with SZ and 31 healthy controls with a use of ambidextrous Serial Reaction Time Task (SRTT), which allows exploring asymmetries in performance depending on the hand used. BD and SZ patients presented impaired implicit motor sequence learning, although the pattern of their impairments was different. While BD patients showed no signs of implicit motor sequence learning for both hands, the SZ group presented some features of motor learning when performing with the right, but not with the left hand. To our best knowledge this is the first study comparing implicit motor sequence learning in BD and SZ. We show that both diseases share impairments in this domain, however in the case of SZ this impairment differs dependently on the hand performing SRTT. We propose that implicit motor sequence learning impairments constitute an overlapping symptom in BD and SZ and suggest further neuroimaging studies to verify cerebellar underpinnings as its cause. Copyright © 2017 Elsevier Inc. All rights reserved.

  14. Singular over-representation of an octameric palindrome, HIP1, in DNA from many cyanobacteria.

    PubMed

    Robinson, N J; Robinson, P J; Gupta, A; Bleasby, A J; Whitton, B A; Morby, A P

    1995-03-11

    An octameric palindrome (5'-GCGATCGC-3') is abundant in cyanobacterial sequences within databases (GenBank/EMBL) and was designated HIP1 (highly iterated palindrome). The frequency of occurrence of all 256 octameric palindromes has now been determined in sub-databases revealing large and unique over-representation of HIP1 in cyanobacterial entries. DNA sequences from other bacteria were searched for any over-represented octameric palindromes analogous to HIP1. Only two sequences were identified, in the genomes of a thermophile and halophilic archaebacteria, although these were less abundant than HIP1 in cyanobacteria and relate to codon usage. To test the proposed widespread distribution of HIP1 in DNA from the cyanobacterium Synechococcus PCC 6301, randomly selected genomic clones were partly sequenced. HIP1 constituted 2.5% of the novel sequences, equivalent to a site on average once every 320 nucleotides. An oligonucleotide including HIP1 was also tested in PCR. Multiple products were obtained using template DNA from cyanobacterial strains in which HIP1 is abundant in known sequences, and some strains generated characteristic HIP-PCR banding patterns. However, analysis of DNA from one strain (not previously represented in databases) by random sequencing, HIP-PCR and Pvul digestion, confirms that not all cyanobacterial genomes are rich in HIP1.

  15. Fast online and index-based algorithms for approximate search of RNA sequence-structure patterns

    PubMed Central

    2013-01-01

    Background It is well known that the search for homologous RNAs is more effective if both sequence and structure information is incorporated into the search. However, current tools for searching with RNA sequence-structure patterns cannot fully handle mutations occurring on both these levels or are simply not fast enough for searching large sequence databases because of the high computational costs of the underlying sequence-structure alignment problem. Results We present new fast index-based and online algorithms for approximate matching of RNA sequence-structure patterns supporting a full set of edit operations on single bases and base pairs. Our methods efficiently compute semi-global alignments of structural RNA patterns and substrings of the target sequence whose costs satisfy a user-defined sequence-structure edit distance threshold. For this purpose, we introduce a new computing scheme to optimally reuse the entries of the required dynamic programming matrices for all substrings and combine it with a technique for avoiding the alignment computation of non-matching substrings. Our new index-based methods exploit suffix arrays preprocessed from the target database and achieve running times that are sublinear in the size of the searched sequences. To support the description of RNA molecules that fold into complex secondary structures with multiple ordered sequence-structure patterns, we use fast algorithms for the local or global chaining of approximate sequence-structure pattern matches. The chaining step removes spurious matches from the set of intermediate results, in particular of patterns with little specificity. In benchmark experiments on the Rfam database, our improved online algorithm is faster than the best previous method by up to factor 45. Our best new index-based algorithm achieves a speedup of factor 560. Conclusions The presented methods achieve considerable speedups compared to the best previous method. This, together with the expected sublinear running time of the presented index-based algorithms, allows for the first time approximate matching of RNA sequence-structure patterns in large sequence databases. Beyond the algorithmic contributions, we provide with RaligNAtor a robust and well documented open-source software package implementing the algorithms presented in this manuscript. The RaligNAtor software is available at http://www.zbh.uni-hamburg.de/ralignator. PMID:23865810

  16. Comparative analysis of complete orthologous centromeres from two subspecies of rice reveals rapid variation of centromere organization and structure.

    PubMed

    Wu, Jianzhong; Fujisawa, Masaki; Tian, Zhixi; Yamagata, Harumi; Kamiya, Kozue; Shibata, Michie; Hosokawa, Satomi; Ito, Yukiyo; Hamada, Masao; Katagiri, Satoshi; Kurita, Kanako; Yamamoto, Mayu; Kikuta, Ari; Machita, Kayo; Karasawa, Wataru; Kanamori, Hiroyuki; Namiki, Nobukazu; Mizuno, Hiroshi; Ma, Jianxin; Sasaki, Takuji; Matsumoto, Takashi

    2009-12-01

    Centromeres are sites for assembly of the chromosomal structures that mediate faithful segregation at mitosis and meiosis. This function is conserved across species, but the DNA components that are involved in kinetochore formation differ greatly, even between closely related species. To shed light on the nature, evolutionary timing and evolutionary dynamics of rice centromeres, we decoded a 2.25-Mb DNA sequence covering the centromeric region of chromosome 8 of an indica rice variety, 'Kasalath' (Kas-Cen8). Analysis of repetitive sequences in Kas-Cen8 led to the identification of 222 long terminal repeat (LTR)-retrotransposon elements and 584 CentO satellite monomers, which account for 59.2% of the region. A comparison of the Kas-Cen8 sequence with that of japonica rice 'Nipponbare' (Nip-Cen8) revealed that about 66.8% of the Kas-Cen8 sequence was collinear with that of Nip-Cen8. Although the 27 putative genes are conserved between the two subspecies, only 55.4% of the total LTR-retrotransposon elements in 'Kasalath' had orthologs in 'Nipponbare', thus reflecting recent proliferation of a considerable number of LTR-retrotransposons since the divergence of two rice subspecies of indica and japonica within Oryza sativa. Comparative analysis of the subfamilies, time of insertion, and organization patterns of inserted LTR-retrotransposons between the two Cen8 regions revealed variations between 'Kasalath' and 'Nipponbare' in the preferential accumulation of CRR elements, and the expansion of CentO satellite repeats within the core domain of Cen8. Together, the results provide insights into the recent proliferation of LTR-retrotransposons, and the rapid expansion of CentO satellite repeats, underlying the dynamic variation and plasticity of plant centromeres.

  17. Burkholderia pseudomallei sequencing identifies genomic clades with distinct recombination, accessory, and epigenetic profiles

    PubMed Central

    Nandi, Tannistha; Holden, Matthew T.G.; Didelot, Xavier; Mehershahi, Kurosh; Boddey, Justin A.; Beacham, Ifor; Peak, Ian; Harting, John; Baybayan, Primo; Guo, Yan; Wang, Susana; How, Lee Chee; Sim, Bernice; Essex-Lopresti, Angela; Sarkar-Tyson, Mitali; Nelson, Michelle; Smither, Sophie; Ong, Catherine; Aw, Lay Tin; Hoon, Chua Hui; Michell, Stephen; Studholme, David J.; Titball, Richard; Chen, Swaine L.; Parkhill, Julian

    2015-01-01

    Burkholderia pseudomallei (Bp) is the causative agent of the infectious disease melioidosis. To investigate population diversity, recombination, and horizontal gene transfer in closely related Bp isolates, we performed whole-genome sequencing (WGS) on 106 clinical, animal, and environmental strains from a restricted Asian locale. Whole-genome phylogenies resolved multiple genomic clades of Bp, largely congruent with multilocus sequence typing (MLST). We discovered widespread recombination in the Bp core genome, involving hundreds of regions associated with multiple haplotypes. Highly recombinant regions exhibited functional enrichments that may contribute to virulence. We observed clade-specific patterns of recombination and accessory gene exchange, and provide evidence that this is likely due to ongoing recombination between clade members. Reciprocally, interclade exchanges were rarely observed, suggesting mechanisms restricting gene flow between clades. Interrogation of accessory elements revealed that each clade harbored a distinct complement of restriction-modification (RM) systems, predicted to cause clade-specific patterns of DNA methylation. Using methylome sequencing, we confirmed that representative strains from separate clades indeed exhibit distinct methylation profiles. Finally, using an E. coli system, we demonstrate that Bp RM systems can inhibit uptake of non-self DNA. Our data suggest that RM systems borne on mobile elements, besides preventing foreign DNA invasion, may also contribute to limiting exchanges of genetic material between individuals of the same species. Genomic clades may thus represent functional units of genetic isolation in Bp, modulating intraspecies genetic diversity. PMID:25236617

  18. Heteroassociative storage of hippocampal pattern sequences in the CA3 subregion

    PubMed Central

    Recio, Renan S.; Reyes, Marcelo B.

    2018-01-01

    Background Recent research suggests that the CA3 subregion of the hippocampus has properties of both autoassociative network, due to its ability to complete partial cues, tolerate noise, and store associations between memories, and heteroassociative one, due to its ability to store and retrieve sequences of patterns. Although there are several computational models of the CA3 as an autoassociative network, more detailed evaluations of its heteroassociative properties are missing. Methods We developed a model of the CA3 subregion containing 10,000 integrate-and-fire neurons with both recurrent excitatory and inhibitory connections, and which exhibits coupled oscillations in the gamma and theta ranges. We stored thousands of pattern sequences using a heteroassociative learning rule with competitive synaptic scaling. Results We showed that a purely heteroassociative network model can (i) retrieve pattern sequences from partial cues with external noise and incomplete connectivity, (ii) achieve homeostasis regarding the number of connections per neuron when many patterns are stored when using synaptic scaling, (iii) continuously update the set of retrievable patterns, guaranteeing that the last stored patterns can be retrieved and older ones can be forgotten. Discussion Heteroassociative networks with synaptic scaling rules seem sufficient to achieve many desirable features regarding connectivity homeostasis, pattern sequence retrieval, noise tolerance and updating of the set of retrievable patterns. PMID:29312826

  19. Effect of oxygen minimum zone formation on communities of marine protists

    PubMed Central

    Orsi, William; Song, Young C; Hallam, Steven; Edgcomb, Virginia

    2012-01-01

    Changes in ocean temperature and circulation patterns compounded by human activities are leading to oxygen minimum zone (OMZ) expansion with concomitant alteration in nutrient and climate active trace gas cycling. Here, we report the response of microbial eukaryote populations to seasonal changes in water column oxygen-deficiency using Saanich Inlet, a seasonally anoxic fjord on the coast of Vancouver Island British Columbia, as a model ecosystem. We combine small subunit ribosomal RNA gene sequencing approaches with multivariate statistical methods to reveal shifts in operational taxonomic units during successive stages of seasonal stratification and renewal. A meta-analysis is used to identify common and unique patterns of community composition between Saanich Inlet and the anoxic/sulfidic Cariaco Basin (Venezuela) and Framvaren Fjord (Norway) to show shared and unique responses of microbial eukaryotes to oxygen and sulfide in these three environments. Our analyses also reveal temporal fluctuations in rare populations of microbial eukaryotes, particularly anaerobic ciliates, that may be of significant importance to the biogeochemical cycling of methane in OMZs. PMID:22402396

  20. Source environment feature related phylogenetic distribution pattern of anoxygenic photosynthetic bacteria as revealed by pufM analysis.

    PubMed

    Zeng, Yonghui; Jiao, Nianzhi

    2007-06-01

    Anoxygenic photosynthesis, performed primarily by anoxygenic photosynthetic bacteria (APB), has been supposed to arise on Earth more than 3 billion years ago. The long established APB are distributed in almost every corner where light can reach. However, the relationship between APB phylogeny and source environments has been largely unexplored. Here we retrieved the pufM sequences and related source information of 89 pufM containing species from the public database. Phylogenetic analysis revealed that horizontal gene transfer (HGT) most likely occurred within 11 out of a total 21 pufM subgroups, not only among species within the same class but also among species of different phyla or subphyla. A clear source environment feature related phylogenetic distribution pattern was observed, with all species from oxic habitats and those from anoxic habitats clustering into independent subgroups, respectively. HGT among ancient APB and subsequent long term evolution and adaptation to separated niches may have contributed to the coupling of environment and pufM phylogeny.

  1. Diminishing-returns epistasis decreases adaptability along an evolutionary trajectory.

    PubMed

    Wünsche, Andrea; Dinh, Duy M; Satterwhite, Rebecca S; Arenas, Carolina Diaz; Stoebel, Daniel M; Cooper, Tim F

    2017-03-01

    Populations evolving in constant environments exhibit declining adaptability. Understanding the basis of this pattern could reveal underlying processes determining the repeatability of evolutionary outcomes. In principle, declining adaptability can be due to a decrease in the effect size of beneficial mutations, a decrease in the rate at which they occur, or some combination of both. By evolving Escherichia coli populations started from different steps along a single evolutionary trajectory, we show that declining adaptability is best explained by a decrease in the size of available beneficial mutations. This pattern reflected the dominant influence of negative genetic interactions that caused new beneficial mutations to confer smaller benefits in fitter genotypes. Genome sequencing revealed that starting genotypes that were more similar to one another did not exhibit greater similarity in terms of new beneficial mutations, supporting the view that epistasis acts globally, having a greater influence on the effect than on the identity of available mutations along an adaptive trajectory. Our findings provide support for a general mechanism that leads to predictable phenotypic evolutionary trajectories.

  2. A framework for the comparative study of language.

    PubMed

    Uriagereka, Juan; Reggia, James A; Wilkinson, Gerald S

    2013-07-18

    Comparative studies of language are difficult because few language precursors are recognized. In this paper we propose a framework for designing experiments that test for structural and semantic patterns indicative of simple or complex grammars as originally described by Chomsky. We argue that a key issue is whether animals can recognize full recursion, which is the hallmark of context-free grammar. We discuss limitations of recent experiments that have attempted to address this issue, and point out that experiments aimed at detecting patterns that follow a Fibonacci series have advantages over other artificial context-free grammars. We also argue that experiments using complex sequences of behaviors could, in principle, provide evidence for fully recursive thought. Some of these ideas could also be approached using artificial life simulations, which have the potential to reveal the types of evolutionary transitions that could occur over time. Because the framework we propose has specific memory and computational requirements, future experiments could target candidate genes with the goal of revealing the genetic underpinnings of complex cognition.

  3. Whole genome sequence revealed the fine transmission map of carbapenem-resistant Klebsiella pneumonia isolates within a nosocomial outbreak.

    PubMed

    Sui, Wenjun; Zhou, Haijian; Du, Pengcheng; Wang, Lijun; Qin, Tian; Wang, Mei; Ren, Hongyu; Huang, Yanfei; Hou, Jing; Chen, Chen; Lu, Xinxin

    2018-01-01

    Carbapenem-resistant Klebsiella pneumoniae (CRKP) is a major cause of nosocomial infections worldwide. The transmission route of CRKP isolates within an outbreak is rarely described. This study aimed to reveal the molecular characteristics and transmission route of CRKP isolates within an outbreak of nosocomial infection. Collecting case information, active screening and targeted environmental monitoring were carried out. The antibiotic susceptibility, drug-resistant genes, molecular subtype and whole genome sequence of CRKP strains were analyzed. Between October and December 2011, 26 CRKP isolates were collected from eight patients in a surgical intensive care unit and subsequent transfer wards of Beijing Tongren hospital, China. All 26 isolates harbored bla KPC-2 , bla SHV-1 , and bla CTX-M-15 genes, had the same or similar pulsed-field gel electrophoresis patterns, and belonged to the sequence type 11 (ST11) clone. By comprehensive consideration of genomic and epidemiological information, a putative transmission map was constructed, including identifying one case as an independent event distinct from the other seven cases, and revealing two transmissions starting from the same case. This study provided the first report confirming an outbreak caused by K. pneumoniae ST11 clone co-harboring the bla KPC-2 , bla CTX-M-15 , and bla SHV-1 genes, and suggested that comprehensive consideration of genomic and epidemiological data can yield a fine transmission map of an outbreak and facilitate the control of nosocomial transmission.

  4. A Search for Gene Fusions/Translocations in Breast Cancer

    DTIC Science & Technology

    2013-11-01

    Ramnarayanan K, Brenner JC, Yu J , Kim JH, Han B, Tan P, Kumar-Sinha C, Lonigro RJ, Palanisamy N, Maher CA, Chinnaiyan AM. Science. 2008 Dec 12;322...Barrette TR, Grasso C, Yu J , Lonigro RJ, Schroth G, Kumar-Sinha C, Chinnaiyan AM. Proc Natl Acad Sci U S A. 2009 Jul 10. [Epub ahead of print]. PMID...CA, Palanisamy N, Mehra R, Kominsky HD, Siddiqui J , Yu J , Qin ZS, Chinnaiyan AM. Deep sequencing reveals distinct patterns of DNA methylation in

  5. Business Planning in the Light of Neuro-fuzzy and Predictive Forecasting

    NASA Astrophysics Data System (ADS)

    Chakrabarti, Prasun; Basu, Jayanta Kumar; Kim, Tai-Hoon

    In this paper we have pointed out gain sensing on forecast based techniques.We have cited an idea of neural based gain forecasting. Testing of sequence of gain pattern is also verifies using statsistical analysis of fuzzy value assignment. The paper also suggests realization of stable gain condition using K-Means clustering of data mining. A new concept of 3D based gain sensing has been pointed out. The paper also reveals what type of trend analysis can be observed for probabilistic gain prediction.

  6. Ongoing outbreak of invasive listeriosis, Germany, 2012 to 2015.

    PubMed

    Ruppitsch, Werner; Prager, Rita; Halbedel, Sven; Hyden, Patrick; Pietzka, Ariane; Huhulescu, Steliana; Lohr, Dorothee; Schönberger, Katharina; Aichinger, Elisabeth; Hauri, Anja; Stark, Klaus; Vygen, Sabine; Tietze, Erhard; Allerberger, Franz; Wilking, Hendrik

    2015-01-01

    Listeriosis patient isolates in Germany have shown a new identical pulsed-field gel electrophoresis (PFGE) pattern since 2012 (n = 66). Almost all isolates (Listeria monocytogenes serotype 1/2a) belonged to cases living in southern Germany, indicating an outbreak with a so far unknown source. Case numbers in 2015 are high (n = 28). No outbreak cases outside Germany have been reported. Next generation sequencing revealed the unique cluster type CT1248 and confirmed the outbreak. Investigations into the source are ongoing.

  7. An efficient, versatile and scalable pattern growth approach to mine frequent patterns in unaligned protein sequences.

    PubMed

    Ye, Kai; Kosters, Walter A; Ijzerman, Adriaan P

    2007-03-15

    Pattern discovery in protein sequences is often based on multiple sequence alignments (MSA). The procedure can be computationally intensive and often requires manual adjustment, which may be particularly difficult for a set of deviating sequences. In contrast, two algorithms, PRATT2 (http//www.ebi.ac.uk/pratt/) and TEIRESIAS (http://cbcsrv.watson.ibm.com/) are used to directly identify frequent patterns from unaligned biological sequences without an attempt to align them. Here we propose a new algorithm with more efficiency and more functionality than both PRATT2 and TEIRESIAS, and discuss some of its applications to G protein-coupled receptors, a protein family of important drug targets. In this study, we designed and implemented six algorithms to mine three different pattern types from either one or two datasets using a pattern growth approach. We compared our approach to PRATT2 and TEIRESIAS in efficiency, completeness and the diversity of pattern types. Compared to PRATT2, our approach is faster, capable of processing large datasets and able to identify the so-called type III patterns. Our approach is comparable to TEIRESIAS in the discovery of the so-called type I patterns but has additional functionality such as mining the so-called type II and type III patterns and finding discriminating patterns between two datasets. The source code for pattern growth algorithms and their pseudo-code are available at http://www.liacs.nl/home/kosters/pg/.

  8. Circulation of Endemic Type 2 Vaccine-Derived Poliovirus in Egypt from 1983 to 1993

    PubMed Central

    Yang, Chen-Fu; Naguib, Tary; Yang, Su-Ju; Nasr, Eman; Jorba, Jaume; Ahmed, Nahed; Campagnoli, Ray; van der Avoort, Harrie; Shimizu, Hiroyuki; Yoneyama, Tetsuo; Miyamura, Tatsuo; Pallansch, Mark; Kew, Olen

    2003-01-01

    From 1988 to 1993, 30 cases of poliomyelitis associated with poliovirus type 2 were found in seven governorates of Egypt. Because many of the cases were geographically and temporally clustered and because the case isolates differed antigenically from the vaccine strain, it was initially assumed that the cases signaled the continued circulation of wild type 2 poliovirus. However, comparison of sequences encoding the major capsid protein, VP1 (903 nucleotides), revealed that the isolates were related (93 to 97% nucleotide sequence identity) to the Sabin type 2 oral poliovirus vaccine (OPV) strain and unrelated (<82% nucleotide sequence identity) to the wild type 2 polioviruses previously indigenous to Egypt (last known isolate: 1979) or to any contemporary wild type 2 polioviruses found elsewhere. The rate and pattern of VP1 divergence among the circulating vaccine-derived poliovirus (cVDPV) isolates suggested that all lineages were derived from a single OPV infection that occurred around 1983 and that progeny from the initiating infection circulated for approximately a decade within Egypt along several independent chains of transmission. Complete genomic sequences of an early (1988) and a late (1993) cVDPV isolate revealed that their 5′ untranslated region (5′ UTR) and noncapsid- 3′ UTR sequences were derived from other species C enteroviruses. Circulation of type 2 cVDPVs occurred at a time of low OPV coverage in the affected communities and ceased when OPV coverage rates increased. The potential for cVDPVs to circulate in populations with low immunity to poliovirus has important implications for current and future strategies to eradicate polio worldwide. PMID:12857906

  9. Generation of “LYmph Node Derived Antibody Libraries” (LYNDAL) for selecting fully human antibody fragments with therapeutic potential

    PubMed Central

    Diebolder, Philipp; Keller, Armin; Haase, Stephanie; Schlegelmilch, Anne; Kiefer, Jonathan D; Karimi, Tamana; Weber, Tobias; Moldenhauer, Gerhard; Kehm, Roland; Eis-Hübinger, Anna M; Jäger, Dirk; Federspil, Philippe A; Herold-Mende, Christel; Dyckhoff, Gerhard; Kontermann, Roland E; Arndt, Michaela AE; Krauss, Jürgen

    2014-01-01

    The development of efficient strategies for generating fully human monoclonal antibodies with unique functional properties that are exploitable for tailored therapeutic interventions remains a major challenge in the antibody technology field. Here, we present a methodology for recovering such antibodies from antigen-encountered human B cell repertoires. As the source for variable antibody genes, we cloned immunoglobulin G (IgG)-derived B cell repertoires from lymph nodes of 20 individuals undergoing surgery for head and neck cancer. Sequence analysis of unselected “LYmph Node Derived Antibody Libraries” (LYNDAL) revealed a naturally occurring distribution pattern of rearranged antibody sequences, representing all known variable gene families and most functional germline sequences. To demonstrate the feasibility for selecting antibodies with therapeutic potential from these repertoires, seven LYNDAL from donors with high serum titers against herpes simplex virus (HSV) were panned on recombinant glycoprotein B of HSV-1. Screening for specific binders delivered 34 single-chain variable fragments (scFvs) with unique sequences. Sequence analysis revealed extensive somatic hypermutation of enriched clones as a result of affinity maturation. Binding of scFvs to common glycoprotein B variants from HSV-1 and HSV-2 strains was highly specific, and the majority of analyzed antibody fragments bound to the target antigen with nanomolar affinity. From eight scFvs with HSV-neutralizing capacity in vitro, the most potent antibody neutralized 50% HSV-2 at 4.5 nM as a dimeric (scFv)2. We anticipate our approach to be useful for recovering fully human antibodies with therapeutic potential. PMID:24256717

  10. Generation of “LYmph Node Derived Antibody Libraries” (LYNDAL) for selecting fully human antibody fragments with therapeutic potential.

    PubMed

    Diebolder, Philipp; Keller, Armin; Haase, Stephanie; Schlegelmilch, Anne; Kiefer, Jonathan D; Karimi, Tamana; Weber, Tobias; Moldenhauer, Gerhard; Kehm, Roland; Eis-Hübinger, Anna M; Jäger, Dirk; Federspil, Philippe A; Herold-Mende, Christel; Dyckhoff, Gerhard; Kontermann, Roland E; Arndt, Michaela A E; Krauss, Jürgen

    2014-01-01

    The development of efficient strategies for generating fully human monoclonal antibodies with unique functional properties that are exploitable for tailored therapeutic interventions remains a major challenge in the antibody technology field. Here, we present a methodology for recovering such antibodies from antigen-encountered human B cell repertoires. As the source for variable antibody genes, we cloned immunoglobulin G (IgG)-derived B cell repertoires from lymph nodes of 20 individuals undergoing surgery for head and neck cancer. Sequence analysis of unselected “LYmph Node Derived Antibody Libraries” (LYNDAL) revealed a naturally occurring distribution pattern of rearranged antibody sequences, representing all known variable gene families and most functional germline sequences. To demonstrate the feasibility for selecting antibodies with therapeutic potential from these repertoires, seven LYNDAL from donors with high serum titers against herpes simplex virus (HSV) were panned on recombinant glycoprotein B of HSV-1. Screening for specific binders delivered 34 single-chain variable fragments (scFvs) with unique sequences. Sequence analysis revealed extensive somatic hypermutation of enriched clones as a result of affinity maturation. Binding of scFvs to common glycoprotein B variants from HSV-1 and HSV-2 strains was highly specific, and the majority of analyzed antibody fragments bound to the target antigen with nanomolar affinity. From eight scFvs with HSV-neutralizing capacity in vitro,the most potent antibody neutralized 50% HSV-2 at 4.5 nM as a dimeric (scFv)2. We anticipate our approach to be useful for recovering fully human antibodies with therapeutic potential.

  11. Genetic characterization of Strongyloides spp. from captive, semi-captive and wild Bornean orangutans (Pongo pygmaeus) in Central and East Kalimantan, Borneo, Indonesia.

    PubMed

    Labes, E M; Nurcahyo, W; Wijayanti, N; Deplazes, P; Mathis, A

    2011-09-01

    Orangutans (Pongo spp.), Asia's only great apes, are threatened in their survival due to habitat loss, hunting and infections. Nematodes of the genus Strongyloides may represent a severe cause of death in wild and captive individuals. In order to better understand which Strongyloides species/subspecies infect orangutans under different conditions, larvae were isolated from fecal material collected in Indonesia from 9 captive, 2 semi-captive and 9 wild individuals, 18 captive groups of Bornean orangutans and from 1 human working with wild orangutans. Genotyping was done at the genomic rDNA locus (part of the 18S rRNA gene and internal transcribed spacer 1, ITS1) by sequencing amplicons. Thirty isolates, including the one from the human, could be identified as S. fuelleborni fuelleborni with 18S rRNA gene identities of 98·5-100%, with a corresponding published sequence. The ITS1 sequences could be determined for 17 of these isolates revealing a huge variability and 2 main clusters without obvious pattern with regard to attributes of the hosts. The ITS1 amplicons of 2 isolates were cloned and sequenced, revealing considerable variability indicative of mixed infections. One isolate from a captive individual was identified as S. stercoralis (18S rRNA) and showed 99% identity (ITS1) with S. stercoralis sequences from geographically distinct locations and host species. The findings are significant with regard to the zoonotic nature of these parasites and might contribute to the conservation of remaining orangutan populations.

  12. Circulation of endemic type 2 vaccine-derived poliovirus in Egypt from 1983 to 1993.

    PubMed

    Yang, Chen-Fu; Naguib, Tary; Yang, Su-Ju; Nasr, Eman; Jorba, Jaume; Ahmed, Nahed; Campagnoli, Ray; van der Avoort, Harrie; Shimizu, Hiroyuki; Yoneyama, Tetsuo; Miyamura, Tatsuo; Pallansch, Mark; Kew, Olen

    2003-08-01

    From 1988 to 1993, 30 cases of poliomyelitis associated with poliovirus type 2 were found in seven governorates of Egypt. Because many of the cases were geographically and temporally clustered and because the case isolates differed antigenically from the vaccine strain, it was initially assumed that the cases signaled the continued circulation of wild type 2 poliovirus. However, comparison of sequences encoding the major capsid protein, VP1 (903 nucleotides), revealed that the isolates were related (93 to 97% nucleotide sequence identity) to the Sabin type 2 oral poliovirus vaccine (OPV) strain and unrelated (<82% nucleotide sequence identity) to the wild type 2 polioviruses previously indigenous to Egypt (last known isolate: 1979) or to any contemporary wild type 2 polioviruses found elsewhere. The rate and pattern of VP1 divergence among the circulating vaccine-derived poliovirus (cVDPV) isolates suggested that all lineages were derived from a single OPV infection that occurred around 1983 and that progeny from the initiating infection circulated for approximately a decade within Egypt along several independent chains of transmission. Complete genomic sequences of an early (1988) and a late (1993) cVDPV isolate revealed that their 5' untranslated region (5' UTR) and noncapsid- 3' UTR sequences were derived from other species C enteroviruses. Circulation of type 2 cVDPVs occurred at a time of low OPV coverage in the affected communities and ceased when OPV coverage rates increased. The potential for cVDPVs to circulate in populations with low immunity to poliovirus has important implications for current and future strategies to eradicate polio worldwide.

  13. Complex codon usage pattern and compositional features of retroviruses.

    PubMed

    RoyChoudhury, Sourav; Mukherjee, Debaprasad

    2013-01-01

    Retroviruses infect a wide range of organisms including humans. Among them, HIV-1, which causes AIDS, has now become a major threat for world health. Some of these viruses are also potential gene transfer vectors. In this study, the patterns of synonymous codon usage in retroviruses have been studied through multivariate statistical methods on ORFs sequences from the available 56 retroviruses. The principal determinant for evolution of the codon usage pattern in retroviruses seemed to be the compositional constraints, while selection for translation of the viral genes plays a secondary role. This was further supported by multivariate analysis on relative synonymous codon usage. Thus, it seems that mutational bias might have dominated role over translational selection in shaping the codon usage of retroviruses. Codon adaptation index was used to identify translationally optimal codons among genes from retroviruses. The comparative analysis of the preferred and optimal codons among different retroviral groups revealed that four codons GAA, AAA, AGA, and GGA were significantly more frequent in most of the retroviral genes inspite of some differences. Cluster analysis also revealed that phylogenetically related groups of retroviruses have probably evolved their codon usage in a concerted manner under the influence of their nucleotide composition.

  14. Genome-Wide Methylome Analyses Reveal Novel Epigenetic Regulation Patterns in Schizophrenia and Bipolar Disorder

    PubMed Central

    Li, Yongsheng; Camarillo, Cynthia; Xu, Juan; Arana, Tania Bedard; Xiao, Yun; Zhao, Zheng; Chen, Hong; Ramirez, Mercedes; Zavala, Juan; Escamilla, Michael A.; Armas, Regina; Mendoza, Ricardo; Ontiveros, Alfonso; Nicolini, Humberto; Jerez Magaña, Alvaro Antonio; Rubin, Lewis P.; Li, Xia; Xu, Chun

    2015-01-01

    Schizophrenia (SZ) and bipolar disorder (BP) are complex genetic disorders. Their appearance is also likely informed by as yet only partially described epigenetic contributions. Using a sequencing-based method for genome-wide analysis, we quantitatively compared the blood DNA methylation landscapes in SZ and BP subjects to control, both in an understudied population, Hispanics along the US-Mexico border. Remarkably, we identified thousands of differentially methylated regions for SZ and BP preferentially located in promoters 3′-UTRs and 5′-UTRs of genes. Distinct patterns of aberrant methylation of promoter sequences were located surrounding transcription start sites. In these instances, aberrant methylation occurred in CpG islands (CGIs) as well as in flanking regions as well as in CGI sparse promoters. Pathway analysis of genes displaying these distinct aberrant promoter methylation patterns showed enhancement of epigenetic changes in numerous genes previously related to psychiatric disorders and neurodevelopment. Integration of gene expression data further suggests that in SZ aberrant promoter methylation is significantly associated with altered gene transcription. In particular, we found significant associations between (1) promoter CGIs hypermethylation with gene repression and (2) CGI 3′-shore hypomethylation with increased gene expression. Finally, we constructed a specific methylation analysis platform that facilitates viewing and comparing aberrant genome methylation in human neuropsychiatric disorders. PMID:25734057

  15. Motor and cognitive stereotypies in the BTBR T+tf/J mouse model of autism

    PubMed Central

    Pearson, BL; Pobbe, RLH; Defensor, EB; Oasay, L; Bolivar, VJ; Blanchard, DC; Blanchard, RJ

    2010-01-01

    The BTBR T+tf/J inbred mouse strain displays a variety of persistent phenotypic alterations similar to those exhibited in autism spectrum disorders. The unique genetic background of the BTBR strain is thought to underlie its lack of reciprocal social interactions, elevated repetitive self-directed grooming and restricted exploratory behaviors. In order to clarify the existence, range and mechanisms of abnormal repetitive behaviors within BTBR mice, we performed detailed analyses of the microstructure of self-grooming patterns and noted increased overall grooming, higher percentages of interruptions in grooming bouts and a concomitant decrease in the proportion of incorrect sequence transitions compared to C57BL/6J inbred mice. Analyses of active phase home cage behavior also revealed an increase in stereotypic bar-biting behavior in the BTBR strain relative to B6 mice. Finally, in a novel object investigation task, BTBR mice exhibited greater baseline preference for specific unfamiliar objects as well as more patterned sequences of sequential investigations of those items. These results suggest that the repetitive, stereotyped behavior patterns of BTBR mice are relatively pervasive and reflect both motor and cognitive mechanisms. Furthermore, other pre-clinical mouse models of autism spectrum disorders may benefit from these more detailed analyses of stereotypic behavior. PMID:21040460

  16. Molecular Microbial Analysis of Lactobacillus Strains Isolated from the Gut of Calves for Potential Probiotic Use

    PubMed Central

    Soto, Lorena P.; Frizzo, Laureano S.; Bertozzi, Ezequiel; Avataneo, Elizabeth; Sequeira, Gabriel J.; Rosmini, Marcelo R.

    2010-01-01

    The intestinal microbiota has an influence on the growth and health status of the hosts. This is of particular interest in animals reared using intensive farming practices. Hence, it is necessary to know more about complexity of the beneficial intestinal microbiota. The use of molecular methods has revolutionized microbial identification by improving its quality and effectiveness. The specific aim of the study was to analyze predominant species of Lactobacillus in intestinal microbial ecosystem of young calves. Forty-two lactic acid bacteria (LAB) isolated from intestinal tract of young calves were characterized by: Amplified Ribosomal DNA Restriction Analysis (ARDRA), by using Hae III, Msp I, and Hinf I restriction enzymes, and 16S rDNA gene sequencing. ARDRA screening revealed nine unique patterns among 42 isolates, with the same pattern for 29 of the isolates. Gene fragments of 16S rDNA of 19 strains representing different patterns were sequenced to confirm the identification of these species. These results confirmed that ARDRA is a good tool for identification and discrimination of bacterial species isolated from complex ecosystem and between closely related groups. This paper provides information about the LAB species predominant in intestinal tract of young calves that could provide beneficial effects when administered as probiotic. PMID:20445780

  17. Microbial eukaryotic distributions and diversity patterns in a deep-sea methane seep ecosystem.

    PubMed

    Pasulka, Alexis L; Levin, Lisa A; Steele, Josh A; Case, David H; Landry, Michael R; Orphan, Victoria J

    2016-09-01

    Although chemosynthetic ecosystems are known to support diverse assemblages of microorganisms, the ecological and environmental factors that structure microbial eukaryotes (heterotrophic protists and fungi) are poorly characterized. In this study, we examined the geographic, geochemical and ecological factors that influence microbial eukaryotic composition and distribution patterns within Hydrate Ridge, a methane seep ecosystem off the coast of Oregon using a combination of high-throughput 18S rRNA tag sequencing, terminal restriction fragment length polymorphism fingerprinting, and cloning and sequencing of full-length 18S rRNA genes. Microbial eukaryotic composition and diversity varied as a function of substrate (carbonate versus sediment), activity (low activity versus active seep sites), sulfide concentration, and region (North versus South Hydrate Ridge). Sulfide concentration was correlated with changes in microbial eukaryotic composition and richness. This work also revealed the influence of oxygen content in the overlying water column and water depth on microbial eukaryotic composition and diversity, and identified distinct patterns from those previously observed for bacteria, archaea and macrofauna in methane seep ecosystems. Characterizing the structure of microbial eukaryotic communities in response to environmental variability is a key step towards understanding if and how microbial eukaryotes influence seep ecosystem structure and function. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.

  18. CodonLogo: a sequence logo-based viewer for codon patterns.

    PubMed

    Sharma, Virag; Murphy, David P; Provan, Gregory; Baranov, Pavel V

    2012-07-15

    Conserved patterns across a multiple sequence alignment can be visualized by generating sequence logos. Sequence logos show each column in the alignment as stacks of symbol(s) where the height of a stack is proportional to its informational content, whereas the height of each symbol within the stack is proportional to its frequency in the column. Sequence logos use symbols of either nucleotide or amino acid alphabets. However, certain regulatory signals in messenger RNA (mRNA) act as combinations of codons. Yet no tool is available for visualization of conserved codon patterns. We present the first application which allows visualization of conserved regions in a multiple sequence alignment in the context of codons. CodonLogo is based on WebLogo3 and uses the same heuristics but treats codons as inseparable units of a 64-letter alphabet. CodonLogo can discriminate patterns of codon conservation from patterns of nucleotide conservation that appear indistinguishable in standard sequence logos. The CodonLogo source code and its implementation (in a local version of the Galaxy Browser) are available at http://recode.ucc.ie/CodonLogo and through the Galaxy Tool Shed at http://toolshed.g2.bx.psu.edu/.

  19. Identification of SNP and SSR Markers in Finger Millet Using Next Generation Sequencing Technologies

    PubMed Central

    Gimode, Davis; Odeny, Damaris A.; de Villiers, Etienne P.; Wanyonyi, Solomon; Dida, Mathews M.; Mneney, Emmarold E.; Muchugi, Alice; Machuka, Jesse; de Villiers, Santie M.

    2016-01-01

    Finger millet is an important cereal crop in eastern Africa and southern India with excellent grain storage quality and unique ability to thrive in extreme environmental conditions. Since negligible attention has been paid to improving this crop to date, the current study used Next Generation Sequencing (NGS) technologies to develop both Simple Sequence Repeat (SSR) and Single Nucleotide Polymorphism (SNP) markers. Genomic DNA from cultivated finger millet genotypes KNE755 and KNE796 was sequenced using both Roche 454 and Illumina technologies. Non-organelle sequencing reads were assembled into 207 Mbp representing approximately 13% of the finger millet genome. We identified 10,327 SSRs and 23,285 non-homeologous SNPs and tested 101 of each for polymorphism across a diverse set of wild and cultivated finger millet germplasm. For the 49 polymorphic SSRs, the mean polymorphism information content (PIC) was 0.42, ranging from 0.16 to 0.77. We also validated 92 SNP markers, 80 of which were polymorphic with a mean PIC of 0.29 across 30 wild and 59 cultivated accessions. Seventy-six of the 80 SNPs were polymorphic across 30 wild germplasm with a mean PIC of 0.30 while only 22 of the SNP markers showed polymorphism among the 59 cultivated accessions with an average PIC value of 0.15. Genetic diversity analysis using the polymorphic SNP markers revealed two major clusters; one of wild and another of cultivated accessions. Detailed STRUCTURE analysis confirmed this grouping pattern and further revealed 2 sub-populations within wild E. coracana subsp. africana. Both STRUCTURE and genetic diversity analysis assisted with the correct identification of the new germplasm collections. These polymorphic SSR and SNP markers are a significant addition to the existing 82 published SSRs, especially with regard to the previously reported low polymorphism levels in finger millet. Our results also reveal an unexploited finger millet genetic resource that can be included in the regional breeding programs in order to efficiently optimize productivity. PMID:27454301

  20. Identification of SNP and SSR Markers in Finger Millet Using Next Generation Sequencing Technologies.

    PubMed

    Gimode, Davis; Odeny, Damaris A; de Villiers, Etienne P; Wanyonyi, Solomon; Dida, Mathews M; Mneney, Emmarold E; Muchugi, Alice; Machuka, Jesse; de Villiers, Santie M

    2016-01-01

    Finger millet is an important cereal crop in eastern Africa and southern India with excellent grain storage quality and unique ability to thrive in extreme environmental conditions. Since negligible attention has been paid to improving this crop to date, the current study used Next Generation Sequencing (NGS) technologies to develop both Simple Sequence Repeat (SSR) and Single Nucleotide Polymorphism (SNP) markers. Genomic DNA from cultivated finger millet genotypes KNE755 and KNE796 was sequenced using both Roche 454 and Illumina technologies. Non-organelle sequencing reads were assembled into 207 Mbp representing approximately 13% of the finger millet genome. We identified 10,327 SSRs and 23,285 non-homeologous SNPs and tested 101 of each for polymorphism across a diverse set of wild and cultivated finger millet germplasm. For the 49 polymorphic SSRs, the mean polymorphism information content (PIC) was 0.42, ranging from 0.16 to 0.77. We also validated 92 SNP markers, 80 of which were polymorphic with a mean PIC of 0.29 across 30 wild and 59 cultivated accessions. Seventy-six of the 80 SNPs were polymorphic across 30 wild germplasm with a mean PIC of 0.30 while only 22 of the SNP markers showed polymorphism among the 59 cultivated accessions with an average PIC value of 0.15. Genetic diversity analysis using the polymorphic SNP markers revealed two major clusters; one of wild and another of cultivated accessions. Detailed STRUCTURE analysis confirmed this grouping pattern and further revealed 2 sub-populations within wild E. coracana subsp. africana. Both STRUCTURE and genetic diversity analysis assisted with the correct identification of the new germplasm collections. These polymorphic SSR and SNP markers are a significant addition to the existing 82 published SSRs, especially with regard to the previously reported low polymorphism levels in finger millet. Our results also reveal an unexploited finger millet genetic resource that can be included in the regional breeding programs in order to efficiently optimize productivity.

  1. Microbial eukaryote diversity in the marine oxygen minimum zone off northern Chile.

    PubMed

    Parris, Darren J; Ganesh, Sangita; Edgcomb, Virginia P; DeLong, Edward F; Stewart, Frank J

    2014-01-01

    Molecular surveys are revealing diverse eukaryotic assemblages in oxygen-limited ocean waters. These communities may play pivotal ecological roles through autotrophy, feeding, and a wide range of symbiotic associations with prokaryotes. We used 18S rRNA gene sequencing to provide the first snapshot of pelagic microeukaryotic community structure in two cellular size fractions (0.2-1.6 μm, >1.6 μm) from seven depths through the anoxic oxygen minimum zone (OMZ) off northern Chile. Sequencing of >154,000 amplicons revealed contrasting patterns of phylogenetic diversity across size fractions and depths. Protist and total eukaryote diversity in the >1.6 μm fraction peaked at the chlorophyll maximum in the upper photic zone before declining by ~50% in the OMZ. In contrast, diversity in the 0.2-1.6 μm fraction, though also elevated in the upper photic zone, increased four-fold from the lower oxycline to a maximum at the anoxic OMZ core. Dinoflagellates of the Dinophyceae and endosymbiotic Syndiniales clades dominated the protist assemblage at all depths (~40-70% of sequences). Other protist groups varied with depth, with the anoxic zone community of the larger size fraction enriched in euglenozoan flagellates and acantharean radiolarians (up to 18 and 40% of all sequences, respectively). The OMZ 0.2-1.6 μm fraction was dominated (11-99%) by Syndiniales, which exhibited depth-specific variation in composition and total richness despite uniform oxygen conditions. Metazoan sequences, though confined primarily to the 1.6 μm fraction above the OMZ, were also detected within the anoxic zone where groups such as copepods increased in abundance relative to the oxycline and upper OMZ. These data, compared to those from other low-oxygen sites, reveal variation in OMZ microeukaryote composition, helping to identify clades with potential adaptations to oxygen-depletion.

  2. Significant strain accumulation between the deformation front and landward out-of-sequence thrusts in accretionary wedge of SW Taiwan revealed by cGPS and SAR interferometry

    NASA Astrophysics Data System (ADS)

    Tsai, M. C.

    2017-12-01

    High strain accumulation across the fold-and-thrust belt in Southwestern Taiwan are revealed by the Continuous GPS (cGPS) and SAR interferometry. This high strain is generally accommodated by the major active structures in fold-and-thrust belt of western Foothills in SW Taiwan connected to the accretionary wedge in the incipient are-continent collision zone. The active structures across the high strain accumulation include the deformation front around the Tainan Tableland, the Hochiali, Hsiaokangshan, Fangshan and Chishan faults. Among these active structures, the deformation pattern revealed from cGPS and SAR interferometry suggest that the Fangshan transfer fault may be a left-lateral fault zone with thrust component accommodating the westward differential motion of thrust sheets on both side of the fault. In addition, the Chishan fault connected to the splay fault bordering the lower-slope and upper-slope of the accretionary wedge which could be the major seismogenic fault and an out-of-sequence thrust fault in SW Taiwan. The big earthquakes resulted from the reactivation of out-of-sequence thrusts have been observed along the Nankai accretionary wedge, thus the assessment of the major seismogenic structures by strain accumulation between the frontal décollement and out-of-sequence thrusts is a crucial topic. According to the background seismicity, the low seismicity and mid-crust to mantle events are observed inland and the lower- and upper- slope domain offshore SW Taiwan, which rheologically implies the upper crust of the accretionary wedge is more or less aseimic. This result may suggest that the excess fluid pressure from the accretionary wedge not only has significantly weakened the prism materials as well as major fault zone, but also makes the accretionary wedge landward extension, which is why the low seismicity is observed in SW Taiwan area. Key words: Continuous GPS, SAR interferometry, strain rate, out-of-sequence thrust.

  3. Microbial eukaryote diversity in the marine oxygen minimum zone off northern Chile

    PubMed Central

    Parris, Darren J.; Ganesh, Sangita; Edgcomb, Virginia P.; DeLong, Edward F.; Stewart, Frank J.

    2014-01-01

    Molecular surveys are revealing diverse eukaryotic assemblages in oxygen-limited ocean waters. These communities may play pivotal ecological roles through autotrophy, feeding, and a wide range of symbiotic associations with prokaryotes. We used 18S rRNA gene sequencing to provide the first snapshot of pelagic microeukaryotic community structure in two cellular size fractions (0.2–1.6 μm, >1.6 μm) from seven depths through the anoxic oxygen minimum zone (OMZ) off northern Chile. Sequencing of >154,000 amplicons revealed contrasting patterns of phylogenetic diversity across size fractions and depths. Protist and total eukaryote diversity in the >1.6 μm fraction peaked at the chlorophyll maximum in the upper photic zone before declining by ~50% in the OMZ. In contrast, diversity in the 0.2–1.6 μm fraction, though also elevated in the upper photic zone, increased four-fold from the lower oxycline to a maximum at the anoxic OMZ core. Dinoflagellates of the Dinophyceae and endosymbiotic Syndiniales clades dominated the protist assemblage at all depths (~40–70% of sequences). Other protist groups varied with depth, with the anoxic zone community of the larger size fraction enriched in euglenozoan flagellates and acantharean radiolarians (up to 18 and 40% of all sequences, respectively). The OMZ 0.2–1.6 μm fraction was dominated (11–99%) by Syndiniales, which exhibited depth-specific variation in composition and total richness despite uniform oxygen conditions. Metazoan sequences, though confined primarily to the 1.6 μm fraction above the OMZ, were also detected within the anoxic zone where groups such as copepods increased in abundance relative to the oxycline and upper OMZ. These data, compared to those from other low-oxygen sites, reveal variation in OMZ microeukaryote composition, helping to identify clades with potential adaptations to oxygen-depletion. PMID:25389417

  4. Molecular evolution of the HoxA cluster in the three major gnathostome lineages

    PubMed Central

    Chiu, Chi-hua; Amemiya, Chris; Dewar, Ken; Kim, Chang-Bae; Ruddle, Frank H.; Wagner, Günter P.

    2002-01-01

    The duplication of Hox clusters and their maintenance in a lineage has a prominent but little understood role in chordate evolution. Here we examined how Hox cluster duplication may influence changes in cluster architecture and patterns of noncoding sequence evolution. We sequenced the entire duplicated HoxAa and HoxAb clusters of zebrafish (Danio rerio) and extended the 5′ (posterior) part of the HoxM (HoxA-like) cluster of horn shark (Heterodontus francisci) containing the hoxa11 and hoxa13 orthologs as well as intergenic and flanking noncoding sequences. The duplicated HoxA clusters in zebrafish each house considerably fewer genes and are dramatically shorter than the single HoxA clusters of human and horn shark. We compared the intergenic sequences of the HoxA clusters of human, horn shark, zebrafish (Aa, Ab), and striped bass and found extensive conservation of noncoding sequence motifs, i.e., phylogenetic footprints, between the human and horn shark, representing two of the three gnathostome lineages. These are putative cis-regulatory elements that may play a role in the regulation of the ancestral HoxA cluster. In contrast, homologous regions of the duplicated HoxAa and HoxAb clusters of zebrafish and the HoxA cluster of striped bass revealed a striking loss of conservation of these putative cis-regulatory sequences in the 3′ (anterior) segment of the cluster, where zebrafish only retains single representatives of group 1, 3, 4, and 5 (HoxAa) and group 2 (HoxAb) genes and in the 5′ part of the clusters, where zebrafish retains two copies of the group 13, 11, and 9 genes, i.e., AbdB-like genes. In analyzing patterns of cis-sequence evolution in the 5′ part of the clusters, we explicitly looked for evidence of complementary loss of conserved noncoding sequences, as predicted by the duplication-degeneration-complementation model in which genetic redundancy after gene duplication is resolved because of the fixation of complementary degenerative mutations. Our data did not yield evidence supporting this prediction. We conclude that changes in the pattern of cis-sequence conservation after Hox cluster duplication are more consistent with being the outcome of adaptive modification rather than passive mechanisms that erode redundancy created by the duplication event. These results support the view that genome duplications may provide a mechanism whereby master control genes undergo radical modifications conducive to major alterations in body plan. Such genomic revolutions may contribute significantly to the evolutionary process. PMID:11943847

  5. Extensive variation at MHC DRB in the New Zealand sea lion (Phocarctos hookeri) provides evidence for balancing selection

    PubMed Central

    Osborne, A J; Zavodna, M; Chilvers, B L; Robertson, B C; Negro, S S; Kennedy, M A; Gemmell, N J

    2013-01-01

    Marine mammals are often reported to possess reduced variation of major histocompatibility complex (MHC) genes compared with their terrestrial counterparts. We evaluated diversity at two MHC class II B genes, DQB and DRB, in the New Zealand sea lion (Phocarctos hookeri, NZSL) a species that has suffered high mortality owing to bacterial epizootics, using Sanger sequencing and haplotype reconstruction, together with next-generation sequencing. Despite this species' prolonged history of small population size and highly restricted distribution, we demonstrate extensive diversity at MHC DRB with 26 alleles, whereas MHC DQB is dimorphic. We identify four DRB codons, predicted to be involved in antigen binding, that are evolving under adaptive evolution. Our data suggest diversity at DRB may be maintained by balancing selection, consistent with the role of this locus as an antigen-binding region and the species' recent history of mass mortality during a series of bacterial epizootics. Phylogenetic analyses of DQB and DRB sequences from pinnipeds and other carnivores revealed significant allelic diversity, but little phylogenetic depth or structure among pinniped alleles; thus, we could neither confirm nor refute the possibility of trans-species polymorphism in this group. The phylogenetic pattern observed however, suggests some significant evolutionary constraint on these loci in the recent past, with the pattern consistent with that expected following an epizootic event. These data may help further elucidate some of the genetic factors underlying the unusually high susceptibility to bacterial infection of the threatened NZSL, and help us to better understand the extent and pattern of MHC diversity in pinnipeds. PMID:23572124

  6. Effects of historical climate change, habitat connectivity, and vicariance on genetic structure and diversity across the range of the Red Tree Vole (Phenacomys longicaudus) in the Pacific Northwest United States

    USGS Publications Warehouse

    Miller, Mark P.; Bellinger, R.M.; Forsman, E.D.; Haig, Susan M.

    2006-01-01

    Phylogeographical analyses conducted in the Pacific Northwestern United States have often revealed concordant patterns of genetic diversity among taxa. These studies demonstrate distinct North/South genetic discontinuities that have been attributed to Pleistocene glaciation. We examined phylogeographical patterns of red tree voles (Phenacomys longicaudus) in western Oregon by analysing mitochondrial control region sequences for 169 individuals from 18 areas across the species' range. Cytochrome b sequences were also analysed from a subset of our samples to confirm the presence of major haplotype groups. Phylogenetic network analyses suggested the presence of two haplotype groups corresponding to northern and southern regions of P. longicaudus' range. Spatial genetic analyses (samova and Genetic Landscape Shapes) of control region sequences demonstrated a primary genetic discontinuity separating northern and southern sampling areas, while a secondary discontinuity separated northern sampling areas into eastern and western groups divided by the Willamette Valley. The North/South discontinuity likely corresponds to a region of secondary contact between lineages rather than an overt barrier. Although the Cordilleran ice sheet (maximum a??12 000 years ago) did not move southward to directly affect the region occupied by P. longicaudus, climate change during glaciation fragmented the forest landscape that it inhabits. Signatures of historical fragmentation were reflected by positive associations between latitude and variables such as Tajima's D and patterns associated with location-specific alleles. Genetic distances between southern sampling areas were smaller, suggesting that forest fragmentation was reduced in southern vs. northern regions.

  7. Molecular cloning, mRNA expression and tissue distribution analysis of Slc7a11 gene in alpaca (Lama paco) skins associated with different coat colors.

    PubMed

    Tian, Xue; Meng, Xiaolin; Wang, Liangyan; Song, Yunfei; Zhang, Danli; Ji, Yuankai; Li, Xuejun; Dong, Changsheng

    2015-01-25

    Slc7a11 encoding solute carrier family 7 member 11 (amionic amino acid transporter light chain, xCT), has been identified to be a critical genetic regulator of pheomelanin synthesis in hair and melanocytes. To better understand the molecular characterization of Slc7a11 and the expression patterns in skin of white versus brown alpaca (lama paco), we cloned the full length coding sequence (CDS) of alpaca Slc7a11 gene and analyzed the expression patterns using Real Time PCR, Western blotting and immunohistochemistry. The full length CDS of 1512bp encodes a 503 amino acid polypeptide. Sequence analysis showed that alpaca xCT contains 12 transmembrane regions consistent with the highly conserved amino acid permease (AA_permease_2) domain similar to other vertebrates. Sequence alignment and phylogenetic analysis revealed that alpaca xCT had the highest identity and shared the same branch with Camelus ferus. Real Time PCR and Western blotting suggested that xCT was expressed at significantly high levels in brown alpaca skin, and transcripts and protein possessed the same expression pattern in white and brown alpaca skins. Additionally, immunohistochemical analysis further demonstrated that xCT staining was robustly increased in the matrix and root sheath of brown alpaca skin compared with that of white. These results suggest that Slc7a11 functions in alpaca coat color regulation and offer essential information for further exploration on the role of Slc7a11 in melanogenesis. Copyright © 2014 Elsevier B.V. All rights reserved.

  8. Deep sequencing reveals distinct patterns of DNA methylation in prostate cancer.

    PubMed

    Kim, Jung H; Dhanasekaran, Saravana M; Prensner, John R; Cao, Xuhong; Robinson, Daniel; Kalyana-Sundaram, Shanker; Huang, Christina; Shankar, Sunita; Jing, Xiaojun; Iyer, Matthew; Hu, Ming; Sam, Lee; Grasso, Catherine; Maher, Christopher A; Palanisamy, Nallasivam; Mehra, Rohit; Kominsky, Hal D; Siddiqui, Javed; Yu, Jindan; Qin, Zhaohui S; Chinnaiyan, Arul M

    2011-07-01

    Beginning with precursor lesions, aberrant DNA methylation marks the entire spectrum of prostate cancer progression. We mapped the global DNA methylation patterns in select prostate tissues and cell lines using MethylPlex-next-generation sequencing (M-NGS). Hidden Markov model-based next-generation sequence analysis identified ∼68,000 methylated regions per sample. While global CpG island (CGI) methylation was not differential between benign adjacent and cancer samples, overall promoter CGI methylation significantly increased from ~12.6% in benign samples to 19.3% and 21.8% in localized and metastatic cancer tissues, respectively (P-value < 2 × 10(-16)). We found distinct patterns of promoter methylation around transcription start sites, where methylation occurred not only on the CGIs, but also on flanking regions and CGI sparse promoters. Among the 6691 methylated promoters in prostate tissues, 2481 differentially methylated regions (DMRs) are cancer-specific, including numerous novel DMRs. A novel cancer-specific DMR in the WFDC2 promoter showed frequent methylation in cancer (17/22 tissues, 6/6 cell lines), but not in the benign tissues (0/10) and normal PrEC cells. Integration of LNCaP DNA methylation and H3K4me3 data suggested an epigenetic mechanism for alternate transcription start site utilization, and these modifications segregated into distinct regions when present on the same promoter. Finally, we observed differences in repeat element methylation, particularly LINE-1, between ERG gene fusion-positive and -negative cancers, and we confirmed this observation using pyrosequencing on a tissue panel. This comprehensive methylome map will further our understanding of epigenetic regulation in prostate cancer progression.

  9. Movement initiation-locked activity of the anterior putamen predicts future movement instability in periodic bimanual movement.

    PubMed

    Aramaki, Yu; Haruno, Masahiko; Osu, Rieko; Sadato, Norihiro

    2011-07-06

    In periodic bimanual movements, anti-phase-coordinated patterns often change into in-phase patterns suddenly and involuntarily. Because behavior in the initial period of a sequence of cycles often does not show any obvious errors, it is difficult to predict subsequent movement errors in the later period of the cyclical sequence. Here, we evaluated performance in the later period of the cyclical sequence of bimanual periodic movements using human brain activity measured with functional magnetic resonance imaging as well as using initial movement features. Eighteen subjects performed a 30 s bimanual finger-tapping task. We calculated differences in initiation-locked transient brain activity between antiphase and in-phase tapping conditions. Correlation analysis revealed that the difference in the anterior putamen activity during antiphase compared within-phase tapping conditions was strongly correlated with future instability as measured by the mean absolute deviation of the left-hand intertap interval during antiphase movements relative to in-phase movements (r = 0.81). Among the initial movement features we measured, only the number of taps to establish the antiphase movement pattern exhibited a significant correlation. However, the correlation efficient of 0.60 was not high enough to predict the characteristics of subsequent movement. There was no significant correlation between putamen activity and initial movement features. It is likely that initiating unskilled difficult movements requires increased anterior putamen activity, and this activity increase may facilitate the initiation of movement via the basal ganglia-thalamocortical circuit. Our results suggest that initiation-locked transient activity of the anterior putamen can be used to predict future motor performance.

  10. [Clinical value of MRI united-sequences examination in diagnosis and differentiation of morphological sub-type of hilar and extrahepatic big bile duct cholangiocarcinoma].

    PubMed

    Yin, Long-Lin; Song, Bin; Guan, Ying; Li, Ying-Chun; Chen, Guang-Wen; Zhao, Li-Ming; Lai, Li

    2014-09-01

    To investigate MRI features and associated histological and pathological changes of hilar and extrahepatic big bile duct cholangiocarcinoma with different morphological sub-types, and its value in differentiating between nodular cholangiocarcinoma (NCC) and intraductal growing cholangiocarcinoma (IDCC). Imaging data of 152 patients with pathologically confirmed hilar and extrahepatic big bile duct cholangiocarcinoma were reviewed, which included 86 periductal infiltrating cholangiocarcinoma (PDCC), 55 NCC, and 11 IDCC. Imaging features of the three morphological sub-types were compared. Each of the subtypes demonstrated its unique imaging features. Significant differences (P < 0.05) were found between NCC and IDCC in tumor shape, dynamic enhanced pattern, enhancement degree during equilibrium phase, multiplicity or singleness of tumor, changes in wall and lumen of bile duct at the tumor-bearing segment, dilatation of tumor upstream or downstream bile duct, and invasion of adjacent organs. Imaging features reveal tumor growth patterns of hilar and extrahepatic big bile duct cholangiocarcinoma. MRI united-sequences examination can accurately describe those imaging features for differentiation diagnosis.

  11. Molecular phylogeny and larval morphological diversity of the lanternfish genus Hygophum (Teleostei: Myctophidae).

    PubMed

    Yamaguchi, M; Miya, M; Okiyama, M; Nishida, M

    2000-04-01

    Larvae of the deep-sea lanternfish genus Hygophum (Myctophidae) exhibit a remarkable morphological diversity that is quite unexpected, considering their homogeneous adult morphology. In an attempt to elucidate the evolutionary patterns of such larval morphological diversity, nucleotide sequences of a portion of the mitochondrially encoded 16S ribosomal RNA gene were determined for seven Hygophum species and three outgroup taxa. Secondary structure-based alignment resulted in a character matrix consisting of 1172 bp of unambiguously aligned sequences, which were subjected to phylogenetic analyses using maximum-parsimony, maximum-likelihood, and neighbor-joining methods. The resultant tree topologies from the three methods were congruent, with most nodes, including that of the genus Hygophum, being strongly supported by various tree statistics. The most parsimonious reconstruction of the three previously recognized, distinct larval morphs onto the molecular phylogeny revealed that one of the morphs had originated as the common ancestor of the genus, the other two having diversified separately in two subsequent major clades. The patterns of such diversification are discussed in terms of the unusual larval eye morphology and geographic distribution. Copyright 2000 Academic Press.

  12. A novel homozygous variant in the SMOC1 gene underlying Waardenburg anophthalmia syndrome.

    PubMed

    Ullah, Asmat; Umair, Muhammad; Ahmad, Farooq; Muhammad, Dost; Basit, Sulman; Ahmad, Wasim

    2017-01-01

    Waardenburg anophthalmia syndrome (WAS), also known as ophthalmo-acromelic syndrome or anophthalmia-syndactyly, is a rare congenital disorder that segregates in an autosomal recessive pattern. Clinical features of the syndrome include malformation of the eyes and the skeleton. Mostly, WAS is caused by mutations in the SMOC-1 gene. The present report describes a large consanguineous family of Pakistani origin segregating Waardenburg anophthalmia syndrome in an autosomal recessive pattern. Genotyping followed by Sanger sequencing was performed to search for a candidate gene. SNP genotyping using AffymetrixGeneChip Human Mapping 250K Nsp array established a single homozygous region among affected members on chromosome 14q23.1-q24.3 harboring the SMOC1 gene. Sequencing of the gene revealed a novel homozygous missense mutation (c.812G>A; p.Cys271Tyr) in the family. This is the first report of Waardenburg anophthalmia syndrome caused by a SMOC1 variant in a Pakistani population. The mutation identified in the present investigation extends the body of evidence implicating the gene SMOC-1 in causing WAS.

  13. Genome-wide identification, classification, and expression analysis of the arabinogalactan protein gene family in rice (Oryza sativa L.)

    PubMed Central

    Zhao, Jie

    2010-01-01

    Arabinogalactan proteins (AGPs) comprise a family of hydroxyproline-rich glycoproteins that are implicated in plant growth and development. In this study, 69 AGPs are identified from the rice genome, including 13 classical AGPs, 15 arabinogalactan (AG) peptides, three non-classical AGPs, three early nodulin-like AGPs (eNod-like AGPs), eight non-specific lipid transfer protein-like AGPs (nsLTP-like AGPs), and 27 fasciclin-like AGPs (FLAs). The results from expressed sequence tags, microarrays, and massively parallel signature sequencing tags are used to analyse the expression of AGP-encoding genes, which is confirmed by real-time PCR. The results reveal that several rice AGP-encoding genes are predominantly expressed in anthers and display differential expression patterns in response to abscisic acid, gibberellic acid, and abiotic stresses. Based on the results obtained from this analysis, an attempt has been made to link the protein structures and expression patterns of rice AGP-encoding genes to their functions. Taken together, the genome-wide identification and expression analysis of the rice AGP gene family might facilitate further functional studies of rice AGPs. PMID:20423940

  14. Characterization of the molecular features and expression patterns of two serine proteases in Hermetia illucens (Diptera: Stratiomyidae) larvae.

    PubMed

    Kim, Wontae; Bae, Sungwoo; Kim, Ayoung; Park, Kwanho; Lee, Sangbeom; Choi, Youngcheol; Han, Sangmi; Park, Younghan; Koh, Youngho

    2011-06-01

    To investigate the molecular scavenging capabilities of the larvae of Hermetia illucens, two serine proteases (SPs) were cloned and characterized. Multiple sequence alignments and phylogenetic tree analysis of the deduced amino acid sequences of Hi-SP1 and Hi-SP2 were suggested that Hi-SP1 may be a chymotrypsin- and Hi-SP2 may be a trypsin-like protease. Hi-SP1 and Hi-SP2 3-D homology models revealed that a catalytic triad, three disulfide bonds, and a substrate-binding pocket were highly conserved, as would be expected of a SP. E. coli expressed Hi-SP1 and Hi-SP2 showed chymotrypsin or trypsin activities, respectively. Hi-SP2 mRNAs were consistently expressed during larval development. In contrast, the expression of Hi-SP1 mRNA fluctuated between feeding and molting stages and disappeared at the pupal stages. These expression pattern differences suggest that Hi-SP1 may be a larval specific chymotrypsin-like protease involved with food digestion, while Hi-SP2 may be a trypsin-like protease with diverse functions at different stages.

  15. Of mice and (Viking?) men: phylogeography of British and Irish house mice.

    PubMed

    Searle, Jeremy B; Jones, Catherine S; Gündüz, Islam; Scascitelli, Moira; Jones, Eleanor P; Herman, Jeremy S; Rambau, R Victor; Noble, Leslie R; Berry, R J; Giménez, Mabel D; Jóhannesdóttir, Fríoa

    2009-01-22

    The west European subspecies of house mouse (Mus musculus domesticus) has gained much of its current widespread distribution through commensalism with humans. This means that the phylogeography of M. m. domesticus should reflect patterns of human movements. We studied restriction fragment length polymorphism (RFLP) and DNA sequence variations in mouse mitochondrial (mt) DNA throughout the British Isles (328 mice from 105 localities, including previously published data). There is a major mtDNA lineage revealed by both RFLP and sequence analyses, which is restricted to the northern and western peripheries of the British Isles, and also occurs in Norway. This distribution of the 'Orkney' lineage fits well with the sphere of influence of the Norwegian Vikings and was probably generated through inadvertent transport by them. To form viable populations, house mice would have required large human settlements such as the Norwegian Vikings founded. The other parts of the British Isles (essentially most of mainland Britain) are characterized by house mice with different mtDNA sequences, some of which are also found in Germany, and which probably reflect both Iron Age movements of people and mice and earlier development of large human settlements. MtDNA studies on house mice have the potential to reveal novel aspects of human history.

  16. Of mice and (Viking?) men: phylogeography of British and Irish house mice

    PubMed Central

    Searle, Jeremy B.; Jones, Catherine S.; Gündüz, İslam; Scascitelli, Moira; Jones, Eleanor P.; Herman, Jeremy S.; Rambau, R. Victor; Noble, Leslie R.; Berry, R.J.; Giménez, Mabel D.; Jóhannesdóttir, Fríða

    2008-01-01

    The west European subspecies of house mouse (Mus musculus domesticus) has gained much of its current widespread distribution through commensalism with humans. This means that the phylogeography of M. m. domesticus should reflect patterns of human movements. We studied restriction fragment length polymorphism (RFLP) and DNA sequence variations in mouse mitochondrial (mt) DNA throughout the British Isles (328 mice from 105 localities, including previously published data). There is a major mtDNA lineage revealed by both RFLP and sequence analyses, which is restricted to the northern and western peripheries of the British Isles, and also occurs in Norway. This distribution of the ‘Orkney’ lineage fits well with the sphere of influence of the Norwegian Vikings and was probably generated through inadvertent transport by them. To form viable populations, house mice would have required large human settlements such as the Norwegian Vikings founded. The other parts of the British Isles (essentially most of mainland Britain) are characterized by house mice with different mtDNA sequences, some of which are also found in Germany, and which probably reflect both Iron Age movements of people and mice and earlier development of large human settlements. MtDNA studies on house mice have the potential to reveal novel aspects of human history. PMID:18826939

  17. Transcriptomic analysis of rice aleurone cells identified a novel abscisic acid response element.

    PubMed

    Watanabe, Kenneth A; Homayouni, Arielle; Gu, Lingkun; Huang, Kuan-Ying; Ho, Tuan-Hua David; Shen, Qingxi J

    2017-09-01

    Seeds serve as a great model to study plant responses to drought stress, which is largely mediated by abscisic acid (ABA). The ABA responsive element (ABRE) is a key cis-regulatory element in ABA signalling. However, its consensus sequence (ACGTG(G/T)C) is present in the promoters of only about 40% of ABA-induced genes in rice aleurone cells, suggesting other ABREs may exist. To identify novel ABREs, RNA sequencing was performed on aleurone cells of rice seeds treated with 20 μM ABA. Gibbs sampling was used to identify enriched elements, and particle bombardment-mediated transient expression studies were performed to verify the function. Gene ontology analysis was performed to predict the roles of genes containing the novel ABREs. This study revealed 2443 ABA-inducible genes and a novel ABRE, designated as ABREN, which was experimentally verified to mediate ABA signalling in rice aleurone cells. Many of the ABREN-containing genes are predicted to be involved in stress responses and transcription. Analysis of other species suggests that the ABREN may be monocot specific. This study also revealed interesting expression patterns of genes involved in ABA metabolism and signalling. Collectively, this study advanced our understanding of diverse cis-regulatory sequences and the transcriptomes underlying ABA responses in rice aleurone cells. © 2017 John Wiley & Sons Ltd.

  18. Mitochondrial genomes reveal recombination in the presumed asexual Fusarium oxysporum species complex.

    PubMed

    Brankovics, Balázs; van Dam, Peter; Rep, Martijn; de Hoog, G Sybren; J van der Lee, Theo A; Waalwijk, Cees; van Diepeningen, Anne D

    2017-09-18

    The Fusarium oxysporum species complex (FOSC) contains several phylogenetic lineages. Phylogenetic studies identified two to three major clades within the FOSC. The mitochondrial sequences are highly informative phylogenetic markers, but have been mostly neglected due to technical difficulties. A total of 61 complete mitogenomes of FOSC strains were de novo assembled and annotated. Length variations and intron patterns support the separation of three phylogenetic species. The variable region of the mitogenome that is typical for the genus Fusarium shows two new variants in the FOSC. The variant typical for Fusarium is found in members of all three clades, while variant 2 is found in clades 2 and 3 and variant 3 only in clade 2. The extended set of loci analyzed using a new implementation of the genealogical concordance species recognition method support the identification of three phylogenetic species within the FOSC. Comparative analysis of the mitogenomes in the FOSC revealed ongoing mitochondrial recombination within, but not between phylogenetic species. The recombination indicates the presence of a parasexual cycle in F. oxysporum. The obstacles hindering the usage of the mitogenomes are resolved by using next generation sequencing and selective genome assemblers, such as GRAbB. Complete mitogenome sequences offer a stable basis and reference point for phylogenetic and population genetic studies.

  19. Fragmentation of contaminant and endogenous DNA in ancient samples determined by shotgun sequencing; prospects for human palaeogenomics.

    PubMed

    García-Garcerà, Marc; Gigli, Elena; Sanchez-Quinto, Federico; Ramirez, Oscar; Calafell, Francesc; Civit, Sergi; Lalueza-Fox, Carles

    2011-01-01

    Despite the successful retrieval of genomes from past remains, the prospects for human palaeogenomics remain unclear because of the difficulty of distinguishing contaminant from endogenous DNA sequences. Previous sequence data generated on high-throughput sequencing platforms indicate that fragmentation of ancient DNA sequences is a characteristic trait primarily arising due to depurination processes that create abasic sites leading to DNA breaks. METHODOLOGY/PRINCIPALS FINDINGS: To investigate whether this pattern is present in ancient remains from a temperate environment, we have 454-FLX pyrosequenced different samples dated between 5,500 and 49,000 years ago: a bone from an extinct goat (Myotragus balearicus) that was treated with a depurinating agent (bleach), an Iberian lynx bone not subjected to any treatment, a human Neolithic sample from Barcelona (Spain), and a Neandertal sample from the El Sidrón site (Asturias, Spain). The efficiency of retrieval of endogenous sequences is below 1% in all cases. We have used the non-human samples to identify human sequences (0.35 and 1.4%, respectively), that we positively know are contaminants. We observed that bleach treatment appears to create a depurination-associated fragmentation pattern in resulting contaminant sequences that is indistinguishable from previously described endogenous sequences. Furthermore, the nucleotide composition pattern observed in 5' and 3' ends of contaminant sequences is much more complex than the flat pattern previously described in some Neandertal contaminants. Although much research on samples with known contaminant histories is needed, our results suggest that endogenous and contaminant sequences cannot be distinguished by the fragmentation pattern alone.

  20. Whale phylogeny and rapid radiation events revealed using novel retroposed elements and their flanking sequences.

    PubMed

    Chen, Zhuo; Xu, Shixia; Zhou, Kaiya; Yang, Guang

    2011-10-27

    A diversity of hypotheses have been proposed based on both morphological and molecular data to reveal phylogenetic relationships within the order Cetacea (dolphins, porpoises, and whales), and great progress has been made in the past two decades. However, there is still some controversy concerning relationships among certain cetacean taxa such as river dolphins and delphinoid species, which needs to be further addressed with more markers in an effort to address unresolved portions of the phylogeny. An analysis of additional SINE insertions and SINE-flanking sequences supported the monophyly of the order Cetacea as well as Odontocete, Delphinoidea (Delphinidae + Phocoenidae + Mondontidae), and Delphinidae. A sister relationship between Delphinidae and Phocoenidae + Mondontidae was supported, and members of classical river dolphins and the genera Tursiops and Stenella were found to be paraphyletic. Estimates of divergence times revealed rapid divergences of basal Odontocete lineages in the Oligocene and Early Miocene, and a recent rapid diversification of Delphinidae in the Middle-Late Miocene and Pliocene within a narrow time frame. Several novel SINEs were found to differentiate Delphinidae from the other two families (Monodontidae and Phocoenidae), whereas the sister grouping of the latter two families with exclusion of Delphinidae was further revealed using the SINE-flanking sequences. Interestingly, some anomalous PCR amplification patterns of SINE insertions were detected, which can be explained as the result of potential ancestral SINE polymorphisms and incomplete lineage sorting. Although a few loci were potentially anomalous, this study demonstrated that the SINE-based approach is a powerful tool in phylogenetic studies. Identifying additional SINE elements that resolve the relationships in the superfamily Delphinoidea and family Delphinidae will be important steps forward in completely resolving cetacean phylogenetic relationships in the future.

  1. Whale phylogeny and rapid radiation events revealed using novel retroposed elements and their flanking sequences

    PubMed Central

    2011-01-01

    Background A diversity of hypotheses have been proposed based on both morphological and molecular data to reveal phylogenetic relationships within the order Cetacea (dolphins, porpoises, and whales), and great progress has been made in the past two decades. However, there is still some controversy concerning relationships among certain cetacean taxa such as river dolphins and delphinoid species, which needs to be further addressed with more markers in an effort to address unresolved portions of the phylogeny. Results An analysis of additional SINE insertions and SINE-flanking sequences supported the monophyly of the order Cetacea as well as Odontocete, Delphinoidea (Delphinidae + Phocoenidae + Mondontidae), and Delphinidae. A sister relationship between Delphinidae and Phocoenidae + Mondontidae was supported, and members of classical river dolphins and the genera Tursiops and Stenella were found to be paraphyletic. Estimates of divergence times revealed rapid divergences of basal Odontocete lineages in the Oligocene and Early Miocene, and a recent rapid diversification of Delphinidae in the Middle-Late Miocene and Pliocene within a narrow time frame. Conclusions Several novel SINEs were found to differentiate Delphinidae from the other two families (Monodontidae and Phocoenidae), whereas the sister grouping of the latter two families with exclusion of Delphinidae was further revealed using the SINE-flanking sequences. Interestingly, some anomalous PCR amplification patterns of SINE insertions were detected, which can be explained as the result of potential ancestral SINE polymorphisms and incomplete lineage sorting. Although a few loci were potentially anomalous, this study demonstrated that the SINE-based approach is a powerful tool in phylogenetic studies. Identifying additional SINE elements that resolve the relationships in the superfamily Delphinoidea and family Delphinidae will be important steps forward in completely resolving cetacean phylogenetic relationships in the future. PMID:22029548

  2. Conflicting patterns of genetic structure produced by nuclear and mitochondrial markers in the Oregon Slender Salamander (Batrachoseps wrighti): implications for conservation efforts and species management

    USGS Publications Warehouse

    Miller, Mark; Haig, Susan M.; Wagner, R.S.

    2005-01-01

    Endemic to Oregon in the northwestern US, the Oregon slender salamander (Batrachoseps wrighti) is a terrestrial plethodontid found associated with late successional mesic forests. Consequently, forest management practices such as timber harvesting may impact their persistence. Therefore, to infer possible future effects of these practices on population structure and differentiation, we used mitochondrial DNA sequences (cytochrome b) and RAPD markers to analyze 22 populations across their range. Phylogenetic analyses of sequence data (774 bp) revealed two historical lineages corresponding to northern and southern-distributed populations. Relationships among haplotypes and haplotype diversity within lineages suggested that the northern region may have more recently been colonized compared to the southern region. In contrast to the mitochondrial data, analyses of 46 RAPD loci suggested an overall pattern of isolation-by-distance in the set of populations examined and no particularly strong clustering of populations based on genetic distances. We propose two non-exclusive hypotheses to account for discrepancies between mitochondrial and nuclear data sets. First, our data may reflect an overall ancestral pattern of isolation-by-distance that has subsequently been influenced by vicariance. Alternately, our analyses may suggest that male-mediated gene flow and female philopatry are important contributors to the pattern of genetic diversity. We discuss the importance of distinguishing between these two hypotheses for the purposes of identifying conservation units and note that, regardless of the relative contribution of each mechanism towards the observed pattern of diversity, protection of habitat will likely prove critical for the long-term persistence of this species.

  3. Chromosomal structures and repetitive sequences divergence in Cucumis species revealed by comparative cytogenetic mapping.

    PubMed

    Zhang, Yunxia; Cheng, Chunyan; Li, Ji; Yang, Shuqiong; Wang, Yunzhu; Li, Ziang; Chen, Jinfeng; Lou, Qunfeng

    2015-09-25

    Differentiation and copy number of repetitive sequences affect directly chromosome structure which contributes to reproductive isolation and speciation. Comparative cytogenetic mapping has been verified an efficient tool to elucidate the differentiation and distribution of repetitive sequences in genome. In present study, the distinct chromosomal structures of five Cucumis species were revealed through genomic in situ hybridization (GISH) technique and comparative cytogenetic mapping of major satellite repeats. Chromosome structures of five Cucumis species were investigated using GISH and comparative mapping of specific satellites. Southern hybridization was employed to study the proliferation of satellites, whose structural characteristics were helpful for analyzing chromosome evolution. Preferential distribution of repetitive DNAs at the subtelomeric regions was found in C. sativus, C hystrix and C. metuliferus, while majority was positioned at the pericentromeric heterochromatin regions in C. melo and C. anguria. Further, comparative GISH (cGISH) through using genomic DNA of other species as probes revealed high homology of repeats between C. sativus and C. hystrix. Specific satellites including 45S rDNA, Type I/II, Type III, Type IV, CentM and telomeric repeat were then comparatively mapped in these species. Type I/II and Type IV produced bright signals at the subtelomeric regions of C. sativus and C. hystrix simultaneously, which might explain the significance of their amplification in the divergence of Cucumis subgenus from the ancient ancestor. Unique positioning of Type III and CentM only at the centromeric domains of C. sativus and C. melo, respectively, combining with unique southern bands, revealed rapid evolutionary patterns of centromeric DNA in Cucumis. Obvious interstitial telomeric repeats were observed in chromosomes 1 and 2 of C. sativus, which might provide evidence of the fusion hypothesis of chromosome evolution from x = 12 to x = 7 in Cucumis species. Besides, the significant correlation was found between gene density along chromosome and GISH band intensity in C. sativus and C. melo. In summary, comparative cytogenetic mapping of major satellites and GISH revealed the distinct differentiation of chromosome structure during species formation. The evolution of repetitive sequences was the main force for the divergence of Cucumis species from common ancestor.

  4. "Change deafness" arising from inter-feature masking within a single auditory object.

    PubMed

    Barascud, Nicolas; Griffiths, Timothy D; McAlpine, David; Chait, Maria

    2014-03-01

    Our ability to detect prominent changes in complex acoustic scenes depends not only on the ear's sensitivity but also on the capacity of the brain to process competing incoming information. Here, employing a combination of psychophysics and magnetoencephalography (MEG), we investigate listeners' sensitivity in situations when two features belonging to the same auditory object change in close succession. The auditory object under investigation is a sequence of tone pips characterized by a regularly repeating frequency pattern. Signals consisted of an initial, regularly alternating sequence of three short (60 msec) pure tone pips (in the form ABCABC…) followed by a long pure tone with a frequency that is either expected based on the on-going regular pattern ("LONG expected"-i.e., "LONG-expected") or constitutes a pattern violation ("LONG-unexpected"). The change in LONG-expected is manifest as a change in duration (when the long pure tone exceeds the established duration of a tone pip), whereas the change in LONG-unexpected is manifest as a change in both the frequency pattern and a change in the duration. Our results reveal a form of "change deafness," in that although changes in both the frequency pattern and the expected duration appear to be processed effectively by the auditory system-cortical signatures of both changes are evident in the MEG data-listeners often fail to detect changes in the frequency pattern when that change is closely followed by a change in duration. By systematically manipulating the properties of the changing features and measuring behavioral and MEG responses, we demonstrate that feature changes within the same auditory object, which occur close together in time, appear to compete for perceptual resources.

  5. Contrasting elevational diversity patterns for soil bacteria between two ecosystems divided by the treeline.

    PubMed

    Li, Guixiang; Xu, Guorui; Shen, Congcong; Tang, Yong; Zhang, Yuxin; Ma, Keming

    2016-11-01

    Above- and below-ground organisms are closely linked, but how elevational distribution pattern of soil microbes shifting across the treeline still remains unknown. Sampling of 140 plots with transect, we herein investigated soil bacterial distribution pattern from a temperate forest up to a subalpine meadow along an elevational gradient using Illumina sequencing. Our results revealed distinct elevational patterns of bacterial diversity above and below the treeline in responding to changes in soil conditions: a hollow elevational pattern in the forest (correlated with soil temperature, pH, and C:N ratio) and a significantly decreasing pattern in the meadow (correlated with soil pH, and available phosphorus). The bacterial community structure was also distinct between the forest and meadow, relating to soil pH in the forest and soil temperature in the meadow. Soil bacteria did not follow the distribution pattern of herb diversity, but bacterial community structure could be predicted by herb community composition. These results suggest that plant communities have an important influence on soil characteristics, and thus change the elevational distribution of soil bacteria. Our findings are useful for future assessments of climate change impacts on microbial community.

  6. Genetic Basis of Melanin Pigmentation in Butterfly Wings

    PubMed Central

    Zhang, Linlin; Martin, Arnaud; Perry, Michael W.; van der Burg, Karin R. L.; Matsuoka, Yuji; Monteiro, Antónia; Reed, Robert D.

    2017-01-01

    Despite the variety, prominence, and adaptive significance of butterfly wing patterns, surprisingly little is known about the genetic basis of wing color diversity. Even though there is intense interest in wing pattern evolution and development, the technical challenge of genetically manipulating butterflies has slowed efforts to functionally characterize color pattern development genes. To identify candidate wing pigmentation genes, we used RNA sequencing to characterize transcription across multiple stages of butterfly wing development, and between different color pattern elements, in the painted lady butterfly Vanessa cardui. This allowed us to pinpoint genes specifically associated with red and black pigment patterns. To test the functions of a subset of genes associated with presumptive melanin pigmentation, we used clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 genome editing in four different butterfly genera. pale, Ddc, and yellow knockouts displayed reduction of melanin pigmentation, consistent with previous findings in other insects. Interestingly, however, yellow-d, ebony, and black knockouts revealed that these genes have localized effects on tuning the color of red, brown, and ochre pattern elements. These results point to previously undescribed mechanisms for modulating the color of specific wing pattern elements in butterflies, and provide an expanded portrait of the insect melanin pathway. PMID:28193726

  7. The neural correlates of implicit sequence learning in schizophrenia.

    PubMed

    Marvel, Cherie L; Turner, Beth M; O'Leary, Daniel S; Johnson, Hans J; Pierson, Ronald K; Ponto, Laura L Boles; Andreasen, Nancy C

    2007-11-01

    Twenty-seven schizophrenia spectrum patients and 25 healthy controls performed a probabilistic version of the serial reaction time task (SRT) that included sequence trials embedded within random trials. Patients showed diminished, yet measurable, sequence learning. Postexperimental analyses revealed that a group of patients performed above chance when generating short spans of the sequence. This high-generation group showed SRT learning that was similar in magnitude to that of controls. Their learning was evident from the very 1st block; however, unlike controls, learning did not develop further with continued testing. A subset of 12 patients and 11 controls performed the SRT in conjunction with positron emission tomography. High-generation performance, which corresponded to SRT learning in patients, correlated to activity in the premotor cortex and parahippocampus. These areas have been associated with stimulus-driven visuospatial processing. Taken together, these results suggest that a subset of patients who showed moderate success on the SRT used an explicit stimulus-driven strategy to process the sequential stimuli. This adaptive strategy facilitated sequence learning but may have interfered with conventional implicit learning of the overall stimulus pattern. PsycINFO Database Record (c) 2007 APA, all rights reserved.

  8. Taxonomic evaluation of selected Ganoderma species and database sequence validation

    PubMed Central

    Jargalmaa, Suldbold; Eimes, John A.; Park, Myung Soo; Park, Jae Young; Oh, Seung-Yoon

    2017-01-01

    Species in the genus Ganoderma include several ecologically important and pathogenic fungal species whose medicinal and economic value is substantial. Due to the highly similar morphological features within the Ganoderma, identification of species has relied heavily on DNA sequencing using BLAST searches, which are only reliable if the GenBank submissions are accurately labeled. In this study, we examined 113 specimens collected from 1969 to 2016 from various regions in Korea using morphological features and multigene analysis (internal transcribed spacer, translation elongation factor 1-α, and the second largest subunit of RNA polymerase II). These specimens were identified as four Ganoderma species: G. sichuanense, G. cf. adspersum, G. cf. applanatum, and G. cf. gibbosum. With the exception of G. sichuanense, these species were difficult to distinguish based solely on morphological features. However, phylogenetic analysis at three different loci yielded concordant phylogenetic information, and supported the four species distinctions with high bootstrap support. A survey of over 600 Ganoderma sequences available on GenBank revealed that 65% of sequences were either misidentified or ambiguously labeled. Here, we suggest corrected annotations for GenBank sequences based on our phylogenetic validation and provide updated global distribution patterns for these Ganoderma species. PMID:28761785

  9. Genetic diversity assessment of anoxygenic photosynthetic bacteria by distance-based grouping analysis of pufM sequences.

    PubMed

    Zeng, Y H; Chen, X H; Jiao, N Z

    2007-12-01

    To assess how completely the diversity of anoxygenic phototrophic bacteria (APB) was sampled in natural environments. All nucleotide sequences of the APB marker gene pufM from cultures and environmental clones were retrieved from the GenBank database. A set of cutoff values (sequence distances 0.06, 0.15 and 0.48 for species, genus, and (sub)phylum levels, respectively) was established using a distance-based grouping program. Analysis of the environmental clones revealed that current efforts on APB isolation and sampling in natural environments are largely inadequate. Analysis of the average distance between each identified genus and an uncultured environmental pufM sequence indicated that the majority of cultured APB genera lack environmental representatives. The distance-based grouping method is fast and efficient for bulk functional gene sequences analysis. The results clearly show that we are at a relatively early stage in sampling the global richness of APB species. Periodical assessment will undoubtedly facilitate in-depth analysis of potential biogeographical distribution pattern of APB. This is the first attempt to assess the present understanding of APB diversity in natural environments. The method used is also useful for assessing the diversity of other functional genes.

  10. Progressive Recruitment of Mesenchymal Progenitors Reveals a Time-Dependent Process of Cell Fate Acquisition in Mouse and Human Nephrogenesis.

    PubMed

    Lindström, Nils O; De Sena Brandine, Guilherme; Tran, Tracy; Ransick, Andrew; Suh, Gio; Guo, Jinjin; Kim, Albert D; Parvez, Riana K; Ruffins, Seth W; Rutledge, Elisabeth A; Thornton, Matthew E; Grubbs, Brendan; McMahon, Jill A; Smith, Andrew D; McMahon, Andrew P

    2018-06-04

    Mammalian nephrons arise from a limited nephron progenitor pool through a reiterative inductive process extending over days (mouse) or weeks (human) of kidney development. Here, we present evidence that human nephron patterning reflects a time-dependent process of recruitment of mesenchymal progenitors into an epithelial nephron precursor. Progressive recruitment predicted from high-resolution image analysis and three-dimensional reconstruction of human nephrogenesis was confirmed through direct visualization and cell fate analysis of mouse kidney organ cultures. Single-cell RNA sequencing of the human nephrogenic niche provided molecular insights into these early patterning processes and predicted developmental trajectories adopted by nephron progenitor cells in forming segment-specific domains of the human nephron. The temporal-recruitment model for nephron polarity and patterning suggested by direct analysis of human kidney development provides a framework for integrating signaling pathways driving mammalian nephrogenesis. Copyright © 2018 Elsevier Inc. All rights reserved.

  11. Effect of migration patterns on maternal genetic structure: a case of Tai-Kadai migration from China to Thailand.

    PubMed

    Kampuansai, Jatupol; Kutanan, Wibhu; Tassi, Francesca; Kaewgahya, Massupa; Ghirotto, Silvia; Kangwanpong, Daoroong

    2017-02-01

    The migration of the Tai-Kadai speaking people from southern China to northern Thailand over the past hundreds of years has revealed numerous patterns that have likely been influenced by routes, purposes and periods of time. To study the effects of different migration patterns on Tai-Kadai maternal genetic structure, mitochondrial DNA hypervariable region I sequences from the Yong and the Lue people having well-documented histories in northern Thailand were analyzed. Although the Yong and Lue people were historically close relatives who shared Xishuangbanna Dai ancestors, significant genetic differences have been observed among them. The Yong people who have been known to practice mass migration have exhibited a closer genetic affinity to their Dai ancestors than have the Lue people. Genetic heterogeneity and a sudden reduced effective population size within the Lue group is likely a direct result of the circumstances of the founder effect.

  12. Dynamic gene expression of Lin-28 during embryonic development in mouse and chicken.

    PubMed

    Yokoyama, Shigetoshi; Hashimoto, Megumi; Shimizu, Hirohito; Ueno-Kudoh, Hiroe; Uchibe, Kenta; Kimura, Ichiro; Asahara, Hiroshi

    2008-02-01

    The Caenorhabditis elegans heterochronic gene lin-28 regulates developmental timing in the nematode trunk. We report the dynamic expression patterns of Lin-28 homologues in mouse and chick embryos. Whole mount in situ hybridization revealed specific and intriguing expression patterns of Lin-28 in the developing mouse and chick limb bud. Mouse Lin-28 expression was detected in both the forelimb and hindlimb at E9.5, but disappeared from the forelimb at E10.5, and finally from the forelimb and hindlimb at E11.5. Chicken Lin-28, which was first detected in the limb primordium at stage 15/16, was also downregulated as the stage proceeded. The amino acid sequences of mouse and chicken Lin-28 genes are highly conserved and the similar expression patterns of Lin-28 during limb development in mouse and chicken suggest that this heterochronic gene is also conserved during vertebrate limb development.

  13. The Oncoprotein BRD4-NUT Generates Aberrant Histone Modification Patterns.

    PubMed

    Zee, Barry M; Dibona, Amy B; Alekseyenko, Artyom A; French, Christopher A; Kuroda, Mitzi I

    2016-01-01

    Defects in chromatin proteins frequently manifest in diseases. A striking case of a chromatin-centric disease is NUT-midline carcinoma (NMC), which is characterized by expression of NUT as a fusion partner most frequently with BRD4. ChIP-sequencing studies from NMC patients revealed that BRD4-NUT (B4N) covers large genomic regions and elevates transcription within these domains. To investigate how B4N modulates chromatin, we performed affinity purification of B4N when ectopically expressed in 293-TREx cells and quantified the associated histone posttranslational modifications (PTM) using proteomics. We observed significant enrichment of acetylation particularly on H3 K18 and of combinatorial patterns such as H3 K27 acetylation paired with K36 methylation. We postulate that B4N complexes override the preexisting histone code with new PTM patterns that reflect aberrant transcription and that epigenetically modulate the nucleosome environment toward the NMC state.

  14. The Oncoprotein BRD4-NUT Generates Aberrant Histone Modification Patterns

    PubMed Central

    Zee, Barry M.; Dibona, Amy B.; Alekseyenko, Artyom A.; French, Christopher A.; Kuroda, Mitzi I.

    2016-01-01

    Defects in chromatin proteins frequently manifest in diseases. A striking case of a chromatin-centric disease is NUT-midline carcinoma (NMC), which is characterized by expression of NUT as a fusion partner most frequently with BRD4. ChIP-sequencing studies from NMC patients revealed that BRD4-NUT (B4N) covers large genomic regions and elevates transcription within these domains. To investigate how B4N modulates chromatin, we performed affinity purification of B4N when ectopically expressed in 293-TREx cells and quantified the associated histone posttranslational modifications (PTM) using proteomics. We observed significant enrichment of acetylation particularly on H3 K18 and of combinatorial patterns such as H3 K27 acetylation paired with K36 methylation. We postulate that B4N complexes override the preexisting histone code with new PTM patterns that reflect aberrant transcription and that epigenetically modulate the nucleosome environment toward the NMC state. PMID:27698495

  15. Ontogeny of hepatic energy metabolism genes in mice as revealed by RNA-sequencing.

    PubMed

    Renaud, Helen J; Cui, Yue Julia; Lu, Hong; Zhong, Xiao-bo; Klaassen, Curtis D

    2014-01-01

    The liver plays a central role in metabolic homeostasis by coordinating synthesis, storage, breakdown, and redistribution of nutrients. Hepatic energy metabolism is dynamically regulated throughout different life stages due to different demands for energy during growth and development. However, changes in gene expression patterns throughout ontogeny for factors important in hepatic energy metabolism are not well understood. We performed detailed transcript analysis of energy metabolism genes during various stages of liver development in mice. Livers from male C57BL/6J mice were collected at twelve ages, including perinatal and postnatal time points (n = 3/age). The mRNA was quantified by RNA-Sequencing, with transcript abundance estimated by Cufflinks. One thousand sixty energy metabolism genes were examined; 794 were above detection, of which 627 were significantly changed during at least one developmental age compared to adult liver. Two-way hierarchical clustering revealed three major clusters dependent on age: GD17.5-Day 5 (perinatal-enriched), Day 10-Day 20 (pre-weaning-enriched), and Day 25-Day 60 (adolescence/adulthood-enriched). Clustering analysis of cumulative mRNA expression values for individual pathways of energy metabolism revealed three patterns of enrichment: glycolysis, ketogenesis, and glycogenesis were all perinatally-enriched; glycogenolysis was the only pathway enriched during pre-weaning ages; whereas lipid droplet metabolism, cholesterol and bile acid metabolism, gluconeogenesis, and lipid metabolism were all enriched in adolescence/adulthood. This study reveals novel findings such as the divergent expression of the fatty acid β-oxidation enzymes Acyl-CoA oxidase 1 and Carnitine palmitoyltransferase 1a, indicating a switch from mitochondrial to peroxisomal β-oxidation after weaning; as well as the dynamic ontogeny of genes implicated in obesity such as Stearoyl-CoA desaturase 1 and Elongation of very long chain fatty acids-like 3. These data shed new light on the ontogeny of homeostatic regulation of hepatic energy metabolism, which could ultimately provide new therapeutic targets for metabolic diseases.

  16. Efficient Mining of Interesting Patterns in Large Biological Sequences

    PubMed Central

    Rashid, Md. Mamunur; Karim, Md. Rezaul; Jeong, Byeong-Soo

    2012-01-01

    Pattern discovery in biological sequences (e.g., DNA sequences) is one of the most challenging tasks in computational biology and bioinformatics. So far, in most approaches, the number of occurrences is a major measure of determining whether a pattern is interesting or not. In computational biology, however, a pattern that is not frequent may still be considered very informative if its actual support frequency exceeds the prior expectation by a large margin. In this paper, we propose a new interesting measure that can provide meaningful biological information. We also propose an efficient index-based method for mining such interesting patterns. Experimental results show that our approach can find interesting patterns within an acceptable computation time. PMID:23105928

  17. Efficient mining of interesting patterns in large biological sequences.

    PubMed

    Rashid, Md Mamunur; Karim, Md Rezaul; Jeong, Byeong-Soo; Choi, Ho-Jin

    2012-03-01

    Pattern discovery in biological sequences (e.g., DNA sequences) is one of the most challenging tasks in computational biology and bioinformatics. So far, in most approaches, the number of occurrences is a major measure of determining whether a pattern is interesting or not. In computational biology, however, a pattern that is not frequent may still be considered very informative if its actual support frequency exceeds the prior expectation by a large margin. In this paper, we propose a new interesting measure that can provide meaningful biological information. We also propose an efficient index-based method for mining such interesting patterns. Experimental results show that our approach can find interesting patterns within an acceptable computation time.

  18. Analysis of alterative cleavage and polyadenylation by 3′ region extraction and deep sequencing

    PubMed Central

    Hoque, Mainul; Ji, Zhe; Zheng, Dinghai; Luo, Wenting; Li, Wencheng; You, Bei; Park, Ji Yeon; Yehia, Ghassan; Tian, Bin

    2012-01-01

    Alternative cleavage and polyadenylation (APA) leads to mRNA isoforms with different coding sequences (CDS) and/or 3′ untranslated regions (3′UTRs). Using 3′ Region Extraction And Deep Sequencing (3′READS), a method which addresses the internal priming and oligo(A) tail issues that commonly plague polyA site (pA) identification, we comprehensively mapped pAs in the mouse genome, thoroughly annotating 3′ ends of genes and revealing over five thousand pAs (~8% of total) flanked by A-rich sequences, which have hitherto been overlooked. About 79% of mRNA genes and 66% of long non-coding RNA (lncRNA) genes have APA; but these two gene types have distinct usage patterns for pAs in introns and upstream exons. Promoter-distal pAs become relatively more abundant during embryonic development and cell differentiation, a trend affecting pAs in both 3′-most exons and upstream regions. Upregulated isoforms generally have stronger pAs, suggesting global modulation of the 3′ end processing activity in development and differentiation. PMID:23241633

  19. Multiplexed Sequence Encoding: A Framework for DNA Communication.

    PubMed

    Zakeri, Bijan; Carr, Peter A; Lu, Timothy K

    2016-01-01

    Synthetic DNA has great propensity for efficiently and stably storing non-biological information. With DNA writing and reading technologies rapidly advancing, new applications for synthetic DNA are emerging in data storage and communication. Traditionally, DNA communication has focused on the encoding and transfer of complete sets of information. Here, we explore the use of DNA for the communication of short messages that are fragmented across multiple distinct DNA molecules. We identified three pivotal points in a communication-data encoding, data transfer & data extraction-and developed novel tools to enable communication via molecules of DNA. To address data encoding, we designed DNA-based individualized keyboards (iKeys) to convert plaintext into DNA, while reducing the occurrence of DNA homopolymers to improve synthesis and sequencing processes. To address data transfer, we implemented a secret-sharing system-Multiplexed Sequence Encoding (MuSE)-that conceals messages between multiple distinct DNA molecules, requiring a combination key to reveal messages. To address data extraction, we achieved the first instance of chromatogram patterning through multiplexed sequencing, thereby enabling a new method for data extraction. We envision these approaches will enable more widespread communication of information via DNA.

  20. Molecular Cloning and Sequence Analysis of a Phenylalanine Ammonia-Lyase Gene from Dendrobium

    PubMed Central

    Cai, Yongping; Lin, Yi

    2013-01-01

    In this study, a phenylalanine ammonia-lyase (PAL) gene was cloned from Dendrobium candidum using homology cloning and RACE. The full-length sequence and catalytic active sites that appear in PAL proteins of Arabidopsis thaliana and Nicotiana tabacum are also found: PAL cDNA of D. candidum (designated Dc-PAL1, GenBank No. JQ765748) has 2,458 bps and contains a complete open reading frame (ORF) of 2,142 bps, which encodes 713 amino acid residues. The amino acid sequence of DcPAL1 has more than 80% sequence identity with the PAL genes of other plants, as indicated by multiple alignments. The dominant sites and catalytic active sites, which are similar to that showing in PAL proteins of Arabidopsis thaliana and Nicotiana tabacum, are also found in DcPAL1. Phylogenetic tree analysis revealed that DcPAL is more closely related to PALs from orchidaceae plants than to those of other plants. The differential expression patterns of PAL in protocorm-like body, leaf, stem, and root, suggest that the PAL gene performs multiple physiological functions in Dendrobium candidum. PMID:23638048

  1. Use of the Fluidigm C1 platform for RNA sequencing of single mouse pancreatic islet cells.

    PubMed

    Xin, Yurong; Kim, Jinrang; Ni, Min; Wei, Yi; Okamoto, Haruka; Lee, Joseph; Adler, Christina; Cavino, Katie; Murphy, Andrew J; Yancopoulos, George D; Lin, Hsin Chieh; Gromada, Jesper

    2016-03-22

    This study provides an assessment of the Fluidigm C1 platform for RNA sequencing of single mouse pancreatic islet cells. The system combines microfluidic technology and nanoliter-scale reactions. We sequenced 622 cells, allowing identification of 341 islet cells with high-quality gene expression profiles. The cells clustered into populations of α-cells (5%), β-cells (92%), δ-cells (1%), and pancreatic polypeptide cells (2%). We identified cell-type-specific transcription factors and pathways primarily involved in nutrient sensing and oxidation and cell signaling. Unexpectedly, 281 cells had to be removed from the analysis due to low viability, low sequencing quality, or contamination resulting in the detection of more than one islet hormone. Collectively, we provide a resource for identification of high-quality gene expression datasets to help expand insights into genes and pathways characterizing islet cell types. We reveal limitations in the C1 Fluidigm cell capture process resulting in contaminated cells with altered gene expression patterns. This calls for caution when interpreting single-cell transcriptomics data using the C1 Fluidigm system.

  2. Cloning and characterization of transferrin cDNA and rapid detection of transferrin gene polymorphism in rainbow trout (Oncorhynchus mykiss).

    PubMed

    Tange, N; Jong-Young, L; Mikawa, N; Hirono, I; Aoki, T

    1997-12-01

    A cDNA clone of rainbow trout (Oncorhynchus mykiss) transferrin was obtained from a liver cDNA library. The 2537-bp cDNA sequence contained an open reading frame encoding 691 amino acids and the 5' and 3' noncoding regions. The amino acid sequences at the iron-binding sites and the two N-linked glycosylation sites, and the cysteine residues were consistent with known, conserved vertebrate transferrin cDNA sequences. Single N-linked glycosylation sites existed on the N- and C-lobe. The deduced amino acid sequence of the rainbow trout transferrin cDNA had 92.9% identities with transferrin of coho salmon (Oncorhynchus kisutch); 85%, Atlantic salmon (Salmo salar); 67.3%, medaka (Oryzias latipes); 61.3% Atlantic cod (Gadus morhua); and 59.7%, Japanese flounder (Paralichthys olivaceus). The long and accurate polymerase chain reaction (LA-PCR) was used to amplify approximately 6.5 kb of the transferrin gene from rainbow trout genomic DNA. Restriction fragment length polymorphisms (RFLPs) of the LA-PCR products revealed three digestion patterns in 22 samples.

  3. Isolation, purification and functional characterization of alpha-BnIA from Conus bandanus venom.

    PubMed

    Nguyen, Bao; Le Caer, Jean-Pierre; Aráoz, Romulo; Thai, Robert; Lamthanh, Hung; Benoit, Evelyne; Molgó, Jordi

    2014-12-01

    We report the isolation and characterization by proteomic approach of a native conopeptide, named BnIA, from the crude venom of Conus bandanus, a molluscivorous cone snail species, collected in the South central coast of Vietnam. Its primary sequence was determined by matrix-assisted laser desorption/ionization time-of-flight tandem mass spectrometry using collision-induced dissociation and confirmed by Edman's degradation of the pure native fraction. BnIA was present in high amounts in the crude venom and the complete sequence of the 16 amino acid peptide was the following GCCSHPACSVNNPDIC*, with C-terminal amidation deduced from Edman's degradation and theoretical monoisotopic mass calculation. Sequence alignment revealed that its -C1C2X4C3X7C4- pattern belongs to the A-superfamily of conopeptides. The cysteine connectivity of BnIA was 1-3/2-4 as determined by partial-reduction technique, like other α4/7-conotoxins, reported previously on other Conus species. Additionally, we found that native α-BnIA shared the same sequence alignment as Mr1.1, from the closely related molluscivorous Conus marmoreus venom, in specimens collected in the same coastal region of Vietnam. Functional studies revealed that native α-BnIA inhibited acetylcholine-evoked currents reversibly in oocytes expressing the human α7 nicotinic acetylcholine receptors, and blocked nerve-evoked skeletal muscle contractions in isolated mouse neuromuscular preparations, but with ∼200-times less potency. Copyright © 2014 Elsevier Ltd. All rights reserved.

  4. Isosteric And Non-Isosteric Base Pairs In RNA Motifs: Molecular Dynamics And Bioinformatics Study Of The Sarcin-Ricin Internal Loop

    PubMed Central

    Havrila, Marek; Réblová, Kamila; Zirbel, Craig L.; Leontis, Neocles B.; Šponer, Jiří

    2013-01-01

    The Sarcin-Ricin RNA motif (SR motif) is one of the most prominent recurrent RNA building blocks that occurs in many different RNA contexts and folds autonomously, i.e., in a context-independent manner. In this study, we combined bioinformatics analysis with explicit-solvent molecular dynamics (MD) simulations to better understand the relation between the RNA sequence and the evolutionary patterns of SR motif. SHAPE probing experiment was also performed to confirm fidelity of MD simulations. We identified 57 instances of the SR motif in a non-redundant subset of the RNA X-ray structure database and analyzed their basepairing, base-phosphate, and backbone-backbone interactions. We extracted sequences aligned to these instances from large ribosomal RNA alignments to determine frequency of occurrence for different sequence variants. We then used a simple scoring scheme based on isostericity to suggest 10 sequence variants with highly variable expected degree of compatibility with the SR motif 3D structure. We carried out MD simulations of SR motifs with these base substitutions. Non isosteric base substitutions led to unstable structures, but so did isosteric substitutions which were unable to make key base-phosphate interactions. MD technique explains why some potentially isosteric SR motifs are not realized during evolution. We also found that inability to form stable cWW geometry is an important factor in case of the first base pair of the flexible region of the SR motif. Comparison of structural, bioinformatics, SHAPE probing and MD simulation data reveals that explicit solvent MD simulations neatly reflect viability of different sequence variants of the SR motif. Thus, MD simulations can efficiently complement bioinformatics tools in studies of conservation patterns of RNA motifs and provide atomistic insight into the role of their different signature interactions. PMID:24144333

  5. CoLIde: a bioinformatics tool for CO-expression-based small RNA Loci Identification using high-throughput sequencing data.

    PubMed

    Mohorianu, Irina; Stocks, Matthew Benedict; Wood, John; Dalmay, Tamas; Moulton, Vincent

    2013-07-01

    Small RNAs (sRNAs) are 20-25 nt non-coding RNAs that act as guides for the highly sequence-specific regulatory mechanism known as RNA silencing. Due to the recent increase in sequencing depth, a highly complex and diverse population of sRNAs in both plants and animals has been revealed. However, the exponential increase in sequencing data has also made the identification of individual sRNA transcripts corresponding to biological units (sRNA loci) more challenging when based exclusively on the genomic location of the constituent sRNAs, hindering existing approaches to identify sRNA loci. To infer the location of significant biological units, we propose an approach for sRNA loci detection called CoLIde (Co-expression based sRNA Loci Identification) that combines genomic location with the analysis of other information such as variation in expression levels (expression pattern) and size class distribution. For CoLIde, we define a locus as a union of regions sharing the same pattern and located in close proximity on the genome. Biological relevance, detected through the analysis of size class distribution, is also calculated for each locus. CoLIde can be applied on ordered (e.g., time-dependent) or un-ordered (e.g., organ, mutant) series of samples both with or without biological/technical replicates. The method reliably identifies known types of loci and shows improved performance on sequencing data from both plants (e.g., A. thaliana, S. lycopersicum) and animals (e.g., D. melanogaster) when compared with existing locus detection techniques. CoLIde is available for use within the UEA Small RNA Workbench which can be downloaded from: http://srna-workbench.cmp.uea.ac.uk.

  6. Using ProMED-Mail and MedWorm blogs for cross-domain pattern analysis in epidemic intelligence.

    PubMed

    Stewart, Avaré; Denecke, Kerstin

    2010-01-01

    In this work we motivate the use of medical blog user generated content for gathering facts about disease reporting events to support biosurveillance investigation. Given the characteristics of blogs, the extraction of such events is made more difficult due to noise and data abundance. We address the problem of automatically inferring disease reporting event extraction patterns in this more noisy setting. The sublanguage used in outbreak reports is exploited to align with the sequences of disease reporting sentences in blogs. Based our Cross Domain Pattern Analysis Framework, experimental results show that Phase-Level sequences tend to produce more overlap across the domains than Word-Level sequences. The cross domain alignment process is effective at filtering noisy sequences from blogs and extracting good candidate sequence patterns from an abundance of text.

  7. SPMBR: a scalable algorithm for mining sequential patterns based on bitmaps

    NASA Astrophysics Data System (ADS)

    Xu, Xiwei; Zhang, Changhai

    2013-12-01

    Now some sequential patterns mining algorithms generate too many candidate sequences, and increase the processing cost of support counting. Therefore, we present an effective and scalable algorithm called SPMBR (Sequential Patterns Mining based on Bitmap Representation) to solve the problem of mining the sequential patterns for large databases. Our method differs from previous related works of mining sequential patterns. The main difference is that the database of sequential patterns is represented by bitmaps, and a simplified bitmap structure is presented firstly. In this paper, First the algorithm generate candidate sequences by SE(Sequence Extension) and IE(Item Extension), and then obtain all frequent sequences by comparing the original bitmap and the extended item bitmap .This method could simplify the problem of mining the sequential patterns and avoid the high processing cost of support counting. Both theories and experiments indicate that the performance of SPMBR is predominant for large transaction databases, the required memory size for storing temporal data is much less during mining process, and all sequential patterns can be mined with feasibility.

  8. Bat Caliciviruses and Human Noroviruses Are Antigenically Similar and Have Overlapping Histo-Blood Group Antigen Binding Profiles.

    PubMed

    Kocher, Jacob F; Lindesmith, Lisa C; Debbink, Kari; Beall, Anne; Mallory, Michael L; Yount, Boyd L; Graham, Rachel L; Huynh, Jeremy; Gates, J Edward; Donaldson, Eric F; Baric, Ralph S

    2018-05-22

    Emerging zoonotic viral diseases remain a challenge to global public health. Recent surveillance studies have implicated bats as potential reservoirs for a number of viral pathogens, including coronaviruses and Ebola viruses. Caliciviridae represent a major viral family contributing to emerging diseases in both human and animal populations and have been recently identified in bats. In this study, we blended metagenomics, phylogenetics, homology modeling, and in vitro assays to characterize two novel bat calicivirus (BtCalV) capsid sequences, corresponding to strain BtCalV/A10/USA/2009, identified in Perimyotis subflavus near Little Orleans, MD, and bat norovirus. We observed that bat norovirus formed virus-like particles and had epitopes and receptor-binding patterns similar to those of human noroviruses. To determine whether these observations stretch across multiple bat caliciviruses, we characterized a novel bat calicivirus, BtCalV/A10/USA/2009. Phylogenetic analysis revealed that BtCalV/A10/USA/2009 likely represents a novel Caliciviridae genus and is most closely related to "recoviruses." Homology modeling revealed that the capsid sequences of BtCalV/A10/USA/2009 and bat norovirus resembled human norovirus capsid sequences and retained host ligand binding within the receptor-binding domains similar to that seen with human noroviruses. Both caliciviruses bound histo-blood group antigens in patterns that overlapped those seen with human and animal noroviruses. Taken together, our results indicate the potential for bat caliciviruses to bind histo-blood group antigens and overcome a significant barrier to cross-species transmission. Additionally, we have shown that bat norovirus maintains antigenic epitopes similar to those seen with human noroviruses, providing further evidence of evolutionary descent. Our results reiterate the importance of surveillance of wild-animal populations, especially of bats, for novel viral pathogens. IMPORTANCE Caliciviruses are rapidly evolving viruses that cause pandemic outbreaks associated with significant morbidity and mortality globally. The animal reservoirs for human caliciviruses are unknown; bats represent critical reservoir species for several emerging and zoonotic diseases. Recent reports have identified several bat caliciviruses but have not characterized biological functions associated with disease risk, including their potential emergence in other mammalian populations. In this report, we identified a novel bat calicivirus that is most closely related to nonhuman primate caliciviruses. Using this new bat calicivirus and a second norovirus-like bat calicivirus capsid gene sequence, we generated virus-like particles that have host carbohydrate ligand binding patterns similar to those of human and animal noroviruses and that share antigens with human noroviruses. The similarities to human noroviruses with respect to binding patterns and antigenic epitopes illustrate the potential for bat caliciviruses to emerge in other species and the importance of pathogen surveillance in wild-animal populations. Copyright © 2018 Kocher et al.

  9. Relationships between early literacy and nonlinguistic rhythmic processes in kindergarteners.

    PubMed

    Ozernov-Palchik, Ola; Wolf, Maryanne; Patel, Aniruddh D

    2018-03-01

    A growing number of studies report links between nonlinguistic rhythmic abilities and certain linguistic abilities, particularly phonological skills. The current study investigated the relationship between nonlinguistic rhythmic processing, phonological abilities, and early literacy abilities in kindergarteners. A distinctive aspect of the current work was the exploration of whether processing of different types of rhythmic patterns is differentially related to kindergarteners' phonological and reading-related abilities. Specifically, we examined the processing of metrical versus nonmetrical rhythmic patterns, that is, patterns capable of being subdivided into equal temporal intervals or not (Povel & Essens, 1985). This is an important comparison because most music involves metrical sequences, in which rhythm often has an underlying temporal grid of isochronous units. In contrast, nonmetrical sequences are arguably more typical to speech rhythm, which is temporally structured but does not involve an underlying grid of equal temporal units. A rhythm discrimination app with metrical and nonmetrical patterns was administered to 74 kindergarteners in conjunction with cognitive and preliteracy measures. Findings support a relationship among rhythm perception, phonological awareness, and letter-sound knowledge (an essential precursor of reading). A mediation analysis revealed that the association between rhythm perception and letter-sound knowledge is mediated through phonological awareness. Furthermore, metrical perception accounted for unique variance in letter-sound knowledge above all other language and cognitive measures. These results point to a unique role for temporal regularity processing in the association between musical rhythm and literacy in young children. Copyright © 2017 Elsevier Inc. All rights reserved.

  10. Entropic Movement Complexity Reflects Subjective Creativity Rankings of Visualized Hand Motion Trajectories

    PubMed Central

    Peng, Zhen; Braun, Daniel A.

    2015-01-01

    In a previous study we have shown that human motion trajectories can be characterized by translating continuous trajectories into symbol sequences with well-defined complexity measures. Here we test the hypothesis that the motion complexity individuals generate in their movements might be correlated to the degree of creativity assigned by a human observer to the visualized motion trajectories. We asked participants to generate 55 novel hand movement patterns in virtual reality, where each pattern had to be repeated 10 times in a row to ensure reproducibility. This allowed us to estimate a probability distribution over trajectories for each pattern. We assessed motion complexity not only by the previously proposed complexity measures on symbolic sequences, but we also propose two novel complexity measures that can be directly applied to the distributions over trajectories based on the frameworks of Gaussian Processes and Probabilistic Movement Primitives. In contrast to previous studies, these new methods allow computing complexities of individual motion patterns from very few sample trajectories. We compared the different complexity measures to how a group of independent jurors rank ordered the recorded motion trajectories according to their personal creativity judgment. We found three entropic complexity measures that correlate significantly with human creativity judgment and discuss differences between the measures. We also test whether these complexity measures correlate with individual creativity in divergent thinking tasks, but do not find any consistent correlation. Our results suggest that entropic complexity measures of hand motion may reveal domain-specific individual differences in kinesthetic creativity. PMID:26733896

  11. Genetic population structure of the lionfish Pterois miles (Scorpaenidae, Pteroinae) in the Gulf of Aqaba and northern Red Sea.

    PubMed

    Kochzius, Marc; Blohm, Dietmar

    2005-03-14

    The aim of this study is to reveal gene flow between populations of the coral reef dwelling lionfish Pterois miles in the Gulf of Aqaba and northern Red Sea. Due to the fjord-like hydrography and topology of the Gulf of Aqaba, isolation of populations might be possible. Analysis of 5' mitochondrial control region sequences from 94 P. miles specimens detected 32 polymorphic sites, yielding 38 haplotypes. Sequence divergence among different haplotypes ranged from 0.6% to 9.9% and genetic diversity was high (h=0.85, pi=1.9%). AMOVA indicates panmixia between the Gulf of Aqaba and northern Red Sea, but analysis of migration pattern shows an almost unidirectional migration originating from the Red Sea.

  12. Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence

    PubMed Central

    Gordon, Kacy L.; Arthur, Robert K.; Ruvinsky, Ilya

    2015-01-01

    Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements. PMID:26020930

  13. Binary Gene Expression Patterning of the Molt Cycle: The Case of Chitin Metabolism

    PubMed Central

    Abehsera, Shai; Glazer, Lilah; Tynyakov, Jenny; Plaschkes, Inbar; Chalifa-Caspi, Vered; Khalaila, Isam; Aflalo, Eliahu D.; Sagi, Amir

    2015-01-01

    In crustaceans, like all arthropods, growth is accompanied by a molting cycle. This cycle comprises major physiological events in which mineralized chitinous structures are built and degraded. These events are in turn governed by genes whose patterns of expression are presumably linked to the molting cycle. To study these genes we performed next generation sequencing and constructed a molt-related transcriptomic library from two exoskeletal-forming tissues of the crayfish Cherax quadricarinatus, namely the gastrolith and the mandible cuticle-forming epithelium. To simplify the study of such a complex process as molting, a novel approach, binary patterning of gene expression, was employed. This approach revealed that key genes involved in the synthesis and breakdown of chitin exhibit a molt-related pattern in the gastrolith-forming epithelium. On the other hand, the same genes in the mandible cuticle-forming epithelium showed a molt-independent pattern of expression. Genes related to the metabolism of glucosamine-6-phosphate, a chitin precursor synthesized from simple sugars, showed a molt-related pattern of expression in both tissues. The binary patterning approach unfolds typical patterns of gene expression during the molt cycle of a crustacean. The use of such a simplifying integrative tool for assessing gene patterning seems appropriate for the study of complex biological processes. PMID:25919476

  14. Calmodulin Gene Family in Potato: Developmental and Touch-Induced Expression of the mRNA Encoding a Novel Isoform

    NASA Technical Reports Server (NTRS)

    Takezawa, D.; Liu, Z. H.; An, G.; Poovaiah, B. W.

    1995-01-01

    Eight genomic clones of potato calmodulin (PCM1 to 8) were isolated and characterized. Sequence comparisons of different genes revealed that the deduced amino acid sequence of PCM1 had several unique substitutions, especially in the fourth Ca(2+)-binding area. The expression patterns of different genes were studied by northern analysis using the 3'-untranslated regions as probes. The expression of PCM1, 5, and 8 was highest in the stolon tip and it decreased during tuber development. The expression of PCM6 did not vary much in the tissues tested, except in the leaves, where the expression was lower; whereas, the expression of PCM4 was very low in all the tissues. The expression of PCM2 and PCM3 was not detected in any of the tissues tested. Among these genes, only PCM1 showed increased expression following touch stimulation. To study the regulation of PCM1, transgenic potato plants carrying the PCM1 promoter fused to the beta-glucuronidase (GUS) reporter gene were produced. GUS expression was found to be developmentally regulated and touch-responsive, indicating a positive correlation between the expression of PCM1 and GUS mRNAs. These results suggest that the 5'-flanking region of PCM1 controls developmental and touch-induced expression. X-Gluc staining patterns revealed that GUS localization is high in meristematic tissues such as the stem apex, stolon tip, and vascular regions.

  15. Characterization of emergent HIV resistance in treatment-naive subjects enrolled in a vicriviroc phase 2 trial.

    PubMed

    McNicholas, Paul; Wei, Yi; Whitcomb, Jeannette; Greaves, Wayne; Black, Todd A; Tremblay, Cecile L; Strizki, Julie M

    2010-05-15

    Vicriviroc is a C-C motif chemokine receptor 5 (CCR5) antagonist that is in clinical development for the treatment of human immunodeficiency virus type 1 (HIV-1) infection. This study explored the molecular basis for the development of phenotypically resistant virus. HIV-1 RNA from treatment-naive subjects who experienced virological failure in a phase 2 dose-finding trial was evaluated for coreceptor usage and susceptibility. For viruses that exhibited reduced susceptibility to vicriviroc, envelope clones were phenotypically and genotypically characterized. Twenty-six vicriviroc-treated subjects experienced virological failure; for 24 the virus remained CCR5-tropic, and 2 had dual/X4 virus. Reduced susceptibility to vicriviroc, manifested as decreases in the maximum percent inhibition value (no increase in median inhibitory concentration), was detected in 4 of the 26 subjects who experienced virological failure. Clonal analysis of envelopes in samples from these 4 subjects revealed multiple sequence changes in gp160, principally within the variable domain 1/variable domain 2, variable domain 3, and variable domain 4 loops. However, no consistent pattern of mutations was observed across subjects. In this study, only a small proportion of treatment failures were associated with tropism changes or reduced susceptibility to vicriviroc. Genotypic analysis of cloned env sequences revealed no specific mutational pattern associated with reduced susceptibility to vicriviroc, although numerous changes were observed in the variable domain 3 loop and in other regions of gp160.

  16. Holarctic phylogeography of the testate amoeba Hyalosphenia papilio (Amoebozoa: Arcellinida) reveals extensive genetic diversity explained more by environment than dispersal limitation.

    PubMed

    Heger, Thierry J; Mitchell, Edward A D; Leander, Brian S

    2013-10-01

    Although free-living protists play essential roles in aquatic and soil ecology, little is known about their diversity and phylogeography, especially in terrestrial ecosystems. We used mitochondrial cytochrome c oxidase subunit 1 (COI) gene sequences to investigate the genetic diversity and phylogeography of the testate amoeba morphospecies Hyalosphenia papilio in 42 Sphagnum (moss)-dominated peatlands in North America, Europe and Asia. Based on ≥1% sequence divergence threshold, our results from single-cell PCRs of 301 individuals revealed 12 different genetic lineages and both the general mixed Yule-coalescent (GMYC) model and the automatic barcode gap discovery (ABGD) methods largely support the hypothesis that these 12 H. papilio lineages correspond to evolutionary independent units (i.e. cryptic species). Our data also showed a high degree of genetic heterogeneity within different geographical regions. Furthermore, we used variation partitioning based on partial redundancy analyses (pRDA) to evaluate the contributions of climate and dispersal limitations on the distribution patterns of the different genetic lineages. The largest fraction of the variation in genetic lineage distribution was attributed to purely climatic factors (21%), followed by the joint effect of spatial and bioclimatic factors (13%), and a purely spatial effect (3%). Therefore, these data suggest that the distribution patterns of H. papilio genetic lineages in the Northern Hemisphere are more influenced by climatic conditions than by dispersal limitations. © 2013 John Wiley & Sons Ltd.

  17. Revising the recent evolutionary history of equids using ancient DNA

    PubMed Central

    Orlando, Ludovic; Metcalf, Jessica L.; Alberdi, Maria T.; Telles-Antunes, Miguel; Bonjean, Dominique; Otte, Marcel; Martin, Fabiana; Eisenmann, Véra; Mashkour, Marjan; Morello, Flavia; Prado, Jose L.; Salas-Gismondi, Rodolfo; Shockey, Bruce J.; Wrinn, Patrick J.; Vasil'ev, Sergei K.; Ovodov, Nikolai D.; Cherry, Michael I.; Hopwood, Blair; Male, Dean; Austin, Jeremy J.; Hänni, Catherine; Cooper, Alan

    2009-01-01

    The rich fossil record of the family Equidae (Mammalia: Perissodactyla) over the past 55 MY has made it an icon for the patterns and processes of macroevolution. Despite this, many aspects of equid phylogenetic relationships and taxonomy remain unresolved. Recent genetic analyses of extinct equids have revealed unexpected evolutionary patterns and a need for major revisions at the generic, subgeneric, and species levels. To investigate this issue we examine 35 ancient equid specimens from four geographic regions (South America, Europe, Southwest Asia, and South Africa), of which 22 delivered 87–688 bp of reproducible aDNA mitochondrial sequence. Phylogenetic analyses support a major revision of the recent evolutionary history of equids and reveal two new species, a South American hippidion and a descendant of a basal lineage potentially related to Middle Pleistocene equids. Sequences from specimens assigned to the giant extinct Cape zebra, Equus capensis, formed a separate clade within the modern plain zebra species, a phenotypicically plastic group that also included the extinct quagga. In addition, we revise the currently recognized extinction times for two hemione-related equid groups. However, it is apparent that the current dataset cannot solve all of the taxonomic and phylogenetic questions relevant to the evolution of Equus. In light of these findings, we propose a rapid DNA barcoding approach to evaluate the taxonomic status of the many Late Pleistocene fossil Equidae species that have been described from purely morphological analyses. PMID:20007379

  18. An analysis of dinosaurian biogeography: evidence for the existence of vicariance and dispersal patterns caused by geological events.

    PubMed

    Upchurch, Paul; Hunn, Craig A; Norman, David B

    2002-03-22

    As the supercontinent Pangaea fragmented during the Mesozoic era, dinosaur faunas were divided into isolated populations living on separate continents. It has been predicted, therefore, that dinosaur distributions should display a branching ('vicariance') pattern that corresponds with the sequence and timing of continental break-up. Several recent studies, however, minimize the importance of plate tectonics and instead suggest that dispersal and regional extinction were the main controls on dinosaur biogeography. Here, in order to test the vicariance hypothesis, we apply a cladistic biogeographical method to a large dataset on dinosaur relationships and distributions. We also introduce a methodological refinement termed 'time-slicing', which is shown to be a key step in the detection of ancient biogeographical patterns. These analyses reveal biogeographical patterns that closely correlate with palaeogeography. The results provide the first statistically robust evidence that, from Middle Jurassic to mid-Cretaceous times, tectonic events had a major role in determining where and when particular dinosaur groups flourished. The fact that evolutionary trees for extinct organisms preserve such distribution patterns opens up a new and fruitful direction for palaeobiogeographical research.

  19. Genotyping of clinical and environmental multidrug resistant Enterococcus faecium strains.

    PubMed

    Shokoohizadeh, Leili; Mobarez, Ashraf Mohabati; Alebouyeh, Masoud; Zali, Mohammad Reza; Ranjbar, Reza

    2017-01-01

    Multidrug resistant (MDR) Enterococcus faecium is a nosocomial pathogen and clonal complex 17 (CC17) is the main genetic subpopulation of E. faecium in hospitals worldwide. There has thus far been no report of major E. faecium clones in Iranian hospitals. The present study analyzed strains of MDR E. faecium obtained from patients and the Intensive Care Unit environments using pulsed field gel electrophoresis (PFGE) and multilocus sequence typing (MLST) to determine the antibiotic resistance patterns and genetic features of the dominant. clones of E. faecium. PFGE and MLST analysis revealed the presence of 17and 15 different subtypes, respectively. Of these, 18 (86%) isolates belonged toCC17. Most strains in this clonal complex harbored the esp gene and exhibited resistance to vancomycin, teicoplanin, ampicillin, ciprofloxacin, gentamicin, and erythromycin. The MLST results revealed 12 new sequence types (ST) for the first time. Approximately 50% of the STs were associated with ST203. Detection of E. faecium strains belonging to CC17 on medical equipment and in clinical specimens verified the circulation of high-risk MDR clones among the patients and in hospital environments in Iran.

  20. Evidence for Phex haploinsufficiency in murine X-linked hypophosphatemia.

    PubMed

    Wang, L; Du, L; Ecarot, B

    1999-04-01

    Mutations in the PHEX gene (phosphate-regulating gene with homology to endopeptidases on the X-chromosome) are responsible for X-linked hypophosphatemia (HYP). We previously reported the full-length coding sequence of murine Phex cDNA and provided evidence of Phex expression in bone and tooth. Here, we report the cloning of the entire 3.5-kb 3'UTR of the Phex gene, yielding a total of 6248 bp for the Phex transcript. Southern blot and RT-PCR analyses revealed that the 3' end of the coding sequence and the 3'UTR of the Phex gene, spanning exons 16 to 22, are deleted in Hyp, the mouse model for HYP. Northern blot analysis of bone revealed lack of expression of stable Phex mRNA from the mutant allele and expression of Phex transcripts from the wild-type allele in Hyp heterozygous females. Expression of the Phex protein in heterozygotes was confirmed by Western analysis with antibodies raised against a COOH-terminal peptide of the mouse Phex protein. Taken together, these results indicate that the dominant pattern of Hyp inheritance in mice is due to Phex haploinsufficiency.

  1. Agonal sequences in four filmed hangings: analysis of respiratory and movement responses to asphyxia by hanging.

    PubMed

    Sauvageau, Anny

    2009-01-01

    The human pathophysiology of asphyxia by hanging is still poorly understood, despite great advances in forensic science. In that context, filmed hangings may hold the key to answer questions regarding the sequence of events leading to death in human asphyxia. Four filmed hangings were analyzed. Rapid loss of consciousness was observed between 13 sec and 18 sec after onset of hanging, closely followed by convulsions (at 14-19 sec). A complex pattern of decerebration rigidity (19-21 sec in most cases), followed by a quick phase of decortication rigidity (1 min 00 sec-1 min 08 sec in most cases), an extended phase of decortication rigidity (1 min 04 sec-1 min 32 sec) and loss of muscle tone (1 min 38 sec-2 min 47 sec) was revealed. Very deep respiratory attempts started between 20 and 22 sec, the last respiratory attempt being detected between 2 min 00 sec and 2 min 04 sec. Despite differences in the types of hanging, this unique study reveals similarities that are further discussed.

  2. Population genomics reveals a candidate gene involved in bumble bee pigmentation.

    PubMed

    Pimsler, Meaghan L; Jackson, Jason M; Lozier, Jeffrey D

    2017-05-01

    Variation in bumble bee color patterns is well-documented within and between species. Identifying the genetic mechanisms underlying such variation may be useful in revealing evolutionary forces shaping rapid phenotypic diversification. The widespread North American species Bombus bifarius exhibits regional variation in abdominal color forms, ranging from red-banded to black-banded phenotypes and including geographically and phenotypically intermediate forms. Identifying genomic regions linked to this variation has been complicated by strong, near species level, genome-wide differentiation between red- and black-banded forms. Here, we instead focus on the closely related black-banded and intermediate forms that both belong to the subspecies B. bifarius nearcticus . We analyze an RNA sequencing (RNAseq) data set and identify a cluster of single nucleotide polymorphisms (SNPs) within one gene, Xanthine dehydrogenase/oxidase -like, that exhibit highly unusual differentiation compared to the rest of the sequenced genome. Homologs of this gene contribute to pigmentation in other insects, and results thus represent a strong candidate for investigating the genetic basis of pigment variation in B. bifarius and other bumble bee mimicry complexes.

  3. Temporal and Spatial Diversity of Bacterial Communities in Coastal Waters of the South China Sea

    PubMed Central

    Du, Jikun; Xiao, Kai; Li, Li; Ding, Xian; Liu, Helu; Lu, Yongjun; Zhou, Shining

    2013-01-01

    Bacteria are recognized as important drivers of biogeochemical processes in all aquatic ecosystems. Temporal and geographical patterns in ocean bacterial communities have been observed in many studies, but the temporal and spatial patterns in the bacterial communities from the South China Sea remained unexplored. To determine the spatiotemporal patterns, we generated 16S rRNA datasets for 15 samples collected from the five regularly distributed sites of the South China Sea in three seasons (spring, summer, winter). A total of 491 representative sequences were analyzed by MOTHUR, yielding 282 operational taxonomic units (OTUs) grouped at 97% stringency. Significant temporal variations of bacterial diversity were observed. Richness and diversity indices indicated that summer samples were the most diverse. The main bacterial group in spring and summer samples was Alphaproteobacteria, followed by Cyanobacteria and Gammaproteobacteria, whereas Cyanobacteria dominated the winter samples. Spatial patterns in the samples were observed that samples collected from the coastal (D151, D221) waters and offshore (D157, D1512, D224) waters clustered separately, the coastal samples harbored more diverse bacterial communities. However, the temporal pattern of the coastal site D151 was contrary to that of the coastal site D221. The LIBSHUFF statistics revealed noticeable differences among the spring, summer and winter libraries collected at five sites. The UPGMA tree showed there were temporal and spatial heterogeneity of bacterial community composition in coastal waters of the South China Sea. The water salinity (P=0.001) contributed significantly to the bacteria-environment relationship. Our results revealed that bacterial community structures were influenced by environmental factors and community-level changes in 16S-based diversity were better explained by spatial patterns than by temporal patterns. PMID:23785512

  4. Molecular population dynamics of DNA structures in a bcl-2 promoter sequence is regulated by small molecules and the transcription factor hnRNP LL

    PubMed Central

    Cui, Yunxi; Koirala, Deepak; Kang, HyunJin; Dhakal, Soma; Yangyuoru, Philip; Hurley, Laurence H.; Mao, Hanbin

    2014-01-01

    Minute difference in free energy change of unfolding among structures in an oligonucleotide sequence can lead to a complex population equilibrium, which is rather challenging for ensemble techniques to decipher. Herein, we introduce a new method, molecular population dynamics (MPD), to describe the intricate equilibrium among non-B deoxyribonucleic acid (DNA) structures. Using mechanical unfolding in laser tweezers, we identified six DNA species in a cytosine (C)-rich bcl-2 promoter sequence. Population patterns of these species with and without a small molecule (IMC-76 or IMC-48) or the transcription factor hnRNP LL are compared to reveal the MPD of different species. With a pattern recognition algorithm, we found that IMC-48 and hnRNP LL share 80% similarity in stabilizing i-motifs with 60 s incubation. In contrast, IMC-76 demonstrates an opposite behavior, preferring flexible DNA hairpins. With 120–180 s incubation, IMC-48 and hnRNP LL destabilize i-motifs, which has been previously proposed to activate bcl-2 transcriptions. These results provide strong support, from the population equilibrium perspective, that small molecules and hnRNP LL can modulate bcl-2 transcription through interaction with i-motifs. The excellent agreement with biochemical results firmly validates the MPD analyses, which, we expect, can be widely applicable to investigate complex equilibrium of biomacromolecules. PMID:24609386

  5. Comparative genomics reveals insights into avian genome evolution and adaptation

    PubMed Central

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M.; Lee, Chul; Storz, Jay F.; Antunes, Agostinho; Greenwold, Matthew J.; Meredith, Robert W.; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R.; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T.; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V.; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S.; Gatesy, John; Hoffmann, Federico G.; Opazo, Juan C.; Håstad, Olle; Sawyer, Roger H.; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W.; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F.; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A.; Green, Richard E.; O’Brien, Stephen J.; Griffin, Darren; Johnson, Warren E.; Haussler, David; Ryder, Oliver A.; Willerslev, Eske; Graves, Gary R.; Alström, Per; Fjeldså, Jon; Mindell, David P.; Edwards, Scott V.; Braun, Edward L.; Rahbek, Carsten; Burt, David W.; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D.; Gilbert, M. Thomas P.; Wang, Jun

    2015-01-01

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. PMID:25504712

  6. The membrane skeleton in Paramecium: Molecular characterization of a novel epiplasmin family and preliminary GFP expression results.

    PubMed

    Pomel, Sébastien; Diogon, Marie; Bouchard, Philippe; Pradel, Lydie; Ravet, Viviane; Coffe, Gérard; Viguès, Bernard

    2006-02-01

    Previous attempts to identify the membrane skeleton of Paramecium cells have revealed a protein pattern that is both complex and specific. The most prominent structural elements, epiplasmic scales, are centered around ciliary units and are closely apposed to the cytoplasmic side of the inner alveolar membrane. We sought to characterize epiplasmic scale proteins (epiplasmins) at the molecular level. PCR approaches enabled the cloning and sequencing of two closely related genes by amplifications of sequences from a macronuclear genomic library. Using these two genes (EPI-1 and EPI-2), we have contributed to the annotation of the Paramecium tetraurelia macronuclear genome and identified 39 additional (paralogous) sequences. Two orthologous sequences were found in the Tetrahymena thermophila genome. Structural analysis of the 43 sequences indicates that the hallmark of this new multigenic family is a 79 aa domain flanked by two Q-, P- and V-rich stretches of sequence that are much more variable in amino-acid composition. Such features clearly distinguish members of the multigenic family from epiplasmic proteins previously sequenced in other ciliates. The expression of Green Fluorescent Protein (GFP)-tagged epiplasmin showed significant labeling of epiplasmic scales as well as oral structures. We expect that the GFP construct described herein will prove to be a useful tool for comparative subcellular localization of different putative epiplasmins in Paramecium.

  7. Structural and sequence features of two residue turns in beta-hairpins.

    PubMed

    Madan, Bharat; Seo, Sung Yong; Lee, Sun-Gu

    2014-09-01

    Beta-turns in beta-hairpins have been implicated as important sites in protein folding. In particular, two residue β-turns, the most abundant connecting elements in beta-hairpins, have been a major target for engineering protein stability and folding. In this study, we attempted to investigate and update the structural and sequence properties of two residue turns in beta-hairpins with a large data set. For this, 3977 beta-turns were extracted from 2394 nonhomologous protein chains and analyzed. First, the distribution, dihedral angles and twists of two residue turn types were determined, and compared with previous data. The trend of turn type occurrence and most structural features of the turn types were similar to previous results, but for the first time Type II turns in beta-hairpins were identified. Second, sequence motifs for the turn types were devised based on amino acid positional potentials of two-residue turns, and their distributions were examined. From this study, we could identify code-like sequence motifs for the two residue beta-turn types. Finally, structural and sequence properties of beta-strands in the beta-hairpins were analyzed, which revealed that the beta-strands showed no specific sequence and structural patterns for turn types. The analytical results in this study are expected to be a reference in the engineering or design of beta-hairpin turn structures and sequences. © 2014 Wiley Periodicals, Inc.

  8. Within-genome evolution of REPINs: a new family of miniature mobile DNA in bacteria.

    PubMed

    Bertels, Frederic; Rainey, Paul B

    2011-06-01

    Repetitive sequences are a conserved feature of many bacterial genomes. While first reported almost thirty years ago, and frequently exploited for genotyping purposes, little is known about their origin, maintenance, or processes affecting the dynamics of within-genome evolution. Here, beginning with analysis of the diversity and abundance of short oligonucleotide sequences in the genome of Pseudomonas fluorescens SBW25, we show that over-represented short sequences define three distinct groups (GI, GII, and GIII) of repetitive extragenic palindromic (REP) sequences. Patterns of REP distribution suggest that closely linked REP sequences form a functional replicative unit: REP doublets are over-represented, randomly distributed in extragenic space, and more highly conserved than singlets. In addition, doublets are organized as inverted repeats, which together with intervening spacer sequences are predicted to form hairpin structures in ssDNA or mRNA. We refer to these newly defined entities as REPINs (REP doublets forming hairpins) and identify short reads from population sequencing that reveal putative transposition intermediates. The proximal relationship between GI, GII, and GIII REPINs and specific REP-associated tyrosine transposases (RAYTs), combined with features of the putative transposition intermediate, suggests a mechanism for within-genome dissemination. Analysis of the distribution of REPs in a range of RAYT-containing bacterial genomes, including Escherichia coli K-12 and Nostoc punctiforme, show that REPINs are a widely distributed, but hitherto unrecognized, family of miniature non-autonomous mobile DNA.

  9. Detecting Abnormal Vehicular Dynamics at Intersections Based on an Unsupervised Learning Approach and a Stochastic Model

    PubMed Central

    Jiménez-Hernández, Hugo; González-Barbosa, Jose-Joel; Garcia-Ramírez, Teresa

    2010-01-01

    This investigation demonstrates an unsupervised approach for modeling traffic flow and detecting abnormal vehicle behaviors at intersections. In the first stage, the approach reveals and records the different states of the system. These states are the result of coding and grouping the historical motion of vehicles as long binary strings. In the second stage, using sequences of the recorded states, a stochastic graph model based on a Markovian approach is built. A behavior is labeled abnormal when current motion pattern cannot be recognized as any state of the system or a particular sequence of states cannot be parsed with the stochastic model. The approach is tested with several sequences of images acquired from a vehicular intersection where the traffic flow and duration used in connection with the traffic lights are continuously changed throughout the day. Finally, the low complexity and the flexibility of the approach make it reliable for use in real time systems. PMID:22163616

  10. Functional and mechanistic diversity of distal transcription enhancers

    PubMed Central

    Bulger, Michael; Groudine, Mark

    2013-01-01

    Biological differences among metazoans, and between cell types in a given organism, arise in large part due to differences in gene expression patterns. The sequencing of multiple metazoan genomes, coupled with recent advances in genome-wide analysis of histone modifications and transcription factor binding, has revealed that among regulatory DNA sequences, gene-distal enhancers appear to exhibit the greatest diversity and cell-type specificity. Moreover, such elements are emerging as important targets for mutations that can give rise to disease and to genetic variability that underlies evolutionary change. Studies of long-range interactions between distal genomic sequences in the nucleus indicate that enhancers are often important determinants of nuclear organization, contributing to a general model for enhancer function that involves direct enhancer-promoter contact. In a number of systems, however, mechanisms for enhancer function are emerging that do not fit solely within such a model, suggesting that enhancers as a class of DNA regulatory element may be functionally and mechanistically diverse. PMID:21295696

  11. Detecting abnormal vehicular dynamics at intersections based on an unsupervised learning approach and a stochastic model.

    PubMed

    Jiménez-Hernández, Hugo; González-Barbosa, Jose-Joel; Garcia-Ramírez, Teresa

    2010-01-01

    This investigation demonstrates an unsupervised approach for modeling traffic flow and detecting abnormal vehicle behaviors at intersections. In the first stage, the approach reveals and records the different states of the system. These states are the result of coding and grouping the historical motion of vehicles as long binary strings. In the second stage, using sequences of the recorded states, a stochastic graph model based on a Markovian approach is built. A behavior is labeled abnormal when current motion pattern cannot be recognized as any state of the system or a particular sequence of states cannot be parsed with the stochastic model. The approach is tested with several sequences of images acquired from a vehicular intersection where the traffic flow and duration used in connection with the traffic lights are continuously changed throughout the day. Finally, the low complexity and the flexibility of the approach make it reliable for use in real time systems.

  12. Progressive Loss of Function in a Limb Enhancer during Snake Evolution.

    PubMed

    Kvon, Evgeny Z; Kamneva, Olga K; Melo, Uirá S; Barozzi, Iros; Osterwalder, Marco; Mannion, Brandon J; Tissières, Virginie; Pickle, Catherine S; Plajzer-Frick, Ingrid; Lee, Elizabeth A; Kato, Momoe; Garvin, Tyler H; Akiyama, Jennifer A; Afzal, Veena; Lopez-Rios, Javier; Rubin, Edward M; Dickel, Diane E; Pennacchio, Len A; Visel, Axel

    2016-10-20

    The evolution of body shape is thought to be tightly coupled to changes in regulatory sequences, but specific molecular events associated with major morphological transitions in vertebrates have remained elusive. We identified snake-specific sequence changes within an otherwise highly conserved long-range limb enhancer of Sonic hedgehog (Shh). Transgenic mouse reporter assays revealed that the in vivo activity pattern of the enhancer is conserved across a wide range of vertebrates, including fish, but not in snakes. Genomic substitution of the mouse enhancer with its human or fish ortholog results in normal limb development. In contrast, replacement with snake orthologs caused severe limb reduction. Synthetic restoration of a single transcription factor binding site lost in the snake lineage reinstated full in vivo function to the snake enhancer. Our results demonstrate changes in a regulatory sequence associated with a major body plan transition and highlight the role of enhancers in morphological evolution. PAPERCLIP. Copyright © 2016 Elsevier Inc. All rights reserved.

  13. Molecular diversity of arbuscular mycorrhizal fungi in relation to soil chemical properties and heavy metal contamination.

    PubMed

    Zarei, Mehdi; Hempel, Stefan; Wubet, Tesfaye; Schäfer, Tina; Savaghebi, Gholamreza; Jouzani, Gholamreza Salehi; Nekouei, Mojtaba Khayam; Buscot, François

    2010-08-01

    Abundance and diversity of arbuscular mycorrhizal fungi (AMF) associated with dominant plant species were studied along a transect from highly lead (Pb) and zinc (Zn) polluted to non-polluted soil at the Anguran open pit mine in Iran. Using an established primer set for AMF in the internal transcribed spacer (ITS) region of rDNA, nine different AMF sequence types were distinguished after phylogenetic analyses, showing remarkable differences in their distribution patterns along the transect. With decreasing Pb and Zn concentration, the number of AMF sequence types increased, however one sequence type was only found in the highly contaminated area. Multivariate statistical analysis revealed that further factors than HM soil concentration affect the AMF community at contaminated sites. Specifically, the soils' calcium carbonate equivalent and available P proved to be of importance, which illustrates that field studies on AMF distribution should also consider important environmental factors and their possible interactions. Copyright 2010 Elsevier Ltd. All rights reserved.

  14. Sequence and Analysis of the Tomato JOINTLESS Locus1

    PubMed Central

    Mao, Long; Begum, Dilara; Goff, Stephen A.; Wing, Rod A.

    2001-01-01

    A 119-kb bacterial artificial chromosome from the JOINTLESS locus on the tomato (Lycopersicon esculentum) chromosome 11 contained 15 putative genes. Repetitive sequences in this region include one copia-like LTR retrotransposon, 13 simple sequence repeats, three copies of a novel type III foldback transposon, and four putative short DNA repeats. Database searches showed that the foldback transposon and the short DNA repeats seemed to be associated preferably with genes. The predicted tomato genes were compared with the complete Arabidopsis genome. Eleven out of 15 tomato open reading frames were found to be colinear with segments on five Arabidopsis bacterial artificial chromosome/P1-derived artificial chromosome clones. The synteny patterns, however, did not reveal duplicated segments in Arabidopsis, where over half of the genome is duplicated. Our analysis indicated that the microsynteny between the tomato and Arabidopsis genomes was still conserved at a very small scale but was complicated by the large number of gene families in the Arabidopsis genome. PMID:11457984

  15. The genome of the Lactobacillus sanfranciscensis temperate phage EV3

    PubMed Central

    2013-01-01

    Background Bacteriophages infection modulates microbial consortia and transduction is one of the most important mechanism involved in the bacterial evolution. However, phage contamination brings food fermentations to a halt causing economic setbacks. The number of phage genome sequences of lactic acid bacteria especially of lactobacilli is still limited. We analysed the genome of a temperate phage active on Lactobacillus sanfranciscensis, the predominant strain in type I sourdough fermentations. Results Sequencing of the DNA of EV3 phage revealed a genome of 34,834 bp and a G + C content of 36.45%. Of the 43 open reading frames (ORFs) identified, all but eight shared homology with other phages of lactobacilli. A similar genomic organization and mosaic pattern of identities align EV3 with the closely related Lactobacillus vaginalis ATCC 49540 prophage. Four unknown ORFs that had no homologies in the databases or predicted functions were identified. Notably, EV3 encodes a putative dextranase. Conclusions EV3 is the first L. sanfranciscensis phage that has been completely sequenced so far. PMID:24308641

  16. Molecular genetic analysis of ancient cattle bones excavated from archaeological sites in Jeju, Korea.

    PubMed

    Kim, Jae-Hwan; Oh, Ju-Hyung; Song, Ji-Hoon; Jeon, Jin-Tae; Han, Sang-Hyun; Jung, Yong-Hwan; Oh, Moon-You

    2005-12-31

    Ancient cattle bones were excavated from archaeological sites in Jeju, Korea. We used molecular genetic techniques to identify the species and establish its relationship to extant cattle breeds. Ancient DNA was extracted from four sources: a humerus (Gonae site, A.D. 700-800), two fragments of radius, and a tooth (Kwakji site, A.D. 0-900). The mitochondrial DNA (mtDNA) D-loop regions were cloned, sequenced, and compared with previously reported sequences of various cattle breeds (9 Asian, 8 European, and 3 African). The results revealed that these bones were of the breed, Bos taurus, and a phylogenetic tree indicated that the four cattle bones formed a monophyletic group with Jeju native black cattle. However, the patterns of sequence variation and reports from archaeological sites suggest that a few wild cattle, with a different maternal lineage, may have existed on Jeju Island. Our results will contribute to further studies of the origin of Jeju native cattle and the possible existence of local wild cattle.

  17. Great majority of recombination events in Arabidopsis are gene conversion events

    PubMed Central

    Yang, Sihai; Yuan, Yang; Wang, Long; Li, Jing; Wang, Wen; Liu, Haoxuan; Chen, Jian-Qun; Hurst, Laurence D.; Tian, Dacheng

    2012-01-01

    The evolutionary importance of meiosis may not solely be associated with allelic shuffling caused by crossing-over but also have to do with its more immediate effects such as gene conversion. Although estimates of the crossing-over rate are often well resolved, the gene conversion rate is much less clear. In Arabidopsis, for example, next-generation sequencing approaches suggest that the two rates are about the same, which contrasts with indirect measures, these suggesting an excess of gene conversion. Here, we provide analysis of this problem by sequencing 40 F2 Arabidopsis plants and their parents. Small gene conversion tracts, with biased gene conversion content, represent over 90% (probably nearer 99%) of all recombination events. The rate of alteration of protein sequence caused by gene conversion is over 600 times that caused by mutation. Finally, our analysis reveals recombination hot spots and unexpectedly high recombination rates near centromeres. This may be responsible for the previously unexplained pattern of high genetic diversity near Arabidopsis centromeres. PMID:23213238

  18. The complete mitochondrial genome of the green lizard Lacerta viridis viridis (Reptilia: Lacertidae) and its phylogenetic position within squamate reptiles.

    PubMed

    Böhme, M U; Fritzsch, G; Tippmann, A; Schlegel, M; Berendonk, T U

    2007-06-01

    For the first time the complete mitochondrial genome was sequenced for a member of Lacertidae. Lacerta viridis viridis was sequenced in order to compare the phylogenetic relationships of this family to other reptilian lineages. Using the long-polymerase chain reaction (long PCR) we characterized a mitochondrial genome, 17,156 bp long showing a typical vertebrate pattern with 13 protein coding genes, 22 transfer RNAs (tRNA), two ribosomal RNAs (rRNA) and one major noncoding region. The noncoding region of L. v. viridis was characterized by a conspicuous 35 bp tandem repeat at its 5' terminus. A phylogenetic study including all currently available squamate mitochondrial sequences demonstrates the position of Lacertidae within a monophyletic squamate group. We obtained a narrow relationship of Lacertidae to Scincidae, Iguanidae, Varanidae, Anguidae, and Cordylidae. Although, the internal relationships within this group yielded only a weak resolution and low bootstrap support, the revealed relationships were more congruent with morphological studies than with recent molecular analyses.

  19. The spotted gar genome illuminates vertebrate evolution and facilitates human-to-teleost comparisons

    PubMed Central

    Braasch, Ingo; Gehrke, Andrew R.; Smith, Jeramiah J.; Kawasaki, Kazuhiko; Manousaki, Tereza; Pasquier, Jeremy; Amores, Angel; Desvignes, Thomas; Batzel, Peter; Catchen, Julian; Berlin, Aaron M.; Campbell, Michael S.; Barrell, Daniel; Martin, Kyle J.; Mulley, John F.; Ravi, Vydianathan; Lee, Alison P.; Nakamura, Tetsuya; Chalopin, Domitille; Fan, Shaohua; Wcisel, Dustin; Cañestro, Cristian; Sydes, Jason; Beaudry, Felix E. G.; Sun, Yi; Hertel, Jana; Beam, Michael J.; Fasold, Mario; Ishiyama, Mikio; Johnson, Jeremy; Kehr, Steffi; Lara, Marcia; Letaw, John H.; Litman, Gary W.; Litman, Ronda T.; Mikami, Masato; Ota, Tatsuya; Saha, Nil Ratan; Williams, Louise; Stadler, Peter F.; Wang, Han; Taylor, John S.; Fontenot, Quenton; Ferrara, Allyse; Searle, Stephen M. J.; Aken, Bronwen; Yandell, Mark; Schneider, Igor; Yoder, Jeffrey A.; Volff, Jean-Nicolas; Meyer, Axel; Amemiya, Chris T.; Venkatesh, Byrappa; Holland, Peter W. H.; Guiguen, Yann; Bobe, Julien; Shubin, Neil H.; Di Palma, Federica; Alföldi, Jessica; Lindblad-Toh, Kerstin; Postlethwait, John H.

    2016-01-01

    To connect human biology to fish biomedical models, we sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before the teleost genome duplication (TGD). The slowly evolving gar genome conserved in content and size many entire chromosomes from bony vertebrate ancestors. Gar bridges teleosts to tetrapods by illuminating the evolution of immunity, mineralization, and development (e.g., Hox, ParaHox, and miRNA genes). Numerous conserved non-coding elements (CNEs, often cis-regulatory) undetectable in direct human-teleost comparisons become apparent using gar: functional studies uncovered conserved roles of such cryptic CNEs, facilitating annotation of sequences identified in human genome-wide association studies. Transcriptomic analyses revealed that the sum of expression domains and levels from duplicated teleost genes often approximate patterns and levels of gar genes, consistent with subfunctionalization. The gar genome provides a resource for understanding evolution after genome duplication, the origin of vertebrate genomes, and the function of human regulatory sequences. PMID:26950095

  20. Novel SNP array analysis and exome sequencing detect a homozygous exon 7 deletion of MEGF10 causing early onset myopathy, areflexia, respiratory distress and dysphagia (EMARDD)

    PubMed Central

    Pierson, Tyler Mark; Markello, Thomas; Accardi, John; Wolfe, Lynne; Adams, David; Sincan, Murat; Tarazi, Noor M.; Fajardo, Karin Fuentes; Cherukuri, Praveen F.; Bajraktari, Ilda; Meilleur, Katy G.; Donkervoort, Sandra; Jain, Mina; Hu, Ying; Lehky, Tanya J.; Cruz, Pedro; Mullikin, James C.; Bonnemann, Carsten; Gahl, William A.; Boerkoel, Cornelius F.; Tifft, Cynthia J.

    2013-01-01

    Early-onset myopathy, areflexia, respiratory distress and dysphagia (EMARDD) is a myopathic disorder associated with mutations in MEGF10. By novel analysis of SNP array hybridization and exome sequence coverage, we diagnosed a 10-year old girl with EMARDD following identification of a novel homozygous deletion of exon 7 in MEGF10. In contrast to previously reported EMARDD patients, her weakness was more prominent proximally than distally, and involved her legs more than her arms. MRI of her pelvis and thighs showed muscle atrophy and fatty replacement. Ultrasound of several muscle groups revealed dense homogenous increases in echogenicity. Cloning and sequencing of the deletion breakpoint identified features suggesting the mutation arose by fork stalling and template switching. These findings constitute the first genomic deletion causing EMARDD, expand the clinical phenotype, and provide new insight into the pattern and histology of its muscular pathology. PMID:23453856

Top