sequence derived insights: Topics by Science.gov

Sample records for sequence derived insights

Insights from Human/Mouse genome comparisons

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pennacchio, Len A.

2003-03-30

Large-scale public genomic sequencing efforts have provided a wealth of vertebrate sequence data poised to provide insights into mammalian biology. These include deep genomic sequence coverage of human, mouse, rat, zebrafish, and two pufferfish (Fugu rubripes and Tetraodon nigroviridis) (Aparicio et al. 2002; Lander et al. 2001; Venter et al. 2001; Waterston et al. 2002). In addition, a high-priority has been placed on determining the genomic sequence of chimpanzee, dog, cow, frog, and chicken (Boguski 2002). While only recently available, whole genome sequence data have provided the unique opportunity to globally compare complete genome contents. Furthermore, the shared evolutionary ancestrymore » of vertebrate species has allowed the development of comparative genomic approaches to identify ancient conserved sequences with functionality. Accordingly, this review focuses on the initial comparison of available mammalian genomes and describes various insights derived from such analysis.« less
Draft genome sequence of the extremely acidophilic biomining bacterium Acidithiobacillus thiooxidans ATCC 19377 provides insights into the evolution of the Acidithiobacillus genus.

PubMed

Valdes, Jorge; Ossandon, Francisco; Quatrini, Raquel; Dopson, Mark; Holmes, David S

2011-12-01

Acidithiobacillus thiooxidans is a mesophilic, extremely acidophilic, chemolithoautotrophic gammaproteobacterium that derives energy from the oxidation of sulfur and inorganic sulfur compounds. Here we present the draft genome sequence of A. thiooxidans ATCC 19377, which has allowed the identification of genes for survival and colonization of extremely acidic environments.
Short-term application of dexamethasone on stem cells derived from human gingiva reduces the expression of RUNX2 and β-catenin.

PubMed

Kim, Bo-Bae; Kim, Minji; Park, Yun-Hee; Ko, Youngkyung; Park, Jun-Beom

2017-06-01

Objective Next-generation sequencing was performed to evaluate the effects of short-term application of dexamethasone on human gingiva-derived mesenchymal stem cells. Methods Human gingiva-derived stem cells were treated with a final concentration of 10 -7 M dexamethasone and the same concentration of vehicle control. This was followed by mRNA sequencing and data analysis, gene ontology and pathway analysis, quantitative real-time polymerase chain reaction of mRNA, and western blot analysis of RUNX2 and β-catenin. Results In total, 26,364 mRNAs were differentially expressed. Comparison of the results of dexamethasone versus control at 2 hours revealed that 7 mRNAs were upregulated and 25 mRNAs were downregulated. The application of dexamethasone reduced the expression of RUNX2 and β-catenin in human gingiva-derived mesenchymal stem cells. Conclusion The effects of dexamethasone on stem cells were evaluated with mRNA sequencing, and validation of the expression was performed with qualitative real-time polymerase chain reaction and western blot analysis. The results of this study can provide new insights into the role of mRNA sequencing in maxillofacial areas.
Question 7: Comparative Genomics and Early Cell Evolution: A Cautionary Methodological Note

NASA Astrophysics Data System (ADS)

Islas, Sara; Hernández-Morales, Ricardo; Lazcano, Antonio

2007-10-01

Inventories of the gene content of the last common ancestor (LCA), i.e., the cenancestor, include sequences that may have undergone horizontal transfer events, as well as sequences that have originated in different pre-cenancestral epochs. However, the universal distribution of highly conserved genes involved in RNA metabolism provide insights into early stages of cell evolution during which RNA played a much more conspicuous biological role, and is consistent with the hypothesis that extant living systems were preceded by an RNA/protein world. Insights into the traits of primitive entities from which the LCA evolved may be derived from the analysis of paralogous gene families, including those formed by sequences that resulted from internal elongation events. Three major types of paralogous gene families can be recognized. The importance of this grouping for understanding the traits of early cells is discussed.
Whole-Genome Sequencing of Theileria parva Strains Provides Insight into Parasite Migration and Diversification in the African Continent

PubMed Central

Hayashida, Kyoko; Abe, Takashi; Weir, William; Nakao, Ryo; Ito, Kimihito; Kajino, Kiichi; Suzuki, Yutaka; Jongejan, Frans; Geysen, Dirk; Sugimoto, Chihiro

2013-01-01

The disease caused by the apicomplexan protozoan parasite Theileria parva, known as East Coast fever or Corridor disease, is one of the most serious cattle diseases in Eastern, Central, and Southern Africa. We performed whole-genome sequencing of nine T. parva strains, including one of the vaccine strains (Kiambu 5), field isolates from Zambia, Uganda, Tanzania, or Rwanda, and two buffalo-derived strains. Comparison with the reference Muguga genome sequence revealed 34 814–121 545 single nucleotide polymorphisms (SNPs) that were more abundant in buffalo-derived strains. High-resolution phylogenetic trees were constructed with selected informative SNPs that allowed the investigation of possible complex recombination events among ancestors of the extant strains. We further analysed the dN/dS ratio (non-synonymous substitutions per non-synonymous site divided by synonymous substitutions per synonymous site) for 4011 coding genes to estimate potential selective pressure. Genes under possible positive selection were identified that may, in turn, assist in the identification of immunogenic proteins or vaccine candidates. This study elucidated the phylogeny of T. parva strains based on genome-wide SNPs analysis with prediction of possible past recombination events, providing insight into the migration, diversification, and evolution of this parasite species in the African continent. PMID:23404454
Whole-genome sequencing of Theileria parva strains provides insight into parasite migration and diversification in the African continent.

PubMed

Hayashida, Kyoko; Abe, Takashi; Weir, William; Nakao, Ryo; Ito, Kimihito; Kajino, Kiichi; Suzuki, Yutaka; Jongejan, Frans; Geysen, Dirk; Sugimoto, Chihiro

2013-06-01

The disease caused by the apicomplexan protozoan parasite Theileria parva, known as East Coast fever or Corridor disease, is one of the most serious cattle diseases in Eastern, Central, and Southern Africa. We performed whole-genome sequencing of nine T. parva strains, including one of the vaccine strains (Kiambu 5), field isolates from Zambia, Uganda, Tanzania, or Rwanda, and two buffalo-derived strains. Comparison with the reference Muguga genome sequence revealed 34 814-121 545 single nucleotide polymorphisms (SNPs) that were more abundant in buffalo-derived strains. High-resolution phylogenetic trees were constructed with selected informative SNPs that allowed the investigation of possible complex recombination events among ancestors of the extant strains. We further analysed the dN/dS ratio (non-synonymous substitutions per non-synonymous site divided by synonymous substitutions per synonymous site) for 4011 coding genes to estimate potential selective pressure. Genes under possible positive selection were identified that may, in turn, assist in the identification of immunogenic proteins or vaccine candidates. This study elucidated the phylogeny of T. parva strains based on genome-wide SNPs analysis with prediction of possible past recombination events, providing insight into the migration, diversification, and evolution of this parasite species in the African continent.
The repetitive landscape of the chicken genome.

PubMed

Wicker, Thomas; Robertson, Jon S; Schulze, Stefan R; Feltus, F Alex; Magrini, Vincent; Morrison, Jason A; Mardis, Elaine R; Wilson, Richard K; Peterson, Daniel G; Paterson, Andrew H; Ivarie, Robert

2005-01-01

Cot-based cloning and sequencing (CBCS) is a powerful tool for isolating and characterizing the various repetitive components of any genome, combining the established principles of DNA reassociation kinetics with high-throughput sequencing. CBCS was used to generate sequence libraries representing the high, middle, and low-copy fractions of the chicken genome. Sequencing high-copy DNA of chicken to about 2.7 x coverage of its estimated sequence complexity led to the initial identification of several new repeat families, which were then used for a survey of the newly released first draft of the complete chicken genome. The analysis provided insight into the diversity and biology of known repeat structures such as CR1 and CNM, for which only limited sequence data had previously been available. Cot sequence data also resulted in the identification of four novel repeats (Birddawg, Hitchcock, Kronos, and Soprano), two new subfamilies of CR1 repeats, and many elements absent from the chicken genome assembly. Multiple autonomous elements were found for a novel Mariner-like transposon, Galluhop, in addition to nonautonomous deletion derivatives. Phylogenetic analysis of the high-copy repeats CR1, Galluhop, and Birddawg provided insight into two distinct genome dispersion strategies. This study also exemplifies the power of the CBCS method to create representative databases for the repetitive fractions of genomes for which only limited sequence data is available.
The repetitive landscape of the chicken genome

PubMed Central

Wicker, Thomas; Robertson, Jon S.; Schulze, Stefan R.; Feltus, F. Alex; Magrini, Vincent; Morrison, Jason A.; Mardis, Elaine R.; Wilson, Richard K.; Peterson, Daniel G.; Paterson, Andrew H.; Ivarie, Robert

2005-01-01

Cot-based cloning and sequencing (CBCS) is a powerful tool for isolating and characterizing the various repetitive components of any genome, combining the established principles of DNA reassociation kinetics with high-throughput sequencing. CBCS was used to generate sequence libraries representing the high, middle, and low-copy fractions of the chicken genome. Sequencing high-copy DNA of chicken to about 2.7× coverage of its estimated sequence complexity led to the initial identification of several new repeat families, which were then used for a survey of the newly released first draft of the complete chicken genome. The analysis provided insight into the diversity and biology of known repeat structures such as CR1 and CNM, for which only limited sequence data had previously been available. Cot sequence data also resulted in the identification of four novel repeats (Birddawg, Hitchcock, Kronos, and Soprano), two new subfamilies of CR1 repeats, and many elements absent from the chicken genome assembly. Multiple autonomous elements were found for a novel Mariner-like transposon, Galluhop, in addition to nonautonomous deletion derivatives. Phylogenetic analysis of the high-copy repeats CR1, Galluhop, and Birddawg provided insight into two distinct genome dispersion strategies. This study also exemplifies the power of the CBCS method to create representative databases for the repetitive fractions of genomes for which only limited sequence data is available. PMID:15256510
Information-Theoretic Uncertainty of SCFG-Modeled Folding Space of The Non-coding RNA

PubMed Central

Manzourolajdad, Amirhossein; Wang, Yingfeng; Shaw, Timothy I.; Malmberg, Russell L.

2012-01-01

RNA secondary structure ensembles define probability distributions for alternative equilibrium secondary structures of an RNA sequence. Shannon’s Entropy is a measure for the amount of diversity present in any ensemble. In this work, Shannon’s entropy of the SCFG ensemble on an RNA sequence is derived and implemented in polynomial time for both structurally ambiguous and unambiguous grammars. Micro RNA sequences generally have low folding entropy, as previously discovered. Surprisingly, signs of significantly high folding entropy were observed in certain ncRNA families. More effective models coupled with targeted randomization tests can lead to a better insight into folding features of these families. PMID:23160142
microRNA Expression Profiling: Technologies, Insights, and Prospects.

PubMed

Roden, Christine; Mastriano, Stephen; Wang, Nayi; Lu, Jun

2015-01-01

Since the early days of microRNA (miRNA) research, miRNA expression profiling technologies have provided important tools toward both better understanding of the biological functions of miRNAs and using miRNA expression as potential diagnostics. Multiple technologies, such as microarrays, next-generation sequencing, bead-based detection system, single-molecule measurements, and quantitative RT-PCR, have enabled accurate quantification of miRNAs and the subsequent derivation of key insights into diverse biological processes. As a class of ~22 nt long small noncoding RNAs, miRNAs present unique challenges in expression profiling that require careful experimental design and data analyses. We will particularly discuss how normalization and the presence of miRNA isoforms can impact data interpretation. We will present one example in which the consideration in data normalization has provided insights that helped to establish the global miRNA expression as a tumor suppressor. Finally, we discuss two future prospects of using miRNA profiling technologies to understand single cell variability and derive new rules for the functions of miRNA isoforms.
Genomewide Function Conservation and Phylogeny in the Herpesviridae

PubMed Central

Albà, M. Mar; Das, Rhiju; Orengo, Christine A.; Kellam, Paul

2001-01-01

The Herpesviridae are a large group of well-characterized double-stranded DNA viruses for which many complete genome sequences have been determined. We have extracted protein sequences from all predicted open reading frames of 19 herpesvirus genomes. Sequence comparison and protein sequence clustering methods have been used to construct herpesvirus protein homologous families. This resulted in 1692 proteins being clustered into 243 multiprotein families and 196 singleton proteins. Predicted functions were assigned to each homologous family based on genome annotation and published data and each family classified into seven broad functional groups. Phylogenetic profiles were constructed for each herpesvirus from the homologous protein families and used to determine conserved functions and genomewide phylogenetic trees. These trees agreed with molecular-sequence-derived trees and allowed greater insight into the phylogeny of ungulate and murine gammaherpesviruses. PMID:11156614
Insights into mechanisms of bacterial antigenic variation derived from the complete genome sequence of Anaplasma marginale.

PubMed

Palmer, Guy H; Futse, James E; Knowles, Donald P; Brayton, Kelly A

2006-10-01

Persistence of Anaplasma spp. in the animal reservoir host is required for efficient tick-borne transmission of these pathogens to animals and humans. Using A. marginale infection of its natural reservoir host as a model, persistent infection has been shown to reflect sequential cycles in which antigenic variants emerge, replicate, and are controlled by the immune system. Variation in the immunodominant outer-membrane protein MSP2 is generated by a process of gene conversion, in which unique hypervariable region sequences (HVRs) located in pseudogenes are recombined into a single operon-linked msp2 expression site. Although organisms expressing whole HVRs derived from pseudogenes emerge early in infection, long-term persistent infection is dependent on the generation of complex mosaics in which segments from different HVRs recombine into the expression site. The resulting combinatorial diversity generates the number of variants both predicted and shown to emerge during persistence.
Exome sequencing and SNP analysis detect novel compound heterozygosity in fatty acid hydroxylase-associated neurodegeneration

PubMed Central

Pierson, Tyler Mark; Simeonov, Dimitre R; Sincan, Murat; Adams, David A; Markello, Thomas; Golas, Gretchen; Fuentes-Fajardo, Karin; Hansen, Nancy F; Cherukuri, Praveen F; Cruz, Pedro; Blackstone, Craig; Tifft, Cynthia; Boerkoel, Cornelius F; Gahl, William A

2012-01-01

Fatty acid hydroxylase-associated neurodegeneration due to fatty acid 2-hydroxylase deficiency presents with a wide range of phenotypes including spastic paraplegia, leukodystrophy, and/or brain iron deposition. All previously described families with this disorder were consanguineous, with homozygous mutations in the probands. We describe a 10-year-old male, from a non-consanguineous family, with progressive spastic paraplegia, dystonia, ataxia, and cognitive decline associated with a sural axonal neuropathy. The use of high-throughput sequencing techniques combined with SNP array analyses revealed a novel paternally derived missense mutation and an overlapping novel maternally derived ∼28-kb genomic deletion in FA2H. This patient provides further insight into the consistent features of this disorder and expands our understanding of its phenotypic presentation. The presence of a sural nerve axonal neuropathy had not been previously associated with this disorder and so may extend the phenotype. PMID:22146942
DAMe: a toolkit for the initial processing of datasets with PCR replicates of double-tagged amplicons for DNA metabarcoding analyses.

PubMed

Zepeda-Mendoza, Marie Lisandra; Bohmann, Kristine; Carmona Baez, Aldo; Gilbert, M Thomas P

2016-05-03

DNA metabarcoding is an approach for identifying multiple taxa in an environmental sample using specific genetic loci and taxa-specific primers. When combined with high-throughput sequencing it enables the taxonomic characterization of large numbers of samples in a relatively time- and cost-efficient manner. One recent laboratory development is the addition of 5'-nucleotide tags to both primers producing double-tagged amplicons and the use of multiple PCR replicates to filter erroneous sequences. However, there is currently no available toolkit for the straightforward analysis of datasets produced in this way. We present DAMe, a toolkit for the processing of datasets generated by double-tagged amplicons from multiple PCR replicates derived from an unlimited number of samples. Specifically, DAMe can be used to (i) sort amplicons by tag combination, (ii) evaluate PCR replicates dissimilarity, and (iii) filter sequences derived from sequencing/PCR errors, chimeras, and contamination. This is attained by calculating the following parameters: (i) sequence content similarity between the PCR replicates from each sample, (ii) reproducibility of each unique sequence across the PCR replicates, and (iii) copy number of the unique sequences in each PCR replicate. We showcase the insights that can be obtained using DAMe prior to taxonomic assignment, by applying it to two real datasets that vary in their complexity regarding number of samples, sequencing libraries, PCR replicates, and used tag combinations. Finally, we use a third mock dataset to demonstrate the impact and importance of filtering the sequences with DAMe. DAMe allows the user-friendly manipulation of amplicons derived from multiple samples with PCR replicates built in a single or multiple sequencing libraries. It allows the user to: (i) collapse amplicons into unique sequences and sort them by tag combination while retaining the sample identifier and copy number information, (ii) identify sequences carrying unused tag combinations, (iii) evaluate the comparability of PCR replicates of the same sample, and (iv) filter tagged amplicons from a number of PCR replicates using parameters of minimum length, copy number, and reproducibility across the PCR replicates. This enables an efficient analysis of complex datasets, and ultimately increases the ease of handling datasets from large-scale studies.
Insights in KIR2.1 channel structure and function by an evolutionary approach; cloning and functional characterization of the first reptilian inward rectifier channel KIR2.1, derived from the California kingsnake (Lampropeltis getula californiae).

PubMed

Houtman, Marien J C; Korte, Sanne M; Ji, Yuan; Kok, Bart; Vos, Marc A; Stary-Weinzinger, Anna; van der Heyden, Marcel A G

2014-10-03

Potassium inward rectifier KIR2.1 channels contribute to the stable resting membrane potential in a variety of muscle and neuronal cell-types. Mutations in the KIR2.1 gene KCNJ2 have been associated with human disease, such as cardiac arrhythmias and periodic paralysis. Crystal structure and homology modelling of KIR2.1 channels combined with functional current measurements provided valuable insights in mechanisms underlying channel function. KIR2.1 channels have been cloned and analyzed from all main vertebrate phyla, except reptilians. To address this lacuna, we set out to clone reptilian KIR2.1 channels. Using a degenerated primer set we cloned the KCNJ2 coding regions from muscle tissue of turtle, snake, bear, quail and bream, and compared their deduced amino acid sequences with those of KIR2.1 sequences from 26 different animal species obtained from Genbank. Furthermore, expression constructs were prepared for functional electrophysiological studies of ectopically expressed KIR2.1 ion channels. In general, KCNJ2 gene evolution followed normal phylogenetic patterns, however turtle KIR2.1 ion channel sequence is more homologues to avians than to snake. Alignment of all 31 KIR2.1 sequences showed that all disease causing KIR2.1 mutations, except V93I, V123G and N318S, are fully conserved. Homology models were built to provide structural insights into species specific amino acid substitutions. Snake KIR2.1 channels became expressed at the plasmamembrane and produced typical barium sensitive (IC50 ∼6μM) inward rectifier currents. Copyright © 2014 Elsevier Inc. All rights reserved.
The Bulgarian vaccine Crimean-Congo haemorrhagic fever virus strain.

PubMed

Papa, Anna; Papadimitriou, Evangelia; Christova, Iva

2011-03-01

The Crimean-Congo haemorrhagic fever virus (CCHFV) is a 3-segmented RNA virus, which causes disease with a high fatality rate in humans. An inactivated suckling mouse brain-derived vaccine is used in Bulgaria for protection against CCHF. Strain V42/81 is currently used for the vaccine preparation. As the M-RNA segment plays a major role in the immune response, the full-length M segment sequence of the V42/81 strain was characterized. A great genetic diversity was observed among CCHFV strains. In order to gain an insight into the topology of the strain in the CCHFV phylogenetic trees, the full-length S and partial L segments were additionally sequenced and analyzed.
Insights into natural products biosynthesis from analysis of 490 polyketide synthases from Fusarium.

PubMed

Brown, Daren W; Proctor, Robert H

2016-04-01

Species of the fungus Fusarium collectively cause disease on almost all crop plants and produce numerous natural products (NPs), including some of the mycotoxins of greatest concern to agriculture. Many Fusarium NPs are derived from polyketide synthases (PKSs), large multi-domain enzymes that catalyze sequential condensation of simple carboxylic acids to form polyketides. To gain insight into the biosynthesis of polyketide-derived NPs in Fusarium, we retrieved 488 PKS gene sequences from genome sequences of 31 species of the fungus. In addition to these apparently functional PKS genes, the genomes collectively included 81 pseudogenized PKS genes. Phylogenetic analysis resolved the PKS genes into 67 clades, and based on multiple lines of evidence, we propose that homologs in each clade are responsible for synthesis of a polyketide that is distinct from those synthesized by PKSs in other clades. The presence and absence of PKS genes among the species examined indicated marked differences in distribution of PKS homologs. Comparisons of Fusarium PKS genes and genes flanking them to those from other Ascomycetes provided evidence that Fusarium has the genetic potential to synthesize multiple NPs that are the same or similar to those reported in other fungi, but that have not yet been reported in Fusarium. The results also highlight ways in which such analyses can help guide identification of novel Fusarium NPs and differences in NP biosynthetic capabilities that exist among fungi. Published by Elsevier Inc.
The Alveolate Perkinsus marinus: Biological Insights from EST Gene Discovery

PubMed Central

2010-01-01

Background Perkinsus marinus, a protozoan parasite of the eastern oyster Crassostrea virginica, has devastated natural and farmed oyster populations along the Atlantic and Gulf coasts of the United States. It is classified as a member of the Perkinsozoa, a recently established phylum considered close to the ancestor of ciliates, dinoflagellates, and apicomplexans, and a key taxon for understanding unique adaptations (e.g. parasitism) within the Alveolata. Despite intense parasite pressure, no disease-resistant oysters have been identified and no effective therapies have been developed to date. Results To gain insight into the biological basis of the parasite's virulence and pathogenesis mechanisms, and to identify genes encoding potential targets for intervention, we generated >31,000 5' expressed sequence tags (ESTs) derived from four trophozoite libraries generated from two P. marinus strains. Trimming and clustering of the sequence tags yielded 7,863 unique sequences, some of which carry a spliced leader. Similarity searches revealed that 55% of these had hits in protein sequence databases, of which 1,729 had their best hit with proteins from the chromalveolates (E-value ≤ 1e-5). Some sequences are similar to those proven to be targets for effective intervention in other protozoan parasites, and include not only proteases, antioxidant enzymes, and heat shock proteins, but also those associated with relict plastids, such as acetyl-CoA carboxylase and methyl erythrithol phosphate pathway components, and those involved in glycan assembly, protein folding/secretion, and parasite-host interactions. Conclusions Our transcriptome analysis of P. marinus, the first for any member of the Perkinsozoa, contributes new insight into its biology and taxonomic position. It provides a very informative, albeit preliminary, glimpse into the expression of genes encoding functionally relevant proteins as potential targets for chemotherapy, and evidence for the presence of a relict plastid. Further, although P. marinus sequences display significant similarity to those from both apicomplexans and dinoflagellates, the presence of trans-spliced transcripts confirms the previously established affinities with the latter. The EST analysis reported herein, together with the recently completed sequence of the P. marinus genome and the development of transfection methodology, should result in improved intervention strategies against dermo disease. PMID:20374649
Bi-PROF

PubMed Central

Gries, Jasmin; Schumacher, Dirk; Arand, Julia; Lutsik, Pavlo; Markelova, Maria Rivera; Fichtner, Iduna; Walter, Jörn; Sers, Christine; Tierling, Sascha

2013-01-01

The use of next generation sequencing has expanded our view on whole mammalian methylome patterns. In particular, it provides a genome-wide insight of local DNA methylation diversity at single nucleotide level and enables the examination of single chromosome sequence sections at a sufficient statistical power. We describe a bisulfite-based sequence profiling pipeline, Bi-PROF, which is based on the 454 GS-FLX Titanium technology that allows to obtain up to one million sequence stretches at single base pair resolution without laborious subcloning. To illustrate the performance of the experimental workflow connected to a bioinformatics program pipeline (BiQ Analyzer HT) we present a test analysis set of 68 different epigenetic marker regions (amplicons) in five individual patient-derived xenograft tissue samples of colorectal cancer and one healthy colon epithelium sample as a control. After the 454 GS-FLX Titanium run, sequence read processing and sample decoding, the obtained alignments are quality controlled and statistically evaluated. Comprehensive methylation pattern interpretation (profiling) assessed by analyzing 102-104 sequence reads per amplicon allows an unprecedented deep view on pattern formation and methylation marker heterogeneity in tissues concerned by complex diseases like cancer. PMID:23803588
CRISPR interference and priming varies with individual spacer sequences

PubMed Central

Xue, Chaoyou; Seetharam, Arun S.; Musharova, Olga; Severinov, Konstantin; J. Brouns, Stan J.; Severin, Andrew J.; Sashital, Dipali G.

2015-01-01

CRISPR–Cas (clustered regularly interspaced short palindromic repeats-CRISPR associated) systems allow bacteria to adapt to infection by acquiring ‘spacer’ sequences from invader DNA into genomic CRISPR loci. Cas proteins use RNAs derived from these loci to target cognate sequences for destruction through CRISPR interference. Mutations in the protospacer adjacent motif (PAM) and seed regions block interference but promote rapid ‘primed’ adaptation. Here, we use multiple spacer sequences to reexamine the PAM and seed sequence requirements for interference and priming in the Escherichia coli Type I-E CRISPR–Cas system. Surprisingly, CRISPR interference is far more tolerant of mutations in the seed and the PAM than previously reported, and this mutational tolerance, as well as priming activity, is highly dependent on spacer sequence. We identify a large number of functional PAMs that can promote interference, priming or both activities, depending on the associated spacer sequence. Functional PAMs are preferentially acquired during unprimed ‘naïve’ adaptation, leading to a rapid priming response following infection. Our results provide numerous insights into the importance of both spacer and target sequences for interference and priming, and reveal that priming is a major pathway for adaptation during initial infection. PMID:26586800

In-depth genome analyses of viruses from vaccine-derived rabies cases and corresponding live-attenuated oral rabies vaccines.

PubMed

Pfaff, Florian; Müller, Thomas; Freuling, Conrad M; Fehlner-Gardiner, Christine; Nadin-Davis, Susan; Robardet, Emmanuelle; Cliquet, Florence; Vuta, Vlad; Hostnik, Peter; Mettenleiter, Thomas C; Beer, Martin; Höper, Dirk

2018-02-10

Live-attenuated rabies virus strains such as those derived from the field isolate Street Alabama Dufferin (SAD) have been used extensively and very effectively as oral rabies vaccines for the control of fox rabies in both Europe and Canada. Although these vaccines are safe, some cases of vaccine-derived rabies have been detected during rabies surveillance accompanying these campaigns. In recent analysis it was shown that some commercial SAD vaccines consist of diverse viral populations, rather than clonal genotypes. For cases of vaccine-derived rabies, only consensus sequence data have been available to date and information concerning their population diversity was thus lacking. In our study, we used high-throughput sequencing to analyze 11 cases of vaccine-derived rabies, and compared their viral population diversity to the related oral rabies vaccines using pairwise Manhattan distances. This extensive deep sequencing analysis of vaccine-derived rabies cases observed during oral vaccination programs provided deeper insights into the effect of accidental in vivo replication of genetically diverse vaccine strains in the central nervous system of target and non-target species under field conditions. The viral population in vaccine-derived cases appeared to be clonal in contrast to their parental vaccines. The change from a state of high population diversity present in the vaccine batches to a clonal genotype in the affected animal may indicate the presence of a strong bottleneck during infection. In conclusion, it is very likely that these few cases are the consequence of host factors and not the result of the selection of a more virulent genotype. Furthermore, this type of vaccine-derived rabies leads to the selection of clonal genotypes and the selected variants were genetically very similar to potent SAD vaccines that have undergone a history of in vitro selection. Copyright © 2018. Published by Elsevier Ltd.
omiRas: a Web server for differential expression analysis of miRNAs derived from small RNA-Seq data.

PubMed

Müller, Sören; Rycak, Lukas; Winter, Peter; Kahl, Günter; Koch, Ina; Rotter, Björn

2013-10-15

Small RNA deep sequencing is widely used to characterize non-coding RNAs (ncRNAs) differentially expressed between two conditions, e.g. healthy and diseased individuals and to reveal insights into molecular mechanisms underlying condition-specific phenotypic traits. The ncRNAome is composed of a multitude of RNAs, such as transfer RNA, small nucleolar RNA and microRNA (miRNA), to name few. Here we present omiRas, a Web server for the annotation, comparison and visualization of interaction networks of ncRNAs derived from next-generation sequencing experiments of two different conditions. The Web tool allows the user to submit raw sequencing data and results are presented as: (i) static annotation results including length distribution, mapping statistics, alignments and quantification tables for each library as well as lists of differentially expressed ncRNAs between conditions and (ii) an interactive network visualization of user-selected miRNAs and their target genes based on the combination of several miRNA-mRNA interaction databases. The omiRas Web server is implemented in Python, PostgreSQL, R and can be accessed at: http://tools.genxpro.net/omiras/.
Complete chloroplast and ribosomal sequences for 30 accessions elucidate evolution of Oryza AA genome species

PubMed Central

Kim, Kyunghee; Lee, Sang-Choon; Lee, Junki; Yu, Yeisoo; Yang, Kiwoung; Choi, Beom-Soon; Koh, Hee-Jong; Waminal, Nomar Espinosa; Choi, Hong-Il; Kim, Nam-Hoon; Jang, Woojong; Park, Hyun-Seung; Lee, Jonghoon; Lee, Hyun Oh; Joh, Ho Jun; Lee, Hyeon Ju; Park, Jee Young; Perumal, Sampath; Jayakodi, Murukarthick; Lee, Yun Sun; Kim, Backki; Copetti, Dario; Kim, Soonok; Kim, Sunggil; Lim, Ki-Byung; Kim, Young-Dong; Lee, Jungho; Cho, Kwang-Su; Park, Beom-Seok; Wing, Rod A.; Yang, Tae-Jin

2015-01-01

Cytoplasmic chloroplast (cp) genomes and nuclear ribosomal DNA (nR) are the primary sequences used to understand plant diversity and evolution. We introduce a high-throughput method to simultaneously obtain complete cp and nR sequences using Illumina platform whole-genome sequence. We applied the method to 30 rice specimens belonging to nine Oryza species. Concurrent phylogenomic analysis using cp and nR of several of specimens of the same Oryza AA genome species provides insight into the evolution and domestication of cultivated rice, clarifying three ambiguous but important issues in the evolution of wild Oryza species. First, cp-based trees clearly classify each lineage but can be biased by inter-subspecies cross-hybridization events during speciation. Second, O. glumaepatula, a South American wild rice, includes two cytoplasm types, one of which is derived from a recent interspecies hybridization with O. longistminata. Third, the Australian O. rufipogan-type rice is a perennial form of O. meridionalis. PMID:26506948
Insights into fungal communities in composts revealed by 454-pyrosequencing: implications for human health and safety.

PubMed

De Gannes, Vidya; Eudoxie, Gaius; Hickey, William J

2013-01-01

Fungal community composition in composts of lignocellulosic wastes was assessed via 454-pyrosequencing of ITS1 libraries derived from the three major composting phases. Ascomycota represented most (93%) of the 27,987 fungal sequences. A total of 102 genera, 120 species, and 222 operational taxonomic units (OTUs; >97% similarity) were identified. Thirty genera predominated (ca. 94% of the sequences), and at the species level, sequences matching Chaetomium funicola and Fusarium oxysporum were the most abundant (26 and 12%, respectively). In all composts, fungal diversity in the mature phase exceeded that of the mesophilic phase, but there was no consistent pattern in diversity changes occurring in the thermophilic phase. Fifteen species of human pathogens were identified, eight of which have not been previously identified in composts. This study demonstrated that deep sequencing can elucidate fungal community diversity in composts, and that this information can have important implications for compost use and human health.
Insights into fungal communities in composts revealed by 454-pyrosequencing: implications for human health and safety

PubMed Central

De Gannes, Vidya; Eudoxie, Gaius; Hickey, William J.

2013-01-01

Fungal community composition in composts of lignocellulosic wastes was assessed via 454-pyrosequencing of ITS1 libraries derived from the three major composting phases. Ascomycota represented most (93%) of the 27,987 fungal sequences. A total of 102 genera, 120 species, and 222 operational taxonomic units (OTUs; >97% similarity) were identified. Thirty genera predominated (ca. 94% of the sequences), and at the species level, sequences matching Chaetomium funicola and Fusarium oxysporum were the most abundant (26 and 12%, respectively). In all composts, fungal diversity in the mature phase exceeded that of the mesophilic phase, but there was no consistent pattern in diversity changes occurring in the thermophilic phase. Fifteen species of human pathogens were identified, eight of which have not been previously identified in composts. This study demonstrated that deep sequencing can elucidate fungal community diversity in composts, and that this information can have important implications for compost use and human health. PMID:23785368
Multilayer material characterization using thermographic signal reconstruction

NASA Astrophysics Data System (ADS)

Shepard, Steven M.; Beemer, Maria Frendberg

2016-02-01

Active-thermography has become a well-established Nondestructive Testing (NDT) method for detection of subsurface flaws. In its simplest form, flaw detection is based on visual identification of contrast between a flaw and local intact regions in an IR image sequence of the surface temperature as the sample responds to thermal stimulation. However, additional information and insight can be obtained from the sequence, even in the absence of a flaw, through analysis of the logarithmic derivatives of individual pixel time histories using the Thermographic Signal Reconstruction (TSR) method. For example, the response of a flaw-free multilayer sample to thermal stimulation can be viewed as a simple transition between the responses of infinitely thick samples of the individual constituent layers over the lifetime of the thermal diffusion process. The transition is represented compactly and uniquely by the logarithmic derivatives, based on the ratio of thermal effusivities of the layers. A spectrum of derivative responses relative to thermal effusivity ratios allows prediction of the time scale and detectability of the interface, and measurement of the thermophysical properties of one layer if the properties of the other are known. A similar transition between steady diffusion states occurs for flat bottom holes, based on the hole aspect ratio.
High-Throughput Single-Cell RNA Sequencing and Data Analysis.

PubMed

Sagar; Herman, Josip Stefan; Pospisilik, John Andrew; Grün, Dominic

2018-01-01

Understanding biological systems at a single cell resolution may reveal several novel insights which remain masked by the conventional population-based techniques providing an average readout of the behavior of cells. Single-cell transcriptome sequencing holds the potential to identify novel cell types and characterize the cellular composition of any organ or tissue in health and disease. Here, we describe a customized high-throughput protocol for single-cell RNA-sequencing (scRNA-seq) combining flow cytometry and a nanoliter-scale robotic system. Since scRNA-seq requires amplification of a low amount of endogenous cellular RNA, leading to substantial technical noise in the dataset, downstream data filtering and analysis require special care. Therefore, we also briefly describe in-house state-of-the-art data analysis algorithms developed to identify cellular subpopulations including rare cell types as well as to derive lineage trees by ordering the identified subpopulations of cells along the inferred differentiation trajectories.
Population-Genomic Insights into Variation in Prevotella intermedia and Prevotella nigrescens Isolates and Its Association with Periodontal Disease

PubMed Central

Zhang, Yifei; Zhen, Min; Zhan, Yalin; Song, Yeqing; Zhang, Qian; Wang, Jinfeng

2017-01-01

High-throughput sequencing has helped to reveal the close relationship between Prevotella and periodontal disease, but the roles of subspecies diversity and genomic variation within this genus in periodontal diseases still need to be investigated. We performed a comparative genome analysis of 48 Prevotella intermedia and Prevotella nigrescens isolates that from the same cohort of subjects to identify the main drivers of their pathogenicity and adaptation to different environments. The comparisons were done between two species and between disease and health based on pooled sequences. The results showed that both P. intermedia and P. nigrescens have highly dynamic genomes and can take up various exogenous factors through horizontal gene transfer. The major differences between disease-derived and health-derived samples of P. intermedia and P. nigrescens were factors related to genome modification and recombination, indicating that the Prevotella isolates from disease sites may be more capable of genomic reconstruction. We also identified genetic elements specific to each sample, and found that disease groups had more unique virulence factors related to capsule and lipopolysaccharide synthesis, secretion systems, proteinases, and toxins, suggesting that strains from disease sites may have more specific virulence, particularly for P. intermedia. The differentially represented pathways between samples from disease and health were related to energy metabolism, carbohydrate and lipid metabolism, and amino acid metabolism, consistent with data from the whole subgingival microbiome in periodontal disease and health. Disease-derived samples had gained or lost several metabolic genes compared to healthy-derived samples, which could be linked with the difference in virulence performance between diseased and healthy sample groups. Our findings suggest that P. intermedia and P. nigrescens may serve as “crucial substances” in subgingival plaque, which may reflect changes in microbial and environmental dynamics in subgingival microbial ecosystems. This provides insight into the potential of P. intermedia and P. nigrescens as new predictive biomarkers and targets for effective interventions in periodontal disease. PMID:28983469
Population-Genomic Insights into Variation in Prevotella intermedia and Prevotella nigrescens Isolates and Its Association with Periodontal Disease.

PubMed

Zhang, Yifei; Zhen, Min; Zhan, Yalin; Song, Yeqing; Zhang, Qian; Wang, Jinfeng

2017-01-01

High-throughput sequencing has helped to reveal the close relationship between Prevotella and periodontal disease, but the roles of subspecies diversity and genomic variation within this genus in periodontal diseases still need to be investigated. We performed a comparative genome analysis of 48 Prevotella intermedia and Prevotella nigrescens isolates that from the same cohort of subjects to identify the main drivers of their pathogenicity and adaptation to different environments. The comparisons were done between two species and between disease and health based on pooled sequences. The results showed that both P. intermedia and P. nigrescens have highly dynamic genomes and can take up various exogenous factors through horizontal gene transfer. The major differences between disease-derived and health-derived samples of P. intermedia and P. nigrescens were factors related to genome modification and recombination, indicating that the Prevotella isolates from disease sites may be more capable of genomic reconstruction. We also identified genetic elements specific to each sample, and found that disease groups had more unique virulence factors related to capsule and lipopolysaccharide synthesis, secretion systems, proteinases, and toxins, suggesting that strains from disease sites may have more specific virulence, particularly for P. intermedia . The differentially represented pathways between samples from disease and health were related to energy metabolism, carbohydrate and lipid metabolism, and amino acid metabolism, consistent with data from the whole subgingival microbiome in periodontal disease and health. Disease-derived samples had gained or lost several metabolic genes compared to healthy-derived samples, which could be linked with the difference in virulence performance between diseased and healthy sample groups. Our findings suggest that P. intermedia and P. nigrescens may serve as "crucial substances" in subgingival plaque, which may reflect changes in microbial and environmental dynamics in subgingival microbial ecosystems. This provides insight into the potential of P. intermedia and P. nigrescens as new predictive biomarkers and targets for effective interventions in periodontal disease.
Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons

PubMed Central

Haas, Brian J.; Gevers, Dirk; Earl, Ashlee M.; Feldgarden, Mike; Ward, Doyle V.; Giannoukos, Georgia; Ciulla, Dawn; Tabbaa, Diana; Highlander, Sarah K.; Sodergren, Erica; Methé, Barbara; DeSantis, Todd Z.; Petrosino, Joseph F.; Knight, Rob; Birren, Bruce W.

2011-01-01

Bacterial diversity among environmental samples is commonly assessed with PCR-amplified 16S rRNA gene (16S) sequences. Perceived diversity, however, can be influenced by sample preparation, primer selection, and formation of chimeric 16S amplification products. Chimeras are hybrid products between multiple parent sequences that can be falsely interpreted as novel organisms, thus inflating apparent diversity. We developed a new chimera detection tool called Chimera Slayer (CS). CS detects chimeras with greater sensitivity than previous methods, performs well on short sequences such as those produced by the 454 Life Sciences (Roche) Genome Sequencer, and can scale to large data sets. By benchmarking CS performance against sequences derived from a controlled DNA mixture of known organisms and a simulated chimera set, we provide insights into the factors that affect chimera formation such as sequence abundance, the extent of similarity between 16S genes, and PCR conditions. Chimeras were found to reproducibly form among independent amplifications and contributed to false perceptions of sample diversity and the false identification of novel taxa, with less-abundant species exhibiting chimera rates exceeding 70%. Shotgun metagenomic sequences of our mock community appear to be devoid of 16S chimeras, supporting a role for shotgun metagenomics in validating novel organisms discovered in targeted sequence surveys. PMID:21212162
Facile Recovery of Individual High-Molecular-Weight, Low-Copy-Number Natural Plasmids for Genomic Sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Williams, L.E.; Detter, C,; Barrie, K.

2006-06-01

Sequencing of the large (>50 kb), low-copy-number (<5 per cell) plasmids that mediate horizontal gene transfer has been hindered by the difficulty and expense of isolating DNA from individual plasmids of this class. We report here that a kit method previously devised for purification of bacterial artificial chromosomes (BACs) can be adapted for effective preparation of individual plasmids up to 220 kb from wild gram-negative and gram-positive bacteria. Individual plasmid DNA recovered from less than 10 ml of Escherichia coli, Staphylococcus, and Corynebacterium cultures was of sufficient quantity and quality for construction of highcoverage libraries, as shown by sequencing fivemore » native plasmids ranging in size from 30 kb to 94 kb. We also report recommendations for vector screening to optimize plasmid sequence assembly, preliminary annotation of novel plasmid genomes, and insights on mobile genetic element biology derived from these sequences. Adaptation of this BAC method for large plasmid isolation removes one major technical hurdle to expanding our knowledge of the natural plasmid gene pool.« less
Whole-genome sequencing reveals novel insights into sulfur oxidation in the extremophile Acidithiobacillus thiooxidans.

PubMed

Yin, Huaqun; Zhang, Xian; Li, Xiaoqi; He, Zhili; Liang, Yili; Guo, Xue; Hu, Qi; Xiao, Yunhua; Cong, Jing; Ma, Liyuan; Niu, Jiaojiao; Liu, Xueduan

2014-07-04

Acidithiobacillus thiooxidans (A. thiooxidans), a chemolithoautotrophic extremophile, is widely used in the industrial recovery of copper (bioleaching or biomining). The organism grows and survives by autotrophically utilizing energy derived from the oxidation of elemental sulfur and reduced inorganic sulfur compounds (RISCs). However, the lack of genetic manipulation systems has restricted our exploration of its physiology. With the development of high-throughput sequencing technology, the whole genome sequence analysis of A. thiooxidans has allowed preliminary models to be built for genes/enzymes involved in key energy pathways like sulfur oxidation. The genome of A. thiooxidans A01 was sequenced and annotated. It contains key sulfur oxidation enzymes involved in the oxidation of elemental sulfur and RISCs, such as sulfur dioxygenase (SDO), sulfide quinone reductase (SQR), thiosulfate:quinone oxidoreductase (TQO), tetrathionate hydrolase (TetH), sulfur oxidizing protein (Sox) system and their associated electron transport components. Also, the sulfur oxygenase reductase (SOR) gene was detected in the draft genome sequence of A. thiooxidans A01, and multiple sequence alignment was performed to explore the function of groups of related protein sequences. In addition, another putative pathway was found in the cytoplasm of A. thiooxidans, which catalyzes sulfite to sulfate as the final product by phosphoadenosine phosphosulfate (PAPS) reductase and adenylylsulfate (APS) kinase. This differs from its closest relative Acidithiobacillus caldus, which is performed by sulfate adenylyltransferase (SAT). Furthermore, real-time quantitative PCR analysis showed that most of sulfur oxidation genes were more strongly expressed in the S0 medium than that in the Na2S2O3 medium at the mid-log phase. Sulfur oxidation model of A. thiooxidans A01 has been constructed based on previous studies from other sulfur oxidizing strains and its genome sequence analyses, providing insights into our understanding of its physiology and further analysis of potential functions of key sulfur oxidation genes.
Whole-genome sequencing reveals novel insights into sulfur oxidation in the extremophile Acidithiobacillus thiooxidans

PubMed Central

2014-01-01

Background Acidithiobacillus thiooxidans (A. thiooxidans), a chemolithoautotrophic extremophile, is widely used in the industrial recovery of copper (bioleaching or biomining). The organism grows and survives by autotrophically utilizing energy derived from the oxidation of elemental sulfur and reduced inorganic sulfur compounds (RISCs). However, the lack of genetic manipulation systems has restricted our exploration of its physiology. With the development of high-throughput sequencing technology, the whole genome sequence analysis of A. thiooxidans has allowed preliminary models to be built for genes/enzymes involved in key energy pathways like sulfur oxidation. Results The genome of A. thiooxidans A01 was sequenced and annotated. It contains key sulfur oxidation enzymes involved in the oxidation of elemental sulfur and RISCs, such as sulfur dioxygenase (SDO), sulfide quinone reductase (SQR), thiosulfate:quinone oxidoreductase (TQO), tetrathionate hydrolase (TetH), sulfur oxidizing protein (Sox) system and their associated electron transport components. Also, the sulfur oxygenase reductase (SOR) gene was detected in the draft genome sequence of A. thiooxidans A01, and multiple sequence alignment was performed to explore the function of groups of related protein sequences. In addition, another putative pathway was found in the cytoplasm of A. thiooxidans, which catalyzes sulfite to sulfate as the final product by phosphoadenosine phosphosulfate (PAPS) reductase and adenylylsulfate (APS) kinase. This differs from its closest relative Acidithiobacillus caldus, which is performed by sulfate adenylyltransferase (SAT). Furthermore, real-time quantitative PCR analysis showed that most of sulfur oxidation genes were more strongly expressed in the S0 medium than that in the Na2S2O3 medium at the mid-log phase. Conclusion Sulfur oxidation model of A. thiooxidans A01 has been constructed based on previous studies from other sulfur oxidizing strains and its genome sequence analyses, providing insights into our understanding of its physiology and further analysis of potential functions of key sulfur oxidation genes. PMID:24993543
Effects of support size and orientation on symmetric gaits in free-ranging tamarins of Amazonian Peru: implications for the functional significance of primate gait sequence patterns.

PubMed

Nyakatura, John A; Heymann, Eckhard W

2010-03-01

The adoption of a specific gait sequence pattern during symmetrical locomotion has been proposed to have been a key advantage for the exploitation of the fine branch niche in early primates. Diverse aspects of primate locomotion have been extensively studied in technically equipped laboratory settings, but evolutionary conclusions derived from these investigations have rarely been verified in wild primates. Bridging the gap from the lab to the field, we conducted an actual performance determination of symmetrical gaits in two free-ranging tamarin species (Saguinus mystax and Saguinus fuscicollis) of Amazonian Peru by analyzing high-speed video recordings of naturally occurring locomotor bouts. Tamarins arguably represent viable models for aspects of early primate locomotion. We tested three specific hypotheses derived from laboratory studies to test for the influence of support size and orientation and to gain further insight into the functional significance of primate gait sequence patterns: (1) The tamarins utilize symmetrical gaits at a higher rate on small supports than on larger ones. (2) During symmetrical locomotion on small supports, diagonal sequences are utilized at a higher rate than on larger supports. (3) On inclines, diagonal sequences are predominantly used and on declines, lateral sequences are predominantly used. Our results corroborated hypotheses 1 and 3. We found no clear support for hypothesis 2. In conclusion, our results add to the notion that primate gait plasticity, rather than uniform adoption of diagonal sequence gaits, enabled early primates to accommodate different support types and effectively exploit the small branch niche. Copyright 2009 Elsevier Ltd. All rights reserved.
Assessment of fungal diversity in a water-damaged office building.

PubMed

Green, Brett J; Lemons, Angela R; Park, Yeonmi; Cox-Ganser, Jean M; Park, Ju-Hyeong

2017-04-01

Recent studies have described fungal communities in indoor environments using gene sequencing-based approaches. In this study, dust-borne fungal communities were elucidated from a water-damaged office building located in the northeastern region of the United States using internal transcribed spacer (ITS) rRNA gene sequencing. Genomic DNA was extracted from 5 mg of floor dust derived from 22 samples collected from either the lower floors (n = 8) or a top floor (n = 14) of the office building. ITS gene sequencing resolved a total of 933 ITS sequences and was clustered into 216 fungal operational taxonomic units (OTUs). Analysis of fungal OTUs at the 97% similarity threshold showed a difference between the lower and top floors that was marginally significant (p = 0.049). Species richness and diversity indices were reduced in the lower floor samples compared to the top floor samples and there was a high degree of compositional dissimilarity within and between the two different areas within the building. Fungal OTUs were placed in the phyla Ascomycota (55%), Basidiomycota (41%), Zygomycota (3%), Glomeromycota (0.4%), Chytridiomycota (0.3%), and unassigned fungi (0.5%). The Ascomycota classes with the highest relative abundances included the Dothideomycetes (30%) and Eurotiomycetes (16%). The Basidiomycota consisted of the classes Ustilaginomycetes (14%), Tremellomycetes (11%), and Agaricomycetes (8%). Sequence reads derived from the plant pathogen Ustilago syntherismae were the most abundant in the analysis as were obligate Basidiomycota yeast species that accounted for 12% and 11% of fungal ITS sequences, respectively. ITS gene sequencing provides additional insight into the diversity of fungal OTUs. These data further highlight the contribution of fungi placed in the phylum Basidiomycota, obligate yeasts, as well as xerophilic species that are typically not resolved using traditional culture methods.
Unveiling the metabolic potential of two soil-derived microbial consortia selected on wheat straw

PubMed Central

Jiménez, Diego Javier; Chaves-Moreno, Diego; van Elsas, Jan Dirk

2015-01-01

Based on the premise that plant biomass can be efficiently degraded by mixed microbial cultures and/or enzymes, we here applied a targeted metagenomics-based approach to explore the metabolic potential of two forest soil-derived lignocellulolytic microbial consortia, denoted RWS and TWS (bred on wheat straw). Using the metagenomes of three selected batches of two experimental systems, about 1.2 Gb of sequence was generated. Comparative analyses revealed an overrepresentation of predicted carbohydrate transporters (ABC, TonB and phosphotransferases), two-component sensing systems and β-glucosidases/galactosidases in the two consortia as compared to the forest soil inoculum. Additionally, “profiling” of carbohydrate-active enzymes showed significant enrichments of several genes encoding glycosyl hydrolases of families GH2, GH43, GH92 and GH95. Sequence analyses revealed these to be most strongly affiliated to genes present on the genomes of Sphingobacterium, Bacteroides, Flavobacterium and Pedobacter spp. Assembly of the RWS and TWS metagenomes generated 16,536 and 15,902 contigs of ≥10 Kb, respectively. Thirteen contigs, containing 39 glycosyl hydrolase genes, constitute novel (hemi)cellulose utilization loci with affiliation to sequences primarily found in the Bacteroidetes. Overall, this study provides deep insight in the plant polysaccharide degrading capabilities of microbial consortia bred from forest soil, highlighting their biotechnological potential. PMID:26343383
First Insights into the Large Genome of Epimedium sagittatum (Sieb. et Zucc) Maxim, a Chinese Traditional Medicinal Plant

PubMed Central

Liu, Di; Zeng, Shao-Hua; Chen, Jian-Jun; Zhang, Yan-Jun; Xiao, Gong; Zhu, Lin-Yao; Wang, Ying

2013-01-01

Epimedium sagittatum (Sieb. et Zucc) Maxim is a member of the Berberidaceae family of basal eudicot plants, widely distributed and used as a traditional medicinal plant in China for therapeutic effects on many diseases with a long history. Recent data shows that E. sagittatum has a relatively large genome, with a haploid genome size of ~4496 Mbp, divided into a small number of only 12 diploid chromosomes (2n = 2x = 12). However, little is known about Epimedium genome structure and composition. Here we present the analysis of 691 kb of high-quality genomic sequence derived from 672 randomly selected plasmid clones of E. sagittatum genomic DNA, representing ~0.0154% of the genome. The sampled sequences comprised at least 78.41% repetitive DNA elements and 2.51% confirmed annotated gene sequences, with a total GC% content of 39%. Retrotransposons represented the major class of transposable element (TE) repeats identified (65.37% of all TE repeats), particularly LTR (Long Terminal Repeat) retrotransposons (52.27% of all TE repeats). Chromosome analysis and Fluorescence in situ Hybridization of Gypsy-Ty3 retrotransposons were performed to survey the E. sagittatum genome at the cytological level. Our data provide the first insights into the composition and structure of the E. sagittatum genome, and will facilitate the functional genomic analysis of this valuable medicinal plant. PMID:23807511
Molecular evidence for a uniform microbial community in sponges from different oceans.

PubMed

Hentschel, Ute; Hopke, Jörn; Horn, Matthias; Friedrich, Anja B; Wagner, Michael; Hacker, Jörg; Moore, Bradley S

2002-09-01

Sponges (class Porifera) are evolutionarily ancient metazoans that populate the tropical oceans in great abundances but also occur in temperate regions and even in freshwater. Sponges contain large numbers of bacteria that are embedded within the animal matrix. The phylogeny of these bacteria and the evolutionary age of the interaction are virtually unknown. In order to provide insights into the species richness of the microbial community of sponges, we performed a comprehensive diversity survey based on 190 sponge-derived 16S ribosomal DNA (rDNA) sequences. The sponges Aplysina aerophoba and Theonella swinhoei were chosen for construction of the bacterial 16S rDNA library because they are taxonomically distantly related and they populate nonoverlapping geographic regions. In both sponges, a uniform microbial community was discovered whose phylogenetic signature is distinctly different from that of marine plankton or marine sediments. Altogether 14 monophyletic, sponge-specific sequence clusters were identified that belong to at least seven different bacterial divisions. By definition, the sequences of each cluster are more closely related to each other than to a sequence from nonsponge sources. These monophyletic clusters comprise 70% of all publicly available sponge-derived 16S rDNA sequences, reflecting the generality of the observed phenomenon. This shared microbial fraction represents the smallest common denominator of the sponges investigated in this study. Bacteria that are exclusively found in certain host species or that occur only transiently would have been missed. A picture emerges where sponges can be viewed as highly concentrated reservoirs of so far uncultured and elusive marine microorganisms.
Phylogenetic Analysis of Aedes aegypti Based on Mitochondrial ND4 Gene Sequences in Almadinah, Saudi Arabia.

PubMed

Ali, Khalil H Al; El-Badry, Ayman A; Ali, Mouhanad Al; El-Sayed, Wael S M; El-Beshbishy, Hesham A

2016-06-01

Aedes aegypti is the main vector of the yellow fever and dengue virus. This mosquito has become the major indirect cause of morbidity and mortality of the human worldwide. Dengue virus activity has been reported recently in the western areas of Saudi Arabia. There is no vaccine for dengue virus until now, and the control of the disease depends on the control of the vector. The present study has aimed to perform phylogenetic analysis of Aedes aegypti based on mitochondrial NADH dehydrogenase subunit 4 ( ND4 ) gene at Almadinah, Saudi Arabia in order to get further insight into the epidemiology and transmission of this vector. Mitochondrial ND4 gene was sequenced in the eight isolated Aedes aegypti mosquitoes from Almadinah, Saudi Arabia, sequences were aligned, and phylogenetic analysis were performed and compared with 54 sequences of Aedes reported in the previous studies from Mexico, Thailand, Brazil, and Africa. Our results suggest that increased gene flow among Aedes aegypti populations occurs between Africa and Saudi Arabia. Phylogenetic relationship analysis showed two genetically distinct Aedes aegypti in Saudi Arabia derived from dual African ancestor.
Microbial community structure in the gut of the New Zealand insect Auckland tree weta (Hemideina thoracica).

PubMed

Waite, David W; Dsouza, Melissa; Biswas, Kristi; Ward, Darren F; Deines, Peter; Taylor, Michael W

2015-05-01

The endemic New Zealand weta is an enigmatic insect. Although the insect is well known by its distinctive name, considerable size, and morphology, many basic aspects of weta biology remain unknown. Here, we employed cultivation-independent enumeration techniques and rRNA gene sequencing to investigate the gut microbiota of the Auckland tree weta (Hemideina thoracica). Fluorescence in situ hybridisation performed on different sections of the gut revealed a bacterial community of fluctuating density, while rRNA gene-targeted amplicon pyrosequencing revealed the presence of a microbial community containing high bacterial diversity, but an apparent absence of archaea. Bacteria were further studied using full-length 16S rRNA gene sequences, with statistical testing of bacterial community membership against publicly available termite- and cockroach-derived sequences, revealing that the weta gut microbiota is similar to that of cockroaches. These data represent the first analysis of the weta microbiota and provide initial insights into the potential function of these microorganisms.

Using next generation transcriptome sequencing to predict an ectomycorrhizal metablome.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Larsen, P. E.; Sreedasyam, A.; Trivedi, G

Mycorrhizae, symbiotic interactions between soil fungi and tree roots, are ubiquitous in terrestrial ecosystems. The fungi contribute phosphorous, nitrogen and mobilized nutrients from organic matter in the soil and in return the fungus receives photosynthetically-derived carbohydrates. This union of plant and fungal metabolisms is the mycorrhizal metabolome. Understanding this symbiotic relationship at a molecular level provides important contributions to the understanding of forest ecosystems and global carbon cycling. We generated next generation short-read transcriptomic sequencing data from fully-formed ectomycorrhizae between Laccaria bicolor and aspen (Populus tremuloides) roots. The transcriptomic data was used to identify statistically significantly expressed gene models usingmore » a bootstrap-style approach, and these expressed genes were mapped to specific metabolic pathways. Integration of expressed genes that code for metabolic enzymes and the set of expressed membrane transporters generates a predictive model of the ectomycorrhizal metabolome. The generated model of mycorrhizal metabolome predicts that the specific compounds glycine, glutamate, and allantoin are synthesized by L. bicolor and that these compounds or their metabolites may be used for the benefit of aspen in exchange for the photosynthetically-derived sugars fructose and glucose. The analysis illustrates an approach to generate testable biological hypotheses to investigate the complex molecular interactions that drive ectomycorrhizal symbiosis. These models are consistent with experimental environmental data and provide insight into the molecular exchange processes for organisms in this complex ecosystem. The method used here for predicting metabolomic models of mycorrhizal systems from deep RNA sequencing data can be generalized and is broadly applicable to transcriptomic data derived from complex systems.« less
The Complete Chloroplast and Mitochondrial Genome Sequences of Boea hygrometrica: Insights into the Evolution of Plant Organellar Genomes

PubMed Central

Wang, Xumin; Deng, Xin; Zhang, Xiaowei; Hu, Songnian; Yu, Jun

2012-01-01

The complete nucleotide sequences of the chloroplast (cp) and mitochondrial (mt) genomes of resurrection plant Boea hygrometrica (Bh, Gesneriaceae) have been determined with the lengths of 153,493 bp and 510,519 bp, respectively. The smaller chloroplast genome contains more genes (147) with a 72% coding sequence, and the larger mitochondrial genome have less genes (65) with a coding faction of 12%. Similar to other seed plants, the Bh cp genome has a typical quadripartite organization with a conserved gene in each region. The Bh mt genome has three recombinant sequence repeats of 222 bp, 843 bp, and 1474 bp in length, which divide the genome into a single master circle (MC) and four isomeric molecules. Compared to other angiosperms, one remarkable feature of the Bh mt genome is the frequent transfer of genetic material from the cp genome during recent Bh evolution. We also analyzed organellar genome evolution in general regarding genome features as well as compositional dynamics of sequence and gene structure/organization, providing clues for the understanding of the evolution of organellar genomes in plants. The cp-derived sequences including tRNAs found in angiosperm mt genomes support the conclusion that frequent gene transfer events may have begun early in the land plant lineage. PMID:22291979
Comparative genomics approach to detecting split-coding regions in a low-coverage genome: lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes).

PubMed

Dessimoz, Christophe; Zoller, Stefan; Manousaki, Tereza; Qiu, Huan; Meyer, Axel; Kuraku, Shigehiro

2011-09-01

Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references.
Comparative genomics approach to detecting split-coding regions in a low-coverage genome: lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes)

PubMed Central

Zoller, Stefan; Manousaki, Tereza; Qiu, Huan; Meyer, Axel; Kuraku, Shigehiro

2011-01-01

Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references. PMID:21712341
The Early ANTP Gene Repertoire: Insights from the Placozoan Genome

PubMed Central

Schierwater, Bernd; Kamm, Kai; Srivastava, Mansi; Rokhsar, Daniel; Rosengarten, Rafael D.; Dellaporta, Stephen L.

2008-01-01

The evolution of ANTP genes in the Metazoa has been the subject of conflicting hypotheses derived from full or partial gene sequences and genomic organization in higher animals. Whole genome sequences have recently filled in some crucial gaps for the basal metazoan phyla Cnidaria and Porifera. Here we analyze the complete genome of Trichoplax adhaerens, representing the basal metazoan phylum Placozoa, for its set of ANTP class genes. The Trichoplax genome encodes representatives of Hox/ParaHox-like, NKL, and extended Hox genes. This repertoire possibly mirrors the condition of a hypothetical cnidarian-bilaterian ancestor. The evolution of the cnidarian and bilaterian ANTP gene repertoires can be deduced by a limited number of cis-duplications of NKL and “extended Hox” genes and the presence of a single ancestral “ProtoHox” gene. PMID:18716659
Characterization of Hepatitis C Virus (HCV) Envelope Diversification from Acute to Chronic Infection within a Sexually Transmitted HCV Cluster by Using Single-Molecule, Real-Time Sequencing

PubMed Central

Ho, Cynthia K. Y.; Raghwani, Jayna; Koekkoek, Sylvie; Liang, Richard H.; Van der Meer, Jan T. M.; Van Der Valk, Marc; De Jong, Menno; Pybus, Oliver G.

2016-01-01

ABSTRACT In contrast to other available next-generation sequencing platforms, PacBio single-molecule, real-time (SMRT) sequencing has the advantage of generating long reads albeit with a relatively higher error rate in unprocessed data. Using this platform, we longitudinally sampled and sequenced the hepatitis C virus (HCV) envelope genome region (1,680 nucleotides [nt]) from individuals belonging to a cluster of sexually transmitted cases. All five subjects were coinfected with HIV-1 and a closely related strain of HCV genotype 4d. In total, 50 samples were analyzed by using SMRT sequencing. By using 7 passes of circular consensus sequencing, the error rate was reduced to 0.37%, and the median number of sequences was 612 per sample. A further reduction of insertions was achieved by alignment against a sample-specific reference sequence. However, in vitro recombination during PCR amplification could not be excluded. Phylogenetic analysis supported close relationships among HCV sequences from the four male subjects and subsequent transmission from one subject to his female partner. Transmission was characterized by a strong genetic bottleneck. Viral genetic diversity was low during acute infection and increased upon progression to chronicity but subsequently fluctuated during chronic infection, caused by the alternate detection of distinct coexisting lineages. SMRT sequencing combines long reads with sufficient depth for many phylogenetic analyses and can therefore provide insights into within-host HCV evolutionary dynamics without the need for haplotype reconstruction using statistical algorithms. IMPORTANCE Next-generation sequencing has revolutionized the study of genetically variable RNA virus populations, but for phylogenetic and evolutionary analyses, longer sequences than those generated by most available platforms, while minimizing the intrinsic error rate, are desired. Here, we demonstrate for the first time that PacBio SMRT sequencing technology can be used to generate full-length HCV envelope sequences at the single-molecule level, providing a data set with large sequencing depth for the characterization of intrahost viral dynamics. The selection of consensus reads derived from at least 7 full circular consensus sequencing rounds significantly reduced the intrinsic high error rate of this method. We used this method to genetically characterize a unique transmission cluster of sexually transmitted HCV infections, providing insight into the distinct evolutionary pathways in each patient over time and identifying the transmission-associated genetic bottleneck as well as fluctuations in viral genetic diversity over time, accompanied by dynamic shifts in viral subpopulations. PMID:28077634
Amyloid-like self-assembly of peptide sequences from the adenovirus fiber shaft: insights from molecular dynamics simulations.

PubMed

Tamamis, Phanourios; Kasotakis, Emmanouil; Mitraki, Anna; Archontis, Georgios

2009-11-26

The self-assembly of peptides and proteins into nanostructures is related to the fundamental problems of protein folding and misfolding and has potential applications in medicine, materials science and nanotechnology. Natural peptides, corresponding to sequence repeats from self-assembling proteins, may constitute elementary building blocks of such nanostructures. In this work, we study by implicit-solvent replica-exchange simulations the self-assembly of two amyloidogenic sequences derived from the naturally occurring fiber shaft of the adenovirus, the octapeptide NSGAITIG (asparagine-serine-glycine-alanine-isoleucine-threonine-isoleucine-glycine) and its hexapeptide counterpart, GAITIG. In accordance with their amyloidogenic capacity, both peptides form readily intermolecular beta-sheets, stabilized by extensive main- and side-chain contacts involving the C-terminal moieties (segments 3-8 and 2-6, respectively). The structural and energetic properties of these sheets are analyzed extensively. The N-terminal residues Asn1 and Ser2 of the octapeptide remain disordered in the sheets, suggesting that these residues are exposed at the exterior of the fibrils and accessible. On the basis of insight provided by the simulations, cysteine residues were recently substituted at positions 1 and 2 of NSGAITIG; the newly designed peptides maintain their amyloidogenic properties and can bind to silver, gold and platinum nanoparticles [Kasotakis et al. Biopolymers 2009, 92, 164-172]. Computational investigation can identify suitable positions for rational modification of peptide building blocks, aiming at the fabrication of novel biomaterials.
Identification of mediator complex 26 (Crsp7) gametologs on platypus X1 and Y5 sex chromosomes: a candidate testis-determining gene in monotremes?

PubMed

Tsend-Ayush, Enkhjargal; Kortschak, R Daniel; Bernard, Pascal; Lim, Shu Ly; Ryan, Janelle; Rosenkranz, Ruben; Borodina, Tatiana; Dohm, Juliane C; Himmelbauer, Heinz; Harley, Vincent R; Grützner, Frank

2012-01-01

The basal lineage of monotremes features an extraordinarily complex sex chromosome system which has provided novel insights into the evolution of mammalian sex chromosomes. Recently, sequence information from autosomes, X chromosomes, and XY-shared pseudoautosomal regions has become available. However, no gene has so far been described on any of the Y chromosome-specific regions. We analyzed sequences derived from Y-specific BAC clones to identify genes with potentially male-specific function. Here, we report the identification and characterization of the mediator complex protein gametologs on platypus Y5 (Crspy). We also identified the X-chromosomal copy which unexpectedly maps to X1 (Crspx). Sequence comparison shows extensive divergence between the X and Y copy, but we found no significant positive selection on either gametolog. Expression analysis shows widespread expression of Crspx. Crspy is expressed exclusively in males with particularly strong expression in testis and kidney. Reporter gene assays to investigate whether Crspx/y can act on the recently discovered mouse Sox9 testis-specific enhancer element did reveal a modest effect together with mouse Sox9 + Sf1, but showed overall no significant upregulation of the reporter gene. This is the first report of a differentiated functional male-specific gene on platypus Y chromosomes, providing new insights into sex chromosome evolution and a candidate gene for male-specific function in monotremes.
Deformation of the western Indian Plate boundary: insights from differential and multi-aperture InSAR data inversion for the 2008 Baluchistan (Western Pakistan) seismic sequence

NASA Astrophysics Data System (ADS)

Pezzo, Giuseppe; Merryman Boncori, John Peter; Atzori, Simone; Antonioli, Andrea; Salvi, Stefano

2014-07-01

In this study, we use Differential Synthetic Aperture Radar Interferometry (DInSAR) and multi-aperture interferometry (MAI) to constrain the sources of the three largest events of the 2008 Baluchistan (western Pakistan) seismic sequence, namely two Mw 6.4 events only 12 hr apart and an Mw 5.7 event that occurred 40 d later. The sequence took place in the Quetta Syntaxis, the most seismically active region of Baluchistan, tectonically located between the colliding Indian Plate and the Afghan Block of the Eurasian Plate. Surface displacements estimated from ascending and descending ENVISAT ASAR acquisitions were used to derive elastic dislocation models for the sources of the two main events. The estimated slip distributions have peak values of 120 and 130 cm on a pair of almost parallel and near-vertical faults striking NW-SE, and of 50 cm and 60 cm on two high-angle faults striking NE-SW. Values up to 50 cm were found for the largest aftershock on an NE-SW fault located between the sources of the main shocks. The MAI measurements, with their high sensitivity to the north-south motion component, are crucial in this area to accurately describe the coseismic displacement field. Our results provide insight into the deformation style of the Quetta Syntaxis, suggesting that right-lateral slip released at shallow depths on large NW fault planes is compatible with left-lateral activation on smaller NE-SW faults.
Sequence-Dependent Self-Assembly and Structural Diversity of Islet Amyloid Polypeptide-Derived β-Sheet Fibrils

DOE PAGES

Wang, Shih-Ting; Lin, Yiyang; Spencer, Ryan K.; ...

2017-08-03

Determining the structural origins of amyloid fibrillation is essential for understanding both the pathology of amyloidosis and the rational design of inhibitors to prevent or reverse amyloid formation. In this work, the decisive roles of peptide structures on amyloid self-assembly and morphological diversity were investigated by the design of eight amyloidogenic peptides derived from islet amyloid polypeptide. Among the segments, two distinct morphologies were highlighted in the form of twisted and planar (untwisted) ribbons with varied diameters, thicknesses, and lengths. In particular, transformation of amyloid fibrils from twisted ribbons into untwisted structures was triggered by substitution of the C-terminal serinemore » with threonine, where the side chain methyl group was responsible for the distinct morphological change. This effect was confirmed following serine substitution with alanine and valine and was ascribed to the restriction of intersheet torsional strain through the increased hydrophobic interactions and hydrogen bonding. We also studied the variation of fibril morphology (i.e., association and helicity) and peptide aggregation propensity by increasing the hydrophobicity of the peptide side group, capping the N-terminus, and extending sequence length. Lastly, we anticipate that our insights into sequence-dependent fibrillation and morphological diversity will shed light on the structural interpretation of amyloidogenesis and development of structure-specific imaging agents and aggregation inhibitors.« less
Computational mining for hypothetical patterns of amino acid side chains in protein data bank (PDB)

NASA Astrophysics Data System (ADS)

Ghani, Nur Syatila Ab; Firdaus-Raih, Mohd

2018-04-01

The three-dimensional structure of a protein can provide insights regarding its function. Functional relationship between proteins can be inferred from fold and sequence similarities. In certain cases, sequence or fold comparison fails to conclude homology between proteins with similar mechanism. Since the structure is more conserved than the sequence, a constellation of functional residues can be similarly arranged among proteins of similar mechanism. Local structural similarity searches are able to detect such constellation of amino acids among distinct proteins, which can be useful to annotate proteins of unknown function. Detection of such patterns of amino acids on a large scale can increase the repertoire of important 3D motifs since available known 3D motifs currently, could not compensate the ever-increasing numbers of uncharacterized proteins to be annotated. Here, a computational platform for an automated detection of 3D motifs is described. A fuzzy-pattern searching algorithm derived from IMagine an Amino Acid 3D Arrangement search EnGINE (IMAAAGINE) was implemented to develop an automated method for searching of hypothetical patterns of amino acid side chains in Protein Data Bank (PDB), without the need for prior knowledge on related sequence or structure of pattern of interest. We present an example of the searches, which is the detection of a hypothetical pattern derived from known structural motif of C2H2 structural pattern from zinc fingers. The conservation of particular patterns of amino acid side chains in unrelated proteins is highlighted. This approach can act as a complementary method for available structure- and sequence-based platforms and may contribute in improving functional association between proteins.
Cell and molecular biology of the spiny dogfish Squalus acanthias and little skate Leucoraja erinacea: insights from in vitro cultured cells.

PubMed

Barnes, D W

2012-04-01

Two of the most commonly used elasmobranch experimental model species are the spiny dogfish Squalus acanthias and the little skate Leucoraja erinacea. Comparative biology and genomics with these species have provided useful information in physiology, pharmacology, toxicology, immunology, evolutionary developmental biology and genetics. A wealth of information has been obtained using in vitro approaches to study isolated cells and tissues from these organisms under circumstances in which the extracellular environment can be controlled. In addition to classical work with primary cell cultures, continuously proliferating cell lines have been derived recently, representing the first cell lines from cartilaginous fishes. These lines have proved to be valuable tools with which to explore functional genomic and biological questions and to test hypotheses at the molecular level. In genomic experiments, complementary (c)DNA libraries have been constructed, and c. 8000 unique transcripts identified, with over 3000 representing previously unknown gene sequences. A sub-set of messenger (m)RNAs has been detected for which the 3' untranslated regions show elements that are remarkably well conserved evolutionarily, representing novel, potentially regulatory gene sequences. The cell culture systems provide physiologically valid tools to study functional roles of these sequences and other aspects of elasmobranch molecular cell biology and physiology. Information derived from the use of in vitro cell cultures is valuable in revealing gene diversity and information for genomic sequence assembly, as well as for identification of new genes and molecular markers, construction of gene-array probes and acquisition of full-length cDNA sequences. © 2012 The Author. Journal of Fish Biology © 2012 The Fisheries Society of the British Isles.
Gene regulation in amphioxus: An insight from transgenic studies in amphioxus and vertebrates.

PubMed

Kozmikova, Iryna; Kozmik, Zbynek

2015-12-01

Cephalochordates, commonly known as amphioxus or lancelets, are the most basal subphylum of chordates. Cephalochordates are thus key to understanding the origin of vertebrates and molecular mechanisms underlying vertebrate evolution. The evolution of developmental control mechanisms during invertebrate-to-vertebrate transition involved not only gene duplication events, but also specific changes in spatial and temporal expression of many genes. To get insight into the spatiotemporal regulation of gene expression during invertebrate-to-vertebrate transition, functional studies of amphioxus gene regulatory elements are highly warranted. Here, we review transgenic studies performed in amphioxus and vertebrates using promoters and enhancers derived from the genome of Branchiostoma floridae. We describe the current methods of transgenesis in amphioxus, provide evidence of Tol2 transposon-generated transgenic embryos of Branchiostoma lanceolatum and discuss possible future directions. We envision that comparative transgenic analysis of gene regulatory sequences in the context of amphioxus and vertebrate embryos will likely provide an important mechanistic insight into the evolution of vertebrate body plan. Copyright © 2015 Elsevier B.V. All rights reserved.
F-Nets and Software Cabling: Deriving a Formal Model and Language for Portable Parallel Programming

NASA Technical Reports Server (NTRS)

DiNucci, David C.; Saini, Subhash (Technical Monitor)

1998-01-01

Parallel programming is still being based upon antiquated sequence-based definitions of the terms "algorithm" and "computation", resulting in programs which are architecture dependent and difficult to design and analyze. By focusing on obstacles inherent in existing practice, a more portable model is derived here, which is then formalized into a model called Soviets which utilizes a combination of imperative and functional styles. This formalization suggests more general notions of algorithm and computation, as well as insights into the meaning of structured programming in a parallel setting. To illustrate how these principles can be applied, a very-high-level graphical architecture-independent parallel language, called Software Cabling, is described, with many of the features normally expected from today's computer languages (e.g. data abstraction, data parallelism, and object-based programming constructs).
Studying the organization of genes encoding plant cell wall degrading enzymes in Chrysomela tremula provides insights into a leaf beetle genome.

PubMed

Pauchet, Y; Saski, C A; Feltus, F A; Luyten, I; Quesneville, H; Heckel, D G

2014-06-01

The ability of herbivorous beetles from the superfamilies Chrysomeloidea and Curculionoidea to degrade plant cell wall polysaccharides has only recently begun to be appreciated. The presence of plant cell wall degrading enzymes (PCWDEs) in the beetle's digestive tract makes this degradation possible. Sequences encoding these beetle-derived PCWDEs were originally identified from transcriptomes and strikingly resemble those of saprophytic and phytopathogenic microorganisms, raising questions about their origin; e.g. are they insect- or microorganism-derived? To demonstrate unambiguously that the genes encoding PCWDEs found in beetle transcriptomes are indeed of insect origin, we generated a bacterial artificial chromosome library from the genome of the leaf beetle Chrysomela tremula, containing 18 432 clones with an average size of 143 kb. After hybridizing this library with probes derived from 12 C. tremula PCWDE-encoding genes and sequencing the positive clones, we demonstrated that the latter genes are encoded by the insect's genome and are surrounded by genes possessing orthologues in the genome of Tribolium castaneum as well as in three other beetle genomes. Our analyses showed that although the level of overall synteny between C. tremula and T. castaneum seems high, the degree of microsynteny between both species is relatively low, in contrast to the more closely related Colorado potato beetle. © 2014 The Royal Entomological Society.
On the distribution of interspecies correlation for Markov models of character evolution on Yule trees.

PubMed

Mulder, Willem H; Crawford, Forrest W

2015-01-07

Efforts to reconstruct phylogenetic trees and understand evolutionary processes depend fundamentally on stochastic models of speciation and mutation. The simplest continuous-time model for speciation in phylogenetic trees is the Yule process, in which new species are "born" from existing lineages at a constant rate. Recent work has illuminated some of the structural properties of Yule trees, but it remains mostly unknown how these properties affect sequence and trait patterns observed at the tips of the phylogenetic tree. Understanding the interplay between speciation and mutation under simple models of evolution is essential for deriving valid phylogenetic inference methods and gives insight into the optimal design of phylogenetic studies. In this work, we derive the probability distribution of interspecies covariance under Brownian motion and Ornstein-Uhlenbeck models of phenotypic change on a Yule tree. We compute the probability distribution of the number of mutations shared between two randomly chosen taxa in a Yule tree under discrete Markov mutation models. Our results suggest summary measures of phylogenetic information content, illuminate the correlation between site patterns in sequences or traits of related organisms, and provide heuristics for experimental design and reconstruction of phylogenetic trees. Copyright © 2014 Elsevier Ltd. All rights reserved.
De novo selection of oncogenes.

PubMed

Chacón, Kelly M; Petti, Lisa M; Scheideman, Elizabeth H; Pirazzoli, Valentina; Politi, Katerina; DiMaio, Daniel

2014-01-07

All cellular proteins are derived from preexisting ones by natural selection. Because of the random nature of this process, many potentially useful protein structures never arose or were discarded during evolution. Here, we used a single round of genetic selection in mouse cells to isolate chemically simple, biologically active transmembrane proteins that do not contain any amino acid sequences from preexisting proteins. We screened a retroviral library expressing hundreds of thousands of proteins consisting of hydrophobic amino acids in random order to isolate four 29-aa proteins that induced focus formation in mouse and human fibroblasts and tumors in mice. These proteins share no amino acid sequences with known cellular or viral proteins, and the simplest of them contains only seven different amino acids. They transformed cells by forming a stable complex with the platelet-derived growth factor β receptor transmembrane domain and causing ligand-independent receptor activation. We term this approach de novo selection and suggest that it can be used to generate structures and activities not observed in nature, create prototypes for novel research reagents and therapeutics, and provide insight into cell biology, transmembrane protein-protein interactions, and possibly virus evolution and the origin of life.
Terminal Restriction Fragment Length Polymorphism Analysis Program, a Web-Based Research Tool for Microbial Community Analysis

PubMed Central

Marsh, Terence L.; Saxman, Paul; Cole, James; Tiedje, James

2000-01-01

Rapid analysis of microbial communities has proven to be a difficult task. This is due, in part, to both the tremendous diversity of the microbial world and the high complexity of many microbial communities. Several techniques for community analysis have emerged over the past decade, and most take advantage of the molecular phylogeny derived from 16S rRNA comparative sequence analysis. We describe a web-based research tool located at the Ribosomal Database Project web site (http://www.cme.msu.edu/RDP/html/analyses.html) that facilitates microbial community analysis using terminal restriction fragment length polymorphism of 16S ribosomal DNA. The analysis function (designated TAP T-RFLP) permits the user to perform in silico restriction digestions of the entire 16S sequence database and derive terminal restriction fragment sizes, measured in base pairs, from the 5′ terminus of the user-specified primer to the 3′ terminus of the restriction endonuclease target site. The output can be sorted and viewed either phylogenetically or by size. It is anticipated that the site will guide experimental design as well as provide insight into interpreting results of community analysis with terminal restriction fragment length polymorphisms. PMID:10919828
The phylogeny of yellow fever virus 17D vaccines.

PubMed

Stock, Nina K; Boschetti, Nicola; Herzog, Christian; Appelhans, Marc S; Niedrig, Matthias

2012-02-01

In recent years the safety of the yellow fever live vaccine 17D came under scrutiny. The focus was on serious adverse events after vaccinations that resemble a wild type infection with yellow fever and whose reasons are still not known. Also the exact mechanism of attenuation of the vaccine remains unknown to this day. In this context, the standards of safety and surveillance in vaccine production and administration have been discussed. Therein embodied was the demand for improved documentation of the derivation of the seed virus used for yellow fever vaccine production. So far, there was just a historical genealogy available that is based on source area and passage level. However, there is a need for a documentation based on molecular information to get better insights into the mechanisms of pathology. In this work we sequenced the whole genome of different passages of the YFV-17D strain used by Crucell Switzerland AG for vaccine production. Using all other publically available 17D full genome sequences we compared the sequence variance of all vaccine strains and oppose a phylogenetic tree based on full genome sequences to the historical genealogy. Copyright © 2011 Elsevier Ltd. All rights reserved.
Global biogeographic sampling of bacterial secondary metabolism

PubMed Central

Charlop-Powers, Zachary; Owen, Jeremy G; Reddy, Boojala Vijay B; Ternei, Melinda A; Guimarães, Denise O; de Frias, Ulysses A; Pupo, Monica T; Seepe, Prudy; Feng, Zhiyang; Brady, Sean F

2015-01-01

Recent bacterial (meta)genome sequencing efforts suggest the existence of an enormous untapped reservoir of natural-product-encoding biosynthetic gene clusters in the environment. Here we use the pyro-sequencing of PCR amplicons derived from both nonribosomal peptide adenylation domains and polyketide ketosynthase domains to compare biosynthetic diversity in soil microbiomes from around the globe. We see large differences in domain populations from all except the most proximal and biome-similar samples, suggesting that most microbiomes will encode largely distinct collections of bacterial secondary metabolites. Our data indicate a correlation between two factors, geographic distance and biome-type, and the biosynthetic diversity found in soil environments. By assigning reads to known gene clusters we identify hotspots of biomedically relevant biosynthetic diversity. These observations not only provide new insights into the natural world, they also provide a road map for guiding future natural products discovery efforts. DOI: http://dx.doi.org/10.7554/eLife.05048.001 PMID:25599565

Comparison against 186 canid whole-genome sequences reveals survival strategies of an ancient clonally transmissible canine tumor

PubMed Central

Decker, Brennan; Davis, Brian W.; Rimbault, Maud; Long, Adrienne H.; Karlins, Eric; Jagannathan, Vidhya; Reiman, Rebecca; Parker, Heidi G.; Drögemüller, Cord; Corneveaux, Jason J.; Chapman, Erica S.; Trent, Jeffery M.; Leeb, Tosso; Huentelman, Matthew J.; Wayne, Robert K.; Karyadi, Danielle M.; Ostrander, Elaine A.

2015-01-01

Canine transmissible venereal tumor (CTVT) is a parasitic cancer clone that has propagated for thousands of years via sexual transfer of malignant cells. Little is understood about the mechanisms that converted an ancient tumor into the world's oldest known continuously propagating somatic cell lineage. We created the largest existing catalog of canine genome-wide variation and compared it against two CTVT genome sequences, thereby separating alleles derived from the founder's genome from somatic mutations that must drive clonal transmissibility. We show that CTVT has undergone continuous adaptation to its transmissible allograft niche, with overlapping mutations at every step of immunosurveillance, particularly self-antigen presentation and apoptosis. We also identified chronologically early somatic mutations in oncogenesis- and immune-related genes that may represent key initiators of clonal transmissibility. Thus, we provide the first insights into the specific genomic aberrations that underlie CTVT's dogged perseverance in canids around the world. PMID:26232412
Geostationary Lightning Mapper: Lessons Learned from Post Launch Test

NASA Astrophysics Data System (ADS)

Edgington, S.; Tillier, C. E.; Demroff, H.; VanBezooijen, R.; Christian, H. J., Jr.; Bitzer, P. M.

2017-12-01

Pre-launch calibration and algorithm design for the GOES Geostationary Lightning Mapper resulted in a successful and trouble-free on-orbit activation and post-launch test sequence. Within minutes of opening the GLM aperture door on January 4th, 2017, lightning was detected across the entire field of view. During the six-month post-launch test period, numerous processing parameters on board the instrument and in the ground processing algorithms were fine-tuned. Demonstrated on-orbit performance exceeded pre-launch predictions. We provide an overview of the ground calibration sequence, on-orbit tuning of the instrument, tuning of the ground processing algorithms (event filtering and navigation). We also touch on new insights obtained from analysis of a large and growing archive of raw GLM data, containing 3e8 flash detections derived from over 1e10 full-disk images of the Earth.
Complete Genome Sequence and Comparative Analysis of the Fish Pathogen Lactococcus garvieae

PubMed Central

Oshima, Kenshiro; Yoshizaki, Mariko; Kawanishi, Michiko; Nakaya, Kohei; Suzuki, Takehito; Miyauchi, Eiji; Ishii, Yasuo; Tanabe, Soichi; Murakami, Masaru; Hattori, Masahira

2011-01-01

Lactococcus garvieae causes fatal haemorrhagic septicaemia in fish such as yellowtail. The comparative analysis of genomes of a virulent strain Lg2 and a non-virulent strain ATCC 49156 of L. garvieae revealed that the two strains shared a high degree of sequence identity, but Lg2 had a 16.5-kb capsule gene cluster that is absent in ATCC 49156. The capsule gene cluster was composed of 15 genes, of which eight genes are highly conserved with those in exopolysaccharide biosynthesis gene cluster often found in Lactococcus lactis strains. Sequence analysis of the capsule gene cluster in the less virulent strain L. garvieae Lg2-S, Lg2-derived strain, showed that two conserved genes were disrupted by a single base pair deletion, respectively. These results strongly suggest that the capsule is crucial for virulence of Lg2. The capsule gene cluster of Lg2 may be a genomic island from several features such as the presence of insertion sequences flanked on both ends, different GC content from the chromosomal average, integration into the locus syntenic to other lactococcal genome sequences, and distribution in human gut microbiomes. The analysis also predicted other potential virulence factors such as haemolysin. The present study provides new insights into understanding of the virulence mechanisms of L. garvieae in fish. PMID:21829716
CompaRNA: a server for continuous benchmarking of automated methods for RNA secondary structure prediction

PubMed Central

Puton, Tomasz; Kozlowski, Lukasz P.; Rother, Kristian M.; Bujnicki, Janusz M.

2013-01-01

We present a continuous benchmarking approach for the assessment of RNA secondary structure prediction methods implemented in the CompaRNA web server. As of 3 October 2012, the performance of 28 single-sequence and 13 comparative methods has been evaluated on RNA sequences/structures released weekly by the Protein Data Bank. We also provide a static benchmark generated on RNA 2D structures derived from the RNAstrand database. Benchmarks on both data sets offer insight into the relative performance of RNA secondary structure prediction methods on RNAs of different size and with respect to different types of structure. According to our tests, on the average, the most accurate predictions obtained by a comparative approach are generated by CentroidAlifold, MXScarna, RNAalifold and TurboFold. On the average, the most accurate predictions obtained by single-sequence analyses are generated by CentroidFold, ContextFold and IPknot. The best comparative methods typically outperform the best single-sequence methods if an alignment of homologous RNA sequences is available. This article presents the results of our benchmarks as of 3 October 2012, whereas the rankings presented online are continuously updated. We will gladly include new prediction methods and new measures of accuracy in the new editions of CompaRNA benchmarks. PMID:23435231
Inferring Short-Range Linkage Information from Sequencing Chromatograms

PubMed Central

Beggel, Bastian; Neumann-Fraune, Maria; Kaiser, Rolf; Verheyen, Jens; Lengauer, Thomas

2013-01-01

Direct Sanger sequencing of viral genome populations yields multiple ambiguous sequence positions. It is not straightforward to derive linkage information from sequencing chromatograms, which in turn hampers the correct interpretation of the sequence data. We present a method for determining the variants existing in a viral quasispecies in the case of two nearby ambiguous sequence positions by exploiting the effect of sequence context-dependent incorporation of dideoxynucleotides. The computational model was trained on data from sequencing chromatograms of clonal variants and was evaluated on two test sets of in vitro mixtures. The approach achieved high accuracies in identifying the mixture components of 97.4% on a test set in which the positions to be analyzed are only one base apart from each other, and of 84.5% on a test set in which the ambiguous positions are separated by three bases. In silico experiments suggest two major limitations of our approach in terms of accuracy. First, due to a basic limitation of Sanger sequencing, it is not possible to reliably detect minor variants with a relative frequency of no more than 10%. Second, the model cannot distinguish between mixtures of two or four clonal variants, if one of two sets of linear constraints is fulfilled. Furthermore, the approach requires repetitive sequencing of all variants that might be present in the mixture to be analyzed. Nevertheless, the effectiveness of our method on the two in vitro test sets shows that short-range linkage information of two ambiguous sequence positions can be inferred from Sanger sequencing chromatograms without any further assumptions on the mixture composition. Additionally, our model provides new insights into the established and widely used Sanger sequencing technology. The source code of our method is made available at http://bioinf.mpi-inf.mpg.de/publications/beggel/linkageinformation.zip. PMID:24376502
Genetic Diversity of Crimean Congo Hemorrhagic Fever Virus Strains from Iran

PubMed Central

Chinikar, Sadegh; Bouzari, Saeid; Shokrgozar, Mohammad Ali; Mostafavi, Ehsan; Jalali, Tahmineh; Khakifirouz, Sahar; Nowotny, Norbert; Fooks, Anthony R.; Shah-Hosseini, Nariman

2016-01-01

Background: Crimean Congo hemorrhagic fever virus (CCHFV) is a member of the Bunyaviridae family and Nairovirus genus. It has a negative-sense, single stranded RNA genome approximately 19.2 kb, containing the Small, Medium, and Large segments. CCHFVs are relatively divergent in their genome sequence and grouped in seven distinct clades based on S-segment sequence analysis and six clades based on M-segment sequences. Our aim was to obtain new insights into the molecular epidemiology of CCHFV in Iran. Methods: We analyzed partial and complete nucleotide sequences of the S and M segments derived from 50 Iranian patients. The extracted RNA was amplified using one-step RT-PCR and then sequenced. The sequences were analyzed using Mega5 software. Results: Phylogenetic analysis of partial S segment sequences demonstrated that clade IV-(Asia 1), clade IV-(Asia 2) and clade V-(Europe) accounted for 80 %, 4 % and 14 % of the circulating genomic variants of CCHFV in Iran respectively. However, one of the Iranian strains (Iran-Kerman/22) was associated with none of other sequences and formed a new clade (VII). The phylogenetic analysis of complete S-segment nucleotide sequences from selected Iranian CCHFV strains complemented with representative strains from GenBank revealed similar topology as partial sequences with eight major clusters. A partial M segment phylogeny positioned the Iranian strains in either association with clade III (Asia-Africa) or clade V (Europe). Conclusion: The phylogenetic analysis revealed subtle links between distant geographic locations, which we propose might originate either from international livestock trade or from long-distance carriage of CCHFV by infected ticks via bird migration. PMID:27308271
Genome sequence of an industrial microorganism Streptomyces avermitilis: deducing the ability of producing secondary metabolites.

PubMed

Omura, S; Ikeda, H; Ishikawa, J; Hanamoto, A; Takahashi, C; Shinose, M; Takahashi, Y; Horikawa, H; Nakazawa, H; Osonoe, T; Kikuchi, H; Shiba, T; Sakaki, Y; Hattori, M

2001-10-09

Streptomyces avermitilis is a soil bacterium that carries out not only a complex morphological differentiation but also the production of secondary metabolites, one of which, avermectin, is commercially important in human and veterinary medicine. The major interest in this genus Streptomyces is the diversity of its production of secondary metabolites as an industrial microorganism. A major factor in its prominence as a producer of the variety of secondary metabolites is its possession of several metabolic pathways for biosynthesis. Here we report sequence analysis of S. avermitilis, covering 99% of its genome. At least 8.7 million base pairs exist in the linear chromosome; this is the largest bacterial genome sequence, and it provides insights into the intrinsic diversity of the production of the secondary metabolites of Streptomyces. Twenty-five kinds of secondary metabolite gene clusters were found in the genome of S. avermitilis. Four of them are concerned with the biosyntheses of melanin pigments, in which two clusters encode tyrosinase and its cofactor, another two encode an ochronotic pigment derived from homogentiginic acid, and another polyketide-derived melanin. The gene clusters for carotenoid and siderophore biosyntheses are composed of seven and five genes, respectively. There are eight kinds of gene clusters for type-I polyketide compound biosyntheses, and two clusters are involved in the biosyntheses of type-II polyketide-derived compounds. Furthermore, a polyketide synthase that resembles phloroglucinol synthase was detected. Eight clusters are involved in the biosyntheses of peptide compounds that are synthesized by nonribosomal peptide synthetases. These secondary metabolite clusters are widely located in the genome but half of them are near both ends of the genome. The total length of these clusters occupies about 6.4% of the genome.
Theory of Mind in Schizophrenia: Associations With Clinical and Cognitive Insight Controlling for Levels of Psychopathology.

PubMed

Popolo, Raffaele; Dimaggio, Giancarlo; Luther, Lauren; Vinci, Giancarlo; Salvatore, Giampaolo; Lysaker, Paul H

2016-03-01

Poor insight in schizophrenia is a risk factor for both poor outcomes and treatment adherence. Accordingly, interest in identifying causes of poor insight has increased. This study explored whether theory of mind (ToM) impairments are linked to poor clinical and cognitive insight independent of psychopathology. Participants with schizophrenia (n = 37) and control subjects (n = 40) completed assessments of ToM with the Hinting Task and the Brüne Picture Sequencing Task, clinical insight and psychopathology with the Positive and Negative Syndrome Scale, and cognitive insight with the Beck Cognitive Insight Scale. Results indicated that the schizophrenia group had greater impairments in ToM relative to control subjects. In the schizophrenia group, the Hinting Task performance was related to both cognitive and clinical insight, with only the relationship with cognitive insight persisting after controlling for psychopathology. Picture Sequencing Task performance was related to cognitive insight only. Future research directions and clinical implications are discussed.
Potential Links between Hepadnavirus and Bornavirus Sequences in the Host Genome and Cancer.

PubMed

Honda, Tomoyuki

2017-01-01

Various viruses leave their sequences in the host genomes during infection. Such events occur mainly in retrovirus infection but also sometimes in DNA and non-retroviral RNA virus infections. If viral sequences are integrated into the genomes of germ line cells, the sequences can become inherited as endogenous viral elements (EVEs). The integration events of viral sequences may have oncogenic potential. Because proviral integrations of some retroviruses and/or reactivation of endogenous retroviruses are closely linked to cancers, viral insertions related to non-retroviral viruses also possibly contribute to cancer development. This article focuses on genomic viral sequences derived from two non-retroviral viruses, whose endogenization is already reported, and discusses their possible contributions to cancer. Viral insertions of hepatitis B virus play roles in the development of hepatocellular carcinoma. Endogenous bornavirus-like elements, the only non-retroviral RNA virus-related EVEs found in the human genome, may also be involved in cancer formation. In addition, the possible contribution of the interactions between viruses and retrotransposons, which seem to be a major driving force for generating EVEs related to non-retroviral RNA viruses, to cancers will be discussed. Future studies regarding the possible links described here may open a new avenue for the development of novel therapeutics for tumor virus-related cancers and/or provide novel insights into EVE functions.
Recovering complete mitochondrial genome sequences from RNA-Seq: A case study of Polytomella non-photosynthetic green algae.

PubMed

Tian, Yao; Smith, David Roy

2016-05-01

Thousands of mitochondrial genomes have been sequenced, but there are comparatively few available mitochondrial transcriptomes. This might soon be changing. High-throughput RNA sequencing (RNA-Seq) techniques have made it fast and cheap to generate massive amounts of mitochondrial transcriptomic data. Here, we explore the utility of RNA-Seq for assembling mitochondrial genomes and studying their expression patterns. Specifically, we investigate the mitochondrial transcriptomes from Polytomella non-photosynthetic green algae, which have among the smallest, most reduced mitochondrial genomes from the Archaeplastida as well as fragmented rRNA-coding regions, palindromic genes, and linear chromosomes with telomeres. Isolation of whole genomic RNA from the four known Polytomella species followed by Illumina paired-end sequencing generated enough mitochondrial-derived reads to easily recover almost-entire mitochondrial genome sequences. Read-mapping and coverage statistics also gave insights into Polytomella mitochondrial transcriptional architecture, revealing polycistronic transcripts and the expression of telomeres and palindromic genes. Ultimately, RNA-Seq is a promising, cost-effective technique for studying mitochondrial genetics, but it does have drawbacks, which are discussed. One of its greatest potentials, as shown here, is that it can be used to generate near-complete mitochondrial genome sequences, which could be particularly useful in situations where there is a lack of available mtDNA data. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
DEEP MOTIF DASHBOARD: VISUALIZING AND UNDERSTANDING GENOMIC SEQUENCES USING DEEP NEURAL NETWORKS.

PubMed

Lanchantin, Jack; Singh, Ritambhara; Wang, Beilun; Qi, Yanjun

2017-01-01

Deep neural network (DNN) models have recently obtained state-of-the-art prediction accuracy for the transcription factor binding (TFBS) site classification task. However, it remains unclear how these approaches identify meaningful DNA sequence signals and give insights as to why TFs bind to certain locations. In this paper, we propose a toolkit called the Deep Motif Dashboard (DeMo Dashboard) which provides a suite of visualization strategies to extract motifs, or sequence patterns from deep neural network models for TFBS classification. We demonstrate how to visualize and understand three important DNN models: convolutional, recurrent, and convolutional-recurrent networks. Our first visualization method is finding a test sequence's saliency map which uses first-order derivatives to describe the importance of each nucleotide in making the final prediction. Second, considering recurrent models make predictions in a temporal manner (from one end of a TFBS sequence to the other), we introduce temporal output scores, indicating the prediction score of a model over time for a sequential input. Lastly, a class-specific visualization strategy finds the optimal input sequence for a given TFBS positive class via stochastic gradient optimization. Our experimental results indicate that a convolutional-recurrent architecture performs the best among the three architectures. The visualization techniques indicate that CNN-RNN makes predictions by modeling both motifs as well as dependencies among them.
Genetic differences between blood- and brain-derived viral sequences from human immunodeficiency virus type 1-infected patients: evidence of conserved elements in the V3 region of the envelope protein of brain-derived sequences.

PubMed Central

Korber, B T; Kunstman, K J; Patterson, B K; Furtado, M; McEvilly, M M; Levy, R; Wolinsky, S M

1994-01-01

Human immunodeficiency virus type 1 (HIV-1) sequences were generated from blood and from brain tissue obtained by stereotactic biopsy from six patients undergoing a diagnostic neurosurgical procedure. Proviral DNA was directly amplified by nested PCR, and 8 to 36 clones from each sample were sequenced. Phylogenetic analysis of intrapatient envelope V3-V5 region HIV-1 DNA sequence sets revealed that brain viral sequences were clustered relative to the blood viral sequences, suggestive of tissue-specific compartmentalization of the virus in four of the six cases. In the other two cases, the blood and brain virus sequences were intermingled in the phylogenetic analyses, suggesting trafficking of virus between the two tissues. Slide-based PCR-driven in situ hybridization of two of the patients' brain biopsy samples confirmed our interpretation of the intrapatient phylogenetic analyses. Interpatient V3 region brain-derived sequence distances were significantly less than blood-derived sequence distances. Relative to the tip of the loop, the set of brain-derived viral sequences had a tendency towards negative or neutral charge compared with the set of blood-derived viral sequences. Entropy calculations were used as a measure of the variability at each position in alignments of blood and brain viral sequences. A relatively conserved set of positions were found, with a significantly lower entropy in the brain-than in the blood-derived viral sequences. These sites constitute a brain "signature pattern," or a noncontiguous set of amino acids in the V3 region conserved in viral sequences derived from brain tissue. This brain-derived signature pattern was also well preserved among isolates previously characterized in vitro as macrophage tropic. Macrophage-monocyte tropism may be the biological constraint that results in the conservation of the viral brain signature pattern. Images PMID:7933130
Supramolecular polymerization of a prebiotic nucleoside provides insights into the creation of sequence-controlled polymers.

PubMed

Wang, Jun; Bonnesen, Peter V; Rangel, E; Vallejo, E; Sanchez-Castillo, Ariadna; James Cleaves Ii, H; Baddorf, Arthur P; Sumpter, Bobby G; Pan, Minghu; Maksymovych, Petro; Fuentes-Cabrera, Miguel

2016-01-04

Self-assembly of a nucleoside on Au(111) was studied to ascertain whether polymerization on well-defined substrates constitutes a promising approach for making sequence-controlled polymers. Scanning tunneling microscopy and density functional theory were used to investigate the self-assembly on Au(111) of (RS)-N(9)-(2,3-dihydroxypropyl)adenine (DHPA), a plausibly prebiotic nucleoside analog of adenosine. It is found that DHPA molecules self-assemble into a hydrogen-bonded polymer that grows almost exclusively along the herringbone reconstruction pattern, has a two component sequence that is repeated over hundreds of nanometers, and is erasable with electron-induced excitation. Although the sequence is simple, more complicated ones are envisioned if two or more nucleoside types are combined. Because polymerization occurs on a substrate in a dry environment, the success of each combination can be gauged with high-resolution imaging and accurate modeling techniques. These characteristics make nucleoside self-assembly on a substrate an attractive approach for designing sequence-controlled polymers. Further, by choosing plausibly prebiotic nucleosides, insights may be provided into how nature created the first sequence-controlled polymers capable of storing information. Such insights, in turn, can inspire new ways of synthesizing sequence-controlled polymers.
Computational analysis of protein-protein interfaces involving an alpha helix: insights for terphenyl-like molecules binding.

PubMed

Isvoran, Adriana; Craciun, Dana; Martiny, Virginie; Sperandio, Olivier; Miteva, Maria A

2013-06-14

Protein-Protein Interactions (PPIs) are key for many cellular processes. The characterization of PPI interfaces and the prediction of putative ligand binding sites and hot spot residues are essential to design efficient small-molecule modulators of PPI. Terphenyl and its derivatives are small organic molecules known to mimic one face of protein-binding alpha-helical peptides. In this work we focus on several PPIs mediated by alpha-helical peptides. We performed computational sequence- and structure-based analyses in order to evaluate several key physicochemical and surface properties of proteins known to interact with alpha-helical peptides and/or terphenyl and its derivatives. Sequence-based analysis revealed low sequence identity between some of the analyzed proteins binding alpha-helical peptides. Structure-based analysis was performed to calculate the volume, the fractal dimension roughness and the hydrophobicity of the binding regions. Besides the overall hydrophobic character of the binding pockets, some specificities were detected. We showed that the hydrophobicity is not uniformly distributed in different alpha-helix binding pockets that can help to identify key hydrophobic hot spots. The presence of hydrophobic cavities at the protein surface with a more complex shape than the entire protein surface seems to be an important property related to the ability of proteins to bind alpha-helical peptides and low molecular weight mimetics. Characterization of similarities and specificities of PPI binding sites can be helpful for further development of small molecules targeting alpha-helix binding proteins.
Late-Quaternary biogeographic scenarios for the brown bear ( Ursus arctos), a wild mammal model species

NASA Astrophysics Data System (ADS)

Davison, John; Ho, Simon Y. W.; Bray, Sarah C.; Korsten, Marju; Tammeleht, Egle; Hindrikson, Maris; Østbye, Kjartan; Østbye, Eivind; Lauritzen, Stein-Erik; Austin, Jeremy; Cooper, Alan; Saarma, Urmas

2011-02-01

This review provides an up-to-date synthesis of the matrilineal phylogeography of a uniquely well-studied Holarctic mammal, the brown bear. We extend current knowledge by presenting a DNA sequence derived from one of the earliest known fossils of a polar bear (dated to 115 000 years before present), a species that shares a paraphyletic mitochondrial association with brown bears. A molecular clock analysis of 140 mitochondrial DNA sequences, including our new polar bear sequence, provides novel insights into the times of origin for different brown bear clades. We propose a number of regional biogeographic scenarios based on genetic data, divergence time estimates and paleontological records. The case of the brown bear provides an example for researchers working with less well-studied taxa: it shows clearly that phylogeographic models based on patterns of modern genetic variation alone can be substantially improved by including data on historical patterns of genetic diversity in the form of ancient DNA sequences derived from accurately dated samples and by using an approach to divergence-time estimation that suits the data under analysis. Using such approaches it has been possible to (i) establish that the processes shaping modern genetic diversity in brown bears acted recently, within the last three glacial cycles; (ii) distinguish among hypotheses concerning species' responses to climatic oscillations in accordance with the lack of phylogeographic structure that existed in brown bears prior to the last glacial maximum (LGM); (iii) reassess theories linking monophyletic brown bear populations to particular LGM refuge areas; and (iv) identify vicariance events and track analogous patterns of migration by brown bears out of Eurasia to North America and Japan.
Bioorthogonal Metabolic Labeling of Nascent RNA in Neurons Improves the Sensitivity of Transcriptome-Wide Profiling.

PubMed

Zajaczkowski, Esmi L; Zhao, Qiong-Yi; Zhang, Zong Hong; Li, Xiang; Wei, Wei; Marshall, Paul R; Leighton, Laura J; Nainar, Sarah; Feng, Chao; Spitale, Robert C; Bredy, Timothy W

2018-06-15

Transcriptome-wide expression profiling of neurons has provided important insights into the underlying molecular mechanisms and gene expression patterns that transpire during learning and memory formation. However, there is a paucity of tools for profiling stimulus-induced RNA within specific neuronal cell populations. A bioorthogonal method to chemically label nascent (i.e., newly transcribed) RNA in a cell-type-specific and temporally controlled manner, which is also amenable to bioconjugation via click chemistry, was recently developed and optimized within conventional immortalized cell lines. However, its value within a more fragile and complicated cellular system such as neurons, as well as for transcriptome-wide expression profiling, has yet to be demonstrated. Here, we report the visualization and sequencing of activity-dependent nascent RNA derived from neurons using this labeling method. This work has important implications for improving transcriptome-wide expression profiling and visualization of nascent RNA in neurons, which has the potential to provide valuable insights into the mechanisms underlying neural plasticity, learning, and memory.
Some User's Insights Into ADIFOR 2.0D

NASA Technical Reports Server (NTRS)

Giesy, Daniel P.

2002-01-01

Some insights are given which were gained by one user through experience with the use of the ADIFOR 2.0D software for automatic differentiation of Fortran code. These insights are generally in the area of the user interface with the generated derivative code - particularly the actual form of the interface and the use of derivative objects, including "seed" matrices. Some remarks are given as to how to iterate application of ADIFOR in order to generate second derivative code.
Comparative Sequence Analysis of Multidrug-Resistant IncA/C Plasmids from Salmonella enterica.

PubMed

Hoffmann, Maria; Pettengill, James B; Gonzalez-Escalona, Narjol; Miller, John; Ayers, Sherry L; Zhao, Shaohua; Allard, Marc W; McDermott, Patrick F; Brown, Eric W; Monday, Steven R

2017-01-01

Determinants of multidrug resistance (MDR) are often encoded on mobile elements, such as plasmids, transposons, and integrons, which have the potential to transfer among foodborne pathogens, as well as to other virulent pathogens, increasing the threats these traits pose to human and veterinary health. Our understanding of MDR among Salmonella has been limited by the lack of closed plasmid genomes for comparisons across resistance phenotypes, due to difficulties in effectively separating the DNA of these high-molecular weight, low-copy-number plasmids from chromosomal DNA. To resolve this problem, we demonstrate an efficient protocol for isolating, sequencing and closing IncA/C plasmids from Salmonella sp. using single molecule real-time sequencing on a Pacific Biosciences (Pacbio) RS II Sequencer. We obtained six Salmonella enterica isolates from poultry, representing six different serovars, each exhibiting the MDR-Ampc resistance profile. Salmonella plasmids were obtained using a modified mini preparation and transformed with Escherichia coli DH10Br. A Qiagen Large-Construct kit™ was used to recover highly concentrated and purified plasmid DNA that was sequenced using PacBio technology. These six closed IncA/C plasmids ranged in size from 104 to 191 kb and shared a stable, conserved backbone containing 98 core genes, with only six differences among those core genes. The plasmids encoded a number of antimicrobial resistance genes, including those for quaternary ammonium compounds and mercury. We then compared our six IncA/C plasmid sequences: first with 14 IncA/C plasmids derived from S. enterica available at the National Center for Biotechnology Information (NCBI), and then with an additional 38 IncA/C plasmids derived from different taxa. These comparisons allowed us to build an evolutionary picture of how antimicrobial resistance may be mediated by this common plasmid backbone. Our project provides detailed genetic information about resistance genes in plasmids, advances in plasmid sequencing, and phylogenetic analyses, and important insights about how MDR evolution occurs across diverse serotypes from different animal sources, particularly in agricultural settings where antimicrobial drug use practices vary.
Gene expression profiling of flax (Linum usitatissimum L.) under edaphic stress.

PubMed

Dmitriev, Alexey A; Kudryavtseva, Anna V; Krasnov, George S; Koroban, Nadezhda V; Speranskaya, Anna S; Krinitsina, Anastasia A; Belenikin, Maxim S; Snezhkina, Anastasiya V; Sadritdinova, Asiya F; Kishlyan, Natalya V; Rozhmina, Tatiana A; Yurkevich, Olga Yu; Muravenko, Olga V; Bolsheva, Nadezhda L; Melnikova, Nataliya V

2016-11-16

Cultivated flax (Linum usitatissimum L.) is widely used for production of textile, food, chemical and pharmaceutical products. However, various stresses decrease flax production. Search for genes, which are involved in stress response, is necessary for breeding of adaptive cultivars. Imbalanced concentration of nutrient elements in soil decrease flax yields and also results in heritable changes in some flax lines. The appearance of Linum Insertion Sequence 1 (LIS-1) is the most studied modification. However, LIS-1 function is still unclear. High-throughput sequencing of transcriptome of flax plants grown under normal (N), phosphate deficient (P), and nutrient excess (NPK) conditions was carried out using Illumina platform. The assembly of transcriptome was performed, and a total of 34924, 33797, and 33698 unique transcripts for N, P, and NPK sequencing libraries were identified, respectively. We have not revealed any LIS-1 derived mRNA in our sequencing data. The analysis of high-throughput sequencing data allowed us to identify genes with potentially differential expression under imbalanced nutrition. For further investigation with qPCR, 15 genes were chosen and their expression levels were evaluated in the extended sampling of 31 flax plants. Significant expression alterations were revealed for genes encoding WRKY and JAZ protein families under P and NPK conditions. Moreover, the alterations of WRKY family genes differed depending on LIS-1 presence in flax plant genome. Besides, we revealed slight and LIS-1 independent mRNA level changes of KRP2 and ING1 genes, which are adjacent to LIS-1, under nutrition stress. Differentially expressed genes were identified in flax plants, which were grown under phosphate deficiency and excess nutrition, on the basis of high-throughput sequencing and qPCR data. We showed that WRKY and JAS gene families participate in flax response to imbalanced nutrient content in soil. Besides, we have not identified any mRNA, which could be derived from LIS-1, in our transcriptome sequencing data. Expression of LIS-1 flanking genes, ING1 and KRP2, was suggested not to be nutrient stress-induced. Obtained results provide new insights into edaphic stress response in flax and the role of LIS-1 in these process.
Analysis of the Aedes albopictus C6/36 genome provides insight into cell line utility for viral propagation.

PubMed

Miller, Jason R; Koren, Sergey; Dilley, Kari A; Puri, Vinita; Brown, David M; Harkins, Derek M; Thibaud-Nissen, Françoise; Rosen, Benjamin; Chen, Xiao-Guang; Tu, Zhijian; Sharakhov, Igor V; Sharakhova, Maria V; Sebra, Robert; Stockwell, Timothy B; Bergman, Nicholas H; Sutton, Granger G; Phillippy, Adam M; Piermarini, Peter M; Shabman, Reed S

2018-03-01

The 50-year-old Aedes albopictus C6/36 cell line is a resource for the detection, amplification, and analysis of mosquito-borne viruses including Zika, dengue, and chikungunya. The cell line is derived from an unknown number of larvae from an unspecified strain of Aedes albopictus mosquitoes. Toward improved utility of the cell line for research in virus transmission, we present an annotated assembly of the C6/36 genome. The C6/36 genome assembly has the largest contig N50 (3.3 Mbp) of any mosquito assembly, presents the sequences of both haplotypes for most of the diploid genome, reveals independent null mutations in both alleles of the Dicer locus, and indicates a male-specific genome. Gene annotation was computed with publicly available mosquito transcript sequences. Gene expression data from cell line RNA sequence identified enrichment of growth-related pathways and conspicuous deficiency in aquaporins and inward rectifier K+ channels. As a test of utility, RNA sequence data from Zika-infected cells were mapped to the C6/36 genome and transcriptome assemblies. Host subtraction reduced the data set by 89%, enabling faster characterization of nonhost reads. The C6/36 genome sequence and annotation should enable additional uses of the cell line to study arbovirus vector interactions and interventions aimed at restricting the spread of human disease.

Deep Motif Dashboard: Visualizing and Understanding Genomic Sequences Using Deep Neural Networks

PubMed Central

Lanchantin, Jack; Singh, Ritambhara; Wang, Beilun; Qi, Yanjun

2018-01-01

Deep neural network (DNN) models have recently obtained state-of-the-art prediction accuracy for the transcription factor binding (TFBS) site classification task. However, it remains unclear how these approaches identify meaningful DNA sequence signals and give insights as to why TFs bind to certain locations. In this paper, we propose a toolkit called the Deep Motif Dashboard (DeMo Dashboard) which provides a suite of visualization strategies to extract motifs, or sequence patterns from deep neural network models for TFBS classification. We demonstrate how to visualize and understand three important DNN models: convolutional, recurrent, and convolutional-recurrent networks. Our first visualization method is finding a test sequence’s saliency map which uses first-order derivatives to describe the importance of each nucleotide in making the final prediction. Second, considering recurrent models make predictions in a temporal manner (from one end of a TFBS sequence to the other), we introduce temporal output scores, indicating the prediction score of a model over time for a sequential input. Lastly, a class-specific visualization strategy finds the optimal input sequence for a given TFBS positive class via stochastic gradient optimization. Our experimental results indicate that a convolutional-recurrent architecture performs the best among the three architectures. The visualization techniques indicate that CNN-RNN makes predictions by modeling both motifs as well as dependencies among them. PMID:27896980
Germline viral "fossils" guide in silico reconstruction of a mid-Cenozoic era marsupial adeno-associated virus.

PubMed

Smith, Richard H; Hallwirth, Claus V; Westerman, Michael; Hetherington, Nicola A; Tseng, Yu-Shan; Cecchini, Sylvain; Virag, Tamas; Ziegler, Mona-Larissa; Rogozin, Igor B; Koonin, Eugene V; Agbandje-McKenna, Mavis; Kotin, Robert M; Alexander, Ian E

2016-07-05

Germline endogenous viral elements (EVEs) genetically preserve viral nucleotide sequences useful to the study of viral evolution, gene mutation, and the phylogenetic relationships among host organisms. Here, we describe a lineage-specific, adeno-associated virus (AAV)-derived endogenous viral element (mAAV-EVE1) found within the germline of numerous closely related marsupial species. Molecular screening of a marsupial DNA panel indicated that mAAV-EVE1 occurs specifically within the marsupial suborder Macropodiformes (present-day kangaroos, wallabies, and related macropodoids), to the exclusion of other Diprotodontian lineages. Orthologous mAAV-EVE1 locus sequences from sixteen macropodoid species, representing a speciation history spanning an estimated 30 million years, facilitated compilation of an inferred ancestral sequence that recapitulates the genome of an ancient marsupial AAV that circulated among Australian metatherian fauna sometime during the late Eocene to early Oligocene. In silico gene reconstruction and molecular modelling indicate remarkable conservation of viral structure over a geologic timescale. Characterisation of AAV-EVE loci among disparate species affords insight into AAV evolution and, in the case of macropodoid species, may offer an additional genetic basis for assignment of phylogenetic relationships among the Macropodoidea. From an applied perspective, the identified AAV "fossils" provide novel capsid sequences for use in translational research and clinical applications.
TFBSshape: a motif database for DNA shape features of transcription factor binding sites.

PubMed

Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W; Gordân, Raluca; Rohs, Remo

2014-01-01

Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein-DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone.
TFBSshape: a motif database for DNA shape features of transcription factor binding sites

PubMed Central

Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W.; Gordân, Raluca; Rohs, Remo

2014-01-01

Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein–DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone. PMID:24214955
Insights into the Evolution of Mitochondrial Genome Size from Complete Sequences of Citrullus lanatus and Cucurbita pepo (Cucurbitaceae)

PubMed Central

Alverson, Andrew J.; Wei, XiaoXin; Rice, Danny W.; Stern, David B.; Barry, Kerrie; Palmer, Jeffrey D.

2010-01-01

The mitochondrial genomes of seed plants are unusually large and vary in size by at least an order of magnitude. Much of this variation occurs within a single family, the Cucurbitaceae, whose genomes range from an estimated 390 to 2,900 kb in size. We sequenced the mitochondrial genomes of Citrullus lanatus (watermelon: 379,236 nt) and Cucurbita pepo (zucchini: 982,833 nt)—the two smallest characterized cucurbit mitochondrial genomes—and determined their RNA editing content. The relatively compact Citrullus mitochondrial genome actually contains more and longer genes and introns, longer segmental duplications, and more discernibly nuclear-derived DNA. The large size of the Cucurbita mitochondrial genome reflects the accumulation of unprecedented amounts of both chloroplast sequences (>113 kb) and short repeated sequences (>370 kb). A low mutation rate has been hypothesized to underlie increases in both genome size and RNA editing frequency in plant mitochondria. However, despite its much larger genome, Cucurbita has a significantly higher synonymous substitution rate (and presumably mutation rate) than Citrullus but comparable levels of RNA editing. The evolution of mutation rate, genome size, and RNA editing are apparently decoupled in Cucurbitaceae, reflecting either simple stochastic variation or governance by different factors. PMID:20118192
Draft genome sequence of an inbred line of Chenopodium quinoa, an allotetraploid crop with great environmental adaptability and outstanding nutritional properties

PubMed Central

Yasui, Yasuo; Hirakawa, Hideki; Oikawa, Tetsuo; Toyoshima, Masami; Matsuzaki, Chiaki; Ueno, Mariko; Mizuno, Nobuyuki; Nagatoshi, Yukari; Imamura, Tomohiro; Miyago, Manami; Tanaka, Kojiro; Mise, Kazuyuki; Tanaka, Tsutomu; Mizukoshi, Hiroharu; Mori, Masashi; Fujita, Yasunari

2016-01-01

Chenopodium quinoa Willd. (quinoa) originated from the Andean region of South America, and is a pseudocereal crop of the Amaranthaceae family. Quinoa is emerging as an important crop with the potential to contribute to food security worldwide and is considered to be an optimal food source for astronauts, due to its outstanding nutritional profile and ability to tolerate stressful environments. Furthermore, plant pathologists use quinoa as a representative diagnostic host to identify virus species. However, molecular analysis of quinoa is limited by its genetic heterogeneity due to outcrossing and its genome complexity derived from allotetraploidy. To overcome these obstacles, we established the inbred and standard quinoa accession Kd that enables rigorous molecular analysis, and presented the draft genome sequence of Kd, using an optimized combination of high-throughput next generation sequencing on the Illumina Hiseq 2500 and PacBio RS II sequencers. The de novo genome assembly contained 25 k scaffolds consisting of 1 Gbp with N50 length of 86 kbp. Based on these data, we constructed the free-access Quinoa Genome DataBase (QGDB). Thus, these findings provide insights into the mechanisms underlying agronomically important traits of quinoa and the effect of allotetraploidy on genome evolution. PMID:27458999
EffectorP: predicting fungal effector proteins from secretomes using machine learning.

PubMed

Sperschneider, Jana; Gardiner, Donald M; Dodds, Peter N; Tini, Francesco; Covarelli, Lorenzo; Singh, Karam B; Manners, John M; Taylor, Jennifer M

2016-04-01

Eukaryotic filamentous plant pathogens secrete effector proteins that modulate the host cell to facilitate infection. Computational effector candidate identification and subsequent functional characterization delivers valuable insights into plant-pathogen interactions. However, effector prediction in fungi has been challenging due to a lack of unifying sequence features such as conserved N-terminal sequence motifs. Fungal effectors are commonly predicted from secretomes based on criteria such as small size and cysteine-rich, which suffers from poor accuracy. We present EffectorP which pioneers the application of machine learning to fungal effector prediction. EffectorP improves fungal effector prediction from secretomes based on a robust signal of sequence-derived properties, achieving sensitivity and specificity of over 80%. Features that discriminate fungal effectors from secreted noneffectors are predominantly sequence length, molecular weight and protein net charge, as well as cysteine, serine and tryptophan content. We demonstrate that EffectorP is powerful when combined with in planta expression data for predicting high-priority effector candidates. EffectorP is the first prediction program for fungal effectors based on machine learning. Our findings will facilitate functional fungal effector studies and improve our understanding of effectors in plant-pathogen interactions. EffectorP is available at http://effectorp.csiro.au. © 2015 CSIRO New Phytologist © 2015 New Phytologist Trust.
Supramolecular polymerization of a prebiotic nucleoside provides insights into the creation of sequence-controlled polymers

DOE PAGES

Wang, Jun; Bonnesen, Peter V; Rangel, E.; ...

2016-01-04

The self-assembly of a nucleoside on Au(111) was studied to ascertain whether polymerization on well-defined substrates constitutes a promising approach for making sequence-controlled polymers. Scanning tunneling microscopy and density functional theory were used to investigate the self-assembly on Au(111) of (RS)-N9-(2,3-dihydroxypropyl)adenine (DHPA), a plausibly prebiotic nucleoside analog of adenosine. It is found that DHPA molecules self-assemble into a hydrogen-bonded polymer that grows almost exclusively along the herringbone reconstruction pattern, has a two component sequence that is repeated over hundreds of nanometers, and is erasable with electron-induced excitation. Although the sequence is simple, more complicated ones are envisioned if two ormore » more nucleoside types are combined. Because polymerization occurs on a substrate in a dry environment, the success of each combination can be gauged with high-resolution imaging and accurate modeling techniques. The resulting characteristics make nucleoside self-assembly on a substrate an attractive approach for designing sequence-controlled polymers. Moreover, by choosing plausibly prebiotic nucleosides, insights may be provided into how nature created the first sequence-controlled polymers capable of storing information. Such insights, in turn, can inspire new ways of synthesizing sequence-controlled polymers.« less
Supramolecular polymerization of a prebiotic nucleoside provides insights into the creation of sequence-controlled polymers

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wang, Jun; Bonnesen, Peter V; Rangel, E.

The self-assembly of a nucleoside on Au(111) was studied to ascertain whether polymerization on well-defined substrates constitutes a promising approach for making sequence-controlled polymers. Scanning tunneling microscopy and density functional theory were used to investigate the self-assembly on Au(111) of (RS)-N9-(2,3-dihydroxypropyl)adenine (DHPA), a plausibly prebiotic nucleoside analog of adenosine. It is found that DHPA molecules self-assemble into a hydrogen-bonded polymer that grows almost exclusively along the herringbone reconstruction pattern, has a two component sequence that is repeated over hundreds of nanometers, and is erasable with electron-induced excitation. Although the sequence is simple, more complicated ones are envisioned if two ormore » more nucleoside types are combined. Because polymerization occurs on a substrate in a dry environment, the success of each combination can be gauged with high-resolution imaging and accurate modeling techniques. The resulting characteristics make nucleoside self-assembly on a substrate an attractive approach for designing sequence-controlled polymers. Moreover, by choosing plausibly prebiotic nucleosides, insights may be provided into how nature created the first sequence-controlled polymers capable of storing information. Such insights, in turn, can inspire new ways of synthesizing sequence-controlled polymers.« less
Estimation of a Killer Whale (Orcinus orca) Population’s Diet Using Sequencing Analysis of DNA from Feces

PubMed Central

Ford, Michael J.; Hempelmann, Jennifer; Hanson, M. Bradley; Ayres, Katherine L.; Baird, Robin W.; Emmons, Candice K.; Lundin, Jessica I.; Schorr, Gregory S.; Wasser, Samuel K.; Park, Linda K.

2016-01-01

Estimating diet composition is important for understanding interactions between predators and prey and thus illuminating ecosystem function. The diet of many species, however, is difficult to observe directly. Genetic analysis of fecal material collected in the field is therefore a useful tool for gaining insight into wild animal diets. In this study, we used high-throughput DNA sequencing to quantitatively estimate the diet composition of an endangered population of wild killer whales (Orcinus orca) in their summer range in the Salish Sea. We combined 175 fecal samples collected between May and September from five years between 2006 and 2011 into 13 sample groups. Two known DNA composition control groups were also created. Each group was sequenced at a ~330bp segment of the 16s gene in the mitochondrial genome using an Illumina MiSeq sequencing system. After several quality controls steps, 4,987,107 individual sequences were aligned to a custom sequence database containing 19 potential fish prey species and the most likely species of each fecal-derived sequence was determined. Based on these alignments, salmonids made up >98.6% of the total sequences and thus of the inferred diet. Of the six salmonid species, Chinook salmon made up 79.5% of the sequences, followed by coho salmon (15%). Over all years, a clear pattern emerged with Chinook salmon dominating the estimated diet early in the summer, and coho salmon contributing an average of >40% of the diet in late summer. Sockeye salmon appeared to be occasionally important, at >18% in some sample groups. Non-salmonids were rarely observed. Our results are consistent with earlier results based on surface prey remains, and confirm the importance of Chinook salmon in this population’s summer diet. PMID:26735849
Estimation of a Killer Whale (Orcinus orca) Population's Diet Using Sequencing Analysis of DNA from Feces.

PubMed

Ford, Michael J; Hempelmann, Jennifer; Hanson, M Bradley; Ayres, Katherine L; Baird, Robin W; Emmons, Candice K; Lundin, Jessica I; Schorr, Gregory S; Wasser, Samuel K; Park, Linda K

2016-01-01

Estimating diet composition is important for understanding interactions between predators and prey and thus illuminating ecosystem function. The diet of many species, however, is difficult to observe directly. Genetic analysis of fecal material collected in the field is therefore a useful tool for gaining insight into wild animal diets. In this study, we used high-throughput DNA sequencing to quantitatively estimate the diet composition of an endangered population of wild killer whales (Orcinus orca) in their summer range in the Salish Sea. We combined 175 fecal samples collected between May and September from five years between 2006 and 2011 into 13 sample groups. Two known DNA composition control groups were also created. Each group was sequenced at a ~330bp segment of the 16s gene in the mitochondrial genome using an Illumina MiSeq sequencing system. After several quality controls steps, 4,987,107 individual sequences were aligned to a custom sequence database containing 19 potential fish prey species and the most likely species of each fecal-derived sequence was determined. Based on these alignments, salmonids made up >98.6% of the total sequences and thus of the inferred diet. Of the six salmonid species, Chinook salmon made up 79.5% of the sequences, followed by coho salmon (15%). Over all years, a clear pattern emerged with Chinook salmon dominating the estimated diet early in the summer, and coho salmon contributing an average of >40% of the diet in late summer. Sockeye salmon appeared to be occasionally important, at >18% in some sample groups. Non-salmonids were rarely observed. Our results are consistent with earlier results based on surface prey remains, and confirm the importance of Chinook salmon in this population's summer diet.
Exploring Insight: Focus on Shifts of Attention

ERIC Educational Resources Information Center

Palatnik, Alik; Koichu, Boris

2015-01-01

The paper presents and analyses a sequence of events that preceded an insight solution to a challenging problem in the context of numerical sequences. A threeweek long solution process by a pair of ninth-grade students is analysed by means of the theory of shifts of attention. The goal for this article is to reveal the potential of this theory…
Probing the evolution, ecology and physiology of marine protists using transcriptomics.

PubMed

Caron, David A; Alexander, Harriet; Allen, Andrew E; Archibald, John M; Armbrust, E Virginia; Bachy, Charles; Bell, Callum J; Bharti, Arvind; Dyhrman, Sonya T; Guida, Stephanie M; Heidelberg, Karla B; Kaye, Jonathan Z; Metzner, Julia; Smith, Sarah R; Worden, Alexandra Z

2017-01-01

Protists, which are single-celled eukaryotes, critically influence the ecology and chemistry of marine ecosystems, but genome-based studies of these organisms have lagged behind those of other microorganisms. However, recent transcriptomic studies of cultured species, complemented by meta-omics analyses of natural communities, have increased the amount of genetic information available for poorly represented branches on the tree of eukaryotic life. This information is providing insights into the adaptations and interactions between protists and other microorganisms and macroorganisms, but many of the genes sequenced show no similarity to sequences currently available in public databases. A better understanding of these newly discovered genes will lead to a deeper appreciation of the functional diversity and metabolic processes in the ocean. In this Review, we summarize recent developments in our understanding of the ecology, physiology and evolution of protists, derived from transcriptomic studies of cultured strains and natural communities, and discuss how these novel large-scale genetic datasets will be used in the future.
The complete chloroplast DNA sequence of the green alga Nephroselmis olivacea: Insights into the architecture of ancestral chloroplast genomes

PubMed Central

Turmel, Monique; Otis, Christian; Lemieux, Claude

1999-01-01

Green plants seem to form two sister lineages: Chlorophyta, comprising the green algal classes Prasinophyceae, Ulvophyceae, Trebouxiophyceae, and Chlorophyceae, and Streptophyta, comprising the Charophyceae and land plants. We have determined the complete chloroplast DNA (cpDNA) sequence (200,799 bp) of Nephroselmis olivacea, a member of the class (Prasinophyceae) thought to include descendants of the earliest-diverging green algae. The 127 genes identified in this genome represent the largest gene repertoire among the green algal and land plant cpDNAs completely sequenced to date. Of the Nephroselmis genes, 2 (ycf81 and ftsI, a gene involved in peptidoglycan synthesis) have not been identified in any previously investigated cpDNA; 5 genes [ftsW, rnE, ycf62, rnpB, and trnS(cga)] have been found only in cpDNAs of nongreen algae; and 10 others (ndh genes) have been described only in land plant cpDNAs. Nephroselmis and land plant cpDNAs share the same quadripartite structure—which is characterized by the presence of a large rRNA-encoding inverted repeat and two unequal single-copy regions—and very similar sets of genes in corresponding genomic regions. Given that our phylogenetic analyses place Nephroselmis within the Chlorophyta, these structural characteristics were most likely present in the cpDNA of the common ancestor of chlorophytes and streptophytes. Comparative analyses of chloroplast genomes indicate that the typical quadripartite architecture and gene-partitioning pattern of land plant cpDNAs are ancient features that may have been derived from the genome of the cyanobacterial progenitor of chloroplasts. Our phylogenetic data also offer insight into the chlorophyte ancestor of euglenophyte chloroplasts. PMID:10468594
Historical perspectives on tumor necrosis factor and its superfamily: 25 years later, a golden journey

PubMed Central

Gupta, Subash C.; Kim, Ji Hye

2012-01-01

Although activity that induced tumor regression was observed and termed tumor necrosis factor (TNF) as early as the 1960s, the true identity of TNF was not clear until 1984, when Aggarwal and coworkers reported, for the first time, the isolation of 2 cytotoxic factors: one, derived from macrophages (molecular mass 17 kDa), was named TNF, and the second, derived from lymphocytes (20 kDa), was named lymphotoxin. Because the 2 cytotoxic factors exhibited 50% amino acid sequence homology and bound to the same receptor, they came to be called TNF-α and TNF-β. Identification of the protein sequences led to cloning of their cDNA. Based on sequence homology to TNF-α, now a total of 19 members of the TNF superfamily have been identified, along with 29 interacting receptors, and several molecules that interact with the cytoplasmic domain of these receptors. The roles of the TNF superfamily in inflammation, apoptosis, proliferation, invasion, angiogenesis, metastasis, and morphogenesis have been documented. Their roles in immunologic, cardiovascular, neurologic, pulmonary, and metabolic diseases are becoming apparent. TNF superfamily members are active targets for drug development, as indicated by the recent approval and expanding market of TNF blockers used to treat rheumatoid arthritis, psoriasis, Crohns disease, and osteoporosis, with a total market of more than US $20 billion. As we learn more about this family, more therapeutics will probably emerge. In this review, we summarize the initial discovery of TNF-α, and the insights gained regarding the roles of this molecule and its related family members in normal physiology and disease. PMID:22053109
Lunar impact basins: Stratigraphy, sequence and ages from superposed impact crater populations measured from Lunar Orbiter Laser Altimeter (LOLA) data

NASA Astrophysics Data System (ADS)

Fassett, C. I.; Head, J. W.; Kadish, S. J.; Mazarico, E.; Neumann, G. A.; Smith, D. E.; Zuber, M. T.

2012-02-01

Impact basin formation is a fundamental process in the evolution of the Moon and records the history of impactors in the early solar system. In order to assess the stratigraphy, sequence, and ages of impact basins and the impactor population as a function of time, we have used topography from the Lunar Orbiter Laser Altimeter (LOLA) on the Lunar Reconnaissance Orbiter (LRO) to measure the superposed impact crater size-frequency distributions for 30 lunar basins (D ≥ 300 km). These data generally support the widely used Wilhelms sequence of lunar basins, although we find significantly higher densities of superposed craters on many lunar basins than derived by Wilhelms (50% higher densities). Our data also provide new insight into the timing of the transition between distinct crater populations characteristic of ancient and young lunar terrains. The transition from a lunar impact flux dominated by Population 1 to Population 2 occurred before the mid-Nectarian. This is before the end of the period of rapid cratering, and potentially before the end of the hypothesized Late Heavy Bombardment. LOLA-derived crater densities also suggest that many Pre-Nectarian basins, such as South Pole-Aitken, have been cratered to saturation equilibrium. Finally, both crater counts and stratigraphic observations based on LOLA data are applicable to specific basin stratigraphic problems of interest; for example, using these data, we suggest that Serenitatis is older than Nectaris, and Humboldtianum is younger than Crisium. Sample return missions to specific basins can anchor these measurements to a Pre-Imbrian absolute chronology.
Biased selection of propagation-related TUPs from phage display peptide libraries.

PubMed

Zade, Hesam Motaleb; Keshavarz, Reihaneh; Shekarabi, Hosna Sadat Zahed; Bakhshinejad, Babak

2017-08-01

Phage display is rapidly advancing as a screening strategy in drug discovery and drug delivery. Phage-encoded combinatorial peptide libraries can be screened through the affinity selection procedure of biopanning to find pharmaceutically relevant cell-specific ligands. However, the unwanted enrichment of target-unrelated peptides (TUPs) with no true affinity for the target presents an important barrier to the successful screening of phage display libraries. Propagation-related TUPs (Pr-TUPs) are an emerging but less-studied category of phage display-derived false-positive hits that are displayed on the surface of clones with faster propagation rates. Despite long regarded as an unbiased selection system, accumulating evidence suggests that biopanning may create biological bias toward selection of phage clones with certain displayed peptides. This bias can be dependent on or independent of the displayed sequence and may act as a major driving force for the isolation of fast-growing clones. Sequence-dependent bias is reflected by censorship or over-representation of some amino acids in the displayed peptide and sequence-independent bias is derived from either point mutations or rare recombination events occurring in the phage genome. It is of utmost interest to clean biopanning data by identifying and removing Pr-TUPs. Experimental and bioinformatic approaches can be exploited for Pr-TUP discovery. With no doubt, obtaining deeper insight into how Pr-TUPs emerge during biopanning and how they could be detected provides a basis for using cell-targeting peptides isolated from phage display screening in the development of disease-specific diagnostic and therapeutic platforms.
The complex evolutionary dynamics of ancient and recent polyploidy in Leucaena (Leguminosae; Mimosoideae).

PubMed

Govindarajulu, Rajanikanth; Hughes, Colin E; Alexander, Patrick J; Bailey, C Donovan

2011-12-01

The evolutionary history of Leucaena has been impacted by polyploidy, hybridization, and divergent allopatric species diversification, suggesting that this is an ideal group to investigate the evolutionary tempo of polyploidy and the complexities of reticulation and divergence in plant diversification. Parsimony- and ML-based phylogenetic approaches were applied to 105 accessions sequenced for six sequence characterized amplified region-based nuclear encoded loci, nrDNA ITS, and four cpDNA regions. Hypotheses for the origin of tetraploid species were inferred using results derived from a novel species tree and established gene tree methods and from data on genome sizes and geographic distributions. The combination of comprehensively sampled multilocus DNA sequence data sets and a novel methodology provide strong resolution and support for the origins of all five tetraploid species. A minimum of four allopolyploidization events are required to explain the origins of these species. The origin(s) of one tetraploid pair (L. involucrata/L. pallida) can be equally explained by two unique allopolyploidizations or a single event followed by divergent speciation. Alongside other recent findings, a comprehensive picture of the complex evolutionary dynamics of polyploidy in Leucaena is emerging that includes paleotetraploidization, diploidization of the last common ancestor to Leucaena, allopatric divergence among diploids, and recent allopolyploid origins for tetraploid species likely associated with human translocation of seed. These results provide insights into the role of divergence and reticulation in a well-characterized angiosperm lineage and into traits of diploid parents and derived tetraploids (particularly self-compatibility and year-round flowering) favoring the formation and establishment of novel tetraploids combinations.
Lunar Impact Basins: Stratigraphy, Sequence and Ages from Superposed Impact Crater Populations Measured from Lunar Orbiter Laser Altimeter (LOLA) Data

NASA Technical Reports Server (NTRS)

Fassett, C. I.; Head, J. W.; Kadish, S. J.; Mazarico, E.; Neumann, G. A.; Smith, D. E.; Zuber, M. T.

2012-01-01

Impact basin formation is a fundamental process in the evolution of the Moon and records the history of impactors in the early solar system. In order to assess the stratigraphy, sequence, and ages of impact basins and the impactor population as a function of time, we have used topography from the Lunar Orbiter Laser Altimeter (LOLA) on the Lunar Reconnaissance Orbiter (LRO) to measure the superposed impact crater size-frequency distributions for 30 lunar basins (D = 300 km). These data generally support the widely used Wilhelms sequence of lunar basins, although we find significantly higher densities of superposed craters on many lunar basins than derived by Wilhelms (50% higher densities). Our data also provide new insight into the timing of the transition between distinct crater populations characteristic of ancient and young lunar terrains. The transition from a lunar impact flux dominated by Population 1 to Population 2 occurred before the mid-Nectarian. This is before the end of the period of rapid cratering, and potentially before the end of the hypothesized Late Heavy Bombardment. LOLA-derived crater densities also suggest that many Pre-Nectarian basins, such as South Pole-Aitken, have been cratered to saturation equilibrium. Finally, both crater counts and stratigraphic observations based on LOLA data are applicable to specific basin stratigraphic problems of interest; for example, using these data, we suggest that Serenitatis is older than Nectaris, and Humboldtianum is younger than Crisium. Sample return missions to specific basins can anchor these measurements to a Pre-Imbrian absolute chronology.
Sequence stratigraphy of the Raha Formation, Bakr Oil Field, Gulf of Suez, Egypt: Insights from electrical well log and palynological data

NASA Astrophysics Data System (ADS)

Mansour, Ahmed; Mohamed, Omar; Tahoun, Sameh S.; Elewa, Ashraf M. T.

2018-03-01

The current paper provides a high resolution sequence stratigraphic study of the Raha Formation from the productive Bakr Oil Field, central Gulf of Suez, Egypt. Sixty cutting rock samples spanning the Cenomanian from three wells (Bakr-114, B-115 and B-109) in the Bakr Basin, were palynologically investigated. The documented palynomorphs assemblage of either terrestrially-derived sporomorphs or marine inhabited dinocysts, allowed two palynological zones as well as their encompassing depositional palaeoenvironment to be recognized. These zones are Afropollis jardinus-Crybelosporites pannuceus Assemblage Zone (early-middle Cenomanian) and Classopollis brasiliensis-Tricolpites sagax Assemblage Zone (late Cenomanian). Detailed analysis of the particulate organic matter compositions suggested that the depositional palaeoenvironment of the Raha Formation was fluctuating between supratidal and distal-inner neritic conditions, due to successive oscillations of the Neo-Tethyan Ocean during the Cenomanian. The pronounced peaks of particulate organic matter versus gamma ray are markedly used in delineating the depositional sequences of the Raha Formation and their bounding surfaces. The Raha Formation probably corresponds to a second-order depositional sequence, which can be further subdivided into eight third-order depositional sequences, of which six are complete and two are incomplete ones. These depositional sequences are significantly synchronized based on a simple 2-D correlation model between the three wells. According to the hierarchical duration system, the Cenomanian herein was approximately attributed to 6 Myr, each of which has lower order depositional sequences that took approximately 0.9 Myr. Based on the sequence stratigraphic approach together with palynofacies analysis and gamma ray data, a condensed section was defined in the B-115.

The American cranberry: first insights into the whole genome of a species adapted to bog habitat.

PubMed

Polashock, James; Zelzion, Ehud; Fajardo, Diego; Zalapa, Juan; Georgi, Laura; Bhattacharya, Debashish; Vorsa, Nicholi

2014-06-13

The American cranberry (Vaccinium macrocarpon Ait.) is one of only three widely-cultivated fruit crops native to North America- the other two are blueberry (Vaccinium spp.) and native grape (Vitis spp.). In terms of taxonomy, cranberries are in the core Ericales, an order for which genome sequence data are currently lacking. In addition, cranberries produce a host of important polyphenolic secondary compounds, some of which are beneficial to human health. Whereas next-generation sequencing technology is allowing the advancement of whole-genome sequencing, one major obstacle to the successful assembly from short-read sequence data of complex diploid (and higher ploidy) organisms is heterozygosity. Cranberry has the advantage of being diploid (2n = 2x = 24) and self-fertile. To minimize the issue of heterozygosity, we sequenced the genome of a fifth-generation inbred genotype (F ≥ 0.97) derived from five generations of selfing originating from the cultivar Ben Lear. The genome size of V. macrocarpon has been estimated to be about 470 Mb. Genomic sequences were assembled into 229,745 scaffolds representing 420 Mbp (N50 = 4,237 bp) with 20X average coverage. The number of predicted genes was 36,364 and represents 17.7% of the assembled genome. Of the predicted genes, 30,090 were assigned to candidate genes based on homology. Genes supported by transcriptome data totaled 13,170 (36%). Shotgun sequencing of the cranberry genome, with an average sequencing coverage of 20X, allowed efficient assembly and gene calling. The candidate genes identified represent a useful collection to further study important biochemical pathways and cellular processes and to use for marker development for breeding and the study of horticultural characteristics, such as disease resistance.
The American cranberry: first insights into the whole genome of a species adapted to bog habitat

PubMed Central

2014-01-01

Background The American cranberry (Vaccinium macrocarpon Ait.) is one of only three widely-cultivated fruit crops native to North America- the other two are blueberry (Vaccinium spp.) and native grape (Vitis spp.). In terms of taxonomy, cranberries are in the core Ericales, an order for which genome sequence data are currently lacking. In addition, cranberries produce a host of important polyphenolic secondary compounds, some of which are beneficial to human health. Whereas next-generation sequencing technology is allowing the advancement of whole-genome sequencing, one major obstacle to the successful assembly from short-read sequence data of complex diploid (and higher ploidy) organisms is heterozygosity. Cranberry has the advantage of being diploid (2n = 2x = 24) and self-fertile. To minimize the issue of heterozygosity, we sequenced the genome of a fifth-generation inbred genotype (F ≥ 0.97) derived from five generations of selfing originating from the cultivar Ben Lear. Results The genome size of V. macrocarpon has been estimated to be about 470 Mb. Genomic sequences were assembled into 229,745 scaffolds representing 420 Mbp (N50 = 4,237 bp) with 20X average coverage. The number of predicted genes was 36,364 and represents 17.7% of the assembled genome. Of the predicted genes, 30,090 were assigned to candidate genes based on homology. Genes supported by transcriptome data totaled 13,170 (36%). Conclusions Shotgun sequencing of the cranberry genome, with an average sequencing coverage of 20X, allowed efficient assembly and gene calling. The candidate genes identified represent a useful collection to further study important biochemical pathways and cellular processes and to use for marker development for breeding and the study of horticultural characteristics, such as disease resistance. PMID:24927653
Chloroplast DNA sequence of the green alga Oedogonium cardiacum (Chlorophyceae): Unique genome architecture, derived characters shared with the Chaetophorales and novel genes acquired through horizontal transfer

PubMed Central

Brouard, Jean-Simon; Otis, Christian; Lemieux, Claude; Turmel, Monique

2008-01-01

Background To gain insight into the branching order of the five main lineages currently recognized in the green algal class Chlorophyceae and to expand our understanding of chloroplast genome evolution, we have undertaken the sequencing of chloroplast DNA (cpDNA) from representative taxa. The complete cpDNA sequences previously reported for Chlamydomonas (Chlamydomonadales), Scenedesmus (Sphaeropleales), and Stigeoclonium (Chaetophorales) revealed tremendous variability in their architecture, the retention of only few ancestral gene clusters, and derived clusters shared by Chlamydomonas and Scenedesmus. Unexpectedly, our recent phylogenies inferred from these cpDNAs and the partial sequences of three other chlorophycean cpDNAs disclosed two major clades, one uniting the Chlamydomonadales and Sphaeropleales (CS clade) and the other uniting the Oedogoniales, Chaetophorales and Chaetopeltidales (OCC clade). Although molecular signatures provided strong support for this dichotomy and for the branching of the Oedogoniales as the earliest-diverging lineage of the OCC clade, more data are required to validate these phylogenies. We describe here the complete cpDNA sequence of Oedogonium cardiacum (Oedogoniales). Results Like its three chlorophycean homologues, the 196,547-bp Oedogonium chloroplast genome displays a distinctive architecture. This genome is one of the most compact among photosynthetic chlorophytes. It has an atypical quadripartite structure, is intron-rich (17 group I and 4 group II introns), and displays 99 different conserved genes and four long open reading frames (ORFs), three of which are clustered in the spacious inverted repeat of 35,493 bp. Intriguingly, two of these ORFs (int and dpoB) revealed high similarities to genes not usually found in cpDNA. At the gene content and gene order levels, the Oedogonium genome most closely resembles its Stigeoclonium counterpart. Characters shared by these chlorophyceans but missing in members of the CS clade include the retention of psaM, rpl32 and trnL(caa), the loss of petA, the disruption of three ancestral clusters and the presence of five derived gene clusters. Conclusion The Oedogonium chloroplast genome disclosed additional characters that bolster the evidence for a close alliance between the Oedogoniales and Chaetophorales. Our unprecedented finding of int and dpoB in this cpDNA provides a clear example that novel genes were acquired by the chloroplast genome through horizontal transfers, possibly from a mitochondrial genome donor. PMID:18558012
Insights into the phylogeny of Northern Hemisphere Armillaria: Neighbor-net and Bayesian analyses of translation elongation factor 1-α gene sequences.

PubMed

Klopfenstein, Ned B; Stewart, Jane E; Ota, Yuko; Hanna, John W; Richardson, Bryce A; Ross-Davis, Amy L; Elías-Román, Rubén D; Korhonen, Kari; Keča, Nenad; Iturritxa, Eugenia; Alvarado-Rosales, Dionicio; Solheim, Halvor; Brazee, Nicholas J; Łakomy, Piotr; Cleary, Michelle R; Hasegawa, Eri; Kikuchi, Taisei; Garza-Ocañas, Fortunato; Tsopelas, Panaghiotis; Rigling, Daniel; Prospero, Simone; Tsykun, Tetyana; Bérubé, Jean A; Stefani, Franck O P; Jafarpour, Saeideh; Antonín, Vladimír; Tomšovský, Michal; McDonald, Geral I; Woodward, Stephen; Kim, Mee-Sook

2017-01-01

Armillaria possesses several intriguing characteristics that have inspired wide interest in understanding phylogenetic relationships within and among species of this genus. Nuclear ribosomal DNA sequence-based analyses of Armillaria provide only limited information for phylogenetic studies among widely divergent taxa. More recent studies have shown that translation elongation factor 1-α (tef1) sequences are highly informative for phylogenetic analysis of Armillaria species within diverse global regions. This study used Neighbor-net and coalescence-based Bayesian analyses to examine phylogenetic relationships of newly determined and existing tef1 sequences derived from diverse Armillaria species from across the Northern Hemisphere, with Southern Hemisphere Armillaria species included for reference. Based on the Bayesian analysis of tef1 sequences, Armillaria species from the Northern Hemisphere are generally contained within the following four superclades, which are named according to the specific epithet of the most frequently cited species within the superclade: (i) Socialis/Tabescens (exannulate) superclade including Eurasian A. ectypa, North American A. socialis (A. tabescens), and Eurasian A. socialis (A. tabescens) clades; (ii) Mellea superclade including undescribed annulate North American Armillaria sp. (Mexico) and four separate clades of A. mellea (Europe and Iran, eastern Asia, and two groups from North America); (iii) Gallica superclade including Armillaria Nag E (Japan), multiple clades of A. gallica (Asia and Europe), A. calvescens (eastern North America), A. cepistipes (North America), A. altimontana (western USA), A. nabsnona (North America and Japan), and at least two A. gallica clades (North America); and (iv) Solidipes/Ostoyae superclade including two A. solidipes/ostoyae clades (North America), A. gemina (eastern USA), A. solidipes/ostoyae (Eurasia), A. cepistipes (Europe and Japan), A. sinapina (North America and Japan), and A. borealis (Eurasia) clade 2. Of note is that A. borealis (Eurasia) clade 1 appears basal to the Solidipes/Ostoyae and Gallica superclades. The Neighbor-net analysis showed similar phylogenetic relationships. This study further demonstrates the utility of tef1 for global phylogenetic studies of Armillaria species and provides critical insights into multiple taxonomic issues that warrant further study.
Selected Insights from Application of Whole Genome Sequencing for Outbreak Investigations

PubMed Central

Le, Vien Thi Minh; Diep, Binh An

2014-01-01

Purpose of review The advent of high-throughput whole genome sequencing has the potential to revolutionize the conduct of outbreak investigation. Because of its ultimate pathogen strain resolution, whole genome sequencing could augment traditional epidemiologic investigations of infectious disease outbreaks. Recent findings The combination of whole genome sequencing and intensive epidemiologic analysis provided new insights on the sources and transmission dynamics of large-scale epidemics caused by Escherichia coli and Vibrio cholerae, nosocomial outbreaks caused by methicillin-resistant Staphylococcus aureus, Klebsiella pneumonia, and Mycobacterium abscessus, community-centered outbreaks caused by Mycobacterium tuberculosis, and natural disaster-associated outbreak caused by environmentally acquired molds. Summary When combined with traditional epidemiologic investigation, whole genome sequencing has proven useful for elucidating sources and transmission dynamics of disease outbreaks. Development of a fully automated bioinformatics pipeline for analysis of whole genome sequence data is much needed to make this powerful tool more widely accessible. PMID:23856896
Conformational diversity in contryphans from Conus venom: cis-trans isomerisation and aromatic/proline interactions in the 23-membered ring of a 7-residue peptide disulfide loop.

PubMed

Sonti, Rajesh; Gowd, Konkallu Hanumae; Rao, K N Shashanka; Ragothama, Srinivasarao; Rodriguez, Alex; Perez, Juan Jesus; Balaram, Padmanabhan

2013-11-04

Conformational diversity or "shapeshifting" in cyclic peptide natural products can, in principle, confer a single molecular entity with the property of binding to multiple receptors. Conformational equilibria have been probed in the contryphans, which are peptides derived from Conus venom possessing a 23-membered cyclic disulfide moiety. The natural sequences derived from Conus inscriptus, GCV(D)LYPWC* (In936) and Conus loroisii, GCP(D)WDPWC* (Lo959) differ in the number of proline residues within the macrocyclic ring. Structural characterisation of distinct conformational states arising from cis-trans equilibria about Xxx-Pro bonds is reported. Isomerisation about the C2-P3 bond is observed in the case of Lo959 and about the Y5-P6 bond in In936. Evidence is presented for as many as four distinct species in the case of the synthetic analogue V3P In936. The Tyr-Pro-Trp segment in In936 is characterised by distinct sidechain orientations as a consequence of aromatic/proline interactions as evidenced by specific sidechain-sidechain nuclear Overhauser effects and ring current shifted proton chemical shifts. Molecular dynamics simulations suggest that Tyr5 and Trp7 sidechain conformations are correlated and depend on the geometry of the Xxx-Pro bond. Thermodynamic parameters are derived for the cis↔trans equilibrium for In936. Studies on synthetic analogues provide insights into the role of sequence effects in modulating isomerisation about Xxx-Pro bonds. Copyright © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Advances in thermographic signal reconstruction

NASA Astrophysics Data System (ADS)

Shepard, Steven M.; Frendberg Beemer, Maria

2015-05-01

Since its introduction in 2001, the Thermographic Signal Reconstruction (TSR) method has emerged as one of the most widely used methods for enhancement and analysis of thermographic sequences, with applications extending beyond industrial NDT into biomedical research, art restoration and botany. The basic TSR process, in which a noise reduced replica of each pixel time history is created, yields improvement over unprocessed image data that is sufficient for many applications. However, examination of the resulting logarithmic time derivatives of each TSR pixel replica provides significant insight into the physical mechanisms underlying the active thermography process. The deterministic and invariant properties of the derivatives have enabled the successful implementation of automated defect recognition and measurement systems. Unlike most approaches to analysis of thermography data, TSR does not depend on flawbackground contrast, so that it can also be applied to characterization and measurement of thermal properties of flaw-free samples. We present a summary of recent advances in TSR, a review of the underlying theory and examples of its implementation.
Hepatitis B virus pathogenesis: Fresh insights into hepatitis B virus RNA.

PubMed

Sekiba, Kazuma; Otsuka, Motoyuki; Ohno, Motoko; Yamagami, Mari; Kishikawa, Takahiro; Suzuki, Tatsunori; Ishibashi, Rei; Seimiya, Takahiro; Tanaka, Eri; Koike, Kazuhiko

2018-06-07

Hepatitis B virus (HBV) is still a worldwide health concern. While divergent factors are involved in its pathogenesis, it is now clear that HBV RNAs, principally templates for viral proteins and viral DNAs, have diverse biological functions involved in HBV pathogenesis. These functions include viral replication, hepatic fibrosis and hepatocarcinogenesis. Depending on the sequence similarities, HBV RNAs may act as sponges for host miRNAs and may deregulate miRNA functions, possibly leading to pathological consequences. Some parts of the HBV RNA molecule may function as viral-derived miRNA, which regulates viral replication. HBV DNA can integrate into the host genomic DNA and produce novel viral-host fusion RNA, which may have pathological functions. To date, elimination of HBV-derived covalently closed circular DNA has not been achieved. However, RNA transcription silencing may be an alternative practical approach to treat HBV-induced pathogenesis. A full understanding of HBV RNA transcription and the biological functions of HBV RNA may open a new avenue for the development of novel HBV therapeutics.
Population genetic implications from sequence variation in four Y chromosome genes.

PubMed

Shen, P; Wang, F; Underhill, P A; Franco, C; Yang, W H; Roxas, A; Sung, R; Lin, A A; Hyman, R W; Vollrath, D; Davis, R W; Cavalli-Sforza, L L; Oefner, P J

2000-06-20

Some insight into human evolution has been gained from the sequencing of four Y chromosome genes. Primary genomic sequencing determined gene SMCY to be composed of 27 exons that comprise 4,620 bp of coding sequence. The unfinished sequencing of the 5' portion of gene UTY1 was completed by primer walking, and a total of 20 exons were found. By using denaturing HPLC, these two genes, as well as DBY and DFFRY, were screened for polymorphic sites in 53-72 representatives of the five continents. A total of 98 variants were found, yielding nucleotide diversity estimates of 2.45 x 10(-5), 5. 07 x 10(-5), and 8.54 x 10(-5) for the coding regions of SMCY, DFFRY, and UTY1, respectively, with no variant having been observed in DBY. In agreement with most autosomal genes, diversity estimates for the noncoding regions were about 2- to 3-fold higher and ranged from 9. 16 x 10(-5) to 14.2 x 10(-5) for the four genes. Analysis of the frequencies of derived alleles for all four genes showed that they more closely fit the expectation of a Luria-Delbrück distribution than a distribution expected under a constant population size model, providing evidence for exponential population growth. Pairwise nucleotide mismatch distributions date the occurrence of population expansion to approximately 28,000 years ago. This estimate is in accord with the spread of Aurignacian technology and the disappearance of the Neanderthals.
Use of a Drosophila Genome-Wide Conserved Sequence Database to Identify Functionally Related cis-Regulatory Enhancers

PubMed Central

Brody, Thomas; Yavatkar, Amarendra S; Kuzin, Alexander; Kundu, Mukta; Tyson, Leonard J; Ross, Jermaine; Lin, Tzu-Yang; Lee, Chi-Hon; Awasaki, Takeshi; Lee, Tzumin; Odenwald, Ward F

2012-01-01

Background: Phylogenetic footprinting has revealed that cis-regulatory enhancers consist of conserved DNA sequence clusters (CSCs). Currently, there is no systematic approach for enhancer discovery and analysis that takes full-advantage of the sequence information within enhancer CSCs. Results: We have generated a Drosophila genome-wide database of conserved DNA consisting of >100,000 CSCs derived from EvoPrints spanning over 90% of the genome. cis-Decoder database search and alignment algorithms enable the discovery of functionally related enhancers. The program first identifies conserved repeat elements within an input enhancer and then searches the database for CSCs that score highly against the input CSC. Scoring is based on shared repeats as well as uniquely shared matches, and includes measures of the balance of shared elements, a diagnostic that has proven to be useful in predicting cis-regulatory function. To demonstrate the utility of these tools, a temporally-restricted CNS neuroblast enhancer was used to identify other functionally related enhancers and analyze their structural organization. Conclusions: cis-Decoder reveals that co-regulating enhancers consist of combinations of overlapping shared sequence elements, providing insights into the mode of integration of multiple regulating transcription factors. The database and accompanying algorithms should prove useful in the discovery and analysis of enhancers involved in any developmental process. Developmental Dynamics 241:169–189, 2012. © 2011 Wiley Periodicals, Inc. Key findings A genome-wide catalog of Drosophila conserved DNA sequence clusters. cis-Decoder discovers functionally related enhancers. Functionally related enhancers share balanced sequence element copy numbers. Many enhancers function during multiple phases of development. PMID:22174086
Single-molecule, full-length transcript sequencing provides insight into the extreme metabolism of the ruby-throated hummingbird Archilochus colubris.

PubMed

Workman, Rachael E; Myrka, Alexander M; Wong, G William; Tseng, Elizabeth; Welch, Kenneth C; Timp, Winston

2018-03-01

Hummingbirds oxidize ingested nectar sugars directly to fuel foraging but cannot sustain this fuel use during fasting periods, such as during the night or during long-distance migratory flights. Instead, fasting hummingbirds switch to oxidizing stored lipids that are derived from ingested sugars. The hummingbird liver plays a key role in moderating energy homeostasis and this remarkable capacity for fuel switching. Additionally, liver is the principle location of de novo lipogenesis, which can occur at exceptionally high rates, such as during premigratory fattening. Yet understanding how this tissue and whole organism moderates energy turnover is hampered by a lack of information regarding how relevant enzymes differ in sequence, expression, and regulation. We generated a de novo transcriptome of the hummingbird liver using PacBio full-length cDNA sequencing (Iso-Seq), yielding 8.6Gb of sequencing data, or 2.6M reads from 4 different size fractions. We analyzed data using the SMRTAnalysis v3.1 Iso-Seq pipeline, then clustered isoforms into gene families to generate de novo gene contigs using Cogent. We performed orthology analysis to identify closely related sequences between our transcriptome and other avian and human gene sets. Finally, we closely examined homology of critical lipid metabolism genes between our transcriptome data and avian and human genomes. We confirmed high levels of sequence divergence within hummingbird lipogenic enzymes, suggesting a high probability of adaptive divergent function in the hepatic lipogenic pathways. Our results leverage cutting-edge technology and a novel bioinformatics pipeline to provide a first direct look at the transcriptome of this incredible organism.
Verifying Digital Components of Physical Systems: Experimental Evaluation of Test Quality

NASA Astrophysics Data System (ADS)

Laputenko, A. V.; López, J. E.; Yevtushenko, N. V.

2018-03-01

This paper continues the study of high quality test derivation for verifying digital components which are used in various physical systems; those are sensors, data transfer components, etc. We have used logic circuits b01-b010 of the package of ITC'99 benchmarks (Second Release) for experimental evaluation which as stated before, describe digital components of physical systems designed for various applications. Test sequences are derived for detecting the most known faults of the reference logic circuit using three different approaches to test derivation. Three widely used fault types such as stuck-at-faults, bridges, and faults which slightly modify the behavior of one gate are considered as possible faults of the reference behavior. The most interesting test sequences are short test sequences that can provide appropriate guarantees after testing, and thus, we experimentally study various approaches to the derivation of the so-called complete test suites which detect all fault types. In the first series of experiments, we compare two approaches for deriving complete test suites. In the first approach, a shortest test sequence is derived for testing each fault. In the second approach, a test sequence is pseudo-randomly generated by the use of an appropriate software for logic synthesis and verification (ABC system in our study) and thus, can be longer. However, after deleting sequences detecting the same set of faults, a test suite returned by the second approach is shorter. The latter underlines the fact that in many cases it is useless to spend `time and efforts' for deriving a shortest distinguishing sequence; it is better to use the test minimization afterwards. The performed experiments also show that the use of only randomly generated test sequences is not very efficient since such sequences do not detect all the faults of any type. After reaching the fault coverage around 70%, saturation is observed, and the fault coverage cannot be increased anymore. For deriving high quality short test suites, the approach that is the combination of randomly generated sequences together with sequences which are aimed to detect faults not detected by random tests, allows to reach the good fault coverage using shortest test sequences.
A-WINGS: an integrated genome database for Pleurocybella porrigens (Angel's wing oyster mushroom, Sugihiratake).

PubMed

Yamamoto, Naoki; Suzuki, Tomohiro; Kobayashi, Masaaki; Dohra, Hideo; Sasaki, Yohei; Hirai, Hirofumi; Yokoyama, Koji; Kawagishi, Hirokazu; Yano, Kentaro

2014-12-03

The angel's wing oyster mushroom (Pleurocybella porrigens, Sugihiratake) is a well-known delicacy. However, its potential risk in acute encephalopathy was recently revealed by a food poisoning incident. To disclose the genes underlying the accident and provide mechanistic insight, we seek to develop an information infrastructure containing omics data. In our previous work, we sequenced the genome and transcriptome using next-generation sequencing techniques. The next step in achieving our goal is to develop a web database to facilitate the efficient mining of large-scale omics data and identification of genes specifically expressed in the mushroom. This paper introduces a web database A-WINGS (http://bioinf.mind.meiji.ac.jp/a-wings/) that provides integrated genomic and transcriptomic information for the angel's wing oyster mushroom. The database contains structure and functional annotations of transcripts and gene expressions. Functional annotations contain information on homologous sequences from NCBI nr and UniProt, Gene Ontology, and KEGG Orthology. Digital gene expression profiles were derived from RNA sequencing (RNA-seq) analysis in the fruiting bodies and mycelia. The omics information stored in the database is freely accessible through interactive and graphical interfaces by search functions that include 'GO TREE VIEW' browsing, keyword searches, and BLAST searches. The A-WINGS database will accelerate omics studies on specific aspects of the angel's wing oyster mushroom and the family Tricholomataceae.
Draft genome sequence of an inbred line of Chenopodium quinoa, an allotetraploid crop with great environmental adaptability and outstanding nutritional properties.

PubMed

Yasui, Yasuo; Hirakawa, Hideki; Oikawa, Tetsuo; Toyoshima, Masami; Matsuzaki, Chiaki; Ueno, Mariko; Mizuno, Nobuyuki; Nagatoshi, Yukari; Imamura, Tomohiro; Miyago, Manami; Tanaka, Kojiro; Mise, Kazuyuki; Tanaka, Tsutomu; Mizukoshi, Hiroharu; Mori, Masashi; Fujita, Yasunari

2016-12-01

Chenopodium quinoa Willd. (quinoa) originated from the Andean region of South America, and is a pseudocereal crop of the Amaranthaceae family. Quinoa is emerging as an important crop with the potential to contribute to food security worldwide and is considered to be an optimal food source for astronauts, due to its outstanding nutritional profile and ability to tolerate stressful environments. Furthermore, plant pathologists use quinoa as a representative diagnostic host to identify virus species. However, molecular analysis of quinoa is limited by its genetic heterogeneity due to outcrossing and its genome complexity derived from allotetraploidy. To overcome these obstacles, we established the inbred and standard quinoa accession Kd that enables rigorous molecular analysis, and presented the draft genome sequence of Kd, using an optimized combination of high-throughput next generation sequencing on the Illumina Hiseq 2500 and PacBio RS II sequencers. The de novo genome assembly contained 25 k scaffolds consisting of 1 Gbp with N50 length of 86 kbp. Based on these data, we constructed the free-access Quinoa Genome DataBase (QGDB). Thus, these findings provide insights into the mechanisms underlying agronomically important traits of quinoa and the effect of allotetraploidy on genome evolution. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Land use type significantly affects microbial gene transcription in soil.

PubMed

Nacke, Heiko; Fischer, Christiane; Thürmer, Andrea; Meinicke, Peter; Daniel, Rolf

2014-05-01

Soil microorganisms play an essential role in sustaining biogeochemical processes and cycling of nutrients across different land use types. To gain insights into microbial gene transcription in forest and grassland soil, we isolated mRNA from 32 sampling sites. After sequencing of generated complementary DNA (cDNA), a total of 5,824,229 sequences could be further analyzed. We were able to assign nonribosomal cDNA sequences to all three domains of life. A dominance of bacterial sequences, which were affiliated to 25 different phyla, was found. Bacterial groups capable of aromatic compound degradation such as Phenylobacterium and Burkholderia were detected in significantly higher relative abundance in forest soil than in grassland soil. Accordingly, KEGG pathway categories related to degradation of aromatic ring-containing molecules (e.g., benzoate degradation) were identified in high abundance within forest soil-derived metatranscriptomic datasets. The impact of land use type forest on community composition and activity is evidently to a high degree caused by the presence of wood breakdown products. Correspondingly, bacterial groups known to be involved in lignin degradation and containing ligninolytic genes such as Burkholderia, Bradyrhizobium, and Azospirillum exhibited increased transcriptional activity in forest soil. Higher solar radiation in grassland presumably induced increased transcription of photosynthesis-related genes within this land use type. This is in accordance with high abundance of photosynthetic organisms and plant-infecting viruses in grassland.
Bacterial taxa–area and distance–decay relationships in marine environments

PubMed Central

Zinger, L; Boetius, A; Ramette, A

2014-01-01

The taxa–area relationship (TAR) and the distance–decay relationship (DDR) both describe spatial turnover of taxa and are central patterns of biodiversity. Here, we compared TAR and DDR of bacterial communities across different marine realms and ecosystems at the global scale. To obtain reliable global estimates for both relationships, we quantified the poorly assessed effects of sequencing depth, rare taxa removal and number of sampling sites. Slope coefficients of bacterial TARs were within the range of those of plants and animals, whereas slope coefficients of bacterial DDR were much lower. Slope coefficients were mostly affected by removing rare taxa and by the number of sampling sites considered in the calculations. TAR and DDR slope coefficients were overestimated at sequencing depth <4000 sequences per sample. Noticeably, bacterial TAR and DDR patterns did not correlate with each other both within and across ecosystem types, suggesting that (i) TAR cannot be directly derived from DDR and (ii) TAR and DDR may be influenced by different ecological factors. Nevertheless, we found marine bacterial TAR and DDR to be steeper in ecosystems associated with high environmental heterogeneity or spatial isolation, namely marine sediments and coastal environments compared with pelagic ecosystems. Hence, our study provides information on macroecological patterns of marine bacteria, as well as methodological and conceptual insights, at a time when biodiversity surveys increasingly make use of high-throughput sequencing technologies. PMID:24460915
The Medicago Genome Provides Insight into the Evolution of Rhizobial Symbioses

PubMed Central

Young, Nevin D.; Debellé, Frédéric; Oldroyd, Giles E. D.; Geurts, Rene; Cannon, Steven B.; Udvardi, Michael K.; Benedito, Vagner A.; Mayer, Klaus F. X.; Gouzy, Jérôme; Schoof, Heiko; Van de Peer, Yves; Proost, Sebastian; Cook, Douglas R.; Meyers, Blake C.; Spannagl, Manuel; Cheung, Foo; De Mita, Stéphane; Krishnakumar, Vivek; Gundlach, Heidrun; Zhou, Shiguo; Mudge, Joann; Bharti, Arvind K.; Murray, Jeremy D.; Naoumkina, Marina A.; Rosen, Benjamin; Silverstein, Kevin A. T.; Tang, Haibao; Rombauts, Stephane; Zhao, Patrick X.; Zhou, Peng; Barbe, Valérie; Bardou, Philippe; Bechner, Michael; Bellec, Arnaud; Berger, Anne; Bergès, Hélène; Bidwell, Shelby; Bisseling, Ton; Choisne, Nathalie; Couloux, Arnaud; Denny, Roxanne; Deshpande, Shweta; Dai, Xinbin; Doyle, Jeff; Dudez, Anne-Marie; Farmer, Andrew D.; Fouteau, Stéphanie; Franken, Carolien; Gibelin, Chrystel; Gish, John; Goldstein, Steven; González, Alvaro J.; Green, Pamela J.; Hallab, Asis; Hartog, Marijke; Hua, Axin; Humphray, Sean; Jeong, Dong-Hoon; Jing, Yi; Jöcker, Anika; Kenton, Steve M.; Kim, Dong-Jin; Klee, Kathrin; Lai, Hongshing; Lang, Chunting; Lin, Shaoping; Macmil, Simone L; Magdelenat, Ghislaine; Matthews, Lucy; McCorrison, Jamison; Monaghan, Erin L.; Mun, Jeong-Hwan; Najar, Fares Z.; Nicholson, Christine; Noirot, Céline; O’Bleness, Majesta; Paule, Charles R.; Poulain, Julie; Prion, Florent; Qin, Baifang; Qu, Chunmei; Retzel, Ernest F.; Riddle, Claire; Sallet, Erika; Samain, Sylvie; Samson, Nicolas; Sanders, Iryna; Saurat, Olivier; Scarpelli, Claude; Schiex, Thomas; Segurens, Béatrice; Severin, Andrew J.; Sherrier, D. Janine; Shi, Ruihua; Sims, Sarah; Singer, Susan R.; Sinharoy, Senjuti; Sterck, Lieven; Viollet, Agnès; Wang, Bing-Bing; Wang, Keqin; Wang, Mingyi; Wang, Xiaohong; Warfsmann, Jens; Weissenbach, Jean; White, Doug D.; White, Jim D.; Wiley, Graham B.; Wincker, Patrick; Xing, Yanbo; Yang, Limei; Yao, Ziyun; Ying, Fu; Zhai, Jixian; Zhou, Liping; Zuber, Antoine; Dénarié, Jean; Dixon, Richard A.; May, Gregory D.; Schwartz, David C.; Rogers, Jane; Quétier, Francis; Town, Christopher D.; Roe, Bruce A.

2011-01-01

Legumes (Fabaceae or Leguminosae) are unique among cultivated plants for their ability to carry out endosymbiotic nitrogen fixation with rhizobial bacteria, a process that takes place in a specialized structure known as the nodule. Legumes belong to one of the two main groups of eurosids, the Fabidae, which includes most species capable of endosymbiotic nitrogen fixation 1. Legumes comprise several evolutionary lineages derived from a common ancestor 60 million years ago (Mya). Papilionoids are the largest clade, dating nearly to the origin of legumes and containing most cultivated species 2. Medicago truncatula (Mt) is a long-established model for the study of legume biology. Here we describe the draft sequence of the Mt euchromatin based on a recently completed BAC-assembly supplemented with Illumina-shotgun sequence, together capturing ~94% of all Mt genes. A whole-genome duplication (WGD) approximately 58 Mya played a major role in shaping the Mt genome and thereby contributed to the evolution of endosymbiotic nitrogen fixation. Subsequent to the WGD, the Mt genome experienced higher levels of rearrangement than two other sequenced legumes, Glycine max (Gm) and Lotus japonicus (Lj). Mt is a close relative of alfalfa (M. sativa), a widely cultivated crop with limited genomics tools and complex autotetraploid genetics. As such, the Mt genome sequence provides significant opportunities to expand alfalfa’s genomic toolbox. PMID:22089132
Expanding the cerebrospinal fluid endopeptidome.

PubMed

Hansson, Karl T; Skillbäck, Tobias; Pernevik, Elin; Kern, Silke; Portelius, Erik; Höglund, Kina; Brinkmalm, Gunnar; Holmén-Larsson, Jessica; Blennow, Kaj; Zetterberg, Henrik; Gobom, Johan

2017-03-01

Biomarkers of neurodegenerative disorders are needed to assist in diagnosis, to monitor disease progression and therapeutic interventions, and to provide insight into disease mechanisms. One route to identify such biomarkers is by proteomic and peptidomic analysis of cerebrospinal fluid (CSF). In the current study, we performed an in-depth analysis of the human CSF endopeptidome to establish an inventory that may serve as a basis for future targeted biomarker studies. High-pH RP HPLC was employed for off-line sample prefractionation followed by low-pH nano-LC-MS analysis. Different software programs and scoring algorithms for peptide identification were employed and compared. A total of 18 031 endogenous peptides were identified at a FDR of 1%, increasing the number of known endogenous CSF peptides 10-fold compared to previous studies. The peptides were derived from 2 053 proteins of which more than 60 have been linked to neurodegeneration. Notably, among the findings were six peptides derived from microtubule-associated protein tau, three of which span the diagnostically interesting threonine-181 (Tau-F isoform). Also, 213 peptides from amyloid precursor protein were identified, 58 of which were partially or completely within the sequence of amyloid β 1-40/42, as well as 109 peptides from apolipoprotein E, spanning sequences that discriminate between the E2/E3/E4 isoforms of the protein. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Hierarchical compression of Caenorhabditis elegans locomotion reveals phenotypic differences in the organization of behaviour.

PubMed

Gomez-Marin, Alex; Stephens, Greg J; Brown, André E X

2016-08-01

Regularities in animal behaviour offer insights into the underlying organizational and functional principles of nervous systems and automated tracking provides the opportunity to extract features of behaviour directly from large-scale video data. Yet how to effectively analyse such behavioural data remains an open question. Here, we explore whether a minimum description length principle can be exploited to identify meaningful behaviours and phenotypes. We apply a dictionary compression algorithm to behavioural sequences from the nematode worm Caenorhabditis elegans freely crawling on an agar plate both with and without food and during chemotaxis. We find that the motifs identified by the compression algorithm are rare but relevant for comparisons between worms in different environments, suggesting that hierarchical compression can be a useful step in behaviour analysis. We also use compressibility as a new quantitative phenotype and find that the behaviour of wild-isolated strains of C. elegans is more compressible than that of the laboratory strain N2 as well as the majority of mutant strains examined. Importantly, in distinction to more conventional phenotypes such as overall motor activity or aggregation behaviour, the increased compressibility of wild isolates is not explained by the loss of function of the gene npr-1, which suggests that erratic locomotion is a laboratory-derived trait with a novel genetic basis. Because hierarchical compression can be applied to any sequence, we anticipate that compressibility can offer insights into the organization of behaviour in other animals including humans. © 2016 The Authors.
Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse

PubMed Central

Hillier, LaDeana W.; Zody, Michael C.; Goldstein, Steve; She, Xinwe; Bult, Carol J.; Agarwala, Richa; Cherry, Joshua L.; DiCuccio, Michael; Hlavina, Wratko; Kapustin, Yuri; Meric, Peter; Maglott, Donna; Birtle, Zoë; Marques, Ana C.; Graves, Tina; Zhou, Shiguo; Teague, Brian; Potamousis, Konstantinos; Churas, Christopher; Place, Michael; Herschleb, Jill; Runnheim, Ron; Forrest, Daniel; Amos-Landgraf, James; Schwartz, David C.; Cheng, Ze; Lindblad-Toh, Kerstin; Eichler, Evan E.; Ponting, Chris P.

2009-01-01

The mouse (Mus musculus) is the premier animal model for understanding human disease and development. Here we show that a comprehensive understanding of mouse biology is only possible with the availability of a finished, high-quality genome assembly. The finished clone-based assembly of the mouse strain C57BL/6J reported here has over 175,000 fewer gaps and over 139 Mb more of novel sequence, compared with the earlier MGSCv3 draft genome assembly. In a comprehensive analysis of this revised genome sequence, we are now able to define 20,210 protein-coding genes, over a thousand more than predicted in the human genome (19,042 genes). In addition, we identified 439 long, non–protein-coding RNAs with evidence for transcribed orthologs in human. We analyzed the complex and repetitive landscape of 267 Mb of sequence that was missing or misassembled in the previously published assembly, and we provide insights into the reasons for its resistance to sequencing and assembly by whole-genome shotgun approaches. Duplicated regions within newly assembled sequence tend to be of more recent ancestry than duplicates in the published draft, correcting our initial understanding of recent evolution on the mouse lineage. These duplicates appear to be largely composed of sequence regions containing transposable elements and duplicated protein-coding genes; of these, some may be fixed in the mouse population, but at least 40% of segmentally duplicated sequences are copy number variable even among laboratory mouse strains. Mouse lineage-specific regions contain 3,767 genes drawn mainly from rapidly-changing gene families associated with reproductive functions. The finished mouse genome assembly, therefore, greatly improves our understanding of rodent-specific biology and allows the delineation of ancestral biological functions that are shared with human from derived functions that are not. PMID:19468303

Optimizing and benchmarking de novo transcriptome sequencing: from library preparation to assembly evaluation.

PubMed

Hara, Yuichiro; Tatsumi, Kaori; Yoshida, Michio; Kajikawa, Eriko; Kiyonari, Hiroshi; Kuraku, Shigehiro

2015-11-18

RNA-seq enables gene expression profiling in selected spatiotemporal windows and yields massive sequence information with relatively low cost and time investment, even for non-model species. However, there remains a large room for optimizing its workflow, in order to take full advantage of continuously developing sequencing capacity. Transcriptome sequencing for three embryonic stages of Madagascar ground gecko (Paroedura picta) was performed with the Illumina platform. The output reads were assembled de novo for reconstructing transcript sequences. In order to evaluate the completeness of transcriptome assemblies, we prepared a reference gene set consisting of vertebrate one-to-one orthologs. To take advantage of increased read length of >150 nt, we demonstrated shortened RNA fragmentation time, which resulted in a dramatic shift of insert size distribution. To evaluate products of multiple de novo assembly runs incorporating reads with different RNA sources, read lengths, and insert sizes, we introduce a new reference gene set, core vertebrate genes (CVG), consisting of 233 genes that are shared as one-to-one orthologs by all vertebrate genomes examined (29 species)., The completeness assessment performed by the computational pipelines CEGMA and BUSCO referring to CVG, demonstrated higher accuracy and resolution than with the gene set previously established for this purpose. As a result of the assessment with CVG, we have derived the most comprehensive transcript sequence set of the Madagascar ground gecko by means of assembling individual libraries followed by clustering the assembled sequences based on their overall similarities. Our results provide several insights into optimizing de novo RNA-seq workflow, including the coordination between library insert size and read length, which manifested in improved connectivity of assemblies. The approach and assembly assessment with CVG demonstrated here would be applicable to transcriptome analysis of other species as well as whole genome analyses.
Unveiling the Hybrid Genome Structure of Escherichia coli RR1 (HB101 RecA+)

PubMed Central

Jeong, Haeyoung; Sim, Young Mi; Kim, Hyun Ju; Lee, Sang Jun

2017-01-01

There have been extensive genome sequencing studies for Escherichia coli strains, particularly for pathogenic isolates, because fast determination of pathogenic potential and/or drug resistance and their propagation routes is crucial. For laboratory E. coli strains, however, genome sequence information is limited except for several well-known strains. We determined the complete genome sequence of laboratory E. coli strain RR1 (HB101 RecA+), which has long been used as a general cloning host. A hybrid genome sequence of K-12 MG1655 and B BL21(DE3) was constructed based on the initial mapping of Illumina HiSeq reads to each reference, and iterative rounds of read mapping, variant detection, and consensus extraction were carried out. Finally, PCR and Sanger sequencing-based finishing were applied to resolve non-single nucleotide variant regions with aberrant read depths and breakpoints, most of them resulting from prophages and insertion sequence transpositions that are not present in the reference genome sequence. We found that 96.9% of the RR1 genome is derived from K-12, and identified exact crossover junctions between K-12 and B genomic fragments. However, because RR1 has experienced a series of genetic manipulations since branching from the common ancestor, it has a set of mutations different from those found in K-12 MG1655. As well as identifying all known genotypes of RR1 on the basis of genomic context, we found novel mutations. Our results extend current knowledge of the genotype of RR1 and its relatives, and provide insights into the pedigree, genomic background, and physiology of common laboratory strains. PMID:28421066
Genomics and epigenomics of clear cell renal cell carcinoma: recent developments and potential applications.

PubMed

Rydzanicz, Małgorzata; Wrzesiński, Tomasz; Bluyssen, Hans A R; Wesoły, Joanna

2013-12-01

Majority of clear cell renal cell carcinomas (ccRCCs) are diagnosed in the advanced metastatic stage resulting in dramatic decrease of patient survival. Thereby, early detection and monitoring of the disease may improve prognosis and treatment results. Recent technological advances enable the identification of genetic events associated with ccRCC and reveal significant molecular heterogeneity of ccRCC tumors. This review summarizes recent findings in ccRCC genomics and epigenomics derived from chromosomal aberrations, DNA sequencing and methylation, mRNA, miRNA expression profiling experiments. We provide a molecular insight into ccRCC pathology and recapitulate possible clinical applications of genomic alterations as predictive and prognostic biomarkers. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Genomes of the Mouse Collaborative Cross.

PubMed

Srivastava, Anuj; Morgan, Andrew P; Najarian, Maya L; Sarsani, Vishal Kumar; Sigmon, J Sebastian; Shorter, John R; Kashfeen, Anwica; McMullan, Rachel C; Williams, Lucy H; Giusti-Rodríguez, Paola; Ferris, Martin T; Sullivan, Patrick; Hock, Pablo; Miller, Darla R; Bell, Timothy A; McMillan, Leonard; Churchill, Gary A; de Villena, Fernando Pardo-Manuel

2017-06-01

The Collaborative Cross (CC) is a multiparent panel of recombinant inbred (RI) mouse strains derived from eight founder laboratory strains. RI panels are popular because of their long-term genetic stability, which enhances reproducibility and integration of data collected across time and conditions. Characterization of their genomes can be a community effort, reducing the burden on individual users. Here we present the genomes of the CC strains using two complementary approaches as a resource to improve power and interpretation of genetic experiments. Our study also provides a cautionary tale regarding the limitations imposed by such basic biological processes as mutation and selection. A distinct advantage of inbred panels is that genotyping only needs to be performed on the panel, not on each individual mouse. The initial CC genome data were haplotype reconstructions based on dense genotyping of the most recent common ancestors (MRCAs) of each strain followed by imputation from the genome sequence of the corresponding founder inbred strain. The MRCA resource captured segregating regions in strains that were not fully inbred, but it had limited resolution in the transition regions between founder haplotypes, and there was uncertainty about founder assignment in regions of limited diversity. Here we report the whole genome sequence of 69 CC strains generated by paired-end short reads at 30× coverage of a single male per strain. Sequencing leads to a substantial improvement in the fine structure and completeness of the genomes of the CC. Both MRCAs and sequenced samples show a significant reduction in the genome-wide haplotype frequencies from two wild-derived strains, CAST/EiJ and PWK/PhJ. In addition, analysis of the evolution of the patterns of heterozygosity indicates that selection against three wild-derived founder strains played a significant role in shaping the genomes of the CC. The sequencing resource provides the first description of tens of thousands of new genetic variants introduced by mutation and drift in the CC genomes. We estimate that new SNP mutations are accumulating in each CC strain at a rate of 2.4 ± 0.4 per gigabase per generation. The fixation of new mutations by genetic drift has introduced thousands of new variants into the CC strains. The majority of these mutations are novel compared to currently sequenced laboratory stocks and wild mice, and some are predicted to alter gene function. Approximately one-third of the CC inbred strains have acquired large deletions (>10 kb) many of which overlap known coding genes and functional elements. The sequence of these mice is a critical resource to CC users, increases threefold the number of mouse inbred strain genomes available publicly, and provides insight into the effect of mutation and drift on common resources. Copyright © 2017 Srivastava et al.
Genomes of the Mouse Collaborative Cross

PubMed Central

Srivastava, Anuj; Morgan, Andrew P.; Najarian, Maya L.; Sarsani, Vishal Kumar; Sigmon, J. Sebastian; Shorter, John R.; Kashfeen, Anwica; McMullan, Rachel C.; Williams, Lucy H.; Giusti-Rodríguez, Paola; Ferris, Martin T.; Sullivan, Patrick; Hock, Pablo; Miller, Darla R.; Bell, Timothy A.; McMillan, Leonard; Churchill, Gary A.; de Villena, Fernando Pardo-Manuel

2017-01-01

The Collaborative Cross (CC) is a multiparent panel of recombinant inbred (RI) mouse strains derived from eight founder laboratory strains. RI panels are popular because of their long-term genetic stability, which enhances reproducibility and integration of data collected across time and conditions. Characterization of their genomes can be a community effort, reducing the burden on individual users. Here we present the genomes of the CC strains using two complementary approaches as a resource to improve power and interpretation of genetic experiments. Our study also provides a cautionary tale regarding the limitations imposed by such basic biological processes as mutation and selection. A distinct advantage of inbred panels is that genotyping only needs to be performed on the panel, not on each individual mouse. The initial CC genome data were haplotype reconstructions based on dense genotyping of the most recent common ancestors (MRCAs) of each strain followed by imputation from the genome sequence of the corresponding founder inbred strain. The MRCA resource captured segregating regions in strains that were not fully inbred, but it had limited resolution in the transition regions between founder haplotypes, and there was uncertainty about founder assignment in regions of limited diversity. Here we report the whole genome sequence of 69 CC strains generated by paired-end short reads at 30× coverage of a single male per strain. Sequencing leads to a substantial improvement in the fine structure and completeness of the genomes of the CC. Both MRCAs and sequenced samples show a significant reduction in the genome-wide haplotype frequencies from two wild-derived strains, CAST/EiJ and PWK/PhJ. In addition, analysis of the evolution of the patterns of heterozygosity indicates that selection against three wild-derived founder strains played a significant role in shaping the genomes of the CC. The sequencing resource provides the first description of tens of thousands of new genetic variants introduced by mutation and drift in the CC genomes. We estimate that new SNP mutations are accumulating in each CC strain at a rate of 2.4 ± 0.4 per gigabase per generation. The fixation of new mutations by genetic drift has introduced thousands of new variants into the CC strains. The majority of these mutations are novel compared to currently sequenced laboratory stocks and wild mice, and some are predicted to alter gene function. Approximately one-third of the CC inbred strains have acquired large deletions (>10 kb) many of which overlap known coding genes and functional elements. The sequence of these mice is a critical resource to CC users, increases threefold the number of mouse inbred strain genomes available publicly, and provides insight into the effect of mutation and drift on common resources. PMID:28592495
Genomics of apicomplexan parasites.

PubMed

Swapna, Lakshmipuram Seshadri; Parkinson, John

2017-06-01

The increasing prevalence of infections involving intracellular apicomplexan parasites such as Plasmodium, Toxoplasma, and Cryptosporidium (the causative agents of malaria, toxoplasmosis, and cryptosporidiosis, respectively) represent a significant global healthcare burden. Despite their significance, few treatments are available; a situation that is likely to deteriorate with the emergence of new resistant strains of parasites. To lay the foundation for programs of drug discovery and vaccine development, genome sequences for many of these organisms have been generated, together with large-scale expression and proteomic datasets. Comparative analyses of these datasets are beginning to identify the molecular innovations supporting both conserved processes mediating fundamental roles in parasite survival and persistence, as well as lineage-specific adaptations associated with divergent life-cycle strategies. The challenge is how best to exploit these data to derive insights into parasite virulence and identify those genes representing the most amenable targets. In this review, we outline genomic datasets currently available for apicomplexans and discuss biological insights that have emerged as a consequence of their analysis. Of particular interest are systems-based resources, focusing on areas of metabolism and host invasion that are opening up opportunities for discovering new therapeutic targets.
Ranking ecological risks of multiple chemical stressors on amphibians.

PubMed

Fedorenkova, Anastasia; Vonk, J Arie; Lenders, H J Rob; Creemers, Raymond C M; Breure, Anton M; Hendriks, A Jan

2012-06-01

Populations of amphibians have been declining worldwide since the late 1960s. Despite global concern, no studies have quantitatively assessed the major causes of this decline. In the present study, species sensitivity distributions (SSDs) were developed to analyze the sensitivity of anurans for ammonium, nitrate, heavy metals (cadmium, copper), pesticides (18 compounds), and acidification (pH) based on laboratory toxicity data. Ecological risk (ER) was calculated as the probability that a measured environmental concentration of a particular stressor in habitats where anurans were observed would exceed the toxic effect concentrations derived from the species sensitivity distributions. The assessment of ER was used to rank the stressors according to their potential risk to anurans based on a case study of Dutch freshwater bodies. The derived ERs revealed that threats to populations of anurans decreased in the sequence of pH, copper, diazinon, ammonium, and endosulfan. Other stressors studied were of minor importance. The method of deriving ER by combining field observation data and laboratory data provides insight into potential threats to species in their habitats and can be used to prioritize stressors, which is necessary to achieve effective management in amphibian conservation. Copyright © 2012 SETAC.
Efficient Access to Imidazo[1,2- a]pyridines/pyrazines/pyrimidines via Catalyst-Free Annulation Reaction under Microwave Irradiation in Green Solvent.

PubMed

Rao, R Nishanth; Mm, Balamurali; Maiti, Barnali; Thakuria, Ranjit; Chanda, Kaushik

2018-03-12

An expeditious catalyst-free heteroannulation reaction for imidazo[1,2- a]pyridines/pyrimidines/pyrazines was developed in green solvent under microwave irradiation. Using H 2 O-IPA as the reaction medium, various substituted 2-aminopyridines/pyrazines/pyrimidines underwent annulation reaction with α-bromoketones under microwave irradiation to provide the corresponding imidazo[1,2- a]pyridines/pyrimidines/pyrazines in excellent yields. The synthetic methodology appears to be very simple and superior to the already reported procedures with the high abundance of commercial reagents and great ability in expanding the molecular diversity. The present synthetic sequence is visualized as an environmentally benign process which allows the introduction of three points of structural diversity to expand chemical space with excellent purity and yields. The anti-inflammatory and antimicrobial activities of the derivatives were evaluated. Screening results uncovered three derivatives with strong inhibition of albumin denaturation and two derivatives were active on Proteus and Klebsiella bacteria. These positive bioassay results implied that the library of potential anti-inflammatory agents could be rapidly prepared in an ecofriendly manner, and provided new insights into drug discovery for medicinal chemists.
Feedback shift register sequences versus uniformly distributed random sequences for correlation chromatography

NASA Technical Reports Server (NTRS)

Kaljurand, M.; Valentin, J. R.; Shao, M.

1996-01-01

Two alternative input sequences are commonly employed in correlation chromatography (CC). They are sequences derived according to the algorithm of the feedback shift register (i.e., pseudo random binary sequences (PRBS)) and sequences derived by using the uniform random binary sequences (URBS). These two sequences are compared. By applying the "cleaning" data processing technique to the correlograms that result from these sequences, we show that when the PRBS is used the S/N of the correlogram is much higher than the one resulting from using URBS.
Activation of Adhesion G Protein-coupled Receptors: AGONIST SPECIFICITY OF STACHEL SEQUENCE-DERIVED PEPTIDES.

PubMed

Demberg, Lilian M; Winkler, Jana; Wilde, Caroline; Simon, Kay-Uwe; Schön, Julia; Rothemund, Sven; Schöneberg, Torsten; Prömel, Simone; Liebscher, Ines

2017-03-17

Members of the adhesion G protein-coupled receptor (aGPCR) family carry an agonistic sequence within their large ectodomains. Peptides derived from this region, called the Stachel sequence, can activate the respective receptor. As the conserved core region of the Stachel sequence is highly similar between aGPCRs, the agonist specificity of Stachel sequence-derived peptides was tested between family members using cell culture-based second messenger assays. Stachel peptides derived from aGPCRs of subfamily VI (GPR110/ADGRF1, GPR116/ADGRF5) and subfamily VIII (GPR64/ADGRG2, GPR126/ADGRG6) are able to activate more than one member of the respective subfamily supporting their evolutionary relationship and defining them as pharmacological receptor subtypes. Extended functional analyses of the Stachel sequences and derived peptides revealed agonist promiscuity, not only within, but also between aGPCR subfamilies. For example, the Stachel -derived peptide of GPR110 (subfamily VI) can activate GPR64 and GPR126 (both subfamily VIII). Our results indicate that key residues in the Stachel sequence are very similar between aGPCRs allowing for agonist promiscuity of several Stachel -derived peptides. Therefore, aGPCRs appear to be pharmacologically more closely related than previously thought. Our findings have direct implications for many aGPCR studies, as potential functional overlap has to be considered for in vitro and in vivo studies. However, it also offers the possibility of a broader use of more potent peptides when the original Stachel sequence is less effective. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Co-Circulation and Evolution of Polioviruses and Species C Enteroviruses in a District of Madagascar

PubMed Central

Rakoto-Andrianarivelo, Mala; Guillot, Sophie; Iber, Jane; Balanant, Jean; Blondel, Bruno; Riquet, Franck; Martin, Javier; Kew, Olen; Randriamanalina, Bakolalao; Razafinimpiasa, Lalatiana; Rousset, Dominique; Delpeyroux, Francis

2007-01-01

Between October 2001 and April 2002, five cases of acute flaccid paralysis (AFP) associated with type 2 vaccine-derived polioviruses (VDPVs) were reported in the southern province of the Republic of Madagascar. To determine viral factors that favor the emergence of these pathogenic VDPVs, we analyzed in detail their genomic and phenotypic characteristics and compared them with co-circulating enteroviruses. These VDPVs appeared to belong to two independent recombinant lineages with sequences from the type 2 strain of the oral poliovaccine (OPV) in the 5′-half of the genome and sequences derived from unidentified species C enteroviruses (HEV-C) in the 3′-half. VDPV strains showed characteristics similar to those of wild neurovirulent viruses including neurovirulence in poliovirus-receptor transgenic mice. We looked for other VDPVs and for circulating enteroviruses in 316 stools collected from healthy children living in the small area where most of the AFP cases occurred. We found vaccine PVs, two VDPVs similar to those found in AFP cases, some echoviruses, and above all, many serotypes of coxsackie A viruses belonging to HEV-C, with substantial genetic diversity. Several coxsackie viruses A17 and A13 carried nucleotide sequences closely related to the 2C and the 3Dpol coding regions of the VDPVs, respectively. There was also evidence of multiple genetic recombination events among the HEV-C resulting in numerous recombinant genotypes. This indicates that co-circulation of HEV-C and OPV strains is associated with evolution by recombination, resulting in unexpectedly extensive viral diversity in small human populations in some tropical regions. This probably contributed to the emergence of recombinant VDPVs. These findings give further insight into viral ecosystems and the evolutionary processes that shape viral biodiversity. PMID:18085822
Determination of the sequences of protein-derived peptides and peptide mixtures by mass spectrometry

PubMed Central

Morris, Howard R.; Williams, Dudley H.; Ambler, Richard P.

1971-01-01

Micro-quantities of protein-derived peptides have been converted into N-acetylated permethyl derivatives, and their sequences determined by low-resolution mass spectrometry without prior knowledge of their amino acid compositions or lengths. A new strategy is suggested for the mass spectrometric sequencing of oligopeptides or proteins, involving gel filtration of protein hydrolysates and subsequent sequence analysis of peptide mixtures. Finally, results are given that demonstrate for the first time the use of mass spectrometry for the analysis of a protein-derived peptide mixture, again without prior knowledge of the protein or components within the mixture. PMID:5158904
Identification of ovule transcripts from the Apospory-Specific Genomic Region (ASGR)-carrier chromosome

PubMed Central

2011-01-01

Background Apomixis, asexual seed production in plants, holds great potential for agriculture as a means to fix hybrid vigor. Apospory is a form of apomixis where the embryo develops from an unreduced egg that is derived from a somatic nucellar cell, the aposporous initial, via mitosis. Understanding the molecular mechanism regulating aposporous initial specification will be a critical step toward elucidation of apomixis and also provide insight into developmental regulation and downstream signaling that results in apomixis. To discover candidate transcripts for regulating aposporous initial specification in P. squamulatum, we compared two transcriptomes derived from microdissected ovules at the stage of aposporous initial formation between the apomictic donor parent, P. squamulatum (accession PS26), and an apomictic derived backcross 8 (BC8) line containing only the Apospory-Specific Genomic Region (ASGR)-carrier chromosome from P. squamulatum. Toward this end, two transcriptomes derived from ovules of an apomictic donor parent and its apomictic backcross derivative at the stage of apospory initiation, were sequenced using 454-FLX technology. Results Using 454-FLX technology, we generated 332,567 reads with an average read length of 147 base pairs (bp) for the PS26 ovule transcriptome library and 363,637 reads with an average read length of 142 bp for the BC8 ovule transcriptome library. A total of 33,977 contigs from the PS26 ovule transcriptome library and 26,576 contigs from the BC8 ovule transcriptome library were assembled using the Multifunctional Inertial Reference Assembly program. Using stringent in silico parameters, 61 transcripts were predicted to map to the ASGR-carrier chromosome, of which 49 transcripts were verified as ASGR-carrier chromosome specific. One of the alien expressed genes could be assigned as tightly linked to the ASGR by screening of apomictic and sexual F1s. Only one transcript, which did not map to the ASGR, showed expression primarily in reproductive tissue. Conclusions Our results suggest that a strategy of comparative sequencing of transcriptomes between donor parent and backcross lines containing an alien chromosome of interest can be an efficient method of identifying transcripts derived from an alien chromosome in a chromosome addition line. PMID:21521529
Evaluating the protein coding potential of exonized transposable element sequences

PubMed Central

Piriyapongsa, Jittima; Rutledge, Mark T; Patel, Sanil; Borodovsky, Mark; Jordan, I King

2007-01-01

Background Transposable element (TE) sequences, once thought to be merely selfish or parasitic members of the genomic community, have been shown to contribute a wide variety of functional sequences to their host genomes. Analysis of complete genome sequences have turned up numerous cases where TE sequences have been incorporated as exons into mRNAs, and it is widely assumed that such 'exonized' TEs encode protein sequences. However, the extent to which TE-derived sequences actually encode proteins is unknown and a matter of some controversy. We have tried to address this outstanding issue from two perspectives: i-by evaluating ascertainment biases related to the search methods used to uncover TE-derived protein coding sequences (CDS) and ii-through a probabilistic codon-frequency based analysis of the protein coding potential of TE-derived exons. Results We compared the ability of three classes of sequence similarity search methods to detect TE-derived sequences among data sets of experimentally characterized proteins: 1-a profile-based hidden Markov model (HMM) approach, 2-BLAST methods and 3-RepeatMasker. Profile based methods are more sensitive and more selective than the other methods evaluated. However, the application of profile-based search methods to the detection of TE-derived sequences among well-curated experimentally characterized protein data sets did not turn up many more cases than had been previously detected and nowhere near as many cases as recent genome-wide searches have. We observed that the different search methods used were complementary in the sense that they yielded largely non-overlapping sets of hits and differed in their ability to recover known cases of TE-derived CDS. The probabilistic analysis of TE-derived exon sequences indicates that these sequences have low protein coding potential on average. In particular, non-autonomous TEs that do not encode protein sequences, such as Alu elements, are frequently exonized but unlikely to encode protein sequences. Conclusion The exaptation of the numerous TE sequences found in exons as bona fide protein coding sequences may prove to be far less common than has been suggested by the analysis of complete genomes. We hypothesize that many exonized TE sequences actually function as post-transcriptional regulators of gene expression, rather than coding sequences, which may act through a variety of double stranded RNA related regulatory pathways. Indeed, their relatively high copy numbers and similarity to sequences dispersed throughout the genome suggests that exonized TE sequences could serve as master regulators with a wide scope of regulatory influence. Reviewers: This article was reviewed by Itai Yanai, Kateryna D. Makova, Melissa Wilson (nominated by Kateryna D. Makova) and Cedric Feschotte (nominated by John M. Logsdon Jr.). PMID:18036258
FARME DB: a functional antibiotic resistance element database

PubMed Central

Wallace, James C.; Port, Jesse A.; Smith, Marissa N.; Faustman, Elaine M.

2017-01-01

Antibiotic resistance (AR) is a major global public health threat but few resources exist that catalog AR genes outside of a clinical context. Current AR sequence databases are assembled almost exclusively from genomic sequences derived from clinical bacterial isolates and thus do not include many microbial sequences derived from environmental samples that confer resistance in functional metagenomic studies. These environmental metagenomic sequences often show little or no similarity to AR sequences from clinical isolates using standard classification criteria. In addition, existing AR databases provide no information about flanking sequences containing regulatory or mobile genetic elements. To help address this issue, we created an annotated database of DNA and protein sequences derived exclusively from environmental metagenomic sequences showing AR in laboratory experiments. Our Functional Antibiotic Resistant Metagenomic Element (FARME) database is a compilation of publically available DNA sequences and predicted protein sequences conferring AR as well as regulatory elements, mobile genetic elements and predicted proteins flanking antibiotic resistant genes. FARME is the first database to focus on functional metagenomic AR gene elements and provides a resource to better understand AR in the 99% of bacteria which cannot be cultured and the relationship between environmental AR sequences and antibiotic resistant genes derived from cultured isolates. Database URL: http://staff.washington.edu/jwallace/farme PMID:28077567
The major histocompatibility complex of tassel-eared squirrels. II. Genetic diversity associated with Abert squirrels.

PubMed

Wettstein, P J; States, J S

1986-01-01

The extent of polymorphism and the rate of divergence of class I and class II sequences mapping to the mammalian major histocompatibility complex (MHC) have been the subject of experimentation and speculation. To provide further insight into the evolution of the MHC we have initiated the analysis of two geographically isolated subspecies of tassel-eared squirrels. In the preceding communication we described the number and polymorphism of TSLA class I and class II sequences in Kaibab squirrels (S. aberti kaibabensis), which live north of the Grand Canyon. In this report we present a parallel analysis of Abert squirrels (S. aberti aberti), which live south of the Grand Canyon in northern Arizona. Genomic DNA from 12 Abert squirrels was digested with restriction enzymes, electrophoresed, blotted, and hybridized with DR alpha, DR beta, DQ alpha, DQ beta, and HLA-B7 probes. The results of these hybridizations were remarkably similar to those obtained in Kaibab squirrels. The majority of class I and class II bands were identical in size and number, suggesting that Abert and Kaibab squirrels have not significantly diverged in the TSLA complex despite their geographical separation. Relative polymorphism of class II sequences was similar to that observed with Kaibab squirrels: beta sequences exhibited higher polymorphism than alpha sequences. As in Kaibab squirrels, a number of alpha and beta sequences were apparently carried on the same fragments. In comparison to class II beta sequences, there was limited polymorphism in class I sequences, although a diverse number of class I genotypes were observed. Attempts to identify segregating TSLA haplotypes were futile in that the only families of sequences with concordant distributions were DQ alpha and DQ beta. These observations and those obtained with Kaibab squirrels suggest that the present-day TSLA haplotypes of both subspecies are derived from a limited number of common, progenitor haplotypes through repeated intra-TSLA recombination.
A simple and novel method for RNA-seq library preparation of single cell cDNA analysis by hyperactive Tn5 transposase.

PubMed

Brouilette, Scott; Kuersten, Scott; Mein, Charles; Bozek, Monika; Terry, Anna; Dias, Kerith-Rae; Bhaw-Rosun, Leena; Shintani, Yasunori; Coppen, Steven; Ikebe, Chiho; Sawhney, Vinit; Campbell, Niall; Kaneko, Masahiro; Tano, Nobuko; Ishida, Hidekazu; Suzuki, Ken; Yashiro, Kenta

2012-10-01

Deep sequencing of single cell-derived cDNAs offers novel insights into oncogenesis and embryogenesis. However, traditional library preparation for RNA-seq analysis requires multiple steps with consequent sample loss and stochastic variation at each step significantly affecting output. Thus, a simpler and better protocol is desirable. The recently developed hyperactive Tn5-mediated library preparation, which brings high quality libraries, is likely one of the solutions. Here, we tested the applicability of hyperactive Tn5-mediated library preparation to deep sequencing of single cell cDNA, optimized the protocol, and compared it with the conventional method based on sonication. This new technique does not require any expensive or special equipment, which secures wider availability. A library was constructed from only 100 ng of cDNA, which enables the saving of precious specimens. Only a few steps of robust enzymatic reaction resulted in saved time, enabling more specimens to be prepared at once, and with a more reproducible size distribution among the different specimens. The obtained RNA-seq results were comparable to the conventional method. Thus, this Tn5-mediated preparation is applicable for anyone who aims to carry out deep sequencing for single cell cDNAs. Copyright © 2012 Wiley Periodicals, Inc.
Centennial-scale records of total organic carbon in sediment cores from the South Yellow Sea, China

NASA Astrophysics Data System (ADS)

Zhu, Qing; Lin, Jia; Hong, Yuehui; Yuan, Lirong; Liu, Jinzhong; Xu, Xiaoming; Wang, Jianghai

2018-01-01

Global carbon cycling is a significant factor that controls climate change. The centennial-scale variations in total organic carbon (TOC) contents and its sources in marginal sea sediments may reflect the influence of human activities on global climate change. In this study, two fine-grained sediment cores from the Yellow Sea Cold Water Mass of the South Yellow Sea were used to systematically determine TOC contents and stable carbon isotope ratios. These results were combined with previous data of black carbon and 210Pb dating from which we reconstructed the centennial-scale initial sequences of TOC, terrigenous TOC (TOCter) and marine autogenous TOC (TOCmar) after selecting suitable models to correct the measured TOC (TOCcor). These sequences showed that the TOCter decreased with time in the both cores while the TOCmar increased, particularly the rapid growth in core H43 since the late 1960s. According to the correlation between the Huanghe (Yellow) River discharge and the TOCcor, TOCter, or TOCmar, we found that the TOCter in the two cores mainly derived from the Huanghe River and was transported by it, and that higher Huanghe River discharge could strengthen the decomposition of TOCmar. The newly obtained initial TOC sequences provide important insights into the interaction between human activities and natural processes.
Detailed investigation of the microbial community in foaming activated sludge reveals novel foam formers

PubMed Central

Guo, Feng; Wang, Zhi-Ping; Yu, Ke; Zhang, T.

2015-01-01

Foaming of activated sludge (AS) causes adverse impacts on wastewater treatment operation and hygiene. In this study, we investigated the microbial communities of foam, foaming AS and non-foaming AS in a sewage treatment plant via deep-sequencing of the taxonomic marker genes 16S rRNA and mycobacterial rpoB and a metagenomic approach. In addition to Actinobacteria, many genera (e.g., Clostridium XI, Arcobacter, Flavobacterium) were more abundant in the foam than in the AS. On the other hand, deep-sequencing of rpoB did not detect any obligate pathogenic mycobacteria in the foam. We found that unknown factors other than the abundance of Gordonia sp. could determine the foaming process, because abundance of the same species was stable before and after a foaming event over six months. More interestingly, although the dominant Gordonia foam former was the closest with G. amarae, it was identified as an undescribed Gordonia species by referring to the 16S rRNA gene, gyrB and, most convincingly, the reconstructed draft genome from metagenomic reads. Our results, based on metagenomics and deep sequencing, reveal that foams are derived from diverse taxa, which expands previous understanding and provides new insight into the underlying complications of the foaming phenomenon in AS. PMID:25560234
Modulation of Immune Signaling and Metabolism Highlights Host and Fungal Transcriptional Responses in Mouse Models of Invasive Pulmonary Aspergillosis.

PubMed

Kale, Shiv D; Ayubi, Tariq; Chung, Dawoon; Tubau-Juni, Nuria; Leber, Andrew; Dang, Ha X; Karyala, Saikumar; Hontecillas, Raquel; Lawrence, Christopher B; Cramer, Robert A; Bassaganya-Riera, Josep

2017-12-06

Incidences of invasive pulmonary aspergillosis, an infection caused predominantly by Aspergillus fumigatus, have increased due to the growing number of immunocompromised individuals. While A. fumigatus is reliant upon deficiencies in the host to facilitate invasive disease, the distinct mechanisms that govern the host-pathogen interaction remain enigmatic, particularly in the context of distinct immune modulating therapies. To gain insights into these mechanisms, RNA-Seq technology was utilized to sequence RNA derived from lungs of 2 clinically relevant, but immunologically distinct murine models of IPA on days 2 and 3 post inoculation when infection is established and active disease present. Our findings identify notable differences in host gene expression between the chemotherapeutic and steroid models at the interface of immunity and metabolism. RT-qPCR verified model specific and nonspecific expression of 23 immune-associated genes. Deep sequencing facilitated identification of highly expressed fungal genes. We utilized sequence similarity and gene expression to categorize the A. fumigatus putative in vivo secretome. RT-qPCR suggests model specific gene expression for nine putative fungal secreted proteins. Our analysis identifies contrasting responses by the host and fungus from day 2 to 3 between the two models. These differences may help tailor the identification, development, and deployment of host- and/or fungal-targeted therapeutics.

Repair of DNA double-strand breaks by templated nucleotide sequence insertions derived from distant regions of the genome.

PubMed

Onozawa, Masahiro; Zhang, Zhenhua; Kim, Yoo Jung; Goldberg, Liat; Varga, Tamas; Bergsagel, P Leif; Kuehl, W Michael; Aplan, Peter D

2014-05-27

We used the I-SceI endonuclease to produce DNA double-strand breaks (DSBs) and observed that a fraction of these DSBs were repaired by insertion of sequences, which we termed "templated sequence insertions" (TSIs), derived from distant regions of the genome. These TSIs were derived from genic, retrotransposon, or telomere sequences and were not deleted from the donor site in the genome, leading to the hypothesis that they were derived from reverse-transcribed RNA. Cotransfection of RNA and an I-SceI expression vector demonstrated insertion of RNA-derived sequences at the DNA-DSB site, and TSIs were suppressed by reverse-transcriptase inhibitors. Both observations support the hypothesis that TSIs were derived from RNA templates. In addition, similar insertions were detected at sites of DNA DSBs induced by transcription activator-like effector nuclease proteins. Whole-genome sequencing of myeloma cell lines revealed additional TSIs, demonstrating that repair of DNA DSBs via insertion was not restricted to experimentally produced DNA DSBs. Analysis of publicly available databases revealed that many of these TSIs are polymorphic in the human genome. Taken together, these results indicate that insertional events should be considered as alternatives to gross chromosomal rearrangements in the interpretation of whole-genome sequence data and that this mutagenic form of DNA repair may play a role in genetic disease, exon shuffling, and mammalian evolution.
Insight into the evolution of the Solanaceae from the parental genomes of Petunia hybrida.

PubMed

Bombarely, Aureliano; Moser, Michel; Amrad, Avichai; Bao, Manzhu; Bapaume, Laure; Barry, Cornelius S; Bliek, Mattijs; Boersma, Maaike R; Borghi, Lorenzo; Bruggmann, Rémy; Bucher, Marcel; D'Agostino, Nunzio; Davies, Kevin; Druege, Uwe; Dudareva, Natalia; Egea-Cortines, Marcos; Delledonne, Massimo; Fernandez-Pozo, Noe; Franken, Philipp; Grandont, Laurie; Heslop-Harrison, J S; Hintzsche, Jennifer; Johns, Mitrick; Koes, Ronald; Lv, Xiaodan; Lyons, Eric; Malla, Diwa; Martinoia, Enrico; Mattson, Neil S; Morel, Patrice; Mueller, Lukas A; Muhlemann, Joëlle; Nouri, Eva; Passeri, Valentina; Pezzotti, Mario; Qi, Qinzhou; Reinhardt, Didier; Rich, Melanie; Richert-Pöggeler, Katja R; Robbins, Tim P; Schatz, Michael C; Schranz, M Eric; Schuurink, Robert C; Schwarzacher, Trude; Spelt, Kees; Tang, Haibao; Urbanus, Susan L; Vandenbussche, Michiel; Vijverberg, Kitty; Villarino, Gonzalo H; Warner, Ryan M; Weiss, Julia; Yue, Zhen; Zethof, Jan; Quattrocchio, Francesca; Sims, Thomas L; Kuhlemeier, Cris

2016-05-27

Petunia hybrida is a popular bedding plant that has a long history as a genetic model system. We report the whole-genome sequencing and assembly of inbred derivatives of its two wild parents, P. axillaris N and P. inflata S6. The assemblies include 91.3% and 90.2% coverage of their diploid genomes (1.4 Gb; 2n = 14) containing 32,928 and 36,697 protein-coding genes, respectively. The genomes reveal that the Petunia lineage has experienced at least two rounds of hexaploidization: the older gamma event, which is shared with most Eudicots, and a more recent Solanaceae event that is shared with tomato and other solanaceous species. Transcription factors involved in the shift from bee to moth pollination reside in particularly dynamic regions of the genome, which may have been key to the remarkable diversity of floral colour patterns and pollination systems. The high-quality genome sequences will enhance the value of Petunia as a model system for research on unique biological phenomena such as small RNAs, symbiosis, self-incompatibility and circadian rhythms.
Pre-service teachers’ approaches to a historical problem in mechanics

NASA Astrophysics Data System (ADS)

Malgieri, Massimiliano; Onorato, Pasquale; Mascheretti, Paolo; De Ambrosis, Anna

2014-09-01

In this paper we report on an activity sequence with a group of 29 pre-service physics teachers based on the reconstruction and analysis of a thought experiment that was crucial for Huygens’ derivation of the formula for the centre of oscillation of a physical pendulum. The sequence starts with student teachers approaching the historical problem and culminates in a guided inquiry activity in a video-based laboratory (VBL) setting using Tracker software. We collected data before, during and after the experimental activity by means of written questions, oral discussions and final reports. These documents provide insights into students’ initial and evolving conceptions, as well as their attitudes towards the activity. The analysis of data allows us to uncover and focus on relevant difficulties for future teachers in mastering the concepts of centre of mass and conservation of energy. Moreover, we find indications that the VBL environment makes a positive contribution by stimulating and improving students’ modelling abilities. In particular, we find a sharp increase in the percentage of students capable of producing coherent explanations and physical analyses for the Huygens’ pendulum system after the Tracker activity.
lncRScan-SVM: A Tool for Predicting Long Non-Coding RNAs Using Support Vector Machine.

PubMed

Sun, Lei; Liu, Hui; Zhang, Lin; Meng, Jia

2015-01-01

Functional long non-coding RNAs (lncRNAs) have been bringing novel insight into biological study, however it is still not trivial to accurately distinguish the lncRNA transcripts (LNCTs) from the protein coding ones (PCTs). As various information and data about lncRNAs are preserved by previous studies, it is appealing to develop novel methods to identify the lncRNAs more accurately. Our method lncRScan-SVM aims at classifying PCTs and LNCTs using support vector machine (SVM). The gold-standard datasets for lncRScan-SVM model training, lncRNA prediction and method comparison were constructed according to the GENCODE gene annotations of human and mouse respectively. By integrating features derived from gene structure, transcript sequence, potential codon sequence and conservation, lncRScan-SVM outperforms other approaches, which is evaluated by several criteria such as sensitivity, specificity, accuracy, Matthews correlation coefficient (MCC) and area under curve (AUC). In addition, several known human lncRNA datasets were assessed using lncRScan-SVM. LncRScan-SVM is an efficient tool for predicting the lncRNAs, and it is quite useful for current lncRNA study.
Functional Analysis With a Barcoder Yeast Gene Overexpression System

PubMed Central

Douglas, Alison C.; Smith, Andrew M.; Sharifpoor, Sara; Yan, Zhun; Durbic, Tanja; Heisler, Lawrence E.; Lee, Anna Y.; Ryan, Owen; Göttert, Hendrikje; Surendra, Anu; van Dyk, Dewald; Giaever, Guri; Boone, Charles; Nislow, Corey; Andrews, Brenda J.

2012-01-01

Systematic analysis of gene overexpression phenotypes provides an insight into gene function, enzyme targets, and biological pathways. Here, we describe a novel functional genomics platform that enables a highly parallel and systematic assessment of overexpression phenotypes in pooled cultures. First, we constructed a genome-level collection of ~5100 yeast barcoder strains, each of which carries a unique barcode, enabling pooled fitness assays with a barcode microarray or sequencing readout. Second, we constructed a yeast open reading frame (ORF) galactose-induced overexpression array by generating a genome-wide set of yeast transformants, each of which carries an individual plasmid-born and sequence-verified ORF derived from the Saccharomyces cerevisiae full-length EXpression-ready (FLEX) collection. We combined these collections genetically using synthetic genetic array methodology, generating ~5100 strains, each of which is barcoded and overexpresses a specific ORF, a set we termed “barFLEX.” Additional synthetic genetic array allows the barFLEX collection to be moved into different genetic backgrounds. As a proof-of-principle, we describe the properties of the barFLEX overexpression collection and its application in synthetic dosage lethality studies under different environmental conditions. PMID:23050238
From rags to riches: insights from the first genomic sequence of a plant pathogenic bacterium

PubMed Central

Keen, Noel T; Korsi Dumenyo, C; Yang, Ching-Hong; Cooksey, Donald A

2000-01-01

The recently published genomic sequence of Xylella fastidiosa is the first for a free-living plant pathogen and provides clues to mechanisms of pathogenesis and survival in insect vectors. The sequence data should lead to improved control of this pathogen. PMID:11178244
Insights into transcriptomes of Big and Low sagebrush

Treesearch

Mark D. Huynh; Justin T. Page; Bryce A. Richardson; Joshua A. Udall

2015-01-01

We report the sequencing and assembly of three transcriptomes from Big (Artemisia tridentatassp. wyomingensis and A. tridentatassp. tridentata) and Low (A. arbuscula ssp. arbuscula) sagebrush. The sequence reads are available in the Sequence Read Archive of NCBI. We demonstrate the utilities of these transcriptomes for gene discovery and phylogenomic analysis. An...
The Origins of 168, W23, and Other Bacillus subtilis Legacy Strains▿ †

PubMed Central

Zeigler, Daniel R.; Prágai, Zoltán; Rodriguez, Sabrina; Chevreux, Bastien; Muffler, Andrea; Albert, Thomas; Bai, Renyuan; Wyss, Markus; Perkins, John B.

2008-01-01

Bacillus subtilis is both a model organism for basic research and an industrial workhorse, yet there are major gaps in our understanding of the genomic heritage and provenance of many widely used strains. We analyzed 17 legacy strains dating to the early years of B. subtilis genetics. For three—NCIB 3610T, PY79, and SMY—we performed comparative genome sequencing. For the remainder, we used conventional sequencing to sample genomic regions expected to show sequence heterogeneity. Sequence comparisons showed that 168, its siblings (122, 160, and 166), and the type strains NCIB 3610 and ATCC 6051 are highly similar and are likely descendants of the original Marburg strain, although the 168 lineage shows genetic evidence of early domestication. Strains 23, W23, and W23SR are identical in sequence to each other but only 94.6% identical to the Marburg group in the sequenced regions. Strain 23, the probable W23 parent, likely arose from a contaminant in the mutagenesis experiments that produced 168. The remaining strains are all genomic hybrids, showing one or more “W23 islands” in a 168 genomic backbone. Each traces its origin to transformations of 168 derivatives with DNA from 23 or W23. The common prototrophic lab strain PY79 possesses substantial W23 islands at its trp and sac loci, along with large deletions that have reduced its genome 4.3%. SMY, reputed to be the parent of 168, is actually a 168-W23 hybrid that likely shares a recent ancestor with PY79. These data provide greater insight into the genomic history of these B. subtilis legacy strains. PMID:18723616
Single-molecule, full-length transcript sequencing provides insight into the extreme metabolism of the ruby-throated hummingbird Archilochus colubris

PubMed Central

Workman, Rachael E; Myrka, Alexander M; Wong, G William; Tseng, Elizabeth

2018-01-01

Abstract Background Hummingbirds oxidize ingested nectar sugars directly to fuel foraging but cannot sustain this fuel use during fasting periods, such as during the night or during long-distance migratory flights. Instead, fasting hummingbirds switch to oxidizing stored lipids that are derived from ingested sugars. The hummingbird liver plays a key role in moderating energy homeostasis and this remarkable capacity for fuel switching. Additionally, liver is the principle location of de novo lipogenesis, which can occur at exceptionally high rates, such as during premigratory fattening. Yet understanding how this tissue and whole organism moderates energy turnover is hampered by a lack of information regarding how relevant enzymes differ in sequence, expression, and regulation. Findings We generated a de novo transcriptome of the hummingbird liver using PacBio full-length cDNA sequencing (Iso-Seq), yielding 8.6Gb of sequencing data, or 2.6M reads from 4 different size fractions. We analyzed data using the SMRTAnalysis v3.1 Iso-Seq pipeline, then clustered isoforms into gene families to generate de novo gene contigs using Cogent. We performed orthology analysis to identify closely related sequences between our transcriptome and other avian and human gene sets. Finally, we closely examined homology of critical lipid metabolism genes between our transcriptome data and avian and human genomes. Conclusions We confirmed high levels of sequence divergence within hummingbird lipogenic enzymes, suggesting a high probability of adaptive divergent function in the hepatic lipogenic pathways. Our results leverage cutting-edge technology and a novel bioinformatics pipeline to provide a first direct look at the transcriptome of this incredible organism. PMID:29618047
Photosynthesis Is Widely Distributed among Proteobacteria as Demonstrated by the Phylogeny of PufLM Reaction Center Proteins

PubMed Central

Imhoff, Johannes F.; Rahn, Tanja; Künzel, Sven; Neulinger, Sven C.

2018-01-01

Two different photosystems for performing bacteriochlorophyll-mediated photosynthetic energy conversion are employed in different bacterial phyla. Those bacteria employing a photosystem II type of photosynthetic apparatus include the phototrophic purple bacteria (Proteobacteria), Gemmatimonas and Chloroflexus with their photosynthetic relatives. The proteins of the photosynthetic reaction center PufL and PufM are essential components and are common to all bacteria with a type-II photosynthetic apparatus, including the anaerobic as well as the aerobic phototrophic Proteobacteria. Therefore, PufL and PufM proteins and their genes are perfect tools to evaluate the phylogeny of the photosynthetic apparatus and to study the diversity of the bacteria employing this photosystem in nature. Almost complete pufLM gene sequences and the derived protein sequences from 152 type strains and 45 additional strains of phototrophic Proteobacteria employing photosystem II were compared. The results give interesting and comprehensive insights into the phylogeny of the photosynthetic apparatus and clearly define Chromatiales, Rhodobacterales, Sphingomonadales as major groups distinct from other Alphaproteobacteria, from Betaproteobacteria and from Caulobacterales (Brevundimonas subvibrioides). A special relationship exists between the PufLM sequences of those bacteria employing bacteriochlorophyll b instead of bacteriochlorophyll a. A clear phylogenetic association of aerobic phototrophic purple bacteria to anaerobic purple bacteria according to their PufLM sequences is demonstrated indicating multiple evolutionary lines from anaerobic to aerobic phototrophic purple bacteria. The impact of pufLM gene sequences for studies on the environmental diversity of phototrophic bacteria is discussed and the possibility of their identification on the species level in environmental samples is pointed out. PMID:29472894
Biochemical Characterization of a Mycobacteriophage Derived DnaB Ortholog Reveals New Insight into the Evolutionary Origin of DnaB Helicases

PubMed Central

Bhowmik, Priyanka; Das Gupta, Sujoy K.

2015-01-01

The bacterial replicative helicases known as DnaB are considered to be members of the RecA superfamily. All members of this superfamily, including DnaB, have a conserved C- terminal domain, known as the RecA core. We unearthed a series of mycobacteriophage encoded proteins in which the RecA core domain alone was present. These proteins were phylogenetically related to each other and formed a distinct clade within the RecA superfamily. A mycobacteriophage encoded protein, Wildcat Gp80 that roots deep in the DnaB family, was found to possess a core domain having significant sequence homology (Expect value < 10-5) with members of this novel cluster. This indicated that Wildcat Gp80, and by extrapolation, other members of the DnaB helicase family, may have evolved from a single domain RecA core polypeptide belonging to this novel group. Biochemical investigations confirmed that Wildcat Gp80 was a helicase. Surprisingly, our investigations also revealed that a thioredoxin tagged truncated version of the protein in which the N-terminal sequences were removed was fully capable of supporting helicase activity, although its ATP dependence properties were different. DnaB helicase activity is thus, primarily a function of the RecA core although additional N-terminal sequences may be necessary for fine tuning its activity and stability. Based on sequence comparison and biochemical studies we propose that DnaB helicases may have evolved from single domain RecA core proteins having helicase activities of their own, through the incorporation of additional N-terminal sequences. PMID:26237048
Human T-lymphotropic virus type 1 (HTLV-1) genetic typing in Kakeroma Island, an island at the crossroads of the ryukyuans and Wajin in Japan, providing further insights into the origin of the virus in Japan.

PubMed

Eguchi, Katsuyuki; Fujii, Hidefumi; Oshima, Kengo; Otani, Masashi; Matsuo, Toshiaki; Yamamoto, Taro

2009-08-01

Peripheral blood samples were collected from 23 human T-lymphotropic virus type-1 (HTLV-1) carriers residing in Kakeroma Island, Japan (Kagoshima Prefecture, Oshima County, Setouchi Town), one of the most highly endemic areas in Japan. The samples were subjected to amplification by PCR and sequencing of the Long Terminal Repeat in order to reconstruct a phylogenetic tree of HTLV-1 isolates. Restriction Fragment Length Polymorphism (RFLP) analysis of env region was also conducted for subgrouping of HTLV-1. Although one sample could not be amplified by PCR, and three more could not be sequenced due to the existence of conspicuous nonspecific bands or repeated sequences, the phylogenetic analysis revealed that the remaining 19 isolates obtained from Kakeroma Island belonged to either the Transcontinental or the Japanese subgroups of the Cosmopolitan subtype, one of the three major subtypes. The RFLP data corresponded closely with the typing data throughout the sequencing. The proportion of the Transcontinental subgroup among the isolates was 26.3% (5 of 19) by sequence analysis and 27.3% (6 of 22) by RFLP. Unlike in Taiwan, China and Okinawa, the Japanese subgroup was dominant in Kakeroma Island. The analysis would also suggest that the Japanese subgroup seems not to have derived from the Transcontinental subgroup, but rather that the Transcontinental subgroup came to Japan first and was followed later by the Japanese one. 2009 Wiley-Liss, Inc.
Whole transcriptome analysis of the poultry red mite Dermanyssus gallinae (De Geer, 1778).

PubMed

Schicht, Sabine; Qi, Weihong; Poveda, Lucy; Strube, Christina

2014-03-01

SUMMARY Although the poultry red mite Dermanyssus gallinae (De Geer, 1778) is the major parasitic pest in poultry farming causing substantial economic losses every year, nucleotide data are rare in the public databases. Therefore, de novo sequencing covering the transcriptome of D. gallinae was carried out resulting in a dataset of 232 097 singletons and 42 130 contiguous sequences (contigs) which were subsequently clustered into 24 140 isogroups consisting of 35 788 isotigs. After removal of sequences possibly originating from bacteria or the chicken host, 267 464 sequences (231 657 singletons, 56 contigs and 35 751 isotigs) remained, of which 10·3% showed homology to proteins derived from other organisms. The most significant Blast top-hit species was the mite Metaseiulus occidentalis followed by the tick Ixodes scapularis. To gain functional knowledge of D. gallinae transcripts, sequences were mapped to Gene Ontology terms, Kyoto Encyclopedia of Gene and Genomes (KEGG) pathways and parsed to InterProScan. The transcriptome dataset provides new insights in general mite genetics and lays a foundation for future studies on stage-specific transcriptomics as well as genomic, proteomic, and metabolomic explorations and might provide new perspectives to control this parasitic mite by identifying possible drug targets or vaccine candidates. It is also worth noting that in different tested species of the class Arachnida no 28S rRNA was detectable in the rRNA profile, indicating that 28S rRNA might consists of two separate, hydrogen-bonded fragments, whose (heat-induced) disruption may led to co-migration with 18S rRNA.
From Sequences to Insights in Microbial Ecology

PubMed Central

Knight, R.

2010-01-01

s4-3 Rapid declines in the cost of sequencing have made large volumes of DNA sequence data available to individual investigators. Now, data analysis is the rate-limiting step: providing a user with sequences alone typically leads to bewilderment, frustration, and skepticism about the technology. In this talk, I focus on how to extract insights from 16S rRNA data, including key lab steps (barcoding and normalization) and on which tools are available to perform routine but essential processing steps such as denoising, chimera detection, taxonomy assignment, and diversity analyses (including detection of biological clusters and gradients in the samples). Providing users with advice on these points and with a standard pipeline they can exploit (but modify if circumstances require) can greatly accelerate the rate of understanding, publication, and acquisition of funding for further studies.
Reprogramming neurodegeneration in the big data era.

PubMed

Zhou, Lujia; Verstreken, Patrik

2018-02-01

Recent genome-wide association studies (GWAS) have identified numerous genetic risk variants for late-onset Alzheimer's disease (AD) and Parkinson's disease (PD). However, deciphering the functional consequences of GWAS data is challenging due to a lack of reliable model systems to study the genetic variants that are often of low penetrance and non-coding identities. Pluripotent stem cell (PSC) technologies offer unprecedented opportunities for molecular phenotyping of GWAS variants in human neurons and microglia. Moreover, rapid technological advances in whole-genome RNA-sequencing and epigenome mapping fuel comprehensive and unbiased investigations of molecular alterations in PSC-derived disease models. Here, we review and discuss how integrated studies that utilize PSC technologies and genome-wide approaches may bring new mechanistic insight into the pathogenesis of AD and PD. Copyright © 2018 Elsevier Ltd. All rights reserved.
Quantitative functional characterization of conserved molecular interactions in the active site of mannitol 2-dehydrogenase

PubMed Central

Lucas, James E; Siegel, Justin B

2015-01-01

Enzyme active site residues are often highly conserved, indicating a significant role in function. In this study we quantitate the functional contribution for all conserved molecular interactions occurring within a Michaelis complex for mannitol 2-dehydrogenase derived from Pseudomonas fluorescens (pfMDH). Through systematic mutagenesis of active site residues, we reveal that the molecular interactions in pfMDH mediated by highly conserved residues not directly involved in reaction chemistry can be as important to catalysis as those directly involved in the reaction chemistry. This quantitative analysis of the molecular interactions within the pfMDH active site provides direct insight into the functional role of each molecular interaction, several of which were unexpected based on canonical sequence conservation and structural analyses. PMID:25752240
Origin and spread of photosynthesis based upon conserved sequence features in key bacteriochlorophyll biosynthesis proteins.

PubMed

Gupta, Radhey S

2012-11-01

The origin of photosynthesis and how this capability has spread to other bacterial phyla remain important unresolved questions. I describe here a number of conserved signature indels (CSIs) in key proteins involved in bacteriochlorophyll (Bchl) biosynthesis that provide important insights in these regards. The proteins BchL and BchX, which are essential for Bchl biosynthesis, are derived by gene duplication in a common ancestor of all phototrophs. More ancient gene duplication gave rise to the BchX-BchL proteins and the NifH protein of the nitrogenase complex. The sequence alignment of NifH-BchX-BchL proteins contain two CSIs that are uniquely shared by all NifH and BchX homologs, but not by any BchL homologs. These CSIs and phylogenetic analysis of NifH-BchX-BchL protein sequences strongly suggest that the BchX homologs are ancestral to BchL and that the Bchl-based anoxygenic photosynthesis originated prior to the chlorophyll (Chl)-based photosynthesis in cyanobacteria. Another CSI in the BchX-BchL sequence alignment that is uniquely shared by all BchX homologs and the BchL sequences from Heliobacteriaceae, but absent in all other BchL homologs, suggests that the BchL homologs from Heliobacteriaceae are primitive in comparison to all other photosynthetic lineages. Several other identified CSIs in the BchN homologs are commonly shared by all proteobacterial homologs and a clade consisting of the marine unicellular Cyanobacteria (Clade C). These CSIs in conjunction with the results of phylogenetic analyses and pair-wise sequence similarity on the BchL, BchN, and BchB proteins, where the homologs from Clade C Cyanobacteria and Proteobacteria exhibited close relationship, provide strong evidence that these two groups have incurred lateral gene transfers. Additionally, phylogenetic analyses and several CSIs in the BchL-N-B proteins that are uniquely shared by all Chlorobi and Chloroflexi homologs provide evidence that the genes for these proteins have also been laterally transferred between these groups. Other results and observations reported here indicate that the genes for the BchL-N-B proteins in Proteobacteria are derived from the Clade C Cyanobacteria, whereas those in Chlorobi were acquired from Chloroflexus or related bacteria by means of LGTs. Some implications of these observations regarding the origin and spread of photosynthesis are discussed.
Diversity of Secondary Structure in Catalytic Peptides with β-Turn-Biased Sequences

PubMed Central

2016-01-01

X-ray crystallography has been applied to the structural analysis of a series of tetrapeptides that were previously assessed for catalytic activity in an atroposelective bromination reaction. Common to the series is a central Pro-Xaa sequence, where Pro is either l- or d-proline, which was chosen to favor nucleation of canonical β-turn secondary structures. Crystallographic analysis of 35 different peptide sequences revealed a range of conformational states. The observed differences appear not only in cases where the Pro-Xaa loop-region is altered, but also when seemingly subtle alterations to the flanking residues are introduced. In many instances, distinct conformers of the same sequence were observed, either as symmetry-independent molecules within the same unit cell or as polymorphs. Computational studies using DFT provided additional insight into the analysis of solid-state structural features. Select X-ray crystal structures were compared to the corresponding solution structures derived from measured proton chemical shifts, 3J-values, and 1H–1H-NOESY contacts. These findings imply that the conformational space available to simple peptide-based catalysts is more diverse than precedent might suggest. The direct observation of multiple ground state conformations for peptides of this family, as well as the dynamic processes associated with conformational equilibria, underscore not only the challenge of designing peptide-based catalysts, but also the difficulty in predicting their accessible transition states. These findings implicate the advantages of low-barrier interconversions between conformations of peptide-based catalysts for multistep, enantioselective reactions. PMID:28029251
Metagenomic assembly through the lens of validation: recent advances in assessing and improving the quality of genomes assembled from metagenomes.

PubMed

Olson, Nathan D; Treangen, Todd J; Hill, Christopher M; Cepeda-Espinoza, Victoria; Ghurye, Jay; Koren, Sergey; Pop, Mihai

2017-08-07

Metagenomic samples are snapshots of complex ecosystems at work. They comprise hundreds of known and unknown species, contain multiple strain variants and vary greatly within and across environments. Many microbes found in microbial communities are not easily grown in culture making their DNA sequence our only clue into their evolutionary history and biological function. Metagenomic assembly is a computational process aimed at reconstructing genes and genomes from metagenomic mixtures. Current methods have made significant strides in reconstructing DNA segments comprising operons, tandem gene arrays and syntenic blocks. Shorter, higher-throughput sequencing technologies have become the de facto standard in the field. Sequencers are now able to generate billions of short reads in only a few days. Multiple metagenomic assembly strategies, pipelines and assemblers have appeared in recent years. Owing to the inherent complexity of metagenome assembly, regardless of the assembly algorithm and sequencing method, metagenome assemblies contain errors. Recent developments in assembly validation tools have played a pivotal role in improving metagenomics assemblers. Here, we survey recent progress in the field of metagenomic assembly, provide an overview of key approaches for genomic and metagenomic assembly validation and demonstrate the insights that can be derived from assemblies through the use of assembly validation strategies. We also discuss the potential for impact of long-read technologies in metagenomics. We conclude with a discussion of future challenges and opportunities in the field of metagenomic assembly and validation. © The Author 2017. Published by Oxford University Press.
Longitudinal Analysis of Cerebrospinal Fluid and Plasma HIV-1 Envelope Sequences Isolated From a Single Donor with HIV Asymptomatic Neurocognitive Impairment.

PubMed

Vázquez-Santiago, Fabián; García, Yashira; Rivera-Román, Ivelisse; Noel, Richard J; Wojna, Valerie; Meléndez, Loyda M; Rivera-Amill, Vanessa

Combined antiretroviral treatment (cART) has changed the clinical presentation of HIV-associated neurocognitive disorders (HAND) to that of the milder forms of the disease. Asymptomatic neurocognitive impairment (ANI) is now more prevalent and is associated with increased morbidity and mortality risk in HIV-1-infected people. HIV-1 envelope ( env ) genetic heterogeneity has been detected within the central nervous system (CNS) of individuals with ANI. Changes within env determine co-receptor use, cellular tropism, and neuropathogenesis. We hypothesize that compartmental changes are associated with HIV-1 env C2V4 during ANI and sought to analyze paired HIV-1 env sequences from plasma and cerebrospinal fluid (CSF) of a female subject undergoing long-term cART. Paired plasma and CSF samples were collected at 12-month intervals and HIV-1 env C2V4 was cloned and sequenced. Phylogenetic analysis of paired samples consistently showed genetic variants unique to the CSF. Phenotypic prediction showed CCR5 (R5) variants for all CSF-derived sequences and showed minor X4 variants (or dual-tropic) in the plasma at later time points. Viral compartmentalization was evident throughout the study, suggesting that the occurrence of distinctive env strains may contribute to the neuropathogenesis of HAND. Our study provides new insights about the genetic characteristics within the C2V4 of HIV-1 env that persist after long-term cART and during the course of persistent ANI.

Moorea BIOCODE barcode library as a tool for understanding predator-prey interactions: insights into the diet of common predatory coral reef fishes

NASA Astrophysics Data System (ADS)

Leray, M.; Boehm, J. T.; Mills, S. C.; Meyer, C. P.

2012-06-01

Identifying species involved in consumer-resource interactions is one of the main limitations in the construction of food webs. DNA barcoding of prey items in predator guts provides a valuable tool for characterizing trophic interactions, but the method relies on the availability of reference sequences to which prey sequences can be matched. In this study, we demonstrate that the COI sequence library of the Moorea BIOCODE project, an ecosystem-level barcode initiative, enables the identification of a large proportion of semi-digested fish, crustacean and mollusks found in the guts of three Hawkfish and two Squirrelfish species. While most prey remains lacked diagnostic morphological characters, 94% of the prey found in 67 fishes had >98% sequence similarity with BIOCODE reference sequences. Using this species-level prey identification, we demonstrate how DNA barcoding can provide insights into resource partitioning, predator feeding behaviors and the consequences of predation on ecosystem function.
Novel Insights into Tree Biology and Genome Evolution as Revealed Through Genomics.

PubMed

Neale, David B; Martínez-García, Pedro J; De La Torre, Amanda R; Montanari, Sara; Wei, Xiao-Xin

2017-04-28

Reference genome sequences are the key to the discovery of genes and gene families that determine traits of interest. Recent progress in sequencing technologies has enabled a rapid increase in genome sequencing of tree species, allowing the dissection of complex characters of economic importance, such as fruit and wood quality and resistance to biotic and abiotic stresses. Although the number of reference genome sequences for trees lags behind those for other plant species, it is not too early to gain insight into the unique features that distinguish trees from nontree plants. Our review of the published data suggests that, although many gene families are conserved among herbaceous and tree species, some gene families, such as those involved in resistance to biotic and abiotic stresses and in the synthesis and transport of sugars, are often expanded in tree genomes. As the genomes of more tree species are sequenced, comparative genomics will further elucidate the complexity of tree genomes and how this relates to traits unique to trees.
Insights into Conifer Giga-Genomes1

PubMed Central

De La Torre, Amanda R.; Birol, Inanc; Bousquet, Jean; Ingvarsson, Pär K.; Jansson, Stefan; Jones, Steven J.M.; Keeling, Christopher I.; MacKay, John; Nilsson, Ove; Ritland, Kermit; Street, Nathaniel; Yanchuk, Alvin; Zerbe, Philipp; Bohlmann, Jörg

2014-01-01

Insights from sequenced genomes of major land plant lineages have advanced research in almost every aspect of plant biology. Until recently, however, assembled genome sequences of gymnosperms have been missing from this picture. Conifers of the pine family (Pinaceae) are a group of gymnosperms that dominate large parts of the world’s forests. Despite their ecological and economic importance, conifers seemed long out of reach for complete genome sequencing, due in part to their enormous genome size (20–30 Gb) and the highly repetitive nature of their genomes. Technological advances in genome sequencing and assembly enabled the recent publication of three conifer genomes: white spruce (Picea glauca), Norway spruce (Picea abies), and loblolly pine (Pinus taeda). These genome sequences revealed distinctive features compared with other plant genomes and may represent a window into the past of seed plant genomes. This Update highlights recent advances, remaining challenges, and opportunities in light of the publication of the first conifer and gymnosperm genomes. PMID:25349325
Insights into conifer giga-genomes.

PubMed

De La Torre, Amanda R; Birol, Inanc; Bousquet, Jean; Ingvarsson, Pär K; Jansson, Stefan; Jones, Steven J M; Keeling, Christopher I; MacKay, John; Nilsson, Ove; Ritland, Kermit; Street, Nathaniel; Yanchuk, Alvin; Zerbe, Philipp; Bohlmann, Jörg

2014-12-01

Insights from sequenced genomes of major land plant lineages have advanced research in almost every aspect of plant biology. Until recently, however, assembled genome sequences of gymnosperms have been missing from this picture. Conifers of the pine family (Pinaceae) are a group of gymnosperms that dominate large parts of the world's forests. Despite their ecological and economic importance, conifers seemed long out of reach for complete genome sequencing, due in part to their enormous genome size (20-30 Gb) and the highly repetitive nature of their genomes. Technological advances in genome sequencing and assembly enabled the recent publication of three conifer genomes: white spruce (Picea glauca), Norway spruce (Picea abies), and loblolly pine (Pinus taeda). These genome sequences revealed distinctive features compared with other plant genomes and may represent a window into the past of seed plant genomes. This Update highlights recent advances, remaining challenges, and opportunities in light of the publication of the first conifer and gymnosperm genomes. © 2014 American Society of Plant Biologists. All Rights Reserved.
Remnants of an Ancient Deltaretrovirus in the Genomes of Horseshoe Bats (Rhinolophidae).

PubMed

Hron, Tomáš; Farkašová, Helena; Gifford, Robert J; Benda, Petr; Hulva, Pavel; Görföl, Tamás; Pačes, Jan; Elleder, Daniel

2018-04-10

Endogenous retrovirus (ERV) sequences provide a rich source of information about the long-term interactions between retroviruses and their hosts. However, most ERVs are derived from a subset of retrovirus groups, while ERVs derived from certain other groups remain extremely rare. In particular, only a single ERV sequence has been identified that shows evidence of being related to an ancient Deltaretrovirus , despite the large number of vertebrate genome sequences now available. In this report, we identify a second example of an ERV sequence putatively derived from a past deltaretroviral infection, in the genomes of several species of horseshoe bats (Rhinolophidae). This sequence represents a fragment of viral genome derived from a single integration. The time of the integration was estimated to be 11-19 million years ago. This finding, together with the previously identified endogenous Deltaretrovirus in long-fingered bats (Miniopteridae), suggest a close association of bats with ancient deltaretroviruses.
Complete Genome Sequence of the Probiotic Strain Lactobacillus salivarius LPM01

PubMed Central

Codoñer, Francisco M.; Martinez-Blanch, Juan F.; Acevedo-Piérart, Marcelo; Ormeño, M. Loreto; Ramón, Daniel

2016-01-01

Lactobacillus salivarius LPM01 (DSM 22150) is a probiotic strain able to improve health status in immunocompromised people. Here, we report its complete genome sequence deciphered by PacBio single-molecule real-time (SMRT) technology. Analysis of the sequence may provide insights into its functional activity and safety assessment. PMID:27881545
Advances in Cryptococcus genomics: insights into the evolution of pathogenesis.

PubMed

Cuomo, Christina A; Rhodes, Johanna; Desjardins, Christopher A

2018-01-01

Cryptococcus species are the causative agents of cryptococcal meningitis, a significant source of mortality in immunocompromised individuals. Initial work on the molecular epidemiology of this fungal pathogen utilized genotyping approaches to describe the genetic diversity and biogeography of two species, Cryptococcus neoformans and Cryptococcus gattii. Whole genome sequencing of representatives of both species resulted in reference assemblies enabling a wide array of downstream studies and genomic resources. With the increasing availability of whole genome sequencing, both species have now had hundreds of individual isolates sequenced, providing fine-scale insight into the evolution and diversification of Cryptococcus and allowing for the first genome-wide association studies to identify genetic variants associated with human virulence. Sequencing has also begun to examine the microevolution of isolates during prolonged infection and to identify variants specific to outbreak lineages, highlighting the potential role of hyper-mutation in evolving within short time scales. We can anticipate that further advances in sequencing technology and sequencing microbial genomes at scale, including metagenomics approaches, will continue to refine our view of how the evolution of Cryptococcus drives its success as a pathogen.
Seahorse-derived peptide suppresses invasive migration of HT1080 fibrosarcoma cells by competing with intracellular α-enolase for plasminogen binding and inhibiting uPA-mediated activation of plasminogen.

PubMed

Kim, Yong-Tae; Kim, Se-kwon; Jeon, You-Jin; Park, Sun Joo

2014-12-01

α-Enolase is a glycolytic enzyme and a surface receptor for plasminogen. α-Enolase-bound plasminogen promotes tumor cell invasion and cancer metastasis by activating plasmin and consequently degrading the extracellular matrix degradation. Therefore, α-enolase and plasminogen are novel targets for cancer therapy. We found that the amino acid sequence of a peptide purified from enzymatic hydrolysates of seahorse has striking similarities to that of α-enolase. In this study, we report that this peptide competes with cellular α-enolase for plasminogen binding and suppresses urokinase plasminogen activator (uPA)-mediated activation of plasminogen, which results in decreased invasive migration of HT1080 fibrosarcoma cells. In addition, the peptide treatment decreased the expression levels of uPA compared to that of untreated controls. These results provide new insight into the mechanism by which the seahorse-derived peptide suppresses invasive properties of human cancer cells. Our findings suggest that this peptide could emerge as a potential therapeutic agent for cancer.
Seahorse-derived peptide suppresses invasive migration of HT1080 fibrosarcoma cells by competing with intracellular α-enolase for plasminogen binding and inhibiting uPA-mediated activation of plasminogen

PubMed Central

Kim, Yong-Tae; Kim, Se-kwon; Jeon, You-Jin; Park, Sun Joo

2014-01-01

α-Enolase is a glycolytic enzyme and a surface receptor for plasminogen. α-Enolase-bound plasminogen promotes tumor cell invasion and cancer metastasis by activating plasmin and consequently degrading the extracellular matrix degradation. Therefore, α-enolase and plasminogen are novel targets for cancer therapy. We found that the amino acid sequence of a peptide purified from enzymatic hydrolysates of seahorse has striking similarities to that of α-enolase. In this study, we report that this peptide competes with cellular α-enolase for plasminogen binding and suppresses urokinase plasminogen activator (uPA)-mediated activation of plasminogen, which results in decreased invasive migration of HT1080 fibrosarcoma cells. In addition, the peptide treatment decreased the expression levels of uPA compared to that of untreated controls. These results provide new insight into the mechanism by which the seahorse-derived peptide suppresses invasive properties of human cancer cells. Our findings suggest that this peptide could emerge as a potential therapeutic agent for cancer. [BMB Reports 2014; 47(12): 691-696] PMID:24602611
A Genome-wide Analysis of Human Pluripotent Stem Cell-Derived Endothelial Cells in 2D or 3D Culture.

PubMed

Zhang, Jue; Schwartz, Michael P; Hou, Zhonggang; Bai, Yongsheng; Ardalani, Hamisha; Swanson, Scott; Steill, John; Ruotti, Victor; Elwell, Angela; Nguyen, Bao Kim; Bolin, Jennifer; Stewart, Ron; Thomson, James A; Murphy, William L

2017-04-11

A defined protocol for efficiently deriving endothelial cells from human pluripotent stem cells was established and vascular morphogenesis was used as a model system to understand how synthetic hydrogels influence global biological function compared with common 2D and 3D culture platforms. RNA sequencing demonstrated that gene expression profiles were similar for endothelial cells and pericytes cocultured in polyethylene glycol (PEG) hydrogels or Matrigel, while monoculture comparisons identified distinct vascular signatures for each cell type. Endothelial cells cultured on tissue-culture polystyrene adopted a proliferative phenotype compared with cells cultured on or encapsulated in PEG hydrogels. The proliferative phenotype correlated to increased FAK-ERK activity, and knockdown or inhibition of ERK signaling reduced proliferation and expression for cell-cycle genes while increasing expression for "3D-like" vasculature development genes. Our results provide insight into the influence of 2D and 3D culture formats on global biological processes that regulate cell function. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
Tracking the fate of pasta (T. Durum semolina) immunogenic proteins by in vitro simulated digestion.

PubMed

Mamone, Gianfranco; Nitride, Chiara; Picariello, Gianluca; Addeo, Francesco; Ferranti, Pasquale; Mackie, Alan

2015-03-18

The aim of the present study was to identify and characterize the celiacogenic/immunogenic proteins and peptides released during digestion of pasta (Triticum durum semolina). Cooked pasta was digested using a harmonized in vitro static model of oral-gastro-duodenal digestion. The course of pasta protein digestion was monitored by SDS-PAGE, and gluten proteins were specifically analyzed by Western blot using sera of celiac patients. Among the allergens, nonspecific lipid-transfer protein was highly resistant to gastro-duodenal hydrolysis, while other digestion-stable allergens such as α-amylase/trypsin inhibitors were not detected being totally released in the pasta cooking water. To simulate the final stage of intestinal degradation, the gastro-duodenal digesta were incubated with porcine jejunal brush-border membrane hydrolases. Sixty-one peptides surviving the brush-border membrane peptidases were identified by liquid chromatography-mass spectrometry, including several gluten-derived sequences encrypting different motifs responsible for the induction of celiac disease. These results provide new insights into the persistence of wheat-derived peptides during digestion of cooked pasta samples.
Identification of diverse nerve growth factor-regulated genes by serial analysis of gene expression (SAGE) profiling

PubMed Central

Angelastro, James M.; Klimaschewski, Lars; Tang, Song; Vitolo, Ottavio V.; Weissman, Tamily A.; Donlin, Laura T.; Shelanski, Michael L.; Greene, Lloyd A.

2000-01-01

Neurotrophic factors such as nerve growth factor (NGF) promote a wide variety of responses in neurons, including differentiation, survival, plasticity, and repair. Such actions often require changes in gene expression. To identify the regulated genes and thereby to more fully understand the NGF mechanism, we carried out serial analysis of gene expression (SAGE) profiling of transcripts derived from rat PC12 cells before and after NGF-promoted neuronal differentiation. Multiple criteria supported the reliability of the profile. Approximately 157,000 SAGE tags were analyzed, representing at least 21,000 unique transcripts. Of these, nearly 800 were regulated by 6-fold or more in response to NGF. Approximately 150 of the regulated transcripts have been matched to named genes, the majority of which were not previously known to be NGF-responsive. Functional categorization of the regulated genes provides insight into the complex, integrated mechanism by which NGF promotes its multiple actions. It is anticipated that as genomic sequence information accrues the data derived here will continue to provide information about neurotrophic factor mechanisms. PMID:10984536
Helix Unwinding and Base Flipping Enable Human MTERF1 to Terminate Mitochondrial Transcription

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yakubovskaya, E.; Mejia, E; Byrnes, J

2010-01-01

Defects in mitochondrial gene expression are associated with aging and disease. Mterf proteins have been implicated in modulating transcription, replication and protein synthesis. We have solved the structure of a member of this family, the human mitochondrial transcriptional terminator MTERF1, bound to dsDNA containing the termination sequence. The structure indicates that upon sequence recognition MTERF1 unwinds the DNA molecule, promoting eversion of three nucleotides. Base flipping is critical for stable binding and transcriptional termination. Additional structural and biochemical results provide insight into the DNA binding mechanism and explain how MTERF1 recognizes its target sequence. Finally, we have demonstrated that themore » mitochondrial pathogenic G3249A and G3244A mutations interfere with key interactions for sequence recognition, eliminating termination. Our results provide insight into the role of mterf proteins and suggest a link between mitochondrial disease and the regulation of mitochondrial transcription.« less
Music and language perception: expectations, structural integration, and cognitive sequencing.

PubMed

Tillmann, Barbara

2012-10-01

Music can be described as sequences of events that are structured in pitch and time. Studying music processing provides insight into how complex event sequences are learned, perceived, and represented by the brain. Given the temporal nature of sound, expectations, structural integration, and cognitive sequencing are central in music perception (i.e., which sounds are most likely to come next and at what moment should they occur?). This paper focuses on similarities in music and language cognition research, showing that music cognition research provides insight into the understanding of not only music processing but also language processing and the processing of other structured stimuli. The hypothesis of shared resources between music and language processing and of domain-general dynamic attention has motivated the development of research to test music as a means to stimulate sensory, cognitive, and motor processes. Copyright © 2012 Cognitive Science Society, Inc.
A decade of pig genome sequencing: a window on pig domestication and evolution.

PubMed

Groenen, Martien A M

2016-03-29

Insight into how genomes change and adapt due to selection addresses key questions in evolutionary biology and in domestication of animals and plants by humans. In that regard, the pig and its close relatives found in Africa and Eurasia represent an excellent group of species that enables studies of the effect of both natural and human-mediated selection on the genome. The recent completion of the draft genome sequence of a domestic pig and the development of next-generation sequencing technology during the past decade have created unprecedented possibilities to address these questions in great detail. In this paper, I review recent whole-genome sequencing studies in the pig and closely-related species that provide insight into the demography, admixture and selection of these species and, in particular, how domestication and subsequent selection of Sus scrofa have shaped the genomes of these animals.
Insights into the sequence parameters for halophilic adaptation.

PubMed

Nath, Abhigyan

2016-03-01

The sequence parameters for halophilic adaptation are still not fully understood. To understand the molecular basis of protein hypersaline adaptation, a detailed analysis is carried out, and investigated the likely association of protein sequence attributes to halophilic adaptation. A two-stage strategy is implemented, where in the first stage a supervised machine learning classifier is build, giving an overall accuracy of 86 % on stratified tenfold cross validation and 90 % on blind testing set, which are better than the previously reported results. The second stage consists of statistical analysis of sequence features and possible extraction of halophilic molecular signatures. The results of this study showed that, halophilic proteins are characterized by lower average charge, lower K content, and lower S content. A statistically significant preference/avoidance list of sequence parameters is also reported giving insights into the molecular basis of halophilic adaptation. D, Q, E, H, P, T, V are significantly preferred while N, C, I, K, M, F, S are significantly avoided. Among amino acid physicochemical groups, small, polar, charged, acidic and hydrophilic groups are preferred over other groups. The halophilic proteins also showed a preference for higher average flexibility, higher average polarity and avoidance for higher average positive charge, average bulkiness and average hydrophobicity. Some interesting trends observed in dipeptide counts are also reported. Further a systematic statistical comparison is undertaken for gaining insights into the sequence feature distribution in different residue structural states. The current analysis may facilitate the understanding of the mechanism of halophilic adaptation clearer, which can be further used for rational design of halophilic proteins.
Insight into the structure and mechanism of nickel-containing superoxide dismutase derived from peptide-based mimics.

PubMed

Shearer, Jason

2014-08-19

Nickel superoxide dismutase (NiSOD) is a nickel-containing metalloenzyme that catalyzes the disproportionation of superoxide through a ping-pong mechanism that relies on accessing reduced Ni(II) and oxidized Ni(III) oxidation states. NiSOD is the most recently discovered SOD. Unlike the other known SODs (MnSOD, FeSOD, and (CuZn)SOD), which utilize "typical" biological nitrogen and oxygen donors, NiSOD utilizes a rather unexpected ligand set. In the reduced Ni(II) oxidation state, NiSOD utilizes nitrogen ligands derived from the N-terminal amine and an amidate along with two cysteinates sulfur donors. These are unusual biological ligands, especially for an SOD: amine and amidate donors are underrepresented as biological ligands, whereas cysteinates are highly susceptible to oxidative damage. An axial histidine imidazole binds to nickel upon oxidation to Ni(III). This bond is long (2.3-2.6 Å) owing to a tight hydrogen-bonding network. All of the ligating residues to Ni(II) and Ni(III) are found within the first 6 residues from the NiSOD N-terminus. Thus, small nickel-containing metallopeptides derived from the first 6-12 residues of the NiSOD sequence can reproduce many of the properties of NiSOD itself. Using these nickel-containing metallopeptide-based NiSOD mimics, we have shown that the minimal sequence needed for nickel binding and reproduction of the structural, spectroscopic, and functional properties of NiSOD is H2N-HCXXPC. Insight into how NiSOD avoids oxidative damage has also been gained. Using small NiN2S2 complexes and metallopeptide-based mimics, it was shown that the unusual nitrogen donor atoms protect the cysteinates from oxidative damage (both one-electron oxidation and oxygen atom insertion reactions) by fine-tuning the electronic structure of the nickel center. Changing the nitrogen donor set to a bis-amidate or bis-amine nitrogen donor led to catalytically nonviable species owing to nickel-cysteinate bond oxidative damage. Only the amine/amidate nitrogen donor atoms within the NiSOD ligand set produce a catalytically viable species. These metallopeptide-based mimics have also hinted at the detailed mechanism of SOD catalysis by NiSOD. One such aspect is that the axial imidazole likely remains ligated to the Ni center under rapid catalytic conditions (i.e., high superoxide loads). This reduces the degree of structural rearrangement about the nickel center, leading to higher catalytic rates. Metallopeptide-based mimics have also shown that, although an axial ligand to Ni(III) is required for catalysis, the rates are highest when this is a weak interaction, suggesting a reason for the long axial His-Ni(III) bond found in NiSOD. These mimics have also suggested a surprising mechanistic insight: O2(-) reduction via a "H(•)" tunneling event from a R-S(H(+))-Ni(II) moiety to O2(-) is possible. The importance of this mechanism in NiSOD has not been verified.
Draft Genome Sequence of Lactobacillus plantarum Strain IPLA 88

PubMed Central

Ladero, Victor; Alvarez-Sieiro, Patricia; Redruello, Begoña; del Rio, Beatriz; Linares, Daniel M.; Martin, M. Cruz; Fernández, María

2013-01-01

Here, we report a 3.2-Mbp draft assembly for the genome of Lactobacillus plantarum IPLA 88. The sequence of this sourdough isolate provides insight into the adaptation of this versatile species to different environments. PMID:23887921
Finding a (pine) needle in a haystack: chloroplast genome sequence divergence in rare and widespread pines

Treesearch

J.B. Whittall; J. Syring; M. Parks; J. Buenrostro; C. Dick; A. Liston; R. Cronn

2010-01-01

Critical to conservation efforts and other investigations at low taxonomic levels, DNA sequence data offer important insights into the distinctiveness, biogeographic partitioning, and evolutionary histories of species. The resolving power of DNA sequences is often limited by insufficient variability at the intraspecific level. This is particularly true of studies...
Complete Genome Sequence of the Probiotic Strain Lactobacillus salivarius LPM01.

PubMed

Chenoll, Empar; Codoñer, Francisco M; Martinez-Blanch, Juan F; Acevedo-Piérart, Marcelo; Ormeño, M Loreto; Ramón, Daniel; Genovés, Salvador

2016-11-23

Lactobacillus salivarius LPM01 (DSM 22150) is a probiotic strain able to improve health status in immunocompromised people. Here, we report its complete genome sequence deciphered by PacBio single-molecule real-time (SMRT) technology. Analysis of the sequence may provide insights into its functional activity and safety assessment. Copyright © 2016 Chenoll et al.

Parallel metatranscriptome analyses of host and symbiont gene expression in the gut of the termite Reticulitermes flavipes

PubMed Central

Tartar, Aurélien; Wheeler, Marsha M; Zhou, Xuguo; Coy, Monique R; Boucias, Drion G; Scharf, Michael E

2009-01-01

Background Termite lignocellulose digestion is achieved through a collaboration of host plus prokaryotic and eukaryotic symbionts. In the present work, we took a combined host and symbiont metatranscriptomic approach for investigating the digestive contributions of host and symbiont in the lower termite Reticulitermes flavipes. Our approach consisted of parallel high-throughput sequencing from (i) a host gut cDNA library and (ii) a hindgut symbiont cDNA library. Subsequently, we undertook functional analyses of newly identified phenoloxidases with potential importance as pretreatment enzymes in industrial lignocellulose processing. Results Over 10,000 expressed sequence tags (ESTs) were sequenced from the 2 libraries that aligned into 6,555 putative transcripts, including 171 putative lignocellulase genes. Sequence analyses provided insights in two areas. First, a non-overlapping complement of host and symbiont (prokaryotic plus protist) glycohydrolase gene families known to participate in cellulose, hemicellulose, alpha carbohydrate, and chitin degradation were identified. Of these, cellulases are contributed by host plus symbiont genomes, whereas hemicellulases are contributed exclusively by symbiont genomes. Second, a diverse complement of previously unknown genes that encode proteins with homology to lignase, antioxidant, and detoxification enzymes were identified exclusively from the host library (laccase, catalase, peroxidase, superoxide dismutase, carboxylesterase, cytochrome P450). Subsequently, functional analyses of phenoloxidase activity provided results that were strongly consistent with patterns of laccase gene expression. In particular, phenoloxidase activity and laccase gene expression are mostly restricted to symbiont-free foregut plus salivary gland tissues, and phenoloxidase activity is inducible by lignin feeding. Conclusion To our knowledge, this is the first time that a dual host-symbiont transcriptome sequencing effort has been conducted in a single termite species. This sequence database represents an important new genomic resource for use in further studies of collaborative host-symbiont termite digestion, as well as development of coevolved host and symbiont-derived biocatalysts for use in industrial biomass-to-bioethanol applications. Additionally, this study demonstrates that: (i) phenoloxidase activities are prominent in the R. flavipes gut and are not symbiont derived, (ii) expands the known number of host and symbiont glycosyl hydrolase families in Reticulitermes, and (iii) supports previous models of lignin degradation and host-symbiont collaboration in cellulose/hemicellulose digestion in the termite gut. All sequences in this paper are available publicly with the accession numbers FL634956-FL640828 (Termite Gut library) and FL641015-FL645753 (Symbiont library). PMID:19832970
Phylogenetic analysis of higher-level relationships within Hydroidolina (Cnidaria: Hydrozoa) using mitochondrial genome data and insight into their mitochondrial transcription.

PubMed

Kayal, Ehsan; Bentlage, Bastian; Cartwright, Paulyn; Yanagihara, Angel A; Lindsay, Dhugal J; Hopcroft, Russell R; Collins, Allen G

2015-01-01

Hydrozoans display the most morphological diversity within the phylum Cnidaria. While recent molecular studies have provided some insights into their evolutionary history, sister group relationships remain mostly unresolved, particularly at mid-taxonomic levels. Specifically, within Hydroidolina, the most speciose hydrozoan subclass, the relationships and sometimes integrity of orders are highly unsettled. Here we obtained the near complete mitochondrial sequence of twenty-six hydroidolinan hydrozoan species from a range of sources (DNA and RNA-seq data, long-range PCR). Our analyses confirm previous inference of the evolution of mtDNA in Hydrozoa while introducing a novel genome organization. Using RNA-seq data, we propose a mechanism for the expression of mitochondrial mRNA in Hydroidolina that can be extrapolated to the other medusozoan taxa. Phylogenetic analyses using the full set of mitochondrial gene sequences provide some insights into the order-level relationships within Hydroidolina, including siphonophores as the first diverging clade, a well-supported clade comprised of Leptothecata-Filifera III-IV, and a second clade comprised of Aplanulata-Capitata s.s.-Filifera I-II. Finally, we describe our relatively inexpensive and accessible multiplexing strategy to sequence long-range PCR amplicons that can be adapted to most high-throughput sequencing platforms.
Phylogenetic analysis of higher-level relationships within Hydroidolina (Cnidaria: Hydrozoa) using mitochondrial genome data and insight into their mitochondrial transcription

PubMed Central

Bentlage, Bastian; Cartwright, Paulyn; Yanagihara, Angel A.; Lindsay, Dhugal J.; Hopcroft, Russell R.; Collins, Allen G.

2015-01-01

Hydrozoans display the most morphological diversity within the phylum Cnidaria. While recent molecular studies have provided some insights into their evolutionary history, sister group relationships remain mostly unresolved, particularly at mid-taxonomic levels. Specifically, within Hydroidolina, the most speciose hydrozoan subclass, the relationships and sometimes integrity of orders are highly unsettled. Here we obtained the near complete mitochondrial sequence of twenty-six hydroidolinan hydrozoan species from a range of sources (DNA and RNA-seq data, long-range PCR). Our analyses confirm previous inference of the evolution of mtDNA in Hydrozoa while introducing a novel genome organization. Using RNA-seq data, we propose a mechanism for the expression of mitochondrial mRNA in Hydroidolina that can be extrapolated to the other medusozoan taxa. Phylogenetic analyses using the full set of mitochondrial gene sequences provide some insights into the order-level relationships within Hydroidolina, including siphonophores as the first diverging clade, a well-supported clade comprised of Leptothecata-Filifera III–IV, and a second clade comprised of Aplanulata-Capitata s.s.-Filifera I–II. Finally, we describe our relatively inexpensive and accessible multiplexing strategy to sequence long-range PCR amplicons that can be adapted to most high-throughput sequencing platforms. PMID:26618080
Using Full Genomic Information to Predict Disease: Breaking Down the Barriers Between Complex and Mendelian Diseases.

PubMed

Jordan, Daniel M; Do, Ron

2018-04-11

While sequence-based genetic tests have long been available for specific loci, especially for Mendelian disease, the rapidly falling costs of genome-wide genotyping arrays, whole-exome sequencing, and whole-genome sequencing are moving us toward a future where full genomic information might inform the prognosis and treatment of a variety of diseases, including complex disease. Similarly, the availability of large populations with full genomic information has enabled new insights about the etiology and genetic architecture of complex disease. Insights from the latest generation of genomic studies suggest that our categorization of diseases as complex may conceal a wide spectrum of genetic architectures and causal mechanisms that ranges from Mendelian forms of complex disease to complex regulatory structures underlying Mendelian disease. Here, we review these insights, along with advances in the prediction of disease risk and outcomes from full genomic information. Expected final online publication date for the Annual Review of Genomics and Human Genetics Volume 19 is August 31, 2018. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Analysis and Prediction of Myristoylation Sites Using the mRMR Method, the IFS Method and an Extreme Learning Machine Algorithm.

PubMed

Wang, ShaoPeng; Zhang, Yu-Hang; Huang, GuoHua; Chen, Lei; Cai, Yu-Dong

2017-01-01

Myristoylation is an important hydrophobic post-translational modification that is covalently bound to the amino group of Gly residues on the N-terminus of proteins. The many diverse functions of myristoylation on proteins, such as membrane targeting, signal pathway regulation and apoptosis, are largely due to the lipid modification, whereas abnormal or irregular myristoylation on proteins can lead to several pathological changes in the cell. To better understand the function of myristoylated sites and to correctly identify them in protein sequences, this study conducted a novel computational investigation on identifying myristoylation sites in protein sequences. A training dataset with 196 positive and 84 negative peptide segments were obtained. Four types of features derived from the peptide segments following the myristoylation sites were used to specify myristoylatedand non-myristoylated sites. Then, feature selection methods including maximum relevance and minimum redundancy (mRMR), incremental feature selection (IFS), and a machine learning algorithm (extreme learning machine method) were adopted to extract optimal features for the algorithm to identify myristoylation sites in protein sequences, thereby building an optimal prediction model. As a result, 41 key features were extracted and used to build an optimal prediction model. The effectiveness of the optimal prediction model was further validated by its performance on a test dataset. Furthermore, detailed analyses were also performed on the extracted 41 features to gain insight into the mechanism of myristoylation modification. This study provided a new computational method for identifying myristoylation sites in protein sequences. We believe that it can be a useful tool to predict myristoylation sites from protein sequences. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Complete genome sequence of the haloalkaliphilic, hydrogen-producing bacterium Halanaerobium hydrogeniformans.

PubMed

Brown, Steven D; Begemann, Matthew B; Mormile, Melanie R; Wall, Judy D; Han, Cliff S; Goodwin, Lynne A; Pitluck, Samuel; Land, Miriam L; Hauser, Loren J; Elias, Dwayne A

2011-07-01

Halanaerobium hydrogenoformans is an alkaliphilic bacterium capable of biohydrogen production at pH 11 and 7% (wt/vol) salt. We present the 2.6-Mb genome sequence to provide insights into its physiology and potential for bioenergy applications.
Evolutionary Descent of Prion Genes from the ZIP Family of Metal Ion Transporters

PubMed Central

Schmitt-Ulms, Gerold; Ehsani, Sepehr; Watts, Joel C.; Westaway, David; Wille, Holger

2009-01-01

In the more than twenty years since its discovery, both the phylogenetic origin and cellular function of the prion protein (PrP) have remained enigmatic. Insights into a possible function of PrP may be obtained through the characterization of its molecular neighborhood in cells. Quantitative interactome data demonstrated the spatial proximity of two metal ion transporters of the ZIP family, ZIP6 and ZIP10, to mammalian prion proteins in vivo. A subsequent bioinformatic analysis revealed the unexpected presence of a PrP-like amino acid sequence within the N-terminal, extracellular domain of a distinct sub-branch of the ZIP protein family that includes ZIP5, ZIP6 and ZIP10. Additional structural threading and orthologous sequence alignment analyses argued that the prion gene family is phylogenetically derived from a ZIP-like ancestral molecule. The level of sequence homology and the presence of prion protein genes in most chordate species place the split from the ZIP-like ancestor gene at the base of the chordate lineage. This relationship explains structural and functional features found within mammalian prion proteins as elements of an ancient involvement in the transmembrane transport of divalent cations. The phylogenetic and spatial connection to ZIP proteins is expected to open new avenues of research to elucidate the biology of the prion protein in health and disease. PMID:19784368
Lucinidae/sulfur-oxidizing bacteria: ancestral heritage or opportunistic association? Further insights from the Bohol Sea (the Philippines).

PubMed

Brissac, Terry; Merçot, Hervé; Gros, Olivier

2011-01-01

The first studies of the 16S rRNA gene diversity of the bacterial symbionts found in lucinid clams did not clarify how symbiotic associations had evolved in this group. Indeed, although species-specific associations deriving from a putative ancestral symbiotic association have been described (coevolution scenario), associations between the same bacterial species and various host species (opportunistic scenario) have also been described. Here, we carried out a comparative molecular analysis of hosts, based on 18S and 28S rRNA gene sequences, and of symbionts, based on 16S rRNA gene sequences, to determine as to which evolutionary scenario led to modern lucinid/symbiont associations. For all sequences analyzed, we found only three bacterial symbiont species, two of which are harbored by lucinids colonizing mangrove swamps. The last symbiont is the most common and was found to be independent of biotope or depth. Another interesting feature is the similarity of ctenidial organization of lucinids from the Philippines to those described previously, with the exception that two bacterial morphotypes were observed in two different species (Gloverina rectangularis and Myrtea flabelliformis). Thus, there is apparently no specific association between Lucinidae and their symbionts, the association taking place according to which bacterial species is present in the environment. FEMS Microbiology Ecology © 2010 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. No claim to original French government works.
Role of Mitochondrial Inheritance on Prostate Cancer Outcome in African American Men. Addendum

DTIC Science & Technology

2016-11-01

DNA sequencing technique developed by our collaborator using single amplicon long-range PCR that permits deep coverage (10,000-20,000X on average) of...the mitochondrial genome. We have sequenced 652 samples derived from frozen fully using this technology. The additional DNA samples derived from...paraffin embedded (FFPE) tissue were more challenging, but have now been sequenced . Mapping of DNA variants in our sequenced genomes to mitochondrial
An insight into the sialome of the blood-sucking bug Triatoma infestans, a vector of Chagas' disease

PubMed Central

Assumpção, Teresa C. F.; Francischetti, Ivo M. B.; Andersen, John F.; Schwarz, Alexandra; Santana, Jaime M.; Ribeiro, José M. C.

2008-01-01

Triatoma infestans is a hemiptera, vector of Chagas’ disease, that feeds exclusively on vertebrate blood in all life stages. Hematophagous insects’ salivary glands (SG) produce potent pharmacological compounds that counteract host hemostasis, including anti-clotting, anti-platelet, and vasodilatory molecules. To obtain a further insight into the salivary biochemical and pharmacological complexity of this insect, a cDNA library from its salivary glands was randomly sequenced. Also, salivary proteins were submitted to two dimentional gel (2D-gel) electrophoresis followed by MS analysis. We present the analysis of a set of 1,534 (SG) cDNA sequences, 645 of which coded for proteins of a putative secretory nature. Most salivary proteins described as lipocalins matched peptide sequences obtained from proteomic results. PMID:18207082
Chiropteran influenza viruses: flu from bats or a relic from the past?

PubMed

Brunotte, Linda; Beer, Martin; Horie, Masayuki; Schwemmle, Martin

2016-02-01

The identification of influenza A-like genomic sequences in bats suggests the existence of distinct lineages of chiropteran influenza viruses in South and Central America. These viruses share similarities with conventional influenza A viruses but lack the canonical receptor-binding property and neuraminidase function. The inability to isolate infectious bat influenza viruses impeded further studies, however, reverse genetic analysis provided new insights into the molecular biology of these viruses. In this review, we highlight the recent developments in the field of the newly discovered bat-derived influenza A-like viruses. We also discuss whether bats are a neglected natural reservoir of influenza viruses, the risk associated with bat influenza viruses for humans and whether these viruses originate from the pool of avian IAV or vice versa. Copyright © 2016 Elsevier B.V. All rights reserved.
Capturing Motion and Depth Before Cinematography.

PubMed

Wade, Nicholas J

2016-01-01

Visual representations of biological states have traditionally faced two problems: they lacked motion and depth. Attempts were made to supply these wants over many centuries, but the major advances were made in the early-nineteenth century. Motion was synthesized by sequences of slightly different images presented in rapid succession and depth was added by presenting slightly different images to each eye. Apparent motion and depth were combined some years later, but they tended to be applied separately. The major figures in this early period were Wheatstone, Plateau, Horner, Duboscq, Claudet, and Purkinje. Others later in the century, like Marey and Muybridge, were stimulated to extend the uses to which apparent motion and photography could be applied to examining body movements. These developments occurred before the birth of cinematography, and significant insights were derived from attempts to combine motion and depth.
Complete Genome Sequence of Lactobacillus rhamnosus Strain BPL5 (CECT 8800), a Probiotic for Treatment of Bacterial Vaginosis.

PubMed

Chenoll, Empar; Codoñer, Francisco M; Martinez-Blanch, Juan F; Ramón, Daniel; Genovés, Salvador; Menabrito, Marco

2016-04-21

ITALIC! Lactobacillus rhamnosusBPL5 (CECT 8800), is a probiotic strain suitable for the treatment of bacterial vaginosis. Here, we report its complete genome sequence deciphered by PacBio single-molecule real-time (SMRT) technology. Analysis of the sequence may provide insight into its functional activity. Copyright © 2016 Chenoll et al.
Draft Genome Sequence and Description of Janthinobacterium sp. Strain CG3, a Psychrotolerant Antarctic Supraglacial Stream Bacterium

PubMed Central

Smith, Heidi; Akiyama, Tatsuya; Franklin, Michael; Woyke, Tanja; Teshima, Hazuki; Davenport, Karen; Daligault, Hajnalka; Erkkila, Tracy; Goodwin, Lynne; Gu, Wei; Xu, Yan; Chain, Patrick

2013-01-01

Here we present the draft genome sequence of Janthinobacterium sp. strain CG3, a psychrotolerant non-violacein-producing bacterium that was isolated from the Cotton Glacier supraglacial stream. The genome sequence of this organism will provide insight as to the mechanisms necessary for bacteria to survive in UV-stressed icy environments. PMID:24265494
Contribution of Reactive and Proactive Control to Children's Working Memory Performance: Insight from Item Recall Durations in Response Sequence Planning

ERIC Educational Resources Information Center

Chevalier, Nicolas; James, Tiffany D.; Wiebe, Sandra A.; Nelson, Jennifer Mize; Espy, Kimberly Andrews

2014-01-01

The present study addressed whether developmental improvement in working memory span task performance relies upon a growing ability to proactively plan response sequences during childhood. Two hundred thirteen children completed a working memory span task in which they used a touchscreen to reproduce orally presented sequences of animal names.…
Draft Genome Sequence of Lactobacillus crispatus EM-LC1, an Isolate with Antimicrobial Activity Cultured from an Elderly Subject

PubMed Central

Power, Susan E.; Harris, Hugh M. B.; Bottacini, Francesca; Ross, R. Paul; O’Toole, Paul W.

2013-01-01

Here we report the 1.86-Mb draft genome sequence of Lactobacillus crispatus EM-LC1, a fecal isolate with antimicrobial activity. This genome sequence is expected to provide insights into the antimicrobial activity of L. crispatus and improve our knowledge of its potential probiotic traits. PMID:24356836
Comparative sequence analysis suggests a conserved gating mechanism for TRP channels

PubMed Central

Palovcak, Eugene; Delemotte, Lucie; Klein, Michael L.

2015-01-01

The transient receptor potential (TRP) channel superfamily plays a central role in transducing diverse sensory stimuli in eukaryotes. Although dissimilar in sequence and domain organization, all known TRP channels act as polymodal cellular sensors and form tetrameric assemblies similar to those of their distant relatives, the voltage-gated potassium (Kv) channels. Here, we investigated the related questions of whether the allosteric mechanism underlying polymodal gating is common to all TRP channels, and how this mechanism differs from that underpinning Kv channel voltage sensitivity. To provide insight into these questions, we performed comparative sequence analysis on large, comprehensive ensembles of TRP and Kv channel sequences, contextualizing the patterns of conservation and correlation observed in the TRP channel sequences in light of the well-studied Kv channels. We report sequence features that are specific to TRP channels and, based on insight from recent TRPV1 structures, we suggest a model of TRP channel gating that differs substantially from the one mediating voltage sensitivity in Kv channels. The common mechanism underlying polymodal gating involves the displacement of a defect in the H-bond network of S6 that changes the orientation of the pore-lining residues at the hydrophobic gate. PMID:26078053
First genetic classification of Cryptosporidium and Giardia from HIV/AIDS patients in Malaysia.

PubMed

Lim, Yvonne A L; Iqbal, Asma; Surin, Johari; Sim, Benedict L H; Jex, Aaron R; Nolan, Matthew J; Smith, Huw V; Gasser, Robin B

2011-07-01

Given the HIV epidemic in Malaysia, genetic information on opportunistic pathogens, such as Cryptosporidium and Giardia, in HIV/AIDS patients is pivotal to enhance our understanding of epidemiology, patient care, management and disease surveillance. In the present study, 122 faecal samples from HIV/AIDS patients were examined for the presence of Cryptosporidium oocysts and Giardia cysts using a conventional coproscopic approach. Such oocysts and cysts were detected in 22.1% and 5.7% of the 122 faecal samples, respectively. Genomic DNAs from selected samples were tested in a nested-PCR, targeting regions of the small subunit (SSU) of nuclear ribosomal RNA and the 60kDa glycoprotein (gp60) genes (for Cryptosporidium), and the triose-phosphate isomerase (tpi) gene (for Giardia), followed by direct sequencing. The sequencing of amplicons derived from SSU revealed that Cryptosporidium parvum was the most frequently detected species (64% of 25 samples tested), followed by C. hominis (24%), C. meleagridis (8%) and C. felis (4%). Sequencing of a region of gp60 identified C. parvum subgenotype IIdA15G2R1 and C. hominis subgenotypes IaA14R1, IbA10G2R2, IdA15R2, IeA11G2T3R1 and IfA11G1R2. Sequencing of amplicons derived from tpi revealed G. duodenalis assemblage A, which is of zoonotic importance. This is the first report of C. hominis, C. meleagridis and C. felis from Malaysian HIV/AIDS patients. Future work should focus on an extensive analysis of Cryptosporidium and Giardia in such patients as well as in domestic and wild animals, in order to improve the understanding of transmission patterns and dynamics in Malaysia. It would also be particularly interesting to establish the relationship among clinical manifestation, CD4 cell counts and genotypes/subgenotypes of Cryptosporidium and Giardia in HIV/AIDS patients. Such insights would assist in a better management of clinical disease in immuno-deficient patients as well as improved preventive and control strategies. Copyright © 2011 Elsevier B.V. All rights reserved.
Chloroplast DNA Structural Variation, Phylogeny, and Age of Divergence among Diploid Cotton Species.

PubMed

Chen, Zhiwen; Feng, Kun; Grover, Corrinne E; Li, Pengbo; Liu, Fang; Wang, Yumei; Xu, Qin; Shang, Mingzhao; Zhou, Zhongli; Cai, Xiaoyan; Wang, Xingxing; Wendel, Jonathan F; Wang, Kunbo; Hua, Jinping

2016-01-01

The cotton genus (Gossypium spp.) contains 8 monophyletic diploid genome groups (A, B, C, D, E, F, G, K) and a single allotetraploid clade (AD). To gain insight into the phylogeny of Gossypium and molecular evolution of the chloroplast genome in this group, we performed a comparative analysis of 19 Gossypium chloroplast genomes, six reported here for the first time. Nucleotide distance in non-coding regions was about three times that of coding regions. As expected, distances were smaller within than among genome groups. Phylogenetic topologies based on nucleotide and indel data support for the resolution of the 8 genome groups into 6 clades. Phylogenetic analysis of indel distribution among the 19 genomes demonstrates contrasting evolutionary dynamics in different clades, with a parallel genome downsizing in two genome groups and a biased accumulation of insertions in the clade containing the cultivated cottons leading to large (for Gossypium) chloroplast genomes. Divergence time estimates derived from the cpDNA sequence suggest that the major diploid clades had diverged approximately 10 to 11 million years ago. The complete nucleotide sequences of 6 cpDNA genomes are provided, offering a resource for cytonuclear studies in Gossypium.
3DNALandscapes: a database for exploring the conformational features of DNA.

PubMed

Zheng, Guohui; Colasanti, Andrew V; Lu, Xiang-Jun; Olson, Wilma K

2010-01-01

3DNALandscapes, located at: http://3DNAscapes.rutgers.edu, is a new database for exploring the conformational features of DNA. In contrast to most structural databases, which archive the Cartesian coordinates and/or derived parameters and images for individual structures, 3DNALandscapes enables searches of conformational information across multiple structures. The database contains a wide variety of structural parameters and molecular images, computed with the 3DNA software package and known to be useful for characterizing and understanding the sequence-dependent spatial arrangements of the DNA sugar-phosphate backbone, sugar-base side groups, base pairs, base-pair steps, groove structure, etc. The data comprise all DNA-containing structures--both free and bound to proteins, drugs and other ligands--currently available in the Protein Data Bank. The web interface allows the user to link, report, plot and analyze this information from numerous perspectives and thereby gain insight into DNA conformation, deformability and interactions in different sequence and structural contexts. The data accumulated from known, well-resolved DNA structures can serve as useful benchmarks for the analysis and simulation of new structures. The collective data can also help to understand how DNA deforms in response to proteins and other molecules and undergoes conformational rearrangements.

Chloroplast DNA Structural Variation, Phylogeny, and Age of Divergence among Diploid Cotton Species

PubMed Central

Li, Pengbo; Liu, Fang; Wang, Yumei; Xu, Qin; Shang, Mingzhao; Zhou, Zhongli; Cai, Xiaoyan; Wang, Xingxing; Wendel, Jonathan F.; Wang, Kunbo

2016-01-01

The cotton genus (Gossypium spp.) contains 8 monophyletic diploid genome groups (A, B, C, D, E, F, G, K) and a single allotetraploid clade (AD). To gain insight into the phylogeny of Gossypium and molecular evolution of the chloroplast genome in this group, we performed a comparative analysis of 19 Gossypium chloroplast genomes, six reported here for the first time. Nucleotide distance in non-coding regions was about three times that of coding regions. As expected, distances were smaller within than among genome groups. Phylogenetic topologies based on nucleotide and indel data support for the resolution of the 8 genome groups into 6 clades. Phylogenetic analysis of indel distribution among the 19 genomes demonstrates contrasting evolutionary dynamics in different clades, with a parallel genome downsizing in two genome groups and a biased accumulation of insertions in the clade containing the cultivated cottons leading to large (for Gossypium) chloroplast genomes. Divergence time estimates derived from the cpDNA sequence suggest that the major diploid clades had diverged approximately 10 to 11 million years ago. The complete nucleotide sequences of 6 cpDNA genomes are provided, offering a resource for cytonuclear studies in Gossypium. PMID:27309527
Composition and Activity of Microbial Communities along the Redox Gradient of an Alkaline, Hypersaline, Lake

PubMed Central

Edwardson, Christian F.; Hollibaugh, James T.

2018-01-01

We compared the composition of microbial communities obtained by sequencing 16S rRNA gene amplicons with taxonomy derived from metatranscriptomes from the same samples. Samples were collected from alkaline, hypersaline Mono Lake, California, USA at five depths that captured the major redox zones of the lake during the onset of meromixis. The prokaryotic community was dominated by bacteria from the phyla Proteobacteria, Firmicutes, and Bacteroidetes, while the picoeukaryotic chlorophyte Picocystis dominated the eukaryotes. Most (80%) of the abundant (>1% relative abundance) OTUs recovered as amplicons of 16S rRNA genes have been reported in previous surveys, indicating that Mono Lake's microbial community has remained stable over 12 years that have included periods of regular, annual overturn interspersed by episodes of prolonged meromixis that result in extremely reducing conditions in bottom water. Metatranscriptomic sequences binned predominately to the Gammaproteobacteria genera Thioalkalivibrio (4–13%) and Thioalkalimicrobium (0–14%); and to the Firmicutes genera Dethiobacter (0–5%) and Clostridium (1–4%), which were also abundant in the 16S rRNA gene amplicon libraries. This study provides insight into the taxonomic affiliations of transcriptionally active communities of the lake's water column under different redox conditions. PMID:29445359
Differential signatures of bacterial and mammalian IMP dehydrogenase enzymes.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, R.; Evans, G.; Rotella, F.

1999-06-01

IMP dehydrogenase (IMPDH) is an essential enzyme of de novo guanine nucleotide synthesis. IMPDH inhibitors have clinical utility as antiviral, anticancer or immunosuppressive agents. The essential nature of this enzyme suggests its therapeutic applications may be extended to the development of antimicrobial agents. Bacterial IMPDH enzymes show bio- chemical and kinetic characteristics that are different than the mammalian IMPDH enzymes, suggesting IMPDH may be an attractive target for the development of antimicrobial agents. We suggest that the biochemical and kinetic differences between bacterial and mammalian enzymes are a consequence of the variance of specific, identifiable amino acid residues. Identification ofmore » these residues or combination of residues that impart this mammalian or bacterial enzyme signature is a prerequisite for the rational identification of agents that specifically target the bacterial enzyme. We used sequence alignments of IMPDH proteins to identify sequence signatures associated with bacterial or eukaryotic IMPDH enzymes. These selections were further refined to discern those likely to have a role in catalysis using information derived from the bacterial and mammalian IMPDH crystal structures and site-specific mutagenesis. Candidate bacterial sequence signatures identified by this process include regions involved in subunit interactions, the active site flap and the NAD binding region. Analysis of sequence alignments in these regions indicates a pattern of catalytic residues conserved in all enzymes and a secondary pattern of amino acid conservation associated with the major phylogenetic groups. Elucidation of the basis for this mammalian/bacterial IMPDH signature will provide insight into the catalytic mechanism of this enzyme and the foundation for the development of highly specific inhibitors.« less
Genetic diversity, genetic structure and demographic history of Cycas simplicipinna (Cycadaceae) assessed by DNA sequences and SSR markers

PubMed Central

2014-01-01

Background Cycas simplicipinna (T. Smitinand) K. Hill. (Cycadaceae) is an endangered species in China. There were seven populations and 118 individuals that we could collect were genotyped in this study. Here, we assessed the genetic diversity, genetic structure and demographic history of this species. Results Analyses of data of DNA sequences (two maternally inherited intergenic spacers of chloroplast, cpDNA and one biparentally inherited internal transcribed spacer region ITS4-ITS5, nrDNA) and sixteen microsatellite loci (SSR) were conducted in the species. Of the 118 samples, 86 individuals from the seven populations were used for DNA sequencing and 115 individuals from six populations were used for the microsatellite study. We found high genetic diversity at the species level, low genetic diversity within each of the seven populations and high genetic differentiation among the populations. There was a clear genetic structure within populations of C. simplicipinna. A demographic history inferred from DNA sequencing data indicates that C. simplicipinna experienced a recent population contraction without retreating to a common refugium during the last glacial period. The results derived from SSR data also showed that C. simplicipinna underwent past effective population contraction, likely during the Pleistocene. Conclusions Some genetic features of C. simplicipinna such as having high genetic differentiation among the populations, a clear genetic structure and a recent population contraction could provide guidelines for protecting this endangered species from extinction. Furthermore, the genetic features with population dynamics of the species in our study would help provide insights and guidelines for protecting other endangered species effectively. PMID:25016306
Longitudinal Analysis of Cerebrospinal Fluid and Plasma HIV-1 Envelope Sequences Isolated From a Single Donor with HIV Asymptomatic Neurocognitive Impairment

PubMed Central

Vázquez-Santiago, Fabián; García, Yashira; Rivera-Román, Ivelisse; Noel, Richard J.; Wojna, Valerie; Meléndez, Loyda M.; Rivera-Amill, Vanessa

2015-01-01

Objective Combined antiretroviral treatment (cART) has changed the clinical presentation of HIV-associated neurocognitive disorders (HAND) to that of the milder forms of the disease. Asymptomatic neurocognitive impairment (ANI) is now more prevalent and is associated with increased morbidity and mortality risk in HIV-1–infected people. HIV-1 envelope (env) genetic heterogeneity has been detected within the central nervous system (CNS) of individuals with ANI. Changes within env determine co-receptor use, cellular tropism, and neuropathogenesis. We hypothesize that compartmental changes are associated with HIV-1 env C2V4 during ANI and sought to analyze paired HIV-1 env sequences from plasma and cerebrospinal fluid (CSF) of a female subject undergoing long-term cART. Methods Paired plasma and CSF samples were collected at 12-month intervals and HIV-1 env C2V4 was cloned and sequenced. Results Phylogenetic analysis of paired samples consistently showed genetic variants unique to the CSF. Phenotypic prediction showed CCR5 (R5) variants for all CSF-derived sequences and showed minor X4 variants (or dual-tropic) in the plasma at later time points. Viral compartmentalization was evident throughout the study, suggesting that the occurrence of distinctive env strains may contribute to the neuropathogenesis of HAND. Conclusions Our study provides new insights about the genetic characteristics within the C2V4 of HIV-1 env that persist after long-term cART and during the course of persistent ANI. PMID:26167513
DNA sequences and composition from 12 BAC clones-derived MUSB SSR markers mapped to cotton (Gossypium Hirsutum L. x G. Barbadense L.)chromosomes 11 and 21

USDA-ARS?s Scientific Manuscript database

To discover resistance (R) and/or pathogen-induced (PR) genes involved in disease response, 12 bacterial artificial chromosome (BAC) clones from cv. Acala Maxxa (G. hirsutum) were sequenced at the Clemson University, Genomics Institute, Clemson, SC. These BACs derived MUSB single sequence repeat (SS...
The Population History of Endogenous Retroviruses in Mule Deer (Odocoileus hemionus)

PubMed Central

2014-01-01

Mobile elements are powerful agents of genomic evolution and can be exceptionally informative markers for investigating species and population-level evolutionary history. While several studies have utilized retrotransposon-based insertional polymorphisms to resolve phylogenies, few population studies exist outside of humans. Endogenous retroviruses are LTR-retrotransposons derived from retroviruses that have become stably integrated in the host genome during past infections and transmitted vertically to subsequent generations. They offer valuable insight into host-virus co-evolution and a unique perspective on host evolutionary history because they integrate into the genome at a discrete point in time. We examined the evolutionary history of a cervid endogenous gammaretrovirus (CrERVγ) in mule deer (Odocoileus hemionus). We sequenced 14 CrERV proviruses (CrERV-in1 to -in14), and examined the prevalence and distribution of 13 proviruses in 262 deer among 15 populations from Montana, Wyoming, and Utah. CrERV absence in white-tailed deer (O. virginianus), identical 5′ and 3′ long terminal repeat (LTR) sequences, insertional polymorphism, and CrERV divergence time estimates indicated that most endogenization events occurred within the last 200000 years. Population structure inferred from CrERVs (F ST = 0.008) and microsatellites (θ = 0.01) was low, but significant, with Utah, northwestern Montana, and a Helena herd being particularly differentiated. Clustering analyses indicated regional structuring, and non-contiguous clustering could often be explained by known translocations. Cluster ensemble results indicated spatial localization of viruses, specifically in deer from northeastern and western Montana. This study demonstrates the utility of endogenous retroviruses to elucidate and provide novel insight into both ERV evolutionary history and the history of contemporary host populations. PMID:24336966
Finding a needle in the virus metagenome haystack--micro-metagenome analysis captures a snapshot of the diversity of a bacteriophage armoire.

PubMed

Ray, Jessica; Dondrup, Michael; Modha, Sejal; Steen, Ida Helene; Sandaa, Ruth-Anne; Clokie, Martha

2012-01-01

Viruses are ubiquitous in the oceans and critical components of marine microbial communities, regulating nutrient transfer to higher trophic levels or to the dissolved organic pool through lysis of host cells. Hydrothermal vent systems are oases of biological activity in the deep oceans, for which knowledge of biodiversity and its impact on global ocean biogeochemical cycling is still in its infancy. In order to gain biological insight into viral communities present in hydrothermal vent systems, we developed a method based on deep-sequencing of pulsed field gel electrophoretic bands representing key viral fractions present in seawater within and surrounding a hydrothermal plume derived from Loki's Castle vent field at the Arctic Mid-Ocean Ridge. The reduction in virus community complexity afforded by this novel approach enabled the near-complete reconstruction of a lambda-like phage genome from the virus fraction of the plume. Phylogenetic examination of distinct gene regions in this lambdoid phage genome unveiled diversity at loci encoding superinfection exclusion- and integrase-like proteins. This suggests the importance of fine-tuning lyosgenic conversion as a viral survival strategy, and provides insights into the nature of host-virus and virus-virus interactions, within hydrothermal plumes. By reducing the complexity of the viral community through targeted sequencing of prominent dsDNA viral fractions, this method has selectively mimicked virus dominance approaching that hitherto achieved only through culturing, thus enabling bioinformatic analysis to locate a lambdoid viral "needle" within the greater viral community "haystack". Such targeted analyses have great potential for accelerating the extraction of biological knowledge from diverse and poorly understood environmental viral communities.
Insights into archaeal evolution and symbiosis from the genomes of a nanoarchaeon and its inferred crenarchaeal host from Obsidian Pool, Yellowstone National Park.

PubMed

Podar, Mircea; Makarova, Kira S; Graham, David E; Wolf, Yuri I; Koonin, Eugene V; Reysenbach, Anna-Louise

2013-04-22

A single cultured marine organism, Nanoarchaeum equitans, represents the Nanoarchaeota branch of symbiotic Archaea, with a highly reduced genome and unusual features such as multiple split genes. The first terrestrial hyperthermophilic member of the Nanoarchaeota was collected from Obsidian Pool, a thermal feature in Yellowstone National Park, separated by single cell isolation, and sequenced together with its putative host, a Sulfolobales archaeon. Both the new Nanoarchaeota (Nst1) and N. equitans lack most biosynthetic capabilities, and phylogenetic analysis of ribosomal RNA and protein sequences indicates that the two form a deep-branching archaeal lineage. However, the Nst1 genome is more than 20% larger, and encodes a complete gluconeogenesis pathway as well as the full complement of archaeal flagellum proteins. With a larger genome, a smaller repertoire of split protein encoding genes and no split non-contiguous tRNAs, Nst1 appears to have experienced less severe genome reduction than N. equitans. These findings imply that, rather than representing ancestral characters, the extremely compact genomes and multiple split genes of Nanoarchaeota are derived characters associated with their symbiotic or parasitic lifestyle. The inferred host of Nst1 is potentially autotrophic, with a streamlined genome and simplified central and energetic metabolism as compared to other Sulfolobales. Comparison of the N. equitans and Nst1 genomes suggests that the marine and terrestrial lineages of Nanoarchaeota share a common ancestor that was already a symbiont of another archaeon. The two distinct Nanoarchaeota-host genomic data sets offer novel insights into the evolution of archaeal symbiosis and parasitism, enabling further studies of the cellular and molecular mechanisms of these relationships. This article was reviewed by Patrick Forterre, Bettina Siebers (nominated by Michael Galperin) and Purification Lopez-Garcia.
Exploring sequence requirements for C₃/C₄ carboxylate recognition in the Pseudomonas aeruginosa cephalosporinase: Insights into plasticity of the AmpC β-lactamase.

PubMed

Drawz, Sarah M; Taracila, Magdalena; Caselli, Emilia; Prati, Fabio; Bonomo, Robert A

2011-06-01

In Pseudomonas aeruginosa, the chromosomally encoded class C cephalosporinase (AmpC β-lactamase) is often responsible for high-level resistance to β-lactam antibiotics. Despite years of study of these important β-lactamases, knowledge regarding how amino acid sequence dictates function of the AmpC Pseudomonas-derived cephalosporinase (PDC) remains scarce. Insights into structure-function relationships are crucial to the design of both β-lactams and high-affinity inhibitors. In order to understand how PDC recognizes the C₃/C₄ carboxylate of β-lactams, we first examined a molecular model of a P. aeruginosa AmpC β-lactamase, PDC-3, in complex with a boronate inhibitor that possesses a side chain that mimics the thiazolidine/dihydrothiazine ring and the C₃/C₄ carboxylate characteristic of β-lactam substrates. We next tested the hypothesis generated by our model, i.e. that more than one amino acid residue is involved in recognition of the C₃/C₄ β-lactam carboxylate, and engineered alanine variants at three putative carboxylate binding amino acids. Antimicrobial susceptibility testing showed that the PDC-3 β-lactamase maintains a high level of activity despite the substitution of C₃/C₄ β-lactam carboxylate recognition residues. Enzyme kinetics were determined for a panel of nine penicillin and cephalosporin analog boronates synthesized as active site probes of the PDC-3 enzyme and the Arg349Ala variant. Our examination of the PDC-3 active site revealed that more than one residue could serve to interact with the C₃/C₄ carboxylate of the β-lactam. This functional versatility has implications for novel drug design, protein evolution, and resistance profile of this enzyme. Copyright © 2011 The Protein Society.
The population history of endogenous retroviruses in mule deer (Odocoileus heminous)

USGS Publications Warehouse

Kamath, Pauline L.; Elleder, Daniel; Bao, Le; Cross, Paul C.; Powell, John H.; Poss, Mary

2013-01-01

Mobile elements are powerful agents of genomic evolution and can be exceptionally informative markers for investigating species and population-level evolutionary history. While several studies have utilized retrotransposon-based insertional polymorphisms to resolve phylogenies, few population studies exist outside of humans. Endogenous retroviruses are LTR-retrotransposons derived from retroviruses that have become stably integrated in the host genome during past infections and transmitted vertically to subsequent generations. They offer valuable insight into host-virus co-evolution and a unique perspective on host evolutionary history because they integrate into the genome at a discrete point in time. We examined the evolutionary history of a cervid endogenous gammaretrovirus (CrERVγ) in mule deer (Odocoileus hemionus). We sequenced 14 CrERV proviruses (CrERV-in1 to -in14), and examined the prevalence and distribution of 13 proviruses in 262 deer among 15 populations from Montana, Wyoming, and Utah. CrERV absence in white-tailed deer (O. virginianus), identical 5′ and 3′ long terminal repeat (LTR) sequences, insertional polymorphism, and CrERV divergence time estimates indicated that most endogenization events occurred within the last 200000 years. Population structure inferred from CrERVs (F ST = 0.008) and microsatellites (θ = 0.01) was low, but significant, with Utah, northwestern Montana, and a Helena herd being particularly differentiated. Clustering analyses indicated regional structuring, and non-contiguous clustering could often be explained by known translocations. Cluster ensemble results indicated spatial localization of viruses, specifically in deer from northeastern and western Montana. This study demonstrates the utility of endogenous retroviruses to elucidate and provide novel insight into both ERV evolutionary history and the history of contemporary host populations.
Gestures and Insight in Advanced Mathematical Thinking

ERIC Educational Resources Information Center

Yoon, Caroline; Thomas, Michael O. J.; Dreyfus, Tommy

2011-01-01

What role do gestures play in advanced mathematical thinking? We argue that the role of gestures goes beyond merely communicating thought and supporting understanding--in some cases, gestures can help generate new mathematical insights. Gestures feature prominently in a case study of two participants working on a sequence of calculus activities.…
Molecular Epidemiological Survey and Genetic Characterization of Anaplasma Species in Mongolian Livestock.

PubMed

Ochirkhuu, Nyamsuren; Konnai, Satoru; Odbileg, Raadan; Murata, Shiro; Ohashi, Kazuhiko

2017-08-01

Anaplasma species are obligate intracellular rickettsial pathogens that cause great economic loss to the animal industry. Few studies on Anaplasma infections in Mongolian livestock have been conducted. This study examined the prevalence of Anaplasma marginale, Anaplasma ovis, Anaplasma phagocytophilum, and Anaplasma bovis by polymerase chain reaction assay in 928 blood samples collected from native cattle and dairy cattle (Bos taurus), yaks (Bos grunniens), sheep (Ovis aries), and goats (Capra aegagrus hircus) in four provinces of Ulaanbaatar city in Mongolia. We genetically characterized positive samples through sequencing analysis based on the heat-shock protein groEL, major surface protein 4 (msp4), and 16S rRNA genes. Only A. ovis was detected in Mongolian livestock (cattle, yaks, sheep, and goats), with 413 animals (44.5%) positive for groEL and 308 animals (33.2%) positive for msp4 genes. In the phylogenetic tree, we separated A. ovis sequences into two distinct clusters based on the groEL gene. One cluster comprised sequences derived mainly from sheep and goats, which was similar to that in A. ovis isolates from other countries. The other divergent cluster comprised sequences derived from cattle and yaks and appeared to be newly branched from that in previously published single isolates in Mongolian cattle. In addition, the msp4 gene of A. ovis using same and different samples with groEL gene of the pathogen demonstrated that all sequences derived from all animal species, except for three sequences derived from cattle and yak, were clustered together, and were identical or similar to those in isolates from other countries. We used 16S rRNA gene sequences to investigate the genetically divergent A. ovis and identified high homology of 99.3-100%. However, the sequences derived from cattle did not match those derived from sheep and goats. The results of this study on the prevalence and molecular characterization of A. ovis in Mongolian livestock can facilitate the control of infectious diseases in livestock.
The Pathogenomic Sequence Analysis of B. cereus and B. Thuringiensis isolates closely related to Bacillus anthracis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Han, C S; Xie, G; Challacombe, J F

The sequencing and analysis of two close relatives of Bacillus anthracis are reported. AFLP analysis of over 300 isolates of B. cereus, B. thuringiensis and B. anthracis identified two isolates as being very closely related to B. anthracis. One, a B. cereus, BcE33L, was isolated from a zebra carcass in Nambia; the second, a B. thuringiensis, 97-27, was isolated from a necrotic human wound. The B. cereus appears to be the closest anthracis relative sequenced to date. A core genome of over 3,900 genes was compiled for the Bacillus cereus group, including B anthracis. Comparative analysis of these two genomesmore » with other members of the B. cereus group provides insight into the evolutionary relationships among these organisms. Evidence is presented that differential regulation modulates virulence, rather than simple acquisition of virulence factors. These genome sequences provide insight into the molecular mechanisms contributing to the host range and virulence of this group of organisms.« less
The Pathogenomic Sequence Analysis of B. cereus and B.thuringiensis Isolates Closely Related to Bacillus anthracis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Han, Cliff S.; Xie, Gary; Challacombe, Jean F.

The sequencing and analysis of two close relatives of Bacillus anthracis are reported. AFLP analysis of over 300 isolates of B.cereus, B. thuringiensis and B. anthracis identified two isolates as being very closely related to B. anthracis. One, a B. cereus, BcE33L, was isolated from a zebra carcass in Nambia; the second, a B. thuringiensis, 97-27, was isolated from a necrotic human wound. The B. cereus appears to be the closest anthracis relative sequenced to date. A core genome of over 3,900 genes was compiled for the Bacillus cereus group, including Banthracis. Comparative analysis of these two genomes with othermore » members of the B. cereus group provides insight into the evolutionary relationships among these organisms. Evidence is presented that differential regulation modulates virulence, rather than simple acquisition of virulence factors. These genome sequences provide insight into the molecular mechanisms contributing to the host range and virulence of this group of organisms.« less
Genetic diversity in Trypanosoma theileri from Sri Lankan cattle and water buffaloes.

PubMed

Yokoyama, Naoaki; Sivakumar, Thillaiampalam; Fukushi, Shintaro; Tattiyapong, Muncharee; Tuvshintulga, Bumduuren; Kothalawala, Hemal; Silva, Seekkuge Susil Priyantha; Igarashi, Ikuo; Inoue, Noboru

2015-01-30

Trypanosoma theileri is a hemoprotozoan parasite that infects various ruminant species. We investigated the epidemiology of this parasite among cattle and water buffalo populations bred in Sri Lanka, using a diagnostic PCR assay based on the cathepsin L-like protein (CATL) gene. Blood DNA samples sourced from cattle (n=316) and water buffaloes (n=320) bred in different geographical areas of Sri Lanka were PCR screened for T. theileri. Parasite DNA was detected in cattle and water buffaloes alike in all the sampling locations. The overall T. theileri-positive rate was higher in water buffaloes (15.9%) than in cattle (7.6%). Subsequently, PCR amplicons were sequenced and the partial CATL sequences were phylogenetically analyzed. The identity values for the CATL gene were 89.6-99.7% among the cattle-derived sequences, compared with values of 90.7-100% for the buffalo-derived sequences. However, the cattle-derived sequences shared 88.2-100% identity values with those from buffaloes. In the phylogenetic tree, the Sri Lankan CATL gene sequences fell into two major clades (TthI and TthII), both of which contain CATL sequences from several other countries. Although most of the CATL sequences from Sri Lankan cattle and buffaloes clustered independently, two buffalo-derived sequences were observed to be closely related to those of the Sri Lankan cattle. Furthermore, a Sri Lankan buffalo sequence clustered with CATL gene sequences from Brazilian buffalo and Thai cattle. In addition to reporting the first PCR-based survey of T. theileri among Sri Lankan-bred cattle and water buffaloes, the present study found that some of the CATL gene fragments sourced from water buffaloes shared similarity with those determined from cattle in this country. Copyright © 2014 Elsevier B.V. All rights reserved.
A novel tomato mutant, Solanum lycopersicum elongated fruit1 (Slelf1), exhibits an elongated fruit shape caused by increased cell layers in the proximal region of the ovary.

PubMed

Chusreeaeom, Katarut; Ariizumi, Tohru; Asamizu, Erika; Okabe, Yoshihiro; Shirasawa, Kenta; Ezura, Hiroshi

2014-06-01

Genes controlling fruit morphology offer important insights into patterns and mechanisms determining organ shape and size. In cultivated tomato (Solanum lycopersicum L.), a variety of fruit shapes are displayed, including round-, bell pepper-, pear-, and elongate-shaped forms. In this study, we characterized a tomato mutant possessing elongated fruit morphology by histologically analyzing its fruit structure and genetically analyzing and mapping the genetic locus. The mutant line, Solanum lycopersicum elongated fruit 1 (Slelf1), was selected in a previous study from an ethylmethane sulfonate-mutagenized population generated in the background of Micro-Tom, a dwarf and rapid-growth variety. Histological analysis of the Slelf1 mutant revealed dramatically increased elongation of ovary and fruit. Until 6 days before flowering, ovaries were round and they began to elongate afterward. We also determined pericarp thickness and the number of cell layers in three designated fruit regions. We found that mesocarp thickness, as well as the number of cell layers, was increased in the proximal region of immature green fruits, making this the key sector of fruit elongation. Using 262 F2 individuals derived from a cross between Slelf1 and the cultivar Ailsa Craig, we constructed a genetic map, simple sequence repeat (SSR), cleaved amplified polymorphism sequence (CAPS), and derived CAPS (dCAPS) markers and mapped to the 12 tomato chromosomes. Genetic mapping placed the candidate gene locus within a 0.2 Mbp interval on the long arm of chromosome 8 and was likely different from previously known loci affecting fruit shape.
Genome Sequences of Multidrug-Resistant Salmonella enterica subsp. enterica Serovar Infantis Strains from Broiler Chicks in Hungary

PubMed Central

Wilk, Tímea; Szabó, Móni; Szmolka, Ama; Kiss, János; Barta, Endre; Nagy, Tibor

2016-01-01

Three strains of Salmonella enterica serovar Infantis isolated from healthy broiler chickens from 2012 to 2013 have been sequenced. Comparison of these and previously published S. Infantis genome sequences of broiler origin in 1996 and 2004 will provide new insight into the genome evolution and recent spread of S. Infantis in poultry. PMID:27979950
Complete genome sequence of Bifidobacterium breve CECT 7263, a strain isolated from human milk.

PubMed

Jiménez, Esther; Villar-Tajadura, M Antonia; Marín, María; Fontecha, Javier; Requena, Teresa; Arroyo, Rebeca; Fernández, Leónides; Rodríguez, Juan M

2012-07-01

Bifidobacterium breve is an actinobacterium frequently isolated from colonic microbiota of breastfeeding babies. Here, we report the complete and annotated genome sequence of a B. breve strain isolated from human milk, B. breve CECT 7263. The genome sequence will provide new insights into the biology of this potential probiotic organism and will allow the characterization of genes related to beneficial properties.
Complete Genome Sequences of Two Bacillus pumilus Strains from Cuatrociénegas, Coahuila, Mexico

PubMed Central

Alcaraz, Luis D.; Aguilar-Salinas, Bernardo; Islas, Africa

2018-01-01

ABSTRACT We assembled the complete genome sequences of Bacillus pumilus strains 145 and 150a from Cuatrociénegas, Mexico. We detected genes codifying for proteins potentially involved in antagonism (bacteriocins) and defense mechanisms (abortive infection bacteriophage proteins and 4-azaleucine resistance). Both strains harbored prophage sequences. Our results provide insights into understanding the establishment of microbial interactions. PMID:29700165

Genome Sequences of Shewanella baltica and Shewanella morhuae Strains Isolated from the Gastrointestinal Tract of Freshwater Fish.

PubMed

Castillo, Daniel; Gram, Lone; Dailey, Frank E

2018-06-21

We present here the genome sequences of Shewanella baltica strain CW2 and Shewanella morhuae strain CW7, isolated from the gastrointestinal tract of Salvelinus namaycush (lean lake trout) and Coregonus clupeaformis (whitefish), respectively. These genome sequences provide insights into the niche adaptation of these specific species in freshwater systems. Copyright © 2018 Castillo et al.
Deciphering the Cryptic Genome: Genome-wide Analyses of the Rice Pathogen Fusarium fujikuroi Reveal Complex Regulation of Secondary Metabolism and Novel Metabolites

PubMed Central

Studt, Lena; Niehaus, Eva-Maria; Espino, Jose J.; Huß, Kathleen; Michielse, Caroline B.; Albermann, Sabine; Wagner, Dominik; Bergner, Sonja V.; Connolly, Lanelle R.; Fischer, Andreas; Reuter, Gunter; Kleigrewe, Karin; Bald, Till; Wingfield, Brenda D.; Ophir, Ron; Freeman, Stanley; Hippler, Michael; Smith, Kristina M.; Brown, Daren W.; Proctor, Robert H.; Münsterkötter, Martin; Freitag, Michael; Humpf, Hans-Ulrich; Güldener, Ulrich; Tudzynski, Bettina

2013-01-01

The fungus Fusarium fujikuroi causes “bakanae” disease of rice due to its ability to produce gibberellins (GAs), but it is also known for producing harmful mycotoxins. However, the genetic capacity for the whole arsenal of natural compounds and their role in the fungus' interaction with rice remained unknown. Here, we present a high-quality genome sequence of F. fujikuroi that was assembled into 12 scaffolds corresponding to the 12 chromosomes described for the fungus. We used the genome sequence along with ChIP-seq, transcriptome, proteome, and HPLC-FTMS-based metabolome analyses to identify the potential secondary metabolite biosynthetic gene clusters and to examine their regulation in response to nitrogen availability and plant signals. The results indicate that expression of most but not all gene clusters correlate with proteome and ChIP-seq data. Comparison of the F. fujikuroi genome to those of six other fusaria revealed that only a small number of gene clusters are conserved among these species, thus providing new insights into the divergence of secondary metabolism in the genus Fusarium. Noteworthy, GA biosynthetic genes are present in some related species, but GA biosynthesis is limited to F. fujikuroi, suggesting that this provides a selective advantage during infection of the preferred host plant rice. Among the genome sequences analyzed, one cluster that includes a polyketide synthase gene (PKS19) and another that includes a non-ribosomal peptide synthetase gene (NRPS31) are unique to F. fujikuroi. The metabolites derived from these clusters were identified by HPLC-FTMS-based analyses of engineered F. fujikuroi strains overexpressing cluster genes. In planta expression studies suggest a specific role for the PKS19-derived product during rice infection. Thus, our results indicate that combined comparative genomics and genome-wide experimental analyses identified novel genes and secondary metabolites that contribute to the evolutionary success of F. fujikuroi as a rice pathogen. PMID:23825955
Whole Genome Sequence of Two Wild-Derived Mus musculus domesticus Inbred Strains, LEWES/EiJ and ZALENDE/EiJ, with Different Diploid Numbers

PubMed Central

Morgan, Andrew P.; Didion, John P.; Doran, Anthony G.; Holt, James M.; McMillan, Leonard; Keane, Thomas M.; de Villena, Fernando Pardo-Manuel

2016-01-01

Wild-derived mouse inbred strains are becoming increasingly popular for complex traits analysis, evolutionary studies, and systems genetics. Here, we report the whole-genome sequencing of two wild-derived mouse inbred strains, LEWES/EiJ and ZALENDE/EiJ, of Mus musculus domesticus origin. These two inbred strains were selected based on their geographic origin, karyotype, and use in ongoing research. We generated 14× and 18× coverage sequence, respectively, and discovered over 1.1 million novel variants, most of which are private to one of these strains. This report expands the number of wild-derived inbred genomes in the Mus genus from six to eight. The sequence variation can be accessed via an online query tool; variant calls (VCF format) and alignments (BAM format) are available for download from a dedicated ftp site. Finally, the sequencing data have also been stored in a lossless, compressed, and indexed format using the multi-string Burrows-Wheeler transform. All data can be used without restriction. PMID:27765810
Brassica ASTRA: an integrated database for Brassica genomic research.

PubMed

Love, Christopher G; Robinson, Andrew J; Lim, Geraldine A C; Hopkins, Clare J; Batley, Jacqueline; Barker, Gary; Spangenberg, German C; Edwards, David

2005-01-01

Brassica ASTRA is a public database for genomic information on Brassica species. The database incorporates expressed sequences with Swiss-Prot and GenBank comparative sequence annotation as well as secondary Gene Ontology (GO) annotation derived from the comparison with Arabidopsis TAIR GO annotations. Simple sequence repeat molecular markers are identified within resident sequences and mapped onto the closely related Arabidopsis genome sequence. Bacterial artificial chromosome (BAC) end sequences derived from the Multinational Brassica Genome Project are also mapped onto the Arabidopsis genome sequence enabling users to identify candidate Brassica BACs corresponding to syntenic regions of Arabidopsis. This information is maintained in a MySQL database with a web interface providing the primary means of interrogation. The database is accessible at http://hornbill.cspp.latrobe.edu.au.
Spin models inferred from patient-derived viral sequence data faithfully describe HIV fitness landscapes

NASA Astrophysics Data System (ADS)

Shekhar, Karthik; Ruberman, Claire F.; Ferguson, Andrew L.; Barton, John P.; Kardar, Mehran; Chakraborty, Arup K.

2013-12-01

Mutational escape from vaccine-induced immune responses has thwarted the development of a successful vaccine against AIDS, whose causative agent is HIV, a highly mutable virus. Knowing the virus' fitness as a function of its proteomic sequence can enable rational design of potent vaccines, as this information can focus vaccine-induced immune responses to target mutational vulnerabilities of the virus. Spin models have been proposed as a means to infer intrinsic fitness landscapes of HIV proteins from patient-derived viral protein sequences. These sequences are the product of nonequilibrium viral evolution driven by patient-specific immune responses and are subject to phylogenetic constraints. How can such sequence data allow inference of intrinsic fitness landscapes? We combined computer simulations and variational theory á la Feynman to show that, in most circumstances, spin models inferred from patient-derived viral sequences reflect the correct rank order of the fitness of mutant viral strains. Our findings are relevant for diverse viruses.
Evaluation of ribosomal RNA removal protocols for Salmonella RNA-Seq projects

USDA-ARS?s Scientific Manuscript database

Next generation sequencing is a powerful technology and its application to sequencing entire RNA populations of food-borne pathogens will provide valuable insights. A problem unique to prokaryotic RNA-Seq is the massive abundance of ribosomal RNA. Unlike eukaryotic messenger RNA (mRNA), bacterial ...
Transcriptome response to elevated CO2, water deficit, and thermal stress in peanut

USDA-ARS?s Scientific Manuscript database

Previously, our laboratories have performed gene expression studies using EST sequencing and spotted microarrays to investigate tissue-specific gene expression and response to abiotic stress. While these studies have provided valuable insight into these processes, they are constrained by sequencer t...
Next generation sequencing applications for microRNA biomarker discovery in toxicological studies

EPA Science Inventory

Next Generation Sequencing (NGS) technology will be reviewed for its base pair resolution, wide dynamic range, and insights into the genome and transcriptome, with special focus upon the biomarker potential of microRNAs (miRNAs). The first part of this presentation reviews commo...
First insight into the viral community of the cnidarian model metaorganism Aiptasia using RNA-Seq data

PubMed Central

Brüwer, Jan D.

2018-01-01

Current research posits that all multicellular organisms live in symbioses with associated microorganisms and form so-called metaorganisms or holobionts. Cnidarian metaorganisms are of specific interest given that stony corals provide the foundation of the globally threatened coral reef ecosystems. To gain first insight into viruses associated with the coral model system Aiptasia (sensu Exaiptasia pallida), we analyzed an existing RNA-Seq dataset of aposymbiotic, partially populated, and fully symbiotic Aiptasia CC7 anemones with Symbiodinium. Our approach included the selective removal of anemone host and algal endosymbiont sequences and subsequent microbial sequence annotation. Of a total of 297 million raw sequence reads, 8.6 million (∼3%) remained after host and endosymbiont sequence removal. Of these, 3,293 sequences could be assigned as of viral origin. Taxonomic annotation of these sequences suggests that Aiptasia is associated with a diverse viral community, comprising 116 viral taxa covering 40 families. The viral assemblage was dominated by viruses from the families Herpesviridae (12.00%), Partitiviridae (9.93%), and Picornaviridae (9.87%). Despite an overall stable viral assemblage, we found that some viral taxa exhibited significant changes in their relative abundance when Aiptasia engaged in a symbiotic relationship with Symbiodinium. Elucidation of viral taxa consistently present across all conditions revealed a core virome of 15 viral taxa from 11 viral families, encompassing many viruses previously reported as members of coral viromes. Despite the non-random selection of viral genetic material due to the nature of the sequencing data analyzed, our study provides a first insight into the viral community associated with Aiptasia. Similarities of the Aiptasia viral community with those of corals corroborate the application of Aiptasia as a model system to study coral holobionts. Further, the change in abundance of certain viral taxa across different symbiotic states suggests a role of viruses in the algal endosymbiosis, but the functional significance of this remains to be determined. PMID:29507840
Sequence Diversity, Intersubgroup Relationships, and Origins of the Mouse Leukemia Gammaretroviruses of Laboratory and Wild Mice.

PubMed

Bamunusinghe, Devinka; Naghashfar, Zohreh; Buckler-White, Alicia; Plishka, Ronald; Baliji, Surendranath; Liu, Qingping; Kassner, Joshua; Oler, Andrew J; Hartley, Janet; Kozak, Christine A

2016-04-01

Mouse leukemia viruses (MLVs) are found in the common inbred strains of laboratory mice and in the house mouse subspecies ofMus musculus Receptor usage and envelope (env) sequence variation define three MLV host range subgroups in laboratory mice: ecotropic, polytropic, and xenotropic MLVs (E-, P-, and X-MLVs, respectively). These exogenous MLVs derive from endogenous retroviruses (ERVs) that were acquired by the wild mouse progenitors of laboratory mice about 1 million years ago. We analyzed the genomes of seven MLVs isolated from Eurasian and American wild mice and three previously sequenced MLVs to describe their relationships and identify their possible ERV progenitors. The phylogenetic tree based on the receptor-determining regions ofenvproduced expected host range clusters, but these clusters are not maintained in trees generated from other virus regions. Colinear alignments of the viral genomes identified segmental homologies to ERVs of different host range subgroups. Six MLVs show close relationships to a small xenotropic ERV subgroup largely confined to the inbred mouse Y chromosome.envvariations define three E-MLV subtypes, one of which carries duplications of various sizes, sequences, and locations in the proline-rich region ofenv Outside theenvregion, all E-MLVs are related to different nonecotropic MLVs. These results document the diversity in gammaretroviruses isolated from globally distributedMussubspecies, provide insight into their origins and relationships, and indicate that recombination has had an important role in the evolution of these mutagenic and pathogenic agents. Laboratory mice carry mouse leukemia viruses (MLVs) of three host range groups which were acquired from their wild mouse progenitors. We sequenced the complete genomes of seven infectious MLVs isolated from geographically separated Eurasian and American wild mice and compared them with endogenous germ line retroviruses (ERVs) acquired early in house mouse evolution. We did this because the laboratory mouse viruses derive directly from specific ERVs or arise by recombination between different ERVs. The six distinctively different wild mouse viruses appear to be recombinants, often involving different host range subgroups, and most are related to a distinctive, largely Y-chromosome-linked MLV ERV subtype. MLVs with ecotropic host ranges show the greatest variability with extensive inter- and intrasubtype envelope differences and with homologies to other host range subgroups outside the envelope. The sequence diversity among these wild mouse isolates helps define their relationships and origins and emphasizes the importance of recombination in their evolution. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Complete chloroplast genome sequence of a tree fern Alsophila spinulosa: insights into evolutionary changes in fern chloroplast genomes.

PubMed

Gao, Lei; Yi, Xuan; Yang, Yong-Xia; Su, Ying-Juan; Wang, Ting

2009-06-11

Ferns have generally been neglected in studies of chloroplast genomics. Before this study, only one polypod and two basal ferns had their complete chloroplast (cp) genome reported. Tree ferns represent an ancient fern lineage that first occurred in the Late Triassic. In recent phylogenetic analyses, tree ferns were shown to be the sister group of polypods, the most diverse group of living ferns. Availability of cp genome sequence from a tree fern will facilitate interpretation of the evolutionary changes of fern cp genomes. Here we have sequenced the complete cp genome of a scaly tree fern Alsophila spinulosa (Cyatheaceae). The Alsophila cp genome is 156,661 base pairs (bp) in size, and has a typical quadripartite structure with the large (LSC, 86,308 bp) and small single copy (SSC, 21,623 bp) regions separated by two copies of an inverted repeat (IRs, 24,365 bp each). This genome contains 117 different genes encoding 85 proteins, 4 rRNAs and 28 tRNAs. Pseudogenes of ycf66 and trnT-UGU are also detected in this genome. A unique trnR-UCG gene (derived from trnR-CCG) is found between rbcL and accD. The Alsophila cp genome shares some unusual characteristics with the previously sequenced cp genome of the polypod fern Adiantum capillus-veneris, including the absence of 5 tRNA genes that exist in most other cp genomes. The genome shows a high degree of synteny with that of Adiantum, but differs considerably from two basal ferns (Angiopteris evecta and Psilotum nudum). At one endpoint of an ancient inversion we detected a highly repeated 565-bp-region that is absent from the Adiantum cp genome. An additional minor inversion of the trnD-GUC, which is possibly shared by all ferns, was identified by comparison between the fern and other land plant cp genomes. By comparing four fern cp genome sequences it was confirmed that two major rearrangements distinguish higher leptosporangiate ferns from basal fern lineages. The Alsophila cp genome is very similar to that of the polypod fern Adiantum in terms of gene content, gene order and GC content. However, there exist some striking differences between them: the trnR-UCG gene represents a putative molecular apomorphy of tree ferns; and the repeats observed at one inversion endpoint may be a vestige of some unknown rearrangement(s). This work provided fresh insights into the fern cp genome evolution as well as useful data for future phylogenetic studies.
The Paramecium germline genome provides a niche for intragenic parasitic DNA: evolutionary dynamics of internal eliminated sequences.

PubMed

Arnaiz, Olivier; Mathy, Nathalie; Baudry, Céline; Malinsky, Sophie; Aury, Jean-Marc; Denby Wilkes, Cyril; Garnier, Olivier; Labadie, Karine; Lauderdale, Benjamin E; Le Mouël, Anne; Marmignon, Antoine; Nowacki, Mariusz; Poulain, Julie; Prajer, Malgorzata; Wincker, Patrick; Meyer, Eric; Duharcourt, Sandra; Duret, Laurent; Bétermier, Mireille; Sperling, Linda

2012-01-01

Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES) from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of -45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a -10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a fraction of the genome in which parasitic DNA is not usually tolerated.
The Paramecium Germline Genome Provides a Niche for Intragenic Parasitic DNA: Evolutionary Dynamics of Internal Eliminated Sequences

PubMed Central

Arnaiz, Olivier; Mathy, Nathalie; Baudry, Céline; Malinsky, Sophie; Aury, Jean-Marc; Denby Wilkes, Cyril; Garnier, Olivier; Labadie, Karine; Lauderdale, Benjamin E.; Le Mouël, Anne; Marmignon, Antoine; Nowacki, Mariusz; Poulain, Julie; Prajer, Malgorzata; Wincker, Patrick; Meyer, Eric; Duharcourt, Sandra; Duret, Laurent; Bétermier, Mireille; Sperling, Linda

2012-01-01

Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES) from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of ∼45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a ∼10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a fraction of the genome in which parasitic DNA is not usually tolerated. PMID:23071448
Geographic Patterns of Genetic Variation in a Broadly Distributed Marine Vertebrate: New Insights into Loggerhead Turtle Stock Structure from Expanded Mitochondrial DNA Sequences

PubMed Central

Shamblin, Brian M.; Bolten, Alan B.; Abreu-Grobois, F. Alberto; Bjorndal, Karen A.; Cardona, Luis; Carreras, Carlos; Clusa, Marcel; Monzón-Argüello, Catalina; Nairn, Campbell J.; Nielsen, Janne T.; Nel, Ronel; Soares, Luciano S.; Stewart, Kelly R.; Vilaça, Sibelle T.; Türkozan, Oguz; Yilmaz, Can; Dutton, Peter H.

2014-01-01

Previous genetic studies have demonstrated that natal homing shapes the stock structure of marine turtle nesting populations. However, widespread sharing of common haplotypes based on short segments of the mitochondrial control region often limits resolution of the demographic connectivity of populations. Recent studies employing longer control region sequences to resolve haplotype sharing have focused on regional assessments of genetic structure and phylogeography. Here we synthesize available control region sequences for loggerhead turtles from the Mediterranean Sea, Atlantic, and western Indian Ocean basins. These data represent six of the nine globally significant regional management units (RMUs) for the species and include novel sequence data from Brazil, Cape Verde, South Africa and Oman. Genetic tests of differentiation among 42 rookeries represented by short sequences (380 bp haplotypes from 3,486 samples) and 40 rookeries represented by long sequences (∼800 bp haplotypes from 3,434 samples) supported the distinction of the six RMUs analyzed as well as recognition of at least 18 demographically independent management units (MUs) with respect to female natal homing. A total of 59 haplotypes were resolved. These haplotypes belonged to two highly divergent global lineages, with haplogroup I represented primarily by CC-A1, CC-A4, and CC-A11 variants and haplogroup II represented by CC-A2 and derived variants. Geographic distribution patterns of haplogroup II haplotypes and the nested position of CC-A11.6 from Oman among the Atlantic haplotypes invoke recent colonization of the Indian Ocean from the Atlantic for both global lineages. The haplotypes we confirmed for western Indian Ocean RMUs allow reinterpretation of previous mixed stock analysis and further suggest that contemporary migratory connectivity between the Indian and Atlantic Oceans occurs on a broader scale than previously hypothesized. This study represents a valuable model for conducting comprehensive international cooperative data management and research in marine ecology. PMID:24465810
Sampling Daphnia's expressed genes: preservation, expansion and invention of crustacean genes with reference to insect genomes

PubMed Central

Colbourne, John K; Eads, Brian D; Shaw, Joseph; Bohuski, Elizabeth; Bauer, Darren J; Andrews, Justen

2007-01-01

Background Functional and comparative studies of insect genomes have shed light on the complement of genes, which in part, account for shared morphologies, developmental programs and life-histories. Contrasting the gene inventories of insects to those of the nematodes provides insight into the genomic changes responsible for their diversification. However, nematodes have weak relationships to insects, as each belongs to separate animal phyla. A better outgroup to distinguish lineage specific novelties would include other members of Arthropoda. For example, crustaceans are close allies to the insects (together forming Pancrustacea) and their fascinating aquatic lifestyle provides an important comparison for understanding the genetic basis of adaptations to life on land versus life in water. Results This study reports on the first characterization of cDNA libraries and sequences for the model crustacean Daphnia pulex. We analyzed 1,546 ESTs of which 1,414 represent approximately 787 nuclear genes, by measuring their sequence similarities with insect and nematode proteomes. The provisional annotation of genes is supported by expression data from microarray studies described in companion papers. Loci expected to be shared between crustaceans and insects because of their mutual biological features are identified, including genes for reproduction, regulation and cellular processes. We identify genes that are likely derived within Pancrustacea or lost within the nematodes. Moreover, lineage specific gene family expansions are identified, which suggest certain biological demands associated with their ecological setting. In particular, up to seven distinct ferritin loci are found in Daphnia compared to three in most insects. Finally, a substantial fraction of the sampled gene transcripts shares no sequence similarity with those from other arthropods. Genes functioning during development and reproduction are comparatively well conserved between crustaceans and insects. By contrast, genes that were responsive to environmental conditions (metal stress) and not sex-biased included the greatest proportion of genes with no matches to insect proteomes. Conclusion This study along with associated microarray experiments are the initial steps in a coordinated effort by the Daphnia Genomics Consortium to build the necessary genomic platform needed to discover genes that account for the phenotypic diversity within the genus and to gain new insights into crustacean biology. This effort will soon include the first crustacean genome sequence. PMID:17612412
Generating Models of Surgical Procedures using UMLS Concepts and Multiple Sequence Alignment

PubMed Central

Meng, Frank; D’Avolio, Leonard W.; Chen, Andrew A.; Taira, Ricky K.; Kangarloo, Hooshang

2005-01-01

Surgical procedures can be viewed as a process composed of a sequence of steps performed on, by, or with the patient’s anatomy. This sequence is typically the pattern followed by surgeons when generating surgical report narratives for documenting surgical procedures. This paper describes a methodology for semi-automatically deriving a model of conducted surgeries, utilizing a sequence of derived Unified Medical Language System (UMLS) concepts for representing surgical procedures. A multiple sequence alignment was computed from a collection of such sequences and was used for generating the model. These models have the potential of being useful in a variety of informatics applications such as information retrieval and automatic document generation. PMID:16779094
The Evolution of Ebola virus: Insights from the 2013–2016 Epidemic

PubMed Central

Holmes, Edward C.; Dudas, Gytis; Rambaut, Andrew; Andersen, Kristian G.

2017-01-01

Preface The 2013–2016 epidemic of Ebola virus disease in West Africa was of unprecedented magnitude and changed our perspective on this lethal but sporadically emerging virus. This outbreak also marked the beginning of large-scale real-time molecular epidemiology. Herein, we show how evolutionary analyses of Ebola virus genome sequences provided key insights into virus origins, evolution, and spread during the epidemic. We provide basic scientists, epidemiologists, medical practitioners, and other outbreak responders with an enhanced understanding of the utility and limitations of pathogen genomic sequencing. This will be crucially important in our attempts to track and control future infectious disease outbreaks. PMID:27734858
Maximizing ecological and evolutionary insight in bisulfite sequencing data sets

PubMed Central

Lea, Amanda J.; Vilgalys, Tauras P.; Durst, Paul A.P.; Tung, Jenny

2017-01-01

Preface Genome-scale bisulfite sequencing approaches have opened the door to ecological and evolutionary studies of DNA methylation in many organisms. These approaches can be powerful. However, they introduce new methodological and statistical considerations, some of which are particularly relevant to non-model systems. Here, we highlight how these considerations influence a study’s power to link methylation variation with a predictor variable of interest. Relative to current practice, we argue that sample sizes will need to increase to provide robust insights. We also provide recommendations for overcoming common challenges and an R Shiny app to aid in study design. PMID:29046582
A comprehensive molecular cytogenetic analysis of chromosome rearrangements in gibbons

PubMed Central

Capozzi, Oronzo; Carbone, Lucia; Stanyon, Roscoe R.; Marra, Annamaria; Yang, Fengtang; Whelan, Christopher W.; de Jong, Pieter J.; Rocchi, Mariano; Archidiacono, Nicoletta

2012-01-01

Chromosome rearrangements in small apes are up to 20 times more frequent than in most mammals. Because of their complexity, the full extent of chromosome evolution in these hominoids is not yet fully documented. However, previous work with array painting, BAC-FISH, and selective sequencing in two of the four karyomorphs has shown that high-resolution methods can precisely define chromosome breakpoints and map the complex flow of evolutionary chromosome rearrangements. Here we use these tools to precisely define the rearrangements that have occurred in the remaining two karyomorphs, genera Symphalangus (2n = 50) and Hoolock (2n = 38). This research provides the most comprehensive insight into the evolutionary origins of chromosome rearrangements involved in transforming small apes genome. Bioinformatics analyses of the human–gibbon synteny breakpoints revealed association with transposable elements and segmental duplications, providing some insight into the mechanisms that might have promoted rearrangements in small apes. In the near future, the comparison of gibbon genome sequences will provide novel insights to test hypotheses concerning the mechanisms of chromosome evolution. The precise definition of synteny block boundaries and orientation, chromosomal fusions, and centromere repositioning events presented here will facilitate genome sequence assembly for these close relatives of humans. PMID:22892276
Merozoite surface protein-1 genetic diversity in Plasmodium malariae and Plasmodium brasilianum from Brazil.

PubMed

Guimarães, Lilian O; Wunderlich, Gerhard; Alves, João M P; Bueno, Marina G; Röhe, Fabio; Catão-Dias, José L; Neves, Amanda; Malafronte, Rosely S; Curado, Izilda; Domingues, Wilson; Kirchgatter, Karin

2015-11-16

The merozoite surface protein 1 (MSP1) gene encodes the major surface antigen of invasive forms of the Plasmodium erythrocytic stages and is considered a candidate vaccine antigen against malaria. Due to its polymorphisms, MSP1 is also useful for strain discrimination and consists of a good genetic marker. Sequence diversity in MSP1 has been analyzed in field isolates of three human parasites: P. falciparum, P. vivax, and P. ovale. However, the extent of variation in another human parasite, P. malariae, remains unknown. This parasite shows widespread, uneven distribution in tropical and subtropical regions throughout South America, Asia, and Africa. Interestingly, it is genetically indistinguishable from P. brasilianum, a parasite known to infect New World monkeys in Central and South America. Specific fragments (1 to 5) covering 60 % of the MSP1 gene (mainly the putatively polymorphic regions), were amplified by PCR in isolates of P. malariae and P. brasilianum from different geographic origin and hosts. Sequencing of the PCR-amplified products or cloned PCR fragments was performed and the sequences were used to construct a phylogenetic tree by the maximum likelihood method. Data were computed to give insights into the evolutionary and phylogenetic relationships of these parasites. Except for fragment 4, sequences from all other fragments consisted of unpublished sequences. The most polymorphic gene region was fragment 2, and in samples where this region lacks polymorphism, all other regions are also identical. The low variability of the P. malariae msp1 sequences of these isolates and the identification of the same haplotype in those collected many years apart at different locations is compatible with a low transmission rate. We also found greater diversity among P. brasilianum isolates compared with P. malariae ones. Lastly, the sequences were segregated according to their geographic origins and hosts, showing a strong genetic and geographic structure. Our data show that there is a low level of sequence diversity and a possible absence of allelic dimorphism of MSP1 in these parasites as opposed to other Plasmodium species. P. brasilianum strains apparently show greater divergence in comparison to P. malariae, thus P. malariae could derive from P. brasilianum, as it has been proposed.

Enabling Language Help: Epistemic Maneuvering in Extended Information Request Sequences between EFL Teachers

ERIC Educational Resources Information Center

Leyland, Christopher

2014-01-01

Recent years have seen an upsurge in interest in epistemics/knowledge in interaction (e.g., Heritage, 2012a, 2012b; Stivers, Mondada & Steensig, 2011). Insights from such research are now being used by Second Language Acquisition (SLA) researchers yielding valuable insights into teacher-student interaction (e.g., Sert, 2013) and…
Cluster K Mycobacteriophages: Insights into the Evolutionary Origins of Mycobacteriophage TM4

PubMed Central

Pope, Welkin H.; Ferreira, Christina M.; Jacobs-Sera, Deborah; Benjamin, Robert C.; Davis, Ariangela J.; DeJong, Randall J.; Elgin, Sarah C. R.; Guilfoile, Forrest R.; Forsyth, Mark H.; Harris, Alexander D.; Harvey, Samuel E.; Hughes, Lee E.; Hynes, Peter M.; Jackson, Arrykka S.; Jalal, Marilyn D.; MacMurray, Elizabeth A.; Manley, Coreen M.; McDonough, Molly J.; Mosier, Jordan L.; Osterbann, Larissa J.; Rabinowitz, Hannah S.; Rhyan, Corwin N.; Russell, Daniel A.; Saha, Margaret S.; Shaffer, Christopher D.; Simon, Stephanie E.; Sims, Erika F.; Tovar, Isabel G.; Weisser, Emilie G.; Wertz, John T.; Weston-Hafer, Kathleen A.; Williamson, Kurt E.; Zhang, Bo; Cresawn, Steven G.; Jain, Paras; Piuri, Mariana; Jacobs, William R.; Hendrix, Roger W.; Hatfull, Graham F.

2011-01-01

Five newly isolated mycobacteriophages –Angelica, CrimD, Adephagia, Anaya, and Pixie – have similar genomic architectures to mycobacteriophage TM4, a previously characterized phage that is widely used in mycobacterial genetics. The nucleotide sequence similarities warrant grouping these into Cluster K, with subdivision into three subclusters: K1, K2, and K3. Although the overall genome architectures of these phages are similar, TM4 appears to have lost at least two segments of its genome, a central region containing the integration apparatus, and a segment at the right end. This suggests that TM4 is a recent derivative of a temperate parent, resolving a long-standing conundrum about its biology, in that it was reportedly recovered from a lysogenic strain of Mycobacterium avium, but it is not capable of forming lysogens in any mycobacterial host. Like TM4, all of the Cluster K phages infect both fast- and slow-growing mycobacteria, and all of them – with the exception of TM4 – form stable lysogens in both Mycobacterium smegmatis and Mycobacterium tuberculosis; immunity assays show that all five of these phages share the same immune specificity. TM4 infects these lysogens suggesting that it was either derived from a heteroimmune temperate parent or that it has acquired a virulent phenotype. We have also characterized a widely-used conditionally replicating derivative of TM4 and identified mutations conferring the temperature-sensitive phenotype. All of the Cluster K phages contain a series of well conserved 13 bp repeats associated with the translation initiation sites of a subset of the genes; approximately one half of these contain an additional sequence feature composed of imperfectly conserved 17 bp inverted repeats separated by a variable spacer. The K1 phages integrate into the host tmRNA and the Cluster K phages represent potential new tools for the genetics of M. tuberculosis and related species. PMID:22053209
Draft Genome Sequence of Bifidobacterium animalis subsp. lactis Strain CECT 8145, Able To Improve Metabolic Syndrome In Vivo.

PubMed

Chenoll, E; Codoñer, F M; Silva, A; Martinez-Blanch, J F; Martorell, P; Ramón, D; Genovés, S

2014-03-27

Bifidobacterium animalis subsp. lactis strain CECT 8145 is able to reduce body fat content and improve metabolic syndrome biomarkers. Here, we report the draft genome sequence of this strain, which may provide insights into its safety status and functional role.
The bovine lactation genome: Insights into the evolution of mammalian milk

USDA-ARS?s Scientific Manuscript database

The newly assembled Bos Taurus genome sequence enables the linkage of bovine milk and lactation data with other mammalian genomes. Using publicly available milk proteome data and mammary expressed sequence tags, 197 milk protein genes and over 6,000 mammary genes were identified in the bovine genome...
A new chicken genome assembly provides insight into avian genome structure

USDA-ARS?s Scientific Manuscript database

The importance of the Gallus gallus (chicken) as a model organism and agricultural animal merits a continuation of sequence assembly improvement efforts. We present a new version of the chicken genome assembly (Gallus_gallus-5.0; GCA_000002315.3) built from combined long single molecule sequencing t...
Genome Sequence of Sphingomonas sp. Strain PAMC 26605, Isolated from Arctic Lichen (Ochrolechia sp.)

PubMed Central

Shin, Seung Chul; Ahn, Do Hwan; Lee, Jong Kyu; Kim, Su Jin; Hong, Soon Gyu; Kim, Eun Hye

2012-01-01

The endosymbiotic bacterium Sphingomonas sp. strain PAMC 26605 was isolated from Arctic lichens (Ochrolechia sp.) on the Svalbard Islands. Here we report the draft genome sequence of this strain, which could provide further insights into the symbiotic mechanism of lichens in extreme environments. PMID:22374946
Complete Genome Sequence of Bifidobacterium breve CECT 7263, a Strain Isolated from Human Milk

PubMed Central

Jiménez, Esther; Villar-Tajadura, M. Antonia; Marín, María; Fontecha, Javier; Requena, Teresa; Arroyo, Rebeca; Fernández, Leónides

2012-01-01

Bifidobacterium breve is an actinobacterium frequently isolated from colonic microbiota of breastfeeding babies. Here, we report the complete and annotated genome sequence of a B. breve strain isolated from human milk, B. breve CECT 7263. The genome sequence will provide new insights into the biology of this potential probiotic organism and will allow the characterization of genes related to beneficial properties. PMID:22740680
Complete Genome Sequences of Two Bacillus pumilus Strains from Cuatrociénegas, Coahuila, Mexico.

PubMed

Zarza, Eugenia; Alcaraz, Luis D; Aguilar-Salinas, Bernardo; Islas, Africa; Olmedo-Álvarez, Gabriela

2018-04-26

We assembled the complete genome sequences of Bacillus pumilus strains 145 and 150a from Cuatrociénegas, Mexico. We detected genes codifying for proteins potentially involved in antagonism (bacteriocins) and defense mechanisms (abortive infection bacteriophage proteins and 4-azaleucine resistance). Both strains harbored prophage sequences. Our results provide insights into understanding the establishment of microbial interactions. Copyright © 2018 Zarza et al.
A Global Comparison of the Human and T. brucei Degradomes Gives Insights about Possible Parasite Drug Targets

PubMed Central

Mashiyama, Susan T.; Koupparis, Kyriacos; Caffrey, Conor R.; McKerrow, James H.; Babbitt, Patricia C.

2012-01-01

We performed a genome-level computational study of sequence and structure similarity, the latter using crystal structures and models, of the proteases of Homo sapiens and the human parasite Trypanosoma brucei. Using sequence and structure similarity networks to summarize the results, we constructed global views that show visually the relative abundance and variety of proteases in the degradome landscapes of these two species, and provide insights into evolutionary relationships between proteases. The results also indicate how broadly these sequence sets are covered by three-dimensional structures. These views facilitate cross-species comparisons and offer clues for drug design from knowledge about the sequences and structures of potential drug targets and their homologs. Two protease groups (“M32” and “C51”) that are very different in sequence from human proteases are examined in structural detail, illustrating the application of this global approach in mining new pathogen genomes for potential drug targets. Based on our analyses, a human ACE2 inhibitor was selected for experimental testing on one of these parasite proteases, TbM32, and was shown to inhibit it. These sequence and structure data, along with interactive versions of the protein similarity networks generated in this study, are available at http://babbittlab.ucsf.edu/resources.html. PMID:23236535
Extraction of Modal Parameters from Spacecraft Flight Data

NASA Technical Reports Server (NTRS)

James, George H.; Cao, Timothy T.; Fogt, Vincent A.; Wilson, Robert L.; Bartkowicz, Theodore J.

2010-01-01

The modeled response of spacecraft systems must be validated using flight data as ground tests cannot adequately represent the flight. Tools from the field of operational modal analysis would typically be brought to bear on such structures. However, spacecraft systems have several complicated issues: 1. High amplitudes of loads; 2. Compressive loads on the vehicle in flight; 3. Lack of generous time-synchronized flight data; 4. Changing properties during the flight; and 5. Major vehicle changes due to staging. A particularly vexing parameter to extract is modal damping. Damping estimation has become a more critical issue as new mass-driven vehicle designs seek to use the highest damping value possible. The paper will focus on recent efforts to utilize spacecraft flight data to extract system parameters, with a special interest on modal damping. This work utilizes the analysis of correlation functions derived from a sliding window technique applied to the time record. Four different case studies are reported in the sequence that drove the authors understanding. The insights derived from these four exercises are preliminary conclusions for the general state-of-the-art, but may be of specific utility to similar problems approached with similar tools.
Steady-State Modeling of Modular Multilevel Converter Under Unbalanced Grid Conditions

DOE Office of Scientific and Technical Information (OSTI.GOV)

Shi, Xiaojie M.; Wang, Zhiqiang; Liu, Bo

This paper presents a steady-state model of MMC for the second-order phase voltage ripple prediction under unbalanced conditions, taking the impact of negative-sequence current control into account. From the steady-state model, a circular relationship is found among current and voltage quantities, which can be used to evaluate the magnitudes and initial phase angles of different circulating current components. Moreover, in order to calculate the circulating current in a point-to-point MMC-based HVdc system under unbalanced grid conditions, the derivation of equivalent dc impedance of an MMC is discussed as well. According to the dc impedance model, an MMC inverter can bemore » represented as a series connected R-L-C branch, with its equivalent resistance and capacitance directly related to the circulating current control parameters. Experimental results from a scaled-down three-phase MMC system under an emulated single-line-to-ground fault are provided to support the theoretical analysis and derived model. In conclusion, this new models provides an insight into the impact of different control schemes on the fault characteristics and improves the understanding of the operation of MMC under unbalanced conditions.« less
Steady-State Modeling of Modular Multilevel Converter Under Unbalanced Grid Conditions

DOE PAGES

Shi, Xiaojie M.; Wang, Zhiqiang; Liu, Bo; ...

2016-11-16

This paper presents a steady-state model of MMC for the second-order phase voltage ripple prediction under unbalanced conditions, taking the impact of negative-sequence current control into account. From the steady-state model, a circular relationship is found among current and voltage quantities, which can be used to evaluate the magnitudes and initial phase angles of different circulating current components. Moreover, in order to calculate the circulating current in a point-to-point MMC-based HVdc system under unbalanced grid conditions, the derivation of equivalent dc impedance of an MMC is discussed as well. According to the dc impedance model, an MMC inverter can bemore » represented as a series connected R-L-C branch, with its equivalent resistance and capacitance directly related to the circulating current control parameters. Experimental results from a scaled-down three-phase MMC system under an emulated single-line-to-ground fault are provided to support the theoretical analysis and derived model. In conclusion, this new models provides an insight into the impact of different control schemes on the fault characteristics and improves the understanding of the operation of MMC under unbalanced conditions.« less
Analysis of simple sequence repeat (SSR) structure and sequence within Epichloë endophyte genomes reveals impacts on gene structure and insights into ancestral hybridization events.

PubMed

Clayton, William; Eaton, Carla Jane; Dupont, Pierre-Yves; Gillanders, Tim; Cameron, Nick; Saikia, Sanjay; Scott, Barry

2017-01-01

Epichloë grass endophytes comprise a group of filamentous fungi of both sexual and asexual species. Known for the beneficial characteristics they endow upon their grass hosts, the identification of these endophyte species has been of great interest agronomically and scientifically. The use of simple sequence repeat loci and the variation in repeat elements has been used to rapidly identify endophyte species and strains, however, little is known of how the structure of repeat elements changes between species and strains, and where these repeat elements are located in the fungal genome. We report on an in-depth analysis of the structure and genomic location of the simple sequence repeat locus B10, commonly used for Epichloë endophyte species identification. The B10 repeat was found to be located within an exon of a putative bZIP transcription factor, suggesting possible impacts on polypeptide sequence and thus protein function. Analysis of this repeat in the asexual endophyte hybrid Epichloë uncinata revealed that the structure of B10 alleles reflects the ancestral species that hybridized to give rise to this species. Understanding the structure and sequence of these simple sequence repeats provides a useful set of tools for readily distinguishing strains and for gaining insights into the ancestral species that have undergone hybridization events.
Performance comparison of SNP detection tools with illumina exome sequencing data—an assessment using both family pedigree information and sample-matched SNP array data

PubMed Central

Yi, Ming; Zhao, Yongmei; Jia, Li; He, Mei; Kebebew, Electron; Stephens, Robert M.

2014-01-01

To apply exome-seq-derived variants in the clinical setting, there is an urgent need to identify the best variant caller(s) from a large collection of available options. We have used an Illumina exome-seq dataset as a benchmark, with two validation scenarios—family pedigree information and SNP array data for the same samples, permitting global high-throughput cross-validation, to evaluate the quality of SNP calls derived from several popular variant discovery tools from both the open-source and commercial communities using a set of designated quality metrics. To the best of our knowledge, this is the first large-scale performance comparison of exome-seq variant discovery tools using high-throughput validation with both Mendelian inheritance checking and SNP array data, which allows us to gain insights into the accuracy of SNP calling through such high-throughput validation in an unprecedented way, whereas the previously reported comparison studies have only assessed concordance of these tools without directly assessing the quality of the derived SNPs. More importantly, the main purpose of our study was to establish a reusable procedure that applies high-throughput validation to compare the quality of SNP discovery tools with a focus on exome-seq, which can be used to compare any forthcoming tool(s) of interest. PMID:24831545
An integrated genomic and transcriptomic survey of mucormycosis-causing fungi

PubMed Central

Chibucos, Marcus C.; Soliman, Sameh; Gebremariam, Teclegiorgis; Lee, Hongkyu; Daugherty, Sean; Orvis, Joshua; Shetty, Amol C.; Crabtree, Jonathan; Hazen, Tracy H.; Etienne, Kizee A.; Kumari, Priti; O'Connor, Timothy D.; Rasko, David A.; Filler, Scott G.; Fraser, Claire M.; Lockhart, Shawn R.; Skory, Christopher D.; Ibrahim, Ashraf S.; Bruno, Vincent M.

2016-01-01

Mucormycosis is a life-threatening infection caused by Mucorales fungi. Here we sequence 30 fungal genomes, and perform transcriptomics with three representative Rhizopus and Mucor strains and with human airway epithelial cells during fungal invasion, to reveal key host and fungal determinants contributing to pathogenesis. Analysis of the host transcriptional response to Mucorales reveals platelet-derived growth factor receptor B (PDGFRB) signaling as part of a core response to divergent pathogenic fungi; inhibition of PDGFRB reduces Mucorales-induced damage to host cells. The unique presence of CotH invasins in all invasive Mucorales, and the correlation between CotH gene copy number and clinical prevalence, are consistent with an important role for these proteins in mucormycosis pathogenesis. Our work provides insight into the evolution of this medically and economically important group of fungi, and identifies several molecular pathways that might be exploited as potential therapeutic targets. PMID:27447865
Iron Age and Anglo-Saxon genomes from East England reveal British migration history.

PubMed

Schiffels, Stephan; Haak, Wolfgang; Paajanen, Pirita; Llamas, Bastien; Popescu, Elizabeth; Loe, Louise; Clarke, Rachel; Lyons, Alice; Mortimer, Richard; Sayer, Duncan; Tyler-Smith, Chris; Cooper, Alan; Durbin, Richard

2016-01-19

British population history has been shaped by a series of immigrations, including the early Anglo-Saxon migrations after 400 CE. It remains an open question how these events affected the genetic composition of the current British population. Here, we present whole-genome sequences from 10 individuals excavated close to Cambridge in the East of England, ranging from the late Iron Age to the middle Anglo-Saxon period. By analysing shared rare variants with hundreds of modern samples from Britain and Europe, we estimate that on average the contemporary East English population derives 38% of its ancestry from Anglo-Saxon migrations. We gain further insight with a new method, rarecoal, which infers population history and identifies fine-scale genetic ancestry from rare variants. Using rarecoal we find that the Anglo-Saxon samples are closely related to modern Dutch and Danish populations, while the Iron Age samples share ancestors with multiple Northern European populations including Britain.
Big Data Analytics for Genomic Medicine

PubMed Central

He, Karen Y.; Ge, Dongliang; He, Max M.

2017-01-01

Genomic medicine attempts to build individualized strategies for diagnostic or therapeutic decision-making by utilizing patients’ genomic information. Big Data analytics uncovers hidden patterns, unknown correlations, and other insights through examining large-scale various data sets. While integration and manipulation of diverse genomic data and comprehensive electronic health records (EHRs) on a Big Data infrastructure exhibit challenges, they also provide a feasible opportunity to develop an efficient and effective approach to identify clinically actionable genetic variants for individualized diagnosis and therapy. In this paper, we review the challenges of manipulating large-scale next-generation sequencing (NGS) data and diverse clinical data derived from the EHRs for genomic medicine. We introduce possible solutions for different challenges in manipulating, managing, and analyzing genomic and clinical data to implement genomic medicine. Additionally, we also present a practical Big Data toolset for identifying clinically actionable genetic variants using high-throughput NGS data and EHRs. PMID:28212287
Exploring mitochondrial evolution and metabolism organization principles by comparative analysis of metabolic networks.

PubMed

Chang, Xiao; Wang, Zhuo; Hao, Pei; Li, Yuan-Yuan; Li, Yi-Xue

2010-06-01

The endosymbiotic theory proposed that mitochondrial genomes are derived from an alpha-proteobacterium-like endosymbiont, which was concluded from sequence analysis. We rebuilt the metabolic networks of mitochondria and 22 relative species, and studied the evolution of mitochondrial metabolism at the level of enzyme content and network topology. Our phylogenetic results based on network alignment and motif identification supported the endosymbiotic theory from the point of view of systems biology for the first time. It was found that the mitochondrial metabolic network were much more compact than the relative species, probably related to the higher efficiency of oxidative phosphorylation of the specialized organelle, and the network is highly clustered around the TCA cycle. Moreover, the mitochondrial metabolic network exhibited high functional specificity to the modules. This work provided insight to the understanding of mitochondria evolution, and the organization principle of mitochondrial metabolic network at the network level. Copyright 2010 Elsevier Inc. All rights reserved.
Telomerase Mechanism of Telomere Synthesis

PubMed Central

Wu, R. Alex; Upton, Heather E.; Vogan, Jacob M.; Collins, Kathleen

2017-01-01

Telomerase is the essential reverse transcriptase required for linear chromosome maintenance in most eukaryotes. Telomerase supplements the tandem array of simple-sequence repeats at chromosome ends to compensate for the DNA erosion inherent in genome replication. The template for telomerase reverse transcriptase is within the RNA subunit of the ribonucleoprotein complex, which in cells contains additional telomerase holoenzyme proteins that assemble the active ribonucleoprotein and promote its function at telomeres. Telomerase is distinct among polymerases in its reiterative reuse of an internal template. The template is precisely defined, processively copied, and regenerated by release of single-stranded product DNA. New specificities of nucleic acid handling that underlie the catalytic cycle of repeat synthesis derive from both active site specialization and new motif elaborations in protein and RNA subunits. Studies of telomerase provide unique insights into cellular requirements for genome stability, tissue renewal, and tumorigenesis as well as new perspectives on dynamic ribonucleoprotein machines. PMID:28141967
New insights into the molecular interaction of the C-terminal sequence of CXCL4 with fibroblast growth factor-2.

PubMed

Ragona, Laura; Tomaselli, Simona; Quemener, Cathy; Zetta, Lucia; Bikfalvi, Andreas

2009-04-24

Full-length CXCL4 chemokine and a peptide derived from its carboxyl-terminal domain exhibits significant antiangiogenic and anti-tumor activity in vivo and in vitro by interacting with fibroblast growth factor (FGF). In this study we used NMR spectroscopy to characterize at a molecular level the interactions between CXCL4 (47-70) and FGF-2 identifying the peptide residues mainly involved in the contact area with the growth factor. Altogether NMR data point to a major role of the hydrophobic contributions of the C-terminal region of CXCL4 (47-70) peptide in addition to specific contacts established by the N-terminal region through cysteine side chain. The proposed recognition mode constitutes a rationale for the observed effects of CXCL4 (47-70) on FGF-2 biological activity and lays the basis for developing novel inhibitors of angiogenesis.

Big Data Analytics for Genomic Medicine.

PubMed

He, Karen Y; Ge, Dongliang; He, Max M

2017-02-15

Genomic medicine attempts to build individualized strategies for diagnostic or therapeutic decision-making by utilizing patients' genomic information. Big Data analytics uncovers hidden patterns, unknown correlations, and other insights through examining large-scale various data sets. While integration and manipulation of diverse genomic data and comprehensive electronic health records (EHRs) on a Big Data infrastructure exhibit challenges, they also provide a feasible opportunity to develop an efficient and effective approach to identify clinically actionable genetic variants for individualized diagnosis and therapy. In this paper, we review the challenges of manipulating large-scale next-generation sequencing (NGS) data and diverse clinical data derived from the EHRs for genomic medicine. We introduce possible solutions for different challenges in manipulating, managing, and analyzing genomic and clinical data to implement genomic medicine. Additionally, we also present a practical Big Data toolset for identifying clinically actionable genetic variants using high-throughput NGS data and EHRs.
Transferrin receptor facilitates TGF-β and BMP signaling activation to control craniofacial morphogenesis

PubMed Central

Lei, R; Zhang, K; Liu, K; Shao, X; Ding, Z; Wang, F; Hong, Y; Zhu, M; Li, H; Li, H

2016-01-01

The Pierre Robin Sequence (PRS), consisting of cleft palate, glossoptosis and micrognathia, is a common human birth defect. However, how this abnormality occurs remains largely unknown. Here we report that neural crest cell (NCC)-specific knockout of transferrin receptor (Tfrc), a well known transferrin transporter protein, caused micrognathia, cleft palate, severe respiratory distress and inability to suckle in mice, which highly resemble human PRS. Histological and anatomical analysis revealed that the cleft palate is due to the failure of palatal shelves elevation that resulted from a retarded extension of Meckel's cartilage. Interestingly, Tfrc deletion dramatically suppressed both transforming growth factor-β (TGF-β) and bone morphogenetic protein (BMP) signaling in cranial NCCs-derived mandibular tissues, suggesting that Tfrc may act as a facilitator of these two signaling pathways during craniofacial morphogenesis. Together, our study uncovers an unknown function of Tfrc in craniofacial development and provides novel insight into the etiology of PRS. PMID:27362800
Phylogenetic Analysis of Shewanella Strains by DNA Relatedness Derived from Whole Genome Microarray DNA-DNA Hybridization and Comparison with Other Methods

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wu, Liyou; Yi, T. Y.; Van Nostrand, Joy

Phylogenetic analyses were done for the Shewanella strains isolated from Baltic Sea (38 strains), US DOE Hanford Uranium bioremediation site [Hanford Reach of the Columbia River (HRCR), 11 strains], Pacific Ocean and Hawaiian sediments (8 strains), and strains from other resources (16 strains) with three out group strains, Rhodopseudomonas palustris, Clostridium cellulolyticum, and Thermoanaerobacter ethanolicus X514, using DNA relatedness derived from WCGA-based DNA-DNA hybridizations, sequence similarities of 16S rRNA gene and gyrB gene, and sequence similarities of 6 loci of Shewanella genome selected from a shared gene list of the Shewanella strains with whole genome sequenced based on the averagemore » nucleotide identity of them (ANI). The phylogenetic trees based on 16S rRNA and gyrB gene sequences, and DNA relatedness derived from WCGA hybridizations of the tested Shewanella strains share exactly the same sub-clusters with very few exceptions, in which the strains were basically grouped by species. However, the phylogenetic analysis based on DNA relatedness derived from WCGA hybridizations dramatically increased the differentiation resolution at species and strains level within Shewanella genus. When the tree based on DNA relatedness derived from WCGA hybridizations was compared to the tree based on the combined sequences of the selected functional genes (6 loci), we found that the resolutions of both methods are similar, but the clustering of the tree based on DNA relatedness derived from WMGA hybridizations was clearer. These results indicate that WCGA-based DNA-DNA hybridization is an idea alternative of conventional DNA-DNA hybridization methods and it is superior to the phylogenetics methods based on sequence similarities of single genes. Detailed analysis is being performed for the re-classification of the strains examined.« less
ToF-SIMS analysis of a polymer microarray composed of poly(meth)acrylates with C6 derivative pendant groups.

PubMed

Hook, Andrew L; Scurr, David J

2016-04-01

Surface analysis plays a key role in understanding the function of materials, particularly in biological environments. Time-of-flight secondary ion mass spectrometry (ToF-SIMS) provides highly surface sensitive chemical information that can readily be acquired over large areas and has, thus, become an important surface analysis tool. However, the information-rich nature of ToF-SIMS complicates the interpretation and comparison of spectra, particularly in cases where multicomponent samples are being assessed. In this study, a method is presented to assess the chemical variance across 16 poly(meth)acrylates. Materials are selected to contain C 6 pendant groups, and ten replicates of each are printed as a polymer microarray. SIMS spectra are acquired for each material with the most intense and unique ions assessed for each material to identify the predominant and distinctive fragmentation pathways within the materials studied. Differentiating acrylate/methacrylate pairs is readily achieved using secondary ions derived from both the polymer backbone and pendant groups. Principal component analysis (PCA) is performed on the SIMS spectra of the 16 polymers, whereby the resulting principal components are able to distinguish phenyl from benzyl groups, mono-functional from multi-functional monomers and acrylates from methacrylates. The principal components are applied to copolymer series to assess the predictive capabilities of the PCA. Beyond being able to predict the copolymer ratio, in some cases, the SIMS analysis is able to provide insight into the molecular sequence of a copolymer. The insight gained in this study will be beneficial for developing structure-function relationships based upon ToF-SIMS data of polymer libraries. © 2016 The Authors Surface and Interface Analysis Published by John Wiley & Sons Ltd.
Ecotypic differentiation under farmers' selection: Molecular insights into the domestication of Pachyrhizus Rich. ex DC. (Fabaceae) in the Peruvian Andes.

PubMed

Delêtre, Marc; Soengas, Beatriz; Vidaurre, Prem Jai; Meneses, Rosa Isela; Delgado Vásquez, Octavio; Oré Balbín, Isabel; Santayana, Monica; Heider, Bettina; Sørensen, Marten

2017-06-01

Understanding the distribution of crop genetic diversity in relation to environmental factors can give insights into the eco-evolutionary processes involved in plant domestication. Yam beans ( Pachyrhizus Rich. ex DC.) are leguminous crops native to South and Central America that are grown for their tuberous roots but are seed-propagated. Using a landscape genetic approach, we examined correlations between environmental factors and phylogeographic patterns of genetic diversity in Pachyrhizus landrace populations. Molecular analyses based on chloroplast DNA sequencing and a new set of nuclear microsatellite markers revealed two distinct lineages, with strong genetic differentiation between Andean landraces (lineage A) and Amazonian landraces (lineage B). The comparison of different evolutionary scenarios for the diversification history of yam beans in the Andes using approximate Bayesian computation suggests that Pachyrhizus ahipa and Pachyrhizus tuberosus share a progenitor-derivative relationship, with environmental factors playing an important role in driving selection for divergent ecotypes. The new molecular data call for a revision of the taxonomy of Pachyrhizus but are congruent with paleoclimatic and archeological evidence, and suggest that selection for determinate growth was part of ecophysiological adaptations associated with the diversification of the P. tuberosus - P. ahipa complex during the Mid-Holocene.
Catalog of genetic progression of human cancers: breast cancer.

PubMed

Desmedt, Christine; Yates, Lucy; Kulka, Janina

2016-03-01

With the rapid development of next-generation sequencing, deeper insights are being gained into the molecular evolution that underlies the development and clinical progression of breast cancer. It is apparent that during evolution, breast cancers acquire thousands of mutations including single base pair substitutions, insertions, deletions, copy number aberrations, and structural rearrangements. As a consequence, at the whole genome level, no two cancers are identical and few cancers even share the same complement of "driver" mutations. Indeed, two samples from the same cancer may also exhibit extensive differences due to constant remodeling of the genome over time. In this review, we summarize recent studies that extend our understanding of the genomic basis of cancer progression. Key biological insights include the following: subclonal diversification begins early in cancer evolution, being detectable even in in situ lesions; geographical stratification of subclonal structure is frequent in primary tumors and can include therapeutically targetable alterations; multiple distant metastases typically arise from a common metastatic ancestor following a "metastatic cascade" model; systemic therapy can unmask preexisting resistant subclones or influence further treatment sensitivity and disease progression. We conclude the review by describing novel approaches such as the analysis of circulating DNA and patient-derived xenografts that promise to further our understanding of the genomic changes occurring during cancer evolution and guide treatment decision making.
Insight into the Structure of Amyloid Fibrils from the Analysis of Globular Proteins

PubMed Central

Trovato, Antonio; Chiti, Fabrizio; Maritan, Amos; Seno, Flavio

2006-01-01

The conversion from soluble states into cross-β fibrillar aggregates is a property shared by many different proteins and peptides and was hence conjectured to be a generic feature of polypeptide chains. Increasing evidence is now accumulating that such fibrillar assemblies are generally characterized by a parallel in-register alignment of β-strands contributed by distinct protein molecules. Here we assume a universal mechanism is responsible for β-structure formation and deduce sequence-specific interaction energies between pairs of protein fragments from a statistical analysis of the native folds of globular proteins. The derived fragment–fragment interaction was implemented within a novel algorithm, prediction of amyloid structure aggregation (PASTA), to investigate the role of sequence heterogeneity in driving specific aggregation into ordered self-propagating cross-β structures. The algorithm predicts that the parallel in-register arrangement of sequence portions that participate in the fibril cross-β core is favoured in most cases. However, the antiparallel arrangement is correctly discriminated when present in fibrils formed by short peptides. The predictions of the most aggregation-prone portions of initially unfolded polypeptide chains are also in excellent agreement with available experimental observations. These results corroborate the recent hypothesis that the amyloid structure is stabilised by the same physicochemical determinants as those operating in folded proteins. They also suggest that side chain–side chain interaction across neighbouring β-strands is a key determinant of amyloid fibril formation and of their self-propagating ability. PMID:17173479
Comparison of the complete genome sequences of four γ-hexachlorocyclohexane-degrading bacterial strains: insights into the evolution of bacteria able to degrade a recalcitrant man-made pesticide.

PubMed

Tabata, Michiro; Ohhata, Satoshi; Nikawadori, Yuki; Kishida, Kouhei; Sato, Takuya; Kawasumi, Toru; Kato, Hiromi; Ohtsubo, Yoshiyuki; Tsuda, Masataka; Nagata, Yuji

2016-12-01

γ-Hexachlorocyclohexane (γ-HCH) is a recalcitrant man-made chlorinated pesticide. Here, the complete genome sequences of four γ-HCH-degrading sphingomonad strains, which are most unlikely to have been derived from one ancestral γ-HCH degrader, were compared. Together with several experimental data, we showed that (i) all the four strains carry almost identical linA to linE genes for the conversion of γ-HCH to maleylacetate (designated "specific" lin genes), (ii) considerably different genes are used for the metabolism of maleylacetate in one of the four strains, and (iii) the linKLMN genes for the putative ABC transporter necessary for γ-HCH utilization exhibit structural divergence, which reflects the phylogenetic relationship of their hosts. Replicon organization and location of the lin genes in the four genomes are significantly different with one another, and that most of the specific lin genes are located on multiple sphingomonad-unique plasmids. Copies of IS6100, the most abundant insertion sequence in the four strains, are often located in close proximity to the specific lin genes. Analysis of the footprints of target duplication upon IS6100 transposition and the experimental detection of IS6100 transposition strongly suggested that the IS6100 transposition has caused dynamic genome rearrangements and the diversification of lin-flanking regions in the four strains. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Non-mydriatic video ophthalmoscope to measure fast temporal changes of the human retina

NASA Astrophysics Data System (ADS)

Tornow, Ralf P.; Kolář, Radim; Odstrčilík, Jan

2015-07-01

The analysis of fast temporal changes of the human retina can be used to get insight to normal physiological behavior and to detect pathological deviations. This can be important for the early detection of glaucoma and other eye diseases. We developed a small, lightweight, USB powered video ophthalmoscope that allows taking video sequences of the human retina with at least 25 frames per second without dilating the pupil. Short sequences (about 10 s) of the optic nerve head (20° x 15°) are recorded from subjects and registered offline using two-stage process (phase correlation and Lucas-Kanade approach) to compensate for eye movements. From registered video sequences, different parameters can be calculated. Two applications are described here: measurement of (i) cardiac cycle induced pulsatile reflection changes and (ii) eye movements and fixation pattern. Cardiac cycle induced pulsatile reflection changes are caused by changing blood volume in the retina. Waveform and pulse parameters like amplitude and rise time can be measured in any selected areas within the retinal image. Fixation pattern ΔY(ΔX) can be assessed from eye movements during video acquisition. The eye movements ΔX[t], ΔY[t] are derived from image registration results with high temporal (40 ms) and spatial (1,86 arcmin) resolution. Parameters of pulsatile reflection changes and fixation pattern can be affected in beginning glaucoma and the method described here may support early detection of glaucoma and other eye disease.
MetaSRA: normalized human sample-specific metadata for the Sequence Read Archive.

PubMed

Bernstein, Matthew N; Doan, AnHai; Dewey, Colin N

2017-09-15

The NCBI's Sequence Read Archive (SRA) promises great biological insight if one could analyze the data in the aggregate; however, the data remain largely underutilized, in part, due to the poor structure of the metadata associated with each sample. The rules governing submissions to the SRA do not dictate a standardized set of terms that should be used to describe the biological samples from which the sequencing data are derived. As a result, the metadata include many synonyms, spelling variants and references to outside sources of information. Furthermore, manual annotation of the data remains intractable due to the large number of samples in the archive. For these reasons, it has been difficult to perform large-scale analyses that study the relationships between biomolecular processes and phenotype across diverse diseases, tissues and cell types present in the SRA. We present MetaSRA, a database of normalized SRA human sample-specific metadata following a schema inspired by the metadata organization of the ENCODE project. This schema involves mapping samples to terms in biomedical ontologies, labeling each sample with a sample-type category, and extracting real-valued properties. We automated these tasks via a novel computational pipeline. The MetaSRA is available at metasra.biostat.wisc.edu via both a searchable web interface and bulk downloads. Software implementing our computational pipeline is available at http://github.com/deweylab/metasra-pipeline. cdewey@biostat.wisc.edu. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.
Removing the needle from the haystack: Enrichment of Wolbachia endosymbiont transcripts from host nematode RNA by Cappable-seq™.

PubMed

Luck, Ashley N; Slatko, Barton E; Foster, Jeremy M

2017-01-01

Efficient transcriptomic sequencing of microbial mRNA derived from host-microbe associations is often compromised by the much lower relative abundance of microbial RNA in the mixed total RNA sample. One solution to this problem is to perform extensive sequencing until an acceptable level of transcriptome coverage is obtained. More cost-effective methods include use of prokaryotic and/or eukaryotic rRNA depletion strategies, sometimes in conjunction with depletion of polyadenylated eukaryotic mRNA. Here, we report use of Cappable-seq™ to specifically enrich, in a single step, Wolbachia endobacterial mRNA transcripts from total RNA prepared from the parasitic filarial nematode, Brugia malayi. The obligate Wolbachia endosymbiont is a proven drug target for many human filarial infections, yet the precise nature of its symbiosis with the nematode host is poorly understood. Insightful analysis of the expression levels of Wolbachia genes predicted to underpin the mutualistic association and of known drug target genes at different life cycle stages or in response to drug treatments is typically challenged by low transcriptomic coverage. Cappable-seq resulted in up to ~ 5-fold increase in the number of reads mapping to Wolbachia. On average, coverage of Wolbachia transcripts from B. malayi microfilariae was enriched ~40-fold by Cappable-seq. Additionally, this method has an additional benefit of selectively removing abundant prokaryotic ribosomal RNAs.The deeper microbial transcriptome sequencing afforded by Cappable-seq facilitates more detailed characterization of gene expression levels of pathogens and symbionts present in animal tissues.
Indo-Burma Range: a belt of accreted microcontinents, ophiolites and Mesozoic-Paleogene flyschoid sediments

NASA Astrophysics Data System (ADS)

Acharyya, S. K.

2015-07-01

This study provides an insight into the lithotectonic evolution of the N-S trending Indo-Burma Range (IBR), constituting the southern flank of the Himalayan syntaxis. Paleogene flyschoid sediments (Disang-Barail) that represent a shallow marine to deltaic environment mainly comprise the west-central sector of IBR, possibly resting upon a continental base. On the east, these sequences are tectonically flanked by the Eocene olistostromal facies of the Disang, which developed through accretion of trench sediments during the subduction. The shelf and trench facies sequences of the Disang underwent overthrusting from the east, giving rise to two ophiolite suites ( Naga Hills Lower Ophiolite ( NHLO) and Victoria Hills Upper Ophiolite ( VHUO), but with different accretion history. The ophiolite and ophiolite cover rock package were subsequently overthrusted by the Proterozoic metamorphic sequence, originated from the Burmese continent. The NHLO suite of Late Jurassic to Early Eocene age is unconformably overlain by mid-Eocene shallow marine ophiolite-derived clastics. On the south, the VHUO of Mesozoic age is structurally underlain by continental metamorphic rocks. The entire package in Victoria Hills is unconformably overlain by shallow marine Late Albian sediments. Both the ophiolite suites and the sandwiched continental metamorphic rocks are thrust westward over the Paleogene shelf sediments. These dismembered ophiolites and continental metamorphic rocks suggest thin-skinned tectonic detachment processes in IBR, as reflected from the presence of klippe of continental metamorphic rocks over the NHLO and the flyschoid Disang floor sediments and half windows exposing the Disang beneath the NHLO.
Structure and binding analysis of Polyporus squamosus lectin in complex with the Neu5Acα2-6Galβ1-4GlcNAc human-type influenza receptor

PubMed Central

Kadirvelraj, Renuka; Grant, Oliver C; Goldstein, Irwin J; Winter, Harry C; Tateno, Hiroaki; Fadda, Elisa; Woods, Robert J

2011-01-01

Glycan chains that terminate in sialic acid (Neu5Ac) are frequently the receptors targeted by pathogens for initial adhesion. Carbohydrate-binding proteins (lectins) with specificity for Neu5Ac are particularly useful in the detection and isolation of sialylated glycoconjugates, such as those associated with pathogen adhesion as well as those characteristic of several diseases including cancer. Structural studies of lectins are essential in order to understand the origin of their specificity, which is particularly important when employing such reagents as diagnostic tools. Here, we report a crystallographic and molecular dynamics (MD) analysis of a lectin from Polyporus squamosus (PSL) that is specific for glycans terminating with the sequence Neu5Acα2-6Galβ. Because of its importance as a histological reagent, the PSL structure was solved (to 1.7 Å) in complex with a trisaccharide, whose sequence (Neu5Acα2-6Galβ1-4GlcNAc) is exploited by influenza A hemagglutinin for viral adhesion to human tissue. The structural data illuminate the origin of the high specificity of PSL for the Neu5Acα2-6Gal sequence. Theoretical binding free energies derived from the MD data confirm the key interactions identified crystallographically and provide additional insight into the relative contributions from each amino acid, as well as estimates of the importance of entropic and enthalpic contributions to binding. PMID:21436237
Enterovirus Migration Patterns between France and Tunisia.

PubMed

Othman, Ines; Mirand, Audrey; Slama, Ichrak; Mastouri, Maha; Peigue-Lafeuille, Hélène; Aouni, Mahjoub; Bailly, Jean-Luc

2015-01-01

The enterovirus (EV) types echovirus (E-) 5, E-9, and E-18, and coxsackievirus (CV-) A9 are infrequently reported in human diseases and their epidemiologic features are poorly defined. Virus transmission patterns between countries have been estimated with phylogenetic data derived from the 1D/VP1 and 3CD gene sequences of a sample of 74 strains obtained in France (2000-2012) and Tunisia (2011-2013) and from the publicly available sequences. The EV types (E-5, E-9, and E-18) exhibited a lower worldwide genetic diversity (respective number of genogroups: 4, 5, and 3) in comparison to CV-A9 (n = 10). The phylogenetic trees estimated with both 1D/VP1 and 3CD sequence data showed variations in the number of co-circulating lineages over the last 20 years among the four EV types. Despite the low number of genogroups in E-18, the virus exhibited the highest number of recombinant 3CD lineages (n = 10) versus 4 (E-5) to 8 (E-9). The phylogenies provided evidence of multiple transportation events between France and Tunisia involving E-5, E-9, E-18, and CV-A9 strains. Virus spread events between France and 17 other countries in five continents had high probabilities of occurrence as those between Tunisia and two European countries other than France. All transportation events were supported by BF values > 10. Inferring the source of virus transmission from phylogenetic data may provide insights into the patterns of sporadic and epidemic diseases caused by EVs.
Analytical and Clinical Validation of a Digital Sequencing Panel for Quantitative, Highly Accurate Evaluation of Cell-Free Circulating Tumor DNA

PubMed Central

Zill, Oliver A.; Sebisanovic, Dragan; Lopez, Rene; Blau, Sibel; Collisson, Eric A.; Divers, Stephen G.; Hoon, Dave S. B.; Kopetz, E. Scott; Lee, Jeeyun; Nikolinakos, Petros G.; Baca, Arthur M.; Kermani, Bahram G.; Eltoukhy, Helmy; Talasaz, AmirAli

2015-01-01

Next-generation sequencing of cell-free circulating solid tumor DNA addresses two challenges in contemporary cancer care. First this method of massively parallel and deep sequencing enables assessment of a comprehensive panel of genomic targets from a single sample, and second, it obviates the need for repeat invasive tissue biopsies. Digital SequencingTM is a novel method for high-quality sequencing of circulating tumor DNA simultaneously across a comprehensive panel of over 50 cancer-related genes with a simple blood test. Here we report the analytic and clinical validation of the gene panel. Analytic sensitivity down to 0.1% mutant allele fraction is demonstrated via serial dilution studies of known samples. Near-perfect analytic specificity (> 99.9999%) enables complete coverage of many genes without the false positives typically seen with traditional sequencing assays at mutant allele frequencies or fractions below 5%. We compared digital sequencing of plasma-derived cell-free DNA to tissue-based sequencing on 165 consecutive matched samples from five outside centers in patients with stage III-IV solid tumor cancers. Clinical sensitivity of plasma-derived NGS was 85.0%, comparable to 80.7% sensitivity for tissue. The assay success rate on 1,000 consecutive samples in clinical practice was 99.8%. Digital sequencing of plasma-derived DNA is indicated in advanced cancer patients to prevent repeated invasive biopsies when the initial biopsy is inadequate, unobtainable for genomic testing, or uninformative, or when the patient’s cancer has progressed despite treatment. Its clinical utility is derived from reduction in the costs, complications and delays associated with invasive tissue biopsies for genomic testing. PMID:26474073
TESS: a geometric hashing algorithm for deriving 3D coordinate templates for searching structural databases. Application to enzyme active sites.

PubMed Central

Wallace, A. C.; Borkakoti, N.; Thornton, J. M.

1997-01-01

It is well established that sequence templates such as those in the PROSITE and PRINTS databases are powerful tools for predicting the biological function and tertiary structure for newly derived protein sequences. The number of X-ray and NMR protein structures is increasing rapidly and it is apparent that a 3D equivalent of the sequence templates is needed. Here, we describe an algorithm called TESS that automatically derives 3D templates from structures deposited in the Brookhaven Protein Data Bank. While a new sequence can be searched for sequence patterns, a new structure can be scanned against these 3D templates to identify functional sites. As examples, 3D templates are derived for enzymes with an O-His-O "catalytic triad" and for the ribonucleases and lysozymes. When these 3D templates are applied to a large data set of nonidentical proteins, several interesting hits are located. This suggests that the development of a 3D template database may help to identify the function of new protein structures, if unknown, as well as to design proteins with specific functions. PMID:9385633
cis-β-Bromostyrene derivatives from cinnamic acids via a tandem substitutive bromination-decarboxylation sequence.

PubMed

Tang, Khanh G; Kent, Greggory T; Erden, Ihsan; Wu, Weiming

2017-10-04

cis -β-Bromostyrene derivatives were synthesized stereospecifically from cinnamic acids through β-lactone intermediates. The synthetic sequence did not require the purification of the β-lactone intermediates although they were found to be stable and readily purified in most cases.
Characterization of circulating transfer RNA-Derived RNA fragments in cattle

USDA-ARS?s Scientific Manuscript database

The objective was to characterize naturally occurring circulating transfer RNA-derived RNA Fragments (tRFs) in cattle. Serum from eight clinically normal adult dairy cows was collected, and small non-coding RNAs were extracted immediately after collection and sequenced by Illumina MiSeq. Sequences a...
Efficient generation of megakaryocytes from human induced pluripotent stem cells using food and drug administration-approved pharmacological reagents.

PubMed

Liu, Yanfeng; Wang, Ying; Gao, Yongxing; Forbes, Jessica A; Qayyum, Rehan; Becker, Lewis; Cheng, Linzhao; Wang, Zack Z

2015-04-01

Megakaryocytes (MKs) are rare hematopoietic cells in the adult bone marrow and produce platelets that are critical to vascular hemostasis and wound healing. Ex vivo generation of MKs from human induced pluripotent stem cells (hiPSCs) provides a renewable cell source of platelets for treating thrombocytopenic patients and allows a better understanding of MK/platelet biology. The key requirements in this approach include developing a robust and consistent method to produce functional progeny cells, such as MKs from hiPSCs, and minimizing the risk and variation from the animal-derived products in cell cultures. In this study, we developed an efficient system to generate MKs from hiPSCs under a feeder-free and xeno-free condition, in which all animal-derived products were eliminated. Several crucial reagents were evaluated and replaced with Food and Drug Administration-approved pharmacological reagents, including romiplostim (Nplate, a thrombopoietin analog), oprelvekin (recombinant interleukin-11), and Plasbumin (human albumin). We used this method to induce MK generation from hiPSCs derived from 23 individuals in two steps: generation of CD34(+)CD45(+) hematopoietic progenitor cells (HPCs) for 14 days; and generation and expansion of CD41(+)CD42a(+) MKs from HPCs for an additional 5 days. After 19 days, we observed abundant CD41(+)CD42a(+) MKs that also expressed the MK markers CD42b and CD61 and displayed polyploidy (≥16% of derived cells with DNA contents >4N). Transcriptome analysis by RNA sequencing revealed that megakaryocytic-related genes were highly expressed. Additional maturation and investigation of hiPSC-derived MKs should provide insights into MK biology and lead to the generation of large numbers of platelets ex vivo. ©AlphaMed Press.
Generation of a Maize B Centromere Minimal Map Containing the Central Core Domain.

PubMed

Ellis, Nathanael A; Douglas, Ryan N; Jackson, Caroline E; Birchler, James A; Dawe, R Kelly

2015-10-28

The maize B centromere has been used as a model for centromere epigenetics and as the basis for building artificial chromosomes. However, there are no sequence resources for this important centromere. Here we used transposon display for the centromere-specific retroelement CRM2 to identify a collection of 40 sequence tags that flank CRM2 insertion points on the B chromosome. These were confirmed to lie within the centromere by assaying deletion breakpoints from centromere misdivision derivatives (intracentromere breakages caused by centromere fission). Markers were grouped together on the basis of their association with other markers in the misdivision series and assembled into a pseudocontig containing 10.1 kb of sequence. To identify sequences that interact directly with centromere proteins, we carried out chromatin immunoprecipitation using antibodies to centromeric histone H3 (CENH3), a defining feature of functional centromeric sequences. The CENH3 chromatin immunoprecipitation map was interpreted relative to the known transmission rates of centromere misdivision derivatives to identify a centromere core domain spanning 33 markers. A subset of seven markers was mapped in additional B centromere misdivision derivatives with the use of unique primer pairs. A derivative previously shown to have no canonical centromere sequences (Telo3-3) lacks these core markers. Our results provide a molecular map of the B chromosome centromere and identify key sequences within the map that interact directly with centromeric histone H3. Copyright © 2015 Ellis et al.

Sequence analysis of the pyruvylated galactan sulfate-derived oligosaccharides by negative-ion electrospray tandem mass spectrometry.

PubMed

Li, Na; Mao, Wenjun; Liu, Xue; Wang, Shuyao; Xia, Zheng; Cao, Sujian; Li, Lin; Zhang, Qi; Liu, Shan

2016-10-04

Five sulfated oligosaccharide fragments, F1-F5, were prepared from a pyruvylated galactan sulfate from the green alga Codium divaricatum, by partial depolymerization using mild acid hydrolysis and purification with gel-permeation chromatography. Negative-ion electrospray tandem mass spectrometry with collision-induced dissociation (ES-CID-MS/MS) is attempted for sequence determination of the sulfated oligosaccharides. The sequence of F1 with homogeneous disaccharide composition was first characterized to be Galp-(4SO4)-(1 → 3)-Galp by detailed nuclear magnetic resonance spectroscopic analyses. The fragmentation pattern of F1 in the product ion spectra was established on the basis of negative-ion ES-CID MS/MS, which was then applied to sequence analysis of other sulfated oligosaccharides. The sequences of F2 and F3 were deduced to be Galp-(4SO4)-(1 → 3)-Galp-(1 → 3)-Galp-(1 → 3)-Galp and 3,4-O-(1-carboxyethylidene)-Galp-(6SO4)-(1 → 3)-Galp, respectively. The sequences of major fragments in F4 and F5 were also deduced. The investigation demonstrated that negative-ion ES-CID-MS/MS was an efficient method for the sequence analysis of the pyruvylated galactan sulfate-derived oligosaccharides which revealed the patterns of substitution and glycosidic linkages. The pyruvylated galactan sulfate-derived oligosaccharides were novel sulfated oligosaccharides different from other algal polysaccharide-derived oligosaccharides. Copyright © 2016 Elsevier Ltd. All rights reserved.
Application of genomics for understanding plant virus-insect vector interactions and insect vector control

USDA-ARS?s Scientific Manuscript database

The ability to decipher DNA sequences provides new insights into the study of plant viruses and their interactions with host plants, including the intricate interactions that allow a virus to be transmitted by an insect vector. Next generation sequencing (NGS) provides a wealth of genetic informati...
The tomato genome sequence provides insight into fleshy fruit evolution

USDA-ARS?s Scientific Manuscript database

The genome of the inbred tomato cultivar ‘Heinz 1706’ was sequenced and assembled using a combination of Sanger and “next generation” technologies. The predicted genome size is ~900 Mb, consistent with prior estimates, of which 760 Mb were assembled in 91 scaffolds aligned to the 12 tomato chromosom...
Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1) provides insights into genome evolution

USDA-ARS?s Scientific Manuscript database

Genetic and genomic analyses of Upland cotton (Gossypium hirsutum) are difficult because it has a complex allotetraploid (AADD; 2n = 4x = 52) genome. Here we sequenced, assembled and analyzed the world's most important cultivated cotton genome with 246.2 gigabase (Gb) clean data obtained using whol...
Giardia lamblia: Molecular Studies of an Early Branching Eukaryote

USDA-ARS?s Scientific Manuscript database

The rapid advance in our understanding of the biology of Giardia lamblia over the last several years is due in part to the complete DNA sequencing of the 11.7 Mb genome of this diplomonad. Insight on the molecular nature of G. lamblia has been gained by searching the genome using query sequences fr...
Genome Sequence of Sphingomonas sp. Strain PAMC 26621, an Arctic-Lichen-Associated Bacterium Isolated from a Cetraria sp.

PubMed Central

Lee, Hyoungseok; Shin, Seung Chul; Lee, Jungeun; Kim, Su Jin; Kim, Bum-Keun; Hong, Soon Gyu; Kim, Eun Hye

2012-01-01

The lichen-associated bacterial strain Sphingomonas sp. PAMC 26621 was isolated from an Arctic lichen Cetraria sp. on Svalbard Islands. Here we report the draft genome sequence of this strain, which could provide novel insights into the molecular principles of lichen-microbe interactions. PMID:22582384
BiPPred: Combined sequence- and structure-based prediction of peptide binding to the Hsp70 chaperone BiP.

PubMed

Schneider, Markus; Rosam, Mathias; Glaser, Manuel; Patronov, Atanas; Shah, Harpreet; Back, Katrin Christiane; Daake, Marina Angelika; Buchner, Johannes; Antes, Iris

2016-10-01

Substrate binding to Hsp70 chaperones is involved in many biological processes, and the identification of potential substrates is important for a comprehensive understanding of these events. We present a multi-scale pipeline for an accurate, yet efficient prediction of peptides binding to the Hsp70 chaperone BiP by combining sequence-based prediction with molecular docking and MMPBSA calculations. First, we measured the binding of 15mer peptides from known substrate proteins of BiP by peptide array (PA) experiments and performed an accuracy assessment of the PA data by fluorescence anisotropy studies. Several sequence-based prediction models were fitted using this and other peptide binding data. A structure-based position-specific scoring matrix (SB-PSSM) derived solely from structural modeling data forms the core of all models. The matrix elements are based on a combination of binding energy estimations, molecular dynamics simulations, and analysis of the BiP binding site, which led to new insights into the peptide binding specificities of the chaperone. Using this SB-PSSM, peptide binders could be predicted with high selectivity even without training of the model on experimental data. Additional training further increased the prediction accuracies. Subsequent molecular docking (DynaDock) and MMGBSA/MMPBSA-based binding affinity estimations for predicted binders allowed the identification of the correct binding mode of the peptides as well as the calculation of nearly quantitative binding affinities. The general concept behind the developed multi-scale pipeline can readily be applied to other protein-peptide complexes with linearly bound peptides, for which sufficient experimental binding data for the training of classical sequence-based prediction models is not available. Proteins 2016; 84:1390-1407. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Leishmania naiffi and Leishmania guyanensis reference genomes highlight genome structure and gene evolution in the Viannia subgenus

PubMed Central

Coughlan, Simone; Taylor, Ali Shirley; Feane, Eoghan; Sanders, Mandy; Schonian, Gabriele; Cotton, James A.

2018-01-01

The unicellular protozoan parasite Leishmania causes the neglected tropical disease leishmaniasis, affecting 12 million people in 98 countries. In South America, where the Viannia subgenus predominates, so far only L. (Viannia) braziliensis and L. (V.) panamensis have been sequenced, assembled and annotated as reference genomes. Addressing this deficit in molecular information can inform species typing, epidemiological monitoring and clinical treatment. Here, L. (V.) naiffi and L. (V.) guyanensis genomic DNA was sequenced to assemble these two genomes as draft references from short sequence reads. The methods used were tested using short sequence reads for L. braziliensis M2904 against its published reference as a comparison. This assembly and annotation pipeline identified 70 additional genes not annotated on the original M2904 reference. Phylogenetic and evolutionary comparisons of L. guyanensis and L. naiffi with 10 other Viannia genomes revealed four traits common to all Viannia: aneuploidy, 22 orthologous groups of genes absent in other Leishmania subgenera, elevated TATE transposon copies and a high NADH-dependent fumarate reductase gene copy number. Within the Viannia, there were limited structural changes in genome architecture specific to individual species: a 45 Kb amplification on chromosome 34 was present in all bar L. lainsoni, L. naiffi had a higher copy number of the virulence factor leishmanolysin, and laboratory isolate L. shawi M8408 had a possible minichromosome derived from the 3’ end of chromosome 34. This combination of genome assembly, phylogenetics and comparative analysis across an extended panel of diverse Viannia has uncovered new insights into the origin and evolution of this subgenus and can help improve diagnostics for leishmaniasis surveillance. PMID:29765675
Integrating protein structural dynamics and evolutionary analysis with Bio3D.

PubMed

Skjærven, Lars; Yao, Xin-Qiu; Scarabelli, Guido; Grant, Barry J

2014-12-10

Popular bioinformatics approaches for studying protein functional dynamics include comparisons of crystallographic structures, molecular dynamics simulations and normal mode analysis. However, determining how observed displacements and predicted motions from these traditionally separate analyses relate to each other, as well as to the evolution of sequence, structure and function within large protein families, remains a considerable challenge. This is in part due to the general lack of tools that integrate information of molecular structure, dynamics and evolution. Here, we describe the integration of new methodologies for evolutionary sequence, structure and simulation analysis into the Bio3D package. This major update includes unique high-throughput normal mode analysis for examining and contrasting the dynamics of related proteins with non-identical sequences and structures, as well as new methods for quantifying dynamical couplings and their residue-wise dissection from correlation network analysis. These new methodologies are integrated with major biomolecular databases as well as established methods for evolutionary sequence and comparative structural analysis. New functionality for directly comparing results derived from normal modes, molecular dynamics and principal component analysis of heterogeneous experimental structure distributions is also included. We demonstrate these integrated capabilities with example applications to dihydrofolate reductase and heterotrimeric G-protein families along with a discussion of the mechanistic insight provided in each case. The integration of structural dynamics and evolutionary analysis in Bio3D enables researchers to go beyond a prediction of single protein dynamics to investigate dynamical features across large protein families. The Bio3D package is distributed with full source code and extensive documentation as a platform independent R package under a GPL2 license from http://thegrantlab.org/bio3d/ .
Exome sequencing of bilateral testicular germ cell tumors suggests independent development lineages.

PubMed

Brabrand, Sigmund; Johannessen, Bjarne; Axcrona, Ulrika; Kraggerud, Sigrid M; Berg, Kaja G; Bakken, Anne C; Bruun, Jarle; Fosså, Sophie D; Lothe, Ragnhild A; Lehne, Gustav; Skotheim, Rolf I

2015-02-01

Intratubular germ cell neoplasia, the precursor of testicular germ cell tumors (TGCTs), is hypothesized to arise during embryogenesis from developmentally arrested primordial germ cells (PGCs) or gonocytes. In early embryonal life, the PGCs migrate from the yolk sac to the dorsal body wall where the cell population separates before colonizing the genital ridges. However, whether the malignant transformation takes place before or after this separation is controversial. We have explored the somatic exome-wide mutational spectra of bilateral TGCT to provide novel insight into the in utero critical time frame of malignant transformation and TGCT pathogenesis. Exome sequencing was performed in five patients with bilateral TGCT (eight tumors), of these three patients in whom both tumors were available (six tumors) and two patients each with only one available tumor (two tumors). Selected loci were explored by Sanger sequencing in 71 patients with bilateral TGCT. From the exome-wide mutational spectra, no identical mutations in any of the three bilateral tumor pairs were identified. Exome sequencing of all eight tumors revealed 87 somatic non-synonymous mutations (median 10 per tumor; range 5-21), some in already known cancer genes such as CIITA, NEB, platelet-derived growth factor receptor α (PDGFRA), and WHSC1. SUPT6H was found recurrently mutated in two tumors. We suggest independent development lineages of bilateral TGCT. Thus, malignant transformation into intratubular germ cell neoplasia is likely to occur after the migration of PGCs. We reveal possible drivers of TGCT pathogenesis, such as mutated PDGFRA, potentially with therapeutic implications for TGCT patients. Copyright © 2014 Neoplasia Press, Inc. Published by Elsevier Inc. All rights reserved.
Sequencing, Annotation and Analysis of the Syrian Hamster (Mesocricetus auratus) Transcriptome

PubMed Central

Tchitchek, Nicolas; Safronetz, David; Rasmussen, Angela L.; Martens, Craig; Virtaneva, Kimmo; Porcella, Stephen F.; Feldmann, Heinz

2014-01-01

Background The Syrian hamster (golden hamster, Mesocricetus auratus) is gaining importance as a new experimental animal model for multiple pathogens, including emerging zoonotic diseases such as Ebola. Nevertheless there are currently no publicly available transcriptome reference sequences or genome for this species. Results A cDNA library derived from mRNA and snRNA isolated and pooled from the brains, lungs, spleens, kidneys, livers, and hearts of three adult female Syrian hamsters was sequenced. Sequence reads were assembled into 62,482 contigs and 111,796 reads remained unassembled (singletons). This combined contig/singleton dataset, designated as the Syrian hamster transcriptome, represents a total of 60,117,204 nucleotides. Our Mesocricetus auratus Syrian hamster transcriptome mapped to 11,648 mouse transcripts representing 9,562 distinct genes, and mapped to a similar number of transcripts and genes in the rat. We identified 214 quasi-complete transcripts based on mouse annotations. Canonical pathways involved in a broad spectrum of fundamental biological processes were significantly represented in the library. The Syrian hamster transcriptome was aligned to the current release of the Chinese hamster ovary (CHO) cell transcriptome and genome to improve the genomic annotation of this species. Finally, our Syrian hamster transcriptome was aligned against 14 other rodents, primate and laurasiatheria species to gain insights about the genetic relatedness and placement of this species. Conclusions This Syrian hamster transcriptome dataset significantly improves our knowledge of the Syrian hamster's transcriptome, especially towards its future use in infectious disease research. Moreover, this library is an important resource for the wider scientific community to help improve genome annotation of the Syrian hamster and other closely related species. Furthermore, these data provide the basis for development of expression microarrays that can be used in functional genomics studies. PMID:25398096
Genetic environment of the KPC gene in Acinetobacter baumannii ST2 clone from Puerto Rico and genomic insights into its drug resistance

PubMed Central

Martinez, Teresa; Martinez, Idali; Vazquez, Guillermo J.; Aquino, Edna E.

2016-01-01

Carbapenems are considered the last-resort antibiotics to treat infections caused by multidrug-resistant Gram-negative bacilli. The Klebsiella pneumoniae carbapenemase (KPC) enzyme hydrolyses β-lactam antibiotics including the carbapenems. KPC has been detected worldwide in Enterobacteriaceae and Pseudomonas aeruginosa isolates associated with transposon Tn4401 commonly located in plasmids. Acinetobacter baumannii has become an important multidrug-resistant nosocomial pathogen. KPC-producing A. baumannii has been reported to date only in Puerto Rico. The objective of this study was to determine the whole genomic sequence of a KPC-producing A. baumannii in order to (i) define its allelic diversity, (ii) identify the location and genetic environment of the blaKPC and (iii) detect additional mechanisms of antimicrobial resistance. Next-generation sequencing, Southern blot, PFGE, multilocus sequence typing and bioinformatics analysis were performed. The organism was assigned to the international ST2 clone. The blaKPC-2 was identified on a novel truncated version of Tn4401e (tentatively named Tn4401h), located in the chromosome within an IncA/C plasmid fragment derived from an Enterobacteriaceae, probably owing to insertion sequence IS26. A chromosomally located truncated Tn1 transposon harbouring a blaTEM-1 was found in a novel genetic environment within an antimicrobial resistance cluster. Additional resistance mechanisms included efflux pumps, non-β-lactam antibiotic inactivating enzymes within and outside a resistance island, two class 1 integrons, In439 and the novel In1252, as well as mutations in the topoisomerase and DNA gyrase genes which confer resistance to quinolones. The presence of the blaKPC in an already globally disseminated A. baumannii ST2 presents a serious threat of further dissemination. PMID:27259867
Cell and molecular biology of SAE, a cell line from the spiny dogfish shark, Squalus acanthias.

PubMed

Parton, Angela; Forest, David; Kobayashi, Hiroshi; Dowell, Lori; Bayne, Christopher; Barnes, David

2007-02-01

Cartilaginous fish, primarily sharks, rays and skates (elasmobranchs), appeared 450 million years ago. They are the most primitive vertebrates, exhibiting jaws and teeth, adaptive immunity, a pressurized circulatory system, thymus, spleen, and a liver comparable to that of humans. The most used elasmobranch in biomedical research is the spiny dogfish shark, Squalus acanthias. Comparative genomic analysis of the dogfish shark, the little skate (Leucoraja erincea), and other elasmobranchs have yielded insights into conserved functional domains of genes associated with human liver function, multidrug resistance, cystic fibrosis, and other biomedically relevant processes. While genomic information from these animals is informative in an evolutionary framework, experimental verification of functions of genomic sequences depends heavily on cell culture approaches. We have derived the first multipassage, continuously proliferating cell line of a cartilaginous fish. The line was initiated from embryos of the spiny dogfish shark. The cells were maintained in a medium modified for fish species and supplemented with cell type-specific hormones, other proteins and sera, and plated on a collagen substrate. SAE cells have been cultured continuously for three years. These cells can be transfected by plasmids and have been cryopreserved. Expressed Sequence Tags generated from a normalized SAE cDNA library included a number of markers for cartilage and muscle, as well as proteins influencing tissue differentiation and development, suggesting that SAE cells may be of mesenchymal stem cell origin. Examination of SAE EST sequences also revealed a cartilaginous fish-specific repetitive sequence that may be evidence of an ancient mobile genetic element that most likely was introduced into the cartilaginous fish lineage after divergence from the lineage leading to teleosts.
Tracing the origin of disseminated tumor cells in breast cancer using single-cell sequencing.

PubMed

Demeulemeester, Jonas; Kumar, Parveen; Møller, Elen K; Nord, Silje; Wedge, David C; Peterson, April; Mathiesen, Randi R; Fjelldal, Renathe; Zamani Esteki, Masoud; Theunis, Koen; Fernandez Gallardo, Elia; Grundstad, A Jason; Borgen, Elin; Baumbusch, Lars O; Børresen-Dale, Anne-Lise; White, Kevin P; Kristensen, Vessela N; Van Loo, Peter; Voet, Thierry; Naume, Bjørn

2016-12-09

Single-cell micro-metastases of solid tumors often occur in the bone marrow. These disseminated tumor cells (DTCs) may resist therapy and lay dormant or progress to cause overt bone and visceral metastases. The molecular nature of DTCs remains elusive, as well as when and from where in the tumor they originate. Here, we apply single-cell sequencing to identify and trace the origin of DTCs in breast cancer. We sequence the genomes of 63 single cells isolated from six non-metastatic breast cancer patients. By comparing the cells' DNA copy number aberration (CNA) landscapes with those of the primary tumors and lymph node metastasis, we establish that 53% of the single cells morphologically classified as tumor cells are DTCs disseminating from the observed tumor. The remaining cells represent either non-aberrant "normal" cells or "aberrant cells of unknown origin" that have CNA landscapes discordant from the tumor. Further analyses suggest that the prevalence of aberrant cells of unknown origin is age-dependent and that at least a subset is hematopoietic in origin. Evolutionary reconstruction analysis of bulk tumor and DTC genomes enables ordering of CNA events in molecular pseudo-time and traced the origin of the DTCs to either the main tumor clone, primary tumor subclones, or subclones in an axillary lymph node metastasis. Single-cell sequencing of bone marrow epithelial-like cells, in parallel with intra-tumor genetic heterogeneity profiling from bulk DNA, is a powerful approach to identify and study DTCs, yielding insight into metastatic processes. A heterogeneous population of CNA-positive cells is present in the bone marrow of non-metastatic breast cancer patients, only part of which are derived from the observed tumor lineages.
Deformation along the western Indian plate boundary: new constraints from differential and multi-aperture InSAR data inversion for the 2008, Baluchistan (Western Pakistan) seismic sequence.

NASA Astrophysics Data System (ADS)

Pezzo, Giuseppe; Merryman Boncori, John Peter; Atzori, Simone; Antonioli, Andrea; Salvi, Stefano

2014-05-01

We use Synthetic Aperture Radar Differential Interferometry (DInSAR) and Multi-Aperture Interferometry (MAI) to constrain the sources of the three largest events of the 2008 Baluchistan (western Pakistan) seismic sequence, namely two Mw 6.4 events only 12 hours apart and an Mw 5.7event occurred 40 days later. The sequence took place in the Quetta Syntaxis, the most seismically active region of Baluchistan, tectonically located between the colliding Indian Plate and the Afghan block of the Eurasian Plate. Elastic dislocation modelling of the surface displacements, derived from ascending and descending ENVISAT ASAR acquisitions, yields slip distributions with peak values of 80 cm and 70 cm for the two main events on a pair of strike-slip near-vertical faults, and values up to 50 cm for the largest aftershock on a NE-SW strike-slip fault. The MAI measurements, with their high sensitivity to the north-south motion component, are crucial in this area to resolve the fault plane ambiguity of moment tensors. We also studied the relationships between the largest earthquakes of the sequence by means of the Coulomb Failure Function to verify the agreement of our source modelling with the stress variations induced by the October 28 earthquake on the October 29 fault plane, and the stress variations induced by the two mainshocks on the December 09 fault plane. Our results provide insight into the deformation style of the Quetta Syntaxis, suggesting that right-lateral slip released at intermediate depths on large NW fault planes is compatible with contemporaneous left-lateral activation on NE-SW minor faults at shallower depths, in agreement with a bookshelf deformation mechanism.
Analysis of SINE and LINE repeat content of Y chromosomes in the platypus, Ornithorhynchus anatinus.

PubMed

Kortschak, R Daniel; Tsend-Ayush, Enkhjargal; Grützner, Frank

2009-01-01

Monotremes feature an extraordinary sex-chromosome system that consists of five X and five Y chromosomes in males. These sex chromosomes share homology with bird sex chromosomes but no homology with the therian X. The genome of a female platypus was recently completed, providing unique insights into sequence and gene content of autosomes and X chromosomes, but no Y-specific sequence has so far been analysed. Here we report the isolation, sequencing and analysis of approximately 700 kb of sequence of the non-recombining regions of Y2, Y3 and Y5, which revealed differences in base composition and repeat content between autosomes and sex chromosomes, and within the sex chromosomes themselves. This provides the first insights into repeat content of Y chromosomes in platypus, which overall show similar patterns of repeat composition to Y chromosomes in other species. Interestingly, we also observed differences between the various Y chromosomes, and in combination with timing and activity patterns we provide an approach that can be used to examine the evolutionary history of the platypus sex-chromosome chain.
A Survey of Protein Structures from Archaeal Viruses

PubMed Central

Dellas, Nikki; Lawrence, C. Martin; Young, Mark J.

2013-01-01

Viruses that infect the third domain of life, Archaea, are a newly emerging field of interest. To date, all characterized archaeal viruses infect archaea that thrive in extreme conditions, such as halophilic, hyperthermophilic, and methanogenic environments. Viruses in general, especially those replicating in extreme environments, contain highly mosaic genomes with open reading frames (ORFs) whose sequences are often dissimilar to all other known ORFs. It has been estimated that approximately 85% of virally encoded ORFs do not match known sequences in the nucleic acid databases, and this percentage is even higher for archaeal viruses (typically 90%–100%). This statistic suggests that either virus genomes represent a larger segment of sequence space and/or that viruses encode genes of novel fold and/or function. Because the overall three-dimensional fold of a protein evolves more slowly than its sequence, efforts have been geared toward structural characterization of proteins encoded by archaeal viruses in order to gain insight into their potential functions. In this short review, we provide multiple examples where structural characterization of archaeal viral proteins has indeed provided significant functional and evolutionary insight. PMID:25371334
Optimization of parameter values for complex pulse sequences by simulated annealing: application to 3D MP-RAGE imaging of the brain.

PubMed

Epstein, F H; Mugler, J P; Brookeman, J R

1994-02-01

A number of pulse sequence techniques, including magnetization-prepared gradient echo (MP-GRE), segmented GRE, and hybrid RARE, employ a relatively large number of variable pulse sequence parameters and acquire the image data during a transient signal evolution. These sequences have recently been proposed and/or used for clinical applications in the brain, spine, liver, and coronary arteries. Thus, the need for a method of deriving optimal pulse sequence parameter values for this class of sequences now exists. Due to the complexity of these sequences, conventional optimization approaches, such as applying differential calculus to signal difference equations, are inadequate. We have developed a general framework for adapting the simulated annealing algorithm to pulse sequence parameter value optimization, and applied this framework to the specific case of optimizing the white matter-gray matter signal difference for a T1-weighted variable flip angle 3D MP-RAGE sequence. Using our algorithm, the values of 35 sequence parameters, including the magnetization-preparation RF pulse flip angle and delay time, 32 flip angles in the variable flip angle gradient-echo acquisition sequence, and the magnetization recovery time, were derived. Optimized 3D MP-RAGE achieved up to a 130% increase in white matter-gray matter signal difference compared with optimized 3D RF-spoiled FLASH with the same total acquisition time. The simulated annealing approach was effective at deriving optimal parameter values for a specific 3D MP-RAGE imaging objective, and may be useful for other imaging objectives and sequences in this general class.
The testes transcriptome derived from the New World Screwworm, Cochliomyia hominivorax SRA

USDA-ARS?s Scientific Manuscript database

In a collaboration with National Center for Genome Resources researchers, we sequenced and assembled the testes transcriptome derived from the Pacora, Panama, production plant strain J06 of the New World Screwworm, Cochliomyia hominivorax. This sequencing project produced 72,750,822 raw reads and th...
Genetic and transcriptomic analyses provide new insights on the early antiviral response to VHSV in resistant and susceptible rainbow trout.

PubMed

Verrier, Eloi R; Genet, Carine; Laloë, Denis; Jaffrezic, Florence; Rau, Andrea; Esquerre, Diane; Dechamp, Nicolas; Ciobotaru, Céline; Hervet, Caroline; Krieg, Francine; Jouneau, Luc; Klopp, Christophe; Quillet, Edwige; Boudinot, Pierre

2018-06-19

The viral hemorrhagic septicemia virus (VHSV) is a major threat for salmonid farming and for wild fish populations worldwide. Previous studies have highlighted the importance of innate factors regulated by a major quantitative trait locus (QTL) for the natural resistance to waterborne VHSV infection in rainbow trout. The aim of this study was to analyze the early transcriptomic response to VHSV inoculation in cell lines derived from previously described resistant and susceptible homozygous isogenic lines of rainbow trout to obtain insights into the molecular mechanisms responsible for the resistance to the viral infection. We first confirmed the presence of the major QTL in a backcross involving a highly resistant fish isogenic line (B57) and a highly susceptible one (A22), and were able to define the confidence interval of the QTL and to identify its precise position. We extended the definition of the QTL since it controls not only resistance to waterborne infection but also the kinetics of mortality after intra-peritoneal injection. Deep sequencing of the transcriptome of B57 and A22 derived cell lines exposed to inactivated VHSV showed a stronger response to virus inoculation in the resistant background. In line with our previous observations, an early and strong induction of interferon and interferon-stimulated genes was correlated with the resistance to VHSV, highlighting the major role of innate immune factors in natural trout resistance to the virus. Interestingly, major factors of the antiviral innate immunity were much more expressed in naive B57 cells compared to naive A22 cells, which likely contributes to the ability of B57 to mount a fast antiviral response after viral infection. These observations were further extended by the identification of several innate immune-related genes localized close to the QTL area on the rainbow trout genome. Taken together, our results improve our knowledge in virus-host interactions in vertebrates and provide novel insights in the molecular mechanisms explaining the resistance to VHSV in rainbow trout. Our data also provide a collection of potential markers for resistance and susceptibility of rainbow trout to VHSV infection.

Genomic insights from whole genome sequencing of four clonal outbreak Campylobacter jejuni assessed within the global C. jejuni population.

PubMed

Clark, Clifford G; Berry, Chrystal; Walker, Matthew; Petkau, Aaron; Barker, Dillon O R; Guan, Cai; Reimer, Aleisha; Taboada, Eduardo N

2016-12-03

Whole genome sequencing (WGS) is useful for determining clusters of human cases, investigating outbreaks, and defining the population genetics of bacteria. It also provides information about other aspects of bacterial biology, including classical typing results, virulence, and adaptive strategies of the organism. Cell culture invasion and protein expression patterns of four related multilocus sequence type 21 (ST21) C. jejuni isolates from a significant Canadian water-borne outbreak were previously associated with the presence of a CJIE1 prophage. Whole genome sequencing was used to examine the genetic diversity among these isolates and confirm that previous observations could be attributed to differential prophage carriage. Moreover, we sought to determine the presence of genome sequences that could be used as surrogate markers to delineate outbreak-associated isolates. Differential carriage of the CJIE1 prophage was identified as the major genetic difference among the four outbreak isolates. High quality single-nucleotide variant (hqSNV) and core genome multilocus sequence typing (cgMLST) clustered these isolates within expanded datasets consisting of additional C. jejuni strains. The number and location of homopolymeric tract regions was identical in all four outbreak isolates but differed from all other C. jejuni examined. Comparative genomics and PCR amplification enabled the identification of large chromosomal inversions of approximately 93 kb and 388 kb within the outbreak isolates associated with transducer-like proteins containing long nucleotide repeat sequences. The 93-kb inversion was characteristic of the outbreak-associated isolates, and the gene content of this inverted region displayed high synteny with the reference strain. The four outbreak isolates were clonally derived and differed mainly in the presence of the CJIE1 prophage, validating earlier findings linking the prophage to phenotypic differences in virulence assays and protein expression. The identification of large, genetically syntenous chromosomal inversions in the genomes of outbreak-associated isolates provided a unique method for discriminating outbreak isolates from the background population. Transducer-like proteins appear to be associated with the chromosomal inversions. CgMLST and hqSNV analysis also effectively delineated the outbreak isolates within the larger C. jejuni population structure.
Profiling of short RNAs during fleshy fruit development reveals stage-specific sRNAome expression patterns.

PubMed

Mohorianu, Irina; Schwach, Frank; Jing, Runchun; Lopez-Gomollon, Sara; Moxon, Simon; Szittya, Gyorgy; Sorefan, Karim; Moulton, Vincent; Dalmay, Tamas

2011-07-01

Plants feature a particularly diverse population of short (s)RNAs, the central component of all RNA silencing pathways. Next generation sequencing techniques enable deeper insights into this complex and highly conserved mechanism and allow identification and quantification of sRNAs. We employed deep sequencing to monitor the sRNAome of developing tomato fruits covering the period between closed flowers and ripened fruits by profiling sRNAs at 10 time-points. It is known that microRNAs (miRNAs) play an important role in development but very little information is available about the majority of sRNAs that are not miRNAs. Here we show distinctive patterns of sRNA expression that often coincide with stages of the developmental process such as flowering, early and late fruit maturation. Moreover, thousands of non-miRNA sRNAs are differentially expressed during fruit development and ripening. Some of these differentially expressed sRNAs derived from transposons but many derive from protein coding genes or regions that show homology to protein coding genes, several of which are known to play a role in flower and fruit development. These findings raise the possibility of a regulative role of these sRNAs during fruit onset and maturation in a crop species. We also identified six new miRNAs and experimentally validated two target mRNAs. These two mRNAs are targeted by the same miRNA but do not belong to the same gene family, which is rare for plant miRNAs. Expression pattern and putative function of these targets indicate a possible role in glutamate accumulation, which contributes to establishing the taste of the fruit. © 2011 The Authors. The Plant Journal © 2011 Blackwell Publishing Ltd.
Attenuation of an original US porcine epidemic diarrhea virus strain PC22A via serial cell culture passage.

PubMed

Lin, Chun-Ming; Hou, Yixuan; Marthaler, Douglas G; Gao, Xiang; Liu, Xinsheng; Zheng, Lanlan; Saif, Linda J; Wang, Qiuhong

2017-03-01

Although porcine epidemic diarrhea (PED) has caused huge economic losses in the pork industry worldwide, an effective live, attenuated vaccine is lacking. In this study, an original US, highly virulent PED virus (PEDV) strain PC22A was serially passaged in Vero CCL81 and Vero BI cells. The virus growth kinetics in cell culture, virulence in neonatal pigs and the whole genomic sequences of selected passages were examined. Increased virus titers and sizes of syncytia were observed at the 65th passage level (P65) and P120, respectively. Based on the severity of clinical signs, histopathological lesions and the distribution of PEDV antigens in the gut, the virulence of P100 and above, but not P95C13 (CCL81), was markedly reduced in 4-day-old, caesarian-derived, colostrum-deprived piglets. Subsequently, the attenuation of P120 and P160 was confirmed in 4-day-old, conventional suckling piglets. Compared with P120, P160 replicated less efficiently in the intestine of pigs and induced a lower rate of protection after challenge. Sequence analysis revealed that the virulent viruses [P3 and P95C13 (CCL81)] had one, one, sixteen (including an early termination of nine amino acids) and two amino acid differences in non-structure protein 1 (nsp1), nsp4, spike and membrane proteins, respectively, from the fully attenuated P160. However, the overall pattern of attenuation-related genetic changes in PC22A differed from those of the other four pairs of PEDV wild type strains and their attenuated derivatives. These results suggest that PEDV attenuation can occur through multiple molecular mechanisms. The knowledge provides insights into potential molecular mechanisms of PEDV attenuation. Copyright © 2017 Elsevier B.V. All rights reserved.
The effects of micronutrient deficiencies on bacterial species from the human gut microbiota.

PubMed

Hibberd, Matthew C; Wu, Meng; Rodionov, Dmitry A; Li, Xiaoqing; Cheng, Jiye; Griffin, Nicholas W; Barratt, Michael J; Giannone, Richard J; Hettich, Robert L; Osterman, Andrei L; Gordon, Jeffrey I

2017-05-17

Vitamin and mineral (micronutrient) deficiencies afflict 2 billion people. Although the impact of these imbalances on host biology has been studied extensively, much less is known about their effects on the gut microbiota of developing or adult humans. Therefore, we established a community of cultured, sequenced human gut-derived bacterial species in gnotobiotic mice and fed the animals a defined micronutrient-sufficient diet, followed by a derivative diet devoid of vitamin A, folate, iron, or zinc, followed by return to the sufficient diet. Acute vitamin A deficiency had the largest effect on bacterial community structure and metatranscriptome, with Bacteroides vulgatus, a prominent responder, increasing its abundance in the absence of vitamin A. Applying retinol selection to a library of 30,300 B. vulgatus transposon mutants revealed that disruption of acrR abrogated retinol sensitivity. Genetic complementation studies, microbial RNA sequencing, and transcription factor-binding assays disclosed that AcrR is a repressor of an adjacent AcrAB-TolC efflux system. Retinol efflux measurements in wild-type and acrR -mutant strains plus treatment with a pharmacologic inhibitor of the efflux system revealed that AcrAB-TolC is a determinant of retinol and bile acid sensitivity in B. vulgatus Acute vitamin A deficiency was associated with altered bile acid metabolism in vivo, raising the possibility that retinol, bile acid metabolites, and AcrAB-TolC interact to influence the fitness of B. vulgatus and perhaps other microbiota members. This type of preclinical model can help to develop mechanistic insights about the effects of, and more effective treatment strategies for micronutrient deficiencies. Copyright © 2017, American Association for the Advancement of Science.
Associations among dietary non-fiber carbohydrate, ruminal microbiota and epithelium G-protein-coupled receptor, and histone deacetylase regulations in goats.

PubMed

Shen, Hong; Lu, Zhongyan; Xu, Zhihui; Chen, Zhan; Shen, Zanming

2017-09-19

Diet-derived short-chain fatty acids (SCFAs) in the rumen have broad effects on the health and growth of ruminants. The microbe-G-protein-coupled receptor (GPR) and microbe-histone deacetylase (HDAC) axes might be the major pathway mediating these effects. Here, an integrated approach of transcriptome sequencing and 16S rRNA gene sequencing was applied to investigate the synergetic responses of rumen epithelium and rumen microbiota to the increased intake of dietary non-fiber carbohydrate (NFC) from 15 to 30% in the goat model. In addition to the analysis of the microbial composition and identification of the genes and signaling pathways related to the differentially expressed GPRs and HDACs, the combined data including the expression of HDACs and GPRs, the relative abundance of the bacteria, and the molar proportions of the individual SCFAs were used to identify the significant co-variation of the SCFAs, clades, and transcripts. The major bacterial clades promoted by the 30% NFC diet were related to lactate metabolism and cellulose degradation in the rumen. The predominant functions of the GPR and HDAC regulation network, under the 30% NFC diet, were related to the maintenance of epithelium integrity and the promotion of animal growth. In addition, the molar proportion of butyrate was inversely correlated with the expression of HDAC1, and the relative abundance of the bacteria belonging to Clostridum_IV was positively correlated with the expression of GPR1. This study revealed that the effects of rumen microbiota-derived SCFA on epithelium growth and metabolism were mediated by the GPR and HDAC regulation network. An understanding of these mechanisms and their relationships to dietary components provides better insights into the modulation of ruminal fermentation and metabolism in the promotion of livestock production.
Phylogenetic Diversity and Metabolic Potential Revealed in a Glacier Ice Metagenome▿ †

PubMed Central

Simon, Carola; Wiezer, Arnim; Strittmatter, Axel W.; Daniel, Rolf

2009-01-01

The largest part of the Earth's microbial biomass is stored in cold environments, which represent almost untapped reservoirs of novel species, processes, and genes. In this study, the first metagenomic survey of the metabolic potential and phylogenetic diversity of a microbial assemblage present in glacial ice is presented. DNA was isolated from glacial ice of the Northern Schneeferner, Germany. Pyrosequencing of this DNA yielded 1,076,539 reads (239.7 Mbp). The phylogenetic composition of the prokaryotic community was assessed by evaluation of a pyrosequencing-derived data set and sequencing of 16S rRNA genes. The Proteobacteria (mainly Betaproteobacteria), Bacteroidetes, and Actinobacteria were the predominant phylogenetic groups. In addition, isolation of psychrophilic microorganisms was performed, and 13 different bacterial isolates were recovered. Analysis of the 16S rRNA gene sequences of the isolates revealed that all were affiliated to the predominant groups. As expected for microorganisms residing in a low-nutrient environment, a high metabolic versatility with respect to degradation of organic substrates was detected by analysis of the pyrosequencing-derived data set. The presence of autotrophic microorganisms was indicated by identification of genes typical for different ways of carbon fixation. In accordance with the results of the phylogenetic studies, in which mainly aerobic and facultative aerobic bacteria were detected, genes typical for central metabolism of aerobes were found. Nevertheless, the capability of growth under anaerobic conditions was indicated by genes involved in dissimilatory nitrate/nitrite reduction. Numerous characteristics for metabolic adaptations associated with a psychrophilic lifestyle, such as formation of cryoprotectants and maintenance of membrane fluidity by the incorporation of unsaturated fatty acids, were detected. Thus, analysis of the glacial metagenome provided insights into the microbial life in frozen habitats on Earth, thereby possibly shedding light onto microbial life in analogous extraterrestrial environments. PMID:19801459
Metagenome-Assembled Genome Sequences of Acetobacterium sp. Strain MES1 and Desulfovibrio sp. Strain MES5 from a Cathode-Associated Acetogenic Microbial Community.

PubMed

Ross, Daniel E; Marshall, Christopher W; May, Harold D; Norman, R Sean

2017-09-07

Draft genome sequences of Acetobacterium sp. strain MES1 and Desulfovibrio sp. strain MES5 were obtained from the metagenome of a cathode-associated community enriched within a microbial electrosynthesis system (MES). The draft genome sequences provide insight into the functional potential of these microorganisms within an MES and a foundation for future comparative analyses. Copyright © 2017 Ross et al.
Complete Genome Sequence of Bifidobacterium bifidum S17▿

PubMed Central

Zhurina, Daria; Zomer, Aldert; Gleinser, Marita; Brancaccio, Vincenco Francesco; Auchter, Marc; Waidmann, Mark S.; Westermann, Christina; van Sinderen, Douwe; Riedel, Christian U.

2011-01-01

Here, we report on the first completely annotated genome sequence of a Bifidobacterium bifidum strain. B. bifidum S17, isolated from feces of a breast-fed infant, was shown to strongly adhere to intestinal epithelial cells and has potent anti-inflammatory activity in vitro and in vivo. The genome sequence will provide new insights into the biology of this potential probiotic organism and allow for the characterization of the molecular mechanisms underlying its beneficial properties. PMID:21037011
Genome-based classification of micromonosporae with a focus on their biotechnological and ecological potential.

PubMed

Carro, Lorena; Nouioui, Imen; Sangal, Vartul; Meier-Kolthoff, Jan P; Trujillo, Martha E; Montero-Calasanz, Maria Del Carmen; Sahin, Nevzat; Smith, Darren Lee; Kim, Kristi E; Peluso, Paul; Deshpande, Shweta; Woyke, Tanja; Shapiro, Nicole; Kyrpides, Nikos C; Klenk, Hans-Peter; Göker, Markus; Goodfellow, Michael

2018-01-11

There is a need to clarify relationships within the actinobacterial genus Micromonospora, the type genus of the family Micromonosporaceae, given its biotechnological and ecological importance. Here, draft genomes of 40 Micromonospora type strains and two non-type strains are made available through the Genomic Encyclopedia of Bacteria and Archaea project and used to generate a phylogenomic tree which showed they could be assigned to well supported phyletic lines that were not evident in corresponding trees based on single and concatenated sequences of conserved genes. DNA G+C ratios derived from genome sequences showed that corresponding data from species descriptions were imprecise. Emended descriptions include precise base composition data and approximate genome sizes of the type strains. antiSMASH analyses of the draft genomes show that micromonosporae have a previously unrealised potential to synthesize novel specialized metabolites. Close to one thousand biosynthetic gene clusters were detected, including NRPS, PKS, terpenes and siderophores clusters that were discontinuously distributed thereby opening up the prospect of prioritising gifted strains for natural product discovery. The distribution of key stress related genes provide an insight into how micromonosporae adapt to key environmental variables. Genes associated with plant interactions highlight the potential use of micromonosporae in agriculture and biotechnology.
Sequence diversity of NanA manifests in distinct enzyme kinetics and inhibitor susceptibility

NASA Astrophysics Data System (ADS)

Xu, Zhongli; von Grafenstein, Susanne; Walther, Elisabeth; Fuchs, Julian E.; Liedl, Klaus R.; Sauerbrei, Andreas; Schmidtke, Michaela

2016-04-01

Streptococcus pneumoniae is the leading pathogen causing bacterial pneumonia and meningitis. Its surface-associated virulence factor neuraminidase A (NanA) promotes the bacterial colonization by removing the terminal sialyl residues from glycoconjugates on eukaryotic cell surface. The predominant role of NanA in the pathogenesis of pneumococci renders it an attractive target for therapeutic intervention. Despite the highly conserved activity of NanA, our alignment of the 11 NanAs revealed the evolutionary diversity of this enzyme. The amino acid substitutions we identified, particularly those in the lectin domain and in the insertion domain next to the catalytic centre triggered our special interest. We synthesised the representative NanAs and the mutagenized derivatives from E. coli for enzyme kinetics study and neuraminidase inhibitor susceptibility test. Via molecular docking we got a deeper insight into the differences between the two major variants of NanA and their influence on the ligand-target interactions. In addition, our molecular dynamics simulations revealed a prominent intrinsic flexibility of the linker between the active site and the insertion domain, which influences the inhibitor binding. Our findings for the first time associated the primary sequence diversity of NanA with the biochemical properties of the enzyme and with the inhibitory efficiency of neuraminidase inhibitors.
Effects of dietary supplementation of Ulva pertusa and non-starch polysaccharide enzymes on gut microbiota of Siganus canaliculatus

NASA Astrophysics Data System (ADS)

Zhang, Xinxu; Wu, Huijuan; Li, Zhongzhen; Li, Yuanyou; Wang, Shuqi; Zhu, Dashi; Wen, Xiaobo; Li, Shengkang

2018-03-01

Fishes represent the highest diversity of vertebrates; however, our understanding of the compositions and functions of their gut microbiota is limited. In this study, we provided the first insight into the gut microbiota of the herbivorous fish Siganus canaliculatus by using three molecular ecology techniques based on the 16S rRNA genes (denaturing gradient gel electrophoresis, clone library construction, and highthroughput Illumina sequencing), and the Illumina sequencing technique is suggested here due to its higher overall coverage of the total 16S rRNA genes. A core gut microbiota of 29 bacterial groups, covering >99.9% of the total bacterial community, was found to be dominated by Proteobacteria and Firmicutes in fish fed three different diets with/without the supplementation of Ulva pertusa and non-starch polysaccharide (NSP) enzymes (cellulase, xylanase, and β-glucanase). Diverse potential NSP-degrading bacteria and probiotics (e.g., Ruminococcus, Clostridium and Lachnospiraceae) were detected in the intestine of the fish fed U. pertusa, suggesting that these microorganisms likely participated in the degradation of NSPs derived from U. pertusa. This study supports our previous conclusion that U. pertusa-based diets are suitable for the production of S. canaliculatus with lower costs without compromising quality.
Effects of dietary supplementation of Ulva pertusa and non-starch polysaccharide enzymes on gut microbiota of Siganus canaliculatus

NASA Astrophysics Data System (ADS)

Zhang, Xinxu; Wu, Huijuan; Li, Zhongzhen; Li, Yuanyou; Wang, Shuqi; Zhu, Dashi; Wen, Xiaobo; Li, Shengkang

2017-05-01

Fishes represent the highest diversity of vertebrates; however, our understanding of the compositions and functions of their gut microbiota is limited. In this study, we provided the first insight into the gut microbiota of the herbivorous fish Siganus canaliculatus (S. canaliculatus) by using three molecular ecology techniques based on the 16S rRNA genes (denaturing gradient gel electrophoresis, clone library construction, and high-throughput Illumina sequencing), and the Illumina sequencing technique is suggested here due to its higher overall coverage of the total 16S rRNA genes. A core gut microbiota of 29 bacterial groups, covering >99.9% of the total bacterial community, was found to be dominated by Proteobacteria and Firmicutes in fish fed three different diets with/without the supplementation of Ulva pertusa (U. pertusa) and non-starch polysaccharide (NSP) enzymes (cellulase, xylanase, and β-glucanase). Diverse potential NSP-degrading bacteria and probiotics (e.g., Ruminococcus, Clostridium and Lachnospiraceae) were detected in the intestine of the fish fed U. pertusa, suggesting that these microorganisms likely participated in the degradation of NSPs derived from U. pertusa. This study supports our previous conclusion that U. pertusa-based diets are suitable for the production of S. canaliculatus with lower costs without compromising quality.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Wang, Shih-Ting; Lin, Yiyang; Spencer, Ryan K.

Determining the structural origins of amyloid fibrillation is essential for understanding both the pathology of amyloidosis and the rational design of inhibitors to prevent or reverse amyloid formation. In this work, the decisive roles of peptide structures on amyloid self-assembly and morphological diversity were investigated by the design of eight amyloidogenic peptides derived from islet amyloid polypeptide. Among the segments, two distinct morphologies were highlighted in the form of twisted and planar (untwisted) ribbons with varied diameters, thicknesses, and lengths. In particular, transformation of amyloid fibrils from twisted ribbons into untwisted structures was triggered by substitution of the C-terminal serinemore » with threonine, where the side chain methyl group was responsible for the distinct morphological change. This effect was confirmed following serine substitution with alanine and valine and was ascribed to the restriction of intersheet torsional strain through the increased hydrophobic interactions and hydrogen bonding. We also studied the variation of fibril morphology (i.e., association and helicity) and peptide aggregation propensity by increasing the hydrophobicity of the peptide side group, capping the N-terminus, and extending sequence length. Lastly, we anticipate that our insights into sequence-dependent fibrillation and morphological diversity will shed light on the structural interpretation of amyloidogenesis and development of structure-specific imaging agents and aggregation inhibitors.« less
Exploring the functional side of the Ocean Sampling Day metagenomes

NASA Astrophysics Data System (ADS)

Antonio, F. G.; Kottmann, R.; Wallom, D.; Glöckner, F. O.

2016-02-01

The Ocean Sampling Day (OSD) is a simultaneous, collaborative, standardized, and global mega-sequencing campaign to analyze marine microbial community composition and functional traits. 150 metagenomes were sequenced from the first OSD in June 2014 including a rich set of environmental and oceanographic measurements. Unlike other ocean mega-surveys such as Global Ocean Sampling (GOS) or the TARA expedition that mostly sampled open ocean waters most of the OSD samples are from coastal sampling sites, an area not previously well studied in this regard. The result is that OSD adds more than three million new genes to the recently published Ocean Microbial-Reference Gene Catalog (Sunawaga et al., 2015). This allows us to significantly increase our knowledge of the ocean microbiome, identify hot-spots of novelty in terms of function and investigate the impact of human activities on oceans coastal areas where there is the largest interaction between dense human populations and the marine world. Additionally, these cumulative samples, related in time, space and environmental parameters, are providing insights into fundamental rules describing microbial diversity and function and contribute to the blue economy through the identification of novel ocean-derived biotechnologies. References: Sunagawa, Coelho, Chaffron, et al. (2015, May). Structure and function of the global ocean microbiome. Science, 348(6237), 126135.
Advances in Understanding Stimulus Responsive Phase Behavior of Intrinsically Disordered Protein Polymers.

PubMed

Ruff, Kiersten M; Roberts, Stefan; Chilkoti, Ashutosh; Pappu, Rohit V

2018-06-24

Proteins and synthetic polymers can undergo phase transitions in response to changes to intensive solution parameters such as temperature, proton chemical potentials (pH), and hydrostatic pressure. For proteins and protein-based polymers, the information required for stimulus responsive phase transitions is encoded in their amino acid sequence. Here, we review some of the key physical principles that govern the phase transitions of archetypal intrinsically disordered protein polymers (IDPPs). These are disordered proteins with highly repetitive amino acid sequences. Advances in recombinant technologies have enabled the design and synthesis of protein sequences of a variety of sequence complexities and lengths. We summarize insights that have been gleaned from the design and characterization of IDPPs that undergo thermo-responsive phase transitions and build on these insights to present a general framework for IDPPs with pH and pressure responsive phase behavior. In doing so, we connect the stimulus responsive phase behavior of IDPPs with repetitive sequences to the coil-to-globule transitions that these sequences undergo at the single chain level in response to changes in stimuli. The proposed framework and ongoing studies of stimulus responsive phase behavior of designed IDPPs have direct implications in bioengineering, where designing sequences with bespoke material properties broadens the spectrum of applications, and in biology and medicine for understanding the sequence-specific driving forces for the formation of protein-based membraneless organelles as well as biological matrices that act as scaffolds for cells and mediators of cell-to-cell communication. Copyright © 2018. Published by Elsevier Ltd.
Analytical Insights on the Position, Challenges, and Potential for Promoting OER in ODeL Institutions in Africa

ERIC Educational Resources Information Center

Muganda, Cornelia K.; Samzugi, Athuman S.; Mallinson, Brenda J.

2016-01-01

This paper shares analytical insights on the position, challenges and potential for promoting Open Educational Resources (OER) in African Open Distance and eLearning (ODeL) institutions. The researchers sought to use a participatory research approach as described by Krishnaswamy (2004), in convening a sequence of two workshops at the Open…
Insights Into Upland Cotton (Gossypium hirsutum L.) Genetic Recombination Based on 3 High-Density Single-Nucleotide Polymorphism and a Consensus Map Developed Independently With Common Parents. Genomics Insights

USDA-ARS?s Scientific Manuscript database

High-density linkage maps are vital to supporting the correct placement of scaffolds and gene sequences on chromosomes and fundamental to contemporary organismal research and scientific approaches to genetic improvement; high-density linkage maps are especially important in paleopolyploids with exce...
Exploiting rice-sorghum synteny for targeted development of EST-SSRs to enrich the sorghum genetic linkage map.

PubMed

Ramu, P; Kassahun, B; Senthilvel, S; Ashok Kumar, C; Jayashree, B; Folkertsma, R T; Reddy, L Ananda; Kuruvinashetti, M S; Haussmann, B I G; Hash, C T

2009-11-01

The sequencing and detailed comparative functional analysis of genomes of a number of select botanical models open new doors into comparative genomics among the angiosperms, with potential benefits for improvement of many orphan crops that feed large populations. In this study, a set of simple sequence repeat (SSR) markers was developed by mining the expressed sequence tag (EST) database of sorghum. Among the SSR-containing sequences, only those sharing considerable homology with rice genomic sequences across the lengths of the 12 rice chromosomes were selected. Thus, 600 SSR-containing sorghum EST sequences (50 homologous sequences on each of the 12 rice chromosomes) were selected, with the intention of providing coverage for corresponding homologous regions of the sorghum genome. Primer pairs were designed and polymorphism detection ability was assessed using parental pairs of two existing sorghum mapping populations. About 28% of these new markers detected polymorphism in this 4-entry panel. A subset of 55 polymorphic EST-derived SSR markers were mapped onto the existing skeleton map of a recombinant inbred population derived from cross N13 x E 36-1, which is segregating for Striga resistance and the stay-green component of terminal drought tolerance. These new EST-derived SSR markers mapped across all 10 sorghum linkage groups, mostly to regions expected based on prior knowledge of rice-sorghum synteny. The ESTs from which these markers were derived were then mapped in silico onto the aligned sorghum genome sequence, and 88% of the best hits corresponded to linkage-based positions. This study demonstrates the utility of comparative genomic information in targeted development of markers to fill gaps in linkage maps of related crop species for which sufficient genomic tools are not available.
Cloning and Sequencing of Defective Particles Derived from the Autonomous Parvovirus Minute Virus of Mice for the Construction of Vectors with Minimal cis-Acting Sequences

PubMed Central

Clément, Nathalie; Avalosse, Bernard; El Bakkouri, Karim; Velu, Thierry; Brandenburger, Annick

2001-01-01

The production of wild-type-free stocks of recombinant parvovirus minute virus of mice [MVM(p)] is difficult due to the presence of homologous sequences in vector and helper genomes that cannot easily be eliminated from the overlapping coding sequences. We have therefore cloned and sequenced spontaneously occurring defective particles of MVM(p) with very small genomes to identify the minimal cis-acting sequences required for DNA amplification and virus production. One of them has lost all capsid-coding sequences but is still able to replicate in permissive cells when nonstructural proteins are provided in trans by a helper plasmid. Vectors derived from this particle produce stocks with no detectable wild-type MVM after cotransfection with new, matched, helper plasmids that present no homology downstream from the transgene. PMID:11152501
Eukaryotic diversity in premise drinking water using 18S rDNA sequencing: implications for health risks

EPA Science Inventory

The goal of this study was to characterize microbial eukaryotes over a 12 month period, so as to provide insight into the occurrence of potentially important predators and bacterial hosts in hot and cold premise plumbing. Nearly 6,300 partial (600 bp) 18S rRNA gene sequences from...

Draft genome sequence of Streptomyces sp. strain SS, which produces a series of uridyl peptide antibiotic sansanmycins.

PubMed

Wang, Lifei; Xie, Yunying; Li, Qinglian; He, Ning; Yao, Entai; Xu, Hongzhang; Yu, Ying; Chen, Ruxian; Hong, Bin

2012-12-01

Streptomyces sp. SS produces a series of uridyl peptide antibiotic sansanmycins. Here, we present a draft genome sequence of Streptomyces sp. SS containing the biosynthetic gene cluster for the antibiotics. The identification of the biosynthetic gene cluster of sansanmycins may provide further insight into biosynthetic mechanisms for uridyl peptide antibiotics.
Some Principles Involved in the Acquisition of Number Words.

ERIC Educational Resources Information Center

Pollmann, Thijs

2003-01-01

Offers linguistic insights into number acquisition. Argues that the particular rhythmical structure of speech forms for numerical sequence provides children with the raw material to develop a concept "decade word. Children have to learn by rote a second sequence--decade numbers (10, 20, 20, etc). This is an important step in the detection of the…
Complete Genome Sequence and Updated Annotation of Desulfovibrio alaskensis G20

DOE PAGES

Hauser, Loren J.; Land, Miriam L.; Brown, Steven D.; ...

2011-06-17

Desulfovibrio alaskensis G20 (formerly desulfuricans G20) is a Gram-negative mesophilic sulfate-reducing bacterium (SRB), known to corrode ferrous metals and to reduce toxic radionuclides and metals such as uranium and chromium to sparingly soluble and less toxic forms. We present the 3.7 Mb genome sequence to provide insights into its physiology.
The carrot genome provides insights into crop origins and a foundation for future crop improvement

USDA-ARS?s Scientific Manuscript database

The sequencing of the carrot genome was an effort that formally began in 2012 and culminated with the publication and release of the genome in 2016. A full genome sequence provides the ultimate foundation to study genetics, gene function, and evolution of a species. The primary goal of the carrot ge...
Draft genome sequence of Pyrodictium occultum PL19 T, a marine hyperthermophilic species of Archaea that grows optimally at 105°C

DOE PAGES

Utturkar, Sagar M.; Huber, Harald; Leptihn, Sebastian; ...

2016-02-25

We report here the draft genome sequence of Pyrodictium occultum PL19 T, a marine hyperthermophilic archaeon. In addition, the genome provides insights into molecular and cellular adaptation mechanisms to life in extreme environments and the evolution of early organisms on Earth.
Draft Genome Sequence of the Spore-Forming Probiotic Strain Bacillus coagulans Unique IS-2

PubMed Central

Upadrasta, Aditya; Pitta, Swetha

2016-01-01

Bacillus coagulans Unique IS-2 is a potential spore-forming probiotic that is commercially available on the market. The draft genome sequence presented here provides deep insight into the beneficial features of this strain for its safe use as a probiotic for various human and animal health applications. PMID:27103709
Improving Students' Conceptual Understanding of a Specific Content Learning: A Designed Teaching Sequence

ERIC Educational Resources Information Center

Ahmad, N. J.; Lah, Y. Che

2012-01-01

The efficacy of a teaching sequence designed for a specific content of learning of electrochemistry is described in this paper. The design of the teaching draws upon theoretical insights into perspectives on learning and empirical studies to improve the teaching of this topic. A case study involving two classes, the experimental and baseline…
Cryptic breakpoint identified by whole-genome mate-pair sequencing in a rare paternally inherited complex chromosomal rearrangement.

PubMed

Aristidou, Constantia; Theodosiou, Athina; Ketoni, Andria; Bak, Mads; Mehrjouy, Mana M; Tommerup, Niels; Sismani, Carolina

2018-01-01

Precise characterization of apparently balanced complex chromosomal rearrangements in non-affected individuals is crucial as they may result in reproductive failure, recurrent miscarriages or affected offspring. We present a family, where the non-affected father and daughter were found, using FISH and karyotyping, to be carriers of a three-way complex chromosomal rearrangement [t(6;7;10)(q16.2;q34;q26.1), de novo in the father]. The family suffered from two stillbirths, one miscarriage, and has a son with severe intellectual disability. In the present study, the family was revisited using whole-genome mate-pair sequencing. Interestingly, whole-genome mate-pair sequencing revealed a cryptic breakpoint on derivative (der) chromosome 6 rendering the rearrangement even more complex. FISH using a chromosome (chr) 6 custom-designed probe and a chr10 control probe confirmed that the interstitial chr6 segment, created by the two chr6 breakpoints, was translocated onto der(10). Breakpoints were successfully validated with Sanger sequencing, and small imbalances as well as microhomology were identified. Finally, the complex chromosomal rearrangement breakpoints disrupted the SIM1 , GRIK2 , CNTNAP2 , and PTPRE genes without causing any phenotype development. In contrast to the majority of maternally transmitted complex chromosomal rearrangement cases, our study investigated a rare case where a complex chromosomal rearrangement, which most probably resulted from a Type IV hexavalent during the pachytene stage of meiosis I, was stably transmitted from a fertile father to his non-affected daughter. Whole-genome mate-pair sequencing proved highly successful in identifying cryptic complexity, which consequently provided further insight into the meiotic segregation of chromosomes and the increased reproductive risk in individuals carrying the specific complex chromosomal rearrangement. We propose that such complex rearrangements should be characterized in detail using a combination of conventional cytogenetic and NGS-based approaches to aid in better prenatal preimplantation genetic diagnosis and counseling in couples with reproductive problems.
Identification of plant genes regulated in resistant potato Solanum sparsipilum during the early stages of infection by Globodera pallida.

PubMed

Jolivet, Katell; Grenier, Eric; Bouchet, Jean-Paul; Esquibet, Magali; Kerlan, Marie-Claire; Caromel, Bernard; Mugniéry, Didier; Lefebvre, Véronique

2007-04-01

Using a complementary (c)DNA-amplified fragment length polymorphism (AFLP) approach, we investigated differential gene expression linked to resistance mechanisms during the incompatible potato - Globodera pallida interaction. Expression was compared between a resistant and a susceptible potato clone, inoculated or not inoculated with G. pallida. These clones were issued from a cross between the resistant Solanum sparsipilum spl329.18 accession and the susceptible dihaploid S. tuberosum Caspar H3, and carried, respectively, resistant and susceptible alleles at the resistance quantitative trait loci (QTLs). Analysis was done on root fragments picked up at 4 time points, during a period of 6 days after infection, from penetration of the nematode in the root to degradation of the feeding site in resistant plants. A total of 2560 transcript-derived fragments (TDFs) were analyzed, resulting in the detection of 46 TDFs that were up- or downregulated. The number of TDFs that were up- or downregulated increased with time after inoculation. The majority of TDFs were upregulated at only 1 or 2 time points in response to infection. After isolation and sequencing of the TDFs of interest, a subset of 36 sequences were identified, among which 22 matched plant sequences and 2 matched nematode sequences. Some of the TDFs that matched plant genes showed clear homologies to genes involved in cell-cycle regulation, transcription regulation, resistance downstream signalling pathways, and defense mechanisms. Other sequences with homologies to plant genes of unknown function or without any significant similarity to known proteins were also found. Although not exhaustive, these results represent the most extensive list of genes with altered RNA levels after the incompatible G. pallida-potato interaction that has been published to date. The function of these genes could provide insight into resistance or plant defense mechanisms during incompatible potato-cyst nematode interactions.
Unexpected DNA affinity and sequence selectivity through core rigidity in guanidinium-based minor groove binders.

PubMed

Nagle, Padraic S; McKeever, Caitriona; Rodriguez, Fernando; Nguyen, Binh; Wilson, W David; Rozas, Isabel

2014-09-25

In this paper we report the design and biophysical evaluation of novel rigid-core symmetric and asymmetric dicationic DNA binders containing 9H-fluorene and 9,10-dihydroanthracene cores as well as the synthesis of one of these fluorene derivatives. First, the affinity toward particular DNA sequences of these compounds and flexible core derivatives was evaluated by means of surface plasmon resonance and thermal denaturation experiments finding that the position of the cations significantly influence the binding strength. Then their affinity and mode of binding were further studied by performing circular dichroism and UV studies and the results obtained were rationalized by means of DFT calculations. We found that the fluorene derivatives prepared have the ability to bind to the minor groove of certain DNA sequences and intercalate to others, whereas the dihydroanthracene compounds bind via intercalation to all the DNA sequences studied here.
Pre-main-sequence isochrones - II. Revising star and planet formation time-scales

NASA Astrophysics Data System (ADS)

Bell, Cameron P. M.; Naylor, Tim; Mayne, N. J.; Jeffries, R. D.; Littlefair, S. P.

2013-09-01

We have derived ages for 13 young (<30 Myr) star-forming regions and find that they are up to a factor of 2 older than the ages typically adopted in the literature. This result has wide-ranging implications, including that circumstellar discs survive longer (≃ 10-12 Myr) and that the average Class I lifetime is greater (≃1 Myr) than currently believed. For each star-forming region, we derived two ages from colour-magnitude diagrams. First, we fitted models of the evolution between the zero-age main sequence and terminal-age main sequence to derive a homogeneous set of main-sequence ages, distances and reddenings with statistically meaningful uncertainties. Our second age for each star-forming region was derived by fitting pre-main-sequence stars to new semi-empirical model isochrones. For the first time (for a set of clusters younger than 50 Myr), we find broad agreement between these two ages, and since these are derived from two distinct mass regimes that rely on different aspects of stellar physics, it gives us confidence in the new age scale. This agreement is largely due to our adoption of empirical colour-Teff relations and bolometric corrections for pre-main-sequence stars cooler than 4000 K. The revised ages for the star-forming regions in our sample are: ˜2 Myr for NGC 6611 (Eagle Nebula; M 16), IC 5146 (Cocoon Nebula), NGC 6530 (Lagoon Nebula; M 8) and NGC 2244 (Rosette Nebula); ˜6 Myr for σ Ori, Cep OB3b and IC 348; ≃10 Myr for λ Ori (Collinder 69); ≃11 Myr for NGC 2169; ≃12 Myr for NGC 2362; ≃13 Myr for NGC 7160; ≃14 Myr for χ Per (NGC 884); and ≃20 Myr for NGC 1960 (M 36).
Does typing of Chlamydia trachomatis using housekeeping multilocus sequence typing reveal different sexual networks among heterosexuals and men who have sex with men?

PubMed

Versteeg, Bart; Bruisten, Sylvia M; van der Ende, Arie; Pannekoek, Yvonne

2016-04-18

Chlamydia trachomatis infections remain the most common bacterial sexually transmitted infection worldwide. To gain more insight into the epidemiology and transmission of C. trachomatis, several schemes of multilocus sequence typing (MLST) have been developed. We investigated the clustering of C. trachomatis strains derived from men who have sex with men (MSM) and heterosexuals using the MLST scheme based on 7 housekeeping genes (MLST-7) adapted for clinical specimens and a high-resolution MLST scheme based on 6 polymorphic genes, including ompA (hr-MLST-6). Specimens from 100 C. trachomatis infected men who have sex with men (MSM) and 100 heterosexual women were randomly selected from previous studies and sequenced. We adapted the MLST-7 scheme to a nested assay to be suitable for direct typing of clinical specimens. All selected specimens were typed using both the adapted MLST-7 scheme and the hr-MLST-6 scheme. Clustering of C. trachomatis strains derived from MSM and heterosexuals was assessed using minimum spanning tree analysis. Sufficient chlamydial DNA was present in 188 of the 200 (94 %) selected samples. Using the adapted MLST-7 scheme, full MLST profiles were obtained for 187 of 188 tested specimens resulting in a high success rate of 99.5 %. Of these 187 specimens, 91 (48.7 %) were from MSM and 96 (51.3 %) from heterosexuals. We detected 21 sequence types (STs) using the adapted MLST-7 and 79 STs using the hr-MLST-6 scheme. Minimum spanning tree analyses was used to examine the clustering of MLST-7 data, which showed no reflection of separate transmission in MSM and heterosexual hosts. Moreover, typing using the hr-MLST-6 scheme identified genetically related clusters within each of clusters that were identified by using the MLST-7 scheme. No distinct transmission of C. trachomatis could be observed in MSM and heterosexuals using the adapted MLST-7 scheme in contrast to using the hr-MLST-6. In addition, we compared clustering of both MLST schemes and demonstrated that typing using the hr-MLST-6 scheme is able to identify genetically related clusters of C. trachomatis strains within each of the clusters that were identified by using the MLST-7 scheme.
Comparative mapping in intraspecific populations uncovers a high degree of macrosynteny between A- and B-genome diploid species of peanut

PubMed Central

2012-01-01

Background Cultivated peanut or groundnut (Arachis hypogaea L.) is an important oilseed crop with an allotetraploid genome (AABB, 2n = 4x = 40). Both the low level of genetic variation within the cultivated gene pool and its polyploid nature limit the utilization of molecular markers to explore genome structure and facilitate genetic improvement. Nevertheless, a wealth of genetic diversity exists in diploid Arachis species (2n = 2x = 20), which represent a valuable gene pool for cultivated peanut improvement. Interspecific populations have been used widely for genetic mapping in diploid species of Arachis. However, an intraspecific mapping strategy was essential to detect chromosomal rearrangements among species that could be obscured by mapping in interspecific populations. To develop intraspecific reference linkage maps and gain insights into karyotypic evolution within the genus, we comparatively mapped the A- and B-genome diploid species using intraspecific F2 populations. Exploring genome organization among diploid peanut species by comparative mapping will enhance our understanding of the cultivated tetraploid peanut genome. Moreover, new sources of molecular markers that are highly transferable between species and developed from expressed genes will be required to construct saturated genetic maps for peanut. Results A total of 2,138 EST-SSR (expressed sequence tag-simple sequence repeat) markers were developed by mining a tetraploid peanut EST assembly including 101,132 unigenes (37,916 contigs and 63,216 singletons) derived from 70,771 long-read (Sanger) and 270,957 short-read (454) sequences. A set of 97 SSR markers were also developed by mining 9,517 genomic survey sequences of Arachis. An SSR-based intraspecific linkage map was constructed using an F2 population derived from a cross between K 9484 (PI 298639) and GKBSPSc 30081 (PI 468327) in the B-genome species A. batizocoi. A high degree of macrosynteny was observed when comparing the homoeologous linkage groups between A (A. duranensis) and B (A. batizocoi) genomes. Comparison of the A- and B-genome genetic linkage maps also showed a total of five inversions and one major reciprocal translocation between two pairs of chromosomes under our current mapping resolution. Conclusions Our findings will contribute to understanding tetraploid peanut genome origin and evolution and eventually promote its genetic improvement. The newly developed EST-SSR markers will enrich current molecular marker resources in peanut. PMID:23140574
Evaluation of anonymous and expressed sequence tag derived polymorphic microsatellite markers in the tobacco budworm Heliothis virescens (Lepidoptera: noctuidae)

USDA-ARS?s Scientific Manuscript database

Polymorphic genetic markers were identified and characterized using a partial genomic library of Heliothis virescens enriched for simple sequence repeats (SSR) and nucleotide sequences of expressed sequence tags (EST). Nucleotide sequences of 192 clones from the partial genomic library yielded 147 u...
HOMFLYPT polynomial is the best quantifier for topological cascades of vortex knots

NASA Astrophysics Data System (ADS)

Ricca, Renzo L.; Liu, Xin

2018-02-01

In this paper we derive and compare numerical sequences obtained by adapted polynomials such as HOMFLYPT, Jones and Alexander-Conway for the topological cascade of vortex torus knots and links that progressively untie by a single reconnection event at a time. Two cases are considered: the alternate sequence of knots and co-oriented links (with positive crossings) and the sequence of two-component links with oppositely oriented components (negative crossings). New recurrence equations are derived and sequences of numerical values are computed. In all cases the adapted HOMFLYPT polynomial proves to be the best quantifier for the topological cascade of torus knots and links.
From genomics to functional markers in the era of next-generation sequencing.

PubMed

Salgotra, R K; Gupta, B B; Stewart, C N

2014-03-01

The availability of complete genome sequences, along with other genomic resources for Arabidopsis, rice, pigeon pea, soybean and other crops, has revolutionized our understanding of the genetic make-up of plants. Next-generation DNA sequencing (NGS) has facilitated single nucleotide polymorphism discovery in plants. Functionally-characterized sequences can be identified and functional markers (FMs) for important traits can be developed at an ever-increasing ease. FMs are derived from sequence polymorphisms found in allelic variants of a functional gene. Linkage disequilibrium-based association mapping and homologous recombinants have been developed for identification of "perfect" markers for their use in crop improvement practices. Compared with many other molecular markers, FMs derived from the functionally characterized sequence genes using NGS techniques and their use provide opportunities to develop high-yielding plant genotypes resistant to various stresses at a fast pace.
Draft genome sequence of the docosahexaenoic acid producing thraustochytrid Aurantiochytrium sp. T66.

PubMed

Liu, Bin; Ertesvåg, Helga; Aasen, Inga Marie; Vadstein, Olav; Brautaset, Trygve; Heggeset, Tonje Marita Bjerkan

2016-06-01

Thraustochytrids are unicellular, marine protists, and there is a growing industrial interest in these organisms, particularly because some species, including strains belonging to the genus Aurantiochytrium, accumulate high levels of docosahexaenoic acid (DHA). Here, we report the draft genome sequence of Aurantiochytrium sp. T66 (ATCC PRA-276), with a size of 43 Mbp, and 11,683 predicted protein-coding sequences. The data has been deposited at DDBJ/EMBL/Genbank under the accession LNGJ00000000. The genome sequence will contribute new insight into DHA biosynthesis and regulation, providing a basis for metabolic engineering of thraustochytrids.
Applications of alignment-free methods in epigenomics.

PubMed

Pinello, Luca; Lo Bosco, Giosuè; Yuan, Guo-Cheng

2014-05-01

Epigenetic mechanisms play an important role in the regulation of cell type-specific gene activities, yet how epigenetic patterns are established and maintained remains poorly understood. Recent studies have supported a role of DNA sequences in recruitment of epigenetic regulators. Alignment-free methods have been applied to identify distinct sequence features that are associated with epigenetic patterns and to predict epigenomic profiles. Here, we review recent advances in such applications, including the methods to map DNA sequence to feature space, sequence comparison and prediction models. Computational studies using these methods have provided important insights into the epigenetic regulatory mechanisms.
Draft genome sequence of the marine bacterium Streptomyces griseoaurantiacus M045, which produces novel manumycin-type antibiotics with a pABA core component.

PubMed

Li, Fuchao; Jiang, Peng; Zheng, Huajun; Wang, Shengyue; Zhao, Guoping; Qin, Song; Liu, Zhaopu

2011-07-01

Streptomyces griseoaurantiacus M045, isolated from marine sediment, produces manumycin and chinikomycin antibiotics. Here we present a high-quality draft genome sequence of S. griseoaurantiacus M045, the first marine Streptomyces species to be sequenced and annotated. The genome encodes several gene clusters for biosynthesis of secondary metabolites and has provided insight into genomic islands linking secondary metabolism to functional adaptation in marine S. griseoaurantiacus M045.
Draft Genome Sequence of the Yeast Starmerella bombicola NBRC10243, a Producer of Sophorolipids, Glycolipid Biosurfactants

PubMed Central

Matsuzawa, Tomohiko; Koike, Hideaki; Saika, Azusa; Fukuoka, Tokuma; Sato, Shun; Habe, Hiroshi; Kitamoto, Dai

2015-01-01

The yeast Starmerella bombicola NBRC10243 is an excellent producer of sophorolipids (SLs) from various feedstocks. Here, we report the draft genome sequence of S. bombicola NBRC10243. Analysis of the sequence may provide insight into the properties of this yeast that make it superior for use in the production of functional glycolipids and biomolecules, leading to the further development of S. bombicola NBRC10243 for industrial applications. PMID:25814600

Insights into the phylogenetic positions of photosynthetic bacteria obtained from 5S rRNA and 16S rRNA sequence data

NASA Technical Reports Server (NTRS)

Fox, G. E.

1985-01-01

Comparisons of complete 16S ribosomal ribonucleic acid (rRNA) sequences established that the secondary structure of these molecules is highly conserved. Earlier work with 5S rRNA secondary structure revealed that when structural conservation exists the alignment of sequences is straightforward. The constancy of structure implies minimal functional change. Under these conditions a uniform evolutionary rate can be expected so that conditions are favorable for phylogenetic tree construction.
Genome sequence of the white koji mold Aspergillus kawachii IFO 4308, used for brewing the Japanese distilled spirit shochu.

PubMed

Futagami, Taiki; Mori, Kazuki; Yamashita, Ayaka; Wada, Shotaro; Kajiwara, Yasuhiro; Takashita, Hideharu; Omori, Toshiro; Takegawa, Kaoru; Tashiro, Kosuke; Kuhara, Satoru; Goto, Masatoshi

2011-11-01

The filamentous fungus Aspergillus kawachii has traditionally been used for brewing the Japanese distilled spirit shochu. A. kawachii characteristically hyperproduces citric acid and a variety of polysaccharide glycoside hydrolases. Here the genome sequence of A. kawachii IFO 4308 was determined and annotated. Analysis of the sequence may provide insight into the properties of this fungus that make it superior for use in shochu production, leading to the further development of A. kawachii for industrial applications.
Taking the Perspective of the Other Contributes to Awareness of Illness in Schizophrenia

PubMed Central

Langdon, Robyn; Ward, Philip

2009-01-01

Two approaches dominate research on the lack of awareness of illness that characterizes schizophrenia. The “deficit” approach uses standardized neuropsychological batteries to identify the neural underpinnings of intact insight; the “nondeficit” approach investigates the psychological defense mechanisms that motivate denial of illness. We adopt, instead, a cognitive neuropsychological approach to model the cognitive processes which underpin insight and which might be either damaged (because of neuropathology) or not used (because of motivational forces). We conceive of these processes in terms of a metacognitive capacity “to see ourselves as others see us.” We predict that a general difficulty with adopting other mental perspectives (with “seeing the world as others do”), indexed by performance deficits on theory of mind (ToM) tasks, will impair insight in schizophrenia. Thirty schizophrenic patients (also assessed for insight) and 26 healthy controls completed a battery of ToM tasks which varied presentation modality, response mode and instruction type (picture sequencing, joke appreciation and story comprehension tasks). While patients performed more poorly than controls on all ToM tasks, impairment in patients was not concordant across tasks. ToM scores from the picture sequencing and joke appreciation tasks, and not the story comprehension task, intercorrelated significantly in patients and predicted insight. Findings support the view that insight relies upon a cognitive capacity to adopt the other perspective, which, if intact, contributes to the metacognitive capacity to reflect upon “one's own” mental health from the other perspective. Findings also suggest that the nature of perspective-taking difficulty which disrupts insight in schizophrenia is best revealed using ToM tasks with “indirect” instructions. PMID:18495647
Human evolution: a tale from ancient genomes

PubMed Central

2017-01-01

The field of human ancient DNA (aDNA) has moved from mitochondrial sequencing that suffered from contamination and provided limited biological insights, to become a fully genomic discipline that is changing our conception of human history. Recent successes include the sequencing of extinct hominins, and true population genomic studies of Bronze Age populations. Among the emerging areas of aDNA research, the analysis of past epigenomes is set to provide more new insights into human adaptation and disease susceptibility through time. Starting as a mere curiosity, ancient human genetics has become a major player in the understanding of our evolutionary history. This article is part of the themed issue ‘Evo-devo in the genomics era, and the origins of morphological diversity’. PMID:27994125
Nasopharyngeal teratoma, congenital diaphragmatic hernia and Dandy-Walker malformation - a yet uncharacterized syndrome.

PubMed

Gupta, N; Shastri, S; Singh, P K; Jana, M; Mridha, A; Verma, G; Kabra, M

2016-11-01

An association of congenital diaphragmatic hernia, dandy walker malformation and nasopharyngeal teratoma is very rare. Here, we report a fourth case with this association where chromosomal microarray and whole exome sequencing (WES) was performed to understand the underlying genetic basis. Findings of few variants especially a novel variation in HIRA provided some insights. An association of congenital diaphragmatic hernia, dandy walker malformation and nasopharyngeal teratoma is very rare. Here, we report a fourth case with this association where chromosomal microarray and whole exome sequencing (WES) was performed to understand the underlying genetic basis. Findings of few variants especially a novel variation in HIRA provided some insights. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Zone of Polarizing Activity Regulatory Sequence Mutations/Duplications with Preaxial Polydactyly and Longitudinal Preaxial Ray Deficiency in the Phenotype: A Review of Human Cases, Animal Models, and Insights Regarding the Pathogenesis

PubMed Central

2018-01-01

Clinicians and scientists interested in developmental biology have viewed preaxial polydactyly (PPD) and longitudinal preaxial ray deficiency (LPAD) as two different entities. Point mutations and duplications in the zone of polarizing activity regulatory sequence (ZRS) are associated with anterior ectopic expression of Sonic Hedgehog (SHH) in the limb bud and usually result in a PPD phenotype. However, some of these mutations/duplications also have LPAD in the phenotype. This unusual PPD-LPAD association in ZRS mutations/duplications has not been specifically reviewed in the literature. The author reviews this unusual entity and gives insights regarding its pathogenesis. PMID:29651423
A typing scheme for the honeybee pathogen Melissococcus plutonius allows detection of disease transmission events and a study of the distribution of variants.

PubMed

Haynes, Edward; Helgason, Thorunn; Young, J Peter W; Thwaites, Richard; Budge, Giles E

2013-08-01

Melissococcus plutonius is the bacterial pathogen that causes European Foulbrood of honeybees, a globally important honeybee brood disease. We have used next-generation sequencing to identify highly polymorphic regions in an otherwise genetically homogenous organism, and used these loci to create a modified MLST scheme. This synthesis of a proven typing scheme format with next-generation sequencing combines reliability and low costs with insights only available from high-throughput sequencing technologies. Using this scheme we show that the global distribution of M.plutonius variants is not uniform. We use the scheme in epidemiological studies to trace movements of infective material around England, insights that would have been impossible to confirm without the typing scheme. We also demonstrate the persistence of local variants over time. © 2013 Crown copyright. Reproduced with the permission of the Controller of Her Majesty's Stationary Office/Queen’s Printer for Scotland and Food and Environment Research Agency.
Sequencing of the sea lamprey (Petromyzon marinus) genome provides insights into vertebrate evolution

PubMed Central

Smith, Jeramiah J; Kuraku, Shigehiro; Holt, Carson; Sauka-Spengler, Tatjana; Jiang, Ning; Campbell, Michael S; Yandell, Mark D; Manousaki, Tereza; Meyer, Axel; Bloom, Ona E; Morgan, Jennifer R; Buxbaum, Joseph D; Sachidanandam, Ravi; Sims, Carrie; Garruss, Alexander S; Cook, Malcolm; Krumlauf, Robb; Wiedemann, Leanne M; Sower, Stacia A; Decatur, Wayne A; Hall, Jeffrey A; Amemiya, Chris T; Saha, Nil R; Buckley, Katherine M; Rast, Jonathan P; Das, Sabyasachi; Hirano, Masayuki; McCurley, Nathanael; Guo, Peng; Rohner, Nicolas; Tabin, Clifford J; Piccinelli, Paul; Elgar, Greg; Ruffier, Magali; Aken, Bronwen L; Searle, Stephen MJ; Muffato, Matthieu; Pignatelli, Miguel; Herrero, Javier; Jones, Matthew; Brown, C Titus; Chung-Davidson, Yu-Wen; Nanlohy, Kaben G; Libants, Scot V; Yeh, Chu-Yin; McCauley, David W; Langeland, James A; Pancer, Zeev; Fritzsch, Bernd; de Jong, Pieter J; Zhu, Baoli; Fulton, Lucinda L; Theising, Brenda; Flicek, Paul; Bronner, Marianne E; Warren, Wesley C; Clifton, Sandra W; Wilson, Richard K; Li, Weiming

2013-01-01

Lampreys are representatives of an ancient vertebrate lineage that diverged from our own ~500 million years ago. By virtue of this deeply shared ancestry, the sea lamprey (P. marinus) genome is uniquely poised to provide insight into the ancestry of vertebrate genomes and the underlying principles of vertebrate biology. Here, we present the first lamprey whole-genome sequence and assembly. We note challenges faced owing to its high content of repetitive elements and GC bases, as well as the absence of broad-scale sequence information from closely related species. Analyses of the assembly indicate that two whole-genome duplications likely occurred before the divergence of ancestral lamprey and gnathostome lineages. Moreover, the results help define key evolutionary events within vertebrate lineages, including the origin of myelin-associated proteins and the development of appendages. The lamprey genome provides an important resource for reconstructing vertebrate origins and the evolutionary events that have shaped the genomes of extant organisms. PMID:23435085
Involuntary memory chaining versus event cueing: Which is a better indicator of autobiographical memory organisation?

PubMed

Mace, John H; Clevinger, Amanda M; Martin, Cody

2010-11-01

Involuntary memory chains are spontaneous recollections of the past that occur in a sequence. Much like semantic memory priming, this memory phenomenon has provided some insights into the nature of associations in autobiographical memory. The event-cueing procedure (a laboratory-based memory sequencing task) has also provided some insights into the nature of autobiographical memory organisation. However, while both of these memory-sequencing phenomena have exhibited the same types of memory associations (conceptual associations and general-event or temporal associations), both have also produced discrepant results with respect to the relative proportions of such associations. This study investigated the possibility that the results from event cueing are artefacts of various memory production responses. Using a number of different approaches we demonstrated that these memory production responses cause overestimates of general-event association. We conclude that for this reason, the data from involuntary memory chains provide a better picture of the organisation of autobiographical memory.
Mechanistic insights into the recognition of 5-methylcytosine oxidation derivatives by the SUVH5 SRA domain.

PubMed

Rajakumara, Eerappa; Nakarakanti, Naveen Kumar; Nivya, M Angel; Satish, Mutyala

2016-02-04

5-Methylcytosine (5 mC) is associated with epigenetic gene silencing in mammals and plants. 5 mC is consecutively oxidized to 5-hydroxymethylcytosine (5 hmC), 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC) by ten-eleven translocation enzymes. We performed binding and structural studies to investigate the molecular basis of the recognition of the 5 mC oxidation derivatives in the context of a CG sequence by the SET- and RING-associated domain (SRA) of the SUVH5 protein (SUVH5 SRA). Using calorimetric measurements, we demonstrate that the SRA domain binds to the hydroxymethylated CG (5hmCG) DNA duplex in a similar manner to methylated CG (5mCG). Interestingly, the SUVH5 SRA domain exhibits weaker affinity towards carboxylated CG (5caCG) and formylated CG (5fCG). We report the 2.6 Å resolution crystal structure of the SUVH5 SRA domain in a complex with fully hydroxymethyl-CG and demonstrate a dual flip-out mechanism, whereby the symmetrical 5hmCs are simultaneously extruded from the partner strands of the DNA duplex and are positioned within the binding pockets of individual SRA domains. The hydroxyl group of 5hmC establishes both intra- and intermolecular interactions in the binding pocket. Collectively, we show that SUVH5 SRA recognizes 5hmC in a similar manner to 5 mC, but exhibits weaker affinity towards 5 hmC oxidation derivatives.
Statistical Linkage Analysis of Substitutions in Patient-Derived Sequences of Genotype 1a Hepatitis C Virus Nonstructural Protein 3 Exposes Targets for Immunogen Design

PubMed Central

Quadeer, Ahmed A.; Louie, Raymond H. Y.; Shekhar, Karthik; Chakraborty, Arup K.; Hsing, I-Ming

2014-01-01

ABSTRACT Chronic hepatitis C virus (HCV) infection is one of the leading causes of liver failure and liver cancer, affecting around 3% of the world's population. The extreme sequence variability of the virus resulting from error-prone replication has thwarted the discovery of a universal prophylactic vaccine. It is known that vigorous and multispecific cellular immune responses, involving both helper CD4+ and cytotoxic CD8+ T cells, are associated with the spontaneous clearance of acute HCV infection. Escape mutations in viral epitopes can, however, abrogate protective T-cell responses, leading to viral persistence and associated pathologies. Despite the propensity of the virus to mutate, there might still exist substitutions that incur a fitness cost. In this paper, we identify groups of coevolving residues within HCV nonstructural protein 3 (NS3) by analyzing diverse sequences of this protein using ideas from random matrix theory and associated methods. Our analyses indicate that one of these groups comprises a large percentage of residues for which HCV appears to resist multiple simultaneous substitutions. Targeting multiple residues in this group through vaccine-induced immune responses should either lead to viral recognition or elicit escape substitutions that compromise viral fitness. Our predictions are supported by published clinical data, which suggested that immune genotypes associated with spontaneous clearance of HCV preferentially recognized and targeted this vulnerable group of residues. Moreover, mapping the sites of this group onto the available protein structure provided insight into its functional significance. An epitope-based immunogen is proposed as an alternative to the NS3 epitopes in the peptide-based vaccine IC41. IMPORTANCE Despite much experimental work on HCV, a thorough statistical study of the HCV sequences for the purpose of immunogen design was missing in the literature. Such a study is vital to identify epistatic couplings among residues that can provide useful insights for designing a potent vaccine. In this work, ideas from random matrix theory were applied to characterize the statistics of substitutions within the diverse publicly available sequences of the genotype 1a HCV NS3 protein, leading to a group of sites for which HCV appears to resist simultaneous substitutions possibly due to deleterious effect on viral fitness. Our analysis leads to completely novel immunogen designs for HCV. In addition, the NS3 epitopes used in the recently proposed peptide-based vaccine IC41 were analyzed in the context of our framework. Our analysis predicts that alternative NS3 epitopes may be worth exploring as they might be more efficacious. PMID:24760894
Insights into the phylogeny of Northern Hemisphere Armillaria: Neighbor-net and Bayesian analyses of translation elongation factor 1-α gene sequences

Treesearch

Ned B. Klopfenstein; Jane E. Stewart; Yuko Ota; John W. Hanna; Bryce A. Richardson; Amy L. Ross-Davis; Ruben D. Elias-Roman; Kari Korhonen; Nenad Keca; Eugenia Iturritxa; Dionicio Alvarado-Rosales; Halvor Solheim; Nicholas J. Brazee; Piotr Lakomy; Michelle R. Cleary; Eri Hasegawa; Taisei Kikuchi; Fortunato Garza-Ocanas; Panaghiotis Tsopelas; Daniel Rigling; Simone Prospero; Tetyana Tsykun; Jean A. Berube; Franck O. P. Stefani; Saeideh Jafarpour; Vladimir Antonin; Michal Tomsovsky; Geral I. McDonald; Stephen Woodward; Mee-Sook Kim

2017-01-01

Armillaria possesses several intriguing characteristics that have inspired wide interest in understanding phylogenetic relationships within and among species of this genus. Nuclear ribosomal DNA sequenceâbased analyses of Armillaria provide only limited information for phylogenetic studies among widely divergent taxa. More recent studies have shown that translation...
Insights into phylogeny, sex function and age of Fragaria based on whole chloroplast genome sequencing

Treesearch

Wambui Njunguna; Aaron Liston; Richard Cronn; Tia-Lynn Ashman; Nahla Bassil

2013-01-01

The cultivated strawberry is one of the youngest domesticated plants, developed in France in the 1700s from chance hybridization between two western hemisphere octoploid species. However, little is known about the evolution of the species that gave rise to this important fruit crop. Phylogenetic analysis of chloroplast genome sequences of 21 Fragaria...
Draft Genome Sequence of Two Marine Plantactinospora spp. from the Gulf of California.

PubMed

Contreras-Castro, Luis; Maldonado, Luis A; Quintana, Erika T; Raggi, Luciana; Sánchez-Flores, Alejandro

2018-05-24

Plantactinospora sp. strains BB1 and BC1 were isolated in 2009 from sediment samples of the Gulf of California from among almost 300 actinobacteria. Genome mining of their ∼8.5-Mb sequences showed the bioprospecting potential of these rare actinomycetes, providing an insight to their ecological and biotechnological importance. Copyright © 2018 Contreras-Castro et al.
Complete genome sequence of the bioleaching bacterium Leptospirillum sp. group II strain CF-1.

PubMed

Ferrer, Alonso; Bunk, Boyke; Spröer, Cathrin; Biedendieck, Rebekka; Valdés, Natalia; Jahn, Martina; Jahn, Dieter; Orellana, Omar; Levicán, Gloria

2016-03-20

We describe the complete genome sequence of Leptospirillum sp. group II strain CF-1, an acidophilic bioleaching bacterium isolated from an acid mine drainage (AMD). This work provides data to gain insights about adaptive response of Leptospirillum spp. to the extreme conditions of bioleaching environments. Copyright © 2016 Elsevier B.V. All rights reserved.
Complete Genome Sequence of the Naphthalene-Degrading Bacterium Pseudomonas stutzeri AN10 (CCUG 29243)

PubMed Central

Brunet-Galmés, Isabel; Busquets, Antonio; Peña, Arantxa; Gomila, Margarita; Nogales, Balbina; García-Valdés, Elena; Lalucat, Jorge; Bennasar, Antonio

2012-01-01

Pseudomonas stutzeri AN10 (CCUG 29243) can be considered a model strain for aerobic naphthalene degradation. We report the complete genome sequence of this bacterium. Its 4.71-Mb chromosome provides insights into other biodegradative capabilities of strain AN10 (i.e., benzoate catabolism) and suggests a high number of horizontal gene transfer events. PMID:23144395
Genome Sequence of the Thermophile Bacillus coagulans Hammer, the Type Strain of the Species

PubMed Central

Su, Fei; Tao, Fei; Tang, Hongzhi

2012-01-01

Here we announce a 3.0-Mb assembly of the Bacillus coagulans Hammer strain, which is the type strain of the species within the genus Bacillus. Genomic analyses based on the sequence may provide insights into the phylogeny of the species and help to elucidate characteristics of the poorly studied strains of Bacillus coagulans. PMID:23105047
Genome sequence of the thermophile Bacillus coagulans Hammer, the type strain of the species.

PubMed

Su, Fei; Tao, Fei; Tang, Hongzhi; Xu, Ping

2012-11-01

Here we announce a 3.0-Mb assembly of the Bacillus coagulans Hammer strain, which is the type strain of the species within the genus Bacillus. Genomic analyses based on the sequence may provide insights into the phylogeny of the species and help to elucidate characteristics of the poorly studied strains of Bacillus coagulans.
Draft Genome Sequence of the Psychrophilic and Alkaliphilic Rhodonellum psychrophilum Strain GCM71T.

PubMed

Hauptmann, Aviaja L; Glaring, Mikkel A; Hallin, Peter F; Priemé, Anders; Stougaard, Peter

2013-12-05

Rhodonellum psychrophilum GCM71(T), isolated from the cold and alkaline submarine ikaite columns in the Ikka Fjord in Greenland, displays optimal growth at 5 to 10°C and pH 10. Here, we report the draft genome sequence of this strain, which may provide insight into the mechanisms of adaptation to these extreme conditions.
Draft Genome Sequences of Two Mycobacterium bovis Strains Isolated from Beef Cattle in Paraguay

PubMed Central

Sanabria, Lidia; Lagrave, Lorena; Nishibe, Christiane; Ribas, Augusto C. A.; Zumárraga, Martín J.; Araújo, Flábio R.

2017-01-01

ABSTRACT This work reports the draft genome sequences of the Mycobacterium bovis strains M1009 and M1010, isolated from the lymph nodes of two infected cows on a beef farm in Paraguay. Comparative genomics between these strains and other regional strains may provide more insights regarding M. bovis epidemiology in South America. PMID:28705977

A mechanistic insight into the amyloidogenic structure of hIAPP peptide revealed from sequence analysis and molecular dynamics simulation.

PubMed

Chakraborty, Sandipan; Chatterjee, Barnali; Basu, Soumalee

2012-07-01

A collective approach of sequence analysis, phylogenetic tree and in silico prediction of amyloidogenecity using bioinformatics tools have been used to correlate the observed species-specific variations in IAPP sequences with the amyloid forming propensity. Observed substitution patterns indicate that probable changes in local hydrophobicity are instrumental in altering the aggregation propensity of the peptide. In particular, residues at 17th, 22nd and 23rd positions of the IAPP peptide are found to be crucial for amyloid formation. Proline25 primarily dictates the observed non-amyloidogenecity in rodents. Furthermore, extensive molecular dynamics simulation of 0.24 μs have been carried out with human IAPP (hIAPP) fragment 19-27, the portion showing maximum sequence variation across different species, to understand the native folding characteristic of this region. Principal component analysis in combination with free energy landscape analysis illustrates a four residue turn spanning from residue 22 to 25. The results provide a structural insight into the intramolecular β-sheet structure of amylin which probably is the template for nucleation of fibril formation and growth, a pathogenic feature of type II diabetes. Copyright © 2012 Elsevier B.V. All rights reserved.
Exome capture sequencing reveals new insights into hepatitis B virus-induced hepatocellular carcinoma at the early stage of tumorigenesis.

PubMed

Chen, Yong; Wang, Lijuan; Xu, Hexiang; Liu, Xingxiang; Zhao, Yingren

2013-10-01

Hepatocellular carcinoma (HCC), the most common type of liver cancer, is the third primary cause of cancer-related mortality worldwide. The molecular mechanisms underlying the initiation and formation of HCC remain obscure. In the present study, we performed exome sequencing using tumor and normal tissues from 3 hepatitis B virus (HBV)-positive BCLC stage A HCC patients. Bioinformatic analysis was performed to find candidate protein-altering somatic mutations. Eighty damaging mutations were validated and 59 genes were reported to be mutated in HBV-related HCCs for the first time here. Further analysis using whole genome sequencing (WGS) data of 88 HBV-related HCC patients from the European Genome-phenome Archive database showed that mutations in 33 of the 59 genes were also detected in other samples. Variants of two newly found genes, ZNF717 and PARP4, were detected in more than 10% of the WGS samples. Several other genes, such as FLNA and CNTN2, are also noteworthy. Thus, the exome sequencing analysis of three BCLC stage A patients provides new insights into the molecular events governing the early steps of HBV-induced HCC tumorigenesis.
Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain

PubMed Central

de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas

2014-01-01

The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. PMID:24792163
Insights on pumping well interpretation from flow dimension analysis: The learnings of a multi-context field database

NASA Astrophysics Data System (ADS)

Ferroud, Anouck; Chesnaux, Romain; Rafini, Silvain

2018-01-01

The flow dimension parameter n, derived from the Generalized Radial Flow model, is a valuable tool to investigate the actual flow regimes that really occur during a pumping test rather than suppose them to be radial, as postulated by the Theis-derived models. A numerical approach has shown that, when the flow dimension is not radial, using the derivative analysis rather than the conventional Theis and Cooper-Jacob methods helps to estimate much more accurately the hydraulic conductivity of the aquifer. Although n has been analysed in numerous studies including field-based studies, there is a striking lack of knowledge about its occurrence in nature and how it may be related to the hydrogeological setting. This study provides an overview of the occurrence of n in natural aquifers located in various geological contexts including crystalline rock, carbonate rock and granular aquifers. A comprehensive database is compiled from governmental and industrial sources, based on 69 constant-rate pumping tests. By means of a sequential analysis approach, we systematically performed a flow dimension analysis in which straight segments on drawdown-log derivative time series are interpreted as successive, specific and independent flow regimes. To reduce the uncertainties inherent in the identification of n sequences, we used the proprietary SIREN code to execute a dual simultaneous fit on both the drawdown and the drawdown-log derivative signals. Using the stated database, we investigate the frequency with which the radial and non-radial flow regimes occur in fractured rock and granular aquifers, and also provide outcomes that indicate the lack of applicability of Theis-derived models in representing nature. The results also emphasize the complexity of hydraulic signatures observed in nature by pointing out n sequential signals and non-integer n values that are frequently observed in the database.
Problem solving stages in the five square problem

PubMed Central

Fedor, Anna; Szathmáry, Eörs; Öllinger, Michael

2015-01-01

According to the restructuring hypothesis, insight problem solving typically progresses through consecutive stages of search, impasse, insight, and search again for someone, who solves the task. The order of these stages was determined through self-reports of problem solvers and has never been verified behaviorally. We asked whether individual analysis of problem solving attempts of participants revealed the same order of problem solving stages as defined by the theory and whether their subjective feelings corresponded to the problem solving stages they were in. Our participants tried to solve the Five-Square problem in an online task, while we recorded the time and trajectory of their stick movements. After the task they were asked about their feelings related to insight and some of them also had the possibility of reporting impasse while working on the task. We found that the majority of participants did not follow the classic four-stage model of insight, but had more complex sequences of problem solving stages, with search and impasse recurring several times. This means that the classic four-stage model is not sufficient to describe variability on the individual level. We revised the classic model and we provide a new model that can generate all sequences found. Solvers reported insight more often than non-solvers and non-solvers reported impasse more often than solvers, as expected; but participants did not report impasse more often during behaviorally defined impasse stages than during other stages. This shows that impasse reports might be unreliable indicators of impasse. Our study highlights the importance of individual analysis of problem solving behavior to verify insight theory. PMID:26300794
Problem solving stages in the five square problem.

PubMed

Fedor, Anna; Szathmáry, Eörs; Öllinger, Michael

2015-01-01

According to the restructuring hypothesis, insight problem solving typically progresses through consecutive stages of search, impasse, insight, and search again for someone, who solves the task. The order of these stages was determined through self-reports of problem solvers and has never been verified behaviorally. We asked whether individual analysis of problem solving attempts of participants revealed the same order of problem solving stages as defined by the theory and whether their subjective feelings corresponded to the problem solving stages they were in. Our participants tried to solve the Five-Square problem in an online task, while we recorded the time and trajectory of their stick movements. After the task they were asked about their feelings related to insight and some of them also had the possibility of reporting impasse while working on the task. We found that the majority of participants did not follow the classic four-stage model of insight, but had more complex sequences of problem solving stages, with search and impasse recurring several times. This means that the classic four-stage model is not sufficient to describe variability on the individual level. We revised the classic model and we provide a new model that can generate all sequences found. Solvers reported insight more often than non-solvers and non-solvers reported impasse more often than solvers, as expected; but participants did not report impasse more often during behaviorally defined impasse stages than during other stages. This shows that impasse reports might be unreliable indicators of impasse. Our study highlights the importance of individual analysis of problem solving behavior to verify insight theory.
Designing deep sequencing experiments: detecting structural variation and estimating transcript abundance.

PubMed

Bashir, Ali; Bansal, Vikas; Bafna, Vineet

2010-06-18

Massively parallel DNA sequencing technologies have enabled the sequencing of several individual human genomes. These technologies are also being used in novel ways for mRNA expression profiling, genome-wide discovery of transcription-factor binding sites, small RNA discovery, etc. The multitude of sequencing platforms, each with their unique characteristics, pose a number of design challenges, regarding the technology to be used and the depth of sequencing required for a particular sequencing application. Here we describe a number of analytical and empirical results to address design questions for two applications: detection of structural variations from paired-end sequencing and estimating mRNA transcript abundance. For structural variation, our results provide explicit trade-offs between the detection and resolution of rearrangement breakpoints, and the optimal mix of paired-read insert lengths. Specifically, we prove that optimal detection and resolution of breakpoints is achieved using a mix of exactly two insert library lengths. Furthermore, we derive explicit formulae to determine these insert length combinations, enabling a 15% improvement in breakpoint detection at the same experimental cost. On empirical short read data, these predictions show good concordance with Illumina 200 bp and 2 Kbp insert length libraries. For transcriptome sequencing, we determine the sequencing depth needed to detect rare transcripts from a small pilot study. With only 1 Million reads, we derive corrections that enable almost perfect prediction of the underlying expression probability distribution, and use this to predict the sequencing depth required to detect low expressed genes with greater than 95% probability. Together, our results form a generic framework for many design considerations related to high-throughput sequencing. We provide software tools http://bix.ucsd.edu/projects/NGS-DesignTools to derive platform independent guidelines for designing sequencing experiments (amount of sequencing, choice of insert length, mix of libraries) for novel applications of next generation sequencing.
Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production.

PubMed

Roth, Melissa S; Cokus, Shawn J; Gallaher, Sean D; Walter, Andreas; Lopez, David; Erickson, Erika; Endelman, Benjamin; Westcott, Daniel; Larabell, Carolyn A; Merchant, Sabeeha S; Pellegrini, Matteo; Niyogi, Krishna K

2017-05-23

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis , because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. To advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ∼58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniform gene density over chromosomes, low repetitive sequence content (∼6%), and a high fraction of protein-coding sequence (∼39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (∼73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase ( BKT ), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. The high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.
Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production

DOE PAGES

Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.; ...

2017-05-08

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. Here, to advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ~58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniformmore » gene density over chromosomes, low repetitive sequence content (~6%), and a high fraction of protein-coding sequence (~39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (~73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. Finally, the high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.« less
Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production

DOE Office of Scientific and Technical Information (OSTI.GOV)

Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. Here, to advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ~58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniformmore » gene density over chromosomes, low repetitive sequence content (~6%), and a high fraction of protein-coding sequence (~39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (~73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. Finally, the high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.« less
Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production

PubMed Central

Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.; Walter, Andreas; Lopez, David; Erickson, Erika; Endelman, Benjamin; Westcott, Daniel; Larabell, Carolyn A.; Merchant, Sabeeha S.; Pellegrini, Matteo

2017-01-01

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. To advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ∼58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniform gene density over chromosomes, low repetitive sequence content (∼6%), and a high fraction of protein-coding sequence (∼39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (∼73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. The high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production. PMID:28484037
Population-Scale Sequencing Data Enable Precise Estimates of Y-STR Mutation Rates

PubMed Central

Willems, Thomas; Gymrek, Melissa; Poznik, G. David; Tyler-Smith, Chris; Erlich, Yaniv

2016-01-01

Short tandem repeats (STRs) are mutation-prone loci that span nearly 1% of the human genome. Previous studies have estimated the mutation rates of highly polymorphic STRs by using capillary electrophoresis and pedigree-based designs. Although this work has provided insights into the mutational dynamics of highly mutable STRs, the mutation rates of most others remain unknown. Here, we harnessed whole-genome sequencing data to estimate the mutation rates of Y chromosome STRs (Y-STRs) with 2–6 bp repeat units that are accessible to Illumina sequencing. We genotyped 4,500 Y-STRs by using data from the 1000 Genomes Project and the Simons Genome Diversity Project. Next, we developed MUTEA, an algorithm that infers STR mutation rates from population-scale data by using a high-resolution SNP-based phylogeny. After extensive intrinsic and extrinsic validations, we harnessed MUTEA to derive mutation-rate estimates for 702 polymorphic STRs by tracing each locus over 222,000 meioses, resulting in the largest collection of Y-STR mutation rates to date. Using our estimates, we identified determinants of STR mutation rates and built a model to predict rates for STRs across the genome. These predictions indicate that the load of de novo STR mutations is at least 75 mutations per generation, rivaling the load of all other known variant types. Finally, we identified Y-STRs with potential applications in forensics and genetic genealogy, assessed the ability to differentiate between the Y chromosomes of father-son pairs, and imputed Y-STR genotypes. PMID:27126583
Enterovirus Migration Patterns between France and Tunisia

PubMed Central

Othman, Ines; Mirand, Audrey; Slama, Ichrak; Mastouri, Maha; Peigue-Lafeuille, Hélène; Aouni, Mahjoub; Bailly, Jean-Luc

2015-01-01

The enterovirus (EV) types echovirus (E-) 5, E-9, and E-18, and coxsackievirus (CV-) A9 are infrequently reported in human diseases and their epidemiologic features are poorly defined. Virus transmission patterns between countries have been estimated with phylogenetic data derived from the 1D/VP1 and 3CD gene sequences of a sample of 74 strains obtained in France (2000–2012) and Tunisia (2011–2013) and from the publicly available sequences. The EV types (E-5, E-9, and E-18) exhibited a lower worldwide genetic diversity (respective number of genogroups: 4, 5, and 3) in comparison to CV-A9 (n = 10). The phylogenetic trees estimated with both 1D/VP1 and 3CD sequence data showed variations in the number of co-circulating lineages over the last 20 years among the four EV types. Despite the low number of genogroups in E-18, the virus exhibited the highest number of recombinant 3CD lineages (n = 10) versus 4 (E-5) to 8 (E-9). The phylogenies provided evidence of multiple transportation events between France and Tunisia involving E-5, E-9, E-18, and CV-A9 strains. Virus spread events between France and 17 other countries in five continents had high probabilities of occurrence as those between Tunisia and two European countries other than France. All transportation events were supported by BF values > 10. Inferring the source of virus transmission from phylogenetic data may provide insights into the patterns of sporadic and epidemic diseases caused by EVs. PMID:26709514
Comparative genomics of Enterococcus faecalis from healthy Norwegian infants

PubMed Central

Solheim, Margrete; Aakra, Ågot; Snipen, Lars G; Brede, Dag A; Nes, Ingolf F

2009-01-01

Background Enterococcus faecalis, traditionally considered a harmless commensal of the intestinal tract, is now ranked among the leading causes of nosocomial infections. In an attempt to gain insight into the genetic make-up of commensal E. faecalis, we have studied genomic variation in a collection of community-derived E. faecalis isolated from the feces of Norwegian infants. Results The E. faecalis isolates were first sequence typed by multilocus sequence typing (MLST) and characterized with respect to antibiotic resistance and properties associated with virulence. A subset of the isolates was compared to the vancomycin resistant strain E. faecalis V583 (V583) by whole genome microarray comparison (comparative genomic hybridization (CGH)). Several of the putative enterococcal virulence factors were found to be highly prevalent among the commensal baby isolates. The genomic variation as observed by CGH was less between isolates displaying the same MLST sequence type than between isolates belonging to different evolutionary lineages. Conclusion The variations in gene content observed among the investigated commensal E. faecalis is comparable to the genetic variation previously reported among strains of various origins thought to be representative of the major E. faecalis lineages. Previous MLST analysis of E. faecalis have identified so-called high-risk enterococcal clonal complexes (HiRECC), defined as genetically distinct subpopulations, epidemiologically associated with enterococcal infections. The observed correlation between CGH and MLST presented here, may offer a method for the identification of lineage-specific genes, and may therefore add clues on how to distinguish pathogenic from commensal E. faecalis. In this work, information on the core genome of E. faecalis is also substantially extended. PMID:19393078
Genome sequence of an Australian kangaroo, Macropus eugenii, provides insight into the evolution of mammalian reproduction and development

PubMed Central

2011-01-01

Background We present the genome sequence of the tammar wallaby, Macropus eugenii, which is a member of the kangaroo family and the first representative of the iconic hopping mammals that symbolize Australia to be sequenced. The tammar has many unusual biological characteristics, including the longest period of embryonic diapause of any mammal, extremely synchronized seasonal breeding and prolonged and sophisticated lactation within a well-defined pouch. Like other marsupials, it gives birth to highly altricial young, and has a small number of very large chromosomes, making it a valuable model for genomics, reproduction and development. Results The genome has been sequenced to 2 × coverage using Sanger sequencing, enhanced with additional next generation sequencing and the integration of extensive physical and linkage maps to build the genome assembly. We also sequenced the tammar transcriptome across many tissues and developmental time points. Our analyses of these data shed light on mammalian reproduction, development and genome evolution: there is innovation in reproductive and lactational genes, rapid evolution of germ cell genes, and incomplete, locus-specific X inactivation. We also observe novel retrotransposons and a highly rearranged major histocompatibility complex, with many class I genes located outside the complex. Novel microRNAs in the tammar HOX clusters uncover new potential mammalian HOX regulatory elements. Conclusions Analyses of these resources enhance our understanding of marsupial gene evolution, identify marsupial-specific conserved non-coding elements and critical genes across a range of biological systems, including reproduction, development and immunity, and provide new insight into marsupial and mammalian biology and genome evolution. PMID:21854559
Genome sequence of an Australian kangaroo, Macropus eugenii, provides insight into the evolution of mammalian reproduction and development.

PubMed

Renfree, Marilyn B; Papenfuss, Anthony T; Deakin, Janine E; Lindsay, James; Heider, Thomas; Belov, Katherine; Rens, Willem; Waters, Paul D; Pharo, Elizabeth A; Shaw, Geoff; Wong, Emily S W; Lefèvre, Christophe M; Nicholas, Kevin R; Kuroki, Yoko; Wakefield, Matthew J; Zenger, Kyall R; Wang, Chenwei; Ferguson-Smith, Malcolm; Nicholas, Frank W; Hickford, Danielle; Yu, Hongshi; Short, Kirsty R; Siddle, Hannah V; Frankenberg, Stephen R; Chew, Keng Yih; Menzies, Brandon R; Stringer, Jessica M; Suzuki, Shunsuke; Hore, Timothy A; Delbridge, Margaret L; Patel, Hardip R; Mohammadi, Amir; Schneider, Nanette Y; Hu, Yanqiu; O'Hara, William; Al Nadaf, Shafagh; Wu, Chen; Feng, Zhi-Ping; Cocks, Benjamin G; Wang, Jianghui; Flicek, Paul; Searle, Stephen M J; Fairley, Susan; Beal, Kathryn; Herrero, Javier; Carone, Dawn M; Suzuki, Yutaka; Sugano, Sumio; Toyoda, Atsushi; Sakaki, Yoshiyuki; Kondo, Shinji; Nishida, Yuichiro; Tatsumoto, Shoji; Mandiou, Ion; Hsu, Arthur; McColl, Kaighin A; Lansdell, Benjamin; Weinstock, George; Kuczek, Elizabeth; McGrath, Annette; Wilson, Peter; Men, Artem; Hazar-Rethinam, Mehlika; Hall, Allison; Davis, John; Wood, David; Williams, Sarah; Sundaravadanam, Yogi; Muzny, Donna M; Jhangiani, Shalini N; Lewis, Lora R; Morgan, Margaret B; Okwuonu, Geoffrey O; Ruiz, San Juana; Santibanez, Jireh; Nazareth, Lynne; Cree, Andrew; Fowler, Gerald; Kovar, Christie L; Dinh, Huyen H; Joshi, Vandita; Jing, Chyn; Lara, Fremiet; Thornton, Rebecca; Chen, Lei; Deng, Jixin; Liu, Yue; Shen, Joshua Y; Song, Xing-Zhi; Edson, Janette; Troon, Carmen; Thomas, Daniel; Stephens, Amber; Yapa, Lankesha; Levchenko, Tanya; Gibbs, Richard A; Cooper, Desmond W; Speed, Terence P; Fujiyama, Asao; Graves, Jennifer A M; O'Neill, Rachel J; Pask, Andrew J; Forrest, Susan M; Worley, Kim C

2011-08-29

We present the genome sequence of the tammar wallaby, Macropus eugenii, which is a member of the kangaroo family and the first representative of the iconic hopping mammals that symbolize Australia to be sequenced. The tammar has many unusual biological characteristics, including the longest period of embryonic diapause of any mammal, extremely synchronized seasonal breeding and prolonged and sophisticated lactation within a well-defined pouch. Like other marsupials, it gives birth to highly altricial young, and has a small number of very large chromosomes, making it a valuable model for genomics, reproduction and development. The genome has been sequenced to 2 × coverage using Sanger sequencing, enhanced with additional next generation sequencing and the integration of extensive physical and linkage maps to build the genome assembly. We also sequenced the tammar transcriptome across many tissues and developmental time points. Our analyses of these data shed light on mammalian reproduction, development and genome evolution: there is innovation in reproductive and lactational genes, rapid evolution of germ cell genes, and incomplete, locus-specific X inactivation. We also observe novel retrotransposons and a highly rearranged major histocompatibility complex, with many class I genes located outside the complex. Novel microRNAs in the tammar HOX clusters uncover new potential mammalian HOX regulatory elements. Analyses of these resources enhance our understanding of marsupial gene evolution, identify marsupial-specific conserved non-coding elements and critical genes across a range of biological systems, including reproduction, development and immunity, and provide new insight into marsupial and mammalian biology and genome evolution.
Diversity of Glycosyl Hydrolases from Cellulose-Depleting Communities Enriched from Casts of Two Earthworm Species▿ †

PubMed Central

Beloqui, Ana; Nechitaylo, Taras Y.; López-Cortés, Nieves; Ghazi, Azam; Guazzaroni, María-Eugenia; Polaina, Julio; Strittmatter, Axel W.; Reva, Oleg; Waliczek, Agnes; Yakimov, Michail M.; Golyshina, Olga V.; Ferrer, Manuel; Golyshin, Peter N.

2010-01-01

The guts and casts of earthworms contain microbial assemblages that process large amounts of organic polymeric substrates from plant litter and soil; however, the enzymatic potential of these microbial communities remains largely unexplored. In the present work, we retrieved carbohydrate-modifying enzymes through the activity screening of metagenomic fosmid libraries from cellulose-depleting microbial communities established with the fresh casts of two earthworm species, Aporrectodea caliginosa and Lumbricus terrestris, as inocula. Eight glycosyl hydrolases (GHs) from the A. caliginosa-derived community were multidomain endo-β-glucanases, β-glucosidases, β-cellobiohydrolases, β-galactosidase, and β-xylosidases of known GH families. In contrast, two GHs derived from the L. terrestris microbiome had no similarity to any known GHs and represented two novel families of β-galactosidases/α-arabinopyranosidases. Members of these families were annotated in public databases as conserved hypothetical proteins, with one being structurally related to isomerases/dehydratases. This study provides insight into their biochemistry, domain structures, and active-site architecture. The two communities were similar in bacterial composition but significantly different with regard to their eukaryotic inhabitants. Further sequence analysis of fosmids and plasmids bearing the GH-encoding genes, along with oligonucleotide usage pattern analysis, suggested that those apparently originated from Gammaproteobacteria (pseudomonads and Cellvibrio-like organisms), Betaproteobacteria (Comamonadaceae), and Alphaproteobacteria (Rhizobiales). PMID:20622123
Spatial and temporal expression of the Grainyhead-like transcription factor family during murine development.

PubMed

Auden, Alana; Caddy, Jacinta; Wilanowski, Tomasz; Ting, Stephen B; Cunningham, John M; Jane, Stephen M

2006-10-01

The Drosophila transcription factor Grainyhead (grh) is expressed in ectoderm-derived tissues where it regulates several key developmental events including cuticle formation, tracheal elongation and dorsal closure. Our laboratory has recently identified three novel mammalian homologues of the grh gene, Grainyhead-like 1, -2 and -3 (Grhl1-3) that rewrite the phylogeny of this family. Using gene targeting in mice, we have shown that Grhl3 is essential for neural tube closure, skin barrier formation and wound healing. Despite their extensive sequence homology, Grhl1 and Grhl2 are unable to compensate for loss of Grhl3 in these developmental processes. To explore this lack of redundancy, and to gain further insights into the functions of this gene family in mammalian development we have performed an extensive in situ hybridisation analysis. We demonstrate that, although all three Grhl genes are highly expressed in the developing epidermis, they display subtle differences in the timing and level of expression. Surprisingly, we also demonstrate differential expression patterns in non-ectoderm-derived tissues, including the heart, the lung, and the metanephric kidney. These findings expand our understanding of the unique role of Grhl3 in neurulation and epidermal morphogenesis, and provide a focus for further functional analysis of the Grhl genes during mouse embryogenesis.
Age-Related Changes and Reference Values of Bicaudate Ratio and Sagittal Brainstem Diameters on MRI.

PubMed

Garbade, Sven F; Boy, Nikolas; Heringer, Jana; Kölker, Stefan; Harting, Inga

2018-06-05

Cranial magnetic resonance imaging (MRI) plays an important role in the diagnosis of neurometabolic diseases, and, in addition, temporal patterns of signal and volume changes allow insight into the underlying pathogenesis. While assessment of volume changes by visual inspection is subjective, volumetric approaches are often not feasible with rare neurometabolic diseases, where MRIs are often acquired with different scanners and protocols. Linear surrogate parameters of brain volume, for example, the bicaudate ratio, present a robust alternative that can be derived from standard imaging sequences. Due to the continuing postnatal brain and skull development and later brain involution, it is, however, necessary to compare patient values with age age-adapted normal values.In this article, we present age-dependent normal values derived from 993 standard scans of patients with normal MRI findings (age range: 0-80 years; mean = 19.9; median = 12.8 years) for bicaudate ratio as a measure of global supratentorial volume, as well as the maximal anteroposterior diameters of mesencephalon, pons, and medulla oblongata as parameters of brainstem volume. The provided data allow quantitative, objective assessment of brain volume changes instead of the usually performed visual and therefore subjective assessment. Georg Thieme Verlag KG Stuttgart · New York.
Isolation and sequence analysis of the wheat B genome subtelomeric DNA.

PubMed

Salina, Elena A; Sergeeva, Ekaterina M; Adonina, Irina G; Shcherban, Andrey B; Afonnikov, Dmitry A; Belcram, Harry; Huneau, Cecile; Chalhoub, Boulos

2009-09-05

Telomeric and subtelomeric regions are essential for genome stability and regular chromosome replication. In this work, we have characterized the wheat BAC (bacterial artificial chromosome) clones containing Spelt1 and Spelt52 sequences, which belong to the subtelomeric repeats of the B/G genomes of wheats and Aegilops species from the section Sitopsis. The BAC library from Triticum aestivum cv. Renan was screened using Spelt1 and Spelt52 as probes. Nine positive clones were isolated; of them, clone 2050O8 was localized mainly to the distal parts of wheat chromosomes by in situ hybridization. The distribution of the other clones indicated the presence of different types of repetitive sequences in BACs. Use of different approaches allowed us to prove that seven of the nine isolated clones belonged to the subtelomeric chromosomal regions. Clone 2050O8 was sequenced and its sequence of 119,737 bp was annotated. It is composed of 33% transposable elements (TEs), 8.2% Spelt52 (namely, the subfamily Spelt52.2) and five non-TE-related genes. DNA transposons are predominant, making up 24.6% of the entire BAC clone, whereas retroelements account for 8.4% of the clone length. The full-length CACTA transposon Caspar covers 11,666 bp, encoding a transposase and CTG-2 proteins, and this transposon accounts for 40% of the DNA transposons. The in situ hybridization data for 2050O8 derived subclones in combination with the BLAST search against wheat mapped ESTs (expressed sequence tags) suggest that clone 2050O8 is located in the terminal bin 4BL-10 (0.95-1.0). Additionally, four of the predicted 2050O8 genes showed significant homology to four putative orthologous rice genes in the distal part of rice chromosome 3S and confirm the synteny to wheat 4BL. Satellite DNA sequences from the subtelomeric regions of diploid wheat progenitor can be used for selecting the BAC clones from the corresponding regions of hexaploid wheat chromosomes. It has been demonstrated for the first time that Spelt52 sequences were involved in the evolution of terminal regions of common wheat chromosomes. Our research provides new insights into the microcollinearity in the terminal regions of wheat chromosomes 4BL and rice chromosome 3S.

The complete mitochondrial genomes of three parasitic nematodes of birds: a unique gene order and insights into nematode phylogeny

PubMed Central

2013-01-01

Background Analyses of mitochondrial (mt) genome sequences in recent years challenge the current working hypothesis of Nematoda phylogeny proposed from morphology, ecology and nuclear small subunit rRNA gene sequences, and raise the need to sequence additional mt genomes for a broad range of nematode lineages. Results We sequenced the complete mt genomes of three Ascaridia species (family Ascaridiidae) that infest chickens, pigeons and parrots, respectively. These three Ascaridia species have an identical arrangement of mt genes to each other but differ substantially from other nematodes. Phylogenetic analyses of the mt genome sequences of the Ascaridia species, together with 62 other nematode species, support the monophylies of seven high-level taxa of the phylum Nematoda: 1) the subclass Dorylaimia; 2) the orders Rhabditida, Trichinellida and Mermithida; 3) the suborder Rhabditina; and 4) the infraorders Spiruromorpha and Oxyuridomorpha. Analyses of mt genome sequences, however, reject the monophylies of the suborders Spirurina and Tylenchina, and the infraorders Rhabditomorpha, Panagrolaimomorpha and Tylenchomorpha. Monophyly of the infraorder Ascaridomorpha varies depending on the methods of phylogenetic analysis. The Ascaridomorpha was more closely related to the infraorders Rhabditomorpha and Diplogasteromorpha (suborder Rhabditina) than they were to the other two infraorders of the Spirurina: Oxyuridorpha and Spiruromorpha. The closer relationship among Ascaridomorpha, Rhabditomorpha and Diplogasteromorpha was also supported by a shared common pattern of mitochondrial gene arrangement. Conclusions Analyses of mitochondrial genome sequences and gene arrangement has provided novel insights into the phylogenetic relationships among several major lineages of nematodes. Many lineages of nematodes, however, are underrepresented or not represented in these analyses. Expanding taxon sampling is necessary for future phylogenetic studies of nematodes with mt genome sequences. PMID:23800363
A SSR-based genetic linkage map of cultivated peanut (Arachis hypogaea L.)

USDA-ARS?s Scientific Manuscript database

The objective of this study was to construct a molecular linkage map of cultivated tetraploid peanut using simple sequence repeat (SSR) markers derived primarily from peanut genomic sequences, expressed sequence tags (ESTs), and by "data mining" sequences released in GenBank. Three recombinant inbre...
Application of whole genome shotgun sequencing for detection and characterization of genetically modified organisms and derived products.

PubMed

Holst-Jensen, Arne; Spilsberg, Bjørn; Arulandhu, Alfred J; Kok, Esther; Shi, Jianxin; Zel, Jana

2016-07-01

The emergence of high-throughput, massive or next-generation sequencing technologies has created a completely new foundation for molecular analyses. Various selective enrichment processes are commonly applied to facilitate detection of predefined (known) targets. Such approaches, however, inevitably introduce a bias and are prone to miss unknown targets. Here we review the application of high-throughput sequencing technologies and the preparation of fit-for-purpose whole genome shotgun sequencing libraries for the detection and characterization of genetically modified and derived products. The potential impact of these new sequencing technologies for the characterization, breeding selection, risk assessment, and traceability of genetically modified organisms and genetically modified products is yet to be fully acknowledged. The published literature is reviewed, and the prospects for future developments and use of the new sequencing technologies for these purposes are discussed.
Structural basis for the inhibition of poly(ADP-ribose) polymerases 1 and 2 by BMN 673, a potent inhibitor derived from dihydropyridophthalazinone

DOE Office of Scientific and Technical Information (OSTI.GOV)

Aoyagi-Scharber, Mika, E-mail: maoyagi@bmrn.com; Gardberg, Anna S.; Yip, Bryan K.

2014-08-29

BMN 673, a novel PARP1/2 inhibitor in clinical development with substantial tumor cytotoxicity, forms extensive hydrogen-bonding and π-stacking in the nicotinamide pocket, with its unique disubstituted scaffold extending towards the less conserved edges of the pocket. These interactions might provide structural insight into the ability of BMN 673 to both inhibit catalysis and affect DNA-binding activity. Poly(ADP-ribose) polymerases 1 and 2 (PARP1 and PARP2), which are involved in DNA damage response, are targets of anticancer therapeutics. BMN 673 is a novel PARP1/2 inhibitor with substantially increased PARP-mediated tumor cytotoxicity and is now in later-stage clinical development for BRCA-deficient breast cancers.more » In co-crystal structures, BMN 673 is anchored to the nicotinamide-binding pocket via an extensive network of hydrogen-bonding and π-stacking interactions, including those mediated by active-site water molecules. The novel di-branched scaffold of BMN 673 extends the binding interactions towards the outer edges of the pocket, which exhibit the least sequence homology among PARP enzymes. The crystallographic structural analyses reported here therefore not only provide critical insights into the molecular basis for the exceptionally high potency of the clinical development candidate BMN 673, but also new opportunities for increasing inhibitor selectivity.« less
Insights into the strategies used by related group II introns to adapt successfully for the colonisation of a bacterial genome

PubMed Central

Martínez-Rodríguez, Laura; García-Rodríguez, Fernando M; Molina-Sánchez, María Dolores; Toro, Nicolás; Martínez-Abarca, Francisco

2014-01-01

Group II introns are self-splicing RNAs and site-specific mobile retroelements found in bacterial and organellar genomes. The group II intron RmInt1 is present at high copy number in Sinorhizobium meliloti species, and has a multifunctional intron-encoded protein (IEP) with reverse transcriptase/maturase activities, but lacking the DNA-binding and endonuclease domains. We characterized two RmInt1-related group II introns RmInt2 from S. meliloti strain GR4 and Sr.md.I1 from S. medicae strain WSM419 in terms of splicing and mobility activities. We used both wild-type and engineered intron-donor constructs based on ribozyme ΔORF-coding sequence derivatives, and we determined the DNA target requirements for RmInt2, the element most distantly related to RmInt1. The excision and mobility patterns of intron-donor constructs expressing different combinations of IEP and intron RNA provided experimental evidence for the co-operation of IEPs and intron RNAs from related elements in intron splicing and, in some cases, in intron homing. We were also able to identify the DNA target regions recognized by these IEPs lacking the DNA endonuclease domain. Our results provide new insight into the versatility of related group II introns and the possible co-operation between these elements to facilitate the colonization of bacterial genomes. PMID:25482895
Insights into the strategies used by related group II introns to adapt successfully for the colonisation of a bacterial genome.

PubMed

Martínez-Rodríguez, Laura; García-Rodríguez, Fernando M; Molina-Sánchez, María Dolores; Toro, Nicolás; Martínez-Abarca, Francisco

2014-01-01

Group II introns are self-splicing RNAs and site-specific mobile retroelements found in bacterial and organellar genomes. The group II intron RmInt1 is present at high copy number in Sinorhizobium meliloti species, and has a multifunctional intron-encoded protein (IEP) with reverse transcriptase/maturase activities, but lacking the DNA-binding and endonuclease domains. We characterized two RmInt1-related group II introns RmInt2 from S. meliloti strain GR4 and Sr.md.I1 from S. medicae strain WSM419 in terms of splicing and mobility activities. We used both wild-type and engineered intron-donor constructs based on ribozyme ΔORF-coding sequence derivatives, and we determined the DNA target requirements for RmInt2, the element most distantly related to RmInt1. The excision and mobility patterns of intron-donor constructs expressing different combinations of IEP and intron RNA provided experimental evidence for the co-operation of IEPs and intron RNAs from related elements in intron splicing and, in some cases, in intron homing. We were also able to identify the DNA target regions recognized by these IEPs lacking the DNA endonuclease domain. Our results provide new insight into the versatility of related group II introns and the possible co-operation between these elements to facilitate the colonization of bacterial genomes.
The 2.1Å Crystal Structure of an Acyl-CoA Synthetase from Methanosarcina acetivorans reveals an alternate acyl binding pocket for small branched acyl substrates†,‡

PubMed Central

Shah, Manish B.; Ingram-Smith, Cheryl; Cooper, Leroy L.; Qu, Jun; Meng, Yu; Smith, Kerry S.; Gulick, Andrew M.

2009-01-01

The acyl-AMP forming family of adenylating enzymes catalyze two-step reactions to activate a carboxylate with the chemical energy derived from ATP hydrolysis. X-ray crystal structures have been determined for multiple members of this family and, together with biochemical studies, provide insights into the active site and catalytic mechanisms used by these enzymes. These studies have shown that the enzymes use a domain rotation of 140° to reconfigure a single active site to catalyze the two partial reactions. We present here the crystal structure of a new medium chain acyl-CoA synthetase from Methanosarcina acetivorans. The binding pocket for the three substrates is analyzed, with many conserved residues present in the AMP binding pocket. The CoA binding pocket is compared to the pockets of both acetyl-CoA synthetase and 4-chlorobenzoate:CoA ligase. Most interestingly, the acyl binding pocket of the new structure is compared with other acyl- and aryl-CoA synthetases. A comparison of the acyl-binding pocket of the acyl-CoA synthetase from M. acetivorans with other structures identifies a shallow pocket that is used to bind the medium chain carboxylates. These insights emphasize the high sequence and structural diversity among this family in the area of the acyl binding pocket. PMID:19544569
A β-solenoid model of the Pmel17 repeat domain: insights to the formation of functional amyloid fibrils

NASA Astrophysics Data System (ADS)

Louros, Nikolaos N.; Baltoumas, Fotis A.; Hamodrakas, Stavros J.; Iconomidou, Vassiliki A.

2016-02-01

Pmel17 is a multidomain protein involved in biosynthesis of melanin. This process is facilitated by the formation of Pmel17 amyloid fibrils that serve as a scaffold, important for pigment deposition in melanosomes. A specific luminal domain of human Pmel17, containing 10 tandem imperfect repeats, designated as repeat domain (RPT), forms amyloid fibrils in a pH-controlled mechanism in vitro and has been proposed to be essential for the formation of the fibrillar matrix. Currently, no three-dimensional structure has been resolved for the RPT domain of Pmel17. Here, we examine the structure of the RPT domain by performing sequence threading. The resulting model was subjected to energy minimization and validated through extensive molecular dynamics simulations. Structural analysis indicated that the RPT model exhibits several distinct properties of β-solenoid structures, which have been proposed to be polymerizing components of amyloid fibrils. The derived model is stabilized by an extensive network of hydrogen bonds generated by stacking of highly conserved polar residues of the RPT domain. Furthermore, the key role of invariant glutamate residues is proposed, supporting a pH-dependent mechanism for RPT domain assembly. Conclusively, our work attempts to provide structural insights into the RPT domain structure and to elucidate its contribution to Pmel17 amyloid fibril formation.
Probing nitrobenzhydrol uncaging mechanisms using FerriCast.

PubMed

Kennedy, Daniel P; Brown, Daniel C; Burdette, Shawn C

2010-10-15

The FerriCast derivative FC-NDBF was synthesized from 3-methyl-2-nitrodibenzofuran (NDBF). The photochemistry of the target Fe(3+) photocage and several related congeners provides mechanistic insight into the uncaging quantum yields of nitrobenzhydrol-derived ligands.
The chloroplast genome sequence of the green alga Leptosira terrestris: multiple losses of the inverted repeat and extensive genome rearrangements within the Trebouxiophyceae

PubMed Central

de Cambiaire, Jean-Charles; Otis, Christian; Turmel, Monique; Lemieux, Claude

2007-01-01

Background In the Chlorophyta – the green algal phylum comprising the classes Prasinophyceae, Ulvophyceae, Trebouxiophyceae and Chlorophyceae – the chloroplast genome displays a highly variable architecture. While chlorophycean chloroplast DNAs (cpDNAs) deviate considerably from the ancestral pattern described for the prasinophyte Nephroselmis olivacea, the degree of remodelling sustained by the two ulvophyte cpDNAs completely sequenced to date is intermediate relative to those observed for chlorophycean and trebouxiophyte cpDNAs. Chlorella vulgaris (Chlorellales) is currently the only photosynthetic trebouxiophyte whose complete cpDNA sequence has been reported. To gain insights into the evolutionary trends of the chloroplast genome in the Trebouxiophyceae, we sequenced cpDNA from the filamentous alga Leptosira terrestris (Ctenocladales). Results The 195,081-bp Leptosira chloroplast genome resembles the 150,613-bp Chlorella genome in lacking a large inverted repeat (IR) but differs greatly in gene order. Six of the conserved genes present in Chlorella cpDNA are missing from the Leptosira gene repertoire. The 106 conserved genes, four introns and 11 free standing open reading frames (ORFs) account for 48.3% of the genome sequence. This is the lowest gene density yet observed among chlorophyte cpDNAs. Contrary to the situation in Chlorella but similar to that in the chlorophycean Scenedesmus obliquus, the gene distribution is highly biased over the two DNA strands in Leptosira. Nine genes, compared to only three in Chlorella, have significantly expanded coding regions relative to their homologues in ancestral-type green algal cpDNAs. As observed in chlorophycean genomes, the rpoB gene is fragmented into two ORFs. Short repeats account for 5.1% of the Leptosira genome sequence and are present mainly in intergenic regions. Conclusion Our results highlight the great plasticity of the chloroplast genome in the Trebouxiophyceae and indicate that the IR was lost on at least two separate occasions. The intriguing similarities of the derived features exhibited by Leptosira cpDNA and its chlorophycean counterparts suggest that the same evolutionary forces shaped the IR-lacking chloroplast genomes in these two algal lineages. PMID:17610731
ASFinder: a tool for genome-wide identification of alternatively splicing transcripts from EST-derived sequences.

PubMed

Min, Xiang Jia

2013-01-01

Expressed Sequence Tags (ESTs) are a rich resource for identifying Alternatively Splicing (AS) genes. The ASFinder webserver is designed to identify AS isoforms from EST-derived sequences. Two approaches are implemented in ASFinder. If no genomic sequences are provided, the server performs a local BLASTN to identify AS isoforms from ESTs having both ends aligned but an internal segment unaligned. Otherwise, ASFinder uses SIM4 to map ESTs to the genome, then the overlapping ESTs that are mapped to the same genomic locus and have internal variable exon/intron boundaries are identified as AS isoforms. The tool is available at http://proteomics.ysu.edu/tools/ASFinder.html.
Insights into archaeal evolution and symbiosis from the genomes of a nanoarchaeon and its inferred crenarchaeal host from Obsidian Pool, Yellowstone National Park

PubMed Central

2013-01-01

Background A single cultured marine organism, Nanoarchaeum equitans, represents the Nanoarchaeota branch of symbiotic Archaea, with a highly reduced genome and unusual features such as multiple split genes. Results The first terrestrial hyperthermophilic member of the Nanoarchaeota was collected from Obsidian Pool, a thermal feature in Yellowstone National Park, separated by single cell isolation, and sequenced together with its putative host, a Sulfolobales archaeon. Both the new Nanoarchaeota (Nst1) and N. equitans lack most biosynthetic capabilities, and phylogenetic analysis of ribosomal RNA and protein sequences indicates that the two form a deep-branching archaeal lineage. However, the Nst1 genome is more than 20% larger, and encodes a complete gluconeogenesis pathway as well as the full complement of archaeal flagellum proteins. With a larger genome, a smaller repertoire of split protein encoding genes and no split non-contiguous tRNAs, Nst1 appears to have experienced less severe genome reduction than N. equitans. These findings imply that, rather than representing ancestral characters, the extremely compact genomes and multiple split genes of Nanoarchaeota are derived characters associated with their symbiotic or parasitic lifestyle. The inferred host of Nst1 is potentially autotrophic, with a streamlined genome and simplified central and energetic metabolism as compared to other Sulfolobales. Conclusions Comparison of the N. equitans and Nst1 genomes suggests that the marine and terrestrial lineages of Nanoarchaeota share a common ancestor that was already a symbiont of another archaeon. The two distinct Nanoarchaeota-host genomic data sets offer novel insights into the evolution of archaeal symbiosis and parasitism, enabling further studies of the cellular and molecular mechanisms of these relationships. Reviewers This article was reviewed by Patrick Forterre, Bettina Siebers (nominated by Michael Galperin) and Purification Lopez-Garcia PMID:23607440
A new RT-PCR assay for the identification of the predominant recombination types in 2C and 3D genomic regions of vaccine-derived poliovirus strains.

PubMed

Pliaka, V; Dedepsidis, E; Kyriakopoulou, Z; Mpirli, K; Tsakogiannis, D; Pratti, A; Levidiotou-Stefanou, S; Markoulatos, P

2010-06-01

In the post-eradication era of wild polioviruses, the only remaining sources of poliovirus infection worldwide would be the vaccine-derived polioviruses (VDPVs). As the preponderance of countries certified to be polio-free has switched from OPV (oral poliovirus vaccine) to IPV (inactivated poliovirus vaccine), importation of recombinant evolved derivatives of vaccinal strains would have serious implication for public health. To test the robustness of the proposed RT-PCR screening analysis, eleven recombinant vaccine-derived polioviruses that were characterized previously by sequencing by our group, in addition to three recently identified recombinant environmental isolates were assayed. Although the most definitive characterization of VDPVs is by genomic sequencing, in this study we describe a new, inexpensive and broadly applicable RT-PCR assay for the identification of the predominant recombination types S3/Sx in 2C and S2/Sx in 3D genomic regions respectively of VDPVs, that can be readily implemented in laboratories lacking sequencing facilities as a first approach for the early detection of vaccine-derived poliovirus (VDPVs).
Analysis of BAC end sequences in oak, a keystone forest tree species, providing insight into the composition of its genome

PubMed Central

2011-01-01

Background One of the key goals of oak genomics research is to identify genes of adaptive significance. This information may help to improve the conservation of adaptive genetic variation and the management of forests to increase their health and productivity. Deep-coverage large-insert genomic libraries are a crucial tool for attaining this objective. We report herein the construction of a BAC library for Quercus robur, its characterization and an analysis of BAC end sequences. Results The EcoRI library generated consisted of 92,160 clones, 7% of which had no insert. Levels of chloroplast and mitochondrial contamination were below 3% and 1%, respectively. Mean clone insert size was estimated at 135 kb. The library represents 12 haploid genome equivalents and, the likelihood of finding a particular oak sequence of interest is greater than 99%. Genome coverage was confirmed by PCR screening of the library with 60 unique genetic loci sampled from the genetic linkage map. In total, about 20,000 high-quality BAC end sequences (BESs) were generated by sequencing 15,000 clones. Roughly 5.88% of the combined BAC end sequence length corresponded to known retroelements while ab initio repeat detection methods identified 41 additional repeats. Collectively, characterized and novel repeats account for roughly 8.94% of the genome. Further analysis of the BESs revealed 1,823 putative genes suggesting at least 29,340 genes in the oak genome. BESs were aligned with the genome sequences of Arabidopsis thaliana, Vitis vinifera and Populus trichocarpa. One putative collinear microsyntenic region encoding an alcohol acyl transferase protein was observed between oak and chromosome 2 of V. vinifera. Conclusions This BAC library provides a new resource for genomic studies, including SSR marker development, physical mapping, comparative genomics and genome sequencing. BES analysis provided insight into the structure of the oak genome. These sequences will be used in the assembly of a future genome sequence for oak. PMID:21645357
Characterization and engineering of the biosynthesis gene cluster for antitumor macrolides PM100117 and PM100118 from a marine actinobacteria: generation of a novel improved derivative.

PubMed

Salcedo, Raúl García; Olano, Carlos; Gómez, Cristina; Fernández, Rogelio; Braña, Alfredo F; Méndez, Carmen; de la Calle, Fernando; Salas, José A

2016-02-22

PM100117 and PM100118 are glycosylated polyketides with remarkable antitumor activity, which derive from the marine symbiotic actinobacteria Streptomyces caniferus GUA-06-05-006A. Structurally, PM100117 and PM100118 are composed of a macrocyclic lactone, three deoxysugar units and a naphthoquinone (NQ) chromophore that shows a clear structural similarity to menaquinone. Whole-genome sequencing of S. caniferus GUA-06-05-006A has enabled the identification of PM100117 and PM100118 biosynthesis gene cluster, which has been characterized on the basis of bioinformatics and genetic engineering data. The product of four genes shows high identity to proteins involved in the biosynthesis of menaquinone via futalosine. Deletion of one of these genes led to a decay in PM100117 and PM100118 production, and to the accumulation of several derivatives lacking NQ. Likewise, five additional genes have been genetically characterized to be involved in the biosynthesis of this moiety. Moreover, the generation of a mutant in a gene coding for a putative cytochrome P450 has led to the production of PM100117 and PM100118 structural analogues showing an enhanced in vitro cytotoxic activity relative to the parental products. Although a number of compounds structurally related to PM100117 and PM100118 has been discovered, this is, to our knowledge, the first insight reported into their biosynthesis. The structural resemblance of the NQ moiety to menaquinone, and the presence in the cluster of four putative menaquinone biosynthetic genes, suggests a connection between the biosynthesis pathways of both compounds. The availability of the PM100117 and PM100118 biosynthetic gene cluster will surely pave a way to the combinatorial engineering of more derivatives.
Surface circulation in the Iroise Sea (western Brittany) derived from high resolution current mapping by HF radars

NASA Astrophysics Data System (ADS)

Sentchev, Alexei; Forget, Philippe; Barbin, Yves; Marié, Louis; Ardhuin, Fabrice

2010-05-01

The use of high frequency radar (HFR) systems for near-real-time coastal ocean monitoring necessities that short time scale motions of the radar-derived velocities are better understood. While the ocean radar systems are able to describe coastal flow patterns with unprecedented details, the data they produce are often too sparse or gappy for applications such as the identification of coherent structures and fronts or understanding transport and mixing processes. In this study, we address two challenges. First, we report results from the HF radar system (WERA) which is routinely operating since 2006 on the western Brittany coast to monitor surface circulation in the Iroise Sea, over an area extending up to 100 km offshore. To obtain more reliable records of vector current fields at high space and time resolution, the Multiple Signal Classification (MUSIC) direction finding algorithm is employed in conjunction with the variational interpolation (2dVar) of radar-derived velocities. This provides surface current maps at 1 km spacing and time resolution of 20 min. Removing the influence of the sea state on radar-derived current measurements is discussed and performed on some data sequences. Second, we examine in deep continuous 2d velocity records for a number of periods, exploring the different modes of variability of surface currents in the region. Given the extent, duration, and resolution of surface current velocity measurements, new quantitative insights from various time series and spatial analysis on higher frequency kinematics will be discussed. By better characterizing the full spectrum of flow regimes that contribute to the surface currents and their shears, a more complete picture of the circulation in the Iroise Sea can be obtained.
Robust temporal alignment of multimodal cardiac sequences

NASA Astrophysics Data System (ADS)

Perissinotto, Andrea; Queirós, Sandro; Morais, Pedro; Baptista, Maria J.; Monaghan, Mark; Rodrigues, Nuno F.; D'hooge, Jan; Vilaça, João. L.; Barbosa, Daniel

2015-03-01

Given the dynamic nature of cardiac function, correct temporal alignment of pre-operative models and intraoperative images is crucial for augmented reality in cardiac image-guided interventions. As such, the current study focuses on the development of an image-based strategy for temporal alignment of multimodal cardiac imaging sequences, such as cine Magnetic Resonance Imaging (MRI) or 3D Ultrasound (US). First, we derive a robust, modality-independent signal from the image sequences, estimated by computing the normalized cross-correlation between each frame in the temporal sequence and the end-diastolic frame. This signal is a resembler for the left-ventricle (LV) volume curve over time, whose variation indicates different temporal landmarks of the cardiac cycle. We then perform the temporal alignment of these surrogate signals derived from MRI and US sequences of the same patient through Dynamic Time Warping (DTW), allowing to synchronize both sequences. The proposed framework was evaluated in 98 patients, which have undergone both 3D+t MRI and US scans. The end-systolic frame could be accurately estimated as the minimum of the image-derived surrogate signal, presenting a relative error of 1.6 +/- 1.9% and 4.0 +/- 4.2% for the MRI and US sequences, respectively, thus supporting its association with key temporal instants of the cardiac cycle. The use of DTW reduces the desynchronization of the cardiac events in MRI and US sequences, allowing to temporally align multimodal cardiac imaging sequences. Overall, a generic, fast and accurate method for temporal synchronization of MRI and US sequences of the same patient was introduced. This approach could be straightforwardly used for the correct temporal alignment of pre-operative MRI information and intra-operative US images.
Cloning and expression of a cDNA coding for a human monocyte-derived plasminogen activator inhibitor.

PubMed

Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P

1988-02-01

Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators.
Cloning and expression of a cDNA coding for a human monocyte-derived plasminogen activator inhibitor.

PubMed Central

Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P

1988-01-01

Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators. Images PMID:3257578
A Benchmark Study on Error Assessment and Quality Control of CCS Reads Derived from the PacBio RS

PubMed Central

Jiao, Xiaoli; Zheng, Xin; Ma, Liang; Kutty, Geetha; Gogineni, Emile; Sun, Qiang; Sherman, Brad T.; Hu, Xiaojun; Jones, Kristine; Raley, Castle; Tran, Bao; Munroe, David J.; Stephens, Robert; Liang, Dun; Imamichi, Tomozumi; Kovacs, Joseph A.; Lempicki, Richard A.; Huang, Da Wei

2013-01-01

PacBio RS, a newly emerging third-generation DNA sequencing platform, is based on a real-time, single-molecule, nano-nitch sequencing technology that can generate very long reads (up to 20-kb) in contrast to the shorter reads produced by the first and second generation sequencing technologies. As a new platform, it is important to assess the sequencing error rate, as well as the quality control (QC) parameters associated with the PacBio sequence data. In this study, a mixture of 10 prior known, closely related DNA amplicons were sequenced using the PacBio RS sequencing platform. After aligning Circular Consensus Sequence (CCS) reads derived from the above sequencing experiment to the known reference sequences, we found that the median error rate was 2.5% without read QC, and improved to 1.3% with an SVM based multi-parameter QC method. In addition, a De Novo assembly was used as a downstream application to evaluate the effects of different QC approaches. This benchmark study indicates that even though CCS reads are post error-corrected it is still necessary to perform appropriate QC on CCS reads in order to produce successful downstream bioinformatics analytical results. PMID:24179701

A Benchmark Study on Error Assessment and Quality Control of CCS Reads Derived from the PacBio RS.

PubMed

Jiao, Xiaoli; Zheng, Xin; Ma, Liang; Kutty, Geetha; Gogineni, Emile; Sun, Qiang; Sherman, Brad T; Hu, Xiaojun; Jones, Kristine; Raley, Castle; Tran, Bao; Munroe, David J; Stephens, Robert; Liang, Dun; Imamichi, Tomozumi; Kovacs, Joseph A; Lempicki, Richard A; Huang, Da Wei

2013-07-31

PacBio RS, a newly emerging third-generation DNA sequencing platform, is based on a real-time, single-molecule, nano-nitch sequencing technology that can generate very long reads (up to 20-kb) in contrast to the shorter reads produced by the first and second generation sequencing technologies. As a new platform, it is important to assess the sequencing error rate, as well as the quality control (QC) parameters associated with the PacBio sequence data. In this study, a mixture of 10 prior known, closely related DNA amplicons were sequenced using the PacBio RS sequencing platform. After aligning Circular Consensus Sequence (CCS) reads derived from the above sequencing experiment to the known reference sequences, we found that the median error rate was 2.5% without read QC, and improved to 1.3% with an SVM based multi-parameter QC method. In addition, a De Novo assembly was used as a downstream application to evaluate the effects of different QC approaches. This benchmark study indicates that even though CCS reads are post error-corrected it is still necessary to perform appropriate QC on CCS reads in order to produce successful downstream bioinformatics analytical results.
Differential recognition of the ORF2 region in a complete genome sequence of porcine circovirus type 2 (PCV2) isolated from boar bone marrow in Korea.

PubMed

Kweon, Chang-Hee; Nguyen, Lien Thi Kim; Yoo, Mi-Sun; Kang, Seung-Won

2015-09-15

Porcine circovirus type 2 (PCV2) is the causative agent of post-weaning multisystemic wasting syndrome (PMWS) in swine. Here, a phylogenetic tree was constructed using PCV2 nucleotide sequences derived from the bone marrow of Korean boar and previously reported PCV2 sequences isolated from various countries. PCV2 from Korean boar bone marrow (KC188796) was classified into the group containing PCV2a-Canada and other PCV2 strain from Korea. While the ORF1 region of the PCV2 genome was highly conserved, ORF2 (the capsid protein coding region) was relatively variable. The nucleotide sequences for bone marrow-derived PCV2 were 93.4-99.0% homologous to the other reference sequences. The deduced amino acid sequences for the ORF1 and ORF2 coding regions were 97.4-99.3% and 84.5-97.4% homologous with the other reference strains, respectively, indicating that KC188796 did not differ markedly from the other PCV2 strains. Phylogenetic analysis demonstrated that bone marrow-derived PCV2 was highly similar to PCV2a from Canada and may be related to persistent PCV2 infections in swine. Copyright © 2015 Elsevier B.V. All rights reserved.
The Construction of Impossibility: A Logic-Based Analysis of Conjuring Tricks

PubMed Central

Smith, Wally; Dignum, Frank; Sonenberg, Liz

2016-01-01

Psychologists and cognitive scientists have long drawn insights and evidence from stage magic about human perceptual and attentional errors. We present a complementary analysis of conjuring tricks that seeks to understand the experience of impossibility that they produce. Our account is first motivated by insights about the constructional aspects of conjuring drawn from magicians' instructional texts. A view is then presented of the logical nature of impossibility as an unresolvable contradiction between a perception-supported belief about a situation and a memory-supported expectation. We argue that this condition of impossibility is constructed not simply through misperceptions and misattentions, but rather it is an outcome of a trick's whole structure of events. This structure is conceptualized as two parallel event sequences: an effect sequence that the spectator is intended to believe; and a method sequence that the magician understands as happening. We illustrate the value of this approach through an analysis of a simple close-up trick, Martin Gardner's Turnabout. A formalism called propositional dynamic logic is used to describe some of its logical aspects. This elucidates the nature and importance of the relationship between a trick's effect sequence and its method sequence, characterized by the careful arrangement of four evidence relationships: similarity, perceptual equivalence, structural equivalence, and congruence. The analysis further identifies two characteristics of magical apparatus that enable the construction of apparent impossibility: substitutable elements and stable occlusion. PMID:27378959
What can we learn about lyssavirus genomes using 454 sequencing?

PubMed

Höper, Dirk; Finke, Stefan; Freuling, Conrad M; Hoffmann, Bernd; Beer, Martin

2012-01-01

The main task of the individual project number four"Whole genome sequencing, virus-host adaptation, and molecular epidemiological analyses of lyssaviruses "within the network" Lyssaviruses--a potential re-emerging public health threat" is to provide high quality complete genome sequences from lyssaviruses. These sequences are analysed in-depth with regard to the diversity of the viral populations as to both quasi-species and so-called defective interfering RNAs. Moreover, the sequence data will facilitate further epidemiological analyses, will provide insight into the evolution of lyssaviruses and will be the basis for the design of novel nucleic acid based diagnostics. The first results presented here indicate that not only high quality full-length lyssavirus genome sequences can be generated, but indeed efficient analysis of the viral population gets feasible.
The number of reduced alignments between two DNA sequences

PubMed Central

2014-01-01

Background In this study we consider DNA sequences as mathematical strings. Total and reduced alignments between two DNA sequences have been considered in the literature to measure their similarity. Results for explicit representations of some alignments have been already obtained. Results We present exact, explicit and computable formulas for the number of different possible alignments between two DNA sequences and a new formula for a class of reduced alignments. Conclusions A unified approach for a wide class of alignments between two DNA sequences has been provided. The formula is computable and, if complemented by software development, will provide a deeper insight into the theory of sequence alignment and give rise to new comparison methods. AMS Subject Classification Primary 92B05, 33C20, secondary 39A14, 65Q30 PMID:24684679
Theileria parva antigens recognized by CD8+ T cells show varying degrees of diversity in buffalo-derived infected cell lines.

PubMed

Sitt, Tatjana; Pelle, Roger; Chepkwony, Maurine; Morrison, W Ivan; Toye, Philip

2018-05-06

The extent of sequence diversity among the genes encoding 10 antigens (Tp1-10) known to be recognized by CD8+ T lymphocytes from cattle immune to Theileria parva was analysed. The sequences were derived from parasites in 23 buffalo-derived cell lines, three cattle-derived isolates and one cloned cell line obtained from a buffalo-derived stabilate. The results revealed substantial variation among the antigens through sequence diversity. The greatest nucleotide and amino acid diversity were observed in Tp1, Tp2 and Tp9. Tp5 and Tp7 showed the least amount of allelic diversity, and Tp5, Tp6 and Tp7 had the lowest levels of protein diversity. Tp6 was the most conserved protein; only a single non-synonymous substitution was found in all obtained sequences. The ratio of non-synonymous: synonymous substitutions varied from 0.84 (Tp1) to 0.04 (Tp6). Apart from Tp2 and Tp9, we observed no variation in the other defined CD8+ T cell epitopes (Tp4, 5, 7 and 8), indicating that epitope variation is not a universal feature of T. parva antigens. In addition to providing markers that can be used to examine the diversity in T. parva populations, the results highlight the potential for using conserved antigens to develop vaccines that provide broad protection against T. parva.
Construction of an Integrated High Density Simple Sequence Repeat Linkage Map in Cultivated Strawberry (Fragaria × ananassa) and its Applicability

PubMed Central

Isobe, Sachiko N.; Hirakawa, Hideki; Sato, Shusei; Maeda, Fumi; Ishikawa, Masami; Mori, Toshiki; Yamamoto, Yuko; Shirasawa, Kenta; Kimura, Mitsuhiro; Fukami, Masanobu; Hashizume, Fujio; Tsuji, Tomoko; Sasamoto, Shigemi; Kato, Midori; Nanri, Keiko; Tsuruoka, Hisano; Minami, Chiharu; Takahashi, Chika; Wada, Tsuyuko; Ono, Akiko; Kawashima, Kumiko; Nakazaki, Naomi; Kishida, Yoshie; Kohara, Mitsuyo; Nakayama, Shinobu; Yamada, Manabu; Fujishiro, Tsunakazu; Watanabe, Akiko; Tabata, Satoshi

2013-01-01

The cultivated strawberry (Fragaria× ananassa) is an octoploid (2n = 8x = 56) of the Rosaceae family whose genomic architecture is still controversial. Several recent studies support the AAA′A′BBB′B′ model, but its complexity has hindered genetic and genomic analysis of this important crop. To overcome this difficulty and to assist genome-wide analysis of F. × ananassa, we constructed an integrated linkage map by organizing a total of 4474 of simple sequence repeat (SSR) markers collected from published Fragaria sequences, including 3746 SSR markers [Fragaria vesca expressed sequence tag (EST)-derived SSR markers] derived from F. vesca ESTs, 603 markers (F. × ananassa EST-derived SSR markers) from F. × ananassa ESTs, and 125 markers (F. × ananassa transcriptome-derived SSR markers) from F. × ananassa transcripts. Along with the previously published SSR markers, these markers were mapped onto five parent-specific linkage maps derived from three mapping populations, which were then assembled into an integrated linkage map. The constructed map consists of 1856 loci in 28 linkage groups (LGs) that total 2364.1 cM in length. Macrosynteny at the chromosome level was observed between the LGs of F. × ananassa and the genome of F. vesca. Variety distinction on 129 F. × ananassa lines was demonstrated using 45 selected SSR markers. PMID:23248204
Genomic sequence of the xylose fermenting, insect-inhabitingyeast, Pichia stipitis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jeffries, Thomas W.; Grigoriev, Igor; Grimwood, Jane

2007-06-25

Xylose is a major constituent of angiosperm lignocellulose,so its fermentation is important for bioconversion to fuels andchemicals. Pichia stipitis is the best-studied native xylose fermentingyeast. Genes from P. stipitis have been used to engineer xylosemetabolism in Saccharomycescerevisiae, and the regulation of the P.stipitis genome offers insights into the mechanisms of xylose metabolismin yeasts. We have sequenced, assembled and finished the genome ofP.stipitis. As such, it is one of only a handful of completely finishedeukaryotic organisms undergoing analysis and manual curation. Thesequence has revealed aspects of genome organization, numerous genes forbiocoversion, preliminary insights into regulation of central metabolicpathways, numerous examples ofmore » co-localized genes with related functions,and evidence of how P. stipitis manages to achieve redox balance whilegrowing on xylose under microaerobic conditions.« less
Draft Genome Sequence of an Anaerobic and Extremophilic Bacterium, Caldanaerobacter yonseiensis, Isolated from a Geothermal Hot Stream

PubMed Central

Lee, Sang-Jae; Lee, Yong-Jik; Park, Gun-Seok; Kim, Byoung-Chan; Lee, Sang Jun; Shin, Jae-Ho

2013-01-01

Caldanaerobacter yonseiensis is a strictly anaerobic, thermophilic, spore-forming bacterium, which was isolated from a geothermal hot stream in Indonesia. This bacterium utilizes xylose and produces a variety of proteases. Here, we report the draft genome sequence of C. yonseiensis, which reveals insights into the pentose phosphate pathway and protein degradation metabolism in thermophilic microorganisms. PMID:24201201
Draft genome sequence of marine Streptomyces sp. strain W007, which produces angucyclinone antibiotics with a benz[a]anthracene skeleton.

PubMed

Qin, Song; Zhang, Hongyu; Li, Fuchao; Zhu, Benwei; Zheng, Huajun

2012-03-01

A series of angucyclinone antibiotics have been isolated from marine Streptomyces sp. strain W007 and identified. Here, a draft genome sequence of Streptomyces sp. W007 is presented. The genome contains an intact biosynthetic gene cluster for angucyclinone antibiotics, which provides insight into the combinatorial biosynthesis of angucyclinone antibiotics produced by marine streptomycetes.
Evidence of Anticipatory Eye Movements in the Spatial Hebb Repetition Effect: Insights for Modeling Sequence Learning

ERIC Educational Resources Information Center

Tremblay, Sebastien; Saint-Aubin, Jean

2009-01-01

In the present study, the authors offer a window onto the mechanisms that drive the Hebb repetition effect through the analysis of eye movement and recall performance. In a spatial serial recall task in which sequences of dots are to be remembered in order, when one particular series is repeated every 4 trials, memory performance markedly improves…
Draft Genome Sequence of the Marine Bacterium Pseudomonas aestusnigri VGXO14T.

PubMed

Gomila, Margarita; Mulet, Magdalena; Lalucat, Jorge; García-Valdés, Elena

2017-08-10

The type strain of Pseudomonas aestusnigri (VGXO14), isolated from a crude oil-polluted marine sand sample, is a member of the P. pertucinogena phylogenetic group. Here, we report the genome sequence (3.83 Mb) of P. aestusnigri to gain insights into the biology and taxonomy of marine Pseudomonas spp. adapted to polluted marine habitats. Copyright © 2017 Gomila et al.
Draft Genome Sequence of the Marine Bacterium Pseudomonas aestusnigri VGXO14T

PubMed Central

2017-01-01

ABSTRACT The type strain of Pseudomonas aestusnigri (VGXO14), isolated from a crude oil-polluted marine sand sample, is a member of the P. pertucinogena phylogenetic group. Here, we report the genome sequence (3.83 Mb) of P. aestusnigri to gain insights into the biology and taxonomy of marine Pseudomonas spp. adapted to polluted marine habitats. PMID:28798177
Draft Genome Sequence of Oil-Degrading Bacterium Gallaecimonas pentaromativorans Strain YA_1 from the Southwest Indian Ocean

PubMed Central

Xu, Yiyuan; Ren, Chong; Chen, Ruixuan

2016-01-01

Gallaecimonas pentaromativorans has been previously reported to be capable of degrading crude oil and diesel oil. G. pentaromativorans strain YA_1 was isolated from the southwest Indian Ocean and can degrade crude oil. This study reports the draft genome sequence of G. pentaromativorans, which can provide insights into the mechanisms of microbial oil biodegradation. PMID:27491993
Rhipicephalus (Boophilus) microplus strain Deutsch, 5 BAC clone sequencing, including two encoding Cytochrome P450s and one encoding CzEst9 carboxylesterase

USDA-ARS?s Scientific Manuscript database

The cattle tick, Rhipicephalus (Boophilus) microplus, has a genome over 2.4 times the size of the human genome, and with over 70% of repetitive DNA, this genome would prove very costly to sequence at today's prices and difficult to assemble and analyze. BAC clones give insight into the genome struct...
Draft Genome Sequence of the Psychrophilic and Alkaliphilic Rhodonellum psychrophilum Strain GCM71T

PubMed Central

Hauptmann, Aviaja L.; Glaring, Mikkel A.; Hallin, Peter F.; Priemé, Anders

2013-01-01

Rhodonellum psychrophilum GCM71T, isolated from the cold and alkaline submarine ikaite columns in the Ikka Fjord in Greenland, displays optimal growth at 5 to 10°C and pH 10. Here, we report the draft genome sequence of this strain, which may provide insight into the mechanisms of adaptation to these extreme conditions. PMID:24309741
Subgenome-specific assembly of vitamin E biosynthesis genes and expression patterns during seed development provide insight into the evolution of the oat genome

USDA-ARS?s Scientific Manuscript database

Vitamin E is essential for humans and thus must be a component of a healthy diet. Among the cereal grains, hexaploid oats (Avena sativa L.) have high vitamin E content. To date, no gene sequences in the vitamin E biosynthesis pathway have been reported for oats. Using deep sequencing and orthology-g...
A century of typhus, lice and Rickettsia.

PubMed

Andersson, J O; Andersson, S G

2000-03-01

At the beginning of the 20th century, it was discovered at the Pasteur Institute in Tunis that epidemic typhus is transmitted by the human body louse. The complete genome sequence of its causative agent, Rickettsia prowazekii, was determined at Uppsala University in Sweden at the end of the century. In this mini-review, we discuss insights gained from the genome sequence of this fascinating and deadly organism.
Draft genome sequence of Inquilinus limosus strain MP06, a multidrug-resistant clinical isolate

PubMed Central

Pino, Marylú; Conza, José Di; Gutkind, Gabriel

2015-01-01

The bacterium, Inquilinus limosus, with its remarkable antimicrobial multiresistant profile, has increasingly been isolated in cystic fibrosis patients. We report draft genome sequence of a strain MP06, which is of considerable interest in elucidating the associated mechanisms of antibiotic resistance in this bacterium and for an insight about its persistence in airways of these patients. PMID:26691451
A Chill Sequence to the Bushveld Complex - Insight into the First Stages of Emplacement and the Parental Magmas to the World's Largest Layered Intrusion

NASA Astrophysics Data System (ADS)

Wilson, A.

2012-04-01

Evidence of the initial stages of magma emplacement in large mafic chambers is commonly lacking because of resorption of early-formed chills and complicated by the fact that the first magmas that entered the chamber were usually more evolved than the true parental magma. Deep drilling has revealed a rare occurrence of a chill sequence from the eastern Bushveld Complex at the base of a previously unrecognized thick succession of ultramafic rocks that forms part of the Lower Zone. The chill sequence (1.8 m thick) includes a true chill against quartzite floor rock, crystalline quench textured and orthopyroxene spinifex textured rocks. Importantly the chill composition represents a relatively evolved magma formed by the separation of high-Mg olivines prior to its emplacement, probably in a conduit or a pre-chamber. An overlying pyroxene dunite represents the extract that gave rise to the chill and was emplaced either as a crystal slurry derived from the feeder conduit or as the crystallization product from a slightly later influx of primitive magma of komatiitic composition. This highly-Mg rich pyroxene dunite most likely acted as a barrier to the thermal erosion of the chill sequence as the chamber filled. The olivine in the pyroxene dunite layer is the most primitive yet recorded for the Bushveld Complex at Mg# 0.915, and the cores of associated orthopyroxene are Mg# 0.93. Compositions of the orthopyroxene in the quench and spinifex textured units range from Mg# 0.91 to 0.72 and preserve cores close to the original liquidus as well as tracking the complete in-situ solidification process. Olivine contains abundant dendritic exsolution structures of Cr-spinel and Al-rich clinopyroxene indicating that they formed at high temperature from incorporation of Ca, Al and Cr into olivine, with little time to equilibrate before emplacement. Chromite in the section is the most primitive yet recorded for the Bushveld Complex. The komatiite magma that was initially emplaced into the Bushveld chamber contained 19-20% MgO but trace element analysis indicates that it was derived from melting of a more primitive komatiite source which digested about 40% of typical Kaapvaal basement to give the strong crustal signature represented by trace elements and Sr isotopes. The evolved B1 magma, which compositionally is only broadly constrained, is regarded as the parental magma to the Lower and Critical Zones, but this is shown to represent a number of different magmas also derived from a komatiitic source with relatively high degrees of crustal contamination. The komatiite source to the Bushveld magmas could have been derived from subducted Archean ocean crust such as the silica- rich but highly depleted Commondale-type komatiites, as well as Barberton-type komatiites and komatiitic basalts. A mantle peridotite source is not considered a suitable bulk source because the Ni content in the Bushveld olivines (up to 4000 ppm) is indicative of a pyroxenite source in the mantle.

Continuum theory for cluster morphologies of soft colloids.

PubMed

Kosmrlj, A; Pauschenwein, G J; Kahl, G; Ziherl, P

2011-06-09

We introduce a continuum description of the thermodynamics of colloids with a core-corona architecture. In the case of thick coronas, their overlap can be treated approximately by replacing the exact one-particle density distribution by a suitably shaped step profile, which provides a convenient way of modeling the spherical, columnar, lamellar, and inverted cluster morphologies predicted by numerical simulations and the more involved theories. We use the model to study monodisperse particles with the hard-core/square-shoulder pair interaction as the simplest representatives of the core-corona class. We derive approximate analytical expressions for the enthalpies of the cluster morphologies which offer a clear insight into the mechanisms at work, and we calculate the lattice spacing and the cluster size for all morphologies of the phase sequence as well as the phase-transition pressures. By comparing the results with the exact crystalline minimum-enthalpy configurations, we show that the accuracy of the theory increases with shoulder width. We discuss possible extensions of the theory that could account for the finite-temperature effects.
Single-Cell Analysis of Human Pancreas Reveals Transcriptional Signatures of Aging and Somatic Mutation Patterns.

PubMed

Enge, Martin; Arda, H Efsun; Mignardi, Marco; Beausang, John; Bottino, Rita; Kim, Seung K; Quake, Stephen R

2017-10-05

As organisms age, cells accumulate genetic and epigenetic errors that eventually lead to impaired organ function or catastrophic transformation such as cancer. Because aging reflects a stochastic process of increasing disorder, cells in an organ will be individually affected in different ways, thus rendering bulk analyses of postmitotic adult cells difficult to interpret. Here, we directly measure the effects of aging in human tissue by performing single-cell transcriptome analysis of 2,544 human pancreas cells from eight donors spanning six decades of life. We find that islet endocrine cells from older donors display increased levels of transcriptional noise and potential fate drift. By determining the mutational history of individual cells, we uncover a novel mutational signature in healthy aging endocrine cells. Our results demonstrate the feasibility of using single-cell RNA sequencing (RNA-seq) data from primary cells to derive insights into genetic and transcriptional processes that operate on aging human tissue. Copyright © 2017 Elsevier Inc. All rights reserved.
Genetic dissection of agronomically important traits in closely related temperate japonica rice cultivars

PubMed Central

Hori, Kiyosumi; Yamamoto, Toshio; Yano, Masahiro

2017-01-01

Many quantitative trait loci (QTLs) for agronomically important traits such as grain yield, disease resistance, and stress tolerance of rice (Oryza sativa L.) have been detected by using segregating populations derived from crosses between indica and japonica subspecies or with wild relatives. However, the QTLs involved in the control of natural variation in agronomic traits among closely related cultivars are still unclear. Decoding the whole genome sequences of Nipponbare and other temperate japonica rice cultivars has accelerated the collection of a huge number of single nucleotide polymorphisms (SNPs). These SNPs are good resource for developing polymorphic DNA markers and for detecting QTLs distributed across all rice chromosomes. The temperate japonica rice cultivar Koshihikari has remained the top cultivar for about 40 years since 1979 in Japan. Unraveling the genetic factors in Koshihikari will provide important insights into improving agronomic traits in temperate japonica rice cultivars. Here we describe recent progress in our studies as an example of genetic analysis in closely related cultivars. PMID:29398936
DOE Office of Scientific and Technical Information (OSTI.GOV)

Bae, Brian; Cobb, Ryan E.; DeSieno, Matthew A.

The enzyme FrbF from Streptomyces rubellomurinus has attracted significant attention due to its role in the biosynthesis of the antimalarial phosphonate FR-900098. The enzyme catalyzes acetyl transfer onto the hydroxamate of the FR-900098 precursors cytidine 5'-monophosphate-3-aminopropylphosphonate and cytidine 5'-monophosphate-N-hydroxy-3-aminopropylphosphonate. Despite the established function as a bona fide N-acetyltransferase, FrbF shows no sequence similarity to any member of the GCN5-like N-acetyltransferase (GNAT) superfamily. Here, we present the 2.0 {angstrom} resolution crystal structure of FrbF in complex with acetyl-CoA, which demonstrates a unique architecture that is distinct from those of canonical GNAT-like acetyltransferases. We also utilized the co-crystal structure to guide structure-functionmore » studies that identified the roles of putative active site residues in the acetyltransferase mechanism. The combined biochemical and structural analyses of FrbF provide insights into this previously uncharacterized family of N-acetyltransferases and also provide a molecular framework toward the production of novel N-acyl derivatives of FR-900098.« less
Iron Age and Anglo-Saxon genomes from East England reveal British migration history

PubMed Central

Schiffels, Stephan; Haak, Wolfgang; Paajanen, Pirita; Llamas, Bastien; Popescu, Elizabeth; Loe, Louise; Clarke, Rachel; Lyons, Alice; Mortimer, Richard; Sayer, Duncan; Tyler-Smith, Chris; Cooper, Alan; Durbin, Richard

2016-01-01

British population history has been shaped by a series of immigrations, including the early Anglo-Saxon migrations after 400 CE. It remains an open question how these events affected the genetic composition of the current British population. Here, we present whole-genome sequences from 10 individuals excavated close to Cambridge in the East of England, ranging from the late Iron Age to the middle Anglo-Saxon period. By analysing shared rare variants with hundreds of modern samples from Britain and Europe, we estimate that on average the contemporary East English population derives 38% of its ancestry from Anglo-Saxon migrations. We gain further insight with a new method, rarecoal, which infers population history and identifies fine-scale genetic ancestry from rare variants. Using rarecoal we find that the Anglo-Saxon samples are closely related to modern Dutch and Danish populations, while the Iron Age samples share ancestors with multiple Northern European populations including Britain. PMID:26783965
Focused Review: Cytotoxic and Antioxidant Potentials of Mangrove-Derived Streptomyces

PubMed Central

Ser, Hooi-Leng; Tan, Loh Teng-Hern; Law, Jodi Woan-Fei; Chan, Kok-Gan; Duangjai, Acharaporn; Saokaew, Surasak; Pusparajah, Priyia; Ab Mutalib, Nurul-Syakima; Khan, Tahir Mehmood; Goh, Bey-Hing; Lee, Learn-Han

2017-01-01

Human life expectancy is rapidly increasing with an associated increasing burden of chronic diseases, such as neurodegenerative diseases and cancer. However, there is limited progress in finding effective treatment for these conditions. For this reason, members of the genus Streptomyces have been explored extensively over the past decades as these filamentous bacteria are highly efficient in producing bioactive compounds with human health benefits. Being ubiquitous in nature, streptomycetes can be found in both terrestrial and marine environments. Previously, two Streptomyces strains (MUSC 137T and MUM 256) isolated from mangrove sediments in Peninsular Malaysia demonstrated potent antioxidant and cytotoxic activities against several human cancer cell lines on bioactivity screening. These results illustrate the importance of streptomycetes from underexplored regions aside from the terrestrial ecosystem. Here we provide the insights and significance of Streptomyces species in the search of anticancer and/or chemopreventive agents and highlight the impact of next generation sequencing on drug discovery from the Streptomyces arsenal. PMID:29163380
FOX and ETS family transcription factors regulate the pigment cell lineage in planarians.

PubMed

He, Xinwen; Lindsay-Mosher, Nicole; Li, Yan; Molinaro, Alyssa M; Pellettieri, Jason; Pearson, Bret J

2017-12-15

Many pigment cells acquire unique structural properties and gene expression profiles during animal development. The underlying differentiation pathways have been well characterized in cells formed during embryogenesis, such as the neural crest-derived melanocyte. However, much less is known about the developmental origins of pigment cells produced in adult organisms during tissue homeostasis and repair. Here we report a lineage analysis of ommochrome- and porphyrin-producing cells in the brown, freshwater planarian Schmidtea mediterranea Using an RNA-sequencing approach, we identified two classes of markers expressed in sequential fashion when new pigment cells are generated during regeneration or in response to pigment cell ablation. We also report roles for FOXF-1 and ETS-1 transcription factors, as well as for an FGFR-like molecule, in the specification and maintenance of this cell type. Together, our results provide insights into mechanisms of adult pigment cell development in the strikingly colorful Platyhelminthes phylum. © 2017. Published by The Company of Biologists Ltd.
Chromatin Remodeling BAF (SWI/SNF) Complexes in Neural Development and Disorders

PubMed Central

Sokpor, Godwin; Xie, Yuanbin; Rosenbusch, Joachim; Tuoc, Tran

2017-01-01

The ATP-dependent BRG1/BRM associated factor (BAF) chromatin remodeling complexes are crucial in regulating gene expression by controlling chromatin dynamics. Over the last decade, it has become increasingly clear that during neural development in mammals, distinct ontogenetic stage-specific BAF complexes derived from combinatorial assembly of their subunits are formed in neural progenitors and post-mitotic neural cells. Proper functioning of the BAF complexes plays critical roles in neural development, including the establishment and maintenance of neural fates and functionality. Indeed, recent human exome sequencing and genome-wide association studies have revealed that mutations in BAF complex subunits are linked to neurodevelopmental disorders such as Coffin-Siris syndrome, Nicolaides-Baraitser syndrome, Kleefstra's syndrome spectrum, Hirschsprung's disease, autism spectrum disorder, and schizophrenia. In this review, we focus on the latest insights into the functions of BAF complexes during neural development and the plausible mechanistic basis of how mutations in known BAF subunits are associated with certain neurodevelopmental disorders. PMID:28824374
Chromatin Remodeling BAF (SWI/SNF) Complexes in Neural Development and Disorders.

PubMed

Sokpor, Godwin; Xie, Yuanbin; Rosenbusch, Joachim; Tuoc, Tran

2017-01-01

The ATP-dependent BRG1/BRM associated factor (BAF) chromatin remodeling complexes are crucial in regulating gene expression by controlling chromatin dynamics. Over the last decade, it has become increasingly clear that during neural development in mammals, distinct ontogenetic stage-specific BAF complexes derived from combinatorial assembly of their subunits are formed in neural progenitors and post-mitotic neural cells. Proper functioning of the BAF complexes plays critical roles in neural development, including the establishment and maintenance of neural fates and functionality. Indeed, recent human exome sequencing and genome-wide association studies have revealed that mutations in BAF complex subunits are linked to neurodevelopmental disorders such as Coffin-Siris syndrome, Nicolaides-Baraitser syndrome, Kleefstra's syndrome spectrum, Hirschsprung's disease, autism spectrum disorder, and schizophrenia. In this review, we focus on the latest insights into the functions of BAF complexes during neural development and the plausible mechanistic basis of how mutations in known BAF subunits are associated with certain neurodevelopmental disorders.
Glycomacropeptide Sustains Microbiota Diversity and Promotes Specific Taxa in an Artificial Colon Model of Elderly Gut Microbiota.

PubMed

Ntemiri, Alexandra; Chonchúir, Fodhla Ní; O'Callaghan, Tom F; Stanton, Catherine; Ross, R Paul; O'Toole, Paul W

2017-03-01

The potential of milk-derived glycomacropeptide (GMP) and lactose for modulating the human gut microbiota of older people, in whom loss of diversity correlates with inferior health, was investigated. We used an in vitro batch fermentation (artificial colon model) to simulate colonic fermentation processes of two GMP products, i.e., a commercially available GMP concentrate and a semipurified GMP concentrate, and lactose. Faecal samples were collected from healthy and frail older people. Samples were analyzed by Illumina Miseq sequencing of rRNA gene amplicons. The commercial GMP preparation had a positive effect on the growth of Coprococcus and Clostridium cluster XIVb and sustained a higher faecal microbiota diversity compared to control substrates or lactose. Lactose fermentation promoted the growth of Proteobacteria including Escherichia/Shigella. This work provides an in-depth insight on the potential of GMP and lactose for modulating the gut microbiota and contributes more evidence confirming the prebiotic activity of GMP.
A PDGF/VEGF homologue provides new insights into the nucleus grafting operation and immune response in the pearl oyster Pinctada fucata.

PubMed

Huang, Xian-De; Zhang, Hua; He, Mao-Xian

2017-12-30

The platelet-derived growth factor/vascular endothelial growth factor (PDGF/VEGF, PVF) family of proteins have been implicated in a wide range of biological functions in vertebrates, including cell proliferation, cell differentiation, cell migration, neural development and especially angiogenesis/vasculogenesis. In this study, a PVF gene, belonging to the PDGF/VEGF family, was cloned and characterized from Pinctada fucata. It contained an ORF of 1110bp encoding a putative protein of 369 amino acids. The deduced amino acid sequence presented the typical structural features of PDGF family members and the N-terminal signal peptide for secretion. Comparative phylogenetic analysis revealed that PfPVF shows relatively high identity with other invertebrate PVF homologues. Furthermore, gene expression analysis revealed that PfPVF is involved in not only the nucleus grafting operation and but also the response to immune stimulation. The study may help to increase understanding of the functions of molluscan PVF. Copyright © 2017 Elsevier B.V. All rights reserved.
How to design 13C para-hydrogen-induced polarization experiments for MRI applications.

PubMed

Reineri, Francesca; Viale, Alessandra; Dastrù, Walter; Gobetto, Roberto; Aime, Silvio

2011-01-01

The application of hyperpolarization techniques for MRI purposes is gathering increasing attention, especially for nuclei such as (13)C or (129)Xe. Among the different proposed methods, ParaHydrogen Induced Polarization requires relatively cheap equipment. The setup of an MRI experiment by means of parahydrogen requires the application of skills and methodologies that derive from different fields of knowledge. The basic theory and a practical insight of this method are presented here. Parahydrogenation of alkynes, having a labelled (13)CO group adjacent to the triple bond, catalyzed by Rh(I) complexes containing a chelating phosphine, represents the best choice for producing and maintaining high heteronuclear polarization effect. In order to transform anti-phase into in-phase (net) (13)C polarization for MRI application it is necessary to set up the described magnetic field cycle procedure. In vitro and in vivo images have been acquired using fast imaging sequences (RARE and trueFISP). Copyright © 2010 John Wiley & Sons, Ltd.
A comparative genomic hybridization approach to study gene copy number variations among Chinese hamster cell lines.

PubMed

Vishwanathan, Nandita; Bandyopadhyay, Arpan; Fu, Hsu-Yuan; Johnson, Kathryn C; Springer, Nathan M; Hu, Wei-Shou

2017-08-01

Chinese Hamster Ovary (CHO) cells are aneuploid in nature. The genome of recombinant protein producing CHO cell lines continuously undergoes changes in its structure and organization. We analyzed nine cell lines, including parental cell lines, using a comparative genomic hybridization (CGH) array focused on gene-containing regions. The comparison of CGH with copy-number estimates from sequencing data showed good correlation. Hierarchical clustering of the gene copy number variation data from CGH data revealed the lineage relationships between the cell lines. On analyzing the clones of a clonal population, some regions with altered genomic copy number status were identified indicating genomic changes during passaging. A CGH array is thus an effective tool in quantifying genomic alterations in industrial cell lines and can provide insights into the changes in the genomic structure during cell line derivation and long term culture. Biotechnol. Bioeng. 2017;114: 1903-1908. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
A DNA methylation map of human cancer at single base-pair resolution.

PubMed

Vidal, E; Sayols, S; Moran, S; Guillaumet-Adkins, A; Schroeder, M P; Royo, R; Orozco, M; Gut, M; Gut, I; Lopez-Bigas, N; Heyn, H; Esteller, M

2017-10-05

Although single base-pair resolution DNA methylation landscapes for embryonic and different somatic cell types provided important insights into epigenetic dynamics and cell-type specificity, such comprehensive profiling is incomplete across human cancer types. This prompted us to perform genome-wide DNA methylation profiling of 22 samples derived from normal tissues and associated neoplasms, including primary tumors and cancer cell lines. Unlike their invariant normal counterparts, cancer samples exhibited highly variable CpG methylation levels in a large proportion of the genome, involving progressive changes during tumor evolution. The whole-genome sequencing results from selected samples were replicated in a large cohort of 1112 primary tumors of various cancer types using genome-scale DNA methylation analysis. Specifically, we determined DNA hypermethylation of promoters and enhancers regulating tumor-suppressor genes, with potential cancer-driving effects. DNA hypermethylation events showed evidence of positive selection, mutual exclusivity and tissue specificity, suggesting their active participation in neoplastic transformation. Our data highlight the extensive changes in DNA methylation that occur in cancer onset, progression and dissemination.
Aqueous mineralogy and stratigraphy at and around the proposed Mawrth Vallis MSL Landing Site: New insights into the aqueous history of the region

USGS Publications Warehouse

Dobrea, Eldar Z. Noe; Michalski, Joseph; Swayze, Gregg

2011-01-01

In this work, we have confirmed the mineralogical stratigraphy previously inferred by other authors, but also demonstrate the presence of additional minerals, including a possible acid-leaching product near the top of the sequence, an Mh-OH bearing phyllosilicate at the to of the sequence, and potentially a Ca-sulfate at the bottom of the phyllosilicate sequence. The latter has important implications regarding the relative timing of sulfate vs clay formation on Mars.
Genome Sequence of the White Koji Mold Aspergillus kawachii IFO 4308, Used for Brewing the Japanese Distilled Spirit Shochu

PubMed Central

Futagami, Taiki; Mori, Kazuki; Yamashita, Ayaka; Wada, Shotaro; Kajiwara, Yasuhiro; Takashita, Hideharu; Omori, Toshiro; Takegawa, Kaoru; Tashiro, Kosuke; Kuhara, Satoru; Goto, Masatoshi

2011-01-01

The filamentous fungus Aspergillus kawachii has traditionally been used for brewing the Japanese distilled spirit shochu. A. kawachii characteristically hyperproduces citric acid and a variety of polysaccharide glycoside hydrolases. Here the genome sequence of A. kawachii IFO 4308 was determined and annotated. Analysis of the sequence may provide insight into the properties of this fungus that make it superior for use in shochu production, leading to the further development of A. kawachii for industrial applications. PMID:22045919
The genome of the sea urchin Strongylocentrotus purpuratus.

PubMed

Sodergren, Erica; Weinstock, George M; Davidson, Eric H; Cameron, R Andrew; Gibbs, Richard A; Angerer, Robert C; Angerer, Lynne M; Arnone, Maria Ina; Burgess, David R; Burke, Robert D; Coffman, James A; Dean, Michael; Elphick, Maurice R; Ettensohn, Charles A; Foltz, Kathy R; Hamdoun, Amro; Hynes, Richard O; Klein, William H; Marzluff, William; McClay, David R; Morris, Robert L; Mushegian, Arcady; Rast, Jonathan P; Smith, L Courtney; Thorndyke, Michael C; Vacquier, Victor D; Wessel, Gary M; Wray, Greg; Zhang, Lan; Elsik, Christine G; Ermolaeva, Olga; Hlavina, Wratko; Hofmann, Gretchen; Kitts, Paul; Landrum, Melissa J; Mackey, Aaron J; Maglott, Donna; Panopoulou, Georgia; Poustka, Albert J; Pruitt, Kim; Sapojnikov, Victor; Song, Xingzhi; Souvorov, Alexandre; Solovyev, Victor; Wei, Zheng; Whittaker, Charles A; Worley, Kim; Durbin, K James; Shen, Yufeng; Fedrigo, Olivier; Garfield, David; Haygood, Ralph; Primus, Alexander; Satija, Rahul; Severson, Tonya; Gonzalez-Garay, Manuel L; Jackson, Andrew R; Milosavljevic, Aleksandar; Tong, Mark; Killian, Christopher E; Livingston, Brian T; Wilt, Fred H; Adams, Nikki; Bellé, Robert; Carbonneau, Seth; Cheung, Rocky; Cormier, Patrick; Cosson, Bertrand; Croce, Jenifer; Fernandez-Guerra, Antonio; Genevière, Anne-Marie; Goel, Manisha; Kelkar, Hemant; Morales, Julia; Mulner-Lorillon, Odile; Robertson, Anthony J; Goldstone, Jared V; Cole, Bryan; Epel, David; Gold, Bert; Hahn, Mark E; Howard-Ashby, Meredith; Scally, Mark; Stegeman, John J; Allgood, Erin L; Cool, Jonah; Judkins, Kyle M; McCafferty, Shawn S; Musante, Ashlan M; Obar, Robert A; Rawson, Amanda P; Rossetti, Blair J; Gibbons, Ian R; Hoffman, Matthew P; Leone, Andrew; Istrail, Sorin; Materna, Stefan C; Samanta, Manoj P; Stolc, Viktor; Tongprasit, Waraporn; Tu, Qiang; Bergeron, Karl-Frederik; Brandhorst, Bruce P; Whittle, James; Berney, Kevin; Bottjer, David J; Calestani, Cristina; Peterson, Kevin; Chow, Elly; Yuan, Qiu Autumn; Elhaik, Eran; Graur, Dan; Reese, Justin T; Bosdet, Ian; Heesun, Shin; Marra, Marco A; Schein, Jacqueline; Anderson, Michele K; Brockton, Virginia; Buckley, Katherine M; Cohen, Avis H; Fugmann, Sebastian D; Hibino, Taku; Loza-Coll, Mariano; Majeske, Audrey J; Messier, Cynthia; Nair, Sham V; Pancer, Zeev; Terwilliger, David P; Agca, Cavit; Arboleda, Enrique; Chen, Nansheng; Churcher, Allison M; Hallböök, F; Humphrey, Glen W; Idris, Mohammed M; Kiyama, Takae; Liang, Shuguang; Mellott, Dan; Mu, Xiuqian; Murray, Greg; Olinski, Robert P; Raible, Florian; Rowe, Matthew; Taylor, John S; Tessmar-Raible, Kristin; Wang, D; Wilson, Karen H; Yaguchi, Shunsuke; Gaasterland, Terry; Galindo, Blanca E; Gunaratne, Herath J; Juliano, Celina; Kinukawa, Masashi; Moy, Gary W; Neill, Anna T; Nomura, Mamoru; Raisch, Michael; Reade, Anna; Roux, Michelle M; Song, Jia L; Su, Yi-Hsien; Townley, Ian K; Voronina, Ekaterina; Wong, Julian L; Amore, Gabriele; Branno, Margherita; Brown, Euan R; Cavalieri, Vincenzo; Duboc, Véronique; Duloquin, Louise; Flytzanis, Constantin; Gache, Christian; Lapraz, François; Lepage, Thierry; Locascio, Annamaria; Martinez, Pedro; Matassi, Giorgio; Matranga, Valeria; Range, Ryan; Rizzo, Francesca; Röttinger, Eric; Beane, Wendy; Bradham, Cynthia; Byrum, Christine; Glenn, Tom; Hussain, Sofia; Manning, Gerard; Miranda, Esther; Thomason, Rebecca; Walton, Katherine; Wikramanayke, Athula; Wu, Shu-Yu; Xu, Ronghui; Brown, C Titus; Chen, Lili; Gray, Rachel F; Lee, Pei Yun; Nam, Jongmin; Oliveri, Paola; Smith, Joel; Muzny, Donna; Bell, Stephanie; Chacko, Joseph; Cree, Andrew; Curry, Stacey; Davis, Clay; Dinh, Huyen; Dugan-Rocha, Shannon; Fowler, Jerry; Gill, Rachel; Hamilton, Cerrissa; Hernandez, Judith; Hines, Sandra; Hume, Jennifer; Jackson, Laronda; Jolivet, Angela; Kovar, Christie; Lee, Sandra; Lewis, Lora; Miner, George; Morgan, Margaret; Nazareth, Lynne V; Okwuonu, Geoffrey; Parker, David; Pu, Ling-Ling; Thorn, Rachel; Wright, Rita

2006-11-10

We report the sequence and analysis of the 814-megabase genome of the sea urchin Strongylocentrotus purpuratus, a model for developmental and systems biology. The sequencing strategy combined whole-genome shotgun and bacterial artificial chromosome (BAC) sequences. This use of BAC clones, aided by a pooling strategy, overcame difficulties associated with high heterozygosity of the genome. The genome encodes about 23,300 genes, including many previously thought to be vertebrate innovations or known only outside the deuterostomes. This echinoderm genome provides an evolutionary outgroup for the chordates and yields insights into the evolution of deuterostomes.
Bed roughness of palaeo-ice streams: insights and implications for contemporary ice sheet dynamics

NASA Astrophysics Data System (ADS)

Falcini, Francesca; Rippin, David; Selby, Katherine; Krabbendam, Maarten

2017-04-01

Bed roughness is the vertical variation of elevation along a horizontal transect. It is an important control on ice stream location and dynamics, with a correspondingly important role in determining the behaviour of ice sheets. Previous studies of bed roughness have been limited to insights derived from Radio Echo Sounding (RES) profiles across parts of Antarctica and Greenland. Such an approach has been necessary due to the inaccessibility of the underlying bed. This approach has led to important insights, such as identifying a general link between smooth beds and fast ice flow, as well as rough beds and slow ice flow. However, these insights are mainly derived from relatively coarse datasets, so that links between roughness and flow are generalised and rather simplistic. Here, we explore the use of DTMs from the well-preserved footprints of palaeo-ice streams, coupled with high resolution models of palaeo-ice flow, as a tool for investigating basal controls on the behaviour of contemporary, active ice streams in much greater detail. Initially, artificial transects were set up across the Minch palaeo-ice stream (NW Scotland) to mimic RES flight lines from past studies in Antarctica. We then explored how increasing data-resolution impacted upon the roughness measurements that were derived. Our work on the Minch palaeo-ice stream indicates that different roughness signatures are associated with different glacial landforms, and we discuss the potential for using these insights to infer, from RES-based roughness measurements, the occurrence of particular landform assemblages that may exist beneath contemporary ice sheets.
Characterization of EST-derived and non-EST simple sequence repeats in an F₁ hybrid population of Vitis vinifera L.

PubMed

Kayesh, E; Bilkish, N; Liu, G S; Chen, W; Leng, X P; Fang, J G

2014-03-31

Among different classes of molecular markers, expressed sequence tags (ESTs) are a new resource for developing simple sequence repeat (SSR) functional markers for genotyping and genetic mapping in F1 hybrid populations of Vitis vinifera L. Recently, because of the availability of an enormous amount of data for ESTs in the public domain, the emphasis has shifted from genomic SSRs to EST-SSRs, which belong to transcribed regions of the genome and may have a role in gene expression or function. The objective of this study was to assess the polymorphisms among 94 F1 hybrids from "Early Rose" and "Red Globe" using 25 EST-derived and 25 non-EST SSR markers. A total collection of 362,375 grape ESTs that were retrieved from the National Center for Biotechnology Information (NCBI) and 2522 EST-SSR sequences were identified. From them, 205 primer pairs were randomly selected, including 176 pairs that were EST-derived and 29 non-EST SSR primer pairs, for polymerase chain reaction amplification. A total of 131 alleles were amplified using 50 pairs of primers; 78 alleles were amplified using EST-derived SSR primers and 53 were from non-EST SSR primers. At most, 6 and 5 alleles were amplified by EST-derived and non-EST SSR primers, respectively. The EST-derived SSR markers showed a maximum polymorphic information content (PIC) value of 1 and a minimum of 0.33 while non-EST SSR markers had maximum and minimum PIC values of 1 and 0.25, respectively. The average PIC value was 0.56 for EST-derived SSR markers and 0.45 for non-EST SSR markers.
Conservation of Shannon's redundancy for proteins. [information theory applied to amino acid sequences

NASA Technical Reports Server (NTRS)

Gatlin, L. L.

1974-01-01

Concepts of information theory are applied to examine various proteins in terms of their redundancy in natural originators such as animals and plants. The Monte Carlo method is used to derive information parameters for random protein sequences. Real protein sequence parameters are compared with the standard parameters of protein sequences having a specific length. The tendency of a chain to contain some amino acids more frequently than others and the tendency of a chain to contain certain amino acid pairs more frequently than other pairs are used as randomness measures of individual protein sequences. Non-periodic proteins are generally found to have random Shannon redundancies except in cases of constraints due to short chain length and genetic codes. Redundant characteristics of highly periodic proteins are discussed. A degree of periodicity parameter is derived.

The complete plastome sequence of Rubus takesimensis endemic to Ulleung Island, Korea: Insights into molecular evolution of anagenetically derived species in Rubus (Rosaceae).

PubMed

Yang, Ji Young; Pak, Jae-Hong; Kim, Seung-Chul

2018-08-20

Previous phylogenetic studies have suggested that Rubus takesimensis (Rosaceae), which is endemic to Ulleung Island, Korea, is closely related to R. crataegifolius, which is broadly distributed across East Asia. A recent phylogeographic study also suggested the possible polyphyletic origins of R. takesimensis from multiple source populations of its continental progenitor R. crataegifolius in China, Japan, Korea, and the Russian Far East. However, even though the progenitor-derivative relationship between R. crataegifolius and R. takesimensis has been established, little is known about the chloroplast genome (i.e., plastome) evolution of anagenetically derived species on oceanic islands and their continental progenitor species. In the present study, we characterized the complete plastome of R. takesimensis and compared it to those of R. crataegifolius and four other Rubus species. The R. takesimensis plastome was 155,760 base pairs (bp) long, a total of 46 bp longer than the plastome of R. crataegifolius (28 from LSC and 18 from SSC). No structural or content rearrangements were found between the species pairs. Four highly variable intergenic regions (rpl32/trnL, rps4/trnT, trnT/trnL, and psbZ/trnG) were identified between R. takesimensis and R. crataegifolius. Compared to the plastomes of other congeneric species (R. corchorifolius, R. fockeanus, and R. niveus), six highly variable intergenic regions (ndhC/psaC, rps16/trnQ, trnK/rps16, trnL/trnF, trnM/atpE, and trnQ/psbK) were also identified. A total of 116 simple sequence repeats (SSRs), including 48 mononucleotide, 64 dinucleotide, and four trinucleotide repeat motifs were characterized in R. takesimensis. The plastome resources generated by the present study will help to elucidate plastome evolution within the genus and to resolve phylogenetic relationships within highly complex and reticulated lineages. Phylogenetic analysis supported both the monophyly of Rubus and the sister relationship between R. crataegifolius and R. takesimensis. Copyright © 2018. Published by Elsevier B.V.
Functionalization of peptide nucleolipid bioconjugates and their structure anti-cancer activity relationship studies.

PubMed

Rana, Niki; Cultrara, Christopher; Phillips, Mariana; Sabatino, David

2017-09-01

In the search for more potent peptide-based anti-cancer conjugates the generation of new, functionally diverse nucleolipid derived D-(KLAKLAK) 2 -AK sequences has enabled a structure and anti-cancer activity relationship study. A reductive amination approach was key for the synthesis of alkylamine, diamine and polyamine derived nucleolipids as well as those incorporating heterocyclic functionality. The carboxy-derived nucleolipids were then coupled to the C-terminus of the D-(KLAKLAK) 2 -AK killer peptide sequence and produced with and without the FITC fluorophore for investigating biological activity in cancer cells. The amphiphilic, α-helical peptide-nucleolipid bioconjugates were found to exhibit variable effects on the viability of MM.1S cells, with the histamine derived nucleolipid peptide bioconjugate displaying the most significant anti-cancer effects. Thus, functionally diverse nucleolipids have been developed to fine-tune the structure and anti-cancer properties of killer peptide sequences, such as D-(KLAKLAK) 2 -AK. Copyright © 2017 Elsevier Ltd. All rights reserved.
Illustrative case studies in the return of exome and genome sequencing results

PubMed Central

Amendola, Laura M; Lautenbach, Denise; Scollon, Sarah; Bernhardt, Barbara; Biswas, Sawona; East, Kelly; Everett, Jessica; Gilmore, Marian J; Himes, Patricia; Raymond, Victoria M; Wynn, Julia; Hart, Ragan; Jarvik, Gail P

2015-01-01

Whole genome and exome sequencing tests are increasingly being ordered in clinical practice, creating a need for research exploring the return of results from these tests. A goal of the Clinical Sequencing and Exploratory Research (CSER) consortium is to gain experience with this process to develop best practice recommendations for offering exome and genome testing and returning results. Genetic counselors in the CSER consortium have an integral role in the return of results from these genomic sequencing tests and have gained valuable insight. We present seven emerging themes related to return of exome and genome sequencing results accompanied by case descriptions illustrating important lessons learned, counseling challenges specific to these tests and considerations for future research and practice. PMID:26478737
Full-Length Venom Protein cDNA Sequences from Venom-Derived mRNA: Exploring Compositional Variation and Adaptive Multigene Evolution

PubMed Central

Modahl, Cassandra M.; Mackessy, Stephen P.

2016-01-01

Envenomation of humans by snakes is a complex and continuously evolving medical emergency, and treatment is made that much more difficult by the diverse biochemical composition of many venoms. Venomous snakes and their venoms also provide models for the study of molecular evolutionary processes leading to adaptation and genotype-phenotype relationships. To compare venom complexity and protein sequences, venom gland transcriptomes are assembled, which usually requires the sacrifice of snakes for tissue. However, toxin transcripts are also present in venoms, offering the possibility of obtaining cDNA sequences directly from venom. This study provides evidence that unknown full-length venom protein transcripts can be obtained from the venoms of multiple species from all major venomous snake families. These unknown venom protein cDNAs are obtained by the use of primers designed from conserved signal peptide sequences within each venom protein superfamily. This technique was used to assemble a partial venom gland transcriptome for the Middle American Rattlesnake (Crotalus simus tzabcan) by amplifying sequences for phospholipases A2, serine proteases, C-lectins, and metalloproteinases from within venom. Phospholipase A2 sequences were also recovered from the venoms of several rattlesnakes and an elapid snake (Pseudechis porphyriacus), and three-finger toxin sequences were recovered from multiple rear-fanged snake species, demonstrating that the three major clades of advanced snakes (Elapidae, Viperidae, Colubridae) have stable mRNA present in their venoms. These cDNA sequences from venom were then used to explore potential activities derived from protein sequence similarities and evolutionary histories within these large multigene superfamilies. Venom-derived sequences can also be used to aid in characterizing venoms that lack proteomic profiles and identify sequence characteristics indicating specific envenomation profiles. This approach, requiring only venom, provides access to cDNA sequences in the absence of living specimens, even from commercial venom sources, to evaluate important regional differences in venom composition and to study snake venom protein evolution. PMID:27280639
Anti-infective activity of apolipoprotein domain derived peptides in vitro: identification of novel antimicrobial peptides related to apolipoprotein B with anti-HIV activity

PubMed Central

2010-01-01

Background Previous reports have shown that peptides derived from the apolipoprotein E receptor binding region and the amphipathic α-helical domains of apolipoprotein AI have broad anti-infective activity and antiviral activity respectively. Lipoproteins and viruses share a similar cell biological niche, being of overlapping size and displaying similar interactions with mammalian cells and receptors, which may have led to other antiviral sequences arising within apolipoproteins, in addition to those previously reported. We therefore designed a series of peptides based around either apolipoprotein receptor binding regions, or amphipathic α-helical domains, and tested these for antiviral and antibacterial activity. Results Of the nineteen new peptides tested, seven showed some anti-infective activity, with two of these being derived from two apolipoproteins not previously used to derive anti-infective sequences. Apolipoprotein J (151-170) - based on a predicted amphipathic alpha-helical domain from apolipoprotein J - had measurable anti-HSV1 activity, as did apolipoprotein B (3359-3367) dp (apoBdp), the latter being derived from the LDL receptor binding domain B of apolipoprotein B. The more active peptide - apoBdp - showed similarity to the previously reported apoE derived anti-infective peptide, and further modification of the apoBdp sequence to align the charge distribution more closely to that of apoEdp or to introduce aromatic residues resulted in increased breadth and potency of activity. The most active peptide of this type showed similar potent anti-HIV activity, comparable to that we previously reported for the apoE derived peptide apoEdpL-W. Conclusions These data suggest that further antimicrobial peptides may be obtained using human apolipoprotein sequences, selecting regions with either amphipathic α-helical structure, or those linked to receptor-binding regions. The finding that an amphipathic α-helical region of apolipoprotein J has antiviral activity comparable with that for the previously reported apolipoprotein AI derived peptide 18A, suggests that full-length apolipoprotein J may also have such activity, as has been reported for full-length apolipoprotein AI. Although the strength of the anti-infective activity of the sequences identified was limited, this could be increased substantially by developing related mutant peptides. Indeed the apolipoprotein B-derived peptide mutants uncovered by the present study may have utility as HIV therapeutics or microbicides. PMID:20298574
In vitro gene expression by cationized derivatives of an artificial protein with repeated RGD sequences, Pronectin.

PubMed

Hosseinkhani, Hossein; Tabata, Yasuhiko

2003-01-09

The objective of this study is to investigate the efficiency of a non-viral gene carrier with RGD sequences, Pronectin F(+) for gene transfection. The Pronectin F(+) was cationized by introducing ethylenediamine (Ed), spermidine (Sd), and spermine (Sm) to the hydroxyl groups while the corresponding gelatin derivative was prepared similarly because gelatin also has one RGD sequence per molecule. The zeta potential and molecular size of Pronectin F(+) and gelatin derivatives were examined before and after polyion complexation with a plasmid DNA of luciferase. When complexed with the plasmid DNA at the Pronectin F(+)/plasmid DNA mixing ratio of 50, the complex exhibited a zeta potential of about 10 mV, which is similar to that of the gelatin derivative-plasmid DNA complex. Irrespective of the type of Pronectin F(+) and gelatin derivatives, their complexation enabled the apparent molecular size of plasmid DNA to reduce to about 200 nm, the size decreasing with the increased derivative/plasmid DNA weight mixing ratio. The rat gastric mucosal (RGM)-1 cells treated with both complexes exhibited significantly stronger luciferase activities than free plasmid DNA although the enhanced extent was significant for the Sm derivative compared with the corresponding Ed and Sd derivatives. Cell attachment was enhanced by the Pronectin F(+) derivative to a significant high extent compared with the gelatin derivative. The amount of plasmid DNA internalized into the cells was enhanced by the complexation with every Pronectin F(+) derivative compared with the gelatin derivative. For both of Pronectin F(+) and gelatin carriers, the buffering capacity of Sm derivatives was higher than that of Ed and Sd derivatives and comparable to that of polyethyleneimine. It is likely that the high efficiency of gene transfection for the Sm derivative is due to the superior buffering effect. We conclude that the Sm derivative of Pronectin F(+) is promising as a non-viral vector of gene transfection.
A high-speed on-chip pseudo-random binary sequence generator for multi-tone phase calibration

NASA Astrophysics Data System (ADS)

Gommé, Liesbeth; Vandersteen, Gerd; Rolain, Yves

2011-07-01

An on-chip reference generator is conceived by adopting the technique of decimating a pseudo-random binary sequence (PRBS) signal in parallel sequences. This is of great benefit when high-speed generation of PRBS and PRBS-derived signals is the objective. The design implemented standard CMOS logic is available in commercial libraries to provide the logic functions for the generator. The design allows the user to select the periodicity of the PRBS and the PRBS-derived signals. The characterization of the on-chip generator marks its performance and reveals promising specifications.
Symmetric convolution of asymmetric multidimensional sequences using discrete trigonometric transforms.

PubMed

Foltz, T M; Welsh, B M

1999-01-01

This paper uses the fact that the discrete Fourier transform diagonalizes a circulant matrix to provide an alternate derivation of the symmetric convolution-multiplication property for discrete trigonometric transforms. Derived in this manner, the symmetric convolution-multiplication property extends easily to multiple dimensions using the notion of block circulant matrices and generalizes to multidimensional asymmetric sequences. The symmetric convolution of multidimensional asymmetric sequences can then be accomplished by taking the product of the trigonometric transforms of the sequences and then applying an inverse trigonometric transform to the result. An example is given of how this theory can be used for applying a two-dimensional (2-D) finite impulse response (FIR) filter with nonlinear phase which models atmospheric turbulence.
Micronuclear DNA of Oxytricha nova contains sequences with autonomously replicating activity in Saccharomyces cerevisiae.

PubMed Central

Colombo, M M; Swanton, M T; Donini, P; Prescott, D M

1984-01-01

Oxytricha nova is a hypotrichous ciliate with micronuclei and macronuclei. Micronuclei, which contain large, chromosomal-sized DNA, are genetically inert but undergo meiosis and exchange during cell mating. Macronuclei, which contain only small, gene-sized DNA molecules, provide all of the nuclear RNA needed to run the cell. After cell mating the macronucleus is derived from a micronucleus, a derivation that includes excision of the genes from chromosomes and elimination of the remaining DNA. The eliminated DNA includes all of the repetitious sequences and approximately 95% of the unique sequences. We cloned large restriction fragments from the micronucleus that confer replication ability on a replication-deficient plasmid in Saccharomyces cerevisiae. Sequences that confer replication ability are called autonomously replicating sequences. The frequency and effectiveness of autonomously replicating sequences in micronuclear DNA are similar to those reported for DNAs of other organisms introduced into yeast cells. Of the 12 micronuclear fragments with autonomously replicating sequence activity, 9 also showed homology to macronuclear DNA, indicating that they contain a macronuclear gene sequence. We conclude from this that autonomously replicating sequence activity is nonrandomly distributed throughout micronuclear DNA and is preferentially associated with those regions of micronuclear DNA that contain genes. Images PMID:6092934
Revising Star and Planet Formation Timescales

NASA Astrophysics Data System (ADS)

Bell, Cameron P. M.; Naylor, Tim; Mayne, N. J.; Jeffries, R. D.; Littlefair, S. P.

2013-07-01

We have derived ages for 13 young (<30 Myr) star-forming regions and find that they are up to a factor of 2 older than the ages typically adopted in the literature. This result has wide-ranging implications, including that circumstellar discs survive longer (≃ 10-12 Myr) and that the average Class I lifetime is greater (≃1 Myr) than currently believed. For each star-forming region, we derived two ages from colour-magnitude diagrams. First, we fitted models of the evolution between the zero-age main sequence and terminal-age main sequence to derive a homogeneous set of main-sequence ages, distances and reddenings with statistically meaningful uncertainties. Our second age for each star-forming region was derived by fitting pre-main-sequence stars to new semi-empirical model isochrones. For the first time (for a set of clusters younger than 50 Myr), we find broad agreement between these two ages, and since these are derived from two distinct mass regimes that rely on different aspects of stellar physics, it gives us confidence in the new age scale. This agreement is largely due to our adoption of empirical colour-Teff relations and bolometric corrections for pre-main-sequence stars cooler than 4000 K. The revised ages for the star-forming regions in our sample are: 2 Myr for NGC 6611 (Eagle Nebula; M 16), IC 5146 (Cocoon Nebula), NGC 6530 (Lagoon Nebula; M 8) and NGC 2244 (Rosette Nebula); 6 Myr for σ Ori, Cep OB3b and IC 348; ≃10 Myr for λ Ori (Collinder 69); ≃11 Myr for NGC 2169; ≃12 Myr for NGC 2362; ≃13 Myr for NGC 7160; ≃14 Myr for χ Per (NGC 884); and ≃20 Myr for NGC 1960 (M 36).
Molecular epidemiology over an 11-year period (2000 to 2010) of extended-spectrum β-lactamase-producing Escherichia coli causing bacteremia in a centralized Canadian region.

PubMed

Peirano, Gisele; van der Bij, Akke K; Gregson, Daniel B; Pitout, Johann D D

2012-02-01

A study was designed to assess the importance of sequence types among extended-spectrum β-lactamase (ESBL)-producing Escherichia coli isolates causing bacteremia over an 11-year period (2000 to 2010) in a centralized Canadian region. A total of 197 patients with incident infections were identified; the majority presented with community-onset urosepsis, with a significant increase in the prevalence of ESBL-producing E. coli during the later part of the study. The majority of E. coli isolates produced either CTX-M-15 or CTX-M-14. We identified 7 different major sequence types among 91% of isolates (i.e., the ST10 clonal complex, ST38, ST131, ST315, ST393, ST405, and ST648) and provided insight into their clinical and molecular characteristics. ST38 was the most antimicrobial-susceptible sequence type and predominated during 2000 to 2004 but disappeared after 2008. ST131 was the most antimicrobial-resistant sequence type, and the influx of a single pulsotype of this sequence type was responsible for the significant increase of ESBL-producing E. coli strains since 2007. During 2010, 49/63 (78%) of the ESBL-producing E. coli isolates belonged to ST131, and this sequence type had established itself as a major drug-resistant pathogen in Calgary, Alberta, Canada, posing an important new public health threat within our region. We urgently need well-designed epidemiological and molecular studies to understand the dynamics of transmission, risk factors, and reservoirs for E. coli ST131. This will provide insight into the emergence and spread of this multiresistant sequence type.
Purification and cDNA cloning of a protein derived from Flammulina velutipes that increases the permeability of the intestinal Caco-2 cell monolayer.

PubMed

Watanabe, H; Narai, A; Shimizu, M

1999-06-01

A new protein that decreases transepithelial electrical resistance (TEER) in the human intestinal Caco-2 cell monolayer was found in a water-soluble fraction of the mushroom Flammulina velutipes. This protein, termed TEER-decreasing protein (TDP), is not cytotoxic and does not induce cell detachment, but rapidly increases the tight junctional permeability for water-soluble marker substances such as Lucifer Yellow CH (Mr 457) through the paracellular pathway. TDP was isolated and purified from the aqueous extract of F. velutipes by chromatographic means. Purified TDP was found to be a simple, nonglycosylated protein without intermolecular disulfide bonds, and the apparent molecular mass as estimated by SDS/PAGE and gel filtration is 30 kDa. It was revealed that the N-terminal amino-acid sequence of purified TDP is identical to the recently reported N-terminal sequence of flammutoxin, a membrane-perturbing hemolytic protein, for which the complete primary structure has not yet been reported [Tomita, T., Ishikawa, D., Noguchi, T., Katayama, E., and Hashimoto, Y. (1998) Biochem. J. 333, 24794-24799]. The cDNA coding for TDP was cloned by 5' and 3' rapid amplification of cDNA ends. The ORF encodes a protein with 272 amino-acid residues showing no homology to known proteins. Relevant studies using TDP cDNA will provide insight into the structure-function relationships of membrane pore-forming toxins.
Defining RNA motif-aminoglycoside interactions via two-dimensional combinatorial screening and structure-activity relationships through sequencing.

PubMed

Velagapudi, Sai Pradeep; Disney, Matthew D

2013-10-15

RNA is an extremely important target for the development of chemical probes of function or small molecule therapeutics. Aminoglycosides are the most well studied class of small molecules to target RNA. However, the RNA motifs outside of the bacterial rRNA A-site that are likely to be bound by these compounds in biological systems is largely unknown. If such information were known, it could allow for aminoglycosides to be exploited to target other RNAs and, in addition, could provide invaluable insights into potential bystander targets of these clinically used drugs. We utilized two-dimensional combinatorial screening (2DCS), a library-versus-library screening approach, to select the motifs displayed in a 3×3 nucleotide internal loop library and in a 6-nucleotide hairpin library that bind with high affinity and selectivity to six aminoglycoside derivatives. The selected RNA motifs were then analyzed using structure-activity relationships through sequencing (StARTS), a statistical approach that defines the privileged RNA motif space that binds a small molecule. StARTS allowed for the facile annotation of the selected RNA motif-aminoglycoside interactions in terms of affinity and selectivity. The interactions selected by 2DCS generally have nanomolar affinities, which is higher affinity than the binding of aminoglycosides to a mimic of their therapeutic target, the bacterial rRNA A-site. Copyright © 2013 Elsevier Ltd. All rights reserved.
Defining RNA motif–aminoglycoside interactions via two-dimensional combinatorial screening and structure–activity relationships through sequencing

PubMed Central

Velagapudi, Sai Pradeep; Disney, Matthew D.

2013-01-01

RNA is an extremely important target for the development of chemical probes of function or small molecule therapeutics. Aminoglycosides are the most well studied class of small molecules to target RNA. However, the RNA motifs outside of the bacterial rRNA A-site that are likely to be bound by these compounds in biological systems is largely unknown. If such information were known, it could allow for aminoglycosides to be exploited to target other RNAs and, in addition, could provide invaluable insights into potential bystander targets of these clinically used drugs. We utilized two-dimensional combinatorial screening (2DCS), a library-versus-library screening approach, to select the motifs displayed in a 3 × 3 nucleotide internal loop library and in a 6-nucleotide hairpin library that bind with high affinity and selectivity to six aminoglycoside derivatives. The selected RNA motifs were then analyzed using structure–activity relationships through sequencing (StARTS), a statistical approach that defines the privileged RNA motif space that binds a small molecule. StARTS allowed for the facile annotation of the selected RNA motif–aminoglycoside interactions in terms of affinity and selectivity. The interactions selected by 2DCS generally have nanomolar affinities, which is higher affinity than the binding of aminoglycosides to a mimic of their therapeutic target, the bacterial rRNA A-site. PMID:23719281
Identification of Differentially Expressed Genes Associated with Apple Fruit Ripening and Softening by Suppression Subtractive Hybridization

PubMed Central

Zhang, Zongying; Jiang, Shenghui; Wang, Nan; Li, Min; Ji, Xiaohao; Sun, Shasha; Liu, Jingxuan; Wang, Deyun; Xu, Haifeng; Qi, Sumin; Wu, Shujing; Fei, Zhangjun; Feng, Shouqian; Chen, Xuesen

2015-01-01

Apple is one of the most economically important horticultural fruit crops worldwide. It is critical to gain insights into fruit ripening and softening to improve apple fruit quality and extend shelf life. In this study, forward and reverse suppression subtractive hybridization libraries were generated from ‘Taishanzaoxia’ apple fruits sampled around the ethylene climacteric to isolate ripening- and softening-related genes. A set of 648 unigenes were derived from sequence alignment and cluster assembly of 918 expressed sequence tags. According to gene ontology functional classification, 390 out of 443 unigenes (88%) were assigned to the biological process category, 356 unigenes (80%) were classified in the molecular function category, and 381 unigenes (86%) were allocated to the cellular component category. A total of 26 unigenes differentially expressed during fruit development period were analyzed by quantitative RT-PCR. These genes were involved in cell wall modification, anthocyanin biosynthesis, aroma production, stress response, metabolism, transcription, or were non-annotated. Some genes associated with cell wall modification, anthocyanin biosynthesis and aroma production were up-regulated and significantly correlated with ethylene production, suggesting that fruit texture, coloration and aroma may be regulated by ethylene in ‘Taishanzaoxia’. Some of the identified unigenes associated with fruit ripening and softening have not been characterized in public databases. The results contribute to an improved characterization of changes in gene expression during apple fruit ripening and softening. PMID:26719904
Plant Omics Data Center: An Integrated Web Repository for Interspecies Gene Expression Networks with NLP-Based Curation

PubMed Central

Ohyanagi, Hajime; Takano, Tomoyuki; Terashima, Shin; Kobayashi, Masaaki; Kanno, Maasa; Morimoto, Kyoko; Kanegae, Hiromi; Sasaki, Yohei; Saito, Misa; Asano, Satomi; Ozaki, Soichi; Kudo, Toru; Yokoyama, Koji; Aya, Koichiro; Suwabe, Keita; Suzuki, Go; Aoki, Koh; Kubo, Yasutaka; Watanabe, Masao; Matsuoka, Makoto; Yano, Kentaro

2015-01-01

Comprehensive integration of large-scale omics resources such as genomes, transcriptomes and metabolomes will provide deeper insights into broader aspects of molecular biology. For better understanding of plant biology, we aim to construct a next-generation sequencing (NGS)-derived gene expression network (GEN) repository for a broad range of plant species. So far we have incorporated information about 745 high-quality mRNA sequencing (mRNA-Seq) samples from eight plant species (Arabidopsis thaliana, Oryza sativa, Solanum lycopersicum, Sorghum bicolor, Vitis vinifera, Solanum tuberosum, Medicago truncatula and Glycine max) from the public short read archive, digitally profiled the entire set of gene expression profiles, and drawn GENs by using correspondence analysis (CA) to take advantage of gene expression similarities. In order to understand the evolutionary significance of the GENs from multiple species, they were linked according to the orthology of each node (gene) among species. In addition to other gene expression information, functional annotation of the genes will facilitate biological comprehension. Currently we are improving the given gene annotations with natural language processing (NLP) techniques and manual curation. Here we introduce the current status of our analyses and the web database, PODC (Plant Omics Data Center; http://bioinf.mind.meiji.ac.jp/podc/), now open to the public, providing GENs, functional annotations and additional comprehensive omics resources. PMID:25505034
Handling the influence of chemical shift in amplitude-modulated heteronuclear dipolar recoupling solid-state NMR

DOE Office of Scientific and Technical Information (OSTI.GOV)

Basse, Kristoffer; Shankar, Ravi; Bjerring, Morten

We present a theoretical analysis of the influence of chemical shifts on amplitude-modulated heteronuclear dipolar recoupling experiments in solid-state NMR spectroscopy. The method is demonstrated using the Rotor Echo Short Pulse IRrAdiaTION mediated Cross-Polarization ({sup RESPIRATION}CP) experiment as an example. By going into the pulse sequence rf interaction frame and employing a quintuple-mode operator-based Floquet approach, we describe how chemical shift offset and anisotropic chemical shift affect the efficiency of heteronuclear polarization transfer. In this description, it becomes transparent that the main attribute leading to non-ideal performance is a fictitious field along the rf field axis, which is generated frommore » second-order cross terms arising mainly between chemical shift tensors and themselves. This insight is useful for the development of improved recoupling experiments. We discuss the validity of this approach and present quaternion calculations to determine the effective resonance conditions in a combined rf field and chemical shift offset interaction frame transformation. Based on this, we derive a broad-banded version of the {sup RESPIRATION}CP experiment. The new sequence is experimentally verified using SNNFGAILSS amyloid fibrils where simultaneous {sup 15}N → {sup 13}CO and {sup 15}N → {sup 13}C{sub α} coherence transfer is demonstrated on high-field NMR instrumentation, requiring great offset stability.« less
Mitogen-Activated Protein Kinase Signaling in Plant-Interacting Fungi: Distinct Messages from Conserved Messengers[W

PubMed Central

Hamel, Louis-Philippe; Nicole, Marie-Claude; Duplessis, Sébastien; Ellis, Brian E.

2012-01-01

Mitogen-activated protein kinases (MAPKs) are evolutionarily conserved proteins that function as key signal transduction components in fungi, plants, and mammals. During interaction between phytopathogenic fungi and plants, fungal MAPKs help to promote mechanical and/or enzymatic penetration of host tissues, while plant MAPKs are required for activation of plant immunity. However, new insights suggest that MAPK cascades in both organisms do not operate independently but that they mutually contribute to a highly interconnected molecular dialogue between the plant and the fungus. As a result, some pathogenesis-related processes controlled by fungal MAPKs lead to the activation of plant signaling, including the recruitment of plant MAPK cascades. Conversely, plant MAPKs promote defense mechanisms that threaten the survival of fungal cells, leading to a stress response mediated in part by fungal MAPK cascades. In this review, we make use of the genomic data available following completion of whole-genome sequencing projects to analyze the structure of MAPK protein families in 24 fungal taxa, including both plant pathogens and mycorrhizal symbionts. Based on conserved patterns of sequence diversification, we also propose the adoption of a unified fungal MAPK nomenclature derived from that established for the model species Saccharomyces cerevisiae. Finally, we summarize current knowledge of the functions of MAPK cascades in phytopathogenic fungi and highlight the central role played by MAPK signaling during the molecular dialogue between plants and invading fungal pathogens. PMID:22517321
Proteomics and Deep Sequencing Comparison of Seasonally Active Venom Glands in the Platypus Reveals Novel Venom Peptides and Distinct Expression Profiles*

PubMed Central

Wong, Emily S. W.; Morgenstern, David; Mofiz, Ehtesham; Gombert, Sara; Morris, Katrina M.; Temple-Smith, Peter; Renfree, Marilyn B.; Whittington, Camilla M.; King, Glenn F.; Warren, Wesley C.; Papenfuss, Anthony T.; Belov, Katherine

2012-01-01

The platypus is a venomous monotreme. Male platypuses possess a spur on their hind legs that is connected to glands in the pelvic region. They produce venom only during the breeding season, presumably to fight off conspecifics. We have taken advantage of this unique seasonal production of venom to compare the transcriptomes of in- and out-of-season venom glands, in conjunction with proteomic analysis, to identify previously undiscovered venom genes. Comparison of the venom glands revealed distinct gene expression profiles that are consistent with changes in venom gland morphology and venom volumes in and out of the breeding season. Venom proteins were identified through shot-gun sequenced venom proteomes of three animals using RNA-seq-derived transcripts for peptide-spectral matching. 5,157 genes were expressed in the venom glands, 1,821 genes were up-regulated in the in-season gland, and 10 proteins were identified in the venom. New classes of platypus-venom proteins identified included antimicrobials, amide oxidase, serpin protease inhibitor, proteins associated with the mammalian stress response pathway, cytokines, and other immune molecules. Five putative toxins have only been identified in platypus venom: growth differentiation factor 15, nucleobindin-2, CD55, a CXC-chemokine, and corticotropin-releasing factor-binding protein. These novel venom proteins have potential biomedical and therapeutic applications and provide insights into venom evolution. PMID:22899769
Proteomics and deep sequencing comparison of seasonally active venom glands in the platypus reveals novel venom peptides and distinct expression profiles.

PubMed

Wong, Emily S W; Morgenstern, David; Mofiz, Ehtesham; Gombert, Sara; Morris, Katrina M; Temple-Smith, Peter; Renfree, Marilyn B; Whittington, Camilla M; King, Glenn F; Warren, Wesley C; Papenfuss, Anthony T; Belov, Katherine

2012-11-01

The platypus is a venomous monotreme. Male platypuses possess a spur on their hind legs that is connected to glands in the pelvic region. They produce venom only during the breeding season, presumably to fight off conspecifics. We have taken advantage of this unique seasonal production of venom to compare the transcriptomes of in- and out-of-season venom glands, in conjunction with proteomic analysis, to identify previously undiscovered venom genes. Comparison of the venom glands revealed distinct gene expression profiles that are consistent with changes in venom gland morphology and venom volumes in and out of the breeding season. Venom proteins were identified through shot-gun sequenced venom proteomes of three animals using RNA-seq-derived transcripts for peptide-spectral matching. 5,157 genes were expressed in the venom glands, 1,821 genes were up-regulated in the in-season gland, and 10 proteins were identified in the venom. New classes of platypus-venom proteins identified included antimicrobials, amide oxidase, serpin protease inhibitor, proteins associated with the mammalian stress response pathway, cytokines, and other immune molecules. Five putative toxins have only been identified in platypus venom: growth differentiation factor 15, nucleobindin-2, CD55, a CXC-chemokine, and corticotropin-releasing factor-binding protein. These novel venom proteins have potential biomedical and therapeutic applications and provide insights into venom evolution.

SL2-like spliced leader RNAs in the basal nematode Prionchulus punctatus: New insight into the evolution of nematode SL2 RNAs.

PubMed

Harrison, Neale; Kalbfleisch, Andreas; Connolly, Bernadette; Pettitt, Jonathan; Müller, Berndt

2010-08-01

Spliced-leader (SL) trans-splicing has been found in all molecularly characterized nematode species to date, and it is likely to be a nematode synapomorphy. Most information regarding SL trans-splicing has come from the study of nematodes from a single monophyletic group, the Rhabditida, all of which employ SL RNAs that are identical to, or variants of, the SL1 RNA first characterized in Caenorhabditis elegans. In contrast, the more distantly related Trichinella spiralis, belonging to the subclass Dorylaimia, utilizes a distinct set of SL RNAs that display considerable sequence diversity. To investigate whether this is true of other members of the Dorylaimia, we have characterized SL RNAs from Prionchulus punctatus. Surprisingly, this revealed the presence of a set of SLs that show clear sequence similarity to the SL2 family of spliced leaders, which have previously only been found within the rhabditine group (which includes C. elegans). Expression of one of the P. punctatus SL RNAs in C. elegans reveals that it can compete specifically with the endogenous C. elegans SL2 spliced leaders, being spliced to the pre-mRNAs derived from downstream genes in operons, but does not compete with the SL1 spliced leaders. This discovery raises the possibility that SL2-like spliced leaders were present in the last common ancestor of the nematode phylum.
Insights into the Biosynthesis of the Benzoquinone Ansamycins Geldanamycin and Herbimycin, Obtained by Gene Sequencing and Disruption†

PubMed Central

Rascher, Andreas; Hu, Zhihao; Buchanan, Greg O.; Reid, Ralph; Hutchinson, C. Richard

2005-01-01

Geldanamycin and the closely related herbimycins A, B, and C were the first benzoquinone ansamycins to be extensively studied for their antitumor properties as small-molecule inhibitors of the Hsp90 protein chaperone complex. These compounds are produced by two different Streptomyces hygroscopicus strains and have the same modular polyketide synthase (PKS)-derived carbon skeleton but different substitution patterns at C-11, C-15, and C-17. To set the stage for structural modification by genetic engineering, we previously identified the gene cluster responsible for geldanamycin biosynthesis. We have now cloned and sequenced a 115-kb segment of the herbimycin biosynthetic gene cluster from S. hygroscopicus AM 3672, including the genes for the PKS and most of the post-PKS tailoring enzymes. The similarities and differences between the gene clusters and biosynthetic pathways for these closely related ansamycins are interpreted with support from the results of gene inactivation experiments. In addition, the organization and functions of genes involved in the biosynthesis of the 3-amino-5-hydroxybenzoic acid (AHBA) starter unit and the post-PKS modifications of progeldanamycin were assessed by inactivating the subclusters of AHBA biosynthetic genes and two oxygenase genes (gdmM and gdmL) that were proposed to be involved in formation of the geldanamycin benzoquinoid system. A resulting novel geldanamycin analog, KOS-1806, was isolated and characterized. PMID:16085885
VIT-CMJ2: Endophyte of Agaricus bisporus in Production of Bioactive Compounds.

PubMed

Gautam, Chandan Kumar; Madhav, Mukund; Sinha, Astha; Jabez Osborne, William

2016-06-01

Agaricus bisporus is an edible basidiomycete fungus. Both the body and the mycelium contain compounds comprising a wide range of antimicrobial molecules, contributing in improvement of immunity and tumor-retardation. The presence of endophytes capable of producing bioactive compounds was investigated in Agaricus bisporus . Endophytes from Agaricus bisporus was isolated on LB agar. The obtained isolates were characterized morphologically and biochemically. Further 16S rRNA sequencing was implemented for molecular analysis of isolates. The isolate was mass produced and the bioactive compounds were extracted using ethyl acetate, chloroform and hexane. Agar well diffusion method was carried out to seek the potential of any antimicrobial activity of the crude bioactive compounds against known pathogens. GC-MS and FT-IR analysis were performed for the identification of bioactive compounds. VIT-CMJ2 was identified as Enterobacter sp. as revealed by 16S rRNA sequencing. Chloroform extract of VIT-CMJ2 showed a maximum zone of inhibition of 19 mm against Salmonella typhi followed by hexane and ethyl acetate extracts. The GC-MS analysis revealed the presence of several bioactive compounds having effective antimicrobial activity like butyl ester, Behenicalcohol, S , S-dioxide derivatives and some others which were later confirmed by FT-IR spectral stretches. The present study shows the insight on the way endophytes interact with Agaricus bisporus ; thereby improving the nutritional profile.
VIT-CMJ2: Endophyte of Agaricus bisporus in Production of Bioactive Compounds

PubMed Central

Gautam, Chandan Kumar; Madhav, Mukund; Sinha, Astha; Jabez Osborne, William

2016-01-01

Background Agaricus bisporus is an edible basidiomycete fungus. Both the body and the mycelium contain compounds comprising a wide range of antimicrobial molecules, contributing in improvement of immunity and tumor-retardation. Objectives The presence of endophytes capable of producing bioactive compounds was investigated in Agaricus bisporus. Materials and Methods Endophytes from Agaricus bisporus was isolated on LB agar. The obtained isolates were characterized morphologically and biochemically. Further 16S rRNA sequencing was implemented for molecular analysis of isolates. The isolate was mass produced and the bioactive compounds were extracted using ethyl acetate, chloroform and hexane. Agar well diffusion method was carried out to seek the potential of any antimicrobial activity of the crude bioactive compounds against known pathogens. GC-MS and FT-IR analysis were performed for the identification of bioactive compounds. Results VIT-CMJ2 was identified as Enterobacter sp. as revealed by 16S rRNA sequencing. Chloroform extract of VIT-CMJ2 showed a maximum zone of inhibition of 19 mm against Salmonella typhi followed by hexane and ethyl acetate extracts. The GC-MS analysis revealed the presence of several bioactive compounds having effective antimicrobial activity like butyl ester, Behenicalcohol, S , S-dioxide derivatives and some others which were later confirmed by FT-IR spectral stretches. Conclusions The present study shows the insight on the way endophytes interact with Agaricus bisporus; thereby improving the nutritional profile. PMID:28959322
The determination of high-resolution spatio-temporal glacier motion fields from time-lapse sequences

NASA Astrophysics Data System (ADS)

Schwalbe, Ellen; Maas, Hans-Gerd

2017-12-01

This paper presents a comprehensive method for the determination of glacier surface motion vector fields at high spatial and temporal resolution. These vector fields can be derived from monocular terrestrial camera image sequences and are a valuable data source for glaciological analysis of the motion behaviour of glaciers. The measurement concepts for the acquisition of image sequences are presented, and an automated monoscopic image sequence processing chain is developed. Motion vector fields can be derived with high precision by applying automatic subpixel-accuracy image matching techniques on grey value patterns in the image sequences. Well-established matching techniques have been adapted to the special characteristics of the glacier data in order to achieve high reliability in automatic image sequence processing, including the handling of moving shadows as well as motion effects induced by small instabilities in the camera set-up. Suitable geo-referencing techniques were developed to transform image measurements into a reference coordinate system.The result of monoscopic image sequence analysis is a dense raster of glacier surface point trajectories for each image sequence. Each translation vector component in these trajectories can be determined with an accuracy of a few centimetres for points at a distance of several kilometres from the camera. Extensive practical validation experiments have shown that motion vector and trajectory fields derived from monocular image sequences can be used for the determination of high-resolution velocity fields of glaciers, including the analysis of tidal effects on glacier movement, the investigation of a glacier's motion behaviour during calving events, the determination of the position and migration of the grounding line and the detection of subglacial channels during glacier lake outburst floods.
A better sequence-read simulator program for metagenomics.

PubMed

Johnson, Stephen; Trost, Brett; Long, Jeffrey R; Pittet, Vanessa; Kusalik, Anthony

2014-01-01

There are many programs available for generating simulated whole-genome shotgun sequence reads. The data generated by many of these programs follow predefined models, which limits their use to the authors' original intentions. For example, many models assume that read lengths follow a uniform or normal distribution. Other programs generate models from actual sequencing data, but are limited to reads from single-genome studies. To our knowledge, there are no programs that allow a user to generate simulated data following non-parametric read-length distributions and quality profiles based on empirically-derived information from metagenomics sequencing data. We present BEAR (Better Emulation for Artificial Reads), a program that uses a machine-learning approach to generate reads with lengths and quality values that closely match empirically-derived distributions. BEAR can emulate reads from various sequencing platforms, including Illumina, 454, and Ion Torrent. BEAR requires minimal user input, as it automatically determines appropriate parameter settings from user-supplied data. BEAR also uses a unique method for deriving run-specific error rates, and extracts useful statistics from the metagenomic data itself, such as quality-error models. Many existing simulators are specific to a particular sequencing technology; however, BEAR is not restricted in this way. Because of its flexibility, BEAR is particularly useful for emulating the behaviour of technologies like Ion Torrent, for which no dedicated sequencing simulators are currently available. BEAR is also the first metagenomic sequencing simulator program that automates the process of generating abundances, which can be an arduous task. BEAR is useful for evaluating data processing tools in genomics. It has many advantages over existing comparable software, such as generating more realistic reads and being independent of sequencing technology, and has features particularly useful for metagenomics work.
Characterization of circulating transfer RNA-derived RNA fragments in cattle

PubMed Central

Casas, Eduardo; Cai, Guohong; Neill, John D.

2015-01-01

The objective was to characterize naturally occurring circulating transfer RNA-derived RNA fragments (tRFs) in cattle1. Serum from eight clinically normal adult dairy cows was collected, and small non-coding RNAs were extracted immediately after collection and sequenced by Illumina MiSeq. Sequences aligned to transfer RNA (tRNA) genes or their flanking sequences were characterized. Sequences aligned to the beginning of 5′ end of the mature tRNA were classified as tRF5; those aligned to the 3′ end of mature tRNA were classified as tRF3; and those aligned to the beginning of the 3′ end flanking sequences were classified as tRF1. There were 3,190,962 sequences that mapped to transfer RNA and small non-coding RNAs in the bovine genome. Of these, 2,323,520 were identified as tRF5s, 562 were tRF3s, and 81 were tRF1s. There were 866,799 sequences identified as other small non-coding RNAs (microRNA, rRNA, snoRNA, etc.) and were excluded from the study. The tRF5s ranged from 28 to 40 nucleotides; and 98.7% ranged from 30 to 34 nucleotides in length. The tRFs with the greatest number of sequences were derived from tRNA of histidine, glutamic acid, lysine, glycine, and valine. There was no association between number of codons for each amino acid and number of tRFs in the samples. The reason for tRF5s being the most abundant can only be explained if these sequences are associated with function within the animal. PMID:26379699
Nucleotide sequence determination of guinea-pig casein B mRNA reveals homology with bovine and rat alpha s1 caseins and conservation of the non-coding regions of the mRNA.

PubMed Central

Hall, L; Laird, J E; Craig, R K

1984-01-01

Nucleotide sequence analysis of cloned guinea-pig casein B cDNA sequences has identified two casein B variants related to the bovine and rat alpha s1 caseins. Amino acid homology was largely confined to the known bovine or predicted rat phosphorylation sites and within the 'signal' precursor sequence. Comparison of the deduced nucleotide sequence of the guinea-pig and rat alpha s1 casein mRNA species showed greater sequence conservation in the non-coding than in the coding regions, suggesting a functional and possibly regulatory role for the non-coding regions of casein mRNA. The results provide insight into the evolution of the casein genes, and raise questions as to the role of conserved nucleotide sequences within the non-coding regions of mRNA species. Images Fig. 1. PMID:6548375
Genome Sequencing and Assembly by Long Reads in Plants

PubMed Central

Li, Changsheng; Lin, Feng; An, Dong; Huang, Ruidong

2017-01-01

Plant genomes generated by Sanger and Next Generation Sequencing (NGS) have provided insight into species diversity and evolution. However, Sanger sequencing is limited in its applications due to high cost, labor intensity, and low throughput, while NGS reads are too short to resolve abundant repeats and polyploidy, leading to incomplete or ambiguous assemblies. The advent and improvement of long-read sequencing by Third Generation Sequencing (TGS) methods such as PacBio and Nanopore have shown promise in producing high-quality assemblies for complex genomes. Here, we review the development of sequencing, introducing the application as well as considerations of experimental design in TGS of plant genomes. We also introduce recent revolutionary scaffolding technologies including BioNano, Hi-C, and 10× Genomics. We expect that the informative guidance for genome sequencing and assembly by long reads will benefit the initiation of scientists’ projects. PMID:29283420
The first genetic map of the American cranberry: exploration of synteny conservation and quantitative trait loci.

PubMed

Georgi, Laura; Johnson-Cicalese, Jennifer; Honig, Josh; Das, Sushma Parankush; Rajah, Veeran D; Bhattacharya, Debashish; Bassil, Nahla; Rowland, Lisa J; Polashock, James; Vorsa, Nicholi

2013-03-01

The first genetic map of cranberry (Vaccinium macrocarpon) has been constructed, comprising 14 linkage groups totaling 879.9 cM with an estimated coverage of 82.2 %. This map, based on four mapping populations segregating for field fruit-rot resistance, contains 136 distinct loci. Mapped markers include blueberry-derived simple sequence repeat (SSR) and cranberry-derived sequence-characterized amplified region markers previously used for fingerprinting cranberry cultivars. In addition, SSR markers were developed near cranberry sequences resembling genes involved in flavonoid biosynthesis or defense against necrotrophic pathogens, or conserved orthologous set (COS) sequences. The cranberry SSRs were developed from next-generation cranberry genomic sequence assemblies; thus, the positions of these SSRs on the genomic map provide information about the genomic location of the sequence scaffold from which they were derived. The use of SSR markers near COS and other functional sequences, plus 33 SSR markers from blueberry, facilitates comparisons of this map with maps of other plant species. Regions of the cranberry map were identified that showed conservation of synteny with Vitis vinifera and Arabidopsis thaliana. Positioned on this map are quantitative trait loci (QTL) for field fruit-rot resistance (FFRR), fruit weight, titratable acidity, and sound fruit yield (SFY). The SFY QTL is adjacent to one of the fruit weight QTL and may reflect pleiotropy. Two of the FFRR QTL are in regions of conserved synteny with grape and span defense gene markers, and the third FFRR QTL spans a flavonoid biosynthetic gene.
Sequential de novo centromere formation and inactivation on a chromosomal fragment in maize.

PubMed

Liu, Yalin; Su, Handong; Pang, Junling; Gao, Zhi; Wang, Xiu-Jie; Birchler, James A; Han, Fangpu

2015-03-17

The ability of centromeres to alternate between active and inactive states indicates significant epigenetic aspects controlling centromere assembly and function. In maize (Zea mays), misdivision of the B chromosome centromere on a translocation with the short arm of chromosome 9 (TB-9Sb) can produce many variants with varying centromere sizes and centromeric DNA sequences. In such derivatives of TB-9Sb, we found a de novo centromere on chromosome derivative 3-3, which has no canonical centromeric repeat sequences. This centromere is derived from a 288-kb region on the short arm of chromosome 9, and is 19 megabases (Mb) removed from the translocation breakpoint of chromosome 9 in TB-9Sb. The functional B centromere in progenitor telo2-2 is deleted from derivative 3-3, but some B-repeat sequences remain. The de novo centromere of derivative 3-3 becomes inactive in three further derivatives with new centromeres being formed elsewhere on each chromosome. Our results suggest that de novo centromere initiation is quite common and can persist on chromosomal fragments without a canonical centromere. However, we hypothesize that when de novo centromeres are initiated in opposition to a larger normal centromere, they are cleared from the chromosome by inactivation, thus maintaining karyotype integrity.
Sequential de novo centromere formation and inactivation on a chromosomal fragment in maize

PubMed Central

Liu, Yalin; Su, Handong; Pang, Junling; Gao, Zhi; Wang, Xiu-Jie; Birchler, James A.; Han, Fangpu

2015-01-01

The ability of centromeres to alternate between active and inactive states indicates significant epigenetic aspects controlling centromere assembly and function. In maize (Zea mays), misdivision of the B chromosome centromere on a translocation with the short arm of chromosome 9 (TB-9Sb) can produce many variants with varying centromere sizes and centromeric DNA sequences. In such derivatives of TB-9Sb, we found a de novo centromere on chromosome derivative 3-3, which has no canonical centromeric repeat sequences. This centromere is derived from a 288-kb region on the short arm of chromosome 9, and is 19 megabases (Mb) removed from the translocation breakpoint of chromosome 9 in TB-9Sb. The functional B centromere in progenitor telo2-2 is deleted from derivative 3-3, but some B-repeat sequences remain. The de novo centromere of derivative 3-3 becomes inactive in three further derivatives with new centromeres being formed elsewhere on each chromosome. Our results suggest that de novo centromere initiation is quite common and can persist on chromosomal fragments without a canonical centromere. However, we hypothesize that when de novo centromeres are initiated in opposition to a larger normal centromere, they are cleared from the chromosome by inactivation, thus maintaining karyotype integrity. PMID:25733907
Deriving high-resolution protein backbone structure propensities from all crystal data using the information maximization device.

PubMed

Solis, Armando D

2014-01-01

The most informative probability distribution functions (PDFs) describing the Ramachandran phi-psi dihedral angle pair, a fundamental descriptor of backbone conformation of protein molecules, are derived from high-resolution X-ray crystal structures using an information-theoretic approach. The Information Maximization Device (IMD) is established, based on fundamental information-theoretic concepts, and then applied specifically to derive highly resolved phi-psi maps for all 20 single amino acid and all 8000 triplet sequences at an optimal resolution determined by the volume of current data. The paper shows that utilizing the latent information contained in all viable high-resolution crystal structures found in the Protein Data Bank (PDB), totaling more than 77,000 chains, permits the derivation of a large number of optimized sequence-dependent PDFs. This work demonstrates the effectiveness of the IMD and the superiority of the resulting PDFs by extensive fold recognition experiments and rigorous comparisons with previously published triplet PDFs. Because it automatically optimizes PDFs, IMD results in improved performance of knowledge-based potentials, which rely on such PDFs. Furthermore, it provides an easy computational recipe for empirically deriving other kinds of sequence-dependent structural PDFs with greater detail and precision. The high-resolution phi-psi maps derived in this work are available for download.
A small and efficient dimerization/packaging signal of rat VL30 RNA and its use in murine leukemia virus-VL30-derived vectors for gene transfer.

PubMed

Torrent, C; Gabus, C; Darlix, J L

1994-02-01

Retroviral genomes consist of two identical RNA molecules associated at their 5' ends by the dimer linkage structure located in the packaging element (Psi or E) necessary for RNA dimerization in vitro and packaging in vivo. In murine leukemia virus (MLV)-derived vectors designed for gene transfer, the Psi + sequence of 600 nucleotides directs the packaging of recombinant RNAs into MLV virions produced by helper cells. By using in vitro RNA dimerization as a screening system, a sequence of rat VL30 RNA located next to the 5' end of the Harvey mouse sarcoma virus genome and as small as 67 nucleotides was found to form stable dimeric RNA. In addition, a purine-rich sequence located at the 5' end of this VL30 RNA seems to be critical for RNA dimerization. When this VL30 element was extended by 107 nucleotides at its 3' end and inserted into an MLV-derived vector lacking MLV Psi +, it directed the efficient encapsidation of recombinant RNAs into MLV virions. Because this VL30 packaging signal is smaller and more efficient in packaging recombinant RNAs than the MLV Psi + and does not contain gag or glyco-gag coding sequences, its use in MLV-derived vectors should render even more unlikely recombinations which could generate replication-competent viruses. Therefore, utilization of the rat VL30 packaging sequence should improve the biological safety of MLV vectors for human gene transfer.
Genome sequencing and comparative genomics of honey bee microsporidia, Nosema apis reveal novel insights into host-parasite interactions.

PubMed

Chen, Yan ping; Pettis, Jeffery S; Zhao, Yan; Liu, Xinyue; Tallon, Luke J; Sadzewicz, Lisa D; Li, Renhua; Zheng, Huoqing; Huang, Shaokang; Zhang, Xuan; Hamilton, Michele C; Pernal, Stephen F; Melathopoulos, Andony P; Yan, Xianghe; Evans, Jay D

2013-07-05

The microsporidia parasite Nosema contributes to the steep global decline of honey bees that are critical pollinators of food crops. There are two species of Nosema that have been found to infect honey bees, Nosema apis and N. ceranae. Genome sequencing of N. apis and comparative genome analysis with N. ceranae, a fully sequenced microsporidia species, reveal novel insights into host-parasite interactions underlying the parasite infections. We applied the whole-genome shotgun sequencing approach to sequence and assemble the genome of N. apis which has an estimated size of 8.5 Mbp. We predicted 2,771 protein- coding genes and predicted the function of each putative protein using the Gene Ontology. The comparative genomic analysis led to identification of 1,356 orthologs that are conserved between the two Nosema species and genes that are unique characteristics of the individual species, thereby providing a list of virulence factors and new genetic tools for studying host-parasite interactions. We also identified a highly abundant motif in the upstream promoter regions of N. apis genes. This motif is also conserved in N. ceranae and other microsporidia species and likely plays a role in gene regulation across the microsporidia. The availability of the N. apis genome sequence is a significant addition to the rapidly expanding body of microsprodian genomic data which has been improving our understanding of eukaryotic genome diversity and evolution in a broad sense. The predicted virulent genes and transcriptional regulatory elements are potential targets for innovative therapeutics to break down the life cycle of the parasite.
Draft Genome Sequence of a Tetrabromobisphenol A–Degrading Strain, Ochrobactrum sp. T, Isolated from an Electronic Waste Recycling Site

PubMed Central

Liang, Zhishu; Li, Guiying; Zhang, Guoxia; Das, Ranjit

2016-01-01

Ochrobactrum sp. T was previously isolated from a sludge sample collected from an electronic waste recycling site and characterized as a unique tetrabromobisphenol A (TBBPA)–degrading bacterium. Here, the draft genome sequence (3.9 Mb) of Ochrobactrum sp. T is reported to provide insights into its diversity and its TBBPA biodegradation mechanism in polluted environments. PMID:27445374
Genome Sequence of the Mucoromycotina Fungus Umbelopsis isabellina, an Effective Producer of Lipids

DOE Office of Scientific and Technical Information (OSTI.GOV)

Takeda, Itaru; Tamano, Koichi; Yamane, Noriko

2014-02-27

Umbelopsis isabellina is a fungus in the subdivision Mucoromycotina, many members of which have been shown to be oleaginous and have become important organisms for producing oil because of their high level of intracellular lipid accumulation from various feedstocks. The genome sequence of U. isabellina NBRC 7884 was determined and annotated, and this information might provide insights into the oleaginous properties of this fungus.
High-Resolution Functional Mapping of the Venezuelan Equine Encephalitis Virus Genome by Insertional Mutagenesis and Massively Parallel Sequencing

DTIC Science & Technology

2010-10-14

High-Resolution Functional Mapping of the Venezuelan Equine Encephalitis Virus Genome by Insertional Mutagenesis and Massively Parallel Sequencing...Venezuelan equine encephalitis virus (VEEV) genome. We initially used a capillary electrophoresis method to gain insight into the role of the VEEV...Smith JM, Schmaljohn CS (2010) High-Resolution Functional Mapping of the Venezuelan Equine Encephalitis Virus Genome by Insertional Mutagenesis and
Genomic insights into the taxonomic status of the Bacillus cereus group

PubMed Central

Liu, Yang; Lai, Qiliang; Göker, Markus; Meier-Kolthoff, Jan P.; Wang, Meng; Sun, Yamin; Wang, Lei; Shao, Zongze

2015-01-01

The identification and phylogenetic relationships of bacteria within the Bacillus cereus group are controversial. This study aimed at determining the taxonomic affiliations of these strains using the whole-genome sequence-based Genome BLAST Distance Phylogeny (GBDP) approach. The GBDP analysis clearly separated 224 strains into 30 clusters, representing eleven known, partially merged species and accordingly 19–20 putative novel species. Additionally, 16S rRNA gene analysis, a novel variant of multi-locus sequence analysis (nMLSA) and screening of virulence genes were performed. The 16S rRNA gene sequence was not sufficient to differentiate the bacteria within this group due to its high conservation. The nMLSA results were consistent with GBDP. Moreover, a fast typing method was proposed using the pycA gene, and where necessary, the ccpA gene. The pXO plasmids and cry genes were widely distributed, suggesting little correlation with the phylogenetic positions of the host bacteria. This might explain why classifications based on virulence characteristics proved unsatisfactory in the past. In summary, this is the first large-scale and systematic study of the taxonomic status of the bacteria within the B. cereus group using whole-genome sequences, and is likely to contribute to further insights into their pathogenicity, phylogeny and adaptation to diverse environments. PMID:26373441
Strategies for high-altitude adaptation revealed from high-quality draft genome of non-violacein producing Janthinobacterium lividum ERGS5:01.

PubMed

Kumar, Rakshak; Acharya, Vishal; Singh, Dharam; Kumar, Sanjay

2018-01-01

A light pink coloured bacterial strain ERGS5:01 isolated from glacial stream water of Sikkim Himalaya was affiliated to Janthinobacterium lividum based on 16S rRNA gene sequence identity and phylogenetic clustering. Whole genome sequencing was performed for the strain to confirm its taxonomy as it lacked the typical violet pigmentation of the genus and also to decipher its survival strategy at the aquatic ecosystem of high elevation. The PacBio RSII sequencing generated genome of 5,168,928 bp with 4575 protein-coding genes and 118 RNA genes. Whole genome-based multilocus sequence analysis clustering, in silico DDH similarity value of 95.1% and, the ANI value of 99.25% established the identity of the strain ERGS5:01 (MCC 2953) as a non-violacein producing J. lividum . The genome comparisons across genus Janthinobacterium revealed an open pan-genome with the scope of the addition of new orthologous cluster to complete the genomic inventory. The genomic insight provided the genetic basis of freezing and frequent freeze-thaw cycle tolerance and, for industrially important enzymes. Extended insight into the genome provided clues of crucial genes associated with adaptation in the harsh aquatic ecosystem of high altitude.

Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain.

PubMed

de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas

2014-06-01

The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Gut Microbiome and Putative Resistome of Inca and Italian Nobility Mummies

PubMed Central

Santiago-Rodriguez, Tasha M.; Luciani, Stefania; Toranzos, Gary A.; Marota, Isolina; Giuffra, Valentina; Cano, Raul J.

2017-01-01

Little is still known about the microbiome resulting from the process of mummification of the human gut. In the present study, the gut microbiota, genes associated with metabolism, and putative resistome of Inca and Italian nobility mummies were characterized by using high-throughput sequencing. The Italian nobility mummies exhibited a higher bacterial diversity as compared to the Inca mummies when using 16S ribosomal (rRNA) gene amplicon sequencing, but both groups showed bacterial and fungal taxa when using shotgun metagenomic sequencing that may resemble both the thanatomicrobiome and extant human gut microbiomes. Identification of sequences associated with plants, animals, and carbohydrate-active enzymes (CAZymes) may provide further insights into the dietary habits of Inca and Italian nobility mummies. Putative antibiotic-resistance genes in the Inca and Italian nobility mummies support a human gut resistome prior to the antibiotic therapy era. The higher proportion of putative antibiotic-resistance genes in the Inca compared to Italian nobility mummies may support the hypotheses that a greater exposure to the environment may result in a greater acquisition of antibiotic-resistance genes. The present study adds knowledge of the microbiome resulting from the process of mummification of the human gut, insights of ancient dietary habits, and the preserved putative human gut resistome prior the antibiotic therapy era. PMID:29112136
Gut Microbiome and Putative Resistome of Inca and Italian Nobility Mummies.

PubMed

Santiago-Rodriguez, Tasha M; Fornaciari, Gino; Luciani, Stefania; Toranzos, Gary A; Marota, Isolina; Giuffra, Valentina; Cano, Raul J

2017-11-07

Little is still known about the microbiome resulting from the process of mummification of the human gut. In the present study, the gut microbiota, genes associated with metabolism, and putative resistome of Inca and Italian nobility mummies were characterized by using high-throughput sequencing. The Italian nobility mummies exhibited a higher bacterial diversity as compared to the Inca mummies when using 16S ribosomal (rRNA) gene amplicon sequencing, but both groups showed bacterial and fungal taxa when using shotgun metagenomic sequencing that may resemble both the thanatomicrobiome and extant human gut microbiomes. Identification of sequences associated with plants, animals, and carbohydrate-active enzymes (CAZymes) may provide further insights into the dietary habits of Inca and Italian nobility mummies. Putative antibiotic-resistance genes in the Inca and Italian nobility mummies support a human gut resistome prior to the antibiotic therapy era. The higher proportion of putative antibiotic-resistance genes in the Inca compared to Italian nobility mummies may support the hypotheses that a greater exposure to the environment may result in a greater acquisition of antibiotic-resistance genes. The present study adds knowledge of the microbiome resulting from the process of mummification of the human gut, insights of ancient dietary habits, and the preserved putative human gut resistome prior the antibiotic therapy era.
Insights into the Melipona scutellaris (Hymenoptera, Apidae, Meliponini) fat body transcriptome.

PubMed

de Sousa, Cristina Soares; Serrão, José Eduardo; Bonetti, Ana Maria; Amaral, Isabel Marques Rodrigues; Kerr, Warwick Estevam; Maranhão, Andréa Queiroz; Ueira-Vieira, Carlos

2013-07-01

The insect fat body is a multifunctional organ analogous to the vertebrate liver. The fat body is involved in the metabolism of juvenile hormone, regulation of environmental stress, production of immunity regulator-like proteins in cells and protein storage. However, very little is known about the molecular mechanisms involved in fat body physiology in stingless bees. In this study, we analyzed the transcriptome of the fat body from the stingless bee Melipona scutellaris. In silico analysis of a set of cDNA library sequences yielded 1728 expressed sequence tags (ESTs) and 997 high-quality sequences that were assembled into 29 contigs and 117 singlets. The BLAST X tool showed that 86% of the ESTs shared similarity with Apis mellifera (honeybee) genes. The M. scutellaris fat body ESTs encoded proteins with roles in numerous physiological processes, including anti-oxidation, phosphorylation, metabolism, detoxification, transmembrane transport, intracellular transport, cell proliferation, protein hydrolysis and protein synthesis. This is the first report to describe a transcriptomic analysis of specific organs of M. scutellaris. Our findings provide new insights into the physiological role of the fat body in stingless bees.
Insights into the Melipona scutellaris (Hymenoptera, Apidae, Meliponini) fat body transcriptome

PubMed Central

de Sousa, Cristina Soares; Serrão, José Eduardo; Bonetti, Ana Maria; Amaral, Isabel Marques Rodrigues; Kerr, Warwick Estevam; Maranhão, Andréa Queiroz; Ueira-Vieira, Carlos

2013-01-01

The insect fat body is a multifunctional organ analogous to the vertebrate liver. The fat body is involved in the metabolism of juvenile hormone, regulation of environmental stress, production of immunity regulator-like proteins in cells and protein storage. However, very little is known about the molecular mechanisms involved in fat body physiology in stingless bees. In this study, we analyzed the transcriptome of the fat body from the stingless bee Melipona scutellaris. In silico analysis of a set of cDNA library sequences yielded 1728 expressed sequence tags (ESTs) and 997 high-quality sequences that were assembled into 29 contigs and 117 singlets. The BLAST X tool showed that 86% of the ESTs shared similarity with Apis mellifera (honeybee) genes. The M. scutellaris fat body ESTs encoded proteins with roles in numerous physiological processes, including anti-oxidation, phosphorylation, metabolism, detoxification, transmembrane transport, intracellular transport, cell proliferation, protein hydrolysis and protein synthesis. This is the first report to describe a transcriptomic analysis of specific organs of M. scutellaris. Our findings provide new insights into the physiological role of the fat body in stingless bees. PMID:23885214
Structural Analysis of Biodiversity

PubMed Central

Sirovich, Lawrence; Stoeckle, Mark Y.; Zhang, Yu

2010-01-01

Large, recently-available genomic databases cover a wide range of life forms, suggesting opportunity for insights into genetic structure of biodiversity. In this study we refine our recently-described technique using indicator vectors to analyze and visualize nucleotide sequences. The indicator vector approach generates correlation matrices, dubbed Klee diagrams, which represent a novel way of assembling and viewing large genomic datasets. To explore its potential utility, here we apply the improved algorithm to a collection of almost 17000 DNA barcode sequences covering 12 widely-separated animal taxa, demonstrating that indicator vectors for classification gave correct assignment in all 11000 test cases. Indicator vector analysis revealed discontinuities corresponding to species- and higher-level taxonomic divisions, suggesting an efficient approach to classification of organisms from poorly-studied groups. As compared to standard distance metrics, indicator vectors preserve diagnostic character probabilities, enable automated classification of test sequences, and generate high-information density single-page displays. These results support application of indicator vectors for comparative analysis of large nucleotide data sets and raise prospect of gaining insight into broad-scale patterns in the genetic structure of biodiversity. PMID:20195371
Cluster-Based Multipolling Sequencing Algorithm for Collecting RFID Data in Wireless LANs

NASA Astrophysics Data System (ADS)

Choi, Woo-Yong; Chatterjee, Mainak

2015-03-01

With the growing use of RFID (Radio Frequency Identification), it is becoming important to devise ways to read RFID tags in real time. Access points (APs) of IEEE 802.11-based wireless Local Area Networks (LANs) are being integrated with RFID networks that can efficiently collect real-time RFID data. Several schemes, such as multipolling methods based on the dynamic search algorithm and random sequencing, have been proposed. However, as the number of RFID readers associated with an AP increases, it becomes difficult for the dynamic search algorithm to derive the multipolling sequence in real time. Though multipolling methods can eliminate the polling overhead, we still need to enhance the performance of the multipolling methods based on random sequencing. To that extent, we propose a real-time cluster-based multipolling sequencing algorithm that drastically eliminates more than 90% of the polling overhead, particularly so when the dynamic search algorithm fails to derive the multipolling sequence in real time.
Paleovirology of bornaviruses: What can be learned from molecular fossils of bornaviruses.

PubMed

Horie, Masayuki; Tomonaga, Keizo

2018-04-06

Endogenous viral elements (EVEs) are virus-derived sequences embedded in eukaryotic genomes formed by germline integration of viral sequences. As many EVEs were integrated into eukaryotic genomes millions of years ago, EVEs are considered molecular fossils of viruses. EVEs can be valuable informational sources about ancient viruses, including their time scale, geographical distribution, genetic information, and hosts. Although integration of viral sequences is not required for replications of viruses other than retroviruses, many non-retroviral EVEs have been reported to exist in eukaryotes. Investigation of these EVEs has expanded our knowledge regarding virus-host interactions, as well as provided information on ancient viruses. Among them, EVEs derived from bornaviruses, non-retroviral RNA viruses, have been relatively well studied. Bornavirus-derived EVEs are widely distributed in animal genomes, including the human genome, and the history of bornaviruses can be dated back to more than 65 million years. Although there are several reports focusing on the biological significance of bornavirus-derived sequences in mammals, paleovirology of bornaviruses has not yet been well described and summarized. In this paper, we describe what can be learned about bornaviruses from endogenous bornavirus-like elements from the view of paleovirology using published results and our novel data. Copyright © 2018 Elsevier B.V. All rights reserved.
New fundamental parameters for attitude representation

NASA Astrophysics Data System (ADS)

Patera, Russell P.

2017-08-01

A new attitude parameter set is developed to clarify the geometry of combining finite rotations in a rotational sequence and in combining infinitesimal angular increments generated by angular rate. The resulting parameter set of six Pivot Parameters represents a rotation as a great circle arc on a unit sphere that can be located at any clocking location in the rotation plane. Two rotations are combined by linking their arcs at either of the two intersection points of the respective rotation planes. In a similar fashion, linking rotational increments produced by angular rate is used to derive the associated kinematical equations, which are linear and have no singularities. Included in this paper is the derivation of twelve Pivot Parameter elements that represent all twelve Euler Angle sequences, which enables efficient conversions between Pivot Parameters and any Euler Angle sequence. Applications of this new parameter set include the derivation of quaternions and the quaternion composition rule, as well as, the derivation of the analytical solution to time dependent coning motion. The relationships between Pivot Parameters and traditional parameter sets are included in this work. Pivot Parameters are well suited for a variety of aerospace applications due to their effective composition rule, singularity free kinematic equations, efficient conversion to and from Euler Angle sequences and clarity of their geometrical foundation.
Guidance to rational use of pharmaceuticals in gallbladder sarcomatoid carcinoma using patient-derived cancer cells and whole exome sequencing.

PubMed

Feng, Feiling; Cheng, Qingbao; Yang, Liang; Zhang, Dadong; Ji, Shunlong; Zhang, Qiangzu; Lin, Yihui; Li, Fugen; Xiong, Lei; Liu, Chen; Jiang, Xiaoqing

2017-01-17

Gallbladder sarcomatoid carcinoma is a rare cancer with no clinical standard treatment. With the rapid development of next generation sequencing, it has been able to provide reasonable treatment options for patients based on genetic variations. However, most cancer drugs are not approval for gallbladder sarcomatoid carcinoma indications. The correlation between drug response and a genetic variation needs to be further elucidated. Three patient-derived cells-JXQ-3D-001, JXQ-3D-002, and JXQ-3D-003, were derived from biopsy samples of one gallbladder sarcomatoid carcinoma patient with progression and have been characterized. In order to study the relationship between drug sensitivity and gene alteration, genetic mutations of three patient-derived cells were discovered by whole exome sequencing, and drug screening has been performed based on the gene alterations and related signaling pathways that are associated with drug targets. It has been found that there are differences in biological characteristics such as morphology, cell proliferation, cell migration and colony formation activity among these three patient-derived cells although they are derived from the same patient. Their sensitivities to the chemotherapy drugs-Fluorouracil, Doxorubicin, and Cisplatin are distinct. Moreover, none of common chemotherapy drugs could inhibit the proliferations of all three patient-derived cells. Comprehensive analysis of their whole exome sequencing demonstrated that tumor-associated genes TP53, AKT2, FGFR3, FGF10, SDHA, and PI3KCA were mutated or amplified. Part of these alterations are actionable. By screening a set of compounds that are associated with the genetic alteration, it has been found that GDC-0941 and PF-04691502 for PI3K-AKT-mTOR pathway inhibitors could dramatically decrease the proliferation of three patient-derived cells. Importantly, expression of phosphorylated AKT and phosphorylated S6 were markedly decreased after treatments with PI3K-AKT-mTOR pathway inhibitors GDC-0941 (0.5 μM) and PF-04691502 (0.1 μM) in all three patient-derived cells. These data suggested that inhibition of the PI3K-AKT-mTOR pathway that was activated by PIK3CA amplification in all three patient-derived cells could reduce the cell proliferation. A patient-derived cell model combined with whole exome sequencing is a powerful tool to elucidate relationship between drug sensitivities and genetic alternations. In these gallbladder sarcomatoid carcinoma patient-derived cells, it is found that PIK3CA amplification could be used as a biomarker to indicate PI3K-AKT-mTOR pathway activation. Block of the pathway may benefit the gallbladder sarcomatoid carcinoma patient with this alternation in hypothesis. The real efficacy needs to be confirmed in vivo or in a clinical trial.
Functionally conserved enhancers with divergent sequences in distant vertebrates

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yang, Song; Oksenberg, Nir; Takayama, Sachiko

To examine the contributions of sequence and function conservation in the evolution of enhancers, we systematically identified enhancers whose sequences are not conserved among distant groups of vertebrate species, but have homologous function and are likely to be derived from a common ancestral sequence. In conclusion, our approach combined comparative genomics and epigenomics to identify potential enhancer sequences in the genomes of three groups of distantly related vertebrate species.
Functionally conserved enhancers with divergent sequences in distant vertebrates

DOE PAGES

Yang, Song; Oksenberg, Nir; Takayama, Sachiko; ...

2015-10-30

To examine the contributions of sequence and function conservation in the evolution of enhancers, we systematically identified enhancers whose sequences are not conserved among distant groups of vertebrate species, but have homologous function and are likely to be derived from a common ancestral sequence. In conclusion, our approach combined comparative genomics and epigenomics to identify potential enhancer sequences in the genomes of three groups of distantly related vertebrate species.
Inferences from structural comparison: flexibility, secondary structure wobble and sequence alignment optimization.

PubMed

Zhang, Gaihua; Su, Zhen

2012-01-01

Work on protein structure prediction is very useful in biological research. To evaluate their accuracy, experimental protein structures or their derived data are used as the 'gold standard'. However, as proteins are dynamic molecular machines with structural flexibility such a standard may be unreliable. To investigate the influence of the structure flexibility, we analysed 3,652 protein structures of 137 unique sequences from 24 protein families. The results showed that (1) the three-dimensional (3D) protein structures were not rigid: the root-mean-square deviation (RMSD) of the backbone Cα of structures with identical sequences was relatively large, with the average of the maximum RMSD from each of the 137 sequences being 1.06 Å; (2) the derived data of the 3D structure was not constant, e.g. the highest ratio of the secondary structure wobble site was 60.69%, with the sequence alignments from structural comparisons of two proteins in the same family sometimes being completely different. Proteins may have several stable conformations and the data derived from resolved structures as a 'gold standard' should be optimized before being utilized as criteria to evaluate the prediction methods, e.g. sequence alignment from structural comparison. Helix/β-sheet transition exists in normal free proteins. The coil ratio of the 3D structure could affect its resolution as determined by X-ray crystallography.
Differential sequence diversity at merozoite surface protein-1 locus of Plasmodium knowlesi from humans and macaques in Thailand.

PubMed

Putaporntip, Chaturong; Thongaree, Siriporn; Jongwutiwes, Somchai

2013-08-01

To determine the genetic diversity and potential transmission routes of Plasmodium knowlesi, we analyzed the complete nucleotide sequence of the gene encoding the merozoite surface protein-1 of this simian malaria (Pkmsp-1), an asexual blood-stage vaccine candidate, from naturally infected humans and macaques in Thailand. Analysis of Pkmsp-1 sequences from humans (n=12) and monkeys (n=12) reveals five conserved and four variable domains. Most nucleotide substitutions in conserved domains were dimorphic whereas three of four variable domains contained complex repeats with extensive sequence and size variation. Besides purifying selection in conserved domains, evidence of intragenic recombination scattering across Pkmsp-1 was detected. The number of haplotypes, haplotype diversity, nucleotide diversity and recombination sites of human-derived sequences exceeded that of monkey-derived sequences. Phylogenetic networks based on concatenated conserved sequences of Pkmsp-1 displayed a character pattern that could have arisen from sampling process or the presence of two independent routes of P. knowlesi transmission, i.e. from macaques to human and from human to humans in Thailand. Copyright © 2013 Elsevier B.V. All rights reserved.
Insights on the poster preparation and presentation process.

PubMed

Moore, L W; Augspurger, P; King, M O; Proffitt, C

2001-05-01

Dissemination of research findings and effective clinical innovations is key to the growth and development of the nursing profession. Several avenues exist for the dissemination of information. One forum for communication that has gained increased recognition over the past decade is the poster presentation. Poster presentations are often a significant part of regional, national, and international nursing conferences. Although posters are frequently used to disseminate information to the nursing community, little is reported about actual poster presenters' experiences with preparation and presentation of their posters. The purpose of this article is to present insights derived from information shared by poster presenters regarding the poster preparation and presentation process. Such insights derived from the personal experiences of poster presenters may assist others to efficiently and effectively prepare and present scholarly posters that disseminate information to the nursing community. Copyright 2001 by W.B. Saunders Company
Mechanistic insights into the recognition of 5-methylcytosine oxidation derivatives by the SUVH5 SRA domain

PubMed Central

Rajakumara, Eerappa; Nakarakanti, Naveen Kumar; Nivya, M. Angel; Satish, Mutyala

2016-01-01

5-Methylcytosine (5 mC) is associated with epigenetic gene silencing in mammals and plants. 5 mC is consecutively oxidized to 5-hydroxymethylcytosine (5 hmC), 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC) by ten-eleven translocation enzymes. We performed binding and structural studies to investigate the molecular basis of the recognition of the 5 mC oxidation derivatives in the context of a CG sequence by the SET- and RING-associated domain (SRA) of the SUVH5 protein (SUVH5 SRA). Using calorimetric measurements, we demonstrate that the SRA domain binds to the hydroxymethylated CG (5hmCG) DNA duplex in a similar manner to methylated CG (5mCG). Interestingly, the SUVH5 SRA domain exhibits weaker affinity towards carboxylated CG (5caCG) and formylated CG (5fCG). We report the 2.6 Å resolution crystal structure of the SUVH5 SRA domain in a complex with fully hydroxymethyl-CG and demonstrate a dual flip-out mechanism, whereby the symmetrical 5hmCs are simultaneously extruded from the partner strands of the DNA duplex and are positioned within the binding pockets of individual SRA domains. The hydroxyl group of 5hmC establishes both intra- and intermolecular interactions in the binding pocket. Collectively, we show that SUVH5 SRA recognizes 5hmC in a similar manner to 5 mC, but exhibits weaker affinity towards 5 hmC oxidation derivatives. PMID:26841909
Full analytical solution of the bloch equation when using a hyperbolic-secant driving function.

PubMed

Zhang, Jinjin; Garwood, Michael; Park, Jang-Yeon

2017-04-01

The frequency-swept pulse known as the hyperbolic-secant (HS) pulse is popular in NMR for achieving adiabatic spin inversion. The HS pulse has also shown utility for achieving excitation and refocusing in gradient-echo and spin-echo sequences, including new ultrashort echo-time imaging (e.g., Sweep Imaging with Fourier Transform, SWIFT) and B 1 mapping techniques. To facilitate the analysis of these techniques, the complete theoretical solution of the Bloch equation, as driven by the HS pulse, was derived for an arbitrary state of initial magnetization. The solution of the Bloch-Riccati equation for transverse and longitudinal magnetization for an arbitrary initial state was derived analytically in terms of HS pulse parameters. The analytical solution was compared with the solutions using both the Runge-Kutta method and the small-tip approximation. The analytical solution was demonstrated on different initial states at different frequency offsets with/without a combination of HS pulses. Evolution of the transverse magnetization was influenced significantly by the choice of HS pulse parameters. The deviation of the magnitude of the transverse magnetization, as obtained by comparing the small-tip approximation to the analytical solution, was < 5% for flip angles < 30 °, but > 10% for the flip angles > 40 °. The derived analytical solution provides insights into the influence of HS pulse parameters on the magnetization evolution. Magn Reson Med 77:1630-1638, 2017. © 2016 International Society for Magnetic Resonance in Medicine. © 2016 International Society for Magnetic Resonance in Medicine.
Computational and Experimental Insight Into Single-Molecule Piezoelectric Materials

NASA Astrophysics Data System (ADS)

Marvin, Christopher Wayne

Piezoelectric materials allow for the harvesting of ambient waste energy from the environment. Producing lightweight, highly responsive materials is a challenge for this type of material, requiring polymer, foam, or bio-inspired materials. In this dissertation, I explore the origin of the piezoelectric effect in single molecules through density functional theory (DFT), analyze the piezoresponse of bio-inspired peptidic materials through the use of atomic and piezoresponse force microscopy (AFM and PFM), and develop a novel class of materials combining flexible polyurethane foams and non-piezoelectric, polar dopants. For the DFT calculations, functional group, regiochemical, and heteroatom derivatives of [6]helicene were examined for their influence on the piezoelectric response. An aza[6]helicene derivative was found to have a piezoelectric response (108 pm/V) comparable to ceramics such as lead zirconium titanate (200+ pm/V). These computed materials have the possibility to compete with current field-leading piezomaterials such as lead zirconium titanate (PZT), zinc oxide (ZnO), and polyvinylidene difluoride (PVDF) and its derivatives. The use of AFM/PFM allows for the demonstration of the piezoelectric effect of the selfassembled monolayer (SAM) peptidic systems. Through PFM, the influence that the helicity and sequence of the peptide has on the overall response of the molecule can be analyzed. Finally, development of a novel class of piezoelectrics, the foam-based materials, expands the current understanding of the qualities required for a piezoelectric material from ceramic and rigid materials to more flexible, organic materials. Through the exploration of these novel types of piezoelectric materials, new design rules and figures of merit have been developed.
Identifying binding modes of two synthetic derivatives of adrenalin to the α2C-adrenoceptor by using molecular modeling; insights into the α2C-adrenoceptor activation.

PubMed

Gholami, Samira; Bordbar, A Khalegh; Lohrasebi, Amir

2017-04-01

Although, α2C adrenergic receptor (AR) mediates a number of physiological functions in vivo and has great therapeutic potential, the absence of its crystal structure is a major difficulty in the activation mechanism studies and drug design endeavors. Here, a homology model of α2C AR has been presented by means of multiple sequence alignment. The used templates were the latest crystal structures of the other ARs (Protein Data Bank IDs: 2R4R, 2RH1, 4GPO, 3P0G, 4BVN and 4LDO) that have 38.4% identity with the query. We then conducted docking simulations to understand and analyze the binding of noradrenaline (NOR), and its derivatives, namely arachidonoyl adrenalin (AA-AD) and arachidonoyl noradrenalin (AA-NOR) to the receptor. The existence of H-bonds between the ligands and SER218 residue implies the same binding site of derivatives with respect to the NOR. AA-AD and AA-NOR bind to the receptor with the larger binding affinities. The presence of salt bridge between ARG149 and GLU377 in the free receptor, obtained from molecular dynamics studies proved that the receptor still is in its basal state before binding process take places. The activation process is characterized by increasing in the RMSD values of the backbone receptor in the bound state, increasing the RMSF of the transmembrane involved in the activation process and the disappearance of the ARG149-GLU377 salt bridge. Copyright © 2017 Elsevier B.V. All rights reserved.
Rediscovering Medicinal Plants' Potential with OMICS: Microsatellite Survey in Expressed Sequence Tags of Eleven Traditional Plants with Potent Antidiabetic Properties

PubMed Central

Sahu, Jagajjit; Sen, Priyabrata; Choudhury, Manabendra Dutta; Dehury, Budheswar; Barooah, Madhumita; Modi, Mahendra Kumar

2014-01-01

Abstract Herbal medicines and traditionally used medicinal plants present an untapped potential for novel molecular target discovery using systems science and OMICS biotechnology driven strategies. Since up to 40% of the world's poor people have no access to government health services, traditional and folk medicines are often the only therapeutics available to them. In this vein, North East (NE) India is recognized for its rich bioresources. As part of the Indo-Burma hotspot, it is regarded as an epicenter of biodiversity for several plants having myriad traditional uses, including medicinal use. However, the improvement of these valuable bioresources through molecular breeding strategies, for example, using genic microsatellites or Simple Sequence Repeats (SSRs) or Expressed Sequence Tags (ESTs)-derived SSRs has not been fully utilized in large scale to date. In this study, we identified a total of 47,700 microsatellites from 109,609 ESTs of 11 medicinal plants (pineapple, papaya, noyontara, bitter orange, bermuda brass, ratalu, barbados nut, mango, mulberry, lotus, and guduchi) having proven antidiabetic properties. A total of 58,159 primer pairs were designed for the non-redundant 8060 SSR-positive ESTs and putative functions were assigned to 4483 unique contigs. Among the identified microsatellites, excluding mononucleotide repeats, di-/trinucleotides are predominant, among which repeat motifs of AG/CT and AAG/CTT were most abundant. Similarity search of SSR containing ESTs and antidiabetic gene sequences revealed 11 microsatellites linked to antidiabetic genes in five plants. GO term enrichment analysis revealed a total of 80 enriched GO terms widely distributed in 53 biological processes, 17 molecular functions, and 10 cellular components associated with the 11 markers. The present study therefore provides concrete insights into the frequency and distribution of SSRs in important medicinal resources. The microsatellite markers reported here markedly add to the genetic stock for cross transferability in these plants and the literature on biomarkers and novel drug discovery for common chronic diseases such as diabetes. PMID:24802971

Some links on this page may take you to non-federal websites. Their policies may differ from this site.