Sample records for previously determined sequences

  1. Complete Genome Sequence of the Avian Paramyxovirus Serotype 5 Strain APMV-5/budgerigar/Japan/TI/75.

    PubMed

    Hiono, Takahiro; Matsuno, Keita; Tuchiya, Kotaro; Lin, Zhifeng; Okamatsu, Masatoshi; Sakoda, Yoshihiro

    2016-09-22

    Here, we report the complete genome sequence of the avian paramyxovirus serotype 5 strain APMV-5/budgerigar/Japan/TI/75, which was determined using the Illumina MiSeq platform. The determined sequence shares 97% homology and similar genetic features with the previously known genome sequence of avian paramyxovirus serotype 5 strain APMV-5/budgerigar/Japan/Kunitachi/74. Copyright © 2016 Hiono et al.

  2. Complete nucleotide sequence of a novel Hibiscus-infecting Cilevirus from Florida and its relationship with closely associated Cileviruses

    USDA-ARS?s Scientific Manuscript database

    The complete nucleotide sequence of a recently discovered Florida (FL) isolate of Hibiscus infecting Cilevirus (HiCV) was determined by Sanger sequencing. The movement- and coat- protein gene sequences of the HiCV-FL isolate are more divergent than other genes of the previously sequenced HiCV-HA (Ha...

  3. Partial gene sequences for the A subunit of methyl-coenzyme M reductase (mcrI) as a phylogenetic tool for the family Methanosarcinaceae

    NASA Technical Reports Server (NTRS)

    Springer, E.; Sachs, M. S.; Woese, C. R.; Boone, D. R.

    1995-01-01

    Representatives of the family Methanosarcinaceae were analyzed phylogenetically by comparing partial sequences of their methyl-coenzyme M reductase (mcrI) genes. A 490-bp fragment from the A subunit of the gene was selected, amplified by the PCR, cloned, and sequenced for each of 25 strains belonging to the Methanosarcinaceae. The sequences obtained were aligned with the corresponding portions of five previously published sequences, and all of the sequences were compared to determine phylogenetic distances by Fitch distance matrix methods. We prepared analogous trees based on 16S rRNA sequences; these trees corresponded closely to the mcrI trees, although the mcrI sequences of pairs of organisms had 3.01 +/- 0.541 times more changes than the respective pairs of 16S rRNA sequences, suggesting that the mcrI fragment evolved about three times more rapidly than the 16S rRNA gene. The qualitative similarity of the mcrI and 16S rRNA trees suggests that transfer of genetic information between dissimilar organisms has not significantly affected these sequences, although we found inconsistencies between some mcrI distances that we measured and and previously published DNA reassociation data. It is unlikely that multiple mcrI isogenes were present in the organisms that we examined, because we found no major discrepancies in multiple determinations of mcrI sequences from the same organism. Our primers for the PCR also match analogous sites in the previously published mcrII sequences, but all of the sequences that we obtained from members of the Methanosarcinaceae were more closely related to mcrI sequences than to mcrII sequences, suggesting that members of the Methanosarcinaceae do not have distinct mcrII genes.

  4. Things Fall Apart: A Recurrence of Tiling

    ERIC Educational Resources Information Center

    Rogers, Douglas G.

    2005-01-01

    A study investigates recurrence relations, sequences in which each term is determined by one or more previous terms. Results provide another approach to the problem of finding closed forms for recursively-defined sequences.

  5. Thermodynamic characterization of tandem mismatches found in naturally occurring RNA

    PubMed Central

    Christiansen, Martha E.; Znosko, Brent M.

    2009-01-01

    Although all sequence symmetric tandem mismatches and some sequence asymmetric tandem mismatches have been thermodynamically characterized and a model has been proposed to predict the stability of previously unmeasured sequence asymmetric tandem mismatches [Christiansen,M.E. and Znosko,B.M. (2008) Biochemistry, 47, 4329–4336], experimental thermodynamic data for frequently occurring tandem mismatches is lacking. Since experimental data is preferred over a predictive model, the thermodynamic parameters for 25 frequently occurring tandem mismatches were determined. These new experimental values, on average, are 1.0 kcal/mol different from the values predicted for these mismatches using the previous model. The data for the sequence asymmetric tandem mismatches reported here were then combined with the data for 72 sequence asymmetric tandem mismatches that were published previously, and the parameters used to predict the thermodynamics of previously unmeasured sequence asymmetric tandem mismatches were updated. The average absolute difference between the measured values and the values predicted using these updated parameters is 0.5 kcal/mol. This updated model improves the prediction for tandem mismatches that were predicted rather poorly by the previous model. This new experimental data and updated predictive model allow for more accurate calculations of the free energy of RNA duplexes containing tandem mismatches, and, furthermore, should allow for improved prediction of secondary structure from sequence. PMID:19509311

  6. Genetic characterization of L-Zagreb mumps vaccine strain.

    PubMed

    Ivancic, Jelena; Gulija, Tanja Kosutic; Forcic, Dubravko; Baricevic, Marijana; Jug, Renata; Mesko-Prejac, Majda; Mazuran, Renata

    2005-04-01

    Eleven mumps vaccine strains, all containing live attenuated virus, have been used throughout the world. Although L-Zagreb mumps vaccine has been licensed since 1972, only its partial nucleotide sequence was previously determined (accession numbers , and ). Therefore, we sequenced the entire genome of L-Zagreb vaccine strain (Institute of Immunology Inc., Zagreb, Croatia). In order to investigate the genetic stability of the vaccine, sequences of both L-Zagreb master seed and currently produced vaccine batch were determined and no difference between them was observed. A phylogenetic analysis based on SH gene sequence has shown that L-Zagreb strain does not belong to any of established mumps genotypes and that it is most similar to old, laboratory preserved European strains (1950s-1970s). L-Zagreb nucleotide and deduced protein sequences were compared with other mumps virus sequences obtained from the GenBank. Emphasis was put on functionally important protein regions and known antigenic epitopes. The extensive comparisons of nucleotide and deduced protein sequences between L-Zagreb vaccine strain and other previously determined mumps virus sequences have shown that while the functional regions of HN, V, and L proteins are well conserved among various mumps strains, there can be a substantial amino acid difference in antigenic epitopes of all proteins and in functional regions of F protein. No molecular pattern was identified that can be used as a distinction marker between virulent and attenuated strains.

  7. Genetic and phylogenetic analysis of a novel parvovirus isolated from chickens in Guangxi, China.

    PubMed

    Feng, Bin; Xie, Zhixun; Deng, Xianwen; Xie, Liji; Xie, Zhiqin; Huang, Li; Fan, Qin; Luo, Sisi; Huang, Jiaoling; Zhang, Yanfang; Zeng, Tingting; Wang, Sheng; Wang, Leyi

    2016-11-01

    A previously unidentified chicken parvovirus (ChPV) strain, associated with runting-stunting syndrome (RSS), is now endemic among chickens in China. To explore the genetic diversity of ChPV strains, we determined the first complete genome sequence of a novel ChPV isolate (GX-CH-PV-7) identified in chickens in Guang Xi, China, and showed moderate genome sequence similarity to reference strains. Analysis showed that the viral genome sequence is 86.4 %-93.9 % identical to those of other ChPVs. Genetic and phylogenetic analyses showed that this newly emergent GX-CH-PV-7 is closely related to Gallus gallus enteric parvovirus isolate ChPV 798 from the USA, indicating that they may share a common ancestor. The complete DNA sequence is 4612 bp long with an A+T content of 56.66 %. We determined the first complete genome sequence of a previously unidentified ChPV strain to elucidate its origin and evolutionary status.

  8. Enhanced arbovirus surveillance with deep sequencing: Identification of novel rhabdoviruses and bunyaviruses in Australian mosquitoes.

    PubMed

    Coffey, Lark L; Page, Brady L; Greninger, Alexander L; Herring, Belinda L; Russell, Richard C; Doggett, Stephen L; Haniotis, John; Wang, Chunlin; Deng, Xutao; Delwart, Eric L

    2014-01-05

    Viral metagenomics characterizes known and identifies unknown viruses based on sequence similarities to any previously sequenced viral genomes. A metagenomics approach was used to identify virus sequences in Australian mosquitoes causing cytopathic effects in inoculated mammalian cell cultures. Sequence comparisons revealed strains of Liao Ning virus (Reovirus, Seadornavirus), previously detected only in China, livestock-infecting Stretch Lagoon virus (Reovirus, Orbivirus), two novel dimarhabdoviruses, named Beaumont and North Creek viruses, and two novel orthobunyaviruses, named Murrumbidgee and Salt Ash viruses. The novel virus proteomes diverged by ≥ 50% relative to their closest previously genetically characterized viral relatives. Deep sequencing also generated genomes of Warrego and Wallal viruses, orbiviruses linked to kangaroo blindness, whose genomes had not been fully characterized. This study highlights viral metagenomics in concert with traditional arbovirus surveillance to characterize known and new arboviruses in field-collected mosquitoes. Follow-up epidemiological studies are required to determine whether the novel viruses infect humans. © 2013 Elsevier Inc. All rights reserved.

  9. Mammalian genome projects reveal new growth hormone (GH) sequences. Characterization of the GH-encoding genes of armadillo (Dasypus novemcinctus), hedgehog (Erinaceus europaeus), bat (Myotis lucifugus), hyrax (Procavia capensis), shrew (Sorex araneus), ground squirrel (Spermophilus tridecemlineatus), elephant (Loxodonta africana), cat (Felis catus) and opossum (Monodelphis domestica).

    PubMed

    Wallis, Michael

    2008-01-15

    Mammalian growth hormone (GH) sequences have been shown previously to display episodic evolution: the sequence is generally strongly conserved but on at least two occasions during mammalian evolution (on lineages leading to higher primates and ruminants) bursts of rapid evolution occurred. However, the number of mammalian orders studied previously has been relatively limited, and the availability of sequence data via mammalian genome projects provides the potential for extending the range of GH gene sequences examined. Complete or nearly complete GH gene sequences for six mammalian species for which no data were previously available have been extracted from the genome databases-Dasypus novemcinctus (nine-banded armadillo), Erinaceus europaeus (western European hedgehog), Myotis lucifugus (little brown bat), Procavia capensis (cape rock hyrax), Sorex araneus (European shrew), Spermophilus tridecemlineatus (13-lined ground squirrel). In addition incomplete data for several other species have been extended. Examination of the data in detail and comparison with previously available sequences has allowed assessment of the reliability of deduced sequences. Several of the new sequences differ substantially from the consensus sequence previously determined for eutherian GHs, indicating greater variability than previously recognised, and confirming the episodic pattern of evolution. The episodic pattern is not seen for signal sequences, 5' upstream sequence or synonymous substitutions-it is specific to the mature protein sequence, suggesting that it relates to the hormonal function. The substitutions accumulated during the course of GH evolution have occurred mainly on the side of the hormone facing away from the receptor, in a non-random fashion, and it is suggested that this may reflect interaction of the receptor-bound hormone with other proteins or small ligands.

  10. Determination of evolutionary relationships of outbreak-associated Listeria monocytogenes strains of serotypes 1/2a and 1/2b by whole-genome sequencing

    USDA-ARS?s Scientific Manuscript database

    We used whole-genome sequencing to determine evolutionary relationships among 20 outbreak-associated clinical isolates of Listeria monocytogenes serotypes 1/2a and 1/2b. Isolates from 6 of 11 outbreaks fell outside the clonal groups or “epidemic clones” that have been previously associated with outb...

  11. Terminal region sequence variations in variola virus DNA.

    PubMed

    Massung, R F; Loparev, V N; Knight, J C; Totmenin, A V; Chizhikov, V E; Parsons, J M; Safronov, P F; Gutorov, V V; Shchelkunov, S N; Esposito, J J

    1996-07-15

    Genome DNA terminal region sequences were determined for a Brazilian alastrim variola minor virus strain Garcia-1966 that was associated with an 0.8% case-fatality rate and African smallpox strains Congo-1970 and Somalia-1977 associated with variola major (9.6%) and minor (0.4%) mortality rates, respectively. A base sequence identity of > or = 98.8% was determined after aligning 30 kb of the left- or right-end region sequences with cognate sequences previously determined for Asian variola major strains India-1967 (31% death rate) and Bangladesh-1975 (18.5% death rate). The deduced amino acid sequences of putative proteins of > or = 65 amino acids also showed relatively high identity, although the Asian and African viruses were clearly more related to each other than to alastrim virus. Alastrim virus contained only 10 of 70 proteins that were 100% identical to homologs in Asian strains, and 7 alastrim-specific proteins were noted.

  12. Identification and characterization of Theileria ovis surface protein (ToSp) resembled TaSp in Theileria annulata.

    PubMed

    Shayan, P; Jafari, S; Fattahi, R; Ebrahimzade, E; Amininia, N; Changizi, E

    2016-05-01

    Ovine theileriosis is an important hemoprotozoal disease of sheep and goats in tropical and subtropical regions which caused high economic loses in the livestock industry. Theileria annulata surface protein (TaSp) was used previously as a tool for serological analysis in livestock. Since the amino acid sequences of TaSp is, at least, in part very conserved in T. annulata, Theileria lestoquardi and Theileria china I and II, it is very important to determine the amino acid sequence of this protein in Theileria ovis as well, to avoid false interpretation of serological data based on this protein in small animal. In the present study, the nucleotide sequence and amino acid sequence of T. ovis surface protein (ToSp) were determined. The comparison of the nucleotide sequence of ToSp showed 96, 96, 99, and 86 % homology to the corresponding nucleotide sequence of TaSp genes by T. annulata, T. China I, T. China II and T. lestoquardi, previously registered in GenBank under accession nos. AJ316260.1, AY274329.1, DQ120058.1, and EF092924.1 respectively. The amino acid sequence analysis showed 95, 81, 98 and 70 % homology to the corresponding amino acid sequence of T. annulata, T chinaI, T china II and T. lestoquardi, registered in GenBank under accession nos. CAC87478.1, AAP36993.1, AAZ30365.1 and AAP36999.11, respectively. Interestingly, in contrast to the C terminus, a significant difference in amino acid sequence in the N teminus of the ToSp protein could be determined compared to the other known corresponding TaSp sequences, which make this region attractive for designing of a suitable tool for serological diagnosis.

  13. The complete sequence of Cymbidium mosaic virus from Vanilla fragrans in Hainan, China.

    PubMed

    He, Zhen; Jiang, Dongmei; Liu, Aiqin; Sang, Liwei; Li, Wenfeng; Li, Shifang

    2011-06-01

    The complete nucleotide sequence of Cymbidium mosaic virus (CymMV) isolated from vanilla in Hainan province, China was determined for the first time. It comprised 6,224 nucleotides; sequence analysis suggested that the isolate we obtained was a member of the genus Potexvirus, and its sequence shared 86.67-96.61% identities with previously reported sequences. Phylogenetic analysis suggested that CymMV from vanilla fragrans was clustered into subgroup A and the isolates in this subgroup displayed little regional difference.

  14. Simultaneous activation of parallel sensory pathways promotes a grooming sequence in Drosophila

    PubMed Central

    Hampel, Stefanie; McKellar, Claire E

    2017-01-01

    A central model that describes how behavioral sequences are produced features a neural architecture that readies different movements simultaneously, and a mechanism where prioritized suppression between the movements determines their sequential performance. We previously described a model whereby suppression drives a Drosophila grooming sequence that is induced by simultaneous activation of different sensory pathways that each elicit a distinct movement (Seeds et al., 2014). Here, we confirm this model using transgenic expression to identify and optogenetically activate sensory neurons that elicit specific grooming movements. Simultaneous activation of different sensory pathways elicits a grooming sequence that resembles the naturally induced sequence. Moreover, the sequence proceeds after the sensory excitation is terminated, indicating that a persistent trace of this excitation induces the next grooming movement once the previous one is performed. This reveals a mechanism whereby parallel sensory inputs can be integrated and stored to elicit a delayed and sequential grooming response. PMID:28887878

  15. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wemmer, D.E.; Kumar, N.V.; Metrione, R.M.

    Toxin II from Radianthus paumotensis (Rp/sub II/) has been investigated by high-resolution NMR and chemical sequencing methods. Resonance assignments have been obtained for this protein by the sequential approach. NMR assignments could not be made consistent with the previously reported primary sequence for this protein, and chemical methods have been used to determine a sequence with which the NMR data are consistent. Analysis of the 2D NOE spectra shows that the protein secondary structure is comprised of two sequences of ..beta..-sheet, probably joined into a distorted continuous sheet, connected by turns and extended loops, without any regular ..cap alpha..-helical segments.more » The residues previously implicated in activity in this class of proteins, D8 and R13, occur in a loop region.« less

  16. An extended sequence specificity for UV-induced DNA damage.

    PubMed

    Chung, Long H; Murray, Vincent

    2018-01-01

    The sequence specificity of UV-induced DNA damage was determined with a higher precision and accuracy than previously reported. UV light induces two major damage adducts: cyclobutane pyrimidine dimers (CPDs) and pyrimidine(6-4)pyrimidone photoproducts (6-4PPs). Employing capillary electrophoresis with laser-induced fluorescence and taking advantages of the distinct properties of the CPDs and 6-4PPs, we studied the sequence specificity of UV-induced DNA damage in a purified DNA sequence using two approaches: end-labelling and a polymerase stop/linear amplification assay. A mitochondrial DNA sequence that contained a random nucleotide composition was employed as the target DNA sequence. With previous methodology, the UV sequence specificity was determined at a dinucleotide or trinucleotide level; however, in this paper, we have extended the UV sequence specificity to a hexanucleotide level. With the end-labelling technique (for 6-4PPs), the consensus sequence was found to be 5'-GCTC*AC (where C* is the breakage site); while with the linear amplification procedure, it was 5'-TCTT*AC. With end-labelling, the dinucleotide frequency of occurrence was highest for 5'-TC*, 5'-TT* and 5'-CC*; whereas it was 5'-TT* for linear amplification. The influence of neighbouring nucleotides on the degree of UV-induced DNA damage was also examined. The core sequences consisted of pyrimidine nucleotides 5'-CTC* and 5'-CTT* while an A at position "1" and C at position "2" enhanced UV-induced DNA damage. Crown Copyright © 2017. Published by Elsevier B.V. All rights reserved.

  17. Molecular characterization of faba bean necrotic yellows viruses in Tunisia.

    PubMed

    Kraberger, Simona; Kumari, Safaa G; Najar, Asma; Stainton, Daisy; Martin, Darren P; Varsani, Arvind

    2018-03-01

    Faba bean necrotic yellows virus (FBNYV) (genus Nanovirus; family Nanoviridae) has a genome comprising eight individually encapsidated circular single-stranded DNA components. It has frequently been found infecting faba bean (Vicia faba L.) and chickpea (Cicer arietinum L.) in association with satellite molecules (alphasatellites). Genome sequences of FBNYV from Azerbaijan, Egypt, Iran, Morocco, Spain and Syria have been determined previously and we now report the first five genome sequences of FBNYV and associated alphasatellites from faba bean sampled in Tunisia. In addition, we have determined the genome sequences of two additional FBNYV isolates from chickpea plants sampled in Syria and Iran. All individual FBNYV genome component sequences that were determined here share > 84% nucleotide sequence identity with FBNYV sequences available in public databases, with the DNA-M component displaying the highest degree of diversity. As with other studied nanoviruses, recombination and genome component reassortment occurs frequently both between FBNYV genomes and between genomes of nanoviruses belonging to other species.

  18. Sequence requirement of the ade6-4095 meiotic recombination hotspot in Schizosaccharomyces pombe.

    PubMed

    Foulis, Steven J; Fowler, Kyle R; Steiner, Walter W

    2018-02-01

    Homologous recombination occurs at a greatly elevated frequency in meiosis compared to mitosis and is initiated by programmed double-strand DNA breaks (DSBs). DSBs do not occur at uniform frequency throughout the genome in most organisms, but occur preferentially at a limited number of sites referred to as hotspots. The location of hotspots have been determined at nucleotide-level resolution in both the budding and fission yeasts, and while several patterns have emerged regarding preferred locations for DSB hotspots, it remains unclear why particular sites experience DSBs at much higher frequency than other sites with seemingly similar properties. Short sequence motifs, which are often sites for binding of transcription factors, are known to be responsible for a number of hotspots. In this study we identified the minimum sequence required for activity of one of such motif identified in a screen of random sequences capable of producing recombination hotspots. The experimentally determined sequence, GGTCTRGACC, closely matches the previously inferred sequence. Full hotspot activity requires an effective sequence length of 9.5 bp, whereas moderate activity requires an effective sequence length of approximately 8.2 bp and shows significant association with DSB hotspots. In combination with our previous work, this result is consistent with a large number of different sequence motifs capable of producing recombination hotspots, and supports a model in which hotspots can be rapidly regenerated by mutation as they are lost through recombination.

  19. Stable integration and expression of heterologous genes in several lactobacilli using an integration vector constructed from the integrase and attP sequences of phage ΦAT3 isolated from Lactobacillus casei ATCC 393.

    PubMed

    Lin, Chao-Fen; Lo, Ta-Chun; Kuo, Yang-Cheng; Lin, Thy-Hou

    2013-04-01

    An integration vector capable of stably integrating and maintaining in the chromosomes of several lactobacilli over hundreds of generations has been constructed. The major integration machinery used is based on the ΦAT3 integrase (int) and attP sequences determined previously. A novel core sequence located at the 3' end of the tRNA(leu) gene is identified in Lactobacillus fermentum ATCC 14931 as the integration target by the integration vector though most of such sequences found in other lactobacilli are similar to that determined previously. Due to the lack of an appropriate attB site in Lactococcus lactis MG1363, the integration vector is found to be unable to integrate into the chromosome of the strain. However, such integration can be successfully restored by cotransforming the integration vector with a replicative one harboring both attB and erythromycin resistance sequences into the strain. Furthermore, the integration vector constructed carries a promoter region of placT from the chromosome of Lactobacillus rhamnosus TCELL-1 which is used to express green fluorescence and luminance protein genes in the lactobacilli studied.

  20. Comprehensive Rare Variant Analysis via Whole-Genome Sequencing to Determine the Molecular Pathology of Inherited Retinal Disease.

    PubMed

    Carss, Keren J; Arno, Gavin; Erwood, Marie; Stephens, Jonathan; Sanchis-Juan, Alba; Hull, Sarah; Megy, Karyn; Grozeva, Detelina; Dewhurst, Eleanor; Malka, Samantha; Plagnol, Vincent; Penkett, Christopher; Stirrups, Kathleen; Rizzo, Roberta; Wright, Genevieve; Josifova, Dragana; Bitner-Glindzicz, Maria; Scott, Richard H; Clement, Emma; Allen, Louise; Armstrong, Ruth; Brady, Angela F; Carmichael, Jenny; Chitre, Manali; Henderson, Robert H H; Hurst, Jane; MacLaren, Robert E; Murphy, Elaine; Paterson, Joan; Rosser, Elisabeth; Thompson, Dorothy A; Wakeling, Emma; Ouwehand, Willem H; Michaelides, Michel; Moore, Anthony T; Webster, Andrew R; Raymond, F Lucy

    2017-01-05

    Inherited retinal disease is a common cause of visual impairment and represents a highly heterogeneous group of conditions. Here, we present findings from a cohort of 722 individuals with inherited retinal disease, who have had whole-genome sequencing (n = 605), whole-exome sequencing (n = 72), or both (n = 45) performed, as part of the NIHR-BioResource Rare Diseases research study. We identified pathogenic variants (single-nucleotide variants, indels, or structural variants) for 404/722 (56%) individuals. Whole-genome sequencing gives unprecedented power to detect three categories of pathogenic variants in particular: structural variants, variants in GC-rich regions, which have significantly improved coverage compared to whole-exome sequencing, and variants in non-coding regulatory regions. In addition to previously reported pathogenic regulatory variants, we have identified a previously unreported pathogenic intronic variant in CHM in two males with choroideremia. We have also identified 19 genes not previously known to be associated with inherited retinal disease, which harbor biallelic predicted protein-truncating variants in unsolved cases. Whole-genome sequencing is an increasingly important comprehensive method with which to investigate the genetic causes of inherited retinal disease. Copyright © 2017. Published by Elsevier Inc.

  1. Novel primer specific false terminations during DNA sequencing reactions: danger of inaccuracy of mutation analysis in molecular diagnostics

    PubMed Central

    Anwar, R; Booth, A; Churchill, A J; Markham, A F

    1996-01-01

    The determination of nucleotide sequence is fundamental to the identification and molecular analysis of genes. Direct sequencing of PCR products is now becoming a commonplace procedure for haplotype analysis, and for defining mutations and polymorphism within genes, particularly for diagnostic purposes. A previously unrecognised phenomenon, primer related variability, observed in sequence data generated using Taq cycle sequencing and T7 Sequenase sequencing, is reported. This suggests that caution is necessary when interpreting DNA sequence data. This is particularly important in situations where treatment may be dependent on the accuracy of the molecular diagnosis. Images PMID:16696096

  2. Draft Genome Sequence of Xylella fastidiosa subsp. fastidiosa Strain Stag's Leap.

    PubMed

    Chen, J; Wu, F; Zheng, Z; Deng, X; Burbank, L P; Stenger, D C

    2016-04-21

    ITALIC! Xylella fastidiosasubsp. ITALIC! fastidiosacauses Pierce's disease of grapevine. Presented here is the draft genome sequence of the Stag's Leap strain, previously used in pathogenicity/virulence assays to evaluate grapevine germplasm bearing Pierce's disease resistance and a phenotypic assessment of knockout mutants to determine gene function. Copyright © 2016 Chen et al.

  3. BIOCHEMICAL AND PHYLOGENETIC CHARACTERIZATION OF TWO NOVEL DEEP-SEA THERMOCOCCUS ISOLATES WITH POTENTIALLY BIOTECHNOLOGICAL APPLICATIONS

    EPA Science Inventory

    The partial 16S rDNA gene sequences of two thermophilic archaeal strains, TY and TYS, previously isolated from the Guaymas Basin hydrothermal vent site were determined. Lipid analyses and a comparative analysis performed with 16S rDNA sequences of similar thermophilic species sho...

  4. The determination of complete human mitochondrial DNA sequences in single cells: implications for the study of somatic mitochondrial DNA point mutations

    PubMed Central

    Taylor, Robert W.; Taylor, Geoffrey A.; Durham, Steve E.; Turnbull, Douglass M.

    2001-01-01

    Studies of single cells have previously shown intracellular clonal expansion of mitochondrial DNA (mtDNA) mutations to levels that can cause a focal cytochrome c oxidase (COX) defect. Whilst techniques are available to study mtDNA rearrangements at the level of the single cell, recent interest has focused on the possible role of somatic mtDNA point mutations in ageing, neurodegenerative disease and cancer. We have therefore developed a method that permits the reliable determination of the entire mtDNA sequence from single cells without amplifying contaminating, nuclear-embedded pseudogenes. Sequencing and PCR–RFLP analyses of individual COX-negative muscle fibres from a patient with a previously described heteroplasmic COX II (T7587C) mutation indicate that mutant loads as low as 30% can be reliably detected by sequencing. This technique will be particularly useful in identifying the mtDNA mutational spectra in age-related COX-negative cells and will increase our understanding of the pathogenetic mechanisms by which they occur. PMID:11470889

  5. Sequence diversity of hepatitis C virus 6a within the extended interferon sensitivity-determining region correlates with interferon-alpha/ribavirin treatment outcomes.

    PubMed

    Zhou, Daniel X M; Chan, Paul K S; Zhang, Tiejun; Tully, Damien C; Tam, John S

    2010-10-01

    Studies on the association between sequence variability of the interferon sensitivity-determining region (ISDR) of hepatitis C virus and the outcome of treatment have reached conflicting results. In this study, 25 patients infected with HCV 6a who had received interferon-alpha/ribavirin combination treatment were analyzed for the sequence variations. 14 of them had the full genome sequences obtained from a previous study, whereas the other 11 samples were sequenced for the extended ISDR (eISDR). This eISDR fragment covers 192 bp (64 amino acids) upstream and 201 bp (67 amino acids) downstream from the ISDR previously defined for HCV 1b. The comparison between interferon-alpha resistance and response groups for the amino acid mutations located in the full genome (6 and 8 patients respectively) as well as the mutations located in the eISDR (10 and 15 patients respectively) showed that the mutations I2160V, I2256V, V2292I (P<0.05) within eISDR were significantly associated with resistance to treatment. However, the extent of amino acid variations within previously defined ISDR was not associated with resistance to treatment as previously reported. Four amino acid variations I248V (P=0.03-0.06) within E1, R445K (P=0.02-0.05) and S747T (P=0.03) within E2, I861V (P=0.01) within NS2 which located outside the eISDR may also associate with treatment outcome as identified by a prescreening of variations within 14 HCV 6a full genomes. (c) 2010 Elsevier B.V. All rights reserved.

  6. Ribosomal DNA intergenic spacer sequence in foxtail millet, Setaria italica (L.) P. Beauv. and its characterization and application to typing of foxtail millet landraces.

    PubMed

    Fukunaga, Kenji; Ichitani, Katsuyuki; Taura, Satoru; Sato, Muneharu; Kawase, Makoto

    2005-02-01

    We determined the sequence of ribosomal DNA (rDNA) intergenic spacer (IGS) of foxtail millet isolated in our previous study, and identified subrepeats in the polymorphic region. We also developed a PCR-based method for identifying rDNA types based on sequence information and assessed 153 accessions of foxtail millet. Results were congruent with our previous works. This study provides new findings regarding the geographical distribution of rDNA variants. This new method facilitates analyses of numerous foxtail millet accessions. It is helpful for typing of foxtail millet germplasms and elucidating the evolution of this millet.

  7. Phylogenetic stratigraphy in the Guerrero Negro hypersaline microbial mat.

    PubMed

    Harris, J Kirk; Caporaso, J Gregory; Walker, Jeffrey J; Spear, John R; Gold, Nicholas J; Robertson, Charles E; Hugenholtz, Philip; Goodrich, Julia; McDonald, Daniel; Knights, Dan; Marshall, Paul; Tufo, Henry; Knight, Rob; Pace, Norman R

    2013-01-01

    The microbial mats of Guerrero Negro (GN), Baja California Sur, Mexico historically were considered a simple environment, dominated by cyanobacteria and sulfate-reducing bacteria. Culture-independent rRNA community profiling instead revealed these microbial mats as among the most phylogenetically diverse environments known. A preliminary molecular survey of the GN mat based on only ∼1500 small subunit rRNA gene sequences discovered several new phylum-level groups in the bacterial phylogenetic domain and many previously undetected lower-level taxa. We determined an additional ∼119,000 nearly full-length sequences and 28,000 >200 nucleotide 454 reads from a 10-layer depth profile of the GN mat. With this unprecedented coverage of long sequences from one environment, we confirm the mat is phylogenetically stratified, presumably corresponding to light and geochemical gradients throughout the depth of the mat. Previous shotgun metagenomic data from the same depth profile show the same stratified pattern and suggest that metagenome properties may be predictable from rRNA gene sequences. We verify previously identified novel lineages and identify new phylogenetic diversity at lower taxonomic levels, for example, thousands of operational taxonomic units at the family-genus levels differ considerably from known sequences. The new sequences populate parts of the bacterial phylogenetic tree that previously were poorly described, but indicate that any comprehensive survey of GN diversity has only begun. Finally, we show that taxonomic conclusions are generally congruent between Sanger and 454 sequencing technologies, with the taxonomic resolution achieved dependent on the abundance of reference sequences in the relevant region of the rRNA tree of life.

  8. Fragmentation of contaminant and endogenous DNA in ancient samples determined by shotgun sequencing; prospects for human palaeogenomics.

    PubMed

    García-Garcerà, Marc; Gigli, Elena; Sanchez-Quinto, Federico; Ramirez, Oscar; Calafell, Francesc; Civit, Sergi; Lalueza-Fox, Carles

    2011-01-01

    Despite the successful retrieval of genomes from past remains, the prospects for human palaeogenomics remain unclear because of the difficulty of distinguishing contaminant from endogenous DNA sequences. Previous sequence data generated on high-throughput sequencing platforms indicate that fragmentation of ancient DNA sequences is a characteristic trait primarily arising due to depurination processes that create abasic sites leading to DNA breaks. METHODOLOGY/PRINCIPALS FINDINGS: To investigate whether this pattern is present in ancient remains from a temperate environment, we have 454-FLX pyrosequenced different samples dated between 5,500 and 49,000 years ago: a bone from an extinct goat (Myotragus balearicus) that was treated with a depurinating agent (bleach), an Iberian lynx bone not subjected to any treatment, a human Neolithic sample from Barcelona (Spain), and a Neandertal sample from the El Sidrón site (Asturias, Spain). The efficiency of retrieval of endogenous sequences is below 1% in all cases. We have used the non-human samples to identify human sequences (0.35 and 1.4%, respectively), that we positively know are contaminants. We observed that bleach treatment appears to create a depurination-associated fragmentation pattern in resulting contaminant sequences that is indistinguishable from previously described endogenous sequences. Furthermore, the nucleotide composition pattern observed in 5' and 3' ends of contaminant sequences is much more complex than the flat pattern previously described in some Neandertal contaminants. Although much research on samples with known contaminant histories is needed, our results suggest that endogenous and contaminant sequences cannot be distinguished by the fragmentation pattern alone.

  9. The Use and Effectiveness of Triple Multiplex System for Coding Region Single Nucleotide Polymorphism in Mitochondrial DNA Typing of Archaeologically Obtained Human Skeletons from Premodern Joseon Tombs of Korea

    PubMed Central

    Oh, Chang Seok; Lee, Soong Deok; Kim, Yi-Suk; Shin, Dong Hoon

    2015-01-01

    Previous study showed that East Asian mtDNA haplogroups, especially those of Koreans, could be successfully assigned by the coupled use of analyses on coding region SNP markers and control region mutation motifs. In this study, we tried to see if the same triple multiplex analysis for coding regions SNPs could be also applicable to ancient samples from East Asia as the complementation for sequence analysis of mtDNA control region. By the study on Joseon skeleton samples, we know that mtDNA haplogroup determined by coding region SNP markers successfully falls within the same haplogroup that sequence analysis on control region can assign. Considering that ancient samples in previous studies make no small number of errors in control region mtDNA sequencing, coding region SNP analysis can be used as good complimentary to the conventional haplogroup determination, especially of archaeological human bone samples buried underground over long periods. PMID:26345190

  10. A metagenomic viral discovery approach identifies potential zoonotic and novel mammalian viruses in Neoromicia bats within South Africa.

    PubMed

    Geldenhuys, Marike; Mortlock, Marinda; Weyer, Jacqueline; Bezuidt, Oliver; Seamark, Ernest C J; Kearney, Teresa; Gleasner, Cheryl; Erkkila, Tracy H; Cui, Helen; Markotter, Wanda

    2018-01-01

    Species within the Neoromicia bat genus are abundant and widely distributed in Africa. It is common for these insectivorous bats to roost in anthropogenic structures in urban regions. Additionally, Neoromicia capensis have previously been identified as potential hosts for Middle East respiratory syndrome (MERS)-related coronaviruses. This study aimed to ascertain the gastrointestinal virome of these bats, as viruses excreted in fecal material or which may be replicating in rectal or intestinal tissues have the greatest opportunities of coming into contact with other hosts. Samples were collected in five regions of South Africa over eight years. Initial virome composition was determined by viral metagenomic sequencing by pooling samples and enriching for viral particles. Libraries were sequenced on the Illumina MiSeq and NextSeq500 platforms, producing a combined 37 million reads. Bioinformatics analysis of the high throughput sequencing data detected the full genome of a novel species of the Circoviridae family, and also identified sequence data from the Adenoviridae, Coronaviridae, Herpesviridae, Parvoviridae, Papillomaviridae, Phenuiviridae, and Picornaviridae families. Metagenomic sequencing data was insufficient to determine the viral diversity of certain families due to the fragmented coverage of genomes and lack of suitable sequencing depth, as some viruses were detected from the analysis of reads-data only. Follow up conventional PCR assays targeting conserved gene regions for the Adenoviridae, Coronaviridae, and Herpesviridae families were used to confirm metagenomic data and generate additional sequences to determine genetic diversity. The complete coding genome of a MERS-related coronavirus was recovered with additional amplicon sequencing on the MiSeq platform. The new genome shared 97.2% overall nucleotide identity to a previous Neoromicia-associated MERS-related virus, also from South Africa. Conventional PCR analysis detected diverse adenovirus and herpesvirus sequences that were widespread throughout Neoromicia populations in South Africa. Furthermore, similar adenovirus sequences were detected within these populations throughout several years. With the exception of the coronaviruses, the study represents the first report of sequence data from several viral families within a Southern African insectivorous bat genus; highlighting the need for continued investigations in this regard.

  11. A metagenomic viral discovery approach identifies potential zoonotic and novel mammalian viruses in Neoromicia bats within South Africa

    PubMed Central

    Geldenhuys, Marike; Mortlock, Marinda; Weyer, Jacqueline; Bezuidt, Oliver; Seamark, Ernest C. J.; Kearney, Teresa; Gleasner, Cheryl; Erkkila, Tracy H.; Cui, Helen; Markotter, Wanda

    2018-01-01

    Species within the Neoromicia bat genus are abundant and widely distributed in Africa. It is common for these insectivorous bats to roost in anthropogenic structures in urban regions. Additionally, Neoromicia capensis have previously been identified as potential hosts for Middle East respiratory syndrome (MERS)-related coronaviruses. This study aimed to ascertain the gastrointestinal virome of these bats, as viruses excreted in fecal material or which may be replicating in rectal or intestinal tissues have the greatest opportunities of coming into contact with other hosts. Samples were collected in five regions of South Africa over eight years. Initial virome composition was determined by viral metagenomic sequencing by pooling samples and enriching for viral particles. Libraries were sequenced on the Illumina MiSeq and NextSeq500 platforms, producing a combined 37 million reads. Bioinformatics analysis of the high throughput sequencing data detected the full genome of a novel species of the Circoviridae family, and also identified sequence data from the Adenoviridae, Coronaviridae, Herpesviridae, Parvoviridae, Papillomaviridae, Phenuiviridae, and Picornaviridae families. Metagenomic sequencing data was insufficient to determine the viral diversity of certain families due to the fragmented coverage of genomes and lack of suitable sequencing depth, as some viruses were detected from the analysis of reads-data only. Follow up conventional PCR assays targeting conserved gene regions for the Adenoviridae, Coronaviridae, and Herpesviridae families were used to confirm metagenomic data and generate additional sequences to determine genetic diversity. The complete coding genome of a MERS-related coronavirus was recovered with additional amplicon sequencing on the MiSeq platform. The new genome shared 97.2% overall nucleotide identity to a previous Neoromicia-associated MERS-related virus, also from South Africa. Conventional PCR analysis detected diverse adenovirus and herpesvirus sequences that were widespread throughout Neoromicia populations in South Africa. Furthermore, similar adenovirus sequences were detected within these populations throughout several years. With the exception of the coronaviruses, the study represents the first report of sequence data from several viral families within a Southern African insectivorous bat genus; highlighting the need for continued investigations in this regard. PMID:29579103

  12. First full-length genome sequence of the polerovirus luffa aphid-borne yellows virus (LABYV) reveals the presence of at least two consensus sequences in an isolate from Thailand.

    PubMed

    Knierim, Dennis; Maiss, Edgar; Kenyon, Lawrence; Winter, Stephan; Menzel, Wulf

    2015-10-01

    Luffa aphid-borne yellows virus (LABYV) was proposed as the name for a previously undescribed polerovirus based on partial genome sequences obtained from samples of cucurbit plants collected in Thailand between 2008 and 2013. In this study, we determined the first full-length genome sequence of LABYV. Based on phylogenetic analysis and genome properties, it is clear that this virus represents a distinct species in the genus Polerovirus. Analysis of sequences from sample TH24, which was collected in 2010 from a luffa plant in Thailand, reveals the presence of two different full-length genome consensus sequences.

  13. First complete genome sequence of infectious laryngotracheitis virus

    PubMed Central

    2011-01-01

    Background Infectious laryngotracheitis virus (ILTV) is an alphaherpesvirus that causes acute respiratory disease in chickens worldwide. To date, only one complete genomic sequence of ILTV has been reported. This sequence was generated by concatenating partial sequences from six different ILTV strains. Thus, the full genomic sequence of a single (individual) strain of ILTV has not been determined previously. This study aimed to use high throughput sequencing technology to determine the complete genomic sequence of a live attenuated vaccine strain of ILTV. Results The complete genomic sequence of the Serva vaccine strain of ILTV was determined, annotated and compared to the concatenated ILTV reference sequence. The genome size of the Serva strain was 152,628 bp, with a G + C content of 48%. A total of 80 predicted open reading frames were identified. The Serva strain had 96.5% DNA sequence identity with the concatenated ILTV sequence. Notably, the concatenated ILTV sequence was found to lack four large regions of sequence, including 528 bp and 594 bp of sequence in the UL29 and UL36 genes, respectively, and two copies of a 1,563 bp sequence in the repeat regions. Considerable differences in the size of the predicted translation products of 4 other genes (UL54, UL30, UL37 and UL38) were also identified. More than 530 single-nucleotide polymorphisms (SNPs) were identified. Most SNPs were located within three genomic regions, corresponding to sequence from the SA-2 ILTV vaccine strain in the concatenated ILTV sequence. Conclusions This is the first complete genomic sequence of an individual ILTV strain. This sequence will facilitate future comparative genomic studies of ILTV by providing an appropriate reference sequence for the sequence analysis of other ILTV strains. PMID:21501528

  14. [Multiplexing mapping of human cDNAs]. Final report, September 1, 1991--February 28, 1994

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Not Available

    Using PCR with automated product analysis, 329 human brain cDNA sequences have been assigned to individual human chromosomes. Primers were designed from single-pass cDNA sequences expressed sequence tags (ESTs). Primers were used in PCR reactions with DNA from somatic cell hybrid mapping panels as templates, often with multiplexing. Many ESTs mapped match sequence database records. To evaluate of these matches, the position of the primers relative to the matching region (In), the BLAST scores and the Poisson probability values of the EST/sequence record match were determined. In cases where the gene product was stringently identified by the sequence match hadmore » already been mapped, the gene locus determined by EST was consistent with the previous position which strongly supports the validity of assigning unknown genes to human chromosomes based on the EST sequence matches. In the present cases mapping the ESTs to a chromosome can also be considered to have mapped the known gene product: rolipram-sensitive cAMP phosphodiesterase, chromosome 1; protein phosphatase 2A{beta}, chromosome 4; alpha-catenin, chromosome 5; the ELE1 oncogene, chromosome 10q11.2 or q2.1-q23; MXII protein, chromosome l0q24-qter; ribosomal protein L18a homologue, chromosome 14; ribosomal protein L3, chromosome 17; and moesin, Xp11-cen. There were also ESTs mapped that were closely related to non-human sequence records. These matches therefore can be considered to identify human counterparts of known gene products, or members of known gene families. Examples of these include membrane proteins, translation-associated proteins, structural proteins, and enzymes. These data then demonstrate that single pass sequence information is sufficient to design PCR primers useful for assigning cDNA sequences to human chromosomes. When the EST sequence matches previous sequence database records, the chromosome assignments of the EST can be used to make preliminary assignments of the human gene to a chromosome.« less

  15. Precise detection of chromosomal translocation or inversion breakpoints by whole-genome sequencing.

    PubMed

    Suzuki, Toshifumi; Tsurusaki, Yoshinori; Nakashima, Mitsuko; Miyake, Noriko; Saitsu, Hirotomo; Takeda, Satoru; Matsumoto, Naomichi

    2014-12-01

    Structural variations (SVs), including translocations, inversions, deletions and duplications, are potentially associated with Mendelian diseases and contiguous gene syndromes. Determination of SV-related breakpoints at the nucleotide level is important to reveal the genetic causes for diseases. Whole-genome sequencing (WGS) by next-generation sequencers is expected to determine structural abnormalities more directly and efficiently than conventional methods. In this study, 14 SVs (9 balanced translocations, 1 inversion and 4 microdeletions) in 9 patients were analyzed by WGS with a shallow (5 × ) to moderate read coverage (20 × ). Among 28 breakpoints (as each SV has two breakpoints), 19 SV breakpoints had been determined previously at the nucleotide level by any other methods and 9 were uncharacterized. BreakDancer and Integrative Genomics Viewer determined 20 breakpoints (16 translocation, 2 inversion and 2 deletion breakpoints), but did not detect 8 breakpoints (2 translocation and 6 deletion breakpoints). These data indicate the efficacy of WGS for the precise determination of translocation and inversion breakpoints.

  16. Non-invasive fetal sex determination by maternal plasma sequencing and application in X-linked disorder counseling.

    PubMed

    Pan, Xiaoyu; Zhang, Chunlei; Li, Xuchao; Chen, Shengpei; Ge, Huijuan; Zhang, Yanyan; Chen, Fang; Jiang, Hui; Jiang, Fuman; Zhang, Hongyun; Wang, Wei; Zhang, Xiuqing

    2014-12-01

    To develop a fetal sex determination method based on maternal plasma sequencing (MPS), assess its performance and potential use in X-linked disorder counseling. 900 cases of MPS data from a previous study were reviewed, in which 100 and 800 cases were used as training and validation set, respectively. The percentage of uniquely mapped sequencing reads on Y chromosome was calculated and used to classify male and female cases. Eight pregnant women who are carriers of Duchenne muscular dystrophy (DMD) mutations were recruited, whose plasma were subjected to multiplex sequencing and fetal sex determination analysis. In the training set, a sensitivity of 96% and false positive rate of 0% for male cases detection were reached in our method. The blinded validation results showed 421 in 423 male cases and 374 in 377 female cases were successfully identified, revealing sensitivity and specificity of 99.53% and 99.20% for fetal sex determination, at as early as 12 gestational weeks. Fetal sex for all eight DMD genetic counseling cases were correctly identified, which were confirmed by amniocentesis. Based on MPS, high accuracy of non-invasive fetal sex determination can be achieved. This method can potentially be used for prenatal genetic counseling.

  17. Complete genome sequence of Clostridium pasteurianum NRRL B-598, a non-type strain producing butanol.

    PubMed

    Sedlar, Karel; Kolek, Jan; Skutkova, Helena; Branska, Barbora; Provaznik, Ivo; Patakova, Petra

    2015-11-20

    The strain Clostridium pasteurianum NRRL B-598 is non-type, oxygen tolerant, spore-forming, mesophilic and heterofermentative strain with high hydrogen production and ability of acetone-butanol fermentation (ethanol production being negligible). Here, we present the annotated complete genome sequence of this bacterium, replacing the previous draft genome assembly. The genome consisting of a single circular 6,186,879 bp chromosome with no plasmid was determined using PacBio RSII and Roche 454 sequencing. Copyright © 2015 Elsevier B.V. All rights reserved.

  18. Nucleotide Sequence of the blaRTG-2 (CARB-5) Gene and Phylogeny of a New Group of Carbenicillinases

    PubMed Central

    Choury, Daniele; Szajnert, Marie-France; Joly-Guillou, Marie-Laure; Azibi, Kemal; Delpech, Marc; Paul, Gérard

    2000-01-01

    We determined the nucleotide sequence of the bla gene for the Acinetobacter calcoaceticus β-lactamase previously described as CARB-5. Alignment of the deduced amino acid sequence with those of known β-lactamases revealed that CARB-5 possesses an RTG triad in box VII, as described for the Proteus mirabilis GN79 enzyme, instead of the RSG consensus characteristic of the other carbenicillinases. Phylogenetic studies showed that these RTG enzymes constitute a new, separate group, possibly ancestors of the carbenicillinase family. PMID:10722515

  19. Species composition of the genus Saprolegnia in fin fish aquaculture environments, as determined by nucleotide sequence analysis of the nuclear rDNA ITS regions.

    PubMed

    de la Bastide, Paul Y; Leung, Wai Lam; Hintz, William E

    2015-01-01

    The ITS region of the rDNA gene was compared for Saprolegnia spp. in order to improve our understanding of nucleotide sequence variability within and between species of this genus, determine species composition in Canadian fin fish aquaculture facilities, and to assess the utility of ITS sequence variability in genetic marker development. From a collection of more than 400 field isolates, ITS region nucleotide sequences were studied and it was determined that there was sufficient consistent inter-specific variation to support the designation of species identity based on ITS sequence data. This non-subjective approach to species identification does not rely upon transient morphological features. Phylogenetic analyses comparing our ITS sequences and species designations with data from previous studies generally supported the clade scheme of Diéguez-Uribeondo et al. (2007) and found agreement with the molecular taxonomic cluster system of Sandoval-Sierra et al. (2014). Our Canadian ITS sequence collection will thus contribute to the public database and assist the clarification of Saprolegnia spp. taxonomy. The analysis of ITS region sequence variability facilitated genus- and species-level identification of unknown samples from aquaculture facilities and provided useful information on species composition. A unique ITS-RFLP for the identification of S. parasitica was also described. Copyright © 2014 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.

  20. Unveiling the complete genome sequence of clerodendrum chlorotic spot virus, a putative dichorhavirus infecting ornamental plants.

    PubMed

    Ramos-González, Pedro Luis; Chabi-Jesus, Camila; Banguela-Castillo, Alexander; Tassi, Aline Daniele; Rodrigues, Mariane da Costa; Kitajima, Elliot Watanabe; Harakava, Ricardo; Freitas-Astúa, Juliana

    2018-06-04

    The genus Dichorhavirus includes plant-infecting rhabdoviruses with bisegmented genomes that are horizontally transmitted by false spider mites of the genus Brevipalpus. The complete genome sequences of three isolates of the putative dichorhavirus clerodendrum chlorotic spot virus were determined using next-generation sequencing (Illumina) and traditional RT-PCR. Their genome organization, sequence similarity and phylogenetic relationship to other viruses, and transmissibility by Brevipalpus yothersi mites support the assignment of these viruses to a new species of dichorhavirus, as suggested previously. New data are discussed stressing the reliability of the current rules for species demarcation and taxonomic status criteria within the genus Dichorhavirus.

  1. Exome-wide Sequencing Shows Low Mutation Rates and Identifies Novel Mutated Genes in Seminomas.

    PubMed

    Cutcutache, Ioana; Suzuki, Yuka; Tan, Iain Beehuat; Ramgopal, Subhashini; Zhang, Shenli; Ramnarayanan, Kalpana; Gan, Anna; Lee, Heng Hong; Tay, Su Ting; Ooi, Aikseng; Ong, Choon Kiat; Bolthouse, Jonathan T; Lane, Brian R; Anema, John G; Kahnoski, Richard J; Tan, Patrick; Teh, Bin Tean; Rozen, Steven G

    2015-07-01

    Testicular germ cell tumors are the most common cancer diagnosed in young men, and seminomas are the most common type of these cancers. There have been no exome-wide examinations of genes mutated in seminomas or of overall rates of nonsilent somatic mutations in these tumors. The objective was to analyze somatic mutations in seminomas to determine which genes are affected and to determine rates of nonsilent mutations. Eight seminomas and matched normal samples were surgically obtained from eight patients. DNA was extracted from tissue samples and exome sequenced on massively parallel Illumina DNA sequencers. Single-nucleotide polymorphism chip-based copy number analysis was also performed to assess copy number alterations. The DNA sequencing read data were analyzed to detect somatic mutations including single-nucleotide substitutions and short insertions and deletions. The detected mutations were validated by independent sequencing and further checked for subclonality. The rate of nonsynonymous somatic mutations averaged 0.31 mutations/Mb. We detected nonsilent somatic mutations in 96 genes that were not previously known to be mutated in seminomas, of which some may be driver mutations. Many of the mutations appear to have been present in subclonal populations. In addition, two genes, KIT and KRAS, were affected in two tumors each with mutations that were previously observed in other cancers and are presumably oncogenic. Our study, the first report on exome sequencing of seminomas, detected somatic mutations in 96 new genes, several of which may be targetable drivers. Furthermore, our results show that seminoma mutation rates are five times higher than previously thought, but are nevertheless low compared to other common cancers. Similar low rates are seen in other cancers that also have excellent rates of remission achieved with chemotherapy. We examined the DNA sequences of seminomas, the most common type of testicular germ cell cancer. Our study identified 96 new genes in which mutations occurred during seminoma development, some of which might contribute to cancer development or progression. The study also showed that the rates of DNA mutations during seminoma development are higher than previously thought, but still lower than for other common solid-organ cancers. Such low rates are also observed among other cancers that, like seminomas, show excellent rates of disease remission after chemotherapy. Copyright © 2015 European Association of Urology. Published by Elsevier B.V. All rights reserved.

  2. Nucleotide sequences of the tet(M) genes from the American and Dutch type tetracycline resistance plasmids of Neisseria gonorrhoeae.

    PubMed

    Gascoyne-Binzi, D M; Heritage, J; Hawkey, P M

    1993-11-01

    High-level tetracycline-resistant Neisseria gonorrhoeae (TRNG) has been associated with the presence of a plasmid approximately 25.2 MDa in size which carries a Tet M tetracycline resistance determinant. Two different plasmid types, American and Dutch, have previously been described, based on the restriction endonuclease digestion pattern. In this study, the tet(M) genes from the two plasmid types have been amplified by the polymerase chain reaction (PCR) and then sequenced. The gene sequences from the two plasmids shared 96.8% identity, and showed similarities with different segments of the tet(M) gene sequences from Tn1545, Tn916 and Ureaplasma urealyticum. The data suggest that it is highly likely that the Tet M determinant found in the American type plasmid has a different origin from that present in the Dutch plasmid.

  3. Measuring patterns in team interaction sequences using a discrete recurrence approach.

    PubMed

    Gorman, Jamie C; Cooke, Nancy J; Amazeen, Polemnia G; Fouse, Shannon

    2012-08-01

    Recurrence-based measures of communication determinism and pattern information are described and validated using previously collected team interaction data. Team coordination dynamics has revealed that"mixing" team membership can lead to flexible interaction processes, but keeping a team "intact" can lead to rigid interaction processes. We hypothesized that communication of intact teams would have greater determinism and higher pattern information compared to that of mixed teams. Determinism and pattern information were measured from three-person Uninhabited Air Vehicle team communication sequences over a series of 40-minute missions. Because team members communicated using push-to-talk buttons, communication sequences were automatically generated during each mission. The Composition x Mission determinism effect was significant. Intact teams' determinism increased over missions, whereas mixed teams' determinism did not change. Intact teams had significantly higher maximum pattern information than mixed teams. Results from these new communication analysis methods converge with content-based methods and support our hypotheses. Because they are not content based, and because they are automatic and fast, these new methods may be amenable to real-time communication pattern analysis.

  4. Two-Stage orders sequencing system for mixed-model assembly

    NASA Astrophysics Data System (ADS)

    Zemczak, M.; Skolud, B.; Krenczyk, D.

    2015-11-01

    In the paper, the authors focus on the NP-hard problem of orders sequencing, formulated similarly to Car Sequencing Problem (CSP). The object of the research is the assembly line in an automotive industry company, on which few different models of products, each in a certain number of versions, are assembled on the shared resources, set in a line. Such production type is usually determined as a mixed-model production, and arose from the necessity of manufacturing customized products on the basis of very specific orders from single clients. The producers are nowadays obliged to provide each client the possibility to determine a huge amount of the features of the product they are willing to buy, as the competition in the automotive market is large. Due to the previously mentioned nature of the problem (NP-hard), in the given time period only satisfactory solutions are sought, as the optimal solution method has not yet been found. Most of the researchers that implemented inaccurate methods (e.g. evolutionary algorithms) to solving sequencing problems dropped the research after testing phase, as they were not able to obtain reproducible results, and met problems while determining the quality of the received solutions. Therefore a new approach to solving the problem, presented in this paper as a sequencing system is being developed. The sequencing system consists of a set of determined rules, implemented into computer environment. The system itself works in two stages. First of them is connected with the determination of a place in the storage buffer to which certain production orders should be sent. In the second stage of functioning, precise sets of sequences are determined and evaluated for certain parts of the storage buffer under certain criteria.

  5. Beryllium abundances along the evolutionary sequence of the open cluster IC 4651 - A new test for hydrodynamical stellar models

    NASA Astrophysics Data System (ADS)

    Smiljanic, R.; Pasquini, L.; Charbonnel, C.; Lagarde, N.

    2010-02-01

    Context. Previous analyses of lithium abundances in main sequence and red giant stars have revealed the action of mixing mechanisms other than convection in stellar interiors. Beryllium abundances in stars with Li abundance determinations can offer valuable complementary information on the nature of these mechanisms. Aims: Our aim is to derive Be abundances along the whole evolutionary sequence of an open cluster. We focus on the well-studied open cluster IC 4651. These Be abundances are used with previously determined Li abundances, in the same sample stars, to investigate the mixing mechanisms in a range of stellar masses and evolutionary stages. Methods: Atmospheric parameters were adopted from a previous abundance analysis by the same authors. New Be abundances have been determined from high-resolution, high signal-to-noise UVES spectra using spectrum synthesis and model atmospheres. The careful synthetic modeling of the Be lines region is used to calculate reliable abundances in rapidly rotating stars. The observed behavior of Be and Li is compared to theoretical predictions from stellar models including rotation-induced mixing, internal gravity waves, atomic diffusion, and thermohaline mixing. Results: Beryllium is detected in all the main sequence and turn-off sample stars, both slow- and fast-rotating stars, including the Li-dip stars, but is not detected in the red giants. Confirming previous results, we find that the Li dip is also a Be dip, although the depletion of Be is more modest than for Li in the corresponding effective temperature range. For post-main-sequence stars, the Be dilution starts earlier within the Hertzsprung gap than expected from classical predictions, as does the Li dilution. A clear dispersion in the Be abundances is also observed. Theoretical stellar models including the hydrodynamical transport processes mentioned above are able to reproduce all the observed features well. These results show a good theoretical understanding of the Li and Be behavior along the color-magnitude diagram of this intermediate-age cluster for stars more massive than 1.2 M⊙. Based on observations made with the ESO VLT, at Paranal Observatory, under programs 065.L-0427 and 067.D-0126.Current address: European Southern Observatory, Karl-Schwarzschild-Str. 2, 85748 Garching bei München, Germany.

  6. Equally parsimonious pathways through an RNA sequence space are not equally likely

    NASA Technical Reports Server (NTRS)

    Lee, Y. H.; DSouza, L. M.; Fox, G. E.

    1997-01-01

    An experimental system for determining the potential ability of sequences resembling 5S ribosomal RNA (rRNA) to perform as functional 5S rRNAs in vivo in the Escherichia coli cellular environment was devised previously. Presumably, the only 5S rRNA sequences that would have been fixed by ancestral populations are ones that were functionally valid, and hence the actual historical paths taken through RNA sequence space during 5S rRNA evolution would have most likely utilized valid sequences. Herein, we examine the potential validity of all sequence intermediates along alternative equally parsimonious trajectories through RNA sequence space which connect two pairs of sequences that had previously been shown to behave as valid 5S rRNAs in E. coli. The first trajectory requires a total of four changes. The 14 sequence intermediates provide 24 apparently equally parsimonious paths by which the transition could occur. The second trajectory involves three changes, six intermediate sequences, and six potentially equally parsimonious paths. In total, only eight of the 20 sequence intermediates were found to be clearly invalid. As a consequence of the position of these invalid intermediates in the sequence space, seven of the 30 possible paths consisted of exclusively valid sequences. In several cases, the apparent validity/invalidity of the intermediate sequences could not be anticipated on the basis of current knowledge of the 5S rRNA structure. This suggests that the interdependencies in RNA sequence space may be more complex than currently appreciated. If ancestral sequences predicted by parsimony are to be regarded as actual historical sequences, then the present results would suggest that they should also satisfy a validity requirement and that, in at least limited cases, this conjecture can be tested experimentally.

  7. "I Saw the Madre": Evaluating Predictions about Codeswitched Determiner-Noun Sequences Using Spanish-English and Welsh-English Data

    ERIC Educational Resources Information Center

    Herring, Jon Russell; Deuchar, Margaret; Couto, M. Carmen Parafita; Quintanilla, Monica Moro

    2010-01-01

    Previous work on intrasentential codeswitching has noted that switches between determiners and their noun complements are frequent in both Spanish-English and Welsh-English data. Two major recent theories of codeswitching, the Matrix Language Frame model and a Minimalist Program approach, make potentially competing predictions regarding the source…

  8. Comprehensive analysis of the T-cell receptor beta chain gene in rhesus monkey by high throughput sequencing

    PubMed Central

    Li, Zhoufang; Liu, Guangjie; Tong, Yin; Zhang, Meng; Xu, Ying; Qin, Li; Wang, Zhanhui; Chen, Xiaoping; He, Jiankui

    2015-01-01

    Profiling immune repertoires by high throughput sequencing enhances our understanding of immune system complexity and immune-related diseases in humans. Previously, cloning and Sanger sequencing identified limited numbers of T cell receptor (TCR) nucleotide sequences in rhesus monkeys, thus their full immune repertoire is unknown. We applied multiplex PCR and Illumina high throughput sequencing to study the TCRβ of rhesus monkeys. We identified 1.26 million TCRβ sequences corresponding to 643,570 unique TCRβ sequences and 270,557 unique complementarity-determining region 3 (CDR3) gene sequences. Precise measurements of CDR3 length distribution, CDR3 amino acid distribution, length distribution of N nucleotide of junctional region, and TCRV and TCRJ gene usage preferences were performed. A comprehensive profile of rhesus monkey immune repertoire might aid human infectious disease studies using rhesus monkeys. PMID:25961410

  9. Unraveling the sequence and structure of the protein osteocalcin from a 42 ka fossil horse

    NASA Astrophysics Data System (ADS)

    Ostrom, Peggy H.; Gandhi, Hasand; Strahler, John R.; Walker, Angela K.; Andrews, Philip C.; Leykam, Joseph; Stafford, Thomas W.; Kelly, Robert L.; Walker, Danny N.; Buckley, Mike; Humpula, James

    2006-04-01

    We report the first complete amino acid sequence and evidence of secondary structure for osteocalcin from a temperate fossil. The osteocalcin derives from a 42 ka equid bone excavated from Juniper Cave, Wyoming. Results were determined by matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-MS) and Edman sequencing with independent confirmation of the sequence in two laboratories. The ancient sequence was compared to that of three modern taxa: horse ( Equus caballus), zebra ( Equus grevyi), and donkey ( Equus asinus). Although there was no difference in sequence among modern taxa, MALDI-MS and Edman sequencing show that residues 48 and 49 of our modern horse are Thr, Ala rather than Pro, Val as previously reported (Carstanjen B., Wattiez, R., Armory, H., Lepage, O.M., Remy, B., 2002. Isolation and characterization of equine osteocalcin. Ann. Med. Vet.146(1), 31-38). MALDI-MS and Edman sequencing data indicate that the osteocalcin sequence of the 42 ka fossil is similar to that of modern horse. Previously inaccessible structural attributes for ancient osteocalcin were observed. Glu 39 rather than Gln 39 is consistent with deamidation, a process known to occur during fossilization and aging. Two post-translational modifications were documented: Hyp 9 and a disulfide bridge. The latter suggests at least partial retention of secondary structure. As has been done for ancient DNA research, we recommend standards for preparation and criteria for authenticating results of ancient protein sequencing.

  10. Nucleotide sequence of a cluster of early and late genes in a conserved segment of the vaccinia virus genome.

    PubMed Central

    Plucienniczak, A; Schroeder, E; Zettlmeissl, G; Streeck, R E

    1985-01-01

    The nucleotide sequence of a 7.6 kb vaccinia DNA segment from a genomic region conserved among different orthopox virus has been determined. This segment contains a tight cluster of 12 partly overlapping open reading frames most of which can be correlated with previously identified early and late proteins and mRNAs. Regulatory signals used by vaccinia virus have been studied. Presumptive promoter regions are rich in A, T and carry the consensus sequences TATA and AATAA spaced at 20-24 base pairs. Tandem repeats of a CTATTC consensus sequence are proposed to be involved in the termination of early transcription. PMID:2987815

  11. Nucleotide sequence of the ribosomal RNA gene of Physarum polycephalum: intron 2 and its flanking regions of the 26S rRNA gene.

    PubMed Central

    Nomiyama, H; Kuhara, S; Kukita, T; Otsuka, T; Sakaki, Y

    1981-01-01

    The 26S ribosomal RNA gene of Physarum polycephalum is interrupted by two introns, and we have previously determined the sequence of one of them (intron 1) (Nomiyama et al. Proc.Natl.Acad.Sci.USA 78, 1376-1380, 1981). In this study we sequenced the second intron (intron 2) of about 0.5 kb length and its flanking regions, and found that one nucleotide at each junction is identical in intron 1 and intron 2, though the junction regions share no other sequence homology. Comparison of the flanking exon sequences to E. coli 23S rRNA sequences shows that conserved sequences are interspersed with tracts having little homology. In particular, the region encompassing the intron 2 interruption site is highly conserved. The E. coli ribosomal protein L1 binding region is also conserved. Images PMID:6171776

  12. The bark of Robinia pseudoacacia contains a complex mixture of lectins.Characterization of the proteins and the cDNA clones.

    PubMed Central

    Van Damme, E J; Barre, A; Smeets, K; Torrekens, S; Van Leuven, F; Rougé, P; Peumans, W J

    1995-01-01

    Two lectins were isolated from the inner bark of Robinia pseudoacacia (black locust). The first (and major) lectin (called RPbAI) is composed of five isolectins that originate from the association of 31.5- and 29-kD polypeptides into tetramers. In contrast, the second (minor) lectin (called RPbAII) is a hometetramer composed of 26-kD subunits. The cDNA clones encoding the polypeptides of RPbAI and RPbAII were isolated and their sequences determined. Apparently all three polypeptides are translated from mRNAs of approximately 1.2 kb. Alignment of the deduced amino acid sequences of the different clones indicates that the 31.5- and 29-kD RPbAI polypeptides show approximately 80% sequence identity and are homologous to the previously reported legume seed lectins, whereas the 26-kD RPbAII polypeptide shows only 33% sequence identity to the previously described legume lectins. Modeling the 31.5-kD subunit of RPbAI predicts that its three-dimensional structure is strongly related to the three-dimensional models that have been determined thus far for a few legume lectins. Southern blot analysis of genomic DNA isolated from Robinia has revealed that the Robinia bark lectins are the result of the expression of a small family of lectin genes. PMID:7716244

  13. A complete, multi-level conformational clustering of antibody complementarity-determining regions

    PubMed Central

    Nikoloudis, Dimitris; Pitts, Jim E.

    2014-01-01

    Classification of antibody complementarity-determining region (CDR) conformations is an important step that drives antibody modelling and engineering, prediction from sequence, directed mutagenesis and induced-fit studies, and allows inferences on sequence-to-structure relations. Most of the previous work performed conformational clustering on a reduced set of structures or after application of various structure pre-filtering criteria. In this study, it was judged that a clustering of every available CDR conformation would produce a complete and redundant repertoire, increase the number of sequence examples and allow better decisions on structure validity in the future. In order to cope with the potential increase in data noise, a first-level statistical clustering was performed using structure superposition Root-Mean-Square Deviation (RMSD) as a distance-criterion, coupled with second- and third-level clustering that employed Ramachandran regions for a deeper qualitative classification. The classification of a total of 12,712 CDR conformations is thus presented, along with rich annotation and cluster descriptions, and the results are compared to previous major studies. The present repertoire has procured an improved image of our current CDR Knowledge-Base, with a novel nesting of conformational sensitivity and specificity that can serve as a systematic framework for improved prediction from sequence as well as a number of future studies that would aid in knowledge-based antibody engineering such as humanisation. PMID:25071986

  14. Methods for determining the genetic affinity of microorganisms and viruses

    NASA Technical Reports Server (NTRS)

    Fox, George E. (Inventor); Willson, III, Richard C. (Inventor); Zhang, Zhengdong (Inventor)

    2012-01-01

    Selecting which sub-sequences in a database of nucleic acid such as 16S rRNA are highly characteristic of particular groupings of bacteria, microorganisms, fungi, etc. on a substantially phylogenetic tree. Also applicable to viruses comprising viral genomic RNA or DNA. A catalogue of highly characteristic sequences identified by this method is assembled to establish the genetic identity of an unknown organism. The characteristic sequences are used to design nucleic acid hybridization probes that include the characteristic sequence or its complement, or are derived from one or more characteristic sequences. A plurality of these characteristic sequences is used in hybridization to determine the phylogenetic tree position of the organism(s) in a sample. Those target organisms represented in the original sequence database and sufficient characteristic sequences can identify to the species or subspecies level. Oligonucleotide arrays of many probes are especially preferred. A hybridization signal can comprise fluorescence, chemiluminescence, or isotopic labeling, etc.; or sequences in a sample can be detected by direct means, e.g. mass spectrometry. The method's characteristic sequences can also be used to design specific PCR primers. The method uniquely identifies the phylogenetic affinity of an unknown organism without requiring prior knowledge of what is present in the sample. Even if the organism has not been previously encountered, the method still provides useful information about which phylogenetic tree bifurcation nodes encompass the organism.

  15. Ancient DNA sequence revealed by error-correcting codes.

    PubMed

    Brandão, Marcelo M; Spoladore, Larissa; Faria, Luzinete C B; Rocha, Andréa S L; Silva-Filho, Marcio C; Palazzo, Reginaldo

    2015-07-10

    A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code.

  16. Ancient DNA sequence revealed by error-correcting codes

    PubMed Central

    Brandão, Marcelo M.; Spoladore, Larissa; Faria, Luzinete C. B.; Rocha, Andréa S. L.; Silva-Filho, Marcio C.; Palazzo, Reginaldo

    2015-01-01

    A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code. PMID:26159228

  17. Phylogenetic position of the pentastomida and [pan]crustacean relationships

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lavrov, Dennis V.; Brown, Wesley M.; Boore, Jeffrey L.

    2004-01-31

    Pentastomids are a small group of vermiform animals with unique morphology and parasitic lifestyle. They are generally recognized as being related to the Arthropoda, however the nature of this relationship is controversial. We have determined the complete sequence of the mitochondrial DNA (mtDNA) of the pentastomid Armillifer armillatus and complete, or nearly complete, mtDNA sequences from representatives of four previously unsampled groups of Crustacea: Remipedia (Speleonectes tulumensis), Cephalocarida (Hutchinsoniella macracantha), Cirripedia (Pollicipes polymerus), and Branchiura (Argulus americanus). Analyses of the mtDNA gene arrangements and sequences determined in this study indicate unambiguously that pentastomids are a group of modified crustaceans likelymore » related to branchiurans. In addition, gene arrangement comparisons strongly support an unforeseen assemblage of pentastomids with maxillopod and cephalocarid crustaceans, to the exclusion of remipedes, branchiopods, malacos tracans and insects.« less

  18. Isolating Viral and Host RNA Sequences from Archival Material and Production of cDNA Libraries for High-Throughput DNA Sequencing

    PubMed Central

    Xiao, Yongli; Sheng, Zong-Mei; Taubenberger, Jeffery K.

    2015-01-01

    The vast majority of surgical biopsy and post-mortem tissue samples are formalin-fixed and paraffin-embedded (FFPE), but this process leads to RNA degradation that limits gene expression analysis. As an example, the viral RNA genome of the 1918 pandemic influenza A virus was previously determined in a 9-year effort by overlapping RT-PCR from post-mortem samples. Using the protocols described here, the full genome of the 1918 virus at high coverage was determined in one high-throughput sequencing run of a cDNA library derived from total RNA of a 1918 FFPE sample after duplex-specific nuclease treatments. This basic methodological approach should assist in the analysis of FFPE tissue samples isolated over the past century from a variety of infectious diseases. PMID:26344216

  19. Monitoring Error Rates In Illumina Sequencing.

    PubMed

    Manley, Leigh J; Ma, Duanduan; Levine, Stuart S

    2016-12-01

    Guaranteeing high-quality next-generation sequencing data in a rapidly changing environment is an ongoing challenge. The introduction of the Illumina NextSeq 500 and the depreciation of specific metrics from Illumina's Sequencing Analysis Viewer (SAV; Illumina, San Diego, CA, USA) have made it more difficult to determine directly the baseline error rate of sequencing runs. To improve our ability to measure base quality, we have created an open-source tool to construct the Percent Perfect Reads (PPR) plot, previously provided by the Illumina sequencers. The PPR program is compatible with HiSeq 2000/2500, MiSeq, and NextSeq 500 instruments and provides an alternative to Illumina's quality value (Q) scores for determining run quality. Whereas Q scores are representative of run quality, they are often overestimated and are sourced from different look-up tables for each platform. The PPR's unique capabilities as a cross-instrument comparison device, as a troubleshooting tool, and as a tool for monitoring instrument performance can provide an increase in clarity over SAV metrics that is often crucial for maintaining instrument health. These capabilities are highlighted.

  20. Structural and sequence features of two residue turns in beta-hairpins.

    PubMed

    Madan, Bharat; Seo, Sung Yong; Lee, Sun-Gu

    2014-09-01

    Beta-turns in beta-hairpins have been implicated as important sites in protein folding. In particular, two residue β-turns, the most abundant connecting elements in beta-hairpins, have been a major target for engineering protein stability and folding. In this study, we attempted to investigate and update the structural and sequence properties of two residue turns in beta-hairpins with a large data set. For this, 3977 beta-turns were extracted from 2394 nonhomologous protein chains and analyzed. First, the distribution, dihedral angles and twists of two residue turn types were determined, and compared with previous data. The trend of turn type occurrence and most structural features of the turn types were similar to previous results, but for the first time Type II turns in beta-hairpins were identified. Second, sequence motifs for the turn types were devised based on amino acid positional potentials of two-residue turns, and their distributions were examined. From this study, we could identify code-like sequence motifs for the two residue beta-turn types. Finally, structural and sequence properties of beta-strands in the beta-hairpins were analyzed, which revealed that the beta-strands showed no specific sequence and structural patterns for turn types. The analytical results in this study are expected to be a reference in the engineering or design of beta-hairpin turn structures and sequences. © 2014 Wiley Periodicals, Inc.

  1. Use of conserved key amino acid positions to morph protein folds.

    PubMed

    Reddy, Boojala V B; Li, Wilfred W; Bourne, Philip E

    2002-07-15

    By using three-dimensional (3D) structure alignments and a previously published method to determine Conserved Key Amino Acid Positions (CKAAPs) we propose a theoretical method to design mutations that can be used to morph the protein folds. The original Paracelsus challenge, met by several groups, called for the engineering of a stable but different structure by modifying less than 50% of the amino acid residues. We have used the sequences from the Protein Data Bank (PDB) identifiers 1ROP, and 2CRO, which were previously used in the Paracelsus challenge by those groups, and suggest mutation to CKAAPs to morph the protein fold. The total number of mutations suggested is less than 40% of the starting sequence theoretically improving the challenge results. From secondary structure prediction experiments of the proposed mutant sequence structures, we observe that each of the suggested mutant protein sequences likely folds to a different, non-native potentially stable target structure. These results are an early indicator that analyses using structure alignments leading to CKAAPs of a given structure are of value in protein engineering experiments. Copyright 2002 Wiley Periodicals, Inc.

  2. The primary structure of rat liver ribosomal protein L37. Homology with yeast and bacterial ribosomal proteins.

    PubMed

    Lin, A; McNally, J; Wool, I G

    1983-09-10

    The covalent structure of the rat liver 60 S ribosomal subunit protein L37 was determined. Twenty-four tryptic peptides were purified and the sequence of each was established; they accounted for all 111 residues of L37. The sequence of the first 30 residues of L37, obtained previously by automated Edman degradation of the intact protein, provided the alignment of the first 9 tryptic peptides. Three peptides (CN1, CN2, and CN3) were produced by cleavage of protein L37 with cyanogen bromide. The sequence of CN1 (65 residues) was established from the sequence of secondary peptides resulting from cleavage with trypsin and chymotrypsin. The sequence of CN1 in turn served to order tryptic peptides 1 through 14. The sequence of CN2 (15 residues) was determined entirely by a micromanual procedure and allowed the alignment of tryptic peptides 14 through 18. The sequence of the NH2-terminal 28 amino acids of CN3 (31 residues) was determined; in addition the complete sequences of the secondary tryptic and chymotryptic peptides were done. The sequence of CN3 provided the order of tryptic peptides 18 through 24. Thus the sequence of the three cyanogen bromide peptides also accounted for the 111 residues of protein L37. The carboxyl-terminal amino acids were identified after carboxypeptidase A treatment. There is a disulfide bridge between half-cystinyl residues at positions 40 and 69. Rat liver ribosomal protein L37 is homologous with yeast YP55 and with Escherichia coli L34. Moreover, there is a segment of 17 residues in rat L37 that occurs, albeit with modifications, in yeast YP55 and in E. coli S4, L20, and L34.

  3. A two-step recognition of signal sequences determines the translocation efficiency of proteins.

    PubMed Central

    Belin, D; Bost, S; Vassalli, J D; Strub, K

    1996-01-01

    The cytosolic and secreted, N-glycosylated, forms of plasminogen activator inhibitor-2 (PAI-2) are generated by facultative translocation. To study the molecular events that result in the bi-topological distribution of proteins, we determined in vitro the capacities of several signal sequences to bind the signal recognition particle (SRP) during targeting, and to promote vectorial transport of murine PAI-2 (mPAI-2). Interestingly, the six signal sequences we compared (mPAI-2 and three mutated derivatives thereof, ovalbumin and preprolactin) were found to have the differential activities in the two events. For example, the mPAI-2 signal sequence first binds SRP with moderate efficiency and secondly promotes the vectorial transport of only a fraction of the SRP-bound nascent chains. Our results provide evidence that the translocation efficiency of proteins can be controlled by the recognition of their signal sequences at two steps: during SRP-mediated targeting and during formation of a committed translocation complex. This second recognition may occur at several time points during the insertion/translocation step. In conclusion, signal sequences have a more complex structure than previously anticipated, allowing for multiple and independent interactions with the translocation machinery. Images PMID:8599930

  4. A two-step recognition of signal sequences determines the translocation efficiency of proteins.

    PubMed

    Belin, D; Bost, S; Vassalli, J D; Strub, K

    1996-02-01

    The cytosolic and secreted, N-glycosylated, forms of plasminogen activator inhibitor-2 (PAI-2) are generated by facultative translocation. To study the molecular events that result in the bi-topological distribution of proteins, we determined in vitro the capacities of several signal sequences to bind the signal recognition particle (SRP) during targeting, and to promote vectorial transport of murine PAI-2 (mPAI-2). Interestingly, the six signal sequences we compared (mPAI-2 and three mutated derivatives thereof, ovalbumin and preprolactin) were found to have the differential activities in the two events. For example, the mPAI-2 signal sequence first binds SRP with moderate efficiency and secondly promotes the vectorial transport of only a fraction of the SRP-bound nascent chains. Our results provide evidence that the translocation efficiency of proteins can be controlled by the recognition of their signal sequences at two steps: during SRP-mediated targeting and during formation of a committed translocation complex. This second recognition may occur at several time points during the insertion/translocation step. In conclusion, signal sequences have a more complex structure than previously anticipated, allowing for multiple and independent interactions with the translocation machinery.

  5. Cost-effective sequencing of full-length cDNA clones powered by a de novo-reference hybrid assembly.

    PubMed

    Kuroshu, Reginaldo M; Watanabe, Junichi; Sugano, Sumio; Morishita, Shinichi; Suzuki, Yutaka; Kasahara, Masahiro

    2010-05-07

    Sequencing full-length cDNA clones is important to determine gene structures including alternative splice forms, and provides valuable resources for experimental analyses to reveal the biological functions of coded proteins. However, previous approaches for sequencing cDNA clones were expensive or time-consuming, and therefore, a fast and efficient sequencing approach was demanded. We developed a program, MuSICA 2, that assembles millions of short (36-nucleotide) reads collected from a single flow cell lane of Illumina Genome Analyzer to shotgun-sequence approximately 800 human full-length cDNA clones. MuSICA 2 performs a hybrid assembly in which an external de novo assembler is run first and the result is then improved by reference alignment of shotgun reads. We compared the MuSICA 2 assembly with 200 pooled full-length cDNA clones finished independently by the conventional primer-walking using Sanger sequencers. The exon-intron structure of the coding sequence was correct for more than 95% of the clones with coding sequence annotation when we excluded cDNA clones insufficiently represented in the shotgun library due to PCR failure (42 out of 200 clones excluded), and the nucleotide-level accuracy of coding sequences of those correct clones was over 99.99%. We also applied MuSICA 2 to full-length cDNA clones from Toxoplasma gondii, to confirm that its ability was competent even for non-human species. The entire sequencing and shotgun assembly takes less than 1 week and the consumables cost only approximately US$3 per clone, demonstrating a significant advantage over previous approaches.

  6. Genomic Characterization of the Genus Nairovirus (Family Bunyaviridae).

    PubMed

    Kuhn, Jens H; Wiley, Michael R; Rodriguez, Sergio E; Bào, Yīmíng; Prieto, Karla; Travassos da Rosa, Amelia P A; Guzman, Hilda; Savji, Nazir; Ladner, Jason T; Tesh, Robert B; Wada, Jiro; Jahrling, Peter B; Bente, Dennis A; Palacios, Gustavo

    2016-06-10

    Nairovirus, one of five bunyaviral genera, includes seven species. Genomic sequence information is limited for members of the Dera Ghazi Khan, Hughes, Qalyub, Sakhalin, and Thiafora nairovirus species. We used next-generation sequencing and historical virus-culture samples to determine 14 complete and nine coding-complete nairoviral genome sequences to further characterize these species. Previously unsequenced viruses include Abu Mina, Clo Mor, Great Saltee, Hughes, Raza, Sakhalin, Soldado, and Tillamook viruses. In addition, we present genomic sequence information on additional isolates of previously sequenced Avalon, Dugbe, Sapphire II, and Zirqa viruses. Finally, we identify Tunis virus, previously thought to be a phlebovirus, as an isolate of Abu Hammad virus. Phylogenetic analyses indicate the need for reassignment of Sapphire II virus to Dera Ghazi Khan nairovirus and reassignment of Hazara, Tofla, and Nairobi sheep disease viruses to novel species. We also propose new species for the Kasokero group (Kasokero, Leopards Hill, Yogue viruses), the Ketarah group (Gossas, Issyk-kul, Keterah/soft tick viruses) and the Burana group (Wēnzhōu tick virus, Huángpí tick virus 1, Tǎchéng tick virus 1). Our analyses emphasize the sister relationship of nairoviruses and arenaviruses, and indicate that several nairo-like viruses (Shāyáng spider virus 1, Xīnzhōu spider virus, Sānxiá water strider virus 1, South Bay virus, Wǔhàn millipede virus 2) require establishment of novel genera in a larger nairovirus-arenavirus supergroup.

  7. Resolving the Origin of Rabbit Hemorrhagic Disease Virus: Insights from an Investigation of the Viral Stocks Released in Australia

    PubMed Central

    Eden, John-Sebastian; Read, Andrew J.; Duckworth, Janine A.; Strive, Tanja

    2015-01-01

    To resolve the evolutionary history of rabbit hemorrhagic disease virus (RHDV), we performed a genomic analysis of the viral stocks imported and released as a biocontrol measure in Australia, as well as a global phylogenetic analysis. Importantly, conflicts were identified between the sequences determined here and those previously published that may have affected evolutionary rate estimates. By removing likely erroneous sequences, we show that RHDV emerged only shortly before its initial description in China. PMID:26378178

  8. Use of 16S rRNA Sequencing for Identification of Actinobacillus ureae Isolated from a Cerebrospinal Fluid Sample

    PubMed Central

    Whitelaw, A. C.; Shankland, I. M.; Elisha, B. G.

    2002-01-01

    Actinobacillus ureae, previously Pasteurella ureae, has on rare occasions been described as a cause of human infection. Owing to its rarity, it may not be easily identified in clinical microbiology laboratories by standard tests. This report describes a patient with acute bacterial meningitis due to A. ureae. The identity of the isolate was determined by means of DNA sequence analysis of a portion of the 16S rRNA gene. PMID:11825992

  9. Analysis of MHC class I genes across horse MHC haplotypes

    PubMed Central

    Tallmadge, Rebecca L.; Campbell, Julie A.; Miller, Donald C.; Antczak, Douglas F.

    2010-01-01

    The genomic sequences of 15 horse Major Histocompatibility Complex (MHC) class I genes and a collection of MHC class I homozygous horses of five different haplotypes were used to investigate the genomic structure and polymorphism of the equine MHC. A combination of conserved and locus-specific primers was used to amplify horse MHC class I genes with classical and non-classical characteristics. Multiple clones from each haplotype identified three to five classical sequences per homozygous animal, and two to three non-classical sequences. Phylogenetic analysis was applied to these sequences and groups were identified which appear to be allelic series, but some sequences were left ungrouped. Sequences determined from MHC class I heterozygous horses and previously described MHC class I sequences were then added, representing a total of ten horse MHC haplotypes. These results were consistent with those obtained from the MHC homozygous horses alone, and 30 classical sequences were assigned to four previously confirmed loci and three new provisional loci. The non-classical genes had few alleles and the classical genes had higher levels of allelic polymorphism. Alleles for two classical loci with the expected pattern of polymorphism were found in the majority of haplotypes tested, but alleles at two other commonly detected loci had more variation outside of the hypervariable region than within. Our data indicate that the equine Major Histocompatibility Complex is characterized by variation in the complement of class I genes expressed in different haplotypes in addition to the expected allelic polymorphism within loci. PMID:20099063

  10. The complete genome sequence of a virus associated with cotton blue disease, cotton leafroll dwarf virus, confirms that it is a new member of the genus Polerovirus.

    PubMed

    Distéfano, Ana J; Bonacic Kresic, Ivan; Hopp, H Esteban

    2010-11-01

    Cotton blue disease is the most important virus disease of cotton in the southern part of America. The complete nucleotide sequence of the ssRNA genome of the cotton blue disease-associated virus was determined for the first time. It comprised 5,866 nucleotides, and the deduced genomic organization resembled that of members of the genus Polerovirus. Sequence homology comparison and phylogenetic analysis confirm that this virus (previous proposed name cotton leafroll dwarf virus) is a member of a new species within the genus Polerovirus.

  11. The Sex Determination Gene Shows No Founder Effect in the Giant Honey Bee, Apis dorsata

    PubMed Central

    Yan, Wei Yu; Wu, Xiao Bo; Zeng, Zhi Jiang; Huang, Zachary Y.

    2012-01-01

    Background All honey bee species (Apis spp) share the same sex determination mechanism using the complementary sex determination (csd) gene. Only individuals heterogeneous at the csd allele develop into females, and the homozygous develop into diploid males, which do not survive. The honeybees are therefore under selection pressure to generate new csd alleles. Previous studies have shown that the csd gene is under balancing selection. We hypothesize that due to the long separation from the mainland of Hainan Island, China, that the giant honey bees (Apis dorsata) should show a founder effect for the csd gene, with many different alleles clustered together, and these would be absent on the mainland. Methodology/Principal Findings We sampled A. dorsata workers from both Hainan and Guangxi Provinces and then cloned and sequenced region 3 of the csd gene and constructed phylogenetic trees. We failed to find any clustering of the csd alleles according to their geographical origin, i.e. the Hainan and Guangxi samples did not form separate clades. Further analysis by including previously published csd sequences also failed to show any clade-forming in both the Philippines and Malaysia. Conclusions/Significance Results from this study and those from previous studies did not support the expectations of a founder effect. We conclude that because of the extremely high mating frequency of A. dorsata queens, a founder effect does not apply in this species. PMID:22511940

  12. The sex determination gene shows no founder effect in the giant honey bee, Apis dorsata.

    PubMed

    Liu, Zhi Yong; Wang, Zi Long; Yan, Wei Yu; Wu, Xiao Bo; Zeng, Zhi Jiang; Huang, Zachary Y

    2012-01-01

    All honey bee species (Apis spp) share the same sex determination mechanism using the complementary sex determination (csd) gene. Only individuals heterogeneous at the csd allele develop into females, and the homozygous develop into diploid males, which do not survive. The honeybees are therefore under selection pressure to generate new csd alleles. Previous studies have shown that the csd gene is under balancing selection. We hypothesize that due to the long separation from the mainland of Hainan Island, China, that the giant honey bees (Apis dorsata) should show a founder effect for the csd gene, with many different alleles clustered together, and these would be absent on the mainland. We sampled A. dorsata workers from both Hainan and Guangxi Provinces and then cloned and sequenced region 3 of the csd gene and constructed phylogenetic trees. We failed to find any clustering of the csd alleles according to their geographical origin, i.e. the Hainan and Guangxi samples did not form separate clades. Further analysis by including previously published csd sequences also failed to show any clade-forming in both the Philippines and Malaysia. Results from this study and those from previous studies did not support the expectations of a founder effect. We conclude that because of the extremely high mating frequency of A. dorsata queens, a founder effect does not apply in this species.

  13. Amino acid sequences of ribosomal proteins S11 from Bacillus stearothermophilus and S19 from Halobacterium marismortui. Comparison of the ribosomal protein S11 family.

    PubMed

    Kimura, M; Kimura, J; Hatakeyama, T

    1988-11-21

    The complete amino acid sequences of ribosomal proteins S11 from the Gram-positive eubacterium Bacillus stearothermophilus and of S19 from the archaebacterium Halobacterium marismortui have been determined. A search for homologous sequences of these proteins revealed that they belong to the ribosomal protein S11 family. Homologous proteins have previously been sequenced from Escherichia coli as well as from chloroplast, yeast and mammalian ribosomes. A pairwise comparison of the amino acid sequences showed that Bacillus protein S11 shares 68% identical residues with S11 from Escherichia coli and a slightly lower homology (52%) with the homologous chloroplast protein. The halophilic protein S19 is more related to the eukaryotic (45-49%) than to the eubacterial counterparts (35%).

  14. The nucleotide sequence of Beneckea harveyi 5S rRNA. [bioluminescent marine bacterium

    NASA Technical Reports Server (NTRS)

    Luehrsen, K. R.; Fox, G. E.

    1981-01-01

    The primary sequence of the 5S ribosomal RNA isolated from the free-living bioluminescent marine bacterium Beneckea harveyi is reported and discussed in regard to indications of phylogenetic relationships with the bacteria Escherichia coli and Photobacterium phosphoreum. Sequences were determined for oligonucleotide products generated by digestion with ribonuclease T1, pancreatic ribonuclease and ribonuclease T2. The presence of heterogeneity is indicated for two sites. The B. harveyi sequence can be arranged into the same four helix secondary structures as E. coli and other prokaryotic 5S rRNAs. Examination of the 5S-RNS sequences of the three bacteria indicates that B. harveyi and P. phosphoreum are specifically related and share a common ancestor which diverged from an ancestor of E. coli at a somewhat earlier time, consistent with previous studies.

  15. An approach for Ewing test selection to support the clinical assessment of cardiac autonomic neuropathy.

    PubMed

    Stranieri, Andrew; Abawajy, Jemal; Kelarev, Andrei; Huda, Shamsul; Chowdhury, Morshed; Jelinek, Herbert F

    2013-07-01

    This article addresses the problem of determining optimal sequences of tests for the clinical assessment of cardiac autonomic neuropathy (CAN). We investigate the accuracy of using only one of the recommended Ewing tests to classify CAN and the additional accuracy obtained by adding the remaining tests of the Ewing battery. This is important as not all five Ewing tests can always be applied in each situation in practice. We used new and unique database of the diabetes screening research initiative project, which is more than ten times larger than the data set used by Ewing in his original investigation of CAN. We utilized decision trees and the optimal decision path finder (ODPF) procedure for identifying optimal sequences of tests. We present experimental results on the accuracy of using each one of the recommended Ewing tests to classify CAN and the additional accuracy that can be achieved by adding the remaining tests of the Ewing battery. We found the best sequences of tests for cost-function equal to the number of tests. The accuracies achieved by the initial segments of the optimal sequences for 2, 3 and 4 categories of CAN are 80.80, 91.33, 93.97 and 94.14, and respectively, 79.86, 89.29, 91.16 and 91.76, and 78.90, 86.21, 88.15 and 88.93. They show significant improvement compared to the sequence considered previously in the literature and the mathematical expectations of the accuracies of a random sequence of tests. The complete outcomes obtained for all subsets of the Ewing features are required for determining optimal sequences of tests for any cost-function with the use of the ODPF procedure. We have also found two most significant additional features that can increase the accuracy when some of the Ewing attributes cannot be obtained. The outcomes obtained can be used to determine the optimal sequences of tests for each individual cost-function by following the ODPF procedure. The results show that the best single Ewing test for diagnosing CAN is the deep breathing heart rate variation test. Optimal sequences found for the cost-function equal to the number of tests guarantee that the best accuracy is achieved after any number of tests and provide an improvement in comparison with the previous ordering of tests or a random sequence. Copyright © 2013 Elsevier B.V. All rights reserved.

  16. The primary structure of aspartate aminotransferase from pig heart muscle. Partial sequences determined by digestion with thermolysin and elastase

    PubMed Central

    Bossa, Francesco; Barra, Donatella; Carloni, Massimo; Fasella, Paolo; Riva, Francesca; Doonan, Shawn; Doonan, Hilary J.; Hanford, Robin; Vernon, Charles A.; Walker, John M.

    1973-01-01

    Peptides produced by thermolytic digestion of aminoethylated aspartate aminotransferase and of the oxidized enzyme were isolated and their amino acid sequences determined. Digestion by elastase of the carboxymethylated enzyme gave peptides representing approximately 40% of the primary structure. Fragments from these digests overlapped with previously reported sequences of peptides obtained by peptic and tryptic digestion (Doonan et al., 1972), giving ten composite peptides containing 395 amino acid residues. The amino acid composition of these composite peptides agrees well with that of the intact enzyme. Confirmatory results for some of the present data have been deposited as Supplementary Publication 50018 at the National Lending Library for Science and Technology, Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1973) 131, 5. PMID:4748834

  17. Structural determinants of nuclear export signal orientation in binding to exportin CRM1

    DOE PAGES

    Fung, Ho Yee Joyce; Fu, Szu -Chin; Brautigam, Chad A.; ...

    2015-09-08

    The Chromosome Region of Maintenance 1 (CRM1) protein mediates nuclear export of hundreds of proteins through recognition of their nuclear export signals (NESs), which are highly variable in sequence and structure. The plasticity of the CRM1-NES interaction is not well understood, as there are many NES sequences that seem incompatible with structures of the NES-bound CRM1 groove. Crystal structures of CRM1 bound to two different NESs with unusual sequences showed the NES peptides binding the CRM1 groove in the opposite orientation (minus) to that of previously studied NESs (plus). A comparison of minus and plus NESs identified structural and sequencemore » determinants for NES orientation. The binding of NESs to CRM1 in both orientations results in a large expansion in NES consensus patterns and therefore a corresponding expansion of potential NESs in the proteome.« less

  18. Purification and sequence analysis of 4-methyl-5-nitrocatechol oxygenase from Burkholderia sp. strain DNT.

    PubMed Central

    Haigler, B E; Suen, W C; Spain, J C

    1996-01-01

    4-Methyl-5-nitrocatechol (MNC) is an intermediate in the degradation of 2,4-dinitrotoluene by Burkholderia sp. strain DNT. In the presence of NADPH and oxygen, MNC monooxygenase catalyzes the removal of the nitro group from MNC to form 2-hydroxy-5-methylquinone. The gene (dntB) encoding MNC monooxygenase has been previously cloned and characterized. In order to examine the properties of MNC monooxygenase and to compare it with other enzymes, we sequenced the gene encoding the MNC monooxygenase and purified the enzyme from strain DNT. dntB was localized within a 2.2-kb ApaI DNA fragment. Sequence analysis of this fragment revealed an open reading frame of 1,644 bp with an N-terminal amino acid sequence identical to that of purified MNC monooxygenase from strain DNT. Comparison of the derived amino acid sequences with those of other genes showed that DntB contains the highly conserved ADP and flavin adenine dinucleotide (FAD) binding motifs characteristic of flavoprotein hydroxylases. MNC monooxygenase was purified to homogeneity from strain DNT by anion exchange and gel filtration chromatography. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis revealed a single protein with a molecular weight of 60,200, which is consistent with the size determined from the gene sequence. The native molecular weight determined by gel filtration was 65,000, which indicates that the native enzyme is a monomer. It used either NADH or NADPH as electron donors, and NADPH was the preferred cofactor. The purified enzyme contained 1 mol of FAD per mol of protein, which is also consistent with the detection of an FAD binding motif in the amino acid sequence of DntB. MNC monooxygenase has a narrow substrate specificity. MNC and 4-nitrocatechol are good substrates whereas 3-methyl-4-nitrophenol, 3-methyl-4-nitrocatechol, 4-nitrophenol, 3-nitrophenol, and 4-chlorocatechol were not. These studies suggest that MNC monooxygenase is a flavoprotein that shares some properties with previously studied nitrophenol oxygenases. PMID:8830701

  19. Ultra-deep sequencing reveals high prevalence and broad structural diversity of hepatitis B surface antigen mutations in a global population

    PubMed Central

    Gencay, Mikael; Hübner, Kirsten; Gohl, Peter; Seffner, Anja; Weizenegger, Michael; Neofytos, Dionysios; Batrla, Richard; Woeste, Andreas; Kim, Hyon-suk; Westergaard, Gaston; Reinsch, Christine; Brill, Eva; Thu Thuy, Pham Thi; Hoang, Bui Huu; Sonderup, Mark; Spearman, C. Wendy; Pabinger, Stephan; Gautier, Jérémie; Brancaccio, Giuseppina; Fasano, Massimo; Santantonio, Teresa; Gaeta, Giovanni B.; Nauck, Markus; Kaminski, Wolfgang E.

    2017-01-01

    The diversity of the hepatitis B surface antigen (HBsAg) has a significant impact on the performance of diagnostic screening tests and the clinical outcome of hepatitis B infection. Neutralizing or diagnostic antibodies against the HBsAg are directed towards its highly conserved major hydrophilic region (MHR), in particular towards its “a” determinant subdomain. Here, we explored, on a global scale, the genetic diversity of the HBsAg MHR in a large, multi-ethnic cohort of randomly selected subjects with HBV infection from four continents. A total of 1553 HBsAg positive blood samples of subjects originating from 20 different countries across Africa, America, Asia and central Europe were characterized for amino acid variation in the MHR. Using highly sensitive ultra-deep sequencing, we found 72.8% of the successfully sequenced subjects (n = 1391) demonstrated amino acid sequence variation in the HBsAg MHR. This indicates that the global variation frequency in the HBsAg MHR is threefold higher than previously reported. The majority of the amino acid mutations were found in the HBV genotypes B (28.9%) and C (25.4%). Collectively, we identified 345 distinct amino acid mutations in the MHR. Among these, we report 62 previously unknown mutations, which extends the worldwide pool of currently known HBsAg MHR mutations by 22%. Importantly, topological analysis identified the “a” determinant upstream flanking region as the structurally most diverse subdomain of the HBsAg MHR. The highest prevalence of “a” determinant region mutations was observed in subjects from Asia, followed by the African, American and European cohorts, respectively. Finally, we found that more than half (59.3%) of all HBV subjects investigated carried multiple MHR mutations. Together, this worldwide ultra-deep sequencing based genotyping study reveals that the global prevalence and structural complexity of variation in the hepatitis B surface antigen have, to date, been significantly underappreciated. PMID:28472040

  20. Ultra-deep sequencing reveals high prevalence and broad structural diversity of hepatitis B surface antigen mutations in a global population.

    PubMed

    Gencay, Mikael; Hübner, Kirsten; Gohl, Peter; Seffner, Anja; Weizenegger, Michael; Neofytos, Dionysios; Batrla, Richard; Woeste, Andreas; Kim, Hyon-Suk; Westergaard, Gaston; Reinsch, Christine; Brill, Eva; Thu Thuy, Pham Thi; Hoang, Bui Huu; Sonderup, Mark; Spearman, C Wendy; Pabinger, Stephan; Gautier, Jérémie; Brancaccio, Giuseppina; Fasano, Massimo; Santantonio, Teresa; Gaeta, Giovanni B; Nauck, Markus; Kaminski, Wolfgang E

    2017-01-01

    The diversity of the hepatitis B surface antigen (HBsAg) has a significant impact on the performance of diagnostic screening tests and the clinical outcome of hepatitis B infection. Neutralizing or diagnostic antibodies against the HBsAg are directed towards its highly conserved major hydrophilic region (MHR), in particular towards its "a" determinant subdomain. Here, we explored, on a global scale, the genetic diversity of the HBsAg MHR in a large, multi-ethnic cohort of randomly selected subjects with HBV infection from four continents. A total of 1553 HBsAg positive blood samples of subjects originating from 20 different countries across Africa, America, Asia and central Europe were characterized for amino acid variation in the MHR. Using highly sensitive ultra-deep sequencing, we found 72.8% of the successfully sequenced subjects (n = 1391) demonstrated amino acid sequence variation in the HBsAg MHR. This indicates that the global variation frequency in the HBsAg MHR is threefold higher than previously reported. The majority of the amino acid mutations were found in the HBV genotypes B (28.9%) and C (25.4%). Collectively, we identified 345 distinct amino acid mutations in the MHR. Among these, we report 62 previously unknown mutations, which extends the worldwide pool of currently known HBsAg MHR mutations by 22%. Importantly, topological analysis identified the "a" determinant upstream flanking region as the structurally most diverse subdomain of the HBsAg MHR. The highest prevalence of "a" determinant region mutations was observed in subjects from Asia, followed by the African, American and European cohorts, respectively. Finally, we found that more than half (59.3%) of all HBV subjects investigated carried multiple MHR mutations. Together, this worldwide ultra-deep sequencing based genotyping study reveals that the global prevalence and structural complexity of variation in the hepatitis B surface antigen have, to date, been significantly underappreciated.

  1. Next-Generation Sequencing Reveals Significant Bacterial Diversity of Botrytized Wine

    PubMed Central

    Bokulich, Nicholas A.; Joseph, C. M. Lucy; Allen, Greg; Benson, Andrew K.; Mills, David A.

    2012-01-01

    While wine fermentation has long been known to involve complex microbial communities, the composition and role of bacteria other than a select set of lactic acid bacteria (LAB) has often been assumed either negligible or detrimental. This study served as a pilot study for using barcoded amplicon next-generation sequencing to profile bacterial community structure in wines and grape musts, comparing the taxonomic depth achieved by sequencing two different domains of prokaryotic 16S rDNA (V4 and V5). This study was designed to serve two goals: 1) to empirically determine the most taxonomically informative 16S rDNA target region for barcoded amplicon sequencing of wine, comparing V4 and V5 domains of bacterial 16S rDNA to terminal restriction fragment length polymorphism (TRFLP) of LAB communities; and 2) to explore the bacterial communities of wine fermentation to better understand the biodiversity of wine at a depth previously unattainable using other techniques. Analysis of amplicons from the V4 and V5 provided similar views of the bacterial communities of botrytized wine fermentations, revealing a broad diversity of low-abundance taxa not traditionally associated with wine, as well as atypical LAB communities initially detected by TRFLP. The V4 domain was determined as the more suitable read for wine ecology studies, as it provided greater taxonomic depth for profiling LAB communities. In addition, targeted enrichment was used to isolate two species of Alphaproteobacteria from a finished fermentation. Significant differences in diversity between inoculated and uninoculated samples suggest that Saccharomyces inoculation exerts selective pressure on bacterial diversity in these fermentations, most notably suppressing abundance of acetic acid bacteria. These results determine the bacterial diversity of botrytized wines to be far higher than previously realized, providing further insight into the fermentation dynamics of these wines, and demonstrate the utility of next-generation sequencing for wine ecology studies. PMID:22563494

  2. Axial and Torsional Load-Type Sequencing in Cumulative Fatigue: Low Amplitude Followed by High Amplitude Loading

    NASA Technical Reports Server (NTRS)

    Bonacuse, Peter J.; Kalluri, Sreeramesh

    2001-01-01

    The experiments described herein were performed to determine whether damage imposed by axial loading interacts with damage imposed by torsional loading. This paper is a follow on to a study that investigated effects of load-type sequencing on the cumulative fatigue behavior of a cobalt base superalloy, Haynes 188 at 538 C Both the current and the previous study were used to test the applicability of cumulative fatigue damage models to conditions where damage is imposed by different loading modes. In the previous study, axial and torsional two load level cumulative fatigue experiments were conducted, in varied combinations, with the low-cycle fatigue (high amplitude loading) applied first. In present study, the high-cycle fatigue (low amplitude loading) is applied initially. As in the previous study, four sequences (axial/axial, torsion/torsion, axial/torsion, and torsion/axial) of two load level cumulative fatigue experiments were performed. The amount of fatigue damage contributed by each of the imposed loads was estimated by both the Palmgren-Miner linear damage rule (LDR) and the non-linear damage curve approach (DCA). Life predictions for the various cumulative loading combinations are compared with experimental results.

  3. The genome sequence of 'Mycobacterium massiliense' strain CIP 108297 suggests the independent taxonomic status of the Mycobacterium abscessus complex at the subspecies level.

    PubMed

    Cho, Yong-Joon; Yi, Hana; Chun, Jongsik; Cho, Sang-Nae; Daley, Charles L; Koh, Won-Jung; Shin, Sung Jae

    2013-01-01

    Members of the Mycobacterium abscessus complex are rapidly growing mycobacteria that are emerging as human pathogens. The M. abscessus complex was previously composed of three species, namely M. abscessus sensu stricto, 'M. massiliense', and 'M. bolletii'. In 2011, 'M. massiliense' and 'M. bolletii' were united and reclassified as a single subspecies within M. abscessus: M. abscessus subsp. bolletii. However, the placement of 'M. massiliense' within the boundary of M. abscessus subsp. bolletii remains highly controversial with regard to clinical aspects. In this study, we revisited the taxonomic status of members of the M. abscessus complex based on comparative analysis of the whole-genome sequences of 53 strains. The genome sequence of the previous type strain of 'Mycobacterium massiliense' (CIP 108297) was determined using next-generation sequencing. The genome tree based on average nucleotide identity (ANI) values supported the differentiation of 'M. bolletii' and 'M. massiliense' at the subspecies level. The genome tree also clearly illustrated that 'M. bolletii' and 'M. massiliense' form a distinct phylogenetic clade within the radiation of the M. abscessus complex. The genomic distances observed in this study suggest that the current M. abscessus subsp. bolletii taxon should be divided into two subspecies, M. abscessus subsp. massiliense subsp. nov. and M. abscessus subsp. bolletii, to correspondingly accommodate the previously known 'M. massiliense' and 'M. bolletii' strains.

  4. Structural characterization of the thermally tolerant pectin methylesterase purified from citrus sinensis fruit and its gene sequence.

    PubMed

    Savary, Brett J; Vasu, Prasanna; Cameron, Randall G; McCollum, T Gregory; Nuñez, Alberto

    2013-12-26

    Despite the longstanding importance of the thermally tolerant pectin methylesterase (TT-PME) activity in citrus juice processing and product quality, the unequivocal identification of the protein and its corresponding gene has remained elusive. TT-PME was purified from sweet orange [ Citrus sinensis (L.) Osbeck] finisher pulp (8.0 mg/1.3 kg tissue) with an improved purification scheme that provided 20-fold increased enzyme yield over previous results. Structural characterization of electrophoretically pure TT-PME by MALDI-TOF MS determined molecular masses of approximately 47900 and 53000 Da for two principal glycoisoforms. De novo sequences generated from tryptic peptides by MALDI-TOF/TOF MS matched multiple anonymous Citrus EST cDNA accessions. The complete tt-pme cDNA (1710 base pair) was cloned from a fruit mRNA library using RT- and RLM-RACE PCR. Citrus TT-PME is a novel isoform that showed higher sequence identity with the multiply glycosylated kiwifruit PME than to previously described Citrus thermally labile PME isoforms.

  5. Complete Sequence of the mitochondrial genome of the tapeworm Hymenolepis diminuta: Gene arrangements indicate that platyhelminths are eutrochozoans

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    von Nickisch-Rosenegk, Markus; Brown, Wesley M.; Boore, Jeffrey L.

    2001-01-01

    Using ''long-PCR'' we have amplified in overlapping fragments the complete mitochondrial genome of the tapeworm Hymenolepis diminuta (Platyhelminthes: Cestoda) and determined its 13,900 nucleotide sequence. The gene content is the same as that typically found for animal mitochondrial DNA (mtDNA) except that atp8 appears to be lacking, a condition found previously for several other animals. Despite the small size of this mtDNA, there are two large non-coding regions, one of which contains 13 repeats of a 31 nucleotide sequence and a potential stem-loop structure of 25 base pairs with an 11-member loop. Large potential secondary structures are identified also formore » the non-coding regions of two other cestode mtDNAs. Comparison of the mitochondrial gene arrangement of H. diminuta with those previously published supports a phylogenetic position of flatworms as members of the Eutrochozoa, rather than being basal to either a clade of protostomes or a clade of coelomates.« less

  6. Molecular diagnosis of putative Stargardt disease probands by exome sequencing

    PubMed Central

    2012-01-01

    Background The commonest genetic form of juvenile or early adult onset macular degeneration is Stargardt Disease (STGD) caused by recessive mutations in the gene ABCA4. However, high phenotypic and allelic heterogeneity and a small but non-trivial amount of locus heterogeneity currently impede conclusive molecular diagnosis in a significant proportion of cases. Methods We performed whole exome sequencing (WES) of nine putative Stargardt Disease probands and searched for potentially disease-causing genetic variants in previously identified retinal or macular dystrophy genes. Follow-up dideoxy sequencing was performed for confirmation and to screen for mutations in an additional set of affected individuals lacking a definitive molecular diagnosis. Results Whole exome sequencing revealed seven likely disease-causing variants across four genes, providing a confident genetic diagnosis in six previously uncharacterized participants. We identified four previously missed mutations in ABCA4 across three individuals. Likely disease-causing mutations in RDS/PRPH2, ELOVL, and CRB1 were also identified. Conclusions Our findings highlight the enormous potential of whole exome sequencing in Stargardt Disease molecular diagnosis and research. WES adequately assayed all coding sequences and canonical splice sites of ABCA4 in this study. Additionally, WES enables the identification of disease-related alleles in other genes. This work highlights the importance of collecting parental genetic material for WES testing as the current knowledge of human genome variation limits the determination of causality between identified variants and disease. While larger sample sizes are required to establish the precision and accuracy of this type of testing, this study supports WES for inherited early onset macular degeneration disorders as an alternative to standard mutation screening techniques. PMID:22863181

  7. Effector diversification within compartments of the Leptosphaeria maculans genome affected by repeat induced point mutations

    USDA-ARS?s Scientific Manuscript database

    The genome sequence of the phytopathogenic fungus Leptosphaeria maculans has been determined. It has a unique bipartite structure, divided between distinct GC-equilibrated and AT-rich regions (isochores), reminiscent of some plants and animals but not previously observed in fungi. The GC-equilibrate...

  8. Discriminating power of microsatellites in cranberry organelles for taxonomic studies in Vaccinium and Ericaceae

    USDA-ARS?s Scientific Manuscript database

    Simple sequence repeats (SSRs) in chloroplast and mitochondrial DNA, which have not been previously developed or explored in the Ericaceae family or Vaccinium genus, can be powerful tools for determining evolutionary relationships between taxa. In this study, 30 chloroplast and 23 mitochondria, and ...

  9. The nature of the embedded population in the Rho Ophiuchi dark cloud - Mid-infrared observations

    NASA Technical Reports Server (NTRS)

    Lada, C. J.; Wilking, B. A.

    1984-01-01

    In combination with previous IR and optical data, the present 10-20 micron observations of previously identified members of the embedded population of the Rho Ophiuchi dark cloud allow determinations to be made of the broadband energy distributions for 32 of the 44 sources. The majority of the sources are found to emit the bulk of their luminosity in the 1-20 micron range, and to be surrounded by dust shells. Because they are, in light of these characteristics, probably premain-sequence in nature, relatively accurate bolometric luminosities for these objects can be obtained through integration of their energy distributions. It is found that 44 percent of the sources are less luminous than the sun, and are among the lowest luminosity premain-sequence/protostellar objects observed to date.

  10. The span of correlations in dolphin whistle sequences

    NASA Astrophysics Data System (ADS)

    Ferrer-i-Cancho, Ramon; McCowan, Brenda

    2012-06-01

    Long-range correlations are found in symbolic sequences from human language, music and DNA. Determining the span of correlations in dolphin whistle sequences is crucial for shedding light on their communicative complexity. Dolphin whistles share various statistical properties with human words, i.e. Zipf's law for word frequencies (namely that the probability of the ith most frequent word of a text is about i-α) and a parallel of the tendency of more frequent words to have more meanings. The finding of Zipf's law for word frequencies in dolphin whistles has been the topic of an intense debate on its implications. One of the major arguments against the relevance of Zipf's law in dolphin whistles is that it is not possible to distinguish the outcome of a die-rolling experiment from that of a linguistic or communicative source producing Zipf's law for word frequencies. Here we show that statistically significant whistle-whistle correlations extend back to the second previous whistle in the sequence, using a global randomization test, and to the fourth previous whistle, using a local randomization test. None of these correlations are expected by a die-rolling experiment and other simple explanations of Zipf's law for word frequencies, such as Simon's model, that produce sequences of unpredictable elements.

  11. Human Splice-Site Prediction with Deep Neural Networks.

    PubMed

    Naito, Tatsuhiko

    2018-04-18

    Accurate splice-site prediction is essential to delineate gene structures from sequence data. Several computational techniques have been applied to create a system to predict canonical splice sites. For classification tasks, deep neural networks (DNNs) have achieved record-breaking results and often outperformed other supervised learning techniques. In this study, a new method of splice-site prediction using DNNs was proposed. The proposed system receives an input sequence data and returns an answer as to whether it is splice site. The length of input is 140 nucleotides, with the consensus sequence (i.e., "GT" and "AG" for the donor and acceptor sites, respectively) in the middle. Each input sequence model is applied to the pretrained DNN model that determines the probability that an input is a splice site. The model consists of convolutional layers and bidirectional long short-term memory network layers. The pretraining and validation were conducted using the data set tested in previously reported methods. The performance evaluation results showed that the proposed method can outperform the previous methods. In addition, the pattern learned by the DNNs was visualized as position frequency matrices (PFMs). Some of PFMs were very similar to the consensus sequence. The trained DNN model and the brief source code for the prediction system are uploaded. Further improvement will be achieved following the further development of DNNs.

  12. Cost-Effective Sequencing of Full-Length cDNA Clones Powered by a De Novo-Reference Hybrid Assembly

    PubMed Central

    Sugano, Sumio; Morishita, Shinichi; Suzuki, Yutaka

    2010-01-01

    Background Sequencing full-length cDNA clones is important to determine gene structures including alternative splice forms, and provides valuable resources for experimental analyses to reveal the biological functions of coded proteins. However, previous approaches for sequencing cDNA clones were expensive or time-consuming, and therefore, a fast and efficient sequencing approach was demanded. Methodology We developed a program, MuSICA 2, that assembles millions of short (36-nucleotide) reads collected from a single flow cell lane of Illumina Genome Analyzer to shotgun-sequence ∼800 human full-length cDNA clones. MuSICA 2 performs a hybrid assembly in which an external de novo assembler is run first and the result is then improved by reference alignment of shotgun reads. We compared the MuSICA 2 assembly with 200 pooled full-length cDNA clones finished independently by the conventional primer-walking using Sanger sequencers. The exon-intron structure of the coding sequence was correct for more than 95% of the clones with coding sequence annotation when we excluded cDNA clones insufficiently represented in the shotgun library due to PCR failure (42 out of 200 clones excluded), and the nucleotide-level accuracy of coding sequences of those correct clones was over 99.99%. We also applied MuSICA 2 to full-length cDNA clones from Toxoplasma gondii, to confirm that its ability was competent even for non-human species. Conclusions The entire sequencing and shotgun assembly takes less than 1 week and the consumables cost only ∼US$3 per clone, demonstrating a significant advantage over previous approaches. PMID:20479877

  13. Defining objective clusters for rabies virus sequences using affinity propagation clustering

    PubMed Central

    Fischer, Susanne; Freuling, Conrad M.; Pfaff, Florian; Bodenhofer, Ulrich; Höper, Dirk; Fischer, Mareike; Marston, Denise A.; Fooks, Anthony R.; Mettenleiter, Thomas C.; Conraths, Franz J.; Homeier-Bachmann, Timo

    2018-01-01

    Rabies is caused by lyssaviruses, and is one of the oldest known zoonoses. In recent years, more than 21,000 nucleotide sequences of rabies viruses (RABV), from the prototype species rabies lyssavirus, have been deposited in public databases. Subsequent phylogenetic analyses in combination with metadata suggest geographic distributions of RABV. However, these analyses somewhat experience technical difficulties in defining verifiable criteria for cluster allocations in phylogenetic trees inviting for a more rational approach. Therefore, we applied a relatively new mathematical clustering algorythm named ‘affinity propagation clustering’ (AP) to propose a standardized sub-species classification utilizing full-genome RABV sequences. Because AP has the advantage that it is computationally fast and works for any meaningful measure of similarity between data samples, it has previously been applied successfully in bioinformatics, for analysis of microarray and gene expression data, however, cluster analysis of sequences is still in its infancy. Existing (516) and original (46) full genome RABV sequences were used to demonstrate the application of AP for RABV clustering. On a global scale, AP proposed four clusters, i.e. New World cluster, Arctic/Arctic-like, Cosmopolitan, and Asian as previously assigned by phylogenetic studies. By combining AP with established phylogenetic analyses, it is possible to resolve phylogenetic relationships between verifiably determined clusters and sequences. This workflow will be useful in confirming cluster distributions in a uniform transparent manner, not only for RABV, but also for other comparative sequence analyses. PMID:29357361

  14. The sequence specificity of UV-induced DNA damage in a systematically altered DNA sequence.

    PubMed

    Khoe, Clairine V; Chung, Long H; Murray, Vincent

    2018-06-01

    The sequence specificity of UV-induced DNA damage was investigated in a specifically designed DNA plasmid using two procedures: end-labelling and linear amplification. Absorption of UV photons by DNA leads to dimerisation of pyrimidine bases and produces two major photoproducts, cyclobutane pyrimidine dimers (CPDs) and pyrimidine(6-4)pyrimidone photoproducts (6-4PPs). A previous study had determined that two hexanucleotide sequences, 5'-GCTC*AC and 5'-TATT*AA, were high intensity UV-induced DNA damage sites. The UV clone plasmid was constructed by systematically altering each nucleotide of these two hexanucleotide sequences. One of the main goals of this study was to determine the influence of single nucleotide alterations on the intensity of UV-induced DNA damage. The sequence 5'-GCTC*AC was designed to examine the sequence specificity of 6-4PPs and the highest intensity 6-4PP damage sites were found at 5'-GTTC*CC nucleotides. The sequence 5'-TATT*AA was devised to investigate the sequence specificity of CPDs and the highest intensity CPD damage sites were found at 5'-TTTT*CG nucleotides. It was proposed that the tetranucleotide DNA sequence, 5'-YTC*Y (where Y is T or C), was the consensus sequence for the highest intensity UV-induced 6-4PP adduct sites; while it was 5'-YTT*C for the highest intensity UV-induced CPD damage sites. These consensus tetranucleotides are composed entirely of consecutive pyrimidines and must have a DNA conformation that is highly productive for the absorption of UV photons. Crown Copyright © 2018. Published by Elsevier B.V. All rights reserved.

  15. A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions.

    PubMed

    Conte, Matthew A; Gammerdinger, William J; Bartie, Kerry L; Penman, David J; Kocher, Thomas D

    2017-05-02

    Tilapias are the second most farmed fishes in the world and a sustainable source of food. Like many other fish, tilapias are sexually dimorphic and sex is a commercially important trait in these fish. In this study, we developed a significantly improved assembly of the tilapia genome using the latest genome sequencing methods and show how it improves the characterization of two sex determination regions in two tilapia species. A homozygous clonal XX female Nile tilapia (Oreochromis niloticus) was sequenced to 44X coverage using Pacific Biosciences (PacBio) SMRT sequencing. Dozens of candidate de novo assemblies were generated and an optimal assembly (contig NG50 of 3.3Mbp) was selected using principal component analysis of likelihood scores calculated from several paired-end sequencing libraries. Comparison of the new assembly to the previous O. niloticus genome assembly reveals that recently duplicated portions of the genome are now well represented. The overall number of genes in the new assembly increased by 27.3%, including a 67% increase in pseudogenes. The new tilapia genome assembly correctly represents two recent vasa gene duplication events that have been verified with BAC sequencing. At total of 146Mbp of additional transposable element sequence are now assembled, a large proportion of which are recent insertions. Large centromeric satellite repeats are assembled and annotated in cichlid fish for the first time. Finally, the new assembly identifies the long-range structure of both a ~9Mbp XY sex determination region on LG1 in O. niloticus, and a ~50Mbp WZ sex determination region on LG3 in the related species O. aureus. This study highlights the use of long read sequencing to correctly assemble recent duplications and to characterize repeat-filled regions of the genome. The study serves as an example of the need for high quality genome assemblies and provides a framework for identifying sex determining genes in tilapia and related fish species.

  16. Frontoxins, three-finger toxins from Micrurus frontalis venom, decrease miniature endplate potential amplitude at frog neuromuscular junction.

    PubMed

    Moreira, K G; Prates, M V; Andrade, F A C; Silva, L P; Beirão, P S L; Kushmerick, C; Naves, L A; Bloch, C

    2010-08-01

    Neurotoxicity is a major symptom of envenomation caused by Brazilian coral snake Micrurus frontalis. Due to the small amount of material that can be collected, no neurotoxin has been fully sequenced from this venom. In this work we report six new three-finger like toxins isolated from the venom of the coral snake M. frontalis which we named Frontoxin (FTx) I-VI. Toxins were purified using multiple steps of RP-HPLC. Molecular masses were determined by MALDI-TOF and ESI ion-trap mass spectrometry. The complete amino acid sequence of FTx II, III, IV and V were determined by sequencing of overlapping proteolytic fragments by Edman degradation and by de novo sequencing. The amino acid sequences of FTx I, II, III and VI predict 4 conserved disulphide bonds and structural similarity to previously reported short-chain alpha-neurotoxins. FTx IV and V each contained 10 conserved cysteines and share high similarity with long-chain alpha-neurotoxins. At the frog neuromuscular junction FTx II, III and IV reduced miniature endplate potential amplitudes in a time-and concentration-dependent manner suggesting Frontoxins block nicotinic acetylcholine receptors. Copyright 2010 Elsevier Ltd. All rights reserved.

  17. The siRNA Non-seed Region and Its Target Sequences Are Auxiliary Determinants of Off-Target Effects.

    PubMed

    Kamola, Piotr J; Nakano, Yuko; Takahashi, Tomoko; Wilson, Paul A; Ui-Tei, Kumiko

    2015-12-01

    RNA interference (RNAi) is a powerful tool for post-transcriptional gene silencing. However, the siRNA guide strand may bind unintended off-target transcripts via partial sequence complementarity by a mechanism closely mirroring micro RNA (miRNA) silencing. To better understand these off-target effects, we investigated the correlation between sequence features within various subsections of siRNA guide strands, and its corresponding target sequences, with off-target activities. Our results confirm previous reports that strength of base-pairing in the siRNA seed region is the primary factor determining the efficiency of off-target silencing. However, the degree of downregulation of off-target transcripts with shared seed sequence is not necessarily similar, suggesting that there are additional auxiliary factors that influence the silencing potential. Here, we demonstrate that both the melting temperature (Tm) in a subsection of siRNA non-seed region, and the GC contents of its corresponding target sequences, are negatively correlated with the efficiency of off-target effect. Analysis of experimentally validated miRNA targets demonstrated a similar trend, indicating a putative conserved mechanistic feature of seed region-dependent targeting mechanism. These observations may prove useful as parameters for off-target prediction algorithms and improve siRNA 'specificity' design rules.

  18. Probability of coding of a DNA sequence: an algorithm to predict translated reading frames from their thermodynamic characteristics.

    PubMed Central

    Tramontano, A; Macchiato, M F

    1986-01-01

    An algorithm to determine the probability that a reading frame codifies for a protein is presented. It is based on the results of our previous studies on the thermodynamic characteristics of a translated reading frame. We also develop a prediction procedure to distinguish between coding and non-coding reading frames. The procedure is based on the characteristics of the putative product of the DNA sequence and not on periodicity characteristics of the sequence, so the prediction is not biased by the presence of overlapping translated reading frames or by the presence of translated reading frames on the complementary DNA strand. PMID:3753761

  19. Solving Assembly Sequence Planning using Angle Modulated Simulated Kalman Filter

    NASA Astrophysics Data System (ADS)

    Mustapa, Ainizar; Yusof, Zulkifli Md.; Adam, Asrul; Muhammad, Badaruddin; Ibrahim, Zuwairie

    2018-03-01

    This paper presents an implementation of Simulated Kalman Filter (SKF) algorithm for optimizing an Assembly Sequence Planning (ASP) problem. The SKF search strategy contains three simple steps; predict-measure-estimate. The main objective of the ASP is to determine the sequence of component installation to shorten assembly time or save assembly costs. Initially, permutation sequence is generated to represent each agent. Each agent is then subjected to a precedence matrix constraint to produce feasible assembly sequence. Next, the Angle Modulated SKF (AMSKF) is proposed for solving ASP problem. The main idea of the angle modulated approach in solving combinatorial optimization problem is to use a function, g(x), to create a continuous signal. The performance of the proposed AMSKF is compared against previous works in solving ASP by applying BGSA, BPSO, and MSPSO. Using a case study of ASP, the results show that AMSKF outperformed all the algorithms in obtaining the best solution.

  20. Cytogenetic evidence for asexual evolution of bdelloid rotifers.

    PubMed

    Mark Welch, Jessica L; Mark Welch, David B; Meselson, Matthew

    2004-02-10

    DNA sequencing has shown individual bdelloid rotifer genomes to contain two or more diverged copies of every gene examined and has revealed no closely similar copies. These and other findings are consistent with long-term asexual evolution of bdelloids. It is not entirely ruled out, however, that bdelloid genomes consist of previously undetected pairs of sequences so similar as to be identical over the regions sequenced, as might result if bdelloids were highly inbred sexual diploids or polyploids. Here, we employ fluorescent in situ hybridization with cosmid probes to determine the copy number and chromosomal distribution of the heat shock gene hsp82 and adjacent sequences in the bdelloid Philodina roseola. We conclude that the four copies identified by sequencing are the only ones present and that each is on a separate chromosome. Bdelloids therefore are not highly homozygous sexually reproducing diploids or polyploids.

  1. Sequence of a second gene encoding bovine submaxillary mucin: implication for mucin heterogeneity and cloning.

    PubMed

    Jiang, W; Woitach, J T; Gupta, D; Bhavanandan, V P

    1998-10-20

    Secreted epithelial mucins are extremely large and heterogeneous glycoproteins. We report the 5 kilobase DNA sequence of a second gene, BSM2, which encodes bovine submaxillary mucin. The determined nucleotide and deduced amino acid sequences of BSM2 are 95.2% and 92. 2% identical, respectively, to those of the previously described BSM1 gene isolated from the same cow. Further, the five predicted protein domains of the two genes are 100%, 94%, 93%, 77%, and 88% identical. Based on the above results, we propose that expression of multiple homologous core proteins from a single animal is a factor in generating diversity of saccharides in mucins and in providing resistance of the molecules to proteolysis. In addition, this work raises several important issues in mucin cloning such as assembling sequences from seemingly overlapping clones and deducing consensus sequences for nearly identical tandem repeats. Copyright 1998 Academic Press.

  2. 5S ribosomal ribonucleic acid sequences in Bacteroides and Fusobacterium: evolutionary relationships within these genera and among eubacteria in general

    NASA Technical Reports Server (NTRS)

    Van den Eynde, H.; De Baere, R.; Shah, H. N.; Gharbia, S. E.; Fox, G. E.; Michalik, J.; Van de Peer, Y.; De Wachter, R.

    1989-01-01

    The 5S ribosomal ribonucleic acid (rRNA) sequences were determined for Bacteroides fragilis, Bacteroides thetaiotaomicron, Bacteroides capillosus, Bacteroides veroralis, Porphyromonas gingivalis, Anaerorhabdus furcosus, Fusobacterium nucleatum, Fusobacterium mortiferum, and Fusobacterium varium. A dendrogram constructed by a clustering algorithm from these sequences, which were aligned with all other hitherto known eubacterial 5S rRNA sequences, showed differences as well as similarities with respect to results derived from 16S rRNA analyses. In the 5S rRNA dendrogram, Bacteroides clustered together with Cytophaga and Fusobacterium, as in 16S rRNA analyses. Intraphylum relationships deduced from 5S rRNAs suggested that Bacteroides is specifically related to Cytophaga rather than to Fusobacterium, as was suggested by 16S rRNA analyses. Previous taxonomic considerations concerning the genus Bacteroides, based on biochemical and physiological data, were confirmed by the 5S rRNA sequence analysis.

  3. Three-dimensional T1rho-weighted MRI at 1.5 Tesla.

    PubMed

    Borthakur, Arijitt; Wheaton, Andrew; Charagundla, Sridhar R; Shapiro, Erik M; Regatte, Ravinder R; Akella, Sarma V S; Kneeland, J Bruce; Reddy, Ravinder

    2003-06-01

    To design and implement a magnetic resonance imaging (MRI) pulse sequence capable of performing three-dimensional T(1rho)-weighted MRI on a 1.5-T clinical scanner, and determine the optimal sequence parameters, both theoretically and experimentally, so that the energy deposition by the radiofrequency pulses in the sequence, measured as the specific absorption rate (SAR), does not exceed safety guidelines for imaging human subjects. A three-pulse cluster was pre-encoded to a three-dimensional gradient-echo imaging sequence to create a three-dimensional, T(1rho)-weighted MRI pulse sequence. Imaging experiments were performed on a GE clinical scanner with a custom-built knee-coil. We validated the performance of this sequence by imaging articular cartilage of a bovine patella and comparing T(1rho) values measured by this sequence to those obtained with a previously tested two-dimensional imaging sequence. Using a previously developed model for SAR calculation, the imaging parameters were adjusted such that the energy deposition by the radiofrequency pulses in the sequence did not exceed safety guidelines for imaging human subjects. The actual temperature increase due to the sequence was measured in a phantom by a MRI-based temperature mapping technique. Following these experiments, the performance of this sequence was demonstrated in vivo by obtaining T(1rho)-weighted images of the knee joint of a healthy individual. Calculated T(1rho) of articular cartilage in the specimen was similar for both and three-dimensional and two-dimensional methods (84 +/- 2 msec and 80 +/- 3 msec, respectively). The temperature increase in the phantom resulting from the sequence was 0.015 degrees C, which is well below the established safety guidelines. Images of the human knee joint in vivo demonstrate a clear delineation of cartilage from surrounding tissues. We developed and implemented a three-dimensional T(1rho)-weighted pulse sequence on a 1.5-T clinical scanner. Copyright 2003 Wiley-Liss, Inc.

  4. Brain cDNA clone for human cholinesterase

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    McTiernan, C.; Adkins, S.; Chatonnet, A.

    1987-10-01

    A cDNA library from human basal ganglia was screened with oligonucleotide probes corresponding to portions of the amino acid sequence of human serum cholinesterase. Five overlapping clones, representing 2.4 kilobases, were isolated. The sequenced cDNA contained 207 base pairs of coding sequence 5' to the amino terminus of the mature protein in which there were four ATG translation start sites in the same reading frame as the protein. Only the ATG coding for Met-(-28) lay within a favorable consensus sequence for functional initiators. There were 1722 base pairs of coding sequence corresponding to the protein found circulating in human serum.more » The amino acid sequence deduced from the cDNA exactly matched the 574 amino acid sequence of human serum cholinesterase, as previously determined by Edman degradation. Therefore, our clones represented cholinesterase rather than acetylcholinesterase. It was concluded that the amino acid sequences of cholinesterase from two different tissues, human brain and human serum, were identical. Hybridization of genomic DNA blots suggested that a single gene, or very few genes coded for cholinesterase.« less

  5. Beyond Bacteria: A Study of the Enteric Microbial Consortium in Extremely Low Birth Weight Infants

    PubMed Central

    Cotton, Charles Michael; Goldberg, Ronald N.; Wynn, James L.; Jackson, Robert B.; Seed, Patrick C.

    2011-01-01

    Extremely low birth weight (ELBW) infants have high morbidity and mortality, frequently due to invasive infections from bacteria, fungi, and viruses. The microbial communities present in the gastrointestinal tracts of preterm infants may serve as a reservoir for invasive organisms and remain poorly characterized. We used deep pyrosequencing to examine the gut-associated microbiome of 11 ELBW infants in the first postnatal month, with a first time determination of the eukaryote microbiota such as fungi and nematodes, including bacteria and viruses that have not been previously described. Among the fungi observed, Candida sp. and Clavispora sp. dominated the sequences, but a range of environmental molds were also observed. Surprisingly, seventy-one percent of the infant fecal samples tested contained ribosomal sequences corresponding to the parasitic organism Trichinella. Ribosomal DNA sequences for the roundworm symbiont Xenorhabdus accompanied these sequences in the infant with the greatest proportion of Trichinella sequences. When examining ribosomal DNA sequences in aggregate, Enterobacteriales, Pseudomonas, Staphylococcus, and Enterococcus were the most abundant bacterial taxa in a low diversity bacterial community (mean Shannon-Weaver Index of 1.02±0.69), with relatively little change within individual infants through time. To supplement the ribosomal sequence data, shotgun sequencing was performed on DNA from multiple displacement amplification (MDA) of total fecal genomic DNA from two infants. In addition to the organisms mentioned previously, the metagenome also revealed sequences for gram positive and gram negative bacteriophages, as well as human adenovirus C. Together, these data reveal surprising eukaryotic and viral microbial diversity in ELBW enteric microbiota dominated bytypes of bacteria known to cause invasive disease in these infants. PMID:22174751

  6. Evaluation of the Abbott RealTime HCV genotype II plus RUO (PLUS) assay with reference to core and NS5B sequencing.

    PubMed

    Mallory, Melanie A; Lucic, Danijela; Ebbert, Mark T W; Cloherty, Gavin A; Toolsie, Dan; Hillyard, David R

    2017-05-01

    HCV genotyping remains a critical tool for guiding initiation of therapy and selecting the most appropriate treatment regimen. Current commercial genotyping assays may have difficulty identifying 1a, 1b and genotype 6. To evaluate the concordance for identifying 1a, 1b, and genotype 6 between two methods: the PLUS assay and core/NS5B sequencing. This study included 236 plasma and serum samples previously genotyped by core/NS5B sequencing. Of these, 25 samples were also previously tested by the Abbott RealTime HCV GT II Research Use Only (RUO) assay and yielded ambiguous results. The remaining 211 samples were routine genotype 1 (n=169) and genotype 6 (n=42). Genotypes obtained from sequence data were determined using a laboratory-developed HCV sequence analysis tool and the NCBI non-redundant database. Agreement between the PLUS assay and core/NS5B sequencing for genotype 1 samples was 95.8% (162/169), with 96% (127/132) and 95% (35/37) agreement for 1a and 1b samples respectively. PLUS results agreed with core/NS5B sequencing for 83% (35/42) of unselected genotype 6 samples, with the remaining seven "not detected" by the PLUS assay. Among the 25 samples with ambiguous GT II results, 15 were concordant by PLUS and core/NS5B sequencing, nine were not detected by PLUS, and one sample had an internal control failure. The PLUS assay is an automated method that identifies 1a, 1b and genotype 6 with good agreement with gold-standard core/NS5B sequencing and can aid in the resolution of certain genotype samples with ambiguous GT II results. Copyright © 2017 Elsevier B.V. All rights reserved.

  7. Protein structure determination by exhaustive search of Protein Data Bank derived databases.

    PubMed

    Stokes-Rees, Ian; Sliz, Piotr

    2010-12-14

    Parallel sequence and structure alignment tools have become ubiquitous and invaluable at all levels in the study of biological systems. We demonstrate the application and utility of this same parallel search paradigm to the process of protein structure determination, benefitting from the large and growing corpus of known structures. Such searches were previously computationally intractable. Through the method of Wide Search Molecular Replacement, developed here, they can be completed in a few hours with the aide of national-scale federated cyberinfrastructure. By dramatically expanding the range of models considered for structure determination, we show that small (less than 12% structural coverage) and low sequence identity (less than 20% identity) template structures can be identified through multidimensional template scoring metrics and used for structure determination. Many new macromolecular complexes can benefit significantly from such a technique due to the lack of known homologous protein folds or sequences. We demonstrate the effectiveness of the method by determining the structure of a full-length p97 homologue from Trichoplusia ni. Example cases with the MHC/T-cell receptor complex and the EmoB protein provide systematic estimates of minimum sequence identity, structure coverage, and structural similarity required for this method to succeed. We describe how this structure-search approach and other novel computationally intensive workflows are made tractable through integration with the US national computational cyberinfrastructure, allowing, for example, rapid processing of the entire Structural Classification of Proteins protein fragment database.

  8. Molecular epidemiology of drug-resistant Neisseria gonorrhoeae in Russia (Current Status, 2015).

    PubMed

    Kubanov, Alexey; Vorobyev, Denis; Chestkov, Aleksandr; Leinsoo, Arvo; Shaskolskiy, Boris; Dementieva, Ekaterina; Solomka, Viktoria; Plakhova, Xenia; Gryadunov, Dmitry; Deryabin, Dmitriy

    2016-08-09

    The widespread distribution of Neisseria gonorrhoeae strains that are resistant to previously used and clinically implemented antibiotics is a significant global public health problem. In line with WHO standards, the national Gonococcal Antimicrobial Surveillance Programme (RU-GASP) has been in existence in Russia since 2004; herein, the current status (2015) is described, including associations between N. gonorrhoeae antimicrobial susceptibility, primary genetic resistance determinants and specific strain sequence types. A total of 124 N. gonorrhoeae strains obtained from 9 regions in Russia in 2015 were examined using N. gonorrhoeae Multi-Antigen Sequence Typing (NG-MAST), an antimicrobial susceptibility test according to European Committee on Antimicrobial Susceptibility Testing (EUCAST) criteria and an oligonucleotide microarray for the identification of mutations in the penA, ponA, rpsJ, gyrA and parC genes responsible for penicillin G, tetracycline, and fluoroquinolone resistance. Genogroup (G) isolates were evaluated based on their porB and tbpB sequence types (STs). NG-MAST analysis showed a diversified population of N. gonorrhoeae in Russia with 58 sequence types, 35 of which were described for the first time. The STs 807, 1544, 1993, 5714, 9476 and 12531, which were typical for some Russian Federation regions and several countries of the former Soviet Union, were represented by five or more isolates. The internationally widespread ST 1407 was represented by a single strain in the present study. Division into genogroups facilitated an exploration of the associations between N. gonorrhoeae sequence type, antimicrobial resistance spectra and genetic resistance determinant contents. Preliminarily susceptible (G-807, G-12531) and resistant (G-5714, G-9476) genogroups were revealed. The variability in the most frequently observed STs and genogroups in each participating region indicated geographically restricted antimicrobial susceptibility in N. gonorrhoeae populations. Resistance or intermediate susceptibility to previously recommended antimicrobials, such as penicillin G (60.5 %), ciprofloxacin (41.1 %) and tetracycline (25 %), is common in the N. gonorrhoeae population. Based on previous reports and current data, ceftriaxone and spectinomycin should be recommended for first-line empiric antimicrobial monotherapy for gonorrhoea in Russia.

  9. Using variable rate models to identify genes under selection in sequence pairs: their validity and limitations for EST sequences.

    PubMed

    Church, Sheri A; Livingstone, Kevin; Lai, Zhao; Kozik, Alexander; Knapp, Steven J; Michelmore, Richard W; Rieseberg, Loren H

    2007-02-01

    Using likelihood-based variable selection models, we determined if positive selection was acting on 523 EST sequence pairs from two lineages of sunflower and lettuce. Variable rate models are generally not used for comparisons of sequence pairs due to the limited information and the inaccuracy of estimates of specific substitution rates. However, previous studies have shown that the likelihood ratio test (LRT) is reliable for detecting positive selection, even with low numbers of sequences. These analyses identified 56 genes that show a signature of selection, of which 75% were not identified by simpler models that average selection across codons. Subsequent mapping studies in sunflower show four of five of the positively selected genes identified by these methods mapped to domestication QTLs. We discuss the validity and limitations of using variable rate models for comparisons of sequence pairs, as well as the limitations of using ESTs for identification of positively selected genes.

  10. Identification of putative plant pathogenic determinants from a draft genome sequence of an opportunistic klebsiella pneumoniae strain

    USDA-ARS?s Scientific Manuscript database

    Klebsiella pneumoniae has been known historically as a causal agent of bacterial pneumonia. More recently, K. pneumoniaerepresentatives have been shown to have a broad ecological distribution and are recognized nitrogen-fixers. Previously, we demonstrated the capacity of K. pneumoniae strain Kp 5-1R...

  11. A Mathematics Entrance Exam for General (Non-Majors) Physics

    ERIC Educational Resources Information Center

    Chediak, Alex

    2010-01-01

    In a previous issue of "The Physics Teacher", John Hubisz explained how a mathematics background check has been used at three different colleges to determine the appropriate physics sequence for incoming students. Based on their performance, students are placed into either calculus-based physics (CBP), algebra-trig physics (ATP), or a year of…

  12. Sequencing of emerging canine distemper virus strain reveals new distinct genetic lineage in the United States associated with disease in wildlife and domestic canine populations.

    PubMed

    Riley, Matthew C; Wilkes, Rebecca P

    2015-12-18

    Recent outbreaks of canine distemper have prompted examination of strains from clinical samples submitted to the University of Tennessee College of Veterinary Medicine (UTCVM) Clinical Virology Lab. We previously described a new strain of CDV that significantly diverged from all genotypes reported to date including America 2, the genotype proposed to be the main lineage currently circulating in the US. The aim of this study was to determine when this new strain appeared and how widespread it is in animal populations, given that it has also been detected in fully vaccinated adult dogs. Additionally, we sequenced complete viral genomes to characterize the strain and determine if variation is confined to known variable regions of the genome or if the changes are also present in more conserved regions. Archived clinical samples were genotyped using real-time RT-PCR amplification and sequencing. The genomes of two unrelated viruses from a dog and fox each from a different state were sequenced and aligned with previously published genomes. Phylogenetic analysis was performed using coding, non-coding and genome-length sequences. Virus neutralization assays were used to evaluate potential antigenic differences between this strain and a vaccine strain and mixed ANOVA test was used to compare the titers. Genotyping revealed this strain first appeared in 2011 and was detected in dogs from multiple states in the Southeast region of the United States. It was the main strain detected among the clinical samples that were typed from 2011-2013, including wildlife submissions. Genome sequencing demonstrated that it is highly conserved within a new lineage and preliminary serologic testing showed significant differences in neutralizing antibody titers between this strain and the strain commonly used in vaccines. This new strain represents an emerging CDV in domestic dogs in the US, may be associated with a stable reservoir in the wildlife population, and could facilitate vaccine escape.

  13. Individual microRNAs (miRNAs) display distinct mRNA targeting "rules".

    PubMed

    Wang, Wang-Xia; Wilfred, Bernard R; Xie, Kevin; Jennings, Mary H; Hu, Yanling Hu; Stromberg, Arnold J; Nelson, Peter T

    2010-01-01

    MicroRNAs (miRNAs) guide Argonaute (AGO)-containing microribonucleoprotein (miRNP) complexes to target mRNAs.It has been assumed that miRNAs behave similarly to each other with regard to mRNA target recognition. The usual assumptions, which are based on prior studies, are that miRNAs target preferentially sequences in the 3'UTR of mRNAs,guided by the 5' "seed" portion of the miRNAs. Here we isolated AGO- and miRNA-containing miRNPs from human H4 tumor cells by co-immunoprecipitation (co-IP) with anti-AGO antibody. Cells were transfected with miR-107, miR-124,miR-128, miR-320, or a negative control miRNA. Co-IPed RNAs were subjected to downstream high-density Affymetrix Human Gene 1.0 ST microarray analyses using an assay we validated previously-a "RIP-Chip" experimental design. RIP-Chip data provided a list of mRNAs recruited into the AGO-miRNP in correlation to each miRNA. These experimentally identified miRNA targets were analyzed for complementary six nucleotide "seed" sequences within the transfected miRNAs. We found that miR-124 targets tended to have sequences in the 3'UTR that would be recognized by the 5' seed of miR-124, as described in previous studies. By contrast, miR-107 targets tended to have 'seed' sequences in the mRNA open reading frame, but not the 3' UTR. Further, mRNA targets of miR-128 and miR-320 are less enriched for 6-mer seed sequences in comparison to miR-107 and miR-124. In sum, our data support the importance of the 5' seed in determining binding characteristics for some miRNAs; however, the "binding rules" are complex, and individual miRNAs can have distinct sequence determinants that lead to mRNA targeting.

  14. Sequential congruency effects: disentangling priming and conflict adaptation.

    PubMed

    Puccioni, Olga; Vallesi, Antonino

    2012-09-01

    Responding to the color of a word is slower and less accurate if the word refers to a different color (incongruent condition) than if it refers to the same color (congruent condition). This phenomenon, known as the Stroop effect, is modulated by sequential effects: it is bigger when the current trial is preceded by a congruent condition than by an incongruent one in the previous trial. Whether this phenomenon is due to priming mechanisms or to cognitive control is still debated. To disentangle the contribution of priming with respect to conflict adaptation mechanisms in determining sequential effects, two experiments were designed here with a four-alternative forced choice (4-AFC) Stroop task: in the first one only trials with complete alternations of features were used, while in the second experiment all possible types of repetitions were presented. Both response times (RTs) and errors were evaluated. Conflict adaptation effects on RTs were limited to congruent trials and were exclusively due to priming: they disappeared in the priming-free experiment and, in the second experiment, they occurred in sequences with feature repetitions but not in complete alternation sequences. Error results, instead, support the presence of conflict adaptation effects in incongruent trials. In priming-free sequences (experiment 1 and complete alternation sequences of experiment 2) with incongruent previous trials there was no error Stroop effect, while this effect was significant with congruent previous trials. These results indicate that cognitive control may modulate performance above and beyond priming effects.

  15. ProteomeVis: a web app for exploration of protein properties from structure to sequence evolution across organisms' proteomes.

    PubMed

    Razban, Rostam M; Gilson, Amy I; Durfee, Niamh; Strobelt, Hendrik; Dinkla, Kasper; Choi, Jeong-Mo; Pfister, Hanspeter; Shakhnovich, Eugene I

    2018-05-08

    Protein evolution spans time scales and its effects span the length of an organism. A web app named ProteomeVis is developed to provide a comprehensive view of protein evolution in the S. cerevisiae and E. coli proteomes. ProteomeVis interactively creates protein chain graphs, where edges between nodes represent structure and sequence similarities within user-defined ranges, to study the long time scale effects of protein structure evolution. The short time scale effects of protein sequence evolution are studied by sequence evolutionary rate (ER) correlation analyses with protein properties that span from the molecular to the organismal level. We demonstrate the utility and versatility of ProteomeVis by investigating the distribution of edges per node in organismal protein chain universe graphs (oPCUGs) and putative ER determinants. S. cerevisiae and E. coli oPCUGs are scale-free with scaling constants of 1.79 and 1.56, respectively. Both scaling constants can be explained by a previously reported theoretical model describing protein structure evolution (Dokholyan et al., 2002). Protein abundance most strongly correlates with ER among properties in ProteomeVis, with Spearman correlations of -0.49 (p-value<10-10) and -0.46 (p-value<10-10) for S. cerevisiae and E. coli, respectively. This result is consistent with previous reports that found protein expression to be the most important ER determinant (Zhang and Yang, 2015). ProteomeVis is freely accessible at http://proteomevis.chem.harvard.edu. Supplementary data are available at Bioinformatics. shakhnovich@chemistry.harvard.edu.

  16. Recent horizontal transfer of mellifera subfamily mariner transposons into insect lineages representing four different orders shows that selection acts only during horizontal transfer.

    PubMed

    Lampe, David J; Witherspoon, David J; Soto-Adames, Felipe N; Robertson, Hugh M

    2003-04-01

    We report the isolation and sequencing of genomic copies of mariner transposons involved in recent horizontal transfers into the genomes of the European earwig, Forficula auricularia; the European honey bee, Apis mellifera; the Mediterranean fruit fly, Ceratitis capitata; and a blister beetle, Epicauta funebris, insects from four different orders. These elements are in the mellifera subfamily and are the second documented example of full-length mariner elements involved in this kind of phenomenon. We applied maximum likelihood methods to the coding sequences and determined that the copies in each genome were evolving neutrally, whereas reconstructed ancestral coding sequences appeared to be under selection, which strengthens our previous hypothesis that the primary selective constraint on mariner sequence evolution is the act of horizontal transfer between genomes.

  17. Sequence-dependent effects in drug-DNA interaction: the crystal structure of Hoechst 33258 bound to the d(CGCAAATTTGCG)2 duplex.

    PubMed Central

    Spink, N; Brown, D G; Skelly, J V; Neidle, S

    1994-01-01

    The bis-benzimidazole drug Hoechst 33258 has been co-crystallized with the dodecanucleotide sequence d(CGCAAATTTGCG)2. The structure has been solved by molecular replacement and refined to an R factor of 18.5% for 2125 reflections collected on a Xentronics area detector. The drug is bound in the minor groove, at the five base-pair site 5'-ATTTG and is in a unique orientation. This is displaced by one base pair in the 5' direction compared to previously-determined structures of this drug with the sequence d(CGCGAATTCGCG)2. Reasons for this difference in behaviour are discussed in terms of several sequence-dependent structural features of the DNA, with particular reference to differences in propeller twist and minor-groove width. Images PMID:7515488

  18. The Applied Development of a Tiered Multilocus Sequence Typing (MLST) Scheme for Dichelobacter nodosus.

    PubMed

    Blanchard, Adam M; Jolley, Keith A; Maiden, Martin C J; Coffey, Tracey J; Maboni, Grazieli; Staley, Ceri E; Bollard, Nicola J; Warry, Andrew; Emes, Richard D; Davies, Peers L; Tötemeyer, Sabine

    2018-01-01

    Dichelobacter nodosus ( D. nodosus ) is the causative pathogen of ovine footrot, a disease that has a significant welfare and financial impact on the global sheep industry. Previous studies into the phylogenetics of D. nodosus have focused on Australia and Scandinavia, meaning the current diversity in the United Kingdom (U.K.) population and its relationship globally, is poorly understood. Numerous epidemiological methods are available for bacterial typing; however, few account for whole genome diversity or provide the opportunity for future application of new computational techniques. Multilocus sequence typing (MLST) measures nucleotide variations within several loci with slow accumulation of variation to enable the designation of allele numbers to determine a sequence type. The usage of whole genome sequence data enables the application of MLST, but also core and whole genome MLST for higher levels of strain discrimination with a negligible increase in experimental cost. An MLST database was developed alongside a seven loci scheme using publically available whole genome data from the sequence read archive. Sequence type designation and strain discrimination was compared to previously published data to ensure reproducibility. Multiple D. nodosus isolates from U.K. farms were directly compared to populations from other countries. The U.K. isolates define new clades within the global population of D. nodosus and predominantly consist of serogroups A, B and H, however serogroups C, D, E, and I were also found. The scheme is publically available at https://pubmlst.org/dnodosus/.

  19. Distributed biotin–streptavidin transcription roadblocks for mapping cotranscriptional RNA folding

    PubMed Central

    Strobel, Eric J.; Nedialkov, Yuri; Artsimovitch, Irina

    2017-01-01

    Abstract RNA folding during transcription directs an order of folding that can determine RNA structure and function. However, the experimental study of cotranscriptional RNA folding has been limited by the lack of easily approachable methods that can interrogate nascent RNA structure at nucleotide resolution. To address this, we previously developed cotranscriptional selective 2΄-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq) to simultaneously probe all intermediate RNA transcripts during transcription by stalling elongation complexes at catalytically dead EcoRIE111Q roadblocks. While effective, the distribution of elongation complexes using EcoRIE111Q requires laborious PCR using many different oligonucleotides for each sequence analyzed. Here, we improve the broad applicability of cotranscriptional SHAPE-Seq by developing a sequence-independent biotin–streptavidin (SAv) roadblocking strategy that simplifies the preparation of roadblocking DNA templates. We first determine the properties of biotin–SAv roadblocks. We then show that randomly distributed biotin–SAv roadblocks can be used in cotranscriptional SHAPE-Seq experiments to identify the same RNA structural transitions related to a riboswitch decision-making process that we previously identified using EcoRIE111Q. Lastly, we find that EcoRIE111Q maps nascent RNA structure to specific transcript lengths more precisely than biotin–SAv and propose guidelines to leverage the complementary strengths of each transcription roadblock in cotranscriptional SHAPE-Seq. PMID:28398514

  20. Distributed biotin-streptavidin transcription roadblocks for mapping cotranscriptional RNA folding.

    PubMed

    Strobel, Eric J; Watters, Kyle E; Nedialkov, Yuri; Artsimovitch, Irina; Lucks, Julius B

    2017-07-07

    RNA folding during transcription directs an order of folding that can determine RNA structure and function. However, the experimental study of cotranscriptional RNA folding has been limited by the lack of easily approachable methods that can interrogate nascent RNA structure at nucleotide resolution. To address this, we previously developed cotranscriptional selective 2΄-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq) to simultaneously probe all intermediate RNA transcripts during transcription by stalling elongation complexes at catalytically dead EcoRIE111Q roadblocks. While effective, the distribution of elongation complexes using EcoRIE111Q requires laborious PCR using many different oligonucleotides for each sequence analyzed. Here, we improve the broad applicability of cotranscriptional SHAPE-Seq by developing a sequence-independent biotin-streptavidin (SAv) roadblocking strategy that simplifies the preparation of roadblocking DNA templates. We first determine the properties of biotin-SAv roadblocks. We then show that randomly distributed biotin-SAv roadblocks can be used in cotranscriptional SHAPE-Seq experiments to identify the same RNA structural transitions related to a riboswitch decision-making process that we previously identified using EcoRIE111Q. Lastly, we find that EcoRIE111Q maps nascent RNA structure to specific transcript lengths more precisely than biotin-SAv and propose guidelines to leverage the complementary strengths of each transcription roadblock in cotranscriptional SHAPE-Seq. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  1. Allelic variants of hereditary prions: The bimodularity principle.

    PubMed

    Tikhodeyev, Oleg N; Tarasov, Oleg V; Bondarev, Stanislav A

    2017-01-02

    Modern biology requires modern genetic concepts equally valid for all discovered mechanisms of inheritance, either "canonical" (mediated by DNA sequences) or epigenetic. Applying basic genetic terms such as "gene" and "allele" to protein hereditary factors is one of the necessary steps toward these concepts. The basic idea that different variants of the same prion protein can be considered as alleles has been previously proposed by Chernoff and Tuite. In this paper, the notion of prion allele is further developed. We propose the idea that any prion allele is a bimodular hereditary system that depends on a certain DNA sequence (DNA determinant) and a certain epigenetic mark (epigenetic determinant). Alteration of any of these 2 determinants may lead to establishment of a new prion allele. The bimodularity principle is valid not only for hereditary prions; it seems to be universal for any epigenetic hereditary factor.

  2. Allelic variants of hereditary prions: The bimodularity principle

    PubMed Central

    Tikhodeyev, Oleg N.; Tarasov, Oleg V.; Bondarev, Stanislav A.

    2017-01-01

    ABSTRACT Modern biology requires modern genetic concepts equally valid for all discovered mechanisms of inheritance, either “canonical” (mediated by DNA sequences) or epigenetic. Applying basic genetic terms such as “gene” and “allele” to protein hereditary factors is one of the necessary steps toward these concepts. The basic idea that different variants of the same prion protein can be considered as alleles has been previously proposed by Chernoff and Tuite. In this paper, the notion of prion allele is further developed. We propose the idea that any prion allele is a bimodular hereditary system that depends on a certain DNA sequence (DNA determinant) and a certain epigenetic mark (epigenetic determinant). Alteration of any of these 2 determinants may lead to establishment of a new prion allele. The bimodularity principle is valid not only for hereditary prions; it seems to be universal for any epigenetic hereditary factor. PMID:28281926

  3. Aromatic claw: A new fold with high aromatic content that evades structural prediction: Aromatic Claw

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sachleben, Joseph R.; Adhikari, Aashish N.; Gawlak, Grzegorz

    2016-11-10

    We determined the NMR structure of a highly aromatic (13%) protein of unknown function, Aq1974 from Aquifex aeolicus (PDB ID: 5SYQ). The unusual sequence of this protein has a tryptophan content five times the normal (six tryptophan residues of 114 or 5.2% while the average tryptophan content is 1.0%) with the tryptophans occurring in a WXW motif. It has no detectable sequence homology with known protein structures. Although its NMR spectrum suggested that the protein was rich in β-sheet, upon resonance assignment and solution structure determination, the protein was found to be primarily α-helical with a small two-stranded β-sheet withmore » a novel fold that we have termed an Aromatic Claw. As this fold was previously unknown and the sequence unique, we submitted the sequence to CASP10 as a target for blind structural prediction. At the end of the competition, the sequence was classified a hard template based model; the structural relationship between the template and the experimental structure was small and the predictions all failed to predict the structure. CSRosetta was found to predict the secondary structure and its packing; however, it was found that there was little correlation between CSRosetta score and the RMSD between the CSRosetta structure and the NMR determined one. This work demonstrates that even in relatively small proteins, we do not yet have the capacity to accurately predict the fold for all primary sequences. The experimental discovery of new folds helps guide the improvement of structural prediction methods.« less

  4. Efficient high-throughput sequencing of a laser microdissected chromosome arm

    PubMed Central

    2013-01-01

    Background Genomic sequence assemblies are key tools for a broad range of gene function and evolutionary studies. The diploid amphibian Xenopus tropicalis plays a pivotal role in these fields due to its combination of experimental flexibility, diploid genome, and early-branching tetrapod taxonomic position, having diverged from the amniote lineage ~360 million years ago. A genome assembly and a genetic linkage map have recently been made available. Unfortunately, large gaps in the linkage map attenuate long-range integrity of the genome assembly. Results We laser dissected the short arm of X. tropicalis chromosome 7 for next generation sequencing and computational mapping to the reference genome. This arm is of particular interest as it encodes the sex determination locus, but its genetic map contains large gaps which undermine available genome assemblies. Whole genome amplification of 15 laser-microdissected 7p arms followed by next generation sequencing yielded ~35 million reads, over four million of which uniquely mapped to the X. tropicalis genome. Our analysis placed more than 200 previously unmapped scaffolds on the analyzed chromosome arm, providing valuable low-resolution physical map information for de novo genome assembly. Conclusion We present a new approach for improving and validating genetic maps and sequence assemblies. Whole genome amplification of 15 microdissected chromosome arms provided sufficient high-quality material for localizing previously unmapped scaffolds and genes as well as recognizing mislocalized scaffolds. PMID:23714049

  5. Strongylus asini (Nematoda, Strongyloidea): genetic relationships with other Strongylus species determined by ribosomal DNA.

    PubMed

    Hung, G C; Jacobs, D E; Krecek, R C; Gasser, R B; Chilton, N B

    1996-12-01

    Genomic DNA was isolated from adult Strongylus asini collected from zebra. The second ribosomal transcribed spacer (ITS-2) was amplified and sequenced using polymerase chain reaction (PCR) based techniques. The DNA sequence was compared with previously published data for 3 related Strongylus species. A PCR-linked restriction fragment length polymorphism method allowed the 4 species to be differentiated unequivocally. The ITS-2 sequence of S. asini was found to be more similar to those of S. edentatus (87.1%) and S. equinus (95.3%) than to that of S vulgaris (73.9%). This result confirms that S. Asini and S vulgaris represent separate species and supports the retention of the 4 species within 1 genus.

  6. Chorea-acanthocytosis

    PubMed Central

    Walker, Susan; Dad, Rubina; Thiruvahindrapuram, Bhooma; Ullah, Muhammed Ikram; Ahmad, Arsalan; Hassan, Muhammad Jawad; Scherer, Stephen W.

    2018-01-01

    Objective To determine a molecular diagnosis for a large multigenerational family of South Asian ancestry with seizures, hyperactivity, and episodes of tongue biting. Methods Two affected individuals from the family were analyzed by whole-genome sequencing on the Illumina HiSeq X platform, and rare variants were prioritized for interpretation with respect to the phenotype. Results A previously undescribed, 1-kb homozygous deletion was identified in both individuals sequenced, which spanned 2 exons of the VPS13A gene, and was found to segregate in other family members. Conclusions VPS13A is associated with autosomal recessive chorea-acanthocytosis, a diagnosis consistent with the phenotype observed in this family. Whole-genome sequencing presents a comprehensive and agnostic approach for detecting diagnostic mutations in families with rare neurologic disorders. PMID:29845114

  7. Burkholderia sp. induces functional nodules on the South African invasive legume Dipogon lignosus (Phaseoleae) in New Zealand soils.

    PubMed

    Liu, Wendy Y Y; Ridgway, Hayley J; James, Trevor K; James, Euan K; Chen, Wen-Ming; Sprent, Janet I; Young, J Peter W; Andrews, Mitchell

    2014-10-01

    The South African invasive legume Dipogon lignosus (Phaseoleae) produces nodules with both determinate and indeterminate characteristics in New Zealand (NZ) soils. Ten bacterial isolates produced functional nodules on D. lignosus. The 16S ribosomal RNA (rRNA) gene sequences identified one isolate as Bradyrhizobium sp., one isolate as Rhizobium sp. and eight isolates as Burkholderia sp. The Bradyrhizobium sp. and Rhizobium sp. 16S rRNA sequences were identical to those of strains previously isolated from crop plants and may have originated from inocula used on crops. Both 16S rRNA and DNA recombinase A (recA) gene sequences placed the eight Burkholderia isolates separate from previously described Burkholderia rhizobial species. However, the isolates showed a very close relationship to Burkholderia rhizobial strains isolated from South African plants with respect to their nitrogenase iron protein (nifH), N-acyltransferase nodulation protein A (nodA) and N-acetylglucosaminyl transferase nodulation protein C (nodC) gene sequences. Gene sequences and enterobacterial repetitive intergenic consensus (ERIC) PCR and repetitive element palindromic PCR (rep-PCR) banding patterns indicated that the eight Burkholderia isolates separated into five clones of one strain and three of another. One strain was tested and shown to produce functional nodules on a range of South African plants previously reported to be nodulated by Burkholderia tuberum STM678(T) which was isolated from the Cape Region. Thus, evidence is strong that the Burkholderia strains isolated here originated in South Africa and were somehow transported with the plants from their native habitat to NZ. It is possible that the strains are of a new species capable of nodulating legumes.

  8. Evaluation of nearest-neighbor methods for detection of chimeric small-subunit rRNA sequences

    NASA Technical Reports Server (NTRS)

    Robison-Cox, J. F.; Bateson, M. M.; Ward, D. M.

    1995-01-01

    Detection of chimeric artifacts formed when PCR is used to retrieve naturally occurring small-subunit (SSU) rRNA sequences may rely on demonstrating that different sequence domains have different phylogenetic affiliations. We evaluated the CHECK_CHIMERA method of the Ribosomal Database Project and another method which we developed, both based on determining nearest neighbors of different sequence domains, for their ability to discern artificially generated SSU rRNA chimeras from authentic Ribosomal Database Project sequences. The reliability of both methods decreases when the parental sequences which contribute to chimera formation are more than 82 to 84% similar. Detection is also complicated by the occurrence of authentic SSU rRNA sequences that behave like chimeras. We developed a naive statistical test based on CHECK_CHIMERA output and used it to evaluate previously reported SSU rRNA chimeras. Application of this test also suggests that chimeras might be formed by retrieving SSU rRNAs as cDNA. The amount of uncertainty associated with nearest-neighbor analyses indicates that such tests alone are insufficient and that better methods are needed.

  9. Hairpin structures with conserved sequence motifs determine the 3' ends of non-polyadenylated invertebrate iridovirus transcripts.

    PubMed

    İnce, İkbal Agah; Pijlman, Gorben P; Vlak, Just M; van Oers, Monique M

    2017-11-01

    Previously, we observed that the transcripts of Invertebrate iridescent virus 6 (IIV6) are not polyadenylated, in line with the absence of canonical poly(A) motifs (AATAAA) downstream of the open reading frames (ORFs) in the genome. Here, we determined the 3' ends of the transcripts of fifty-four IIV6 virion protein genes in infected Drosophila Schneider 2 (S2) cells. By using ligation-based amplification of cDNA ends (LACE) it was shown that the IIV6 mRNAs often ended with a CAUUA motif. In silico analysis showed that the 3'-untranslated regions of IIV6 genes have the ability to form hairpin structures (22-56 nt in length) and that for about half of all IIV6 genes these 3' sequences contained complementary TAATG and CATTA motifs. We also show that a hairpin in the 3' flanking region with conserved sequence motifs is a conserved feature in invertebrate-infecting iridoviruses (genus Iridovirus and Chloriridovirus). Copyright © 2017 Elsevier Inc. All rights reserved.

  10. Brain transcriptome sequencing and assembly of three songbird model systems for the study of social behavior

    PubMed Central

    Mukai, Motoko; Gonser, Rusty A.; Wingfield, John C.; London, Sarah E.; Tuttle, Elaina M.; Clayton, David F.

    2014-01-01

    Emberizid sparrows (emberizidae) have played a prominent role in the study of avian vocal communication and social behavior. We present here brain transcriptomes for three emberizid model systems, song sparrow Melospiza melodia, white-throated sparrow Zonotrichia albicollis, and Gambel’s white-crowned sparrow Zonotrichia leucophrys gambelii. Each of the assemblies covered fully or in part, over 89% of the previously annotated protein coding genes in the zebra finch Taeniopygia guttata, with 16,846, 15,805, and 16,646 unique BLAST hits in song, white-throated and white-crowned sparrows, respectively. As in previous studies, we find tissue of origin (auditory forebrain versus hypothalamus and whole brain) as an important determinant of overall expression profile. We also demonstrate the successful isolation of RNA and RNA-sequencing from post-mortem samples from building strikes and suggest that such an approach could be useful when traditional sampling opportunities are limited. These transcriptomes will be an important resource for the study of social behavior in birds and for data driven annotation of forthcoming whole genome sequences for these and other bird species. PMID:24883256

  11. Rapid Hypothesis Testing with Candida albicans through Gene Disruption with Short Homology Regions

    PubMed Central

    Wilson, R. Bryce; Davis, Dana; Mitchell, Aaron P.

    1999-01-01

    Disruption of newly identified genes in the pathogen Candida albicans is a vital step in determination of gene function. Several gene disruption methods described previously employ long regions of homology flanking a selectable marker. Here, we describe disruption of C. albicans genes with PCR products that have 50 to 60 bp of homology to a genomic sequence on each end of a selectable marker. We used the method to disrupt two known genes, ARG5 and ADE2, and two sequences newly identified through the Candida genome project, HRM101 and ENX3. HRM101 and ENX3 are homologous to genes in the conserved RIM101 (previously called RIM1) and PacC pathways of Saccharomyces cerevisiae and Aspergillus nidulans. We show that three independent hrm101/hrm101 mutants and two independent enx3/enx3 mutants are defective in filamentation on Spider medium. These observations argue that HRM101 and ENX3 sequences are indeed portions of genes and that the respective gene products have related functions. PMID:10074081

  12. Petri net modeling of high-order genetic systems using grammatical evolution.

    PubMed

    Moore, Jason H; Hahn, Lance W

    2003-11-01

    Understanding how DNA sequence variations impact human health through a hierarchy of biochemical and physiological systems is expected to improve the diagnosis, prevention, and treatment of common, complex human diseases. We have previously developed a hierarchical dynamic systems approach based on Petri nets for generating biochemical network models that are consistent with genetic models of disease susceptibility. This modeling approach uses an evolutionary computation approach called grammatical evolution as a search strategy for optimal Petri net models. We have previously demonstrated that this approach routinely identifies biochemical network models that are consistent with a variety of genetic models in which disease susceptibility is determined by nonlinear interactions between two DNA sequence variations. In the present study, we evaluate whether the Petri net approach is capable of identifying biochemical networks that are consistent with disease susceptibility due to higher order nonlinear interactions between three DNA sequence variations. The results indicate that our model-building approach is capable of routinely identifying good, but not perfect, Petri net models. Ideas for improving the algorithm for this high-dimensional problem are presented.

  13. Hepatitis E Virus of Subtype 3a in a Pig Farm, South-Eastern France.

    PubMed

    Colson, P; Saint-Jacques, P; Ferretti, A; Davoust, B

    2015-12-01

    Hepatitis E virus (HEV) has emerged during the past decade as a causative agent of autochthonous hepatitis and is a clinical concern in Western developed countries. It has been increasingly recognized that pigs are a major reservoir of HEV of genotypes 3 and 4 worldwide and pig-derived food items represent a potential source of infections by these viruses in humans. Hepatitis E virus RNA testing was performed here on faeces from rectal swabs sampled in 2012 from 50 3-month-old farm pigs from the same farm located in south-eastern France than in a previous work conducted in 2007. Pig HEV sequences corresponding to genomic fragments of ORF2 and ORF1 genes were obtained after RT-PCR amplification with in-house protocols. Hepatitis E virus genotype was determined by phylogenetic analysis. Prevalence was similar to that determined 5 years earlier (68% versus 62%). Two robust phylogenetic clusters of HEV subtypes 3a and 3f were identified, and these sequences obtained in 2012 largely differ compared with those obtained in 2007. Notably, HEV sequences obtained in 2012 from a majority (62%) of the infected pigs belonged to subtype 3a, which was not previously described in France, including not being found in any of humans, pigs or wild boars. Further studies are needed to assess the circulation of HEV-3a in pigs and humans in this country. In addition, along with previous findings, this study supports the need for increased information to the public on the risk of HEV infection through contacts with pigs or consumption of pig-derived products in France. © 2015 Blackwell Verlag GmbH.

  14. Genomic insights from whole genome sequencing of four clonal outbreak Campylobacter jejuni assessed within the global C. jejuni population.

    PubMed

    Clark, Clifford G; Berry, Chrystal; Walker, Matthew; Petkau, Aaron; Barker, Dillon O R; Guan, Cai; Reimer, Aleisha; Taboada, Eduardo N

    2016-12-03

    Whole genome sequencing (WGS) is useful for determining clusters of human cases, investigating outbreaks, and defining the population genetics of bacteria. It also provides information about other aspects of bacterial biology, including classical typing results, virulence, and adaptive strategies of the organism. Cell culture invasion and protein expression patterns of four related multilocus sequence type 21 (ST21) C. jejuni isolates from a significant Canadian water-borne outbreak were previously associated with the presence of a CJIE1 prophage. Whole genome sequencing was used to examine the genetic diversity among these isolates and confirm that previous observations could be attributed to differential prophage carriage. Moreover, we sought to determine the presence of genome sequences that could be used as surrogate markers to delineate outbreak-associated isolates. Differential carriage of the CJIE1 prophage was identified as the major genetic difference among the four outbreak isolates. High quality single-nucleotide variant (hqSNV) and core genome multilocus sequence typing (cgMLST) clustered these isolates within expanded datasets consisting of additional C. jejuni strains. The number and location of homopolymeric tract regions was identical in all four outbreak isolates but differed from all other C. jejuni examined. Comparative genomics and PCR amplification enabled the identification of large chromosomal inversions of approximately 93 kb and 388 kb within the outbreak isolates associated with transducer-like proteins containing long nucleotide repeat sequences. The 93-kb inversion was characteristic of the outbreak-associated isolates, and the gene content of this inverted region displayed high synteny with the reference strain. The four outbreak isolates were clonally derived and differed mainly in the presence of the CJIE1 prophage, validating earlier findings linking the prophage to phenotypic differences in virulence assays and protein expression. The identification of large, genetically syntenous chromosomal inversions in the genomes of outbreak-associated isolates provided a unique method for discriminating outbreak isolates from the background population. Transducer-like proteins appear to be associated with the chromosomal inversions. CgMLST and hqSNV analysis also effectively delineated the outbreak isolates within the larger C. jejuni population structure.

  15. Deletion endpoint allele-specificity in the developmentally regulated elimination of an internal sequence (IES) in Paramecium.

    PubMed Central

    Dubrana, K; Le Mouël, A; Amar, L

    1997-01-01

    Ciliated protozoa undergo thousands of site-specific DNA deletion events during the programmed development of micronuclear genomes to macronuclear genomes. Two deletion elements, W1 and W2, were identified in the Paramecium primaurelia wild-type 156 strain. Here, we report the characterization of both elements in wild-type strain 168 and show that they display variant deletion patterns when compared with those of strain 156. The W1 ( 168 ) element is defective for deletion. The W2 ( 168 ) element is excised utilizing two alternative boundaries on one side, both are different from the boundary utilized to excise the W2156 element. By crossing the 156 and 168 strains, we demonstrate that the definition of all deletion endpoints are each controlled by cis -acting determinant(s) rather than by strain-specific trans-acting factor(s). Sequence comparison of all deleted DNA segments indicates that the 5'-TA-3'terminal sequence is strictly required at their ends. Furthermore the identity of the first eight base pairs of these ends to a previously established consensus sequence correlates with the frequency of the corresponding deletion events. Our data implies the existence of an adaptive convergent evolution of these Paramecium deleted DNA segment end sequences. PMID:9171098

  16. Completion of full length genome sequence of novel avian paramyxovirus strain APMV/Shimane67 isolated from migratory wild geese in Japan.

    PubMed

    Yamamoto, Eiji; Ito, Toshihiro; Ito, Hiroshi

    2016-11-01

    The nucleotide sequences of nucleocapsid protein (N); phosphoprotein (P); matrix protein (M); hemagglutinin-neuraminidase (HN); and large polymerase protein (L) genes, 3'-end leader, 5'-end trailer and intergenic regions of the avian paramyxovirus (APMV) strain goose/Shimane/67/2000 (APMV/Shimane67) were determined. Together with previously reported data on fusion protein (F) gene sequence [46], the determination of the genome sequence of APMV/Shimane67 has been completed in this study. The genome of APMV/Shimane67 comprised 16,146 nucleotides in length and contains six genes in the order of 3'-N-P-M-F-HN-L-5'. The features of the APMV/Shimane67 genome (e.g., nucleotide length of whole genome and each of the six genes, and predicted amino acid length of each of the six genes) were distinct from those of other APMV serotypes. Phylogenetic analysis indicated that although APMV/Shimane67 was grouped with APMV-1, -9 and -12, the evolutionary distance between APMV/Shimane67 and these viruses was longer than that observed between intra-serotype viruses. These results show that the genome sequence of APMV/Shimane67 contains specific characteristics and is distinguishable from other types of APMV.

  17. About the Concept of Angle in Elementary School: Misconceptions and Teaching Sequences

    ERIC Educational Resources Information Center

    Devichi, Claude; Munier, Valerie

    2013-01-01

    This paper reports classroom research dealing with the difficulties encountered by schoolchildren in the acquisition of angle concept. Two obstacles were pointed out in previous studies: the side-length obstacle and the salience of the prototypical right angle. The first aim of the present study is to determine the extent to which a teaching…

  18. A single base pair in the right terminal domain of Tomato planta macho viroid is a virulence determinant factor on tomato

    USDA-ARS?s Scientific Manuscript database

    Tomato planta macho viroid (TPMVd), including isolates previously designated as Mexican papita viroid (MPVd), causes serious disease on tomatoes in North America. Two predominant variants, sharing 93.8% sequence identity, incited distinct severe (MPVd-S) or mild (MPVd-M) symptoms on tomato. To ide...

  19. Investigation of the protein osteocalcin of Camelops hesternus: Sequence, structure and phylogenetic implications

    NASA Astrophysics Data System (ADS)

    Humpula, James F.; Ostrom, Peggy H.; Gandhi, Hasand; Strahler, John R.; Walker, Angela K.; Stafford, Thomas W.; Smith, James J.; Voorhies, Michael R.; George Corner, R.; Andrews, Phillip C.

    2007-12-01

    Ancient DNA sequences offer an extraordinary opportunity to unravel the evolutionary history of ancient organisms. Protein sequences offer another reservoir of genetic information that has recently become tractable through the application of mass spectrometric techniques. The extent to which ancient protein sequences resolve phylogenetic relationships, however, has not been explored. We determined the osteocalcin amino acid sequence from the bone of an extinct Camelid (21 ka, Camelops hesternus) excavated from Isleta Cave, New Mexico and three bones of extant camelids: bactrian camel ( Camelus bactrianus); dromedary camel ( Camelus dromedarius) and guanaco ( Llama guanacoe) for a diagenetic and phylogenetic assessment. There was no difference in sequence among the four taxa. Structural attributes observed in both modern and ancient osteocalcin include a post-translation modification, Hyp 9, deamidation of Gln 35 and Gln 39, and oxidation of Met 36. Carbamylation of the N-terminus in ancient osteocalcin may result in blockage and explain previous difficulties in sequencing ancient proteins via Edman degradation. A phylogenetic analysis using osteocalcin sequences of 25 vertebrate taxa was conducted to explore osteocalcin protein evolution and the utility of osteocalcin sequences for delineating phylogenetic relationships. The maximum likelihood tree closely reflected generally recognized taxonomic relationships. For example, maximum likelihood analysis recovered rodents, birds and, within hominins, the Homo-Pan-Gorilla trichotomy. Within Artiodactyla, character state analysis showed that a substitution of Pro 4 for His 4 defines the Capra-Ovis clade within Artiodactyla. Homoplasy in our analysis indicated that osteocalcin evolution is not a perfect indicator of species evolution. Limited sequence availability prevented assigning functional significance to sequence changes. Our preliminary analysis of osteocalcin evolution represents an initial step towards a complete character analysis aimed at determining the evolutionary history of this functionally significant protein. We emphasize that ancient protein sequencing and phylogenetic analyses using amino acid sequences must pay close attention to post-translational modifications, amino acid substitutions due to diagenetic alteration and the impacts of isobaric amino acids on mass shifts and sequence alignments.

  20. Whole exome sequencing for determination of tumor mutation load in liquid biopsy from advanced cancer patients.

    PubMed

    Koeppel, Florence; Blanchard, Steven; Jovelet, Cécile; Genin, Bérengère; Marcaillou, Charles; Martin, Emmanuel; Rouleau, Etienne; Solary, Eric; Soria, Jean-Charles; André, Fabrice; Lacroix, Ludovic

    2017-01-01

    Tumor mutation load (TML) has been proposed as a biomarker of patient response to immunotherapy in several studies. TML is usually determined by tumor biopsy DNA (tDNA) whole exome sequencing (WES), therefore TML evaluation is limited by informative biopsy availability. Circulating cell free DNA (cfDNA) provided by liquid biopsy is a surrogate specimen to biopsy for molecular profiling. Nevertheless performing WES on DNA from plasma is technically challenging and the ability to determine tumor mutation load from liquid biopsies remains to be demonstrated. In the current study, WES was performed on cfDNA from 32 metastatic patients of various cancer types included into MOSCATO 01 (NCT01566019) and/or MATCHR (NCT02517892) molecular triage trials. Results from targeted gene sequencing (TGS) and WES performed on cfDNA were compared to results from tumor tissue biopsy. In cfDNA samples, WES mutation detection sensitivity was 92% compared to targeted sequencing (TGS). When comparing cfDNA-WES to tDNA-WES, mutation detection sensitivity was 53%, consistent with previously published prospective study comparing cfDNA-TGS to tDNA-TGS. For samples in which presence of tumor DNA was confirmed in cfDNA, tumor mutation load from liquid biopsy was correlated with tumor biopsy. Taken together, this study demonstrated that liquid biopsy may be applied to determine tumor mutation load. Qualification of liquid biopsy for interpretation is a crucial point to use cfDNA for mutational load estimation.

  1. Whole exome sequencing for determination of tumor mutation load in liquid biopsy from advanced cancer patients

    PubMed Central

    Blanchard, Steven; Jovelet, Cécile; Genin, Bérengère; Marcaillou, Charles; Martin, Emmanuel; Rouleau, Etienne; Solary, Eric; Soria, Jean-Charles; André, Fabrice; Lacroix, Ludovic

    2017-01-01

    Tumor mutation load (TML) has been proposed as a biomarker of patient response to immunotherapy in several studies. TML is usually determined by tumor biopsy DNA (tDNA) whole exome sequencing (WES), therefore TML evaluation is limited by informative biopsy availability. Circulating cell free DNA (cfDNA) provided by liquid biopsy is a surrogate specimen to biopsy for molecular profiling. Nevertheless performing WES on DNA from plasma is technically challenging and the ability to determine tumor mutation load from liquid biopsies remains to be demonstrated. In the current study, WES was performed on cfDNA from 32 metastatic patients of various cancer types included into MOSCATO 01 (NCT01566019) and/or MATCHR (NCT02517892) molecular triage trials. Results from targeted gene sequencing (TGS) and WES performed on cfDNA were compared to results from tumor tissue biopsy. In cfDNA samples, WES mutation detection sensitivity was 92% compared to targeted sequencing (TGS). When comparing cfDNA-WES to tDNA-WES, mutation detection sensitivity was 53%, consistent with previously published prospective study comparing cfDNA-TGS to tDNA-TGS. For samples in which presence of tumor DNA was confirmed in cfDNA, tumor mutation load from liquid biopsy was correlated with tumor biopsy. Taken together, this study demonstrated that liquid biopsy may be applied to determine tumor mutation load. Qualification of liquid biopsy for interpretation is a crucial point to use cfDNA for mutational load estimation. PMID:29161279

  2. Axolotl hemoglobin: cDNA-derived amino acid sequences of two alpha globins and a beta globin from an adult Ambystoma mexicanum.

    PubMed

    Shishikura, Fumio; Takeuchi, Hiro-aki; Nagai, Takatoshi

    2005-11-01

    Erythrocytes of the adult axolotl, Ambystoma mexicanum, have multiple hemoglobins. We separated and purified two kinds of hemoglobin, termed major hemoglobin (Hb M) and minor hemoglobin (Hb m), from a five-year-old male by hydrophobic interaction column chromatography on Alkyl Superose. The hemoglobins have two distinct alpha type globin polypeptides (alphaM and alpham) and a common beta globin polypeptide, all of which were purified in FPLC on a reversed-phase column after S-pyridylethylation. The complete amino acid sequences of the three globin chains were determined separately using nucleotide sequencing with the assistance of protein sequencing. The mature globin molecules were composed of 141 amino acid residues for alphaM globin, 143 for alpham globin and 146 for beta globin. Comparing primary structures of the five kinds of axolotl globins, including two previously established alpha type globins from the same species, with other known globins of amphibians and representatives of other vertebrates, we constructed phylogenetic trees for amphibian hemoglobins and tetrapod hemoglobins. The molecular trees indicated that alphaM, alpham, beta and the previously known alpha major globin were adult types of globins and the other known alpha globin was a larval type. The existence of two to four more globins in the axolotl erythrocyte is predicted.

  3. Accounting for biases in riboprofiling data indicates a major role for proline in stalling translation.

    PubMed

    Artieri, Carlo G; Fraser, Hunter B

    2014-12-01

    The recent advent of ribosome profiling-sequencing of short ribosome-bound fragments of mRNA-has offered an unprecedented opportunity to interrogate the sequence features responsible for modulating translational rates. Nevertheless, numerous analyses of the first riboprofiling data set have produced equivocal and often incompatible results. Here we analyze three independent yeast riboprofiling data sets, including two with much higher coverage than previously available, and find that all three show substantial technical sequence biases that confound interpretations of ribosomal occupancy. After accounting for these biases, we find no effect of previously implicated factors on ribosomal pausing. Rather, we find that incorporation of proline, whose unique side-chain stalls peptide synthesis in vitro, also slows the ribosome in vivo. We also reanalyze a method that implicated positively charged amino acids as the major determinant of ribosomal stalling and demonstrate that it produces false signals of stalling in low-coverage data. Our results suggest that any analysis of riboprofiling data should account for sequencing biases and sparse coverage. To this end, we establish a robust methodology that enables analysis of ribosome profiling data without prior assumptions regarding which positions spanned by the ribosome cause stalling. © 2014 Artieri and Fraser; Published by Cold Spring Harbor Laboratory Press.

  4. Protein and gene structure of a blue laccase from Pleurotus ostreatus1.

    PubMed Central

    Giardina, P; Palmieri, G; Scaloni, A; Fontanella, B; Faraco, V; Cennamo, G; Sannia, G

    1999-01-01

    A new laccase isoenzyme (POXA1b, where POX is phenol oxidase), produced by Pleurotus ostreatus in cultures supplemented with copper sulphate, has been purified and fully characterized. The main characteristics of this protein (molecular mass in native and denaturing conditions, pI and catalytic properties) are almost identical to the previously studied laccase POXA1w. However, POXA1b contains four copper atoms per molecule instead of one copper, two zinc and one iron atom per molecule of POXA1w. Furthermore, POXA1b shows an unusually high stability at alkaline pH. The gene and cDNA coding for POXA1b have been cloned and sequenced. The gene coding sequence contains 1599 bp, interrupted by 15 introns. Comparison of the structure of the poxa1b gene with the two previously studied P. ostreatus laccase genes (pox1 and poxc) suggests that these genes belong to two different subfamilies. The amino acid sequence of POXA1b deduced from the cDNA sequence has been almost completely verified by means of matrix-assisted laser desorption ionization MS. It has been demonstrated that three out of six putative glycosylation sites are post-translationally modified and the structure of the bound glycosidic moieties has been determined, whereas two other putative glycosylation sites are unmodified. PMID:10417329

  5. Characterization and complete genome sequence of a previously uncharacterized panicovirus from Bermuda grass detected by high throughput sequencing

    USDA-ARS?s Scientific Manuscript database

    Bermuda grass samples were examined by transmission electron microscopy and 28-30 nm spherical virus particles were observed. Total RNA from these plants was subjected to high throughput sequencing (HTS). The nearly full genome sequence of a previously uncharacterized Panicovirus was identified from...

  6. Structure and Genetic Content of the Megaplasmids of Neurotoxigenic Clostridium butyricum Type E Strains from Italy

    PubMed Central

    Iacobino, Angelo; Scalfaro, Concetta; Franciosa, Giovanna

    2013-01-01

    We determined the genetic maps of the megaplasmids of six neutoroxigenic Clostridium butyricum type E strains from Italy using molecular and bioinformatics techniques. The megaplasmids are circular, not linear as we had previously proposed. The differently-sized megaplasmids share a genetic region that includes structural, metabolic and regulatory genes. In addition, we found that a 168 kb genetic region is present only in the larger megaplasmids of two tested strains, whereas it is absent from the smaller megaplasmids of the four remaining strains. The genetic region unique to the larger megaplasmids contains, among other features, a locus for clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR associated (cas) genes, i.e. a bacterial adaptive immune system providing sequence-specific protection from invading genetic elements. Some CRISPR spacer sequences of the neurotoxigenic C. butyricum type E strains showed homology to prophage, phage and plasmid sequences from closely related clostridia species or from distant species, all sharing the intestinal habitat, suggesting that the CRISPR locus might be involved in the microorganism adaptation to the human or animal intestinal environment. Besides, we report here that each of four distinct CRISPR spacers partially matched DNA sequences of different prophages and phages, at identical nucleotide locations. This suggests that, at least in neurotoxigenic C. butyricum type E, the CRISPR locus is potentially able to recognize the same conserved DNA sequence of different invading genetic elements, besides targeting sequences unique to previously encountered invading DNA, as currently predicted for a CRISPR locus. Thus, the results of this study introduce the possibility that CRISPR loci can provide resistance to a wider range of invading DNA elements than previously appreciated. Whether it is more advantageous for the peculiar neurotoxigenic C. butyricum type E strains to maintain or to lose the CRISPR-cas system remains an open question. PMID:23967192

  7. Evolution and spread of Ebola virus in Liberia, 2014–2015

    PubMed Central

    Ladner, Jason T.; Wiley, Michael R.; Mate, Suzanne; Dudas, Gytis; Prieto, Karla; Lovett, Sean; Nagle, Elyse R.; Beitzel, Brett; Gilbert, Merle L.; Fakoli, Lawrence; Diclaro, Joseph W.; Schoepp, Randal J.; Fair, Joseph; Kuhn, Jens H.; Hensley, Lisa E.; Park, Daniel J.; Sabeti, Pardis C.; Rambaut, Andrew; Sanchez-Lockhart, Mariano; Bolay, Fatorma K.; Kugelman, Jeffrey R.; Palacios, Gustavo

    2015-01-01

    SUMMARY The 2013–present Western African Ebola virus disease (EVD) outbreak is the largest ever recorded with >28,000 reported cases. Ebola virus (EBOV) genome sequencing has played an important role throughout this outbreak; however, relatively few sequences have been determined from patients in Liberia, the second worst-affected country. Here, we report 140 EBOV genome sequences from the second wave of the Liberian outbreak and analyze them in combination with 782 previously published sequences from throughout the Western African outbreak. While multiple early introductions of EBOV to Liberia are evident, the majority of Liberian EVD cases are consistent with a single introduction, followed by spread and diversification within the country. Movement of the virus within Liberia was widespread and reintroductions from Liberia served as an important source for the continuation of the already ongoing EVD outbreak in Guinea. Overall, little evidence was found for incremental adaptation of EBOV to the human host. PMID:26651942

  8. Relationships in subtribe Diocleinae (Leguminosae; Papilionoideae) inferred from internal transcribed spacer sequences from nuclear ribosomal DNA.

    PubMed

    Varela, Eduardo S; Lima, João P M S; Galdino, Alexsandro S; Pinto, Luciano da S; Bezerra, Walderly M; Nunes, Edson P; Alves, Maria A O; Grangeiro, Thalles B

    2004-01-01

    The complete sequences of nuclear ribosomal DNA (nrDNA) internal transcribed spacer regions (ITS/5.8S) were determined for species belonging to six genera from the subtribe Diocleinae as well as for the anomalous genera Calopogonium and Pachyrhizus. Phylogenetic trees constructed by distance matrix, maximum parsimony and maximum likelihood methods showed that Calopogonium and Pachyrhizus were outside the clade Diocleinae (Canavalia, Camptosema, Cratylia, Dioclea, Cymbosema, and Galactia). This finding supports previous morphological, phytochemical, and molecular evidence that Calopogonium and Pachyrhizus do not belong to the subtribe Diocleinae. Within the true Diocleinae clade, the clustering of genera and species were congruent with morphology-based classifications, suggesting that ITS/5.8S sequences can provide enough informative sites to allow resolution below the genus level. This is the first evidence of the phylogeny of subtribe Diocleinae based on nuclear DNA sequences.

  9. Sequence harmony: detecting functional specificity from alignments

    PubMed Central

    Feenstra, K. Anton; Pirovano, Walter; Krab, Klaas; Heringa, Jaap

    2007-01-01

    Multiple sequence alignments are often used for the identification of key specificity-determining residues within protein families. We present a web server implementation of the Sequence Harmony (SH) method previously introduced. SH accurately detects subfamily specific positions from a multiple alignment by scoring compositional differences between subfamilies, without imposing conservation. The SH web server allows a quick selection of subtype specific sites from a multiple alignment given a subfamily grouping. In addition, it allows the predicted sites to be directly mapped onto a protein structure and displayed. We demonstrate the use of the SH server using the family of plant mitochondrial alternative oxidases (AOX). In addition, we illustrate the usefulness of combining sequence and structural information by showing that the predicted sites are clustered into a few distinct regions in an AOX homology model. The SH web server can be accessed at www.ibi.vu.nl/programs/seqharmwww. PMID:17584793

  10. Nucleotide sequencing analysis of a LEU gene of Candida maltosa which complements leuB mutation of Escherichia coli and leu2 mutation of Saccharomyces cerevisiae.

    PubMed

    Takagi, M; Kobayashi, N; Sugimoto, M; Fujii, T; Watari, J; Yano, K

    1987-01-01

    The expression of a LEU gene from Candida maltosa (designated as C-LEU2) isolated previously (Kawamura et al. 1983) was shown to be regulated, when transferred into Saccharomyces cerevisiae, by leucine and threonine in the medium, as in the case of LEU2 gene of S. cerevisiae. The coding region together with the regulatory region was subcloned and the nucleotide sequence was determined. When the sequence of the coding region was compared with that of LEU2, the homology was 72% for base pairs and 76% for deduced amino acids. Comparison of the regulatory region of C-LEU2 with those of LEU1 and LEU2 suggested a few short consensus sequences which are involved in regulation of gene expression by leucine and threonine in the medium.

  11. Diversity and phylogeography of begomovirus-associated beta satellites of okra in India

    PubMed Central

    2011-01-01

    Background Okra (Abelmoschus esculentus; family Malvaceae) is grown in temperate as well as subtropical regions of the world, both for human consumption as a vegetable and for industrial uses. Okra yields are affected by the diseases caused by phyopathogenic viruses. India is the largest producer of okra and in this region a major biotic constraint to production are viruses of the genus Begomovirus. Begomoviruses affecting okra across the Old World are associated with specific, symptom modulating satellites (beta satellites). We describe a comprehensive analysis of the diversity of beta satellites associated with okra in India. Results The full-length sequences of 36 beta satellites, isolated from okra exhibiting typical begomovirus symptoms (leaf curl and yellow vein), were determined. The sequences segregated in to four groups. Two groups correspond to the beta satellites Okra leaf curl beta satellite (OLCuB) and Bhendi yellow vein beta satellite (BYVB) that have previously been identified in okra from the sub-continent. One sequence was distinct from all other, previously isolated beta satellites and represents a new species for which we propose the name Bhendi yellow vein India beta satellite (BYVIB). This new beta satellite was nevertheless closely related to BYVB and OLCuB. Most surprising was the identification of Croton yellow vein mosaic beta satellite (CroYVMB) in okra; a beta satellite not previously identified in a malvaceous plant species. The okra beta satellites were shown to have distinct geographic host ranges with BYVB occurring across India whereas OLCuB was only identified in northwestern India. Okra infections with CroYVMB were only identified across the northern and eastern central regions of India. A more detailed analysis of the sequences showed that OLCuB, BYVB and BYVIB share highest identity with respect βC1 gene. βC1 is the only gene encoded by beta satellites, the product of which is the major pathogenicity determinant of begomovirus-beta satellite complexes and is involved in overcoming host defenses based on RNAi. Conclusion The diversity of beta satellites in okra across the sub-continent is higher than previously realized and is higher than for any other malvaceous plant species so far analyzed. The beta satellites identified in okra show geographic segregation, which has implications for the development and introduction of resistant okra varieties. However, the finding that the βC1 gene of the major okra beta satellites (OLCuB, BYVB and BYVIB) share high sequence identity and provides a possible avenue to achieve a broad spectrum resistance. PMID:22188644

  12. Linking soil biology and chemistry in biological soil crust using isolate exometabolomics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Swenson, Tami L.; Karaoz, Ulas; Swenson, Joel M.

    Metagenomic sequencing provides a window into microbial community structure and metabolic potential; however, linking these data to exogenous metabolites that microorganisms process and produce (the exometabolome) remains challenging. Previously, we observed strong exometabolite niche partitioning among bacterial isolates from biological soil crust (biocrust). For this study, we examine native biocrust to determine if these patterns are reproduced in the environment. Overall, most soil metabolites display the expected relationship (positive or negative correlation) with four dominant bacteria following a wetting event and across biocrust developmental stages. For metabolites that were previously found to be consumed by an isolate, 70% are negativelymore » correlated with the abundance of the isolate's closest matching environmental relative in situ, whereas for released metabolites, 67% were positively correlated. Our results demonstrate that metabolite profiling, shotgun sequencing and exometabolomics may be successfully integrated to functionally link microbial community structure with environmental chemistry in biocrust.« less

  13. Linking soil biology and chemistry in biological soil crust using isolate exometabolomics

    DOE PAGES

    Swenson, Tami L.; Karaoz, Ulas; Swenson, Joel M.; ...

    2018-01-02

    Metagenomic sequencing provides a window into microbial community structure and metabolic potential; however, linking these data to exogenous metabolites that microorganisms process and produce (the exometabolome) remains challenging. Previously, we observed strong exometabolite niche partitioning among bacterial isolates from biological soil crust (biocrust). For this study, we examine native biocrust to determine if these patterns are reproduced in the environment. Overall, most soil metabolites display the expected relationship (positive or negative correlation) with four dominant bacteria following a wetting event and across biocrust developmental stages. For metabolites that were previously found to be consumed by an isolate, 70% are negativelymore » correlated with the abundance of the isolate's closest matching environmental relative in situ, whereas for released metabolites, 67% were positively correlated. Our results demonstrate that metabolite profiling, shotgun sequencing and exometabolomics may be successfully integrated to functionally link microbial community structure with environmental chemistry in biocrust.« less

  14. Connecting the dots between genes, biochemistry, and disease susceptibility: systems biology modeling in human genetics.

    PubMed

    Moore, Jason H; Boczko, Erik M; Summar, Marshall L

    2005-02-01

    Understanding how DNA sequence variations impact human health through a hierarchy of biochemical and physiological systems is expected to improve the diagnosis, prevention, and treatment of common, complex human diseases. We have previously developed a hierarchical dynamic systems approach based on Petri nets for generating biochemical network models that are consistent with genetic models of disease susceptibility. This modeling approach uses an evolutionary computation approach called grammatical evolution as a search strategy for optimal Petri net models. We have previously demonstrated that this approach routinely identifies biochemical network models that are consistent with a variety of genetic models in which disease susceptibility is determined by nonlinear interactions between two or more DNA sequence variations. We review here this approach and then discuss how it can be used to model biochemical and metabolic data in the context of genetic studies of human disease susceptibility.

  15. Spatial constraints govern competition of mutant clones in human epidermis.

    PubMed

    Lynch, M D; Lynch, C N S; Craythorne, E; Liakath-Ali, K; Mallipeddi, R; Barker, J N; Watt, F M

    2017-10-24

    Deep sequencing can detect somatic DNA mutations in tissues permitting inference of clonal relationships. This has been applied to human epidermis, where sun exposure leads to the accumulation of mutations and an increased risk of skin cancer. However, previous studies have yielded conflicting conclusions about the relative importance of positive selection and neutral drift in clonal evolution. Here, we sequenced larger areas of skin than previously, focusing on cancer-prone skin spanning five decades of life. The mutant clones identified were too large to be accounted for solely by neutral drift. Rather, using mathematical modelling and computational lattice-based simulations, we show that observed clone size distributions can be explained by a combination of neutral drift and stochastic nucleation of mutations at the boundary of expanding mutant clones that have a competitive advantage. These findings demonstrate that spatial context and cell competition cooperate to determine the fate of a mutant stem cell.

  16. Linking soil biology and chemistry in biological soil crust using isolate exometabolomics.

    PubMed

    Swenson, Tami L; Karaoz, Ulas; Swenson, Joel M; Bowen, Benjamin P; Northen, Trent R

    2018-01-02

    Metagenomic sequencing provides a window into microbial community structure and metabolic potential; however, linking these data to exogenous metabolites that microorganisms process and produce (the exometabolome) remains challenging. Previously, we observed strong exometabolite niche partitioning among bacterial isolates from biological soil crust (biocrust). Here we examine native biocrust to determine if these patterns are reproduced in the environment. Overall, most soil metabolites display the expected relationship (positive or negative correlation) with four dominant bacteria following a wetting event and across biocrust developmental stages. For metabolites that were previously found to be consumed by an isolate, 70% are negatively correlated with the abundance of the isolate's closest matching environmental relative in situ, whereas for released metabolites, 67% were positively correlated. Our results demonstrate that metabolite profiling, shotgun sequencing and exometabolomics may be successfully integrated to functionally link microbial community structure with environmental chemistry in biocrust.

  17. Sequencing of adenine in DNA by scanning tunneling microscopy

    NASA Astrophysics Data System (ADS)

    Tanaka, Hiroyuki; Taniguchi, Masateru

    2017-08-01

    The development of DNA sequencing technology utilizing the detection of a tunnel current is important for next-generation sequencer technologies based on single-molecule analysis technology. Using a scanning tunneling microscope, we previously reported that dI/dV measurements and dI/dV mapping revealed that the guanine base (purine base) of DNA adsorbed onto the Cu(111) surface has a characteristic peak at V s = -1.6 V. If, in addition to guanine, the other purine base of DNA, namely, adenine, can be distinguished, then by reading all the purine bases of each single strand of a DNA double helix, the entire base sequence of the original double helix can be determined due to the complementarity of the DNA base pair. Therefore, the ability to read adenine is important from the viewpoint of sequencing. Here, we report on the identification of adenine by STM topographic and spectroscopic measurements using a synthetic DNA oligomer and viral DNA.

  18. Cassini Imaging Science: First Results at Saturn

    NASA Astrophysics Data System (ADS)

    Porco, C. C.

    The Cassini Imaging Science experiment at Saturn will commence in early February, 2004 -- five months before Cassini's arrival at Saturn. Approach observations consist of repeated multi-spectral `movie' sequences of Saturn and its rings, image sequences designed to search for previously unseen satellites between the outer edge of the ring system and the orbit of Hyperion, images of known satellites for orbit refinement, observations of Phoebe during Cassini's closest approach to the satellite, and repeated multi-spectral `movie' sequences of Titan to detect and track clouds (for wind determination) and to sense the surface. During Saturn Orbit Insertion, the highest resolution images (~ 100 m) obtained during the whole orbital tour will be collected of the dark side of the rings. Finally, imaging sequences are planned for Cassini's first Titan flyby, on July 2, from a distance of ~ 350,000 km, yielding an image scale of ~ 2.1 km on the South polar region. The highlights of these observation sequences will be presented.

  19. Phylogenetic relationships among the major lineages of the birds-of-paradise (Paradisaeidae) using mitochondrial DNA gene sequences.

    PubMed

    Nunn, G B; Cracraft, J

    1996-06-01

    Complete mitochondrial cytochrome b gene sequences were determined from 12 species of the Australo-Papuan birds-of-paradise (Paradisaeidae) representing 9 genera. Phylogenetic analysis of these and 5 previously published sequences reveals a radiation of the main paradisaeinine lineages that took place over a relatively short evolutionary time scale. The core paradisaeinines are resolved as the monophyletic sister-group to the crow-like manucodines. The genus Parotia is basal to other paradisaeinines and is not closely related to the morphologically similar genera Ptiloris and Lophorina. Three major clades within the paradisaeinine ingroup include: (1) Cicinnurus and Diphyllodes, (2) Ptiloris and Lophorina, and (3) the genus Paradisaea. The monotypic genus Seleucidis is apparently closely related to clades (1) and (2). Cytochrome b sequences did not provide evidence for the monophyly of the sicklebill genera Epimachus and Drepanornis. The paradisaeid tree is characterized by short internodal distances. Thus, some clades cannot be strongly resolved by cytochrome b sequences alone.

  20. Large-scale deletions of the ABCA1 gene in patients with hypoalphalipoproteinemia.

    PubMed

    Dron, Jacqueline S; Wang, Jian; Berberich, Amanda J; Iacocca, Michael A; Cao, Henian; Yang, Ping; Knoll, Joan; Tremblay, Karine; Brisson, Diane; Netzer, Christian; Gouni-Berthold, Ioanna; Gaudet, Daniel; Hegele, Robert A

    2018-06-04

    Copy-number variations (CNVs) have been studied in the context of familial hypercholesterolemia but have not yet been evaluated in patients with extremes of high-density lipoprotein (HDL) cholesterol levels. We evaluated targeted next-generation sequencing data from patients with very low HDL cholesterol (i.e. hypoalphalipoproteinemia) using the VarSeq-CNV caller algorithm to screen for CNVs disrupting the ABCA1, LCAT or APOA1 genes. In four individuals, we found three unique deletions in ABCA1: a heterozygous deletion of exon 4, a heterozygous deletion spanning exons 8 to 31, and a heterozygous deletion of the entire ABCA1 gene. Breakpoints were identified using Sanger sequencing, and the full-gene deletion was also confirmed using exome sequencing and the Affymetrix CytoScanTM HD Array. Before now, large-scale deletions in candidate HDL genes have not been associated with hypoalphalipoproteinemia; our findings indicate that CNVs in ABCA1 may be a previously unappreciated genetic determinant of low HDL cholesterol levels. By coupling bioinformatic analyses with next-generation sequencing data, we can successfully assess the spectrum of genetic determinants of many dyslipidemias, now including hypoalphalipoproteinemia. Published under license by The American Society for Biochemistry and Molecular Biology, Inc.

  1. Molecular cloning and characterization of a gene encoding glutaminase from Aspergillus oryzae.

    PubMed

    Koibuchi, K; Nagasaki, H; Yuasa, A; Kataoka, J; Kitamoto, K

    2000-07-01

    A glutaminase from Aspergillus oryzae was purified and its molecular weight was determined to be 82,091 by matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Purified glutaminase catalysed the hydrolysis not only of L-glutamine but also of D-glutamine. Both the molecular weight and the substrate specificity of this glutaminase were different from those reported previously [Yano et al. (1998) J Ferment Technol 66: 137-143]. On the basis of its internal amino acid sequences, we have isolated and characterized the glutaminase gene (gtaA) from A. oryzae. The gtaA gene had an open reading frame coding for 690 amino acid residues, including a signal peptide of 20 amino acid residues and a mature protein of 670 amino acid residues. In the 5'-flanking region of the gene, there were three putative CreAp binding sequences and one putative AreAp binding sequence. The gtaA structural gene was introduced into A. oryzae NS4 and a marked increase in activity was detected in comparison with the control strain. The gtaA gene was also isolated from Aspergillus nidulans on the basis of the determined nucleotide sequence of the gtaA gene from A. oryzae.

  2. Novel Inhibitor Cystine Knot Peptides from Momordica charantia

    PubMed Central

    Clark, Richard J.; Tang, Jun; Zeng, Guang-Zhi; Franco, Octavio L.; Cantacessi, Cinzia; Craik, David J.; Daly, Norelle L.; Tan, Ning-Hua

    2013-01-01

    Two new peptides, MCh-1 and MCh-2, along with three known trypsin inhibitors (MCTI-I, MCTI-II and MCTI-III), were isolated from the seeds of the tropical vine Momordica charantia. The sequences of the peptides were determined using mass spectrometry and NMR spectroscopy. Using a strategy involving partial reduction and stepwise alkylation of the peptides, followed by enzymatic digestion and tandem mass spectrometry sequencing, the disulfide connectivity of MCh-1 was elucidated to be CysI-CysIV, CysII-CysV and CysIII-CysVI. The three-dimensional structures of MCh-1 and MCh-2 were determined using NMR spectroscopy and found to contain the inhibitor cystine knot (ICK) motif. The sequences of the novel peptides differ significantly from peptides previously isolated from this plant. Therefore, this study expands the known peptide diversity in M. charantia and the range of sequences that can be accommodated by the ICK motif. Furthermore, we show that a stable two-disulfide intermediate is involved in the oxidative folding of MCh-1. This disulfide intermediate is structurally homologous to the proposed ancestral fold of ICK peptides, and provides a possible pathway for the evolution of this structural motif, which is highly prevalent in nature. PMID:24116036

  3. Second generation noninvasive fetal genome analysis reveals de novo mutations, single-base parental inheritance, and preferred DNA ends

    PubMed Central

    Chan, K. C. Allen; Jiang, Peiyong; Sun, Kun; Cheng, Yvonne K. Y.; Tong, Yu K.; Cheng, Suk Hang; Wong, Ada I. C.; Hudecova, Irena; Leung, Tak Y.; Chiu, Rossa W. K.; Lo, Yuk Ming Dennis

    2016-01-01

    Plasma DNA obtained from a pregnant woman was sequenced to a depth of 270× haploid genome coverage. Comparing the maternal plasma DNA sequencing data with the parental genomic DNA data and using a series of bioinformatics filters, fetal de novo mutations were detected at a sensitivity of 85% and a positive predictive value of 74%. These results represent a 169-fold improvement in the positive predictive value over previous attempts. Improvements in the interpretation of the sequence information of every base position in the genome allowed us to interrogate the maternal inheritance of the fetus for 618,271 of 656,676 (94.2%) heterozygous SNPs within the maternal genome. The fetal genotype at each of these sites was deduced individually, unlike previously, where the inheritance was determined for a collection of sites within a haplotype. These results represent a 90-fold enhancement in the resolution in determining the fetus’s maternal inheritance. Selected genomic locations were more likely to be found at the ends of plasma DNA molecules. We found that a subset of such preferred ends exhibited selectivity for fetal- or maternal-derived DNA in maternal plasma. The ratio of the number of maternal plasma DNA molecules with fetal preferred ends to those with maternal preferred ends showed a correlation with the fetal DNA fraction. Finally, this second generation approach for noninvasive fetal whole-genome analysis was validated in a pregnancy diagnosed with cardiofaciocutaneous syndrome with maternal plasma DNA sequenced to 195× coverage. The causative de novo BRAF mutation was successfully detected through the maternal plasma DNA analysis. PMID:27799561

  4. Identification of Y-Chromosome Sequences in Turner Syndrome.

    PubMed

    Silva-Grecco, Roseane Lopes da; Trovó-Marqui, Alessandra Bernadete; Sousa, Tiago Alves de; Croce, Lilian Da; Balarin, Marly Aparecida Spadotto

    2016-05-01

    To investigate the presence of Y-chromosome sequences and determine their frequency in patients with Turner syndrome. The study included 23 patients with Turner syndrome from Brazil, who gave written informed consent for participating in the study. Cytogenetic analyses were performed in peripheral blood lymphocytes, with 100 metaphases per patient. Genomic DNA was also extracted from peripheral blood lymphocytes, and gene sequences DYZ1, DYZ3, ZFY and SRY were amplified by Polymerase Chain Reaction. The cytogenetic analysis showed a 45,X karyotype in 9 patients (39.2 %) and a mosaic pattern in 14 (60.8 %). In 8.7 % (2 out of 23) of the patients, Y-chromosome sequences were found. This prevalence is very similar to those reported previously. The initial karyotype analysis of these patients did not reveal Y-chromosome material, but they were found positive for Y-specific sequences in the lymphocyte DNA analysis. The PCR technique showed that 2 (8.7 %) of the patients with Turner syndrome had Y-chromosome sequences, both presenting marker chromosomes on cytogenetic analysis.

  5. Alanine-170 and proline-172 are critical determinants for extracellular CD20 epitopes; heterogeneity in the fine specificity of CD20 monoclonal antibodies is defined by additional requirements imposed by both amino acid sequence and quaternary structure.

    PubMed

    Polyak, Maria J; Deans, Julie P

    2002-05-01

    In vivo ablation of malignant B cells can be achieved using antibodies directed against the CD20 antigen. Fine specificity differences among CD20 monoclonal antibodies (mAbs) are assumed not to be a factor in determining their efficacy because evidence from antibody-blocking studies indicates limited epitope diversity with only 2 overlapping extracellular CD20 epitopes. However, in this report a high degree of heterogeneity among antihuman CD20 mAbs is demonstrated. Mutation of alanine and proline at positions 170 and 172 (AxP) (single-letter amino acid codes; x indicates the identical amino acid at the same position in the murine and human CD20 sequences) in human CD20 abrogated the binding of all CD20 mAbs tested. Introduction of AxP into the equivalent positions in the murine sequence, which is not otherwise recognized by antihuman CD20 mAbs, fully reconstituted the epitope recognized by B1, the prototypic anti-CD20 mAb. 2H7, a mAb previously thought to recognize the same epitope as B1, did not recognize the murine AxP mutant. Reconstitution of the 2H7 epitope was achieved with additional mutations replacing VDxxD in the murine sequence for INxxN (positions 162-166 in the human sequence). The integrity of the 2H7 epitope, unlike that of B1, further depends on the maintenance of CD20 in an oligomeric complex. The majority of 16 antihuman CD20 mAbs tested, including rituximab, bound to murine CD20 containing the AxP mutations. Heterogeneity in the fine specificity of these antibodies was indicated by marked differences in their ability to induce homotypic cellular aggregation and translocation of CD20 to a detergent-insoluble membrane compartment previously identified as lipid rafts.

  6. Age determination of vessel wall hematoma in spontaneous cervical artery dissection: A multi-sequence 3T Cardiovascular Magnetic resonance study

    PubMed Central

    2011-01-01

    Background Previously proposed classifications for carotid plaque and cerebral parenchymal hemorrhages are used to estimate the age of hematoma according to its signal intensities on T1w and T2w MR images. Using these classifications, we systematically investigated the value of cardiovascular magnetic resonance (CMR) in determining the age of vessel wall hematoma (VWH) in patients with spontaneous cervical artery dissection (sCAD). Methods 35 consecutive patients (mean age 43.6 ± 9.8 years) with sCAD received a cervical multi-sequence 3T CMR with fat-saturated black-blood T1w-, T2w- and TOF images. Age of sCAD was defined as time between onset of symptoms (stroke, TIA or Horner's syndrome) and the CMR scan. VWH were categorized into hyperacute, acute, early subacute, late subacute and chronic based on their signal intensities on T1w- and T2w images. Results The mean age of sCAD was 2.0, 5.8, 15.7 and 58.7 days in patients with acute, early subacute, late subacute and chronic VWH as classified by CMR (p < 0.001 for trend). Agreement was moderate between VWH types in our study and the previously proposed time scheme of signal evolution for cerebral hemorrhage, Cohen's kappa 0.43 (p < 0.001). There was a strong agreement of CMR VWH classification compared to the time scheme which was proposed for carotid intraplaque hematomas with Cohen's kappa of 0.74 (p < 0.001). Conclusions Signal intensities of VWH in sCAD vary over time and multi-sequence CMR can help to determine the age of an arterial dissection. Furthermore, findings of this study suggest that the time course of carotid hematomas differs from that of cerebral hematomas. PMID:22122756

  7. Deep sequencing-based analysis of the anaerobic stimulon in Neisseria gonorrhoeae

    PubMed Central

    2011-01-01

    Background Maintenance of an anaerobic denitrification system in the obligate human pathogen, Neisseria gonorrhoeae, suggests that an anaerobic lifestyle may be important during the course of infection. Furthermore, mounting evidence suggests that reduction of host-produced nitric oxide has several immunomodulary effects on the host. However, at this point there have been no studies analyzing the complete gonococcal transcriptome response to anaerobiosis. Here we performed deep sequencing to compare the gonococcal transcriptomes of aerobically and anaerobically grown cells. Using the information derived from this sequencing, we discuss the implications of the robust transcriptional response to anaerobic growth. Results We determined that 198 chromosomal genes were differentially expressed (~10% of the genome) in response to anaerobic conditions. We also observed a large induction of genes encoded within the cryptic plasmid, pJD1. Validation of RNA-seq data using translational-lacZ fusions or RT-PCR demonstrated the RNA-seq results to be very reproducible. Surprisingly, many genes of prophage origin were induced anaerobically, as well as several transcriptional regulators previously unknown to be involved in anaerobic growth. We also confirmed expression and regulation of a small RNA, likely a functional equivalent of fnrS in the Enterobacteriaceae family. We also determined that many genes found to be responsive to anaerobiosis have also been shown to be responsive to iron and/or oxidative stress. Conclusions Gonococci will be subject to many forms of environmental stress, including oxygen-limitation, during the course of infection. Here we determined that the anaerobic stimulon in gonococci was larger than previous studies would suggest. Many new targets for future research have been uncovered, and the results derived from this study may have helped to elucidate factors or mechanisms of virulence that may have otherwise been overlooked. PMID:21251255

  8. Protein-Protein Förster Resonance Energy Transfer Analysis of Nucleosome Core Particles Containing H2A and H2A.Z

    PubMed Central

    Hoch, Duane A.; Stratton, Jessica J.; Gloss, Lisa M.

    2007-01-01

    A protein-protein Förster resonance energy transfer (FRET) system, employing probes at multiple positions, was designed to specifically monitor the dissociation of the H2A-H2B dimer from the nucleosome core particle (NCP). Tryptophan donors and Cys-AEDANS acceptors were chosen because, in comparison to fluorophores used in previous NCP FRET studies, they: 1) are smaller and less hydrophobic which should minimize perturbations of histone and NCP structure; and 2) have an R0 of 20 Å, which is much less than the dimensions of the NCP (~50 Å width and ~100 Å diameter). CD and FL equilibrium protein unfolding titrations indicate that the donor and acceptor moieties have minimal effects on the stability of the H2A-H2B dimer and (H3-H4)2 tetramer. NCPs containing the various FRET pairs were reconstituted with the 601 artificial positioning DNA sequence. Equilibrium NaCl-induced dissociation of the modified NCPs showed that the 601 sequence stabilized the NCP to dimer dissociation as compared to previous studies using weaker positioning sequences. This finding implies a significant role for the H2A-H2B dimers in determining the DNA sequence dependence of NCP stability. The free energy of dissociation determined from reversible and well-defined sigmoidal transitions revealed two distinct phases reflecting the dissociation of each H2A-H2B dimer, confirming cooperativity in dimer dissociation. While cooperativity in the association/dissociation of the H2A-H2B dimers has been suggested previously, these data allow its quantitative description. The protein-protein FRET system was then used to study the effects of the histone variant H2A.Z on NCP stability; previous studies have reported both destabilizing and stabilizing effects. Comparison of the H2A and H2A.Z FRET NCP dissociation transitions suggest a slight increase in stability but a significant increase in cooperativity for dimer dissociation from H2A.Z NCPs. Thus, the utility of this protein-protein FRET system to monitor the effects of histone variants on NCP dynamics has been demonstrated, and the system appears equally well-suited for dissection of the kinetic processes of dimer association and dissociation from the NCP. PMID:17597150

  9. The role of molecular structure of sugar-phosphate backbone and nucleic acid bases in the formation of single-stranded and double-stranded DNA structures.

    PubMed

    Poltev, Valeri; Anisimov, Victor M; Danilov, Victor I; Garcia, Dolores; Sanchez, Carolina; Deriabina, Alexandra; Gonzalez, Eduardo; Rivas, Francisco; Polteva, Nina

    2014-06-01

    Our previous DFT computations of deoxydinucleoside monophosphate complexes with Na(+)-ions (dDMPs) have demonstrated that the main characteristics of Watson-Crick (WC) right-handed duplex families are predefined in the local energy minima of dDMPs. In this work, we study the mechanisms of contribution of chemically monotonous sugar-phosphate backbone and the bases into the double helix irregularity. Geometry optimization of sugar-phosphate backbone produces energy minima matching the WC DNA conformations. Studying the conformational variability of dDMPs in response to sequence permutation, we found that simple replacement of bases in the previously fully optimized dDMPs, e.g. by constructing Pyr-Pur from Pur-Pyr, and Pur-Pyr from Pyr-Pur sequences, while retaining the backbone geometry, automatically produces the mutual base position characteristic of the target sequence. Based on that, we infer that the directionality and the preferable regions of the sugar-phosphate torsions, combined with the difference of purines from pyrimidines in ring shape, determines the sequence dependence of the structure of WC DNA. No such sequence dependence exists in dDMPs corresponding to other DNA conformations (e.g., Z-family and Hoogsteen duplexes). Unlike other duplexes, WC helix is unique by its ability to match the local energy minima of the free single strand to the preferable conformations of the duplex. Copyright © 2013 Wiley Periodicals, Inc.

  10. A programmable method for massively parallel targeted sequencing

    PubMed Central

    Hopmans, Erik S.; Natsoulis, Georges; Bell, John M.; Grimes, Susan M.; Sieh, Weiva; Ji, Hanlee P.

    2014-01-01

    We have developed a targeted resequencing approach referred to as Oligonucleotide-Selective Sequencing. In this study, we report a series of significant improvements and novel applications of this method whereby the surface of a sequencing flow cell is modified in situ to capture specific genomic regions of interest from a sample and then sequenced. These improvements include a fully automated targeted sequencing platform through the use of a standard Illumina cBot fluidics station. Targeting optimization increased the yield of total on-target sequencing data 2-fold compared to the previous iteration, while simultaneously increasing the percentage of reads that could be mapped to the human genome. The described assays cover up to 1421 genes with a total coverage of 5.5 Megabases (Mb). We demonstrate a 10-fold abundance uniformity of greater than 90% in 1 log distance from the median and a targeting rate of up to 95%. We also sequenced continuous genomic loci up to 1.5 Mb while simultaneously genotyping SNPs and genes. Variants with low minor allele fraction were sensitively detected at levels of 5%. Finally, we determined the exact breakpoint sequence of cancer rearrangements. Overall, this approach has high performance for selective sequencing of genome targets, configuration flexibility and variant calling accuracy. PMID:24782526

  11. Database-independent Protein Sequencing (DiPS) Enables Full-length de Novo Protein and Antibody Sequence Determination.

    PubMed

    Savidor, Alon; Barzilay, Rotem; Elinger, Dalia; Yarden, Yosef; Lindzen, Moshit; Gabashvili, Alexandra; Adiv Tal, Ophir; Levin, Yishai

    2017-06-01

    Traditional "bottom-up" proteomic approaches use proteolytic digestion, LC-MS/MS, and database searching to elucidate peptide identities and their parent proteins. Protein sequences absent from the database cannot be identified, and even if present in the database, complete sequence coverage is rarely achieved even for the most abundant proteins in the sample. Thus, sequencing of unknown proteins such as antibodies or constituents of metaproteomes remains a challenging problem. To date, there is no available method for full-length protein sequencing, independent of a reference database, in high throughput. Here, we present Database-independent Protein Sequencing, a method for unambiguous, rapid, database-independent, full-length protein sequencing. The method is a novel combination of non-enzymatic, semi-random cleavage of the protein, LC-MS/MS analysis, peptide de novo sequencing, extraction of peptide tags, and their assembly into a consensus sequence using an algorithm named "Peptide Tag Assembler." As proof-of-concept, the method was applied to samples of three known proteins representing three size classes and to a previously un-sequenced, clinically relevant monoclonal antibody. Excluding leucine/isoleucine and glutamic acid/deamidated glutamine ambiguities, end-to-end full-length de novo sequencing was achieved with 99-100% accuracy for all benchmarking proteins and the antibody light chain. Accuracy of the sequenced antibody heavy chain, including the entire variable region, was also 100%, but there was a 23-residue gap in the constant region sequence. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  12. Horse cDNA clones encoding two MHC class I genes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Barbis, D.P.; Maher, J.K.; Stanek, J.

    1994-12-31

    Two full-length clones encoding MHC class I genes were isolated by screening a horse cDNA library, using a probe encoding in human HLA-A2.2Y allele. The library was made in the pcDNA1 vector (Invitrogen, San Diego, CA), using mRNA from peripheral blood lymphocytes obtained from a Thoroughbred stallion (No. 0834) homozygous for a common horse MHC haplotype (ELA-A2, -B2, -D2; Antczak et al. 1984; Donaldson et al. 1988). The clones were sequenced, using SP6 and T7 universal primers and horse-specific oligonucleotides designed to extend previously determined sequences.

  13. Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease.

    PubMed

    Dilliott, Allison A; Farhan, Sali M K; Ghani, Mahdi; Sato, Christine; Liang, Eric; Zhang, Ming; McIntyre, Adam D; Cao, Henian; Racacho, Lemuel; Robinson, John F; Strong, Michael J; Masellis, Mario; Bulman, Dennis E; Rogaeva, Ekaterina; Lang, Anthony; Tartaglia, Carmela; Finger, Elizabeth; Zinman, Lorne; Turnbull, John; Freedman, Morris; Swartz, Rick; Black, Sandra E; Hegele, Robert A

    2018-04-04

    Next-generation sequencing (NGS) is quickly revolutionizing how research into the genetic determinants of constitutional disease is performed. The technique is highly efficient with millions of sequencing reads being produced in a short time span and at relatively low cost. Specifically, targeted NGS is able to focus investigations to genomic regions of particular interest based on the disease of study. Not only does this further reduce costs and increase the speed of the process, but it lessens the computational burden that often accompanies NGS. Although targeted NGS is restricted to certain regions of the genome, preventing identification of potential novel loci of interest, it can be an excellent technique when faced with a phenotypically and genetically heterogeneous disease, for which there are previously known genetic associations. Because of the complex nature of the sequencing technique, it is important to closely adhere to protocols and methodologies in order to achieve sequencing reads of high coverage and quality. Further, once sequencing reads are obtained, a sophisticated bioinformatics workflow is utilized to accurately map reads to a reference genome, to call variants, and to ensure the variants pass quality metrics. Variants must also be annotated and curated based on their clinical significance, which can be standardized by applying the American College of Medical Genetics and Genomics Pathogenicity Guidelines. The methods presented herein will display the steps involved in generating and analyzing NGS data from a targeted sequencing panel, using the ONDRISeq neurodegenerative disease panel as a model, to identify variants that may be of clinical significance.

  14. Characterization of genomic sequence showing strong association with polyembryony among diverse Citrus species and cultivars, and its synteny with Vitis and Populus.

    PubMed

    Nakano, Michiharu; Shimada, Takehiko; Endo, Tomoko; Fujii, Hiroshi; Nesumi, Hirohisa; Kita, Masayuki; Ebina, Masumi; Shimizu, Tokurou; Omura, Mitsuo

    2012-02-01

    Polyembryony, in which multiple somatic nucellar cell-derived embryos develop in addition to the zygotic embryo in a seed, is common in the genus Citrus. Previous genetic studies indicated polyembryony is mainly determined by a single locus, but the underlying molecular mechanism is still unclear. As a step towards identification and characterization of the gene or genes responsible for nucellar embryogenesis in Citrus, haplotype-specific physical maps around the polyembryony locus were constructed. By sequencing three BAC clones aligned on the polyembryony haplotype, a single contiguous draft sequence consisting of 380 kb containing 70 predicted open reading frames (ORFs) was reconstructed. Single nucleotide polymorphism genotypes detected in the sequenced genomic region showed strong association with embryo type in Citrus, indicating a common polyembryony locus is shared among widely diverse Citrus cultivars and species. The arrangement of the predicted ORFs in the characterized genomic region showed high collinearity to the genomic sequence of chromosome 4 of Vitis vinifera and linkage group VI of Populus trichocarpa, suggesting that the syntenic relationship among these species is conserved even though V. vinifera and P. trichocarpa are non-apomictic species. This is the first study to characterize in detail the genomic structure of an apomixis locus determining adventitious embryony. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.

  15. Subset of Kappa and Lambda Germline Sequences Result in Light Chains with a Higher Molecular Mass Phenotype.

    PubMed

    Barnidge, David R; Lundström, Susanna L; Zhang, Bo; Dasari, Surendra; Murray, David L; Zubarev, Roman A

    2015-12-04

    In our previous work, we showed that electrospray ionization of intact polyclonal kappa and lambda light chains isolated from normal serum generates two distinct, Gaussian-shaped, molecular mass distributions representing the light-chain repertoire. During the analysis of a large (>100) patient sample set, we noticed a low-intensity molecular mass distribution with a mean of approximately 24 250 Da, roughly 800 Da higher than the mean of the typical kappa molecular-mass distribution mean of 23 450 Da. We also observed distinct clones in this region that did not appear to contain any typical post-translational modifications that would account for such a large mass shift. To determine the origin of the high molecular mass clones, we performed de novo bottom-up mass spectrometry on a purified IgM monoclonal light chain that had a calculated molecular mass of 24 275.03 Da. The entire sequence of the monoclonal light chain was determined using multienzyme digestion and de novo sequence-alignment software and was found to belong to the germline allele IGKV2-30. The alignment of kappa germline sequences revealed ten IGKV2 and one IGKV4 sequences that contained additional amino acids in their CDR1 region, creating the high-molecular-mass phenotype. We also performed an alignment of lambda germline sequences, which showed additional amino acids in the CDR2 region, and the FR3 region of functional germline sequences that result in a high-molecular-mass phenotype. The work presented here illustrates the ability of mass spectrometry to provide information on the diversity of light-chain molecular mass phenotypes in circulation, which reflects the germline sequences selected by the immunoglobulin-secreting B-cell population.

  16. Sequence variation of functional HTLV-II tax alleles among isolates from an endemic population: lack of evidence for oncogenic determinant in tax.

    PubMed

    Hjelle, B; Chaney, R

    1992-02-01

    Human T-cell leukemia-lymphoma virus type II (HTLV-II) has been isolated from patients with hairy cell leukemia (HCL). We previously described a population with longstanding endemic HTLV-II infection, and showed that there is no increased risk for HCL in the affected groups. We thus have direct evidence that the endemic form(s) of HTLV-II cause HCL infrequently, if at all. By comparison, there is reason to suspect that the viruses isolated from patients with HCL had an etiologic role in the disease in those patients. One way to reconcile these conflicting observations is to consider that isolates of HTLV-II might differ in oncogenic potential. To determine whether the structure of the putative oncogenic determinant of HTLV-II, tax2, might differ in the new isolates compared to the tax of the prototype HCL isolate, MO, four new functional tax cDNAs were cloned from new isolates. Sequence analysis showed only minor (0.9-2.0%) amino acid variation compared to the published sequence of MO tax2. Some codons were consistently different from published sequences of the MO virus, but in most cases, such variations were also found in each of two tax2 clones we isolated from the MO T-cell line. These variations rendered the new clones more similar to the tax1 of the pathogenic virus HTLV-I. Thus we find no evidence that pathologic determinants of HTLV-II can be assigned to the tax gene.

  17. Loss and persistence of implicit memory for sound: evidence from auditory stream segregation context effects.

    PubMed

    Snyder, Joel S; Weintraub, David M

    2013-07-01

    An important question is the extent to which declines in memory over time are due to passive loss or active interference from other stimuli. The purpose of the present study was to determine the extent to which implicit memory effects in the perceptual organization of sound sequences are subject to loss and interference. Toward this aim, we took advantage of two recently discovered context effects in the perceptual judgments of sound patterns, one that depends on stimulus features of previous sounds and one that depends on the previous perceptual organization of these sounds. The experiments measured how listeners' perceptual organization of a tone sequence (test) was influenced by the frequency separation, or the perceptual organization, of the two preceding sequences (context1 and context2). The results demonstrated clear evidence for loss of context effects over time but little evidence for interference. However, they also revealed that context effects can be surprisingly persistent. The robust effects of loss, followed by persistence, were similar for the two types of context effects. We discuss whether the same auditory memories might contain information about basic stimulus features of sounds (i.e., frequency separation), as well as the perceptual organization of these sounds.

  18. Molecular Mapping of Restriction-Site Associated DNA Markers In Allotetraploid Upland Cotton.

    PubMed

    Wang, Yangkun; Ning, Zhiyuan; Hu, Yan; Chen, Jiedan; Zhao, Rui; Chen, Hong; Ai, Nijiang; Guo, Wangzhen; Zhang, Tianzhen

    2015-01-01

    Upland cotton (Gossypium hirsutum L., 2n = 52, AADD) is an allotetraploid, therefore the discovery of single nucleotide polymorphism (SNP) markers is difficult. The recent emergence of genome complexity reduction technologies based on the next-generation sequencing (NGS) platform has greatly expedited SNP discovery in crops with highly repetitive and complex genomes. Here we applied restriction-site associated DNA (RAD) sequencing technology for de novo SNP discovery in allotetraploid cotton. We identified 21,109 SNPs between the two parents and used these for genotyping of 161 recombinant inbred lines (RILs). Finally, a high dense linkage map comprising 4,153 loci over 3500-cM was developed based on the previous result. Using this map quantitative trait locus (QTLs) conferring fiber strength and Verticillium Wilt (VW) resistance were mapped to a more accurate region in comparison to the 1576-cM interval determined using the simple sequence repeat (SSR) genetic map. This suggests that the newly constructed map has more power and resolution than the previous SSR map. It will pave the way for the rapid identification of the marker-assisted selection in cotton breeding and cloning of QTL of interest traits.

  19. The RelA/SpoT Homolog (RSH) Superfamily: Distribution and Functional Evolution of ppGpp Synthetases and Hydrolases across the Tree of Life

    PubMed Central

    Atkinson, Gemma C.; Tenson, Tanel; Hauryliuk, Vasili

    2011-01-01

    RelA/SpoT Homologue (RSH) proteins, named for their sequence similarity to the RelA and SpoT enzymes of Escherichia coli, comprise a superfamily of enzymes that synthesize and/or hydrolyze the alarmone ppGpp, activator of the “stringent” response and regulator of cellular metabolism. The classical “long” RSHs Rel, RelA and SpoT with the ppGpp hydrolase, synthetase, TGS and ACT domain architecture have been found across diverse bacteria and plant chloroplasts, while dedicated single domain ppGpp-synthesizing and -hydrolyzing RSHs have also been discovered in disparate bacteria and animals respectively. However, there is considerable confusion in terms of nomenclature and no comprehensive phylogenetic and sequence analyses have previously been carried out to classify RSHs on a genomic scale. We have performed high-throughput sensitive sequence searching of over 1000 genomes from across the tree of life, in combination with phylogenetic analyses to consolidate previous ad hoc identification of diverse RSHs in different organisms and provide a much-needed unifying terminology for the field. We classify RSHs into 30 subgroups comprising three groups: long RSHs, small alarmone synthetases (SASs), and small alarmone hydrolases (SAHs). Members of nineteen previously unidentified RSH subgroups can now be studied experimentally, including previously unknown RSHs in archaea, expanding the “stringent response” to this domain of life. We have analyzed possible combinations of RSH proteins and their domains in bacterial genomes and compared RSH content with available RSH knock-out data for various organisms to determine the rules of combining RSHs. Through comparative sequence analysis of long and small RSHs, we find exposed sites limited in conservation to the long RSHs that we propose are involved in transmitting regulatory signals. Such signals may be transmitted via NTD to CTD intra-molecular interactions, or inter-molecular interactions either among individual RSH molecules or among long RSHs and other binding partners such as the ribosome. PMID:21858139

  20. Anchoring genome sequence to chromosomes of the central bearded dragon (Pogona vitticeps) enables reconstruction of ancestral squamate macrochromosomes and identifies sequence content of the Z chromosome.

    PubMed

    Deakin, Janine E; Edwards, Melanie J; Patel, Hardip; O'Meally, Denis; Lian, Jinmin; Stenhouse, Rachael; Ryan, Sam; Livernois, Alexandra M; Azad, Bhumika; Holleley, Clare E; Li, Qiye; Georges, Arthur

    2016-06-10

    Squamates (lizards and snakes) are a speciose lineage of reptiles displaying considerable karyotypic diversity, particularly among lizards. Understanding the evolution of this diversity requires comparison of genome organisation between species. Although the genomes of several squamate species have now been sequenced, only the green anole lizard has any sequence anchored to chromosomes. There is only limited gene mapping data available for five other squamates. This makes it difficult to reconstruct the events that have led to extant squamate karyotypic diversity. The purpose of this study was to anchor the recently sequenced central bearded dragon (Pogona vitticeps) genome to chromosomes to trace the evolution of squamate chromosomes. Assigning sequence to sex chromosomes was of particular interest for identifying candidate sex determining genes. By using two different approaches to map conserved blocks of genes, we were able to anchor approximately 42 % of the dragon genome sequence to chromosomes. We constructed detailed comparative maps between dragon, anole and chicken genomes, and where possible, made broader comparisons across Squamata using cytogenetic mapping information for five other species. We show that squamate macrochromosomes are relatively well conserved between species, supporting findings from previous molecular cytogenetic studies. Macrochromosome diversity between members of the Toxicofera clade has been generated by intrachromosomal, and a small number of interchromosomal, rearrangements. We reconstructed the ancestral squamate macrochromosomes by drawing upon comparative cytogenetic mapping data from seven squamate species and propose the events leading to the arrangements observed in representative species. In addition, we assigned over 8 Mbp of sequence containing 219 genes to the Z chromosome, providing a list of genes to begin testing as candidate sex determining genes. Anchoring of the dragon genome has provided substantial insight into the evolution of squamate genomes, enabling us to reconstruct ancestral macrochromosome arrangements at key positions in the squamate phylogeny, demonstrating that fusions between macrochromosomes or fusions of macrochromosomes and microchromosomes, have played an important role during the evolution of squamate genomes. Assigning sequence to the sex chromosomes has identified NR5A1 as a promising candidate sex determining gene in the dragon.

  1. Molecular Variability Among Isolates of Prunus Necrotic Ringspot Virus from Different Prunus spp.

    PubMed

    Aparicio, F; Myrta, A; Di Terlizzi, B; Pallás, V

    1999-11-01

    ABSTRACT Viral sequences amplified by polymerase chain reaction from 25 isolates of Prunus necrotic ringspot virus (PNRSV), varying in the symptomatology they cause in six different Prunus spp., were analyzed for restriction fragment polymorphisms. Most of the isolates could be discriminated by using a combination of three different restriction enzymes. The nucleotide sequences of the RNA 4 of 15 of these isolates were determined. Sequence comparisons and phylogenetic analyses of the RNA 4 and coat proteins (CPs) revealed that all of the isolates clustered into three different groups, represented by three previously sequenced PNRSV isolates: PV32, PE5, and PV96. The PE5-type group was characterized by a 5' untranslated region that was clearly different from that of the other two groups. The PV32-type group was characterized by an extra hexanucleotide consisting of a duplication of the six immediately preceding nucleotides. Although most of the variability was observed in the first third of the CP, the amino acid residues in this region, which were previously thought to be functionally important in the replication cycle of the virus, were strictly conserved. No clear correlation with the type of symptom or host specificity could be observed. The validity of this grouping was confirmed when other isolates recently characterized by other authors were included in these analyses.

  2. Evolution of EF-hand calcium-modulated proteins. IV. Exon shuffling did not determine the domain compositions of EF-hand proteins

    NASA Technical Reports Server (NTRS)

    Kretsinger, R. H.; Nakayama, S.

    1993-01-01

    In the previous three reports in this series we demonstrated that the EF-hand family of proteins evolved by a complex pattern of gene duplication, transposition, and splicing. The dendrograms based on exon sequences are nearly identical to those based on protein sequences for troponin C, the essential light chain myosin, the regulatory light chain, and calpain. This validates both the computational methods and the dendrograms for these subfamilies. The proposal of congruence for calmodulin, troponin C, essential light chain, and regulatory light chain was confirmed. There are, however, significant differences in the calmodulin dendrograms computed from DNA and from protein sequences. In this study we find that introns are distributed throughout the EF-hand domain and the interdomain regions. Further, dendrograms based on intron type and distribution bear little resemblance to those based on protein or on DNA sequences. We conclude that introns are inserted, and probably deleted, with relatively high frequency. Further, in the EF-hand family exons do not correspond to structural domains and exon shuffling played little if any role in the evolution of this widely distributed homolog family. Calmodulin has had a turbulent evolution. Its dendrograms based on protein sequence, exon sequence, 3'-tail sequence, intron sequences, and intron positions all show significant differences.

  3. Occurrence of the root-rot pathogen, Fusarium commune, in forest nurseries of the midwestern and western United States

    Treesearch

    Mee-Sook Kim; Jane E. Stewart; R. Kasten Dumroese; Ned B. Klopfenstein

    2012-01-01

    Fusarium commune can cause damping-off and root rot of conifer seedlings in forest nurseries, and this pathogen has been previously reported from Oregon, Idaho, and Washington, USA. We collected Fusarium isolates from additional nurseries in the midwestern and western USA to more fully determine occurrence of this pathogen. We used DNA sequences of the mitochondrial...

  4. PRP5: a helicase-like protein required for mRNA splicing in yeast.

    PubMed Central

    Dalbadie-McFarland, G; Abelson, J

    1990-01-01

    A 96-kDa protein predicted by the DNA sequence of the Saccharomyces cerevisiae PRP5 gene contains a domain that bears a striking resemblance to a family of RNA helicases characterized by the conserved amino acid sequence Asp-Glu-Ala-Asp (D-E-A-D). Previous work indicated that the product of the PRP5 gene is required for splicing and that spliceosome assembly does not occur in its absence. However, its precise role in splicing and the nature of its biochemical activity remained unknown. To examine the role of PRP5 in splicing, we cloned the gene by complementation of a temperature-sensitive mutation and determined its DNA sequence. We discuss here the possible roles for an RNA helicase in splicing and for the activity of the PRP5 protein. Images PMID:2349233

  5. The complete mitochondrial genome of the medicinal fungus Ganoderma applanatum (Polyporales, Basidiomycota).

    PubMed

    Wang, Xin-Cun; Shao, Junjie; Liu, Chang

    2016-07-01

    We have determined the complete nucleotide sequence of the mitochondrial genome of the medicinal fungus Ganoderma applanatum (Pers.) Pat. using the next-generation sequencing technology. The circular molecule is 119,803 bp long with a GC content of 26.66%. Gene prediction revealed genes encoding 15 conserved proteins, 25 tRNAs, the large and small ribosomal RNAs, all genes are located on the same strand except trnW-CCA. Compared with previously sequenced genomes of G. lucidum, G. meredithiae and G. sinense, the order of the protein and rRNA genes is highly conserved; however, the types of tRNA genes are slightly different. The mitochondrial genome of G. applanatum will contribute to the understanding of the phylogeny and evolution of Ganoderma and Ganodermataceae, the group containing many species with high medicinal values.

  6. DNA sequence analysis of ARS elements from chromosome III of Saccharomyces cerevisiae: identification of a new conserved sequence.

    PubMed Central

    Palzkill, T G; Oliver, S G; Newlon, C S

    1986-01-01

    Four fragments of Saccharomyces cerevisiae chromosome III DNA which carry ARS elements have been sequenced. Each fragment contains multiple copies of sequences that have at least 10 out of 11 bases of homology to a previously reported 11 bp core consensus sequence. A survey of these new ARS sequences and previously reported sequences revealed the presence of an additional 11 bp conserved element located on the 3' side of the T-rich strand of the core consensus. Subcloning analysis as well as deletion and transposon insertion mutagenesis of ARS fragments support a role for 3' conserved sequence in promoting ARS activity. PMID:3529036

  7. Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data.

    PubMed

    Favero, F; Joshi, T; Marquard, A M; Birkbak, N J; Krzystanek, M; Li, Q; Szallasi, Z; Eklund, A C

    2015-01-01

    Exome or whole-genome deep sequencing of tumor DNA along with paired normal DNA can potentially provide a detailed picture of the somatic mutations that characterize the tumor. However, analysis of such sequence data can be complicated by the presence of normal cells in the tumor specimen, by intratumor heterogeneity, and by the sheer size of the raw data. In particular, determination of copy number variations from exome sequencing data alone has proven difficult; thus, single nucleotide polymorphism (SNP) arrays have often been used for this task. Recently, algorithms to estimate absolute, but not allele-specific, copy number profiles from tumor sequencing data have been described. We developed Sequenza, a software package that uses paired tumor-normal DNA sequencing data to estimate tumor cellularity and ploidy, and to calculate allele-specific copy number profiles and mutation profiles. We applied Sequenza, as well as two previously published algorithms, to exome sequence data from 30 tumors from The Cancer Genome Atlas. We assessed the performance of these algorithms by comparing their results with those generated using matched SNP arrays and processed by the allele-specific copy number analysis of tumors (ASCAT) algorithm. Comparison between Sequenza/exome and SNP/ASCAT revealed strong correlation in cellularity (Pearson's r = 0.90) and ploidy estimates (r = 0.42, or r = 0.94 after manual inspecting alternative solutions). This performance was noticeably superior to previously published algorithms. In addition, in artificial data simulating normal-tumor admixtures, Sequenza detected the correct ploidy in samples with tumor content as low as 30%. The agreement between Sequenza and SNP array-based copy number profiles suggests that exome sequencing alone is sufficient not only for identifying small scale mutations but also for estimating cellularity and inferring DNA copy number aberrations. © The Author 2014. Published by Oxford University Press on behalf of the European Society for Medical Oncology.

  8. Novel division level bacterial diversity in a Yellowstone hot spring.

    PubMed

    Hugenholtz, P; Pitulle, C; Hershberger, K L; Pace, N R

    1998-01-01

    A culture-independent molecular phylogenetic survey was carried out for the bacterial community in Obsidian Pool (OP), a Yellowstone National Park hot spring previously shown to contain remarkable archaeal diversity (S. M. Barns, R. E. Fundyga, M. W. Jeffries, and N. R. Page, Proc. Natl. Acad. Sci. USA 91:1609-1613, 1994). Small-subunit rRNA genes (rDNA) were amplified directly from OP sediment DNA by PCR with universally conserved or Bacteria-specific rDNA primers and cloned. Unique rDNA types among > 300 clones were identified by restriction fragment length polymorphism, and 122 representative rDNA sequences were determined. These were found to represent 54 distinct bacterial sequence types or clusters (> or = 98% identity) of sequences. A majority (70%) of the sequence types were affiliated with 14 previously recognized bacterial divisions (main phyla; kingdoms); 30% were unaffiliated with recognized bacterial divisions. The unaffiliated sequence types (represented by 38 sequences) nominally comprise 12 novel, division level lineages termed candidate divisions. Several OP sequences were nearly identical to those of cultivated chemolithotrophic thermophiles, including the hydrogen-oxidizing Calderobacterium and the sulfate reducers Thermodesulfovibrio and Thermodesulfobacterium, or belonged to monophyletic assemblages recognized for a particular type of metabolism, such as the hydrogen-oxidizing Aquificales and the sulfate-reducing delta-Proteobacteria. The occurrence of such organisms is consistent with the chemical composition of OP (high in reduced iron and sulfur) and suggests a lithotrophic base for primary productivity in this hot spring, through hydrogen oxidation and sulfate reduction. Unexpectedly, no archaeal sequences were encountered in OP clone libraries made with universal primers. Hybridization analysis of amplified OP DNA with domain-specific probes confirmed that the analyzed community rDNA from OP sediment was predominantly bacterial. These results expand substantially our knowledge of the extent of bacterial diversity and call into question the commonly held notion that Archaea dominate hydrothermal environments. Finally, the currently known extent of division level bacterial phylogenetic diversity is collated and summarized.

  9. Comparative chloroplast genomics and phylogenetics of Fagopyrum esculentum ssp. ancestrale – A wild ancestor of cultivated buckwheat

    PubMed Central

    Logacheva, Maria D; Samigullin, Tahir H; Dhingra, Amit; Penin, Aleksey A

    2008-01-01

    Background Chloroplast genome sequences are extremely informative about species-interrelationships owing to its non-meiotic and often uniparental inheritance over generations. The subject of our study, Fagopyrum esculentum, is a member of the family Polygonaceae belonging to the order Caryophyllales. An uncertainty remains regarding the affinity of Caryophyllales and the asterids that could be due to undersampling of the taxa. With that background, having access to the complete chloroplast genome sequence for Fagopyrum becomes quite pertinent. Results We report the complete chloroplast genome sequence of a wild ancestor of cultivated buckwheat, Fagopyrum esculentum ssp. ancestrale. The sequence was rapidly determined using a previously described approach that utilized a PCR-based method and employed universal primers, designed on the scaffold of multiple sequence alignment of chloroplast genomes. The gene content and order in buckwheat chloroplast genome is similar to Spinacia oleracea. However, some unique structural differences exist: the presence of an intron in the rpl2 gene, a frameshift mutation in the rpl23 gene and extension of the inverted repeat region to include the ycf1 gene. Phylogenetic analysis of 61 protein-coding gene sequences from 44 complete plastid genomes provided strong support for the sister relationships of Caryophyllales (including Polygonaceae) to asterids. Further, our analysis also provided support for Amborella as sister to all other angiosperms, but interestingly, in the bayesian phylogeny inference based on first two codon positions Amborella united with Nymphaeales. Conclusion Comparative genomics analyses revealed that the Fagopyrum chloroplast genome harbors the characteristic gene content and organization as has been described for several other chloroplast genomes. However, it has some unique structural features distinct from previously reported complete chloroplast genome sequences. Phylogenetic analysis of the dataset, including this new sequence from non-core Caryophyllales supports the sister relationship between Caryophyllales and asterids. PMID:18492277

  10. Novel sequence variants in the TMIE gene in families with autosomal recessive nonsyndromic hearing impairment

    PubMed Central

    Santos, Regie Lyn P.; El-Shanti, Hatem; Sikandar, Shaheen; Lee, Kwanghyuk; Bhatti, Attya; Yan, Kai; Chahrour, Maria H.; McArthur, Nathan; Pham, Thanh L.; Mahasneh, Amjad Abdullah; Ahmad, Wasim

    2010-01-01

    To date, 37 genes have been identified for nonsyndromic hearing impairment (NSHI). Identifying the functional sequence variants within these genes and knowing their population-specific frequencies is of public health value, in particular for genetic screening for NSHI. To determine putatively functional sequence variants in the transmembrane inner ear (TMIE) gene in Pakistani and Jordanian families with autosomal recessive (AR) NSHI, four Jordanian and 168 Pakistani families with ARNSHI that is not due to GJB2 (CX26) were submitted to a genome scan. Two-point and multipoint parametric linkage analyses were performed, and families with logarithmic odds (LOD) scores of 1.0 or greater within the TMIE region underwent further DNA sequencing. The evolutionary conservation and location in predicted protein domains of amino acid residues where sequence variants occurred were studied to elucidate the possible effects of these sequence variants on function. Of seven families that were screened for TMIE, putatively functional sequence variants were found to segregate with hearing impairment in four families but were not seen in not less than 110 ethnically matched control chromosomes. The previously reported c.241C>T (p.R81C) variant was observed in two Pakistani families. Two novel variants, c.92A>G (p.E31G) and the splice site mutation c.212–2A>C, were identified in one Pakistani and one Jordanian family, respectively. The c.92A>G (p.E31G) variant occurred at a residue that is conserved in the mouse and is predicted to be extracellular. Conservation and potential functionality of previously published mutations were also examined. The prevalence of functional TMIE variants in Pakistani families is 1.7% [95% confidence interval (CI) 0.3–4.8]. Further studies on the spectrum, prevalence rates, and functional effect of sequence variants in the TMIE gene in other populations should demonstrate the true importance of this gene as a cause of hearing impairment. PMID:16389551

  11. The evolutionary history of the DMRT3 'Gait keeper' haplotype.

    PubMed

    Staiger, E A; Almén, M S; Promerová, M; Brooks, S; Cothran, E G; Imsland, F; Jäderkvist Fegraeus, K; Lindgren, G; Mehrabani Yeganeh, H; Mikko, S; Vega-Pla, J L; Tozaki, T; Rubin, C J; Andersson, L

    2017-10-01

    A previous study revealed a strong association between the DMRT3:Ser301STOP mutation in horses and alternate gaits as well as performance in harness racing. Several follow-up studies have confirmed a high frequency of the mutation in gaited horse breeds and an effect on gait quality. The aim of this study was to determine when and where the mutation arose, to identify additional potential causal mutations and to determine the coalescence time for contemporary haplotypes carrying the stop mutation. We utilized sequences from 89 horses representing 26 breeds to identify 102 SNPs encompassing the DMRT3 gene that are in strong linkage disequilibrium with the stop mutation. These 102 SNPs were genotyped in an additional 382 horses representing 72 breeds, and we identified 14 unique haplotypes. The results provided conclusive evidence that DMRT3:Ser301STOP is causal, as no other sequence polymorphisms showed an equally strong association to locomotion traits. The low sequence diversity among mutant chromosomes demonstrated that they must have diverged from a common ancestral sequence within the last 10 000 years. Thus, the mutation occurred either just before domestication or more likely some time after domestication and then spread across the world as a result of selection on locomotion traits. © 2017 Stichting International Foundation for Animal Genetics.

  12. Genome sequencing of idiopathic pulmonary fibrosis in conjunction with a medical school human anatomy course.

    PubMed

    Kumar, Akash; Dougherty, Max; Findlay, Gregory M; Geisheker, Madeleine; Klein, Jason; Lazar, John; Machkovech, Heather; Resnick, Jesse; Resnick, Rebecca; Salter, Alexander I; Talebi-Liasi, Faezeh; Arakawa, Christopher; Baudin, Jacob; Bogaard, Andrew; Salesky, Rebecca; Zhou, Qian; Smith, Kelly; Clark, John I; Shendure, Jay; Horwitz, Marshall S

    2014-01-01

    Even in cases where there is no obvious family history of disease, genome sequencing may contribute to clinical diagnosis and management. Clinical application of the genome has not yet become routine, however, in part because physicians are still learning how best to utilize such information. As an educational research exercise performed in conjunction with our medical school human anatomy course, we explored the potential utility of determining the whole genome sequence of a patient who had died following a clinical diagnosis of idiopathic pulmonary fibrosis (IPF). Medical students performed dissection and whole genome sequencing of the cadaver. Gross and microscopic findings were more consistent with the fibrosing variant of nonspecific interstitial pneumonia (NSIP), as opposed to IPF per se. Variants in genes causing Mendelian disorders predisposing to IPF were not detected. However, whole genome sequencing identified several common variants associated with IPF, including a single nucleotide polymorphism (SNP), rs35705950, located in the promoter region of the gene encoding mucin glycoprotein MUC5B. The MUC5B promoter polymorphism was recently found to markedly elevate risk for IPF, though a particular association with NSIP has not been previously reported, nor has its contribution to disease risk previously been evaluated in the genome-wide context of all genetic variants. We did not identify additional predicted functional variants in a region of linkage disequilibrium (LD) adjacent to MUC5B, nor did we discover other likely risk-contributing variants elsewhere in the genome. Whole genome sequencing thus corroborates the association of rs35705950 with MUC5B dysregulation and interstitial lung disease. This novel exercise additionally served a unique mission in bridging clinical and basic science education.

  13. Complex alternative splicing of acetylcholinesterase transcripts in Torpedo electric organ; primary structure of the precursor of the glycolipid-anchored dimeric form.

    PubMed Central

    Sikorav, J L; Duval, N; Anselmet, A; Bon, S; Krejci, E; Legay, C; Osterlund, M; Reimund, B; Massoulié, J

    1988-01-01

    In this paper, we show the existence of alternative splicing in the 3' region of the coding sequence of Torpedo acetylcholinesterase (AChE). We describe two cDNA structures which both diverge from the previously described coding sequence of the catalytic subunit of asymmetric (A) forms (Schumacher et al., 1986; Sikorav et al., 1987). They both contain a coding sequence followed by a non-coding sequence and a poly(A) stretch. Both of these structures were shown to exist in poly(A)+ RNAs, by S1 mapping experiments. The divergent region encoded by the first sequence corresponds to the precursor of the globular dimeric form (G2a), since it contains the expected C-terminal amino acids, Ala-Cys. These amino acids are followed by a 29 amino acid extension which contains a hydrophobic segment and must be replaced by a glycolipid in the mature protein. Analyses of intact G2a AChE showed that the common domain of the protein contains intersubunit disulphide bonds. The divergent region of the second type of cDNA consists of an adjacent genomic sequence, which is removed as an intron in A and Ga mRNAs, but may encode a distinct, less abundant catalytic subunit. The structures of the cDNA clones indicate that they are derived from minor mRNAs, shorter than the three major transcripts which have been described previously (14.5, 10.5 and 5.5 kb). Oligonucleotide probes specific for the asymmetric and globular terminal regions hybridize with the three major transcripts, indicating that their size is determined by 3'-untranslated regions which are not related to the differential splicing leading to A and Ga forms. Images PMID:3181125

  14. Transitional probabilities count more than frequency, but might not be used for memorization.

    PubMed

    Endress, Ansgar D; Langus, Alan

    2017-02-01

    Learners often need to extract recurring items from continuous sequences, in both vision and audition. The best-known example is probably found in word-learning, where listeners have to determine where words start and end in fluent speech. This could be achieved through universal and experience-independent statistical mechanisms, for example by relying on Transitional Probabilities (TPs). Further, these mechanisms might allow learners to store items in memory. However, previous investigations have yielded conflicting evidence as to whether a sensitivity to TPs is diagnostic of the memorization of recurring items. Here, we address this issue in the visual modality. Participants were familiarized with a continuous sequence of visual items (i.e., arbitrary or everyday symbols), and then had to choose between (i) high-TP items that appeared in the sequence, (ii) high-TP items that did not appear in the sequence, and (iii) low-TP items that appeared in the sequence. Items matched in TPs but differing in (chunk) frequency were much harder to discriminate than items differing in TPs (with no significant sensitivity to chunk frequency), and learners preferred unattested high-TP items over attested low-TP items. Contrary to previous claims, these results cannot be explained on the basis of the similarity of the test items. Learners thus weigh within-item TPs higher than the frequency of the chunks, even when the TP differences are relatively subtle. We argue that these results are problematic for distributional clustering mechanisms that analyze continuous sequences, and provide supporting computational results. We suggest that the role of TPs might not be to memorize items per se, but rather to prepare learners to memorize recurring items once they are presented in subsequent learning situations with richer cues. Copyright © 2016 Elsevier Inc. All rights reserved.

  15. Polynucleobacter meluiroseus sp. nov., a bacterium isolated from a lake located in the mountains of the Mediterranean island of Corsica.

    PubMed

    Pitt, Alexandra; Schmidt, Johanna; Lang, Elke; Whitman, William B; Woyke, Tanja; Hahn, Martin W

    2018-06-01

    Strain AP-Melu-1000-B4 was isolated from a lake located in the mountains of the Mediterranean island of Corsica (France). Phenotypic, chemotaxonomic and genomic traits were investigated. Phylogenetic analyses based on 16S rRNA gene sequencing referred the strain to the cryptic species complex PnecC within the genus Polynucleobacter. The strain encoded genes for biosynthesis of proteorhodopsin and retinal. When pelleted by centrifugation the strain showed an intense rose colouring. Major fatty acids were C16 : 1ω7c, C16 : 0, C18 : 1ω7c and summed feature 2 (C16 : 1 isoI and C14 : 0-3OH). The sequence of the 16S rRNA gene contained an indel which was not present in any previously described Polynucleobacter species. Genome sequencing revealed a genome size of 1.89 Mbp and a G+C content of 46.6 mol%. In order to resolve the phylogenetic position of the new strain within subcluster PnecC, its phylogeny was reconstructed from sequences of 319 shared genes. To represent all currently described Polynucleobacter species by whole genome sequences, three type strains were additionally sequenced. Our phylogenetic analysis revealed that strain AP-Melu-100-B4 occupied a basal position compared with previously described PnecC strains. Pairwise determined whole genome average nucleotide identity (gANI) values suggested that strain AP-Melu-1000-B4 represents a new species, for which we propose the name Polynucleobacter meluiroseus sp. nov. with the type strain AP-Melu-1000-B4 T (=DSM 103591 T =CIP 111329 T ).

  16. Purification and properties of a third form of anthranilate-5-phosphoribosylpyrophosphate phosphoribosyltransferase from the enterobacteriaceae

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Largen, M.; Mills, S.E.; Rowe, J.

    1978-01-25

    Anthranilate-5-phosphoribosypyrophosphate phosphoribosyltransferase was purified from the bacterium Erwinia carotovora, a member of the Enterobacteriaceae. The enzyme was homogeneous according to the criteria of gel electrophoresis and NH/sub 2/-terminal amino acid sequence analysis. The molecular weight of the enzyme as determined on a calibrated Sephadex G-200 column was 67,000 +- 2,000. Sodium dodecyl sulfate-polyacrylamide gels gave a subunit molecular weight of 40,000 +- 1,000, suggesting that the enzyme was a dimer. A comparison of the NH/sub 2/-terminal sequence of the enzyme with the (previously determined) homologue from Serratia marcescens, a monomer with a molecular weight of 45,000, showed that the largermore » Serratia subunit came into register with amino acid 14 of the Erwinia subunit. The register for the length of the known overlap, 26 amino acids, was highly conserved.« less

  17. A multi-omic future for microbiome studies

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jansson, Janet K.; Baker, Erin S.

    2016-04-26

    Microbes constitute about a third of the Earth’s biomass and play critical roles in sustaining life. While results from multiple sequence-based studies have illustrated the importance of microbial communities for human health and the environment, additional technological developments are still needed to gain more insight into their functions [1]. To date, the majority of sequencing studies have focused on the 16S rRNA gene as a phylogenetic marker. This approach has enabled exploration of microbial compositions in a range of sample types, while bypassing the need for cultivation. 16S rRNA gene sequencing has also enabled a vast majority of microorganisms nevermore » previously isolated in culture to be identified and placed into a phylogenetic context [2]. These technologies have been utilized to map the locations of microbes inhabiting various locations of the body [3]. Similarly, sequencing has been used to determine the identities and distributions of microorganisms inhabiting different ecosystems [4, 5], and efforts in single cell sequencing of the microbiome have helped fill in missing branches of the phylogenetic tree [6].« less

  18. Phylogenetic origins of the plant mitochondrion based on a comparative analysis of 5S ribosomal RNA sequences

    NASA Technical Reports Server (NTRS)

    Villanueva, E.; Delihas, N.; Luehrsen, K. R.; Fox, G. E.; Gibson, J.

    1985-01-01

    The complete nucleotide sequences of 5S ribosomal RNAs from Rhodocyclus gelatinosa, Rhodobacter sphaeroides, and Pseudomonas cepacia were determined. Comparisons of these 5S RNA sequences show that rather than being phylogenetically related to one another, the two photosynthetic bacterial 5S RNAs share more sequence and signature homology with the RNAs of two nonphotosynthetic strains. Rhodobacter sphaeroides is specifically related to Paracoccus denitrificans and Rc. gelatinosa is related to Ps. cepacia. These results support earlier 16S ribosomal RNA studies and add two important groups to the 5S RNA data base. Unique 5S RNA structural features previously found in P. denitrificans are present also in the 5S RNA of Rb. sphaeroides; these provide the basis for subdivisional signatures. The immediate consequence of obtaining these new sequences is that it is possible to clarify the phylogenetic origins of the plant mitochondrion. In particular, a close phylogenetic relationship is found between the plant mitochondria and members of the alpha subdivision of the purple photosynthetic bacteria, namely, Rb. sphaeroides, P. denitrificans, and Rhodospirillum rubrum.

  19. Spectra library assisted de novo peptide sequencing for HCD and ETD spectra pairs.

    PubMed

    Yan, Yan; Zhang, Kaizhong

    2016-12-23

    De novo peptide sequencing via tandem mass spectrometry (MS/MS) has been developed rapidly in recent years. With the use of spectra pairs from the same peptide under different fragmentation modes, performance of de novo sequencing is greatly improved. Currently, with large amount of spectra sequenced everyday, spectra libraries containing tens of thousands of annotated experimental MS/MS spectra become available. These libraries provide information of the spectra properties, thus have the potential to be used with de novo sequencing to improve its performance. In this study, an improved de novo sequencing method assisted with spectra library is proposed. It uses spectra libraries as training datasets and introduces significant scores of the features used in our previous de novo sequencing method for HCD and ETD spectra pairs. Two pairs of HCD and ETD spectral datasets were used to test the performance of the proposed method and our previous method. The results show that this proposed method achieves better sequencing accuracy with higher ranked correct sequences and less computational time. This paper proposed an advanced de novo sequencing method for HCD and ETD spectra pair and used information from spectra libraries and significant improved previous similar methods.

  20. Evolution and Spread of Ebola Virus in Liberia, 2014-2015.

    PubMed

    Ladner, Jason T; Wiley, Michael R; Mate, Suzanne; Dudas, Gytis; Prieto, Karla; Lovett, Sean; Nagle, Elyse R; Beitzel, Brett; Gilbert, Merle L; Fakoli, Lawrence; Diclaro, Joseph W; Schoepp, Randal J; Fair, Joseph; Kuhn, Jens H; Hensley, Lisa E; Park, Daniel J; Sabeti, Pardis C; Rambaut, Andrew; Sanchez-Lockhart, Mariano; Bolay, Fatorma K; Kugelman, Jeffrey R; Palacios, Gustavo

    2015-12-09

    The 2013-present Western African Ebola virus disease (EVD) outbreak is the largest ever recorded with >28,000 reported cases. Ebola virus (EBOV) genome sequencing has played an important role throughout this outbreak; however, relatively few sequences have been determined from patients in Liberia, the second worst-affected country. Here, we report 140 EBOV genome sequences from the second wave of the Liberian outbreak and analyze them in combination with 782 previously published sequences from throughout the Western African outbreak. While multiple early introductions of EBOV to Liberia are evident, the majority of Liberian EVD cases are consistent with a single introduction, followed by spread and diversification within the country. Movement of the virus within Liberia was widespread, and reintroductions from Liberia served as an important source for the continuation of the already ongoing EVD outbreak in Guinea. Overall, little evidence was found for incremental adaptation of EBOV to the human host. Copyright © 2015 Elsevier Inc. All rights reserved.

  1. Action-perception coupling in violinists.

    PubMed

    Kajihara, Takafumi; Verdonschot, Rinus G; Sparks, Joseph; Stewart, Lauren

    2013-01-01

    The current study investigates auditory-motor coupling in musically trained participants using a Stroop-type task that required the execution of simple finger sequences according to aurally presented number sequences (e.g., "2," "4," "5," "3," "1"). Digital remastering was used to manipulate the pitch contour of the number sequences such that they were either congruent or incongruent with respect to the resulting action sequence. Conservatoire-level violinists showed a strong effect of congruency manipulation (increased response time for incongruent vs. congruent trials), in comparison to a control group of non-musicians. In Experiment 2, this paradigm was used to determine whether pedagogical background would influence this effect in a group of young violinists. Suzuki trained violinists differed significantly from those with no musical background, while traditionally trained violinists did not. The findings extend previous research in this area by demonstrating that obligatory audio-motor coupling is directly related to a musicians' expertise on their instrument of study and is influenced by pedagogy.

  2. Whole Genome Complete Resequencing of Bacillus subtilis Natto by Combining Long Reads with High-Quality Short Reads

    PubMed Central

    Kamada, Mayumi; Hase, Sumitaka; Sato, Kengo; Toyoda, Atsushi; Fujiyama, Asao; Sakakibara, Yasubumi

    2014-01-01

    De novo microbial genome sequencing reached a turning point with third-generation sequencing (TGS) platforms, and several microbial genomes have been improved by TGS long reads. Bacillus subtilis natto is closely related to the laboratory standard strain B. subtilis Marburg 168, and it has a function in the production of the traditional Japanese fermented food “natto.” The B. subtilis natto BEST195 genome was previously sequenced with short reads, but it included some incomplete regions. We resequenced the BEST195 genome using a PacBio RS sequencer, and we successfully obtained a complete genome sequence from one scaffold without any gaps, and we also applied Illumina MiSeq short reads to enhance quality. Compared with the previous BEST195 draft genome and Marburg 168 genome, we found that incomplete regions in the previous genome sequence were attributed to GC-bias and repetitive sequences, and we also identified some novel genes that are found only in the new genome. PMID:25329997

  3. Detection and analysis of recombination in GII.4 norovirus strains causing gastroenteritis outbreaks in Alberta.

    PubMed

    Hasing, Maria E; Hazes, Bart; Lee, Bonita E; Preiksaitis, Jutta K; Pang, Xiaoli L

    2014-10-01

    Recombination is an important mechanism generating genetic diversity in norovirus (NoV) that occurs commonly at the NoV polymerase-capsid (ORF1/2) junction. The genotyping method based on partial ORF2 sequences currently used to characterize circulating NoV strains in gastroenteritis outbreaks in Alberta cannot detect such recombination events and provides only limited information on NoV genetic evolution. The objective of this study was to determine whether any NoV GII.4 strains causing outbreaks in Alberta are recombinants. Twenty stool samples collected during outbreaks occurring between July 2004 and January 2012 were selected to include the GII.4 variants Farmington Hills 2002, Hunter 2004, Yerseke 2006a, Den Haag 2006b, Apeldoorn 2007, New Orleans 2009, and Sydney 2012 based on previous NoV ORF2-genotyping results. Near full-length NoV genome sequences were obtained, aligned with reference sequences from GenBank and analyzed with RDPv4.13. Two sequences corresponding to Apeldoorn 2007, and Sydney 2012 were identified as recombinants with breakpoints near the ORF1/2 junction and putative parental strains as previously reported. We also identified, for the first time, a non-recombinant sequence resembling the ORF2-3 parent of the recombinant cluster Sydney 2012 responsible for the most recent pandemic. Our results confirmed the presence of recombinant NoV GII.4 strains in Alberta, and highlight the importance of including additional genomic regions in surveillance studies to trace the evolution of pandemic NoV GII.4 strains. Copyright © 2014 Elsevier B.V. All rights reserved.

  4. A Sequence-Independent Strategy for Detection and Cloning of Circular DNA Virus Genomes by Using Multiply Primed Rolling-Circle Amplification

    PubMed Central

    Rector, Annabel; Tachezy, Ruth; Van Ranst, Marc

    2004-01-01

    The discovery of novel viruses has often been accomplished by using hybridization-based methods that necessitate the availability of a previously characterized virus genome probe or knowledge of the viral nucleotide sequence to construct consensus or degenerate PCR primers. In their natural replication cycle, certain viruses employ a rolling-circle mechanism to propagate their circular genomes, and multiply primed rolling-circle amplification (RCA) with φ29 DNA polymerase has recently been applied in the amplification of circular plasmid vectors used in cloning. We employed an isothermal RCA protocol that uses random hexamer primers to amplify the complete genomes of papillomaviruses without the need for prior knowledge of their DNA sequences. We optimized this RCA technique with extracted human papillomavirus type 16 (HPV-16) DNA from W12 cells, using a real-time quantitative PCR assay to determine amplification efficiency, and obtained a 2.4 × 104-fold increase in HPV-16 DNA concentration. We were able to clone the complete HPV-16 genome from this multiply primed RCA product. The optimized protocol was subsequently applied to a bovine fibropapillomatous wart tissue sample. Whereas no papillomavirus DNA could be detected by restriction enzyme digestion of the original sample, multiply primed RCA enabled us to obtain a sufficient amount of papillomavirus DNA for restriction enzyme analysis, cloning, and subsequent sequencing of a novel variant of bovine papillomavirus type 1. The multiply primed RCA method allows the discovery of previously unknown papillomaviruses, and possibly also other circular DNA viruses, without a priori sequence information. PMID:15113879

  5. The speEspeD operon of Escherichia coli. Formation and processing of a proenzyme form of S-adenosylmethionine decarboxylase.

    PubMed

    Tabor, C W; Tabor, H

    1987-11-25

    We have previously shown that the gene (speD) for S-adenosylmethionine decarboxylase is part of an operon that also contains the gene (speE) for spermidine synthase (Tabor, C. W., Tabor, H., and Xie, Q.-W. (1986) Proc. Natl. Acad. Sci. U. S. A. 83, 6040-6044). We have now determined the nucleotide sequence of this operon and have found that speD codes for a polypeptide of Mr = 30,400, which is considerably greater than the subunit size of the purified enzyme. Our studies show that S-adenosylmethionine decarboxylase is first formed as a Mr = 30,400 polypeptide and that this proenzyme is then cleaved at the Lys111-Ser112 peptide bond to form a Mr = 12,400 subunit and a Mr = 18,000 subunit. The latter subunit contains the pyruvoyl moiety that we previously showed is required for enzymatic activity. Both subunits are present in the purified enzyme. These conclusions are based on (i) pulse-chase experiments with a strain containing a speD+ plasmid which showed a precursor-product relationship between the proenzyme and the enzyme subunits, (ii) the amino acid sequence of the proenzyme form of S-adenosylmethionine decarboxylase (derived from the nucleotide sequence of the speD gene), and (iii) comparison of this sequence of the proenzyme with the N-terminal amino acid sequences of the two subunits of the purified enzyme reported by Anton and Kutny (Anton, D. L., and Kutny, R. (1987) J. Biol. Chem. 262, 2817-2822).

  6. A complete mitochondrial genome of wheat (Triticum aestivum cv. Chinese Yumai), and fast evolving mitochondrial genes in higher plants.

    PubMed

    Cui, Peng; Liu, Huitao; Lin, Qiang; Ding, Feng; Zhuo, Guoyin; Hu, Songnian; Liu, Dongcheng; Yang, Wenlong; Zhan, Kehui; Zhang, Aimin; Yu, Jun

    2009-12-01

    Plant mitochondrial genomes, encoding necessary proteins involved in the system of energy production, play an important role in the development and reproduction of the plant. They occupy a specific evolutionary pattern relative to their nuclear counterparts. Here, we determined the winter wheat (Triticum aestivum cv. Chinese Yumai) mitochondrial genome in a length of 452 and 526 bp by shotgun sequencing its BAC library. It contains 202 genes, including 35 known protein-coding genes, three rRNA and 17 tRNA genes, as well as 149 open reading frames (ORFs; greater than 300 bp in length). The sequence is almost identical to the previously reported sequence of the spring wheat (T. aestivum cv. Chinese Spring); we only identified seven SNPs (three transitions and four transversions) and 10 indels (insertions and deletions) between the two independently acquired sequences, and all variations were found in non-coding regions. This result confirmed the accuracy of the previously reported mitochondrial sequence of the Chinese Spring wheat. The nucleotide frequency and codon usage of wheat are common among the lineage of higher plant with a high AT-content of 58%. Molecular evolutionary analysis demonstrated that plant mitochondrial genomes evolved at different rates, which may correlate with substantial variations in metabolic rate and generation time among plant lineages. In addition, through the estimation of the ratio of non-synonymous to synonymous substitution rates between orthologous mitochondrion-encoded genes of higher plants, we found an accelerated evolutionary rate that seems to be the result of relaxed selection.

  7. A computer aided thermodynamic approach for predicting the formation of Z-DNA in naturally occurring sequences

    NASA Technical Reports Server (NTRS)

    Ho, P. S.; Ellison, M. J.; Quigley, G. J.; Rich, A.

    1986-01-01

    The ease with which a particular DNA segment adopts the left-handed Z-conformation depends largely on the sequence and on the degree of negative supercoiling to which it is subjected. We describe a computer program (Z-hunt) that is designed to search long sequences of naturally occurring DNA and retrieve those nucleotide combinations of up to 24 bp in length which show a strong propensity for Z-DNA formation. Incorporated into Z-hunt is a statistical mechanical model based on empirically determined energetic parameters for the B to Z transition accumulated to date. The Z-forming potential of a sequence is assessed by ranking its behavior as a function of negative superhelicity relative to the behavior of similar sized randomly generated nucleotide sequences assembled from over 80,000 combinations. The program makes it possible to compare directly the Z-forming potential of sequences with different base compositions and different sequence lengths. Using Z-hunt, we have analyzed the DNA sequences of the bacteriophage phi X174, plasmid pBR322, the animal virus SV40 and the replicative form of the eukaryotic adenovirus-2. The results are compared with those previously obtained by others from experiments designed to locate Z-DNA forming regions in these sequences using probes which show specificity for the left-handed DNA conformation.

  8. Phylogenetic Relationship of Necoclí Virus to Other South American Hantaviruses (Bunyaviridae: Hantavirus).

    PubMed

    Montoya-Ruiz, Carolina; Cajimat, Maria N B; Milazzo, Mary Louise; Diaz, Francisco J; Rodas, Juan David; Valbuena, Gustavo; Fulhorst, Charles F

    2015-07-01

    The results of a previous study suggested that Cherrie's cane rat (Zygodontomys cherriei) is the principal host of Necoclí virus (family Bunyaviridae, genus Hantavirus) in Colombia. Bayesian analyses of complete nucleocapsid protein gene sequences and complete glycoprotein precursor gene sequences in this study confirmed that Necoclí virus is phylogenetically closely related to Maporal virus, which is principally associated with the delicate pygmy rice rat (Oligoryzomys delicatus) in western Venezuela. In pairwise comparisons, nonidentities between the complete amino acid sequence of the nucleocapsid protein of Necoclí virus and the complete amino acid sequences of the nucleocapsid proteins of other hantaviruses were ≥8.7%. Likewise, nonidentities between the complete amino acid sequence of the glycoprotein precursor of Necoclí virus and the complete amino acid sequences of the glycoprotein precursors of other hantaviruses were ≥11.7%. Collectively, the unique association of Necoclí virus with Z. cherriei in Colombia, results of the Bayesian analyses of complete nucleocapsid protein gene sequences and complete glycoprotein precursor gene sequences, and results of the pairwise comparisons of amino acid sequences strongly support the notion that Necoclí virus represents a novel species in the genus Hantavirus. Further work is needed to determine whether Calabazo virus (a hantavirus associated with Z. brevicauda cherriei in Panama) and Necoclí virus are conspecific.

  9. Characterization and mapping of cDNA encoding aspartate aminotransferase in rice, Oryza sativa L.

    PubMed

    Song, J; Yamamoto, K; Shomura, A; Yano, M; Minobe, Y; Sasaki, T

    1996-10-31

    Fifteen cDNA clones, putatively identified as encoding aspartate aminotransferase (AST, EC 2.6.1.1.), were isolated and partially sequenced. Together with six previously isolated clones putatively identified to encode ASTs (Sasaki, et al. 1994, Plant Journal 6, 615-624), their sequences were characterized and classified into 4 cDNA species. Two of the isolated clones, C60213 and C2079, were full-length cDNAs, and their complete nucleotide sequences were determined. C60213 was 1612 bp long and its deduced amino acid sequence showed 88% homology with that of Panicum miliaceum L. mitochondrial AST. The C60213-encoded protein had an N-terminal amino acid sequence that was characteristic of a mitochondrial transit peptide. On the other hand, C2079 was 1546 bp long and had 91% amino acid sequence homology with P. miliaceum L. cytosolic AST but lacked in the transit peptide sequence. The homologies of nucleotide sequences and deduced amino acid sequences of C2079 and C60213 were 54% and 52%, respectively. C2079 and C60213 were mapped on chromosomes 1 and 6, respectively, by restriction fragment length polymorphism linkage analysis. Northern blot analysis using C2079 as a probe revealed much higher transcript levels in callus and root than in green and etiolated shoots, suggesting tissue-specific variations of AST gene expression.

  10. Species diversity and phylogeographical affinities of the Branchiopoda (Crustacea) of Churchill, Manitoba, Canada.

    PubMed

    Jeffery, Nicholas W; Elías-Gutiérrez, Manuel; Adamowicz, Sarah J

    2011-01-01

    The region of Churchill, Manitoba, contains a wide variety of habitats representative of both the boreal forest and arctic tundra and has been used as a model site for biodiversity studies for nearly seven decades within Canada. Much previous work has been done in Churchill to study the Daphnia pulex species complex in particular, but no study has completed a wide-scale survey on the crustacean species that inhabit Churchill's aquatic ecosystems using molecular markers. We have employed DNA barcoding to study the diversity of the Branchiopoda (Crustacea) in a wide variety of freshwater habitats and to determine the likely origins of the Churchill fauna following the last glaciation. The standard animal barcode marker (COI) was sequenced for 327 specimens, and a 3% divergence threshold was used to delineate potential species. We found 42 provisional and valid branchiopod species from this survey alone, including several cryptic lineages, in comparison with the 25 previously recorded from previous ecological works. Using published sequence data, we explored the phylogeographic affinities of Churchill's branchiopods, finding that the Churchill fauna apparently originated from all directions from multiple glacial refugia (including southern, Beringian, and high arctic regions). Overall, these microcrustaceans are very diverse in Churchill and contain multiple species complexes. The present study introduces among the first sequences for some understudied genera, for which further work is required to delineate species boundaries and develop a more complete understanding of branchiopod diversity over a larger spatial scale.

  11. Species Diversity and Phylogeographical Affinities of the Branchiopoda (Crustacea) of Churchill, Manitoba, Canada

    PubMed Central

    Jeffery, Nicholas W.; Elías-Gutiérrez, Manuel; Adamowicz, Sarah J.

    2011-01-01

    The region of Churchill, Manitoba, contains a wide variety of habitats representative of both the boreal forest and arctic tundra and has been used as a model site for biodiversity studies for nearly seven decades within Canada. Much previous work has been done in Churchill to study the Daphnia pulex species complex in particular, but no study has completed a wide-scale survey on the crustacean species that inhabit Churchill's aquatic ecosystems using molecular markers. We have employed DNA barcoding to study the diversity of the Branchiopoda (Crustacea) in a wide variety of freshwater habitats and to determine the likely origins of the Churchill fauna following the last glaciation. The standard animal barcode marker (COI) was sequenced for 327 specimens, and a 3% divergence threshold was used to delineate potential species. We found 42 provisional and valid branchiopod species from this survey alone, including several cryptic lineages, in comparison with the 25 previously recorded from previous ecological works. Using published sequence data, we explored the phylogeographic affinities of Churchill's branchiopods, finding that the Churchill fauna apparently originated from all directions from multiple glacial refugia (including southern, Beringian, and high arctic regions). Overall, these microcrustaceans are very diverse in Churchill and contain multiple species complexes. The present study introduces among the first sequences for some understudied genera, for which further work is required to delineate species boundaries and develop a more complete understanding of branchiopod diversity over a larger spatial scale. PMID:21610864

  12. Bacterivory by a Summer Assemblage of Nanoplankton in the Ross Sea, Antarctica: Mixotrophic Versus Heterotrophic Protists

    NASA Astrophysics Data System (ADS)

    Sanders, R. W.; Gast, R. J.

    2016-02-01

    Many protists traditionally described as phototrophic have recently been shown to have retained the primitive trait of phagotrophy, and thus function as mixotrophs. Mixotrophic nanoflagellates were identified in every sample examined from a summer cruise in the Ross Sea, Antarctica, where they often were more abundant than heterotrophic nanoflagellates that have previously been considered the major bacterivores in marine waters. Mixotrophs, identified by uptake of fluorescent tracers, comprised similar proportions (9-75%) of the total bacterivorous flagellates in summer as were previously determined for an earlier spring cruise in the Ross Sea. Protist diversity also was linked to functional bacterivores using a culture-independent method in which BrdU-labeled DNA of bacterial prey was incorporated into the DNA of eukaryotic grazers. Immunoprecipitation of the BrdU-labeld DNA was followed by high-throughput sequencing to identify a diverse group of bacterivores, including numerous uncultured eukaryotes. However, its utility for identification of mixotrophs was limited by the availability of sequences from known mixotrophs.

  13. Testing the Use of Implicit Solvent in the Molecular Dynamics Modelling of DNA Flexibility

    NASA Astrophysics Data System (ADS)

    Mitchell, J.; Harris, S.

    DNA flexibility controls packaging, looping and in some cases sequence specific protein binding. Molecular dynamics simulations carried out with a computationally efficient implicit solvent model are potentially a powerful tool for studying larger DNA molecules than can be currently simulated when water and counterions are represented explicitly. In this work we compare DNA flexibility at the base pair step level modelled using an implicit solvent model to that previously determined from explicit solvent simulations and database analysis. Although much of the sequence dependent behaviour is preserved in implicit solvent, the DNA is considerably more flexible when the approximate model is used. In addition we test the ability of the implicit solvent to model stress induced DNA disruptions by simulating a series of DNA minicircle topoisomers which vary in size and superhelical density. When compared with previously run explicit solvent simulations, we find that while the levels of DNA denaturation are similar using both computational methodologies, the specific structural form of the disruptions is different.

  14. Evolutionary characterization of the West Nile Virus complete genome.

    PubMed

    Gray, R R; Veras, N M C; Santos, L A; Salemi, M

    2010-07-01

    The spatial dynamics of the West Nile Virus epidemic in North America are largely unknown. Previous studies that investigated the evolutionary history of the virus used sequence data from the structural genes (prM and E); however, these regions may lack phylogenetic information and obscure true evolutionary relationships. This study systematically evaluated the evolutionary patterns in the eleven genes of the WNV genome in order to determine which region(s) were most phylogenetically informative. We found that while the E region lacks resolution and can potentially result in misleading conclusions, the full NS3 or NS5 regions have strong phylogenetic signal. Furthermore, we show that geographic structure of WNV infection within the US is more pronounced than previously reported in studies that used the structural genes. We conclude that future evolutionary studies should focus on NS3 and NS5 in order to maximize the available sequences while retaining maximal interpretative power to infer temporal and geographic trends among WNV strains. Copyright 2010 Elsevier Inc. All rights reserved.

  15. Size and Content of the Sex-Determining Region of the Y Chromosome in Dioecious Mercurialis annua, a Plant with Homomorphic Sex Chromosomes.

    PubMed

    Veltsos, Paris; Cossard, Guillaume; Beaudoing, Emmanuel; Beydon, Genséric; Savova Bianchi, Dessislava; Roux, Camille; C González-Martínez, Santiago; R Pannell, John

    2018-05-29

    Dioecious plants vary in whether their sex chromosomes are heteromorphic or homomorphic, but even homomorphic sex chromosomes may show divergence between homologues in the non-recombining, sex-determining region (SDR). Very little is known about the SDR of these species, which might represent particularly early stages of sex-chromosome evolution. Here, we assess the size and content of the SDR of the diploid dioecious herb Mercurialis annua , a species with homomorphic sex chromosomes and mild Y-chromosome degeneration. We used RNA sequencing (RNAseq) to identify new Y-linked markers for M. annua. Twelve of 24 transcripts showing male-specific expression in a previous experiment could be amplified by polymerase chain reaction (PCR) only from males, and are thus likely to be Y-linked. Analysis of genome-capture data from multiple populations of M. annua pointed to an additional six male-limited (and thus Y-linked) sequences. We used these markers to identify and sequence 17 sex-linked bacterial artificial chromosomes (BACs), which form 11 groups of non-overlapping sequences, covering a total sequence length of about 1.5 Mb. Content analysis of this region suggests that it is enriched for repeats, has low gene density, and contains few candidate sex-determining genes. The BACs map to a subset of the sex-linked region of the genetic map, which we estimate to be at least 14.5 Mb. This is substantially larger than estimates for other dioecious plants with homomorphic sex chromosomes, both in absolute terms and relative to their genome sizes. Our data provide a rare, high-resolution view of the homomorphic Y chromosome of a dioecious plant.

  16. Identification of mitochondrial carriers in Saccharomyces cerevisiae by transport assay of reconstituted recombinant proteins.

    PubMed

    Palmieri, Ferdinando; Agrimi, Gennaro; Blanco, Emanuela; Castegna, Alessandra; Di Noia, Maria A; Iacobazzi, Vito; Lasorsa, Francesco M; Marobbio, Carlo M T; Palmieri, Luigi; Scarcia, Pasquale; Todisco, Simona; Vozza, Angelo; Walker, John

    2006-01-01

    The inner membranes of mitochondria contain a family of carrier proteins that are responsible for the transport in and out of the mitochondrial matrix of substrates, products, co-factors and biosynthetic precursors that are essential for the function and activities of the organelle. This family of proteins is characterized by containing three tandem homologous sequence repeats of approximately 100 amino acids, each folded into two transmembrane alpha-helices linked by an extensive polar loop. Each repeat contains a characteristic conserved sequence. These features have been used to determine the extent of the family in genome sequences. The genome of Saccharomyces cerevisiae contains 34 members of the family. The identity of five of them was known before the determination of the genome sequence, but the functions of the remaining family members were not. This review describes how the functions of 15 of these previously unknown transport proteins have been determined by a strategy that consists of expressing the genes in Escherichia coli or Saccharomyces cerevisiae, reconstituting the gene products into liposomes and establishing their functions by transport assay. Genetic and biochemical evidence as well as phylogenetic considerations have guided the choice of substrates that were tested in the transport assays. The physiological roles of these carriers have been verified by genetic experiments. Various pieces of evidence point to the functions of six additional members of the family, but these proposals await confirmation by transport assay. The sequences of many of the newly identified yeast carriers have been used to characterize orthologs in other species, and in man five diseases are presently known to be caused by defects in specific mitochondrial carrier genes. The roles of eight yeast mitochondrial carriers remain to be established.

  17. Principle component analysis to separate deformation signals from multiple sources during a 2015 intrusive sequence at Kīlauea Volcano

    NASA Astrophysics Data System (ADS)

    Johanson, I. A.; Miklius, A.; Poland, M. P.

    2016-12-01

    A sequence of magmatic events in April-May 2015 at Kīlauea Volcano produced a complex deformation pattern that can be described by multiple deforming sources, active simultaneously. The 2015 intrusive sequence began with inflation in the volcano's summit caldera near Halema`uma`u (HMM) Crater, which continued over a few weeks, followed by rapid deflation of the HMM source and inflation of a source in the south caldera region during the next few days. In Kīlauea Volcano's summit area, multiple deformation centers are active at varying times, and all contribute to the overall pattern observed with GPS, tiltmeters, and InSAR. Isolating the contribution of different signals related to each source is a challenge and complicates the determination of optimal source geometry for the underlying magma bodies. We used principle component analysis of continuous GPS time series from the 2015 intrusion sequence to determine three basis vectors which together account for 83% of the variance in the data set. The three basis vectors are non-orthogonal and not strictly the principle components of the data set. In addition to separating deformation sources in the continuous GPS data, the basis vectors provide a means to scale the contribution of each source in a given interferogram. This provides an additional constraint in a joint model of GPS and InSAR data (COSMO-SkyMed and Sentinel-1A) to determine source geometry. The first basis vector corresponds with inflation in the south caldera region, an area long recognized as the location of a long-term storage reservoir. The second vector represents deformation of the HMM source, which is in the same location as a previously modeled shallow reservoir, however InSAR data suggest a more complicated source. Preliminary modeling of the deformation attributed to the third basis vector shows that it is consistent with inflation of a steeply dipping ellipsoid centered below Keanakāko`i crater, southeast of HMM. Keanakāko`i crater is the locus of a known, intermittently active deformation source, which was not previously recognized to have been active during the 2015 event.

  18. Characterization of occult hepatitis B virus infection among HIV positive patients in Cameroon.

    PubMed

    Gachara, George; Magoro, Tshifhiwa; Mavhandu, Lufuno; Lum, Emmaculate; Kimbi, Helen K; Ndip, Roland N; Bessong, Pascal O

    2017-03-08

    Occult hepatitis B infection (OBI) among HIV positive patients varies widely in different geographic regions. We undertook a study to determine the prevalence of occult hepatitis B infection among HIV infected individuals visiting a health facility in South West Cameroon and characterized occult HBV strains based on sequence analyses. Plasma samples (n = 337), which previously tested negative for hepatitis B surface antigen (HBsAg), were screened for antibodies against hepatitis B core (anti-HBc) and surface (anti-HBs) antigens followed by DNA extraction. A 366 bp region covering the overlapping surface/polymerase gene of HBV was then amplified in a nested PCR and the amplicons sequenced using Sanger sequencing. The resulting sequences were then analyzed for genotypes and for escape and drug resistance mutations. Twenty samples were HBV DNA positive and were classified as OBI giving a prevalence of 5.9%. Out of these, 9 (45%) were anti-HBs positive, while 10 (52.6%) were anti-HBc positive. Additionally, 2 had dual anti-HBs and anti-HBc reactivity, while 6 had no detectable HBV antibodies. Out of the ten samples that were successfully sequenced, nine were classified as genotype E and one as genotype A. Three sequences possessed mutations associated with lamivudine resistance. We detected a number of mutations within the major hydrophilic region of the surface gene where most immune escape mutations occur. Findings from this study show the presence of hepatitis B in patients without any of the HBV serological markers. Further prospective studies are required to determine the risk factors and markers of OBI.

  19. Identification and characterization of Burkholderia multivorans CCA53.

    PubMed

    Akita, Hironaga; Kimura, Zen-Ichiro; Yusoff, Mohd Zulkhairi Mohd; Nakashima, Nobutaka; Hoshino, Tamotsu

    2017-07-06

    A lignin-degrading bacterium, Burkholderia sp. CCA53, was previously isolated from leaf soil. The purpose of this study was to determine phenotypic and biochemical features of Burkholderia sp. CCA53. Multilocus sequence typing (MLST) analysis based on fragments of the atpD, gltD, gyrB, lepA, recA and trpB gene sequences was performed to identify Burkholderia sp. CCA53. The MLST analysis revealed that Burkholderia sp. CCA53 was tightly clustered with B. multivorans ATCC BAA-247 T . The quinone and cellular fatty acid profiles, carbon source utilization, growth temperature and pH were consistent with the characteristics of B. multivorans species. Burkholderia sp. CCA53 was therefore identified as B. multivorans CCA53.

  20. High bacterial diversity in epilithic biofilms of oligotrophic mountain lakes.

    PubMed

    Bartrons, Mireia; Catalan, Jordi; Casamayor, Emilio O

    2012-11-01

    Benthic microbial biofilms attached to rocks (epilithic) are major sites of carbon cycling and can dominate ecosystem primary production in oligotrophic lakes. We studied the bacterial community composition of littoral epilithic biofilms in five connected oligotrophic high mountain lakes located at different altitudes by genetic fingerprinting and clone libraries of the 16S rRNA gene. Different intra-lake samples were analyzed, and consistent changes in community structure (chlorophyll a and organic matter contents, and bacterial community composition) were observed along the altitudinal gradient, particularly related with the location of the lake above or below the treeline. Epilithic biofilm genetic fingerprints were both more diverse among lakes than within lakes and significantly different between montane (below the tree line) and alpine lakes (above the tree line). The genetic richness in the epilithic biofilm was much higher than in the plankton of the same lacustrine area studied in previous works, with significantly idiosyncratic phylogenetic composition (specifically distinct from lake plankton or mountain soils). Data suggest the coexistence of aerobic, anaerobic, phototrophic, and chemotrophic microorganisms in the biofilm, Bacteroidetes and Cyanobacteria being the most important bacterial taxa, followed by Alpha-, Beta-, Gamma-, and Deltaproteobacteria, Chlorobi, Planctomycetes, and Verrucomicrobia. The degree of novelty was especially high for epilithic Bacteroidetes, and up to 50 % of the sequences formed monophyletic clusters distantly related to any previously reported sequence. More than 35 % of the total sequences matched at <95 % identity to any previously reported 16S rRNA gene, indicating that alpine epilithic biofilms are unexplored habitats that contain a substantial degree of novelty within a short geographical distance. Further research is needed to determine whether these communities are involved in more biogeochemical pathways than previously thought.

  1. Speech Motor Sequence Learning: Effect of Parkinson Disease and Normal Aging on Dual-Task Performance.

    PubMed

    Whitfield, Jason A; Goberman, Alexander M

    2017-06-22

    Everyday communication is carried out concurrently with other tasks. Therefore, determining how dual tasks interfere with newly learned speech motor skills can offer insight into the cognitive mechanisms underlying speech motor learning in Parkinson disease (PD). The current investigation examines a recently learned speech motor sequence under dual-task conditions. A previously learned sequence of 6 monosyllabic nonwords was examined using a dual-task paradigm. Participants repeated the sequence while concurrently performing a visuomotor task, and performance on both tasks was measured in single- and dual-task conditions. The younger adult group exhibited little to no dual-task interference on the accuracy and duration of the sequence. The older adult group exhibited variability in dual-task costs, with the group as a whole exhibiting an intermediate, though significant, amount of dual-task interference. The PD group exhibited the largest degree of bidirectional dual-task interference among all the groups. These data suggest that PD affects the later stages of speech motor learning, as the dual-task condition interfered with production of the recently learned sequence beyond the effect of normal aging. Because the basal ganglia is critical for the later stages of motor sequence learning, the observed deficits may result from the underlying neural dysfunction associated with PD.

  2. Musical Scales in Tone Sequences Improve Temporal Accuracy.

    PubMed

    Li, Min S; Di Luca, Massimiliano

    2018-01-01

    Predicting the time of stimulus onset is a key component in perception. Previous investigations of perceived timing have focused on the effect of stimulus properties such as rhythm and temporal irregularity, but the influence of non-temporal properties and their role in predicting stimulus timing has not been exhaustively considered. The present study aims to understand how a non-temporal pattern in a sequence of regularly timed stimuli could improve or bias the detection of temporal deviations. We presented interspersed sequences of 3, 4, 5, and 6 auditory tones where only the timing of the last stimulus could slightly deviate from isochrony. Participants reported whether the last tone was 'earlier' or 'later' relative to the expected regular timing. In two conditions, the tones composing the sequence were either organized into musical scales or they were random tones. In one experiment, all sequences ended with the same tone; in the other experiment, each sequence ended with a different tone. Results indicate higher discriminability of anisochrony with musical scales and with longer sequences, irrespective of the knowledge of the final tone. Such an outcome suggests that the predictability of non-temporal properties, as enabled by the musical scale pattern, can be a factor in determining the sensitivity of time judgments.

  3. Singular over-representation of an octameric palindrome, HIP1, in DNA from many cyanobacteria.

    PubMed

    Robinson, N J; Robinson, P J; Gupta, A; Bleasby, A J; Whitton, B A; Morby, A P

    1995-03-11

    An octameric palindrome (5'-GCGATCGC-3') is abundant in cyanobacterial sequences within databases (GenBank/EMBL) and was designated HIP1 (highly iterated palindrome). The frequency of occurrence of all 256 octameric palindromes has now been determined in sub-databases revealing large and unique over-representation of HIP1 in cyanobacterial entries. DNA sequences from other bacteria were searched for any over-represented octameric palindromes analogous to HIP1. Only two sequences were identified, in the genomes of a thermophile and halophilic archaebacteria, although these were less abundant than HIP1 in cyanobacteria and relate to codon usage. To test the proposed widespread distribution of HIP1 in DNA from the cyanobacterium Synechococcus PCC 6301, randomly selected genomic clones were partly sequenced. HIP1 constituted 2.5% of the novel sequences, equivalent to a site on average once every 320 nucleotides. An oligonucleotide including HIP1 was also tested in PCR. Multiple products were obtained using template DNA from cyanobacterial strains in which HIP1 is abundant in known sequences, and some strains generated characteristic HIP-PCR banding patterns. However, analysis of DNA from one strain (not previously represented in databases) by random sequencing, HIP-PCR and Pvul digestion, confirms that not all cyanobacterial genomes are rich in HIP1.

  4. Complete Genomic Sequence and Comparative Analysis of the Genome Segments of Sweet Potato Chlorotic Stunt Virus in China

    PubMed Central

    Qin, Yanhong; Wang, Li; Zhang, Zhenchen; Qiao, Qi; Zhang, Desheng; Tian, Yuting; Wang, Shuang; Wang, Yongjiang; Yan, Zhaoling

    2014-01-01

    Background Sweet potato chlorotic stunt virus (family Closteroviridae, genus Crinivirus) features a large bipartite, single-stranded, positive-sense RNA genome. To date, only three complete genomic sequences of SPCSV can be accessed through GenBank. SPCSV was first detected from China in 2011, only partial genomic sequences have been determined in the country. No report on the complete genomic sequence and genome structure of Chinese SPCSV isolates or the genetic relation between isolates from China and other countries is available. Methodology/Principal Findings The complete genomic sequences of five isolates from different areas in China were characterized. This study is the first to report the complete genome sequences of SPCSV from whitefly vectors. Genome structure analysis showed that isolates of WA and EA strains from China have the same coding protein as isolates Can181-9 and m2-47, respectively. Twenty cp genes and four RNA1 partial segments were sequenced and analyzed, and the nucleotide identities of complete genomic, cp, and RNA1 partial sequences were determined. Results indicated high conservation among strains and significant differences between WA and EA strains. Genetic analysis demonstrated that, except for isolates from Guangdong Province, SPCSVs from other areas belong to the WA strain. Genome organization analysis showed that the isolates in this study lack the p22 gene. Conclusions/Significance We presented the complete genome sequences of SPCSV in China. Comparison of nucleotide identities and genome structures between these isolates and previously reported isolates showed slight differences. The nucleotide identities of different SPCSV isolates showed high conservation among strains and significant differences between strains. All nine isolates in this study lacked p22 gene. WA strains were more extensively distributed than EA strains in China. These data provide important insights into the molecular variation and genomic structure of SPCSV in China as well as genetic relationships among isolates from China and other countries. PMID:25170926

  5. Deciphering evolutionary strata on plant sex chromosomes and fungal mating-type chromosomes through compositional segmentation.

    PubMed

    Pandey, Ravi S; Azad, Rajeev K

    2016-03-01

    Sex chromosomes have evolved from a pair of homologous autosomes which differentiated into sex determination systems, such as XY or ZW system, as a consequence of successive recombination suppression between the gametologous chromosomes. Identifying the regions of recombination suppression, namely, the "evolutionary strata", is central to understanding the history and dynamics of sex chromosome evolution. Evolution of sex chromosomes as a consequence of serial recombination suppressions is well-studied for mammals and birds, but not for plants, although 48 dioecious plants have already been reported. Only two plants Silene latifolia and papaya have been studied until now for the presence of evolutionary strata on their X chromosomes, made possible by the sequencing of sex-linked genes on both the X and Y chromosomes, which is a requirement of all current methods that determine stratum structure based on the comparison of gametologous sex chromosomes. To circumvent this limitation and detect strata even if only the sequence of sex chromosome in the homogametic sex (i.e. X or Z chromosome) is available, we have developed an integrated segmentation and clustering method. In application to gene sequences on the papaya X chromosome and protein-coding sequences on the S. latifolia X chromosome, our method could decipher all known evolutionary strata, as reported by previous studies. Our method, after validating on known strata on the papaya and S. latifolia X chromosome, was applied to the chromosome 19 of Populus trichocarpa, an incipient sex chromosome, deciphering two, yet unknown, evolutionary strata. In addition, we applied this approach to the recently sequenced sex chromosome V of the brown alga Ectocarpus sp. that has a haploid sex determination system (UV system) recovering the sex determining and pseudoautosomal regions, and then to the mating-type chromosomes of an anther-smut fungus Microbotryum lychnidis-dioicae predicting five strata in the non-recombining region of both the chromosomes.

  6. A TATA binding protein mutant with increased affinity for DNA directs transcription from a reversed TATA sequence in vivo.

    PubMed

    Spencer, J Vaughn; Arndt, Karen M

    2002-12-01

    The TATA-binding protein (TBP) nucleates the assembly and determines the position of the preinitiation complex at RNA polymerase II-transcribed genes. We investigated the importance of two conserved residues on the DNA binding surface of Saccharomyces cerevisiae TBP to DNA binding and sequence discrimination. Because they define a significant break in the twofold symmetry of the TBP-TATA interface, Ala100 and Pro191 have been proposed to be key determinants of TBP binding orientation and transcription directionality. In contrast to previous predictions, we found that substitution of an alanine for Pro191 did not allow recognition of a reversed TATA box in vivo; however, the reciprocal change, Ala100 to proline, resulted in efficient utilization of this and other variant TATA sequences. In vitro assays demonstrated that TBP mutants with the A100P and P191A substitutions have increased and decreased affinity for DNA, respectively. The TATA binding defect of TBP with the P191A mutation could be intragenically suppressed by the A100P substitution. Our results suggest that Ala100 and Pro191 are important for DNA binding and sequence recognition by TBP, that the naturally occurring asymmetry of Ala100 and Pro191 is not essential for function, and that a single amino acid change in TBP can lead to elevated DNA binding affinity and recognition of a reversed TATA sequence.

  7. Codon-Anticodon Recognition in the Bacillus subtilis glyQS T Box Riboswitch

    PubMed Central

    Caserta, Enrico; Liu, Liang-Chun; Grundy, Frank J.; Henkin, Tina M.

    2015-01-01

    Many amino acid-related genes in Gram-positive bacteria are regulated by the T box riboswitch. The leader RNA of genes in the T box family controls the expression of downstream genes by monitoring the aminoacylation status of the cognate tRNA. Previous studies identified a three-nucleotide codon, termed the “Specifier Sequence,” in the riboswitch that corresponds to the amino acid identity of the downstream genes. Pairing of the Specifier Sequence with the anticodon of the cognate tRNA is the primary determinant of specific tRNA recognition. This interaction mimics codon-anticodon pairing in translation but occurs in the absence of the ribosome. The goal of the current study was to determine the effect of a full range of mismatches for comparison with codon recognition in translation. Mutations were individually introduced into the Specifier Sequence of the glyQS leader RNA and tRNAGly anticodon to test the effect of all possible pairing combinations on tRNA binding affinity and antitermination efficiency. The functional role of the conserved purine 3′ of the Specifier Sequence was also verifiedin this study. We found that substitutions at the Specifier Sequence resulted in reduced binding, the magnitude of which correlates well with the predicted stability of the RNA-RNA pairing. However, the tolerance for specific mismatches in antitermination was generally different from that during decoding, which reveals a unique tRNA recognition pattern in the T box antitermination system. PMID:26229106

  8. Data Prediction for Public Events in Professional Domains Based on Improved RNN- LSTM

    NASA Astrophysics Data System (ADS)

    Song, Bonan; Fan, Chunxiao; Wu, Yuexin; Sun, Juanjuan

    2018-02-01

    The traditional data services of prediction for emergency or non-periodic events usually cannot generate satisfying result or fulfill the correct prediction purpose. However, these events are influenced by external causes, which mean certain a priori information of these events generally can be collected through the Internet. This paper studied the above problems and proposed an improved model—LSTM (Long Short-term Memory) dynamic prediction and a priori information sequence generation model by combining RNN-LSTM and public events a priori information. In prediction tasks, the model is qualified for determining trends, and its accuracy also is validated. This model generates a better performance and prediction results than the previous one. Using a priori information can increase the accuracy of prediction; LSTM can better adapt to the changes of time sequence; LSTM can be widely applied to the same type of prediction tasks, and other prediction tasks related to time sequence.

  9. The genome sequence of the emerging common midwife toad virus identifies an evolutionary intermediate within ranaviruses.

    PubMed

    Mavian, Carla; López-Bueno, Alberto; Balseiro, Ana; Casais, Rosa; Alcamí, Antonio; Alejo, Alí

    2012-04-01

    Worldwide amphibian population declines have been ascribed to global warming, increasing pollution levels, and other factors directly related to human activities. These factors may additionally be favoring the emergence of novel pathogens. In this report, we have determined the complete genome sequence of the emerging common midwife toad ranavirus (CMTV), which has caused fatal disease in several amphibian species across Europe. Phylogenetic and gene content analyses of the first complete genomic sequence from a ranavirus isolated in Europe show that CMTV is an amphibian-like ranavirus (ALRV). However, the CMTV genome structure is novel and represents an intermediate evolutionary stage between the two previously described ALRV groups. We find that CMTV clusters with several other ranaviruses isolated from different hosts and locations which might also be included in this novel ranavirus group. This work sheds light on the phylogenetic relationships within this complex group of emerging, disease-causing viruses.

  10. Complete Genome Sequence of a Novel Newcastle Disease Virus Strain Isolated from a Chicken in West Africa

    PubMed Central

    Kim, Shin-Hee; Nayak, Subhashree; Paldurai, Anandan; Nayak, Baibaswata; Samuel, Arthur; Aplogan, Gilbert L.; Awoume, Kodzo A.; Webby, Richard J.; Ducatez, Mariette F.; Collins, Peter L.

    2012-01-01

    The complete genome sequence of an African Newcastle disease virus (NDV) strain isolated from a chicken in Togo in 2009 was determined. The genome is 15,198 nucleotides (nt) in length and is classified in genotype VII in the class II cluster. Compared to common vaccine strains, the African strain contains a previously described 6-nt insert in the downstream untranslated region of the N gene and a novel 6-nt insert in the HN-L intergenic region. Genome length differences are a marker of the natural history of NDV. This is the first description of a class II NDV strain with a genome of 15,198 nt and a 6-nt insert in the HN-L intergenic region. Sequence divergence relative to vaccine strains was substantial, likely contributes to outbreaks, and illustrates the continued evolution of new NDV strains in West Africa. PMID:22997417

  11. Discovery of T Cell Receptor β Motifs Specific to HLA-B27-Positive Ankylosing Spondylitis by Deep Repertoire Sequence Analysis.

    PubMed

    Faham, Malek; Carlton, Victoria; Moorhead, Martin; Zheng, Jianbiao; Klinger, Mark; Pepin, Francois; Asbury, Thomas; Vignali, Marissa; Emerson, Ryan O; Robins, Harlan S; Ireland, James; Baechler-Gillespie, Emily; Inman, Robert D

    2017-04-01

    Ankylosing spondylitis (AS), a chronic inflammatory disorder, has a notable association with HLA-B27. One hypothesis suggests that a common antigen that binds to HLA-B27 is important for AS disease pathogenesis. This study was undertaken to determine sequences and motifs that are shared among HLA-B27-positive AS patients, using T cell repertoire next-generation sequencing. To identify motifs enriched among B27-positive AS patients, we performed T cell receptor β (TCRβ) repertoire sequencing on samples from 191 B27-positive AS patients, 43 B27-negative AS patients, and 227 controls, and we obtained >77 million TCRβ clonotype sequences. First, we assessed whether any of 50 previously published sequences were enriched in B27-positive AS patients. We then used training and test cohorts to identify discovered motifs that were enriched in B27-positive AS patients versus controls. Six previously published and 11 discovered motifs were enriched in the B27-positive AS samples as compared to controls. After combining motifs related by sequence, we identified a total of 15 independent motifs. Both the full set of 15 motifs and a set of 6 published motifs were enriched in the B27-positive AS patients as compared to B27-positive healthy individuals (P = 0.049 and P = 0.001, respectively). Using an independent cohort, we validated that at least some of these motifs were associated with AS, and not simply with B27-positive status. We identified TCRβ motifs that are enriched in B27-positive AS patients as compared to B27-positive healthy controls. This suggests that a common antigen, presented by HLA-B27 and detected by CD8+ T cells, may be associated with AS disease pathogenesis. © 2016, American College of Rheumatology.

  12. Next-generation sequencing sheds light on the natural history of hepatitis C infection in patients who fail treatment.

    PubMed

    Abdelrahman, Tamer; Hughes, Joseph; Main, Janice; McLauchlan, John; Thursz, Mark; Thomson, Emma

    2015-01-01

    High rates of sexually transmitted infection and reinfection with hepatitis C virus (HCV) have recently been reported in human immunodeficiency virus (HIV)-infected men who have sex with men and reinfection has also been described in monoinfected injecting drug users. The diagnosis of reinfection has traditionally been based on direct Sanger sequencing of samples pre- and posttreatment, but not on more sensitive deep sequencing techniques. We studied viral quasispecies dynamics in patients who failed standard of care therapy in a high-risk HIV-infected cohort of patients with early HCV infection to determine whether treatment failure was associated with reinfection or recrudescence of preexisting infection. Paired sequences (pre- and posttreatment) were analyzed. The HCV E2 hypervariable region-1 was amplified using nested reverse-transcription polymerase chain reaction (RT-PCR) with indexed genotype-specific primers and the same products were sequenced using both Sanger and 454 pyrosequencing approaches. Of 99 HIV-infected patients with acute HCV treated with 24-48 weeks of pegylated interferon alpha and ribavirin, 15 failed to achieve a sustained virological response (six relapsed, six had a null response, and three had a partial response). Using direct sequencing, 10/15 patients (66%) had evidence of a previously undetected strain posttreatment; in many studies, this is interpreted as reinfection. However, pyrosequencing revealed that 15/15 (100%) of patients had evidence of persisting infection; 6/15 (40%) patients had evidence of a previously undetected variant present in the posttreatment sample in addition to a variant that was detected at baseline. This could represent superinfection or a limitation of the sensitivity of pyrosequencing. In this high-risk group, the emergence of new viral strains following treatment failure is most commonly associated with emerging dominance of preexisting minority variants rather than reinfection. Superinfection may occur in this cohort but reinfection is overestimated by Sanger sequencing. © 2014 The Authors. Hepatology published by Wiley on behalf of the American Association for the Study of Liver Diseases.

  13. Sequence analysis of the L protein of the Ebola 2014 outbreak: Insight into conserved regions and mutations.

    PubMed

    Ayub, Gohar; Waheed, Yasir

    2016-06-01

    The 2014 Ebola outbreak was one of the largest that have occurred; it started in Guinea and spread to Nigeria, Liberia and Sierra Leone. Phylogenetic analysis of the current virus species indicated that this outbreak is the result of a divergent lineage of the Zaire ebolavirus. The L protein of Ebola virus (EBOV) is the catalytic subunit of the RNA‑dependent RNA polymerase complex, which, with VP35, is key for the replication and transcription of viral RNA. Earlier sequence analysis demonstrated that the L protein of all non‑segmented negative‑sense (NNS) RNA viruses consists of six domains containing conserved functional motifs. The aim of the present study was to analyze the presence of these motifs in 2014 EBOV isolates, highlight their function and how they may contribute to the overall pathogenicity of the isolates. For this purpose, 81 2014 EBOV L protein sequences were aligned with 475 other NNS RNA viruses, including Paramyxoviridae and Rhabdoviridae viruses. Phylogenetic analysis of all EBOV outbreak L protein sequences was also performed. Analysis of the amino acid substitutions in the 2014 EBOV outbreak was conducted using sequence analysis. The alignment demonstrated the presence of previously conserved motifs in the 2014 EBOV isolates and novel residues. Notably, all the mutations identified in the 2014 EBOV isolates were tolerant, they were pathogenic with certain examples occurring within previously determined functional conserved motifs, possibly altering viral pathogenicity, replication and virulence. The phylogenetic analysis demonstrated that all sequences with the exception of the 2014 EBOV sequences were clustered together. The 2014 EBOV outbreak has acquired a great number of mutations, which may explain the reasons behind this unprecedented outbreak. Certain residues critical to the function of the polymerase remain conserved and may be targets for the development of antiviral therapeutic agents.

  14. The nonamer UUAUUUAUU is the key AU-rich sequence motif that mediates mRNA degradation.

    PubMed Central

    Zubiaga, A M; Belasco, J G; Greenberg, M E

    1995-01-01

    Labile mRNAs that encode cytokine and immediate-early gene products often contain AU-rich sequences within their 3' untranslated region (UTR). These AU-rich sequences appear to be key determinants of the short half-lives of these mRNAs, although the sequence features of these elements and the mechanism by which they target mRNAs for rapid decay have not been fully defined. We have examined the features of AU-rich elements (AREs) that are crucial for their function as determinants of mRNA instability in mammalian cells by testing the ability of various mutant c-fos AREs and synthetic AREs to direct rapid mRNA deadenylation and decay when inserted within the 3' UTR of the normally stable beta-globin mRNA. Evidence is presented that the pentamer AUUUA, which previously was suggested to be the minimal determinant of instability present in mammalian AREs, cannot direct rapid mRNA deadenylation and decay. Instead, the nonomer UUAUUUAUU is the elemental AU-rich sequence motif that destabilizes mRNA. Removal of one uridine residue from either end of the nonamer (UUAUUUAU or UAUUUAUU) results in a decrease of potency of the element, while removal of a uridine residue from both ends of the nonamer (UAUUUAU) eliminates detectable destabilizing activity. The inclusion of an additional uridine residue at both ends of the nonamer (UUUAUUUAUUU) does not further increase the efficacy of the element. Taken together, these findings suggest that the nonamer UUAUUUAUU is the minimal AU-rich motif that effectively destabilizes mRNA. Additional ARE potency is achieved by combining multiple copies of this nonamer in a single mRNA 3' UTR. Furthermore, analysis of poly(A) shortening rates for ARE-containing mRNAs reveals that the UUAUUUAUU sequence also accelerates mRNA deadenylation and suggests that the UUAUUUAUU motif targets mRNA for rapid deadenylation as an early step in the mRNA decay process. PMID:7891716

  15. Genetic diversity and epidemiology of infectious hematopoietic necrosis virus in Alaska

    USGS Publications Warehouse

    Emmenegger, E.G; Meyers, T.R.; Burton, T.O.; Kurath, G.

    2000-01-01

    Forty-two infectious hematopoietic necrosis virus (IHNV) isolates from Alaska were analyzed using the ribonuclease protection assay (RPA) and nucleotide sequencing. RPA analyses, utilizing 4 probes, N5, N3 (N gene), GF (G gene), and NV (NV gene), determined that the haplotypes of all 3 genes demonstrated a consistent spatial pattern. Virus isolates belonging to the most common haplotype groups were distributed throughout Alaska, whereas isolates in small haplotype groups were obtained from only 1 site (hatchery, lake, etc.). The temporal pattern of the GF haplotypes suggested a 'genetic acclimation' of the G gene, possibly due to positive selection on the glycoprotein. A pairwise comparison of the sequence data determined that the maximum nucleotide diversity of the isolates was 2.75% (10 mismatches) for the NV gene, and 1.99% (6 mismatches) for a 301 base pair region of the G gene, indicating that the genetic diversity of IHNV within Alaska is notably lower than in the more southern portions of the IHNV North American range. Phylogenetic analysis of representative Alaskan sequences and sequences of 12 previously characterized IHNV strains from Washington, Oregon, Idaho, California (USA) and British Columbia (Canada) distinguished the isolates into clusters that correlated with geographic origin and indicated that the Alaskan and British Columbia isolates may have a common viral ancestral lineage. Comparisons of multiple isolates from the same site provided epidemiological insights into viral transmission patterns and indicated that viral evolution, viral introduction, and genetic stasis were the mechanisms involved with IHN virus population dynamics in Alaska. The examples of genetic stasis and the overall low sequence heterogeneity of the Alaskan isolates suggested that they are evolutionarily constrained. This study establishes a baseline of genetic fingerprint patterns and sequence groups representing the genetic diversity of Alaskan IHNV isolates. This information could be used to determine the source of an IHN outbreak and to facilitate decisions in fisheries management of Alaskan salmonid stocks.

  16. kmer-SVM: a web server for identifying predictive regulatory sequence features in genomic data sets

    PubMed Central

    Fletez-Brant, Christopher; Lee, Dongwon; McCallion, Andrew S.; Beer, Michael A.

    2013-01-01

    Massively parallel sequencing technologies have made the generation of genomic data sets a routine component of many biological investigations. For example, Chromatin immunoprecipitation followed by sequence assays detect genomic regions bound (directly or indirectly) by specific factors, and DNase-seq identifies regions of open chromatin. A major bottleneck in the interpretation of these data is the identification of the underlying DNA sequence code that defines, and ultimately facilitates prediction of, these transcription factor (TF) bound or open chromatin regions. We have recently developed a novel computational methodology, which uses a support vector machine (SVM) with kmer sequence features (kmer-SVM) to identify predictive combinations of short transcription factor-binding sites, which determine the tissue specificity of these genomic assays (Lee, Karchin and Beer, Discriminative prediction of mammalian enhancers from DNA sequence. Genome Res. 2011; 21:2167–80). This regulatory information can (i) give confidence in genomic experiments by recovering previously known binding sites, and (ii) reveal novel sequence features for subsequent experimental testing of cooperative mechanisms. Here, we describe the development and implementation of a web server to allow the broader research community to independently apply our kmer-SVM to analyze and interpret their genomic datasets. We analyze five recently published data sets and demonstrate how this tool identifies accessory factors and repressive sequence elements. kmer-SVM is available at http://kmersvm.beerlab.org. PMID:23771147

  17. kmer-SVM: a web server for identifying predictive regulatory sequence features in genomic data sets.

    PubMed

    Fletez-Brant, Christopher; Lee, Dongwon; McCallion, Andrew S; Beer, Michael A

    2013-07-01

    Massively parallel sequencing technologies have made the generation of genomic data sets a routine component of many biological investigations. For example, Chromatin immunoprecipitation followed by sequence assays detect genomic regions bound (directly or indirectly) by specific factors, and DNase-seq identifies regions of open chromatin. A major bottleneck in the interpretation of these data is the identification of the underlying DNA sequence code that defines, and ultimately facilitates prediction of, these transcription factor (TF) bound or open chromatin regions. We have recently developed a novel computational methodology, which uses a support vector machine (SVM) with kmer sequence features (kmer-SVM) to identify predictive combinations of short transcription factor-binding sites, which determine the tissue specificity of these genomic assays (Lee, Karchin and Beer, Discriminative prediction of mammalian enhancers from DNA sequence. Genome Res. 2011; 21:2167-80). This regulatory information can (i) give confidence in genomic experiments by recovering previously known binding sites, and (ii) reveal novel sequence features for subsequent experimental testing of cooperative mechanisms. Here, we describe the development and implementation of a web server to allow the broader research community to independently apply our kmer-SVM to analyze and interpret their genomic datasets. We analyze five recently published data sets and demonstrate how this tool identifies accessory factors and repressive sequence elements. kmer-SVM is available at http://kmersvm.beerlab.org.

  18. Diversity of 16S rRNA genes of new Ehrlichia strains isolated from horses with clinical signs of Potomac horse fever.

    PubMed

    Wen, B; Rikihisa, Y; Fuerst, P A; Chaichanasiriwithaya, W

    1995-04-01

    Ehrlichia risticii is the causative agent of Potomac horse fever. Variations among the major antigens of different local E. risticii strains have been detected previously. To further assess genetic variability in this species or species complex, the sequences of the 16S rRNA genes of several isolates obtained from sick horses diagnosed as having Potomac horse fever were determined. The sequences of six isolates obtained from Ohio and three isolates obtained from Kentucky were amplified by PCR. Three groups of sequences were identified. The sequences of five of the Ohio isolates were identical to the sequence of the type strain of E. risticii, the Illinois strain. The sequence of one Ohio isolate, isolate 081, was unique; this sequence differed in 10 nucleotides from the sequence of the type strain (level of similarity, 99.3%). The sequences of the three Kentucky isolates were identical to each other, but differed by five bases from the sequence of the type strain (level of similarity, 99.6%). The levels of sequence similarity of isolate 081, the Kentucky isolates, and the type strain to the next most closely related Ehrlichia sp., Ehrlichia sennetsu, were 99.3, 99.2, and 99.2%, respectively. On the basis of the distinct antigenic profiles and the levels of 16S rRNA sequence divergence, isolate 081 is as divergent from the type strain of E. risticii as E. sennetsu is. Therefore, we suggest that strain 081 and the Kentucky isolates may represent two new distinct Ehrlichia species.

  19. hsp65 PCR-restriction analysis (PRA) with capillary electrophoresis in comparison to three other methods for identification of Mycobacterium species.

    PubMed

    Sajduda, Anna; Martin, Anandi; Portaels, Françoise; Palomino, Juan Carlos

    2010-02-01

    We developed a scheme for rapid identification of Mycobacterium species using an automated fluorescence capillary electrophoresis instrument. A 441-bp region of the hsp65 gene was examined using PCR-restriction analysis (PRA). The assay was initially evaluated on 38 reference strains. The observed sizes of restriction fragments were consistently smaller than the real sizes for each of the species as deduced from the sequence analysis (mean variance=7bp). Nevertheless, the obtained PRA patterns were highly reproducible and resulted in correct species identifications. A blind test was then successfully performed on 64 test isolates previously characterized by conventional biochemical methods, a commercial INNO-LiPA Mycobacteria assay and/or sequence determination of the 5' end of 16S rRNA gene. A total of 14 of 64 isolates were erroneously identified by conventional methods (78% accuracy). In contrast, PRA performed very well in comparison with the LiPA (89% concordance) and especially with DNA sequencing (93.3% of concordant results). Also, PRA identified seven isolates representing five previously unreported hsp65 alleles. We conclude that hsp65 PRA based on automated capillary electrophoresis is a rapid, simple and reliable method for identification of mycobacteria. Copyright 2010 Elsevier B.V. All rights reserved.

  20. Recent sequence variation in probe binding site affected detection of respiratory syncytial virus group B by real-time RT-PCR.

    PubMed

    Kamau, Everlyn; Agoti, Charles N; Lewa, Clement S; Oketch, John; Owor, Betty E; Otieno, Grieven P; Bett, Anne; Cane, Patricia A; Nokes, D James

    2017-03-01

    Direct immuno-fluorescence test (IFAT) and multiplex real-time RT-PCR have been central to RSV diagnosis in Kilifi, Kenya. Recently, these two methods showed discrepancies with an increasing number of PCR undetectable RSV-B viruses. Establish if mismatches in the primer and probe binding sites could have reduced real-time RT-PCR sensitivity. Nucleoprotein (N) and glycoprotein (G) genes were sequenced for real-time RT-PCR positive and negative samples. Primer and probe binding regions in N gene were checked for mismatches and phylogenetic analyses done to determine molecular epidemiology of these viruses. New primers and probe were designed and tested on the previously real-time RT-PCR negative samples. N gene sequences revealed 3 different mismatches in the probe target site of PCR negative, IFAT positive viruses. The primers target sites had no mismatches. Phylogenetic analysis of N and G genes showed that real-time RT-PCR positive and negative samples fell into distinct clades. Newly designed primers-probe pair improved detection and recovered previous PCR undetectable viruses. An emerging RSV-B variant is undetectable by a quite widely used real-time RT-PCR assay due to polymorphisms that influence probe hybridization affecting PCR accuracy. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.

  1. Cryptococcus neoformans var. grubii: Separate Varietal Status for Cryptococcus neoformans Serotype A Isolates

    PubMed Central

    Franzot, Sarah P.; Salkin, Ira F.; Casadevall, Arturo

    1999-01-01

    Cryptococcus neoformans var. neoformans presently includes isolates which have been determined by the immunologic reactivity of their capsular polysaccharides to be serotype A and those which have been determined to be serotype D. However, recent analyses of the URA5 sequences and DNA fingerprinting patterns suggest significant genetic differences between the two serotypes. Therefore, we propose to recognize these genotypic distinctions, as well as previously reported phenotypic differences, by restricting C. neoformans var. neoformans to isolates which are serotype D and describing a new variety, C. neoformans var. grubii, for serotype A isolates. PMID:9986871

  2. A Comprehensive Overview of the Duvernay Induced Seismicity near Fox Creek, Alberta

    NASA Astrophysics Data System (ADS)

    Schultz, R.; Wang, R.; Gu, Y. J.; Haug, K.; Atkinson, G. M.

    2016-12-01

    In this work we summarize the current state of understanding regarding the induced seismicity related to Duvernay hydraulic fracturing operations in central Alberta, near the town of Fox Creek. Earthquakes in this region cluster into distinct sequences in time, space, and focal mechanism. To corroborate this point, we use cross-correlation detection methods to delineate transient temporal relationships, double-difference relocations to confirm spatial clustering, and moment tensor determinations to show fault motion consistency. The spatiotemporal clustering of sequences is strongly related to nearby hydraulic fracturing operations. In addition, we identify a strong preference for subvertical strike-slip motion with a roughly 45º P-axis orientation, consistent with ambient stress field considerations. The hypocentral geometry in two red traffic light protocol cases, that are robustly constrained by local array data, provide compelling evidence for planar features starting at Duvernay Formation depths and extending into the shallow Precambrian basement. We interpret these features as faults orientated approximately north-south and subvertically, consistent with moment tensor determinations. Finally, we conclude that the primary sequences are best explained as induced events in response to effective stress changes as a result of pore-pressure increase along previously existing faults due to hydraulic fracturing stimulations.

  3. The right half of the Escherichia coli replication origin is not essential for viability, but facilitates multi-forked replication

    PubMed Central

    Stepankiw, Nicholas; Kaidow, Akihiro; Boye, Erik; Bates, David

    2010-01-01

    Summary Replication initiation is a key event in the cell cycle of all organisms and oriC, the replication origin in Escherichia coli, serves as the prototypical model for this process. The minimal sequence required for oriC function was originally determined entirely from plasmid studies using cloned origin fragments, which have previously been shown to differ dramatically in sequence requirement from the chromosome. Using an in vivo recombineering strategy to exchange wt oriCs for mutated ones regardless of whether they are functional origins or not, we have determined the minimal origin sequence that will support chromosome replication. Nearly the entire right half of oriC could be deleted without loss of origin function, demanding a reassessment of existing models for initiation. Cells carrying the new DnaA box-depleted 163 bp minimal oriC exhibited little or no loss of fitness under slow-growth conditions, but were sensitive to rich medium, suggesting that the dense packing of initiator binding sites that is a hallmark of prokaryotic origins, has likely evolved to support the increased demands of multi-forked replication. PMID:19737351

  4. Novel Virus Discovery and Genome Reconstruction from Field RNA Samples Reveals Highly Divergent Viruses in Dipteran Hosts

    PubMed Central

    Bass, David; Moureau, Gregory; Tang, Shuoya; McAlister, Erica; Culverwell, C. Lorna; Glücksman, Edvard; Wang, Hui; Brown, T. David K.; Gould, Ernest A.; Harbach, Ralph E.; de Lamballerie, Xavier; Firth, Andrew E.

    2013-01-01

    We investigated whether small RNA (sRNA) sequenced from field-collected mosquitoes and chironomids (Diptera) can be used as a proxy signature of viral prevalence within a range of species and viral groups, using sRNAs sequenced from wild-caught specimens, to inform total RNA deep sequencing of samples of particular interest. Using this strategy, we sequenced from adult Anopheles maculipennis s.l. mosquitoes the apparently nearly complete genome of one previously undescribed virus related to chronic bee paralysis virus, and, from a pool of Ochlerotatus caspius and Oc. detritus mosquitoes, a nearly complete entomobirnavirus genome. We also reconstructed long sequences (1503-6557 nt) related to at least nine other viruses. Crucially, several of the sequences detected were reconstructed from host organisms highly divergent from those in which related viruses have been previously isolated or discovered. It is clear that viral transmission and maintenance cycles in nature are likely to be significantly more complex and taxonomically diverse than previously expected. PMID:24260463

  5. A single determinant dominates the rate of yeast protein evolution.

    PubMed

    Drummond, D Allan; Raval, Alpan; Wilke, Claus O

    2006-02-01

    A gene's rate of sequence evolution is among the most fundamental evolutionary quantities in common use, but what determines evolutionary rates has remained unclear. Here, we carry out the first combined analysis of seven predictors (gene expression level, dispensability, protein abundance, codon adaptation index, gene length, number of protein-protein interactions, and the gene's centrality in the interaction network) previously reported to have independent influences on protein evolutionary rates. Strikingly, our analysis reveals a single dominant variable linked to the number of translation events which explains 40-fold more variation in evolutionary rate than any other, suggesting that protein evolutionary rate has a single major determinant among the seven predictors. The dominant variable explains nearly half the variation in the rate of synonymous and protein evolution. We show that the two most commonly used methods to disentangle the determinants of evolutionary rate, partial correlation analysis and ordinary multivariate regression, produce misleading or spurious results when applied to noisy biological data. We overcome these difficulties by employing principal component regression, a multivariate regression of evolutionary rate against the principal components of the predictor variables. Our results support the hypothesis that translational selection governs the rate of synonymous and protein sequence evolution in yeast.

  6. Determination of the fundamental properties of an M31 globular cluster from main-sequence photometry

    NASA Astrophysics Data System (ADS)

    Ma, Jun

    2013-02-01

    We determined the age of the M31 globular cluster B379 using isochrones of the Padova stellar evolutionary models. At the same time, the cluster's metal abundance, its distance modulus, and reddening value were also obtained. The results obtained in this paper are consistent with previous determinations, including the age. Brown et al. constrained the age of B379 by comparing its color-magnitude diagram with isochrones of the 2006 VandenBerg models. Therefore, this paper confirms the consistency of the age scale of B379 between the Padova isochrones and the 2006 VandenBerg isochrones. The results of B379 obtained in this paper are: metallicity [M/H] = log(Z/Z⊙) = -0.325 dex, age τ = 11.0 +/- 1.5 Gyr, reddening E(B - V) = 0.08 mag, and distance modulus (m - M)0 = 24.44 +/- 0.10 mag. Using the metallicity, the reddening value and the distance modulus obtained in this paper, we constrained the age of B379 by comparing its multicolor photometry with theoretical stellar population synthesis models. The age of B379 obtained is 10.6-0.76 +0.92 Gyr, which is in very good agreement with the determination from main-sequence photometry.

  7. Phylogenetic diversity of hpnP, the hopanoid methylase, and its implications for 2-methylhopanoids as biomarkers

    NASA Astrophysics Data System (ADS)

    Ricci, J. N.; Coleman, M. L.; Osburn, M. R.; Sessions, A. L.; Spear, J. R.; Newman, D. K.

    2011-12-01

    Hopanoids are a class of sterols produced by bacteria. Their hydrocarbon skeletons are resistant to degradation making their diagenetic products, hopanes, attractive biomarkers. Particular attention has been paid to 2-methylhopanes, which have been found at discrete times and locations in Earth history as far back as 2,500 Myr. Previously, they were inferred to be markers of oxygenic photosynthesis in cyanobacteria, but the discovery of an anoxygenic phototroph, Rhodopseudomonas palustris TIE-1, capable of producing significant quantities of 2-methylbacteriohopanetetrol, the parent molecule of the fossil 2-methylhopane, challenged this interpretation. In this study, we sought to determine the diversity and origin of the enzyme responsible for methylating hopanoids, HpnP. To accomplish this task, we surveyed a diversity of Yellowstone hot springs using degenerate PCR primers and searched publically available metagenomic databases for hpnP-like sequences. The Yellowstone hot spring samples were dominated by cyanobacterial-like hpnP sequences, while the metagenomic data contained many hpnP-like sequences from a diversity of environments that grouped with all known hpnP-containing phyla. With these additional hpnP sequences, we will report updated phylogenetic trees that attempt to determine the origin of hpnP. Understanding the distribution of 2-methylhopanoid production throughout the tree of life and its origin is important to be able to use 2-methylhopanes as biomarkers for any particular taxonomic group.

  8. Determination of disease phenotypes and pathogenic variants from exome sequence data in the CAGI 4 gene panel challenge.

    PubMed

    Kundu, Kunal; Pal, Lipika R; Yin, Yizhou; Moult, John

    2017-09-01

    The use of gene panel sequence for diagnostic and prognostic testing is now widespread, but there are so far few objective tests of methods to interpret these data. We describe the design and implementation of a gene panel sequencing data analysis pipeline (VarP) and its assessment in a CAGI4 community experiment. The method was applied to clinical gene panel sequencing data of 106 patients, with the goal of determining which of 14 disease classes each patient has and the corresponding causative variant(s). The disease class was correctly identified for 36 cases, including 10 where the original clinical pipeline did not find causative variants. For a further seven cases, we found strong evidence of an alternative disease to that tested. Many of the potentially causative variants are missense, with no previous association with disease, and these proved the hardest to correctly assign pathogenicity or otherwise. Post analysis showed that three-dimensional structure data could have helped for up to half of these cases. Over-reliance on HGMD annotation led to a number of incorrect disease assignments. We used a largely ad hoc method to assign probabilities of pathogenicity for each variant, and there is much work still to be done in this area. © 2017 The Authors. **Human Mutation published by Wiley Periodicals, Inc.

  9. Analysis of sequence variation among smeDEF multi drug efflux pump genes and flanking DNA from defined 16S rRNA subgroups of clinical Stenotrophomonas maltophilia isolates.

    PubMed

    Gould, Virginia C; Okazaki, Aki; Howe, Robin A; Avison, Matthew B

    2004-08-01

    To determine the level of variation in the smeDEF efflux pump and smeT transcriptional regulator genes among three defined 16S rRNA sequence subgroups of clinical Stenotrophomonas maltophilia isolates. smeDEF sequencing used a PCR genome walking approach. Determination of the sequence surrounding smeDEF used a flanking primer PCR method and specific primers anchored in smeD or smeF together with random primers. smeDEF is chromosomal and located in the same position in the chromosome in all three subgroups of isolates. Flanking smeD is a gene, smeT, encoding a putative transcriptional repressor for smeDEF. Variation at these loci among the isolates is considerably lower (up to 10%) than at intrinsic beta-lactamase loci (up to 30%) in the same isolates, implying greater functional constraint. The smeD-smeT intergenic region contains a highly conserved section, which maps with previously predicted promoter/operator regions, and a hypervariable untranslated region, which can be used to subgroup clinical isolates. These data provide further evidence that it is possible to group clinical isolates of the inherently variable species, S. maltophilia, based on genotypic properties. Isolate D457, in which most work concerning smeDEF expression has been performed, does not fall into S. maltophilia subgroup A, which is the most typical.

  10. Effects of a Transposable Element Insertion on Alcohol Dehydrogenase Expression in Drosophila Melanogaster

    PubMed Central

    Dunn, R. C.; Laurie, C. C.

    1995-01-01

    Variation in the DNA sequence and level of alcohol dehydrogenase (Adh) gene expression in Drosophila melanogaster have been studied to determine what types of DNA polymorphisms contribute to phenotypic variation in natural populations. The Adh gene, like many others, shows a high level of variability in both DNA sequence and quantitative level of expression. A number of transposable element insertions occur in the Adh region and one of these, a copia insertion in the 5' flanking region, is associated with unusually low Adh expression. To determine whether this insertion (called RI42) causes the low expression level, the insertion was excised from the cloned RI42 Adh gene and the effect was assessed by P-element transformation. Removal of this insertion causes a threefold increase in the level of ADH, clearly showing that it contributes to the naturally occurring variation in expression at this locus. Removal of all but one LTR also causes a threefold increase, indicating that the mechanism is not a simple sequence disruption. Furthermore, this copia insertion, which is located between the two Adh promoters and their upstream enhancer sequences, has differential effects on the levels of proximal and distal transcripts. Finally, a test for the possible modifying effects of two suppressor loci, su(w(a)) and su(f), on this insertional mutation was negative, in contrast to a previous report in the literature. PMID:7498745

  11. RNAi screen for rapid therapeutic target identification in leukemia patients

    PubMed Central

    Tyner, Jeffrey W.; Deininger, Michael W.; Loriaux, Marc M.; Chang, Bill H.; Gotlib, Jason R.; Willis, Stephanie G.; Erickson, Heidi; Kovacsovics, Tibor; O'Hare, Thomas; Heinrich, Michael C.; Druker, Brian J.

    2009-01-01

    Targeted therapy has vastly improved outcomes in certain types of cancer. Extension of this paradigm across a broad spectrum of malignancies will require an efficient method to determine the molecular vulnerabilities of cancerous cells. Improvements in sequencing technology will soon enable high-throughput sequencing of entire genomes of cancer patients; however, determining the relevance of identified sequence variants will require complementary functional analyses. Here, we report an RNAi-assisted protein target identification (RAPID) technology that individually assesses targeting of each member of the tyrosine kinase gene family. We demonstrate that RAPID screening of primary leukemia cells from 30 patients identifies targets that are critical to survival of the malignant cells from 10 of these individuals. We identify known, activating mutations in JAK2 and K-RAS, as well as patient-specific sensitivity to down-regulation of FLT1, CSF1R, PDGFR, ROR1, EPHA4/5, JAK1/3, LMTK3, LYN, FYN, PTK2B, and N-RAS. We also describe a previously undescribed, somatic, activating mutation in the thrombopoietin receptor that is sensitive to down-stream pharmacologic inhibition. Hence, the RAPID technique can quickly identify molecular vulnerabilities in malignant cells. Combination of this technique with whole-genome sequencing will represent an ideal tool for oncogenic target identification such that specific therapies can be matched with individual patients. PMID:19433805

  12. Towards the Rational Design of a Candidate Vaccine against Pregnancy Associated Malaria: Conserved Sequences of the DBL6ε Domain of VAR2CSA

    PubMed Central

    Badaut, Cyril; Bertin, Gwladys; Rustico, Tatiana; Fievet, Nadine; Massougbodji, Achille; Gaye, Alioune; Deloron, Philippe

    2010-01-01

    Background Placental malaria is a disease linked to the sequestration of Plasmodium falciparum infected red blood cells (IRBC) in the placenta, leading to reduced materno-fetal exchanges and to local inflammation. One of the virulence factors of P. falciparum involved in cytoadherence to chondroitin sulfate A, its placental receptor, is the adhesive protein VAR2CSA. Its localisation on the surface of IRBC makes it accessible to the immune system. VAR2CSA contains six DBL domains. The DBL6ε domain is the most variable. High variability constitutes a means for the parasite to evade the host immune response. The DBL6ε domain could constitute a very attractive basis for a vaccine candidate but its reported variability necessitates, for antigenic characterisations, identifying and classifying commonalities across isolates. Methodology/Principal Findings Local alignment analysis of the DBL6ε domain had revealed that it is not as variable as previously described. Variability is concentrated in seven regions present on the surface of the DBL6ε domain. The main goal of our work is to classify and group variable sequences that will simplify further research to determine dominant epitopes. Firstly, variable sequences were grouped following their average percent pairwise identity (APPI). Groups comprising many variable sequences sharing low variability were found. Secondly, ELISA experiments following the IgG recognition of a recombinant DBL6ε domain, and of peptides mimicking its seven variable blocks, allowed to determine an APPI cut-off and to isolate groups represented by a single consensus sequence. Conclusions/Significance A new sequence approach is used to compare variable regions in sequences that have extensive segmental gene relationship. Using this approach, the VAR2CSA DBL6 domain is composed of 7 variable blocks with limited polymorphism. Each variable block is composed of a limited number of consensus types. Based on peptide based ELISA, variable blocks with 85% or greater sequence identity are expected to be recognized equally well by antibody and can be considered the same consensus type. Therefore, the analysis of the antibody response against the classified small number of sequences should be helpful to determine epitopes. PMID:20585655

  13. Complete genomic sequence of an infectious pancreatic necrosis virus isolated from rainbow trout (Oncorhynchus mykiss) in China.

    PubMed

    Ji, Feng; Zhao, Jing-Zhuang; Liu, Miao; Lu, Tong-Yan; Liu, Hong-Bai; Yin, Jiasheng; Xu, Li-Ming

    2017-04-01

    Infectious pancreatic necrosis (IPN) is a significant disease of farmed salmonids resulting in direct economic losses due to high mortality in China. However, no gene sequence of any Chinese infectious pancreatic necrosis virus (IPNV) isolates was available. In the study, moribund rainbow trout fry samples were collected during an outbreak of IPN in Yunnan province of southwest China in 2013. An IPNV was isolated and tentatively named ChRtm213. We determined the full genome sequence of the IPNV ChRtm213 and compared it with previously identified IPNV sequences worldwide. The sequences of different structural and non-structural protein genes were compared to those of other aquatic birnaviruses sequenced to date. The results indicated that the complete genome sequence of ChRtm213 strain contains a segment A (3099 nucleotides) coding a polyprotein VP2-VP4-VP3, and a segment B (2789 nucleotides) coding a RNA-dependent RNA polymerase VP1. The phylogenetic analyses showed that ChRtm213 strain fell within genogroup 1, serotype A9 (Jasper), having similarities of 96.3% (segment A) and 97.3% (segment B) with the IPNV strain AM98 from Japan. The results suggest that the Chinese IPNV isolate has relative closer relationship with Japanese IPNV strains. The sequence of ChRtm213 was the first gene sequence of IPNV isolates in China. This study provided a robust reference for diagnosis and/or control of IPNV prevalent in China.

  14. Role of Double-Strand Break End-Tethering during Gene Conversion in Saccharomyces cerevisiae

    PubMed Central

    Haber, James E.

    2016-01-01

    Correct repair of DNA double-strand breaks (DSBs) is critical for maintaining genome stability. Whereas gene conversion (GC)-mediated repair is mostly error-free, repair by break-induced replication (BIR) is associated with non-reciprocal translocations and loss of heterozygosity. We have previously shown that a Recombination Execution Checkpoint (REC) mediates this competition by preventing the BIR pathway from acting on DSBs that can be repaired by GC. Here, we asked if the REC can also determine whether the ends that are engaged in a GC-compatible configuration belong to the same break, since repair involving ends from different breaks will produce potentially deleterious translocations. We report that the kinetics of repair are markedly delayed when the two DSB ends that participate in GC belong to different DSBs (termed Trans) compared to the case when both DSB ends come from the same break (Cis). However, repair in Trans still occurs by GC rather than BIR, and the overall efficiency of repair is comparable. Hence, the REC is not sensitive to the “origin” of the DSB ends. When the homologous ends for GC are in Trans, the delay in repair appears to reflect their tethering to sequences on the other side of the DSB that themselves recombine with other genomic locations with which they share sequence homology. These data support previous observations that the two ends of a DSB are usually tethered to each other and that this tethering facilitates both ends encountering the same donor sequence. We also found that the presence of homeologous/repetitive sequences in the vicinity of a DSB can distract the DSB end from finding its bona fide homologous donor, and that inhibition of GC by such homeologous sequences is markedly increased upon deleting Sgs1 but not Msh6. PMID:27074148

  15. Sequential effects in judgements of attractiveness: the influences of face race and sex.

    PubMed

    Kramer, Robin S S; Jones, Alex L; Sharma, Dinkar

    2013-01-01

    In perceptual decision-making, a person's response on a given trial is influenced by their response on the immediately preceding trial. This sequential effect was initially demonstrated in psychophysical tasks, but has now been found in more complex, real-world judgements. The similarity of the current and previous stimuli determines the nature of the effect, with more similar items producing assimilation in judgements, while less similarity can cause a contrast effect. Previous research found assimilation in ratings of facial attractiveness, and here, we investigated whether this effect is influenced by the social categories of the faces presented. Over three experiments, participants rated the attractiveness of own- (White) and other-race (Chinese) faces of both sexes that appeared successively. Through blocking trials by race (Experiment 1), sex (Experiment 2), or both dimensions (Experiment 3), we could examine how sequential judgements were altered by the salience of different social categories in face sequences. For sequences that varied in sex alone, own-race faces showed significantly less opposite-sex assimilation (male and female faces perceived as dissimilar), while other-race faces showed equal assimilation for opposite- and same-sex sequences (male and female faces were not differentiated). For sequences that varied in race alone, categorisation by race resulted in no opposite-race assimilation for either sex of face (White and Chinese faces perceived as dissimilar). For sequences that varied in both race and sex, same-category assimilation was significantly greater than opposite-category. Our results suggest that the race of a face represents a superordinate category relative to sex. These findings demonstrate the importance of social categories when considering sequential judgements of faces, and also highlight a novel approach for investigating how multiple social dimensions interact during decision-making.

  16. Mutation analysis in 129 genes associated with other forms of retinal dystrophy in 157 families with retinitis pigmentosa based on exome sequencing.

    PubMed

    Xu, Yan; Guan, Liping; Xiao, Xueshan; Zhang, Jianguo; Li, Shiqiang; Jiang, Hui; Jia, Xiaoyun; Yang, Jianhua; Guo, Xiangming; Yin, Ye; Wang, Jun; Zhang, Qingjiong

    2015-01-01

    Mutations in 60 known genes were previously identified by exome sequencing in 79 of 157 families with retinitis pigmentosa (RP). This study analyzed variants in 129 genes associated with other forms of hereditary retinal dystrophy in the same cohort. Apart from the 73 genes previously analyzed, a further 129 genes responsible for other forms of hereditary retinal dystrophy were selected based on RetNet. Variants in the 129 genes determined by whole exome sequencing were selected and filtered by bioinformatics analysis. Candidate variants were confirmed by Sanger sequencing and validated by analysis of available family members and controls. A total of 90 candidate variants were present in the 129 genes. Sanger sequencing confirmed 83 of the 90 variants. Analysis of family members and controls excluded 76 of these 83 variants. The remaining seven variants were considered to be potential pathogenic mutations; these were c.899A>G, c.1814C>G, and c.2107C>T in BBS2; c.1073C>T and c.1669C>T in INPP5E; and c.3582C>G and c.5704-5C>G in CACNA1F. Six of these seven mutations were novel. The mutations were detected in five unrelated patients without a family history, including three patients with homozygous or compound heterozygous mutations in BBS2 and INPP5E, and two patients with hemizygous mutations in CACNA1F. None of the patients had mutations in the genes associated with autosome dominant retinal dystrophy. Only a small portion of patients with RP, about 3% (5/157), had causative mutations in the 129 genes associated with other forms of hereditary retinal dystrophy.

  17. Sequential Effects in Judgements of Attractiveness: The Influences of Face Race and Sex

    PubMed Central

    Kramer, Robin S. S.; Jones, Alex L.; Sharma, Dinkar

    2013-01-01

    In perceptual decision-making, a person’s response on a given trial is influenced by their response on the immediately preceding trial. This sequential effect was initially demonstrated in psychophysical tasks, but has now been found in more complex, real-world judgements. The similarity of the current and previous stimuli determines the nature of the effect, with more similar items producing assimilation in judgements, while less similarity can cause a contrast effect. Previous research found assimilation in ratings of facial attractiveness, and here, we investigated whether this effect is influenced by the social categories of the faces presented. Over three experiments, participants rated the attractiveness of own- (White) and other-race (Chinese) faces of both sexes that appeared successively. Through blocking trials by race (Experiment 1), sex (Experiment 2), or both dimensions (Experiment 3), we could examine how sequential judgements were altered by the salience of different social categories in face sequences. For sequences that varied in sex alone, own-race faces showed significantly less opposite-sex assimilation (male and female faces perceived as dissimilar), while other-race faces showed equal assimilation for opposite- and same-sex sequences (male and female faces were not differentiated). For sequences that varied in race alone, categorisation by race resulted in no opposite-race assimilation for either sex of face (White and Chinese faces perceived as dissimilar). For sequences that varied in both race and sex, same-category assimilation was significantly greater than opposite-category. Our results suggest that the race of a face represents a superordinate category relative to sex. These findings demonstrate the importance of social categories when considering sequential judgements of faces, and also highlight a novel approach for investigating how multiple social dimensions interact during decision-making. PMID:24349226

  18. Characterization of the Bacillus stearothermophilus manganese superoxide dismutase gene and its ability to complement copper/zinc superoxide dismutase deficiency in Saccharomyces cerevisiae

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bowler, C.; Inze, D.; Van Camp, W.

    1990-03-01

    Recombinant clones containing the manganese superoxide dismutase (MnSOD) gene of Bacillus stearothermophilus were isolated with an oligonucleotide probe designed to match a part of the previously determined amino acid sequence. Complementation analyses, performed by introducing each plasmid into a superoxide dismutase-deficient mutant of Escherichia coli, allowed us to define the region of DNA which encodes the MnSOD structural gene and to identify a promoter region immediately upstream from the gene. These data were subsequently confirmed by DNA sequencing. Since MnSOD is normally restricted to the mitochondria in eucaryotes, we were interested (i) in determining whether B. stearothermophilus MnSOD could functionmore » in eucaryotic cytosol and (ii) in determining whether MnSOD could replace the structurally unrelated copper/zinc superoxide dismutase (Cu/ZnSOD) which is normally found there. To test this, the sequence encoding bacterial MnSOD was cloned into a yeast expression vector and subsequently introduced into a Cu/ZnSOD-deficient mutant of the yeast Saccharomyces cerevisiae. Functional expression of the protein was demonstrated, and complementation tests revealed that the protein was able to provide tolerance at wild-type levels to conditions which are normally restrictive for this mutant. Thus, in spite of the evolutionary unrelatedness of these two enzymes, Cu/ZnSOD can be functionally replaced by MnSOD in yeast cytosol.« less

  19. The primary structures of two yeast enolase genes. Homology between the 5' noncoding flanking regions of yeast enolase and glyceraldehyde-3-phosphate dehydrogenase genes.

    PubMed

    Holland, M J; Holland, J P; Thill, G P; Jackson, K A

    1981-02-10

    Segments of yeast genomic DNA containing two enolase structural genes have been isolated by subculture cloning procedures using a cDNA hybridization probe synthesized from purified yeast enolase mRNA. Based on restriction endonuclease and transcriptional maps of these two segments of yeast DNA, each hybrid plasmid contains a region of extensive nucleotide sequence homology which forms hybrids with the cDNA probe. The DNA sequences which flank this homologous region in the two hybrid plasmids are nonhomologous indicating that these sequences are nontandemly repeated in the yeast genome. The complete nucleotide sequence of the coding as well as the flanking noncoding regions of these genes has been determined. The amino acid sequence predicted from one reading frame of both structural genes is extremely similar to that determined for yeast enolase (Chin, C. C. Q., Brewer, J. M., Eckard, E., and Wold, F. (1981) J. Biol. Chem. 256, 1370-1376), confirming that these isolated structural genes encode yeast enolase. The nucleotide sequences of the coding regions of the genes are approximately 95% homologous, and neither gene contains an intervening sequence. Codon utilization in the enolase genes follows the same biased pattern previously described for two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes (Holland, J. P., and Holland, M. J. (1980) J. Biol. Chem. 255, 2596-2605). DNA blotting analysis confirmed that the isolated segments of yeast DNA are colinear with yeast genomic DNA and that there are two nontandemly repeated enolase genes per haploid yeast genome. The noncoding portions of the two enolase genes adjacent to the initiation and termination codons are approximately 70% homologous and contain sequences thought to be involved in the synthesis and processing messenger RNA. Finally there are regions of extensive homology between the two enolase structural genes and two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes within the 5- noncoding portions of these glycolytic genes.

  20. The complete chloroplast genome sequences of Lychnis wilfordii and Silene capitata and comparative analyses with other Caryophyllaceae genomes.

    PubMed

    Kang, Jong-Soo; Lee, Byoung Yoon; Kwak, Myounghai

    2017-01-01

    The complete chloroplast genomes of Lychnis wilfordii and Silene capitata were determined and compared with ten previously reported Caryophyllaceae chloroplast genomes. The chloroplast genome sequences of L. wilfordii and S. capitata contain 152,320 bp and 150,224 bp, respectively. The gene contents and orders among 12 Caryophyllaceae species are consistent, but several microstructural changes have occurred. Expansion of the inverted repeat (IR) regions at the large single copy (LSC)/IRb and small single copy (SSC)/IR boundaries led to partial or entire gene duplications. Additionally, rearrangements of the LSC region were caused by gene inversions and/or transpositions. The 18 kb inversions, which occurred three times in different lineages of tribe Sileneae, were thought to be facilitated by the intermolecular duplicated sequences. Sequence analyses of the L. wilfordii and S. capitata genomes revealed 39 and 43 repeats, respectively, including forward, palindromic, and reverse repeats. In addition, a total of 67 and 56 simple sequence repeats were discovered in the L. wilfordii and S. capitata chloroplast genomes, respectively. Finally, we constructed phylogenetic trees of the 12 Caryophyllaceae species and two Amaranthaceae species based on 73 protein-coding genes using both maximum parsimony and likelihood methods.

  1. Spatialization in working memory is related to literacy and reading direction: Culture "literarily" directs our thoughts.

    PubMed

    Guida, Alessandro; Megreya, Ahmed M; Lavielle-Guida, Magali; Noël, Yvonnick; Mathy, Fabien; van Dijck, Jean-Philippe; Abrahamse, Elger

    2018-06-01

    The ability to maintain arbitrary sequences of items in the mind contributes to major cognitive faculties, such as language, reasoning, and episodic memory. Previous research suggests that serial order working memory is grounded in the brain's spatial attention system. In the present study, we show that the spatially defined mental organization of novel item sequences is related to literacy and varies as a function of reading/writing direction. Specifically, three groups (left-to-right Western readers, right-to-left Arabic readers, and Arabic-speaking illiterates) were asked to memorize random (and non-spatial) sequences of color patches and determine whether a subsequent probe was part of the memorized sequence (e.g., press left key) or not (e.g., press right key). The results showed that Western readers mentally organized the sequences from left to right, Arabic readers spontaneously used the opposite direction, and Arabic-speaking illiterates showed no systematic spatial organization. This finding suggests that cultural conventions shape one of the most "fluid" aspects of human cognition, namely, the spontaneous mental organization of novel non-spatial information. Copyright © 2018 Elsevier B.V. All rights reserved.

  2. Plastome Sequences of Lygodium japonicum and Marsilea crenata Reveal the Genome Organization Transformation from Basal Ferns to Core Leptosporangiates

    PubMed Central

    Gao, Lei; Wang, Bo; Wang, Zhi-Wei; Zhou, Yuan; Su, Ying-Juan; Wang, Ting

    2013-01-01

    Previous studies have shown that core leptosporangiates, the most species-rich group of extant ferns (monilophytes), have a distinct plastid genome (plastome) organization pattern from basal fern lineages. However, the details of genome structure transformation from ancestral ferns to core leptosporangiates remain unclear because of limited plastome data available. Here, we have determined the complete chloroplast genome sequences of Lygodium japonicum (Lygodiaceae), a member of schizaeoid ferns (Schizaeales), and Marsilea crenata (Marsileaceae), a representative of heterosporous ferns (Salviniales). The two species represent the sister and the basal lineages of core leptosporangiates, respectively, for which the plastome sequences are currently unavailable. Comparative genomic analysis of all sequenced fern plastomes reveals that the gene order of L. japonicum plastome occupies an intermediate position between that of basal ferns and core leptosporangiates. The two exons of the fern ndhB gene have a unique pattern of intragenic copy number variances. Specifically, the substitution rate heterogeneity between the two exons is congruent with their copy number changes, confirming the constraint role that inverted repeats may play on the substitution rate of chloroplast gene sequences. PMID:23821521

  3. Characterization and mapping of the human rhodopsin kinase gene and screening of the gene for mutations in patients with retinitis pigmentosa

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Khani, S.C.; Lin, D.; Magovcevic, I.

    1994-09-01

    Rhodopsin kinase (RK) is a cytosolic enzyme in rod photoreceptors that initiates the deactivation of the phototransductions cascade by phosphorylating photoactivated rhodopsin. Although the cDNA sequence of bovine RK has been determined previously, no human cDNA or genomic sequence has thus far been available for genetic studies. In order to investigate the possible role of this candidate gene in retinitis pigmentosa (RP) and allied diseases, we have isolated and characterized human cDNA and genomic clones derived from the RK locus. The coding sequence of the human gene is 1692 nucleotides in length and is split into seven exons. The humanmore » and the bovine sequence show 84% identity at the nucleotide level and 92% identity at the amino acid level. Thus far, the intronic sequences flanking each exon except for one have been determined. We have also mapped the human RK gene to chromosome 13q34 using fluorescence in situ hybridization. To our knowledge, no RP gene has as yet been linked to this region. However, since the substrate for RK (rhodopsin) and other members of the phototransduction cascade have been implicated in the pathogenesis of RP, it is conceivable that defects in RK can also cause some forms of this disease. We are evaluating this possibility by screening DNA from 173 patients with autosomal recessive RP and 190 patients with autosomal dominant RP. So far, we have found 11 patients with variant bands. In one patient with autosomal dominant RP we discovered the missense change Ser536Leu. Cosegregation studies and further sequencing of the variant bands are currently underway.« less

  4. Risk of breast cancer with CXCR4-using HIV defined by V3 loop sequencing.

    PubMed

    Goedert, James J; Swenson, Luke C; Napolitano, Laura A; Haddad, Mojgan; Anastos, Kathryn; Minkoff, Howard; Young, Mary; Levine, Alexandra; Adeyemi, Oluwatoyin; Seaberg, Eric C; Aouizerat, Bradley; Rabkin, Charles S; Harrigan, P Richard; Hessol, Nancy A

    2015-01-01

    Evaluate the risk of female breast cancer associated with HIV-CXCR4 (X4) tropism as determined by various genotypic measures. A breast cancer case-control study, with pairwise comparisons of tropism determination methods, was conducted. From the Women's Interagency HIV Study repository, one stored plasma specimen was selected from 25 HIV-infected cases near the breast cancer diagnosis date and 75 HIV-infected control women matched for age and calendar date. HIV-gp120 V3 sequences were derived by Sanger population sequencing (PS) and 454-pyro deep sequencing (DS). Sequencing-based HIV-X4 tropism was defined using the geno2pheno algorithm, with both high-stringency DS [false-positive rate (3.5) and 2% X4 cutoff], and lower stringency DS (false-positive rate, 5.75 and 15% X4 cutoff). Concordance of tropism results by PS, DS, and previously performed phenotyping was assessed with kappa (κ) statistics. Case-control comparisons used exact P values and conditional logistic regression. In 74 women (19 cases, 55 controls) with complete results, prevalence of HIV-X4 by PS was 5% in cases vs 29% in controls (P = 0.06; odds ratio, 0.14; confidence interval: 0.003 to 1.03). Smaller case-control prevalence differences were found with high-stringency DS (21% vs 36%, P = 0.32), lower stringency DS (16% vs 35%, P = 0.18), and phenotyping (11% vs 31%, P = 0.10). HIV-X4 tropism concordance was best between PS and lower stringency DS (93%, κ = 0.83). Other pairwise concordances were 82%-92% (κ = 0.56-0.81). Concordance was similar among cases and controls. HIV-X4 defined by population sequencing (PS) had good agreement with lower stringency DS and was significantly associated with lower odds of breast cancer.

  5. Arginine kinase from the Tardigrade, Macrobiotus occidentalis: molecular cloning, phylogenetic analysis and enzymatic properties.

    PubMed

    Uda, Kouji; Ishida, Mikako; Matsui, Tohru; Suzuki, Tomohiko

    2010-10-01

    Arginine kinase (AK), which catalyzes the reversible transfer of phosphate from ATP to arginine to yield phosphoarginine and ADP, is widely distributed throughout the invertebrates. We determined the cDNA sequence of AK from the tardigrade (water bear) Macrobiotus occidentalis, cloned the sequence into pET30b plasmid, and expressed it in Escherichia coli as a 6x His-tag—fused protein. The cDNA is 1377 bp, has an open reading frame of 1080 bp, and has 5′- and 3′-untranslated regions of 116 and 297 bp, respectively. The open reading frame encodes a 359-amino acid protein containing the 12 residues considered necessary for substrate binding in Limulus AK. This is the first AK sequence from a tardigrade. From fragmented and non-annotated sequences available from DNA databases, we assembled 46 complete AK sequences: 26 from arthropods (including 19 from Insecta), 11 from nematodes, 4 from mollusks, 2 from cnidarians and 2 from onychophorans. No onychophoran sequences have been reported previously. The phylogenetic trees of 104 AKs indicated clearly that Macrobiotus AK (from the phylum Tardigrada) shows close affinity with Epiperipatus and Euperipatoides AKs (from the phylum Onychophora), and therefore forms a sister group with the arthropod AKs. Recombinant 6x His-tagged Macrobiotus AK was successfully expressed as a soluble protein, and the kinetic constants (K(m), K(d), V(ma) and k(cat)) were determined for the forward reaction. Comparison of these kinetic constants with those of AKs from other sources (arthropods, mollusks and nematodes) indicated that Macrobiotus AK is unique in that it has the highest values for k(cat) and K(d)K(m) (indicative of synergistic substrate binding) of all characterized AKs.

  6. Accurate Prediction of Inducible Transcription Factor Binding Intensities In Vivo

    PubMed Central

    Siepel, Adam; Lis, John T.

    2012-01-01

    DNA sequence and local chromatin landscape act jointly to determine transcription factor (TF) binding intensity profiles. To disentangle these influences, we developed an experimental approach, called protein/DNA binding followed by high-throughput sequencing (PB–seq), that allows the binding energy landscape to be characterized genome-wide in the absence of chromatin. We applied our methods to the Drosophila Heat Shock Factor (HSF), which inducibly binds a target DNA sequence element (HSE) following heat shock stress. PB–seq involves incubating sheared naked genomic DNA with recombinant HSF, partitioning the HSF–bound and HSF–free DNA, and then detecting HSF–bound DNA by high-throughput sequencing. We compared PB–seq binding profiles with ones observed in vivo by ChIP–seq and developed statistical models to predict the observed departures from idealized binding patterns based on covariates describing the local chromatin environment. We found that DNase I hypersensitivity and tetra-acetylation of H4 were the most influential covariates in predicting changes in HSF binding affinity. We also investigated the extent to which DNA accessibility, as measured by digital DNase I footprinting data, could be predicted from MNase–seq data and the ChIP–chip profiles for many histone modifications and TFs, and found GAGA element associated factor (GAF), tetra-acetylation of H4, and H4K16 acetylation to be the most predictive covariates. Lastly, we generated an unbiased model of HSF binding sequences, which revealed distinct biophysical properties of the HSF/HSE interaction and a previously unrecognized substructure within the HSE. These findings provide new insights into the interplay between the genomic sequence and the chromatin landscape in determining transcription factor binding intensity. PMID:22479205

  7. Mutations Affecting Expression of the rosy Locus in Drosophila melanogaster

    PubMed Central

    Lee, Chong Sung; Curtis, Daniel; McCarron, Margaret; Love, Carol; Gray, Mark; Bender, Welcome; Chovnick, Arthur

    1987-01-01

    The rosy locus in Drosophila melanogaster codes for the enzyme xanthine dehydrogenase (XDH). Previous studies defined a "control element" near the 5' end of the gene, where variant sites affected the amount of rosy mRNA and protein produced. We have determined the DNA sequence of this region from both genomic and cDNA clones, and from the ry+10 underproducer strain. This variant strain had many sequence differences, so that the site of the regulatory change could not be fixed. A mutagenesis was also undertaken to isolate new regulatory mutations. We induced 376 new mutations with 1-ethyl-1-nitrosourea (ENU) and screened them to isolate those that reduced the amount of XDH protein produced, but did not change the properties of the enzyme. Genetic mapping was used to find mutations located near the 5' end of the gene. DNA from each of seven mutants was cloned and sequenced through the 5' region. Mutant base changes were identified in all seven; they appear to affect splicing and translation of the rosy mRNA. In a related study (T. P. Keith et al. 1987), the genomic and cDNA sequences are extended through the 3' end of the gene; the combined sequences define the processing pattern of the rosy transcript and predict the amino acid sequence of XDH. PMID:3036645

  8. Application of sorting and next generation sequencing to study 5΄-UTR influence on translation efficiency in Escherichia coli

    PubMed Central

    Evfratov, Sergey A.; Osterman, Ilya A.; Komarova, Ekaterina S.; Pogorelskaya, Alexandra M.; Rubtsova, Maria P.; Zatsepin, Timofei S.; Semashko, Tatiana A.; Kostryukova, Elena S.; Mironov, Andrey A.; Burnaev, Evgeny; Krymova, Ekaterina; Gelfand, Mikhail S.; Govorun, Vadim M.; Bogdanov, Alexey A.; Dontsova, Olga A.

    2017-01-01

    Abstract Yield of protein per translated mRNA may vary by four orders of magnitude. Many studies analyzed the influence of mRNA features on the translation yield. However, a detailed understanding of how mRNA sequence determines its propensity to be translated is still missing. Here, we constructed a set of reporter plasmid libraries encoding CER fluorescent protein preceded by randomized 5΄ untranslated regions (5΄-UTR) and Red fluorescent protein (RFP) used as an internal control. Each library was transformed into Escherchia coli cells, separated by efficiency of CER mRNA translation by a cell sorter and subjected to next generation sequencing. We tested efficiency of translation of the CER gene preceded by each of 48 natural 5΄-UTR sequences and introduced random and designed mutations into natural and artificially selected 5΄-UTRs. Several distinct properties could be ascribed to a group of 5΄-UTRs most efficient in translation. In addition to known ones, several previously unrecognized features that contribute to the translation enhancement were found, such as low proportion of cytidine residues, multiple SD sequences and AG repeats. The latter could be identified as translation enhancer, albeit less efficient than SD sequence in several natural 5΄-UTRs. PMID:27899632

  9. On the apparent positions of T Tauri stars in the H-R diagram

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kenyon, S.J.; Hartmann, L.W.

    1990-01-01

    The spread in apparent luminosities of T Tauri stars caused by occultation and emission from protostellar disks is investigated. A random distribution of disk inclination angles, coupled with a plausible range of accretion rates, introduces a significant scatter in apparent luminosities for intrinsically identical stars. The observed dispersion of luminosities for K7-M1 Hayashi track stars thought to have disks in Taurus-Auriga is similar to predictions of the simple accretion disk model, which suggets that age determinations form many pre-main-sequence stars are uncertain. The results also suggest that Stahler's birthline for convective track pre-main-sequence stars may be located at slightly lowermore » luminosities than previously thought. 38 refs.« less

  10. Direct memory access transfer completion notification

    DOEpatents

    Chen, Dong; Giampapa, Mark E.; Heidelberger, Philip; Kumar, Sameer; Parker, Jeffrey J.; Steinmacher-Burow, Burkhard D.; Vranas, Pavlos

    2010-07-27

    Methods, compute nodes, and computer program products are provided for direct memory access (`DMA`) transfer completion notification. Embodiments include determining, by an origin DMA engine on an origin compute node, whether a data descriptor for an application message to be sent to a target compute node is currently in an injection first-in-first-out (`FIFO`) buffer in dependence upon a sequence number previously associated with the data descriptor, the total number of descriptors currently in the injection FIFO buffer, and the current sequence number for the newest data descriptor stored in the injection FIFO buffer; and notifying a processor core on the origin DMA engine that the message has been sent if the data descriptor for the message is not currently in the injection FIFO buffer.

  11. Pseudomonas aeruginosa Genotype Prevalence in Dutch Cystic Fibrosis Patients and Age Dependency of Colonization by Various P. aeruginosa Sequence Types ▿

    PubMed Central

    van Mansfeld, Rosa; Willems, Rob; Brimicombe, Roland; Heijerman, Harry; van Berkhout, Ferdinand Teding; Wolfs, Tom; van der Ent, Cornelis; Bonten, Marc

    2009-01-01

    The patient-to-patient transmission of highly prevalent Pseudomonas aeruginosa clones which are associated with enhanced disease progression has led to strict segregation policies for cystic fibrosis (CF) patients in many countries. However, little is known about the population structure of P. aeruginosa among CF patients. The aim of the present cross-sectional study was to determine the prevalence and genetic relatedness of P. aeruginosa isolates from CF patients who visited two major CF centers in The Netherlands in 2007 and 2008. These patients represented 45% of the Dutch CF population. P. aeruginosa carriage in the respiratory tract was determined by standard microbiological culture techniques, and all phenotypically different isolates in the first specimens recovered in 2007 and 2008 were genotyped by multilocus sequence typing. A total of 313 (57%) of 551 patients whose samples were cultured carried P. aeruginosa. Two sequence types (STs), ST406 and ST497, were found in 15% and 5% of the patients, respectively, and 60% of the patients harbored a strain that was also found in at least two other patients. The risk ratios for carrying ST406 and ST497 were 17.8 (95% confidence interval [CI], 7.2 to 43.6) for those aged between 15 and 24 years and 6 (95% CI, 1.4 to 26.1) for those aged >25 years. ST406 and ST497 were not genetically linked to previously described epidemic clones, which were also not found in this CF population. The population structure of P. aeruginosa in Dutch CF patients is characterized by the presence of two prevalent STs that are associated with certain age groups and that are not genetically linked to previously described epidemic clones. PMID:19828746

  12. Restriction site polymorphism-based candidate gene mapping for seedling drought tolerance in cowpea [Vigna unguiculata (L.) Walp.].

    PubMed

    Muchero, Wellington; Ehlers, Jeffrey D; Roberts, Philip A

    2010-02-01

    Quantitative trait loci (QTL) studies provide insight into the complexity of drought tolerance mechanisms. Molecular markers used in these studies also allow for marker-assisted selection (MAS) in breeding programs, enabling transfer of genetic factors between breeding lines without complete knowledge of their exact nature. However, potential for recombination between markers and target genes limit the utility of MAS-based strategies. Candidate gene mapping offers an alternative solution to identify trait determinants underlying QTL of interest. Here, we used restriction site polymorphisms to investigate co-location of candidate genes with QTL for seedling drought stress-induced premature senescence identified previously in cowpea. Genomic DNA isolated from 113 F(2:8) RILs of drought-tolerant IT93K503-1 and drought susceptible CB46 genotypes was digested with combinations of EcoR1 and HpaII, Mse1, or Msp1 restriction enzymes and amplified with primers designed from 13 drought-responsive cDNAs. JoinMap 3.0 and MapQTL 4.0 software were used to incorporate polymorphic markers onto the AFLP map and to analyze their association with the drought response QTL. Seven markers co-located with peaks of previously identified QTL. Isolation, sequencing, and blast analysis of these markers confirmed their significant homology with drought or other abiotic stress-induced expressed sequence tags (EST) from cowpea and other plant systems. Further, homology with coding sequences for a multidrug resistance protein 3 and a photosystem I assembly protein ycf3 was revealed in two of these candidates. These results provide a platform for the identification and characterization of genetic trait determinants underlying seedling drought tolerance in cowpea.

  13. Intact long-type dupA as a marker for gastroduodenal diseases in Okinawan subpopulation, Japan.

    PubMed

    Takahashi, Ayaka; Shiota, Seiji; Matsunari, Osamu; Watada, Masahide; Suzuki, Rumiko; Nakachi, Saori; Kinjo, Nagisa; Kinjo, Fukunori; Yamaoka, Yoshio

    2013-02-01

    Helicobacter pylori dupA can be divided into two types according to the presence or absence of the mutation. In addition, full-sequenced data revealed that dupA has two types with different lengths depend on the presence of approximately 600 bp in the putative 5' region (presence; long-type and absence; short-type), which has not been taken into account in previous studies. A total of 319 strains isolated from Okinawa, the south islands of Japan, were included. The status of dupA and cagA was determined by polymerase chain reaction. The presence of mutations in long-type dupA was determined by DNA sequencing. The prevalence of long-type dupA was 26.3% (84/319). Sequence analysis showed that there were only six cases (7.1%) with point mutations lead to stop codon among 84 long-type dupA strains studied. Interestingly, intact long-type dupA without frameshift mutation, but not short-type dupA, was significantly associated with gastric ulcer and gastric cancer than gastritis (p = .001 and p = .019, respectively). After adjustment by age, gender, and cagA, the presence of intact long-type dupA was significantly associated with gastric ulcer and gastric cancer compared with gastritis (odds ratio [OR] = 3.35, 95% confidence interval [CI] = 1.55-7.24 and OR = 4.14, 95% CI = 1.23-13.94, respectively). Intact long-type dupA is a real virulence marker for severe outcomes in Okinawa, Japan. The previous information gained from PCR-based methods without taking long-type dupA into account must be interpreted with caution. © 2012 Blackwell Publishing Ltd.

  14. Intact long-type dupA as a marker for gastroduodenal diseases in Okinawan subpopulation, Japan

    PubMed Central

    Takahashi, Ayaka; Shiota, Seiji; Matsunari, Osamu; Watada, Masahide; Suzuki, Rumiko; Nakachi, Saori; Kinjo, Nagisa; Kinjo, Fukunori; Yamaoka, Yoshio

    2012-01-01

    Background Helicobacter pylori dupA can be divided into two types according to the presence or absence of the mutation. In addition, full-sequenced data revealed that dupA has two types with different lengths depend on the presence of approximately 600 bp in the putative 5' region (presence; long-type and absence; short-type), which has not been taken into account in previous studies. Methods A total of 319 strains isolated from Okinawa, the south islands of Japan, were included. The status of dupA and cagA was determined by polymerase chain reaction. The presence of mutations in long-type dupA was determined by DNA sequencing. Results The prevalence of long-type dupA was 26.3% (84/319). Sequence analysis showed that there were only 6 cases (7.1%) with point mutations lead to stop codon among 84 long-type dupA strains studied. Interestingly, intact long-type dupA without frameshift mutation, but not short-type dupA was significantly associated with gastric ulcer and gastric cancer than gastritis (P = 0.001 and P = 0.019, respectively). After adjustment by age, gender and cagA, the presence of intact long-type dupA was significantly associated with gastric ulcer and gastric cancer compared with gastritis (odds ratio [OR] = 3.35, 95% confidence interval [CI] = 1.55–7.24 and OR = 4.14, 95% CI = 1.23–13.94, respectively). Conclusions Intact long-type dupA is a real virulence marker for severe outcomes in Okinawa, Japan. The previous information gained from PCR-based methods without taking long-type dupA into account must be interpreted with caution. PMID:23067336

  15. Evolution of the P-type II ATPase gene family in the fungi and presence of structural genomic changes among isolates of Glomus intraradices.

    PubMed

    Corradi, Nicolas; Sanders, Ian R

    2006-03-10

    The P-type II ATPase gene family encodes proteins with an important role in adaptation of the cell to variation in external K+, Ca2+ and Na2+ concentrations. The presence of P-type II gene subfamilies that are specific for certain kingdoms has been reported but was sometimes contradicted by discovery of previously unknown homologous sequences in newly sequenced genomes. Members of this gene family have been sampled in all of the fungal phyla except the arbuscular mycorrhizal fungi (AMF; phylum Glomeromycota), which are known to play a key-role in terrestrial ecosystems and to be genetically highly variable within populations. Here we used highly degenerate primers on AMF genomic DNA to increase the sampling of fungal P-Type II ATPases and to test previous predictions about their evolution. In parallel, homologous sequences of the P-type II ATPases have been used to determine the nature and amount of polymorphism that is present at these loci among isolates of Glomus intraradices harvested from the same field. In this study, four P-type II ATPase sub-families have been isolated from three AMF species. We show that, contrary to previous predictions, P-type IIC ATPases are present in all basal fungal taxa. Additionally, P-Type IIE ATPases should no longer be considered as exclusive to the Ascomycota and the Basidiomycota, since we also demonstrate their presence in the Zygomycota. Finally, a comparison of homologous sequences encoding P-type IID ATPases showed unexpectedly that indel mutations among coding regions, as well as specific gene duplications occur among AMF individuals within the same field. On the basis of these results we suggest that the diversification of P-Type IIC and E ATPases followed the diversification of the extant fungal phyla with independent events of gene gains and losses. Consistent with recent findings on the human genome, but at a much smaller geographic scale, we provided evidence that structural genomic changes, such as exonic indel mutations and gene duplications are less rare than previously thought and that these also occur within fungal populations.

  16. Evolutionary relationships of a plant-pathogenic mycoplasmalike organism and Acholeplasma laidlawii deduced from two ribosomal protein gene sequences.

    PubMed Central

    Lim, P O; Sears, B B

    1992-01-01

    The families within the class Mollicutes are distinguished by their morphologies, nutritional requirements, and abilities to metabolize certain compounds. Biosystematic classification of the plant-pathogenic mycoplasmalike organisms (MLOs) has been difficult because these organisms have not been cultured in vitro, and hence their nutritional requirements have not been determined nor have physiological characterizations been possible. To investigate the evolutionary relationship of the MLOs to other members of the class Mollicutes, a segment of a ribosomal protein operon was cloned and sequenced from an aster yellows-type MLO which is pathogenic for members of the genus Oenothera and from Acholeplasma laidlawii. The deduced amino acid sequence data from the rpl22 and rps3 genes indicate that the MLOs are more closely related to A. laidlawii than to animal mycoplasmas, confirming previous results from 16S rRNA sequence comparisons. This conclusion is also supported by the finding that the UGA codon is not read as a tryptophan codon in the MLO and A. laidlawii, in contrast to its usage in Mycoplasma capricolum. PMID:1556079

  17. Reference genotype and exome data from an Australian Aboriginal population for health-based research

    PubMed Central

    Tang, Dave; Anderson, Denise; Francis, Richard W.; Syn, Genevieve; Jamieson, Sarra E.; Lassmann, Timo; Blackwell, Jenefer M.

    2016-01-01

    Genetic analyses, including genome-wide association studies and whole exome sequencing (WES), provide powerful tools for the analysis of complex and rare genetic diseases. To date there are no reference data for Aboriginal Australians to underpin the translation of health-based genomic research. Here we provide a catalogue of variants called after sequencing the exomes of 72 Aboriginal individuals to a depth of 20X coverage in ∼80% of the sequenced nucleotides. We determined 320,976 single nucleotide variants (SNVs) and 47,313 insertions/deletions using the Genome Analysis Toolkit. We had previously genotyped a subset of the Aboriginal individuals (70/72) using the Illumina Omni2.5 BeadChip platform and found ~99% concordance at overlapping sites, which suggests high quality genotyping. Finally, we compared our SNVs to six publicly available variant databases, such as dbSNP and the Exome Sequencing Project, and 70,115 of our SNVs did not overlap any of the single nucleotide polymorphic sites in all the databases. Our data set provides a useful reference point for genomic studies on Aboriginal Australians. PMID:27070114

  18. Reference genotype and exome data from an Australian Aboriginal population for health-based research.

    PubMed

    Tang, Dave; Anderson, Denise; Francis, Richard W; Syn, Genevieve; Jamieson, Sarra E; Lassmann, Timo; Blackwell, Jenefer M

    2016-04-12

    Genetic analyses, including genome-wide association studies and whole exome sequencing (WES), provide powerful tools for the analysis of complex and rare genetic diseases. To date there are no reference data for Aboriginal Australians to underpin the translation of health-based genomic research. Here we provide a catalogue of variants called after sequencing the exomes of 72 Aboriginal individuals to a depth of 20X coverage in ∼80% of the sequenced nucleotides. We determined 320,976 single nucleotide variants (SNVs) and 47,313 insertions/deletions using the Genome Analysis Toolkit. We had previously genotyped a subset of the Aboriginal individuals (70/72) using the Illumina Omni2.5 BeadChip platform and found ~99% concordance at overlapping sites, which suggests high quality genotyping. Finally, we compared our SNVs to six publicly available variant databases, such as dbSNP and the Exome Sequencing Project, and 70,115 of our SNVs did not overlap any of the single nucleotide polymorphic sites in all the databases. Our data set provides a useful reference point for genomic studies on Aboriginal Australians.

  19. Evolution of long centromeres in fire ants.

    PubMed

    Huang, Yu-Ching; Lee, Chih-Chi; Kao, Chia-Yi; Chang, Ni-Chen; Lin, Chung-Chi; Shoemaker, DeWayne; Wang, John

    2016-09-15

    Centromeres are essential for accurate chromosome segregation, yet sequence conservation is low even among closely related species. Centromere drive predicts rapid turnover because some centromeric sequences may compete better than others during female meiosis. In addition to sequence composition, longer centromeres may have a transmission advantage. We report the first observations of extremely long centromeres, covering on average 34 % of the chromosomes, in the red imported fire ant Solenopsis invicta. By comparison, cytological examination of Solenopsis geminata revealed typical small centromeric constrictions. Bioinformatics and molecular analyses identified CenSol, the major centromeric satellite DNA repeat. We found that CenSol sequences are very similar between the two species but the CenSol copy number in S. invicta is much greater than that in S. geminata. In addition, centromere expansion in S. invicta is not correlated with the duplication of CenH3. Comparative analyses revealed that several closely related fire ant species also possess long centromeres. Our results are consistent with a model of simple runaway centromere expansion due to centromere drive. We suggest expanded centromeres may be more prevalent in hymenopteran insects, which use haplodiploid sex determination, than previously considered.

  20. Molecular and phylogenetic characterizations of an Eimeria krijgsmanni Yakimoff & Gouseff, 1938 (Apicomplexa: Eimeriidae) mouse intestinal protozoan parasite by partial 18S ribosomal RNA gene sequence analysis.

    PubMed

    Takeo, Toshinori; Tanaka, Tetsuya; Matsubayashi, Makoto; Maeda, Hiroki; Kusakisako, Kodai; Matsui, Toshihiro; Mochizuki, Masami; Matsuo, Tomohide

    2014-08-01

    Previously, we characterized an undocumented strain of Eimeria krijgsmanni by morphological and biological features. Here, we present a detailed molecular phylogenetic analysis of this organism. Namely, 18S ribosomal RNA gene (rDNA) sequences of E. krijgsmanni were analyzed to incorporate this species into a comprehensive Eimeria phylogeny. As a result, partial 18S rDNA sequence from E. krijgsmanni was successfully determined, and two different types, Type A and Type B, that differed by 1 base pair were identified. E. krijgsmanni was originally isolated from a single oocyst, and thus the result show that the two types might have allelic sequence heterogeneity in the 18S rDNA. Based on phylogenetic analyses, the two types of E. krijgsmanni 18S rDNA formed one of two clades among murine Eimeria spp.; these Eimeria clades reflected morphological similarity among the Eimeria spp. This is the third molecular phylogenetic characterization of a murine Eimeria spp. in addition to E. falciformis and E. papillata. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  1. Measurements of relative chemical shift tensor orientations in solid-state NMR: new slow magic angle spinning dipolar recoupling experiments.

    PubMed

    Jurd, Andrew P S; Titman, Jeremy J

    2009-08-28

    Solid-state NMR experiments can be used to determine conformational parameters, such as interatomic distances and torsion angles. The latter can be obtained from measurements of the relative orientation of two chemical shift tensors, if the orientation of these with respect to the surrounding bonds is known. In this paper, a new rotor-synchronized magic angle spinning (MAS) dipolar correlation experiment is described which can be used in this way. Because the experiment requires slow MAS rates, a novel recoupling sequence, designed using symmetry principles, is incorporated into the mixing period. This recoupling sequence is based in turn on a new composite cyclic pulse referred to as COAST (for combined offset and anisotropy stabilization). The new COAST-C7(2)(1) sequence is shown to give good theoretical and experimental recoupling efficiency, even when the CSA far exceeds the MAS rate. In this regime, previous recoupling sequences, such as POST-C7(2)(1), exhibit poor recoupling performance. The effectiveness of the new method has been explored by a study of the dipeptide L-phenylalanyl-L-phenylalanine.

  2. Resurgence of Integrated Behavioral Units

    PubMed Central

    Bachá-Méndez, Gustavo; Reid, Alliston K; Mendoza-Soylovna, Adela

    2007-01-01

    Two experiments with rats examined the dynamics of well-learned response sequences when reinforcement contingencies were changed. Both experiments contained four phases, each of which reinforced a 2-response sequence of lever presses until responding was stable. The contingencies then were shifted to a new reinforced sequence until responding was again stable. Extinction-induced resurgence of previously reinforced, and then extinguished, heterogeneous response sequences was observed in all subjects in both experiments. These sequences were demonstrated to be integrated behavioral units, controlled by processes acting at the level of the entire sequence. Response-level processes were also simultaneously operative. Errors in sequence production were strongly influenced by the terminal, not the initial, response in the currently reinforced sequence, but not by the previously reinforced sequence. These studies demonstrate that sequence-level and response-level processes can operate simultaneously in integrated behavioral units. Resurgence and the development of integrated behavioral units may be dissociated; thus the observation of one does not necessarily imply the other. PMID:17345948

  3. Amino- and carboxyl-terminal amino acid sequences of proteins coded by gag gene of murine leukemia virus

    PubMed Central

    Oroszlan, Stephen; Henderson, Louis E.; Stephenson, John R.; Copeland, Terry D.; Long, Cedric W.; Ihle, James N.; Gilden, Raymond V.

    1978-01-01

    The amino- and carboxyl-terminal amino acid sequences of proteins (p10, p12, p15, and p30) coded by the gag gene of Rauscher and AKR murine leukemia viruses were determined. Among these proteins, p15 from both viruses appears to have a blocked amino end. Proline was found to be the common NH2 terminus of both p30s and both p12s, and alanine of both p10s. The amino-terminal sequences of p30s are identical, as are those of p10s, while the p12 sequences are clearly distinctive but also show substantial homology. The carboxyl-terminal amino acids of both viral p30s and p12s are leucine and phenylalanine, respectively. Rauscher leukemia virus p15 has tyrosine as the carboxyl terminus while AKR virus p15 has phenylalanine in this position. The compositional and sequence data provide definite chemical criteria for the identification of analogous gag gene products and for the comparison of viral proteins isolated in different laboratories. On the basis of amino acid sequences and the previously proposed H-p15-p12-p30-p10-COOH peptide sequence in the precursor polyprotein, a model for cleavage sites involved in the post-translational processing of the precursor coded for by the gag gene is proposed. PMID:206897

  4. Sequence, molecular properties, and chromosomal mapping of mouse lumican

    NASA Technical Reports Server (NTRS)

    Funderburgh, J. L.; Funderburgh, M. L.; Hevelone, N. D.; Stech, M. E.; Justice, M. J.; Liu, C. Y.; Kao, W. W.; Conrad, G. W.; Spooner, B. S. (Principal Investigator)

    1995-01-01

    PURPOSE. Lumican is a major proteoglycan of vertebrate cornea. This study characterizes mouse lumican, its molecular form, cDNA sequence, and chromosomal localization. METHODS. Lumican sequence was determined from cDNA clones selected from a mouse corneal cDNA expression library using a bovine lumican cDNA probe. Tissue expression and size of lumican mRNA were determined using Northern hybridization. Glycosidase digestion followed by Western blot analysis provided characterization of molecular properties of purified mouse corneal lumican. Chromosomal mapping of the lumican gene (Lcn) used Southern hybridization of a panel of genomic DNAs from an interspecific murine backcross. RESULTS. Mouse lumican is a 338-amino acid protein with high-sequence identity to bovine and chicken lumican proteins. The N-terminus of the lumican protein contains consensus sequences for tyrosine sulfation. A 1.9-kb lumican mRNA is present in cornea and several other tissues. Antibody against bovine lumican reacted with recombinant mouse lumican expressed in Escherichia coli and also detected high molecular weight proteoglycans in extracts of mouse cornea. Keratanase digestion of corneal proteoglycans released lumican protein, demonstrating the presence of sulfated keratan sulfate chains on mouse corneal lumican in vivo. The lumican gene (Lcn) was mapped to the distal region of mouse chromosome 10. The Lcn map site is in the region of a previously identified developmental mutant, eye blebs, affecting corneal morphology. CONCLUSIONS. This study demonstrates sulfated keratan sulfate proteoglycan in mouse cornea and describes the tools (antibodies and cDNA) necessary to investigate the functional role of this important corneal molecule using naturally occurring and induced mutants of the murine lumican gene.

  5. Phylogeny and strain typing of Escherichia coli, inferred from variation at mononucleotide repeat loci.

    PubMed

    Diamant, Eran; Palti, Yniv; Gur-Arie, Riva; Cohen, Helit; Hallerman, Eric M; Kashi, Yechezkel

    2004-04-01

    Multilocus sequencing of housekeeping genes has been used previously for bacterial strain typing and for inferring evolutionary relationships among strains of Escherichia coli. In this study, we used shorter intergenic sequences that contained simple sequence repeats (SSRs) of repeating mononucleotide motifs (mononucleotide repeats [MNRs]) to infer the phylogeny of pathogenic and commensal E. coli strains. Seven noncoding loci (four MNRs and three non-SSRs) were sequenced in 27 strains, including enterohemorrhagic (six isolates of O157:H7), enteropathogenic, enterotoxigenic, B, and K-12 strains. The four MNRs were also sequenced in 20 representative strains of the E. coli reference (ECOR) collection. Sequence polymorphism was significantly higher at the MNR loci, including the flanking sequences, indicating a higher mutation rate in the sequences flanking the MNR tracts. The four MNR loci were amplifiable by PCR in the standard ECOR A, B1, and D groups, but only one (yaiN) in the B2 group was amplified, which is consistent with previous studies that suggested that B2 is the most ancient group. High sequence compatibility was found between the four MNR loci, indicating that they are in the same clonal frame. The phylogenetic trees that were constructed from the sequence data were in good agreement with those of previous studies that used multilocus enzyme electrophoresis. The results demonstrate that MNR loci are useful for inferring phylogenetic relationships and provide much higher sequence variation than housekeeping genes. Therefore, the use of MNR loci for multilocus sequence typing should prove efficient for clinical diagnostics, epidemiology, and evolutionary study of bacteria.

  6. Phylogeny and Strain Typing of Escherichia coli, Inferred from Variation at Mononucleotide Repeat Loci

    PubMed Central

    Diamant, Eran; Palti, Yniv; Gur-Arie, Riva; Cohen, Helit; Hallerman, Eric M.; Kashi, Yechezkel

    2004-01-01

    Multilocus sequencing of housekeeping genes has been used previously for bacterial strain typing and for inferring evolutionary relationships among strains of Escherichia coli. In this study, we used shorter intergenic sequences that contained simple sequence repeats (SSRs) of repeating mononucleotide motifs (mononucleotide repeats [MNRs]) to infer the phylogeny of pathogenic and commensal E. coli strains. Seven noncoding loci (four MNRs and three non-SSRs) were sequenced in 27 strains, including enterohemorrhagic (six isolates of O157:H7), enteropathogenic, enterotoxigenic, B, and K-12 strains. The four MNRs were also sequenced in 20 representative strains of the E. coli reference (ECOR) collection. Sequence polymorphism was significantly higher at the MNR loci, including the flanking sequences, indicating a higher mutation rate in the sequences flanking the MNR tracts. The four MNR loci were amplifiable by PCR in the standard ECOR A, B1, and D groups, but only one (yaiN) in the B2 group was amplified, which is consistent with previous studies that suggested that B2 is the most ancient group. High sequence compatibility was found between the four MNR loci, indicating that they are in the same clonal frame. The phylogenetic trees that were constructed from the sequence data were in good agreement with those of previous studies that used multilocus enzyme electrophoresis. The results demonstrate that MNR loci are useful for inferring phylogenetic relationships and provide much higher sequence variation than housekeeping genes. Therefore, the use of MNR loci for multilocus sequence typing should prove efficient for clinical diagnostics, epidemiology, and evolutionary study of bacteria. PMID:15066845

  7. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Farmer, M. T.

    The overall objective of the current work is to carry out a scoping analysis to determine the impact of ATF on late phase accident progression; in particular, the molten core-concrete interaction portion of the sequence that occurs after the core debris fails the reactor vessel and relocates into containment. This additional study augments previous work by including kinetic effects that govern chemical reaction rates during core-concrete interaction. The specific ATF considered as part of this study is SiC-clad UO 2.

  8. Canine Antithrombin-III: Some Biochemical and Biologic Properties

    DTIC Science & Technology

    1987-06-02

    performing amino acid analyses, amino acid sequence analysis, and differential refractometry . I thank Ms. Kerry Singer for her excellent typing and...previously determined by differential refractometry at 546 nm, with a value of 0.186 ml/g for dn/dc (refractive index increment) (69). 20 •, "• 11...J ;,.-0 t.! ’ > ~ +-’ :I c: ... Q) :::J ~ w Protein concentration by refractometry = 8.29 mg/ml O.D. value at 280 nm (1:10 dilution

  9. Listeria monocytogenes incidence changes and diversity in some Brazilian dairy industries and retail products.

    PubMed

    Oxaran, Virginie; Lee, Sarah Hwa In; Chaul, Luíza Toubas; Corassin, Carlos Humberto; Barancelli, Giovana Verginia; Alves, Virgínia Farias; de Oliveira, Carlos Augusto Fernandes; Gram, Lone; De Martinis, Elaine Cristina Pereira

    2017-12-01

    Listeria monocytogenes can cause listeriosis, a severe foodborne disease. In Brazil, despite very few reported cases of listeriosis, the pathogen has been repeatedly isolated from dairies. This has led the government to implement specific legislation to reduce the hazard. Here, we determined the incidence of L. monocytogenes in five dairies and retail products in the Southeast and Midwest regions of Brazil over eight months. Of 437 samples, three samples (0.7%) from retail and only one sample (0.2%) from the dairies were positive for L. monocytogenes. Thus, the contamination rate was significantly reduced as compared to previous studies. MultiLocus Sequence Typing (MLST) was used to determine if contamination was caused by new or persistent clones leading to the first MLST profile of L. monocytogenes from the Brazilian dairy industry. The processing environment isolate is of concern being a sequence-type (ST) 2, belonging to the lineage I responsible for the majority of listeriosis outbreaks. Also, ST3 and ST8 found in commercialized cheese have previously been reported in outbreaks. Despite the lower incidence, dairy products still pose a potential health risk and the occurrence of L. monocytogenes in dairies and retail products emphasize the need for continuous surveillance of this pathogen in the Brazilian dairy industry. Copyright © 2017 Elsevier Ltd. All rights reserved.

  10. A functional genomics investigation of allelochemical biosynthesis in Sorghum bicolor root hairs.

    PubMed

    Baerson, Scott R; Dayan, Franck E; Rimando, Agnes M; Nanayakkara, N P Dhammika; Liu, Chang-Jun; Schröder, Joachim; Fishbein, Mark; Pan, Zhiqiang; Kagan, Isabelle A; Pratt, Lee H; Cordonnier-Pratt, Marie-Michèle; Duke, Stephen O

    2008-02-08

    Sorghum is considered to be one of the more allelopathic crop species, producing phytotoxins such as the potent benzoquinone sorgoleone (2-hydroxy-5-methoxy-3-[(Z,Z)-8',11',14'-pentadecatriene]-p-benzoquinone) and its analogs. Sorgoleone likely accounts for much of the allelopathy of Sorghum spp., typically representing the predominant constituent of Sorghum bicolor root exudates. Previous and ongoing studies suggest that the biosynthetic pathway for this plant growth inhibitor occurs in root hair cells, involving a polyketide synthase activity that utilizes an atypical 16:3 fatty acyl-CoA starter unit, resulting in the formation of a pentadecatrienyl resorcinol intermediate. Subsequent modifications of this resorcinolic intermediate are likely to be mediated by S-adenosylmethionine-dependent O-methyltransferases and dihydroxylation by cytochrome P450 monooxygenases, although the precise sequence of reactions has not been determined previously. Analyses performed by gas chromatography-mass spectrometry with sorghum root extracts identified a 3-methyl ether derivative of the likely pentadecatrienyl resorcinol intermediate, indicating that dihydroxylation of the resorcinol ring is preceded by O-methylation at the 3'-position by a novel 5-n-alk(en)ylresorcinol-utilizing O-methyltransferase activity. An expressed sequence tag data set consisting of 5,468 sequences selected at random from an S. bicolor root hair-specific cDNA library was generated to identify candidate sequences potentially encoding enzymes involved in the sorgoleone biosynthetic pathway. Quantitative real time reverse transcription-PCR and recombinant enzyme studies with putative O-methyltransferase sequences obtained from the expressed sequence tag data set have led to the identification of a novel O-methyltransferase highly and predominantly expressed in root hairs (designated SbOMT3), which preferentially utilizes alk(en)ylresorcinols among a panel of benzene-derivative substrates tested. SbOMT3 is therefore proposed to be involved in the biosynthesis of the allelochemical sorgoleone.

  11. A wide extent of inter-strain diversity in virulent and vaccine strains of alphaherpesviruses.

    PubMed

    Szpara, Moriah L; Tafuri, Yolanda R; Parsons, Lance; Shamim, S Rafi; Verstrepen, Kevin J; Legendre, Matthieu; Enquist, L W

    2011-10-01

    Alphaherpesviruses are widespread in the human population, and include herpes simplex virus 1 (HSV-1) and 2, and varicella zoster virus (VZV). These viral pathogens cause epithelial lesions, and then infect the nervous system to cause lifelong latency, reactivation, and spread. A related veterinary herpesvirus, pseudorabies (PRV), causes similar disease in livestock that result in significant economic losses. Vaccines developed for VZV and PRV serve as useful models for the development of an HSV-1 vaccine. We present full genome sequence comparisons of the PRV vaccine strain Bartha, and two virulent PRV isolates, Kaplan and Becker. These genome sequences were determined by high-throughput sequencing and assembly, and present new insights into the attenuation of a mammalian alphaherpesvirus vaccine strain. We find many previously unknown coding differences between PRV Bartha and the virulent strains, including changes to the fusion proteins gH and gB, and over forty other viral proteins. Inter-strain variation in PRV protein sequences is much closer to levels previously observed for HSV-1 than for the highly stable VZV proteome. Almost 20% of the PRV genome contains tandem short sequence repeats (SSRs), a class of nucleic acids motifs whose length-variation has been associated with changes in DNA binding site efficiency, transcriptional regulation, and protein interactions. We find SSRs throughout the herpesvirus family, and provide the first global characterization of SSRs in viruses, both within and between strains. We find SSR length variation between different isolates of PRV and HSV-1, which may provide a new mechanism for phenotypic variation between strains. Finally, we detected a small number of polymorphic bases within each plaque-purified PRV strain, and we characterize the effect of passage and plaque-purification on these polymorphisms. These data add to growing evidence that even plaque-purified stocks of stable DNA viruses exhibit limited sequence heterogeneity, which likely seeds future strain evolution.

  12. Whole genome re-sequencing identifies a mutation in an ABC transporter (mdr2) in a Plasmodium chabaudi clone with altered susceptibility to antifolate drugs☆

    PubMed Central

    Martinelli, Axel; Henriques, Gisela; Cravo, Pedro; Hunt, Paul

    2011-01-01

    In malaria parasites, mutations in two genes of folate biosynthesis encoding dihydrofolate reductase (dhfr) and dihydropteroate synthase (dhps) modify responses to antifolate therapies which target these enzymes. However, the involvement of other genes which modify the availability of exogenous folate, for example, has been proposed. Here, we used short-read whole-genome re-sequencing to determine the mutations in a clone of the rodent malaria parasite, Plasmodium chabaudi, which has altered susceptibility to both sulphadoxine and pyrimethamine. This clone bears a previously identified S106N mutation in dhfr and no mutation in dhps. Instead, three additional point mutations in genes on chromosomes 2, 13 and 14 were identified. The mutated gene on chromosome 13 (mdr2 K392Q) encodes an ABC transporter. Because Quantitative Trait Locus analysis previously indicated an association of genetic markers on chromosome 13 with responses to individual and combined antifolates, MDR2 is proposed to modulate antifolate responses, possibly mediated by the transport of folate intermediates. PMID:20858498

  13. Phylogenetic relationships in Taxodiaceae and Cupressaceae sensu stricto based on matK gene, chlL gene, trnL-trnF IGS region, and trnL intron sequences.

    PubMed

    Kusumi, J; Tsumura, Y; Yoshimaru, H; Tachida, H

    2000-10-01

    Nucleotide sequences from four chloroplast genes, the matK, chlL, intergenic spacer (IGS) region between trnL and trnF, and an intron of trnL, were determined from all species of Taxodiaceae and five species of Cupressaceae sensu stricto (s.s.). Phylogenetic trees were constructed using the maximum parsimony and the neighbor-joining methods with Cunninghamia as an outgroup. These analyses provided greater resolution of relationships among genera and higher bootstrap supports for clades compared to previous analyses. Results indicate that Taiwania diverged first, and then Athrotaxis diverged from the remaining genera. Metasequoia, Sequoia, and Sequoiadendron form a clade. Taxodium and Glyptostrobus form a clade, which is the sister to Cryptomeria. Cupressaceae s.s. are derived from within Taxodiaceae, being the most closely related to the Cryptomeria/Taxodium/Glyptostrobus clade. These relationships are consistent with previous morphological groupings and the analyses of molecular data. In addition, we found acceleration of evolutionary rates in Cupressaceae s.s. Possible causes for the acceleration are discussed.

  14. Expanding the 2011 Prague, OK Event Catalog: Detections, Relocations, and Stress Drop Estimates

    NASA Astrophysics Data System (ADS)

    Clerc, F.; Cochran, E. S.; Dougherty, S. L.; Keranen, K. M.; Harrington, R. M.

    2016-12-01

    The Mw 5.6 earthquake occurring on 6 Nov. 2011, near Prague, OK, is thought to have been triggered by a Mw 4.8 foreshock, which was likely induced by fluid injection into local wastewater disposal wells [Keranen et al., 2013; Sumy et al., 2014]. Previous stress drop estimates for the sequence have suggested values lower than those for most Central and Eastern U.S. tectonic events of similar magnitudes [Hough, 2014; Sun & Hartzell, 2014; Sumy & Neighbors et al., 2016]. Better stress drop estimates allow more realistic assessment of seismic hazard and more effective regulation of wastewater injection. More reliable estimates of source properties may help to differentiate induced events from natural ones. Using data from local and regional networks, we perform event detections, relocations, and stress drop calculations of the Prague aftershock sequence. We use the Match & Locate method, a variation on the matched-filter method which detects events of lower magnitudes by stacking cross-correlograms from different stations [Zhang & Wen, 2013; 2015], in order to create a more complete catalog from 6 Nov to 31 Dec 2011. We then relocate the detected events using the HypoDD double-difference algorithm. Using our enhanced catalog and relocations, we examine the seismicity distribution for evidence of migration and investigate implications for triggering mechanisms. To account for path and site effects, we calculate stress drops using the Empirical Green's Function (EGF) spectral ratio method, beginning with 2730 previously relocated events. We determine whether there is a correlation between the stress drop magnitudes and the spatial and temporal distribution of events, including depth, position relative to existing faults, and proximity to injection wells. Finally, we consider the range of stress drop values and scaling with respect to event magnitudes within the context of previously published work for the Prague sequence as well as other induced and natural sequences.

  15. Phylogenetic Network for European mtDNA

    PubMed Central

    Finnilä, Saara; Lehtonen, Mervi S.; Majamaa, Kari

    2001-01-01

    The sequence in the first hypervariable segment (HVS-I) of the control region has been used as a source of evolutionary information in most phylogenetic analyses of mtDNA. Population genetic inference would benefit from a better understanding of the variation in the mtDNA coding region, but, thus far, complete mtDNA sequences have been rare. We determined the nucleotide sequence in the coding region of mtDNA from 121 Finns, by conformation-sensitive gel electrophoresis and subsequent sequencing and by direct sequencing of the D loop. Furthermore, 71 sequences from our previous reports were included, so that the samples represented all the mtDNA haplogroups present in the Finnish population. We found a total of 297 variable sites in the coding region, which allowed the compilation of unambiguous phylogenetic networks. The D loop harbored 104 variable sites, and, in most cases, these could be localized within the coding-region networks, without discrepancies. Interestingly, many homoplasies were detected in the coding region. Nucleotide variation in the rRNA and tRNA genes was 6%, and that in the third nucleotide positions of structural genes amounted to 22% of that in the HVS-I. The complete networks enabled the relationships between the mtDNA haplogroups to be analyzed. Phylogenetic networks based on the entire coding-region sequence in mtDNA provide a rich source for further population genetic studies, and complete sequences make it easier to differentiate between disease-causing mutations and rare polymorphisms. PMID:11349229

  16. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution.

    PubMed

    Melters, Daniël P; Bradnam, Keith R; Young, Hugh A; Telis, Natalie; May, Michael R; Ruby, J Graham; Sebra, Robert; Peluso, Paul; Eid, John; Rank, David; Garcia, José Fernando; DeRisi, Joseph L; Smith, Timothy; Tobias, Christian; Ross-Ibarra, Jeffrey; Korf, Ian; Chan, Simon W L

    2013-01-30

    Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.

  17. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution

    PubMed Central

    2013-01-01

    Background Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Results Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. Conclusions While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes. PMID:23363705

  18. Location analysis for the estrogen receptor-α reveals binding to diverse ERE sequences and widespread binding within repetitive DNA elements

    PubMed Central

    Mason, Christopher E.; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M.; Kallen, Roland G.; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B.

    2010-01-01

    Location analysis for estrogen receptor-α (ERα)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERα-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: <10% and 10–20% nucleotide deviation from the canonical ERE sequence. We demonstrate that ∼50% of all ERα-bound loci do not have a discernable ERE and show that most ERα-bound EREs are not perfect consensus EREs. Approximately one-third of all ERα-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERα-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERα binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers. PMID:20047966

  19. Location analysis for the estrogen receptor-alpha reveals binding to diverse ERE sequences and widespread binding within repetitive DNA elements.

    PubMed

    Mason, Christopher E; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M; Kallen, Roland G; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B

    2010-04-01

    Location analysis for estrogen receptor-alpha (ERalpha)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERalpha-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: <10% and 10-20% nucleotide deviation from the canonical ERE sequence. We demonstrate that approximately 50% of all ERalpha-bound loci do not have a discernable ERE and show that most ERalpha-bound EREs are not perfect consensus EREs. Approximately one-third of all ERalpha-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERalpha-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERalpha binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers.

  20. Subglacial Lake Vostok (Antarctica) Accretion Ice Contains a Diverse Set of Sequences from Aquatic, Marine and Sediment-Inhabiting Bacteria and Eukarya

    PubMed Central

    Edgar, Robyn; Veerapaneni, Ram S.; D’Elia, Tom; Morris, Paul F.; Rogers, Scott O.

    2013-01-01

    Lake Vostok, the 7th largest (by volume) and 4th deepest lake on Earth, is covered by more than 3,700 m of ice, making it the largest subglacial lake known. The combination of cold, heat (from possible hydrothermal activity), pressure (from the overriding glacier), limited nutrients and complete darkness presents extreme challenges to life. Here, we report metagenomic/metatranscriptomic sequence analyses from four accretion ice sections from the Vostok 5G ice core. Two sections accreted in the vicinity of an embayment on the southwestern end of the lake, and the other two represented part of the southern main basin. We obtained 3,507 unique gene sequences from concentrates of 500 ml of 0.22 µm-filtered accretion ice meltwater. Taxonomic classifications (to genus and/or species) were possible for 1,623 of the sequences. Species determinations in combination with mRNA gene sequence results allowed deduction of the metabolic pathways represented in the accretion ice and, by extension, in the lake. Approximately 94% of the sequences were from Bacteria and 6% were from Eukarya. Only two sequences were from Archaea. In general, the taxa were similar to organisms previously described from lakes, brackish water, marine environments, soil, glaciers, ice, lake sediments, deep-sea sediments, deep-sea thermal vents, animals and plants. Sequences from aerobic, anaerobic, psychrophilic, thermophilic, halophilic, alkaliphilic, acidophilic, desiccation-resistant, autotrophic and heterotrophic organisms were present, including a number from multicellular eukaryotes. PMID:23843994

  1. Mapping Simple Repeated DNA Sequences in Heterochromatin of Drosophila Melanogaster

    PubMed Central

    Lohe, A. R.; Hilliker, A. J.; Roberts, P. A.

    1993-01-01

    Heterochromatin in Drosophila has unusual genetic, cytological and molecular properties. Highly repeated DNA sequences (satellites) are the principal component of heterochromatin. Using probes from cloned satellites, we have constructed a chromosome map of 10 highly repeated, simple DNA sequences in heterochromatin of mitotic chromosomes of Drosophila melanogaster. Despite extensive sequence homology among some satellites, chromosomal locations could be distinguished by stringent in situ hybridizations for each satellite. Only two of the localizations previously determined using gradient-purified bulk satellite probes are correct. Eight new satellite localizations are presented, providing a megabase-level chromosome map of one-quarter of the genome. Five major satellites each exhibit a multichromosome distribution, and five minor satellites hybridize to single sites on the Y chromosome. Satellites closely related in sequence are often located near one another on the same chromosome. About 80% of Y chromosome DNA is composed of nine simple repeated sequences, in particular (AAGAC)(n) (8 Mb), (AAGAG)(n) (7 Mb) and (AATAT)(n) (6 Mb). Similarly, more than 70% of the DNA in chromosome 2 heterochromatin is composed of five simple repeated sequences. We have also generated a high resolution map of satellites in chromosome 2 heterochromatin, using a series of translocation chromosomes whose breakpoints in heterochromatin were ordered by N-banding. Finally, staining and banding patterns of heterochromatic regions are correlated with the locations of specific repeated DNA sequences. The basis for the cytochemical heterogeneity in banding appears to depend exclusively on the different satellite DNAs present in heterochromatin. PMID:8375654

  2. Subglacial Lake Vostok (Antarctica) accretion ice contains a diverse set of sequences from aquatic, marine and sediment-inhabiting bacteria and eukarya.

    PubMed

    Shtarkman, Yury M; Koçer, Zeynep A; Edgar, Robyn; Veerapaneni, Ram S; D'Elia, Tom; Morris, Paul F; Rogers, Scott O

    2013-01-01

    Lake Vostok, the 7(th) largest (by volume) and 4(th) deepest lake on Earth, is covered by more than 3,700 m of ice, making it the largest subglacial lake known. The combination of cold, heat (from possible hydrothermal activity), pressure (from the overriding glacier), limited nutrients and complete darkness presents extreme challenges to life. Here, we report metagenomic/metatranscriptomic sequence analyses from four accretion ice sections from the Vostok 5G ice core. Two sections accreted in the vicinity of an embayment on the southwestern end of the lake, and the other two represented part of the southern main basin. We obtained 3,507 unique gene sequences from concentrates of 500 ml of 0.22 µm-filtered accretion ice meltwater. Taxonomic classifications (to genus and/or species) were possible for 1,623 of the sequences. Species determinations in combination with mRNA gene sequence results allowed deduction of the metabolic pathways represented in the accretion ice and, by extension, in the lake. Approximately 94% of the sequences were from Bacteria and 6% were from Eukarya. Only two sequences were from Archaea. In general, the taxa were similar to organisms previously described from lakes, brackish water, marine environments, soil, glaciers, ice, lake sediments, deep-sea sediments, deep-sea thermal vents, animals and plants. Sequences from aerobic, anaerobic, psychrophilic, thermophilic, halophilic, alkaliphilic, acidophilic, desiccation-resistant, autotrophic and heterotrophic organisms were present, including a number from multicellular eukaryotes.

  3. Archaeal β diversity patterns under the seafloor along geochemical gradients

    NASA Astrophysics Data System (ADS)

    Koyano, Hitoshi; Tsubouchi, Taishi; Kishino, Hirohisa; Akutsu, Tatsuya

    2014-09-01

    Recently, deep drilling into the seafloor has revealed that there are vast sedimentary ecosystems of diverse microorganisms, particularly archaea, in subsurface areas. We investigated the β diversity patterns of archaeal communities in sediment layers under the seafloor and their determinants. This study was accomplished by analyzing large environmental samples of 16S ribosomal RNA gene sequences and various geochemical data collected from a sediment core of 365.3 m, obtained by drilling into the seafloor off the east coast of the Shimokita Peninsula. To extract the maximum amount of information from these environmental samples, we first developed a method for measuring β diversity using sequence data by applying probability theory on a set of strings developed by two of the authors in a previous publication. We introduced an index of β diversity between sequence populations from which the sequence data were sampled. We then constructed an estimator of the β diversity index based on the sequence data and demonstrated that it converges to the β diversity index between sequence populations with probability of 1 as the number of sampled sequences increases. Next, we applied this new method to quantify β diversities between archaeal sequence populations under the seafloor and constructed a quantitative model of the estimated β diversity patterns. Nearly 90% of the variation in the archaeal β diversity was explained by a model that included as variables the differences in the abundances of chlorine, iodine, and carbon between the sediment layers.

  4. Assessing the determinants of evolutionary rates in the presence of noise.

    PubMed

    Plotkin, Joshua B; Fraser, Hunter B

    2007-05-01

    Although protein sequences are known to evolve at vastly different rates, little is known about what determines their rate of evolution. However, a recent study using principal component regression (PCR) has concluded that evolutionary rates in yeast are primarily governed by a single determinant related to translation frequency. Here, we demonstrate that noise in biological data can confound PCRs, leading to spurious conclusions. When equalizing noise levels across 7 predictor variables used in previous studies, we find no evidence that protein evolution is dominated by a single determinant. Our results indicate that a variety of factors--including expression level, gene dispensability, and protein-protein interactions--may independently affect evolutionary rates in yeast. More accurate measurements or more sophisticated statistical techniques will be required to determine which one, if any, of these factors dominates protein evolution.

  5. Sequence learning in Parkinson's disease: Focusing on action dynamics and the role of dopaminergic medication.

    PubMed

    Ruitenberg, Marit F L; Duthoo, Wout; Santens, Patrick; Seidler, Rachael D; Notebaert, Wim; Abrahamse, Elger L

    2016-12-01

    Previous studies on movement sequence learning in Parkinson's disease (PD) have produced mixed results. A possible explanation for the inconsistent findings is that some studies have taken dopaminergic medication into account while others have not. Additionally, in previous studies the response modalities did not allow for an investigation of the action dynamics of sequential movements as they unfold over time. In the current study we investigated sequence learning in PD by specifically considering the role of medication status in a sequence learning task where mouse movements were performed. The focus on mouse movements allowed us to examine the action dynamics of sequential movement in terms of initiation time, movement time, movement accuracy, and velocity. PD patients performed the sequence learning task once on their regular medication, and once after overnight withdrawal from their medication. Results showed that sequence learning as reflected in initiation times was impaired when PD patients performed the task ON medication compared to OFF medication. In contrast, sequence learning as reflected in the accuracy of movement trajectories was enhanced when performing the task ON compared to OFF medication. Our findings suggest that while medication enhances execution processes of movement sequence learning, it may at the same time impair planning processes that precede actual execution. Overall, the current study extends earlier findings on movement sequence learning in PD by differentiating between various components of performance, and further refines previous dopamine overdose effects in sequence learning. Copyright © 2016 Elsevier Ltd. All rights reserved.

  6. THE SMALL ACID SOLUBLE PROTEINS (SASP α and SASP β) OF BACILLUS WEIHENSTEPHANENSIS AND B. MYCOIDES GROUP 2 ARE THE MOST DISTINCT AMONG THE B. CEREUS GROUP

    PubMed Central

    Callahan, Courtney; Fox, Karen; Fox, Alvin

    2009-01-01

    The Bacillus cereus group includes Bacillus anthracis, Bacillus cereus, Bacillus thuringiensis, Bacillus mycoides and Bacillus weihenstephanensis. The small acid-soluble spore protein (SASP) β has been previously demonstrated to be among the biomarkers differentiating B. anthracis and B. cereus; SASP β of B. cereus most commonly exhibits one or two amino acid substitutions when compared to B. anthracis. SASP α is conserved in sequence among these two species. Neither SASP α nor β for B. thuringiensis, B. mycoides and B. weihenstephanensis have been previously characterized as taxonomic discriminators. In the current work molecular weight (MW) variation of these SASPs were determined by matrix assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI TOF MS) for representative strains of the 5 species within the B. cereus group. The measured MWs also correlate with calculated MWs of translated amino acid sequences generated from whole genome sequencing projects. SASP α and β demonstrated consistent MW among B. cereus, B. thuringiensis, and B. mycoides strains (group 1). However B. mycoides (group 2) and B. weihenstephanensis SASP α and β were quite distinct making them unique among the B. cereus group. Limited sequence changes were observed in SASP α (at most 3 substitutions and 2 deletions) indicating it is a more conserved protein than SASP β (up to 6 substitutions and a deletion). Another even more conserved SASP, SASP α-β type, was described here for the first time. PMID:19616612

  7. Response disengagement on a spatial self-ordered sequencing task: effects of regionally selective excitotoxic lesions and serotonin depletion within the prefrontal cortex.

    PubMed

    Walker, Susannah C; Robbins, Trevor W; Roberts, Angela C

    2009-05-06

    Prefrontal cortex (PFC) is critical for self-ordered response sequencing. Patients with frontal lobe damage are impaired on response sequencing tasks, and increased blood flow has been reported in ventrolateral and dorsolateral PFC in subjects performing such tasks. Previously, we have shown that large excitotoxic lesions of the lateral PFC (LPFC) and orbitofrontal cortex FC (OFC), but not global prefrontal dopamine depletion, markedly impaired marmoset performance on a spatial self-ordered sequencing task (SSOST). To determine whether LPFC or OFC was responsible for the previously observed impairments and whether the underlying neural mechanism was modulated by serotonin, the present study compared the effects of selective LPFC and OFC excitotoxic lesions and 5,7-DHT-induced PFC serotonin depletions in marmosets on SSOST performance. Severe and long-lasting impairments in SSOST performance, including robust perseverative responding, followed LPFC but not OFC lesions. The deficit was ameliorated by task manipulations that precluded perseveration. Depletions of serotonin within LPFC and OFC had no effect, despite impairing performance on a visual discrimination reversal task, thus providing further evidence for differential monaminergic regulation of prefrontal function. In the light of the proposed attentional control functions of ventrolateral PFC and the failure of LPFC-lesioned animals to disengage from the immediately preceding response, it is proposed that this deficit may be due to a failure to attend to and register that a response has been made and thus should not be repeated. However, 5-HT does not appear to be implicated in this response inhibitory capacity.

  8. First description of Grapevine leafroll-associated virus 5 in Argentina and partial genome sequence.

    PubMed

    Gómez Talquenca, Sebastián; Muñoz, Claudio; Grau, Oscar; Gracia, Olga

    2009-02-01

    An accession of Vitis vinifera cv. Red Globe from Argentina, was found to be infected with Grapevine leafroll-associated virus-5 by ELISA. It was partially sequenced, and three ORFs, corresponding to HSP70h, HSP90h, and CP, were found. This isolate shares a high aminoacid identity with the previously reported sequence of the virus, and identities between 80% and 90% with previously reported GLRaV-9 and GLRaV-4 isolates. The analysis of the sequence supports the clustering together with GLRaV-4 and GLRV-9 inside the Ampelovirus genus.

  9. Recruitment of the proneural gene scute to the Drosophila sex-determination pathway.

    PubMed Central

    Wrischnik, Lisa A; Timmer, John R; Megna, Lisa A; Cline, Thomas W

    2003-01-01

    In flies, scute (sc) works with its paralogs in the achaete-scute-complex (ASC) to direct neuronal development. However, in the family Drosophilidae, sc also acquired a role in the primary event of sex determination, X chromosome counting, by becoming an X chromosome signal element (XSE)-an evolutionary step shown here to have occurred after sc diverged from its closest paralog, achaete (ac). Two temperature-sensitive alleles, sc(sisB2) and sc(sisB3), which disrupt only sex determination, were recovered in a powerful F1 genetic selection and used to investigate how sc was recruited to the sex-determination pathway. sc(sisB2) revealed 3' nontranscribed regulatory sequences likely to be involved. The sc(sisB2) lesion abolished XSE activity when combined with mutations engineered in a sequence upstream of all XSEs. In contrast, changes in Sc protein sequence seem not to have been important for recruitment. The observation that the other new allele, sc(sisB3), eliminates the C-terminal half of Sc without affecting neurogenesis and that sc(sisB1), the most XSE-specific allele previously available, is a nonsense mutant, would seem to suggest the opposite, but we show that housefly Sc can substitute for fruit fly Sc in sex determination, despite lacking Drosophilidae-specific conserved residues in its C-terminal half. Lack of synergistic lethality among mutations in sc, twist, and dorsal argue against a proposed role for sc in mesoderm formation that had seemed potentially relevant to sex-pathway recruitment. The screen that yielded new sc alleles also generated autosomal duplications that argue against the textbook view that fruit fly sex signal evolution recruited a set of autosomal signal elements comparable to the XSEs. PMID:14704182

  10. A novel begomovirus isolated from sida contains putative cis- and trans-acting replication specificity determinants that have evolved independently in several geographical lineages.

    PubMed

    Mauricio-Castillo, J A; Torres-Herrera, S I; Cárdenas-Conejo, Y; Pastor-Palacios, G; Méndez-Lozano, J; Argüello-Astorga, G R

    2014-09-01

    A novel begomovirus isolated from a Sida rhombifolia plant collected in Sinaloa, Mexico, was characterized. The genomic components of sida mosaic Sinaloa virus (SiMSinV) shared highest sequence identity with DNA-A and DNA-B components of chino del tomate virus (CdTV), suggesting a vertical evolutionary relationship between these viruses. However, recombination analysis indicated that a short segment of SiMSinV DNA-A encompassing the plus-strand replication origin and the 5´-proximal 43 codons of the Rep gene was derived from tomato mottle Taino virus (ToMoTV). Accordingly, the putative cis- and trans-acting replication specificity determinants of SiMSinV were identical to those of ToMoTV but differed from those of CdTV. Modeling of the SiMSinV and CdTV Rep proteins revealed significant differences in the region comprising the small β1/β5 sheet element, where five putative DNA-binding specificity determinants (SPDs) of Rep (i.e., amino acid residues 5, 8, 10, 69 and 71) were previously identified. Computer-assisted searches of public databases led to identification of 33 begomoviruses from three continents encoding proteins with SPDs identical to those of the Rep encoded by SiMSinV. Sequence analysis of the replication origins demonstrated that all 33 begomoviruses harbor potential Rep-binding sites identical to those of SiMSinV. These data support the hypothesis that the Rep β1/β5 sheet region determines specificity of this protein for DNA replication origin sequences.

  11. Detection of Bartonella Species in the Blood of Veterinarians and Veterinary Technicians: A Newly Recognized Occupational Hazard?

    PubMed Central

    Maggi, Ricardo G.; Ferguson, Brandy; Varkey, Jay; Park, Lawrence P.; Breitschwerdt, Edward B.

    2014-01-01

    Abstract Background: Bartonella species are important emerging pathogens in human and veterinary medicine. In the context of their daily activities, veterinary professionals have frequent animal contact and arthropod exposures. Detection of Bartonella spp. using traditional culture methods has been limited by poor sensitivity, making it difficult to determine the prevalence of infection in this population. We have developed a detection method combining enrichment culture and molecular amplification, which increases testing sensitivity. Methods: We performed a cross-sectional study to determine the prevalence of detectable Bartonella spp. in the blood of veterinary personnel and nonveterinary control subjects. Bartonella was detected by enrichment blood culture with conventional PCR followed by DNA sequencing. Results were correlated with epidemiological variables and symptoms. Results: We detected DNA from at least one Bartonella species in 32 (28%) of the 114 veterinary subjects. After DNA sequencing, the Bartonella species could be determined for 27 of the 32 infected subjects, including B. henselae in 15 (56%), B. vinsonii subsp. berkhoffii in seven (26%), B. koehlerae in six (22%), and a B. volans–like sequence in one (4%). Seventy percent of Bartonella-positive subjects described headache compared with 40% of uninfected veterinarians (p=0.009). Irritability was also reported more commonly by infected subjects (68% vs. 43%, p=0.04). Conclusions: Our study supports an emerging body of evidence that cryptic Bartonella bloodstream infection may be more frequent in humans than previously recognized and may induce symptoms. Longitudinal studies are needed to determine the natural course and clinical features of Bartonella infection. PMID:25072986

  12. Determination of Trichuris skrjabini by sequencing of the ITS1-5.8S-ITS2 segment of the ribosomal DNA: comparative molecular study of different species of trichurids.

    PubMed

    Cutillas, C; Oliveros, R; de Rojas, M; Guevara, D C

    2004-06-01

    Adults of Trichuris skrjahini have been isolated from the cecum of caprine hosts (Capra hircus), Trichuris ovis and Trichuris globulosa from Ovis aries (sheep) and C. hircus (goats), and Trichuris leporis from Lepus europaeus (rabbits) in Spain. Genomic DNA was isolated and the ITS1-5.8S-ITS2 segment from the ribosomal DNA (rDNA) was amplified and sequenced by polymerase chain reaction (PCR) techniques. The ITS1 of T. skrjabini, T. ovis, T. globulosa, and T. leporis was 495, 757, 757, and 536 nucleotides in length, respectively, and had G + C contents of 59.6, 58.7, 58.7, and 60.8%, respectively. Intraindividual variation was detected in the ITSI sequences of the 4 species. Furthermore, the 5.8S sequences of T. skrjabini, T. ovis, T. globulosa, and T. leporis were compared. A total of 157, 152, 153, and 157 nucleotides in length was observed in the 5.8S sequences of these 4 species, respectively. There were no sequence differences of ITS1 and 5.8S products between T. ovis and T. globulosa. Nevertheless, clear differences were detected between the ITS1 sequences of T. skrjabini, T. ovis, T. leporis, Trichuris muris, and T. arvicolae. The ITS2 fragment from the rDNA of T. skrjabini was sequenced. A comparative study of the ITS2 sequence of T. skrjabini with the previously published ITS2 sequence data of T. ovis, T. leporis, T. muris, and T. arvicolae suggested that the combined use of sequence data from both spacers would be useful in the molecular characterization of trichurid parasites.

  13. Antimicrobial resistance and molecular epidemiology using whole-genome sequencing of Neisseria gonorrhoeae in Ireland, 2014-2016: focus on extended-spectrum cephalosporins and azithromycin.

    PubMed

    Ryan, L; Golparian, D; Fennelly, N; Rose, L; Walsh, P; Lawlor, B; Mac Aogáin, M; Unemo, M; Crowley, B

    2018-06-07

    High-level resistance and treatment failures with ceftriaxone and azithromycin, the first-line agents for gonorrhoea treatment are reported and antimicrobial-resistant Neisseria gonorrhoeae is an urgent public health threat. Our aims were to determine antimicrobial resistance rates, resistance determinants and phylogeny of N. gonorrhoeae in Ireland, 2014-2016. Overall, 609 isolates from four University Hospitals were tested for susceptibility to extended-spectrum cephalosporins (ESCs) and azithromycin by the MIC Test Strips. Forty-three isolates were whole-genome sequenced based on elevated MICs. The resistance rate to ceftriaxone, cefixime, cefotaxime and azithromycin was 0, 1, 2.1 and 19%, respectively. Seven high-level azithromycin-resistant (HLAzi-R) isolates were identified, all susceptible to ceftriaxone. Mosaic penA alleles XXXIV, X and non-mosaic XIII, and G120K plus A121N/D/G (PorB1b), H105Y (MtrR) and A deletion (mtrR promoter) mutations, were associated with elevated ESC MICs. A2059G and C2611T mutations in 23S rRNA were associated with HLAzi-R and azithromycin MICs of 4-32 mg/L, respectively. The 43 whole-genome sequenced isolates belonged to 31 NG-MAST STs. All HLAzi-R isolates belonged to MLST ST1580 and some clonal clustering was observed; however, the isolates differed significantly from the published HLAzi-R isolates from the ongoing UK outbreak. There is good correlation between previously described genetic antimicrobial resistance determinants and phenotypic susceptibility categories for ESCs and azithromycin in N. gonorrhoeae. This work highlights the advantages and potential of whole-genome sequencing to be applied at scale in the surveillance of antibiotic resistant strains of N. gonorrhoeae, both locally and internationally.

  14. PredPPCrys: accurate prediction of sequence cloning, protein production, purification and crystallization propensity from protein sequences using multi-step heterogeneous feature fusion and selection.

    PubMed

    Wang, Huilin; Wang, Mingjun; Tan, Hao; Li, Yuan; Zhang, Ziding; Song, Jiangning

    2014-01-01

    X-ray crystallography is the primary approach to solve the three-dimensional structure of a protein. However, a major bottleneck of this method is the failure of multi-step experimental procedures to yield diffraction-quality crystals, including sequence cloning, protein material production, purification, crystallization and ultimately, structural determination. Accordingly, prediction of the propensity of a protein to successfully undergo these experimental procedures based on the protein sequence may help narrow down laborious experimental efforts and facilitate target selection. A number of bioinformatics methods based on protein sequence information have been developed for this purpose. However, our knowledge on the important determinants of propensity for a protein sequence to produce high diffraction-quality crystals remains largely incomplete. In practice, most of the existing methods display poorer performance when evaluated on larger and updated datasets. To address this problem, we constructed an up-to-date dataset as the benchmark, and subsequently developed a new approach termed 'PredPPCrys' using the support vector machine (SVM). Using a comprehensive set of multifaceted sequence-derived features in combination with a novel multi-step feature selection strategy, we identified and characterized the relative importance and contribution of each feature type to the prediction performance of five individual experimental steps required for successful crystallization. The resulting optimal candidate features were used as inputs to build the first-level SVM predictor (PredPPCrys I). Next, prediction outputs of PredPPCrys I were used as the input to build second-level SVM classifiers (PredPPCrys II), which led to significantly enhanced prediction performance. Benchmarking experiments indicated that our PredPPCrys method outperforms most existing procedures on both up-to-date and previous datasets. In addition, the predicted crystallization targets of currently non-crystallizable proteins were provided as compendium data, which are anticipated to facilitate target selection and design for the worldwide structural genomics consortium. PredPPCrys is freely available at http://www.structbioinfor.org/PredPPCrys.

  15. Reconstruction of DNA sequences using genetic algorithms and cellular automata: towards mutation prediction?

    PubMed

    Mizas, Ch; Sirakoulis, G Ch; Mardiris, V; Karafyllidis, I; Glykos, N; Sandaltzopoulos, R

    2008-04-01

    Change of DNA sequence that fuels evolution is, to a certain extent, a deterministic process because mutagenesis does not occur in an absolutely random manner. So far, it has not been possible to decipher the rules that govern DNA sequence evolution due to the extreme complexity of the entire process. In our attempt to approach this issue we focus solely on the mechanisms of mutagenesis and deliberately disregard the role of natural selection. Hence, in this analysis, evolution refers to the accumulation of genetic alterations that originate from mutations and are transmitted through generations without being subjected to natural selection. We have developed a software tool that allows modelling of a DNA sequence as a one-dimensional cellular automaton (CA) with four states per cell which correspond to the four DNA bases, i.e. A, C, T and G. The four states are represented by numbers of the quaternary number system. Moreover, we have developed genetic algorithms (GAs) in order to determine the rules of CA evolution that simulate the DNA evolution process. Linear evolution rules were considered and square matrices were used to represent them. If DNA sequences of different evolution steps are available, our approach allows the determination of the underlying evolution rule(s). Conversely, once the evolution rules are deciphered, our tool may reconstruct the DNA sequence in any previous evolution step for which the exact sequence information was unknown. The developed tool may be used to test various parameters that could influence evolution. We describe a paradigm relying on the assumption that mutagenesis is governed by a near-neighbour-dependent mechanism. Based on the satisfactory performance of our system in the deliberately simplified example, we propose that our approach could offer a starting point for future attempts to understand the mechanisms that govern evolution. The developed software is open-source and has a user-friendly graphical input interface.

  16. Risk of Breast Cancer with CXCR4-using HIV Defined by V3-Loop Sequencing

    PubMed Central

    Goedert, James J.; Swenson, Luke C.; Napolitano, Laura A.; Haddad, Mojgan; Anastos, Kathryn; Minkoff, Howard; Young, Mary; Levine, Alexandra; Adeyemi, Oluwatoyin; Seaberg, Eric C.; Aouizerat, Bradley; Rabkin, Charles S.; Harrigan, P. Richard; Hessol, Nancy A.

    2014-01-01

    Objective Evaluate the risk of female breast cancer associated with HIV-CXCR4 (X4) tropism as determined by various genotypic measures. Methods A breast cancer case-control study, with pairwise comparisons of tropism determination methods, was conducted. From the Women's Interagency HIV Study repository, one stored plasma specimen was selected from 25 HIV-infected cases near the breast cancer diagnosis date and 75 HIV-infected control women matched for age and calendar date. HIVgp120-V3 sequences were derived by Sanger population sequencing (PS) and 454-pyro deep sequencing (DS). Sequencing-based HIV-X4 tropism was defined using the geno2pheno algorithm, with both high-stringency DS [False-Positive-Rate (FPR 3.5) and 2% X4 cutoff], and lower stringency DS (FPR 5.75, 15% X4 cut-off). Concordance of tropism results by PS, DS, and previously performed phenotyping was assessed with kappa (κ) statistics. Case-control comparisons used exact P-values and conditional logistic regression. Results In 74 women (19 cases, 55 controls) with complete results, prevalence of HIV-X4 by PS was 5% in cases vs 29% in controls (P=0.06, odds ratio 0.14, confidence interval 0.003-1.03). Smaller case-control prevalence differences were found with high-stringency DS (21% vs 36%, P=0.32), lower-stringency DS (16% vs 35%, P=0.18), and phenotyping (11% vs 31%, P=0.10). HIV-X4-tropism concordance was best between PS and lower-stringency DS (93%, κ=0.83). Other pairwise concordances were 82%-92% (κ=0.56-0.81). Concordance was similar among cases and controls. Conclusions HIV-X4 defined by population sequencing (PS) had good agreement with lower stringency deep sequencing and was significantly associated with lower odds of breast cancer. PMID:25321183

  17. Program Synthesizes UML Sequence Diagrams

    NASA Technical Reports Server (NTRS)

    Barry, Matthew R.; Osborne, Richard N.

    2006-01-01

    A computer program called "Rational Sequence" generates Universal Modeling Language (UML) sequence diagrams of a target Java program running on a Java virtual machine (JVM). Rational Sequence thereby performs a reverse engineering function that aids in the design documentation of the target Java program. Whereas previously, the construction of sequence diagrams was a tedious manual process, Rational Sequence generates UML sequence diagrams automatically from the running Java code.

  18. Dominant Sequences of Human Major Histocompatibility Complex Conserved Extended Haplotypes from HLA-DQA2 to DAXX

    PubMed Central

    Larsen, Charles E.; Alford, Dennis R.; Trautwein, Michael R.; Jalloh, Yanoh K.; Tarnacki, Jennifer L.; Kunnenkeri, Sushruta K.; Fici, Dolores A.; Yunis, Edmond J.; Awdeh, Zuheir L.; Alper, Chester A.

    2014-01-01

    We resequenced and phased 27 kb of DNA within 580 kb of the MHC class II region in 158 population chromosomes, most of which were conserved extended haplotypes (CEHs) of European descent or contained their centromeric fragments. We determined the single nucleotide polymorphism and deletion-insertion polymorphism alleles of the dominant sequences from HLA-DQA2 to DAXX for these CEHs. Nine of 13 CEHs remained sufficiently intact to possess a dominant sequence extending at least to DAXX, 230 kb centromeric to HLA-DPB1. We identified the regions centromeric to HLA-DQB1 within which single instances of eight “common” European MHC haplotypes previously sequenced by the MHC Haplotype Project (MHP) were representative of those dominant CEH sequences. Only two MHP haplotypes had a dominant CEH sequence throughout the centromeric and extended class II region and one MHP haplotype did not represent a known European CEH anywhere in the region. We identified the centromeric recombination transition points of other MHP sequences from CEH representation to non-representation. Several CEH pairs or groups shared sequence identity in small blocks but had significantly different (although still conserved for each separate CEH) sequences in surrounding regions. These patterns partly explain strong calculated linkage disequilibrium over only short (tens to hundreds of kilobases) distances in the context of a finite number of observed megabase-length CEHs comprising half a population's haplotypes. Our results provide a clearer picture of European CEH class II allelic structure and population haplotype architecture, improved regional CEH markers, and raise questions concerning regional recombination hotspots. PMID:25299700

  19. Identifying functional thermodynamics in autonomous Maxwellian ratchets

    NASA Astrophysics Data System (ADS)

    Boyd, Alexander B.; Mandal, Dibyendu; Crutchfield, James P.

    2016-02-01

    We introduce a family of Maxwellian Demons for which correlations among information bearing degrees of freedom can be calculated exactly and in compact analytical form. This allows one to precisely determine Demon functional thermodynamic operating regimes, when previous methods either misclassify or simply fail due to approximations they invoke. This reveals that these Demons are more functional than previous candidates. They too behave either as engines, lifting a mass against gravity by extracting energy from a single heat reservoir, or as Landauer erasers, consuming external work to remove information from a sequence of binary symbols by decreasing their individual uncertainty. Going beyond these, our Demon exhibits a new functionality that erases bits not by simply decreasing individual-symbol uncertainty, but by increasing inter-bit correlations (that is, by adding temporal order) while increasing single-symbol uncertainty. In all cases, but especially in the new erasure regime, exactly accounting for informational correlations leads to tight bounds on Demon performance, expressed as a refined Second Law of thermodynamics that relies on the Kolmogorov-Sinai entropy for dynamical processes and not on changes purely in system configurational entropy, as previously employed. We rigorously derive the refined Second Law under minimal assumptions and so it applies quite broadly—for Demons with and without memory and input sequences that are correlated or not. We note that general Maxwellian Demons readily violate previously proposed, alternative such bounds, while the current bound still holds. As such, it broadly describes the minimal energetic cost of any computation by a thermodynamic system.

  20. Identification of sex-linked SNP markers using RAD sequencing suggests ZW/ZZ sex determination in Pistacia vera L.

    PubMed

    Kafkas, Salih; Khodaeiaminjan, Mortaza; Güney, Murat; Kafkas, Ebru

    2015-02-18

    Pistachio (Pistacia vera L.) is a dioecious species that has a long juvenility period. Therefore, development of marker-assisted selection (MAS) techniques would greatly facilitate pistachio cultivar-breeding programs. The sex determination mechanism is presently unknown in pistachio. The generation of sex-linked markers is likely to reduce time, labor, and costs associated with breeding programs, and will help to clarify the sex determination system in pistachio. Restriction site-associated DNA (RAD) markers were used to identify sex-linked markers and to elucidate the sex determination system in pistachio. Eight male and eight female F1 progenies from a Pistacia vera L. Siirt × Bağyolu cross, along with the parents, were subjected to RAD sequencing in two lanes of a Hi-Seq 2000 sequencing platform. This generated 449 million reads, comprising approximately 37.7 Gb of sequences. There were 33,757 polymorphic single nucleotide polymorphism (SNP) loci between the parents. Thirty-eight of these, from 28 RAD reads, were detected as putative sex-associated loci in pistachio. Validation was performed by SNaPshot analysis in 42 mature F1 progenies and in 124 cultivars and genotypes in a germplasm collection. Eight loci could distinguish sex with 100% accuracy in pistachio. To ascertain cost-effective application of markers in a breeding program, high-resolution melting (HRM) analysis was performed; four markers were found to perfectly separate sexes in pistachio. Because of the female heterogamety in all candidate SNP loci, we report for the first time that pistachio has a ZZ/ZW sex determination system. As the reported female-to-male segregation ratio is 1:1 in all known segregating populations and there is no previous report of super-female genotypes or female heteromorphic chromosomes in pistachio, it appears that the WW genotype is not viable. Sex-linked SNP markers were identified and validated in a large germplasm and proved their suitability for MAS in pistachio. HRM analysis successfully validated the sex-linked markers for MAS. For the first time in dioecious pistachio, a female heterogamety ZW/ZZ sex determination system is suggested.

  1. Protein-protein Förster resonance energy transfer analysis of nucleosome core particles containing H2A and H2A.Z.

    PubMed

    Hoch, Duane A; Stratton, Jessica J; Gloss, Lisa M

    2007-08-24

    A protein-protein Förster resonance energy transfer (FRET) system, employing probes at multiple positions, was designed to specifically monitor the dissociation of the H2A-H2B dimer from the nucleosome core particle (NCP). Tryptophan donors and Cys-AEDANS acceptors were chosen because, compared to previous NCP FRET fluorophores, they: (1) are smaller and less hydrophobic, which should minimize perturbations of histone and NCP structure; and (2) have an R0 of 20 A, which is much less than the dimensions of the NCP (approximately 50 A width and approximately 100 A diameter). Equilibrium protein unfolding titrations indicate that the donor and acceptor moieties have minimal effects on the stability of the H2A-H2B dimer and (H3-H4)2 tetramer. NCPs containing the various FRET pairs were reconstituted with the 601 DNA positioning element. Equilibrium NaCl-induced dissociation of the modified NCPs showed that the 601 sequence stabilized the NCP to dimer dissociation relative to weaker positioning sequences. This finding implies a significant role for the H2A-H2B dimers in determining the DNA sequence dependence of NCP stability. The free energy of dissociation determined from reversible and well-defined sigmoidal transitions revealed two distinct phases reflecting the dissociation of individual H2A-H2B dimers, confirming cooperativity as suggested previously; these data allow quantitative description of the cooperativity. The FRET system was then used to study the effects of the histone variant H2A.Z on NCP stability; previous studies have reported both destabilizing and stabilizing effects. H2A.Z FRET NCP dissociation transitions suggest a slight increase in stability but a significant increase in cooperativity of the dimer dissociations. Thus, the utility of this protein-protein FRET system to monitor the effects of histone variants on NCP dynamics has been demonstrated, and the system appears equally well-suited for dissection of the kinetic processes of dimer association and dissociation from the NCP.

  2. Rapid Fine Conformational Epitope Mapping Using Comprehensive Mutagenesis and Deep Sequencing*

    PubMed Central

    Kowalsky, Caitlin A.; Faber, Matthew S.; Nath, Aritro; Dann, Hailey E.; Kelly, Vince W.; Liu, Li; Shanker, Purva; Wagner, Ellen K.; Maynard, Jennifer A.; Chan, Christina; Whitehead, Timothy A.

    2015-01-01

    Knowledge of the fine location of neutralizing and non-neutralizing epitopes on human pathogens affords a better understanding of the structural basis of antibody efficacy, which will expedite rational design of vaccines, prophylactics, and therapeutics. However, full utilization of the wealth of information from single cell techniques and antibody repertoire sequencing awaits the development of a high throughput, inexpensive method to map the conformational epitopes for antibody-antigen interactions. Here we show such an approach that combines comprehensive mutagenesis, cell surface display, and DNA deep sequencing. We develop analytical equations to identify epitope positions and show the method effectiveness by mapping the fine epitope for different antibodies targeting TNF, pertussis toxin, and the cancer target TROP2. In all three cases, the experimentally determined conformational epitope was consistent with previous experimental datasets, confirming the reliability of the experimental pipeline. Once the comprehensive library is generated, fine conformational epitope maps can be prepared at a rate of four per day. PMID:26296891

  3. Identification and validation of sex-linked SCAR markers in dioecious Hippophae rhamnoides L. (Elaeagnaceae).

    PubMed

    Korekar, Girish; Sharma, Ram Kumar; Kumar, Rahul; Meenu; Bisht, Naveen C; Srivastava, Ravi B; Ahuja, Paramvir Singh; Stobdan, Tsering

    2012-05-01

    The actinorhizal plant seabuckthorn (Hippophae rhamnoides L., Elaeagnaceae) is a wind pollinated dioecious crop. To distinguish male genotypes from female genotypes early in the vegetative growth phase, we have developed robust PCR-based marker(s). DNA bulk samples from 20 male and 20 female plants each were screened with 60 RAPD primers. Two primers, OPA-04 and OPT-06 consistently amplified female-specific (FS) polymorphic fragments of 1,164 and 868 bp, respectively, that were absent in the male samples. DNA sequence of the two markers did not exhibit significant similarity to previously characterized sequences. A sequence-characterized amplified region marker HrX1 (JQ284019) and HrX2 (JQ284020) designed for the two fragments, continued to amplify the FS allele in 120 female plants but not in 100 male plants tested in the current study. Thus, HrX1 and HrX2 are FS markers that can determine the sex of seabuckthorn plants in an early stage and expedite cultivations for industrial applications.

  4. Plasmid origin of replication of herpesvirus papio: DNA sequence and enhancer function.

    PubMed Central

    Loeb, D D; Sung, N S; Pesano, R L; Sexton, C J; Hutchison, C; Pagano, J S

    1990-01-01

    Herpesvirus papio (HVP) is a lymphotropic virus of baboons which is related to Epstein-Barr virus (EBV) and produces latent infection. The nucleotide sequence of the 5,775-base-pair (bp) EcoRI K fragment of HVP, which has previously been shown to confer the ability to replicate autonomously, has been determined. Within this DNA fragment is a region which bears structural and sequence similarity to the ori-P region of EBV. The HVP ori-P region has a 10- by 26-bp tandem array which is related to the 20- by 30-bp tandem array from the EBV ori-P region. In HVP there is an intervening region of 764 bp followed by five partial copies of the 26-bp monomer. Both the EBV and HVP 3' regions have the potential to form dyad structures which, however, differ in arrangement. We also demonstrate that a transcriptional enhancer which requires transactivation by a virus-encoded factor is present in the HVP ori-P. Images PMID:2159548

  5. Computational-Model-Based Analysis of Context Effects on Harmonic Expectancy.

    PubMed

    Morimoto, Satoshi; Remijn, Gerard B; Nakajima, Yoshitaka

    2016-01-01

    Expectancy for an upcoming musical chord, harmonic expectancy, is supposedly based on automatic activation of tonal knowledge. Since previous studies implicitly relied on interpretations based on Western music theory, the underlying computational processes involved in harmonic expectancy and how it relates to tonality need further clarification. In particular, short chord sequences which cannot lead to unique keys are difficult to interpret in music theory. In this study, we examined effects of preceding chords on harmonic expectancy from a computational perspective, using stochastic modeling. We conducted a behavioral experiment, in which participants listened to short chord sequences and evaluated the subjective relatedness of the last chord to the preceding ones. Based on these judgments, we built stochastic models of the computational process underlying harmonic expectancy. Following this, we compared the explanatory power of the models. Our results imply that, even when listening to short chord sequences, internally constructed and updated tonal assumptions determine the expectancy of the upcoming chord.

  6. Computational-Model-Based Analysis of Context Effects on Harmonic Expectancy

    PubMed Central

    Morimoto, Satoshi; Remijn, Gerard B.; Nakajima, Yoshitaka

    2016-01-01

    Expectancy for an upcoming musical chord, harmonic expectancy, is supposedly based on automatic activation of tonal knowledge. Since previous studies implicitly relied on interpretations based on Western music theory, the underlying computational processes involved in harmonic expectancy and how it relates to tonality need further clarification. In particular, short chord sequences which cannot lead to unique keys are difficult to interpret in music theory. In this study, we examined effects of preceding chords on harmonic expectancy from a computational perspective, using stochastic modeling. We conducted a behavioral experiment, in which participants listened to short chord sequences and evaluated the subjective relatedness of the last chord to the preceding ones. Based on these judgments, we built stochastic models of the computational process underlying harmonic expectancy. Following this, we compared the explanatory power of the models. Our results imply that, even when listening to short chord sequences, internally constructed and updated tonal assumptions determine the expectancy of the upcoming chord. PMID:27003807

  7. Structure of a Trypanosoma Brucei Alpha/Beta--Hydrolase Fold Protein With Unknown Function

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Merritt, E.A.; Holmes, M.; Buckner, F.S.

    2009-05-26

    The structure of a structural genomics target protein, Tbru020260AAA from Trypanosoma brucei, has been determined to a resolution of 2.2 {angstrom} using multiple-wavelength anomalous diffraction at the Se K edge. This protein belongs to Pfam sequence family PF08538 and is only distantly related to previously studied members of the {alpha}/{beta}-hydrolase fold family. Structural superposition onto representative {alpha}/{beta}-hydrolase fold proteins of known function indicates that a possible catalytic nucleophile, Ser116 in the T. brucei protein, lies at the expected location. However, the present structure and by extension the other trypanosomatid members of this sequence family have neither sequence nor structural similaritymore » at the location of other active-site residues typical for proteins with this fold. Together with the presence of an additional domain between strands {beta}6 and {beta}7 that is conserved in trypanosomatid genomes, this suggests that the function of these homologs has diverged from other members of the fold family.« less

  8. Mitochondrial genome of the bullet tuna Auxis rochei from Indo-West Pacific collection provides novel genetic information about two subspecies.

    PubMed

    Li, Mingming; Guo, Liang; Zhang, Heng; Yang, Sen; Chen, Xinghan; Lin, Haoran; Meng, Zining

    2016-09-01

    Previously morphological studies supported the division of the bullet tuna into the two subspecies, Auxis rochei rochei and A. rochei eudorax. As a cosmopolitan species, A. rochei rochei ranges in the Indo-West Pacific and Atlantic oceans, while A. rochei eudorax inhabits in eastern Pacific region. Here, we used the HiSeq next-generation sequencing technique to determine the mitochondrial genome (mitogenome) of A. rochei from Indo-West Pacific collection, and then compared our data with mitogenomic sequences of the Atlantic and eastern Pacific retrieved from NCBI database. Results showed the mitogenome of A. rochei from three geographic collections shared the same genes and gene order, similar to typical teleosts. Also, we examined a low level of nucleotide diversity among these mitogenomic sequences. Interestingly, nucleotide diversity of intra-subspecies (Atlantic versus Indo-West) was higher than that of inter-subspecies (Atlantic versus eastern Pacific, Indo-West versus eastern Pacific).

  9. First report of human parvovirus 4 detection in Iran.

    PubMed

    Asiyabi, Sanaz; Nejati, Ahmad; Shoja, Zabihollah; Shahmahmoodi, Shohreh; Jalilvand, Somayeh; Farahmand, Mohammad; Gorzin, Ali-Akbar; Najafi, Alireza; Haji Mollahoseini, Mostafa; Marashi, Sayed Mahdi

    2016-08-01

    Parvovirus 4 (PARV4) is an emerging and intriguing virus that currently received many attentions. High prevalence of PARV4 infection in high-risk groups such as HIV infected patients highlights the potential clinical outcomes that this virus might have. Molecular techniques were used to determine both the presence and the genotype of circulating PARV4 on previously collected serum samples from 133 HIV infected patients and 120 healthy blood donors. Nested PCR was applied to assess the presence of PARV4 DNA genome in both groups. PARV4 DNA was detected in 35.3% of HIV infected patients compared to 16.6% healthy donors. To genetically characterize the PARV4 genotype in these groups, positive samples were randomly selected and subjected for sequencing and phylogenetic analysis. All PARV4 sequences were found to be genotype 1 and clustered with the reference sequences of PARV4 genotype 1. J. Med. Virol. 88:1314-1318, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  10. Clinical features of X linked juvenile retinoschisis associated with new mutations in the XLRS1 gene in Italian families.

    PubMed

    Simonelli, F; Cennamo, G; Ziviello, C; Testa, F; de Crecchio, G; Nesti, A; Manitto, M P; Ciccodicola, A; Banfi, S; Brancato, R; Rinaldi, E

    2003-09-01

    To describe the clinical phenotype of X linked juvenile retinoschisis in eight Italian families with six different mutations in the XLRS1 gene. Complete ophthalmic examinations, electroretinography and A and B-scan standardised echography were performed in 18 affected males. The coding sequences of the XLRS1 gene were amplified by polymerase chain reaction and directly sequenced on an automated sequencer. Six different XLRS1 mutations were identified; two of these mutations Ile81Asn and the Trp122Cys, have not been previously described. The affected males showed an electronegative response to the standard white scotopic stimulus and a prolonged implicit time of the 30 Hz flicker. In the families with Trp112Cys and Trp122Cys mutations we observed a more severe retinoschisis (RS) clinical picture compared with the other genotypes. The severe RS phenotypes associated with Trp112Cys and to Trp122Cys mutations suggest that these mutations determine a notable alteration in the function of the retinoschisin protein.

  11. Isolation, purification and functional characterization of alpha-BnIA from Conus bandanus venom.

    PubMed

    Nguyen, Bao; Le Caer, Jean-Pierre; Aráoz, Romulo; Thai, Robert; Lamthanh, Hung; Benoit, Evelyne; Molgó, Jordi

    2014-12-01

    We report the isolation and characterization by proteomic approach of a native conopeptide, named BnIA, from the crude venom of Conus bandanus, a molluscivorous cone snail species, collected in the South central coast of Vietnam. Its primary sequence was determined by matrix-assisted laser desorption/ionization time-of-flight tandem mass spectrometry using collision-induced dissociation and confirmed by Edman's degradation of the pure native fraction. BnIA was present in high amounts in the crude venom and the complete sequence of the 16 amino acid peptide was the following GCCSHPACSVNNPDIC*, with C-terminal amidation deduced from Edman's degradation and theoretical monoisotopic mass calculation. Sequence alignment revealed that its -C1C2X4C3X7C4- pattern belongs to the A-superfamily of conopeptides. The cysteine connectivity of BnIA was 1-3/2-4 as determined by partial-reduction technique, like other α4/7-conotoxins, reported previously on other Conus species. Additionally, we found that native α-BnIA shared the same sequence alignment as Mr1.1, from the closely related molluscivorous Conus marmoreus venom, in specimens collected in the same coastal region of Vietnam. Functional studies revealed that native α-BnIA inhibited acetylcholine-evoked currents reversibly in oocytes expressing the human α7 nicotinic acetylcholine receptors, and blocked nerve-evoked skeletal muscle contractions in isolated mouse neuromuscular preparations, but with ∼200-times less potency. Copyright © 2014 Elsevier Ltd. All rights reserved.

  12. Different rates of spontaneous mutation of chloroplastic and nuclear viroids as determined by high-fidelity ultra-deep sequencing.

    PubMed

    López-Carrasco, Amparo; Ballesteros, Cristina; Sentandreu, Vicente; Delgado, Sonia; Gago-Zachert, Selma; Flores, Ricardo; Sanjuán, Rafael

    2017-09-01

    Mutation rates vary by orders of magnitude across biological systems, being higher for simpler genomes. The simplest known genomes correspond to viroids, subviral plant replicons constituted by circular non-coding RNAs of few hundred bases. Previous work has revealed an extremely high mutation rate for chrysanthemum chlorotic mottle viroid, a chloroplast-replicating viroid. However, whether this is a general feature of viroids remains unclear. Here, we have used high-fidelity ultra-deep sequencing to determine the mutation rate in a common host (eggplant) of two viroids, each representative of one family: the chloroplastic eggplant latent viroid (ELVd, Avsunviroidae) and the nuclear potato spindle tuber viroid (PSTVd, Pospiviroidae). This revealed higher mutation frequencies in ELVd than in PSTVd, as well as marked differences in the types of mutations produced. Rates of spontaneous mutation, quantified in vivo using the lethal mutation method, ranged from 1/1000 to 1/800 for ELVd and from 1/7000 to 1/3800 for PSTVd depending on sequencing run. These results suggest that extremely high mutability is a common feature of chloroplastic viroids, whereas the mutation rates of PSTVd and potentially other nuclear viroids appear significantly lower and closer to those of some RNA viruses.

  13. Different rates of spontaneous mutation of chloroplastic and nuclear viroids as determined by high-fidelity ultra-deep sequencing

    PubMed Central

    Ballesteros, Cristina; Sentandreu, Vicente; Gago-Zachert, Selma

    2017-01-01

    Mutation rates vary by orders of magnitude across biological systems, being higher for simpler genomes. The simplest known genomes correspond to viroids, subviral plant replicons constituted by circular non-coding RNAs of few hundred bases. Previous work has revealed an extremely high mutation rate for chrysanthemum chlorotic mottle viroid, a chloroplast-replicating viroid. However, whether this is a general feature of viroids remains unclear. Here, we have used high-fidelity ultra-deep sequencing to determine the mutation rate in a common host (eggplant) of two viroids, each representative of one family: the chloroplastic eggplant latent viroid (ELVd, Avsunviroidae) and the nuclear potato spindle tuber viroid (PSTVd, Pospiviroidae). This revealed higher mutation frequencies in ELVd than in PSTVd, as well as marked differences in the types of mutations produced. Rates of spontaneous mutation, quantified in vivo using the lethal mutation method, ranged from 1/1000 to 1/800 for ELVd and from 1/7000 to 1/3800 for PSTVd depending on sequencing run. These results suggest that extremely high mutability is a common feature of chloroplastic viroids, whereas the mutation rates of PSTVd and potentially other nuclear viroids appear significantly lower and closer to those of some RNA viruses. PMID:28910391

  14. Solar-Type Stars with the Suppression of Convection at an Early Stage of Evolution

    NASA Astrophysics Data System (ADS)

    Oreshina, A. V.; Baturin, V. A.; Ayukov, S. V.; Gorshkov, A. B.

    2017-12-01

    The evolution of a solar-mass star before and on the main sequence is analyzed in light of the diminished efficiency of convection in the first 500 Myr. A numerical simulation has been performed with the CESAM2k code. It is shown that the suppression of convection in the early stages of evolution leads to a somewhat higher lithium content than that predicted by the classical solar model. In addition, the star's effective temperature decreases. Ignoring this phenomenon may lead to errors in age and mass determinations for young stars (before the main sequence) from standard evolutionary tracks in the temperature-luminosity diagram. At a later stage of evolution, after 500 Myr, the efficiency of convection tends to the solar value. At this stage, the star's inner structure becomes classical; it does not depend on the previous history. On the contrary, the photospheric lithium abundance contains information about the star's past. In other words, there may exist main-sequence solar-mass stars of the same age (above 500 Myr), radius, and luminosity, yet with different photospheric lithium contents. The main results of this work add considerably to the popular method for determining the age of solar-type stars from lithium abundances.

  15. Revising the Evolutionary Stage of HD 163899: The Effects of Convective Overshooting and Rotation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ostrowski, Jakub; Daszyńska-Daszkiewicz, Jadwiga; Cugier, Henryk, E-mail: ostrowski@astro.uni.wroc.pl

    We revise the evolutionary status of the B-type supergiant HD 163899 based on the new determinations of the mass–luminosity ratio, effective temperature, and rotational velocity, as well as on the interpretation of the oscillation spectrum of the star. The observed value of the nitrogen-to-carbon abundance fixes the value of the rotation rate of the star. Now, more massive models are strongly preferred than those previously considered, and it is very likely that the star is still in the main-sequence stage. The rotationally induced mixing manifests as the nitrogen overabundance in the atmosphere, which agrees with our analysis of the HARPSmore » spectra. Thus, HD 163899 probably belongs to a group of evolved nitrogen-rich main-sequence stars.« less

  16. Complete genome sequence of a new bipartite begomovirus infecting fluted pumpkin (Telfairia occidentalis) plants in Cameroon.

    PubMed

    Leke, Walter N; Khatabi, Behnam; Fondong, Vincent N; Brown, Judith K

    2016-08-01

    The complete genome sequence was determined and characterized for a previously unreported bipartite begomovirus from fluted pumpkin (Telfairia occidentalis, family Cucurbitaceae) plants displaying mosaic symptoms in Cameroon. The DNA-A and DNA-B components were ~2.7 kb and ~2.6 kb in size, and the arrangement of viral coding regions on the genomic components was like those characteristic of other known bipartite begomoviruses originating in the Old World. While the DNA-A component was more closely related to that of chayote yellow mosaic virus (ChaYMV), at 78 %, the DNA-B component was more closely related to that of soybean chlorotic blotch virus (SbCBV), at 64 %. This newly discovered bipartite Old World virus is herein named telfairia mosaic virus (TelMV).

  17. Unifying tephrostratigraphic approaches to redefine major Holocene marker tephras, Mt. Taranaki, New Zealand

    NASA Astrophysics Data System (ADS)

    Damaschke, M.; Cronin, S. J.; Torres-Orozco, R.; Wallace, R. C.

    2017-05-01

    In this study, geochemical fingerprinting of glass shards and titanomagnetite phenocrysts was used to match twenty complex pyroclastic deposits from the flanks of Mt. Taranaki to major tephra fall ;marker beds; in medial and distal deposition sites. These correlations hinged upon identifying time-bound compositional changes (a chemostratigraphy) in distal Taranaki tephra-fall sequences preserved in lake and peat sediment records around the volcano. The current work shows that previous soil-stratigraphy based studies led to miscorrelations, because they relied upon radiocarbon dates, a ;counting back; approach, and an underestimate of the number of eruptions that actually occurred in any time frame. The new tephrostratigraphy proposed at Mt. Taranaki resulted from stratigraphic rearranging of several earlier-defined units. Some tephra units are older than previously determined (e.g., Waipuku, Tariki, and Mangatoki; 6 to 9 cal ka BP), while one of the most prominent Taranaki marker tephra deposit, the Korito, is shown to lie stratigraphically above a widespread rhyolitic marker bed from Taupo volcano, the Stent Tephra (also known as unit Q; 4.3 cal ka BP). Pyroclastic tephra deposits previously dated between 6 to 4 cal ka BP at a key tephra section, c. 40 km NE of Mt. Taranaki's summit, were misidentified and are now shown to comprise new marker tephra deposits, including the Kokowai ( 4.7 cal ka BP), which is a prominent marker horizon on the eastern flanks of the volcano. A new local proximal stratigraphy for < 5 cal ka BP tephra units can be well correlated to tephra layers within distal lake and peat sequences, but the differences between the two records indicates an overall larger number of eruptions have occurred at this volcano than previously thought. This study additionally demonstrates the utility of titanomagnetite chemistry for discrimination and correlation of groups or sequences of tephra deposits - even if unique compositions cannot be identified.

  18. Optimal packaging of FIV genomic RNA depends upon a conserved long-range interaction and a palindromic sequence within gag.

    PubMed

    Rizvi, Tahir A; Kenyon, Julia C; Ali, Jahabar; Aktar, Suriya J; Phillip, Pretty S; Ghazawi, Akela; Mustafa, Farah; Lever, Andrew M L

    2010-10-15

    The feline immunodeficiency virus (FIV) is a lentivirus that is related to human immunodeficiency virus (HIV), causing a similar pathology in cats. It is a potential small animal model for AIDS and the FIV-based vectors are also being pursued for human gene therapy. Previous studies have mapped the FIV packaging signal (ψ) to two or more discontinuous regions within the 5' 511 nt of the genomic RNA and structural analyses have determined its secondary structure. The 5' and 3' sequences within ψ region interact through extensive long-range interactions (LRIs), including a conserved heptanucleotide interaction between R/U5 and gag. Other secondary structural elements identified include a conserved 150 nt stem-loop (SL2) and a small palindromic stem-loop within gag open reading frame that might act as a viral dimerization initiation site. We have performed extensive mutational analysis of these sequences and structures and ascertained their importance in FIV packaging using a trans-complementation assay. Disrupting the conserved heptanucleotide LRI to prevent base pairing between R/U5 and gag reduced packaging by 2.8-5.5 fold. Restoration of pairing using an alternative, non-wild type (wt) LRI sequence restored RNA packaging and propagation to wt levels, suggesting that it is the structure of the LRI, rather than its sequence, that is important for FIV packaging. Disrupting the palindrome within gag reduced packaging by 1.5-3-fold, but substitution with a different palindromic sequence did not restore packaging completely, suggesting that the sequence of this region as well as its palindromic nature is important. Mutation of individual regions of SL2 did not have a pronounced effect on FIV packaging, suggesting that either it is the structure of SL2 as a whole that is necessary for optimal packaging, or that there is redundancy within this structure. The mutational analysis presented here has further validated the previously predicted RNA secondary structure of FIV ψ. Copyright © 2010 Elsevier Ltd. All rights reserved.

  19. First ultraviolet observations of the transition regions of X-ray bright solar-type stars in the Pleiades

    NASA Technical Reports Server (NTRS)

    Caillault, J.-P.; Vilhu, O.; Linsky, J. L.

    1990-01-01

    Results are reported from A UV study of the transition regions of two X-ray-bright solar-type stars from the Pleiades, in an attempt to extend the main sequence age baseline for the transition-region activity-age relation over more than two orders of magnitude. However, no emission lines were detected from either star; the upper limits to the fluxes are consistent with previously determined saturation levels, but do not help to further constrain evolutionary models.

  20. Annotation of the Clostridium Acetobutylicum Genome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Daly, M. J.

    The genome sequence of the solvent producing bacterium Clostridium acetobutylicum ATCC824, has been determined by the shotgun approach. The genome consists of a 3.94 Mb chromosome and a 192 kb megaplasmid that contains the majority of genes responsible for solvent production. Comparison of C. acetobutylicum to Bacillus subtilis reveals significant local conservation of gene order, which has not been seen in comparisons of other genomes with similar, or, in some cases, closer, phylogenetic proximity. This conservation allows the prediction of many previously undetected operons in both bacteria.

  1. Gene discovery in the hamster: a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs

    PubMed Central

    Oduru, Sreedhar; Campbell, Janee L; Karri, SriTulasi; Hendry, William J; Khan, Shafiq A; Williams, Simon C

    2003-01-01

    Background Complete genome annotation will likely be achieved through a combination of computer-based analysis of available genome sequences combined with direct experimental characterization of expressed regions of individual genomes. We have utilized a comparative genomics approach involving the sequencing of randomly selected hamster testis cDNAs to begin to identify genes not previously annotated on the human, mouse, rat and Fugu (pufferfish) genomes. Results 735 distinct sequences were analyzed for their relatedness to known sequences in public databases. Eight of these sequences were derived from previously unidentified genes and expression of these genes in testis was confirmed by Northern blotting. The genomic locations of each sequence were mapped in human, mouse, rat and pufferfish, where applicable, and the structure of their cognate genes was derived using computer-based predictions, genomic comparisons and analysis of uncharacterized cDNA sequences from human and macaque. Conclusion The use of a comparative genomics approach resulted in the identification of eight cDNAs that correspond to previously uncharacterized genes in the human genome. The proteins encoded by these genes included a new member of the kinesin superfamily, a SET/MYND-domain protein, and six proteins for which no specific function could be predicted. Each gene was expressed primarily in testis, suggesting that they may play roles in the development and/or function of testicular cells. PMID:12783626

  2. HIV Sequence Compendium 2015

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Foley, Brian Thomas; Leitner, Thomas Kenneth; Apetrei, Cristian

    This compendium is an annual printed summary of the data contained in the HIV sequence database. We try to present a judicious selection of the data in such a way that it is of maximum utility to HIV researchers. Each of the alignments attempts to display the genetic variability within the different species, groups and subtypes of the virus. This compendium contains sequences published before January 1, 2015. Hence, though it is published in 2015 and called the 2015 Compendium, its contents correspond to the 2014 curated alignments on our website. The number of sequences in the HIV database ismore » still increasing. In total, at the end of 2014, there were 624,121 sequences in the HIV Sequence Database, an increase of 7% since the previous year. This is the first year that the number of new sequences added to the database has decreased compared to the previous year. The number of near complete genomes (>7000 nucleotides) increased to 5834 by end of 2014. However, as in previous years, the compendium alignments contain only a fraction of these. A more complete version of all alignments is available on our website, http://www.hiv.lanl.gov/ content/sequence/NEWALIGN/align.html As always, we are open to complaints and suggestions for improvement. Inquiries and comments regarding the compendium should be addressed to seq-info@lanl.gov.« less

  3. Application of the major capsid protein as a marker of the phylogenetic diversity of Emiliania huxleyi viruses.

    PubMed

    Rowe, Janet M; Fabre, Marie-Françoise; Gobena, Daniel; Wilson, William H; Wilhelm, Steven W

    2011-05-01

    Studies of the Phycodnaviridae have traditionally relied on the DNA polymerase (pol) gene as a biomarker. However, recent investigations have suggested that the major capsid protein (MCP) gene may be a reliable phylogenetic biomarker. We used MCP gene amplicons gathered across the North Atlantic to assess the diversity of Emiliania huxleyi-infecting Phycodnaviridae. Nucleotide sequences were examined across >6000 km of open ocean, with comparisons between concentrates of the virus-size fraction of seawater and of lysates generated by exposing host strains to these same virus concentrates. Analyses revealed that many sequences were only sampled once, while several were over-represented. Analyses also revealed nucleotide sequences distinct from previous coastal isolates. Examination of lysed cultures revealed a new richness in phylogeny, as MCP sequences previously unrepresented within the existing collection of E. huxleyi viruses (EhV) were associated with viruses lysing cultures. Sequences were compared with previously described EhV MCP sequences from the North Sea and a Norwegian Fjord, as well as from the Gulf of Maine. Principal component analysis indicates that location-specific distinctions exist despite the presence of sequences common across these environments. Overall, this investigation provides new sequence data and an assessment on the use of the MCP gene. © 2011 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved.

  4. Fast single-pass alignment and variant calling using sequencing data

    USDA-ARS?s Scientific Manuscript database

    Sequencing research requires efficient computation. Few programs use already known information about DNA variants when aligning sequence data to the reference map. New program findmap.f90 reads the previous variant list before aligning sequence, calling variant alleles, and summing the allele counts...

  5. Accurate Filtering of Privacy-Sensitive Information in Raw Genomic Data.

    PubMed

    Decouchant, Jérémie; Fernandes, Maria; Völp, Marcus; Couto, Francisco M; Esteves-Veríssimo, Paulo

    2018-04-13

    Sequencing thousands of human genomes has enabled breakthroughs in many areas, among them precision medicine, the study of rare diseases, and forensics. However, mass collection of such sensitive data entails enormous risks if not protected to the highest standards. In this article, we follow the position and argue that post-alignment privacy is not enough and that data should be automatically protected as early as possible in the genomics workflow, ideally immediately after the data is produced. We show that a previous approach for filtering short reads cannot extend to long reads and present a novel filtering approach that classifies raw genomic data (i.e., whose location and content is not yet determined) into privacy-sensitive (i.e., more affected by a successful privacy attack) and non-privacy-sensitive information. Such a classification allows the fine-grained and automated adjustment of protective measures to mitigate the possible consequences of exposure, in particular when relying on public clouds. We present the first filter that can be indistinctly applied to reads of any length, i.e., making it usable with any recent or future sequencing technologies. The filter is accurate, in the sense that it detects all known sensitive nucleotides except those located in highly variable regions (less than 10 nucleotides remain undetected per genome instead of 100,000 in previous works). It has far less false positives than previously known methods (10% instead of 60%) and can detect sensitive nucleotides despite sequencing errors (86% detected instead of 56% with 2% of mutations). Finally, practical experiments demonstrate high performance, both in terms of throughput and memory consumption. Copyright © 2018. Published by Elsevier Inc.

  6. Diversity of virus-host systems in hypersaline Lake Retba, Senegal.

    PubMed

    Sime-Ngando, Télesphore; Lucas, Soizick; Robin, Agnès; Tucker, Kimberly Pause; Colombet, Jonathan; Bettarel, Yvan; Desmond, Elie; Gribaldo, Simonetta; Forterre, Patrick; Breitbart, Mya; Prangishvili, David

    2011-08-01

    Remarkable morphological diversity of virus-like particles was observed by transmission electron microscopy in a hypersaline water sample from Lake Retba, Senegal. The majority of particles morphologically resembled hyperthermophilic archaeal DNA viruses isolated from extreme geothermal environments. Some hypersaline viral morphotypes have not been previously observed in nature, and less than 1% of observed particles had a head-and-tail morphology, which is typical for bacterial DNA viruses. Culture-independent analysis of the microbial diversity in the sample suggested the dominance of extremely halophilic archaea. Few of the 16S sequences corresponded to known archeal genera (Haloquadratum, Halorubrum and Natronomonas), whereas the majority represented novel archaeal clades. Three sequences corresponded to a new basal lineage of the haloarchaea. Bacteria belonged to four major phyla, consistent with the known diversity in saline environments. Metagenomic sequencing of DNA from the purified virus-like particles revealed very few similarities to the NCBI non-redundant database at either the nucleotide or amino acid level. Some of the identifiable virus sequences were most similar to previously described haloarchaeal viruses, but no sequence similarities were found to archaeal viruses from extreme geothermal environments. A large proportion of the sequences had similarity to previously sequenced viral metagenomes from solar salterns. © 2010 Society for Applied Microbiology and Blackwell Publishing Ltd.

  7. A core microbiome associated with the peritoneal tumors of pseudomyxoma peritonei

    PubMed Central

    2013-01-01

    Background Pseudomyxoma peritonei (PMP) is a malignancy characterized by dissemination of mucus-secreting cells throughout the peritoneum. This disease is associated with significant morbidity and mortality and despite effective treatment options for early-stage disease, patients with PMP often relapse. Thus, there is a need for additional treatment options to reduce relapse rate and increase long-term survival. A previous study identified the presence of both typed and non-culturable bacteria associated with PMP tissue and determined that increased bacterial density was associated with more severe disease. These findings highlighted the possible role for bacteria in PMP disease. Methods To more clearly define the bacterial communities associated with PMP disease, we employed a sequenced-based analysis to profile the bacterial populations found in PMP tumor and mucin tissue in 11 patients. Sequencing data were confirmed by in situ hybridization at multiple taxonomic depths and by culturing. A pilot clinical study was initiated to determine whether the addition of antibiotic therapy affected PMP patient outcome. Main results We determined that the types of bacteria present are highly conserved in all PMP patients; the dominant phyla are the Proteobacteria, Actinobacteria, Firmicutes and Bacteroidetes. A core set of taxon-specific sequences were found in all 11 patients; many of these sequences were classified into taxonomic groups that also contain known human pathogens. In situ hybridization directly confirmed the presence of bacteria in PMP at multiple taxonomic depths and supported our sequence-based analysis. Furthermore, culturing of PMP tissue samples allowed us to isolate 11 different bacterial strains from eight independent patients, and in vitro analysis of subset of these isolates suggests that at least some of these strains may interact with the PMP-associated mucin MUC2. Finally, we provide evidence suggesting that targeting these bacteria with antibiotic treatment may increase the survival of PMP patients. Conclusions Using 16S amplicon-based sequencing, direct in situ hybridization analysis and culturing methods, we have identified numerous bacterial taxa that are consistently present in all PMP patients tested. Combined with data from a pilot clinical study, these data support the hypothesis that adding antimicrobials to the standard PMP treatment could improve PMP patient survival. PMID:23844722

  8. Sequence History Update Tool

    NASA Technical Reports Server (NTRS)

    Khanampompan, Teerapat; Gladden, Roy; Fisher, Forest; DelGuercio, Chris

    2008-01-01

    The Sequence History Update Tool performs Web-based sequence statistics archiving for Mars Reconnaissance Orbiter (MRO). Using a single UNIX command, the software takes advantage of sequencing conventions to automatically extract the needed statistics from multiple files. This information is then used to populate a PHP database, which is then seamlessly formatted into a dynamic Web page. This tool replaces a previous tedious and error-prone process of manually editing HTML code to construct a Web-based table. Because the tool manages all of the statistics gathering and file delivery to and from multiple data sources spread across multiple servers, there is also a considerable time and effort savings. With the use of The Sequence History Update Tool what previously took minutes is now done in less than 30 seconds, and now provides a more accurate archival record of the sequence commanding for MRO.

  9. Molecular delineation of the Agave Red Worm Comadia redtenbacheri  (Lepidoptera: Cossidae).

    PubMed

    CÁrdenas-Aquino, MarÍa Del Rosario; AlarcÓn-rodrÍguez, Norma Marina; Rivas-Medrano, Mario; GonzÁlez-hernÁndez, HÉctor; Vargas-hernÁndez, Mateo; SÁnchez-Arroyo, Hussein; Llanderal-cÁzares, Celina

    2018-01-25

    Comadia redtenbacheri (Hammerschmidt) (Agave Red Worm) is the only member of the family Cossidae that has been described as a phytophagous specialist of the plant genus Agave, which is mainly distributed in México. A new extraction protocol adapted from Stewart Via (1993) has been implemented for sequencing the COI gene from samples collected in five states of the North Central (Querétaro and Zacatecas), South Central (Estado de México) and East Central (Hidalgo and Tlaxcala) regions of México with the purpose of contributing to delineation of the species. A Maximum Likelihood (ML) tree based on these COI sequences as well as COI sequences from other Cossinae species was developed to complement the existing morphological and taxonomic approaches to delineation of this species. As expected, our Comadia samples cluster together within a monophyletic clade that includes four C. redtenbacheri sequences previously reported. This group seems to be consistent with our reconstruction, which is supported by a bootstrap value of over 99%. The closely related branches associated with the latter group include organisms known to be the plant and tree borers of the Cossinae subfamily. The COI sequences from our samples were analyzed to determine the percentage of identity among the C. redtenbacheri in a first attempt to detect differences in the sequence that matches a particular region of México.

  10. The Number, Organization, and Size of Polymorphic Membrane Protein Coding Sequences as well as the Most Conserved Pmp Protein Differ within and across Chlamydia Species.

    PubMed

    Van Lent, Sarah; Creasy, Heather Huot; Myers, Garry S A; Vanrompay, Daisy

    2016-01-01

    Variation is a central trait of the polymorphic membrane protein (Pmp) family. The number of pmp coding sequences differs between Chlamydia species, but it is unknown whether the number of pmp coding sequences is constant within a Chlamydia species. The level of conservation of the Pmp proteins has previously only been determined for Chlamydia trachomatis. As different Pmp proteins might be indispensible for the pathogenesis of different Chlamydia species, this study investigated the conservation of Pmp proteins both within and across C. trachomatis,C. pneumoniae,C. abortus, and C. psittaci. The pmp coding sequences were annotated in 16 C. trachomatis, 6 C. pneumoniae, 2 C. abortus, and 16 C. psittaci genomes. The number and organization of polymorphic membrane coding sequences differed within and across the analyzed Chlamydia species. The length of coding sequences of pmpA,pmpB, and pmpH was conserved among all analyzed genomes, while the length of pmpE/F and pmpG, and remarkably also of the subtype pmpD, differed among the analyzed genomes. PmpD, PmpA, PmpH, and PmpA were the most conserved Pmp in C. trachomatis,C. pneumoniae,C. abortus, and C. psittaci, respectively. PmpB was the most conserved Pmp across the 4 analyzed Chlamydia species. © 2016 S. Karger AG, Basel.

  11. Presence and Expression of Microbial Genes Regulating Soil Nitrogen Dynamics Along the Tanana River Successional Sequence

    NASA Astrophysics Data System (ADS)

    Boone, R. D.; Rogers, S. L.

    2004-12-01

    We report on work to assess the functional gene sequences for soil microbiota that control nitrogen cycle pathways along the successional sequence (willow, alder, poplar, white spruce, black spruce) on the Tanana River floodplain, Interior Alaska. Microbial DNA and mRNA were extracted from soils (0-10 cm depth) for amoA (ammonium monooxygenase), nifH (nitrogenase reductase), napA (nitrate reductase), and nirS and nirK (nitrite reductase) genes. Gene presence was determined by amplification of a conserved sequence of each gene employing sequence specific oligonucleotide primers and Polymerase Chain Reaction (PCR). Expression of the genes was measured via nested reverse transcriptase PCR amplification of the extracted mRNA. Amplified PCR products were visualized on agarose electrophoresis gels. All five successional stages show evidence for the presence and expression of microbial genes that regulate N fixation (free-living), nitrification, and nitrate reduction. We detected (1) nifH, napA, and nirK presence and amoA expression (mRNA production) for all five successional stages and (2) nirS and amoA presence and nifH, nirK, and napA expression for early successional stages (willow, alder, poplar). The results highlight that the existing body of previous process-level work has not sufficiently considered the microbial potential for a nitrate economy and free-living N fixation along the complete floodplain successional sequence.

  12. Determination of Fundamental Properties of an M31 Globular Cluster from Main-Sequence Photometry

    NASA Astrophysics Data System (ADS)

    Ma, Jun; Wu, Zhenyu; Wang, Song; Fan, Zhou; Zhou, Xu; Wu, Jianghua; Jiang, Zhaoji; Chen, Jiansheng

    2010-10-01

    M31 globular cluster B379 is the first extragalactic cluster whose age was determined by main-sequence photometry. In the main-sequence photometric method, the age of a cluster is obtained by fitting its color-magnitude diagram (CMD) with stellar evolutionary models. However, different stellar evolutionary models use different parameters of stellar evolution, such as range of stellar masses, different opacities and equations of state, and different recipes, and so on. So, it is interesting to check whether different stellar evolutionary models can give consistent results for the same cluster. Brown et al. constrained the age of B379 by comparing its CMD with isochrones of the 2006 VandenBerg models. Using SSP models of Bruzual & Charlot and its multiphotometry, ZMa et al. independently determined the age of B379, which is in good agreement with the determination of Brown et al. The models of Bruzual & Charlot are calculated based on the Padova evolutionary tracks. It is necessary to check whether the age of B379 as determined based on the Padova evolutionary tracks is in agreement with the determination of Brown et al.. In this article, we redetermine the age of B379 using isochrones of the Padova stellar evolutionary models. In addition, the metal abundance, the distance modulus, and the reddening value for B379 are reported. The results obtained are consistent with the previous determinations, which include the age obtained by Brown et al. This article thus confirms the consistency of the age scale of B379 between the Padova isochrones and the 2006 VandenBerg isochrones; i.e., the comparison between the results of Brown et al. and Ma et al. is meaningful. The results reported in this article of values found for B379 are: metallicity [M/H] = log(Z/Z ⊙) = -0.325, age τ = 11.0 ± 1.5 Gyr, reddening E(B - V) = 0.08, and distance modulus (m - M)0 = 24.44 ± 0.10.

  13. Sequence of the structural gene for granule-bound starch synthase of potato (Solanum tuberosum L.) and evidence for a single point deletion in the amf allele.

    PubMed

    van der Leij, F R; Visser, R G; Ponstein, A S; Jacobsen, E; Feenstra, W J

    1991-08-01

    The genomic sequence of the potato gene for starch granule-bound starch synthase (GBSS; "waxy protein") has been determined for the wild-type allele of a monoploid genotype from which an amylose-free (amf) mutant was derived, and for the mutant part of the amf allele. Comparison of the wild-type sequence with a cDNA sequence from the literature and a newly isolated cDNA revealed the presence of 13 introns, the first of which is located in the untranslated leader. The promoter contains a G-box-like sequence. The deduced amino acid sequence of the precursor of GBSS shows a high degree of identity with monocot waxy protein sequences in the region corresponding to the mature form of the enzyme. The transit peptide of 77 amino acids, required for routing of the precursor to the plastids, shows much less identity with the transit peptides of the other waxy preproteins, but resembles the hydropathic distributions of these peptides. Alignment of the amino acid sequences of the four mature starch synthases with the Escherichia coli glgA gene product revealed the presence of at least three conserved boxes; there is no homology with previously proposed starch-binding domains of other enzymes involved in starch metabolism. We report the use of chimeric constructs with wild-type and amf sequences to localize, via complementation experiments, the region of the amf allele in which the mutation resides. Direct sequencing of polymerase chain reaction products confirmed that the amf mutation is a deletion of a single AT basepair in the region coding for the transit peptide.(ABSTRACT TRUNCATED AT 250 WORDS)

  14. Breaking barriers and halting rupture: the 2016 Amatrice-Visso-Castelluccio earthquake sequence, central Italy

    NASA Astrophysics Data System (ADS)

    Gregory, L. C.; Walters, R. J.; Wedmore, L. N. J.; Craig, T. J.; McCaffrey, K. J. W.; Wilkinson, M. W.; Livio, F.; Michetti, A.; Goodall, H.; Li, Z.; Chen, J.; De Martini, P. M.

    2017-12-01

    In 2016 the Central Italian Apennines was struck by a sequence of normal faulting earthquakes that ruptured in three separate events on the 24th August (Mw 6.2), the 26th Oct (Mw 6.1), and the 30th Oct (Mw 6.6). We reveal the complex nature of the individual events and the time-evolution of the sequence using multiple datasets. We will present an overview of the results from field geology, satellite geodesy, GNSS (including low-cost short baseline installations), and terrestrial laser scanning (TLS). Sequences of earthquakes of mid to high magnitude 6 are common in historical and seismological records in Italy and other similar tectonic settings globally. Multi-fault rupture during these sequences can occur in seconds, as in the M 6.9 1980 Irpinia earthquake, or can span days, months, or years (e.g. the 1703 Norcia-L'Aquila sequence). It is critical to determine why the causative faults in the 2016 sequence did not rupture simultaneously, and how this relates to fault segmentation and structural barriers. This is the first sequence of this kind to be observed using modern geodetic techniques, and only with all of the datasets combined can we begin to understand how and why the sequence evolved in time and space. We show that earthquake rupture both broke through structural barriers that were thought to exist, but was also inhibited by a previously unknown structure. We will also discuss the logistical challenges in generating datasets on the time-evolving sequence, and show how rapid response and international collaboration within the Open EMERGEO Working Group was critical for gaining a complete picture of the ongoing activity.

  15. Sequences of 95 human MHC haplotypes reveal extreme coding variation in genes other than highly polymorphic HLA class I and II

    PubMed Central

    Norman, Paul J.; Norberg, Steven J.; Guethlein, Lisbeth A.; Nemat-Gorgani, Neda; Royce, Thomas; Wroblewski, Emily E.; Dunn, Tamsen; Mann, Tobias; Alicata, Claudia; Hollenbach, Jill A.; Chang, Weihua; Shults Won, Melissa; Gunderson, Kevin L.; Abi-Rached, Laurent; Ronaghi, Mostafa; Parham, Peter

    2017-01-01

    The most polymorphic part of the human genome, the MHC, encodes over 160 proteins of diverse function. Half of them, including the HLA class I and II genes, are directly involved in immune responses. Consequently, the MHC region strongly associates with numerous diseases and clinical therapies. Notoriously, the MHC region has been intractable to high-throughput analysis at complete sequence resolution, and current reference haplotypes are inadequate for large-scale studies. To address these challenges, we developed a method that specifically captures and sequences the 4.8-Mbp MHC region from genomic DNA. For 95 MHC homozygous cell lines we assembled, de novo, a set of high-fidelity contigs and a sequence scaffold, representing a mean 98% of the target region. Included are six alternative MHC reference sequences of the human genome that we completed and refined. Characterization of the sequence and structural diversity of the MHC region shows the approach accurately determines the sequences of the highly polymorphic HLA class I and HLA class II genes and the complex structural diversity of complement factor C4A/C4B. It has also uncovered extensive and unexpected diversity in other MHC genes; an example is MUC22, which encodes a lung mucin and exhibits more coding sequence alleles than any HLA class I or II gene studied here. More than 60% of the coding sequence alleles analyzed were previously uncharacterized. We have created a substantial database of robust reference MHC haplotype sequences that will enable future population scale studies of this complicated and clinically important region of the human genome. PMID:28360230

  16. Fixing Formalin: A Method to Recover Genomic-Scale DNA Sequence Data from Formalin-Fixed Museum Specimens Using High-Throughput Sequencing

    PubMed Central

    Hykin, Sarah M.; Bi, Ke; McGuire, Jimmy A.

    2015-01-01

    For 150 years or more, specimens were routinely collected and deposited in natural history collections without preserving fresh tissue samples for genetic analysis. In the case of most herpetological specimens (i.e. amphibians and reptiles), attempts to extract and sequence DNA from formalin-fixed, ethanol-preserved specimens—particularly for use in phylogenetic analyses—has been laborious and largely ineffective due to the highly fragmented nature of the DNA. As a result, tens of thousands of specimens in herpetological collections have not been available for sequence-based phylogenetic studies. Massively parallel High-Throughput Sequencing methods and the associated bioinformatics, however, are particularly suited to recovering meaningful genetic markers from severely degraded/fragmented DNA sequences such as DNA damaged by formalin-fixation. In this study, we compared previously published DNA extraction methods on three tissue types subsampled from formalin-fixed specimens of Anolis carolinensis, followed by sequencing. Sufficient quality DNA was recovered from liver tissue, making this technique minimally destructive to museum specimens. Sequencing was only successful for the more recently collected specimen (collected ~30 ybp). We suspect this could be due either to the conditions of preservation and/or the amount of tissue used for extraction purposes. For the successfully sequenced sample, we found a high rate of base misincorporation. After rigorous trimming, we successfully mapped 27.93% of the cleaned reads to the reference genome, were able to reconstruct the complete mitochondrial genome, and recovered an accurate phylogenetic placement for our specimen. We conclude that the amount of DNA available, which can vary depending on specimen age and preservation conditions, will determine if sequencing will be successful. The technique described here will greatly improve the value of museum collections by making many formalin-fixed specimens available for genetic analysis. PMID:26505622

  17. Fixing Formalin: A Method to Recover Genomic-Scale DNA Sequence Data from Formalin-Fixed Museum Specimens Using High-Throughput Sequencing.

    PubMed

    Hykin, Sarah M; Bi, Ke; McGuire, Jimmy A

    2015-01-01

    For 150 years or more, specimens were routinely collected and deposited in natural history collections without preserving fresh tissue samples for genetic analysis. In the case of most herpetological specimens (i.e. amphibians and reptiles), attempts to extract and sequence DNA from formalin-fixed, ethanol-preserved specimens-particularly for use in phylogenetic analyses-has been laborious and largely ineffective due to the highly fragmented nature of the DNA. As a result, tens of thousands of specimens in herpetological collections have not been available for sequence-based phylogenetic studies. Massively parallel High-Throughput Sequencing methods and the associated bioinformatics, however, are particularly suited to recovering meaningful genetic markers from severely degraded/fragmented DNA sequences such as DNA damaged by formalin-fixation. In this study, we compared previously published DNA extraction methods on three tissue types subsampled from formalin-fixed specimens of Anolis carolinensis, followed by sequencing. Sufficient quality DNA was recovered from liver tissue, making this technique minimally destructive to museum specimens. Sequencing was only successful for the more recently collected specimen (collected ~30 ybp). We suspect this could be due either to the conditions of preservation and/or the amount of tissue used for extraction purposes. For the successfully sequenced sample, we found a high rate of base misincorporation. After rigorous trimming, we successfully mapped 27.93% of the cleaned reads to the reference genome, were able to reconstruct the complete mitochondrial genome, and recovered an accurate phylogenetic placement for our specimen. We conclude that the amount of DNA available, which can vary depending on specimen age and preservation conditions, will determine if sequencing will be successful. The technique described here will greatly improve the value of museum collections by making many formalin-fixed specimens available for genetic analysis.

  18. Exome Sequence Analysis of 14 Families With High Myopia.

    PubMed

    Kloss, Bethany A; Tompson, Stuart W; Whisenhunt, Kristina N; Quow, Krystina L; Huang, Samuel J; Pavelec, Derek M; Rosenberg, Thomas; Young, Terri L

    2017-04-01

    To identify causal gene mutations in 14 families with autosomal dominant (AD) high myopia using exome sequencing. Select individuals from 14 large Caucasian families with high myopia were exome sequenced. Gene variants were filtered to identify potential pathogenic changes. Sanger sequencing was used to confirm variants in original DNA, and to test for disease cosegregation in additional family members. Candidate genes and chromosomal loci previously associated with myopic refractive error and its endophenotypes were comprehensively screened. In 14 high myopia families, we identified 73 rare and 31 novel gene variants as candidates for pathogenicity. In seven of these families, two of the novel and eight of the rare variants were within known myopia loci. A total of 104 heterozygous nonsynonymous rare variants in 104 genes were identified in 10 out of 14 probands. Each variant cosegregated with affection status. No rare variants were identified in genes known to cause myopia or in genes closest to published genome-wide association study association signals for refractive error or its endophenotypes. Whole exome sequencing was performed to determine gene variants implicated in the pathogenesis of AD high myopia. This study provides new genes for consideration in the pathogenesis of high myopia, and may aid in the development of genetic profiling of those at greatest risk for attendant ocular morbidities of this disorder.

  19. Evolutionary relationships of lactate dehydrogenases (LDHs) from mammals, birds, an amphibian, fish, barley, and bacteria: LDH cDNA sequences from Xenopus, pig, and rat.

    PubMed Central

    Tsuji, S; Qureshi, M A; Hou, E W; Fitch, W M; Li, S S

    1994-01-01

    The nucleotide sequences of the cDNAs encoding LDH (EC 1.1.1.27) subunits LDH-A (muscle), LDH-B (liver), and LDH-C (oocyte) from Xenopus laevis, LDH-A (muscle) and LDH-B (heart) from pig, and LDH-B (heart) and LDH-C (testis) from rat were determined. These seven newly deduced amino acid sequences and 22 other published LDH sequences, and three unpublished fish LDH-A sequences kindly provided by G. N. Somero and D. A. Powers, were used to construct the most parsimonious phylogenetic tree of these 32 LDH subunits from mammals, birds, an amphibian, fish, barley, and bacteria. There have been at least six LDH gene duplications among the vertebrates. The Xenopus LDH-A, LDH-B, and LDH-C subunits are most closely related to each other and then are more closely related to vertebrate LDH-B than LDH-A. Three fish LDH-As, as well as a single LDH of lamprey, also seem to be more related to vertebrate LDH-B than to land vertebrate LDH-A. The mammalian LDH-C (testis) subunit appears to have diverged very early, prior to the divergence of vertebrate LDH-A and LDH-B subunits, as reported previously. Images PMID:7937776

  20. Increased complexity of circRNA expression during species evolution.

    PubMed

    Dong, Rui; Ma, Xu-Kai; Chen, Ling-Ling; Yang, Li

    2017-08-03

    Circular RNAs (circRNAs) are broadly identified from precursor mRNA (pre-mRNA) back-splicing across various species. Recent studies have suggested a cell-/tissue- specific manner of circRNA expression. However, the distinct expression pattern of circRNAs among species and its underlying mechanism still remain to be explored. Here, we systematically compared circRNA expression from human and mouse, and found that only a small portion of human circRNAs could be determined in parallel mouse samples. The conserved circRNA expression between human and mouse is correlated with the existence of orientation-opposite complementary sequences in introns that flank back-spliced exons in both species, but not the circRNA sequences themselves. Quantification of RNA pairing capacity of orientation-opposite complementary sequences across circRNA-flanking introns by Complementary Sequence Index (CSI) identifies that among all types of complementary sequences, SINEs, especially Alu elements in human, contribute the most for circRNA formation and that their diverse distribution across species leads to the increased complexity of circRNA expression during species evolution. Together, our integrated and comparative reference catalog of circRNAs in different species reveals a species-specific pattern of circRNA expression and suggests a previously under-appreciated impact of fast-evolved SINEs on the regulation of (circRNA) gene expression.

  1. A serendipitous survey of prediction algorithms for amyloidogenicity

    PubMed Central

    Roland, Bartholomew P.; Kodali, Ravindra; Mishra, Rakesh; Wetzel, Ronald

    2014-01-01

    SUMMARY The 17- amino acid N-terminal segment of the Huntingtin protein, httNT, grows into stable α-helix rich oligomeric aggregates when incubated under physiological conditions. We examined 15 scrambled sequence versions of an httNT peptide for their stabilities against aggregation in aqueous solution at low micromolar concentration and physiological conditions. Surprisingly, given their derivation from a sequence that readily assembles into highly stable α-helical aggregates that fail to convert into β-structure, we found that three of these scrambled peptides rapidly grow into amyloid-like fibrils, while two others also develop amyloid somewhat more slowly. The other 10 scrambled peptides do not detectibly form any aggregates after 100 hrs incubation under these conditions. We then analyzed these sequences using four previously described algorithms for predicting the tendencies of peptides to grow into amyloid or other β-aggregates. We found that these algorithms – Zyggregator, Tango, Waltz and Zipper – varied greatly in the number of sequences predicted to be amyloidogenic and in their abilities to correctly identify the amyloid forming members of scrambled peptide collection. The results are discussed in the context of a review of the sequence and structural factors currently thought to be important in determining amyloid formation kinetics and thermodynamics. PMID:23893755

  2. Antimicrobial activity predictors benchmarking analysis using shuffled and designed synthetic peptides.

    PubMed

    Porto, William F; Pires, Állan S; Franco, Octavio L

    2017-08-07

    The antimicrobial activity prediction tools aim to help the novel antimicrobial peptides (AMP) sequences discovery, utilizing machine learning methods. Such approaches have gained increasing importance in the generation of novel synthetic peptides by means of rational design techniques. This study focused on predictive ability of such approaches to determine the antimicrobial sequence activities, which were previously characterized at the protein level by in vitro studies. Using four web servers and one standalone software, we evaluated 78 sequences generated by the so-called linguistic model, being 40 designed and 38 shuffled sequences, with ∼60 and ∼25% of identity to AMPs, respectively. The ab initio molecular modelling of such sequences indicated that the structure does not affect the predictions, as both sets present similar structures. Overall, the systems failed on predicting shuffled versions of designed peptides, as they are identical in AMPs composition, which implies in accuracies below 30%. The prediction accuracy is negatively affected by the low specificity of all systems here evaluated, as they, on the other hand, reached 100% of sensitivity. Our results suggest that complementary approaches with high specificity, not necessarily high accuracy, should be developed to be used together with the current systems, overcoming their limitations. Copyright © 2017 Elsevier Ltd. All rights reserved.

  3. Deep nirS amplicon sequencing of San Francisco Bay sediments enables prediction of geography and environmental conditions from denitrifying community composition.

    PubMed

    Lee, Jessica A; Francis, Christopher A

    2017-12-01

    Denitrification is a dominant nitrogen loss process in the sediments of San Francisco Bay. In this study, we sought to understand the ecology of denitrifying bacteria by using next-generation sequencing (NGS) to survey the diversity of a denitrification functional gene, nirS (encoding cytchrome-cd 1 nitrite reductase), along the salinity gradient of San Francisco Bay over the course of a year. We compared our dataset to a library of nirS sequences obtained previously from the same samples by standard PCR cloning and Sanger sequencing, and showed that both methods similarly demonstrated geography, salinity and, to a lesser extent, nitrogen, to be strong determinants of community composition. Furthermore, the depth afforded by NGS enabled novel techniques for measuring the association between environment and community composition. We used Random Forests modelling to demonstrate that the site and salinity of a sample could be predicted from its nirS sequences, and to identify indicator taxa associated with those environmental characteristics. This work contributes significantly to our understanding of the distribution and dynamics of denitrifying communities in San Francisco Bay, and provides valuable tools for the further study of this key N-cycling guild in all estuarine systems. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.

  4. Bacterial Diversity in Human Subgingival Plaque

    PubMed Central

    Paster, Bruce J.; Boches, Susan K.; Galvin, Jamie L.; Ericson, Rebecca E.; Lau, Carol N.; Levanos, Valerie A.; Sahasrabudhe, Ashish; Dewhirst, Floyd E.

    2001-01-01

    The purpose of this study was to determine the bacterial diversity in the human subgingival plaque by using culture-independent molecular methods as part of an ongoing effort to obtain full 16S rRNA sequences for all cultivable and not-yet-cultivated species of human oral bacteria. Subgingival plaque was analyzed from healthy subjects and subjects with refractory periodontitis, adult periodontitis, human immunodeficiency virus periodontitis, and acute necrotizing ulcerative gingivitis. 16S ribosomal DNA (rDNA) bacterial genes from DNA isolated from subgingival plaque samples were PCR amplified with all-bacterial or selective primers and cloned into Escherichia coli. The sequences of cloned 16S rDNA inserts were used to determine species identity or closest relatives by comparison with sequences of known species. A total of 2,522 clones were analyzed. Nearly complete sequences of approximately 1,500 bases were obtained for putative new species. About 60% of the clones fell into 132 known species, 70 of which were identified from multiple subjects. About 40% of the clones were novel phylotypes. Of the 215 novel phylotypes, 75 were identified from multiple subjects. Known putative periodontal pathogens such as Porphyromonas gingivalis, Bacteroides forsythus, and Treponema denticola were identified from multiple subjects, but typically as a minor component of the plaque as seen in cultivable studies. Several phylotypes fell into two recently described phyla previously associated with extreme natural environments, for which there are no cultivable species. A number of species or phylotypes were found only in subjects with disease, and a few were found only in healthy subjects. The organisms identified only from diseased sites deserve further study as potential pathogens. Based on the sequence data in this study, the predominant subgingival microbial community consisted of 347 species or phylotypes that fall into 9 bacterial phyla. Based on the 347 species seen in our sample of 2,522 clones, we estimate that there are 68 additional unseen species, for a total estimate of 415 species in the subgingival plaque. When organisms found on other oral surfaces such as the cheek, tongue, and teeth are added to this number, the best estimate of the total species diversity in the oral cavity is approximately 500 species, as previously proposed. PMID:11371542

  5. HIPPI: highly accurate protein family classification with ensembles of HMMs.

    PubMed

    Nguyen, Nam-Phuong; Nute, Michael; Mirarab, Siavash; Warnow, Tandy

    2016-11-11

    Given a new biological sequence, detecting membership in a known family is a basic step in many bioinformatics analyses, with applications to protein structure and function prediction and metagenomic taxon identification and abundance profiling, among others. Yet family identification of sequences that are distantly related to sequences in public databases or that are fragmentary remains one of the more difficult analytical problems in bioinformatics. We present a new technique for family identification called HIPPI (Hierarchical Profile Hidden Markov Models for Protein family Identification). HIPPI uses a novel technique to represent a multiple sequence alignment for a given protein family or superfamily by an ensemble of profile hidden Markov models computed using HMMER. An evaluation of HIPPI on the Pfam database shows that HIPPI has better overall precision and recall than blastp, HMMER, and pipelines based on HHsearch, and maintains good accuracy even for fragmentary query sequences and for protein families with low average pairwise sequence identity, both conditions where other methods degrade in accuracy. HIPPI provides accurate protein family identification and is robust to difficult model conditions. Our results, combined with observations from previous studies, show that ensembles of profile Hidden Markov models can better represent multiple sequence alignments than a single profile Hidden Markov model, and thus can improve downstream analyses for various bioinformatic tasks. Further research is needed to determine the best practices for building the ensemble of profile Hidden Markov models. HIPPI is available on GitHub at https://github.com/smirarab/sepp .

  6. Mitochondrial gene sequences alone or combined with ITS region sequences provide firm molecular criteria for the classification of Lecanicillium species.

    PubMed

    Kouvelis, Vassili N; Sialakouma, Aphrodite; Typas, Milton A

    2008-07-01

    The recent revision of Verticillium sect. Prostrata led to the introduction of the genus Lecanicillium, which comprises the majority of the entomopathogenic strains. Sixty-five strains previously classified as Verticillium lecanii or Verticillium sp. from different geographical regions and hosts were examined and their phylogenetic relationships were determined using sequences from three mitochondrial (mt) genes [the small rRNA subunit (rns), the NADH dehydrogenase subunits 1 (nad1) and 3 (nad3)] and the ITS region. In general, single gene phylogenetic trees differentiated and placed the strains examined in well-supported (by BS analysis) groups of L. lecanii, L. longisporum, L. muscarium, and L. nodulosum, although in some cases a few uncertainties still remained. nad1 was the most informative single gene in phylogenetic analyses and was also found to contain group I introns with putative open reading frames (ORFs) encoding for GIY-YIG endonucleases. The combined use of mt gene sequences resolved taxonomic uncertainties arisen from ITS analysis and, alone or in combination with ITS sequences, helped in placing uncharacterised Verticillium lecanii and Verticillium sp. firmly into Lecanicillium species. Combined gene data from all the mt genes and all the mt genes and the ITS region together, were very similar. Furthermore, a relaxed correlation with host specificity -- at least for Homoptera -- was indicated for the rns and the combined mt gene sequences. Thus, the usefulness of mt gene sequences as a convenient molecular tool in phylogenetic studies of entomopathogenic fungi was demonstrated.

  7. Distinct retroelement classes define evolutionary breakpoints demarcating sites of evolutionary novelty

    PubMed Central

    Longo, Mark S; Carone, Dawn M; Green, Eric D; O'Neill, Michael J; O'Neill, Rachel J

    2009-01-01

    Background Large-scale genome rearrangements brought about by chromosome breaks underlie numerous inherited diseases, initiate or promote many cancers and are also associated with karyotype diversification during species evolution. Recent research has shown that these breakpoints are nonrandomly distributed throughout the mammalian genome and many, termed "evolutionary breakpoints" (EB), are specific genomic locations that are "reused" during karyotypic evolution. When the phylogenetic trajectory of orthologous chromosome segments is considered, many of these EB are coincident with ancient centromere activity as well as new centromere formation. While EB have been characterized as repeat-rich regions, it has not been determined whether specific sequences have been retained during evolution that would indicate previous centromere activity or a propensity for new centromere formation. Likewise, the conservation of specific sequence motifs or classes at EBs among divergent mammalian taxa has not been determined. Results To define conserved sequence features of EBs associated with centromere evolution, we performed comparative sequence analysis of more than 4.8 Mb within the tammar wallaby, Macropus eugenii, derived from centromeric regions (CEN), euchromatic regions (EU), and an evolutionary breakpoint (EB) that has undergone convergent breakpoint reuse and past centromere activity in marsupials. We found a dramatic enrichment for long interspersed nucleotide elements (LINE1s) and endogenous retroviruses (ERVs) and a depletion of short interspersed nucleotide elements (SINEs) shared between CEN and EBs. We analyzed the orthologous human EB (14q32.33), known to be associated with translocations in many cancers including multiple myelomas and plasma cell leukemias, and found a conserved distribution of similar repetitive elements. Conclusion Our data indicate that EBs tracked within the class Mammalia harbor sequence features retained since the divergence of marsupials and eutherians that may have predisposed these genomic regions to large-scale chromosomal instability. PMID:19630942

  8. Phylogeny of the ammonia-producing ruminal bacteria Peptostreptococcus anaerobius, Clostridium sticklandii, and Clostridium aminophilum sp. nov

    NASA Technical Reports Server (NTRS)

    Paster, B. J.; Russell, J. B.; Yang, C. M.; Chow, J. M.; Woese, C. R.; Tanner, R.

    1993-01-01

    In previous studies, gram-positive bacteria which grew rapidly with peptides or an amino acid as the sole energy source were isolated from bovine rumina. Three isolates, strains C, FT (T = type strain), and SR, were considered to be ecologically important since they produced up to 20-fold more ammonia than other ammonia-producing ruminal bacteria. On the basis of phenotypic criteria, the taxonomic position of these new isolates was uncertain. In this study, the 16S rRNA sequences of these isolates and related bacteria were determined to establish the phylogenetic positions of the organisms. The sequences of strains C, FT, and SR and reference strains of Peptostreptococcus anaerobius, Clostridium sticklandii, Clostridium coccoides, Clostridium aminovalericum, Acetomaculum ruminis, Clostridium leptum, Clostridium lituseburense, Clostridium acidiurici, and Clostridium barkeri were determined by using a modified Sanger dideoxy chain termination method. Strain C, a large coccus purported to belong to the genus Peptostreptococcus, was closely related to P. anaerobius, with a level of sequence similarity of 99.6%. Strain SR, a heat-resistant, short, rod-shaped organism, was closely related to C. sticklandii, with a level of sequence similarity of 99.9%. However, strain FT, a heat-resistant, pleomorphic, rod-shaped organism, was only distantly related to some clostridial species and P. anaerobius. On the basis of the sequence data, it was clear that strain FT warranted designation as a separate species. The closest known relative of strain FT was C. coccoides (level of similarity, only 90.6%). Additional strains that are phenotypically similar to strain FT were isolated in this study.(ABSTRACT TRUNCATED AT 250 WORDS).

  9. C-Terminal Region of EBNA-2 Determines the Superior Transforming Ability of Type 1 Epstein-Barr Virus by Enhanced Gene Regulation of LMP-1 and CXCR7

    PubMed Central

    Cancian, Laila; Bosshard, Rachel; Lucchesi, Walter; Karstegl, Claudio Elgueta; Farrell, Paul J.

    2011-01-01

    Type 1 Epstein-Barr virus (EBV) strains immortalize B lymphocytes in vitro much more efficiently than type 2 EBV, a difference previously mapped to the EBNA-2 locus. Here we demonstrate that the greater transforming activity of type 1 EBV correlates with a stronger and more rapid induction of the viral oncogene LMP-1 and the cell gene CXCR7 (which are both required for proliferation of EBV-LCLs) during infection of primary B cells with recombinant viruses. Surprisingly, although the major sequence differences between type 1 and type 2 EBNA-2 lie in N-terminal parts of the protein, the superior ability of type 1 EBNA-2 to induce proliferation of EBV-infected lymphoblasts is mostly determined by the C-terminus of EBNA-2. Substitution of the C-terminus of type 1 EBNA-2 into the type 2 protein is sufficient to confer a type 1 growth phenotype and type 1 expression levels of LMP-1 and CXCR7 in an EREB2.5 cell growth assay. Within this region, the RG, CR7 and TAD domains are the minimum type 1 sequences required. Sequencing the C-terminus of EBNA-2 from additional EBV isolates showed high sequence identity within type 1 isolates or within type 2 isolates, indicating that the functional differences mapped are typical of EBV type sequences. The results indicate that the C-terminus of EBNA-2 accounts for the greater ability of type 1 EBV to promote B cell proliferation, through mechanisms that include higher induction of genes (LMP-1 and CXCR7) required for proliferation and survival of EBV-LCLs. PMID:21857817

  10. ASSET: Analysis of Sequences of Synchronous Events in Massively Parallel Spike Trains

    PubMed Central

    Canova, Carlos; Denker, Michael; Gerstein, George; Helias, Moritz

    2016-01-01

    With the ability to observe the activity from large numbers of neurons simultaneously using modern recording technologies, the chance to identify sub-networks involved in coordinated processing increases. Sequences of synchronous spike events (SSEs) constitute one type of such coordinated spiking that propagates activity in a temporally precise manner. The synfire chain was proposed as one potential model for such network processing. Previous work introduced a method for visualization of SSEs in massively parallel spike trains, based on an intersection matrix that contains in each entry the degree of overlap of active neurons in two corresponding time bins. Repeated SSEs are reflected in the matrix as diagonal structures of high overlap values. The method as such, however, leaves the task of identifying these diagonal structures to visual inspection rather than to a quantitative analysis. Here we present ASSET (Analysis of Sequences of Synchronous EvenTs), an improved, fully automated method which determines diagonal structures in the intersection matrix by a robust mathematical procedure. The method consists of a sequence of steps that i) assess which entries in the matrix potentially belong to a diagonal structure, ii) cluster these entries into individual diagonal structures and iii) determine the neurons composing the associated SSEs. We employ parallel point processes generated by stochastic simulations as test data to demonstrate the performance of the method under a wide range of realistic scenarios, including different types of non-stationarity of the spiking activity and different correlation structures. Finally, the ability of the method to discover SSEs is demonstrated on complex data from large network simulations with embedded synfire chains. Thus, ASSET represents an effective and efficient tool to analyze massively parallel spike data for temporal sequences of synchronous activity. PMID:27420734

  11. Characterization of Encapsulated and Noncapsulated Haemophilus influenzae and Determination of Phylogenetic Relationships by Multilocus Sequence Typing

    PubMed Central

    Meats, Emma; Feil, Edward J.; Stringer, Suzanna; Cody, Alison J.; Goldstein, Richard; Kroll, J. Simon; Popovic, Tanja; Spratt, Brian G.

    2003-01-01

    A multilocus sequence typing (MLST) scheme has been developed for the unambiguous characterization of encapsulated and noncapsulated Haemophilus influenzae isolates. The sequences of internal fragments of seven housekeeping genes were determined for 131 isolates, comprising a diverse set of 104 serotype a, b, c, d, e, and f isolates and 27 noncapsulated isolates. Many of the encapsulated isolates had previously been characterized by multilocus enzyme electrophoresis (MLEE), and the validity of the MLST scheme was established by the very similar clustering of isolates obtained by these methods. Isolates of serotypes c, d, e, and f formed monophyletic groups on a dendrogram constructed from the differences in the allelic profiles of the isolates, whereas there were highly divergent lineages of both serotype a and b isolates. Noncapsulated isolates were distinct from encapsulated isolates and, with one exception, were within two highly divergent clusters. The relationships between the major lineages of encapsulated H. influenzae inferred from MLEE data could not be discerned on a dendrogram constructed from differences in the allelic profiles, but were apparent on a tree reconstructed from the concatenated nucleotide sequences. Recombination has not therefore completely eliminated phylogenetic signal, and in support of this, for encapsulated isolates, there was significant congruence between many of the trees reconstructed from the sequences of the seven individual loci. Congruence was less apparent for noncapsulated isolates, suggesting that the impact of recombination is greater among noncapsulated than encapsulated isolates. The H. influenzae MLST scheme is available at www.mlst.net, it allows any isolate to be compared with those in the MLST database, and (for encapsulated isolates) it assigns isolates to their phylogenetic lineage, via the Internet. PMID:12682154

  12. Application of Metagenomic Sequencing to Food Safety: Detection of Shiga Toxin-Producing Escherichia coli on Fresh Bagged Spinach

    PubMed Central

    Leonard, Susan R.; Mammel, Mark K.; Lacher, David W.

    2015-01-01

    Culture-independent diagnostics reduce the reliance on traditional (and slower) culture-based methodologies. Here we capitalize on advances in next-generation sequencing (NGS) to apply this approach to food pathogen detection utilizing NGS as an analytical tool. In this study, spiking spinach with Shiga toxin-producing Escherichia coli (STEC) following an established FDA culture-based protocol was used in conjunction with shotgun metagenomic sequencing to determine the limits of detection, sensitivity, and specificity levels and to obtain information on the microbiology of the protocol. We show that an expected level of contamination (∼10 CFU/100 g) could be adequately detected (including key virulence determinants and strain-level specificity) within 8 h of enrichment at a sequencing depth of 10,000,000 reads. We also rationalize the relative benefit of static versus shaking culture conditions and the addition of selected antimicrobial agents, thereby validating the long-standing culture-based parameters behind such protocols. Moreover, the shotgun metagenomic approach was informative regarding the dynamics of microbial communities during the enrichment process, including initial surveys of the microbial loads associated with bagged spinach; the microbes found included key genera such as Pseudomonas, Pantoea, and Exiguobacterium. Collectively, our metagenomic study highlights and considers various parameters required for transitioning to such sequencing-based diagnostics for food safety and the potential to develop better enrichment processes in a high-throughput manner not previously possible. Future studies will investigate new species-specific DNA signature target regimens, rational design of medium components in concert with judicious use of additives, such as antibiotics, and alterations in the sample processing protocol to enhance detection. PMID:26386062

  13. High energy PIXE: A tool to characterize multi-layer thick samples

    NASA Astrophysics Data System (ADS)

    Subercaze, A.; Koumeir, C.; Métivier, V.; Servagent, N.; Guertin, A.; Haddad, F.

    2018-02-01

    High energy PIXE is a useful and non-destructive tool to characterize multi-layer thick samples such as cultural heritage objects. In a previous work, we demonstrated the possibility to perform quantitative analysis of simple multi-layer samples using high energy PIXE, without any assumption on their composition. In this work an in-depth study of the parameters involved in the method previously published is proposed. Its extension to more complex samples with a repeated layer is also presented. Experiments have been performed at the ARRONAX cyclotron using 68 MeV protons. The thicknesses and sequences of a multi-layer sample including two different layers of the same element have been determined. Performances and limits of this method are presented and discussed.

  14. Human Papillomavirus Type 6 and 11 Genetic Variants Found in 71 Oral and Anogenital Epithelial Samples from Australia

    PubMed Central

    Danielewski, Jennifer A.; Garland, Suzanne M.; McCloskey, Jenny; Hillman, Richard J.; Tabrizi, Sepehr N.

    2013-01-01

    Genetic variation of 49 human papillomavirus (HPV) 6 and 22 HPV11 isolates from recurrent respiratory papillomatosis (RRP) (n = 17), genital warts (n = 43), anal cancer (n = 6) and cervical neoplasia cells (n = 5), was determined by sequencing the long control region (LCR) and the E6 and E7 genes. Comparative analysis of genetic variability was examined to determine whether different disease states resulting from HPV6 or HPV11 infection cluster into distinct variant groups. Sequence variation analysis of HPV6 revealed that isolates cluster into variants within previously described HPV6 lineages, with the majority (65%) clustering to HPV6 sublineage B1 across the three genomic regions examined. Overall 72 HPV6 and 25 HPV11 single nucleotide variations, insertions and deletions were observed within samples examined. In addition, missense alterations were observed in the E6/E7 genes for 6 HPV6 and 5 HPV11 variants. No nucleotide variations were identified in any isolates at the four E2 binding sites for HPV6 or HPV11, nor were any isolates found to be identical to the HPV6 lineage A or HPV11 sublineage A1 reference genomes. Overall, a high degree of sequence conservation was observed between isolates across each of the regions investigated for both HPV6 and HPV11. Genetic variants identified a slight association with HPV6 and anogenital lesions (p = 0.04). This study provides important information on the genetic diversity of circulating HPV 6 and HPV11 variants within the Australian population and supports the observation that the majority of HPV6 isolates cluster to the HPV6 sublineage B1 with anogenital lesions demonstrating an association with this sublineage (p = 0.02). Comparative analysis of Australian isolates for both HPV6 and HPV11 to those from other geographical regions based on the LCR revealed a high degree of sequence similarity throughout the world, confirming previous observations that there are no geographically specific variants for these HPV types. PMID:23691108

  15. Isolation and molecular characterization of partial FSH and LH receptor genes in Arabian camels (Camelus dromedarius)

    PubMed Central

    Jelokhani-Niaraki, Saber; Tahmoorespur, Mojtaba; Bitaraf-Sani, Morteza

    2015-01-01

    Very little is known about LHR and FSHR genes of domestic dromedary camels. The main objective of this study was to determine and analyze partial genomic regions of FSHR and LHR genes in dromedary camels for the first time. To this end, a total of50 DNA samples belonging to dromedary camels raised in Iran were sent for sequencing (25 samples of each gene). We compared the nucleotide sequences of Camelus dromedarius with corresponding sequences of previously published FSHR and LHR genes in bactrian camels and other species. According to the data, the same nucleotide variation was identified in both regions of the two camel species. The alignment of deduced protein sequences of the two different species revealed an amino acid variation at the FSHR region. No evidence of amino acid variation was observed, however, in LHR sequences. Phylogenetic analysis indicated that both camel species had a close relationship and clustered together in a separate branch. This was further confirmed by genetic distance values illustrating significant sequence identity between Camelus dromedarius and Camelus bactrianus. Interestingly, sequence comparisons revealed heterozygote patterns in FSHR sequences isolated from dromedary camels of Iran. In comparison to other species, this camel contains three amino acid substitutions at 5, 67, and 105 positions in the FSHR coding region. These positions are found exclusively in camels and can be considered as species specific. The results of our study can be used for hormone functionality research (FSHR and LHR) as well as reproduction-linked polymorphisms and breeding programs. PMID:27844002

  16. Isolation and molecular characterization of partial FSH and LH receptor genes in Arabian camels (Camelus dromedarius).

    PubMed

    Jelokhani-Niaraki, Saber; Tahmoorespur, Mojtaba; Bitaraf-Sani, Morteza

    2015-06-01

    Very little is known about LHR and FSHR genes of domestic dromedary camels. The main objective of this study was to determine and analyze partial genomic regions of FSHR and LHR genes in dromedary camels for the first time. To this end, a total of50 DNA samples belonging to dromedary camels raised in Iran were sent for sequencing (25 samples of each gene). We compared the nucleotide sequences of Camelus dromedarius with corresponding sequences of previously published FSHR and LHR genes in bactrian camels and other species. According to the data, the same nucleotide variation was identified in both regions of the two camel species. The alignment of deduced protein sequences of the two different species revealed an amino acid variation at the FSHR region. No evidence of amino acid variation was observed, however, in LHR sequences. Phylogenetic analysis indicated that both camel species had a close relationship and clustered together in a separate branch. This was further confirmed by genetic distance values illustrating significant sequence identity between Camelus dromedarius and Camelus bactrianus . Interestingly, sequence comparisons revealed heterozygote patterns in FSHR sequences isolated from dromedary camels of Iran. In comparison to other species, this camel contains three amino acid substitutions at 5, 67, and 105 positions in the FSHR coding region. These positions are found exclusively in camels and can be considered as species specific. The results of our study can be used for hormone functionality research ( FSHR and LHR ) as well as reproduction-linked polymorphisms and breeding programs.

  17. An improved model for whole genome phylogenetic analysis by Fourier transform.

    PubMed

    Yin, Changchuan; Yau, Stephen S-T

    2015-10-07

    DNA sequence similarity comparison is one of the major steps in computational phylogenetic studies. The sequence comparison of closely related DNA sequences and genomes is usually performed by multiple sequence alignments (MSA). While the MSA method is accurate for some types of sequences, it may produce incorrect results when DNA sequences undergone rearrangements as in many bacterial and viral genomes. It is also limited by its computational complexity for comparing large volumes of data. Previously, we proposed an alignment-free method that exploits the full information contents of DNA sequences by Discrete Fourier Transform (DFT), but still with some limitations. Here, we present a significantly improved method for the similarity comparison of DNA sequences by DFT. In this method, we map DNA sequences into 2-dimensional (2D) numerical sequences and then apply DFT to transform the 2D numerical sequences into frequency domain. In the 2D mapping, the nucleotide composition of a DNA sequence is a determinant factor and the 2D mapping reduces the nucleotide composition bias in distance measure, and thus improving the similarity measure of DNA sequences. To compare the DFT power spectra of DNA sequences with different lengths, we propose an improved even scaling algorithm to extend shorter DFT power spectra to the longest length of the underlying sequences. After the DFT power spectra are evenly scaled, the spectra are in the same dimensionality of the Fourier frequency space, then the Euclidean distances of full Fourier power spectra of the DNA sequences are used as the dissimilarity metrics. The improved DFT method, with increased computational performance by 2D numerical representation, can be applicable to any DNA sequences of different length ranges. We assess the accuracy of the improved DFT similarity measure in hierarchical clustering of different DNA sequences including simulated and real datasets. The method yields accurate and reliable phylogenetic trees and demonstrates that the improved DFT dissimilarity measure is an efficient and effective similarity measure of DNA sequences. Due to its high efficiency and accuracy, the proposed DFT similarity measure is successfully applied on phylogenetic analysis for individual genes and large whole bacterial genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.

  18. Architecture of a Species: Phylogenomics of Staphylococcus aureus.

    PubMed

    Planet, Paul J; Narechania, Apurva; Chen, Liang; Mathema, Barun; Boundy, Sam; Archer, Gordon; Kreiswirth, Barry

    2017-02-01

    A deluge of whole-genome sequencing has begun to give insights into the patterns and processes of microbial evolution, but genome sequences have accrued in a haphazard manner, with biased sampling of natural variation that is driven largely by medical and epidemiological priorities. For instance, there is a strong bias for sequencing epidemic lineages of methicillin-resistant Staphylococcus aureus (MRSA) over sensitive isolates (methicillin-sensitive S. aureus: MSSA). As more diverse genomes are sequenced the emerging picture is of a highly subdivided species with a handful of relatively clonal groups (complexes) that, at any given moment, dominate in particular geographical regions. The establishment of hegemony of particular clones appears to be a dynamic process of successive waves of replacement of the previously dominant clone. Here we review the phylogenomic structure of a diverse range of S. aureus, including both MRSA and MSSA. We consider the utility of the concept of the 'core' genome and the impact of recombination and horizontal transfer. We argue that whole-genome surveillance of S. aureus populations could lead to better forecasting of antibiotic resistance and virulence of emerging clones, and a better understanding of the elusive biological factors that determine repeated strain replacement. Copyright © 2016. Published by Elsevier Ltd.

  19. Organization of the hao gene cluster of Nitrosomonas europaea: genes for two tetraheme c cytochromes.

    PubMed

    Bergmann, D J; Arciero, D M; Hooper, A B

    1994-06-01

    The organization of genes for three proteins involved in ammonia oxidation in Nitrosomonas europaea has been investigated. The amino acid sequence of the N-terminal region and four heme-containing peptides produced by proteolysis of the tetraheme cytochrome c554 of N. europaea were determined by Edman degradation. The gene (cycA) encoding this cytochrome is present in three copies per genome (H. McTavish, F. LaQuier, D. Arciero, M. Logan, G. Mundfrom, J.A. Fuchs, and A. B. Hooper, J. Bacteriol. 175:2445-2447, 1993). Three clones, representing at least two copies of cycA, were isolated and sequenced by the dideoxy-chain termination procedure. In both copies, the sequences of 211 amino acids derived from the gene sequence are identical and include all amino acids predicted by the proteolytic peptides. In two copies, the cycA open reading frame (ORF) is followed closely (three bases in one copy) by a second ORF predicted to encode a 28-kDa tetraheme c cytochrome not previously characterized but similar to the nirT gene product of Pseudomonas stutzeri. In one copy of the cycA gene cluster, the second ORF is absent.

  20. Experience of targeted Usher exome sequencing as a clinical test

    PubMed Central

    Besnard, Thomas; García-García, Gema; Baux, David; Vaché, Christel; Faugère, Valérie; Larrieu, Lise; Léonard, Susana; Millan, Jose M; Malcolm, Sue; Claustres, Mireille; Roux, Anne-Françoise

    2014-01-01

    We show that massively parallel targeted sequencing of 19 genes provides a new and reliable strategy for molecular diagnosis of Usher syndrome (USH) and nonsyndromic deafness, particularly appropriate for these disorders characterized by a high clinical and genetic heterogeneity and a complex structure of several of the genes involved. A series of 71 patients including Usher patients previously screened by Sanger sequencing plus newly referred patients was studied. Ninety-eight percent of the variants previously identified by Sanger sequencing were found by next-generation sequencing (NGS). NGS proved to be efficient as it offers analysis of all relevant genes which is laborious to reach with Sanger sequencing. Among the 13 newly referred Usher patients, both mutations in the same gene were identified in 77% of cases (10 patients) and one candidate pathogenic variant in two additional patients. This work can be considered as pilot for implementing NGS for genetically heterogeneous diseases in clinical service. PMID:24498627

  1. The global prevalence of HFE and non-HFE hemochromatosis estimated from analysis of next-generation sequencing data.

    PubMed

    Wallace, Daniel F; Subramaniam, V Nathan

    2016-06-01

    The prevalence of HFE-related hereditary hemochromatosis (HH) among European populations has been well studied. There are no prevalence data for atypical forms of HH caused by mutations in HFE2, HAMP, TFR2, or SLC40A1. The purpose of this study was to estimate the population prevalence of these non-HFE forms of HH. A list of HH pathogenic variants in publically available next-generation sequence (NGS) databases was compiled and allele frequencies were determined. Of 161 variants previously associated with HH, 43 were represented among the NGS data sets; an additional 40 unreported functional variants also were identified. The predicted prevalence of HFE HH and the p.Cys282Tyr mutation closely matched previous estimates from similar populations. Of the non-HFE forms of iron overload, TFR2-, HFE2-, and HAMP-related forms are predicted to be rare, with pathogenic allele frequencies in the range of 0.00007 to 0.0005. Significantly, SLC40A1 variants that have been previously associated with autosomal-dominant ferroportin disease were identified in several populations (pathogenic allele frequency 0.0004), being most prevalent among Africans. We have, for the first time, estimated the population prevalence of non-HFE HH. This methodology could be applied to estimate the population prevalence of a wide variety of genetic disorders.Genet Med 18 6, 618-626.

  2. Discovery and validation of information theory-based transcription factor and cofactor binding site motifs.

    PubMed

    Lu, Ruipeng; Mucaki, Eliseos J; Rogan, Peter K

    2017-03-17

    Data from ChIP-seq experiments can derive the genome-wide binding specificities of transcription factors (TFs) and other regulatory proteins. We analyzed 765 ENCODE ChIP-seq peak datasets of 207 human TFs with a novel motif discovery pipeline based on recursive, thresholded entropy minimization. This approach, while obviating the need to compensate for skewed nucleotide composition, distinguishes true binding motifs from noise, quantifies the strengths of individual binding sites based on computed affinity and detects adjacent cofactor binding sites that coordinate with the targets of primary, immunoprecipitated TFs. We obtained contiguous and bipartite information theory-based position weight matrices (iPWMs) for 93 sequence-specific TFs, discovered 23 cofactor motifs for 127 TFs and revealed six high-confidence novel motifs. The reliability and accuracy of these iPWMs were determined via four independent validation methods, including the detection of experimentally proven binding sites, explanation of effects of characterized SNPs, comparison with previously published motifs and statistical analyses. We also predict previously unreported TF coregulatory interactions (e.g. TF complexes). These iPWMs constitute a powerful tool for predicting the effects of sequence variants in known binding sites, performing mutation analysis on regulatory SNPs and predicting previously unrecognized binding sites and target genes. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  3. The Faintest WISE Debris Disks: Enhanced Methods for Detection and Verification

    NASA Astrophysics Data System (ADS)

    Patel, Rahul I.; Metchev, Stanimir A.; Heinze, Aren; Trollo, Joseph

    2017-02-01

    In an earlier study, we reported nearly 100 previously unknown dusty debris disks around Hipparcos main-sequence stars within 75 pc by selecting stars with excesses in individual WISE colors. Here, we further scrutinize the Hipparcos 75 pc sample to (1) gain sensitivity to previously undetected, fainter mid-IR excesses and (2) remove spurious excesses contaminated by previously unidentified blended sources. We improve on our previous method by adopting a more accurate measure of the confidence threshold for excess detection and by adding an optimally weighted color average that incorporates all shorter-wavelength WISE photometry, rather than using only individual WISE colors. The latter is equivalent to spectral energy distribution fitting, but only over WISE bandpasses. In addition, we leverage the higher-resolution WISE images available through the unWISE.me image service to identify contaminated WISE excesses based on photocenter offsets among the W3- and W4-band images. Altogether, we identify 19 previously unreported candidate debris disks. Combined with the results from our earlier study, we have found a total of 107 new debris disks around 75 pc Hipparcos main-sequence stars using precisely calibrated WISE photometry. This expands the 75 pc debris disk sample by 22% around Hipparcos main-sequence stars and by 20% overall (including non-main-sequence and non-Hipparcos stars).

  4. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Patel, Rahul I.; Metchev, Stanimir A.; Trollo, Joseph

    In an earlier study, we reported nearly 100 previously unknown dusty debris disks around Hipparcos main-sequence stars within 75 pc by selecting stars with excesses in individual WISE colors. Here, we further scrutinize the Hipparcos  75 pc sample to (1) gain sensitivity to previously undetected, fainter mid-IR excesses and (2) remove spurious excesses contaminated by previously unidentified blended sources. We improve on our previous method by adopting a more accurate measure of the confidence threshold for excess detection and by adding an optimally weighted color average that incorporates all shorter-wavelength WISE photometry, rather than using only individual WISE colors. Themore » latter is equivalent to spectral energy distribution fitting, but only over WISE bandpasses. In addition, we leverage the higher-resolution WISE images available through the unWISE.me image service to identify contaminated WISE excesses based on photocenter offsets among the W 3- and W 4-band images. Altogether, we identify 19 previously unreported candidate debris disks. Combined with the results from our earlier study, we have found a total of 107 new debris disks around 75 pc Hipparcos main-sequence stars using precisely calibrated WISE photometry. This expands the 75 pc debris disk sample by 22% around Hipparcos main-sequence stars and by 20% overall (including non-main-sequence and non- Hipparcos stars).« less

  5. SEROPREVALENCE AND MOLECULAR CHARACTERIZATION OF FERLAVIRUS IN CAPTIVE VIPERS OF COSTA RICA.

    PubMed

    Solis, Cristina; Arguedas, Randall; Baldi, Mario; Piche, Martha; Jimenez, Carlos

    2017-06-01

    Ferlaviruses (FV, previously referred to as ophidian paramyxoviruses, OPMV), are enveloped viruses with a negative-strand RNA genome, affecting snakes in captivity worldwide. Infection is characterized by respiratory and nervous clinical signs and carries high mortality rates, but no specific treatment or vaccine is currently available. Costa Rica has 16 species of vipers, found in captivity in collections essential for antivenom production, reintroduction, and public education. FV circulation in these populations was previously unknown, and the risk of introducing the viruses into naïve collections or free-ranging populations exists if the virus's presence is confirmed. The objective of this study was to determine seroprevalence and FV shedding in 150 samples from captive vipers in nine collections across Costa Rica. A hemagglutination inhibition (HI) assay was performed to determine the antibody titer against two Ferlavirus strains, Bush viper virus (BV) and Neotropical virus (NT), and reverse-transcriptase polymerase chain reaction (RT-PCR) and sequencing to determine virus secretion in cloacal swabs. Ferlavirus strains were replicated in Vero cells, and chicken anti-FV polyclonal antibodies were produced and used as a positive control serum for the HI. Results demonstrate that seroprevalence of anti-FV antibodies in viper serum was 26.6% (n = 40) for the BV strain and 30% (n = 45) for the NT strain in the population tested. Furthermore, molecular characterization of FV group A was possible by sequencing the virus recovered from three cloacal swabs, demonstrating circulation of FV in one collection. This study demonstrates for the first time serological evidence of FV exposure and infection in vipers in captivity in Costa Rica, and suggests cross reactivity between antibodies against both strains. Appropriate biosafety measures could prevent the spread of FV between and within collections of reptiles in the country.

  6. New strategy to address DNA-methyl transferase activity in ovarian cancer cell cultures by monitoring the formation of 5-methylcytosine using HPLC-UV.

    PubMed

    Iglesias González, T; Blanco-González, E; Montes-Bayón, M

    2016-08-15

    Methylation of mammalian genomic DNA is catalyzed by DNA methyltransferases (DNMTs). Aberrant expression and activity of these enzymes has been reported to play an important role in the initiation and progression of tumors and its response to chemotherapy. Therefore, there is a great interest in developing strategies to detect human DNMTs activity. We propose a simple, antibody-free, label-free and non-radioactive analytical strategy in which methyltransferase activity is measured trough the determination of the 5-methylcytosine (5mC) content in DNA by a chromatographic method (HPLC-UV) previously developed. For this aim, a correlation between the enzyme activity and the concentration of 5mC obtained by HPLC-UV is previously obtained under optimized conditions using both, un-methylated and hemi-methylated DNA substrates and the prokaryotic methyltransferase M.SssI as model enzyme. The evaluation of the methylation yield in un-methylated known sequences (a 623bp PCR-amplicon) turned to be quantitative (110%) in experiments conducted in-vitro. Methylation of hemi-methylated and low-methylated sequences could be also detected with the proposed approach. The application of the methodology to the determination of the DNMTs activity in nuclear extracts from human ovarian cancer cells has revealed the presence of matrix effects (also confirmed by standard additions) that hampered quantitative enzyme recovery. The obtained results showed the high importance of adequate sample clean-up steps. Copyright © 2016. Published by Elsevier B.V.

  7. Use of a Designed Peptide Array To Infer Dissociation Trends for Nontryptic Peptides in Quadrupole Ion Trap and Quadrupole Time-of-Flight Mass Spectrometry

    DOE PAGES

    Gaucher, Sara P.; Morrow, Jeffrey A.; Faulon, Jean-Loup M.

    2007-09-14

    Observed peptide gas-phase fragmentation patterns are a complex function of many variables. In order to systematically probe this phenomenon, an array of 40 peptides was synthesized for study. The array of sequences was designed to hold certain variables (peptide length) constant and randomize or balance others (peptide amino acid distribution and position). A high-quality tandem mass spectrometry (MS/MS) data set was acquired for each peptide for all observed charge states on multiple MS instruments, quadrupole-time-of-flight and quadrupole ion trap. The data were analyzed as a function of total charge state and number of mobile protons. Previously known dissociation trends weremore » observed, validating our approach. In addition, the general influence of basic amino acids on dissociation could be determined because, in contrast to the more widely studied tryptic peptides, the amino acids H, K, and R were positionally distributed. Interestingly, our results suggest that cleavage at all basic amino acids is suppressed when a mobile proton is available. Cleavage at H becomes favored only under conditions where a partially mobile proton is present, a caveat to the previously reported trend of enhanced cleavage at H. In conclusion, all acquired data were used as a benchmark to determine how well these sequences would have been identified in a database search using a common algorithm, Mascot.« less

  8. Structure-Function, Stability, and Chemical Modification of the Cyanobacterial Cytochrome b6f Complex from Nostoc sp. PCC 7120*

    PubMed Central

    Baniulis, Danas; Yamashita, Eiki; Whitelegge, Julian P.; Zatsman, Anna I.; Hendrich, Michael P.; Hasan, S. Saif; Ryan, Christopher M.; Cramer, William A.

    2009-01-01

    The crystal structure of the cyanobacterial cytochrome b6f complex has previously been solved to 3.0-Å resolution using the thermophilic Mastigocladus laminosus whose genome has not been sequenced. Several unicellular cyanobacteria, whose genomes have been sequenced and are tractable for mutagenesis, do not yield b6f complex in an intact dimeric state with significant electron transport activity. The genome of Nostoc sp. PCC 7120 has been sequenced and is closer phylogenetically to M. laminosus than are unicellular cyanobacteria. The amino acid sequences of the large core subunits and four small peripheral subunits of Nostoc are 88 and 80% identical to those in the M. laminosus b6f complex. Purified b6f complex from Nostoc has a stable dimeric structure, eight subunits with masses similar to those of M. laminosus, and comparable electron transport activity. The crystal structure of the native b6f complex, determined to a resolution of 3.0Å (PDB id: 2ZT9), is almost identical to that of M. laminosus. Two unique aspects of the Nostoc complex are: (i) a dominant conformation of heme bp that is rotated 180° about the α- and γ-meso carbon axis relative to the orientation in the M. laminosus complex and (ii) acetylation of the Rieske iron-sulfur protein (PetC) at the N terminus, a post-translational modification unprecedented in cyanobacterial membrane and electron transport proteins, and in polypeptides of cytochrome bc complexes from any source. The high spin electronic character of the unique heme cn is similar to that previously found in the b6f complex from other sources. PMID:19189962

  9. Novel chromosomal rearrangements and break points at the t(6;9) in salivary adenoid cystic carcinoma: association with MYB-NFIB chimeric fusion, MYB expression, and clinical outcome.

    PubMed

    Mitani, Yoshitsugu; Rao, Pulivarthi H; Futreal, P Andrew; Roberts, Dianna B; Stephens, Philip J; Zhao, Yi-Jue; Zhang, Li; Mitani, Mutsumi; Weber, Randal S; Lippman, Scott M; Caulin, Carlos; El-Naggar, Adel K

    2011-11-15

    To investigate the molecular genetic heterogeneity associated with the t(6:9) in adenoid cystic carcinoma (ACC) and correlate the findings with patient clinical outcome. Multimolecular and genetic techniques complemented with massive pair-ended sequencing and single-nucleotide polymorphism array analyses were used on tumor specimens from 30 new and 52 previously analyzed fusion transcript-negative ACCs by reverse transcriptase PCR (RT-PCR). MYB mRNA expression level was determined by quantitative RT-PCR. The results of 102 tumors (30 new and 72 previously reported cases) were correlated with the clinicopathologic factors and patients' survival. The FISH analysis showed 34 of 82 (41.5%) fusion-positive tumors and molecular techniques identified fusion transcripts in 21 of the 82 (25.6%) tumors. Detailed FISH analysis of 11 out the 15 tumors with gene fusion without transcript formation showed translocation of NFIB sequences to proximal or distal sites of the MYB gene. Massive pair-end sequencing of a subset of tumors confirmed the proximal translocation to an NFIB sequence and led to the identification of a new fusion gene (NFIB-AIG1) in one of the tumors. Overall, MYB-NFIB gene fusion rate by FISH was in 52.9% whereas fusion transcript forming incidence was 38.2%. Significant statistical association between the 5' MYB transcript expression and patient survival was found. We conclude that: (i) t(6;9) results in complex genetic and molecular alterations in ACC, (ii) MYB-NFIB gene fusion may not always be associated with chimeric transcript formation, (iii) noncanonical MYB-NFIB gene fusions occur in a subset of tumors, (iv) high MYB expression correlates with worse patient survival.

  10. Novel Chromosomal Rearrangements and breakpoints at the t(6;9) in Salivary Adenoid Cystic Carcinoma: association with MYB-NFIB chimeric fusion, MYB expression, and clinical outcome

    PubMed Central

    Mitani, Yoshitsugu; Rao, Pulivarthi H.; Futreal, P. Andrew; Roberts, Dianna B.; Stephens, Philip J.; Zhao, Yi-Jue; Zhang, Li; Mitani, Mutsumi; Weber, Randal S.; Lippman, Scott M.; Caulin, Carlos; El-Naggar, Adel K.

    2011-01-01

    Objective To investigate the molecular-genetic heterogeneity associated with the t(6:9) in adenoid cystic carcinoma (ACC) and correlate the findings with patient clinical outcome. Experimental Design Multi-molecular and genetic techniques complemented with massive pair-ended sequencing and SNP array analyses were used on tumor specimens from 30 new and 52 previously RT-PCR analyzed fusion transcript negative ACCs. MYB mRNA expression level was determined by quantitative RT-PCR. The results of 102 tumors (30 new and 72 previously reported cases) were correlated with the clinicopathologic factors and patients’ survival. Results The FISH analysis showed 34/82 (41.5%) fusion positive tumors and molecular techniques identified fusion transcripts in 21 of the 82 (25.6%) tumors. Detailed FISH analysis of 11 out the 15 tumors with gene fusion without transcript formation showed translocation of NFIB sequences to proximal or distal sites of the MYB gene. Massive pair-end sequencing of a subset of tumors confirmed the proximal translocation to an NFIB sequence and led to the identification of a new fusion gene (NFIB-AIG1) in one of the tumors. Overall, MYB-NFIB gene fusion rate by FISH was in 52.9% while fusion transcript forming incidence was 38.2%. Significant statistical association between the 5′ MYB transcript expression and patient survival was found. Conclusions We conclude that: 1) t(6;9) results in a complex genetic and molecular alterations in ACC, 2) MYB-NFIB gene fusion may not always be associated with chimeric transcript formation, 3) non-canonical MYB, NFIB gene fusions occur in a subset of tumors, 4) high MYB expression correlates with worse patient survival. PMID:21976542

  11. Structure-Function, Stability, and Chemical Modification of the Cyanobacterial Cytochrome b[subscript 6]f Complex from Nostoc sp. PCC 7120

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Baniulis, Danas; Yamashita, Eiki; Whitelegge, Julian P.

    2009-06-08

    The crystal structure of the cyanobacterial cytochrome b{sub 6}f complex has previously been solved to 3.0-{angstrom} resolution using the thermophilic Mastigocladus laminosus whose genome has not been sequenced. Several unicellular cyanobacteria, whose genomes have been sequenced and are tractable for mutagenesis, do not yield b{sub 6}f complex in an intact dimeric state with significant electron transport activity. The genome of Nostoc sp. PCC 7120 has been sequenced and is closer phylogenetically to M. laminosus than are unicellular cyanobacteria. The amino acid sequences of the large core subunits and four small peripheral subunits of Nostoc are 88 and 80% identical tomore » those in the M. laminosus b{sub 6}f complex. Purified b{sub 6}f complex from Nostoc has a stable dimeric structure, eight subunits with masses similar to those of M. laminosus, and comparable electron transport activity. The crystal structure of the native b{sub 6}f complex, determined to a resolution of 3.0{angstrom} (PDB id: 2ZT9), is almost identical to that of M. laminosus. Two unique aspects of the Nostoc complex are: (i) a dominant conformation of heme b{sub p} that is rotated 180 deg. about the {alpha}- and {gamma}-meso carbon axis relative to the orientation in the M. laminosus complex and (ii) acetylation of the Rieske iron-sulfur protein (PetC) at the N terminus, a post-translational modification unprecedented in cyanobacterial membrane and electron transport proteins, and in polypeptides of cytochrome bc complexes from any source. The high spin electronic character of the unique heme cn is similar to that previously found in the b{sub 6}f complex from other sources.« less

  12. Information and redundancy in the burial folding code of globular proteins within a wide range of shapes and sizes.

    PubMed

    Ferreira, Diogo C; van der Linden, Marx G; de Oliveira, Leandro C; Onuchic, José N; de Araújo, Antônio F Pereira

    2016-04-01

    Recent ab initio folding simulations for a limited number of small proteins have corroborated a previous suggestion that atomic burial information obtainable from sequence could be sufficient for tertiary structure determination when combined to sequence-independent geometrical constraints. Here, we use simulations parameterized by native burials to investigate the required amount of information in a diverse set of globular proteins comprising different structural classes and a wide size range. Burial information is provided by a potential term pushing each atom towards one among a small number L of equiprobable concentric layers. An upper bound for the required information is provided by the minimal number of layers L(min) still compatible with correct folding behavior. We obtain L(min) between 3 and 5 for seven small to medium proteins with 50 ≤ Nr ≤ 110 residues while for a larger protein with Nr = 141 we find that L ≥ 6 is required to maintain native stability. We additionally estimate the usable redundancy for a given L ≥ L(min) from the burial entropy associated to the largest folding-compatible fraction of "superfluous" atoms, for which the burial term can be turned off or target layers can be chosen randomly. The estimated redundancy for small proteins with L = 4 is close to 0.8. Our results are consistent with the above-average quality of burial predictions used in previous simulations and indicate that the fraction of approachable proteins could increase significantly with even a mild, plausible, improvement on sequence-dependent burial prediction or on sequence-independent constraints that augment the detectable redundancy during simulations. © 2016 Wiley Periodicals, Inc.

  13. Using regional moment tensors to constrain the kinematics and stress evolution of the 2010–2013 Canterbury earthquake sequence, South Island, New Zealand

    USGS Publications Warehouse

    Herman, Matthew W.; Herrmann, Robert B.; Benz, Harley M.; Furlong, Kevin P.

    2014-01-01

    On September 3, 2010, a MW 7.0 (U.S. Geological Survey moment magnitude) earthquake ruptured across the Canterbury Plains in South Island, New Zealand. Since then, New Zealand GNS Science has recorded over 10,000 aftershocks ML 2.0 and larger, including three destructive ~ MW 6.0 earthquakes near Christchurch. We treat the Canterbury earthquake sequence as an intraplate earthquake sequence, and compare its kinematics to an Andersonian model for fault slip in a uniform stress field. We determined moment magnitudes and double couple solutions for 150 earthquakes having MW 3.7 and larger through the use of a waveform inversion technique using data from broadband seismic stations on South Island, New Zealand. The majority (126) of these double couple solutions have strike-slip focal mechanisms, with right-lateral slip on ENE fault planes or equivalently left-lateral slip on SSE fault planes. The remaining focal mechanisms indicate reverse faulting, except for two normal faulting events. The strike-slip segments have compatible orientations for slip in a stress field with a horizontal σ1 oriented ~ N115°E, and horizontal σ3. The preference for right lateral strike-slip earthquakes suggests that these structures are inherited from previous stages of deformation. Reverse slip is interpreted to have occurred on previously existing structures in regions with an absence of existing structures optimally oriented for strike-slip deformation. Despite the variations in slip direction and faulting style, most aftershocks had nearly the same P-axis orientation, consistent with the regional σ1. There is no evidence for significant changes in these stress orientations throughout the Canterbury earthquake sequence.

  14. Revisiting Robustness and Evolvability: Evolution in Weighted Genotype Spaces

    PubMed Central

    Partha, Raghavendran; Raman, Karthik

    2014-01-01

    Robustness and evolvability are highly intertwined properties of biological systems. The relationship between these properties determines how biological systems are able to withstand mutations and show variation in response to them. Computational studies have explored the relationship between these two properties using neutral networks of RNA sequences (genotype) and their secondary structures (phenotype) as a model system. However, these studies have assumed every mutation to a sequence to be equally likely; the differences in the likelihood of the occurrence of various mutations, and the consequence of probabilistic nature of the mutations in such a system have previously been ignored. Associating probabilities to mutations essentially results in the weighting of genotype space. We here perform a comparative analysis of weighted and unweighted neutral networks of RNA sequences, and subsequently explore the relationship between robustness and evolvability. We show that assuming an equal likelihood for all mutations (as in an unweighted network), underestimates robustness and overestimates evolvability of a system. In spite of discarding this assumption, we observe that a negative correlation between sequence (genotype) robustness and sequence evolvability persists, and also that structure (phenotype) robustness promotes structure evolvability, as observed in earlier studies using unweighted networks. We also study the effects of base composition bias on robustness and evolvability. Particularly, we explore the association between robustness and evolvability in a sequence space that is AU-rich – sequences with an AU content of 80% or higher, compared to a normal (unbiased) sequence space. We find that evolvability of both sequences and structures in an AU-rich space is lesser compared to the normal space, and robustness higher. We also observe that AU-rich populations evolving on neutral networks of phenotypes, can access less phenotypic variation compared to normal populations evolving on neutral networks. PMID:25390641

  15. AutoGen Version 5.0

    NASA Technical Reports Server (NTRS)

    Gladden, Roy E.; Khanampornpan, Teerapat; Fisher, Forest W.

    2010-01-01

    Version 5.0 of the AutoGen software has been released. Previous versions, variously denoted Autogen and autogen, were reported in two articles: Automated Sequence Generation Process and Software (NPO-30746), Software Tech Briefs (Special Supplement to NASA Tech Briefs), September 2007, page 30, and Autogen Version 2.0 (NPO- 41501), NASA Tech Briefs, Vol. 31, No. 10 (October 2007), page 58. To recapitulate: AutoGen (now signifying automatic sequence generation ) automates the generation of sequences of commands in a standard format for uplink to spacecraft. AutoGen requires fewer workers than are needed for older manual sequence-generation processes, and greatly reduces sequence-generation times. The sequences are embodied in spacecraft activity sequence files (SASFs). AutoGen automates generation of SASFs by use of another previously reported program called APGEN. AutoGen encodes knowledge of different mission phases and of how the resultant commands must differ among the phases. AutoGen also provides means for customizing sequences through use of configuration files. The approach followed in developing AutoGen has involved encoding the behaviors of a system into a model and encoding algorithms for context-sensitive customizations of the modeled behaviors. This version of AutoGen addressed the MRO (Mars Reconnaissance Orbiter) primary science phase (PSP) mission phase. On previous Mars missions this phase has more commonly been referred to as mapping phase. This version addressed the unique aspects of sequencing orbital operations and specifically the mission specific adaptation of orbital operations for MRO. This version also includes capabilities for MRO s role in Mars relay support for UHF relay communications with the MER rovers and the Phoenix lander.

  16. The complete CDS of the prion protein (PRNP) gene of African lion (Panthera leo).

    PubMed

    Maj, Andrzej; Spellman, Garth M; Sarver, Shane K

    2008-04-01

    We provide the complete PRNP CDS sequence for the African lion, which is different from the previously published sequence and more similar to other carnivore sequences. The newly obtained prion protein sequence differs from the domestic cat sequence at three amino acid positions and contains only four octapeptide repeats. We recommend that this sequence be used as the reference sequence for future studies of the PRNP gene for this species.

  17. Prevalence and Genetic Basis of Antimicrobial Resistance in Non-aureus Staphylococci Isolated from Canadian Dairy Herds

    PubMed Central

    Nobrega, Diego B.; Naushad, Sohail; Naqvi, S. Ali; Condas, Larissa A. Z.; Saini, Vineet; Kastelic, John P.; Luby, Christopher; De Buck, Jeroen; Barkema, Herman W.

    2018-01-01

    Emergence and spread of antimicrobial resistance is a major concern for the dairy industry worldwide. Objectives were to determine: (1) phenotypic and genotypic prevalence of drug-specific resistance for 25 species of non-aureus staphylococci, and (2) associations between presence of resistance determinants and antimicrobial resistance. Broth micro-dilution was used to determine resistance profiles for 1,702 isolates from 89 dairy herds. Additionally, 405 isolates were sequenced to screen for resistance determinants. Antimicrobial resistance was clearly species-dependent. Resistance to quinupristin/dalfopristin was common in Staphylococcus gallinarum (prevalence of 98%), whereas S. cohnii and S. arlettae were frequently resistant to erythromycin (prevalence of 63 and 100%, respectively). Prevalence of resistance was 10% against β-lactams and tetracyclines. In contrast, resistance to antimicrobials critically important for human medicine, namely vancomycin, fluoroquinolones, linezolid and daptomycin, was uncommon (< 1%). Genes encoding multidrug-resistance efflux pumps and resistance-associated residues in deducted amino acid sequences of the folP gene were the most frequent mechanisms of resistance, regardless of species. The estimated prevalence of the mecA gene was 17% for S. epidermidis. Several genes, including blaZ, mecA, fexA, erm, mphC, msrA, and tet were associated with drug-specific resistance, whereas other elements were not. There were specific residues in gyrB for all isolates of species intrinsically resistant to novobiocin. This study provided consensus protein sequences of key elements previously associated with resistance for 25 species of non-aureus staphylococci from dairy cattle. These results will be important for evaluating effects of interventions in antimicrobial use in Canadian dairy herds. PMID:29503642

  18. Prediction of Ras-effector interactions using position energy matrices.

    PubMed

    Kiel, Christina; Serrano, Luis

    2007-09-01

    One of the more challenging problems in biology is to determine the cellular protein interaction network. Progress has been made to predict protein-protein interactions based on structural information, assuming that structural similar proteins interact in a similar way. In a previous publication, we have determined a genome-wide Ras-effector interaction network based on homology models, with a high accuracy of predicting binding and non-binding domains. However, for a prediction on a genome-wide scale, homology modelling is a time-consuming process. Therefore, we here successfully developed a faster method using position energy matrices, where based on different Ras-effector X-ray template structures, all amino acids in the effector binding domain are sequentially mutated to all other amino acid residues and the effect on binding energy is calculated. Those pre-calculated matrices can then be used to score for binding any Ras or effector sequences. Based on position energy matrices, the sequences of putative Ras-binding domains can be scanned quickly to calculate an energy sum value. By calibrating energy sum values using quantitative experimental binding data, thresholds can be defined and thus non-binding domains can be excluded quickly. Sequences which have energy sum values above this threshold are considered to be potential binding domains, and could be further analysed using homology modelling. This prediction method could be applied to other protein families sharing conserved interaction types, in order to determine in a fast way large scale cellular protein interaction networks. Thus, it could have an important impact on future in silico structural genomics approaches, in particular with regard to increasing structural proteomics efforts, aiming to determine all possible domain folds and interaction types. All matrices are deposited in the ADAN database (http://adan-embl.ibmc.umh.es/). Supplementary data are available at Bioinformatics online.

  19. Identification of viral and non-viral reverse transcribing elements in pineapple (Ananas comosus), including members of two new badnavirus species.

    PubMed

    Gambley, C F; Geering, A D W; Steele, V; Thomas, J E

    2008-01-01

    A previously published partial sequence of pineapple bacilliform virus was shown to be from a retrotransposon (family Metaviridae) and not from a badnavirus as previously thought. Two newly discovered sequence groups isolated from pineapple were associated with bacilliform virions and were transmitted by mealybugs. Phylogenetic analyses indicated that they were members of new badnavirus species. A third caulimovirid sequence was also amplified from pineapple, but available evidence suggests that this DNA is not encapsidated, but more likely derived from an endogenous virus.

  20. Score distributions of gapped multiple sequence alignments down to the low-probability tail

    NASA Astrophysics Data System (ADS)

    Fieth, Pascal; Hartmann, Alexander K.

    2016-08-01

    Assessing the significance of alignment scores of optimally aligned DNA or amino acid sequences can be achieved via the knowledge of the score distribution of random sequences. But this requires obtaining the distribution in the biologically relevant high-scoring region, where the probabilities are exponentially small. For gapless local alignments of infinitely long sequences this distribution is known analytically to follow a Gumbel distribution. Distributions for gapped local alignments and global alignments of finite lengths can only be obtained numerically. To obtain result for the small-probability region, specific statistical mechanics-based rare-event algorithms can be applied. In previous studies, this was achieved for pairwise alignments. They showed that, contrary to results from previous simple sampling studies, strong deviations from the Gumbel distribution occur in case of finite sequence lengths. Here we extend the studies to multiple sequence alignments with gaps, which are much more relevant for practical applications in molecular biology. We study the distributions of scores over a large range of the support, reaching probabilities as small as 10-160, for global and local (sum-of-pair scores) multiple alignments. We find that even after suitable rescaling, eliminating the sequence-length dependence, the distributions for multiple alignment differ from the pairwise alignment case. Furthermore, we also show that the previously discussed Gaussian correction to the Gumbel distribution needs to be refined, also for the case of pairwise alignments.

  1. In situ molecular identification of the Influenza A (H1N1) 2009 Neuraminidase in patients with severe and fatal infections during a pandemic in Mexico City

    PubMed Central

    2013-01-01

    Background In April 2009, public health surveillance detected an increased number of influenza-like illnesses in Mexico City’s hospitals. The etiological agent was subsequently determined to be a spread of a worldwide novel influenza A (H1N1) triple reassortant. The purpose of the present study was to demonstrate that molecular detection of pandemic influenza A (H1N1) 2009 strains is possible in archival material such as paraffin-embedded lung samples. Methods In order to detect A (H1N1) virus sequences in archived biological samples, eight paraffin-embedded lung samples from patients who died of pneumonia and respiratory failure were tested for influenza A (H1N1) Neuraminidase (NA) RNA using in situ RT-PCR. Results We detected NA transcripts in 100% of the previously diagnosed A (H1N1)-positive samples as a cytoplasmic signal. No expression was detected by in situ RT-PCR in two Influenza-like Illness A (H1N1)-negative patients using standard protocols nor in a non-related cervical cell line. In situ relative transcription levels correlated with those obtained when in vitro RT-PCR assays were performed. Partial sequences of the NA gene from A (H1N1)-positive patients were obtained by the in situ RT-PCR-sequencing method. Sequence analysis showed 98% similarity with influenza viruses reported previously in other places. Conclusions We have successfully amplified specific influenza A (H1N1) NA sequences using stored clinical material; results suggest that this strategy could be useful when clinical RNA samples are quantity limited, or when poor quality is obtained. Here, we provide a very sensitive method that specifically detects the neuraminidase viral RNA in lung samples from patients who died from pneumonia caused by Influenza A (H1N1) outbreak in Mexico City. PMID:23327529

  2. A rapid NGS strategy for comprehensive molecular diagnosis of Birt-Hogg-Dubé syndrome in patients with primary spontaneous pneumothorax.

    PubMed

    Zhang, Xinxin; Ma, Dehua; Zou, Wei; Ding, Yibing; Zhu, Chengchu; Min, Haiyan; Zhang, Bin; Wang, Wei; Chen, Baofu; Ye, Minhua; Cai, Minghui; Pan, Yanqing; Cao, Lei; Wan, Yueming; Jin, Yu; Gao, Qian; Yi, Long

    2016-05-27

    Primary spontaneous pneumothorax (PSP) or pulmonary cysts is one of the manifestations of Birt-Hogg-Dube syndrome (BHDS) that is caused by heterozygous mutations in FLCN gene. Most of the mutations are SNVs and small indels, and there are also approximately 10 % large intragenic deletions and duplications of the mutations. These molecular findings are generally obtained by disparate methods including Sanger sequencing and Multiple Ligation-dependent Probe Amplification in the clinical laboratory. In addition, as a genetically heterogeneous disorder, PSP may be caused by mutations in multiple genes include FBN1, COL3A1, CBS, SERPINA1 and TSC1/TSC2 genes. For differential diagnosis, these genes should also be screened which makes the diagnostic procedure more time-consuming and labor-intensive. Forty PSP patients were divided into 2 groups. Nineteen patients with different pathogenic mutations of FLCN previously identified by conventional Sanger sequencing and MLPA were included in test group, 21 random PSP patients without any genetic screening were included in blinded sample group. 7 PSP genes including FLCN, FBN1, COL3A1, CBS, SERPINA1 and TSC1/TSC2 were designed and enriched by Haloplex system, sequenced on a Miseq platform and analyzed in the 40 patients to evaluate the performance of the targeted-NGS method. We demonstrated that the full spectrum of genes associated with pneumothorax including FLCN gene mutations can be identified simultaneously in multiplexed sequence data. Noteworthy, by our in-house copy number analysis of the sequence data, we could not only detect intragenic deletions, but also determine approximate deletion junctions simultaneously. NGS based Haloplex target enrichment technology is proved to be a rapid and cost-effective screening strategy for the comprehensive molecular diagnosis of BHDS in PSP patients, as it can replace Sanger sequencing and MLPA by simultaneously detecting exonic and intronic SNVs, small indels, large intragenic deletions and determining deletion junctions in PSP-related genes.

  3. Characterization of a tandemly repeated DNA sequence family originally derived by retroposition of tRNA(Glu) in the newt.

    PubMed

    Nagahashi, S; Endoh, H; Suzuki, Y; Okada, N

    1991-11-20

    A previous report from this laboratory showed that in vitro transcription of total genomic DNA of the newt Cynopus pyrrhogaster resulted in a discrete sized 8 S RNA, which represented highly repetitive and transcribable sequences with a glutamic acid tRNA-like structure in the newt genome. We isolated four independent clones from a newt genomic library and determined the complete sequences of three 2000 to 2400 base-pair PstI fragments spanning the 8 S RNA gene. The glutamic acid tRNA-related segment in the 8 S RNA gene contains the CCA sequence expected as the 3' terminus of a tRNA molecule. Further, the 11 nucleotides located 13 nucleotides upstream from one of the two transcription initiation sites of the 8 S RNA were found to be repeated in the region upstream from the termination site, suggesting that the original unit, which is shorter than the 8 S RNA, was retrotransposed via cDNA intermediates from the PolIII transcript. In the upstream region of the 8 S RNA gene, a 360 nucleotide unit containing the glutamic acid tRNA-related segment was found to be duplicated (clones NE1 and NE10) or triplicated (clone NE3). Except for the difference in the number of the 360 nucleotide unit, the three sequences of the 2000 to 2400 base-pair PstI fragment were essentially the same with only a few mutations and minor deletions. Inverse polymerase chain reaction and sequence determination of the products, together with a Southern hybridization experiment, demonstrated that the family consists of a tandemly repeated unit of 3300, 3700 or 4100 base-pairs. Thus during evolution, this family in the newt was created by retroposition via cDNA intermediates, followed by duplication or triplication of the 360 nucleotide unit and multiplication of the 3300 to 4100 base-pair region at the DNA level.

  4. Discovery of novel plant interaction determinants from the genomes of 163 root nodule bacteria

    DOE PAGES

    Seshadri, Rekha; Reeve, Wayne G.; Ardley, Julie K.; ...

    2015-11-20

    Root nodule bacteria (RNB) or “rhizobia” are a type of plant growth promoting bacteria, typified by their ability to fix nitrogen for their plant host, fixing nearly 65% of the nitrogen currently utilized in sustainable agricultural production of legume crops and pastures. In this study, we sequenced the genomes of 110 RNB from diverse hosts and biogeographical regions, and undertook a global exploration of all available RNB genera with the aim of identifying novel genetic determinants of symbiotic association and plant growth promotion. Specifically, we performed a subtractive comparative analysis with non-RNB genomes, employed relevant transcriptomic data, and leveraged phylogeneticmore » distribution patterns and sequence signatures based on known precepts of symbioticand host-microbe interactions. A total of 184 protein families were delineated, including known factors for nodulation and nitrogen fixation, and candidates with previously unexplored functions, for which a role in host-interaction, -regulation, biocontrol, and more, could be posited. Lastly, these analyses expand our knowledge of the RNB purview and provide novel targets for strain improvement in the ultimate quest to enhance plant productivity and agricultural sustainability.« less

  5. Searching for Partners of Cool Senior Citizens

    NASA Astrophysics Data System (ADS)

    Jao, Wei-Chun; Henry, T. J.

    2012-01-01

    Mass is one of the most fundamental parameters in stellar astronomy. In order to measure dynamical masses, one needs to find nearby binary systems that can be resolved and monitored, ideally with orbital periods that completely wrap in a reasonable amount of time. Many surveys have been made of nearby main sequence dwarfs, and their mass-luminosity relation is well established. As part of our Cool Subdwarf Investigations (CSI) program, we are searching for subdwarf binaries of spectral types K and M within 60 parsecs to measure their multiplicity rate and to reveal binaries appropriate for mass determinations. Here we present results of our CSI work using HST's Fine Guidance Sensors. When combined with previous CSI work and results in the literature, we find the multiplicity rate of subdwarfs, 21%, to be surprisingly low compared to that of similar main sequence K and M stars, 37%. This work has several implications, including that the star formation and/or evolution history of subdwarfs is different than for dwarfs, and that ideal systems for subdwarf mass determinations are difficult to find. This work is supported by HST grant GO-11943.

  6. Determination of Membrane-Insertion Free Energies by Molecular Dynamics Simulations

    PubMed Central

    Gumbart, James; Roux, Benoît

    2012-01-01

    The accurate prediction of membrane-insertion probability for arbitrary protein sequences is a critical challenge to identifying membrane proteins and determining their folded structures. Although algorithms based on sequence statistics have had moderate success, a complete understanding of the energetic factors that drive the insertion of membrane proteins is essential to thoroughly meeting this challenge. In the last few years, numerous attempts to define a free-energy scale for amino-acid insertion have been made, yet disagreement between most experimental and theoretical scales persists. However, for a recently resolved water-to-bilayer scale, it is found that molecular dynamics simulations that carefully mimic the conditions of the experiment can reproduce experimental free energies, even when using the same force field as previous computational studies that were cited as evidence of this disagreement. Therefore, it is suggested that experimental and simulation-based scales can both be accurate and that discrepancies stem from disparities in the microscopic processes being considered rather than methodological errors. Furthermore, these disparities make the development of a single universally applicable membrane-insertion free energy scale difficult. PMID:22385850

  7. Protein Chaperones Q8ZP25_SALTY from Salmonella Typhimurium and HYAE_ECOLI from Escherichia coli Exhibit Thioredoxin-like Structures Despite Lack of Canonical Thioredoxin Active Site Sequence Motif

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Parish, D.; Benach, J; Liu, G

    2008-01-01

    The structure of the 142-residue protein Q8ZP25 SALTY encoded in the genome of Salmonella typhimurium LT2 was determined independently by NMR and X-ray crystallography, and the structure of the 140-residue protein HYAE ECOLI encoded in the genome of Escherichia coli was determined by NMR. The two proteins belong to Pfam (Finn et al. 34:D247-D251, 2006) PF07449, which currently comprises 50 members, and belongs itself to the 'thioredoxin-like clan'. However, protein HYAE ECOLI and the other proteins of Pfam PF07449 do not contain the canonical Cys-X-X-Cys active site sequence motif of thioredoxin. Protein HYAE ECOLI was previously classified as a (NiFe)more » hydrogenase-1 specific chaperone interacting with the twin-arginine translocation (Tat) signal peptide. The structures presented here exhibit the expected thioredoxin-like fold and support the view that members of Pfam family PF07449 specifically interact with Tat signal peptides.« less

  8. Chromosomal localization and partial genomic structure of the human peroxisome proliferator activated receptor-gamma (hPPAR gamma) gene.

    PubMed

    Beamer, B A; Negri, C; Yen, C J; Gavrilova, O; Rumberger, J M; Durcan, M J; Yarnall, D P; Hawkins, A L; Griffin, C A; Burns, D K; Roth, J; Reitman, M; Shuldiner, A R

    1997-04-28

    We determined the chromosomal localization and partial genomic structure of the coding region of the human PPAR gamma gene (hPPAR gamma), a nuclear receptor important for adipocyte differentiation and function. Sequence analysis and long PCR of human genomic DNA with primers that span putative introns revealed that intron positions and sizes of hPPAR gamma are similar to those previously determined for the mouse PPAR gamma gene[13]. Fluorescent in situ hybridization localized hPPAR gamma to chromosome 3, band 3p25. Radiation hybrid mapping with two independent primer pairs was consistent with hPPAR gamma being within 1.5 Mb of marker D3S1263 on 3p25-p24.2. These sequences of the intron/exon junctions of the 6 coding exons shared by hPPAR gamma 1 and hPPAR gamma 2 will facilitate screening for possible mutations. Furthermore, D3S1263 is a suitable polymorphic marker for linkage analysis to evaluate PPAR gamma's potential contribution to genetic susceptibility to obesity, lipoatrophy, insulin resistance, and diabetes.

  9. Pressure for Pattern-Specific Intertypic Recombination between Sabin Polioviruses: Evolutionary Implications.

    PubMed

    Korotkova, Ekaterina; Laassri, Majid; Zagorodnyaya, Tatiana; Petrovskaya, Svetlana; Rodionova, Elvira; Cherkasova, Elena; Gmyl, Anatoly; Ivanova, Olga E; Eremeeva, Tatyana P; Lipskaya, Galina Y; Agol, Vadim I; Chumakov, Konstantin

    2017-11-22

    Complete genomic sequences of a non-redundant set of 70 recombinants between three serotypes of attenuated Sabin polioviruses as well as location (based on partial sequencing) of crossover sites of 28 additional recombinants were determined and compared with the previously published data. It is demonstrated that the genomes of Sabin viruses contain distinct strain-specific segments that are eliminated by recombination. The presumed low fitness of these segments could be linked to mutations acquired upon derivation of the vaccine strains and/or may have been present in wild-type parents of Sabin viruses. These "weak" segments contribute to the propensity of these viruses to recombine with each other and with other enteroviruses as well as determine the choice of crossover sites. The knowledge of location of such segments opens additional possibilities for the design of more genetically stable and/or more attenuated variants, i.e., candidates for new oral polio vaccines. The results also suggest that the genome of wild polioviruses, and, by generalization, of other RNA viruses, may harbor hidden low-fitness segments that can be readily eliminated only by recombination.

  10. Correlation of genetic variability with safety of mumps vaccine Urabe AM9 strain.

    PubMed

    Amexis, G; Fineschi, N; Chumakov, K

    2001-08-15

    The Urabe AM9 strain of mumps vaccine live is known for its genetic instability and some vaccines derived from this strain were withdrawn from the market due to an excessive number of vaccine-associated parotitis and meningitis cases. To identify the molecular basis of this instability, we determined complete nucleotide sequences of several stocks of the Urabe strain used for vaccine production by different manufacturers and of two clinical isolates from cases of vaccine-associated meningitis. In contrast to previously published studies relating the Lys335 --> Glu mutation in the viral HN gene with neurovirulence of mumps virus, we could not confirm any association of this mutation with the safety of mumps vaccine. Each of the three vaccine stocks studied had its own characteristic profile of mutations that was identified by cDNA sequencing and quantitated by mutant analysis by PCR and restriction enzyme cleavage. Determination of the mutational profile of mumps vaccine lots could allow vaccine manufacturers to characterize seed viruses and monitor the consistency of vaccine production to prevent emergence of virulent revertants.

  11. Large scale genomic analysis shows no evidence for pathogen adaptation between the blood and cerebrospinal fluid niches during bacterial meningitis

    PubMed Central

    Lees, John A.; Kremer, Philip H. C.; Manso, Ana S.; Croucher, Nicholas J.; Ferwerda, Bart; Serón, Mercedes Valls; Oggioni, Marco R.; Parkhill, Julian; Brouwer, Matthijs C.; van der Ende, Arie; van de Beek, Diederik

    2017-01-01

    Recent studies have provided evidence for rapid pathogen genome diversification, some of which could potentially affect the course of disease. We have previously described such variation seen between isolates infecting the blood and cerebrospinal fluid (CSF) of a single patient during a case of bacterial meningitis. Here, we performed whole-genome sequencing of paired isolates from the blood and CSF of 869 meningitis patients to determine whether such variation frequently occurs between these two niches in cases of bacterial meningitis. Using a combination of reference-free variant calling approaches, we show that no genetic adaptation occurs in either invaded niche during bacterial meningitis for two major pathogen species, Streptococcus pneumoniae and Neisseria meningitidis. This study therefore shows that the bacteria capable of causing meningitis are already able to do this upon entering the blood, and no further sequence change is necessary to cross the blood–brain barrier. Our findings place the focus back on bacterial evolution between nasopharyngeal carriage and invasion, or diversity of the host, as likely mechanisms for determining invasiveness. PMID:28348877

  12. Dolabra nepheliae on rambutan and lychee represents a novel lineage of phytopathogenic Eurotiomycetes

    PubMed Central

    Schoch, Conrad L.; Farr, David F.; Nishijima, Kate; Keith, Lisa; Goenaga, Ricardo

    2010-01-01

    Rambutan (Nephelium lappaceum) and lychee (Litchi chinensis) are tropical trees in the Sapindaceae that produce delicious edible fruits and are increasingly cultivated in tropical regions. These trees are afflicted with a stem canker disease associated with the ascomycete Dolabra nepheliae. Previously known from Asia and Australia, this fungus was recently reported from Hawaii and Puerto Rico. The sexual and asexual states of Dolabra nepheliae are redescribed and illustrated. In addition, the ITS and large subunit of the nuclear ribosomal DNA plus fragments from the genes RPB2, TEF1, and the mitochondrial small ribosomal subunit were sequenced for three isolates of D. nepheliae and compared with other sequences of ascomycetes. It was determined that D. nepheliae represents a new lineage within the Eurotiomycetes allied with Phaeomoniella chlamydospora, the causal agent of Petri grapevine decline. PMID:20802819

  13. Dolabra nepheliae on rambutan and lychee represents a novel lineage of phytopathogenic Eurotiomycetes.

    PubMed

    Rossman, Amy Y; Schoch, Conrad L; Farr, David F; Nishijima, Kate; Keith, Lisa; Goenaga, Ricardo

    2010-07-01

    Rambutan (Nephelium lappaceum) and lychee (Litchi chinensis) are tropical trees in the Sapindaceae that produce delicious edible fruits and are increasingly cultivated in tropical regions. These trees are afflicted with a stem canker disease associated with the ascomycete Dolabra nepheliae. Previously known from Asia and Australia, this fungus was recently reported from Hawaii and Puerto Rico. The sexual and asexual states of Dolabra nepheliae are redescribed and illustrated. In addition, the ITS and large subunit of the nuclear ribosomal DNA plus fragments from the genes RPB2, TEF1, and the mitochondrial small ribosomal subunit were sequenced for three isolates of D. nepheliae and compared with other sequences of ascomycetes. It was determined that D. nepheliae represents a new lineage within the Eurotiomycetes allied with Phaeomoniella chlamydospora, the causal agent of Petri grapevine decline.

  14. Mutations altering the cleavage specificity of a homing endonuclease

    PubMed Central

    Seligman, Lenny M.; Chisholm, Karen M.; Chevalier, Brett S.; Chadsey, Meggen S.; Edwards, Samuel T.; Savage, Jeremiah H.; Veillet, Adeline L.

    2002-01-01

    The homing endonuclease I-CreI recognizes and cleaves a particular 22 bp DNA sequence. The crystal structure of I-CreI bound to homing site DNA has previously been determined, leading to a number of predictions about specific protein–DNA contacts. We test these predictions by analyzing a set of endonuclease mutants and a complementary set of homing site mutants. We find evidence that all structurally predicted I-CreI/DNA contacts contribute to DNA recognition and show that these contacts differ greatly in terms of their relative importance. We also describe the isolation of a collection of altered specificity I-CreI derivatives. The in vitro DNA-binding and cleavage properties of two such endonucleases demonstrate that our genetic approach is effective in identifying homing endonucleases that recognize and cleave novel target sequences. PMID:12202772

  15. An improved procedure, involving mass spectrometry, for N-terminal amino acid sequence determination of proteins which are N alpha-blocked.

    PubMed Central

    Rose, K; Kocher, H P; Blumberg, B M; Kolakofsky, D

    1984-01-01

    A modification to a previously described procedure [Gray & del Valle (1970) Biochemistry 9, 2134-2137; Rose, Simona & Offord (1983) Biochem. J. 215, 261-272] for mass-spectral identification of the N-terminal regions of proteins is shown to be useful in cases where the N-terminus is blocked. Three proteins were studied: vesicular-stomatitis-virus N protein, Sendai-virus NP protein, and a rabbit immunoglobulin lambda-light chain. These proteins, found to be blocked at the N-terminus with either the acetyl group or a pyroglutamic acid residue, had all failed to yield to attempted Edman degradation, in one case even after attempted enzymic removal of the pyroglutamic acid residue. The N-terminal regions of all three proteins were sequenced by using the new procedure. PMID:6421284

  16. Gene organization and alternative splicing of human prohormone convertase PC8.

    PubMed Central

    Goodge, K A; Thomas, R J; Martin, T J; Gillespie, M T

    1998-01-01

    The mammalian Ca2+-dependent serine protease prohormone convertase PC8 is expressed ubiquitously, being transcribed as 3.5, 4.3 and 6.0 kb mRNA isoforms in various tissues. To determine the origin of these various mRNA isoforms we report the characterization of the human PC8 gene, which has been previously localized to chromosome 11q23-24. Consisting of 16 exons, the human PC8 gene spans approx. 27 kb. A comparison of the position of intron-exon junctions of the human PC8 gene with the gene structures of previously reported prohormone convertase genes demonstrated a divergence of the human PC8 from the highly conserved nature of the gene organization of this enzyme family. The nucleotide sequence of the 5'-flanking region of the human PC8 is reported and possesses putative promoter elements characteristic of a GC-rich promoter. Further supporting the potential role of a GC-rich promoter element, multiple transcriptional initiation sites within a 200 bp region were demonstrated. We propose that the various mRNA isoforms of PC8 result from the inclusion of intronic sequences within transcripts. PMID:9820811

  17. Structural test of the parameterized-backbone method for protein design.

    PubMed

    Plecs, Joseph J; Harbury, Pehr B; Kim, Peter S; Alber, Tom

    2004-09-03

    Designing new protein folds requires a method for simultaneously optimizing the conformation of the backbone and the side-chains. One approach to this problem is the use of a parameterized backbone, which allows the systematic exploration of families of structures. We report the crystal structure of RH3, a right-handed, three-helix coiled coil that was designed using a parameterized backbone and detailed modeling of core packing. This crystal structure was determined using another rationally designed feature, a metal-binding site that permitted experimental phasing of the X-ray data. RH3 adopted the intended fold, which has not been observed previously in biological proteins. Unanticipated structural asymmetry in the trimer was a principal source of variation within the RH3 structure. The sequence of RH3 differs from that of a previously characterized right-handed tetramer, RH4, at only one position in each 11 amino acid sequence repeat. This close similarity indicates that the design method is sensitive to the core packing interactions that specify the protein structure. Comparison of the structures of RH3 and RH4 indicates that both steric overlap and cavity formation provide strong driving forces for oligomer specificity.

  18. Characterization of NIST human mitochondrial DNA SRM-2392 and SRM-2392-I standard reference materials by next generation sequencing.

    PubMed

    Riman, Sarah; Kiesler, Kevin M; Borsuk, Lisa A; Vallone, Peter M

    2017-07-01

    Standard Reference Materials SRM 2392 and 2392-I are intended to provide quality control when amplifying and sequencing human mitochondrial genome sequences. The National Institute of Standards and Technology (NIST) offers these SRMs to laboratories performing DNA-based forensic human identification, molecular diagnosis of mitochondrial diseases, mutation detection, evolutionary anthropology, and genetic genealogy. The entire mtGenome (∼16569bp) of SRM 2392 and 2392-I have previously been characterized at NIST by Sanger sequencing. Herein, we used the sensitivity, specificity, and accuracy offered by next generation sequencing (NGS) to: (1) re-sequence the certified values of the SRM 2392 and 2392-I; (2) confirm Sanger data with a high coverage new sequencing technology; (3) detect lower level heteroplasmies (<20%); and thus (4) support mitochondrial sequencing communities in the adoption of NGS methods. To obtain a consensus sequence for the SRMs as well as identify and control any bias, sequencing was performed using two NGS platforms and data was analyzed using different bioinformatics pipelines. Our results confirm five low level heteroplasmy sites that were not previously observed with Sanger sequencing: three sites in the GM09947A template in SRM 2392 and two sites in the HL-60 template in SRM 2392-I. Copyright © 2017 Elsevier B.V. All rights reserved.

  19. Observation of quantum criticality with ultracold atoms in optical lattices

    NASA Astrophysics Data System (ADS)

    Zhang, Xibo

    As biological problems are becoming more complex and data growing at a rate much faster than that of computer hardware, new and faster algorithms are required. This dissertation investigates computational problems arising in two of the fields: comparative genomics and epigenomics, and employs a variety of computational techniques to address the problems. One fundamental question in the studies of chromosome evolution is whether the rearrangement breakpoints are happening at random positions or along certain hotspots. We investigate the breakpoint reuse phenomenon, and show the analyses that support the more recently proposed fragile breakage model as opposed to the conventional random breakage models for chromosome evolution. The identification of syntenic regions between chromosomes forms the basis for studies of genome architectures, comparative genomics, and evolutionary genomics. The previous synteny block reconstruction algorithms could not be scaled to a large number of mammalian genomes being sequenced; neither did they address the issue of generating non-overlapping synteny blocks suitable for analyzing rearrangements and evolutionary history of large-scale duplications prevalent in plant genomes. We present a new unified synteny block generation algorithm based on A-Bruijn graph framework that overcomes these shortcomings. In the epigenome sequencing, a sample may contain a mixture of epigenomes and there is a need to resolve the distinct methylation patterns from the mixture. Many sequencing applications, such as haplotype inference for diploid or polyploid genomes, and metagenomic sequencing, share the similar objective: to infer a set of distinct assemblies from reads that are sequenced from a heterogeneous sample and subsequently aligned to a reference genome. We model the problem from both a combinatorial and a statistical angles. First, we describe a theoretical framework. A linear-time algorithm is then given to resolve a minimum number of assemblies that are consistent with all reads, substantially improving on previous algorithms. An efficient algorithm is also described to determine a set of assemblies that is consistent with a maximum subset of the reads, a previously untreated problem. We then prove that allowing nested reads or permitting mismatches between reads and their assemblies renders these problems NP-hard. Second, we describe a mixture model-based approach, and applied the model for the detection of allele-specific methylations.

  20. Structural variant of the intergenic internal ribosome entry site elements in dicistroviruses and computational search for their counterparts

    PubMed Central

    HATAKEYAMA, YOSHINORI; SHIBUYA, NORIHIRO; NISHIYAMA, TAKASHI; NAKASHIMA, NOBUHIKO

    2004-01-01

    The intergenic region (IGR) located upstream of the capsid protein gene in dicistroviruses contains an internal ribosome entry site (IRES). Translation initiation mediated by the IRES does not require initiator methionine tRNA. Comparison of the IGRs among dicistroviruses suggested that Taura syndrome virus (TSV) and acute bee paralysis virus have an extra side stem loop in the predicted IRES. We examined whether the side stem is responsible for translation activity mediated by the IGR using constructs with compensatory mutations. In vitro translation analysis showed that TSV has an IGR-IRES that is structurally distinct from those previously described. Because IGR-IRES elements determine the translation initiation site by virtue of their own tertiary structure formation, the discovery of this initiation mechanism suggests the possibility that eukaryotic mRNAs might have more extensive coding regions than previously predicted. To test this hypothesis, we searched full-length cDNA databases and whole genome sequences of eukaryotes using the pattern matching program, Scan For Matches, with parameters that can extract sequences containing secondary structure elements resembling those of IGR-IRES. Our search yielded several sequences, but their predicted secondary structures were suggested to be unstable in comparison to those of dicistroviruses. These results suggest that RNAs structurally similar to dicistroviruses are not common. If some eukaryotic mRNAs are translated independently of an initiator methionine tRNA, their structures are likely to be significantly distinct from those of dicistroviruses. PMID:15100433

  1. Comprehensive profiling of retroviral integration sites using target enrichment methods from historical koala samples without an assembled reference genome

    PubMed Central

    Alquezar-Planas, David E.; Ishida, Yasuko; Courtiol, Alexandre; Timms, Peter; Johnson, Rebecca N.; Lenz, Dorina; Helgen, Kristofer M.; Roca, Alfred L.; Hartman, Stefanie

    2016-01-01

    Background. Retroviral integration into the host germline results in permanent viral colonization of vertebrate genomes. The koala retrovirus (KoRV) is currently invading the germline of the koala (Phascolarctos cinereus) and provides a unique opportunity for studying retroviral endogenization. Previous analysis of KoRV integration patterns in modern koalas demonstrate that they share integration sites primarily if they are related, indicating that the process is currently driven by vertical transmission rather than infection. However, due to methodological challenges, KoRV integrations have not been comprehensively characterized. Results. To overcome these challenges, we applied and compared three target enrichment techniques coupled with next generation sequencing (NGS) and a newly customized sequence-clustering based computational pipeline to determine the integration sites for 10 museum Queensland and New South Wales (NSW) koala samples collected between the 1870s and late 1980s. A secondary aim of this study sought to identify common integration sites across modern and historical specimens by comparing our dataset to previously published studies. Several million sequences were processed, and the KoRV integration sites in each koala were characterized. Conclusions. Although the three enrichment methods each exhibited bias in integration site retrieval, a combination of two methods, Primer Extension Capture and hybridization capture is recommended for future studies on historical samples. Moreover, identification of integration sites shows that the proportion of integration sites shared between any two koalas is quite small. PMID:27069793

  2. Association of Streptomyces community composition determined by PCR-denaturing gradient gel electrophoresis with indoor mold status

    PubMed Central

    Johansson, Elisabet; Reponen, Tiina; Meller, Jarek; Vesper, Stephen; Yadav, Jagjit

    2014-01-01

    Both Streptomyces species and mold species have previously been isolated from moisture-damaged building materials; however, an association between these two groups of microorganisms in indoor environments is not clear. In this study we used a culture-independent method, PCR denaturing gradient gel electrophoresis (PCR-DGGE) to investigate the composition of the Streptomyces community in house dust. Twenty-three dust samples each from two sets of homes categorized as high-mold and low-mold based on mold specific quantitative PCR-analysis were used in the study. Taxonomic identification of prominent bands was performed by cloning and sequencing. Associations between DGGE amplicon band intensities and home mold status were assessed using univariate analyses, as well as multivariate recursive partitioning (decision trees) to test the predictive value of combinations of bands intensities. In the final classification tree, a combination of two bands was significantly associated with mold status of the home (p = 0.001). The sequence corresponding to one of the bands in the final decision tree matched a group of Streptomyces species that included S. coelicolor and S. sampsonii, both of which have been isolated from moisture-damaged buildings previously. The closest match for the majority of sequences corresponding to a second band consisted of a group of Streptomyces species that included S. hygroscopicus, an important producer of antibiotics and immunosuppressors. Taken together, the study showed that DGGE can be a useful tool for identifying bacterial species that may be more prevalent in mold-damaged buildings. PMID:25331035

  3. Targeted next-generation sequencing makes new molecular diagnoses and expands genotype-phenotype relationship in Ehlers-Danlos syndrome.

    PubMed

    Weerakkody, Ruwan A; Vandrovcova, Jana; Kanonidou, Christina; Mueller, Michael; Gampawar, Piyush; Ibrahim, Yousef; Norsworthy, Penny; Biggs, Jennifer; Abdullah, Abdulshakur; Ross, David; Black, Holly A; Ferguson, David; Cheshire, Nicholas J; Kazkaz, Hanadi; Grahame, Rodney; Ghali, Neeti; Vandersteen, Anthony; Pope, F Michael; Aitman, Timothy J

    2016-11-01

    Ehlers-Danlos syndrome (EDS) comprises a group of overlapping hereditary disorders of connective tissue with significant morbidity and mortality, including major vascular complications. We sought to identify the diagnostic utility of a next-generation sequencing (NGS) panel in a mixed EDS cohort. We developed and applied PCR-based NGS assays for targeted, unbiased sequencing of 12 collagen and aortopathy genes to a cohort of 177 unrelated EDS patients. Variants were scored blind to previous genetic testing and then compared with results of previous Sanger sequencing. Twenty-eight pathogenic variants in COL5A1/2, COL3A1, FBN1, and COL1A1 and four likely pathogenic variants in COL1A1, TGFBR1/2, and SMAD3 were identified by the NGS assays. These included all previously detected single-nucleotide and other short pathogenic variants in these genes, and seven newly detected pathogenic or likely pathogenic variants leading to clinically significant diagnostic revisions. Twenty-two variants of uncertain significance were identified, seven of which were in aortopathy genes and required clinical follow-up. Unbiased NGS-based sequencing made new molecular diagnoses outside the expected EDS genotype-phenotype relationship and identified previously undetected clinically actionable variants in aortopathy susceptibility genes. These data may be of value in guiding future clinical pathways for genetic diagnosis in EDS.Genet Med 18 11, 1119-1127.

  4. Phylogeny and origin of 82 zygomycetes from all 54 genera of the Mucorales and Mortierellales based on combined analysis of actin and translation elongation factor EF-1alpha genes.

    PubMed

    Voigt, K; Wöstemeyer, J

    2001-05-30

    True fungi (Eumycota) are heterotrophic eukaryotic microorganisms encompassing ascomycetes, basidiomycetes, chytridiomycetes and zygomycetes. The natural systematics of the latter group, Zygomycota, are very poorly understood due to the lack of distinguishing morphological characters. We have determined sequences for the nuclear-encoded genes actin (act) from 82 zygomycetes representing all 54 currently recognized genera from the two zygomycetous orders Mucorales and Mortierellales. We also determined sequences for translation elongation factor EF-1alpha (tef) from 16 zygomycetes (total of 96,837 bp). Phylogenetic analysis in the context of available sequence data (total 2,062 nucleotide positions per species) revealed that current classification schemes for the mucoralean fungi are highly unnatural at the family and, to a large extent, at the genus level. The data clearly indicate a deep, ancient and distinct dichotomy of the orders Mucorales and Mortierellales, which are recognized only in some zygomycete systems. Yet at the same time the data show that two genera - Umbelopsis and Micromucor - previously placed within the Mortierellales on the basis of their weakly developed columella (a morphological structure of the sporangiophore well-developed within all Mucorales) are in fact members of the Mucorales. Phylogenetic analyses of the encoded amino acid sequences in the context of homologues from eukaryotes and archaebacterial outgroups indicate that the Eumycota studied here are a natural group but provide little or no support for the monophyly of either zygomycetes, ascomycetes or basidiomycetes. The data clearly indicate that a complete revision of zygomycete natural systematics is necessary.

  5. Phylogenetic relationships between some members of the genera Neisseria, Acinetobacter, Moraxella, and Kingella based on partial 16S ribosomal DNA sequence analysis.

    PubMed

    Enright, M C; Carter, P E; MacLean, I A; McKenzie, H

    1994-07-01

    We obtained 16S ribosomal DNA (rDNA) sequence data for strains belonging to 11 species of Proteobacteria, including the type strains of Kingella kingae, Neisseria lactamica, Neisseria meningitidis, Moraxella lacunata subsp. lacunata, [Neisseria] ovis, Moraxella catarrhalis, Moraxella osloensis, [Moraxella] phenylpyruvica, and Acinetobacter lwoffii, as well as strains of Neisseria subflava and Acinetobacter calcoaceticus. The data in a distance matrix constructed by comparing the sequences supported the proposal that the genera Acinetobacter and Moraxella and [N.] ovis should be excluded from the family Neisseriaceae. Our results are consistent with hybridization data which suggest that these excluded taxa should be part of a new family, the Moraxellaceae. The strains that we studied can be divided into the following five groups: (i) M. lacunata subsp. lacunata, [N.] ovis, and M. catarrhalis; (ii) M. osloensis; (iii) [M.] phenylpyruvica; (iv) A. calcoaceticus and A. lwoffii; and (v) N. meningitidis, N. subflava, N. lactamica, and K. kingae. We agree with the previous proposal that [N.] ovis should be renamed Moraxella ovis, as this organism is closely related to Moraxella species and not to Neisseria species. The generically misnamed taxon [M.] phenylpyruvica belongs to the proposed family Moraxellaceae, but it is sufficiently different to warrant exclusion from the genus Moraxella. Further work needs to be done to investigate genetically similar species, such as Psychrobacter immobilis, before the true generic position of this organism can be determined. Automated 16S rDNA sequencing with the PCR allows workers to accurately determine phylogenetic relationships between groups of organisms.(ABSTRACT TRUNCATED AT 250 WORDS)

  6. NMR structure determination of a synthetic analogue of bacillomycin Lc reveals the strategic role of L-Asn1 in the natural iturinic antibiotics

    NASA Astrophysics Data System (ADS)

    Volpon, Laurent; Tsan, Pascale; Majer, Zsuzsa; Vass, Elemer; Hollósi, Miklós; Noguéra, Valérie; Lancelin, Jean-Marc; Besson, Françoise

    2007-08-01

    Iturins are a group of antifungal produced by Bacillus subtilis. All are cyclic lipopeptides with seven α-amino acids of configuration LDDLLDL and one β-amino fatty acid. The bacillomycin L is a member of this family and its NMR structure was previously resolved using the sequence Asp-Tyr-Asn-Ser-Gln-Ser-Thr. In this work, we carefully examined the NMR spectra of this compound and detected an error in the sequence. In fact, Asp1 and Gln5 need to be changed into Asn1 and Glu5, which therefore makes it identical to bacillomycin Lc. As a consequence, it now appears that all iturinic peptides with antibiotic activity share the common β-amino fatty acid 8- L-Asn1- D-Tyr2- D-Asn3 sequence. To better understand the conformational influence of the acidic residue L-Asp1, present, for example in the inactive iturin C, the NMR structure of the synthetic analogue SCP [cyclo ( L-Asp1- D-Tyr2- D-Asn3- L-Ser4- L-Gln5- D-Ser6- L-Thr7-β-Ala8)] was determined and compared with bacillomycin Lc recalculated with the corrected sequence. In both cases, the conformers obtained were separated into two families of similar energy which essentially differ in the number and type of turns. A detailed analysis of both cyclopeptide structures is presented here. In addition, CD and FTIR spectra were performed and confirmed the conformational differences observed by NMR between both cyclopeptides.

  7. Relationships between residue Voronoi volume and sequence conservation in proteins.

    PubMed

    Liu, Jen-Wei; Cheng, Chih-Wen; Lin, Yu-Feng; Chen, Shao-Yu; Hwang, Jenn-Kang; Yen, Shih-Chung

    2018-02-01

    Functional and biophysical constraints can cause different levels of sequence conservation in proteins. Previously, structural properties, e.g., relative solvent accessibility (RSA) and packing density of the weighted contact number (WCN), have been found to be related to protein sequence conservation (CS). The Voronoi volume has recently been recognized as a new structural property of the local protein structural environment reflecting CS. However, for surface residues, it is sensitive to water molecules surrounding the protein structure. Herein, we present a simple structural determinant termed the relative space of Voronoi volume (RSV); it uses the Voronoi volume and the van der Waals volume of particular residues to quantify the local structural environment. RSV (range, 0-1) is defined as (Voronoi volume-van der Waals volume)/Voronoi volume of the target residue. The concept of RSV describes the extent of available space for every protein residue. RSV and Voronoi profiles with and without water molecules (RSVw, RSV, VOw, and VO) were compared for 554 non-homologous proteins. RSV (without water) showed better Pearson's correlations with CS than did RSVw, VO, or VOw values. The mean correlation coefficient between RSV and CS was 0.51, which is comparable to the correlation between RSA and CS (0.49) and that between WCN and CS (0.56). RSV is a robust structural descriptor with and without water molecules and can quantitatively reflect evolutionary information in a single protein structure. Therefore, it may represent a practical structural determinant to study protein sequence, structure, and function relationships. Copyright © 2017 Elsevier B.V. All rights reserved.

  8. Whole-genome sequencing to determine Neisseria gonorrhoeae transmission: an observational study

    PubMed Central

    Cole, Kevin; Cole, Michelle J; Cresswell, Fiona; Dean, Gillian; Dave, Jayshree; Thomas, Daniel Rh; Foster, Kirsty; Waldram, Alison; Wilson, Daniel J; Didelot, Xavier; Grad, Yonatan H; Crook, Derrick W; Peto, Tim EA; Walker, A Sarah

    2016-01-01

    Background New approaches are urgently required to address increasing rates of gonorrhoea and the emergence and global spread of antibiotic-resistant Neisseria gonorrhoeae. Whole genome sequencing (WGS) can be applied to study transmission and track resistance. Methods We performed WGS on 1659 isolates from Brighton, UK, and 217 additional isolates from other UK locations. We included WGS data (n=196) from the USA. Estimated mutation rates, plus diversity observed within patients across anatomical sites and probable transmission pairs, were used to fit a coalescent model to determine the number of single nucleotide polymorphisms (SNPs) expected between sequences related by direct/indirect transmission, depending on the time between samples. Findings We detected extensive local transmission. 281/1061(26%) Brighton cases were indistinguishable (0 SNPs) to ≥1 previous case(s), and 786(74%) had evidence of a sampled direct or indirect Brighton source. There was evidence of sustained transmission of some lineages. We observed multiple related samples across geographic locations. Of 1273 infections in Brighton, 225(18%) were linked to another case from elsewhere in the UK, and 115(9%) to a case from the USA. Four lineages initially identified in Brighton could be linked to 70 USA sequences, including 61 from a lineage carrying the mosaic penA XXXIV associated with reduced cefixime susceptibility. Interpretation We present a WGS-based tool for genomic contact tracing of N. gonorrhoeae and demonstrate local, national and international transmission. WGS can be applied across geographical boundaries to investigate gonorrhoea transmission and to track antimicrobial resistance. Funding Oxford NIHR Health Protection Research Unit and Biomedical Research Centre. PMID:27427203

  9. Detection of Mixed Infection from Bacterial Whole Genome Sequence Data Allows Assessment of Its Role in Clostridium difficile Transmission

    PubMed Central

    Eyre, David W.; Cule, Madeleine L.; Griffiths, David; Crook, Derrick W.; Peto, Tim E. A.

    2013-01-01

    Bacterial whole genome sequencing offers the prospect of rapid and high precision investigation of infectious disease outbreaks. Close genetic relationships between microorganisms isolated from different infected cases suggest transmission is a strong possibility, whereas transmission between cases with genetically distinct bacterial isolates can be excluded. However, undetected mixed infections—infection with ≥2 unrelated strains of the same species where only one is sequenced—potentially impairs exclusion of transmission with certainty, and may therefore limit the utility of this technique. We investigated the problem by developing a computationally efficient method for detecting mixed infection without the need for resource-intensive independent sequencing of multiple bacterial colonies. Given the relatively low density of single nucleotide polymorphisms within bacterial sequence data, direct reconstruction of mixed infection haplotypes from current short-read sequence data is not consistently possible. We therefore use a two-step maximum likelihood-based approach, assuming each sample contains up to two infecting strains. We jointly estimate the proportion of the infection arising from the dominant and minor strains, and the sequence divergence between these strains. In cases where mixed infection is confirmed, the dominant and minor haplotypes are then matched to a database of previously sequenced local isolates. We demonstrate the performance of our algorithm with in silico and in vitro mixed infection experiments, and apply it to transmission of an important healthcare-associated pathogen, Clostridium difficile. Using hospital ward movement data in a previously described stochastic transmission model, 15 pairs of cases enriched for likely transmission events associated with mixed infection were selected. Our method identified four previously undetected mixed infections, and a previously undetected transmission event, but no direct transmission between the pairs of cases under investigation. These results demonstrate that mixed infections can be detected without additional sequencing effort, and this will be important in assessing the extent of cryptic transmission in our hospitals. PMID:23658511

  10. CpG PatternFinder: a Windows-based utility program for easy and rapid identification of the CpG methylation status of DNA.

    PubMed

    Xu, Yi-Hua; Manoharan, Herbert T; Pitot, Henry C

    2007-09-01

    The bisulfite genomic sequencing technique is one of the most widely used techniques to study sequence-specific DNA methylation because of its unambiguous ability to reveal DNA methylation status to the order of a single nucleotide. One characteristic feature of the bisulfite genomic sequencing technique is that a number of sample sequence files will be produced from a single DNA sample. The PCR products of bisulfite-treated DNA samples cannot be sequenced directly because they are heterogeneous in nature; therefore they should be cloned into suitable plasmids and then sequenced. This procedure generates an enormous number of sample DNA sequence files as well as adding extra bases belonging to the plasmids to the sequence, which will cause problems in the final sequence comparison. Finding the methylation status for each CpG in each sample sequence is not an easy job. As a result CpG PatternFinder was developed for this purpose. The main functions of the CpG PatternFinder are: (i) to analyze the reference sequence to obtain CpG and non-CpG-C residue position information. (ii) To tailor sample sequence files (delete insertions and mark deletions from the sample sequence files) based on a configuration of ClustalW multiple alignment. (iii) To align sample sequence files with a reference file to obtain bisulfite conversion efficiency and CpG methylation status. And, (iv) to produce graphics, highlighted aligned sequence text and a summary report which can be easily exported to Microsoft Office suite. CpG PatternFinder is designed to operate cooperatively with BioEdit, a freeware on the internet. It can handle up to 100 files of sample DNA sequences simultaneously, and the total CpG pattern analysis process can be finished in minutes. CpG PatternFinder is an ideal software tool for DNA methylation studies to determine the differential methylation pattern in a large number of individuals in a population. Previously we developed the CpG Analyzer program; CpG PatternFinder is our further effort to create software tools for DNA methylation studies.

  11. Characterization of HIV Type 1 Envelope Sequence Among Viral Isolates Circulating in the Northern Region of Colombia, South America

    PubMed Central

    Villarreal, José-Luis; Gutiérrez, Jaime; Palacio, Lucy; Peñuela, Martha; Hernández, Robin; Lemay, Guy

    2012-01-01

    Abstract To characterize human immunodeficiency virus (HIV-1) strains circulating in the Northern region of Colombia in South America, sequences of the viral envelope C2V3C3 region were obtained from patients with different high-risk practices. Close to 60% of the sequences were predicted to belong to macrophage-tropic viruses, according to the positions of acidic amino acids and putative N-linked glycosylation sites. This is in agreement with the fact that most of the patients were recently diagnosed individuals. Phylogenic analysis then allowed assignment of all 35 samples to subtype B viruses. This same subtype was found in previous studies carried out in other Colombian regions. This study thus expands previous analyses with previously missing data from the Northern region of the country. The number and the length of the sequences examined also help to provide a clearer picture of the prevailing situation of the present HIV epidemics in this country. PMID:22482735

  12. The genome of Chelonid herpesvirus 5 harbors atypical genes

    USGS Publications Warehouse

    Ackermann, Mathias; Koriabine, Maxim; Hartmann-Fritsch, Fabienne; de Jong, Pieter J.; Lewis, Teresa D.; Schetle, Nelli; Work, Thierry M.; Dagenais, Julie; Balazs, George H.; Leong, Jo-Ann C.

    2012-01-01

    The Chelonid fibropapilloma-associated herpesvirus (CFPHV; ChHV5) is believed to be the causative agent of fibropapillomatosis (FP), a neoplastic disease of marine turtles. While clinical signs and pathology of FP are well known, research on ChHV5 has been impeded because no cell culture system for its propagation exists. We have cloned a BAC containing ChHV5 in pTARBAC2.1 and determined its nucleotide sequence. Accordingly, ChHV5 has a type D genome and its predominant gene order is typical for the varicellovirus genus within thealphaherpesvirinae. However, at least four genes that are atypical for an alphaherpesvirus genome were also detected, i.e. two members of the C-type lectin-like domain superfamily (F-lec1, F-lec2), an orthologue to the mouse cytomegalovirus M04 (F-M04) and a viral sialyltransferase (F-sial). Four lines of evidence suggest that these atypical genes are truly part of the ChHV5 genome: (1) the pTARBAC insertion interrupted the UL52 ORF, leaving parts of the gene to either side of the insertion and suggesting that an intact molecule had been cloned. (2) Using FP-associated UL52 (F-UL52) as an anchor and the BAC-derived sequences as a means to generate primers, overlapping PCR was performed with tumor-derived DNA as template, which confirmed the presence of the same stretch of “atypical” DNA in independent FP cases. (3) Pyrosequencing of DNA from independent tumors did not reveal previously undetected viral sequences, suggesting that no apparent loss of viral sequence had happened due to the cloning strategy. (4) The simultaneous presence of previously known ChHV5 sequences and F-sial as well as F-M04 sequences was also confirmed in geographically distinct Australian cases of FP. Finally, transcripts of F-sial and F-M04 but not transcripts of lytic viral genes were detected in tumors from Hawaiian FP-cases. Therefore, we suggest that F-sial and F-M04 may play a role in FP pathogenesis

  13. Identification of sequence motifs significantly associated with antisense activity.

    PubMed

    McQuisten, Kyle A; Peek, Andrew S

    2007-06-07

    Predicting the suppression activity of antisense oligonucleotide sequences is the main goal of the rational design of nucleic acids. To create an effective predictive model, it is important to know what properties of an oligonucleotide sequence associate significantly with antisense activity. Also, for the model to be efficient we must know what properties do not associate significantly and can be omitted from the model. This paper will discuss the results of a randomization procedure to find motifs that associate significantly with either high or low antisense suppression activity, analysis of their properties, as well as the results of support vector machine modelling using these significant motifs as features. We discovered 155 motifs that associate significantly with high antisense suppression activity and 202 motifs that associate significantly with low suppression activity. The motifs range in length from 2 to 5 bases, contain several motifs that have been previously discovered as associating highly with antisense activity, and have thermodynamic properties consistent with previous work associating thermodynamic properties of sequences with their antisense activity. Statistical analysis revealed no correlation between a motif's position within an antisense sequence and that sequences antisense activity. Also, many significant motifs existed as subwords of other significant motifs. Support vector regression experiments indicated that the feature set of significant motifs increased correlation compared to all possible motifs as well as several subsets of the significant motifs. The thermodynamic properties of the significantly associated motifs support existing data correlating the thermodynamic properties of the antisense oligonucleotide with antisense efficiency, reinforcing our hypothesis that antisense suppression is strongly associated with probe/target thermodynamics, as there are no enzymatic mediators to speed the process along like the RNA Induced Silencing Complex (RISC) in RNAi. The independence of motif position and antisense activity also allows us to bypass consideration of this feature in the modelling process, promoting model efficiency and reducing the chance of overfitting when predicting antisense activity. The increase in SVR correlation with significant features compared to nearest-neighbour features indicates that thermodynamics alone is likely not the only factor in determining antisense efficiency.

  14. Understanding soil health by capitalizing on long-term field studies

    NASA Astrophysics Data System (ADS)

    Tavakkoli, Ehsan; Wang, Zhe; VanZweieten, Lukas; Rose, Michael

    2017-04-01

    Microbial biodiversity in Australian agricultural soils is of paramount importance as it plays a critical role in regulating soil health, plant productivity, and the cycling of carbon, nitrogen, and other nutrients. Agricultural practices strongly affect soil microbial communities by changing the physical and chemical characteristics of the soil in which microorganisms live, thereby affecting their abundance, diversity, and activity. Despite its importance, the specific responses of various microbial groups to changing environmental conditions (e.g. increased/decreased carbon in response to land management) in agricultural soils are not well understood. This knowledge gap is largely due to previous methodological limitations that, until recently, did not allow microbial diversity and functioning to be meaningfully investigated on large numbers of samples. We sampled soils from a field trial on the effect of strategic tillage in no-till systems to examine the potential impact of tillage and stubble management on soil microbial composition. To determine the relative abundance of bacteria and fungi, we used quantitative PCR (qPCR), and to analyze the composition and diversity of the bacterial and fungal communities, we used bar-coded high-throughput sequencing. Bioinformatics of the sequencing generated data was performed using a previously scripted and tested pipeline, and involved allocation of the relevant sequences to their samples of origin according to the bar-code. In parallel, changes in soil quality and microbial functionality were determined using multi-enzyme activity assay and multiple substrate-induced respiration. The extracellular enzyme activities that were measured include: β-1,4-glucosidase, β-D-cellobiohydrolase, β-Xylosidase, and α-1,4-glucosidase which are all relevant to the C cycle; β-1,4-N-acetylglucosaminidase and L-leucine aminopeptidase which are both relevant to the N cycle associated and associated with protein catabolism. In this presentation, analyses of soil health and functionality in relation to its response to various agronomic practices and implications for C sequestration and nutrient cycling will be discussed.

  15. BayesPI-BAR: a new biophysical model for characterization of regulatory sequence variations

    PubMed Central

    Wang, Junbai; Batmanov, Kirill

    2015-01-01

    Sequence variations in regulatory DNA regions are known to cause functionally important consequences for gene expression. DNA sequence variations may have an essential role in determining phenotypes and may be linked to disease; however, their identification through analysis of massive genome-wide sequencing data is a great challenge. In this work, a new computational pipeline, a Bayesian method for protein–DNA interaction with binding affinity ranking (BayesPI-BAR), is proposed for quantifying the effect of sequence variations on protein binding. BayesPI-BAR uses biophysical modeling of protein–DNA interactions to predict single nucleotide polymorphisms (SNPs) that cause significant changes in the binding affinity of a regulatory region for transcription factors (TFs). The method includes two new parameters (TF chemical potentials or protein concentrations and direct TF binding targets) that are neglected by previous methods. The new method is verified on 67 known human regulatory SNPs, of which 47 (70%) have predicted true TFs ranked in the top 10. Importantly, the performance of BayesPI-BAR, which uses principal component analysis to integrate multiple predictions from various TF chemical potentials, is found to be better than that of existing programs, such as sTRAP and is-rSNP, when evaluated on the same SNPs. BayesPI-BAR is a publicly available tool and is able to carry out parallelized computation, which helps to investigate a large number of TFs or SNPs and to detect disease-associated regulatory sequence variations in the sea of genome-wide noncoding regions. PMID:26202972

  16. Re-sequencing regions of the ovine Y chromosome in domestic and wild sheep reveals novel paternal haplotypes.

    PubMed

    Meadows, J R S; Kijas, J W

    2009-02-01

    The male-specific region of the ovine Y chromosome (MSY) remains poorly characterized, yet sequence variants from this region have the potential to reveal the wild progenitor of domestic sheep or examples of domestic and wild paternal introgression. The 5' promoter region of the sex-determining gene SRY was re-sequenced using a subset of wild sheep including bighorn (Ovis canadensis), thinhorn (Ovis dalli spp.), urial (Ovis vignei), argali (Ovis ammon), mouflon (Ovis musimon) and domestic sheep (Ovis aries). Seven novel SNPs (oY2-oY8) were revealed; these were polymorphic between but not within species. Re-sequencing and fragment analysis was applied to the MSY microsatellite SRYM18. It contains a complex compound repeat structure and sequencing of three novel size fragments revealed that a pentanucleotide element remained fixed, whilst a dinucleotide element displayed variability within species. Comparison of the sequence between species revealed that urial and argali sheep grouped more closely to the mouflon and domestic breeds than the pachyceriforms (bighorn and thinhorn). SNP and microsatellite data were combined to define six previously undetected haplotypes. Analysis revealed the mouflon as the only species to share a haplotype with domestic sheep, consistent with its status as a feral domesticate that has undergone male-mediated exchange with domestic animals. A comparison of the remaining wild species and domestic sheep revealed that O. aries is free from signatures of wild sheep introgression.

  17. Analysis of complex repeat sequences within the spinal muscular atrophy (SMA) candidate region in 5q13

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Davies, K.E.; Morrison, K.E.; Daniels, R.I.

    1994-09-01

    We previously reported that the 400 kb interval flanked the polymorphic loci D5S435 and D5S557 contains blocks of a chromosome 5 specific repeat. This interval also defines the SMA candidate region by genetic analysis of recombinant families. A YAC contig of 2-3 Mb encompassing this area has been constructed and a 5.5 kb conserved fragment, isolated from a YAC end clone within the above interval, was used to obtain cDNAs from both fetal and adult brain libraries. We describe the identification of cDNAs with stretches of high DNA sequence homology to exons of {beta} glucuronidase on human chromosome 7. Themore » cDNAs map both to the candidate region and to an area of 5p using FISH and deletion hybrid analysis. Hybridization to bacteriophage and cosmid clones from the YACs localizes the {beta} glucuronidase related sequences within the 400 kb region of the YAC contig. The cDNAs show a polymorphic pattern on hybridization to genomic BamH1 fragments in the size range of 10-250 kb. Further analysis using YAC fragmentation vectors is being used to determine how these {beta} glucuronidase related cDNAs are distributed within 5q13. Dinucleotide repeats within the region are being investigated to determine linkage disequilibrium with the disease locus.« less

  18. Genetic diversity and potential vectors and reservoirs of Cucurbit aphid-borne yellows virus in southeastern Spain.

    PubMed

    Kassem, Mona A; Juarez, Miguel; Gómez, Pedro; Mengual, Carmen M; Sempere, Raquel N; Plaza, María; Elena, Santiago F; Moreno, Aranzazu; Fereres, Alberto; Aranda, Miguel A

    2013-11-01

    The genetic variability of a Cucurbit aphid-borne yellows virus (CABYV) (genus Polerovirus, family Luteoviridae) population was evaluated by determining the nucleotide sequences of two genomic regions of CABYV isolates collected in open-field melon and squash crops during three consecutive years in Murcia (southeastern Spain). A phylogenetic analysis showed the existence of two major clades. The sequences did not cluster according to host, year, or locality of collection, and nucleotide similarities among isolates were 97 to 100 and 94 to 97% within and between clades, respectively. The ratio of nonsynonymous to synonymous nucleotide substitutions reflected that all open reading frames have been under purifying selection. Estimates of the population's genetic diversity were of the same magnitude as those previously reported for other plant virus populations sampled at larger spatial and temporal scales, suggesting either the presence of CABYV in the surveyed area long before it was first described, multiple introductions, or a particularly rapid diversification. We also determined the full-length sequences of three isolates, identifying the occurrence and location of recombination events along the CABYV genome. Furthermore, our field surveys indicated that Aphis gossypii was the major vector species of CABYV and the most abundant aphid species colonizing melon fields in the Murcia (Spain) region. Our surveys also suggested the importance of the weed species Ecballium elaterium as an alternative host and potential virus reservoir.

  19. The V-band Empirical Mass-luminosity Relation for Main Sequence Stars

    NASA Astrophysics Data System (ADS)

    Xia, Fang; Fu, Yan-Ning

    2010-07-01

    Stellar mass is an indispensable parameter in the studies of stellar physics and stellar dynamics. On the one hand, the most reliable way to determine the stellar dynamical mass is via orbital determinations of binaries. On the other hand, however, most stellar masses have to be estimated by using the mass luminosity relation (MLR). Therefore, it is important to obtain the empirical MLR through fitting the data of stellar dynamical mass and luminosity. The effect of metallicity can make this relation disperse in the V-band, but studies show that this is mainly limited to the case when the stellar mass is less than 0.6M⊙ Recently, many relevant data have been accumulated for main sequence stars with larger masses, which make it possible to significantly improve the corresponding MLR. Using a fitting method which can reasonably assign weights to the observational data including two quantities with different dimensions, we obtain a V-band MLR based on the dynamical masses and luminosities of 203 main sequence stars. In comparison with the previous work, the improved MLR is statistically significant, and the relative error of mass estimation reaches about 5%. Therefore, our MLR is useful not only in the studies of statistical nature, but also in the studies of concrete stellar systems, such as the long-term dynamical study and the short-term positioning study of a specific multiple star system.

  20. The V Band Empirical Mass-Luminosity Relation for Main Sequence Stars

    NASA Astrophysics Data System (ADS)

    Xia, F.; Fu, Y. N.

    2010-01-01

    Stellar mass is an indispensable parameter in the studies of stellar physics and stellar dynamics. On the one hand, the most reliable way to determine the stellar dynamical mass is via orbital determination of binaries. On the other hand, however, most stellar masses have to be estimated by using the mass-luminosity relation (MLR). Therefore, it is important to obtain the empirical MLR through fitting the data of stellar dynamical mass and luminosity. The effect of metallicity can make this relation disperse in the V-band, but studies show that this is mainly limited to the case when the stellar mass is less than 0.6M⊙. Recently, many relevant data have been accumulated for main sequence stars with larger mass, which make it possible to significantly improve the corresponding MLR. Using a fitting method which can reasonably assign weight to the observational data including two quantities with different dimensions, we obtain a V-band MLR based on the dynamical masses and luminosities of 203 main sequence stars. Compared with the previous work, the improved MLR is statistically significant, and the relative error of mass estimation reaches about 5%. Therefore, our MLR is useful not only in studies of statistical nature, but also in studies of concrete stellar systems, such as the long-term dynamical study and the short-term positioning study of a specific multiple star system.

  1. Niche specialization of terrestrial archaeal ammonia oxidizers.

    PubMed

    Gubry-Rangin, Cécile; Hai, Brigitte; Quince, Christopher; Engel, Marion; Thomson, Bruce C; James, Phillip; Schloter, Michael; Griffiths, Robert I; Prosser, James I; Nicol, Graeme W

    2011-12-27

    Soil pH is a major determinant of microbial ecosystem processes and potentially a major driver of evolution, adaptation, and diversity of ammonia oxidizers, which control soil nitrification. Archaea are major components of soil microbial communities and contribute significantly to ammonia oxidation in some soils. To determine whether pH drives evolutionary adaptation and community structure of soil archaeal ammonia oxidizers, sequences of amoA, a key functional gene of ammonia oxidation, were examined in soils at global, regional, and local scales. Globally distributed database sequences clustered into 18 well-supported phylogenetic lineages that dominated specific soil pH ranges classified as acidic (pH <5), acido-neutral (5 ≤ pH <7), or alkalinophilic (pH ≥ 7). To determine whether patterns were reproduced at regional and local scales, amoA gene fragments were amplified from DNA extracted from 47 soils in the United Kingdom (pH 3.5-8.7), including a pH-gradient formed by seven soils at a single site (pH 4.5-7.5). High-throughput sequencing and analysis of amoA gene fragments identified an additional, previously undiscovered phylogenetic lineage and revealed similar pH-associated distribution patterns at global, regional, and local scales, which were most evident for the five most abundant clusters. Archaeal amoA abundance and diversity increased with soil pH, which was the only physicochemical characteristic measured that significantly influenced community structure. These results suggest evolution based on specific adaptations to soil pH and niche specialization, resulting in a global distribution of archaeal lineages that have important consequences for soil ecosystem function and nitrogen cycling.

  2. Sequence Variation of the tRNALeu Intron as a Marker for Genetic Diversity and Specificity of Symbiotic Cyanobacteria in Some Lichens

    PubMed Central

    Paulsrud, Per; Lindblad, Peter

    1998-01-01

    We examined the genetic diversity of Nostoc symbionts in some lichens by using the tRNALeu (UAA) intron as a genetic marker. The nucleotide sequence was analyzed in the context of the secondary structure of the transcribed intron. Cyanobacterial tRNALeu (UAA) introns were specifically amplified from freshly collected lichen samples without previous DNA extraction. The lichen species used in the present study were Nephroma arcticum, Peltigera aphthosa, P. membranacea, and P. canina. Introns with different sizes around 300 bp were consistently obtained. Multiple clones from single PCRs were screened by using their single-stranded conformational polymorphism pattern, and the nucleotide sequence was determined. No evidence for sample heterogenity was found. This implies that the symbiont in situ is not a diverse community of cyanobionts but, rather, one Nostoc strain. Furthermore, each lichen thallus contained only one intron type, indicating that each thallus is colonized only once or that there is a high degree of specificity. The same cyanobacterial intron sequence was also found in samples of one lichen species from different localities. In a phylogenetic analysis, the cyanobacterial lichen sequences grouped together with the sequences from two free-living Nostoc strains. The size differences in the intron were due to insertions and deletions in highly variable regions. The sequence data were used in discussions concerning specificity and biology of the lichen symbiosis. It is concluded that the tRNALeu (UAA) intron can be of great value when examining cyanobacterial diversity. PMID:9435083

  3. Structural analysis of the HLA-A/HLA-F subregion: Precise localization of two new multigene families closely associated with the HLA class I sequences

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pichon, L.; Carn, G.; Bouric, P.

    1996-03-01

    Positional cloning strategies for the hemochromatosis gene have previously concentrated on a target area restricted to a maximum genomic expanse of 400 kb around the HLA-A and HLA-F loci. Recently, the candidate region has been extended to 2-3 Mb on the distal side of the MHC. In this study, 10 coding sequences [hemochromatosis candidate genes (HCG) I to X] were isolated by cDNA selection using YACs covering the HLA-A/HLA-F subregion. Two of these (HCG II and HCG IV) belong to multigene families, as well as other sequences already described in this region, i.e., P5, pMC 6.7, and HLA class I.more » Fingerprinting of the four YACSs overlapping the region was performed and allowed partial localization of the different multigene family sequences on each YAC without defining their exact positions. Fingerprinting on cosmids isolated from the ICRF chromosome 6-specific cosmid library allowed more precise localization of the redundant sequences in all of the multigene families and revealed their apparent organization in clusters. Further examination of these intertwined sequences demonstrated that this structural organization resulted from a succession of complex phenomena, including duplications and contractions. This study presents a precise description of the structural organization of the HLA-A/HLA-F region and a determination of the sequences involved in the megabase size polymorphism observed among the A3, A24, and A31 haplotypes. 29 refs., 2 figs., 2 tabs.« less

  4. Genetic Characteristics of Coronaviruses from Korean Bats in 2016.

    PubMed

    Lee, Saemi; Jo, Seong-Deok; Son, Kidong; An, Injung; Jeong, Jipseol; Wang, Seung-Jun; Kim, Yongkwan; Jheong, Weonhwa; Oem, Jae-Ku

    2018-01-01

    Bats have increasingly been recognized as the natural reservoir of severe acute respiratory syndrome (SARS), coronavirus, and other coronaviruses found in mammals. However, little research has been conducted on bat coronaviruses in South Korea. In this study, bat samples (332 oral swabs, 245 fecal samples, 38 urine samples, and 57 bat carcasses) were collected at 33 natural bat habitat sites in South Korea. RT-PCR and sequencing were performed for specific coronavirus genes to identify the bat coronaviruses in different bat samples. Coronaviruses were detected in 2.7% (18/672) of the samples: 13 oral swabs from one species of the family Rhinolophidae, and four fecal samples and one carcass (intestine) from three species of the family Vespertiliodae. To determine the genetic relationships of the 18 sequences obtained in this study and previously known coronaviruses, the nucleotide sequences of a 392-nt region of the RNA-dependent RNA polymerase (RdRp) gene were analyzed phylogenetically. Thirteen sequences belonging to SARS-like betacoronaviruses showed the highest nucleotide identity (97.1-99.7%) with Bat-CoV-JTMC15 reported in China. The other five sequences were most similar to MERS-like betacoronaviruses. Four nucleotide sequences displayed the highest identity (94.1-95.1%) with Bat-CoV-HKU5 from Hong Kong. The one sequence from a carcass showed the highest nucleotide identity (99%) with Bat-CoV-SC2013 from China. These results suggest that careful surveillance of coronaviruses from bats should be continued, because animal and human infections may result from the genetic variants present in bat coronavirus reservoirs.

  5. Identification and characterization of the reptilian GnRH-II gene in the leopard gecko, Eublepharis macularius, and its evolutionary considerations.

    PubMed

    Ikemoto, Tadahiro; Park, Min Kyun

    2003-10-16

    To elucidate the molecular phylogeny and evolution of a particular peptide, one must analyze not the limited primary amino acid sequences of the low molecular weight mature polypeptide, but rather the sequences of the corresponding precursors from various species. Of all the structural variants of gonadotropin-releasing hormone (GnRH), GnRH-II (chicken GnRH-II, or cGnRH-II) is remarkably conserved without any sequence substitutions among vertebrates, but its precursor sequences vary considerably. We have identified and characterized the full-length complementary DNA (cDNA) encoding the GnRH-II precursor and determined its genomic structure, consisting of four exons and three introns, in a reptilian species, the leopard gecko Eublepharis macularius. This is the first report about the GnRH-II precursor cDNA/gene from reptiles. The deduced leopard gecko prepro-GnRH-II polypeptide had the highest identities with the corresponding polypeptides of amphibians. The GnRH-II precursor mRNA was detected in more than half of the tissues and organs examined. This widespread expression is consistent with the previous findings in several species, though the roles of GnRH outside the hypothalamus-pituitary-gonadal axis remain largely unknown. Molecular phylogenetic analysis combined with sequence comparison showed that the leopard gecko is more similar to fishes and amphibians than to eutherian mammals with respect to the GnRH-II precursor sequence. These results strongly suggest that the divergence of the GnRH-II precursor sequences seen in eutherian mammals may have occurred along with amniote evolution.

  6. The Sequence of Study Changes What Information Is Attended to, Encoded, and Remembered during Category Learning

    ERIC Educational Resources Information Center

    Carvalho, Paulo F.; Goldstone, Robert L.

    2017-01-01

    The sequence of study influences how we learn. Previous research has identified different sequences as potentially beneficial for learning in different contexts and with different materials. Here we investigate the mechanisms involved in inductive category learning that give rise to these sequencing effects. Across 3 experiments we show evidence…

  7. 37 CFR 1.825 - Amendments to or replacement of sequence listing and computer readable copy thereof.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... of sequence listing and computer readable copy thereof. 1.825 Section 1.825 Patents, Trademarks, and... Amino Acid Sequences § 1.825 Amendments to or replacement of sequence listing and computer readable copy... copy of the computer readable form (§ 1.821(e)) including all previously submitted data with the...

  8. 37 CFR 1.825 - Amendments to or replacement of sequence listing and computer readable copy thereof.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... of sequence listing and computer readable copy thereof. 1.825 Section 1.825 Patents, Trademarks, and... Amino Acid Sequences § 1.825 Amendments to or replacement of sequence listing and computer readable copy... copy of the computer readable form (§ 1.821(e)) including all previously submitted data with the...

  9. 37 CFR 1.825 - Amendments to or replacement of sequence listing and computer readable copy thereof.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... of sequence listing and computer readable copy thereof. 1.825 Section 1.825 Patents, Trademarks, and... Amino Acid Sequences § 1.825 Amendments to or replacement of sequence listing and computer readable copy... copy of the computer readable form (§ 1.821(e)) including all previously submitted data with the...

  10. 37 CFR 1.825 - Amendments to or replacement of sequence listing and computer readable copy thereof.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... of sequence listing and computer readable copy thereof. 1.825 Section 1.825 Patents, Trademarks, and... Amino Acid Sequences § 1.825 Amendments to or replacement of sequence listing and computer readable copy... copy of the computer readable form (§ 1.821(e)) including all previously submitted data with the...

  11. 37 CFR 1.825 - Amendments to or replacement of sequence listing and computer readable copy thereof.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... of sequence listing and computer readable copy thereof. 1.825 Section 1.825 Patents, Trademarks, and... Amino Acid Sequences § 1.825 Amendments to or replacement of sequence listing and computer readable copy... copy of the computer readable form (§ 1.821(e)) including all previously submitted data with the...

  12. Genetic identification of the bacteriocins produced by Enterococcus faecium IT62 and evidence that bacteriocin 32 is identical to enterocin IT.

    PubMed

    Izquierdo, Esther; Cai, Yimin; Marchioni, Eric; Ennahar, Saïd

    2009-05-01

    Enterococcus faecium IT62, a strain isolated from ryegrass in Japan, produces three bacteriocins (enterocins L50A, L50B, and IT) that have been previously purified and the primary structures of which have been determined by amino acid sequencing (E. Izquierdo, A. Bednarczyk, C. Schaeffer, Y. Cai, E. Marchioni, A. Van Dorsselaer, and S. Ennahar, Antimicrob. Agents Chemother., 52:1917-1923, 2008). Genetic analysis showed that the bacteriocins of E. faecium IT62 are plasmid encoded, but with the structural genes specifying enterocin L50A and enterocin L50B being carried by a plasmid (pTAB1) that is separate from the one (pTIT1) carrying the structural gene of enterocin IT. Sequencing analysis of a 1,475-bp region from pTAB1 identified two consecutive open reading frames corresponding, with the exception of 2 bp, to the genes entL50A and entL50B, encoding EntL50A and EntL50B, respectively. Both bacteriocins are synthesized without N-terminal leader sequences. Genetic analysis of a sequenced 1,380-bp pTIT1 fragment showed that the genes entIT and entIM, encoding enterocin IT and its immunity protein, respectively, were both found in E. faecium VRE200 for bacteriocin 32. Enterocin IT, a 6,390-Da peptide made up of 54 amino acids, has been previously shown to be identical to the C-terminal part of bacteriocin 32, a 7,998-Da bacteriocin produced by E. faecium VRE200 whose structure was deduced from its structural gene (T. Inoue, H. Tomita, and Y. Ike, Antimicrob. Agents Chemother., 50:1202-1212, 2006). By combining the biochemical and genetic data on enterocin IT, it was concluded that bacteriocin 32 is in fact identical to enterocin IT, both being encoded by the same plasmid-borne gene, and that the N-terminal leader peptide for this bacteriocin is 35 amino acids long and not 19 amino acids long as previously reported.

  13. Nanoscale structure in AgSbTe2 determined by diffuse elastic neutron scattering

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Specht, Eliot D; Ma, Jie; Delaire, Olivier A

    2015-01-01

    Diffuse elastic neutron scattering measurements confirm that AgSbTe2 has a hierarchical structure, with defects on length scales from nanometers to microns. While scattering from mesoscale structure is consistent with previously-proposed structures in which Ag and Sb order on a NaCl lattice, more diffuse scattering from nanoscale structure suggests a structural rearrangement in which hexagonal layers form a combination of (ABC), (ABA), and (AAB) stacking sequences. The AgCrSe2 structure is the best-fitting model for the local atomic arrangements.

  14. The Complete Genome Sequence of the Plant Growth-Promoting Bacterium Pseudomonas sp. UW4

    PubMed Central

    Duan, Jin; Jiang, Wei; Cheng, Zhenyu; Heikkila, John J.; Glick, Bernard R.

    2013-01-01

    The plant growth-promoting bacterium (PGPB) Pseudomonas sp. UW4, previously isolated from the rhizosphere of common reeds growing on the campus of the University of Waterloo, promotes plant growth in the presence of different environmental stresses, such as flooding, high concentrations of salt, cold, heavy metals, drought and phytopathogens. In this work, the genome sequence of UW4 was obtained by pyrosequencing and the gaps between the contigs were closed by directed PCR. The P. sp. UW4 genome contains a single circular chromosome that is 6,183,388 bp with a 60.05% G+C content. The bacterial genome contains 5,423 predicted protein-coding sequences that occupy 87.2% of the genome. Nineteen genomic islands (GIs) were predicted and thirty one complete putative insertion sequences were identified. Genes potentially involved in plant growth promotion such as indole-3-acetic acid (IAA) biosynthesis, trehalose production, siderophore production, acetoin synthesis, and phosphate solubilization were determined. Moreover, genes that contribute to the environmental fitness of UW4 were also observed including genes responsible for heavy metal resistance such as nickel, copper, cadmium, zinc, molybdate, cobalt, arsenate, and chromate. Whole-genome comparison with other completely sequenced Pseudomonas strains and phylogeny of four concatenated “housekeeping” genes (16S rRNA, gyrB, rpoB and rpoD) of 128 Pseudomonas strains revealed that UW4 belongs to the fluorescens group, jessenii subgroup. PMID:23516524

  15. A sequence database allowing automated genotyping of Classical swine fever virus isolates.

    PubMed

    Dreier, Sabrina; Zimmermann, Bernd; Moennig, Volker; Greiser-Wilke, Irene

    2007-03-01

    Classical swine fever (CSF) is a highly contagious viral disease of pigs. According to the OIE classification of diseases it is classified as a notifiable (previously List A) disease, thus having the potential for causing severe socio-economic problems and affecting severely the international trade of pigs and pig products. Effective control measures are compulsory, and to expose weaknesses a reliable tracing of the spread of the virus is necessary. Genetic typing has proved to be the method of choice. However, genotyping involves the use of multiple software applications, which is laborious and complex. The implementation of a sequence database, which is accessible by the World Wide Web with the option to type automatically new CSF virus isolates once the sequence is available is described. The sequence to be typed is tested for correct orientation and, if necessary, adjusted to the right length. The alignment and the neighbor-joining phylogenetic analysis with a standard set of sequences can then be calculated. The results are displayed as a graph. As an example, the determination is shown of the genetic subgroup of the isolate obtained from the outbreaks registered in Russia, in 2005. After registration (Irene.greiser-wilke@tiho-hannover.de) the database including the module for genotyping are accessible under http://viro08.tiho-hannover.de/eg/eurl_virus_db.htm.

  16. Single mutation in Shine-Dalgarno-like sequence present in the amino terminal of lactate dehydrogenase of Plasmodium effects the production of an eukaryotic protein expressed in a prokaryotic system.

    PubMed

    Cicek, Mustafa; Mutlu, Ozal; Erdemir, Aysegul; Ozkan, Ebru; Saricay, Yunus; Turgut-Balik, Dilek

    2013-06-01

    One of the most important step in structure-based drug design studies is obtaining the protein in active form after cloning the target gene. In one of our previous study, it was determined that an internal Shine-Dalgarno-like sequence present just before the third methionine at N-terminus of wild type lactate dehydrogenase enzyme of Plasmodium falciparum prevent the translation of full length protein. Inspection of the same region in P. vivax LDH, which was overproduced as an active enzyme, indicated that the codon preference in the same region was slightly different than the codon preference of wild type PfLDH. In this study, 5'-GGAGGC-3' sequence of P. vivax that codes for two glycine residues just before the third methionine was exchanged to 5'-GGAGGA-3', by mimicking P. falciparum LDH, to prove the possible effects of having an internal SD-like sequence when expressing an eukaryotic protein in a prokaryotic system. Exchange was made by site-directed mutagenesis. Results indicated that having two glycine residues with an internal SD-like sequence (GGAGGA) just before the third methionine abolishes the enzyme activity due to the preference of the prokaryotic system used for the expression. This study emphasizes the awareness of use of a prokaryotic system to overproduce an eukaryotic protein.

  17. Comparative Sequence and X-Inactivation Analyses of a Domain of Escape in Human Xp11.2 and the Conserved Segment in Mouse

    PubMed Central

    Tsuchiya, Karen D.; Greally, John M.; Yi, Yajun; Noel, Kevin P.; Truong, Jean-Pierre; Disteche, Christine M.

    2004-01-01

    We have performed X-inactivation and sequence analyses on 350 kb of sequence from human Xp11.2, a region shown previously to contain a cluster of genes that escape X inactivation, and we compared this region with the region of conserved synteny in mouse. We identified several new transcripts from this region in human and in mouse, which defined the full extent of the domain escaping X inactivation in both species. In human, escape from X inactivation involves an uninterrupted 235-kb domain of multiple genes. Despite highly conserved gene content and order between the two species, Smcx is the only mouse gene from the conserved segment that escapes inactivation. As repetitive sequences are believed to facilitate spreading of X inactivation along the chromosome, we compared the repetitive sequence composition of this region between the two species. We found that long terminal repeats (LTRs) were decreased in the human domain of escape, but not in the majority of the conserved mouse region adjacent to Smcx in which genes were subject to X inactivation, suggesting that these repeats might be excluded from escape domains to prevent spreading of silencing. Our findings indicate that genomic context, as well as gene-specific regulatory elements, interact to determine expression of a gene from the inactive X-chromosome. PMID:15197169

  18. Whole genomic DNA sequencing and comparative genomic analysis of Arthrospira platensis: high genome plasticity and genetic diversity

    PubMed Central

    Xu, Teng; Qin, Song; Hu, Yongwu; Song, Zhijian; Ying, Jianchao; Li, Peizhen; Dong, Wei; Zhao, Fangqing; Yang, Huanming; Bao, Qiyu

    2016-01-01

    Arthrospira platensis is a multi-cellular and filamentous non-N2-fixing cyanobacterium that is capable of performing oxygenic photosynthesis. In this study, we determined the nearly complete genome sequence of A. platensis YZ. A. platensis YZ genome is a single, circular chromosome of 6.62 Mb in size. Phylogenetic and comparative genomic analyses revealed that A. platensis YZ was more closely related to A. platensis NIES-39 than Arthrospira sp. PCC 8005 and A. platensis C1. Broad gene gains were identified between A. platensis YZ and three other Arthrospira speices, some of which have been previously demonstrated that can be laterally transferred among different species, such as restriction-modification systems-coding genes. Moreover, unprecedented extensive chromosomal rearrangements among different strains were observed. The chromosomal rearrangements, particularly the chromosomal inversions, were analysed and estimated to be closely related to palindromes that involved long inverted repeat sequences and the extensively distributed type IIR restriction enzyme in the Arthrospira genome. In addition, species from genus Arthrospira unanimously contained the highest rate of repetitive sequence compared with the other species of order Oscillatoriales, suggested that sequence duplication significantly contributed to Arthrospira genome phylogeny. These results provided in-depth views into the genomic phylogeny and structural variation of A. platensis, as well as provide a valuable resource for functional genomics studies. PMID:27330141

  19. Whole-Exome Sequencing Identifies Rare and Low-Frequency Coding Variants Associated with LDL Cholesterol

    PubMed Central

    Lange, Leslie A.; Hu, Youna; Zhang, He; Xue, Chenyi; Schmidt, Ellen M.; Tang, Zheng-Zheng; Bizon, Chris; Lange, Ethan M.; Smith, Joshua D.; Turner, Emily H.; Jun, Goo; Kang, Hyun Min; Peloso, Gina; Auer, Paul; Li, Kuo-ping; Flannick, Jason; Zhang, Ji; Fuchsberger, Christian; Gaulton, Kyle; Lindgren, Cecilia; Locke, Adam; Manning, Alisa; Sim, Xueling; Rivas, Manuel A.; Holmen, Oddgeir L.; Gottesman, Omri; Lu, Yingchang; Ruderfer, Douglas; Stahl, Eli A.; Duan, Qing; Li, Yun; Durda, Peter; Jiao, Shuo; Isaacs, Aaron; Hofman, Albert; Bis, Joshua C.; Correa, Adolfo; Griswold, Michael E.; Jakobsdottir, Johanna; Smith, Albert V.; Schreiner, Pamela J.; Feitosa, Mary F.; Zhang, Qunyuan; Huffman, Jennifer E.; Crosby, Jacy; Wassel, Christina L.; Do, Ron; Franceschini, Nora; Martin, Lisa W.; Robinson, Jennifer G.; Assimes, Themistocles L.; Crosslin, David R.; Rosenthal, Elisabeth A.; Tsai, Michael; Rieder, Mark J.; Farlow, Deborah N.; Folsom, Aaron R.; Lumley, Thomas; Fox, Ervin R.; Carlson, Christopher S.; Peters, Ulrike; Jackson, Rebecca D.; van Duijn, Cornelia M.; Uitterlinden, André G.; Levy, Daniel; Rotter, Jerome I.; Taylor, Herman A.; Gudnason, Vilmundur; Siscovick, David S.; Fornage, Myriam; Borecki, Ingrid B.; Hayward, Caroline; Rudan, Igor; Chen, Y. Eugene; Bottinger, Erwin P.; Loos, Ruth J.F.; Sætrom, Pål; Hveem, Kristian; Boehnke, Michael; Groop, Leif; McCarthy, Mark; Meitinger, Thomas; Ballantyne, Christie M.; Gabriel, Stacey B.; O’Donnell, Christopher J.; Post, Wendy S.; North, Kari E.; Reiner, Alexander P.; Boerwinkle, Eric; Psaty, Bruce M.; Altshuler, David; Kathiresan, Sekar; Lin, Dan-Yu; Jarvik, Gail P.; Cupples, L. Adrienne; Kooperberg, Charles; Wilson, James G.; Nickerson, Deborah A.; Abecasis, Goncalo R.; Rich, Stephen S.; Tracy, Russell P.; Willer, Cristen J.; Gabriel, Stacey B.; Altshuler, David M.; Abecasis, Gonçalo R.; Allayee, Hooman; Cresci, Sharon; Daly, Mark J.; de Bakker, Paul I.W.; DePristo, Mark A.; Do, Ron; Donnelly, Peter; Farlow, Deborah N.; Fennell, Tim; Garimella, Kiran; Hazen, Stanley L.; Hu, Youna; Jordan, Daniel M.; Jun, Goo; Kathiresan, Sekar; Kang, Hyun Min; Kiezun, Adam; Lettre, Guillaume; Li, Bingshan; Li, Mingyao; Newton-Cheh, Christopher H.; Padmanabhan, Sandosh; Peloso, Gina; Pulit, Sara; Rader, Daniel J.; Reich, David; Reilly, Muredach P.; Rivas, Manuel A.; Schwartz, Steve; Scott, Laura; Siscovick, David S.; Spertus, John A.; Stitziel, Nathaniel O.; Stoletzki, Nina; Sunyaev, Shamil R.; Voight, Benjamin F.; Willer, Cristen J.; Rich, Stephen S.; Akylbekova, Ermeg; Atwood, Larry D.; Ballantyne, Christie M.; Barbalic, Maja; Barr, R. Graham; Benjamin, Emelia J.; Bis, Joshua; Boerwinkle, Eric; Bowden, Donald W.; Brody, Jennifer; Budoff, Matthew; Burke, Greg; Buxbaum, Sarah; Carr, Jeff; Chen, Donna T.; Chen, Ida Y.; Chen, Wei-Min; Concannon, Pat; Crosby, Jacy; Cupples, L. Adrienne; D’Agostino, Ralph; DeStefano, Anita L.; Dreisbach, Albert; Dupuis, Josée; Durda, J. Peter; Ellis, Jaclyn; Folsom, Aaron R.; Fornage, Myriam; Fox, Caroline S.; Fox, Ervin; Funari, Vincent; Ganesh, Santhi K.; Gardin, Julius; Goff, David; Gordon, Ora; Grody, Wayne; Gross, Myron; Guo, Xiuqing; Hall, Ira M.; Heard-Costa, Nancy L.; Heckbert, Susan R.; Heintz, Nicholas; Herrington, David M.; Hickson, DeMarc; Huang, Jie; Hwang, Shih-Jen; Jacobs, David R.; Jenny, Nancy S.; Johnson, Andrew D.; Johnson, Craig W.; Kawut, Steven; Kronmal, Richard; Kurz, Raluca; Lange, Ethan M.; Lange, Leslie A.; Larson, Martin G.; Lawson, Mark; Lewis, Cora E.; Levy, Daniel; Li, Dalin; Lin, Honghuang; Liu, Chunyu; Liu, Jiankang; Liu, Kiang; Liu, Xiaoming; Liu, Yongmei; Longstreth, William T.; Loria, Cay; Lumley, Thomas; Lunetta, Kathryn; Mackey, Aaron J.; Mackey, Rachel; Manichaikul, Ani; Maxwell, Taylor; McKnight, Barbara; Meigs, James B.; Morrison, Alanna C.; Musani, Solomon K.; Mychaleckyj, Josyf C.; Nettleton, Jennifer A.; North, Kari; O’Donnell, Christopher J.; O’Leary, Daniel; Ong, Frank; Palmas, Walter; Pankow, James S.; Pankratz, Nathan D.; Paul, Shom; Perez, Marco; Person, Sharina D.; Polak, Joseph; Post, Wendy S.; Psaty, Bruce M.; Quinlan, Aaron R.; Raffel, Leslie J.; Ramachandran, Vasan S.; Reiner, Alexander P.; Rice, Kenneth; Rotter, Jerome I.; Sanders, Jill P.; Schreiner, Pamela; Seshadri, Sudha; Shea, Steve; Sidney, Stephen; Silverstein, Kevin; Smith, Nicholas L.; Sotoodehnia, Nona; Srinivasan, Asoke; Taylor, Herman A.; Taylor, Kent; Thomas, Fridtjof; Tracy, Russell P.; Tsai, Michael Y.; Volcik, Kelly A.; Wassel, Chrstina L.; Watson, Karol; Wei, Gina; White, Wendy; Wiggins, Kerri L.; Wilk, Jemma B.; Williams, O. Dale; Wilson, Gregory; Wilson, James G.; Wolf, Phillip; Zakai, Neil A.; Hardy, John; Meschia, James F.; Nalls, Michael; Singleton, Andrew; Worrall, Brad; Bamshad, Michael J.; Barnes, Kathleen C.; Abdulhamid, Ibrahim; Accurso, Frank; Anbar, Ran; Beaty, Terri; Bigham, Abigail; Black, Phillip; Bleecker, Eugene; Buckingham, Kati; Cairns, Anne Marie; Caplan, Daniel; Chatfield, Barbara; Chidekel, Aaron; Cho, Michael; Christiani, David C.; Crapo, James D.; Crouch, Julia; Daley, Denise; Dang, Anthony; Dang, Hong; De Paula, Alicia; DeCelie-Germana, Joan; Drumm, Allen DozorMitch; Dyson, Maynard; Emerson, Julia; Emond, Mary J.; Ferkol, Thomas; Fink, Robert; Foster, Cassandra; Froh, Deborah; Gao, Li; Gershan, William; Gibson, Ronald L.; Godwin, Elizabeth; Gondor, Magdalen; Gutierrez, Hector; Hansel, Nadia N.; Hassoun, Paul M.; Hiatt, Peter; Hokanson, John E.; Howenstine, Michelle; Hummer, Laura K.; Kanga, Jamshed; Kim, Yoonhee; Knowles, Michael R.; Konstan, Michael; Lahiri, Thomas; Laird, Nan; Lange, Christoph; Lin, Lin; Lin, Xihong; Louie, Tin L.; Lynch, David; Make, Barry; Martin, Thomas R.; Mathai, Steve C.; Mathias, Rasika A.; McNamara, John; McNamara, Sharon; Meyers, Deborah; Millard, Susan; Mogayzel, Peter; Moss, Richard; Murray, Tanda; Nielson, Dennis; Noyes, Blakeslee; O’Neal, Wanda; Orenstein, David; O’Sullivan, Brian; Pace, Rhonda; Pare, Peter; Parker, H. Worth; Passero, Mary Ann; Perkett, Elizabeth; Prestridge, Adrienne; Rafaels, Nicholas M.; Ramsey, Bonnie; Regan, Elizabeth; Ren, Clement; Retsch-Bogart, George; Rock, Michael; Rosen, Antony; Rosenfeld, Margaret; Ruczinski, Ingo; Sanford, Andrew; Schaeffer, David; Sell, Cindy; Sheehan, Daniel; Silverman, Edwin K.; Sin, Don; Spencer, Terry; Stonebraker, Jackie; Tabor, Holly K.; Varlotta, Laurie; Vergara, Candelaria I.; Weiss, Robert; Wigley, Fred; Wise, Robert A.; Wright, Fred A.; Wurfel, Mark M.; Zanni, Robert; Zou, Fei; Nickerson, Deborah A.; Rieder, Mark J.; Green, Phil; Shendure, Jay; Akey, Joshua M.; Bustamante, Carlos D.; Crosslin, David R.; Eichler, Evan E.; Fox, P. Keolu; Fu, Wenqing; Gordon, Adam; Gravel, Simon; Jarvik, Gail P.; Johnsen, Jill M.; Kan, Mengyuan; Kenny, Eimear E.; Kidd, Jeffrey M.; Lara-Garduno, Fremiet; Leal, Suzanne M.; Liu, Dajiang J.; McGee, Sean; O’Connor, Timothy D.; Paeper, Bryan; Robertson, Peggy D.; Smith, Joshua D.; Staples, Jeffrey C.; Tennessen, Jacob A.; Turner, Emily H.; Wang, Gao; Yi, Qian; Jackson, Rebecca; Peters, Ulrike; Carlson, Christopher S.; Anderson, Garnet; Anton-Culver, Hoda; Assimes, Themistocles L.; Auer, Paul L.; Beresford, Shirley; Bizon, Chris; Black, Henry; Brunner, Robert; Brzyski, Robert; Burwen, Dale; Caan, Bette; Carty, Cara L.; Chlebowski, Rowan; Cummings, Steven; Curb, J. David; Eaton, Charles B.; Ford, Leslie; Franceschini, Nora; Fullerton, Stephanie M.; Gass, Margery; Geller, Nancy; Heiss, Gerardo; Howard, Barbara V.; Hsu, Li; Hutter, Carolyn M.; Ioannidis, John; Jiao, Shuo; Johnson, Karen C.; Kooperberg, Charles; Kuller, Lewis; LaCroix, Andrea; Lakshminarayan, Kamakshi; Lane, Dorothy; Lasser, Norman; LeBlanc, Erin; Li, Kuo-Ping; Limacher, Marian; Lin, Dan-Yu; Logsdon, Benjamin A.; Ludlam, Shari; Manson, JoAnn E.; Margolis, Karen; Martin, Lisa; McGowan, Joan; Monda, Keri L.; Kotchen, Jane Morley; Nathan, Lauren; Ockene, Judith; O’Sullivan, Mary Jo; Phillips, Lawrence S.; Prentice, Ross L.; Robbins, John; Robinson, Jennifer G.; Rossouw, Jacques E.; Sangi-Haghpeykar, Haleh; Sarto, Gloria E.; Shumaker, Sally; Simon, Michael S.; Stefanick, Marcia L.; Stein, Evan; Tang, Hua; Taylor, Kira C.; Thomson, Cynthia A.; Thornton, Timothy A.; Van Horn, Linda; Vitolins, Mara; Wactawski-Wende, Jean; Wallace, Robert; Wassertheil-Smoller, Sylvia; Zeng, Donglin; Applebaum-Bowden, Deborah; Feolo, Michael; Gan, Weiniu; Paltoo, Dina N.; Sholinsky, Phyliss; Sturcke, Anne

    2014-01-01

    Elevated low-density lipoprotein cholesterol (LDL-C) is a treatable, heritable risk factor for cardiovascular disease. Genome-wide association studies (GWASs) have identified 157 variants associated with lipid levels but are not well suited to assess the impact of rare and low-frequency variants. To determine whether rare or low-frequency coding variants are associated with LDL-C, we exome sequenced 2,005 individuals, including 554 individuals selected for extreme LDL-C (>98th or <2nd percentile). Follow-up analyses included sequencing of 1,302 additional individuals and genotype-based analysis of 52,221 individuals. We observed significant evidence of association between LDL-C and the burden of rare or low-frequency variants in PNPLA5, encoding a phospholipase-domain-containing protein, and both known and previously unidentified variants in PCSK9, LDLR and APOB, three known lipid-related genes. The effect sizes for the burden of rare variants for each associated gene were substantially higher than those observed for individual SNPs identified from GWASs. We replicated the PNPLA5 signal in an independent large-scale sequencing study of 2,084 individuals. In conclusion, this large whole-exome-sequencing study for LDL-C identified a gene not known to be implicated in LDL-C and provides unique insight into the design and analysis of similar experiments. PMID:24507775

  20. Next generation sequencing and analysis of a conserved transcriptome of New Zealand's kiwi.

    PubMed

    Subramanian, Sankar; Huynen, Leon; Millar, Craig D; Lambert, David M

    2010-12-15

    Kiwi is a highly distinctive, flightless and endangered ratite bird endemic to New Zealand. To understand the patterns of molecular evolution of the nuclear protein-coding genes in brown kiwi (Apteryx australis mantelli) and to determine the timescale of avian history we sequenced a transcriptome obtained from a kiwi embryo using next generation sequencing methods. We then assembled the conserved protein-coding regions using the chicken proteome as a scaffold. Using 1,543 conserved protein coding genes we estimated the neutral evolutionary divergence between the kiwi and chicken to be ~45%, which is approximately equal to the divergence computed for the human-mouse pair using the same set of genes. A large fraction of genes was found to be under high selective constraint, as most of the expressed genes appeared to be involved in developmental gene regulation. Our study suggests a significant relationship between gene expression levels and protein evolution. Using sequences from over 700 nuclear genes we estimated the divergence between the two basal avian groups, Palaeognathae and Neognathae to be 132 million years, which is consistent with previous studies using mitochondrial genes. The results of this investigation revealed patterns of mutation and purifying selection in conserved protein coding regions in birds. Furthermore this study suggests a relatively cost-effective way of obtaining a glimpse into the fundamental molecular evolutionary attributes of a genome, particularly when no closely related genomic sequence is available.

  1. Limited Genetic Diversity Preceded Extinction of the Tasmanian Tiger

    PubMed Central

    Menzies, Brandon R.; Renfree, Marilyn B.; Heider, Thomas; Mayer, Frieder; Hildebrandt, Thomas B.; Pask, Andrew J.

    2012-01-01

    The Tasmanian tiger or thylacine was the largest carnivorous marsupial when Europeans first reached Australia. Sadly, the last known thylacine died in captivity in 1936. A recent analysis of the genome of the closely related and extant Tasmanian devil demonstrated limited genetic diversity between individuals. While a similar lack of diversity has been reported for the thylacine, this analysis was based on just two individuals. Here we report the sequencing of an additional 12 museum-archived specimens collected between 102 and 159 years ago. We examined a portion of the mitochondrial DNA hyper-variable control region and determined that all sequences were on average 99.5% identical at the nucleotide level. As a measure of accuracy we also sequenced mitochondrial DNA from a mother and two offspring. As expected, these samples were found to be 100% identical, validating our methods. We also used 454 sequencing to reconstruct 2.1 kilobases of the mitochondrial genome, which shared 99.91% identity with the two complete thylacine mitochondrial genomes published previously. Our thylacine genomic data also contained three highly divergent putative nuclear mitochondrial sequences, which grouped phylogenetically with the published thylacine mitochondrial homologs but contained 100-fold more polymorphisms than the conserved fragments. Together, our data suggest that the thylacine population in Tasmania had limited genetic diversity prior to its extinction, possibly as a result of their geographic isolation from mainland Australia approximately 10,000 years ago. PMID:22530022

  2. Infectious hematopoietic necrosis virus: monophyletic origin of European isolates from North American genogroup M.

    PubMed

    Enzmann, P J; Kurath, G; Fichtner, D; Bergmann, S M

    2005-09-23

    Infectious hematopoietic necrosis virus (IHNV) was first detected in Europe in 1987 in France and Italy, and later, in 1992, in Germany. The source of the virus and the route of introduction are unknown. The present study investigates the molecular epidemiology of IHNV outbreaks in Germany since its first introduction. The complete nucleotide sequences of the glycoprotein (G) and non-virion (NV) genes from 9 IHNV isolates from Germany have been determined, and this has allowed the identification of characteristic differences between these isolates. Phylogenetic analysis of partial G gene sequences (mid-G, 303 nucleotides) from North American IHNV isolates (Kurath et al. 2003) has revealed 3 major genogroups, designated U, M and L. Using this gene region with 2 different North American IHNV data sets, it was possible to group the European IHNV strains within the M genogroup, but not in any previously defined subgroup. Analysis of the full length G gene sequences indicated that an independent evolution of IHN viruses had occurred in Europe. IHN viruses in Europe seem to be of a monophyletic origin, again most closely related to North American isolates in the M genogroup. Analysis of the NV gene sequences also showed the European isolates to be monophyletic, but resolution of the 3 genogroups was poor with this gene region. As a result of comparative sequence analyses, several different genotypes have been identified circulating in Europe.

  3. Whole-exome sequencing identifies rare and low-frequency coding variants associated with LDL cholesterol.

    PubMed

    Lange, Leslie A; Hu, Youna; Zhang, He; Xue, Chenyi; Schmidt, Ellen M; Tang, Zheng-Zheng; Bizon, Chris; Lange, Ethan M; Smith, Joshua D; Turner, Emily H; Jun, Goo; Kang, Hyun Min; Peloso, Gina; Auer, Paul; Li, Kuo-Ping; Flannick, Jason; Zhang, Ji; Fuchsberger, Christian; Gaulton, Kyle; Lindgren, Cecilia; Locke, Adam; Manning, Alisa; Sim, Xueling; Rivas, Manuel A; Holmen, Oddgeir L; Gottesman, Omri; Lu, Yingchang; Ruderfer, Douglas; Stahl, Eli A; Duan, Qing; Li, Yun; Durda, Peter; Jiao, Shuo; Isaacs, Aaron; Hofman, Albert; Bis, Joshua C; Correa, Adolfo; Griswold, Michael E; Jakobsdottir, Johanna; Smith, Albert V; Schreiner, Pamela J; Feitosa, Mary F; Zhang, Qunyuan; Huffman, Jennifer E; Crosby, Jacy; Wassel, Christina L; Do, Ron; Franceschini, Nora; Martin, Lisa W; Robinson, Jennifer G; Assimes, Themistocles L; Crosslin, David R; Rosenthal, Elisabeth A; Tsai, Michael; Rieder, Mark J; Farlow, Deborah N; Folsom, Aaron R; Lumley, Thomas; Fox, Ervin R; Carlson, Christopher S; Peters, Ulrike; Jackson, Rebecca D; van Duijn, Cornelia M; Uitterlinden, André G; Levy, Daniel; Rotter, Jerome I; Taylor, Herman A; Gudnason, Vilmundur; Siscovick, David S; Fornage, Myriam; Borecki, Ingrid B; Hayward, Caroline; Rudan, Igor; Chen, Y Eugene; Bottinger, Erwin P; Loos, Ruth J F; Sætrom, Pål; Hveem, Kristian; Boehnke, Michael; Groop, Leif; McCarthy, Mark; Meitinger, Thomas; Ballantyne, Christie M; Gabriel, Stacey B; O'Donnell, Christopher J; Post, Wendy S; North, Kari E; Reiner, Alexander P; Boerwinkle, Eric; Psaty, Bruce M; Altshuler, David; Kathiresan, Sekar; Lin, Dan-Yu; Jarvik, Gail P; Cupples, L Adrienne; Kooperberg, Charles; Wilson, James G; Nickerson, Deborah A; Abecasis, Goncalo R; Rich, Stephen S; Tracy, Russell P; Willer, Cristen J

    2014-02-06

    Elevated low-density lipoprotein cholesterol (LDL-C) is a treatable, heritable risk factor for cardiovascular disease. Genome-wide association studies (GWASs) have identified 157 variants associated with lipid levels but are not well suited to assess the impact of rare and low-frequency variants. To determine whether rare or low-frequency coding variants are associated with LDL-C, we exome sequenced 2,005 individuals, including 554 individuals selected for extreme LDL-C (>98(th) or <2(nd) percentile). Follow-up analyses included sequencing of 1,302 additional individuals and genotype-based analysis of 52,221 individuals. We observed significant evidence of association between LDL-C and the burden of rare or low-frequency variants in PNPLA5, encoding a phospholipase-domain-containing protein, and both known and previously unidentified variants in PCSK9, LDLR and APOB, three known lipid-related genes. The effect sizes for the burden of rare variants for each associated gene were substantially higher than those observed for individual SNPs identified from GWASs. We replicated the PNPLA5 signal in an independent large-scale sequencing study of 2,084 individuals. In conclusion, this large whole-exome-sequencing study for LDL-C identified a gene not known to be implicated in LDL-C and provides unique insight into the design and analysis of similar experiments. Copyright © 2014 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  4. Functional Genomics Analysis of Singapore Grouper Iridovirus: Complete Sequence Determination and Proteomic Analysis

    PubMed Central

    Song, Wen Jun; Qin, Qi Wei; Qiu, Jin; Huang, Can Hua; Wang, Fan; Hew, Choy Leong

    2004-01-01

    Here we report the complete genome sequence of Singapore grouper iridovirus (SGIV). Sequencing of the random shotgun and restriction endonuclease genomic libraries showed that the entire SGIV genome consists of 140,131 nucleotide bp. One hundred sixty-two open reading frames (ORFs) from the sense and antisense DNA strands, coding for lengths varying from 41 to 1,268 amino acids, were identified. Computer-assisted analyses of the deduced amino acid sequences revealed that 77 of the ORFs exhibited homologies to known virus genes, 23 of which matched functional iridovirus proteins. Forty-two putative conserved domains or signatures were detected in the National Center for Biotechnology Information CD-Search database and PROSITE database. An assortment of enzyme activities involved in DNA replication, transcription, nucleotide metabolism, cell signaling, etc., were identified. Viruses were cultured on a cell line derived from the embryonated egg of the grouper Epinephelus tauvina, isolated, and purified by sucrose gradient ultracentrifugation. The protein extract from the purified virions was analyzed by polyacrylamide gel electrophoresis followed by in-gel digestion of protein bands. Matrix-assisted laser desorption ionization-time of flight mass spectrometry and database searching led to identification of 26 proteins. Twenty of these represented novel or previously unidentified genes, which were further confirmed by reverse transcription-PCR (RT-PCR) and DNA sequencing of their respective RT-PCR products. PMID:15507645

  5. Accurate and rapid modeling of iron-bleomycin-induced DNA damage using tethered duplex oligonucleotides and electrospray ionization ion trap mass spectrometric analysis.

    PubMed

    Harsch, A; Marzilli, L A; Bunt, R C; Stubbe, J; Vouros, P

    2000-05-01

    Bleomycin B(2)(BLM) in the presence of iron [Fe(II)] and O(2)catalyzes single-stranded (ss) and double-stranded (ds) cleavage of DNA. Electrospray ionization ion trap mass spectrometry was used to monitor these cleavage processes. Two duplex oligonucleotides containing an ethylene oxide tether between both strands were used in this investigation, allowing facile monitoring of all ss and ds cleavage events. A sequence for site-specific binding and cleavage by Fe-BLM was incorporated into each analyte. One of these core sequences, GTAC, is a known hot-spot for ds cleavage, while the other sequence, GGCC, is a hot-spot for ss cleavage. Incubation of each oligo-nucleotide under anaerobic conditions with Fe(II)-BLM allowed detection of the non-covalent ternary Fe-BLM/oligonucleotide complex in the gas phase. Cleavage studies were then performed utilizing O(2)-activated Fe(II)-BLM. No work-up or separation steps were required and direct MS and MS/MS analyses of the crude reaction mixtures confirmed sequence-specific Fe-BLM-induced cleavage. Comparison of the cleavage patterns for both oligonucleotides revealed sequence-dependent preferences for ss and ds cleavages in accordance with previously established gel electrophoresis analysis of hairpin oligonucleotides. This novel methodology allowed direct, rapid and accurate determination of cleavage profiles of model duplex oligonucleotides after exposure to activated Fe-BLM.

  6. Complete Chloroplast Genome of the Wollemi Pine (Wollemia nobilis): Structure and Evolution.

    PubMed

    Yap, Jia-Yee S; Rohner, Thore; Greenfield, Abigail; Van Der Merwe, Marlien; McPherson, Hannah; Glenn, Wendy; Kornfeld, Geoff; Marendy, Elessa; Pan, Annie Y H; Wilton, Alan; Wilkins, Marc R; Rossetto, Maurizio; Delaney, Sven K

    2015-01-01

    The Wollemi pine (Wollemia nobilis) is a rare Southern conifer with striking morphological similarity to fossil pines. A small population of W. nobilis was discovered in 1994 in a remote canyon system in the Wollemi National Park (near Sydney, Australia). This population contains fewer than 100 individuals and is critically endangered. Previous genetic studies of the Wollemi pine have investigated its evolutionary relationship with other pines in the family Araucariaceae, and have suggested that the Wollemi pine genome contains little or no variation. However, these studies were performed prior to the widespread use of genome sequencing, and their conclusions were based on a limited fraction of the Wollemi pine genome. In this study, we address this problem by determining the entire sequence of the W. nobilis chloroplast genome. A detailed analysis of the structure of the genome is presented, and the evolution of the genome is inferred by comparison with the chloroplast sequences of other members of the Araucariaceae and the related family Podocarpaceae. Pairwise alignments of whole genome sequences, and the presence of unique pseudogenes, gene duplications and insertions in W. nobilis and Araucariaceae, indicate that the W. nobilis chloroplast genome is most similar to that of its sister taxon Agathis. However, the W. nobilis genome contains an unusually high number of repetitive sequences, and these could be used in future studies to investigate and conserve any remnant genetic diversity in the Wollemi pine.

  7. Arabidopsis thaliana type I and II chaperonins.

    PubMed

    Hill, J E; Hemmingsen, S M

    2001-07-01

    An examination of the Arabidopsis thaliana genome sequence led to the identification of 29 predicted genes with the potential to encode members of the chaperonin family of chaperones (CPN60 and CCT), their associated cochaperonins, and the cytoplasmic chaperonin cofactor prefoldin. These comprise the first complete set of plant chaperonin protein sequences and indicate that the CPN family is more diverse than previously described. In addition to surprising sequence diversity within CPN subclasses, the genomic data also suggest the existence of previously undescribed family members, including a 10-kDa chloroplast cochaperonin. Consideration of the sequence data described in this review prompts questions about the complexities of plant CPN systems and the evolutionary relationships and functions of the component proteins, most of which have not been studied experimentally.

  8. A new species of cellular slime mold from southern Portugal based on morphology, ITS and SSU sequences.

    PubMed

    Romeralo, M; Baldauf, S L; Cavender, J C

    2009-01-01

    Sampling soils to look for dictyostelids in southern Portugal we found an isolate that has a morphology that differed from any previously described species of the group. We sequenced the internally transcribed spacer (ITS) and small subunit (SSU) genes of the nuclear ribosomal RNA and found that both sequences are distinct from all previously described sequences. Phylogenetic analyses place the new species in dictyostelid Group 3 (Rhizostelids) together with D. potamoides, with which it shares 65.8% identity for ITS and 96.6% for SSU. In this paper we describe a new species of cellular slime mold, Dictyostelium ibericum, based on morphological and molecular characters. It is a small species with polar granules in its spores.

  9. Molecular detection and characterization of Hepatozoon spp. in dogs from the central part of Turkey.

    PubMed

    Aydin, Mehmet Fatih; Sevinc, Ferda; Sevinc, Mutlu

    2015-04-01

    Canine hepatozoonosis is a tick-borne protozoal disease caused by Hepatozoon spp. Two species of Hepatozoon are currently known to infect dogs as Hepatozoon canis and H. americanum. Although H. canis generally causes a chronic infection with relatively mild clinical alterations compared to H. americanum, infection by H. canis can be life-threatening. The disease is widespread in USA, Africa, Europe, South America, and Asia. To determine the frequency of infection with Hepatozoon spp. in stray dogs from Central Anatolia Region of Turkey, a total of 221 blood samples collected over a three-year period were evaluated by using genus specific Polymerase Chain Reaction (PCR) designed to amplify a fragment of 666bp located in 18 S rRNA gene of Hepatozoon spp. Eight (3.61%) blood samples were positive for Hepatozoon spp. For the classification of species, all positive PCR products were purified with a PCR purification kit and sequenced. Sequencing results of eight representative amplicons indicated that 6 were 98-99% identical to the sequence of H. canis and the other 2 sequences were 95-97% identical to the sequence of Hepatozoon spp. So it was named Hepatozoon sp. MF. A phylogenetic tree was constructed from the sequences of the tick-borne agents identified previously and in this study using the neighbor-joining method. The nucleotide sequences were compared to the H. canis sequences reported in Turkey using the nucleotide Basic Local Alignment Search Tool (BLAST) program. The results of this study are significant in terms of the presence of a novel canine Hepatozoon genotype. Copyright © 2015 Elsevier GmbH. All rights reserved.

  10. HeLa Nucleic Acid Contamination in The Cancer Genome Atlas Leads to the Misidentification of Human Papillomavirus 18

    PubMed Central

    Cantalupo, Paul G.; Katz, Joshua P.

    2015-01-01

    ABSTRACT We searched The Cancer Genome Atlas (TCGA) database for viruses by comparing non-human reads present in transcriptome sequencing (RNA-Seq) and whole-exome sequencing (WXS) data to viral sequence databases. Human papillomavirus 18 (HPV18) is an etiologic agent of cervical cancer, and as expected, we found robust expression of HPV18 genes in cervical cancer samples. In agreement with previous studies, we also found HPV18 transcripts in non-cervical cancer samples, including those from the colon, rectum, and normal kidney. However, in each of these cases, HPV18 gene expression was low, and single-nucleotide variants and positions of genomic alignments matched the integrated portion of HPV18 present in HeLa cells. Chimeric reads that match a known virus-cell junction of HPV18 integrated in HeLa cells were also present in some samples. We hypothesize that HPV18 sequences in these non-cervical samples are due to nucleic acid contamination from HeLa cells. This finding highlights the problems that contamination presents in computational virus detection pipelines. IMPORTANCE Viruses associated with cancer can be detected by searching tumor sequence databases. Several studies involving searches of the TCGA database have reported the presence of HPV18, a known cause of cervical cancer, in a small number of additional cancers, including those of the rectum, kidney, and colon. We have determined that the sequences related to HPV18 in non-cervical samples are due to nucleic acid contamination from HeLa cells. To our knowledge, this is the first report of the misidentification of viruses in next-generation sequencing data of tumors due to contamination with a cancer cell line. These results raise awareness of the difficulty of accurately identifying viruses in human sequence databases. PMID:25631090

  11. Single molecule real-time sequencing of Xanthomonas oryzae genomes reveals a dynamic structure and complex TAL (transcription activator-like) effector gene relationships

    PubMed Central

    Booher, Nicholas J.; Carpenter, Sara C. D.; Sebra, Robert P.; Wang, Li; Salzberg, Steven L.; Leach, Jan E.

    2015-01-01

    Pathogen-injected, direct transcriptional activators of host genes, TAL (transcription activator-like) effectors play determinative roles in plant diseases caused by Xanthomonas spp. A large domain of nearly identical, 33–35 aa repeats in each protein mediates DNA recognition. This modularity makes TAL effectors customizable and thus important also in biotechnology. However, the repeats render TAL effector (tal) genes nearly impossible to assemble using next-generation, short reads. Here, we demonstrate that long-read, single molecule real-time (SMRT) sequencing solves this problem. Taking an ensemble approach to first generate local, tal gene contigs, we correctly assembled de novo the genomes of two strains of the rice pathogen X. oryzae completed previously using the Sanger method and even identified errors in those references. Sequencing two more strains revealed a dynamic genome structure and a striking plasticity in tal gene content. Our results pave the way for population-level studies to inform resistance breeding, improve biotechnology and probe TAL effector evolution. PMID:27148456

  12. An improved host-vector system for Candida maltosa using a gene isolated from its genome that complements the his5 mutation of Saccharomyces cerevisiae.

    PubMed

    Hikiji, T; Ohkuma, M; Takagi, M; Yano, K

    1989-10-01

    The host-vector system of an n-alkane-assimilating-yeast, Candida maltosa, which we previously constructed using an autonomously replicating sequence (ARS) region isolated from the genome of this yeast, utilizes C. maltosa J288 (leu2-) as a host. As this host had a serious growth defect on n-alkane, we developed an improved host-vector system using C. maltosa CH1 (his-) as host. The vectors were constructed with the Candida ARS region and a DNA fragment isolated from the genome of C. maltosa. Since this DNA fragment could complement histidine auxotrophy of both C. maltosa CH1 and S. cerevisiae (his5-), we termed the gene contained in this DNA fragment C-HIS5. The vectors were characterized in terms of transformation frequency and stability, and the nucleotide sequence of C-HIS5 was determined. The deduced amino acid sequence (389 residues) shared 51% homology with that of HIS5 of S. cerevisiae (384 residues; Nishiwaki et al. 1987).

  13. Detection of canine cytokine gene expression by reverse transcription-polymerase chain reaction.

    PubMed

    Pinelli, E; van der Kaaij, S Y; Slappendel, R; Fragio, C; Ruitenberg, E J; Bernadina, W; Rutten, V P

    1999-08-02

    Further characterization of the canine immune system will greatly benefit from the availability of tools to detect canine cytokines. Our interest concerns the study on the role of cytokines in canine visceral leishmaniasis. For this purpose, we have designed specific primers using previously published sequences for the detection of canine IL-2, IFN-gamma and IL10 mRNA by reverse transcription-polymerase chain reaction (RT-PCR). For IL-4, we have cloned and sequenced this cytokine gene, and developed canine-specific primers. To control for sample-to-sample variation in the quantity of mRNA and variation in the RT and PCR reactions, the mRNA levels of glyceraldehyde-3-phosphate dehydrogenase (G3PDH), a housekeeping gene, were determined in parallel. Primers to amplify G3PDH were designed from consensus sequences obtained from the Genbank database. The mRNA levels of the cytokines mentioned here were detected from ConA-stimulated peripheral mononuclear cells derived from Leishmania-infected dogs. A different pattern of cytokine production among infected animals was found.

  14. Antarctic ice core samples: culturable bacterial diversity.

    PubMed

    Shivaji, Sisinthy; Begum, Zareena; Shiva Nageswara Rao, Singireesu Soma; Vishnu Vardhan Reddy, Puram V; Manasa, Poorna; Sailaja, Buddi; Prathiba, Mambatta S; Thamban, Meloth; Krishnan, Kottekkatu P; Singh, Shiv M; Srinivas, Tanuku N R

    2013-01-01

    Culturable bacterial abundance at 11 different depths of a 50.26 m ice core from the Tallaksenvarden Nunatak, Antarctica, varied from 0.02 to 5.8 × 10(3) CFU ml(-1) of the melt water. A total of 138 bacterial strains were recovered from the 11 different depths of the ice core. Based on 16S rRNA gene sequence analyses, the 138 isolates could be categorized into 25 phylotypes belonging to phyla Actinobacteria, Bacteroidetes, Firmicutes and Proteobacteria. All isolates had 16S rRNA sequences similar to previously determined sequences (97.2-100%). No correlation was observed in the distribution of the isolates at the various depths either at the phylum, genus or species level. The 25 phylotypes varied in growth temperature range, tolerance to NaCl, growth pH range and ability to produce eight different extracellular enzymes at either 4 or 18 °C. Iso-, anteiso-, unsaturated and saturated fatty acids together constituted a significant proportion of the total fatty acid composition. Copyright © 2012 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.

  15. Isolation and whole genome sequencing of a Ruminococcus-like bacterium, associated with irritable bowel syndrome.

    PubMed

    Hynönen, Ulla; Rasinkangas, Pia; Satokari, Reetta; Paulin, Lars; de Vos, Willem M; Pietilä, Taija E; Kant, Ravi; Palva, Airi

    2016-06-01

    In our previous studies on the intestinal microbiota in irritable bowel syndrome (IBS), we identified a bacterial phylotype with higher abundance in patients suffering from diarrhea than in healthy controls. In the present work, we have isolated in pure culture strain RT94, belonging to this phylotype, determined its whole genome sequence and performed an extensive genomic analysis and phenotypical testing. This revealed strain RT94 to be a strict anaerobe apparently belonging to a novel species with only 94% similarity in the 16S rRNA gene sequence to the closest relatives Ruminococcus torques and Ruminococcus lactaris. The G + C content of strain RT94 is 45.2 mol% and the major long-chain cellular fatty acids are C16:0, C18:0 and C14:0. The isolate is metabolically versatile but not a mucus or cellulose utilizer. It produces acetate, ethanol, succinate, lactate and formate, but very little butyrate, as end products of glucose metabolism. The mechanisms underlying the association of strain RT94 with diarrhea-type IBS are discussed. Copyright © 2016 Elsevier Ltd. All rights reserved.

  16. Characterization of culturable anaerobic bacteria from the forestomach of an eastern grey kangaroo, Macropus giganteus.

    PubMed

    Ouwerkerk, D; Klieve, A V; Forster, R J; Templeton, J M; Maguire, A J

    2005-01-01

    To determine the culturable biodiversity of anaerobic bacteria isolated from the forestomach contents of an eastern grey kangaroo, Macropus giganteus, using phenotypic characterization and 16S rDNA sequence analysis. Bacteria from forestomach contents of an eastern grey kangaroo were isolated using anaerobic media containing milled curly Mitchell grass (Astrebla lappacea). DNA was extracted and the 16S rDNA sequenced for phylogenetic analysis. Forty bacterial isolates were obtained and placed in 17 groups based on phenotypic characteristics and restriction enzyme digestion of 16S rDNA PCR products. DNA sequencing revealed that the 17 groups comprised five known species (Clostridium butyricum, Streptococcus bovis, Clostridium sporogenes, Clostridium paraputrificum and Enterococcus avium) and 12 groups apparently representing new species, all within the phylum Firmicutes. Foregut contents from Australian macropod marsupials contain a microbial ecosystem with a novel bacterial biodiversity comprising a high percentage of previously unrecognized species. This study adds to knowledge of Australia's unique biodiversity, which may provide a future bioresource of genetic information and bacterial species of benefit to agriculture.

  17. Quantitation of normal CFTR mRNA in CF patients with splice-site mutations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhou, Z.; Olsen, J.C.; Silverman, L.M.

    Previously we identified two mutations in introns of the CFTR gene associated with partially active splice sites and unusual clinical phenotypes. One mutation in intron 19 (3849+10 kb C to T) is common in CF patients with normal sweat chloride values; an 84 bp sequence from intron 19, which contains a stop codon, is inserted between exon 19 and exon 20 in most nasal CFTR transcripts. The other mutation in intron 14B (2789+5 G to A) is associated with elevated sweat chloride levels, but mild pulmonary disease; exon 14B (38 bp) is spliced out of most nasal CFTR transcipts. Themore » remaining CFTR cDNA sequences, other than the 84 bp insertion of exon 14B deletion, are identical to the published sequence. To correlate genotype and phenotype, we used quantitative RT-PCR to determine the levels of normally-spliced CFTR mRNA in nasal epithelia from these patients. CFTR cDNA was amplified (25 cycles) by using primers specific for normally-spliced species, {gamma}-actin cDNA was amplified as a standard.« less

  18. Rolling chain amplification based signal-enhanced electrochemical aptasensor for ultrasensitive detection of ochratoxin A.

    PubMed

    Huang, Lin; Wu, Jingjing; Zheng, Lei; Qian, Haisheng; Xue, Feng; Wu, Yucheng; Pan, Daodong; Adeloju, Samuel B; Chen, Wei

    2013-11-19

    A novel electrochemical aptasensor is described for rapid and ultrasensitive detection of ochratoxin A (OTA) based on signal enhancement with rolling circle amplification (RCA). The primer for RCA was designed to compose of a two-part sequence, one part of the aptamer sequence directed against OTA while the other part was complementary to the capture probe on the electrode surface. In the presence of target OTA, the primer, originally hybridized with the RCA padlock, is replaced to combine with OTA. This induces the inhibition of RCA and decreases the OTA sensing signal obtained with the electrochemical aptasensor. Under the optimized conditions, ultrasensitive detection of OTA was achieved with a limit of detection (LOD) of 0.065 ppt (pg/mL), which is much lower than previously reported. The electrochemical aptasensor was also successfully applied to the determination of OTA in wine samples. This ultrasensitive electrochemical aptasensor is of great practical importance in food safety and could be widely extended to the detection of other toxins by replacing the sequence of the recognition aptamer.

  19. Female pseudohermaphroditism with multiple caudal anomalies: Absence of Y-specific DNA sequences as pathogenetic factors

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Seaver, L.H.; Grimes, J.; Erickson, R.P.

    1994-05-15

    46,XX female pseudohermaphrodites have been previously described with nearly complete masculinization of the external genitalia and no apparent source of testosterone. Multiple malformations of internal genital, urinary, and gastrointestinal tracts are associated. We have evaluated four such infants with female pseudohermaphroditism and multiple caudal anomalies. Three cases had apparently normal chromosome (46,XX); one had a 46,XX,del(10)(q25.3{yields}qter) chromosome constitution. The chromosome breakpoint is in the region of PAX2, a developmentally important paired box gene which is expressed in urogenital tissue. Using the polymerase chain reaction, we screened for the presence of multiple Y specific sequences, including SRY (sex determining region, Ymore » chromosome), that could explain masculinization of the external genitalia. All were negative for Y centromeric sequences, ZFY (Zinc finger Y), and SRY. Furthermore, there was no evidence for adrenal or other sources of testosterone. We suggest that the masculinization in these cases is the result of abnormal expression of genes which would normally be regulated by testosterone. 32 refs., 1 fig., 2 tabs.« less

  20. A molecular epidemiological investigation of avian paramyxovirus type 1 viruses isolated from game birds of the order Galliformes.

    PubMed

    Aldous, E W; Mynn, J K; Irvine, R M; Alexander, D J; Brown, I H

    2010-12-01

    The partial (370 nucleotides) fusion gene sequences of 55 avian paramyxovirus type 1 (APMV-1) isolates were obtained. Included were 41 published sequences, of which 16 were from strains of APMV-1 of previously determined lineages included as markers for the data analysed and 25 were from APMV-1 viruses isolated from game birds of the order Galliformes. In addition, we sequenced a further 14 game bird isolates obtained from the repository at the Veterinary Laboratories Agency. The game bird isolates had been obtained from 17 countries, and spanned four decades. Earlier studies have shown that class II APMV-1 viruses can be divided into at least 15 lineages and sub-lineages. Phylogenetic analysis revealed that the 39 game bird isolates were distributed across 12 of these sub-lineages. We conclude that no single lineage of Newcastle disease viruses appears to be prevalent in game birds, and the isolates obtained from these hosts reflected the prevailing, both geographically and temporally, viruses in poultry, pigeons or wild birds.

  1. Clinical features of X linked juvenile retinoschisis associated with new mutations in the XLRS1 gene in Italian families

    PubMed Central

    Simonelli, F; Cennamo, G; Ziviello, C; Testa, F; de Crecchio, G; Nesti, A; Manitto, M P; Ciccodicola, A; Banfi, S; Brancato, R; Rinaldi, E

    2003-01-01

    Aims: To describe the clinical phenotype of X linked juvenile retinoschisis in eight Italian families with six different mutations in the XLRS1 gene. Methods: Complete ophthalmic examinations, electroretinography and A and B-scan standardised echography were performed in 18 affected males. The coding sequences of the XLRS1 gene were amplified by polymerase chain reaction and directly sequenced on an automated sequencer. Results: Six different XLRS1 mutations were identified; two of these mutations Ile81Asn and the Trp122Cys, have not been previously described. The affected males showed an electronegative response to the standard white scotopic stimulus and a prolonged implicit time of the 30 Hz flicker. In the families with Trp112Cys and Trp122Cys mutations we observed a more severe retinoschisis (RS) clinical picture compared with the other genotypes. Conclusion: The severe RS phenotypes associated with Trp112Cys and to Trp122Cys mutations suggest that these mutations determine a notable alteration in the function of the retinoschisin protein. PMID:12928282

  2. Whole genome assembly of a natto production strain Bacillus subtilis natto from very short read data.

    PubMed

    Nishito, Yukari; Osana, Yasunori; Hachiya, Tsuyoshi; Popendorf, Kris; Toyoda, Atsushi; Fujiyama, Asao; Itaya, Mitsuhiro; Sakakibara, Yasubumi

    2010-04-16

    Bacillus subtilis natto is closely related to the laboratory standard strain B. subtilis Marburg 168, and functions as a starter for the production of the traditional Japanese food "natto" made from soybeans. Although re-sequencing whole genomes of several laboratory domesticated B. subtilis 168 derivatives has already been attempted using short read sequencing data, the assembly of the whole genome sequence of a closely related strain, B. subtilis natto, from very short read data is more challenging, particularly with our aim to assemble one fully connected scaffold from short reads around 35 bp in length. We applied a comparative genome assembly method, which combines de novo assembly and reference guided assembly, to one of the B. subtilis natto strains. We successfully assembled 28 scaffolds and managed to avoid substantial fragmentation. Completion of the assembly through long PCR experiments resulted in one connected scaffold for B. subtilis natto. Based on the assembled genome sequence, our orthologous gene analysis between natto BEST195 and Marburg 168 revealed that 82.4% of 4375 predicted genes in BEST195 are one-to-one orthologous to genes in 168, with two genes in-paralog, 3.2% are deleted in 168, 14.3% are inserted in BEST195, and 5.9% of genes present in 168 are deleted in BEST195. The natto genome contains the same alleles in the promoter region of degQ and the coding region of swrAA as the wild strain, RO-FF-1. These are specific for gamma-PGA production ability, which is related to natto production. Further, the B. subtilis natto strain completely lacked a polyketide synthesis operon, disrupted the plipastatin production operon, and possesses previously unidentified transposases. The determination of the whole genome sequence of Bacillus subtilis natto provided detailed analyses of a set of genes related to natto production, demonstrating the number and locations of insertion sequences that B. subtilis natto harbors but B. subtilis 168 lacks. Multiple genome-level comparisons among five closely related Bacillus species were also carried out. The determined genome sequence of B. subtilis natto and gene annotations are available from the Natto genome browser http://natto-genome.org/.

  3. PredPPCrys: Accurate Prediction of Sequence Cloning, Protein Production, Purification and Crystallization Propensity from Protein Sequences Using Multi-Step Heterogeneous Feature Fusion and Selection

    PubMed Central

    Wang, Huilin; Wang, Mingjun; Tan, Hao; Li, Yuan; Zhang, Ziding; Song, Jiangning

    2014-01-01

    X-ray crystallography is the primary approach to solve the three-dimensional structure of a protein. However, a major bottleneck of this method is the failure of multi-step experimental procedures to yield diffraction-quality crystals, including sequence cloning, protein material production, purification, crystallization and ultimately, structural determination. Accordingly, prediction of the propensity of a protein to successfully undergo these experimental procedures based on the protein sequence may help narrow down laborious experimental efforts and facilitate target selection. A number of bioinformatics methods based on protein sequence information have been developed for this purpose. However, our knowledge on the important determinants of propensity for a protein sequence to produce high diffraction-quality crystals remains largely incomplete. In practice, most of the existing methods display poorer performance when evaluated on larger and updated datasets. To address this problem, we constructed an up-to-date dataset as the benchmark, and subsequently developed a new approach termed ‘PredPPCrys’ using the support vector machine (SVM). Using a comprehensive set of multifaceted sequence-derived features in combination with a novel multi-step feature selection strategy, we identified and characterized the relative importance and contribution of each feature type to the prediction performance of five individual experimental steps required for successful crystallization. The resulting optimal candidate features were used as inputs to build the first-level SVM predictor (PredPPCrys I). Next, prediction outputs of PredPPCrys I were used as the input to build second-level SVM classifiers (PredPPCrys II), which led to significantly enhanced prediction performance. Benchmarking experiments indicated that our PredPPCrys method outperforms most existing procedures on both up-to-date and previous datasets. In addition, the predicted crystallization targets of currently non-crystallizable proteins were provided as compendium data, which are anticipated to facilitate target selection and design for the worldwide structural genomics consortium. PredPPCrys is freely available at http://www.structbioinfor.org/PredPPCrys. PMID:25148528

  4. Distinctive acceptor-end structure and other determinants of Escherichia coli tRNAPro identity.

    PubMed Central

    McClain, W H; Schneider, J; Gabriel, K

    1994-01-01

    The previously uncharacterized determinants of the specificity of tRNAPro for aminoacylation (tRNAPro identity) were defined by a computer comparison of all Escherichia coli tRNA sequences and tested by a functional analysis of amber suppressor tRNAs in vivo. We determined the amino acid specificity of tRNA by sequencing a suppressed protein and the aminoacylation efficiency of tRNA by examining the steady-state level of aminoacyl-tRNA. On substituting nucleotides derived from the acceptor end and variable pocket of tRNAPro for the corresponding nucleotides in a tRNAPhe gene, the identity of the resulting tRNA changed substantially but incompletely to that of tRNAPro. The redesigned tRNAPhe was weakly active and aminoacyl-tRNA was not detected. Ethyl methanesulfonate mutagenesis of the redesigned tRNAPhe gene produced a mutant with a wobble pair in place of a base pair in the end of the acceptor-stem helix of the transcribed tRNA. This mutant exhibited both a tRNAPro identity and substantial aminoacyl-tRNA. The results speak for the importance of a distinctive conformation in the acceptor-stem helix of tRNAPro for aminoacylation by the prolyl-tRNA synthetase. The anticodon also contributes to tRNAPro identity but is not necessary in vivo. Images PMID:8127693

  5. An N-terminal glycine-rich sequence contributes to retrovirus trimer of hairpins stability

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wilson, Kirilee A.; Maerz, Anne L.; Baer, Severine

    2007-08-10

    Retroviral transmembrane proteins (TMs) contain a glycine-rich segment linking the N-terminal fusion peptide and coiled coil core. Previously, we reported that the glycine-rich segment (Met-326-Ser-337) of the human T-cell leukemia virus type 1 (HTLV-1) TM, gp21, is a determinant of membrane fusion function [K.A. Wilson, S. Baer, A.L. Maerz, M. Alizon, P. Poumbourios, The conserved glycine-rich segment linking the N-terminal fusion peptide to the coiled coil of human T-cell leukemia virus type 1 transmembrane glycoprotein gp21 is a determinant of membrane fusion function, J. Virol. 79 (2005) 4533-4539]. Here we show that the reduced fusion activity of an I334A mutantmore » correlated with a decrease in stability of the gp21 trimer of hairpins conformation, in the context of a maltose-binding protein-gp21 chimera. The stabilizing influence of Ile-334 required the C-terminal membrane-proximal sequence Trp-431-Ser-436. Proline substitution of four of five Gly residues altered gp21 trimer of hairpins stability. Our data indicate that flexibility within and hydrophobic interactions mediated by this region are determinants of gp21 stability and membrane fusion function.« less

  6. PMS2 gene mutation results in DNA mismatch repair system failure in a case of adult granulosa cell tumor.

    PubMed

    Wang, Wen-Chung; Lee, Ya-Ting; Lai, Yen-Chein

    2017-03-27

    Granulosa cell tumors are rare ovarian malignancies. Their characteristics include unpredictable indolent growth with malignant potential and late recurrence. Approximately 95% are of adult type. Recent molecular studies have characterized the FOXL2 402C > G mutation in adult granulosa cell tumor. Our previous case report showed that unique FOXL2 402C > G mutation and defective DNA mismatch repair system are associated with the development of adult granulosa cell tumor. In this study, the DNA sequences of four genes, MSH2, MLH1, MSH6, and PMS2, in the DNA mismatch repair system were determined via direct sequencing to elucidate the exact mechanism for the development of this granulosa cell tumor. The results showed that two missense germline mutations, T485K and N775L, inactivate the PMS2 gene. The results of this case study indicated that although FOXL2 402C > G mutation determines the development of granulosa cell tumor, PMS2 mutation may be the initial driver of carcinogenesis. Immunohistochemistry-based tumor testing for mismatch repair gene expression may be necessary for granulosa cell tumors to determine their malignant potential or if they are part of Lynch syndrome.

  7. Clonal architecture of secondary acute myeloid leukemia defined by single-cell sequencing.

    PubMed

    Hughes, Andrew E O; Magrini, Vincent; Demeter, Ryan; Miller, Christopher A; Fulton, Robert; Fulton, Lucinda L; Eades, William C; Elliott, Kevin; Heath, Sharon; Westervelt, Peter; Ding, Li; Conrad, Donald F; White, Brian S; Shao, Jin; Link, Daniel C; DiPersio, John F; Mardis, Elaine R; Wilson, Richard K; Ley, Timothy J; Walter, Matthew J; Graubert, Timothy A

    2014-07-01

    Next-generation sequencing has been used to infer the clonality of heterogeneous tumor samples. These analyses yield specific predictions-the population frequency of individual clones, their genetic composition, and their evolutionary relationships-which we set out to test by sequencing individual cells from three subjects diagnosed with secondary acute myeloid leukemia, each of whom had been previously characterized by whole genome sequencing of unfractionated tumor samples. Single-cell mutation profiling strongly supported the clonal architecture implied by the analysis of bulk material. In addition, it resolved the clonal assignment of single nucleotide variants that had been initially ambiguous and identified areas of previously unappreciated complexity. Accordingly, we find that many of the key assumptions underlying the analysis of tumor clonality by deep sequencing of unfractionated material are valid. Furthermore, we illustrate a single-cell sequencing strategy for interrogating the clonal relationships among known variants that is cost-effective, scalable, and adaptable to the analysis of both hematopoietic and solid tumors, or any heterogeneous population of cells.

  8. Use of Genome Sequence Information for Meat Quality Trait QTL Mining for Causal Genes and Mutations on Pig Chromosome 17

    PubMed Central

    Hu, Zhi-Liang; Ramos, Antonio M.; Humphray, Sean J.; Rogers, Jane; Reecy, James M.; Rothschild, Max F.

    2011-01-01

    The newly available pig genome sequence has provided new information to fine map quantitative trait loci (QTL) in order to eventually identify causal variants. With targeted genomic sequencing efforts, we were able to obtain high quality BAC sequences that cover a region on pig chromosome 17 where a number of meat quality QTL have been previously discovered. Sequences from 70 BAC clones were assembled to form an 8-Mbp contig. Subsequently, we successfully mapped five previously identified QTL, three for meat color and two for lactate related traits, to the contig. With an additional 25 genetic markers that were identified by sequence comparison, we were able to carry out further linkage disequilibrium analysis to narrow down the genomic locations of these QTL, which allowed identification of the chromosomal regions that likely contain the causative variants. This research has provided one practical approach to combine genetic and molecular information for QTL mining. PMID:22303339

  9. DIALIGN P: fast pair-wise and multiple sequence alignment using parallel processors.

    PubMed

    Schmollinger, Martin; Nieselt, Kay; Kaufmann, Michael; Morgenstern, Burkhard

    2004-09-09

    Parallel computing is frequently used to speed up computationally expensive tasks in Bioinformatics. Herein, a parallel version of the multi-alignment program DIALIGN is introduced. We propose two ways of dividing the program into independent sub-routines that can be run on different processors: (a) pair-wise sequence alignments that are used as a first step to multiple alignment account for most of the CPU time in DIALIGN. Since alignments of different sequence pairs are completely independent of each other, they can be distributed to multiple processors without any effect on the resulting output alignments. (b) For alignments of large genomic sequences, we use a heuristics by splitting up sequences into sub-sequences based on a previously introduced anchored alignment procedure. For our test sequences, this combined approach reduces the program running time of DIALIGN by up to 97%. By distributing sub-routines to multiple processors, the running time of DIALIGN can be crucially improved. With these improvements, it is possible to apply the program in large-scale genomics and proteomics projects that were previously beyond its scope.

  10. Antimicrobial susceptibility and genetic characteristics of Neisseria gonorrhoeae isolates from Vietnam, 2011

    PubMed Central

    2013-01-01

    Background Antimicrobial resistance (AMR) in Neisseria gonorrhoeae is a major public health concern worldwide. In Vietnam, knowledge regarding N. gonorrhoeae prevalence and AMR is limited, and data concerning genetic characteristics of N. gonorrhoeae is totally lacking. Herein, we investigated the phenotypic AMR (previous, current and possible future treatment options), genetic resistance determinants for extended-spectrum cephalosporins (ESCs), and genotypic distribution of N. gonorrhoeae isolated in 2011 in Hanoi, Vietnam. Methods N. gonorrhoeae isolates from Hanoi, Vietnam isolated in 2011 (n = 108) were examined using antibiograms (Etest for 10 antimicrobials), Neisseria gonorrhoeae multi-antigen sequence typing (NG-MAST), and sequencing of ESC resistance determinants (penA, mtrR and penB). Results The levels of in vitro resistance were as follows: ciprofloxacin 98%, tetracycline 82%, penicillin G 48%, azithromycin 11%, ceftriaxone 5%, cefixime 1%, and spectinomycin 0%. The MICs of gentamicin (0.023-6 mg/L), ertapenem (0.002-0.125 mg/L) and solithromycin (<0.016-0.25 mg/L) were relatively low. No penA mosaic alleles were found, however, 78% of the isolates contained an alteration of amino acid A501 (A501V (44%) and A501T (34%)) in the encoded penicillin-binding protein 2. A single nucleotide (A) deletion in the inverted repeat of the promoter region of the mtrR gene and amino acid alterations in MtrR was observed in 91% and 94% of the isolates, respectively. penB resistance determinants were detected in 87% of the isolates. Seventy-five different NG-MAST STs were identified, of which 59 STs have not been previously described. Conclusions In Vietnam, the highly diversified gonococcal population displayed high in vitro resistance to antimicrobials previously recommended for gonorrhoea treatment (with exception of spectinomycin), but resistance also to the currently recommended ESCs were found. Nevertheless, the MICs of three potential future treatment options were low. It is essential to strengthen the diagnostics, case reporting, and epidemiologic surveillance of gonorrhoea in Vietnam. Furthermore, the surveillance of gonococcal AMR and gonorrhoea treatment failures is imperative to reinforce. Research regarding novel antimicrobial treatment strategies (e.g., combination therapy) and new antimicrobials is crucial for future treatment of gonorrhoea. PMID:23351067

  11. Human papillomavirus genotyping by Linear Array and Next-Generation Sequencing in cervical samples from Western Mexico.

    PubMed

    Flores-Miramontes, María Guadalupe; Torres-Reyes, Luis Alberto; Alvarado-Ruíz, Liliana; Romero-Martínez, Salvador Angel; Ramírez-Rodríguez, Verenice; Balderas-Peña, Luz María Adriana; Vallejo-Ruíz, Verónica; Piña-Sánchez, Patricia; Cortés-Gutiérrez, Elva Irene; Jave-Suárez, Luis Felipe; Aguilar-Lemarroy, Adriana

    2015-10-06

    The Linear Array® (LA) genotyping test is one of the most used methodologies for Human papillomavirus (HPV) genotyping, in that it is able to detect 37 HPV genotypes and co-infections in the same sample. However, the assay is limited to a restricted number of HPV, and sequence variations in the detection region of the HPV probes could give false negatives results. Recently, 454 Next-Generation sequencing (NGS) technology has been efficiently used also for HPV genotyping; this methodology is based on massive sequencing of HPV fragments and is expected to be highly specific and sensitive. In this work, we studied HPV prevalence in cervixes of women in Western Mexico by LA and confirmed the genotypes found by NGS. Two hundred thirty three cervical samples from women Without cervical lesions (WCL, n = 48), with Cervical intraepithelial neoplasia grade 1 (CIN I, n = 98), or with Cervical cancer (CC, n = 87) were recruited, DNA was extracted, and HPV positivity was determined by PCR amplification using PGMY09/11 primers. All HPV- positive samples were genotyped individually by LA. Additionally, pools of amplicons from the PGMY-PCR products were sequenced using 454 NGS technology. Results obtained by NGS were compared with those of LA for each group of samples. We identified 35 HPV genotypes, among which 30 were identified by both technologies; in addition, the HPV genotypes 32, 44, 74, 102 and 114 were detected by NGS. These latter genotypes, to our knowledge, have not been previously reported in Mexican population. Furthermore, we found that LA did not detect, in some diagnosis groups, certain HPV genotypes included in the test, such as 6, 11, 16, 26, 35, 51, 58, 68, 73, and 89, which indicates possible variations at the species level. There are HPV genotypes in Mexican population that cannot be detected by LA, which is, at present, the most complete commercial genotyping test. More studies are necessary to determine the impact of HPV-44, 74, 102 and 114 on the risk of developing CC. A greater number of samples must be analyzed by NGS for the most accurate determination of Mexican HPV variants.

  12. Initial sequence characterization of the rhabdoviruses of squamate reptiles, including a novel rhabdovirus from a caiman lizard (Dracaena guianensis)

    PubMed Central

    Wellehan, James F.X.; Pessier, Allan P.; Archer, Linda L.; Childress, April L.; Jacobson, Elliott R.; Tesh, Robert B.

    2012-01-01

    Rhabdoviruses infect a variety of hosts, including non-avian reptiles. Consensus PCR techniques were used to obtain partial RNA-dependent RNA polymerase gene sequence from five rhabdoviruses of South American lizards; Marco, Chaco, Timbo, Sena Madureira, and a rhabdovirus from a caiman lizard (Dracaena guianensis). The caiman lizard rhabdovirus formed inclusions in erythrocytes, which may be a route for infecting hematophagous insects. This is the first information on behavior of a rhabdovirus in squamates. We also obtained sequence from two rhabdoviruses of Australian lizards, confirming previous Charleville virus sequence and finding that, unlike a previous sequence report but in agreement with serologic reports, Almpiwar virus is clearly distinct from Charleville virus. Bayesian and maximum likelihood phylogenetic analysis revealed that most known rhabdoviruses of squamates cluster in the Almpiwar subgroup. The exception is Marco virus, which is found in the Hart Park group. PMID:22397930

  13. Genotyping by Sequencing Using Specific Allelic Capture to Build a High-Density Genetic Map of Durum Wheat

    PubMed Central

    Holtz, Yan; Ardisson, Morgane; Ranwez, Vincent; Besnard, Alban; Leroy, Philippe; Poux, Gérard; Roumet, Pierre; Viader, Véronique; Santoni, Sylvain; David, Jacques

    2016-01-01

    Targeted sequence capture is a promising technology which helps reduce costs for sequencing and genotyping numerous genomic regions in large sets of individuals. Bait sequences are designed to capture specific alleles previously discovered in parents or reference populations. We studied a set of 135 RILs originating from a cross between an emmer cultivar (Dic2) and a recent durum elite cultivar (Silur). Six thousand sequence baits were designed to target Dic2 vs. Silur polymorphisms discovered in a previous RNAseq study. These baits were exposed to genomic DNA of the RIL population. Eighty percent of the targeted SNPs were recovered, 65% of which were of high quality and coverage. The final high density genetic map consisted of more than 3,000 markers, whose genetic and physical mapping were consistent with those obtained with large arrays. PMID:27171472

  14. Identification of tissue-specific, abiotic stress-responsive gene expression patterns in wine grape (Vitis vinifera L.) based on curation and mining of large-scale EST data sets

    PubMed Central

    2011-01-01

    Background Abiotic stresses, such as water deficit and soil salinity, result in changes in physiology, nutrient use, and vegetative growth in vines, and ultimately, yield and flavor in berries of wine grape, Vitis vinifera L. Large-scale expressed sequence tags (ESTs) were generated, curated, and analyzed to identify major genetic determinants responsible for stress-adaptive responses. Although roots serve as the first site of perception and/or injury for many types of abiotic stress, EST sequencing in root tissues of wine grape exposed to abiotic stresses has been extremely limited to date. To overcome this limitation, large-scale EST sequencing was conducted from root tissues exposed to multiple abiotic stresses. Results A total of 62,236 expressed sequence tags (ESTs) were generated from leaf, berry, and root tissues from vines subjected to abiotic stresses and compared with 32,286 ESTs sequenced from 20 public cDNA libraries. Curation to correct annotation errors, clustering and assembly of the berry and leaf ESTs with currently available V. vinifera full-length transcripts and ESTs yielded a total of 13,278 unique sequences, with 2302 singletons and 10,976 mapped to V. vinifera gene models. Of these, 739 transcripts were found to have significant differential expression in stressed leaves and berries including 250 genes not described previously as being abiotic stress responsive. In a second analysis of 16,452 ESTs from a normalized root cDNA library derived from roots exposed to multiple, short-term, abiotic stresses, 135 genes with root-enriched expression patterns were identified on the basis of their relative EST abundance in roots relative to other tissues. Conclusions The large-scale analysis of relative EST frequency counts among a diverse collection of 23 different cDNA libraries from leaf, berry, and root tissues of wine grape exposed to a variety of abiotic stress conditions revealed distinct, tissue-specific expression patterns, previously unrecognized stress-induced genes, and many novel genes with root-enriched mRNA expression for improving our understanding of root biology and manipulation of rootstock traits in wine grape. mRNA abundance estimates based on EST library-enriched expression patterns showed only modest correlations between microarray and quantitative, real-time reverse transcription-polymerase chain reaction (qRT-PCR) methods highlighting the need for deep-sequencing expression profiling methods. PMID:21592389

  15. PET Imaging Stability Measurements During Simultaneous Pulsing of Aggressive MR Sequences on the SIGNA PET/MR System.

    PubMed

    Deller, Timothy W; Khalighi, Mohammad Mehdi; Jansen, Floris P; Glover, Gary H

    2018-01-01

    The recent introduction of simultaneous whole-body PET/MR scanners has enabled new research taking advantage of the complementary information obtainable with PET and MRI. One such application is kinetic modeling, which requires high levels of PET quantitative stability. To accomplish the required PET stability levels, the PET subsystem must be sufficiently isolated from the effects of MR activity. Performance measurements have previously been published, demonstrating sufficient PET stability in the presence of MR pulsing for typical clinical use; however, PET stability during radiofrequency (RF)-intensive and gradient-intensive sequences has not previously been evaluated for a clinical whole-body scanner. In this work, PET stability of the GE SIGNA PET/MR was examined during simultaneous scanning of aggressive MR pulse sequences. Methods: PET performance tests were acquired with MR idle and during simultaneous MR pulsing. Recent system improvements mitigating RF interference and gain variation were used. A fast recovery fast spin echo MR sequence was selected for high RF power, and an echo planar imaging sequence was selected for its high heat-inducing gradients. Measurements were performed to determine PET stability under varying MR conditions using the following metrics: sensitivity, scatter fraction, contrast recovery, uniformity, count rate performance, and image quantitation. A final PET quantitative stability assessment for simultaneous PET scanning during functional MRI studies was performed with a spiral in-and-out gradient echo sequence. Results: Quantitation stability of a 68 Ge flood phantom was demonstrated within 0.34%. Normalized sensitivity was stable during simultaneous scanning within 0.3%. Scatter fraction measured with a 68 Ge line source in the scatter phantom was stable within the range of 40.4%-40.6%. Contrast recovery and uniformity were comparable for PET images acquired simultaneously with multiple MR conditions. Peak noise equivalent count rate was 224 kcps at an effective activity concentration of 18.6 kBq/mL, and the count rate curves and scatter fraction curve were consistent for the alternating MR pulsing states. A final test demonstrated quantitative stability during a spiral functional MRI sequence. Conclusion: PET stability metrics demonstrated that PET quantitation was not affected during simultaneous aggressive MRI. This stability enables demanding applications such as kinetic modeling. © 2018 by the Society of Nuclear Medicine and Molecular Imaging.

  16. Sex chromosome differentiation and the W- and Z-specific loci in Xenopus laevis.

    PubMed

    Mawaribuchi, Shuuji; Takahashi, Shuji; Wada, Mikako; Uno, Yoshinobu; Matsuda, Yoichi; Kondo, Mariko; Fukui, Akimasa; Takamatsu, Nobuhiko; Taira, Masanori; Ito, Michihiko

    2017-06-15

    Genetic sex-determining systems in vertebrates include two basic types of heterogamety; XX (female)/XY (male) and ZZ (male)/ZW (female) types. The African clawed frog Xenopus laevis has a ZZ/ZW-type sex-determining system. In this species, we previously identified a W-specific sex (female)-determining gene dmw, and specified W and Z chromosomes, which could be morphologically indistinguishable (homomorphic). In addition to dmw, we most recently discovered two genes, named scanw and ccdc69w, and one gene, named capn5z in the W- and Z-specific regions, respectively. In this study, we revealed the detail structures of the W/Z-specific loci and genes. Sequence analysis indicated that there is almost no sequence similarity between 278kb W-specific and 83kb Z-specific sequences on chromosome 2Lq32-33, where both the transposable elements are abundant. Synteny and phylogenic analyses indicated that all the W/Z-specific genes might have emerged independently. Expression analysis demonstrated that scanw and ccdc69w or capn5z are expressed in early differentiating ZW gonads or testes, thereby suggesting possible roles in female or male development, respectively. Importantly, the sex-determining gene (SDG) dmw might have been generated after allotetraploidization, thereby indicating the construction of the new sex-determining system by dmw after species hybridization. Furthermore, by direct genotyping, we confirmed that diploid WW embryos developed into normal female frogs, which indicate that the Z-specific region is not essential for female development. Overall, these findings indicate that sex chromosome differentiation has started, although no heteromorphic sex chromosomes are evident yet, in X. laevis. Homologous recombination suppression might have promoted the accumulation of mutations and transposable elements, and enlarged the W/Z-specific regions, thereby resulting in differentiation of the W/Z chromosomes. Copyright © 2016 Elsevier Inc. All rights reserved.

  17. The Genome Sequence of Avibacterium paragallinarum Strain CL Has a Large Repertoire of Insertion Sequence Elements.

    PubMed

    Horta-Valerdi, Guillermo; Sanchez-Alonso, Maria Patricia; Perez-Marquez, Victor M; Negrete-Abascal, Erasmo; Vaca-Pacheco, Sergio; Hernandez-Gonzalez, Ismael; Gomez-Lunar, Zulema; Olmedo-Álvarez, Gabriela; Vázquez-Cruz, Candelario

    2017-04-13

    The draft genome sequence of Avibacterium paragallinarum strain CL serovar C is reported here. The genome comprises 154 contigs corresponding to 2.4 Mb with 41% G+C content and many insertion sequence (IS) elements, a characteristic not previously reported in A. paragallinarum . Copyright © 2017 Horta-Valerdi et al.

  18. Host and viral determinants for MxB restriction of HIV-1 infection.

    PubMed

    Matreyek, Kenneth A; Wang, Weifeng; Serrao, Erik; Singh, Parmit Kumar; Levin, Henry L; Engelman, Alan

    2014-10-25

    Interferon-induced cellular proteins play important roles in the host response against viral infection. The Mx family of dynamin-like GTPases, which include MxA and MxB, target a wide variety of viruses. Despite considerable evidence demonstrating the breadth of antiviral activity of MxA, human MxB was only recently discovered to specifically inhibit lentiviruses. Here we assess both host and viral determinants that underlie MxB restriction of HIV-1 infection. Heterologous expression of MxB in human osteosarcoma cells potently inhibited HIV-1 infection (~12-fold), yet had little to no effect on divergent retroviruses. The anti-HIV effect manifested as a partial block in the formation of 2-long terminal repeat circle DNA and hence nuclear import, and we accordingly found evidence for an additional post-nuclear entry block. A large number of previously characterized capsid mutations, as well as mutations that abrogated integrase activity, counteracted MxB restriction. MxB expression suppressed integration into gene-enriched regions of chromosomes, similar to affects observed previously when cells were depleted for nuclear transport factors such as transportin 3. MxB activity did not require predicted GTPase active site residues or a series of unstructured loops within the stalk domain that confer functional oligomerization to related dynamin family proteins. In contrast, we observed an N-terminal stretch of residues in MxB to harbor key determinants. Protein localization conferred by a nuclear localization signal (NLS) within the N-terminal 25 residues, which was critical, was fully rescuable by a heterologous NLS. Consistent with this observation, a heterologous nuclear export sequence (NES) abolished full-length MxB activity. We additionally mapped sub-regions within amino acids 26-90 that contribute to MxB activity, finding sequences present within residues 27-50 particularly important. MxB inhibits HIV-1 by interfering with minimally two steps of infection, nuclear entry and post-nuclear trafficking and/or integration, without destabilizing the inherent catalytic activity of viral preintegration complexes. Putative MxB GTPase active site residues and stalk domain Loop 4 -- both previously shown to be necessary for MxA function -- were dispensable for MxB antiviral activity. Instead, we highlight subcellular localization and a yet-determined function(s) present in the unique MxB N-terminal region to be required for HIV-1 restriction.

  19. Exome Sequencing Reveals Primary Immunodeficiencies in Children with Community-Acquired Pseudomonas aeruginosa Sepsis.

    PubMed

    Asgari, Samira; McLaren, Paul J; Peake, Jane; Wong, Melanie; Wong, Richard; Bartha, Istvan; Francis, Joshua R; Abarca, Katia; Gelderman, Kyra A; Agyeman, Philipp; Aebi, Christoph; Berger, Christoph; Fellay, Jacques; Schlapbach, Luregn J

    2016-01-01

    One out of three pediatric sepsis deaths in high income countries occur in previously healthy children. Primary immunodeficiencies (PIDs) have been postulated to underlie fulminant sepsis, but this concept remains to be confirmed in clinical practice. Pseudomonas aeruginosa ( P. aeruginosa ) is a common bacterium mostly associated with health care-related infections in immunocompromised individuals. However, in rare cases, it can cause sepsis in previously healthy children. We used exome sequencing and bioinformatic analysis to systematically search for genetic factors underpinning severe P. aeruginosa infection in the pediatric population. We collected blood samples from 11 previously healthy children, with no family history of immunodeficiency, who presented with severe sepsis due to community-acquired P. aeruginosa bacteremia. Genomic DNA was extracted from blood or tissue samples obtained intravitam or postmortem. We obtained high-coverage exome sequencing data and searched for rare loss-of-function variants. After rigorous filtrations, 12 potentially causal variants were identified. Two out of eight (25%) fatal cases were found to carry novel pathogenic variants in PID genes, including BTK and DNMT3B . This study demonstrates that exome sequencing allows to identify rare, deleterious human genetic variants responsible for fulminant sepsis in apparently healthy children. Diagnosing PIDs in such patients is of high relevance to survivors and affected families. We propose that unusually severe and fatal sepsis cases in previously healthy children should be considered for exome/genome sequencing to search for underlying PIDs.

  20. Exome Sequencing Reveals Primary Immunodeficiencies in Children with Community-Acquired Pseudomonas aeruginosa Sepsis

    PubMed Central

    Asgari, Samira; McLaren, Paul J.; Peake, Jane; Wong, Melanie; Wong, Richard; Bartha, Istvan; Francis, Joshua R.; Abarca, Katia; Gelderman, Kyra A.; Agyeman, Philipp; Aebi, Christoph; Berger, Christoph; Fellay, Jacques; Schlapbach, Luregn J.; Posfay-Barbe, Klara

    2016-01-01

    One out of three pediatric sepsis deaths in high income countries occur in previously healthy children. Primary immunodeficiencies (PIDs) have been postulated to underlie fulminant sepsis, but this concept remains to be confirmed in clinical practice. Pseudomonas aeruginosa (P. aeruginosa) is a common bacterium mostly associated with health care-related infections in immunocompromised individuals. However, in rare cases, it can cause sepsis in previously healthy children. We used exome sequencing and bioinformatic analysis to systematically search for genetic factors underpinning severe P. aeruginosa infection in the pediatric population. We collected blood samples from 11 previously healthy children, with no family history of immunodeficiency, who presented with severe sepsis due to community-acquired P. aeruginosa bacteremia. Genomic DNA was extracted from blood or tissue samples obtained intravitam or postmortem. We obtained high-coverage exome sequencing data and searched for rare loss-of-function variants. After rigorous filtrations, 12 potentially causal variants were identified. Two out of eight (25%) fatal cases were found to carry novel pathogenic variants in PID genes, including BTK and DNMT3B. This study demonstrates that exome sequencing allows to identify rare, deleterious human genetic variants responsible for fulminant sepsis in apparently healthy children. Diagnosing PIDs in such patients is of high relevance to survivors and affected families. We propose that unusually severe and fatal sepsis cases in previously healthy children should be considered for exome/genome sequencing to search for underlying PIDs. PMID:27703454

Top