Sample records for unique sequence characteristics

  1. Methods for determining the genetic affinity of microorganisms and viruses

    NASA Technical Reports Server (NTRS)

    Fox, George E. (Inventor); Willson, III, Richard C. (Inventor); Zhang, Zhengdong (Inventor)

    2012-01-01

    Selecting which sub-sequences in a database of nucleic acid such as 16S rRNA are highly characteristic of particular groupings of bacteria, microorganisms, fungi, etc. on a substantially phylogenetic tree. Also applicable to viruses comprising viral genomic RNA or DNA. A catalogue of highly characteristic sequences identified by this method is assembled to establish the genetic identity of an unknown organism. The characteristic sequences are used to design nucleic acid hybridization probes that include the characteristic sequence or its complement, or are derived from one or more characteristic sequences. A plurality of these characteristic sequences is used in hybridization to determine the phylogenetic tree position of the organism(s) in a sample. Those target organisms represented in the original sequence database and sufficient characteristic sequences can identify to the species or subspecies level. Oligonucleotide arrays of many probes are especially preferred. A hybridization signal can comprise fluorescence, chemiluminescence, or isotopic labeling, etc.; or sequences in a sample can be detected by direct means, e.g. mass spectrometry. The method's characteristic sequences can also be used to design specific PCR primers. The method uniquely identifies the phylogenetic affinity of an unknown organism without requiring prior knowledge of what is present in the sample. Even if the organism has not been previously encountered, the method still provides useful information about which phylogenetic tree bifurcation nodes encompass the organism.

  2. Unique Trichomonas vaginalis gene sequences identified in multinational regions of Northwest China.

    PubMed

    Liu, Jun; Feng, Meng; Wang, Xiaolan; Fu, Yongfeng; Ma, Cailing; Cheng, Xunjia

    2017-07-24

    Trichomonas vaginalis (T. vaginalis) is a flagellated protozoan parasite that infects humans worldwide. This study determined the sequence of the 18S ribosomal RNA gene of T. vaginalis infecting both females and males in Xinjiang, China. Samples from 73 females and 28 males were collected and confirmed for infection with T. vaginalis, a total of 110 sequences were identified when the T. vaginalis 18S ribosomal RNA gene was sequenced. These sequences were used to prepare a phylogenetic network. The rooted network comprised three large clades and several independent branches. Most of the Xinjiang sequences were in one group. Preliminary results suggest that Xinjiang T. vaginalis isolates might be genetically unique, as indicated by the sequence of their 18S ribosomal RNA gene. Low migration rate of local people in this province may contribute to a genetic conservativeness of T. vaginalis. The unique genetic feature of our isolates may suggest a different clinical presentation of trichomoniasis, including metronidazole susceptibility, T. vaginalis virus or Mycoplasma co-infection characteristics. The transmission and evolution of Xinjiang T. vaginalis is of interest and should be studied further. More attention should be given to T. vaginalis infection in both females and males in Xinjiang.

  3. Analyses of Evolutionary Characteristics of the Hemagglutinin-Esterase Gene of Influenza C Virus during a Period of 68 Years Reveals Evolutionary Patterns Different from Influenza A and B Viruses.

    PubMed

    Furuse, Yuki; Matsuzaki, Yoko; Nishimura, Hidekazu; Oshitani, Hitoshi

    2016-11-26

    Infections with the influenza C virus causing respiratory symptoms are common, particularly among children. Since isolation and detection of the virus are rarely performed, compared with influenza A and B viruses, the small number of available sequences of the virus makes it difficult to analyze its evolutionary dynamics. Recently, we reported the full genome sequence of 102 strains of the virus. Here, we exploited the data to elucidate the evolutionary characteristics and phylodynamics of the virus compared with influenza A and B viruses. Along with our data, we obtained public sequence data of the hemagglutinin-esterase gene of the virus; the dataset consists of 218 unique sequences of the virus collected from 14 countries between 1947 and 2014. Informatics analyses revealed that (1) multiple lineages have been circulating globally; (2) there have been weak and infrequent selective bottlenecks; (3) the evolutionary rate is low because of weak positive selection and a low capability to induce mutations; and (4) there is no significant positive selection although a few mutations affecting its antigenicity have been induced. The unique evolutionary dynamics of the influenza C virus must be shaped by multiple factors, including virological, immunological, and epidemiological characteristics.

  4. Analyses of Evolutionary Characteristics of the Hemagglutinin-Esterase Gene of Influenza C Virus during a Period of 68 Years Reveals Evolutionary Patterns Different from Influenza A and B Viruses

    PubMed Central

    Furuse, Yuki; Matsuzaki, Yoko; Nishimura, Hidekazu; Oshitani, Hitoshi

    2016-01-01

    Infections with the influenza C virus causing respiratory symptoms are common, particularly among children. Since isolation and detection of the virus are rarely performed, compared with influenza A and B viruses, the small number of available sequences of the virus makes it difficult to analyze its evolutionary dynamics. Recently, we reported the full genome sequence of 102 strains of the virus. Here, we exploited the data to elucidate the evolutionary characteristics and phylodynamics of the virus compared with influenza A and B viruses. Along with our data, we obtained public sequence data of the hemagglutinin-esterase gene of the virus; the dataset consists of 218 unique sequences of the virus collected from 14 countries between 1947 and 2014. Informatics analyses revealed that (1) multiple lineages have been circulating globally; (2) there have been weak and infrequent selective bottlenecks; (3) the evolutionary rate is low because of weak positive selection and a low capability to induce mutations; and (4) there is no significant positive selection although a few mutations affecting its antigenicity have been induced. The unique evolutionary dynamics of the influenza C virus must be shaped by multiple factors, including virological, immunological, and epidemiological characteristics. PMID:27898037

  5. Production of Supra-regular Spatial Sequences by Macaque Monkeys.

    PubMed

    Jiang, Xinjian; Long, Tenghai; Cao, Weicong; Li, Junru; Dehaene, Stanislas; Wang, Liping

    2018-06-18

    Understanding and producing embedded sequences in language, music, or mathematics, is a central characteristic of our species. These domains are hypothesized to involve a human-specific competence for supra-regular grammars, which can generate embedded sequences that go beyond the regular sequences engendered by finite-state automata. However, is this capacity truly unique to humans? Using a production task, we show that macaque monkeys can be trained to produce time-symmetrical embedded spatial sequences whose formal description requires supra-regular grammars or, equivalently, a push-down stack automaton. Monkeys spontaneously generalized the learned grammar to novel sequences, including longer ones, and could generate hierarchical sequences formed by an embedding of two levels of abstract rules. Compared to monkeys, however, preschool children learned the grammars much faster using a chunking strategy. While supra-regular grammars are accessible to nonhuman primates through extensive training, human uniqueness may lie in the speed and learning strategy with which they are acquired. Copyright © 2018 Elsevier Ltd. All rights reserved.

  6. The Nostoc punctiforme Genome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    John C. Meeks

    2001-12-31

    Nostoc punctiforme is a filamentous cyanobacterium with extensive phenotypic characteristics and a relatively large genome, approaching 10 Mb. The phenotypic characteristics include a photoautotrophic, diazotrophic mode of growth, but N. punctiforme is also facultatively heterotrophic; its vegetative cells have multiple development alternatives, including terminal differentiation into nitrogen-fixing heterocysts and transient differentiation into spore-like akinetes or motile filaments called hormogonia; and N. punctiforme has broad symbiotic competence with fungi and terrestrial plants, including bryophytes, gymnosperms and an angiosperm. The shotgun-sequencing phase of the N. punctiforme strain ATCC 29133 genome has been completed by the Joint Genome Institute. Annotation of an 8.9more » Mb database yielded 7432 open reading frames, 45% of which encode proteins with known or probable known function and 29% of which are unique to N. punctiforme. Comparative analysis of the sequence indicates a genome that is highly plastic and in a state of flux, with numerous insertion sequences and multilocus repeats, as well as genes encoding transposases and DNA modification enzymes. The sequence also reveals the presence of genes encoding putative proteins that collectively define almost all characteristics of cyanobacteria as a group. N. punctiforme has an extensive potential to sense and respond to environmental signals as reflected by the presence of more than 400 genes encoding sensor protein kinases, response regulators and other transcriptional factors. The signal transduction systems and any of the large number of unique genes may play essential roles in the cell differentiation and symbiotic interaction properties of N. punctiforme.« less

  7. Image Encryption Algorithm Based on Hyperchaotic Maps and Nucleotide Sequences Database

    PubMed Central

    2017-01-01

    Image encryption technology is one of the main means to ensure the safety of image information. Using the characteristics of chaos, such as randomness, regularity, ergodicity, and initial value sensitiveness, combined with the unique space conformation of DNA molecules and their unique information storage and processing ability, an efficient method for image encryption based on the chaos theory and a DNA sequence database is proposed. In this paper, digital image encryption employs a process of transforming the image pixel gray value by using chaotic sequence scrambling image pixel location and establishing superchaotic mapping, which maps quaternary sequences and DNA sequences, and by combining with the logic of the transformation between DNA sequences. The bases are replaced under the displaced rules by using DNA coding in a certain number of iterations that are based on the enhanced quaternary hyperchaotic sequence; the sequence is generated by Chen chaos. The cipher feedback mode and chaos iteration are employed in the encryption process to enhance the confusion and diffusion properties of the algorithm. Theoretical analysis and experimental results show that the proposed scheme not only demonstrates excellent encryption but also effectively resists chosen-plaintext attack, statistical attack, and differential attack. PMID:28392799

  8. Proteins without unique 3D structures: biotechnological applications of intrinsically unstable/disordered proteins.

    PubMed

    Uversky, Vladimir N

    2015-03-01

    Intrinsically disordered proteins (IDPs) and intrinsically disordered protein regions (IDPRs) are functional proteins or regions that do not have unique 3D structures under functional conditions. Therefore, from the viewpoint of their lack of stable 3D structure, IDPs/IDPRs are inherently unstable. As much as structure and function of normal ordered globular proteins are determined by their amino acid sequences, the lack of unique 3D structure in IDPs/IDPRs and their disorder-based functionality are also encoded in the amino acid sequences. Because of their specific sequence features and distinctive conformational behavior, these intrinsically unstable proteins or regions have several applications in biotechnology. This review introduces some of the most characteristic features of IDPs/IDPRs (such as peculiarities of amino acid sequences of these proteins and regions, their major structural features, and peculiar responses to changes in their environment) and describes how these features can be used in the biotechnology, for example for the proteome-wide analysis of the abundance of extended IDPs, for recombinant protein isolation and purification, as polypeptide nanoparticles for drug delivery, as solubilization tools, and as thermally sensitive carriers of active peptides and proteins. Copyright © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  9. Molecular characterization of an ependymin precursor from goldfish brain.

    PubMed

    Königstorfer, A; Sterrer, S; Eckerskorn, C; Lottspeich, F; Schmidt, R; Hoffmann, W

    1989-01-01

    Ependymins are thought to be implicated in fundamental processes involved in plasticity of the goldfish CNS. Gas-phase sequencing of purified ependymins beta and gamma revealed that they share the same N-terminal sequence. Each sequence displays microheterogeneities at several positions. Based on the protein sequences obtained, we constructed synthetic oligonucleotides and used them as hybridization probes for screening cDNA libraries of goldfish brain. In this article we describe the full-length sequence of a mRNA encoding a precursor of ependymins. A cleavable signal sequence characteristic of secretory proteins is located at the N-terminal end, followed directly by the ependymin sequence. Also, two potential N-glycosylation sites were detected. A computer search revealed that ependymins form a novel family of unique proteins.

  10. Plasmodium falciparum Serine/Threonine Phosphoprotein Phosphatases (PPP): From Housekeeper to 'Holy Grail'

    USDA-ARS?s Scientific Manuscript database

    Availability of complete genome sequence for Plasmodium falciparum has been useful in drawing a comprehensive metabolic map of the parasite. Distinct and unique metabolic characteristics of the parasite may be exploited as potential targets for new antimalarial drug discovery research. Reversible ph...

  11. Molecular characterization of subgenotype A1 (subgroup Aa) of hepatitis B virus.

    PubMed

    Kramvis, Anna; Kew, Michael C

    2007-07-01

    Subgenotypes of hepatitis B virus (HBV) were first recognized after a unique segment of genotype A was identified when sequencing the preS2/S region of southern African HBV isolates. Originally named subgroup A', subsequently called subgroup Aa (for Africa) or subgenotype A1, this subgenotype is found in South Africa, Malawi, Uganda, Tanzania, Somalia, Yemen, India, Nepal, the Philippines and Brazil. The relatively higher mean nucleotide divergence of subgenotype A1 suggests that it has been endemic and has a long evolutionary history in the populations where it prevails. Distinctive sequence characteristics could account for the high hepatitis B e-antigen (HBeAg) negativity and low HBV DNA levels in carriers of this subgenotype. Substitutions or mutations can reduce HBeAg expression at three levels: (i) 1762T1764A atthe transcriptional level; (ii) substitutions at nt 1809-1812 at the translational level; and (iii) 1862T at the post-translational level. Co-existence of 1762T1764A and nt 1809-1812 mutations reduces HBeAg expression in an additive manner. In addition, subgenotype A1 has unique sequence alterations in the transcriptional regulatory elements and the polymerase coding region. The distinct sequence characteristics of subgenotype A1 may contribute to the 4.5-fold increased risk of heptocellular carcinoma in HBV carriers infected with genotype A, which is entirely attributable to subgenotype A1.

  12. A unique Mycobacterium species isolated from an epizootic of striped bass (Morone saxatilis).

    PubMed Central

    Rhodes, M. W.; Kator, H.; Kotob, S.; van Berkum, P.; Kaattari, I.; Vogelbein, W.; Floyd, M. M.; Butler, W. R.; Quinn, F. D.; Ottinger, C.; Shotts, E.

    2001-01-01

    We isolated a Mycobacterium sp. resembling Mycobacterium marinum and M. ulcerans from diseased striped bass (Morone saxatilis) during an epizootic of mycobacteriosis in the Chesapeake Bay. This isolate may represent an undescribed Mycobacterium species, based on phenotypic characteristics and comparative 16S rRNA gene sequence. PMID:11747708

  13. Pestoides F, an atypical Yersinia pestis strain from the former Soviet Union.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Garcia, Emilio; Worsham, Patricia; Bearden, S.

    2007-01-01

    Unlike the classical Yersinia pestis strains, members of an atypical group of Y. pestis from Central Asia, denominated Y. pestis subspecies caucasica (also known as one of several pestoides types), are distinguished by a number of characteristics including their ability to ferment rhamnose and melibiose, their lack of the small plasmid encoding the plasminogen activator (pla) and pesticin, and their exceptionally large variants of the virulence plasmid pMT (encoding murine toxin and capsular antigen). We have obtained the entire genome sequence of Y. pestis Pestoides F, an isolate from the former Soviet Union that has enabled us to carryout amore » comprehensive genome-wide comparison of this organism's genomic content against the six published sequences of Y. pestis and their Y. pseudotuberculosis ancestor. Based on classical glycerol fermentation (+ve) and nitrate reduction (+ve) Y. pestis Pestoides F is an isolate that belongs to the biovar antiqua. This strain is unusual in other characteristics such as the fact that it carries a non-consensus V antigen (lcrV) sequence, and that unlike other Pla(-) strains, Pestoides F retains virulence by the parenteral and aerosol routes. The chromosome of Pestoides F is 4,517,345 bp in size comprising some 3,936 predicted coding sequences, while its pCD and pMT plasmids are 71,507 bp and 137,010 bp in size respectively. Comparison of chromosome-associated genes in Pestoides F with those in the other sequenced Y. pestis strains reveals differences ranging from strain-specific rearrangements, insertions, deletions, single nucleotide polymorphisms, and a unique distribution of insertion sequences. There is a single approximately 7 kb unique region in the chromosome not found in any of the completed Y. pestis strains sequenced to date, but which is present in the Y. pseudotuberculosis ancestor. Taken together, these findings are consistent with Pestoides F being derived from the most ancient lineage of Y. pestis yet sequenced.« less

  14. Pestoides F, and Atypical Yersinia pestis Strain from the Former Soviet Union

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Garcia, E; Worsham, P; Bearden, S

    2007-01-05

    Unlike the classical Yersinia pestis strains, members of an atypical group of Y. pestis from Central Asia, denominated Y. pestis subspecies caucasica (also known as one of several pestoides types), are distinguished by a number of characteristics including their ability to ferment rhamnose and melibiose, their lacking the small plasmid encoding the plasminogen activator (pla) and pesticin, and their exceptionally large variants of the virulence plasmid pMT (encoding murine toxin and capsular antigen). We have obtained the entire genome sequence of Y. pestis Pestoides F, an isolate from the former Soviet Union that has enabled us to carryout a comprehensivemore » genome-wide comparison of this organism's genomic content against the six published sequences of Y. pestis and their Y. pseudotuberculosis ancestor. Based on classical glycerol fermentation (+ve) and nitrate reduction (+ve) Y. pestis Pestoides F is an isolate that belongs to the biovar antiqua. This strain is unusual in other characteristics such as the fact that it carries a non-consensus V antigen (lcrV) sequence, and that unlike other Pla{sup -} strains, Pestoides F retains virulence by the parenteral and aerosol routes. The chromosome of Pestoides F is 4,517,345 bp in size comprising some 3,936 predicted coding sequences, while its pCD and pMT plasmids are 71,507 bp and 137,010 bp in size respectively. Comparison of chromosome-associated genes in Pestoides F with those in the other sequenced Y. pestis strains, reveals a series of differences ranging from strain-specific rearrangements, insertions, deletions, single nucleotide polymorphisms, and a unique distribution of insertion sequences. There is a single {approx}7 kb unique region in the chromosome not found in any of the completed Y. pestis strains sequenced to date, but which is present in the Y. pseudotuberculosis ancestor. Taken together, these findings are consistent with Pestoides F being derived from the most ancient lineage of Y. pestis yet sequenced.« less

  15. Draft genome sequence of marine alphaproteobacterial strain HIMB11, the first cultivated representative of a unique lineage within the Roseobacter clade possessing an unusually small genome

    PubMed Central

    Durham, Bryndan P.; Grote, Jana; Whittaker, Kerry A.; Bender, Sara J.; Luo, Haiwei; Grim, Sharon L.; Brown, Julia M.; Casey, John R.; Dron, Antony; Florez-Leiva, Lennin; Krupke, Andreas; Luria, Catherine M.; Mine, Aric H.; Nigro, Olivia D.; Pather, Santhiska; Talarmin, Agathe; Wear, Emma K.; Weber, Thomas S.; Wilson, Jesse M.; Church, Matthew J.; DeLong, Edward F.; Karl, David M.; Steward, Grieg F.; Eppley, John M.; Kyrpides, Nikos C.; Schuster, Stephan; Rappé, Michael S.

    2014-01-01

    Strain HIMB11 is a planktonic marine bacterium isolated from coastal seawater in Kaneohe Bay, Oahu, Hawaii belonging to the ubiquitous and versatile Roseobacter clade of the alphaproteobacterial family Rhodobacteraceae. Here we describe the preliminary characteristics of strain HIMB11, including annotation of the draft genome sequence and comparative genomic analysis with other members of the Roseobacter lineage. The 3,098,747 bp draft genome is arranged in 34 contigs and contains 3,183 protein-coding genes and 54 RNA genes. Phylogenomic and 16S rRNA gene analyses indicate that HIMB11 represents a unique sublineage within the Roseobacter clade. Comparison with other publicly available genome sequences from members of the Roseobacter lineage reveals that strain HIMB11 has the genomic potential to utilize a wide variety of energy sources (e.g. organic matter, reduced inorganic sulfur, light, carbon monoxide), while possessing a reduced number of substrate transporters. PMID:25197450

  16. Draft genome sequence of marine alphaproteobacterial strain HIMB11, the first cultivated representative of a unique lineage within the Roseobacter clade possessing an unusually small genome.

    PubMed

    Durham, Bryndan P; Grote, Jana; Whittaker, Kerry A; Bender, Sara J; Luo, Haiwei; Grim, Sharon L; Brown, Julia M; Casey, John R; Dron, Antony; Florez-Leiva, Lennin; Krupke, Andreas; Luria, Catherine M; Mine, Aric H; Nigro, Olivia D; Pather, Santhiska; Talarmin, Agathe; Wear, Emma K; Weber, Thomas S; Wilson, Jesse M; Church, Matthew J; DeLong, Edward F; Karl, David M; Steward, Grieg F; Eppley, John M; Kyrpides, Nikos C; Schuster, Stephan; Rappé, Michael S

    2014-06-15

    Strain HIMB11 is a planktonic marine bacterium isolated from coastal seawater in Kaneohe Bay, Oahu, Hawaii belonging to the ubiquitous and versatile Roseobacter clade of the alphaproteobacterial family Rhodobacteraceae. Here we describe the preliminary characteristics of strain HIMB11, including annotation of the draft genome sequence and comparative genomic analysis with other members of the Roseobacter lineage. The 3,098,747 bp draft genome is arranged in 34 contigs and contains 3,183 protein-coding genes and 54 RNA genes. Phylogenomic and 16S rRNA gene analyses indicate that HIMB11 represents a unique sublineage within the Roseobacter clade. Comparison with other publicly available genome sequences from members of the Roseobacter lineage reveals that strain HIMB11 has the genomic potential to utilize a wide variety of energy sources (e.g. organic matter, reduced inorganic sulfur, light, carbon monoxide), while possessing a reduced number of substrate transporters.

  17. Recurrence of 49-base decamers, nonomers, and octamers within mouse C mu gene of Ig heavy chain and its primordial building block.

    PubMed Central

    Yazaki, A; Ohno, S

    1983-01-01

    Within the published 2,168-base-long mouse C mu gene of Ig heavy chain consisting of four coding and four noncoding segments, 2 base decamers, 8 nonomers, and 39 octamers recurred. Recurring base heptamers (about 100) and hexamers (about 350) were simply too numerous to merit individual identification. In spite of extensive overlaps between these recurring base decamers to hexamers, they occupied nearly the entire length of mouse Ig C mu gene. As with other genes of the beta-sheet-forming beta 2-microglobulin family, the Ig C mu gene (flanking and intervening noncoding sequences included) is not a unique sequence but rather it is degenerate repeats of the 45-base-long primordial building-block sequence uniquely its own. This primordial building block must originally have specified the 15-amino-acid-residue-long primordial arm of beta-sheet-forming loops, the characteristics of the beta 2-microglobulin family of polypeptides. PMID:6403948

  18. Identification of a novel astrovirus in domestic sheep in Hungary.

    PubMed

    Reuter, Gábor; Pankovics, Péter; Delwart, Eric; Boros, Ákos

    2012-02-01

    The family Astroviridae consists of two genera, Avastrovirus and Mamastrovirus, whose members are associated with gastroenteritis in avian and mammalian hosts, respectively. We serendipitously identified a novel ovine astrovirus in a fecal specimen from a domestic sheep (Ovis aries) in Hungary by viral metagenomic analysis. Sequencing of the fragment indicated that it was an ORF1b/ORF2/3'UTR sequence, and it has been submitted to the GenBank database as ovine astrovirus type 2 (OAstV-2/Hungary/2009) with accession number JN592482. The unique sequence characteristics and the phylogenetic position of OAstV-2 suggest that genetically divergent lineages of astroviruses exist in sheep.

  19. Comparative genomics of Fructobacillus spp. and Leuconostoc spp. reveals niche-specific evolution of Fructobacillus spp.

    DOE PAGES

    Endo, Akihito; Tanizawa, Yasuhiro; Tanaka, Naoto; ...

    2015-12-29

    In this study, Fructobacillus spp. in fructose-rich niches belong to the family Leuconostocaceae. They were originally classified as Leuconostoc spp., but were later grouped into a novel genus, Fructobacillus , based on their phylogenetic position, morphology and specific biochemical characteristics. The unique characters, so called fructophilic characteristics, had not been reported in the group of lactic acid bacteria, suggesting unique evolution at the genome level. Here we studied four draft genome sequences of Fructobacillus spp. and compared their metabolic properties against those of Leuconostoc spp. As a result, Fructobacillus species possess significantly less protein coding sequences in their small genomes.more » The number of genes was significantly smaller in carbohydrate transport and metabolism. Several other metabolic pathways, including TCA cycle, ubiquinone and other terpenoid-quinone biosynthesis and phosphotransferase systems, were characterized as discriminative pathways between the two genera. The adhE gene for bifunctional acetaldehyde/alcohol dehydrogenase, and genes for subunits of the pyruvate dehydrogenase complex were absent in Fructobacillus spp. The two genera also show different levels of GC contents, which are mainly due to the different GC contents at the third codon position. In conclusion, the present genome characteristics in Fructobacillus spp. suggest reductive evolution that took place to adapt to specific niches.« less

  20. Comparative genomics of Fructobacillus spp. and Leuconostoc spp. reveals niche-specific evolution of Fructobacillus spp.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Endo, Akihito; Tanizawa, Yasuhiro; Tanaka, Naoto

    In this study, Fructobacillus spp. in fructose-rich niches belong to the family Leuconostocaceae. They were originally classified as Leuconostoc spp., but were later grouped into a novel genus, Fructobacillus , based on their phylogenetic position, morphology and specific biochemical characteristics. The unique characters, so called fructophilic characteristics, had not been reported in the group of lactic acid bacteria, suggesting unique evolution at the genome level. Here we studied four draft genome sequences of Fructobacillus spp. and compared their metabolic properties against those of Leuconostoc spp. As a result, Fructobacillus species possess significantly less protein coding sequences in their small genomes.more » The number of genes was significantly smaller in carbohydrate transport and metabolism. Several other metabolic pathways, including TCA cycle, ubiquinone and other terpenoid-quinone biosynthesis and phosphotransferase systems, were characterized as discriminative pathways between the two genera. The adhE gene for bifunctional acetaldehyde/alcohol dehydrogenase, and genes for subunits of the pyruvate dehydrogenase complex were absent in Fructobacillus spp. The two genera also show different levels of GC contents, which are mainly due to the different GC contents at the third codon position. In conclusion, the present genome characteristics in Fructobacillus spp. suggest reductive evolution that took place to adapt to specific niches.« less

  1. Genomics dataset of unidentified disclosed isolates.

    PubMed

    Rekadwad, Bhagwan N

    2016-09-01

    Analysis of DNA sequences is necessary for higher hierarchical classification of the organisms. It gives clues about the characteristics of organisms and their taxonomic position. This dataset is chosen to find complexities in the unidentified DNA in the disclosed patents. A total of 17 unidentified DNA sequences were thoroughly analyzed. The quick response codes were generated. AT/GC content of the DNA sequences analysis was carried out. The QR is helpful for quick identification of isolates. AT/GC content is helpful for studying their stability at different temperatures. Additionally, a dataset on cleavage code and enzyme code studied under the restriction digestion study, which helpful for performing studies using short DNA sequences was reported. The dataset disclosed here is the new revelatory data for exploration of unique DNA sequences for evaluation, identification, comparison and analysis.

  2. International Life Science Institute North America Cronobacter (Formerly Enterobacter sakazakii) isolate set.

    PubMed

    Ivy, Reid A; Farber, Jeffrey M; Pagotto, Franco; Wiedmann, Martin

    2013-01-01

    Foodborne pathogen isolate collections are important for the development of detection methods, for validation of intervention strategies, and to develop an understanding of pathogenesis and virulence. We have assembled a publicly available Cronobacter (formerly Enterobacter sakazakii) isolate set that consists of (i) 25 Cronobacter sakazakii isolates, (ii) two Cronobacter malonaticus isolates, (iii) one Cronobacter muytjensii isolate, which displays some atypical phenotypic characteristics, biochemical profiles, and colony color on selected differential media, and (iv) two nonclinical Enterobacter asburiae isolates, which show some phenotypic characteristics similar to those of Cronobacter spp. The set consists of human (n = 10), food (n = 11), and environmental (n = 9) isolates. Analysis of partial 16S rDNA sequence and seven-gene multilocus sequence typing data allowed for reliable identification of these isolates to species and identification of 14 isolates as sequence type 4, which had previously been shown to be the most common C. sakazakii sequence type associated with neonatal meningitis. Phenotypic characterization was carried out with API 20E and API 32E test strips and streaking on two selective chromogenic agars; isolates were also assessed for sorbitol fermentation and growth at 45°C. Although these strategies typically produced the same classification as sequence-based strategies, based on a panel of four biochemical tests, one C. sakazakii isolate yielded inconclusive data and one was classified as C. malonaticus. EcoRI automated ribotyping and pulsed-field gel electrophoresis (PFGE) with XbaI separated the set into 23 unique ribotypes and 30 unique PFGE types, respectively, indicating subtype diversity within the set. Subtype and source data for the collection are publicly available in the PathogenTracker database (www. pathogentracker. net), which allows for continuous updating of information on the set, including links to publications that include information on isolates from this collection.

  3. Prevalence and genome characteristics of canine astrovirus in southwest China.

    PubMed

    Li, Mingxiang; Yan, Nan; Ji, Conghui; Wang, Min; Zhang, Bin; Yue, Hua; Tang, Cheng

    2018-05-30

    The aim of this study was to investigate canine astrovirus (CaAstV) infection in southwest China. We collected 107 faecal samples from domestic dogs with obvious diarrhoea. Forty-two diarrhoeic samples (39.3 %) were positive for CaAstV by RT-PCR, and 41/42 samples showed co-infection with canine coronavirus (CCoV), canine parvovirus-2 (CPV-2) and canine distemper virus (CDV). Phylogenetic analysis based on 26 CaAstV partial ORF1a and ORF1b sequences revealed that most CaAstV strains showed unique evolutionary features. Interestingly, putative recombination events were observed among four of the five complete ORF2 sequences cloned in this study, and three of the five complete ORF2 sequences formed a single unique group, suggesting that these strains could be a novel genotype. We successfully sequenced the complete genome of one CaAstV strain (designated 2017/44/CHN), which was 6628 nt in length. The features of this genome include putative recombination events in the ORF1a, ORF1b and ORF2 genes, while the ORF2 gene had a continuous insertion of 7 aa in region II compared with the other complete ORF2 sequences available in GenBank. Phylogenetic analysis showed that 2017/44/CHN formed a single group based on genome sequences, suggesting that this strain might be a novel genotype. The results of this study revealed that CaAstV circulates widely in diarrhoeic dogs in southwest China and exhibits unique evolutionary events. To the best of our knowledge, this is the first report of recombination events in CaAstV, and it contributes to further understanding of the genetic evolution of CaAstV.

  4. BHD-associated kidney cancer exhibits unique molecular characteristics and a wide variety of variants in chromatin remodeling genes.

    PubMed

    Hasumi, Hisashi; Furuya, Mitsuko; Tatsuno, Kenji; Yamamoto, Shogo; Baba, Masaya; Hasumi, Yukiko; Isono, Yasuhiro; Suzuki, Kae; Jikuya, Ryosuke; Otake, Shinji; Muraoka, Kentaro; Osaka, Kimito; Hayashi, Narihiko; Makiyama, Kazuhide; Miyoshi, Yasuhide; Kondo, Keiichi; Nakaigawa, Noboru; Kawahara, Takashi; Izumi, Koji; Teranishi, Junichi; Yumura, Yasushi; Uemura, Hiroji; Nagashima, Yoji; Metwalli, Adam R; Schmidt, Laura S; Aburatani, Hiroyuki; Linehan, W Marston; Yao, Masahiro

    2018-05-14

    Birt-Hogg-Dubé (BHD) syndrome is a hereditary kidney cancer syndrome, which predisposes patients to develop kidney cancer, cutaneous fibrofolliculomas and pulmonary cysts. The responsible gene FLCN is a tumor suppressor for kidney cancer which plays an important role in energy homeostasis through the regulation of mitochondrial oxidative metabolism. However, the process by which FLCN-deficiency leads to renal tumorigenesis is unclear. In order to clarify molecular pathogenesis of BHD-associated kidney cancer, we conducted whole-exome sequencing analysis using next-generation sequencing technology as well as metabolite analysis using LC/MS and GC/MS. Whole-exome sequencing analysis of BHD-associated kidney cancer revealed that copy number variations (CNV) of BHD-associated kidney cancer are considerably different from those already reported in sporadic cases. In somatic variant analysis, very few variants were commonly observed in BHD-associated kidney cancer; however, variants in chromatin remodeling genes were frequently observed in BHD-associated kidney cancer (17/29 tumors, 59%). Metabolite analysis of BHD-associated kidney cancer revealed metabolic reprogramming towards upregulated redox regulation which may neutralize reactive oxygen species potentially produced from mitochondria with increased respiratory capacity under FLCN-deficiency. BHD-associated kidney cancer displays unique molecular characteristics which are completely different from sporadic kidney cancer, providing mechanistic insight into tumorigenesis under FLCN-deficiency as well as a foundation for development of novel therapeutics for kidney cancer.

  5. Nosiheptide Biosynthesis Featuring a Unique Indole Side Ring Formation on the Characteristic Thiopeptide Framework

    PubMed Central

    Yu, Yi; Duan, Lian; Zhang, Qi; Liao, Rijing; Ding, Ying; Pan, Haixue; Wendt-Pienkowski, Evelyn; Tang, Gongli; Shen, Ben; Liu, Wen

    2009-01-01

    Nosiheptide (NOS), belonging to the e series of thiopeptide antibiotics that exhibit potent activity against various bacterial pathogens, bears a unique indole side ring system and regiospecific hydroxyl groups on the characteristic macrocyclic core. Here, cloning, sequencing and characterization of the nos gene cluster from Streptomyces actuosus ATCC 25421 as a model for this series of thiopeptides has unveiled new insights into their biosynthesis. Bioinformatics-based sequence analysis and in vivo investigation into the gene functions show that NOS biosynthesis shares a common strategy with recently characterized b or c series thiopeptides for forming the characteristic macrocyclic core, which features a ribosomally synthesized precursor peptide with conserved posttranslational modifications. However, it apparently proceeds via a different route for tailoring the thiopeptide framework, allowing the final product to exhibit the distinct structural characteristics of e series thiopeptides, such as the indole side ring system. Chemical complementation supports the notion that the S-adenosylmethionine (AdoMet)-dependent protein NosL may play a central role in converting Trp to the key 3-methylindole moiety by an unusual carbon side chain rearrangement, most likely via a radical-initiated mechanism. Characterization of the indole side ring-opened analog of NOS from the nosN mutant strain is consistent with the proposed methyltransferase activity of its encoded protein, shedding light into the timing of the individual steps for indole side ring biosynthesis. These results also suggest the feasibility of engineering novel thiopeptides for drug discovery by manipulating the NOS biosynthetic machinery. PMID:19678698

  6. Genomic characterization of Zika virus isolated from Indonesia.

    PubMed

    Yudhaputri, Frilasita A; Trimarsanto, Hidayat; Perkasa, Aditya; Yohan, Benediktus; Haryanto, Sotianingsih; Wiyatno, Ageng; Soebandrio, Amin; Myint, Khin Saw; Ledermann, Jeremy P; Rosenberg, Ronald; Powers, Ann M; Sasmono, R Tedjo

    2017-10-01

    Zika virus (ZIKV) JMB-185 strain was isolated from a febrile patient in Jambi, Indonesia in 2014. To understand its genetic characteristics, we performed whole genome sequencing using the Ion Torrent PGM platform on the supernatant of the first passage. The phylogenetic analysis showed that the isolate was not closely related to the Brazilian ZIKV associated with microcephaly or isolates from the recent Singapore Zika outbreak. Molecular evolution analysis indicated that JMB-185 strain may have been circulating in the Southeast Asia region, including Indonesia since 2000. We observed high nucleotide sequence identity between Indonesia, Thailand, Singapore, and American strains although unique amino acid substitutions were also observed. This report provides information on the genomic characteristics of Indonesian ZIKV which may be used for further studies. Copyright © 2017 Elsevier Ltd. All rights reserved.

  7. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sattison, M.B.; Schroeder, J.A.; Russell, K.D.

    The Idaho National Engineering Laboratory (INEL) over the past year has created 75 plant-specific Accident Sequence Precursor (ASP) models using the SAPHIRE suite of PRA codes. Along with the new models, the INEL has also developed a new module for SAPHIRE which is tailored specifically to the unique needs of ASP evaluations. These models and software will be the next generation of risk tools for the evaluation of accident precursors by both NRR and AEOD. This paper presents an overview of the models and software. Key characteristics include: (1) classification of the plant models according to plant response with amore » unique set of event trees for each plant class, (2) plant-specific fault trees using supercomponents, (3) generation and retention of all system and sequence cutsets, (4) full flexibility in modifying logic, regenerating cutsets, and requantifying results, and (5) user interface for streamlined evaluation of ASP events.« less

  8. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sattison, M.B.; Schroeder, J.A.; Russell, K.D.

    The Idaho National Engineering Laboratory (INEL) over the past year has created 75 plant-specific Accident Sequence Precursor (ASP) models using the SAPHIRE suite of PRA codes. Along with the new models, the INEL has also developed a new module for SAPHIRE which is tailored specifically to the unique needs of conditional core damage probability (CCDP) evaluations. These models and software will be the next generation of risk tools for the evaluation of accident precursors by both NRR and AEOD. This paper presents an overview of the models and software. Key characteristics include: (1) classification of the plant models according tomore » plant response with a unique set of event trees for each plant class, (2) plant-specific fault trees using supercomponents, (3) generation and retention of all system and sequence cutsets, (4) full flexibility in modifying logic, regenerating cutsets, and requantifying results, and (5) user interface for streamlined evaluation of ASP events.« less

  9. Transcriptome profile and unique genetic evolution of positively selected genes in yak lungs.

    PubMed

    Lan, DaoLiang; Xiong, XianRong; Ji, WenHui; Li, Jian; Mipam, Tserang-Donko; Ai, Yi; Chai, ZhiXin

    2018-04-01

    The yak (Bos grunniens), which is a unique bovine breed that is distributed mainly in the Qinghai-Tibetan Plateau, is considered a good model for studying plateau adaptability in mammals. The lungs are important functional organs that enable animals to adapt to their external environment. However, the genetic mechanism underlying the adaptability of yak lungs to harsh plateau environments remains unknown. To explore the unique evolutionary process and genetic mechanism of yak adaptation to plateau environments, we performed transcriptome sequencing of yak and cattle (Bos taurus) lungs using RNA-Seq technology and a subsequent comparison analysis to identify the positively selected genes in the yak. After deep sequencing, a normal transcriptome profile of yak lung that containing a total of 16,815 expressed genes was obtained, and the characteristics of yak lungs transcriptome was described by functional analysis. Furthermore, Ka/Ks comparison statistics result showed that 39 strong positively selected genes are identified from yak lungs. Further GO and KEGG analysis was conducted for the functional annotation of these genes. The results of this study provide valuable data for further explorations of the unique evolutionary process of high-altitude hypoxia adaptation in yaks in the Tibetan Plateau and the genetic mechanism at the molecular level.

  10. Phylogenetic analysis of several Thermus strains from Rehai of Tengchong, Yunnan, China.

    PubMed

    Lin, Lianbing; Zhang, Jie; Wei, Yunlin; Chen, Chaoyin; Peng, Qian

    2005-10-01

    Several Thermus strains were isolated from 10 hot springs of the Rehai geothermal area in Tengchong, Yunnan province. The diversity of Thermus strains was examined by sequencing the 16S rRNA genes and comparing their sequences. Phylogenetic analysis showed that the 16S rDNA sequences from the Rehai geothermal isolates form four branches in the phylogenetic tree and had greater than 95.9% similarity in the phylogroup. Secondary structure comparison also indicated that the 16S rRNA from the Rehai geothermal isolates have unique secondary structure characteristics in helix 6, helix 9, and helix 10 (reference to Escherichia coli). This research is the first attempt to reveal the diversity of Thermus strains that are distributed in the Rehai geothermal area.

  11. Molecular structures of centromeric heterochromatin and karyotypic evolution in the Siamese crocodile (Crocodylus siamensis) (Crocodylidae, Crocodylia).

    PubMed

    Kawagoshi, Taiki; Nishida, Chizuko; Ota, Hidetoshi; Kumazawa, Yoshinori; Endo, Hideki; Matsuda, Yoichi

    2008-01-01

    Crocodilians have several unique karyotypic features, such as small diploid chromosome numbers (30-42) and the absence of dot-shaped microchromosomes. Of the extant crocodilian species, the Siamese crocodile (Crocodylus siamensis) has no more than 2n = 30, comprising mostly bi-armed chromosomes with large centromeric heterochromatin blocks. To investigate the molecular structures of C-heterochromatin and genomic compartmentalization in the karyotype, characterized by the disappearance of tiny microchromosomes and reduced chromosome number, we performed molecular cloning of centromeric repetitive sequences and chromosome mapping of the 18S-28S rDNA and telomeric (TTAGGG)( n ) sequences. The centromeric heterochromatin was composed mainly of two repetitive sequence families whose characteristics were quite different. Two types of GC-rich CSI-HindIII family sequences, the 305 bp CSI-HindIII-S (G+C content, 61.3%) and 424 bp CSI-HindIII-M (63.1%), were localized to the intensely PI-stained centric regions of all chromosomes, except for chromosome 2 with PI-negative heterochromatin. The 94 bp CSI-DraI (G+C content, 48.9%) was tandem-arrayed satellite DNA and localized to chromosome 2 and four pairs of small-sized chromosomes. The chromosomal size-dependent genomic compartmentalization that is supposedly unique to the Archosauromorpha was probably lost in the crocodilian lineage with the disappearance of microchromosomes followed by the homogenization of centromeric repetitive sequences between chromosomes, except for chromosome 2.

  12. Revised systematics of Holospora-like bacteria and characterization of "Candidatus Gortzia infectiva", a novel macronuclear symbiont of Paramecium jenningsi.

    PubMed

    Boscaro, Vittorio; Fokin, Sergei I; Schrallhammer, Martina; Schweikert, Michael; Petroni, Giulio

    2013-01-01

    The genus Holospora (Rickettsiales) includes highly infectious nuclear symbionts of the ciliate Paramecium with unique morphology and life cycle. To date, nine species have been described, but a molecular characterization is lacking for most of them. In this study, we have characterized a novel Holospora-like bacterium (HLB) living in the macronuclei of a Paramecium jenningsi population. This bacterium was morphologically and ultrastructurally investigated in detail, and its life cycle and infection capabilities were described. We also obtained its 16S rRNA gene sequence and developed a specific probe for fluorescence in situ hybridization experiments. A new taxon, "Candidatus Gortzia infectiva", was established for this HLB according to its unique characteristics and the relatively low DNA sequence similarities shared with other bacteria. The phylogeny of the order Rickettsiales based on 16S rRNA gene sequences has been inferred, adding to the available data the sequence of the novel bacterium and those of two Holospora species (Holospora obtusa and Holospora undulata) characterized for the purpose. Our phylogenetic analysis provided molecular support for the monophyly of HLBs and showed a possible pattern of evolution for some of their features. We suggested to classify inside the family Holosporaceae only HLBs, excluding other more distantly related and phenotypically different Paramecium endosymbionts.

  13. Identification of human-to-human transmissibility factors in PB2 proteins of influenza A by large-scale mutual information analysis

    PubMed Central

    Miotto, Olivo; Heiny, AT; Tan, Tin Wee; August, J Thomas; Brusic, Vladimir

    2008-01-01

    Background The identification of mutations that confer unique properties to a pathogen, such as host range, is of fundamental importance in the fight against disease. This paper describes a novel method for identifying amino acid sites that distinguish specific sets of protein sequences, by comparative analysis of matched alignments. The use of mutual information to identify distinctive residues responsible for functional variants makes this approach highly suitable for analyzing large sets of sequences. To support mutual information analysis, we developed the AVANA software, which utilizes sequence annotations to select sets for comparison, according to user-specified criteria. The method presented was applied to an analysis of influenza A PB2 protein sequences, with the objective of identifying the components of adaptation to human-to-human transmission, and reconstructing the mutation history of these components. Results We compared over 3,000 PB2 protein sequences of human-transmissible and avian isolates, to produce a catalogue of sites involved in adaptation to human-to-human transmission. This analysis identified 17 characteristic sites, five of which have been present in human-transmissible strains since the 1918 Spanish flu pandemic. Sixteen of these sites are located in functional domains, suggesting they may play functional roles in host-range specificity. The catalogue of characteristic sites was used to derive sequence signatures from historical isolates. These signatures, arranged in chronological order, reveal an evolutionary timeline for the adaptation of the PB2 protein to human hosts. Conclusion By providing the most complete elucidation to date of the functional components participating in PB2 protein adaptation to humans, this study demonstrates that mutual information is a powerful tool for comparative characterization of sequence sets. In addition to confirming previously reported findings, several novel characteristic sites within PB2 are reported. Sequence signatures generated using the characteristic sites catalogue characterize concisely the adaptation characteristics of individual isolates. Evolutionary timelines derived from signatures of early human influenza isolates suggest that characteristic variants emerged rapidly, and remained remarkably stable through subsequent pandemics. In addition, the signatures of human-infecting H5N1 isolates suggest that this avian subtype has low pandemic potential at present, although it presents more human adaptation components than most avian subtypes. PMID:18315849

  14. Next Generation Sequencing Technologies: The Doorway to the Unexplored Genomics of Non-Model Plants

    PubMed Central

    Unamba, Chibuikem I. N.; Nag, Akshay; Sharma, Ram K.

    2015-01-01

    Non-model plants i.e., the species which have one or all of the characters such as long life cycle, difficulty to grow in the laboratory or poor fecundity, have been schemed out of sequencing projects earlier, due to high running cost of Sanger sequencing. Consequently, the information about their genomics and key biological processes are inadequate. However, the advent of fast and cost effective next generation sequencing (NGS) platforms in the recent past has enabled the unearthing of certain characteristic gene structures unique to these species. It has also aided in gaining insight about mechanisms underlying processes of gene expression and secondary metabolism as well as facilitated development of genomic resources for diversity characterization, evolutionary analysis and marker assisted breeding even without prior availability of genomic sequence information. In this review we explore how different Next Gen Sequencing platforms, as well as recent advances in NGS based high throughput genotyping technologies are rewarding efforts on de-novo whole genome/transcriptome sequencing, development of genome wide sequence based markers resources for improvement of non-model crops that are less costly than phenotyping. PMID:26734016

  15. Sequence of the toxic shock syndrome toxin gene (tstH) borne by strains of Staphylococcus aureus isolated from patients with Kawasaki syndrome.

    PubMed Central

    Deresiewicz, R L; Flaxenburg, J; Leng, K; Kasper, D L

    1996-01-01

    To explore whether a novel staphylococcal clone or structural variant of toxic shock syndrome toxin 1 is associated with Kawasaki syndrome, six toxigenic strains of Staphylococcus aureus from Kawasaki syndrome patients were studied. The strains were divisible into two groups based on phenotypic and genotypic characteristics and are therefore unequivocally not clonal. Portions of the tstH genes of each strain were sequenced. Three were sequenced in their entirety, while the remainder were sequenced from codon 66 to codon 137 of the mature protein only. Two of the former group differed slightly in the sequences of their signal peptides relative to the sequence published for the tstH signal peptide. Those differences did not affect toxin processing or secretion. The sequenced portions of the regions encoding mature toxic shock syndrome toxin 1 were identical in all six strains and corresponded exactly to the published sequence of tstH. No evidence was found for the existence of a structural variant of tstH uniquely associated with Kawasaki syndrome. PMID:8757881

  16. A study of parameter identification

    NASA Technical Reports Server (NTRS)

    Herget, C. J.; Patterson, R. E., III

    1978-01-01

    A set of definitions for deterministic parameter identification ability were proposed. Deterministic parameter identificability properties are presented based on four system characteristics: direct parameter recoverability, properties of the system transfer function, properties of output distinguishability, and uniqueness properties of a quadratic cost functional. Stochastic parameter identifiability was defined in terms of the existence of an estimation sequence for the unknown parameters which is consistent in probability. Stochastic parameter identifiability properties are presented based on the following characteristics: convergence properties of the maximum likelihood estimate, properties of the joint probability density functions of the observations, and properties of the information matrix.

  17. Molecular signatures that are distinctive characteristics of the vertebrates and chordates and supporting a grouping of vertebrates with the tunicates.

    PubMed

    Gupta, Radhey S

    2016-01-01

    Members of the phylum Chordata and the subphylum Vertebrata are presently distinguished solely on the basis of morphological characteristics. The relationship of the vertebrates to the two non-vertebrate chordate subphyla is also a subject of debate. Analyses of protein sequences have identified multiple conserved signature indels (CSIs) that are specific for Chordata or for Vertebrata. Five CSIs in 4 important proteins are specific for the Vertebrata, whereas two other CSIs are uniquely found in all sequenced chordate species including Ciona intestinalis and Oikapleura dioica (Tunicates) as well as Branchiostoma floridae (Cephalochordates). The shared presence of these molecular signatures by all vertebrates/chordate species, but in no other animal taxa, strongly indicates that the genetic changes represented by the identified CSIs diagnose monophyletic groups. Two other discovered CSIs are uniquely shared by different vertebrate species and by either one (Ciona intestinalis) or both tunicate (Ciona and Oikapleura) species, but they are not found in Branchiostoma or other animal species. Specific presence of these CSIs in different vertebrates and either one or both tunicate species provides strong independent evidence that the vertebrate species are more closely related to the urochordates (tunicates) than to the cephalochordates. Copyright © 2015 Elsevier Inc. All rights reserved.

  18. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhang, Yanfeng; Zheng, Yi; Qin, Ling

    Beta-hydroxyacid dehydrogenase (β-HAD) genes have been identified in all sequenced genomes of eukaryotes and prokaryotes. Their gene products catalyze the NAD+- or NADP+-dependent oxidation of various β-hydroxy acid substrates into their corresponding semialdehyde. In many fungal and bacterial genomes, multiple β-HAD genes are observed leading to the hypothesis that these gene products may have unique, uncharacterized metabolic roles specific to their species. The genomes of Geobacter sulfurreducens and Geobacter metallireducens each contain two potential β-HAD genes. The protein sequences of one pair of these genes, Gs-βHAD (Q74DE4) and Gm-βHAD (Q39R98), have 65% sequence identity and 77% sequence similarity with eachmore » other. Both proteins reduce succinic semialdehyde, a metabolite of the GABA shunt. To further explore the structural and functional characteristics of these two β-HADs with a potentially unique substrate specificity, crystal structures for Gs-βHAD and Gm-βHAD in complex with NADP+ were determined to a resolution of 1.89 Å and 2.07 Å, respectively. The structure of both proteins are similar, composed of 14 α-helices and nine β-strands organized into two domains. Domain One (1-165) adopts a typical Rossmann fold composed of two α/β units: a six-strand parallel β-sheet surrounded by six α-helices (α1 – α6) followed by a mixed three-strand β-sheet surrounded by two α-helices (α7 and α8). Domain Two (166-287) is composed of a bundle of seven α-helices (α9 – α14). Four functional regions conserved in all β-HADs are spatially located near each other at the interdomain cleft in both Gs-βHAD and Gm-βHAD with a buried molecule of NADP+. The structural features of Gs-βHAD and Gm-βHAD are described in relation to the four conserved consensus sequences characteristic of β-HADs and the potential biochemical importance of these enzymes as an alternative pathway for the degradation of succinic semialdehyde.« less

  19. Distinct population structure for co-occurring Anopheles goeldii and Anopheles triannulatus in Amazonian Brazil

    PubMed Central

    McKeon, Sascha Naomi; Moreno, Marta; Sallum, Maria Anise; Povoa, Marinete Marins; Conn, Jan Evelyn

    2013-01-01

    To evaluate whether environmental heterogeneity contributes to the genetic heterogeneity in Anopheles triannulatus, larval habitat characteristics across the Brazilian states of Roraima and Pará and genetic sequences were examined. A comparison with Anopheles goeldii was utilised to determine whether high genetic diversity was unique to An. triannulatus. Student t test and analysis of variance found no differences in habitat characteristics between the species. Analysis of population structure of An. triannulatus and An. goeldii revealed distinct demographic histories in a largely overlapping geographic range. Cytochrome oxidase I sequence parsimony networks found geographic clustering for both species; however nuclear marker networks depicted An. triannulatus with a more complex history of fragmentation, secondary contact and recent divergence. Evidence of Pleistocene expansions suggests both species are more likely to be genetically structured by geographic and ecological barriers than demography. We hypothesise that niche partitioning is a driving force for diversity, particularly in An. triannulatus. PMID:23903977

  20. GMPR: A robust normalization method for zero-inflated count data with application to microbiome sequencing data.

    PubMed

    Chen, Li; Reeve, James; Zhang, Lujun; Huang, Shengbing; Wang, Xuefeng; Chen, Jun

    2018-01-01

    Normalization is the first critical step in microbiome sequencing data analysis used to account for variable library sizes. Current RNA-Seq based normalization methods that have been adapted for microbiome data fail to consider the unique characteristics of microbiome data, which contain a vast number of zeros due to the physical absence or under-sampling of the microbes. Normalization methods that specifically address the zero-inflation remain largely undeveloped. Here we propose geometric mean of pairwise ratios-a simple but effective normalization method-for zero-inflated sequencing data such as microbiome data. Simulation studies and real datasets analyses demonstrate that the proposed method is more robust than competing methods, leading to more powerful detection of differentially abundant taxa and higher reproducibility of the relative abundances of taxa.

  1. [Analysis of structural characteristics of alpha-tubulins in plants with enhanced cold tolerance].

    PubMed

    Nyporko, A Iu; Demchuk, O N; Blium, Ia B

    2003-01-01

    The uniqueness of the point substitutions in the sequences of two alpha-tubulin isotypes from psychrophilic alga Chloromonas that can determine the increased cold tolerance of this alga was analyzed. The comparison of all known amino acid sequences of plant alpha-tubulins enabled to ascertain that only M268-->V replacement is unique and may have a significant influence on spatial structure of plant alpha-tubulins. Modeling of molecular surfaces of alpha-tubulins from Chloromonas, Chalmydomonas reinhardtii and goose grass Eleusine indica showed that insertion of the amino acid replacement M268-->V into the sequence of goose grace tubulin led to the likening of this protein surface to the surface of native alpha-tubulin from Chloromonas. Alteration of local hydrophobic properties of alpha-tubulin molecular surface in interdimeric contact zone as a result of the mentioned replacement was shown that may play important role in increasing the level of cold resistance of microtubules. The crucial role of amino acid residue in 268 position for forming the interdimeric contact surface of alpha-tubulin molecule was revealed. The assumption is made about the importance of replacements at this position for plant tolerance to abiotic factors of different nature (cold, herbicides).

  2. Stable isotope, site-specific mass tagging for protein identification

    DOEpatents

    Chen, Xian

    2006-10-24

    Proteolytic peptide mass mapping as measured by mass spectrometry provides an important method for the identification of proteins, which are usually identified by matching the measured and calculated m/z values of the proteolytic peptides. A unique identification is, however, heavily dependent upon the mass accuracy and sequence coverage of the fragment ions generated by peptide ionization. The present invention describes a method for increasing the specificity, accuracy and efficiency of the assignments of particular proteolytic peptides and consequent protein identification, by the incorporation of selected amino acid residue(s) enriched with stable isotope(s) into the protein sequence without the need for ultrahigh instrumental accuracy. Selected amino acid(s) are labeled with .sup.13C/.sup.15N/.sup.2H and incorporated into proteins in a sequence-specific manner during cell culturing. Each of these labeled amino acids carries a defined mass change encoded in its monoisotopic distribution pattern. Through their characteristic patterns, the peptides with mass tag(s) can then be readily distinguished from other peptides in mass spectra. The present method of identifying unique proteins can also be extended to protein complexes and will significantly increase data search specificity, efficiency and accuracy for protein identifications.

  3. Genome Wide Search for Biomarkers to Diagnose Yersinia Infections.

    PubMed

    Kalia, Vipin Chandra; Kumar, Prasun

    2015-12-01

    Bacterial identification on the basis of the highly conserved 16S rRNA (rrs) gene is limited by its presence in multiple copies and a very high level of similarity among them. The need is to look for other genes with unique characteristics to be used as biomarkers. Fifty-one sequenced genomes belonging to 10 different Yersinia species were used for searching genes common to all the genomes. Out of 304 common genes, 34 genes of sizes varying from 0.11 to 4.42 kb, were selected and subjected to in silico digestion with 10 different Restriction endonucleases (RE) (4-6 base cutters). Yersinia species have 6-7 copies of rrs per genome, which are difficult to distinguish by multiple sequence alignments or their RE digestion patterns. However, certain unique combinations of other common gene sequences-carB, fadJ, gluM, gltX, ileS, malE, nusA, ribD, and rlmL and their RE digestion patterns can be used as markers for identifying 21 strains belonging to 10 Yersinia species: Y. aldovae, Y. enterocolitica, Y. frederiksenii, Y. intermedia, Y. kristensenii, Y. pestis, Y. pseudotuberculosis, Y. rohdei, Y. ruckeri, and Y. similis. This approach can be applied for rapid diagnostic applications.

  4. Sequence, Structural Analysis and Metrics to Define the Unique Dynamic Features of the Flap Regions Among Aspartic Proteases.

    PubMed

    McGillewie, Lara; Ramesh, Muthusamy; Soliman, Mahmoud E

    2017-10-01

    Aspartic proteases are a class of hydrolytic enzymes that have been implicated in a number of diseases such as HIV, malaria, cancer and Alzheimer's. The flap region of aspartic proteases is a characteristic unique structural feature of these enzymes; and found to have a profound impact on protein overall structure, function and dynamics. Flap dynamics also plays a crucial role in drug binding and drug resistance. Therefore, understanding the structure and dynamic behavior of this flap regions is crucial in the design of potent and selective inhibitors against aspartic proteases. Defining metrics that can describe the flap motion/dynamics has been a challenging topic in literature. This review is the first attempt to compile comprehensive information on sequence, structure, motion and metrics used to assess the dynamics of the flap region of different aspartic proteases in "one pot". We believe that this review would be of critical importance to the researchers from different scientific domains.

  5. Molecular phylogenetic position of hexactinellid sponges in relation to the Protista and Demospongiae.

    PubMed

    West, L; Powers, D

    1993-01-01

    Although it is generally accepted that the first multicellular organisms arose from unicellular ancestors, the phylogenetic relationships linking these groups remain unclear. Anatomical, physiological, and molecular studies of current multicellular organisms with relatively simple body organization suggest key characteristics of the earliest multicellular lineages. Glass sponges, the Hexactinellida, possess cellular characteristics that resemble some unicellular protistan organisms. These unique sponges were abundant in shallow seas of the early Cambrian, but they are currently restricted to polar habitats or very deep regions of the world oceans. Due in part to their relative inaccessibility, their potential significance to the early phylogeny of the eukaryotic kingdoms has been largely overlooked. We used sequences of the 18s ribosomal RNA gene of Farrea occa, a representative of the deep-water hexactinellid sponges, and Coelocarteria singaporense, a representative of the more common demosponges, and compared them with selected ribosomal RNA gene sequences available within the Protista. Using four computational methods for phylogenetic analysis of ribosomal DNA sequences, we found that the hexactinellid sponge-demosponge cluster is most closely related to Volvox and Acanthamoeba.

  6. Putative and unique gene sequence utilization for the design of species specific probes as modeled by Lactobacillus plantarum

    USDA-ARS?s Scientific Manuscript database

    The concept of utilizing putative and unique gene sequences for the design of species specific probes was tested. The abundance profile of assigned functions within the Lactobacillus plantarum genome was used for the identification of the putative and unique gene sequence, csh. The targeted gene (cs...

  7. Plastid, nuclear and reverse transcriptase sequences in the mitochondrial genome of Oenothera: is genetic information transferred between organelles via RNA?

    PubMed Central

    Schuster, W; Brennicke, A

    1987-01-01

    We describe an open reading frame (ORF) with high homology to reverse transcriptase in the mitochondrial genome of Oenothera. This ORF displays all the characteristics of an active plant mitochondrial gene with a possible ribosome binding site and 39% T in the third codon position. It is located between a sequence fragment from the plastid genome and one of nuclear origin downstream from the gene encoding subunit 5 of the NADH dehydrogenase. The nuclear derived sequence consists of 528 nucleotides from the small ribosomal RNA and contains an expansion segment unique to nuclear rRNAs. The plastid sequence contains part of the ribosomal protein S4 and the complete tRNA(Ser). The observation that only transcribed sequences have been found i more than one subcellular compartment in higher plants suggests that interorganellar transfer of genetic information may occur via RNA and subsequent local reverse transcription and genomic integration. PMID:14650433

  8. Biology of a Novel Mycobacteriophage, SWU1, Isolated from Chinese Soil as Revealed by Genomic Characteristics

    PubMed Central

    Fan, Xiangyu; Teng, Tieshan; Wang, Honghai

    2012-01-01

    Mycobacteriophage SWU1 is a newly isolated phage from a soil sample collected at Gongping village, Pingchang County, Sichuan Province, China, using Mycobacterium smegmatis mc2155 as a host. Plaques of SWU1 appear as a unique bull's-eye on an M. smegmatis lawn. In this paper, we report the complete genome sequence of SWU1 and some major findings from the analysis result. PMID:22923793

  9. Biology of a novel mycobacteriophage, SWU1, isolated from Chinese soil as revealed by genomic characteristics.

    PubMed

    Fan, Xiangyu; Teng, Tieshan; Wang, Honghai; Xie, Jianping

    2012-09-01

    Mycobacteriophage SWU1 is a newly isolated phage from a soil sample collected at Gongping village, Pingchang County, Sichuan Province, China, using Mycobacterium smegmatis mc(2)155 as a host. Plaques of SWU1 appear as a unique bull's-eye on an M. smegmatis lawn. In this paper, we report the complete genome sequence of SWU1 and some major findings from the analysis result.

  10. Nucleotide and deduced amino acid sequence of the envelope gene of the Vasilchenko strain of TBE virus; comparison with other flaviviruses.

    PubMed

    Gritsun, T S; Frolova, T V; Pogodina, V V; Lashkevich, V A; Venugopal, K; Gould, E A

    1993-02-01

    A strain of tick-borne encephalitis virus known as Vasilchenko (Vs) exhibits relatively low virulence characteristics in monkeys, Syrian hamsters and humans. The gene encoding the envelope glycoprotein of this virus was cloned and sequenced. Alignment of the sequence with those of other known tick-borne flaviviruses and identification of the recognised amino acid genetic marker EHLPTA confirmed its identity as a member of the TBE complex. However, Vs virus was distinguishable from eastern and western tick-borne serotypes by the presence of the sequence AQQ at amino acid positions 232-234 and also by the presence of other specific amino acid substitutions which may be genetic markers for these viruses and could determine their pathogenetic characteristics. When compared with other tick-borne flaviviruses, Vs virus had 12 unique amino acid substitutions including an additional potential glycosylation site at position (315-317). The Vs virus strain shared closest nucleotide and amino acid homology (84.5% and 95.5% respectively) with western and far eastern strains of tick-borne encephalitis virus. Comparison with the far eastern serotype of tick-borne encephalitis virus, by cross-immunoelectrophoresis of Vs virions and PAGE analysis of the extracted virion proteins, revealed differences in surface charge and virus stability that may account for the different virulence characteristics of Vs virus. These results support and enlarge upon previous data obtained from molecular and serological analysis.

  11. Temporal variation of aftershocks by means of multifractal characterization of their inter-event time and cluster analysis

    NASA Astrophysics Data System (ADS)

    Figueroa-Soto, A.; Zuñiga, R.; Marquez-Ramirez, V.; Monterrubio-Velasco, M.

    2017-12-01

    . The inter-event time characteristics of seismic aftershock sequences can provide important information to discern stages in the aftershock generation process. In order to investigate whether separate dynamic stages can be identified, (1) aftershock series after selected earthquake mainshocks, which took place at similar tectonic regimes were analyzed. To this end we selected two well-defined aftershock sequences from New Zealand and one aftershock sequence for Mexico, we (2) analyzed the fractal behavior of the logarithm of inter-event times (also called waiting times) of aftershocks by means of Holdeŕs exponent, and (3) their magnitude and spatial location based on a methodology proposed by Zaliapin and Ben Zion [2011] which accounts for the clustering properties of the sequence. In general, more than two coherent process stages can be identified following the main rupture, evidencing a type of "cascade" process which precludes implying a single generalized power law even though the temporal rate and average fractal character appear to be unique (as in a single Omorís p value). We found that aftershock processes indeed show multi-fractal characteristics, which may be related to different stages in the process of diffusion, as seen in the temporary-spatial distribution of aftershocks. Our method provides a way of defining the onset of the return to seismic background activity and the end of the main aftershock sequence.

  12. Shark (Scyliorhinus torazame) metallothionein: cDNA cloning, genomic sequence, and expression analysis.

    PubMed

    Cho, Young Sun; Choi, Buyl Nim; Ha, En-Mi; Kim, Ki Hong; Kim, Sung Koo; Kim, Dong Soo; Nam, Yoon Kwon

    2005-01-01

    Novel metallothionein (MT) complementary DNA and genomic sequences were isolated from a cartilaginous shark species, Scyliorhinus torazame. The full-length open reading frame (ORF) of shark MT cDNA encoded 68 amino acids with a high cysteine content (29%). The genomic ORF sequence (932 bp) of shark MT isolated by polymerase chain reaction (PCR) comprised 3 exons with 2 interventing introns. Shark MT sequence shared many conserved features with other vertebrate MTs: overall amino acid identities of shark MT ranged from 47% to 57% with fish MTs, and 41% to 62% with mammalian MTs. However, in addition to these conserved characteristics, shark MT sequence exhibited some unique characteristics. It contained 4 extra amino acids (Lys-Ala-Gly-Arg) at the end of the beta-domain, which have not been reported in any other vertebrate MTs. The last amino acid residue at the C-terminus was Ser, which also has not been reported in fish and mammalian MTs. The MT messenger RNA levels in shark liver and kidney, assessed by semiquantitative reverse transcriptase PCR and RNA blot hybridization, were significantly affected by experimental exposures to heavy metals (cadmium, copper, and zinc). Generally, the transcriptional activation of shark MT gene was dependent on the dose (0-10 mg/kg body weight for injection and 0-20 microM for immersion) and duration (1-10 days); zinc was a more potent inducer than copper and cadmium.

  13. Genomic analyses of Clostridium perfringens isolates from five toxinotypes.

    PubMed

    Hassan, Karl A; Elbourne, Liam D H; Tetu, Sasha G; Melville, Stephen B; Rood, Julian I; Paulsen, Ian T

    2015-05-01

    Clostridium perfringens can be isolated from a range of environments, including soil, marine and fresh water sediments, and the gastrointestinal tracts of animals and humans. Some C. perfringens strains have attractive industrial applications, e.g., in the degradation of waste products or the production of useful chemicals. However, C. perfringens has been most studied as the causative agent of a range of enteric and soft tissue infections of varying severities in humans and animals. Host preference and disease type in C. perfringens are intimately linked to the production of key extracellular toxins and on this basis toxigenic C. perfringens strains have been classified into five toxinotypes (A-E). To date, twelve genome sequences have been generated for a diverse collection of C. perfringens isolates, including strains associated with human and animal infections, a human commensal strain, and a strain with potential industrial utility. Most of the sequenced strains are classified as toxinotype A. However, genome sequences of representative strains from each of the other four toxinotypes have also been determined. Analysis of this collection of sequences has highlighted a lack of features differentiating toxinotype A strains from the other isolates, indicating that the primary defining characteristic of toxinotype A strains is their lack of key plasmid-encoded extracellular toxin genes associated with toxinotype B to E strains. The representative B-E strains sequenced to date each harbour many unique genes. Additional genome sequences are needed to determine if these genes are characteristic of their respective toxinotypes. Copyright © 2014. Published by Elsevier Masson SAS.

  14. Chætognath transcriptome reveals ancestral and unique features among bilaterians

    PubMed Central

    Marlétaz, Ferdinand; Gilles, André; Caubit, Xavier; Perez, Yvan; Dossat, Carole; Samain, Sylvie; Gyapay, Gabor; Wincker, Patrick; Le Parco, Yannick

    2008-01-01

    Background The chætognaths (arrow worms) have puzzled zoologists for years because of their astonishing morphological and developmental characteristics. Despite their deuterostome-like development, phylogenomic studies recently positioned the chætognath phylum in protostomes, most likely in an early branching. This key phylogenetic position and the peculiar characteristics of chætognaths prompted further investigation of their genomic features. Results Transcriptomic and genomic data were collected from the chætognath Spadella cephaloptera through the sequencing of expressed sequence tags and genomic bacterial artificial chromosome clones. Transcript comparisons at various taxonomic scales emphasized the conservation of a core gene set and phylogenomic analysis confirmed the basal position of chætognaths among protostomes. A detailed survey of transcript diversity and individual genotyping revealed a past genome duplication event in the chætognath lineage, which was, surprisingly, followed by a high retention rate of duplicated genes. Moreover, striking genetic heterogeneity was detected within the sampled population at the nuclear and mitochondrial levels but cannot be explained by cryptic speciation. Finally, we found evidence for trans-splicing maturation of transcripts through splice-leader addition in the chætognath phylum and we further report that this processing is associated with operonic transcription. Conclusion These findings reveal both shared ancestral and unique derived characteristics of the chætognath genome, which suggests that this genome is likely the product of a very original evolutionary history. These features promote chætognaths as a pivotal model for comparative genomics, which could provide new clues for the investigation of the evolution of animal genomes. PMID:18533022

  15. Possible ancient giant basin and related water enrichment in the Arabia Terra province, Mars

    USGS Publications Warehouse

    Dohm, J.M.; Barlow, N.G.; Anderson, R.C.; Williams, J.-P.; Miyamoto, H.; Ferris, J.C.; Strom, R.G.; Taylor, G.J.; Fairen, A.G.; Baker, V.R.; Boynton, W.V.; Keller, J.M.; Kerry, K.; Janes, D.; Rodriguez, J.A.P.; Hare, T.M.

    2007-01-01

    A circular albedo feature in the Arabia Terra province was first hypothesized as an ancient impact basin using Viking-era information. To test this unpublished hypothesis, we have analyzed the Viking era-information together with layers of new data derived from the Mars Global Surveyor (MGS) and Mars Odyssey (MO) missions. Our analysis indicates that Arabia Terra is an ancient geologic province of Mars with many distinct characteristics, including predominantly Noachian materials, a unique part of the highland-lowland boundary, a prominent paleotectonic history, the largest region of fretted terrain on the planet, outflow channels with no obvious origins, extensive exposures of eroded layered sedimentary deposits, and notable structural, albedo, thermal inertia, gravity, magnetic, and elemental signatures. The province also is marked by special impact crater morphologies, which suggest a persistent volatile-rich substrate. No one characteristic provides definitive answers to the dominant event(s) that shaped this unique province. Collectively the characteristics reported here support the following hypothesized sequence of events in Arabia Terra: (1) an enormous basin, possibly of impact origin, formed early in martian history when the magnetic dynamo was active and the lithosphere was relatively thin, (2) sediments and other materials were deposited in the basin during high erosion rates while maintaining isostatic equilibrium, (3) sediments became water enriched during the Noachian Period, and (4) basin materials were uplifted in response to the growth of the Tharsis Bulge, resulting in differential erosion exposing ancient stratigraphic sequences. Parts of the ancient basin remain water-enriched to the present day. ?? 2007 Elsevier Inc. All rights reserved.

  16. Transcriptomic analysis of Ruditapes philippinarum hemocytes reveals cytoskeleton disruption after in vitro Vibrio tapetis challenge.

    PubMed

    Brulle, Franck; Jeffroy, Fanny; Madec, Stéphanie; Nicolas, Jean-Louis; Paillard, Christine

    2012-10-01

    The Manila clam, Ruditapes philippinarum, is an economically-important, commercial shellfish; harvests are diminished in some European waters by a pathogenic bacterium, Vibrio tapetis, that causes Brown Ring disease. To identify molecular characteristics associated with susceptibility or resistance to Brown Ring disease, Suppression Subtractive Hybridization (SSH) analyzes were performed to construct cDNA libraries enriched in up- or down-regulated transcripts from clam immune cells, hemocytes, after a 3-h in vitro challenge with cultured V. tapetis. Nine hundred and ninety eight sequences from the two libraries were sequenced, and an in silico analysis identified 235 unique genes. BLAST and "Gene ontology" classification analyzes revealed that 60.4% of the Expressed Sequence Tags (ESTs) have high similarities with genes involved in various physiological functions, such as immunity, apoptosis and cytoskeleton organization; whereas, 39.6% remain unidentified. From the 235 unique genes, we selected 22 candidates based upon physiological function and redundancy in the libraries. Then, Real-Time PCR analysis identified 3 genes related to cytoskeleton organization showing significant variation in expression attributable to V. tapetis exposure. Disruption in regulation of these genes is consistent with the etiologic agent of Brown Ring disease in Manila clams. Copyright © 2012 Elsevier Ltd. All rights reserved.

  17. Prevalence of the F-type lectin domain.

    PubMed

    Bishnoi, Ritika; Khatri, Indu; Subramanian, Srikrishna; Ramya, T N C

    2015-08-01

    F-type lectins are fucolectins with characteristic fucose and calcium-binding sequence motifs and a unique lectin fold (the "F-type" fold). F-type lectins are phylogenetically widespread with selective distribution. Several eukaryotic F-type lectins have been biochemically and structurally characterized, and the F-type lectin domain (FLD) has also been studied in the bacterial proteins, Streptococcus mitis lectinolysin and Streptococcus pneumoniae SP2159. However, there is little knowledge about the extent of occurrence of FLDs and their domain organization, especially, in bacteria. We have now mined the extensive genomic sequence information available in the public databases with sensitive sequence search techniques in order to exhaustively survey prokaryotic and eukaryotic FLDs. We report 437 FLD sequence clusters (clustered at 80% sequence identity) from eukaryotic, eubacterial and viral proteins. Domain architectures are diverse but mostly conserved in closely related organisms, and domain organizations of bacterial FLD-containing proteins are very different from their eukaryotic counterparts, suggesting unique specialization of FLDs to suit different requirements. Several atypical phylogenetic associations hint at lateral transfer. Among eukaryotes, we observe an expansion of FLDs in terms of occurrence and domain organization diversity in the taxa Mollusca, Hemichordata and Branchiostomi, perhaps coinciding with greater emphasis on innate immune strategies in these organisms. The naturally occurring FLDs with diverse domain organizations that we have identified here will be useful for future studies aimed at creating designer molecular platforms for directing desired biological activities to fucosylated glycoconjugates in target niches. © The Author 2015. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  18. Sequence-Based Discovery Demonstrates That Fixed Light Chain Human Transgenic Rats Produce a Diverse Repertoire of Antigen-Specific Antibodies.

    PubMed

    Harris, Katherine E; Aldred, Shelley Force; Davison, Laura M; Ogana, Heather Anne N; Boudreau, Andrew; Brüggemann, Marianne; Osborn, Michael; Ma, Biao; Buelow, Benjamin; Clarke, Starlynn C; Dang, Kevin H; Iyer, Suhasini; Jorgensen, Brett; Pham, Duy T; Pratap, Payal P; Rangaswamy, Udaya S; Schellenberger, Ute; van Schooten, Wim C; Ugamraj, Harshad S; Vafa, Omid; Buelow, Roland; Trinklein, Nathan D

    2018-01-01

    We created a novel transgenic rat that expresses human antibodies comprising a diverse repertoire of heavy chains with a single common rearranged kappa light chain (IgKV3-15-JK1). This fixed light chain animal, called OmniFlic, presents a unique system for human therapeutic antibody discovery and a model to study heavy chain repertoire diversity in the context of a constant light chain. The purpose of this study was to analyze heavy chain variable gene usage, clonotype diversity, and to describe the sequence characteristics of antigen-specific monoclonal antibodies (mAbs) isolated from immunized OmniFlic animals. Using next-generation sequencing antibody repertoire analysis, we measured heavy chain variable gene usage and the diversity of clonotypes present in the lymph node germinal centers of 75 OmniFlic rats immunized with 9 different protein antigens. Furthermore, we expressed 2,560 unique heavy chain sequences sampled from a diverse set of clonotypes as fixed light chain antibody proteins and measured their binding to antigen by ELISA. Finally, we measured patterns and overall levels of somatic hypermutation in the full B-cell repertoire and in the 2,560 mAbs tested for binding. The results demonstrate that OmniFlic animals produce an abundance of antigen-specific antibodies with heavy chain clonotype diversity that is similar to what has been described with unrestricted light chain use in mammals. In addition, we show that sequence-based discovery is a highly effective and efficient way to identify a large number of diverse monoclonal antibodies to a protein target of interest.

  19. Phylogenomic analyses and molecular signatures for the class Halobacteria and its two major clades: a proposal for division of the class Halobacteria into an emended order Halobacteriales and two new orders, Haloferacales ord. nov. and Natrialbales ord. nov., containing the novel families Haloferacaceae fam. nov. and Natrialbaceae fam. nov.

    PubMed

    Gupta, Radhey S; Naushad, Sohail; Baker, Sheridan

    2015-03-01

    The Halobacteria constitute one of the largest groups within the Archaea. The hierarchical relationship among members of this large class, which comprises a single order and a single family, has proven difficult to determine based upon 16S rRNA gene trees and morphological and physiological characteristics. This work reports detailed phylogenetic and comparative genomic studies on >100 halobacterial (haloarchaeal) genomes containing representatives from 30 genera to investigate their evolutionary relationships. In phylogenetic trees reconstructed on the basis of 32 conserved proteins, using both neighbour-joining and maximum-likelihood methods, two major clades (clades A and B) encompassing nearly two-thirds of the sequenced haloarchaeal species were strongly supported. Clades grouping the same species/genera were also supported by the 16S rRNA gene trees and trees for several individual highly conserved proteins (RpoC, EF-Tu, UvrD, GyrA, EF-2/EF-G). In parallel, our comparative analyses of protein sequences from haloarchaeal genomes have identified numerous discrete molecular markers in the form of conserved signature indels (CSI) in protein sequences and conserved signature proteins (CSPs) that are found uniquely in specific groups of haloarchaea. Thirteen CSIs in proteins involved in diverse functions and 68 CSPs that are uniquely present in all or most genome-sequenced haloarchaea provide novel molecular means for distinguishing members of the class Halobacteria from all other prokaryotes. The members of clade A are distinguished from all other haloarchaea by the unique shared presence of two CSIs in the ribose operon protein and small GTP-binding protein and eight CSPs that are found specifically in members of this clade. Likewise, four CSIs in different proteins and five other CSPs are present uniquely in members of clade B and distinguish them from all other haloarchaea. Based upon their specific clustering in phylogenetic trees for different gene/protein sequences and the unique shared presence of large numbers of molecular signatures, members of clades A and B are indicated to be distinct from all other haloarchaea because of their uniquely shared evolutionary histories. Based upon these results, it is proposed that clades A and B be recognized as two new orders, Natrialbales ord. nov. and Haloferacales ord. nov., within the class Halobacteria, containing the novel families Natrialbaceae fam. nov. and Haloferacaceae fam. nov. Other members of the class Halobacteria that are not members of these two orders will remain part of the emended order Halobacteriales in an emended family Halobacteriaceae. © 2015 IUMS.

  20. Between Two Fern Genomes

    PubMed Central

    2014-01-01

    Ferns are the only major lineage of vascular plants not represented by a sequenced nuclear genome. This lack of genome sequence information significantly impedes our ability to understand and reconstruct genome evolution not only in ferns, but across all land plants. Azolla and Ceratopteris are ideal and complementary candidates to be the first ferns to have their nuclear genomes sequenced. They differ dramatically in genome size, life history, and habit, and thus represent the immense diversity of extant ferns. Together, this pair of genomes will facilitate myriad large-scale comparative analyses across ferns and all land plants. Here we review the unique biological characteristics of ferns and describe a number of outstanding questions in plant biology that will benefit from the addition of ferns to the set of taxa with sequenced nuclear genomes. We explain why the fern clade is pivotal for understanding genome evolution across land plants, and we provide a rationale for how knowledge of fern genomes will enable progress in research beyond the ferns themselves. PMID:25324969

  1. Gut microbial profile analysis by MiSeq sequencing of pancreatic carcinoma patients in China

    PubMed Central

    Xie, Haiyang; Li, Ang; Lu, Haifeng; Xu, Shaoyan; Zhou, Lin; Zhang, Hua; Cui, Guangying; Chen, Xinhua; Liu, Yuanxing; Wu, Liming; Qin, Nan; Sun, Ranran; Wang, Wei; Li, Lanjuan; Wang, Weilin; Zheng, Shusen

    2017-01-01

    Pancreatic carcinoma (PC) is a lethal cancer. Gut microbiota is associated with some risk factors of PC, e.g. obesity and types II diabetes. However, the specific gut microbial profile in clinical PC in China has never been reported. This prospective study collected 85 PC and 57 matched healthy controls (HC) to analyze microbial characteristics by MiSeq sequencing. The results showed that gut microbial diversity was decreased in PC with an unique microbial profile, which partly attributed to its decrease of alpha diversity. Microbial alterations in PC featured by the increase of certain pathogens and lipopolysaccharides-producing bacteria, and the decrease of probiotics and butyrate-producing bacteria. Microbial community in obstruction cases was separated from the un-obstructed cases. Streptococcus was associated with the bile. Furthermore, 23 microbial functions e.g. Leucine and LPS biosynthesis were enriched, while 13 functions were reduced in PC. Importantly, based on 40 genera associated with PC, microbial markers achieves a high classification power with AUC of 0.842. In conclusion, gut microbial profile was unique in PC, providing a microbial marker for non-invasive PC diagnosis. PMID:29221120

  2. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sattison, M.B.

    The Idaho National Engineering Laboratory (INEL) over the three years has created 75 plant-specific Accident Sequence Precursor (ASP) models using the SAPHIRE suite of PRA codes. Along with the new models, the INEL has also developed a new module for SAPHIRE which is tailored specifically to the unique needs of ASP evaluations. These models and software will be the next generation of risk tools for the evaluation of accident precursors by both the U.S. Nuclear Regulatory Commission`s (NRC`s) Office of Nuclear Reactor Regulation (NRR) and the Office for Analysis and Evaluation of Operational Data (AEOD). This paper presents an overviewmore » of the models and software. Key characteristics include: (1) classification of the plant models according to plant response with a unique set of event trees for each plant class, (2) plant-specific fault trees using supercomponents, (3) generation and retention of all system and sequence cutsets, (4) full flexibility in modifying logic, regenerating cutsets, and requantifying results, and (5) user interface for streamlined evaluation of ASP events. Future plans for the ASP models is also presented.« less

  3. Encapsulins: microbial nanocompartments with applications in biomedicine, nanobiotechnology and materials science.

    PubMed

    Giessen, Tobias W

    2016-10-01

    Compartmentalization is one of the defining features of life. Cells use protein compartments to exert spatial control over their metabolism, store nutrients and create unique microenvironments needed for essential physiological processes. Encapsulins are a recently discovered class of protein nanocompartments found in bacteria and archaea that naturally encapsulate cargo proteins. A short C-terminal targeting sequence directs the highly specific encapsulation process in vivo. Here, I will initially discuss the properties, diversity and putative function of encapsulins. The unique characteristics and potential uses of the self-sorting cargo-packaging process found in encapsulin systems will then be highlighted. Examples for the application of encapsulins as cell-specific optical nanoprobes and targeted therapeutic delivery systems will be discussed with an emphasis on the ability to integrate multiple functionalities within a single nanodevice. By fusing targeting sequences to non-native proteins, encapsulins can also be used as specific nanocontainers and enzymatic nanoreactors in vivo. I will end by briefly discussing future avenues for encapsulin research related to both basic microbial metabolism and applications in biomedicine, catalysis and materials science. Copyright © 2016 Elsevier Ltd. All rights reserved.

  4. Generation of the novel monoclonal antibody against TLS/EWS-CHOP chimeric oncoproteins that is applicable to one of the most sensitive assays for myxoid and round cell liposarcomas.

    PubMed

    Oikawa, Kosuke; Ishida, Tsuyoshi; Imamura, Tetsuo; Yoshida, Keiichi; Takanashi, Masakatsu; Hattori, Hiroyuki; Ishikawa, Akio; Fujita, Koji; Yamamoto, Kengo; Matsubayashi, Jun; Kuroda, Masahiko; Mukai, Kiyoshi

    2006-03-01

    The fusion oncoproteins, TLS-CHOP and EWS-CHOP, are characteristic markers for myxoid and round cell liposarcomas (MLS/RCLS). Especially, the peptide sequence of 26 amino acids corresponding to the normally untranslated CHOP exon 2 and parts of exon 3 (5'-UTR) is a unique structure for these chimeric proteins. In this report, we have generated monoclonal antibodies against the unique peptide sequence of TLS/EWS-CHOP oncoproteins. These antibodies reacted with TLS-CHOP fusion protein, but not reacted with normal TLS and CHOP proteins by Western blot analysis. In addition, one of the antibodies also recognized the chimeric oncoprotein in archival paraffin-embedded tissue samples of MLS/RCLS. The oncoprotein was detectable by the antibody even in the paraffin-embedded tissue samples whose mRNAs were too degraded to be detected by a nested reverse transcription-polymerase chain reaction-based assay. Thus, the molecular assay using the novel antibody is expected to be one of the most sensitive diagnostic assays for MLS/RCLS.

  5. RECOVIR Software for Identifying Viruses

    NASA Technical Reports Server (NTRS)

    Chakravarty, Sugoto; Fox, George E.; Zhu, Dianhui

    2013-01-01

    Most single-stranded RNA (ssRNA) viruses mutate rapidly to generate a large number of strains with highly divergent capsid sequences. Determining the capsid residues or nucleotides that uniquely characterize these strains is critical in understanding the strain diversity of these viruses. RECOVIR (an acronym for "recognize viruses") software predicts the strains of some ssRNA viruses from their limited sequence data. Novel phylogenetic-tree-based databases of protein or nucleic acid residues that uniquely characterize these virus strains are created. Strains of input virus sequences (partial or complete) are predicted through residue-wise comparisons with the databases. RECOVIR uses unique characterizing residues to identify automatically strains of partial or complete capsid sequences of picorna and caliciviruses, two of the most highly diverse ssRNA virus families. Partition-wise comparisons of the database residues with the corresponding residues of more than 300 complete and partial sequences of these viruses resulted in correct strain identification for all of these sequences. This study shows the feasibility of creating databases of hitherto unknown residues uniquely characterizing the capsid sequences of two of the most highly divergent ssRNA virus families. These databases enable automated strain identification from partial or complete capsid sequences of these human and animal pathogens.

  6. The CRISPRdb database and tools to display CRISPRs and to generate dictionaries of spacers and repeats

    PubMed Central

    Grissa, Ibtissem; Vergnaud, Gilles; Pourcel, Christine

    2007-01-01

    Background In Archeae and Bacteria, the repeated elements called CRISPRs for "clustered regularly interspaced short palindromic repeats" are believed to participate in the defence against viruses. Short sequences called spacers are stored in-between repeated elements. In the current model, motifs comprising spacers and repeats may target an invading DNA and lead to its degradation through a proposed mechanism similar to RNA interference. Analysis of intra-species polymorphism shows that new motifs (one spacer and one repeated element) are added in a polarised fashion. Although their principal characteristics have been described, a lot remains to be discovered on the way CRISPRs are created and evolve. As new genome sequences become available it appears necessary to develop automated scanning tools to make available CRISPRs related information and to facilitate additional investigations. Description We have produced a program, CRISPRFinder, which identifies CRISPRs and extracts the repeated and unique sequences. Using this software, a database is constructed which is automatically updated monthly from newly released genome sequences. Additional tools were created to allow the alignment of flanking sequences in search for similarities between different loci and to build dictionaries of unique sequences. To date, almost six hundred CRISPRs have been identified in 475 published genomes. Two Archeae out of thirty-seven and about half of Bacteria do not possess a CRISPR. Fine analysis of repeated sequences strongly supports the current view that new motifs are added at one end of the CRISPR adjacent to the putative promoter. Conclusion It is hoped that availability of a public database, regularly updated and which can be queried on the web will help in further dissecting and understanding CRISPR structure and flanking sequences evolution. Subsequent analyses of the intra-species CRISPR polymorphism will be facilitated by CRISPRFinder and the dictionary creator. CRISPRdb is accessible at PMID:17521438

  7. Comparative chloroplast genomics and phylogenetics of Fagopyrum esculentum ssp. ancestrale – A wild ancestor of cultivated buckwheat

    PubMed Central

    Logacheva, Maria D; Samigullin, Tahir H; Dhingra, Amit; Penin, Aleksey A

    2008-01-01

    Background Chloroplast genome sequences are extremely informative about species-interrelationships owing to its non-meiotic and often uniparental inheritance over generations. The subject of our study, Fagopyrum esculentum, is a member of the family Polygonaceae belonging to the order Caryophyllales. An uncertainty remains regarding the affinity of Caryophyllales and the asterids that could be due to undersampling of the taxa. With that background, having access to the complete chloroplast genome sequence for Fagopyrum becomes quite pertinent. Results We report the complete chloroplast genome sequence of a wild ancestor of cultivated buckwheat, Fagopyrum esculentum ssp. ancestrale. The sequence was rapidly determined using a previously described approach that utilized a PCR-based method and employed universal primers, designed on the scaffold of multiple sequence alignment of chloroplast genomes. The gene content and order in buckwheat chloroplast genome is similar to Spinacia oleracea. However, some unique structural differences exist: the presence of an intron in the rpl2 gene, a frameshift mutation in the rpl23 gene and extension of the inverted repeat region to include the ycf1 gene. Phylogenetic analysis of 61 protein-coding gene sequences from 44 complete plastid genomes provided strong support for the sister relationships of Caryophyllales (including Polygonaceae) to asterids. Further, our analysis also provided support for Amborella as sister to all other angiosperms, but interestingly, in the bayesian phylogeny inference based on first two codon positions Amborella united with Nymphaeales. Conclusion Comparative genomics analyses revealed that the Fagopyrum chloroplast genome harbors the characteristic gene content and organization as has been described for several other chloroplast genomes. However, it has some unique structural features distinct from previously reported complete chloroplast genome sequences. Phylogenetic analysis of the dataset, including this new sequence from non-core Caryophyllales supports the sister relationship between Caryophyllales and asterids. PMID:18492277

  8. extendFromReads

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Williams, Kelly P.

    2013-10-03

    This package assists in genome assembly. extendFromReads takes as input a set of Illumina (eg, MiSeq) DNA sequencing reads, a query seed sequence and a direction to extend the seed. The algorithm collects all seed-- ]matching reads (flipping reverse-- ]orientation hits), trims off the seed and additional sequence in the other direction, sorts the remaining sequences alphabetically, and prints them aligned without gaps from the point of seed trimming. This produces a visual display distinguishing the flanks of multi- ]copy seeds. A companion script hitMates.pl collects the mates of seed-- ]hi]ng reads, whose alignment reveals longer extensions from the seed.more » The collect/trim/sort strategy was made iterative and scaled up in the script denovo.pl, for de novo contig assembly. An index is pre-- ]built using indexReads.pl that for each unique 21-- ]mer found in all the reads, records its gfate h of extension (whether extendable, blocked by low coverage, or blocked by branching after a duplicated sequence) and other characteristics. Importantly, denovo.pl records all branchings that follow a branching contig endpoint, providing contig- ]extension information« less

  9. Ultraviolet spectral morphology of the O stars. IV - The OB supergiant sequence

    NASA Technical Reports Server (NTRS)

    Walborn, Nolan R.; Nichols-Bohlin, Joy

    1987-01-01

    An atlas of 25 O3-B8 supergiant spectra in the wavelength ranges 1320-1580 A and 1620-1880 A is presented, based on high-resolution data from the IUE archives. The remarkably detailed relationship between the stellar-wind profiles and the optical spectral classifications throughout this sequence is emphasized. For instance, the (Si IV)/(C IV) ratio reverses between O4 and O6.5; and the B0, B0.5, and B0.7 Ia wind characteristics are each qualitatively unique and distinct from one another. The systematic behavior of nine stellar-wind features with ionization potentials ranging from 114 to 19 eV is summarized as a function of advancing spectral type.

  10. The role of molecular structure of sugar-phosphate backbone and nucleic acid bases in the formation of single-stranded and double-stranded DNA structures.

    PubMed

    Poltev, Valeri; Anisimov, Victor M; Danilov, Victor I; Garcia, Dolores; Sanchez, Carolina; Deriabina, Alexandra; Gonzalez, Eduardo; Rivas, Francisco; Polteva, Nina

    2014-06-01

    Our previous DFT computations of deoxydinucleoside monophosphate complexes with Na(+)-ions (dDMPs) have demonstrated that the main characteristics of Watson-Crick (WC) right-handed duplex families are predefined in the local energy minima of dDMPs. In this work, we study the mechanisms of contribution of chemically monotonous sugar-phosphate backbone and the bases into the double helix irregularity. Geometry optimization of sugar-phosphate backbone produces energy minima matching the WC DNA conformations. Studying the conformational variability of dDMPs in response to sequence permutation, we found that simple replacement of bases in the previously fully optimized dDMPs, e.g. by constructing Pyr-Pur from Pur-Pyr, and Pur-Pyr from Pyr-Pur sequences, while retaining the backbone geometry, automatically produces the mutual base position characteristic of the target sequence. Based on that, we infer that the directionality and the preferable regions of the sugar-phosphate torsions, combined with the difference of purines from pyrimidines in ring shape, determines the sequence dependence of the structure of WC DNA. No such sequence dependence exists in dDMPs corresponding to other DNA conformations (e.g., Z-family and Hoogsteen duplexes). Unlike other duplexes, WC helix is unique by its ability to match the local energy minima of the free single strand to the preferable conformations of the duplex. Copyright © 2013 Wiley Periodicals, Inc.

  11. Characteristic motifs for families of allergenic proteins

    PubMed Central

    Ivanciuc, Ovidiu; Garcia, Tzintzuni; Torres, Miguel; Schein, Catherine H.; Braun, Werner

    2008-01-01

    The identification of potential allergenic proteins is usually done by scanning a database of allergenic proteins and locating known allergens with a high sequence similarity. However, there is no universally accepted cut-off value for sequence similarity to indicate potential IgE cross-reactivity. Further, overall sequence similarity may be less important than discrete areas of similarity in proteins with homologous structure. To identify such areas, we first classified all allergens and their subdomains in the Structural Database of Allergenic Proteins (SDAP, http://fermi.utmb.edu/SDAP/) to their closest protein families as defined in Pfam, and identified conserved physicochemical property motifs characteristic of each group of sequences. Allergens populate only a small subset of all known Pfam families, as all allergenic proteins in SDAP could be grouped to only 130 (of 9318 total) Pfams, and 31 families contain more than four allergens. Conserved physicochemical property motifs for the aligned sequences of the most populated Pfam families were identified with the PCPMer program suite and catalogued in the webserver Motif-Mate (http://born.utmb.edu/motifmate/summary.php). We also determined specific motifs for allergenic members of a family that could distinguish them from non-allergenic ones. These allergen specific motifs should be most useful in database searches for potential allergens. We found that sequence motifs unique to the allergens in three families (seed storage proteins, Bet v 1, and tropomyosin) overlap with known IgE epitopes, thus providing evidence that our motif based approach can be used to assess the potential allergenicity of novel proteins. PMID:18951633

  12. High-Throughput Block Optical DNA Sequence Identification.

    PubMed

    Sagar, Dodderi Manjunatha; Korshoj, Lee Erik; Hanson, Katrina Bethany; Chowdhury, Partha Pratim; Otoupal, Peter Britton; Chatterjee, Anushree; Nagpal, Prashant

    2018-01-01

    Optical techniques for molecular diagnostics or DNA sequencing generally rely on small molecule fluorescent labels, which utilize light with a wavelength of several hundred nanometers for detection. Developing a label-free optical DNA sequencing technique will require nanoscale focusing of light, a high-throughput and multiplexed identification method, and a data compression technique to rapidly identify sequences and analyze genomic heterogeneity for big datasets. Such a method should identify characteristic molecular vibrations using optical spectroscopy, especially in the "fingerprinting region" from ≈400-1400 cm -1 . Here, surface-enhanced Raman spectroscopy is used to demonstrate label-free identification of DNA nucleobases with multiplexed 3D plasmonic nanofocusing. While nanometer-scale mode volumes prevent identification of single nucleobases within a DNA sequence, the block optical technique can identify A, T, G, and C content in DNA k-mers. The content of each nucleotide in a DNA block can be a unique and high-throughput method for identifying sequences, genes, and other biomarkers as an alternative to single-letter sequencing. Additionally, coupling two complementary vibrational spectroscopy techniques (infrared and Raman) can improve block characterization. These results pave the way for developing a novel, high-throughput block optical sequencing method with lossy genomic data compression using k-mer identification from multiplexed optical data acquisition. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  13. A reassessment of IgM memory subsets in humans

    PubMed Central

    Bagnara, Davide; Squillario, Margherita; Kipling, David; Mora, Thierry; Walczak, Aleksandra M.; Da Silva, Lucie; Weller, Sandra; Dunn-Walters, Deborah K.; Weill, Jean-Claude; Reynaud, Claude-Agnès

    2015-01-01

    From paired blood and spleen samples from three adult donors we performed high-throughput V-h sequencing of human B-cell subsets defined by IgD and CD27 expression: IgD+CD27+ (“MZ”), IgD−CD27+(“memory”, including IgM (“IgM-only”), IgG and IgA) and IgD−CD27− cells (“double-negative”, including IgM, IgG and IgA). 91,294 unique sequences clustered in 42,670 clones, revealing major clonal expansions in each of these subsets. Among these clones, we further analyzed those shared sequences from different subsets or tissues for Vh-gene mutation, H-CDR3-length, and Vh/Jh usage, comparing these different characteristics with all sequences from their subset of origin, for which these parameters constitute a distinct signature. The IgM-only repertoire profile differed notably from that of MZ B cells by a higher mutation frequency, and lower Vh4 and higher Jh6 gene usage. Strikingly, IgM sequences from clones shared between the MZ and the memory IgG/IgA compartments showed a mutation and repertoire profile of IgM-only and not of MZ B cells. Similarly, all IgM clonal relationships (between MZ, IgM-only, and double-negative compartments) involved sequences with the characteristics of IgM-only B cells. Finally, clonal relationships between tissues suggested distinct recirculation characteristics between MZ and switched B cells. The “IgM-only” subset (including cells with its repertoire signature but higher IgD or lower CD27 expression levels) thus appear as the only subset showing precursor-product relationships with CD27+ switched memory B cells, indicating that they represent germinal center-derived IgM memory B cells, and that IgM memory and MZ B cells constitute two distinct entities. PMID:26355154

  14. A Reassessment of IgM Memory Subsets in Humans.

    PubMed

    Bagnara, Davide; Squillario, Margherita; Kipling, David; Mora, Thierry; Walczak, Aleksandra M; Da Silva, Lucie; Weller, Sandra; Dunn-Walters, Deborah K; Weill, Jean-Claude; Reynaud, Claude-Agnès

    2015-10-15

    From paired blood and spleen samples from three adult donors, we performed high-throughput VH sequencing of human B cell subsets defined by IgD and CD27 expression: IgD(+)CD27(+) ("marginal zone [MZ]"), IgD(-)CD27(+) ("memory," including IgM ["IgM-only"], IgG and IgA) and IgD(-)CD27(-) cells ("double-negative," including IgM, IgG, and IgA). A total of 91,294 unique sequences clustered in 42,670 clones, revealing major clonal expansions in each of these subsets. Among these clones, we further analyzed those shared sequences from different subsets or tissues for VH gene mutation, H-CDR3-length, and VH/JH usage, comparing these different characteristics with all sequences from their subset of origin for which these parameters constitute a distinct signature. The IgM-only repertoire profile differed notably from that of MZ B cells by a higher mutation frequency and lower VH4 and higher JH6 gene usage. Strikingly, IgM sequences from clones shared between the MZ and the memory IgG/IgA compartments showed a mutation and repertoire profile of IgM-only and not of MZ B cells. Similarly, all IgM clonal relationships (among MZ, IgM-only, and double-negative compartments) involved sequences with the characteristics of IgM-only B cells. Finally, clonal relationships between tissues suggested distinct recirculation characteristics between MZ and switched B cells. The "IgM-only" subset (including cells with its repertoire signature but higher IgD or lower CD27 expression levels) thus appear as the only subset showing precursor-product relationships with CD27(+) switched memory B cells, indicating that they represent germinal center-derived IgM memory B cells and that IgM memory and MZ B cells constitute two distinct entities. Copyright © 2015 by The American Association of Immunologists, Inc.

  15. Comparison of biological and genomic characteristics between a newly isolated mink enteritis parvovirus MEV-LHV and an attenuated strain MEV-L.

    PubMed

    Mao, Yaping; Wang, Jigui; Hou, Qiang; Xi, Ji; Zhang, Xiaomei; Bian, Dawei; Yu, Yongle; Wang, Xi; Liu, Weiquan

    2016-06-01

    A virus isolated from mink showing clinical signs of enteritis was identified as a high virulent mink enteritis parvovirus (MEV) based on its biological characteristics in vivo and in vitro. Mink, challenged with this strain named MEV-LHV, exhibited severe pathological lesions as compared to those challenged with attenuated strain MEV-L. MEV-LHV also showed higher infection and replication efficiencies in vitro than MEV-L. Sequence of the complete genome of MEV-LHV was determined and analyzed in comparison with those in GenBank, which revealed that MEV-LHV shared high homology with virulent strain MEV SD12/01, whereas MEV-L was closely related to Abashiri and vaccine strain MEVB, and belonged to a different branch of the phylogenetic tree. The genomes of the two strains differed by insertions and deletions in their palindromic termini and specific unique mutations (especially VP2 300) in coding sequences which may be involved in viral replication and pathogenicity. The results of this study provide a better understanding of the biological and genomic characteristics of MEV and identify certain regions and sites that may be involved in viral replication and pathogenicity.

  16. Multi-Platform Next-Generation Sequencing of the Domestic Turkey (Meleagris gallopavo): Genome Assembly and Analysis

    PubMed Central

    Aslam, Luqman; Beal, Kathryn; Ann Blomberg, Le; Bouffard, Pascal; Burt, David W.; Crasta, Oswald; Crooijmans, Richard P. M. A.; Cooper, Kristal; Coulombe, Roger A.; De, Supriyo; Delany, Mary E.; Dodgson, Jerry B.; Dong, Jennifer J.; Evans, Clive; Frederickson, Karin M.; Flicek, Paul; Florea, Liliana; Folkerts, Otto; Groenen, Martien A. M.; Harkins, Tim T.; Herrero, Javier; Hoffmann, Steve; Megens, Hendrik-Jan; Jiang, Andrew; de Jong, Pieter; Kaiser, Pete; Kim, Heebal; Kim, Kyu-Won; Kim, Sungwon; Langenberger, David; Lee, Mi-Kyung; Lee, Taeheon; Mane, Shrinivasrao; Marcais, Guillaume; Marz, Manja; McElroy, Audrey P.; Modise, Thero; Nefedov, Mikhail; Notredame, Cédric; Paton, Ian R.; Payne, William S.; Pertea, Geo; Prickett, Dennis; Puiu, Daniela; Qioa, Dan; Raineri, Emanuele; Ruffier, Magali; Salzberg, Steven L.; Schatz, Michael C.; Scheuring, Chantel; Schmidt, Carl J.; Schroeder, Steven; Searle, Stephen M. J.; Smith, Edward J.; Smith, Jacqueline; Sonstegard, Tad S.; Stadler, Peter F.; Tafer, Hakim; Tu, Zhijian (Jake); Van Tassell, Curtis P.; Vilella, Albert J.; Williams, Kelly P.; Yorke, James A.; Zhang, Liqing; Zhang, Hong-Bin; Zhang, Xiaojun; Zhang, Yang; Reed, Kent M.

    2010-01-01

    A synergistic combination of two next-generation sequencing platforms with a detailed comparative BAC physical contig map provided a cost-effective assembly of the genome sequence of the domestic turkey (Meleagris gallopavo). Heterozygosity of the sequenced source genome allowed discovery of more than 600,000 high quality single nucleotide variants. Despite this heterozygosity, the current genome assembly (∼1.1 Gb) includes 917 Mb of sequence assigned to specific turkey chromosomes. Annotation identified nearly 16,000 genes, with 15,093 recognized as protein coding and 611 as non-coding RNA genes. Comparative analysis of the turkey, chicken, and zebra finch genomes, and comparing avian to mammalian species, supports the characteristic stability of avian genomes and identifies genes unique to the avian lineage. Clear differences are seen in number and variety of genes of the avian immune system where expansions and novel genes are less frequent than examples of gene loss. The turkey genome sequence provides resources to further understand the evolution of vertebrate genomes and genetic variation underlying economically important quantitative traits in poultry. This integrated approach may be a model for providing both gene and chromosome level assemblies of other species with agricultural, ecological, and evolutionary interest. PMID:20838655

  17. Sleep-dependent learning and motor-skill complexity

    PubMed Central

    Kuriyama, Kenichi; Stickgold, Robert; Walker, Matthew P.

    2004-01-01

    Learning of a procedural motor-skill task is known to progress through a series of unique memory stages. Performance initially improves during training, and continues to improve, without further rehearsal, across subsequent periods of sleep. Here, we investigate how this delayed sleep-dependent learning is affected when the task characteristics are varied across several degrees of difficulty, and whether this improvement differentially enhances individual transitions of the motor-sequence pattern being learned. We report that subjects show similar overnight improvements in speed whether learning a five-element unimanual sequence (17.7% improvement), a nine-element unimanual sequence (20.2%), or a five-element bimanual sequence (17.5%), but show markedly increased overnight improvement (28.9%) with a nine-element bimanual sequence. In addition, individual transitions within the motor-sequence pattern that appeared most difficult at the end of training showed a significant 17.8% increase in speed overnight, whereas those transitions that were performed most rapidly at the end of training showed only a non-significant 1.4% improvement. Together, these findings suggest that the sleep-dependent learning process selectively provides maximum benefit to motor-skill procedures that proved to be most difficult prior to sleep. PMID:15576888

  18. The mitochondrial genome sequences of the round goby and the sand goby reveal patterns of recent evolution in gobiid fish.

    PubMed

    Adrian-Kalchhauser, Irene; Svensson, Ola; Kutschera, Verena E; Alm Rosenblad, Magnus; Pippel, Martin; Winkler, Sylke; Schloissnig, Siegfried; Blomberg, Anders; Burkhardt-Holm, Patricia

    2017-02-16

    Vertebrate mitochondrial genomes are optimized for fast replication and low cost of RNA expression. Accordingly, they are devoid of introns, are transcribed as polycistrons and contain very little intergenic sequences. Usually, vertebrate mitochondrial genomes measure between 16.5 and 17 kilobases (kb). During genome sequencing projects for two novel vertebrate models, the invasive round goby and the sand goby, we found that the sand goby genome is exceptionally small (16.4 kb), while the mitochondrial genome of the round goby is much larger than expected for a vertebrate. It is 19 kb in size and is thus one of the largest fish and even vertebrate mitochondrial genomes known to date. The expansion is attributable to a sequence insertion downstream of the putative transcriptional start site. This insertion carries traces of repeats from the control region, but is mostly novel. To get more information about this phenomenon, we gathered all available mitochondrial genomes of Gobiidae and of nine gobioid species, performed phylogenetic analyses, analysed gene arrangements, and compared gobiid mitochondrial genome sizes, ecological information and other species characteristics with respect to the mitochondrial phylogeny. This allowed us amongst others to identify a unique arrangement of tRNAs among Ponto-Caspian gobies. Our results indicate that the round goby mitochondrial genome may contain novel features. Since mitochondrial genome organisation is tightly linked to energy metabolism, these features may be linked to its invasion success. Also, the unique tRNA arrangement among Ponto-Caspian gobies may be helpful in studying the evolution of this highly adaptive and invasive species group. Finally, we find that the phylogeny of gobiids can be further refined by the use of longer stretches of linked DNA sequence.

  19. Chemical property based sequence characterization of PpcA and its homolog proteins PpcB-E: A mathematical approach

    PubMed Central

    Pal Choudhury, Pabitra

    2017-01-01

    Periplasmic c7 type cytochrome A (PpcA) protein is determined in Geobacter sulfurreducens along with its other four homologs (PpcB-E). From the crystal structure viewpoint the observation emerges that PpcA protein can bind with Deoxycholate (DXCA), while its other homologs do not. But it is yet to be established with certainty the reason behind this from primary protein sequence information. This study is primarily based on primary protein sequence analysis through the chemical basis of embedded amino acids. Firstly, we look for the chemical group specific score of amino acids. Along with this, we have developed a new methodology for the phylogenetic analysis based on chemical group dissimilarities of amino acids. This new methodology is applied to the cytochrome c7 family members and pinpoint how a particular sequence is differing with others. Secondly, we build a graph theoretic model on using amino acid sequences which is also applied to the cytochrome c7 family members and some unique characteristics and their domains are highlighted. Thirdly, we search for unique patterns as subsequences which are common among the group or specific individual member. In all the cases, we are able to show some distinct features of PpcA that emerges PpcA as an outstanding protein compared to its other homologs, resulting towards its binding with deoxycholate. Similarly, some notable features for the structurally dissimilar protein PpcD compared to the other homologs are also brought out. Further, the five members of cytochrome family being homolog proteins, they must have some common significant features which are also enumerated in this study. PMID:28362850

  20. Comprehensive definition of genome features in Spirodela polyrhiza by high-depth physical mapping and short-read DNA sequencing strategies.

    PubMed

    Michael, Todd P; Bryant, Douglas; Gutierrez, Ryan; Borisjuk, Nikolai; Chu, Philomena; Zhang, Hanzhong; Xia, Jing; Zhou, Junfei; Peng, Hai; El Baidouri, Moaine; Ten Hallers, Boudewijn; Hastie, Alex R; Liang, Tiffany; Acosta, Kenneth; Gilbert, Sarah; McEntee, Connor; Jackson, Scott A; Mockler, Todd C; Zhang, Weixiong; Lam, Eric

    2017-02-01

    Spirodela polyrhiza is a fast-growing aquatic monocot with highly reduced morphology, genome size and number of protein-coding genes. Considering these biological features of Spirodela and its basal position in the monocot lineage, understanding its genome architecture could shed light on plant adaptation and genome evolution. Like many draft genomes, however, the 158-Mb Spirodela genome sequence has not been resolved to chromosomes, and important genome characteristics have not been defined. Here we deployed rapid genome-wide physical maps combined with high-coverage short-read sequencing to resolve the 20 chromosomes of Spirodela and to empirically delineate its genome features. Our data revealed a dramatic reduction in the number of the rDNA repeat units in Spirodela to fewer than 100, which is even fewer than that reported for yeast. Consistent with its unique phylogenetic position, small RNA sequencing revealed 29 Spirodela-specific microRNA, with only two being shared with Elaeis guineensis (oil palm) and Musa balbisiana (banana). Combining DNA methylation data and small RNA sequencing enabled the accurate prediction of 20.5% long terminal repeats (LTRs) that doubled the previous estimate, and revealed a high Solo:Intact LTR ratio of 8.2. Interestingly, we found that Spirodela has the lowest global DNA methylation levels (9%) of any plant species tested. Taken together our results reveal a genome that has undergone reduction, likely through eliminating non-essential protein coding genes, rDNA and LTRs. In addition to delineating the genome features of this unique plant, the methodologies described and large-scale genome resources from this work will enable future evolutionary and functional studies of this basal monocot family. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  1. Singular over-representation of an octameric palindrome, HIP1, in DNA from many cyanobacteria.

    PubMed

    Robinson, N J; Robinson, P J; Gupta, A; Bleasby, A J; Whitton, B A; Morby, A P

    1995-03-11

    An octameric palindrome (5'-GCGATCGC-3') is abundant in cyanobacterial sequences within databases (GenBank/EMBL) and was designated HIP1 (highly iterated palindrome). The frequency of occurrence of all 256 octameric palindromes has now been determined in sub-databases revealing large and unique over-representation of HIP1 in cyanobacterial entries. DNA sequences from other bacteria were searched for any over-represented octameric palindromes analogous to HIP1. Only two sequences were identified, in the genomes of a thermophile and halophilic archaebacteria, although these were less abundant than HIP1 in cyanobacteria and relate to codon usage. To test the proposed widespread distribution of HIP1 in DNA from the cyanobacterium Synechococcus PCC 6301, randomly selected genomic clones were partly sequenced. HIP1 constituted 2.5% of the novel sequences, equivalent to a site on average once every 320 nucleotides. An oligonucleotide including HIP1 was also tested in PCR. Multiple products were obtained using template DNA from cyanobacterial strains in which HIP1 is abundant in known sequences, and some strains generated characteristic HIP-PCR banding patterns. However, analysis of DNA from one strain (not previously represented in databases) by random sequencing, HIP-PCR and Pvul digestion, confirms that not all cyanobacterial genomes are rich in HIP1.

  2. Soda pans of the Pannonian steppe harbor unique bacterial communities adapted to multiple extreme conditions.

    PubMed

    Szabó, Attila; Korponai, Kristóf; Kerepesi, Csaba; Somogyi, Boglárka; Vörös, Lajos; Bartha, Dániel; Márialigeti, Károly; Felföldi, Tamás

    2017-05-01

    Soda pans of the Pannonian steppe are unique environments regarding their physical and chemical characteristics: shallowness, high turbidity, intermittent character, alkaline pH, polyhumic organic carbon concentration, hypertrophic condition, moderately high salinity, sodium and carbonate ion dominance. The pans are highly productive environments with picophytoplankton predominance. Little is known about the planktonic bacterial communities inhabiting these aquatic habitats; therefore, amplicon sequencing and shotgun metagenomics were applied to reveal their composition and functional properties. Results showed a taxonomically complex bacterial community which was distinct from other soda lakes regarding its composition, e.g. the dominance of class Alphaproteobacteria was observed within phylum Proteobacteria. The shotgun metagenomic analysis revealed several functional gene components related to the harsh and at the same time hypertrophic environmental conditions, e.g. proteins involved in stress response, transport and hydrolase systems targeting phytoplankton-derived organic matter. This is the first detailed report on the indigenous planktonic bacterial communities coping with the multiple extreme conditions present in the unique soda pans of the Pannonian steppe.

  3. Is Life Unique?

    PubMed Central

    Abel, David L.

    2011-01-01

    Is life physicochemically unique? No. Is life unique? Yes. Life manifests innumerable formalisms that cannot be generated or explained by physicodynamics alone. Life pursues thousands of biofunctional goals, not the least of which is staying alive. Neither physicodynamics, nor evolution, pursue goals. Life is largely directed by linear digital programming and by the Prescriptive Information (PI) instantiated particularly into physicodynamically indeterminate nucleotide sequencing. Epigenomic controls only compound the sophistication of these formalisms. Life employs representationalism through the use of symbol systems. Life manifests autonomy, homeostasis far from equilibrium in the harshest of environments, positive and negative feedback mechanisms, prevention and correction of its own errors, and organization of its components into Sustained Functional Systems (SFS). Chance and necessity—heat agitation and the cause-and-effect determinism of nature’s orderliness—cannot spawn formalisms such as mathematics, language, symbol systems, coding, decoding, logic, organization (not to be confused with mere self-ordering), integration of circuits, computational success, and the pursuit of functionality. All of these characteristics of life are formal, not physical. PMID:25382119

  4. Identification of a Unique Amyloid Sequence in AA Amyloidosis of a Pig Associated With Streptococcus Suis Infection.

    PubMed

    Kamiie, J; Sugahara, G; Yoshimoto, S; Aihara, N; Mineshige, T; Uetsuka, K; Shirota, K

    2017-01-01

    Here we report a pig with amyloid A (AA) amyloidosis associated with Streptococcus suis infection and identification of a unique amyloid sequence in the amyloid deposits in the tissue. Tissues from the 180-day-old underdeveloped pig contained foci of necrosis and suppurative inflammation associated with S. suis infection. Congo red stain, immunohistochemistry, and electron microscopy revealed intense AA deposition in the spleen and renal glomeruli. Mass spectrometric analysis of amyloid material extracted from the spleen showed serum AA 2 (SAA2) peptide as well as a unique peptide sequence previously reported in a pig with AA amyloidosis. The common detection of the unique amyloid sequence in the current and past cases of AA amyloidosis in pigs suggests that this amyloid sequence might play a key role in the development of porcine AA amyloidosis. An in vitro fibrillation assay demonstrated that the unique AA peptide formed typically rigid, long amyloid fibrils (10 nm wide) and the N-terminus peptide of SAA2 formed zigzagged, short fibers (7 nm wide). Moreover, the SAA2 peptide formed long, rigid amyloid fibrils in the presence of sonicated amyloid fibrils formed by the unique AA peptide. These findings indicate that the N-terminus of SAA2 as well as the AA peptide mediate the development of AA amyloidosis in pigs via cross-seeding polymerization.

  5. Geology of the Devonian black shales of the Appalachian basin

    USGS Publications Warehouse

    Roen, J.B.

    1983-01-01

    Black shales of Devonian age in the Appalachian basin are a unique rock sequence. The high content of organic matter, which imparts the characteristic lithology, has for years attracted considerable interest in the shales as a possible source of energy. Concurrent with periodic and varied economic exploitations of the black shales are geologic studies. The recent energy shortage prompted the U.S. Department of Energy through the Eastern Gas Shales Project of the Morgantown Energy Technology Center to underwrite a research program to determine the geologic, geochemical, and structural characteristics of the Devonian black shales in order to enhance the recovery of gas from the shales. Geologic studies produced a regional stratigraphic network that correlates the 15-foot sequence in Tennessee with 3,000 feet of interbedded black and gray shales in central New York. The classic Devonian black-shale sequence in New York has been correlated with the Ohio Shale of Ohio and Kentucky and the Chattanooga Shale of Tennessee and southwestern Virginia. Biostratigraphic and lithostratigraphic markers in conjunction with gamma-ray logs facilitated long range correlations within the Appalachian basin and provided a basis for correlations with the black shales of the Illinois and Michigan basins. Areal distribution of selected shale units along with paleocurrent studies, clay mineralogy, and geochemistry suggests variations in the sediment source and transport directions. Current structures, faunal evidence, lithologic variations, and geochemical studies provide evidence to support interpretation of depositional environments. In addition, organic geochemical data combined with stratigraphic and structural characteristics of the shale within the basin allow an evaluation of the resource potential of natural gas in the Devonian shale sequence.

  6. Whole-Genome Sequence Analysis of Antimicrobial Resistance Genes in Streptococcus uberis and Streptococcus dysgalactiae Isolates from Canadian Dairy Herds

    PubMed Central

    Vélez, Julián Reyes; Cameron, Marguerite; Rodríguez-Lecompte, Juan Carlos; Xia, Fangfang; Heider, Luke C.; Saab, Matthew; McClure, J. Trenton; Sánchez, Javier

    2017-01-01

    The objectives of this study are to determine the occurrence of antimicrobial resistance (AMR) genes using whole-genome sequence (WGS) of Streptococcus uberis (S. uberis) and Streptococcus dysgalactiae (S. dysgalactiae) isolates, recovered from dairy cows in the Canadian Maritime Provinces. A secondary objective included the exploration of the association between phenotypic AMR and the genomic characteristics (genome size, guanine–cytosine content, and occurrence of unique gene sequences). Initially, 91 isolates were sequenced, and of these isolates, 89 were assembled. Furthermore, 16 isolates were excluded due to larger than expected genomic sizes (>2.3 bp × 1,000 bp). In the final analysis, 73 were used with complete WGS and minimum inhibitory concentration records, which were part of the previous phenotypic AMR study, representing 18 dairy herds from the Maritime region of Canada (1). A total of 23 unique AMR gene sequences were found in the bacterial genomes, with a mean number of 8.1 (minimum: 5; maximum: 13) per genome. Overall, there were 10 AMR genes [ANT(6), TEM-127, TEM-163, TEM-89, TEM-95, Linb, Lnub, Ermb, Ermc, and TetS] present only in S. uberis genomes and 2 genes unique (EF-TU and TEM-71) to the S. dysgalactiae genomes; 11 AMR genes [APH(3′), TEM-1, TEM-136, TEM-157, TEM-47, TetM, bl2b, gyrA, parE, phoP, and rpoB] were found in both bacterial species. Two-way tabulations showed association between the phenotypic susceptibility to lincosamides and the presence of linB (P = 0.002) and lnuB (P < 0.001) genes and the between the presence of tetM (P = 0.015) and tetS (P = 0.064) genes and phenotypic resistance to tetracyclines only for the S. uberis isolates. The logistic model showed that the odds of resistance (to any of the phenotypically tested antimicrobials) was 4.35 times higher when there were >11 AMR genes present in the genome, compared with <7 AMR genes (P < 0.001). The odds of resistance was lower for S. dysgalactiae than S. uberis (P = 0.031). When the within-herd somatic cell count was >250,000 cells/mL, a trend toward higher odds of resistance compared with the baseline category of <150,000 cells/mL was observed. When the isolate corresponded to a post-mastitis sample, there were lower odds of resistance when compared with non-clinical isolates (P = 0.01). The results of this study showed the strength of associations between phenotypic AMR resistance of both mastitis pathogens and their genotypic resistome and other epidemiological characteristics. PMID:28589129

  7. A proteome view of structural, functional, and taxonomic characteristics of major protein domain clusters.

    PubMed

    Sun, Chia-Tsen; Chiang, Austin W T; Hwang, Ming-Jing

    2017-10-27

    Proteome-scale bioinformatics research is increasingly conducted as the number of completely sequenced genomes increases, but analysis of protein domains (PDs) usually relies on similarity in their amino acid sequences and/or three-dimensional structures. Here, we present results from a bi-clustering analysis on presence/absence data for 6,580 unique PDs in 2,134 species with a sequenced genome, thus covering a complete set of proteins, for the three superkingdoms of life, Bacteria, Archaea, and Eukarya. Our analysis revealed eight distinctive PD clusters, which, following an analysis of enrichment of Gene Ontology functions and CATH classification of protein structures, were shown to exhibit structural and functional properties that are taxa-characteristic. For examples, the largest cluster is ubiquitous in all three superkingdoms, constituting a set of 1,472 persistent domains created early in evolution and retained in living organisms and characterized by basic cellular functions and ancient structural architectures, while an Archaea and Eukarya bi-superkingdom cluster suggests its PDs may have existed in the ancestor of the two superkingdoms, and others are single superkingdom- or taxa (e.g. Fungi)-specific. These results contribute to increase our appreciation of PD diversity and our knowledge of how PDs are used in species, yielding implications on species evolution.

  8. Proteogenomic Investigation of Strain Variation in Clinical Mycobacterium tuberculosis Isolates.

    PubMed

    Heunis, Tiaan; Dippenaar, Anzaan; Warren, Robin M; van Helden, Paul D; van der Merwe, Ruben G; Gey van Pittius, Nicolaas C; Pain, Arnab; Sampson, Samantha L; Tabb, David L

    2017-10-06

    Mycobacterium tuberculosis consists of a large number of different strains that display unique virulence characteristics. Whole-genome sequencing has revealed substantial genetic diversity among clinical M. tuberculosis isolates, and elucidating the phenotypic variation encoded by this genetic diversity will be of the utmost importance to fully understand M. tuberculosis biology and pathogenicity. In this study, we integrated whole-genome sequencing and mass spectrometry (GeLC-MS/MS) to reveal strain-specific characteristics in the proteomes of two clinical M. tuberculosis Latin American-Mediterranean isolates. Using this approach, we identified 59 peptides containing single amino acid variants, which covered ∼9% of all coding nonsynonymous single nucleotide variants detected by whole-genome sequencing. Furthermore, we identified 29 distinct peptides that mapped to a hypothetical protein not present in the M. tuberculosis H37Rv reference proteome. Here, we provide evidence for the expression of this protein in the clinical M. tuberculosis SAWC3651 isolate. The strain-specific databases enabled confirmation of genomic differences (i.e., large genomic regions of difference and nonsynonymous single nucleotide variants) in these two clinical M. tuberculosis isolates and allowed strain differentiation at the proteome level. Our results contribute to the growing field of clinical microbial proteogenomics and can improve our understanding of phenotypic variation in clinical M. tuberculosis isolates.

  9. The Genome of the Netherlands: design, and project goals.

    PubMed

    Boomsma, Dorret I; Wijmenga, Cisca; Slagboom, Eline P; Swertz, Morris A; Karssen, Lennart C; Abdellaoui, Abdel; Ye, Kai; Guryev, Victor; Vermaat, Martijn; van Dijk, Freerk; Francioli, Laurent C; Hottenga, Jouke Jan; Laros, Jeroen F J; Li, Qibin; Li, Yingrui; Cao, Hongzhi; Chen, Ruoyan; Du, Yuanping; Li, Ning; Cao, Sujie; van Setten, Jessica; Menelaou, Androniki; Pulit, Sara L; Hehir-Kwa, Jayne Y; Beekman, Marian; Elbers, Clara C; Byelas, Heorhiy; de Craen, Anton J M; Deelen, Patrick; Dijkstra, Martijn; den Dunnen, Johan T; de Knijff, Peter; Houwing-Duistermaat, Jeanine; Koval, Vyacheslav; Estrada, Karol; Hofman, Albert; Kanterakis, Alexandros; Enckevort, David van; Mai, Hailiang; Kattenberg, Mathijs; van Leeuwen, Elisabeth M; Neerincx, Pieter B T; Oostra, Ben; Rivadeneira, Fernanodo; Suchiman, Eka H D; Uitterlinden, Andre G; Willemsen, Gonneke; Wolffenbuttel, Bruce H; Wang, Jun; de Bakker, Paul I W; van Ommen, Gert-Jan; van Duijn, Cornelia M

    2014-02-01

    Within the Netherlands a national network of biobanks has been established (Biobanking and Biomolecular Research Infrastructure-Netherlands (BBMRI-NL)) as a national node of the European BBMRI. One of the aims of BBMRI-NL is to enrich biobanks with different types of molecular and phenotype data. Here, we describe the Genome of the Netherlands (GoNL), one of the projects within BBMRI-NL. GoNL is a whole-genome-sequencing project in a representative sample consisting of 250 trio-families from all provinces in the Netherlands, which aims to characterize DNA sequence variation in the Dutch population. The parent-offspring trios include adult individuals ranging in age from 19 to 87 years (mean=53 years; SD=16 years) from birth cohorts 1910-1994. Sequencing was done on blood-derived DNA from uncultured cells and accomplished coverage was 14-15x. The family-based design represents a unique resource to assess the frequency of regional variants, accurately reconstruct haplotypes by family-based phasing, characterize short indels and complex structural variants, and establish the rate of de novo mutational events. GoNL will also serve as a reference panel for imputation in the available genome-wide association studies in Dutch and other cohorts to refine association signals and uncover population-specific variants. GoNL will create a catalog of human genetic variation in this sample that is uniquely characterized with respect to micro-geographic location and a wide range of phenotypes. The resource will be made available to the research and medical community to guide the interpretation of sequencing projects. The present paper summarizes the global characteristics of the project.

  10. Unique Phylogenetic Lineage Found in the Fusarium-like Clade after Re-examining BCCM/IHEM Fungal Culture Collection Material

    PubMed Central

    De Cremer, Koen; Piérard, Denis; Hendrickx, Marijke

    2016-01-01

    Recently, the Fusarium genus has been narrowed based upon phylogenetic analyses and a Fusarium-like clade was adopted. The few species of the Fusarium-like clade were moved to new, re-installed or existing genera or provisionally retained as "Fusarium." Only a limited number of reference strains and DNA marker sequences are available for this clade and not much is known about its actual species diversity. Here, we report six strains, preserved by the Belgian fungal culture collection BCCM/IHEM as a Fusarium species, that belong to the Fusarium-like clade. They showed a slow growth and produced pionnotes, typical morphological characteristics of many Fusarium-like species. Multilocus sequencing with comparative sequence analyses in GenBank and phylogenetic analyses, using reference sequences of type material, confirmed that they were indeed member of the Fusarium-like clade. One strain was identified as "Fusarium" ciliatum whereas another strain was identified as Fusicolla merismoides. The four remaining strains were shown to represent a unique phylogenetic lineage in the Fusarium-like clade and were also found morphologically distinct from other members of the Fusarium-like clade. Based upon phylogenetic considerations, a new genus, Pseudofusicolla gen. nov., and a new species, Pseudofusicolla belgica sp. nov., were installed for this lineage. A formal description is provided in this study. Additional sampling will be required to gather isolates other than the historical strains presented in the present study as well as to further reveal the actual species diversity in the Fusarium-like clade. PMID:27790062

  11. hPDI: a database of experimental human protein-DNA interactions.

    PubMed

    Xie, Zhi; Hu, Shaohui; Blackshaw, Seth; Zhu, Heng; Qian, Jiang

    2010-01-15

    The human protein DNA Interactome (hPDI) database holds experimental protein-DNA interaction data for humans identified by protein microarray assays. The unique characteristics of hPDI are that it contains consensus DNA-binding sequences not only for nearly 500 human transcription factors but also for >500 unconventional DNA-binding proteins, which are completely uncharacterized previously. Users can browse, search and download a subset or the entire data via a web interface. This database is freely accessible for any academic purposes. http://bioinfo.wilmer.jhu.edu/PDI/.

  12. WS2 nanopores for molecule analysis

    NASA Astrophysics Data System (ADS)

    Danda, Gopinath; Masih Das, Paul; Chou, Yung-Chien; Mlack, Jerome; Naylor, Carl; Perea-Lopez, Nestor; Lin, Zhong; Fulton, Laura Beth; Terrones, Mauricio; Johnson, A. T. Charlie; Drndic, Marija

    Atomically thin 2D materials like graphene and transition metal dichalcogenides (TMDs) are interesting as membranes in solid state nanopore sensors for DNA analysis as they may facilitate single base resolution sequencing. These materials also exhibit unique optical and electronic properties which may be exploited to enhance the functionality of nanopore sensors. Here, we report WS2 nanopores, fabricated using a focused TEM beam. We also report their controlled laser-induced expansion in ionic solution. This study demonstrates the possibility of dynamic control of nanopore characteristics optically. NIH Grant R21HG007856, NSF EFRI-1542707.

  13. Single-cell Transcriptome Study as Big Data

    PubMed Central

    Yu, Pingjian; Lin, Wei

    2016-01-01

    The rapid growth of single-cell RNA-seq studies (scRNA-seq) demands efficient data storage, processing, and analysis. Big-data technology provides a framework that facilitates the comprehensive discovery of biological signals from inter-institutional scRNA-seq datasets. The strategies to solve the stochastic and heterogeneous single-cell transcriptome signal are discussed in this article. After extensively reviewing the available big-data applications of next-generation sequencing (NGS)-based studies, we propose a workflow that accounts for the unique characteristics of scRNA-seq data and primary objectives of single-cell studies. PMID:26876720

  14. A comprehensive survey of soil acidobacterial diversity using pyrosequencing and clone library analyses

    PubMed Central

    Jones, Ryan T; Robeson, Michael S; Lauber, Christian L; Hamady, Micah; Knight, Rob; Fierer, Noah

    2010-01-01

    Acidobacteria are ubiquitous and abundant members of soil bacterial communities. However, an ecological understanding of this important phylum has remained elusive because its members have been difficult to culture and few molecular investigations have focused exclusively on this group. We generated an unprecedented number of acidobacterial DNA sequence data using pyrosequencing and clone libraries (39 707 and 1787 sequences, respectively) to characterize the relative abundance, diversity and composition of acidobacterial communities across a range of soil types. To gain insight into the ecological characteristics of acidobacterial taxa, we investigated the large-scale biogeographic patterns exhibited by acidobacterial communities, and related soil and site characteristics to acidobacterial community assemblage patterns. The 87 soils analyzed by pyrosequencing contained more than 8600 unique acidobacterial phylotypes (at the 97% sequence similarity level). One phylotype belonging to Acidobacteria subgroup 1, but not closely related to any cultured representatives, was particularly abundant, accounting for 7.4% of bacterial sequences and 17.6% of acidobacterial sequences, on average, across the soils. The abundance of Acidobacteria relative to other bacterial taxa was highly variable across the soils examined, but correlated strongly with soil pH (R = −0.80, P<0.001). Soil pH was also the best predictor of acidobacterial community composition, regardless of how the communities were characterized, and the relative abundances of the dominant Acidobacteria subgroups were readily predictable. Acidobacterial communities were more phylogenetically clustered as soil pH departed from neutrality, suggesting that pH is an effective habitat filter, restricting community membership to progressively more narrowly defined lineages as pH deviates from neutrality. PMID:19129864

  15. Evolutionary and biophysical relationships among the papillomavirus E2 proteins.

    PubMed

    Blakaj, Dukagjin M; Fernandez-Fuentes, Narcis; Chen, Zigui; Hegde, Rashmi; Fiser, Andras; Burk, Robert D; Brenowitz, Michael

    2009-01-01

    Infection by human papillomavirus (HPV) may result in clinical conditions ranging from benign warts to invasive cancer. The HPV E2 protein represses oncoprotein transcription and is required for viral replication. HPV E2 binds to palindromic DNA sequences of highly conserved four base pair sequences flanking an identical length variable 'spacer'. E2 proteins directly contact the conserved but not the spacer DNA. Variation in naturally occurring spacer sequences results in differential protein affinity that is dependent on their sensitivity to the spacer DNA's unique conformational and/or dynamic properties. This article explores the biophysical character of this core viral protein with the goal of identifying characteristics that associated with risk of virally caused malignancy. The amino acid sequence, 3d structure and electrostatic features of the E2 protein DNA binding domain are highly conserved; specific interactions with DNA binding sites have also been conserved. In contrast, the E2 protein's transactivation domain does not have extensive surfaces of highly conserved residues. Rather, regions of high conservation are localized to small surface patches. Implications to cancer biology are discussed.

  16. Fingerprint multicast in secure video streaming.

    PubMed

    Zhao, H Vicky; Liu, K J Ray

    2006-01-01

    Digital fingerprinting is an emerging technology to protect multimedia content from illegal redistribution, where each distributed copy is labeled with unique identification information. In video streaming, huge amount of data have to be transmitted to a large number of users under stringent latency constraints, so the bandwidth-efficient distribution of uniquely fingerprinted copies is crucial. This paper investigates the secure multicast of anticollusion fingerprinted video in streaming applications and analyzes their performance. We first propose a general fingerprint multicast scheme that can be used with most spread spectrum embedding-based multimedia fingerprinting systems. To further improve the bandwidth efficiency, we explore the special structure of the fingerprint design and propose a joint fingerprint design and distribution scheme. From our simulations, the two proposed schemes can reduce the bandwidth requirement by 48% to 87%, depending on the number of users, the characteristics of video sequences, and the network and computation constraints. We also show that under the constraint that all colluders have the same probability of detection, the embedded fingerprints in the two schemes have approximately the same collusion resistance. Finally, we propose a fingerprint drift compensation scheme to improve the quality of the reconstructed sequences at the decoder's side without introducing extra communication overhead.

  17. The natural armors of fish: A comparison of the lamination pattern and structure of scales

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Murcia, Sandra; Lavoie, Ellen; Linley, Tim

    Fish scales exhibit a unique balance of flexibility, strength and toughness, which is essential to provide protection without encumbering locomotion. Although the mechanical behavior and structure of this natural armor are of recent interest, a comparison of these qualities from scales of different fish species has not been reported. In this investigation the armor of fish with different locomotion, size and protection needs were analyzed. Scales from the Arapaima gigas, the tarpon (Megalops atlanticus) and the carp (Cyprinus carpio) were compared in terms of the stacking sequence of individual plies and their microstructure. The scales were also compared with respectmore » to anatomical position to distinguish site-specific functional differences. Results show that the lamination sequence of plies for the carp and tarpon exhibit a Bouligand structure with relative rotation of 75° between consecutive plies. The arapaima scales exhibit a cross-ply structure, with 90° rotation between adjacent plies. In addition, results indicate that the volume fraction of reinforcement, the number of plies and the variations in thickness with anatomical position are unique amongst the three fish. These characteristics should be considered in evaluations focused on the mechanical behavior.« less

  18. Top-down Proteomics in Health and Disease: Challenges and Opportunities

    PubMed Central

    Gregorich, Zachery R.; Ge, Ying

    2014-01-01

    Proteomics is essential for deciphering how molecules interact as a system and for understanding the functions of cellular systems in human disease; however, the unique characteristics of the human proteome, which include a high dynamic range of protein expression and extreme complexity due to a plethora of post-translational modifications (PTMs) and sequence variations, make such analyses challenging. An emerging “top-down” mass spectrometry (MS)-based proteomics approach, which provides a “bird’s eye” view of all proteoforms, has unique advantages for the assessment of PTMs and sequence variations. Recently, a number of studies have showcased the potential of top-down proteomics for unraveling of disease mechanisms and discovery of new biomarkers. Nevertheless, the top-down approach still faces significant challenges in terms of protein solubility, separation, and the detection of large intact proteins, as well as the under-developed data analysis tools. Consequently, new technological developments are urgently needed to advance the field of top-down proteomics. Herein, we intend to provide an overview of the recent applications of top-down proteomics in biomedical research. Moreover, we will outline the challenges and opportunities facing top-down proteomics strategies aimed at understanding and diagnosing human diseases. PMID:24723472

  19. Biological and phylogenetic characteristics of yellow fever virus lineages from West Africa.

    PubMed

    Stock, Nina K; Laraway, Hewád; Faye, Ousmane; Diallo, Mawlouth; Niedrig, Matthias; Sall, Amadou A

    2013-03-01

    The yellow fever virus (YFV), the first proven human-pathogenic virus, although isolated in 1927, is still a major public health problem, especially in West Africa where it causes outbreaks every year. Nevertheless, little is known about its genetic diversity and evolutionary dynamics, mainly due to a limited number of genomic sequences from wild virus isolates. In this study, we analyzed the phylogenetic relationships of 24 full-length genomes from YFV strains isolated between 1973 and 2005 in a sylvatic context of West Africa, including 14 isolates that had previously not been sequenced. By this, we confirmed genetic variability within one genotype by the identification of various YF lineages circulating in West Africa. Further analyses of the biological properties of these lineages revealed differential growth behavior in human liver and insect cells, correlating with the source of isolation and suggesting host adaptation. For one lineage, repeatedly isolated in a context of vertical transmission, specific characteristics in the growth behavior and unique mutations of the viral genome were observed and deserve further investigation to gain insight into mechanisms involved in YFV emergence and maintenance in nature.

  20. Biological and Phylogenetic Characteristics of Yellow Fever Virus Lineages from West Africa

    PubMed Central

    Laraway, Hewád; Faye, Ousmane; Diallo, Mawlouth; Niedrig, Matthias

    2013-01-01

    The yellow fever virus (YFV), the first proven human-pathogenic virus, although isolated in 1927, is still a major public health problem, especially in West Africa where it causes outbreaks every year. Nevertheless, little is known about its genetic diversity and evolutionary dynamics, mainly due to a limited number of genomic sequences from wild virus isolates. In this study, we analyzed the phylogenetic relationships of 24 full-length genomes from YFV strains isolated between 1973 and 2005 in a sylvatic context of West Africa, including 14 isolates that had previously not been sequenced. By this, we confirmed genetic variability within one genotype by the identification of various YF lineages circulating in West Africa. Further analyses of the biological properties of these lineages revealed differential growth behavior in human liver and insect cells, correlating with the source of isolation and suggesting host adaptation. For one lineage, repeatedly isolated in a context of vertical transmission, specific characteristics in the growth behavior and unique mutations of the viral genome were observed and deserve further investigation to gain insight into mechanisms involved in YFV emergence and maintenance in nature. PMID:23269797

  1. Ichthyobodo salmonis sp. n. (Ichthyobodonidae, Kinetoplastida), an euryhaline ectoparasite infecting Atlantic salmon (Salmo salar L.)

    PubMed Central

    ISAKSEN, TROND E.; KARLSBAKK, EGIL; WATANABE, KUNINORI; NYLUND, ARE

    2011-01-01

    SUMMARY Phylogenetic analyses of SSU rDNA sequences have previously revealed the existence of 2 Ichthyobodo species able to infect Atlantic salmon (Salmo salar L.). Ichthyobodo necator sensu stricto (s.s.) is assumed to be a freshwater parasite, while a genetically distinct but undescribed species, Ichthyobodo sp. II sensu Todal et al. (2004) have been detected on Atlantic salmon in both fresh- and seawater. In the present study a morphological description of Ichthyobodo sp. II from the gills of salmon reared in fresh-, brackish- and seawater is presented, using both light- and electron microscopy. Comparative morphometry show that Ichthyobodo sp. II from both freshwater and seawater displays a different cell shape, and is significantly smaller than I. necator s.s. Also, ultrastructural characteristics distinguish these two species, notably differences in the attachment region and the presence of spine-like surface projections in Ichthyobodo sp. II. Based on both unique SSU rDNA sequences and morphological characteristics, we conclude that Ichthyobodo sp. II. represents a novel species for which we propose the name Ichthyobodo salmonis sp. n. PMID:21756424

  2. Establishing effective working relations with a potential user community - NASA Lewis Research Center experience

    NASA Technical Reports Server (NTRS)

    Foster, P.

    1977-01-01

    The NASA Lewis Research Center has held a series of six major and unique technology utilization conferences which were major milestones in planned structured efforts to establish effective working relationships with specific technology user communities. These efforts were unique in that the activities undertaken prior to the conference were extensive, and effectively laid the groundwork for productive technology transfer following, and as a direct result of, the conferences. The effort leading to the conference was in each case tailored to the characteristics of the potential user community, however, the common factors comprise a basic framework applicable to similar endeavors. The process is essentially a planned sequence of steps that constitute a technical market survey and a marketing program for the development of beneficial applications of aerospace technology beyond the aerospace field.

  3. Molecular Cloning and Expression of Three Polygalacturonase cDNAs from the Tarnished Plant Bug, Lygus lineolaris

    PubMed Central

    Allen, Margaret L.; Mertens, Jeffrey A.

    2008-01-01

    Three unique cDNAs encoding putative polygalacturonase enzymes were isolated from the tarnished plant bug, Lygus lineolaris (Palisot de Beauvois) (Hemiptera: Miridae). The three nucleotide sequences were dissimilar to one another, but the deduced amino acid sequences were similar to each other and to other polygalacturonases from insects, fungi, plants, and bacteria. Four conserved segments characteristic of polygalacturonases were present, but with some notable semiconservative substitutions. Two of four expected disulfide bridge—forming cysteine pairs were present. All three inferred protein translations included predicted signal sequences of 17 to 20 amino acids. Amplification of genomic DNA identified an intron in one of the genes, Llpg1, in the 5′ untranslated region. Semiquantitative RT-PCR revealed expression in all stages of the insect except the eggs. Expression in adults, male and female, was highly variable, indicating a family of highly inducible and diverse enzymes adapted to the generalist polyphagous nature of this important pest. PMID:20233096

  4. "Plasmo2D": an ancillary proteomic tool to aid identification of proteins from Plasmodium falciparum.

    PubMed

    Khachane, Amit; Kumar, Ranjit; Jain, Sanyam; Jain, Samta; Banumathy, Gowrishankar; Singh, Varsha; Nagpal, Saurabh; Tatu, Utpal

    2005-01-01

    Bioinformatics tools to aid gene and protein sequence analysis have become an integral part of biology in the post-genomic era. Release of the Plasmodium falciparum genome sequence has allowed biologists to define the gene and the predicted protein content as well as their sequences in the parasite. Using pI and molecular weight as characteristics unique to each protein, we have developed a bioinformatics tool to aid identification of proteins from Plasmodium falciparum. The tool makes use of a Virtual 2-DE generated by plotting all of the proteins from the Plasmodium database on a pI versus molecular weight scale. Proteins are identified by comparing the position of migration of desired protein spots from an experimental 2-DE and that on a virtual 2-DE. The procedure has been automated in the form of user-friendly software called "Plasmo2D". The tool can be downloaded from http://144.16.89.25/Plasmo2D.zip.

  5. Incorporation of unique molecular identifiers in TruSeq adapters improves the accuracy of quantitative sequencing.

    PubMed

    Hong, Jungeui; Gresham, David

    2017-11-01

    Quantitative analysis of next-generation sequencing (NGS) data requires discriminating duplicate reads generated by PCR from identical molecules that are of unique origin. Typically, PCR duplicates are identified as sequence reads that align to the same genomic coordinates using reference-based alignment. However, identical molecules can be independently generated during library preparation. Misidentification of these molecules as PCR duplicates can introduce unforeseen biases during analyses. Here, we developed a cost-effective sequencing adapter design by modifying Illumina TruSeq adapters to incorporate a unique molecular identifier (UMI) while maintaining the capacity to undertake multiplexed, single-index sequencing. Incorporation of UMIs into TruSeq adapters (TrUMIseq adapters) enables identification of bona fide PCR duplicates as identically mapped reads with identical UMIs. Using TrUMIseq adapters, we show that accurate removal of PCR duplicates results in improved accuracy of both allele frequency (AF) estimation in heterogeneous populations using DNA sequencing and gene expression quantification using RNA-Seq.

  6. Partial bisulfite conversion for unique template sequencing

    PubMed Central

    Kumar, Vijay; Rosenbaum, Julie; Wang, Zihua; Forcier, Talitha; Ronemus, Michael; Wigler, Michael

    2018-01-01

    Abstract We introduce a new protocol, mutational sequencing or muSeq, which uses sodium bisulfite to randomly deaminate unmethylated cytosines at a fixed and tunable rate. The muSeq protocol marks each initial template molecule with a unique mutation signature that is present in every copy of the template, and in every fragmented copy of a copy. In the sequenced read data, this signature is observed as a unique pattern of C-to-T or G-to-A nucleotide conversions. Clustering reads with the same conversion pattern enables accurate count and long-range assembly of initial template molecules from short-read sequence data. We explore count and low-error sequencing by profiling 135 000 restriction fragments in a PstI representation, demonstrating that muSeq improves copy number inference and significantly reduces sporadic sequencer error. We explore long-range assembly in the context of cDNA, generating contiguous transcript clusters greater than 3,000 bp in length. The muSeq assemblies reveal transcriptional diversity not observable from short-read data alone. PMID:29161423

  7. Whole genome sequencing of Gyeongbuk Araucana, a newly developed blue-egg laying chicken breed, reveals its origin and genetic characteristics.

    PubMed

    Jeong, Hyeonsoo; Kim, Kwondo; Caetano-Anollés, Kelsey; Kim, Heebal; Kim, Byung-Ki; Yi, Jun-Koo; Ha, Jae-Jung; Cho, Seoae; Oh, Dong Yep

    2016-05-24

    Chicken, Gallus gallus, is a valuable species both as a food source and as a model organism for scientific research. Here, we sequenced the genome of Gyeongbuk Araucana, a rare chicken breed with unique phenotypic characteristics including flight ability, large body size, and laying blue-shelled eggs, to identify its genomic features. We generated genomes of Gyeongbuk Araucana, Leghorn, and Korean Native Chicken at a total of 33.5, 35.82, and 33.23 coverage depth, respectively. Along with the genomes of 12 Chinese breeds, we identified genomic variants of 16.3 million SNVs and 2.3 million InDels in mapped regions. Additionally, through assembly of unmapped reads and selective sweep, we identified candidate genes that fall into heart, vasculature and muscle development and body growth categories, which provided insight into Gyeongbuk Araucana's phenotypic traits. Finally, genetic variation based on the transposable element insertion pattern was investigated to elucidate the features of transposable elements related to blue egg shell formation. This study presents results of the first genomic study on the Gyeongbuk Araucana breed; it has potential to serve as an invaluable resource for future research on the genomic characteristics of this chicken breed as well as others.

  8. Statistical analysis of the Bacterial Carbohydrate Structure Data Base (BCSDB): Characteristics and diversity of bacterial carbohydrates in comparison with mammalian glycans

    PubMed Central

    Herget, Stephan; Toukach, Philip V; Ranzinger, René; Hull, William E; Knirel, Yuriy A; von der Lieth, Claus-Wilhelm

    2008-01-01

    Background There are considerable differences between bacterial and mammalian glycans. In contrast to most eukaryotic carbohydrates, bacterial glycans are often composed of repeating units with diverse functions ranging from structural reinforcement to adhesion, colonization and camouflage. Since bacterial glycans are typically displayed at the cell surface, they can interact with the environment and, therefore, have significant biomedical importance. Results The sequence characteristics of glycans (monosaccharide composition, modifications, and linkage patterns) for the higher bacterial taxonomic classes have been examined and compared with the data for mammals, with both similarities and unique features becoming evident. Compared to mammalian glycans, the bacterial glycans deposited in the current databases have a more than ten-fold greater diversity at the monosaccharide level, and the disaccharide pattern space is approximately nine times larger. Specific bacterial subclasses exhibit characteristic glycans which can be distinguished on the basis of distinctive structural features or sequence properties. Conclusion For the first time a systematic database analysis of the bacterial glycome has been performed. This study summarizes the current knowledge of bacterial glycan architecture and diversity and reveals putative targets for the rational design and development of therapeutic intervention strategies by comparing bacterial and mammalian glycans. PMID:18694500

  9. A novel, privacy-preserving cryptographic approach for sharing sequencing data

    PubMed Central

    Cassa, Christopher A; Miller, Rachel A; Mandl, Kenneth D

    2013-01-01

    Objective DNA samples are often processed and sequenced in facilities external to the point of collection. These samples are routinely labeled with patient identifiers or pseudonyms, allowing for potential linkage to identity and private clinical information if intercepted during transmission. We present a cryptographic scheme to securely transmit externally generated sequence data which does not require any patient identifiers, public key infrastructure, or the transmission of passwords. Materials and methods This novel encryption scheme cryptographically protects participant sequence data using a shared secret key that is derived from a unique subset of an individual’s genetic sequence. This scheme requires access to a subset of an individual’s genetic sequence to acquire full access to the transmitted sequence data, which helps to prevent sample mismatch. Results We validate that the proposed encryption scheme is robust to sequencing errors, population uniqueness, and sibling disambiguation, and provides sufficient cryptographic key space. Discussion Access to a set of an individual’s genotypes and a mutually agreed cryptographic seed is needed to unlock the full sequence, which provides additional sample authentication and authorization security. We present modest fixed and marginal costs to implement this transmission architecture. Conclusions It is possible for genomics researchers who sequence participant samples externally to protect the transmission of sequence data using unique features of an individual’s genetic sequence. PMID:23125421

  10. Analysis of common deafness gene mutations in deaf people from unique ethnic groups in Gansu Province, China.

    PubMed

    Xu, Bai-Cheng; Bian, Pan-Pan; Liu, Xiao-Wen; Zhu, Yi-Ming; Yang, Xiao-Long; Ma, Jian-Li; Chen, Xing-Jian; Wang, Yan-Li; Guo, Yu-Fen

    2014-09-01

    The GJB2 gene mutation characteristic of Dongxiang was the interaction result of ethnic background and geographical environment, and Yugur exhibited the typical founder effect. The SLC26A4 gene mutation characteristic of Dongxiang was related to caucasian backgrounds and selection of purpose exons, i.e. ethnic background and the penetrance of ethnic specificity caused the low mtDNA1555A>G mutation frequency in Dongxiang. To determine the prevalence of GJB2 and SLC26A4 genes and mtDNA1555A>G mutations and analyze the ethnic specificity in the non-syndromic sensorineural hearing loss (NSHL) of unique ethnic groups in Gansu Province. Peripheral blood samples were obtained from Dongxiang, Yugur, Bonan, and ethnic Han groups with moderately severe to profound NSHL in Gansu Province. Bidirectional sequencing (or enzyme digestion) was applied to identify the sequence variations. The pathogenic allele frequency of the three gene mutations was different. The frequency of the GJB2 gene among the Dongxiang, Yugur, Bonan, and ethnic Han groups was 9.03%, 12.5%, 5.88%, and 12.17%, respectively. No difference was found between the ethnic groups. The frequencies of the SLC26A4 genes were 3.23%, 8.33%, 0%, and 9.81%, respectively. The mutation frequency of mtDNA1555A>G was 0%, 0%, 0%, and 6.03%, respectively. No difference was found between the ethnic groups, except for the Dongxiang and ethnic Han groups, both in SLC26A4 gene and mtDNA1555A>G.

  11. Cloning, analysis and functional annotation of expressed sequence tags from the Earthworm Eisenia fetida

    PubMed Central

    Pirooznia, Mehdi; Gong, Ping; Guan, Xin; Inouye, Laura S; Yang, Kuan; Perkins, Edward J; Deng, Youping

    2007-01-01

    Background Eisenia fetida, commonly known as red wiggler or compost worm, belongs to the Lumbricidae family of the Annelida phylum. Little is known about its genome sequence although it has been extensively used as a test organism in terrestrial ecotoxicology. In order to understand its gene expression response to environmental contaminants, we cloned 4032 cDNAs or expressed sequence tags (ESTs) from two E. fetida libraries enriched with genes responsive to ten ordnance related compounds using suppressive subtractive hybridization-PCR. Results A total of 3144 good quality ESTs (GenBank dbEST accession number EH669363–EH672369 and EL515444–EL515580) were obtained from the raw clone sequences after cleaning. Clustering analysis yielded 2231 unique sequences including 448 contigs (from 1361 ESTs) and 1783 singletons. Comparative genomic analysis showed that 743 or 33% of the unique sequences shared high similarity with existing genes in the GenBank nr database. Provisional function annotation assigned 830 Gene Ontology terms to 517 unique sequences based on their homology with the annotated genomes of four model organisms Drosophila melanogaster, Mus musculus, Saccharomyces cerevisiae, and Caenorhabditis elegans. Seven percent of the unique sequences were further mapped to 99 Kyoto Encyclopedia of Genes and Genomes pathways based on their matching Enzyme Commission numbers. All the information is stored and retrievable at a highly performed, web-based and user-friendly relational database called EST model database or ESTMD version 2. Conclusion The ESTMD containing the sequence and annotation information of 4032 E. fetida ESTs is publicly accessible at . PMID:18047730

  12. The future is now: single-cell genomics of bacteria and archaea

    PubMed Central

    Blainey, Paul C.

    2013-01-01

    Interest in the expanding catalog of uncultivated microorganisms, increasing recognition of heterogeneity among seemingly similar cells, and technological advances in whole-genome amplification and single-cell manipulation are driving considerable progress in single-cell genomics. Here, the spectrum of applications for single-cell genomics, key advances in the development of the field, and emerging methodology for single-cell genome sequencing are reviewed by example with attention to the diversity of approaches and their unique characteristics. Experimental strategies transcending specific methodologies are identified and organized as a road map for future studies in single-cell genomics of environmental microorganisms. Over the next decade, increasingly powerful tools for single-cell genome sequencing and analysis will play key roles in accessing the genomes of uncultivated organisms, determining the basis of microbial community functions, and fundamental aspects of microbial population biology. PMID:23298390

  13. Genome Sequence of a Canadian Vibrio parahaemolyticus Isolate with Unique Mobilizing Capacity.

    PubMed

    Bioteau, Audrey; Huguet, Kévin; Burrus, Vincent; Banerjee, Swapan

    2018-06-14

    Vibrio parahaemolyticus is a clinically significant marine bacterium implicated in gastroenteritis among consumers of raw or undercooked seafood. This report presents the whole-genome sequence of a unique strain of V. parahaemolyticus isolated from oysters harvested in Canada. © Crown copyright 2018.

  14. Characteristics of 263K Scrapie Agent in Multiple Hamster Species

    PubMed Central

    Barbian, Kent D.; Race, Brent; Favara, Cynthia; Gardner, Don; Taubner, Lara; Porcella, Stephen; Race, Richard

    2009-01-01

    Transmissible spongiform encephalopathy (TSE) diseases are known to cross species barriers, but the pathologic and biochemical changes that occur during transmission are not well understood. To better understand these changes, we infected 6 hamster species with 263K hamster scrapie strain and, after each of 3 successive passages in the new species, analyzed abnormal proteinase K (PK)–resistant prion protein (PrPres) glycoform ratios, PrPres PK sensitivity, incubation periods, and lesion profiles. Unique 263K molecular and biochemical profiles evolved in each of the infected hamster species. Characteristics of 263K in the new hamster species seemed to correlate best with host factors rather than agent strain. Furthermore, 2 polymorphic regions of the prion protein amino acid sequence correlated with profile differences in these TSE-infected hamster species. PMID:19193264

  15. [Molecular epidemiological study on HIV/AIDS under the follow-up program in Zhejiang province in 2009].

    PubMed

    Zhang, Jia-feng; Pan, Xiao-hong; Ding, Xiao-bei; Chen, Lin; Guo, Zhi-hong; Xu, Yun; Huang, Jing-jing

    2013-01-01

    To analyze the molecular epidemiological characteristics on HIV infectors/AIDS patients (HIV/AIDS) under a follow-up program in Zhejiang province in 2009. 303 cases were randomly sampled. Information on the cases was collected and followed by genomic DNA extraction. Gag gene fragments were amplified by nested PCR, followed by sequencing and bio-informatic analysis. The rate of success for sequence acquisition was 74.3% (225/303). Distributions of HIV subtypes were as follows: CRF01_AE (58.7%), CRF07_BC (13.8%), CRF08_BC (9.8%), B' (15.1%), C (1.8%), G (0.4%) and unassigned BC (unique recombinant form 0.4%). from the HIV BLAST analysis showed that the sources of strains with the highest homology involved in 10 provinces/municipalities (Liaoning, Guangxi, Yunnan, Henan, etc.) and five other countries (Thailand, Vietnam, India, South Africa and Libya). The CRF01_AE phylogenetic tree was divided into four clusters. The sequences of HIV/AIDS with homosexual transmission showed a gather in cluster 1, and mix with those infected through heterosexual contact. Circulating recombinant forms of HIV seemed to play a dominant role in Zhejiang province. Unique recombinant form and new subtype of HIV were found. People living with HIV under homosexual transmission and heterosexual transmission had a trend of interwoven with each other. Increase of both the diversity and complexity of HIV strains were also noticed in Zhejiang province.

  16. Whole-genome sequence, SNP chips and pedigree structure: building demographic profiles in domestic dog breeds to optimize genetic-trait mapping.

    PubMed

    Dreger, Dayna L; Rimbault, Maud; Davis, Brian W; Bhatnagar, Adrienne; Parker, Heidi G; Ostrander, Elaine A

    2016-12-01

    In the decade following publication of the draft genome sequence of the domestic dog, extraordinary advances with application to several fields have been credited to the canine genetic system. Taking advantage of closed breeding populations and the subsequent selection for aesthetic and behavioral characteristics, researchers have leveraged the dog as an effective natural model for the study of complex traits, such as disease susceptibility, behavior and morphology, generating unique contributions to human health and biology. When designing genetic studies using purebred dogs, it is essential to consider the unique demography of each population, including estimation of effective population size and timing of population bottlenecks. The analytical design approach for genome-wide association studies (GWAS) and analysis of whole-genome sequence (WGS) experiments are inextricable from demographic data. We have performed a comprehensive study of genomic homozygosity, using high-depth WGS data for 90 individuals, and Illumina HD SNP data from 800 individuals representing 80 breeds. These data were coupled with extensive pedigree data analyses for 11 breeds that, together, allowed us to compute breed structure, demography, and molecular measures of genome diversity. Our comparative analyses characterize the extent, formation and implication of breed-specific diversity as it relates to population structure. These data demonstrate the relationship between breed-specific genome dynamics and population architecture, and provide important considerations influencing the technological and cohort design of association and other genomic studies. © 2016. Published by The Company of Biologists Ltd.

  17. Whole-genome sequence, SNP chips and pedigree structure: building demographic profiles in domestic dog breeds to optimize genetic-trait mapping

    PubMed Central

    Dreger, Dayna L.; Rimbault, Maud; Davis, Brian W.; Bhatnagar, Adrienne; Parker, Heidi G.

    2016-01-01

    ABSTRACT In the decade following publication of the draft genome sequence of the domestic dog, extraordinary advances with application to several fields have been credited to the canine genetic system. Taking advantage of closed breeding populations and the subsequent selection for aesthetic and behavioral characteristics, researchers have leveraged the dog as an effective natural model for the study of complex traits, such as disease susceptibility, behavior and morphology, generating unique contributions to human health and biology. When designing genetic studies using purebred dogs, it is essential to consider the unique demography of each population, including estimation of effective population size and timing of population bottlenecks. The analytical design approach for genome-wide association studies (GWAS) and analysis of whole-genome sequence (WGS) experiments are inextricable from demographic data. We have performed a comprehensive study of genomic homozygosity, using high-depth WGS data for 90 individuals, and Illumina HD SNP data from 800 individuals representing 80 breeds. These data were coupled with extensive pedigree data analyses for 11 breeds that, together, allowed us to compute breed structure, demography, and molecular measures of genome diversity. Our comparative analyses characterize the extent, formation and implication of breed-specific diversity as it relates to population structure. These data demonstrate the relationship between breed-specific genome dynamics and population architecture, and provide important considerations influencing the technological and cohort design of association and other genomic studies. PMID:27874836

  18. [Analysis of Conformational Features of Watson-Crick Duplex Fragments by Molecular Mechanics and Quantum Mechanics Methods].

    PubMed

    Poltev, V I; Anisimov, V M; Sanchez, C; Deriabina, A; Gonzalez, E; Garcia, D; Rivas, F; Polteva, N A

    2016-01-01

    It is generally accepted that the important characteristic features of the Watson-Crick duplex originate from the molecular structure of its subunits. However, it still remains to elucidate what properties of each subunit are responsible for the significant characteristic features of the DNA structure. The computations of desoxydinucleoside monophosphates complexes with Na-ions using density functional theory revealed a pivotal role of DNA conformational properties of single-chain minimal fragments in the development of unique features of the Watson-Crick duplex. We found that directionality of the sugar-phosphate backbone and the preferable ranges of its torsion angles, combined with the difference between purines and pyrimidines. in ring bases, define the dependence of three-dimensional structure of the Watson-Crick duplex on nucleotide base sequence. In this work, we extended these density functional theory computations to the minimal' fragments of DNA duplex, complementary desoxydinucleoside monophosphates complexes with Na-ions. Using several computational methods and various functionals, we performed a search for energy minima of BI-conformation for complementary desoxydinucleoside monophosphates complexes with different nucleoside sequences. Two sequences are optimized using ab initio method at the MP2/6-31++G** level of theory. The analysis of torsion angles, sugar ring puckering and mutual base positions of optimized structures demonstrates that the conformational characteristic features of complementary desoxydinucleoside monophosphates complexes with Na-ions remain within BI ranges and become closer to the corresponding characteristic features of the Watson-Crick duplex crystals. Qualitatively, the main characteristic features of each studied complementary desoxydinucleoside monophosphates complex remain invariant when different computational methods are used, although the quantitative values of some conformational parameters could vary lying within the limits typical for the corresponding family. We observe that popular functionals in density functional theory calculations lead to the overestimated distances between base pairs, while MP2 computations and the newer complex functionals produce the structures that have too close atom-atom contacts. A detailed study of some complementary desoxydinucleoside monophosphate complexes with Na-ions highlights the existence of several energy minima corresponding to BI-conformations, in other words, the complexity of the relief pattern of the potential energy surface of complementary desoxydinucleoside monophosphate complexes. This accounts for variability of conformational parameters of duplex fragments with the same base sequence. Popular molecular mechanics force fields AMBER and CHARMM reproduce most of the conformational characteristics of desoxydinucleoside monophosphates and their complementary complexes with Na-ions but fail to reproduce some details of the dependence of the Watson-Crick duplex conformation on the nucleotide sequence.

  19. Transcriptomic sequencing reveals a set of unique genes activated by butyrate-induced histone modification

    USDA-ARS?s Scientific Manuscript database

    Butyrate is a nutritional element with strong epigenetic regulatory activity as an inhibitor of histone deacetylases (HDACs). Based on the analysis of differentially expressed genes induced by butyrate in the bovine epithelial cell using deep RNA-sequencing technology (RNA-seq), a set of unique gen...

  20. Full genome sequences and molecular characterization of tick-borne encephalitis virus strains isolated from human patients.

    PubMed

    Formanová, Petra; Černý, Jiří; Bolfíková, Barbora Černá; Valdés, James J; Kozlova, Irina; Dzhioev, Yuri; Růžek, Daniel

    2015-02-01

    Tick-borne encephalitis virus (TBEV) causes tick-borne encephalitis (TBE), one of the most important human neuroinfections across Eurasia. Up to date, only three full genome sequences of human European TBEV isolates are available, mostly due to difficulties with isolation of the virus from human patients. Here we present full genome characterization of an additional five low-passage TBEV strains isolated from human patients with severe forms of TBE. These strains were isolated in 1953 within Central Bohemia in the former Czechoslovakia, and belong to the historically oldest human TBEV isolates in Europe. We demonstrate here that all analyzed isolates are distantly phylogenetically related, indicating that the emergence of TBE in Central Europe was not caused by one predominant strain, but rather a pool of distantly related TBEV strains. Nucleotide identity between individual sequenced TBEV strains ranged from 97.5% to 99.6% and all strains shared large deletions in the 3' non-coding region, which has been recently suggested to be an important determinant of virulence. The number of unique amino acid substitutions varied from 3 to 9 in individual isolates, but no characteristic amino acid substitution typical exclusively for all human TBEV isolates was identified when compared to the isolates from ticks. We did, however, correlate that the exploration of the TBEV envelope glycoprotein by specific antibodies were in close proximity to these unique amino acid substitutions. Taken together, we report here the largest number of patient-derived European TBEV full genome sequences to date and provide a platform for further studies on evolution of TBEV since the first emergence of human TBE in Europe. Copyright © 2014 Elsevier GmbH. All rights reserved.

  1. The Genome of the Netherlands: design, and project goals

    PubMed Central

    Boomsma, Dorret I; Wijmenga, Cisca; Slagboom, Eline P; Swertz, Morris A; Karssen, Lennart C; Abdellaoui, Abdel; Ye, Kai; Guryev, Victor; Vermaat, Martijn; van Dijk, Freerk; Francioli, Laurent C; Hottenga, Jouke Jan; Laros, Jeroen F J; Li, Qibin; Li, Yingrui; Cao, Hongzhi; Chen, Ruoyan; Du, Yuanping; Li, Ning; Cao, Sujie; van Setten, Jessica; Menelaou, Androniki; Pulit, Sara L; Hehir-Kwa, Jayne Y; Beekman, Marian; Elbers, Clara C; Byelas, Heorhiy; de Craen, Anton J M; Deelen, Patrick; Dijkstra, Martijn; den Dunnen, Johan T; de Knijff, Peter; Houwing-Duistermaat, Jeanine; Koval, Vyacheslav; Estrada, Karol; Hofman, Albert; Kanterakis, Alexandros; Enckevort, David van; Mai, Hailiang; Kattenberg, Mathijs; van Leeuwen, Elisabeth M; Neerincx, Pieter B T; Oostra, Ben; Rivadeneira, Fernanodo; Suchiman, Eka H D; Uitterlinden, Andre G; Willemsen, Gonneke; Wolffenbuttel, Bruce H; Wang, Jun; de Bakker, Paul I W; van Ommen, Gert-Jan; van Duijn, Cornelia M

    2014-01-01

    Within the Netherlands a national network of biobanks has been established (Biobanking and Biomolecular Research Infrastructure-Netherlands (BBMRI-NL)) as a national node of the European BBMRI. One of the aims of BBMRI-NL is to enrich biobanks with different types of molecular and phenotype data. Here, we describe the Genome of the Netherlands (GoNL), one of the projects within BBMRI-NL. GoNL is a whole-genome-sequencing project in a representative sample consisting of 250 trio-families from all provinces in the Netherlands, which aims to characterize DNA sequence variation in the Dutch population. The parent–offspring trios include adult individuals ranging in age from 19 to 87 years (mean=53 years; SD=16 years) from birth cohorts 1910–1994. Sequencing was done on blood-derived DNA from uncultured cells and accomplished coverage was 14–15x. The family-based design represents a unique resource to assess the frequency of regional variants, accurately reconstruct haplotypes by family-based phasing, characterize short indels and complex structural variants, and establish the rate of de novo mutational events. GoNL will also serve as a reference panel for imputation in the available genome-wide association studies in Dutch and other cohorts to refine association signals and uncover population-specific variants. GoNL will create a catalog of human genetic variation in this sample that is uniquely characterized with respect to micro-geographic location and a wide range of phenotypes. The resource will be made available to the research and medical community to guide the interpretation of sequencing projects. The present paper summarizes the global characteristics of the project. PMID:23714750

  2. Phylogenomic analysis of proteins that are distinctive of Archaea and its main subgroups and the origin of methanogenesis

    PubMed Central

    Gao, Beile; Gupta, Radhey S

    2007-01-01

    Background The Archaea are highly diverse in terms of their physiology, metabolism and ecology. Presently, very few molecular characteristics are known that are uniquely shared by either all archaea or the different main groups within archaea. The evolutionary relationships among different groups within the Euryarchaeota branch are also not clearly understood. Results We have carried out comprehensive analyses on each open reading frame (ORFs) in the genomes of 11 archaea (3 Crenarchaeota – Aeropyrum pernix, Pyrobaculum aerophilum and Sulfolobus acidocaldarius; 8 Euryarchaeota – Pyrococcus abyssi, Methanococcus maripaludis, Methanopyrus kandleri, Methanococcoides burtonii, Halobacterium sp. NCR-1, Haloquadratum walsbyi, Thermoplasma acidophilum and Picrophilus torridus) to search for proteins that are unique to either all Archaea or for its main subgroups. These studies have identified 1448 proteins or ORFs that are distinctive characteristics of Archaea and its various subgroups and whose homologues are not found in other organisms. Six of these proteins are unique to all Archaea, 10 others are only missing in Nanoarchaeum equitans and a large number of other proteins are specific for various main groups within the Archaea (e.g. Crenarchaeota, Euryarchaeota, Sulfolobales and Desulfurococcales, Halobacteriales, Thermococci, Thermoplasmata, all methanogenic archaea or particular groups of methanogens). Of particular importance is the observation that 31 proteins are uniquely present in virtually all methanogens (including M. kandleri) and 10 additional proteins are only found in different methanogens as well as A. fulgidus. In contrast, no protein was exclusively shared by various methanogen and any of the Halobacteriales or Thermoplasmatales. These results strongly indicate that all methanogenic archaea form a monophyletic group exclusive of other archaea and that this lineage likely evolved from Archaeoglobus. In addition, 15 proteins that are uniquely shared by M. kandleri and Methanobacteriales suggest a close evolutionary relationship between them. In contrast to the phylogenomics studies, a monophyletic grouping of archaea is not supported by phylogenetic analyses based on protein sequences. Conclusion The identified archaea-specific proteins provide novel molecular markers or signature proteins that are distinctive characteristics of Archaea and all of its major subgroups. The species distributions of these proteins provide novel insights into the evolutionary relationships among different groups within Archaea, particularly regarding the origin of methanogenesis. Most of these proteins are of unknown function and further studies should lead to discovery of novel biochemical and physiological characteristics that are unique to either all archaea or its different subgroups. PMID:17394648

  3. Defining the healthy "core microbiome" of oral microbial communities

    PubMed Central

    2009-01-01

    Background Most studies examining the commensal human oral microbiome are focused on disease or are limited in methodology. In order to diagnose and treat diseases at an early and reversible stage an in-depth definition of health is indispensible. The aim of this study therefore was to define the healthy oral microbiome using recent advances in sequencing technology (454 pyrosequencing). Results We sampled and sequenced microbiomes from several intraoral niches (dental surfaces, cheek, hard palate, tongue and saliva) in three healthy individuals. Within an individual oral cavity, we found over 3600 unique sequences, over 500 different OTUs or "species-level" phylotypes (sequences that clustered at 3% genetic difference) and 88 - 104 higher taxa (genus or more inclusive taxon). The predominant taxa belonged to Firmicutes (genus Streptococcus, family Veillonellaceae, genus Granulicatella), Proteobacteria (genus Neisseria, Haemophilus), Actinobacteria (genus Corynebacterium, Rothia, Actinomyces), Bacteroidetes (genus Prevotella, Capnocytophaga, Porphyromonas) and Fusobacteria (genus Fusobacterium). Each individual sample harboured on average 266 "species-level" phylotypes (SD 67; range 123 - 326) with cheek samples being the least diverse and the dental samples from approximal surfaces showing the highest diversity. Principal component analysis discriminated the profiles of the samples originating from shedding surfaces (mucosa of tongue, cheek and palate) from the samples that were obtained from solid surfaces (teeth). There was a large overlap in the higher taxa, "species-level" phylotypes and unique sequences among the three microbiomes: 84% of the higher taxa, 75% of the OTUs and 65% of the unique sequences were present in at least two of the three microbiomes. The three individuals shared 1660 of 6315 unique sequences. These 1660 sequences (the "core microbiome") contributed 66% of the reads. The overlapping OTUs contributed to 94% of the reads, while nearly all reads (99.8%) belonged to the shared higher taxa. Conclusions We obtained the first insight into the diversity and uniqueness of individual oral microbiomes at a resolution of next-generation sequencing. We showed that a major proportion of bacterial sequences of unrelated healthy individuals is identical, supporting the concept of a core microbiome at health. PMID:20003481

  4. Quantum-Sequencing: Fast electronic single DNA molecule sequencing

    NASA Astrophysics Data System (ADS)

    Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

    2014-03-01

    A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free, high-throughput and cost-effective, single-molecule sequencing method. Here, we present the first demonstration of unique ``electronic fingerprint'' of all nucleotides (A, G, T, C), with single-molecule DNA sequencing, using Quantum-tunneling Sequencing (Q-Seq) at room temperature. We show that the electronic state of the nucleobases shift depending on the pH, with most distinct states identified at acidic pH. We also demonstrate identification of single nucleotide modifications (methylation here). Using these unique electronic fingerprints (or tunneling data), we report a partial sequence of beta lactamase (bla) gene, which encodes resistance to beta-lactam antibiotics, with over 95% success rate. These results highlight the potential of Q-Seq as a robust technique for next-generation sequencing.

  5. Genome sequence analyses of two isolates from the recent Escherichia coli outbreak in Germany reveal the emergence of a new pathotype: Entero-Aggregative-Haemorrhagic Escherichia coli (EAHEC).

    PubMed

    Brzuszkiewicz, Elzbieta; Thürmer, Andrea; Schuldes, Jörg; Leimbach, Andreas; Liesegang, Heiko; Meyer, Frauke-Dorothee; Boelter, Jürgen; Petersen, Heiko; Gottschalk, Gerhard; Daniel, Rolf

    2011-12-01

    The genome sequences of two Escherichia coli O104:H4 strains derived from two different patients of the 2011 German E. coli outbreak were determined. The two analyzed strains were designated E. coli GOS1 and GOS2 (German outbreak strain). Both isolates comprise one chromosome of approximately 5.31 Mbp and two putative plasmids. Comparisons of the 5,217 (GOS1) and 5,224 (GOS2) predicted protein-encoding genes with various E. coli strains, and a multilocus sequence typing analysis revealed that the isolates were most similar to the entero-aggregative E. coli (EAEC) strain 55989. In addition, one of the putative plasmids of the outbreak strain is similar to pAA-type plasmids of EAEC strains, which contain aggregative adhesion fimbrial operons. The second putative plasmid harbors genes for extended-spectrum β-lactamases. This type of plasmid is widely distributed in pathogenic E. coli strains. A significant difference of the E. coli GOS1 and GOS2 genomes to those of EAEC strains is the presence of a prophage encoding the Shiga toxin, which is characteristic for enterohemorrhagic E. coli (EHEC) strains. The unique combination of genomic features of the German outbreak strain, containing characteristics from pathotypes EAEC and EHEC, suggested that it represents a new pathotype Entero-Aggregative-Haemorrhagic E scherichia c oli (EAHEC).

  6. Mycobacterium pseudoshottsii sp. nov., a slowly growing chromogenic species isolated from Chesapeake Bay striped bass (Morone saxatilis)

    USGS Publications Warehouse

    Rhodes, M.W.; Kator, H.; McNabb, A.; Deshayes, C.; Reyrat, J.-M.; Brown-Elliott, B. A.; Wallace, R.; Trott, K.A.; Parker, J.M.; Lifland, B.; Osterhout, G.; Kaattari, I.; Reece, K.; Vogelbein, W.; Ottinger, C.A.

    2005-01-01

    A group of slowly growing photochromogenic mycobacteria was isolated from Chesapeake Bay striped bass (Morone saxatilis) during an epizootic of mycobacteriosis. Growth characteristics, acid-fastness and 16S rRNA gene sequencing results were consistent with those of the genus Mycobacterium. Biochemical reactions, growth characteristics and mycolic acid profiles (HPLC) resembled those of Mycobacterium shottsii, a non-pigmented mycobacterium also isolated during the same epizootic. Sequencing of the 16S rRNA genes, the gene encoding the exported repeated protein (erp) and the gene encoding the 65 kDa heat-shock protein (hsp65) and restriction enzyme analysis of the hsp65 gene demonstrated that this group of isolates is unique. Insertion sequences associated with Mycobacterium ulcerans, IS2404 and IS2606, were detected by PCR. These isolates could be differentiated from other slowly growing pigmented mycobacteria by their inability to grow at 37 ??C, production of niacin and urease, absence of nitrate reductase, negative Tween 80 hydrolysis and resistance to isoniazid (1 ??g ml-1), p-nitrobenzoic acid, thiacetazone and thiophene-2-carboxylic hydrazide. On the basis of this polyphasic study, it is proposed that these isolates represent a novel species, Mycobacterium pseudoshottsii sp. nov. The type strain, L15T, has been deposited in the American Type Culture Collection as ATCC BAA-883T and the National Collection of Type Cultures (UK) as NCTC 13318T. ?? 2005 IUMS.

  7. A nuclear ribosomal DNA pseudogene in triatomines opens a new research field of fundamental and applied implications in Chagas disease.

    PubMed

    Zuriaga, María Angeles; Mas-Coma, Santiago; Bargues, María Dolores

    2015-05-01

    A pseudogene, designated as "ps(5.8S+ITS-2)", paralogous to the 5.8S gene and internal transcribed spacer (ITS)-2 of the nuclear ribosomal DNA (rDNA), has been recently found in many triatomine species distributed throughout North America, Central America and northern South America. Among characteristics used as criteria for pseudogene verification, secondary structures and free energy are highlighted, showing a lower fit between minimum free energy, partition function and centroid structures, although in given cases the fit only appeared to be slightly lower. The unique characteristics of "ps(5.8S+ITS-2)" as a processed or retrotransposed pseudogenic unit of the ghost type are reviewed, with emphasis on its potential functionality compared to the functionality of genes and spacers of the normal rDNA operon. Besides the technical problem of the risk for erroneous sequence results, the usefulness of "ps(5.8S+ITS-2)" for specimen classification, phylogenetic analyses and systematic/taxonomic studies should be highlighted, based on consistence and retention index values, which in pseudogenic sequence trees were higher than in functional sequence trees. Additionally, intraindividual, interpopulational and interspecific differences in pseudogene amount and the fact that it is a pseudogene in the nuclear rDNA suggests a potential relationships with fitness, behaviour and adaptability of triatomine vectors and consequently its potential utility in Chagas disease epidemiology and control.

  8. Sequence stratigraphy of the Kingak Shale (Jurassic-Lower Cretaceous), National Petroleum Reserve in Alaska

    USGS Publications Warehouse

    Houseknecht, D.W.; Bird, K.J.

    2004-01-01

    Beaufortian strata (Jurassic-Lower Cretaceous) in the National Petroleum Reserve in Alaska (NPRA) are a focus of exploration since the 1994 discovery of the nearby Alpine oil field (>400 MMBO). These strata include the Kingak Shale, a succession of depositional sequences influenced by rift opening of the Arctic Ocean Basin. Interpretation of sequence stratigraphy and depositional facies from a regional two-dimensional seismic grid and well data allows the definition of four sequence sets that each displays unique stratal geometries and thickness trends across NPRA. A Lower to Middle Jurassic sequence set includes numerous transgressive-regressive sequences that collectively built a clastic shelf in north-central NPRA. Along the south-facing, lobate shelf margin, condensed shales in transgressive systems tracts downlap and coalesce into a basinal condensed section that is likely an important hydrocarbon source rock. An Oxfordian-Kimmeridgian sequence set, deposited during pulses of uplift on the Barrow arch, includes multiple transgressive-regressive sequences that locally contain well-winnowed, shoreface sandstones at the base of transgressive systems tracts. These shoreface sandstones and overlying shales, deposited during maximum flooding, form stratigraphic traps that are the main objective of exploration in the Alpine play in NPRA. A Valanginian sequence set includes at least two transgressive-regressive sequences that display relatively distal characteristics, suggesting high relative sea level. An important exception is the presence of a basal transgressive systems tract that locally contains shoreface sandstones of reservoir quality. A Hauterivian sequence set includes two transgressive-regressive sequences that constitute a shelf-margin wedge developed as the result of tectonic uplift along the Barrow arch during rift opening of the Arctic Ocean Basin. This sequence set displays stratal geometries suggesting incision and synsedimentary collapse of the shelf margin. ?? 2004. The American Association of Petroleum Geologists. All rights reserved.

  9. Micro- and nanofluidic systems in devices for biological, medical and environmental research

    NASA Astrophysics Data System (ADS)

    Evstrapov, A. A.

    2017-11-01

    The use of micro- and nanofluidic systems in modern analytical instruments allow you to implement a number of unique opportunities and achieve ultra-high measurement sensitivity. The possibility of manipulation of the individual biological objects (cells, bacteria, viruses, proteins, nucleic acids) in a liquid medium caused the development of devices on microchip platform for methods: chromatographic and electrophoretic analyzes; polymerase chain reaction; sequencing of nucleic acids; immunoassay; cytometric studies. Development of micro and nano fabrication technologies, materials science, surface chemistry, analytical chemistry, cell engineering have led to the creation of a unique systems such as “lab-on-a-chip”, “human-on-a-chip” and other. This article discusses common in microfluidics materials and methods of making functional structures. Examples of integration of nanoscale structures in microfluidic devices for the implementation of new features and improve the technical characteristics of devices and systems are shown.

  10. A Review of Living Collections with Special Emphasis on Sustainability and Its Impact on Research Across Multiple Disciplines

    PubMed Central

    2017-01-01

    Formal living collections have unique characteristics that distinguish them from other types of biorepositories. Comprising diverse resources, microbe culture collections, crop and biodiversity plant germplasm collections, and animal germplasm repositories are commonly allied with specific research communities or stakeholder groups. Among living collections, microbial culture collections have very long and unique life histories, with some being older than 100 years. Regulatory, financial, and technical developments have impacted living collections in many ways. International treaty obligations and restrictions on release of genetically modified organisms complicate the activities of living collections. Funding for living collections is a continuing challenge and threatens to create a two-tier system where medically relevant collections are well funded and all other collections are underfunded and hence understaffed. Molecular, genetic, and whole genome sequence analysis of contents of microbes and other living resource collections bring additional value to living collections. PMID:27869477

  11. A Review of Living Collections with Special Emphasis on Sustainability and Its Impact on Research Across Multiple Disciplines.

    PubMed

    McCluskey, Kevin

    2017-02-01

    Formal living collections have unique characteristics that distinguish them from other types of biorepositories. Comprising diverse resources, microbe culture collections, crop and biodiversity plant germplasm collections, and animal germplasm repositories are commonly allied with specific research communities or stakeholder groups. Among living collections, microbial culture collections have very long and unique life histories, with some being older than 100 years. Regulatory, financial, and technical developments have impacted living collections in many ways. International treaty obligations and restrictions on release of genetically modified organisms complicate the activities of living collections. Funding for living collections is a continuing challenge and threatens to create a two-tier system where medically relevant collections are well funded and all other collections are underfunded and hence understaffed. Molecular, genetic, and whole genome sequence analysis of contents of microbes and other living resource collections bring additional value to living collections.

  12. Partial bisulfite conversion for unique template sequencing.

    PubMed

    Kumar, Vijay; Rosenbaum, Julie; Wang, Zihua; Forcier, Talitha; Ronemus, Michael; Wigler, Michael; Levy, Dan

    2018-01-25

    We introduce a new protocol, mutational sequencing or muSeq, which uses sodium bisulfite to randomly deaminate unmethylated cytosines at a fixed and tunable rate. The muSeq protocol marks each initial template molecule with a unique mutation signature that is present in every copy of the template, and in every fragmented copy of a copy. In the sequenced read data, this signature is observed as a unique pattern of C-to-T or G-to-A nucleotide conversions. Clustering reads with the same conversion pattern enables accurate count and long-range assembly of initial template molecules from short-read sequence data. We explore count and low-error sequencing by profiling 135 000 restriction fragments in a PstI representation, demonstrating that muSeq improves copy number inference and significantly reduces sporadic sequencer error. We explore long-range assembly in the context of cDNA, generating contiguous transcript clusters greater than 3,000 bp in length. The muSeq assemblies reveal transcriptional diversity not observable from short-read data alone. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. Structural characteristics of alkaline phosphatase from the moderately halophilic bacterium Halomonas sp. 593

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Arai, Shigeki; Yonezawa, Yasushi; Ishibashi, Matsujiro

    2014-03-01

    In order to clarify the structural basis of the halophilic characteristics of an alkaline phosphatase derived from the moderate halophile Halomonas sp. 593 (HaAP), the tertiary structure of HaAP was determined to 2.1 Å resolution by X-ray crystallography. The structural properties of surface negative charge and core hydrophobicity were shown to be intermediate between those characteristic of halophiles and non-halophiles, and may explain the unique functional adaptation to a wide range of salt concentrations. Alkaline phosphatase (AP) from the moderate halophilic bacterium Halomonas sp. 593 (HaAP) catalyzes the hydrolysis of phosphomonoesters over a wide salt-concentration range (1–4 M NaCl). Inmore » order to clarify the structural basis of its halophilic characteristics and its wide-range adaptation to salt concentration, the tertiary structure of HaAP was determined by X-ray crystallography to 2.1 Å resolution. The unit cell of HaAP contained one dimer unit corresponding to the biological unit. The monomer structure of HaAP contains a domain comprised of an 11-stranded β-sheet core with 19 surrounding α-helices similar to those of APs from other species, and a unique ‘crown’ domain containing an extended ‘arm’ structure that participates in formation of a hydrophobic cluster at the entrance to the substrate-binding site. The HaAP structure also displays a unique distribution of negatively charged residues and hydrophobic residues in comparison to other known AP structures. AP from Vibrio sp. G15-21 (VAP; a slight halophile) has the highest similarity in sequence (70.0% identity) and structure (C{sup α} r.m.s.d. of 0.82 Å for the monomer) to HaAP. The surface of the HaAP dimer is substantially more acidic than that of the VAP dimer (144 exposed Asp/Glu residues versus 114, respectively), and thus may enable the solubility of HaAP under high-salt conditions. Conversely, the monomer unit of HaAP formed a substantially larger hydrophobic interior comprising 329 C atoms from completely buried residues, whereas that of VAP comprised 264 C atoms, which may maintain the stability of HaAP under low-salt conditions. These characteristics of HaAP may be responsible for its unique functional adaptation permitting activity over a wide range of salt concentrations.« less

  14. Neisseria arctica sp. nov. isolated from nonviable eggs of greater white-fronted geese (Anser albifrons) in Arctic Alaska

    USGS Publications Warehouse

    Hansen, Cristina M.; Himschoot, Elizabeth; Hare, Rebekah F.; Meixell, Brandt W.; Van Hemert, Caroline R.; Hueffer, Karsten

    2017-01-01

    During the summers of 2013 and 2014, isolates of a novel Gram-negative coccus in the Neisseria genus were obtained from the contents of nonviable greater white-fronted goose (Anser albifrons) eggs on the Arctic Coastal Plain of Alaska. We used a polyphasic approach to determine whether these isolates represent a novel species. 16S rRNA gene sequences, 23S rRNA gene sequences, and chaperonin 60 gene sequences suggested that these Alaskan isolates are members of a distinct species that is most closely related to Neisseria canis, N. animaloris, and N. shayeganii. Analysis of the rplF gene additionally showed that our isolates are unique and most closely related to N. weaveri. Average nucleotide identity of the whole genome sequence of our type strain was between 71.5% and 74.6% compared to close relatives, further supporting designation as a novel species. Fatty acid methyl ester analysis showed a predominance of C14:0, C16:0, and C16:1ω7c fatty acids. Finally, biochemical characteristics distinguished our isolates from other Neisseria species. The name Neisseria arctica (type strain KH1503T = ATCC TSD-57T = DSM 103136T) is proposed.

  15. Collaborative Filtering Recommendation on Users' Interest Sequences.

    PubMed

    Cheng, Weijie; Yin, Guisheng; Dong, Yuxin; Dong, Hongbin; Zhang, Wansong

    2016-01-01

    As an important factor for improving recommendations, time information has been introduced to model users' dynamic preferences in many papers. However, the sequence of users' behaviour is rarely studied in recommender systems. Due to the users' unique behavior evolution patterns and personalized interest transitions among items, users' similarity in sequential dimension should be introduced to further distinguish users' preferences and interests. In this paper, we propose a new collaborative filtering recommendation method based on users' interest sequences (IS) that rank users' ratings or other online behaviors according to the timestamps when they occurred. This method extracts the semantics hidden in the interest sequences by the length of users' longest common sub-IS (LCSIS) and the count of users' total common sub-IS (ACSIS). Then, these semantics are utilized to obtain users' IS-based similarities and, further, to refine the similarities acquired from traditional collaborative filtering approaches. With these updated similarities, transition characteristics and dynamic evolution patterns of users' preferences are considered. Our new proposed method was compared with state-of-the-art time-aware collaborative filtering algorithms on datasets MovieLens, Flixster and Ciao. The experimental results validate that the proposed recommendation method is effective and outperforms several existing algorithms in the accuracy of rating prediction.

  16. Collaborative Filtering Recommendation on Users’ Interest Sequences

    PubMed Central

    Cheng, Weijie; Yin, Guisheng; Dong, Yuxin; Dong, Hongbin; Zhang, Wansong

    2016-01-01

    As an important factor for improving recommendations, time information has been introduced to model users’ dynamic preferences in many papers. However, the sequence of users’ behaviour is rarely studied in recommender systems. Due to the users’ unique behavior evolution patterns and personalized interest transitions among items, users’ similarity in sequential dimension should be introduced to further distinguish users’ preferences and interests. In this paper, we propose a new collaborative filtering recommendation method based on users’ interest sequences (IS) that rank users’ ratings or other online behaviors according to the timestamps when they occurred. This method extracts the semantics hidden in the interest sequences by the length of users’ longest common sub-IS (LCSIS) and the count of users’ total common sub-IS (ACSIS). Then, these semantics are utilized to obtain users’ IS-based similarities and, further, to refine the similarities acquired from traditional collaborative filtering approaches. With these updated similarities, transition characteristics and dynamic evolution patterns of users’ preferences are considered. Our new proposed method was compared with state-of-the-art time-aware collaborative filtering algorithms on datasets MovieLens, Flixster and Ciao. The experimental results validate that the proposed recommendation method is effective and outperforms several existing algorithms in the accuracy of rating prediction. PMID:27195787

  17. Genome Re-Sequencing of Semi-Wild Soybean Reveals a Complex Soja Population Structure and Deep Introgression

    PubMed Central

    Wu, Sanling; Wang, Ying-Ying; Ye, Chu-Yu; Bai, Xuefei; Li, Zefeng; Yan, Chenghai; Wang, Weidi; Wang, Ziqiang; Shu, Qingyao; Xie, Jiahua; Lee, Suk-Ha; Fan, Longjiang

    2014-01-01

    Semi-wild soybean is a unique type of soybean that retains both wild and domesticated characteristics, which provides an important intermediate type for understanding the evolution of the subgenus Soja population in the Glycine genus. In this study, a semi-wild soybean line (Maliaodou) and a wild line (Lanxi 1) collected from the lower Yangtze regions were deeply sequenced while nine other semi-wild lines were sequenced to a 3-fold genome coverage. Sequence analysis revealed that (1) no independent phylogenetic branch covering all 10 semi-wild lines was observed in the Soja phylogenetic tree; (2) besides two distinct subpopulations of wild and cultivated soybean in the Soja population structure, all semi-wild lines were mixed with some wild lines into a subpopulation rather than an independent one or an intermediate transition type of soybean domestication; (3) high heterozygous rates (0.19–0.49) were observed in several semi-wild lines; and (4) over 100 putative selective regions were identified by selective sweep analysis, including those related to the development of seed size. Our results suggested a hybridization origin for the semi-wild soybean, which makes a complex Soja population structure. PMID:25265539

  18. Evolutionary conservation of sequence and secondary structures inCRISPR repeats

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kunin, Victor; Sorek, Rotem; Hugenholtz, Philip

    Clustered Regularly Interspaced Palindromic Repeats (CRISPRs) are a novel class of direct repeats, separated by unique spacer sequences of similar length, that are present in {approx}40% of bacterial and all archaeal genomes analyzed to date. More than 40 gene families, called CRISPR-associated sequences (CAS), appear in conjunction with these repeats and are thought to be involved in the propagation and functioning of CRISPRs. It has been proposed that the CRISPR/CAS system samples, maintains a record of, and inactivates invasive DNA that the cell has encountered, and therefore constitutes a prokaryotic analog of an immune system. Here we analyze CRISPR repeatsmore » identified in 195 microbial genomes and show that they can be organized into multiple clusters based on sequence similarity. All individual repeats in any given cluster were inferred to form characteristic RNA secondary structure, ranging from non-existent to pronounced. Stable secondary structures included G:U base pairs and exhibited multiple compensatory base changes in the stem region, indicating evolutionary conservation and functional importance. We also show that the repeat-based classification corresponds to, and expands upon, a previously reported CAS gene-based classification including specific relationships between CRISPR and CAS subtypes.« less

  19. Identification and verification of hybridoma-derived monoclonal antibody variable region sequences using recombinant DNA technology and mass spectrometry

    USDA-ARS?s Scientific Manuscript database

    Antibody engineering requires the identification of antigen binding domains or variable regions (VR) unique to each antibody. It is the VR that define the unique antigen binding properties and proper sequence identification is essential for functional evaluation and performance of recombinant antibo...

  20. Integration of promoters, inverted repeat sequences and proteomic data into a model for high silencing efficiency of coeliac disease related gliadins in bread wheat

    PubMed Central

    2013-01-01

    Background Wheat gluten has unique nutritional and technological characteristics, but is also a major trigger of allergies and intolerances. One of the most severe diseases caused by gluten is coeliac disease. The peptides produced in the digestive tract by the incomplete digestion of gluten proteins trigger the disease. The majority of the epitopes responsible reside in the gliadin fraction of gluten. The location of the multiple gliadin genes in blocks has to date complicated their elimination by classical breeding techniques or by the use of biotechnological tools. As an approach to silence multiple gliadin genes we have produced 38 transgenic lines of bread wheat containing combinations of two endosperm-specific promoters and three different inverted repeat sequences to silence three fractions of gliadins by RNA interference. Results The effects of the RNA interference constructs on the content of the gluten proteins, total protein and starch, thousand seed weights and SDSS quality tests of flour were analyzed in these transgenic lines in two consecutive years. The characteristics of the inverted repeat sequences were the main factor that determined the efficiency of silencing. The promoter used had less influence on silencing, although a synergy in silencing efficiency was observed when the two promoters were used simultaneously. Genotype and the environment also influenced silencing efficiency. Conclusions We conclude that to obtain wheat lines with an optimum reduction of toxic gluten epitopes one needs to take into account the factors of inverted repeat sequences design, promoter choice and also the wheat background used. PMID:24044767

  1. Silencing Effect of Hominoid Highly Conserved Noncoding Sequences on Embryonic Brain Development

    PubMed Central

    Mahmoudi Saber, Morteza

    2017-01-01

    Abstract Superfamily Hominoidea, which consists of Hominidae (humans and great apes) and Hylobatidae (gibbons), is well-known for sharing human-like characteristics, however, the genomic origins of these shared unique phenotypes have mainly remained elusive. To decipher the underlying genomic basis of Hominoidea-restricted phenotypes, we identified and characterized Hominoidea-restricted highly conserved noncoding sequences (HCNSs) that are a class of potential regulatory elements which may be involved in evolution of lineage-specific phenotypes. We discovered 679 such HCNSs from human, chimpanzee, gorilla, orangutan and gibbon genomes. These HCNSs were demonstrated to be under purifying selection but with lineage-restricted characteristics different from old CNSs. A significant proportion of their ancestral sequences had accelerated rates of nucleotide substitutions, insertions and deletions during the evolution of common ancestor of Hominoidea, suggesting the intervention of positive Darwinian selection for creating those HCNSs. In contrary to enhancer elements and similar to silencer sequences, these Hominoidea-restricted HCNSs are located in close proximity of transcription start sites. Their target genes are enriched in the nervous system, development and transcription, and they tend to be remotely located from the nearest coding gene. Chip-seq signals and gene expression patterns suggest that Hominoidea-restricted HCNSs are likely to be functional regulatory elements by imposing silencing effects on their target genes in a tissue-restricted manner during fetal brain development. These HCNSs, emerged through adaptive evolution and conserved through purifying selection, represent a set of promising targets for future functional studies of the evolution of Hominoidea-restricted phenotypes. PMID:28633494

  2. Transcriptome analysis of genes related to resistance against powdery mildew in wheat-Thinopyrum alien addition disomic line germplasm SN6306.

    PubMed

    Li, Quanquan; Niu, Zubiao; Bao, Yinguang; Tian, Qiuju; Wang, Honggang; Kong, Lingrang; Feng, Deshun

    2016-09-15

    Wheat powdery mildew, which is mainly caused by Blumeria graminis f. sp. tritici (Bgt), seriously damages wheat production. The wheat-Thinopyrum intermedium alien addition disomic line germplasm SN6306, being one of the important sources of genes for wheat resistance, is highly resistant to Bgt E09 and to many other powdery mildew physiological races. However, knowledge on the resistance mechanism of SN6306 remains limited. Our study employed high-throughput RNA sequencing based on next-generation sequencing technology (Illumina) to obtain an overview of the transcriptome characteristics of SN6306 and its parent wheat Yannong 15 (YN15) during Bgt infection. The sequencing generated 104,773 unigenes, 9909 of which showed varied expression levels. Among the 9909 unigenes, 1678 unigenes showed 0 reads in YN15. The expression levels in Bgt-inoculated SN6306 and YN15 of exactly 39 unigenes that showed 0 or considerably low reads in YN15 were validated to identify the genes involved in Bgt resistance. Among the 39 unigenes, 12 unigenes were upregulated in SN6306 by 3-45 times. These unigenes mainly encoded kinase, synthase, proteases, and signal transduction proteins, which may play an important role in the resistance against Bgt. To confirm whether the unigenes that showed 0 reads in YN15 are really unique to SN6306, 8 unigenes were cloned and sequenced. Results showed that the selected unigenes are more similar to SN6306 and Th. intermedium than to the wheat cultivar YN15. The sequencing results further confirmed that the unigenes showing 0 reads in YN15 are unique to SN6306 and are most likely derived from Th. intermedium (Host) Nevski. Thus, the genes from Th. intermedium most probably conferred the resistance of SN6306 to Bgt. Copyright © 2016 Elsevier B.V. All rights reserved.

  3. ARResT/AssignSubsets: a novel application for robust subclassification of chronic lymphocytic leukemia based on B cell receptor IG stereotypy.

    PubMed

    Bystry, Vojtech; Agathangelidis, Andreas; Bikos, Vasilis; Sutton, Lesley Ann; Baliakas, Panagiotis; Hadzidimitriou, Anastasia; Stamatopoulos, Kostas; Darzentas, Nikos

    2015-12-01

    An ever-increasing body of evidence supports the importance of B cell receptor immunoglobulin (BcR IG) sequence restriction, alias stereotypy, in chronic lymphocytic leukemia (CLL). This phenomenon accounts for ∼30% of studied cases, one in eight of which belong to major subsets, and extends beyond restricted sequence patterns to shared biologic and clinical characteristics and, generally, outcome. Thus, the robust assignment of new cases to major CLL subsets is a critical, and yet unmet, requirement. We introduce a novel application, ARResT/AssignSubsets, which enables the robust assignment of BcR IG sequences from CLL patients to major stereotyped subsets. ARResT/AssignSubsets uniquely combines expert immunogenetic sequence annotation from IMGT/V-QUEST with curation to safeguard quality, statistical modeling of sequence features from more than 7500 CLL patients, and results from multiple perspectives to allow for both objective and subjective assessment. We validated our approach on the learning set, and evaluated its real-world applicability on a new representative dataset comprising 459 sequences from a single institution. ARResT/AssignSubsets is freely available on the web at http://bat.infspire.org/arrest/assignsubsets/ nikos.darzentas@gmail.com. Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  4. PpRT1: the first complete gypsy-like retrotransposon isolated in Pinus pinaster.

    PubMed

    Rocheta, Margarida; Cordeiro, Jorge; Oliveira, M; Miguel, Célia

    2007-02-01

    We have isolated and characterized a complete retrotransposon sequence, named PpRT1, from the genome of Pinus pinaster. PpRT1 is 5,966 bp long and is closely related to IFG7 gypsy retrotransposon from Pinus radiata. The long terminal repeats (LTRs) have 333 bp each and show a 5.4% sequence divergence between them. In addition to the characteristic polypurine tract (PPT) and the primer binding site (PBS), PpRT1 carries internal regions with homology to retroviral genes gag and pol. The pol region contains sequence motifs related to the enzymes protease, reverse transcriptase, RNAseH and integrase in the same typical order known for Ty3/gypsy-like retrotransposons. PpRT1 was extended from an EST database sequence indicating that its transcription is occurring in pine tissues. Southern blot analyses indicate however, that PpRT1 is present in a unique or a low number of copies in the P. pinaster genome. The differences in nucleotide sequence found between PpRT1 and IFG7 may explain the strikingly different copy number in the two pine species genome. Based on the homologies observed when comparing LTR region among different gypsy elements we propose that the highly conserved LTR regions may be useful to amplify other retrotransposon sequences of the same or close retrotransposon family.

  5. The genome sequence of Leishmania (Leishmania) amazonensis: functional annotation and extended analysis of gene models.

    PubMed

    Real, Fernando; Vidal, Ramon Oliveira; Carazzolle, Marcelo Falsarella; Mondego, Jorge Maurício Costa; Costa, Gustavo Gilson Lacerda; Herai, Roberto Hirochi; Würtele, Martin; de Carvalho, Lucas Miguel; Carmona e Ferreira, Renata; Mortara, Renato Arruda; Barbiéri, Clara Lucia; Mieczkowski, Piotr; da Silveira, José Franco; Briones, Marcelo Ribeiro da Silva; Pereira, Gonçalo Amarante Guimarães; Bahia, Diana

    2013-12-01

    We present the sequencing and annotation of the Leishmania (Leishmania) amazonensis genome, an etiological agent of human cutaneous leishmaniasis in the Amazon region of Brazil. L. (L.) amazonensis shares features with Leishmania (L.) mexicana but also exhibits unique characteristics regarding geographical distribution and clinical manifestations of cutaneous lesions (e.g. borderline disseminated cutaneous leishmaniasis). Predicted genes were scored for orthologous gene families and conserved domains in comparison with other human pathogenic Leishmania spp. Carboxypeptidase, aminotransferase, and 3'-nucleotidase genes and ATPase, thioredoxin, and chaperone-related domains were represented more abundantly in L. (L.) amazonensis and L. (L.) mexicana species. Phylogenetic analysis revealed that these two species share groups of amastin surface proteins unique to the genus that could be related to specific features of disease outcomes and host cell interactions. Additionally, we describe a hypothetical hybrid interactome of potentially secreted L. (L.) amazonensis proteins and host proteins under the assumption that parasite factors mimic their mammalian counterparts. The model predicts an interaction between an L. (L.) amazonensis heat-shock protein and mammalian Toll-like receptor 9, which is implicated in important immune responses such as cytokine and nitric oxide production. The analysis presented here represents valuable information for future studies of leishmaniasis pathogenicity and treatment.

  6. The Genome Sequence of Leishmania (Leishmania) amazonensis: Functional Annotation and Extended Analysis of Gene Models

    PubMed Central

    Real, Fernando; Vidal, Ramon Oliveira; Carazzolle, Marcelo Falsarella; Mondego, Jorge Maurício Costa; Costa, Gustavo Gilson Lacerda; Herai, Roberto Hirochi; Würtele, Martin; de Carvalho, Lucas Miguel; e Ferreira, Renata Carmona; Mortara, Renato Arruda; Barbiéri, Clara Lucia; Mieczkowski, Piotr; da Silveira, José Franco; Briones, Marcelo Ribeiro da Silva; Pereira, Gonçalo Amarante Guimarães; Bahia, Diana

    2013-01-01

    We present the sequencing and annotation of the Leishmania (Leishmania) amazonensis genome, an etiological agent of human cutaneous leishmaniasis in the Amazon region of Brazil. L. (L.) amazonensis shares features with Leishmania (L.) mexicana but also exhibits unique characteristics regarding geographical distribution and clinical manifestations of cutaneous lesions (e.g. borderline disseminated cutaneous leishmaniasis). Predicted genes were scored for orthologous gene families and conserved domains in comparison with other human pathogenic Leishmania spp. Carboxypeptidase, aminotransferase, and 3′-nucleotidase genes and ATPase, thioredoxin, and chaperone-related domains were represented more abundantly in L. (L.) amazonensis and L. (L.) mexicana species. Phylogenetic analysis revealed that these two species share groups of amastin surface proteins unique to the genus that could be related to specific features of disease outcomes and host cell interactions. Additionally, we describe a hypothetical hybrid interactome of potentially secreted L. (L.) amazonensis proteins and host proteins under the assumption that parasite factors mimic their mammalian counterparts. The model predicts an interaction between an L. (L.) amazonensis heat-shock protein and mammalian Toll-like receptor 9, which is implicated in important immune responses such as cytokine and nitric oxide production. The analysis presented here represents valuable information for future studies of leishmaniasis pathogenicity and treatment. PMID:23857904

  7. A cool tool for hot and sour Archaea: proteomics of Sulfolobus solfataricus.

    PubMed

    Kort, Julia Christin; Esser, Dominik; Pham, Trong Khoa; Noirel, Josselin; Wright, Phillip C; Siebers, Bettina

    2013-10-01

    In recent years, much progress has been made in proteomic studies to unravel metabolic pathways and basic cellular processes. This is especially interesting for members of the Archaea, the third domain of life. Archaea exhibit extraordinary features and many of their cultivable representatives are adaptable to extreme environments. Archaea harbor many unique traits besides bacterial attributes, such as size, shape, and DNA structure and eukaryal characteristics like information processing. Sulfolobus solfataricus P2, a thermoacidophilic archaeal representative, is a well-established model organism adapted to low-pH environments (pH 2-3) and high temperatures (80°C). The genome has a size of 3 Mbp and its sequence has been deciphered. Approximately 3033 predicted open reading frames have been identified and the genome is characterized by a great number of diverse insertion sequence elements. In unraveling the organisms' metabolism and lifestyle, proteomic analyses have played a major role. Much effort has been directed at this organism and is reviewed here. With the help of proteomics, unique metabolic pathways were resolved in S. solfataricus, targets for regulatory protein phosphorylation identified, and cellular responses upon virus infection as well as oxidative stress analyzed. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  8. Metagenomic insights into the rumen microbial fibrolytic enzymes in Indian crossbred cattle fed finger millet straw.

    PubMed

    Jose, V Lyju; Appoothy, Thulasi; More, Ravi P; Arun, A Sha

    2017-12-01

    The rumen is a unique natural habitat, exhibiting an unparalleled genetic resource of fibrolytic enzymes of microbial origin that degrade plant polysaccharides. The objectives of this study were to identify the principal plant cell wall-degrading enzymes and the taxonomic profile of rumen microbial communities that are associated with it. The cattle rumen microflora and the carbohydrate-active enzymes were functionally classified through a whole metagenomic sequencing approach. Analysis of the assembled sequences by the Carbohydrate-active enzyme analysis Toolkit identified the candidate genes encoding fibrolytic enzymes belonging to different classes of glycoside hydrolases(11,010 contigs), glycosyltransferases (6366 contigs), carbohydrate esterases (4945 contigs), carbohydrate-binding modules (1975 contigs), polysaccharide lyases (480 contigs), and auxiliary activities (115 contigs). Phylogenetic analysis of CAZyme encoding contigs revealed that a significant proportion of CAZymes were contributed by bacteria belonging to genera Prevotella, Bacteroides, Fibrobacter, Clostridium, and Ruminococcus. The results indicated that the cattle rumen microbiome and the CAZymes are highly complex, structurally similar but compositionally distinct from other ruminants. The unique characteristics of rumen microbiota and the enzymes produced by resident microbes provide opportunities to improve the feed conversion efficiency in ruminants and serve as a reservoir of industrially important enzymes for cellulosic biofuel production.

  9. Hepatitis A virus: host interactions, molecular epidemiology and evolution.

    PubMed

    Vaughan, Gilberto; Goncalves Rossi, Livia Maria; Forbi, Joseph C; de Paula, Vanessa S; Purdy, Michael A; Xia, Guoliang; Khudyakov, Yury E

    2014-01-01

    Infection with hepatitis A virus (HAV) is the commonest viral cause of liver disease and presents an important public health problem worldwide. Several unique HAV properties and molecular mechanisms of its interaction with host were recently discovered and should aid in clarifying the pathogenesis of hepatitis A. Genetic characterization of HAV strains have resulted in the identification of different genotypes and subtypes, which exhibit a characteristic worldwide distribution. Shifts in HAV endemicity occurring in different parts of the world, introduction of genetically diverse strains from geographically distant regions, genotype displacement observed in some countries and population expansion detected in the last decades of the 20th century using phylogenetic analysis are important factors contributing to the complex dynamics of HAV infections worldwide. Strong selection pressures, some of which, like usage of deoptimized codons, are unique to HAV, limit genetic variability of the virus. Analysis of subgenomic regions has been proven useful for outbreak investigations. However, sharing short sequences among epidemiologically unrelated strains indicates that specific identification of HAV strains for molecular surveillance can be achieved only using whole-genome sequences. Here, we present up-to-date information on the HAV molecular epidemiology and evolution, and highlight the most relevant features of the HAV-host interactions. Published by Elsevier B.V.

  10. Comprehensive analysis of the T-cell receptor beta chain gene in rhesus monkey by high throughput sequencing

    PubMed Central

    Li, Zhoufang; Liu, Guangjie; Tong, Yin; Zhang, Meng; Xu, Ying; Qin, Li; Wang, Zhanhui; Chen, Xiaoping; He, Jiankui

    2015-01-01

    Profiling immune repertoires by high throughput sequencing enhances our understanding of immune system complexity and immune-related diseases in humans. Previously, cloning and Sanger sequencing identified limited numbers of T cell receptor (TCR) nucleotide sequences in rhesus monkeys, thus their full immune repertoire is unknown. We applied multiplex PCR and Illumina high throughput sequencing to study the TCRβ of rhesus monkeys. We identified 1.26 million TCRβ sequences corresponding to 643,570 unique TCRβ sequences and 270,557 unique complementarity-determining region 3 (CDR3) gene sequences. Precise measurements of CDR3 length distribution, CDR3 amino acid distribution, length distribution of N nucleotide of junctional region, and TCRV and TCRJ gene usage preferences were performed. A comprehensive profile of rhesus monkey immune repertoire might aid human infectious disease studies using rhesus monkeys. PMID:25961410

  11. The growing world of expansins

    NASA Technical Reports Server (NTRS)

    Cosgrove, Daniel J.; Li, Lian Chao; Cho, Hyung-Taeg; Hoffmann-Benning, Susanne; Moore, Richard C.; Blecker, Douglas

    2002-01-01

    Expansins are cell wall proteins that induce pH-dependent wall extension and stress relaxation in a characteristic and unique manner. Two families of expansins are known, named alpha- and beta-expansins, and they comprise large multigene families whose members show diverse organ-, tissue- and cell-specific expression patterns. Other genes that bear distant sequence similarity to expansins are also represented in the sequence databases, but their biological and biochemical functions have not yet been uncovered. Expansin appears to weaken glucan-glucan binding, but its detailed mechanism of action is not well established. The biological roles of expansins are diverse, but can be related to the action of expansins to loosen cell walls, for example during cell enlargement, fruit softening, pollen tube and root hair growth, and abscission. Expansin-like proteins have also been identified in bacteria and fungi, where they may aid microbial invasion of the plant body.

  12. Expanded complexity of unstable repeat diseases

    PubMed Central

    Polak, Urszula; McIvor, Elizabeth; Dent, Sharon Y.R.; Wells, Robert D.; Napierala, Marek

    2015-01-01

    Unstable Repeat Diseases (URDs) share a common mutational phenomenon of changes in the copy number of short, tandemly repeated DNA sequences. More than 20 human neurological diseases are caused by instability, predominantly expansion, of microsatellite sequences. Changes in the repeat size initiate a cascade of pathological processes, frequently characteristic of a unique disease or a small subgroup of the URDs. Understanding of both the mechanism of repeat instability and molecular consequences of the repeat expansions is critical to developing successful therapies for these diseases. Recent technological breakthroughs in whole genome, transcriptome and proteome analyses will almost certainly lead to new discoveries regarding the mechanisms of repeat instability, the pathogenesis of URDs, and will facilitate development of novel therapeutic approaches. The aim of this review is to give a general overview of unstable repeats diseases, highlight the complexities of these diseases, and feature the emerging discoveries in the field. PMID:23233240

  13. Wickerhamomyces mori sp. nov., an anamorphic yeast species found in the guts of wood-boring insect larvae.

    PubMed

    Hui, Feng-Li; Chen, Liang; Chu, Xue-Ying; Niu, Qiu-Hong; Ke, Tao

    2013-03-01

    A novel anamorphic yeast species is described to accommodate three isolates recovered from the guts of three different wood-boring insect larvae collected in Henan, central China. On the basis of sequence analyses of the D1/D2 domains of the large-subunit rRNA gene and the internal transcribed spacer regions, the three strains are assigned to a novel species of the genus Wickerhamomyces, although the formation of ascospores was not observed. These strains also exhibited a number of distinct morphological and physiological characteristics that clearly differentiated them from Wickerhamomyces mucosus, Candida odintsovae and Wickerhamomyces rabaulensis, the most closely related species. In view of the phenotypic differences and unique rRNA gene sequences, we consider that these three isolates represent a novel species of the genus Wickerhamomyces, Wickerhamomyces mori sp. nov. The type strain is NYNU 1216(T) ( = CICC 1983(T)  = CBS 12678(T)).

  14. Reconciling the Structural Attributes of Avian Antibodies*

    PubMed Central

    Conroy, Paul J.; Law, Ruby H. P.; Gilgunn, Sarah; Hearty, Stephen; Caradoc-Davies, Tom T.; Lloyd, Gordon; O'Kennedy, Richard J.; Whisstock, James C.

    2014-01-01

    Antibodies are high value therapeutic, diagnostic, biotechnological, and research tools. Combinatorial approaches to antibody discovery have facilitated access to unique antibodies by surpassing the diversity limitations of the natural repertoire, exploitation of immune repertoires from multiple species, and tailoring selections to isolate antibodies with desirable biophysical attributes. The V-gene repertoire of the chicken does not utilize highly diverse sequence and structures, which is in stark contrast to the mechanism employed by humans, mice, and primates. Recent exploitation of the avian immune system has generated high quality, high affinity antibodies to a wide range of antigens for a number of therapeutic, diagnostic and biotechnological applications. Furthermore, extensive examination of the amino acid characteristics of the chicken repertoire has provided significant insight into mechanisms employed by the avian immune system. A paucity of avian antibody crystal structures has limited our understanding of the structural consequences of these uniquely chicken features. This paper presents the crystal structure of two chicken single chain fragment variable (scFv) antibodies generated from large libraries by phage display against important human antigen targets, which capture two unique CDRL1 canonical classes in the presence and absence of a non-canonical disulfide constrained CDRH3. These structures cast light on the unique structural features of chicken antibodies and contribute further to our collective understanding of the unique mechanisms of diversity and biochemical attributes that render the chicken repertoire of particular value for antibody generation. PMID:24737329

  15. Swallow Event Sequencing: Comparing Healthy Older and Younger Adults.

    PubMed

    Herzberg, Erica G; Lazarus, Cathy L; Steele, Catriona M; Molfenter, Sonja M

    2018-04-23

    Previous research has established that a great deal of variation exists in the temporal sequence of swallowing events for healthy adults. Yet, the impact of aging on swallow event sequence is not well understood. Kendall et al. (Dysphagia 18(2):85-91, 2003) suggested there are 4 obligatory paired-event sequences in swallowing. We directly compared adherence to these sequences, as well as event latencies, and quantified the percentage of unique sequences in two samples of healthy adults: young (< 45) and old (> 65). The 8 swallowing events that contribute to the sequences were reliably identified from videofluoroscopy in a sample of 23 healthy seniors (10 male, mean age 74.7) and 20 healthy young adults (10 male, mean age 31.5) with no evidence of penetration-aspiration or post-swallow residue. Chi-square analyses compared the proportions of obligatory pairs and unique sequences by age group. Compared to the older subjects, younger subjects had significantly lower adherence to two obligatory sequences: Upper Esophageal Sphincter (UES) opening occurs before (or simultaneous with) the bolus arriving at the UES and UES maximum distention occurs before maximum pharyngeal constriction. The associated latencies were significantly different between age groups as well. Further, significantly fewer unique swallow sequences were observed in the older group (61%) compared with the young (82%) (χ 2  = 31.8; p < 0.001). Our findings suggest that paired swallow event sequences may not be robust across the age continuum and that variation in swallow sequences appears to decrease with aging. These findings provide normative references for comparisons to older individuals with dysphagia.

  16. Novel numerical and graphical representation of DNA sequences and proteins.

    PubMed

    Randić, M; Novic, M; Vikić-Topić, D; Plavsić, D

    2006-12-01

    We have introduced novel numerical and graphical representations of DNA, which offer a simple and unique characterization of DNA sequences. The numerical representation of a DNA sequence is given as a sequence of real numbers derived from a unique graphical representation of the standard genetic code. There is no loss of information on the primary structure of a DNA sequence associated with this numerical representation. The novel representations are illustrated with the coding sequences of the first exon of beta-globin gene of half a dozen species in addition to human. The method can be extended to proteins as is exemplified by humanin, a 24-aa peptide that has recently been identified as a specific inhibitor of neuronal cell death induced by familial Alzheimer's disease mutant genes.

  17. High-resolution seismic reflection to delineate shallow gas in Eastern Kansas

    USGS Publications Warehouse

    Miller, R.D.; Watney, W.L.; Begay, D.K.; Xia, J.

    2000-01-01

    Unique amplitude characteristics of shallow gas sands within Pennsylvanian clastic-carbonate dominated sequences are discernible on high-resolution seismic reflection data in eastern Kansas. Upward grading sequences of sand into shale represent a potential gas reservoir with a low-impedence acoustic contrast at the base of the encasing layer. The gas sand and encasing shale, which define the gas reservoir studied here, are part of an erosional incised valley where about 30 m of carbonates and shale have been replaced by sandstone and shale confined to the incised valley. These consolidated geologic settings would normally possess high impedence gas sand reservoirs as defined by abrupt contacts between the gas sand and encasing shale. Based orr core and borehole logs, the gas sand studied here grades from sand into shale in a fashion analogous to that observed in classic low-impedance environments. Amplitude and phase characteristics of high-resolution seismic data across this approximately 400-m wide gas sand are indicative of a low-impedance reservoir. Shot gathers possess classic amplitude with offsett-dependent characteristics which are manifeted on the stacked section as "bright spots." Dominant Frequencies of around 120 Hz allow detection of several reflectors within the 30+ meters of sand/shale that make up this localized gas-rich incised valley fill. The gradational nature of the trapping mechanism observed in this gas reservoir would make detection with conventional seismic reflection methods unlikely.

  18. Real-Time PCR Assay for a Unique Chromosomal Sequence of Bacillus anthracis

    DTIC Science & Technology

    2004-12-01

    13061 Neisseria lactamica .............................................................. 23970 Bacillus coagulans ...NEG Bacillus coagulane 7050 NEG NEG Bacillus cereus 13472 NEG NEG Bacillus licheniforms 12759 NEG NEG Bacillus cereus 13824 NEG NEG Bacillus ...Assay for a Unique Chromosomal Sequence of Bacillus anthracis Elizabeth Bode,1 William Hurtle,2† and David Norwood1* United States Army Medical

  19. Draft Genome Sequence of the Spore-Forming Probiotic Strain Bacillus coagulans Unique IS-2

    PubMed Central

    Upadrasta, Aditya; Pitta, Swetha

    2016-01-01

    Bacillus coagulans Unique IS-2 is a potential spore-forming probiotic that is commercially available on the market. The draft genome sequence presented here provides deep insight into the beneficial features of this strain for its safe use as a probiotic for various human and animal health applications. PMID:27103709

  20. Characteristics of yak platelet derived growth factors-alpha gene and expression in brain tissues.

    PubMed

    Huang, Zhenhua; Pan, Yangyang; Liu, Penggang; Yu, Sijiu; Cui, Yan

    2017-05-29

    Platelet derived growth factors (PDGFs) are key components of autocrine and paracrine signaling, both of which play important roles in mammalian developmental processes. PDGF expression levels also relate to oxygen levels. The characteristics of yak PDGFs, which are indigenous to hypoxic environments, have not been clearly described until the current study. We amplified the open reading frame encoding yak (Bos grunniens) platelet derived growth factor-a (PDGFA) from a yak skin tissue cDNA library by reverse transcriptase polymerase chain reaction (PCR) using specific primers and Sanger dideoxy sequencing. Expression of PDGFA mRNA in different portions of yak brain tissue (cerebrum, cerebellum, hippocampus, and spinal cord) was detected by quantitative real-time PCR (qRT-PCR). PDGFA protein expression levels and its location in different portions of the yak brain were evaluated by western blot and immunohistochemistry. We obtained a yak PDGFA 755 bp cDNA gene fragment containing a 636 bp open reading frame, encoding 211 amino acids (GenBank: KU851801). Phylogenetic analysis shows yak PDGFA to be well conserved, having 98.1% DNA sequence identity to homologous Bubalus bubalus and Bos taurus PDGFA genes. However, eight nucleotides in the yak DNA sequence and four amino acids in the yak protein sequence differ from the other two species. PDGFA is widely expressed in yak brain tissue, and furthermore, PDGFA expression in the cerebrum and cerebellum are higher than in the hippocampus and spinal cord (p > 0.05). PDGFA was observed by immunohistochemistry in glial cells of the cerebrum, cerebellum, and hippocampus, as well as in pyramidal cells of the cerebrum, and Purkinje cell bodies of the hippocampus, but not in glial cells of the spinal cord. The PDGFA gene is well conserved in the animal kingdom; however, the yak PDGFA gene has unique characteristics and brain expression patterns specific to this high elevation species.

  1. Genus-Wide Comparative Genomics of Malassezia Delineates Its Phylogeny, Physiology, and Niche Adaptation on Human Skin

    PubMed Central

    Wu, Guangxi; Zhao, He; Li, Chenhao; Rajapakse, Menaka Priyadarsani; Wong, Wing Cheong; Xu, Jun; Saunders, Charles W.; Reeder, Nancy L.; Reilman, Raymond A.; Scheynius, Annika; Sun, Sheng; Billmyre, Blake Robert; Li, Wenjun; Averette, Anna Floyd; Mieczkowski, Piotr; Heitman, Joseph; Theelen, Bart; Schröder, Markus S.; De Sessions, Paola Florez; Butler, Geraldine; Maurer-Stroh, Sebastian; Boekhout, Teun; Nagarajan, Niranjan; Dawson, Thomas L.

    2015-01-01

    Malassezia is a unique lipophilic genus in class Malasseziomycetes in Ustilaginomycotina, (Basidiomycota, fungi) that otherwise consists almost exclusively of plant pathogens. Malassezia are typically isolated from warm-blooded animals, are dominant members of the human skin mycobiome and are associated with common skin disorders. To characterize the genetic basis of the unique phenotypes of Malassezia spp., we sequenced the genomes of all 14 accepted species and used comparative genomics against a broad panel of fungal genomes to comprehensively identify distinct features that define the Malassezia gene repertoire: gene gain and loss; selection signatures; and lineage-specific gene family expansions. Our analysis revealed key gene gain events (64) with a single gene conserved across all Malassezia but absent in all other sequenced Basidiomycota. These likely horizontally transferred genes provide intriguing gain-of-function events and prime candidates to explain the emergence of Malassezia. A larger set of genes (741) were lost, with enrichment for glycosyl hydrolases and carbohydrate metabolism, concordant with adaptation to skin’s carbohydrate-deficient environment. Gene family analysis revealed extensive turnover and underlined the importance of secretory lipases, phospholipases, aspartyl proteases, and other peptidases. Combining genomic analysis with a re-evaluation of culture characteristics, we establish the likely lipid-dependence of all Malassezia. Our phylogenetic analysis sheds new light on the relationship between Malassezia and other members of Ustilaginomycotina, as well as phylogenetic lineages within the genus. Overall, our study provides a unique genomic resource for understanding Malassezia niche-specificity and potential virulence, as well as their abundance and distribution in the environment and on human skin. PMID:26539826

  2. Genus-Wide Comparative Genomics of Malassezia Delineates Its Phylogeny, Physiology, and Niche Adaptation on Human Skin.

    PubMed

    Wu, Guangxi; Zhao, He; Li, Chenhao; Rajapakse, Menaka Priyadarsani; Wong, Wing Cheong; Xu, Jun; Saunders, Charles W; Reeder, Nancy L; Reilman, Raymond A; Scheynius, Annika; Sun, Sheng; Billmyre, Blake Robert; Li, Wenjun; Averette, Anna Floyd; Mieczkowski, Piotr; Heitman, Joseph; Theelen, Bart; Schröder, Markus S; De Sessions, Paola Florez; Butler, Geraldine; Maurer-Stroh, Sebastian; Boekhout, Teun; Nagarajan, Niranjan; Dawson, Thomas L

    2015-11-01

    Malassezia is a unique lipophilic genus in class Malasseziomycetes in Ustilaginomycotina, (Basidiomycota, fungi) that otherwise consists almost exclusively of plant pathogens. Malassezia are typically isolated from warm-blooded animals, are dominant members of the human skin mycobiome and are associated with common skin disorders. To characterize the genetic basis of the unique phenotypes of Malassezia spp., we sequenced the genomes of all 14 accepted species and used comparative genomics against a broad panel of fungal genomes to comprehensively identify distinct features that define the Malassezia gene repertoire: gene gain and loss; selection signatures; and lineage-specific gene family expansions. Our analysis revealed key gene gain events (64) with a single gene conserved across all Malassezia but absent in all other sequenced Basidiomycota. These likely horizontally transferred genes provide intriguing gain-of-function events and prime candidates to explain the emergence of Malassezia. A larger set of genes (741) were lost, with enrichment for glycosyl hydrolases and carbohydrate metabolism, concordant with adaptation to skin's carbohydrate-deficient environment. Gene family analysis revealed extensive turnover and underlined the importance of secretory lipases, phospholipases, aspartyl proteases, and other peptidases. Combining genomic analysis with a re-evaluation of culture characteristics, we establish the likely lipid-dependence of all Malassezia. Our phylogenetic analysis sheds new light on the relationship between Malassezia and other members of Ustilaginomycotina, as well as phylogenetic lineages within the genus. Overall, our study provides a unique genomic resource for understanding Malassezia niche-specificity and potential virulence, as well as their abundance and distribution in the environment and on human skin.

  3. 20-Hz pulses and other vocalizations of fin whales, Balaenoptera physalus, in the Gulf of California, Mexico.

    PubMed

    Thompson, P O; Findley, L T; Vidal, O

    1992-12-01

    Low-frequency vocalizations were recorded from fin whales, Balaenoptera physalus, in the Gulf of California, Mexico, during three cruises. In March 1985, recorded 20-Hz pulses were in sequences of regular 9-s interpulse intervals. In August 1987, nearly all were in sequences of doublets with alternating 5- and 18-s interpulse intervals. No 20-Hz pulse sequences of any kind were detected in February 1987. The typical pulse modulated from 42 to 20 Hz and its median duration was 0.7 s (1985 data). Most other fin whale sounds were also short tonal pulses averaging 82, 56, and 68 Hz, respectively, for the three cruises; 89% were modulated in frequency, mostly downward. Compared to Atlantic and Pacific Ocean regions, Gulf of California 20-Hz pulses were unique in terms of frequency modulation, interpulse sound levels, and temporal patterns. Fin whales in the Gulf may represent a regional stock revealed by their sound characteristics, a phenomenon previously shown for humpback whales, birds, and fish. Regional differences in fin whale sounds were found in comparisons of Atlantic and Pacific locations.

  4. Characterization of culturable anaerobic bacteria from the forestomach of an eastern grey kangaroo, Macropus giganteus.

    PubMed

    Ouwerkerk, D; Klieve, A V; Forster, R J; Templeton, J M; Maguire, A J

    2005-01-01

    To determine the culturable biodiversity of anaerobic bacteria isolated from the forestomach contents of an eastern grey kangaroo, Macropus giganteus, using phenotypic characterization and 16S rDNA sequence analysis. Bacteria from forestomach contents of an eastern grey kangaroo were isolated using anaerobic media containing milled curly Mitchell grass (Astrebla lappacea). DNA was extracted and the 16S rDNA sequenced for phylogenetic analysis. Forty bacterial isolates were obtained and placed in 17 groups based on phenotypic characteristics and restriction enzyme digestion of 16S rDNA PCR products. DNA sequencing revealed that the 17 groups comprised five known species (Clostridium butyricum, Streptococcus bovis, Clostridium sporogenes, Clostridium paraputrificum and Enterococcus avium) and 12 groups apparently representing new species, all within the phylum Firmicutes. Foregut contents from Australian macropod marsupials contain a microbial ecosystem with a novel bacterial biodiversity comprising a high percentage of previously unrecognized species. This study adds to knowledge of Australia's unique biodiversity, which may provide a future bioresource of genetic information and bacterial species of benefit to agriculture.

  5. Designable and dynamic single-walled stiff nanotubes assembled from sequence-defined peptoids

    DOE PAGES

    Jin, Haibao; Ding, Yan-Huai; Wang, Mingming; ...

    2018-01-18

    Despite recent advances in assembly of organic nanotubes, conferral of sequence-defined engineering and dynamic response characteristics to the tubules remains a challenge. Here we report a new family of highly-designable and dynamic single-walled nanotubes assembled from sequence-defined peptoids through a unique “rolling-up and closure of nanosheet” mechanism. During the assembly process, amorphous spherical particles of amphiphilic peptoid oligomers (APOs) crystallized to form well-defined nanosheets which were then folded to form single-walled peptoid nanotubes (SW-PNTs). These SW-PNTs undergo a pH-triggered, reversible contraction-expansion motion. By varying the number of hydrophobic residues of APOs, we demonstrate the tuning of PNT wall thickness andmore » diameter, and mechanical properties. AFM-based mechanical measurements indicate that PNTs are highly stiff (Young’s Modulus ~13-17 GPa), comparable to the stiffest known biological materials. We further demonstrate that the precise incorporation of functional groups within PNTs and the application of functional PNTs in water decontamination. We believe these SW-PNTs can provide a robust platform for development of biomimetic materials tailored to specific applications.« less

  6. Designable and dynamic single-walled stiff nanotubes assembled from sequence-defined peptoids

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jin, Haibao; Ding, Yan-Huai; Wang, Mingming

    Despite recent advances in assembly of organic nanotubes, conferral of sequence-defined engineering and dynamic response characteristics to the tubules remains a challenge. Here we report a new family of highly-designable and dynamic single-walled nanotubes assembled from sequence-defined peptoids through a unique “rolling-up and closure of nanosheet” mechanism. During the assembly process, amorphous spherical particles of amphiphilic peptoid oligomers (APOs) crystallized to form well-defined nanosheets which were then folded to form single-walled peptoid nanotubes (SW-PNTs). These SW-PNTs undergo a pH-triggered, reversible contraction-expansion motion. By varying the number of hydrophobic residues of APOs, we demonstrate the tuning of PNT wall thickness andmore » diameter, and mechanical properties. AFM-based mechanical measurements indicate that PNTs are highly stiff (Young’s Modulus ~13-17 GPa), comparable to the stiffest known biological materials. We further demonstrate that the precise incorporation of functional groups within PNTs and the application of functional PNTs in water decontamination. We believe these SW-PNTs can provide a robust platform for development of biomimetic materials tailored to specific applications.« less

  7. Digital RNA sequencing minimizes sequence-dependent bias and amplification noise with optimized single-molecule barcodes

    PubMed Central

    Shiroguchi, Katsuyuki; Jia, Tony Z.; Sims, Peter A.; Xie, X. Sunney

    2012-01-01

    RNA sequencing (RNA-Seq) is a powerful tool for transcriptome profiling, but is hampered by sequence-dependent bias and inaccuracy at low copy numbers intrinsic to exponential PCR amplification. We developed a simple strategy for mitigating these complications, allowing truly digital RNA-Seq. Following reverse transcription, a large set of barcode sequences is added in excess, and nearly every cDNA molecule is uniquely labeled by random attachment of barcode sequences to both ends. After PCR, we applied paired-end deep sequencing to read the two barcodes and cDNA sequences. Rather than counting the number of reads, RNA abundance is measured based on the number of unique barcode sequences observed for a given cDNA sequence. We optimized the barcodes to be unambiguously identifiable, even in the presence of multiple sequencing errors. This method allows counting with single-copy resolution despite sequence-dependent bias and PCR-amplification noise, and is analogous to digital PCR but amendable to quantifying a whole transcriptome. We demonstrated transcriptome profiling of Escherichia coli with more accurate and reproducible quantification than conventional RNA-Seq. PMID:22232676

  8. Sequencing Needs for Viral Diagnostics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gardner, S N; Lam, M; Mulakken, N J

    2004-01-26

    We built a system to guide decisions regarding the amount of genomic sequencing required to develop diagnostic DNA signatures, which are short sequences that are sufficient to uniquely identify a viral species. We used our existing DNA diagnostic signature prediction pipeline, which selects regions of a target species genome that are conserved among strains of the target (for reliability, to prevent false negatives) and unique relative to other species (for specificity, to avoid false positives). We performed simulations, based on existing sequence data, to assess the number of genome sequences of a target species and of close phylogenetic relatives (''nearmore » neighbors'') that are required to predict diagnostic signature regions that are conserved among strains of the target species and unique relative to other bacterial and viral species. For DNA viruses such as variola (smallpox), three target genomes provide sufficient guidance for selecting species-wide signatures. Three near neighbor genomes are critical for species specificity. In contrast, most RNA viruses require four target genomes and no near neighbor genomes, since lack of conservation among strains is more limiting than uniqueness. SARS and Ebola Zaire are exceptional, as additional target genomes currently do not improve predictions, but near neighbor sequences are urgently needed. Our results also indicate that double stranded DNA viruses are more conserved among strains than are RNA viruses, since in most cases there was at least one conserved signature candidate for the DNA viruses and zero conserved signature candidates for the RNA viruses.« less

  9. The most common Chinese rhesus macaque MHC class I molecule shares peptide binding repertoire with the HLA-B7 supertype

    PubMed Central

    Solomon, Christopher; Southwood, Scott; Hoof, Ilka; Rudersdorf, Richard; Peters, Bjoern; Sidney, John; Pinilla, Clemencia; Marcondes, Maria Cecilia Garibaldi; Ling, Binhua; Marx, Preston; Sette, Alessandro

    2010-01-01

    Of the two rhesus macaque subspecies used for AIDS studies, the Simian immunodeficiency virus-infected Indian rhesus macaque (Macaca mulatta) is the most established model of HIV infection, providing both insight into pathogenesis and a system for testing novel vaccines. Despite the Chinese rhesus macaque potentially being a more relevant model for AIDS outcomes than the Indian rhesus macaque, the Chinese-origin rhesus macaques have not been well-characterized for their major histocompatibility complex (MHC) composition and function, reducing their greater utilization. In this study, we characterized a total of 50 unique Chinese rhesus macaques from several varying origins for their entire MHC class I allele composition and identified a total of 58 unique complete MHC class I sequences. Only nine of the sequences had been associated with Indian rhesus macaques, and 28/58 (48.3%) of the sequences identified were novel. From all MHC alleles detected, we prioritized Mamu-A1*02201 for functional characterization based on its higher frequency of expression. Upon the development of MHC/peptide binding assays and definition of its associated motif, we revealed that this allele shares peptide binding characteristics with the HLA-B7 supertype, the most frequent supertype in human populations. These studies provide the first functional characterization of an MHC class I molecule in the context of Chinese rhesus macaques and the first instance of HLA-B7 analogy for rhesus macaques. Electronic supplementary material The online version of this article (doi:10.1007/s00251-010-0450-3) contains supplementary material, which is available to authorized users. PMID:20480161

  10. Comparative Analysis of 35 Basidiomycete Genomes Reveals Diversity and Uniqueness of the Phylum

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Riley, Robert; Salamov, Asaf; Otillar, Robert

    Fungi of the phylum Basidiomycota (basidiomycetes), make up some 37percent of the described fungi, and are important in forestry, agriculture, medicine, and bioenergy. This diverse phylum includes symbionts, pathogens, and saprobes including wood decaying fungi. To better understand the diversity of this phylum we compared the genomes of 35 basidiomycete fungi including 6 newly sequenced genomes. The genomes of basidiomycetes span extremes of genome size, gene number, and repeat content. A phylogenetic tree of Basidiomycota was generated using the Phyldog software, which uses all available protein sequence data to simultaneously infer gene and species trees. Analysis of core genes revealsmore » that some 48percent of basidiomycete proteins are unique to the phylum with nearly half of those (22percent) comprising proteins found in only one organism. Phylogenetic patterns of plant biomass-degrading genes suggest a continuum rather than a sharp dichotomy between the white rot and brown rot modes of wood decay among the members of Agaricomycotina subphylum. There is a correlation of the profile of certain gene families to nutritional mode in Agaricomycotina. Based on phylogenetically-informed PCA analysis of such profiles, we predict that that Botryobasidium botryosum and Jaapia argillacea have properties similar to white rot species, although neither has liginolytic class II fungal peroxidases. Furthermore, we find that both fungi exhibit wood decay with white rot-like characteristics in growth assays. Analysis of the rate of discovery of proteins with no or few homologs suggests the high value of continued sequencing of basidiomycete fungi.« less

  11. Sequencing Adventure Activities: A New Perspective.

    ERIC Educational Resources Information Center

    Bisson, Christian

    Sequencing in adventure education involves putting activities in an order appropriate to the needs of the group. Contrary to the common assumption that each adventure sequence is unique, a review of literature concerning five sequencing models reveals a certain universality. These models present sequences that move through four phases: group…

  12. The neXtProt peptide uniqueness checker: a tool for the proteomics community.

    PubMed

    Schaeffer, Mathieu; Gateau, Alain; Teixeira, Daniel; Michel, Pierre-André; Zahn-Zabal, Monique; Lane, Lydie

    2017-11-01

    The neXtProt peptide uniqueness checker allows scientists to define which peptides can be used to validate the existence of human proteins, i.e. map uniquely versus multiply to human protein sequences taking into account isobaric substitutions, alternative splicing and single amino acid variants. The pepx program is available at https://github.com/calipho-sib/pepx and can be launched from the command line or through a cgi web interface. Indexing requires a sequence file in FASTA format. The peptide uniqueness checker tool is freely available on the web at https://www.nextprot.org/tools/peptide-uniqueness-checker and from the neXtProt API at https://api.nextprot.org/. lydie.lane@sib.swiss. © The Author(s) 2017. Published by Oxford University Press.

  13. A Unique Sequence of Financial Accounting Courses Featuring Team Teaching, Linked Courses, Challenging Assignments, and Instruments for Evaluation and Assessment

    ERIC Educational Resources Information Center

    Lundblad, Heidemarie; Wilson, Barbara A.

    2008-01-01

    The Department of Accounting at California State University Northridge (CSUN) has developed a unique sequence of courses designed to ensure that accounting students are trained not only in technical accounting, but also acquire critical thinking, research and communication skills. The courses have proven effective and have embedded assessment…

  14. VISA--Vector Integration Site Analysis server: a web-based server to rapidly identify retroviral integration sites from next-generation sequencing.

    PubMed

    Hocum, Jonah D; Battrell, Logan R; Maynard, Ryan; Adair, Jennifer E; Beard, Brian C; Rawlings, David J; Kiem, Hans-Peter; Miller, Daniel G; Trobridge, Grant D

    2015-07-07

    Analyzing the integration profile of retroviral vectors is a vital step in determining their potential genotoxic effects and developing safer vectors for therapeutic use. Identifying retroviral vector integration sites is also important for retroviral mutagenesis screens. We developed VISA, a vector integration site analysis server, to analyze next-generation sequencing data for retroviral vector integration sites. Sequence reads that contain a provirus are mapped to the human genome, sequence reads that cannot be localized to a unique location in the genome are filtered out, and then unique retroviral vector integration sites are determined based on the alignment scores of the remaining sequence reads. VISA offers a simple web interface to upload sequence files and results are returned in a concise tabular format to allow rapid analysis of retroviral vector integration sites.

  15. Generation and Analysis of a Large-Scale Expressed Sequence Tag Database from a Full-Length Enriched cDNA Library of Developing Leaves of Gossypium hirsutum L

    PubMed Central

    Pang, Chaoyou; Fan, Shuli; Song, Meizhen; Yu, Shuxun

    2013-01-01

    Background Cotton (Gossypium hirsutum L.) is one of the world’s most economically-important crops. However, its entire genome has not been sequenced, and limited resources are available in GenBank for understanding the molecular mechanisms underlying leaf development and senescence. Methodology/Principal Findings In this study, 9,874 high-quality ESTs were generated from a normalized, full-length cDNA library derived from pooled RNA isolated from throughout leaf development during the plant blooming stage. After clustering and assembly of these ESTs, 5,191 unique sequences, representative 1,652 contigs and 3,539 singletons, were obtained. The average unique sequence length was 682 bp. Annotation of these unique sequences revealed that 84.4% showed significant homology to sequences in the NCBI non-redundant protein database, and 57.3% had significant hits to known proteins in the Swiss-Prot database. Comparative analysis indicated that our library added 2,400 ESTs and 991 unique sequences to those known for cotton. The unigenes were functionally characterized by gene ontology annotation. We identified 1,339 and 200 unigenes as potential leaf senescence-related genes and transcription factors, respectively. Moreover, nine genes related to leaf senescence and eleven MYB transcription factors were randomly selected for quantitative real-time PCR (qRT-PCR), which revealed that these genes were regulated differentially during senescence. The qRT-PCR for three GhYLSs revealed that these genes express express preferentially in senescent leaves. Conclusions/Significance These EST resources will provide valuable sequence information for gene expression profiling analyses and functional genomics studies to elucidate their roles, as well as for studying the mechanisms of leaf development and senescence in cotton and discovering candidate genes related to important agronomic traits of cotton. These data will also facilitate future whole-genome sequence assembly and annotation in G. hirsutum and comparative genomics among Gossypium species. PMID:24146870

  16. Characterization of a new apscaviroid from American persimmon.

    PubMed

    Ito, Takao; Suzaki, Koichi; Nakano, Masaaki; Sato, Akihiko

    2013-12-01

    A unique circular molecule of 358 nucleotides was detected in American persimmon (Diospyros virginiana L.). The molecule was graft-transmissible and had genetic characteristics of members of the genus Apscaviroid. It had the highest sequence similarity (72-73 %) to citrus viroid VI (CVd-VI) and formed a clade with CVd-VI, citrus dwarfing viroid, and apple dimple fruit viroid in a phylogenetic tree. The molecule was not detected in citrus, unlike CVd-VI, which infects citrus and persimmon, and it was genetically distant from persimmon latent viroid, which infects persimmon only. The genetic and biological features indicated that the molecule may be a member of a new apscaviroid species.

  17. Euchromatic subdomains in rice centromeres are associated with genes and transcription.

    PubMed

    Wu, Yufeng; Kikuchi, Shinji; Yan, Huihuang; Zhang, Wenli; Rosenbaum, Heidi; Iniguez, A Leonardo; Jiang, Jiming

    2011-11-01

    The presence of the centromere-specific histone H3 variant, CENH3, defines centromeric (CEN) chromatin, but poorly understood epigenetic mechanisms determine its establishment and maintenance. CEN chromatin is embedded within pericentromeric heterochromatin in most higher eukaryotes, but, interestingly, it can show euchromatic characteristics; for example, the euchromatic histone modification mark dimethylated H3 Lys 4 (H3K4me2) is uniquely associated with animal centromeres. To examine the histone marks and chromatin properties of plant centromeres, we developed a genomic tiling array for four fully sequenced rice (Oryza sativa) centromeres and used chromatin immunoprecipitation-chip to study the patterns of four euchromatic histone modification marks: H3K4me2, trimethylated H3 Lys 4, trimethylated H3 Lys 36, and acetylated H3 Lys 4, 9. The vast majority of the four histone marks were associated with genes located in the H3 subdomains within the centromere cores. We demonstrate that H3K4me2 is not a ubiquitous component of rice CEN chromatin, and the euchromatic characteristics of rice CEN chromatin are hallmarks of the transcribed sequences embedded in the centromeric H3 subdomains. We propose that the transcribed sequences located in rice centromeres may provide a barrier preventing loading of CENH3 into the H3 subdomains. The separation of CENH3 and H3 subdomains in the centromere core may be favorable for the formation of three-dimensional centromere structure and for rice centromere function.

  18. Bioinformatics analysis identifies several intrinsically disordered human E3 ubiquitin-protein ligases.

    PubMed

    Boomsma, Wouter; Nielsen, Sofie V; Lindorff-Larsen, Kresten; Hartmann-Petersen, Rasmus; Ellgaard, Lars

    2016-01-01

    The ubiquitin-proteasome system targets misfolded proteins for degradation. Since the accumulation of such proteins is potentially harmful for the cell, their prompt removal is important. E3 ubiquitin-protein ligases mediate substrate ubiquitination by bringing together the substrate with an E2 ubiquitin-conjugating enzyme, which transfers ubiquitin to the substrate. For misfolded proteins, substrate recognition is generally delegated to molecular chaperones that subsequently interact with specific E3 ligases. An important exception is San1, a yeast E3 ligase. San1 harbors extensive regions of intrinsic disorder, which provide both conformational flexibility and sites for direct recognition of misfolded targets of vastly different conformations. So far, no mammalian ortholog of San1 is known, nor is it clear whether other E3 ligases utilize disordered regions for substrate recognition. Here, we conduct a bioinformatics analysis to examine >600 human and S. cerevisiae E3 ligases to identify enzymes that are similar to San1 in terms of function and/or mechanism of substrate recognition. An initial sequence-based database search was found to detect candidates primarily based on the homology of their ordered regions, and did not capture the unique disorder patterns that encode the functional mechanism of San1. However, by searching specifically for key features of the San1 sequence, such as long regions of intrinsic disorder embedded with short stretches predicted to be suitable for substrate interaction, we identified several E3 ligases with these characteristics. Our initial analysis revealed that another remarkable trait of San1 is shared with several candidate E3 ligases: long stretches of complete lysine suppression, which in San1 limits auto-ubiquitination. We encode these characteristic features into a San1 similarity-score, and present a set of proteins that are plausible candidates as San1 counterparts in humans. In conclusion, our work indicates that San1 is not a unique case, and that several other yeast and human E3 ligases have sequence properties that may allow them to recognize substrates by a similar mechanism as San1.

  19. Genome sequence of the Japanese oak silk moth, Antheraea yamamai: the first draft genome in the family Saturniidae

    PubMed Central

    Kim, Seong-Ryul; Kwak, Woori; Kim, Hyaekang; Kim, Kee-Young; Kim, Su-Bae; Choi, Kwang-Ho; Kim, Seong-Wan; Hwang, Jae-Sam; Kim, Minjee; Kim, Iksoo; Goo, Tae-Won

    2018-01-01

    Abstract Background Antheraea yamamai, also known as the Japanese oak silk moth, is a wild species of silk moth. Silk produced by A. yamamai, referred to as tensan silk, shows different characteristics such as thickness, compressive elasticity, and chemical resistance compared with common silk produced from the domesticated silkworm, Bombyx mori. Its unique characteristics have led to its use in many research fields including biotechnology and medical science, and the scientific as well as economic importance of the wild silk moth continues to gradually increase. However, no genomic information for the wild silk moth, including A. yamamai, is currently available. Findings In order to construct the A. yamamai genome, a total of 147G base pairs using Illumina and Pacbio sequencing platforms were generated, providing 210-fold coverage based on the 700-Mb estimated genome size of A. yamamai. The assembled genome of A. yamamai was 656 Mb (>2 kb) with 3675 scaffolds, and the N50 length of assembly was 739 Kb with a 34.07% GC ratio. Identified repeat elements covered 37.33% of the total genome, and the completeness of the constructed genome assembly was estimated to be 96.7% by Benchmarking Universal Single-Copy Orthologs v2 analysis. A total of 15 481 genes were identified using Evidence Modeler based on the gene prediction results obtained from 3 different methods (ab initio, RNA-seq-based, known-gene-based) and manual curation. Conclusions Here we present the genome sequence of A. yamamai, the first genome sequence of the wild silk moth. These results provide valuable genomic information, which will help enrich our understanding of the molecular mechanisms relating to not only specific phenotypes such as wild silk itself but also the genomic evolution of Saturniidae. PMID:29186418

  20. Complete genome sequence and comparative analysis of Acetobacter pasteurianus 386B, a strain well-adapted to the cocoa bean fermentation ecosystem.

    PubMed

    Illeghems, Koen; De Vuyst, Luc; Weckx, Stefan

    2013-08-01

    Acetobacter pasteurianus 386B, an acetic acid bacterium originating from a spontaneous cocoa bean heap fermentation, proved to be an ideal functional starter culture for coca bean fermentations. It is able to dominate the fermentation process, thereby resisting high acetic acid concentrations and temperatures. However, the molecular mechanisms underlying its metabolic capabilities and niche adaptations are unknown. In this study, whole-genome sequencing and comparative genome analysis was used to investigate this strain's mechanisms to dominate the cocoa bean fermentation process. The genome sequence of A. pasteurianus 386B is composed of a 2.8-Mb chromosome and seven plasmids. The annotation of 2875 protein-coding sequences revealed important characteristics, including several metabolic pathways, the occurrence of strain-specific genes such as an endopolygalacturonase, and the presence of mechanisms involved in tolerance towards various stress conditions. Furthermore, the low number of transposases in the genome and the absence of complete phage genomes indicate that this strain might be more genetically stable compared with other A. pasteurianus strains, which is an important advantage for the use of this strain as a functional starter culture. Comparative genome analysis with other members of the Acetobacteraceae confirmed the functional properties of A. pasteurianus 386B, such as its thermotolerant nature and unique genetic composition. Genome analysis of A. pasteurianus 386B provided detailed insights into the underlying mechanisms of its metabolic features, niche adaptations, and tolerance towards stress conditions. Combination of these data with previous experimental knowledge enabled an integrated, global overview of the functional characteristics of this strain. This knowledge will enable improved fermentation strategies and selection of appropriate acetic acid bacteria strains as functional starter culture for cocoa bean fermentation processes.

  1. Genetic and DNA sequence analysis of the kanamycin resistance transposon Tn903.

    PubMed Central

    Grindley, N D; Joyce, C M

    1980-01-01

    The kanamycin resistance transposon Tn903 consists of a unique region of about 1000 base pairs bounded by a pair of 1050-base-pair inverted repeat sequences. Each repeat contains two Pvu II endonuclease cleavage sites separated by 520 base pairs. We have constructed derivatives of Tn903 in which this 520-base-pair fragment is deleted from one or both repeats. Those derivatives that lack both 520-base-pair fragments cannot transpose, whereas those that lack just one remain transposition proficient. One such transposable derivative, Tn903 delta I, has been selected for further study. We have determined the sequence of the intact inverted repeat. The 18 base pairs at each end are identical and inverted relative to one another, a structure characteristic of insertion sequences. Additional experiments indicate that a single inverted repeat from Tn903 can, in fact, transpose; we propose that this element be called IS903. To correlate the DNA sequence with genetic activities, we have created mutations by inserting a 10-base-pair DNA fragment at several sites within the intact repeat of Tn903 delta 1, and we have examined the effect of such insertions on transposability. The results suggest that IS903 encodes a 307-amino-acid polypeptide (a "transposase") that is absolutely required for transposition of IS903 or Tn903. Images PMID:6261245

  2. Points of View: A Survey of Survey Courses--Are They Effective? A Unique Approach? Four Semesters of Biology Core Curriculum

    ERIC Educational Resources Information Center

    Batzli, Janet M.

    2005-01-01

    ''Why four semesters? How does this track differ from the two-semester course sequence?'' These are the most common questions students have when they learn about the Biology Core Curriculum (Biocore), a unique four-semester honors biology sequence at University of Wisconsin-Madison (UW-Madison). Biocore was first taught at University of Wisconsin…

  3. Designing deep sequencing experiments: detecting structural variation and estimating transcript abundance.

    PubMed

    Bashir, Ali; Bansal, Vikas; Bafna, Vineet

    2010-06-18

    Massively parallel DNA sequencing technologies have enabled the sequencing of several individual human genomes. These technologies are also being used in novel ways for mRNA expression profiling, genome-wide discovery of transcription-factor binding sites, small RNA discovery, etc. The multitude of sequencing platforms, each with their unique characteristics, pose a number of design challenges, regarding the technology to be used and the depth of sequencing required for a particular sequencing application. Here we describe a number of analytical and empirical results to address design questions for two applications: detection of structural variations from paired-end sequencing and estimating mRNA transcript abundance. For structural variation, our results provide explicit trade-offs between the detection and resolution of rearrangement breakpoints, and the optimal mix of paired-read insert lengths. Specifically, we prove that optimal detection and resolution of breakpoints is achieved using a mix of exactly two insert library lengths. Furthermore, we derive explicit formulae to determine these insert length combinations, enabling a 15% improvement in breakpoint detection at the same experimental cost. On empirical short read data, these predictions show good concordance with Illumina 200 bp and 2 Kbp insert length libraries. For transcriptome sequencing, we determine the sequencing depth needed to detect rare transcripts from a small pilot study. With only 1 Million reads, we derive corrections that enable almost perfect prediction of the underlying expression probability distribution, and use this to predict the sequencing depth required to detect low expressed genes with greater than 95% probability. Together, our results form a generic framework for many design considerations related to high-throughput sequencing. We provide software tools http://bix.ucsd.edu/projects/NGS-DesignTools to derive platform independent guidelines for designing sequencing experiments (amount of sequencing, choice of insert length, mix of libraries) for novel applications of next generation sequencing.

  4. The complete chloroplast genome sequence of Epipremnum aureum and its comparative analysis among eight Araceae species

    PubMed Central

    Han, Limin; Chen, Chen; Wang, Zhezhi

    2018-01-01

    Epipremnum aureum is an important foliage plant in the Araceae family. In this study, we have sequenced the complete chloroplast genome of E. aureum by using Illumina Hiseq sequencing platforms. This genome is a double-stranded circular DNA sequence of 164,831 bp that contains 35.8% GC. The two inverted repeats (IRa and IRb; 26,606 bp) are spaced by a small single-copy region (22,868 bp) and a large single-copy region (88,751 bp). The chloroplast genome has 131 (113 unique) functional genes, including 86 (79 unique) protein-coding genes, 37 (30 unique) tRNA genes, and eight (four unique) rRNA genes. Tandem repeats comprise the majority of the 43 long repetitive sequences. In addition, 111 simple sequence repeats are present, with mononucleotides being the most common type and di- and tetranucleotides being infrequent events. Positive selection pressure on rps12 in the E. aureum chloroplast has been demonstrated via synonymous and nonsynonymous substitution rates and selection pressure sites analyses. Ycf15 and infA are pseudogenes in this species. We constructed a Maximum Likelihood phylogenetic tree based on the complete chloroplast genomes of 38 species from 13 families. Those results strongly indicated that E. aureum is positioned as the sister of Colocasia esculenta within the Araceae family. This work may provide information for further study of the molecular phylogenetic relationships within Araceae, as well as molecular markers and breeding novel varieties by chloroplast genetic-transformation of E. aureum in particular. PMID:29529038

  5. A plasma membrane sucrose-binding protein that mediates sucrose uptake shares structural and sequence similarity with seed storage proteins but remains functionally distinct.

    PubMed

    Overvoorde, P J; Chao, W S; Grimes, H D

    1997-06-20

    Photoaffinity labeling of a soybean cotyledon membrane fraction identified a sucrose-binding protein (SBP). Subsequent studies have shown that the SBP is a unique plasma membrane protein that mediates the linear uptake of sucrose in the presence of up to 30 mM external sucrose when ectopically expressed in yeast. Analysis of the SBP-deduced amino acid sequence indicates it lacks sequence similarity with other known transport proteins. Data presented here, however, indicate that the SBP shares significant sequence and structural homology with the vicilin-like seed storage proteins that organize into homotrimers. These similarities include a repeated sequence that forms the basis of the reiterated domain structure characteristic of the vicilin-like protein family. In addition, analytical ultracentrifugation and nonreducing SDS-polyacrylamide gel electrophoresis demonstrate that the SBP appears to be organized into oligomeric complexes with a Mr indicative of the existence of SBP homotrimers and homodimers. The structural similarity shared by the SBP and vicilin-like proteins provides a novel framework to explore the mechanistic basis of SBP-mediated sucrose uptake. Expression of the maize Glb protein (a vicilin-like protein closely related to the SBP) in yeast demonstrates that a closely related vicilin-like protein is unable to mediate sucrose uptake. Thus, despite sequence and structural similarities shared by the SBP and the vicilin-like protein family, the SBP is functionally divergent from other members of this group.

  6. Nanobodies®: new ammunition to battle viruses.

    PubMed

    Vanlandschoot, Peter; Stortelers, Catelijne; Beirnaert, Els; Ibañez, Lorena Itatí; Schepens, Bert; Depla, Erik; Saelens, Xavier

    2011-12-01

    In 1989, a new type of antibody was identified, first in the sera of dromedaries and later also in all other species of the Camelidae family. These antibodies do not contain a light chain and also lack the first constant heavy domain. Today it is still unclear what the evolutionary advantage of such heavy chain-only antibodies could be. In sharp contrast, the broad applicability of the isolated variable antigen-binding domains (VHH) was rapidly recognized, especially for the development of therapeutic proteins, called Nanobodies(®). Here we summarize first some of the unique characteristics and features of VHHs. These will next be described in the context of different experimental therapeutic applications of Nanobodies against different viruses: HIV, Hepatitis B virus, influenza virus, Respiratory Syncytial virus, Rabies virus, FMDV, Poliovirus, Rotavirus, and PERVs. Next, the diagnostic application of VHHs (Vaccinia virus, Marburg virus and plant Tulip virus X), as well as an industrial application (lytic lactococcal 936 phage) will be described. In addition, the described data show that monovalent Nanobodies can possess unique characteristics not observed with conventional antibodies. The straightforward formatting into bivalent, multivalent, and/or multispecific Nanobodies allowed tailoring molecules for potency and cross-reactivity against viral targets with high sequence diversity. Copyright © 2011. Published by Elsevier B.V.

  7. Diversity and distribution of entomopathogenic nematodes (Nematoda: Steinernematidae, Heterorhabditidae) and their bacterial symbionts (gamma-Proteobacteria: Enterobacteriaceae) in Jordan.

    PubMed

    Stock, S Patricia; Al Banna, Luma; Darwish, Rula; Katbeh, Ahmad

    2008-06-01

    Until now, only a few systematic surveys of entomopathogenic nematodes (EPN) have been conducted in Middle Eastern countries. Many of the recovered EPN species in this region have shown to own distinctive qualities that enable their survival in unique environments, such as high temperatures and low moisture levels tolerance. These new species and strains, with unique environmental tolerances, are more suitable for their consideration in pest management programs in xerophytic regions. With this background in mind, we recently conducted a survey of EPN in Jordan. This study records for the first time the diversity and distribution of these nematodes and their bacterial symbionts in this country. Jordan's three geographic regions: (1) the highlands, (2) Jordan valley and (3) the desert region were sampled. Within each region, natural habitats and agricultural regions characteristic to each region were considered for sampling purposes. Four EPN species including three Steinernema and one Heterorhabditis were recovered. Nematodes were identified using a combination of molecular markers and classic morphological diagnostic tools. Bacterial symbionts were identified by analysis of 16S rRNA sequences. Abiotic characteristics such as soil type, soil pH, and elevation were also recorded. We herein report the diversity of EPN species in Jordan and discuss their potential in Biocontrol and IPM programs for this country.

  8. [Identification and phylogenetic application of unique nucleotide sequence of nad7 intron2 in Rhodiola (Crassulaceae) species].

    PubMed

    Deng, Ke-Jun; Yang, Zu-Jun; Liu, Cheng; Zhao, Wei; Liu, Chang; Feng, Juan; Ren, Zheng-Long

    2007-03-01

    Genetic characterization of 9 populations of Rhodiola crenulata, R. fastigiata and R. sachalinensis (Crassulaceae) species from Sichuan and Jilin Provinces of China, was investigated using the conserved primer of nad7 intron 2. All PCR products about 800 bp long were shorter than other Crassulaceae plants, which were used as molecular markers to identify the Rhodiola species. The sequence of the products indicated that total exon of 53 bp and intron of 738 bp exhibit only 9 nucleotide variations. Blasting the nad7 sequences to GenBank and the phylogenetic analysis showed that the sequence of Rhodiola species was clusted independently, and the length was smaller than all the registered sequences of higher plants. The result suggests that the Rhiodola species had a unique sequence in this gene region, which might be related to the special growth condition.

  9. RNA Sequencing Reveals Differential Expression of Mitochondrial and Oxidation Reduction Genes in the Long-Lived Naked Mole-Rat When Compared to Mice

    PubMed Central

    Holmes, Andrew; Szafranski, Karol; Faulkes, Chris G.; Coen, Clive W.; Buffenstein, Rochelle; Platzer, Matthias; de Magalhães, João Pedro; Church, George M.

    2011-01-01

    The naked mole-rat (Heterocephalus glaber) is a long-lived, cancer resistant rodent and there is a great interest in identifying the adaptations responsible for these and other of its unique traits. We employed RNA sequencing to compare liver gene expression profiles between naked mole-rats and wild-derived mice. Our results indicate that genes associated with oxidoreduction and mitochondria were expressed at higher relative levels in naked mole-rats. The largest effect is nearly 300-fold higher expression of epithelial cell adhesion molecule (Epcam), a tumour-associated protein. Also of interest are the protease inhibitor, alpha2-macroglobulin (A2m), and the mitochondrial complex II subunit Sdhc, both ageing-related genes found strongly over-expressed in the naked mole-rat. These results hint at possible candidates for specifying species differences in ageing and cancer, and in particular suggest complex alterations in mitochondrial and oxidation reduction pathways in the naked mole-rat. Our differential gene expression analysis obviated the need for a reference naked mole-rat genome by employing a combination of Illumina/Solexa and 454 platforms for transcriptome sequencing and assembling transcriptome contigs of the non-sequenced species. Overall, our work provides new research foci and methods for studying the naked mole-rat's fascinating characteristics. PMID:22073188

  10. A spectroscopic and photometric study of the unique pre- main sequence system KH 15D

    NASA Astrophysics Data System (ADS)

    Hamilton, Catrina Marie

    2004-09-01

    As a class, T Tauri stars are YSOs, some which are surrounded by circumstellar disks, and are recognized as the final stage of low-mass star formation. They also represent the earliest stage of stellar evolution that is optically visible, and, therefore, can be easily studied in detail. Understanding the processes through which these young stars interact with and eventually disperse their circumstellar disks is critical for understanding how they evolve from the T Tauri phase to the zero age main sequence (ZAMS), and how this affects the formation of planets, as well as their rotational evolution. KH 15D is a unique eclipsing system that could provide invaluable insight into the evolution of circumstellar disk material, as well as clues to the close stellar environment. Discovered in 1997, this star system has been observed to undergo an eclipse every 48 days in which the star's light is diminished by 3.5 magnitudes. What is so unusual about the eclipse is that the length of the eclipse has evolved over time, growing in length from 16 days initially, to ˜25 days in 2002/2003. Evolution of disk material on these timescales has never been observed before, and therefore provides us with a unique opportunity to refine our theories about remnant disks around young stars, how they transition, possibly into planets, and what role they play as the star matures and arrives on the zero age main sequence. Additionally, high resolution spectra obtained at specific phases during the December 2001 eclipse showed that as the obscuring matter cut across the star, dramatic spectral changes in the Hα and Hβ lines were seen. Its unique eclipse produces a “natural coronographic” effect in which the stellar photosphere is occulted, revealing details of its magnetosphere and surroundings during eclipse. There is evidence that the weak-lined T Tauri star (WTTS) central to the system is actively accreting gas, although probably not at the rate of a typical classical T Tauri star, calling into question the common practice of associating WTTS characteristics with the absence of an accretion disk. Here I present an investigation of the photometric and spectroscopic properties of the KH 15D eclipsing system, and discuss the implications that this system holds for the future research of T Tauri stars.

  11. PuLSE: Quality control and quantification of peptide sequences explored by phage display libraries.

    PubMed

    Shave, Steven; Mann, Stefan; Koszela, Joanna; Kerr, Alastair; Auer, Manfred

    2018-01-01

    The design of highly diverse phage display libraries is based on assumption that DNA bases are incorporated at similar rates within the randomized sequence. As library complexity increases and expected copy numbers of unique sequences decrease, the exploration of library space becomes sparser and the presence of truly random sequences becomes critical. We present the program PuLSE (Phage Library Sequence Evaluation) as a tool for assessing randomness and therefore diversity of phage display libraries. PuLSE runs on a collection of sequence reads in the fastq file format and generates tables profiling the library in terms of unique DNA sequence counts and positions, translated peptide sequences, and normalized 'expected' occurrences from base to residue codon frequencies. The output allows at-a-glance quantitative quality control of a phage library in terms of sequence coverage both at the DNA base and translated protein residue level, which has been missing from toolsets and literature. The open source program PuLSE is available in two formats, a C++ source code package for compilation and integration into existing bioinformatics pipelines and precompiled binaries for ease of use.

  12. Equivalent Indels – Ambiguous Functional Classes and Redundancy in Databases

    PubMed Central

    Assmus, Jens; Kleffe, Jürgen; Schmitt, Armin O.; Brockmann, Gudrun A.

    2013-01-01

    There is considerable interest in studying sequenced variations. However, while the positions of substitutions are uniquely identifiable by sequence alignment, the location of insertions and deletions still poses problems. Each insertion and deletion causes a change of sequence. Yet, due to low complexity or repetitive sequence structures, the same indel can sometimes be annotated in different ways. Two indels which differ in allele sequence and position can be one and the same, i.e. the alternative sequence of the whole chromosome is identical in both cases and, therefore, the two deletions are biologically equivalent. In such a case, it is impossible to identify the exact position of an indel merely based on sequence alignment. Thus, variation entries in a mutation database are not necessarily uniquely defined. We prove the existence of a contiguous region around an indel in which all deletions of the same length are biologically identical. Databases often show only one of several possible locations for a given variation. Furthermore, different data base entries can represent equivalent variation events. We identified 1,045,590 such problematic entries of insertions and deletions out of 5,860,408 indel entries in the current human database of Ensembl. Equivalent indels are found in sequence regions of different functions like exons, introns or 5' and 3' UTRs. One and the same variation can be assigned to several different functional classifications of which only one is correct. We implemented an algorithm that determines for each indel database entry its complete set of equivalent indels which is uniquely characterized by the indel itself and a given interval of the reference sequence. PMID:23658777

  13. The pomegranate (Punica granatum L.) genome provides insights into fruit quality and ovule developmental biology.

    PubMed

    Yuan, Zhaohe; Fang, Yanming; Zhang, Taikui; Fei, Zhangjun; Han, Fengming; Liu, Cuiyu; Liu, Min; Xiao, Wei; Zhang, Wenjing; Wu, Shan; Zhang, Mengwei; Ju, Youhui; Xu, Huili; Dai, He; Liu, Yujun; Chen, Yanhui; Wang, Lili; Zhou, Jianqing; Guan, Dian; Yan, Ming; Xia, Yanhua; Huang, Xianbin; Liu, Dongyuan; Wei, Hongmin; Zheng, Hongkun

    2017-12-22

    Pomegranate (Punica granatum L.) has an ancient cultivation history and has become an emerging profitable fruit crop due to its attractive features such as the bright red appearance and the high abundance of medicinally valuable ellagitannin-based compounds in its peel and aril. However, the limited genomic resources have restricted further elucidation of genetics and evolution of these interesting traits. Here, we report a 274-Mb high-quality draft pomegranate genome sequence, which covers approximately 81.5% of the estimated 336-Mb genome, consists of 2177 scaffolds with an N50 size of 1.7 Mb and contains 30 903 genes. Phylogenomic analysis supported that pomegranate belongs to the Lythraceae family rather than the monogeneric Punicaceae family, and comparative analyses showed that pomegranate and Eucalyptus grandis share the paleotetraploidy event. Integrated genomic and transcriptomic analyses provided insights into the molecular mechanisms underlying the biosynthesis of ellagitannin-based compounds, the colour formation in both peels and arils during pomegranate fruit development, and the unique ovule development processes that are characteristic of pomegranate. This genome sequence provides an important resource to expand our understanding of some unique biological processes and to facilitate both comparative biology studies and crop breeding. © 2017 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

  14. De novo assembly, characterization and functional annotation of pineapple fruit transcriptome through massively parallel sequencing.

    PubMed

    Ong, Wen Dee; Voo, Lok-Yung Christopher; Kumar, Vijay Subbiah

    2012-01-01

    Pineapple (Ananas comosus var. comosus), is an important tropical non-climacteric fruit with high commercial potential. Understanding the mechanism and processes underlying fruit ripening would enable scientists to enhance the improvement of quality traits such as, flavor, texture, appearance and fruit sweetness. Although, the pineapple is an important fruit, there is insufficient transcriptomic or genomic information that is available in public databases. Application of high throughput transcriptome sequencing to profile the pineapple fruit transcripts is therefore needed. To facilitate this, we have performed transcriptome sequencing of ripe yellow pineapple fruit flesh using Illumina technology. About 4.7 millions Illumina paired-end reads were generated and assembled using the Velvet de novo assembler. The assembly produced 28,728 unique transcripts with a mean length of approximately 200 bp. Sequence similarity search against non-redundant NCBI database identified a total of 16,932 unique transcripts (58.93%) with significant hits. Out of these, 15,507 unique transcripts were assigned to gene ontology terms. Functional annotation against Kyoto Encyclopedia of Genes and Genomes pathway database identified 13,598 unique transcripts (47.33%) which were mapped to 126 pathways. The assembly revealed many transcripts that were previously unknown. The unique transcripts derived from this work have rapidly increased of the number of the pineapple fruit mRNA transcripts as it is now available in public databases. This information can be further utilized in gene expression, genomics and other functional genomics studies in pineapple.

  15. De Novo Assembly, Characterization and Functional Annotation of Pineapple Fruit Transcriptome through Massively Parallel Sequencing

    PubMed Central

    Ong, Wen Dee; Voo, Lok-Yung Christopher; Kumar, Vijay Subbiah

    2012-01-01

    Background Pineapple (Ananas comosus var. comosus), is an important tropical non-climacteric fruit with high commercial potential. Understanding the mechanism and processes underlying fruit ripening would enable scientists to enhance the improvement of quality traits such as, flavor, texture, appearance and fruit sweetness. Although, the pineapple is an important fruit, there is insufficient transcriptomic or genomic information that is available in public databases. Application of high throughput transcriptome sequencing to profile the pineapple fruit transcripts is therefore needed. Methodology/Principal Findings To facilitate this, we have performed transcriptome sequencing of ripe yellow pineapple fruit flesh using Illumina technology. About 4.7 millions Illumina paired-end reads were generated and assembled using the Velvet de novo assembler. The assembly produced 28,728 unique transcripts with a mean length of approximately 200 bp. Sequence similarity search against non-redundant NCBI database identified a total of 16,932 unique transcripts (58.93%) with significant hits. Out of these, 15,507 unique transcripts were assigned to gene ontology terms. Functional annotation against Kyoto Encyclopedia of Genes and Genomes pathway database identified 13,598 unique transcripts (47.33%) which were mapped to 126 pathways. The assembly revealed many transcripts that were previously unknown. Conclusions The unique transcripts derived from this work have rapidly increased of the number of the pineapple fruit mRNA transcripts as it is now available in public databases. This information can be further utilized in gene expression, genomics and other functional genomics studies in pineapple. PMID:23091603

  16. Intrahaplotypic Variants Differentiate Complex Linkage Disequilibrium within Human MHC Haplotypes

    PubMed Central

    Lam, Tze Hau; Tay, Matthew Zirui; Wang, Bei; Xiao, Ziwei; Ren, Ee Chee

    2015-01-01

    Distinct regions of long-range genetic fixation in the human MHC region, known as conserved extended haplotypes (CEHs), possess unique genomic characteristics and are strongly associated with numerous diseases. While CEHs appear to be homogeneous by SNP analysis, the nature of fine variations within their genomic structure is unknown. Using multiple, MHC-homozygous cell lines, we demonstrate extensive sequence conservation in two common Asian MHC haplotypes: A33-B58-DR3 and A2-B46-DR9. However, characterization of phase-resolved MHC haplotypes revealed unique intra-CEH patterns of variation and uncovered 127 single nucleotide variants (SNVs) which are missing from public databases. We further show that the strong linkage disequilibrium structure within the human MHC that typically confounds precise identification of genetic features can be resolved using intra-CEH variants, as evidenced by rs3129063 and rs448489, which affect expression of ZFP57, a gene important in methylation and epigenetic regulation. This study demonstrates an improved strategy that can be used towards genetic dissection of diseases. PMID:26593880

  17. Heterotrophic Protists in Hypersaline Microbial Mats and Deep Hypersaline Basin Water Columns

    PubMed Central

    Edgcomb, Virginia P.; Bernhard, Joan M.

    2013-01-01

    Although hypersaline environments pose challenges to life because of the low water content (water activity), many such habitats appear to support eukaryotic microbes. This contribution presents brief reviews of our current knowledge on eukaryotes of water-column haloclines and brines from Deep Hypersaline Anoxic Basins (DHABs) of the Eastern Mediterranean, as well as shallow-water hypersaline microbial mats in solar salterns of Guerrero Negro, Mexico and benthic microbialite communities from Hamelin Pool, Shark Bay, Western Australia. New data on eukaryotic diversity from Shark Bay microbialites indicates eukaryotes are more diverse than previously reported. Although this comparison shows that eukaryotic communities in hypersaline habitats with varying physicochemical characteristics are unique, several groups are commonly found, including diverse alveolates, strameonopiles, and fungi, as well as radiolaria. Many eukaryote sequences (SSU) in both regions also have no close homologues in public databases, suggesting that these environments host unique microbial eukaryote assemblages with the potential to enhance our understanding of the capacity of eukaryotes to adapt to hypersaline conditions. PMID:25369746

  18. Molecular profile of the unique species of traditional Chinese medicine, Chinese seahorse (Hippocampus kuda Bleeker).

    PubMed

    Zhang, Ning; Xu, Bin; Mou, Chunyan; Yang, Wenli; Wei, Jianwen; Lu, Liang; Zhu, Junjie; Du, Jingchun; Wu, Xiaokun; Ye, Lanting; Fu, Zhiyan; Lu, Yang; Lin, Jianghai; Sun, Zizi; Su, Jing; Dong, Meiling; Xu, Anlong

    2003-08-28

    A cDNA library of male Chinese seahorse (Hippocampus kuda Bleeker) was constructed to investigate the molecular profile of seahorse as one of the most famous traditional Chinese medicine materials, and to reveal immunological and physiological mechanisms of seahorse as one of the most primitive vertebrates at molecular level. A total of 3372 expressed sequence tags (ESTs) consisting of 1911 unique genes (345 clusters and 1566 singletons) were examined in the present study. Identification of the genes related to immune system, paternal brooding and physiological regulation provides not only valuable insights into the molecular mechanism of immune system in teleost fish but also plausible explanations for pharmacological activities of Chinese seahorse. Furthermore, the occurrence of high prevalent C-type lectins suggested that a lectin-complement pathway might exert a more dominant function in the innate immune system of teleost than mammal. Carbohydrate recognition domain (CRD) without a collagen-like region in the lectins of seahorse was likely an ancient characteristic of lectins similar to invertebrates.

  19. Host-Microbe Interactions in Microgravity: Assessment and Implications

    PubMed Central

    Foster, Jamie S.; Wheeler, Raymond M.; Pamphile, Regine

    2014-01-01

    Spaceflight imposes several unique stresses on biological life that together can have a profound impact on the homeostasis between eukaryotes and their associated microbes. One such stressor, microgravity, has been shown to alter host-microbe interactions at the genetic and physiological levels. Recent sequencing of the microbiomes associated with plants and animals have shown that these interactions are essential for maintaining host health through the regulation of several metabolic and immune responses. Disruptions to various environmental parameters or community characteristics may impact the resiliency of the microbiome, thus potentially driving host-microbe associations towards disease. In this review, we discuss our current understanding of host-microbe interactions in microgravity and assess the impact of this unique environmental stress on the normal physiological and genetic responses of both pathogenic and mutualistic associations. As humans move beyond our biosphere and undergo longer duration space flights, it will be essential to more fully understand microbial fitness in microgravity conditions in order to maintain a healthy homeostasis between humans, plants and their respective microbiomes. PMID:25370197

  20. Host-microbe interactions in microgravity: assessment and implications.

    PubMed

    Foster, Jamie S; Wheeler, Raymond M; Pamphile, Regine

    2014-05-26

    Spaceflight imposes several unique stresses on biological life that together can have a profound impact on the homeostasis between eukaryotes and their associated microbes. One such stressor, microgravity, has been shown to alter host-microbe interactions at the genetic and physiological levels. Recent sequencing of the microbiomes associated with plants and animals have shown that these interactions are essential for maintaining host health through the regulation of several metabolic and immune responses. Disruptions to various environmental parameters or community characteristics may impact the resiliency of the microbiome, thus potentially driving host-microbe associations towards disease. In this review, we discuss our current understanding of host-microbe interactions in microgravity and assess the impact of this unique environmental stress on the normal physiological and genetic responses of both pathogenic and mutualistic associations. As humans move beyond our biosphere and undergo longer duration space flights, it will be essential to more fully understand microbial fitness in microgravity conditions in order to maintain a healthy homeostasis between humans, plants and their respective microbiomes.

  1. Symbolic emblems of the Levantine Aurignacians as a regional entity identifier (Hayonim Cave, Lower Galilee, Israel).

    PubMed

    Tejero, José-Miguel; Belfer-Cohen, Anna; Bar-Yosef, Ofer; Gutkin, Vitaly; Rabinovich, Rivka

    2018-05-15

    The Levantine Aurignacian is a unique phenomenon in the local Upper Paleolithic sequence, showing greater similarity to the West European classic Aurignacian than to the local Levantine archaeological entities preceding and following it. Herewith we highlight another unique characteristic of this entity, namely, the presence of symbolic objects in the form of notched bones (mostly gazelle scapulae) from the Aurignacian levels of Hayonim Cave, Lower Galilee, Israel. Through both macroscopic and microscopic analyses of the items, we suggest that they are not mere cut marks but rather are intentional (decorative?) human-made markings. The significance of this evidence for symbolic behavior is discussed in its chrono-cultural and geographical contexts. Notched bones are among the oldest symbolic expressions of anatomically modern humans. However, unlike other Paleolithic sites where such findings were reported in single numbers, the number of these items recovered at Hayonim Cave is sufficient to assume they possibly served as an emblem of the Levantine Aurignacian.

  2. Mosaic Graphs and Comparative Genomics in Phage Communities

    PubMed Central

    Belcaid, Mahdi; Bergeron, Anne

    2010-01-01

    Abstract Comparing the genomes of two closely related viruses often produces mosaics where nearly identical sequences alternate with sequences that are unique to each genome. When several closely related genomes are compared, the unique sequences are likely to be shared with third genomes, leading to virus mosaic communities. Here we present comparative analysis of sets of Staphylococcus aureus phages that share large identical sequences with up to three other genomes, and with different partners along their genomes. We introduce mosaic graphs to represent these complex recombination events, and use them to illustrate the breath and depth of sequence sharing: some genomes are almost completely made up of shared sequences, while genomes that share very large identical sequences can adopt alternate functional modules. Mosaic graphs also allow us to identify breakpoints that could eventually be used for the construction of recombination networks. These findings have several implications on phage metagenomics assembly, on the horizontal gene transfer paradigm, and more generally on the understanding of the composition and evolutionary dynamics of virus communities. PMID:20874413

  3. Unique core genomes of the bacterial family vibrionaceae: insights into niche adaptation and speciation.

    PubMed

    Kahlke, Tim; Goesmann, Alexander; Hjerde, Erik; Willassen, Nils Peder; Haugen, Peik

    2012-05-10

    The criteria for defining bacterial species and even the concept of bacterial species itself are under debate, and the discussion is apparently intensifying as more genome sequence data is becoming available. However, it is still unclear how the new advances in genomics should be used most efficiently to address this question. In this study we identify genes that are common to any group of genomes in our dataset, to determine whether genes specific to a particular taxon exist and to investigate their potential role in adaptation of bacteria to their specific niche. These genes were named unique core genes. Additionally, we investigate the existence and importance of unique core genes that are found in isolates of phylogenetically non-coherent groups. These groups of isolates, that share a genetic feature without sharing a closest common ancestor, are termed genophyletic groups. The bacterial family Vibrionaceae was used as the model, and we compiled and compared genome sequences of 64 different isolates. Using the software orthoMCL we determined clusters of homologous genes among the investigated genome sequences. We used multilocus sequence analysis to build a host phylogeny and mapped the numbers of unique core genes of all distinct groups of isolates onto the tree. The results show that unique core genes are more likely to be found in monophyletic groups of isolates. Genophyletic groups of isolates, in contrast, are less common especially for large groups of isolate. The subsequent annotation of unique core genes that are present in genophyletic groups indicate a high degree of horizontally transferred genes. Finally, the annotation of the unique core genes of Vibrio cholerae revealed genes involved in aerotaxis and biosynthesis of the iron-chelator vibriobactin. The presented work indicates that genes specific for any taxon inside the bacterial family Vibrionaceae exist. These unique core genes encode conserved metabolic functions that can shed light on the adaptation of a species to its ecological niche. Additionally, our study suggests that unique core genes can be used to aid classification of bacteria and contribute to a bacterial species definition on a genomic level. Furthermore, these genes may be of importance in clinical diagnostics and drug development.

  4. Isolation and Expression Analysis of CYP9A11 and Cytochrome P450 Reductase Gene in the Beet Armyworm (Lepidoptera: Noctuidae)

    PubMed Central

    Zhao, Chunqing; Feng, Xiaoyun; Tang, Tao; Qiu, Lihong

    2015-01-01

    Cytochrome P450 monooxygenases (CYPs), as an enzyme superfamily, is widely distributed in organisms and plays a vital function in the metabolism of exogenous and endogenous compounds by interacting with its obligatory redox partner, CYP reductase (CPR). A novel CYP gene (CYP9A11) and CPR gene from the agricultural pest insect Spodoptera exigua were cloned and characterized. The complete cDNA sequences of SeCYP9A11 and SeCPR are 1,931 and 3,919 bp in length, respectively, and contain open reading frames of 1,593 and 2,070 nucleotides, respectively. Analysis of the putative protein sequences indicated that SeCYP9A11 contains a heme-binding domain and the unique characteristic sequence (SRFALCE) of the CYP9 family, in addition to a signal peptide and transmembrane segment at the N-terminal. Alignment analysis revealed that SeCYP9A11 shares the highest sequence similarity with CYP9A13 from Mamestra brassicae, which is 66.54%. The putative protein sequence of SeCPR has all of the classical CPR features, such as an N-terminal membrane anchor; three conserved domain flavin adenine dinucleotide (FAD), flavin mononucleotide (FMN), and nicotinamide adenine dinucleotide phosphate (NADPH) domain; and characteristic binding motifs. Phylogenetic analysis revealed that SeCPR shares the highest identity with HaCPR, which is 95.21%. The SeCYP9A11 and SeCPR genes were detected in the midgut, fat body, and cuticle tissues, and throughout all of the developmental stages of S. exigua. The mRNA levels of SeCYP9A11 and SeCPR decreased remarkably after exposure to plant secondary metabolites quercetin and tannin. The results regarding SeCYP9A11 and SeCPR genes in the current study provide foundation for the further study of S. exigua P450 system. PMID:26320261

  5. In vitro resolution of the dimer bridge of the minute virus of mice (MVM) genome supports the modified rolling hairpin model for MVM replication.

    PubMed

    Liu, Q; Yong, C B; Astell, C R

    1994-06-01

    Previous characterization of the terminal sequences of the minute virus of mice (MVM) genome demonstrated that the right hand palindrome contains two sequences, each the inverted complement of the other. However, the left hand palindrome was shown to exist as a unique sequence [Astell et al., J. Virol. 54: 179-185 (1985)]. The modified rolling hairpin (MRH) model for MVM replication provided an explanation of how the right hand palindrome could undergo hairpin transfer to generate two sequences, while the left end palindrome within the dimer bridge could undergo asymmetric resolution and retain the unique left end sequence. This report describes in vitro resolution of the wild-type dimer bridge sequence of MVM using recombinant (baculovirus) expressed NS-1 and a replication extract from LA9 cells. The resolution products are consistent with those predicted by the MRH model, providing support for this replication mechanism. In addition, mutant dimer bridge clones were constructed and used in the resolution assay. The mutant structures included removal of the asymmetry in the hairpin stem, inversion of the sequence at the initiating nick site, and a 2-bp deletion within one stem of the dimer bridge. In all cases, the mutant dimer bridge structures are resolved; however, the resolution pattern observed with the mutant dimer bridge compared with the wild-type dimer bridge is shifted toward symmetrical resolution. These results suggest that sequences within the left hand hairpin (and hence dimer bridge sequence) are responsible for asymmetric resolution and conservation of the unique sequence within the left hand palindrome of the MVM genome.

  6. Biologically important conformational features of DNA as interpreted by quantum mechanics and molecular mechanics computations of its simple fragments.

    PubMed

    Poltev, V; Anisimov, V M; Dominguez, V; Gonzalez, E; Deriabina, A; Garcia, D; Rivas, F; Polteva, N A

    2018-02-01

    Deciphering the mechanism of functioning of DNA as the carrier of genetic information requires identifying inherent factors determining its structure and function. Following this path, our previous DFT studies attributed the origin of unique conformational characteristics of right-handed Watson-Crick duplexes (WCDs) to the conformational profile of deoxydinucleoside monophosphates (dDMPs) serving as the minimal repeating units of DNA strand. According to those findings, the directionality of the sugar-phosphate chain and the characteristic ranges of dihedral angles of energy minima combined with the geometric differences between purines and pyrimidines determine the dependence on base sequence of the three-dimensional (3D) structure of WCDs. This work extends our computational study to complementary deoxydinucleotide-monophosphates (cdDMPs) of non-standard conformation, including those of Z-family, Hoogsteen duplexes, parallel-stranded structures, and duplexes with mispaired bases. For most of these systems, except Z-conformation, computations closely reproduce experimental data within the tolerance of characteristic limits of dihedral parameters for each conformation family. Computation of cdDMPs with Z-conformation reveals that their experimental structures do not correspond to the internal energy minimum. This finding establishes the leading role of external factors in formation of the Z-conformation. Energy minima of cdDMPs of non-Watson-Crick duplexes demonstrate different sequence-dependence features than those known for WCDs. The obtained results provide evidence that the biologically important regularities of 3D structure distinguish WCDs from duplexes having non-Watson-Crick nucleotide pairing.

  7. Unique Variants in OPN1LW Cause Both Syndromic and Nonsyndromic X-Linked High Myopia Mapped to MYP1.

    PubMed

    Li, Jiali; Gao, Bei; Guan, Liping; Xiao, Xueshan; Zhang, Jianguo; Li, Shiqiang; Jiang, Hui; Jia, Xiaoyun; Yang, Jianhua; Guo, Xiangming; Yin, Ye; Wang, Jun; Zhang, Qingjiong

    2015-06-01

    MYP1 is a locus for X-linked syndromic and nonsyndromic high myopia. Recently, unique haplotypes in OPN1LW were found to be responsible for X-linked syndromic high myopia mapped to MYP1. The current study is to test if such variants in OPN1LW are also responsible for X-linked nonsyndromic high myopia mapped to MYP1. The proband of the family previously mapped to MYP1 was initially analyzed using whole-exome sequencing and whole-genome sequencing. Additional probands with early-onset high myopia were analyzed using whole-exome sequencing. Variants in OPN1LW were selected and confirmed by Sanger sequencing. Long-range and second PCR were used to determine the haplotype and the first gene of the red-green gene array. Candidate variants were further validated in family members and controls. The unique LVAVA haplotype in OPN1LW was detected in the family with X-linked nonsyndromic high myopia mapped to MYP1. In addition, this haplotype and a novel frameshift mutation (c.617_620dup, p.Phe208Argfs*51) in OPN1LW were detected in two other families with X-linked high myopia. The unique haplotype cosegregated with high myopia in the two families, with a maximum LOD score of 3.34 and 2.31 at θ = 0. OPN1LW with the variants in these families was the first gene in the red-green gene array and was not present in 247 male controls. Reevaluation of the clinical data in both families with the unique haplotype suggested nonsyndromic high myopia. Our study confirms the findings that unique variants in OPN1LW are responsible for both syndromic and nonsyndromic X-linked high myopia mapped to MYP1.

  8. Candida northwykensis sp. nov., a novel yeast isolated from the gut of the click beetle Melanotus villosus.

    PubMed

    Ravella, Sreenivas Rao; Donovan, Neil; James, Stephen A; Shivaji, Sisinthy; Arunasri, Kotakonda; Bond, Christopher J; Roberts, Ian N; Hobbs, Phil J

    2011-08-01

    Two yeast morphotypes, BET 4(T) and BET 7, were isolated from the gut of click beetle Melanotus villosus. Click beetles were collected from the decaying timber within the woodlands of North Wyke Research, South West England, UK (latitude, 50°46'29″N; longitude, 3°55'23″W). Morphotype BET 7 was identified as Debaryomyces hansenii, and the other morphotype, BET 4(T), was found to differ from Priceomyces castillae and Priceomyces haplophilus, its closest phylogenetic neighbours, by 5.0% with respect to the nucleotide sequence of the D1/D2 domain of the large-subunit (LSU) rRNA gene, and by 8.0% with respect to the ribosomal internal-transcribed spacer (ITS) region. BET 4(T) also differ from P. castillae and P. haplophilus in a number of different phenotypic characteristics. Thus, based on the unique nucleotide sequences of its D1/D2 domain and ITS region, its physiological characteristics and an inability to sporulate, strain BET 4(T) is assigned the status of a new species of Candida, for which the name Candida northwykensis sp. nov., is proposed. The type strain is BET 4(T) (NCYC 3525(T) = CBS 11370(T)).

  9. In-depth investigations of adolescents and adults with holoprosencephaly identify unique characteristics.

    PubMed

    Weiss, Karin; Kruszka, Paul; Guillen Sacoto, Maria J; Addissie, Yonit A; Hadley, Donald W; Hadsall, Casey K; Stokes, Bethany; Hu, Ping; Roessler, Erich; Solomon, Beth; Wiggs, Edythe; Thurm, Audrey; Hufnagel, Robert B; Zein, Wadih M; Hahn, Jin S; Stashinko, Elaine; Levey, Eric; Baldwin, Debbie; Clegg, Nancy J; Delgado, Mauricio R; Muenke, Maximilian

    2018-01-01

    PurposeWith improved medical care, some individuals with holoprosencephaly (HPE) are surviving into adulthood. We investigated the clinical manifestations of adolescents and adults with HPE and explored the underlying molecular causes.MethodsParticipants included 20 subjects 15 years of age and older. Clinical assessments included dysmorphology exams, cognitive testing, swallowing studies, ophthalmic examination, and brain magnetic resonance imaging. Genetic testing included chromosomal microarray, Sanger sequencing for SHH, ZIC2, SIX3, and TGIF, and whole-exome sequencing (WES) of 10 trios.ResultsSemilobar HPE was the most common subtype of HPE, seen in 50% of the participants. Neurodevelopmental disabilities were found to correlate with HPE subtype. Factors associated with long-term survival included HPE subtype not alobar, female gender, and nontypical facial features. Four participants had de novo pathogenic variants in ZIC2. WES analysis of 11 participants did not reveal plausible candidate genes, suggesting complex inheritance in these cases. Indeed, in two probands there was a history of uncontrolled maternal type 1 diabetes.ConclusionIndividuals with various HPE subtypes can survive into adulthood and the neurodevelopmental outcomes are variable. Based on the facial characteristics and molecular evaluations, we suggest that classic genetic causes of HPE may play a smaller role in this cohort.

  10. Bacterial analysis of combined periodontal-endodontic lesions by polymerase chain reaction-denaturing gradient gel electrophoresis.

    PubMed

    Xia, Minghui; Qi, Qingguo

    2013-01-01

    We used denaturing gradient gel electrophoresis (DGGE) to compare bacterial profiles in periodontium and root canals of teeth with combined periodontal-endodontic lesions. Samples of dental plaque and necrotic pulp were collected from thirteen extracted teeth with advanced periodontitis. Genomic DNA was extracted for polymerase chain reaction (PCR) analysis using universal bacterial primers. The PCR products were then loaded onto DGGE gels to obtain fractionated bands. Characteristic DGGE bands were excised and DNA was cloned and sequenced. The number of bands, which indicates the number of bacterial species, was compared between dental plaques and necrotic pulp tissues from the same tooth. Although the difference was statistically significant (P < 0.01), there was no positive correlation; similarity (Dice coefficient) was 13.1% to 62.5%. Some bacteria species were present in both the periodontal pockets and root canals of the same tooth; however, periodontal bacteria did not always invade the root canals, and some bacteria in root canals were not present in periodontal pockets of the same tooth. In some teeth, unique bacteria in root canals had not passed from periodontal pockets. A basic local alignment search tool (BLAST) sequence search in Genbank indicated that new bacteria species were present in periodontal pockets and root canals. Their characteristics must thus be further analyzed.

  11. Analysis of the transcriptome of Panax notoginseng root uncovers putative triterpene saponin-biosynthetic genes and genetic markers

    PubMed Central

    2011-01-01

    Background Panax notoginseng (Burk) F.H. Chen is important medicinal plant of the Araliacease family. Triterpene saponins are the bioactive constituents in P. notoginseng. However, available genomic information regarding this plant is limited. Moreover, details of triterpene saponin biosynthesis in the Panax species are largely unknown. Results Using the 454 pyrosequencing technology, a one-quarter GS FLX titanium run resulted in 188,185 reads with an average length of 410 bases for P. notoginseng root. These reads were processed and assembled by 454 GS De Novo Assembler software into 30,852 unique sequences. A total of 70.2% of unique sequences were annotated by Basic Local Alignment Search Tool (BLAST) similarity searches against public sequence databases. The Kyoto Encyclopedia of Genes and Genomes (KEGG) assignment discovered 41 unique sequences representing 11 genes involved in triterpene saponin backbone biosynthesis in the 454-EST dataset. In particular, the transcript encoding dammarenediol synthase (DS), which is the first committed enzyme in the biosynthetic pathway of major triterpene saponins, is highly expressed in the root of four-year-old P. notoginseng. It is worth emphasizing that the candidate cytochrome P450 (Pn02132 and Pn00158) and UDP-glycosyltransferase (Pn00082) gene most likely to be involved in hydroxylation or glycosylation of aglycones for triterpene saponin biosynthesis were discovered from 174 cytochrome P450s and 242 glycosyltransferases by phylogenetic analysis, respectively. Putative transcription factors were detected in 906 unique sequences, including Myb, homeobox, WRKY, basic helix-loop-helix (bHLH), and other family proteins. Additionally, a total of 2,772 simple sequence repeat (SSR) were identified from 2,361 unique sequences, of which, di-nucleotide motifs were the most abundant motif. Conclusion This study is the first to present a large-scale EST dataset for P. notoginseng root acquired by next-generation sequencing (NGS) technology. The candidate genes involved in triterpene saponin biosynthesis, including the putative CYP450s and UGTs, were obtained in this study. Additionally, the identification of SSRs provided plenty of genetic makers for molecular breeding and genetics applications in this species. These data will provide information on gene discovery, transcriptional regulation and marker-assisted selection for P. notoginseng. The dataset establishes an important foundation for the study with the purpose of ensuring adequate drug resources for this species. PMID:22369100

  12. Prospecting for viral natural enemies of the fire ant Solenopsis invicta in Argentina.

    PubMed

    Valles, Steven M; Porter, Sanford D; Calcaterra, Luis A

    2018-01-01

    Metagenomics and next generation sequencing were employed to discover new virus natural enemies of the fire ant, Solenopsis invicta Buren in its native range (i.e., Formosa, Argentina) with the ultimate goal of testing and releasing new viral pathogens into U.S. S. invicta populations to provide natural, sustainable control of this ant. RNA was purified from worker ants from 182 S. invicta colonies, which was pooled into 4 groups according to location. A library was created from each group and sequenced using Illumina Miseq technology. After a series of winnowing methods to remove S. invicta genes, known S. invicta virus genes, and all other non-virus gene sequences, 61,944 unique singletons were identified with virus identity. These were assembled de novo yielding 171 contiguous sequences with significant identity to non-plant virus genes. Fifteen contiguous sequences exhibited very high expression rates and were detected in all four gene libraries. One contig (Contig_29) exhibited the highest expression level overall and across all four gene libraries. Random amplification of cDNA ends analyses expanded this contiguous sequence yielding a complete virus genome, which we have provisionally named Solenopsis invicta virus 5 (SINV-5). SINV-5 is a positive-sense, single-stranded RNA virus with genome characteristics consistent with insect-infecting viruses from the family Dicistroviridae. Moreover, the replicative genome strand of SINV-5 was detected in worker ants indicating that S. invicta serves as host for the virus. Many additional sequences were identified that are likely of viral origin. These sequences await further investigation to determine their origins and relationship with S. invicta. This study expands knowledge of the RNA virome diversity found within S. invicta populations.

  13. Prospecting for viral natural enemies of the fire ant Solenopsis invicta in Argentina

    PubMed Central

    Porter, Sanford D.; Calcaterra, Luis A.

    2018-01-01

    Metagenomics and next generation sequencing were employed to discover new virus natural enemies of the fire ant, Solenopsis invicta Buren in its native range (i.e., Formosa, Argentina) with the ultimate goal of testing and releasing new viral pathogens into U.S. S. invicta populations to provide natural, sustainable control of this ant. RNA was purified from worker ants from 182 S. invicta colonies, which was pooled into 4 groups according to location. A library was created from each group and sequenced using Illumina Miseq technology. After a series of winnowing methods to remove S. invicta genes, known S. invicta virus genes, and all other non-virus gene sequences, 61,944 unique singletons were identified with virus identity. These were assembled de novo yielding 171 contiguous sequences with significant identity to non-plant virus genes. Fifteen contiguous sequences exhibited very high expression rates and were detected in all four gene libraries. One contig (Contig_29) exhibited the highest expression level overall and across all four gene libraries. Random amplification of cDNA ends analyses expanded this contiguous sequence yielding a complete virus genome, which we have provisionally named Solenopsis invicta virus 5 (SINV-5). SINV-5 is a positive-sense, single-stranded RNA virus with genome characteristics consistent with insect-infecting viruses from the family Dicistroviridae. Moreover, the replicative genome strand of SINV-5 was detected in worker ants indicating that S. invicta serves as host for the virus. Many additional sequences were identified that are likely of viral origin. These sequences await further investigation to determine their origins and relationship with S. invicta. This study expands knowledge of the RNA virome diversity found within S. invicta populations. PMID:29466388

  14. Animal selection for whole genome sequencing by quantifying the unique contribution of homozygous haplotypes sequenced

    USDA-ARS?s Scientific Manuscript database

    Major whole genome sequencing projects promise to identify rare and causal variants within livestock species; however, the efficient selection of animals for sequencing remains a major problem within these surveys. The goal of this project was to develop a library of high accuracy genetic variants f...

  15. Tilted pillar array fabrication by the combination of proton beam writing and soft lithography for microfluidic cell capture Part 2: Image sequence analysis based evaluation and biological application.

    PubMed

    Járvás, Gábor; Varga, Tamás; Szigeti, Márton; Hajba, László; Fürjes, Péter; Rajta, István; Guttman, András

    2018-02-01

    As a continuation of our previously published work, this paper presents a detailed evaluation of a microfabricated cell capture device utilizing a doubly tilted micropillar array. The device was fabricated using a novel hybrid technology based on the combination of proton beam writing and conventional lithography techniques. Tilted pillars offer unique flow characteristics and support enhanced fluidic interaction for improved immunoaffinity based cell capture. The performance of the microdevice was evaluated by an image sequence analysis based in-house developed single-cell tracking system. Individual cell tracking allowed in-depth analysis of the cell-chip surface interaction mechanism from hydrodynamic point of view. Simulation results were validated by using the hybrid device and the optimized surface functionalization procedure. Finally, the cell capture capability of this new generation microdevice was demonstrated by efficiently arresting cells from a HT29 cell-line suspension. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  16. Pseudomonas aeruginosa Type III Secretory Toxin ExoU and Its Predicted Homologs.

    PubMed

    Sawa, Teiji; Hamaoka, Saeko; Kinoshita, Mao; Kainuma, Atsushi; Naito, Yoshifumi; Akiyama, Koichi; Kato, Hideya

    2016-10-26

    Pseudomonas aeruginosa ExoU, a type III secretory toxin and major virulence factor with patatin-like phospholipase activity, is responsible for acute lung injury and sepsis in immunocompromised patients. Through use of a recently updated bacterial genome database, protein sequences predicted to be homologous to Ps. aeruginosa ExoU were identified in 17 other Pseudomonas species ( Ps. fluorescens , Ps. lundensis , Ps. weihenstephanensis , Ps. marginalis, Ps. rhodesiae, Ps. synxantha , Ps. libanensis , Ps. extremaustralis , Ps. veronii , Ps. simiae , Ps. trivialis , Ps. tolaasii , Ps. orientalis , Ps. taetrolens , Ps. syringae , Ps. viridiflava , and Ps. cannabina ) and 8 Gram-negative bacteria from three other genera ( Photorhabdus , Aeromonas , and Paludibacterium ). In the alignment of the predicted primary amino acid sequences used for the phylogenetic analyses, both highly conserved and nonconserved parts of the toxin were discovered among the various species. Further comparative studies of the predicted ExoU homologs should provide us with more detailed information about the unique characteristics of the Ps. aeruginosa ExoU toxin.

  17. Cyclotide isolation and characterization.

    PubMed

    Craik, David J; Henriques, Sonia Troeira; Mylne, Joshua S; Wang, Conan K

    2012-01-01

    Cyclotides are disulfide-rich cyclic peptides produced by plants with the presumed natural function of defense agents against insect pests. They are present in a wide range of plant tissues, being ribosomally synthesized via precursor proteins that are posttranslationally processed to produce mature peptides with a characteristic cyclic backbone and cystine knot motif associated with their six conserved cysteine residues. Their processing is not fully understood but involves asparaginyl endoproteinase activity. In addition to interest in their defense roles and their unique topologies, cyclotides have attracted attention as potential templates in peptide-based drug design applications. This chapter provides protocols for the isolation of cyclotides from plants, their detection and sequencing by mass spectrometry, and their structural analysis by NMR, as well as describing methods for the isolation of nucleic acid sequences that encode their precursor proteins. Assays to assess their membrane-binding interactions are also described. These protocols provide a "starter kit" for researchers entering the cyclotide field. Copyright © 2012 Elsevier Inc. All rights reserved.

  18. Genome Structure of the Legume, Lotus japonicus

    PubMed Central

    Sato, Shusei; Nakamura, Yasukazu; Kaneko, Takakazu; Asamizu, Erika; Kato, Tomohiko; Nakao, Mitsuteru; Sasamoto, Shigemi; Watanabe, Akiko; Ono, Akiko; Kawashima, Kumiko; Fujishiro, Tsunakazu; Katoh, Midori; Kohara, Mitsuyo; Kishida, Yoshie; Minami, Chiharu; Nakayama, Shinobu; Nakazaki, Naomi; Shimizu, Yoshimi; Shinpo, Sayaka; Takahashi, Chika; Wada, Tsuyuko; Yamada, Manabu; Ohmido, Nobuko; Hayashi, Makoto; Fukui, Kiichi; Baba, Tomoya; Nakamichi, Tomoko; Mori, Hirotada; Tabata, Satoshi

    2008-01-01

    The legume Lotus japonicus has been widely used as a model system to investigate the genetic background of legume-specific phenomena such as symbiotic nitrogen fixation. Here, we report structural features of the L. japonicus genome. The 315.1-Mb sequences determined in this and previous studies correspond to 67% of the genome (472 Mb), and are likely to cover 91.3% of the gene space. Linkage mapping anchored 130-Mb sequences onto the six linkage groups. A total of 10 951 complete and 19 848 partial structures of protein-encoding genes were assigned to the genome. Comparative analysis of these genes revealed the expansion of several functional domains and gene families that are characteristic of L. japonicus. Synteny analysis detected traces of whole-genome duplication and the presence of synteny blocks with other plant genomes to various degrees. This study provides the first opportunity to look into the complex and unique genetic system of legumes. PMID:18511435

  19. What helminth genomes have taught us about parasite evolution.

    PubMed

    Zarowiecki, Magdalena; Berriman, Matt

    2015-02-01

    The genomes of more than 20 helminths have now been sequenced. Here we perform a meta-analysis of all sequenced genomes of nematodes and Platyhelminthes, and attempt to address the question of what are the defining characteristics of helminth genomes. We find that parasitic worms lack systems for surface antigenic variation, instead maintaining infections using their surfaces as the first line of defence against the host immune system, with several expanded gene families of genes associated with the surface and tegument. Parasite excretory/secretory products evolve rapidly, and proteases even more so, with each parasite exhibiting unique modifications of its protease repertoire. Endoparasitic flatworms show striking losses of metabolic capabilities, not matched by nematodes. All helminths do however exhibit an overall reduction in auxiliary metabolism (biogenesis of co-factors and vitamins). Overall, the prevailing pattern is that there are few commonalities between the genomes of independently evolved parasitic worms, with each parasite having undergone specific adaptations for their particular niche.

  20. High-throughput T-cell receptor sequencing across chronic liver diseases reveals distinct disease-associated repertoires.

    PubMed

    Liaskou, Evaggelia; Klemsdal Henriksen, Eva Kristine; Holm, Kristian; Kaveh, Fatemeh; Hamm, David; Fear, Janine; Viken, Marte K; Hov, Johannes Roksund; Melum, Espen; Robins, Harlan; Olweus, Johanna; Karlsen, Tom H; Hirschfield, Gideon M

    2016-05-01

    Hepatic T-cell infiltrates and a strong genetic human leukocyte antigen association represent characteristic features of various immune-mediated liver diseases. Conceptually the presence of disease-associated antigens is predicted to be reflected in T-cell receptor (TCR) repertoires. Here, we aimed to determine if disease-associated TCRs could be identified in the nonviral chronic liver diseases primary biliary cirrhosis (PBC), primary sclerosing cholangitis (PSC), and alcoholic liver disease (ALD). We performed high-throughput sequencing of the TCRβ chain complementarity-determining region 3 of liver-infiltrating T cells from PSC (n = 20), PBC (n = 10), and ALD (n = 10) patients, alongside genomic human leukocyte antigen typing. The frequency of TCRβ nucleotide sequences was significantly higher in PSC samples (2.53 ± 0.80, mean ± standard error of the mean) compared to PBC samples (1.13 ± 0.17, P < 0.0001) and ALD samples (0.62 ± 0.10, P < 0.0001). An average clonotype overlap of 0.85% was detected among PSC samples, significantly higher compared to the average overlap of 0.77% seen within the PBC (P = 0.024) and ALD groups (0.40%, P < 0.0001). From eight to 42 clonotypes were uniquely detected in each of the three disease groups (≥30% of the respective patient samples). Multiple, unique sequences using different variable family genes encoded the same amino acid clonotypes, providing additional support for antigen-driven selection. In PSC and PBC, disease-associated clonotypes were detected among patients with human leukocyte antigen susceptibility alleles. We demonstrate liver-infiltrating disease-associated clonotypes in all three diseases evaluated, and evidence for antigen-driven clonal expansions. Our findings indicate that differential TCR signatures, as determined by high-throughput sequencing, may represent an imprint of distinctive antigenic repertoires present in the different chronic liver diseases; this thereby opens up the prospect of studying disease-relevant T cells in order to better understand and treat liver disease. © 2015 by the American Association for the Study of Liver Diseases.

  1. Molecular cloning of pepsinogens A and C from adult newt (Cynops pyrrhogaster) stomach.

    PubMed

    Inokuchi, Tomofumi; Ikuzawa, Masayuki; Yamazaki, Shin; Watanabe, Yukari; Shiota, Koushiro; Katoh, Takuma; Kobayashi, Ken-Ichiro

    2013-08-01

    The full-length cDNAs of three pepsinogens (Pgs) were cloned from the stomach of newt, Cynops pyrrhogaster, and nucleotide sequences of the full-length cDNAs were determined. Molecular phylogenetic analysis showed that two Pgs, named PgC1 and PgC2, belong to the pepsinogen C group, and one Pg, named PgA, belongs to the pepsinogen A group. The sequences contain an open reading frame (ORF) encoding 385 amino acid residues for PgC1, 383 amino acid residues for PgC2 and 377 amino acid residues for PgA. In addition, all of the three amino acid sequences conserve some unique characteristics such as six cysteine residues and putative active site two aspartic acid residues. All of the pepsinogen mRNAs were detected in the stomach by RT-PCR but not in other organs. Although a slight difference at the time of the start of expression was seen among the three pepsinogen genes, all of them were expressed in the larval stage after hatching. This is the first report on cloning of pepsinogens from urodele stomach. Copyright © 2013 Elsevier Inc. All rights reserved.

  2. Detection of nucleotide-specific CRISPR/Cas9 modified alleles using multiplex ligation detection

    PubMed Central

    KC, R.; Srivastava, A.; Wilkowski, J. M.; Richter, C. E.; Shavit, J. A.; Burke, D. T.; Bielas, S. L.

    2016-01-01

    CRISPR/Cas9 genome-editing has emerged as a powerful tool to create mutant alleles in model organisms. However, the precision with which these mutations are created has introduced a new set of complications for genotyping and colony management. Traditional gene-targeting approaches in many experimental organisms incorporated exogenous DNA and/or allele specific sequence that allow for genotyping strategies based on binary readout of PCR product amplification and size selection. In contrast, alleles created by non-homologous end-joining (NHEJ) repair of double-stranded DNA breaks generated by Cas9 are much less amenable to such strategies. Here we describe a novel genotyping strategy that is cost effective, sequence specific and allows for accurate and efficient multiplexing of small insertion-deletions and single-nucleotide variants characteristic of CRISPR/Cas9 edited alleles. We show that ligation detection reaction (LDR) can be used to generate products that are sequence specific and uniquely detected by product size and/or fluorescent tags. The method works independently of the model organism and will be useful for colony management as mutant alleles differing by a few nucleotides become more prevalent in experimental animal colonies. PMID:27557703

  3. Barcodes for genomes and applications

    PubMed Central

    Zhou, Fengfeng; Olman, Victor; Xu, Ying

    2008-01-01

    Background Each genome has a stable distribution of the combined frequency for each k-mer and its reverse complement measured in sequence fragments as short as 1000 bps across the whole genome, for 1

  4. Using the self-select paradigm to delineate the nature of speech motor programming.

    PubMed

    Wright, David L; Robin, Don A; Rhee, Jooyhun; Vaculin, Amber; Jacks, Adam; Guenther, Frank H; Fox, Peter T

    2009-06-01

    The authors examined the involvement of 2 speech motor programming processes identified by S. T. Klapp (1995, 2003) during the articulation of utterances differing in syllable and sequence complexity. According to S. T. Klapp, 1 process, INT, resolves the demands of the programmed unit, whereas a second process, SEQ, oversees the serial order demands of longer sequences. A modified reaction time paradigm was used to assess INT and SEQ demands. Specifically, syllable complexity was dependent on syllable structure, whereas sequence complexity involved either repeated or unique syllabi within an utterance. INT execution was slowed when articulating single syllables in the form CCCV compared to simpler CV syllables. Planning unique syllables within a multisyllabic utterance rather than repetitions of the same syllable slowed INT but not SEQ. The INT speech motor programming process, important for mental syllabary access, is sensitive to changes in both syllable structure and the number of unique syllables in an utterance.

  5. Intrinsic colony conditions affect the provisioning and oviposition process in the stingless bee Melipona scutellaris.

    PubMed

    Pereira, R A; Morais, M M; Nascimento, F S; Bego, L R

    2009-01-01

    The cell provisioning and oviposition process (POP) is a unique characteristic of stingless bees (Meliponini), in which coordinated interactions between workers and queen regulate the filling of brood cells with larval resources and subsequent egg laying. Environmental conditions seem to regulate reproduction in stingless bees; however, little is known about how the amount of food affects quantitative sequences of the process. We examined intrinsic variables by comparing three colonies in distinct conditions (strong, intermediate and weak state). We predicted that some of these variables are correlated with temporal events of POP in Melipona scutellaris colonies. The results demonstrated that the strong colony had shorter periods of POP.

  6. Electrokinetic focusing injection methods on microfluidic devices.

    PubMed

    Fu, Lung-Ming; Yang, Ruey-Jen; Lee, Gwo-Bin

    2003-04-15

    This paper presents an experimental and numerical investigation into electrokinetic focusing injection on microfluidic chips. The valving characteristics on microfluidic devices are controlled through appropriate manipulations of the electric potential strengths during the sample loading and dispensing steps. The present study also addresses the design and testing of various injection systems used to deliver a sample plug. A novel double-cross injection microfluidic chip is fabricated, which employs electrokinetic focusing to deliver sample plugs of variable volume. The proposed design combines several functions of traditional sample plug injection systems on a single microfluidic chip. The injection technique uses an unique sequence of loading steps with different electric potential distributions and magnitudes within the various channels to effectuate a virtual valve.

  7. Complete mitochondrial genome of Ostrea denselamellosa (Bivalvia, Ostreidae).

    PubMed

    Yu, Hong; Kong, Lingfeng; Li, Qi

    2016-01-01

    The complete mitochondrial (mt) genome of the flat oyster, Ostrea denselamellosa, was determined using Long-PCR and genome walking techniques in this study. The total length of the mt genome sequence of O. denselamellosa was 16,227 bp, which is the smallest reported Ostreidae mt genome to date. It contained 12 protein-coding genes (lacking of ATP8), 23 transfer RNA genes, and two ribosomal RNA genes. A bias towards a higher representation of nucleotides A and T (60.7%) was detected in the mt genome of O. denselamellosa. The rrnL was split into two fragments (3' half, 711 bp; 5' half, 509 bp), which seems to be the unique characteristics of Ostreidae mt genomes.

  8. Structural characteristics of alkaline phosphatase from the moderately halophilic bacterium Halomonas sp. 593

    PubMed Central

    Arai, Shigeki; Yonezawa, Yasushi; Ishibashi, Matsujiro; Matsumoto, Fumiko; Adachi, Motoyasu; Tamada, Taro; Tokunaga, Hiroko; Blaber, Michael; Tokunaga, Masao; Kuroki, Ryota

    2014-01-01

    Alkaline phosphatase (AP) from the moderate halophilic bacterium Halomonas sp. 593 (HaAP) catalyzes the hydrolysis of phosphomonoesters over a wide salt-concentration range (1–4 M NaCl). In order to clarify the structural basis of its halophilic characteristics and its wide-range adaptation to salt concentration, the tertiary structure of HaAP was determined by X-ray crystallography to 2.1 Å resolution. The unit cell of HaAP contained one dimer unit corresponding to the biological unit. The monomer structure of HaAP contains a domain comprised of an 11-stranded β-sheet core with 19 surrounding α-helices similar to those of APs from other species, and a unique ‘crown’ domain containing an extended ‘arm’ structure that participates in formation of a hydrophobic cluster at the entrance to the substrate-binding site. The HaAP structure also displays a unique distribution of negatively charged residues and hydrophobic residues in comparison to other known AP structures. AP from Vibrio sp. G15-21 (VAP; a slight halophile) has the highest similarity in sequence (70.0% identity) and structure (Cα r.m.s.d. of 0.82 Å for the monomer) to HaAP. The surface of the HaAP dimer is substantially more acidic than that of the VAP dimer (144 exposed Asp/Glu residues versus 114, respectively), and thus may enable the solubility of HaAP under high-salt conditions. Conversely, the monomer unit of HaAP formed a substantially larger hydrophobic interior comprising 329 C atoms from completely buried residues, whereas that of VAP comprised 264 C atoms, which may maintain the stability of HaAP under low-salt conditions. These characteristics of HaAP may be responsible for its unique functional adaptation permitting activity over a wide range of salt concentrations. PMID:24598750

  9. Quantitative statistical analysis of cis-regulatory sequences in ABA/VP1- and CBF/DREB1-regulated genes of Arabidopsis.

    PubMed

    Suzuki, Masaharu; Ketterling, Matthew G; McCarty, Donald R

    2005-09-01

    We have developed a simple quantitative computational approach for objective analysis of cis-regulatory sequences in promoters of coregulated genes. The program, designated MotifFinder, identifies oligo sequences that are overrepresented in promoters of coregulated genes. We used this approach to analyze promoter sequences of Viviparous1 (VP1)/abscisic acid (ABA)-regulated genes and cold-regulated genes, respectively, of Arabidopsis (Arabidopsis thaliana). We detected significantly enriched sequences in up-regulated genes but not in down-regulated genes. This result suggests that gene activation but not repression is mediated by specific and common sequence elements in promoters. The enriched motifs include several known cis-regulatory sequences as well as previously unidentified motifs. With respect to known cis-elements, we dissected the flanking nucleotides of the core sequences of Sph element, ABA response elements (ABREs), and the C repeat/dehydration-responsive element. This analysis identified the motif variants that may correlate with qualitative and quantitative differences in gene expression. While both VP1 and cold responses are mediated in part by ABA signaling via ABREs, these responses correlate with unique ABRE variants distinguished by nucleotides flanking the ACGT core. ABRE and Sph motifs are tightly associated uniquely in the coregulated set of genes showing a strict dependence on VP1 and ABA signaling. Finally, analysis of distribution of the enriched sequences revealed a striking concentration of enriched motifs in a proximal 200-base region of VP1/ABA and cold-regulated promoters. Overall, each class of coregulated genes possesses a discrete set of the enriched motifs with unique distributions in their promoters that may account for the specificity of gene regulation.

  10. Normal and compound poisson approximations for pattern occurrences in NGS reads.

    PubMed

    Zhai, Zhiyuan; Reinert, Gesine; Song, Kai; Waterman, Michael S; Luan, Yihui; Sun, Fengzhu

    2012-06-01

    Next generation sequencing (NGS) technologies are now widely used in many biological studies. In NGS, sequence reads are randomly sampled from the genome sequence of interest. Most computational approaches for NGS data first map the reads to the genome and then analyze the data based on the mapped reads. Since many organisms have unknown genome sequences and many reads cannot be uniquely mapped to the genomes even if the genome sequences are known, alternative analytical methods are needed for the study of NGS data. Here we suggest using word patterns to analyze NGS data. Word pattern counting (the study of the probabilistic distribution of the number of occurrences of word patterns in one or multiple long sequences) has played an important role in molecular sequence analysis. However, no studies are available on the distribution of the number of occurrences of word patterns in NGS reads. In this article, we build probabilistic models for the background sequence and the sampling process of the sequence reads from the genome. Based on the models, we provide normal and compound Poisson approximations for the number of occurrences of word patterns from the sequence reads, with bounds on the approximation error. The main challenge is to consider the randomness in generating the long background sequence, as well as in the sampling of the reads using NGS. We show the accuracy of these approximations under a variety of conditions for different patterns with various characteristics. Under realistic assumptions, the compound Poisson approximation seems to outperform the normal approximation in most situations. These approximate distributions can be used to evaluate the statistical significance of the occurrence of patterns from NGS data. The theory and the computational algorithm for calculating the approximate distributions are then used to analyze ChIP-Seq data using transcription factor GABP. Software is available online (www-rcf.usc.edu/∼fsun/Programs/NGS_motif_power/NGS_motif_power.html). In addition, Supplementary Material can be found online (www.liebertonline.com/cmb).

  11. Geology of the Devonian black shales of the Appalachian Basin

    USGS Publications Warehouse

    Roen, J.B.

    1984-01-01

    Black shales of Devonian age in the Appalachian Basin are a unique rock sequence. The high content of organic matter, which imparts the characteristic lithology, has for years attracted considerable interest in the shales as a possible source of energy. The recent energy shortage prompted the U.S. Department of Energy through the Eastern Gas Shales Project of the Morgantown Energy Technology Center to underwrite a research program to determine the geologic, geochemical, and structural characteristics of the Devonian black shales in order to enhance the recovery of gas from the shales. Geologic studies by Federal and State agencies and academic institutions produced a regional stratigraphic network that correlates the 15 ft black shale sequence in Tennessee with 3000 ft of interbedded black and gray shales in central New York. These studies correlate the classic Devonian black shale sequence in New York with the Ohio Shale of Ohio and Kentucky and the Chattanooga Shale of Tennessee and southwestern Virginia. Biostratigraphic and lithostratigraphic markers in conjunction with gamma-ray logs facilitated long-range correlations within the Appalachian Basin. Basinwide correlations, including the subsurface rocks, provided a basis for determining the areal distribution and thickness of the important black shale units. The organic carbon content of the dark shales generally increases from east to west across the basin and is sufficient to qualify as a hydrocarbon source rock. Significant structural features that involve the black shale and their hydrocarbon potential are the Rome trough, Kentucky River and Irvine-Paint Creek fault zone, and regional decollements and ramp zones. ?? 1984.

  12. Technical Considerations for Reduced Representation Bisulfite Sequencing with Multiplexed Libraries

    PubMed Central

    Chatterjee, Aniruddha; Rodger, Euan J.; Stockwell, Peter A.; Weeks, Robert J.; Morison, Ian M.

    2012-01-01

    Reduced representation bisulfite sequencing (RRBS), which couples bisulfite conversion and next generation sequencing, is an innovative method that specifically enriches genomic regions with a high density of potential methylation sites and enables investigation of DNA methylation at single-nucleotide resolution. Recent advances in the Illumina DNA sample preparation protocol and sequencing technology have vastly improved sequencing throughput capacity. Although the new Illumina technology is now widely used, the unique challenges associated with multiplexed RRBS libraries on this platform have not been previously described. We have made modifications to the RRBS library preparation protocol to sequence multiplexed libraries on a single flow cell lane of the Illumina HiSeq 2000. Furthermore, our analysis incorporates a bioinformatics pipeline specifically designed to process bisulfite-converted sequencing reads and evaluate the output and quality of the sequencing data generated from the multiplexed libraries. We obtained an average of 42 million paired-end reads per sample for each flow-cell lane, with a high unique mapping efficiency to the reference human genome. Here we provide a roadmap of modifications, strategies, and trouble shooting approaches we implemented to optimize sequencing of multiplexed libraries on an a RRBS background. PMID:23193365

  13. Novel application of the MSSCP method in biodiversity studies.

    PubMed

    Tomczyk-Żak, Karolina; Kaczanowski, Szymon; Górecka, Magdalena; Zielenkiewicz, Urszula

    2012-02-01

    Analysis of 16S rRNA sequence diversity is widely performed for characterizing the biodiversity of microbial samples. The number of determined sequences has a considerable impact on complete results. Although the cost of mass sequencing is decreasing, it is often still too high for individual projects. We applied the multi-temperature single-strand conformational polymorphism (MSSCP) method to decrease the number of analysed sequences. This was a novel application of this method. As a control, the same sample was analysed using random sequencing. In this paper, we adapted the MSSCP technique for screening of unique sequences of the 16S rRNA gene library and bacterial strains isolated from biofilms growing on the walls of an ancient gold mine in Poland and determined whether the results obtained by both methods differed and whether random sequencing could be replaced by MSSCP. Although it was biased towards the detection of rare sequences in the samples, the qualitative results of MSSCP were not different than those of random sequencing. Unambiguous discrimination of unique clones and strains creates an opportunity to effectively estimate the biodiversity of natural communities, especially in populations which are numerous but species poor. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  14. Low-pass sequencing for microbial comparative genomics

    PubMed Central

    Goo, Young Ah; Roach, Jared; Glusman, Gustavo; Baliga, Nitin S; Deutsch, Kerry; Pan, Min; Kennedy, Sean; DasSarma, Shiladitya; Victor Ng, Wailap; Hood, Leroy

    2004-01-01

    Background We studied four extremely halophilic archaea by low-pass shotgun sequencing: (1) the metabolically versatile Haloarcula marismortui; (2) the non-pigmented Natrialba asiatica; (3) the psychrophile Halorubrum lacusprofundi and (4) the Dead Sea isolate Halobaculum gomorrense. Approximately one thousand single pass genomic sequences per genome were obtained. The data were analyzed by comparative genomic analyses using the completed Halobacterium sp. NRC-1 genome as a reference. Low-pass shotgun sequencing is a simple, inexpensive, and rapid approach that can readily be performed on any cultured microbe. Results As expected, the four archaeal halophiles analyzed exhibit both bacterial and eukaryotic characteristics as well as uniquely archaeal traits. All five halophiles exhibit greater than sixty percent GC content and low isoelectric points (pI) for their predicted proteins. Multiple insertion sequence (IS) elements, often involved in genome rearrangements, were identified in H. lacusprofundi and H. marismortui. The core biological functions that govern cellular and genetic mechanisms of H. sp. NRC-1 appear to be conserved in these four other halophiles. Multiple TATA box binding protein (TBP) and transcription factor IIB (TFB) homologs were identified from most of the four shotgunned halophiles. The reconstructed molecular tree of all five halophiles shows a large divergence between these species, but with the closest relationship being between H. sp. NRC-1 and H. lacusprofundi. Conclusion Despite the diverse habitats of these species, all five halophiles share (1) high GC content and (2) low protein isoelectric points, which are characteristics associated with environmental exposure to UV radiation and hypersalinity, respectively. Identification of multiple IS elements in the genome of H. lacusprofundi and H. marismortui suggest that genome structure and dynamic genome reorganization might be similar to that previously observed in the IS-element rich genome of H. sp. NRC-1. Identification of multiple TBP and TFB homologs in these four halophiles are consistent with the hypothesis that different types of complex transcriptional regulation may occur through multiple TBP-TFB combinations in response to rapidly changing environmental conditions. Low-pass shotgun sequence analyses of genomes permit extensive and diverse analyses, and should be generally useful for comparative microbial genomics. PMID:14718067

  15. Genome sequencing and comparative genomics of honey bee microsporidia, Nosema apis reveal novel insights into host-parasite interactions.

    PubMed

    Chen, Yan ping; Pettis, Jeffery S; Zhao, Yan; Liu, Xinyue; Tallon, Luke J; Sadzewicz, Lisa D; Li, Renhua; Zheng, Huoqing; Huang, Shaokang; Zhang, Xuan; Hamilton, Michele C; Pernal, Stephen F; Melathopoulos, Andony P; Yan, Xianghe; Evans, Jay D

    2013-07-05

    The microsporidia parasite Nosema contributes to the steep global decline of honey bees that are critical pollinators of food crops. There are two species of Nosema that have been found to infect honey bees, Nosema apis and N. ceranae. Genome sequencing of N. apis and comparative genome analysis with N. ceranae, a fully sequenced microsporidia species, reveal novel insights into host-parasite interactions underlying the parasite infections. We applied the whole-genome shotgun sequencing approach to sequence and assemble the genome of N. apis which has an estimated size of 8.5 Mbp. We predicted 2,771 protein- coding genes and predicted the function of each putative protein using the Gene Ontology. The comparative genomic analysis led to identification of 1,356 orthologs that are conserved between the two Nosema species and genes that are unique characteristics of the individual species, thereby providing a list of virulence factors and new genetic tools for studying host-parasite interactions. We also identified a highly abundant motif in the upstream promoter regions of N. apis genes. This motif is also conserved in N. ceranae and other microsporidia species and likely plays a role in gene regulation across the microsporidia. The availability of the N. apis genome sequence is a significant addition to the rapidly expanding body of microsprodian genomic data which has been improving our understanding of eukaryotic genome diversity and evolution in a broad sense. The predicted virulent genes and transcriptional regulatory elements are potential targets for innovative therapeutics to break down the life cycle of the parasite.

  16. Girls with Emotional Disturbance and a History of Arrest: Characteristics and School-Based Predictors of Arrest

    ERIC Educational Resources Information Center

    Gage, Nicholas A.; Josephs, Nikki L.; Lunde, Kimberly

    2012-01-01

    Research suggests that girls receiving special education services for Emotional Disturbance (ED) may have unique characteristics and needs. Similarly, juvenile justice research has identified unique characteristics of court-involved girls. This study examined characteristics of girls with ED and a history of arrest. Additionally, classroom-based…

  17. Helicobacter pylori Heat Shock Protein A: Serologic Responses and Genetic Diversity

    PubMed Central

    Ng, Enders K. W.; Thompson, Stuart A.; Pérez-Pérez, Guillermo I.; Kansau, Imad; van der Ende, Arie; Labigne, Agnès; Sung, Joseph J. Y.; Chung, S. C. Sydney; Blaser, Martin J.

    1999-01-01

    Helicobacter pylori synthesizes an unusual GroES homolog, heat shock protein A (HspA). The present study was aimed at an assessment of the serological response to HspA in a group of Chinese patients with defined gastroduodenal pathologies and determination of whether diversity is present in the nucleotide sequences encoding HspA in isolates from these patients. Serum samples collected from 154 patients who had an upper gastrointestinal pathology and the presence of H. pylori defined by biopsy were tested for an immunoglobulin G (IgG) serologic response to H. pylori HspA by an enzyme linked immunosorbant assay. HspA-encoding nucleotide sequences in H. pylori isolates from 14 patients (7 seropositive and 7 seronegative for HspA) were analyzed by PCR and direct sequencing of the PCR products. The sequencing results were compared to those of 48 isolates from other parts of the world. Of the 154 known H. pylori-positive patients, 54 (35.1%) were seropositive for HspA. The A domain (GroES homology) of HspA was highly conserved in the 14 isolates tested. Although the B domain (metal-binding site unique to H. pylori) resembled that in the known major variant, particular amino acid substitutions allowed definition of an HspA variant associated with isolates from East Asia. There were no associations between patient characteristics and HspA seropositivity or amino acid sequences. We confirmed in this study that the clinical outcomes of H. pylori infection are not related to HspA antigenicity or to sequence variation. However, B-domain sequence variation may be a marker for the study of the genetic diversity of H. pylori strains of different geographic origins. PMID:10225839

  18. De novo sequencing analysis of the Rosa roxburghii fruit transcriptome reveals putative ascorbate biosynthetic genes and EST-SSR markers.

    PubMed

    Yan, Xiuqin; Zhang, Xue; Lu, Min; He, Yong; An, Huaming

    2015-04-25

    Rosa roxburghii Tratt. is a well-known ornamental rose species native to China. In addition, the fruits of this species are valued for their nutritional and medicinal characteristics, especially their high ascorbic acid (AsA) levels. Nevertheless, AsA biosynthesis in R. roxburghii fruit has not been explored in detail because of a lack of genomic resources for this species. High-throughput transcriptomic sequencing generating large volumes of transcript sequence data can aid in gene discovery and molecular marker development. In this study, we generated more than 53 million clean reads using Illumina paired-end sequencing technology. De novo assembly yielded 106,590 unigenes, with an average length of 343 bp. On the basis of sequence similarity to known proteins, 9301 and 2393 unigenes were classified into Gene Ontology and Clusters of Orthologous Group categories, respectively. There were 7480 unigenes assigned to 124 pathways in the Kyoto Encyclopedia of Gene and Genome pathway database. BLASTx searches identified 498 unique putative transcripts encoding various transcription factors, some known to regulate fruit development. qRT-PCR validated the expressions of most of the genes encoding the main enzymes involved in ascorbate biosynthesis. In addition, 9131 potential simple sequence repeat (SSR) loci were identified among the unigenes. One hundred and two primer pairs were synthesized and 71 pairs produced an amplification product during initial screening. Among the amplified products, 30 were polymorphic in the 16 R. roxburghii germplasms tested. Our study was the first to produce a large volume of transcriptome data from R. roxburghii. The resulting sequence collection is a valuable resource for gene discovery and marker-assisted selective breeding in this rose species. Copyright © 2015 Elsevier B.V. All rights reserved.

  19. Unique LCR variations among lineages of HPV16, 18 and 45 isolates from women with normal cervical cytology in Ghana.

    PubMed

    Awua, Adolf K; Adanu, Richard M K; Wiredu, Edwin K; Afari, Edwin A; Zubuch, Vanessa A; Asmah, Richard H; Severini, Alberto

    2017-04-21

    In addition to being useful for classification, sequence variations of human Papillomavirus (HPV) genotypes have been implicated in differential oncogenic potential and a differential association with the different histological forms of invasive cervical cancer. These associations have also been indicated for HPV genotype lineages and sub-lineages. In order to better understand the potential implications of lineage variation in the occurrence of cervical cancers in Ghana, we studied the lineages of the three most prevalent HPV genotypes among women with normal cytology as baseline to further studies. Of previously collected self- and health personnel-collected cervical specimen, 54, which were positive for HPV16, 18 and 45, were selected and the long control region (LCR) of each HPV genotype was separately amplified by a nested PCR. DNA sequences of 41 isolates obtained with the forward and reverse primers by Sanger sequencing were analysed. Nucleotide sequence variations of the HPV16 genotypes were observed at 30 positions within the LCR (7460 - 7840). Of these, 19 were the known variations for the lineages B and C (African lineages), while the other 11 positions had variations unique to the HPV16 isolates of this study. For the HPV18 isolates, the variations were at 35 positions, 22 of which were known variations of Africa lineages and the other 13 were unique variations observed for the isolates obtained in this study (at positions 7799 and 7813). HPV45 isolates had variations at 35 positions and 2 (positions 7114 and 97) were unique to the isolates of this study. This study provides the first data on the lineages of HPV 16, 18 and 45 isolates from Ghana. Although the study did not obtain full genome sequence data for a comprehensive comparison with known lineages, these genotypes were predominately of the Africa lineages and had some unique sequence variations at positions that suggest potential oncogenic implications. These data will be useful for comparison with lineages of these genotypes from women with cervical lesion and all the forms of invasive cervical cancers.

  20. DSAP: deep-sequencing small RNA analysis pipeline.

    PubMed

    Huang, Po-Jung; Liu, Yi-Chung; Lee, Chi-Ching; Lin, Wei-Chen; Gan, Richie Ruei-Chi; Lyu, Ping-Chiang; Tang, Petrus

    2010-07-01

    DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log(2)-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.

  1. Clinical germline diagnostic exome sequencing for hereditary cancer: Findings within novel candidate genes are prevalent.

    PubMed

    Powis, Zöe; Espenschied, Carin R; LaDuca, Holly; Hagman, Kelly D; Paudyal, Tripti; Li, Shuwei; Inaba, Hiroto; Mauer, Ann; Nathanson, Katherine L; Knost, James; Chao, Elizabeth C; Tang, Sha

    2018-08-01

    Clinical diagnostic exome sequencing (DES) has been effective in diagnosing individuals with suspected genetic conditions; nevertheless little has been described regarding its clinical utility in individuals with a personal and family history of cancer. This study aimed to assess diagnostic yield and clinical characteristics of pediatric and adult patients undergoing germline DES for hereditary cancer. We retrospectively reviewed 2171 patients referred for DES; cases with a personal and/or family history of cancer were further studied. Of 39 cancer patients, relevant alterations were found in eight individuals (21%), including one (3%) positive pathogenic alteration within a characterized gene, two (5%) uncertain findings in characterized genes, and five (13%) alterations in novel candidate genes. Two of the 5 pediatric patients, undergoing testing, (40%) had findings in novel candidate genes, with the remainder being negative. We include brief case studies to illustrate the variety of challenging issues related to these patients. Our observations demonstrate utility of family-based exome sequencing in patients for suspected hereditary cancer, including familial co-segregation analysis, and comprehensive medical review. DES may be particularly useful when traditional approaches do not result in a diagnosis or in families with unique phenotypes. This work also highlights the importance and complexity of analysis of uncharacterized genes in exome sequencing for hereditary cancer. Copyright © 2018 Elsevier Inc. All rights reserved.

  2. Abamectin, pymetrozine and azadirachtin sequence as a unique solution to control the leafminer Liriomyza trifolii (Burgess) (Diptera: Agromyzidae) infesting garden beans (Phaseolus vulgaris L.) in Egypt.

    PubMed

    Saad, A S A; Massoud, M A; Abdel-Megeed, A A M; Hamid, N A; Mourad, A K K; Barakat, A S T

    2007-01-01

    Field trails were conducted to determine the performance of three different sequences as a unique solution for the control of the leaf miner Liriomyza trifolii (Burgess) (Diptera: Agromyzidae) infesting garden beans (Phaseolus vulgaris L.) during the two successive seasons of 2004 and 2005. Furthermore, during the evaluation period, the side effect against the ectoparasite Diglyphus isaea (Walker) (Hymenoptera: Eulophidae) was put into consideration. Meanwhile, the comparative evaluation of the pesticides alone showed that abamectin and azadirachtin were highly effective against Liriomyza trifolii, while carbosulfan, pymetrozine and thiamethoxam provided to be of a moderate effect. Moreover, carbosulfan showed harmful effect to the larvae of the ectoparasite Diglyphus isaea (Walker), while abamectin and azadirachtin gave a moderate effect. Thiamethoxam and the the detergent (Masrol 410) had slight effect in this respect. The highly effective sequence among the sequences was abamectin, pymetrozine and azadirachtin, against Liriomyza trifolii (Burgess), with slight harmful effect on Diglyphus isaea (Walker). However the sequence of azadirachtin, pymetrozine and abamectin had a moderate effect on Liriomyza trifolii (Burgess) and exhibited a slight toxic effect on Diglyphus isaea (Walker). In contrast, the sequence of carbosulfan, thiamethoxam and pymetrozine was the least effective and represented a slight effect on Diglyphus isaea (Walker). From this study, it was concluded that abamectin, pymetrozine and azadirachtin sequence has proved to be a unique solution for the control of the leaf miner Liriomyza trifolii (Burgess) infesting garden beans (Phaseolus vulgaris L.) in Egypt.

  3. Improving the annotation of the Heterorhabditis bacteriophora genome.

    PubMed

    McLean, Florence; Berger, Duncan; Laetsch, Dominik R; Schwartz, Hillel T; Blaxter, Mark

    2018-04-01

    Genome assembly and annotation remain exacting tasks. As the tools available for these tasks improve, it is useful to return to data produced with earlier techniques to assess their credibility and correctness. The entomopathogenic nematode Heterorhabditis bacteriophora is widely used to control insect pests in horticulture. The genome sequence for this species was reported to encode an unusually high proportion of unique proteins and a paucity of secreted proteins compared to other related nematodes. We revisited the H. bacteriophora genome assembly and gene predictions to determine whether these unusual characteristics were biological or methodological in origin. We mapped an independent resequencing dataset to the genome and used the blobtools pipeline to identify potential contaminants. While present (0.2% of the genome span, 0.4% of predicted proteins), assembly contamination was not significant. Re-prediction of the gene set using BRAKER1 and published transcriptome data generated a predicted proteome that was very different from the published one. The new gene set had a much reduced complement of unique proteins, better completeness values that were in line with other related species' genomes, and an increased number of proteins predicted to be secreted. It is thus likely that methodological issues drove the apparent uniqueness of the initial H. bacteriophora genome annotation and that similar contamination and misannotation issues affect other published genome assemblies.

  4. The Divided Bacterial Genome: Structure, Function, and Evolution.

    PubMed

    diCenzo, George C; Finan, Turlough M

    2017-09-01

    Approximately 10% of bacterial genomes are split between two or more large DNA fragments, a genome architecture referred to as a multipartite genome. This multipartite organization is found in many important organisms, including plant symbionts, such as the nitrogen-fixing rhizobia, and plant, animal, and human pathogens, including the genera Brucella , Vibrio , and Burkholderia . The availability of many complete bacterial genome sequences means that we can now examine on a broad scale the characteristics of the different types of DNA molecules in a genome. Recent work has begun to shed light on the unique properties of each class of replicon, the unique functional role of chromosomal and nonchromosomal DNA molecules, and how the exploitation of novel niches may have driven the evolution of the multipartite genome. The aims of this review are to (i) outline the literature regarding bacterial genomes that are divided into multiple fragments, (ii) provide a meta-analysis of completed bacterial genomes from 1,708 species as a way of reviewing the abundant information present in these genome sequences, and (iii) provide an encompassing model to explain the evolution and function of the multipartite genome structure. This review covers, among other topics, salient genome terminology; mechanisms of multipartite genome formation; the phylogenetic distribution of multipartite genomes; how each part of a genome differs with respect to genomic signatures, genetic variability, and gene functional annotation; how each DNA molecule may interact; as well as the costs and benefits of this genome structure. Copyright © 2017 American Society for Microbiology.

  5. Je, a versatile suite to handle multiplexed NGS libraries with unique molecular identifiers.

    PubMed

    Girardot, Charles; Scholtalbers, Jelle; Sauer, Sajoscha; Su, Shu-Yi; Furlong, Eileen E M

    2016-10-08

    The yield obtained from next generation sequencers has increased almost exponentially in recent years, making sample multiplexing common practice. While barcodes (known sequences of fixed length) primarily encode the sample identity of sequenced DNA fragments, barcodes made of random sequences (Unique Molecular Identifier or UMIs) are often used to distinguish between PCR duplicates and transcript abundance in, for example, single-cell RNA sequencing (scRNA-seq). In paired-end sequencing, different barcodes can be inserted at each fragment end to either increase the number of multiplexed samples in the library or to use one of the barcodes as UMI. Alternatively, UMIs can be combined with the sample barcodes into composite barcodes, or with standard Illumina® indexing. Subsequent analysis must take read duplicates and sample identity into account, by identifying UMIs. Existing tools do not support these complex barcoding configurations and custom code development is frequently required. Here, we present Je, a suite of tools that accommodates complex barcoding strategies, extracts UMIs and filters read duplicates taking UMIs into account. Using Je on publicly available scRNA-seq and iCLIP data containing UMIs, the number of unique reads increased by up to 36 %, compared to when UMIs are ignored. Je is implemented in JAVA and uses the Picard API. Code, executables and documentation are freely available at http://gbcs.embl.de/Je . Je can also be easily installed in Galaxy through the Galaxy toolshed.

  6. Using the Self-Select Paradigm to Delineate the Nature of Speech Motor Programming

    PubMed Central

    Wright, David L.; Robin, Don A.; Rhee, Jooyhun; Vaculin, Amber; Jacks, Adam; Guenther, Frank H.; Fox, Peter T.

    2015-01-01

    Purpose The authors examined the involvement of 2 speech motor programming processes identified by S. T. Klapp (1995, 2003) during the articulation of utterances differing in syllable and sequence complexity. According to S. T. Klapp, 1 process, INT, resolves the demands of the programmed unit, whereas a second process, SEQ, oversees the serial order demands of longer sequences. Method A modified reaction time paradigm was used to assess INT and SEQ demands. Specifically, syllable complexity was dependent on syllable structure, whereas sequence complexity involved either repeated or unique syllabi within an utterance. Results INT execution was slowed when articulating single syllables in the form CCCV compared to simpler CV syllables. Planning unique syllables within a multisyllabic utterance rather than repetitions of the same syllable slowed INT but not SEQ. Conclusions The INT speech motor programming process, important for mental syllabary access, is sensitive to changes in both syllable structure and the number of unique syllables in an utterance. PMID:19474396

  7. RUCS: rapid identification of PCR primers for unique core sequences.

    PubMed

    Thomsen, Martin Christen Frølund; Hasman, Henrik; Westh, Henrik; Kaya, Hülya; Lund, Ole

    2017-12-15

    Designing PCR primers to target a specific selection of whole genome sequenced strains can be a long, arduous and sometimes impractical task. Such tasks would benefit greatly from an automated tool to both identify unique targets, and to validate the vast number of potential primer pairs for the targets in silico. Here we present RUCS, a program that will find PCR primer pairs and probes for the unique core sequences of a positive genome dataset complement to a negative genome dataset. The resulting primer pairs and probes are in addition to simple selection also validated through a complex in silico PCR simulation. We compared our method, which identifies the unique core sequences, against an existing tool called ssGeneFinder, and found that our method was 6.5-20 times more sensitive. We used RUCS to design primer pairs that would target a set of genomes known to contain the mcr-1 colistin resistance gene. Three of the predicted pairs were chosen for experimental validation using PCR and gel electrophoresis. All three pairs successfully produced an amplicon with the target length for the samples containing mcr-1 and no amplification products were produced for the negative samples. The novel methods presented in this manuscript can reduce the time needed to identify target sequences, and provide a quick virtual PCR validation to eliminate time wasted on ambiguously binding primers. Source code is freely available on https://bitbucket.org/genomicepidemiology/rucs. Web service is freely available on https://cge.cbs.dtu.dk/services/RUCS. mcft@cbs.dtu.dk. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.

  8. Giraffe genome sequence reveals clues to its unique morphology and physiology

    PubMed Central

    Agaba, Morris; Ishengoma, Edson; Miller, Webb C.; McGrath, Barbara C.; Hudson, Chelsea N.; Bedoya Reina, Oscar C.; Ratan, Aakrosh; Burhans, Rico; Chikhi, Rayan; Medvedev, Paul; Praul, Craig A.; Wu-Cavener, Lan; Wood, Brendan; Robertson, Heather; Penfold, Linda; Cavener, Douglas R.

    2016-01-01

    The origins of giraffe's imposing stature and associated cardiovascular adaptations are unknown. Okapi, which lacks these unique features, is giraffe's closest relative and provides a useful comparison, to identify genetic variation underlying giraffe's long neck and cardiovascular system. The genomes of giraffe and okapi were sequenced, and through comparative analyses genes and pathways were identified that exhibit unique genetic changes and likely contribute to giraffe's unique features. Some of these genes are in the HOX, NOTCH and FGF signalling pathways, which regulate both skeletal and cardiovascular development, suggesting that giraffe's stature and cardiovascular adaptations evolved in parallel through changes in a small number of genes. Mitochondrial metabolism and volatile fatty acids transport genes are also evolutionarily diverged in giraffe and may be related to its unusual diet that includes toxic plants. Unexpectedly, substantial evolutionary changes have occurred in giraffe and okapi in double-strand break repair and centrosome functions. PMID:27187213

  9. Mid-Permian Phosphoria Sea in Nevada and the Upwelling Model

    USGS Publications Warehouse

    Ketner, Keith B.

    2009-01-01

    The Phosphoria Sea extended at least 500 km westward and at least 700 km southwestward from its core area centered in southeastern Idaho. Throughout that extent it displayed many characteristic features of the core: the same fauna, the same unique sedimentary assemblage including phosphate in mostly pelletal form, chert composed mainly of sponge spicules, and an association with dolomite. Phosphoria-age sediments in Nevada display ample evidence of deposition in shallow water. The chief difference between the sediments in Nevada and those of the core area is the greater admixture of sandstone and conglomerate in Nevada. Evidence of the western margin of the Phosphoria Sea where the water deepened and began to lose its essential characteristics is located in the uppermost part of the Upper Devonian to Permian Havallah sequence, which has been displaced tectonically eastward an unknown distance. The relatively deep water in which the mid-Permian part of the Havallah was deposited was a sea of probably restricted east-west width and was floored by a very thick sequence of mainly terrigenous sedimentary rocks. The phosphate content of mid-Permian strata in western exposures tends to be relatively low as a percentage, but the thickness of those strata tends to be high. The core area in and near southeastern Idaho where the concentration of phosphate is highest was separated from any possible site of upwelling oceanic waters by a great expanse of shallow sea.

  10. DAMe: a toolkit for the initial processing of datasets with PCR replicates of double-tagged amplicons for DNA metabarcoding analyses.

    PubMed

    Zepeda-Mendoza, Marie Lisandra; Bohmann, Kristine; Carmona Baez, Aldo; Gilbert, M Thomas P

    2016-05-03

    DNA metabarcoding is an approach for identifying multiple taxa in an environmental sample using specific genetic loci and taxa-specific primers. When combined with high-throughput sequencing it enables the taxonomic characterization of large numbers of samples in a relatively time- and cost-efficient manner. One recent laboratory development is the addition of 5'-nucleotide tags to both primers producing double-tagged amplicons and the use of multiple PCR replicates to filter erroneous sequences. However, there is currently no available toolkit for the straightforward analysis of datasets produced in this way. We present DAMe, a toolkit for the processing of datasets generated by double-tagged amplicons from multiple PCR replicates derived from an unlimited number of samples. Specifically, DAMe can be used to (i) sort amplicons by tag combination, (ii) evaluate PCR replicates dissimilarity, and (iii) filter sequences derived from sequencing/PCR errors, chimeras, and contamination. This is attained by calculating the following parameters: (i) sequence content similarity between the PCR replicates from each sample, (ii) reproducibility of each unique sequence across the PCR replicates, and (iii) copy number of the unique sequences in each PCR replicate. We showcase the insights that can be obtained using DAMe prior to taxonomic assignment, by applying it to two real datasets that vary in their complexity regarding number of samples, sequencing libraries, PCR replicates, and used tag combinations. Finally, we use a third mock dataset to demonstrate the impact and importance of filtering the sequences with DAMe. DAMe allows the user-friendly manipulation of amplicons derived from multiple samples with PCR replicates built in a single or multiple sequencing libraries. It allows the user to: (i) collapse amplicons into unique sequences and sort them by tag combination while retaining the sample identifier and copy number information, (ii) identify sequences carrying unused tag combinations, (iii) evaluate the comparability of PCR replicates of the same sample, and (iv) filter tagged amplicons from a number of PCR replicates using parameters of minimum length, copy number, and reproducibility across the PCR replicates. This enables an efficient analysis of complex datasets, and ultimately increases the ease of handling datasets from large-scale studies.

  11. Theory of the decision/problem state

    NASA Technical Reports Server (NTRS)

    Dieterly, D. L.

    1980-01-01

    A theory of the decision-problem state was introduced and elaborated. Starting with the basic model of a decision-problem condition, an attempt was made to explain how a major decision-problem may consist of subsets of decision-problem conditions composing different condition sequences. In addition, the basic classical decision-tree model was modified to allow for the introduction of a series of characteristics that may be encountered in an analysis of a decision-problem state. The resulting hierarchical model reflects the unique attributes of the decision-problem state. The basic model of a decision-problem condition was used as a base to evolve a more complex model that is more representative of the decision-problem state and may be used to initiate research on decision-problem states.

  12. Complete Genome Sequences of Bacillus Phages Janet and OTooleKemple52

    PubMed Central

    2018-01-01

    ABSTRACT We report here the genome sequences of two novel Bacillus cereus group-infecting bacteriophages, Janet and OTooleKemple52. These bacteriophages are double-stranded DNA-containing Myoviridae isolated from soil samples. While their genomes share a high degree of sequence identity with one another, their host preferences are unique. PMID:29748396

  13. Novel Insights into Tree Biology and Genome Evolution as Revealed Through Genomics.

    PubMed

    Neale, David B; Martínez-García, Pedro J; De La Torre, Amanda R; Montanari, Sara; Wei, Xiao-Xin

    2017-04-28

    Reference genome sequences are the key to the discovery of genes and gene families that determine traits of interest. Recent progress in sequencing technologies has enabled a rapid increase in genome sequencing of tree species, allowing the dissection of complex characters of economic importance, such as fruit and wood quality and resistance to biotic and abiotic stresses. Although the number of reference genome sequences for trees lags behind those for other plant species, it is not too early to gain insight into the unique features that distinguish trees from nontree plants. Our review of the published data suggests that, although many gene families are conserved among herbaceous and tree species, some gene families, such as those involved in resistance to biotic and abiotic stresses and in the synthesis and transport of sugars, are often expanded in tree genomes. As the genomes of more tree species are sequenced, comparative genomics will further elucidate the complexity of tree genomes and how this relates to traits unique to trees.

  14. Generation of a total of 6483 expressed sequence tags from 60 day-old bovine whole fetus and fetal placenta.

    PubMed

    Oishi, M; Gohma, H; Lejukole, H Y; Taniguchi, Y; Yamada, T; Suzuki, K; Shinkai, H; Uenishi, H; Yasue, H; Sasaki, Y

    2004-05-01

    Expressed sequence tags (ESTs) generated based on characterization of clones isolated randomly from cDNA libraries are used to study gene expression profiles in specific tissues and to provide useful information for characterizing tissue physiology. In this study, two directionally cloned cDNA libraries were constructed from 60 day-old bovine whole fetus and fetal placenta. We have characterized 5357 and 1126 clones, and then identified 3464 and 795 unique sequences for the fetus and placenta cDNA libraries: 1851 and 504 showed homology to already identified genes, and 1613 and 291 showed no significant matches to any of the sequences in DNA databases, respectively. Further, we found 94 unique sequences overlapping in both the fetus and the placenta, leading to a catalog of 4165 genes expressed in 60 day-old fetus and placenta. The catalog is used to examine expression profile of genes in 60 day-old bovine fetus and placenta.

  15. UMI-tools: modeling sequencing errors in Unique Molecular Identifiers to improve quantification accuracy

    PubMed Central

    2017-01-01

    Unique Molecular Identifiers (UMIs) are random oligonucleotide barcodes that are increasingly used in high-throughput sequencing experiments. Through a UMI, identical copies arising from distinct molecules can be distinguished from those arising through PCR amplification of the same molecule. However, bioinformatic methods to leverage the information from UMIs have yet to be formalized. In particular, sequencing errors in the UMI sequence are often ignored or else resolved in an ad hoc manner. We show that errors in the UMI sequence are common and introduce network-based methods to account for these errors when identifying PCR duplicates. Using these methods, we demonstrate improved quantification accuracy both under simulated conditions and real iCLIP and single-cell RNA-seq data sets. Reproducibility between iCLIP replicates and single-cell RNA-seq clustering are both improved using our proposed network-based method, demonstrating the value of properly accounting for errors in UMIs. These methods are implemented in the open source UMI-tools software package. PMID:28100584

  16. Highly conserved intragenic HSV-2 sequences: Results from next-generation sequencing of HSV-2 UL and US regions from genital swabs collected from 3 continents.

    PubMed

    Johnston, Christine; Magaret, Amalia; Roychoudhury, Pavitra; Greninger, Alexander L; Cheng, Anqi; Diem, Kurt; Fitzgibbon, Matthew P; Huang, Meei-Li; Selke, Stacy; Lingappa, Jairam R; Celum, Connie; Jerome, Keith R; Wald, Anna; Koelle, David M

    2017-10-01

    Understanding the variability in circulating herpes simplex virus type 2 (HSV-2) genomic sequences is critical to the development of HSV-2 vaccines. Genital lesion swabs containing ≥ 10 7 log 10 copies HSV DNA collected from Africa, the USA, and South America underwent next-generation sequencing, followed by K-mer based filtering and de novo genomic assembly. Sites of heterogeneity within coding regions in unique long and unique short (U L _U S ) regions were identified. Phylogenetic trees were created using maximum likelihood reconstruction. Among 46 samples from 38 persons, 1468 intragenic base-pair substitutions were identified. The maximum nucleotide distance between strains for concatenated U L_ U S segments was 0.4%. Phylogeny did not reveal geographic clustering. The most variable proteins had non-synonymous mutations in < 3% of amino acids. Unenriched HSV-2 DNA can undergo next-generation sequencing to identify intragenic variability. The use of clinical swabs for sequencing expands the information that can be gathered directly from these specimens. Copyright © 2017 Elsevier Inc. All rights reserved.

  17. Molecular identification and characterization of clustered regularly interspaced short palindromic repeats (CRISPRs) in a urease-positive thermophilic Campylobacter sp. (UPTC).

    PubMed

    Tasaki, E; Hirayama, J; Tazumi, A; Hayashi, K; Hara, Y; Ueno, H; Moore, J E; Millar, B C; Matsuda, M

    2012-02-01

    Novel clustered regularly-interspaced short palindromic repeats (CRISPRs) locus [7,500 base pairs (bp) in length] occurred in the urease-positive thermophilic Campylobacter (UPTC) Japanese isolate, CF89-12. The 7,500 bp gene loci consisted of the 5'-methylaminomethyl-2-thiouridylate methyltransferase gene, putative (P) CRISPR associated (p-Cas), putative open reading frames, Cas1 and Cas2, leader sequence region (146 bp), 12 CRISPRs consensus sequence repeats (each 36 bp) separated by a non-repetitive unique spacer region of similar length (26-31 bp) and the phosphatidyl glycerophosphatase A gene. When the CRISPRs loci in the UPTC CF89-12 and five C. jejuni isolates were compared with one another, these six isolates contained p-Cas, Cas1 and Cas2 within the loci. Four to 12 CRISPRs consensus sequence repeats separated by a non-repetitive unique spacer region occurred in six isolates and the nucleotide sequences of those repeats gave approximately 92-100% similarity with each other. However, no sequence similarity occurred in the unique spacer regions among these isolates. The putative σ(70) transcriptional promoter and the hypothetical ρ-independent terminator structures for the CRISPRs and Cas were detected. No in vivo transcription of p-Cas, Cas1 and Cas2 was confirmed in the UPTC cells.

  18. Structural analysis of a set of proteins resulting from a bacterial genomics project.

    PubMed

    Badger, J; Sauder, J M; Adams, J M; Antonysamy, S; Bain, K; Bergseid, M G; Buchanan, S G; Buchanan, M D; Batiyenko, Y; Christopher, J A; Emtage, S; Eroshkina, A; Feil, I; Furlong, E B; Gajiwala, K S; Gao, X; He, D; Hendle, J; Huber, A; Hoda, K; Kearins, P; Kissinger, C; Laubert, B; Lewis, H A; Lin, J; Loomis, K; Lorimer, D; Louie, G; Maletic, M; Marsh, C D; Miller, I; Molinari, J; Muller-Dieckmann, H J; Newman, J M; Noland, B W; Pagarigan, B; Park, F; Peat, T S; Post, K W; Radojicic, S; Ramos, A; Romero, R; Rutter, M E; Sanderson, W E; Schwinn, K D; Tresser, J; Winhoven, J; Wright, T A; Wu, L; Xu, J; Harris, T J R

    2005-09-01

    The targets of the Structural GenomiX (SGX) bacterial genomics project were proteins conserved in multiple prokaryotic organisms with no obvious sequence homolog in the Protein Data Bank of known structures. The outcome of this work was 80 structures, covering 60 unique sequences and 49 different genes. Experimental phase determination from proteins incorporating Se-Met was carried out for 45 structures with most of the remainder solved by molecular replacement using members of the experimentally phased set as search models. An automated tool was developed to deposit these structures in the Protein Data Bank, along with the associated X-ray diffraction data (including refined experimental phases) and experimentally confirmed sequences. BLAST comparisons of the SGX structures with structures that had appeared in the Protein Data Bank over the intervening 3.5 years since the SGX target list had been compiled identified homologs for 49 of the 60 unique sequences represented by the SGX structures. This result indicates that, for bacterial structures that are relatively easy to express, purify, and crystallize, the structural coverage of gene space is proceeding rapidly. More distant sequence-structure relationships between the SGX and PDB structures were investigated using PDB-BLAST and Combinatorial Extension (CE). Only one structure, SufD, has a truly unique topology compared to all folds in the PDB. Copyright 2005 Wiley-Liss, Inc.

  19. Cloning and polymorphisms of yak lactate dehydrogenase B gene.

    PubMed

    Wang, Guosheng; Zhao, Xingbo; Zhong, Juming; Cao, Meng; He, Qinghua; Liu, Zhengxin; Lin, Yaqiu; Xu, Yaou; Zheng, Yucai

    2013-06-05

    The main objective of this work was to study the unique polymorphisms of the lactate dehydrogenase-1 (LDH1) gene in yak (Bos grunniens). Native polyacrylamide gel electrophoresis revealed three phenotypes of LDH1 (a tetramer of H subunit) in yak heart and longissimus muscle extracts. The corresponding gene, ldhb, encoding H subunits of three LDH1 phenotypes was obtained by RT-PCR. A total of six nucleotide differences were detected in yak ldhb compared with that of cattle, of which five mutations cause amino acid substitutions. Sequence analysis shows that the G896A and C689A, mutations of ldhb gene, result in alterations of differently charged amino acids, and create the three phenotypes (F, M, and S) of yak LDH1. Molecular modeling of the H subunit of LDH indicates that the substituted amino acids are not located within NAD+ or substrate binding sites. PCR-RFLP examination of G896A mutation demonstrated that most LDH1-F samples are actually heterozygote at this site. These results help to elucidate the molecular basis and genetic characteristic of the three unique LDH1 phenotypes in yak.

  20. Cloning and Polymorphisms of Yak Lactate Dehydrogenase b Gene

    PubMed Central

    Wang, Guosheng; Zhao, Xingbo; Zhong, Juming; Cao, Meng; He, Qinghua; Liu, Zhengxin; Lin, Yaqiu; Xu, Yaou; Zheng, Yucai

    2013-01-01

    The main objective of this work was to study the unique polymorphisms of the lactate dehydrogenase-1 (LDH1) gene in yak (Bos grunniens). Native polyacrylamide gel electrophoresis revealed three phenotypes of LDH1 (a tetramer of H subunit) in yak heart and longissimus muscle extracts. The corresponding gene, ldhb, encoding H subunits of three LDH1 phenotypes was obtained by RT-PCR. A total of six nucleotide differences were detected in yak ldhb compared with that of cattle, of which five mutations cause amino acid substitutions. Sequence analysis shows that the G896A and C689A, mutations of ldhb gene, result in alterations of differently charged amino acids, and create the three phenotypes (F, M, and S) of yak LDH1. Molecular modeling of the H subunit of LDH indicates that the substituted amino acids are not located within NAD+ or substrate binding sites. PCR-RFLP examination of G896A mutation demonstrated that most LDH1-F samples are actually heterozygote at this site. These results help to elucidate the molecular basis and genetic characteristic of the three unique LDH1 phenotypes in yak. PMID:23739677

  1. Membrane-bound human orphan cytochrome P450 2U1: Sequence singularities, construction of a full 3D model, and substrate docking.

    PubMed

    Ducassou, Lionel; Dhers, Laura; Jonasson, Gabriella; Pietrancosta, Nicolas; Boucher, Jean-Luc; Mansuy, Daniel; André, François

    2017-09-01

    Human cytochrome P450 2U1 (CYP2U1) is an orphan CYP that exhibits several distinctive characteristics among the 57 human CYPs with a highly conserved sequence in almost all living organisms. We compared its protein sequence with those of the 57 human CYPs and constructed a 3D structure of a full-length CYP2U1 model bound to a POPC membrane. We also performed docking experiments of arachidonic acid (AA) and N-arachidonoylserotonin (AS) in this model. The protein sequence of CYP2U1 displayed two unique characteristics when compared to those of the human CYPs, the presence of a longer N-terminal region upstream of the putative trans-membrane helix (TMH) containing 8 proline residues, and of an insert of about 20 amino acids containing 5 arginine residues between helices A' and A. Its N-terminal part upstream of TMH involved an additional short terminal helix, in a manner similar to what was reported in the crystal structure of Saccharomyces cerevisiae CYP51. Our model also showed a specific interaction between the charged residues of insert AA' and phosphate groups of lipid polar heads, suggesting a possible role of this insert in substrate recruitment. Docking of AA and AS in this model showed these substrates in channel 2ac, with the terminal alkyl chain of AA or the indole ring of AS close to the heme, in agreement with the reported CYP2U1-catalyzed AA and AS hydroxylation regioselectivities. This model should be useful to find new endogenous or exogenous CYP2U1 substrates and to interpret the regioselectivity of their hydroxylation. Copyright © 2017 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.

  2. Kilo-sequencing: an ordered strategy for rapid DNA sequence data acquisition.

    PubMed Central

    Barnes, W M; Bevan, M

    1983-01-01

    A strategy for rapid DNA sequence acquisition in an ordered, nonrandom manner, while retaining all of the conveniences of the dideoxy method with M13 transducing phage DNA template, is described. Target DNA 3 to 14 kb in size can be stably carried by our M13 vectors. Suitable targets are stretches of DNA which lack an enzyme recognition site which is unique on our cloning vectors and adjacent to the sequencing primer; current sites that are so useful when lacking are Pst, Xba, HindIII, BglII, EcoRI. By an in vitro procedure, we cut RF DNA once randomly and once specifically, to create thousands of deletions which start at the unique restriction site adjacent to the dideoxy sequencing primer and extend various distances across the target DNA. Phage carrying a desired size of deletions, whose DNA as template will give rise to DNA sequence data in a desired location along the target DNA, may be purified by electrophoresis alive on agarose gels. Phage running in the same location on the agarose gel thus conveniently give rise to nucleotide sequence data from the same kilobase of target DNA. Images PMID:6298723

  3. Application of combinatorial biocatalysis for a unique ring expansion of dihydroxymethylzearalenone

    USDA-ARS?s Scientific Manuscript database

    Combinatorial biocatalysis was applied to generate a diverse set of dihydroxymethylzearalenone derivatives with modified ring structure. In one chemoenzymatic reaction sequence, dihydroxymethylzearalenone was first subjected to a unique enzyme-catalyzed oxidative ring opening reaction that creates ...

  4. A cricket Gene Index: a genomic resource for studying neurobiology, speciation, and molecular evolution

    PubMed Central

    Danley, Patrick D; Mullen, Sean P; Liu, Fenglong; Nene, Vishvanath; Quackenbush, John; Shaw, Kerry L

    2007-01-01

    Background As the developmental costs of genomic tools decline, genomic approaches to non-model systems are becoming more feasible. Many of these systems may lack advanced genetic tools but are extremely valuable models in other biological fields. Here we report the development of expressed sequence tags (EST's) in an orthopteroid insect, a model for the study of neurobiology, speciation, and evolution. Results We report the sequencing of 14,502 EST's from clones derived from a nerve cord cDNA library, and the subsequent construction of a Gene Index from these sequences, from the Hawaiian trigonidiine cricket Laupala kohalensis. The Gene Index contains 8607 unique sequences comprised of 2575 tentative consensus (TC) sequences and 6032 singletons. For each of the unique sequences, an attempt was made to assign a provisional annotation and to categorize its function using a Gene Ontology-based classification through a sequence-based comparison to known proteins. In addition, a set of unique 70 base pair oligomers that can be used for DNA microarrays was developed. All Gene Index information is posted at the DFCI Gene Indices web page Conclusion Orthopterans are models used to understand the neurophysiological basis of complex motor patterns such as flight and stridulation. The sequences presented in the cricket Gene Index will provide neurophysiologists with many genetic tools that have been largely absent in this field. The cricket Gene Index is one of only two gene indices to be developed in an evolutionary model system. Species within the genus Laupala have speciated recently, rapidly, and extensively. Therefore, the genes identified in the cricket Gene Index can be used to study the genomics of speciation. Furthermore, this gene index represents a significant EST resources for basal insects. As such, this resource is a valuable comparative tool for the understanding of invertebrate molecular evolution. The sequences presented here will provide much needed genomic resources for three distinct but overlapping fields of inquiry: neurobiology, speciation, and molecular evolution. PMID:17459168

  5. DNA sequence analysis of the composite plasmid pTC conferring virulence and antimicrobial resistance for porcine enterotoxigenic Escherichia coli.

    PubMed

    Fekete, Péter Z; Brzuszkiewicz, Elzbieta; Blum-Oehler, Gabriele; Olasz, Ferenc; Szabó, Mónika; Gottschalk, Gerhard; Hacker, Jörg; Nagy, Béla

    2012-01-01

    In this study the plasmid pTC, a 90 kb self-conjugative virulence plasmid of the porcine enterotoxigenic Escherichia coli (ETEC) strain EC2173 encoding the STa and STb heat-stable enterotoxins and tetracycline resistance, has been sequenced in two steps. As a result we identified five main distinct regions of pTC: (i) the maintenance region responsible for the extreme stability of the plasmid, (ii) the TSL (toxin-specific locus comprising the estA and estB genes) which is unique and characteristic for pTC, (iii) a Tn10 transposon, encoding tetracycline resistance, (iv) the tra (plasmid transfer) region, and (v) the colE1-like origin of replication. It is concluded that pTC is a self-transmissible composite plasmid harbouring antibiotic resistance and virulence genes. pTC belongs to a group of large conjugative E. coli plasmids represented by NR1 with a widespread tra backbone which might have evolved from a common ancestor. This is the first report of a completely sequenced animal ETEC virulence plasmid containing an antimicrobial resistance locus, thereby representing a selection advantage for spread of pathogenicity in the presence of antimicrobials leading to increased disease potential. Copyright © 2011. Published by Elsevier GmbH.

  6. A Molecular Method for the Identification of Honey Bee Subspecies Used by Beekeepers in Russia

    PubMed Central

    Syromyatnikov, Mikhail Y.; Borodachev, Anatoly V.; Kokina, Anastasia V.; Popov, Vasily N.

    2018-01-01

    Apis mellifera L. includes several recognized subspecies that differ in their biological properties and agricultural characteristics. Distinguishing between honey bee subspecies is complicated. We analyzed the Folmer region of the COX1 gene in honey bee subspecies cultivated at bee farms in Russia and identified subspecies-specific SNPs. DNA analysis revealed two clearly distinct haplogroups in A. mellifera mellifera. The first one was characterized by multiple cytosine-thymine (thymine–cytosine) transitions, one adenine-guanine substitution, and one thymine–adenine substitution. The nucleotide sequence of the second haplogroup coincided with sequences from other subspecies, except the unique C/A SNP at position 421 of the 658-bp Folmer region. A. mellifera carnica and A. mellifera carpatica could be distinguished from A. mellifera mellifera and A. mellifera caucasica by the presence of the A/G SNP at position 99 of the 658-bp Folmer region. The G/A SNP at position 448 was typical for A. mellifera carnica. A. mellifera caucasica COX1 sequence lacked all the above-mentioned sites. We developed a procedure for rapid identification of honey bee subspecies by PCR with restriction fragment length polymorphism (RFLP) using mutagenic primers. The developed molecular method for honey bee subspecies identification is fast and inexpensive. PMID:29382048

  7. A novel class of small RNAs bind to MILI protein in mouse testes.

    PubMed

    Aravin, Alexei; Gaidatzis, Dimos; Pfeffer, Sébastien; Lagos-Quintana, Mariana; Landgraf, Pablo; Iovino, Nicola; Morris, Patricia; Brownstein, Michael J; Kuramochi-Miyagawa, Satomi; Nakano, Toru; Chien, Minchen; Russo, James J; Ju, Jingyue; Sheridan, Robert; Sander, Chris; Zavolan, Mihaela; Tuschl, Thomas

    2006-07-13

    Small RNAs bound to Argonaute proteins recognize partially or fully complementary nucleic acid targets in diverse gene-silencing processes. A subgroup of the Argonaute proteins--known as the 'Piwi family'--is required for germ- and stem-cell development in invertebrates, and two Piwi members--MILI and MIWI--are essential for spermatogenesis in mouse. Here we describe a new class of small RNAs that bind to MILI in mouse male germ cells, where they accumulate at the onset of meiosis. The sequences of the over 1,000 identified unique molecules share a strong preference for a 5' uridine, but otherwise cannot be readily classified into sequence families. Genomic mapping of these small RNAs reveals a limited number of clusters, suggesting that these RNAs are processed from long primary transcripts. The small RNAs are 26-31 nucleotides (nt) in length--clearly distinct from the 21-23 nt of microRNAs (miRNAs) or short interfering RNAs (siRNAs)--and we refer to them as 'Piwi-interacting RNAs' or piRNAs. Orthologous human chromosomal regions also give rise to small RNAs with the characteristics of piRNAs, but the cloned sequences are distinct. The identification of this new class of small RNAs provides an important starting point to determine the molecular function of Piwi proteins in mammalian spermatogenesis.

  8. Trinucleotide cassettes increase diversity of T7 phage-displayed peptide library.

    PubMed

    Krumpe, Lauren R H; Schumacher, Kathryn M; McMahon, James B; Makowski, Lee; Mori, Toshiyuki

    2007-10-05

    Amino acid sequence diversity is introduced into a phage-displayed peptide library by randomizing library oligonucleotide DNA. We recently evaluated the diversity of peptide libraries displayed on T7 lytic phage and M13 filamentous phage and showed that T7 phage can display a more diverse amino acid sequence repertoire due to differing processes of viral morphogenesis. In this study, we evaluated and compared the diversity of a 12-mer T7 phage-displayed peptide library randomized using codon-corrected trinucleotide cassettes with a T7 and an M13 12-mer phage-displayed peptide library constructed using the degenerate codon randomization method. We herein demonstrate that the combination of trinucleotide cassette amino acid codon randomization and T7 phage display construction methods resulted in a significant enhancement to the functional diversity of a 12-mer peptide library. This novel library exhibited superior amino acid uniformity and order-of-magnitude increases in amino acid sequence diversity as compared to degenerate codon randomized peptide libraries. Comparative analyses of the biophysical characteristics of the 12-mer peptide libraries revealed the trinucleotide cassette-randomized library to be a unique resource. The combination of T7 phage display and trinucleotide cassette randomization resulted in a novel resource for the potential isolation of binding peptides for new and previously studied molecular targets.

  9. Hydraulic fracturing and the Crooked Lake Sequences: Insights gleaned from regional seismic networks

    NASA Astrophysics Data System (ADS)

    Schultz, Ryan; Stern, Virginia; Novakovic, Mark; Atkinson, Gail; Gu, Yu Jeffrey

    2015-04-01

    Within central Alberta, Canada, a new sequence of earthquakes has been recognized as of 1 December 2013 in a region of previous seismic quiescence near Crooked Lake, ~30 km west of the town of Fox Creek. We utilize a cross-correlation detection algorithm to detect more than 160 events to the end of 2014, which is temporally distinguished into five subsequences. This observation is corroborated by the uniqueness of waveforms clustered by subsequence. The Crooked Lake Sequences have come under scrutiny due to its strong temporal correlation (>99.99%) to the timing of hydraulic fracturing operations in the Duvernay Formation. We assert that individual subsequences are related to fracturing stimulation and, despite adverse initial station geometry, double-difference techniques allow us to spatially relate each cluster back to a unique horizontal well. Overall, we find that seismicity in the Crooked Lake Sequences is consistent with first-order observations of hydraulic fracturing induced seismicity.

  10. Dynamics of actin evolution in dinoflagellates.

    PubMed

    Kim, Sunju; Bachvaroff, Tsvetan R; Handy, Sara M; Delwiche, Charles F

    2011-04-01

    Dinoflagellates have unique nuclei and intriguing genome characteristics with very high DNA content making complete genome sequencing difficult. In dinoflagellates, many genes are found in multicopy gene families, but the processes involved in the establishment and maintenance of these gene families are poorly understood. Understanding the dynamics of gene family evolution in dinoflagellates requires comparisons at different evolutionary scales. Studies of closely related species provide fine-scale information relative to species divergence, whereas comparisons of more distantly related species provides broad context. We selected the actin gene family as a highly expressed conserved gene previously studied in dinoflagellates. Of the 142 sequences determined in this study, 103 were from the two closely related species, Dinophysis acuminata and D. caudata, including full length and partial cDNA sequences as well as partial genomic amplicons. For these two Dinophysis species, at least three types of sequences could be identified. Most copies (79%) were relatively similar and in nucleotide trees, the sequences formed two bushy clades corresponding to the two species. In comparisons within species, only eight to ten nucleotide differences were found between these copies. The two remaining types formed clades containing sequences from both species. One type included the most similar sequences in between-species comparisons with as few as 12 nucleotide differences between species. The second type included the most divergent sequences in comparisons between and within species with up to 93 nucleotide differences between sequences. In all the sequences, most variation occurred in synonymous sites or the 5' UnTranslated Region (UTR), although there was still limited amino acid variation between most sequences. Several potential pseudogenes were found (approximately 10% of all sequences depending on species) with incomplete open reading frames due to frameshifts or early stop codons. Overall, variation in the actin gene family fits best with the "birth and death" model of evolution based on recent duplications, pseudogenes, and incomplete lineage sorting. Divergence between species was similar to variation within species, so that actin may be too conserved to be useful for phylogenetic estimation of closely related species.

  11. A population study of the minicircles in Trypanosoma cruzi: predicting guide RNAs in the absence of empirical RNA editing.

    PubMed

    Thomas, Sean; Martinez, L L Isadora Trejo; Westenberger, Scott J; Sturm, Nancy R

    2007-05-24

    The structurally complex network of minicircles and maxicircles comprising the mitochondrial DNA of kinetoplastids mirrors the complexity of the RNA editing process that is required for faithful expression of encrypted maxicircle genes. Although a few of the guide RNAs that direct this editing process have been discovered on maxicircles, guide RNAs are mostly found on the minicircles. The nuclear and maxicircle genomes have been sequenced and assembled for Trypanosoma cruzi, the causative agent of Chagas disease, however the complement of 1.4-kb minicircles, carrying four guide RNA genes per molecule in this parasite, has been less thoroughly characterised. Fifty-four CL Brener and 53 Esmeraldo strain minicircle sequence reads were extracted from T. cruzi whole genome shotgun sequencing data. With these sequences and all published T. cruzi minicircle sequences, 108 unique guide RNAs from all known T. cruzi minicircle sequences and two guide RNAs from the CL Brener maxicircle were predicted using a local alignment algorithm and mapped onto predicted or experimentally determined sequences of edited maxicircle open reading frames. For half of the sequences no statistically significant guide RNA could be assigned. Likely positions of these unidentified gRNAs in T. cruzi minicircle sequences are estimated using a simple Hidden Markov Model. With the local alignment predictions as a standard, the HMM had an ~85% chance of correctly identifying at least 20 nucleotides of guide RNA from a given minicircle sequence. Inter-minicircle recombination was documented. Variable regions contain species-specific areas of distinct nucleotide preference. Two maxicircle guide RNA genes were found. The identification of new minicircle sequences and the further characterization of all published minicircles are presented, including the first observation of recombination between minicircles. Extrapolation suggests a level of 4% recombinants in the population, supporting a relatively high recombination rate that may serve to minimize the persistence of gRNA pseudogenes. Characteristic nucleotide preferences observed within variable regions provide potential clues regarding the transcription and maturation of T. cruzi guide RNAs. Based on these preferences, a method of predicting T. cruzi guide RNAs using only primary minicircle sequence data was created.

  12. Sequence divergence of the red and green visual pigments in great apes and humans.

    PubMed Central

    Deeb, S S; Jorgensen, A L; Battisti, L; Iwasaki, L; Motulsky, A G

    1994-01-01

    We have determined the coding sequences of red and green visual pigment genes of the chimpanzee, gorilla, and orangutan. The deduced amino acid sequences of these pigments are highly homologous to the equivalent human pigments. None of the amino acid differences occurred at sites that were previously shown to influence pigment absorption characteristics. Therefore, we predict the spectra of red and green pigments of the apes to have wavelengths of maximum absorption that differ by < 2 nm from the equivalent human pigments and that color vision in these nonhuman primates will be very similar, if not identical, to that in humans. A total of 14 within-species polymorphisms (6 involving silent substitutions) were observed in the coding sequences of the red and green pigment genes of the great apes. Remarkably, the polymorphisms at 6 of these sites had been observed in human populations, suggesting that they predated the evolution of higher primates. Alleles at polymorphic sites were often shared between the red and green pigment genes. The average synonymous rate of divergence of red from green sequences was approximately 1/10th that estimated for other proteins of higher primates, indicating the involvement of gene conversion in generating these polymorphisms. The high degree of homology and juxtaposition of these two genes on the X chromosome has promoted unequal recombination and/or gene conversion that led to sequence homogenization. However, natural selection operated to maintain the degree of separation in peak absorbance between the red and green pigments that resulted in optimal chromatic discrimination. This represents a unique case of molecular coevolution between two homologous genes that functionally interact at the behavioral level. PMID:8041777

  13. The human homolog of S. cerevisiae CDC27, CDC27 Hs, is encoded by a highly conserved intronless gene present in multiple copies in the human genome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Devor, E.J.; Dill-Devor, R.M.

    1994-09-01

    We have obtained a number of unique sequences via PCR amplification of human genomic DNA using degenerate primers under low stringency (42{degrees}C). One of these, an 853 bp product, has been identified as a partial genomic sequence of the human homolog of the S. cerevisiae CDC27 gene, CDC27Hs (GenBank No. U00001). This gene, reported by Turgendreich et al. is also designated EST00556 from Adams et al. We have undertaken a more detailed examination of our sequence, MCP34N, and have found that: 1. the genomic sequence is nearly identical to CDC27Hs over its entire 853 bp length; 2. an MCP34N-specific PCRmore » assay of several non-human primate species reveals amplification products in chimpanzee and gorilla genomes having greater than 90% sequence identity with CDC27Hs; and 3. an MCP34N-specific PCR assay of the BIOS hybrid cell line panel gives a discordancy pattern suggesting multiple loci. Based upon these data, we present the following initial characterization: 1. the complete MCP34N sequence identity with CDC27Hs indicates that the latter is encoded by an intronless gene; 2. CDC27Hs is highly conserved among higher primates; and 3. CDC27Hs is present in multiple copies in the human genome. These characteristics, taken together with those initially reported for CDC27Hs, suggest that this is an old gene that carries out an important but, as yet, unknown function in the human brain.« less

  14. Early Evolution of Conserved Regulatory Sequences Associated with Development in Vertebrates

    PubMed Central

    McEwen, Gayle K.; Goode, Debbie K.; Parker, Hugo J.; Woolfe, Adam; Callaway, Heather; Elgar, Greg

    2009-01-01

    Comparisons between diverse vertebrate genomes have uncovered thousands of highly conserved non-coding sequences, an increasing number of which have been shown to function as enhancers during early development. Despite their extreme conservation over 500 million years from humans to cartilaginous fish, these elements appear to be largely absent in invertebrates, and, to date, there has been little understanding of their mode of action or the evolutionary processes that have modelled them. We have now exploited emerging genomic sequence data for the sea lamprey, Petromyzon marinus, to explore the depth of conservation of this type of element in the earliest diverging extant vertebrate lineage, the jawless fish (agnathans). We searched for conserved non-coding elements (CNEs) at 13 human gene loci and identified lamprey elements associated with all but two of these gene regions. Although markedly shorter and less well conserved than within jawed vertebrates, identified lamprey CNEs are able to drive specific patterns of expression in zebrafish embryos, which are almost identical to those driven by the equivalent human elements. These CNEs are therefore a unique and defining characteristic of all vertebrates. Furthermore, alignment of lamprey and other vertebrate CNEs should permit the identification of persistent sequence signatures that are responsible for common patterns of expression and contribute to the elucidation of the regulatory language in CNEs. Identifying the core regulatory code for development, common to all vertebrates, provides a foundation upon which regulatory networks can be constructed and might also illuminate how large conserved regulatory sequence blocks evolve and become fixed in genomic DNA. PMID:20011110

  15. Generation and analysis of expressed sequence tags from a cDNA library of the fruiting body of Ganoderma lucidum

    PubMed Central

    2010-01-01

    Background Little genomic or trancriptomic information on Ganoderma lucidum (Lingzhi) is known. This study aims to discover the transcripts involved in secondary metabolite biosynthesis and developmental regulation of G. lucidum using an expressed sequence tag (EST) library. Methods A cDNA library was constructed from the G. lucidum fruiting body. Its high-quality ESTs were assembled into unique sequences with contigs and singletons. The unique sequences were annotated according to sequence similarities to genes or proteins available in public databases. The detection of simple sequence repeats (SSRs) was preformed by online analysis. Results A total of 1,023 clones were randomly selected from the G. lucidum library and sequenced, yielding 879 high-quality ESTs. These ESTs showed similarities to a diverse range of genes. The sequences encoding squalene epoxidase (SE) and farnesyl-diphosphate synthase (FPS) were identified in this EST collection. Several candidate genes, such as hydrophobin, MOB2, profilin and PHO84 were detected for the first time in G. lucidum. Thirteen (13) potential SSR-motif microsatellite loci were also identified. Conclusion The present study demonstrates a successful application of EST analysis in the discovery of transcripts involved in the secondary metabolite biosynthesis and the developmental regulation of G. lucidum. PMID:20230644

  16. Synthetic oligonucleotide probes deduced from amino acid sequence data. Theoretical and practical considerations.

    PubMed

    Lathe, R

    1985-05-05

    Synthetic probes deduced from amino acid sequence data are widely used to detect cognate coding sequences in libraries of cloned DNA segments. The redundancy of the genetic code dictates that a choice must be made between (1) a mixture of probes reflecting all codon combinations, and (2) a single longer "optimal" probe. The second strategy is examined in detail. The frequency of sequences matching a given probe by chance alone can be determined and also the frequency of sequences closely resembling the probe and contributing to the hybridization background. Gene banks cannot be treated as random associations of the four nucleotides, and probe sequences deduced from amino acid sequence data occur more often than predicted by chance alone. Probe lengths must be increased to confer the necessary specificity. Examination of hybrids formed between unique homologous probes and their cognate targets reveals that short stretches of perfect homology occurring by chance make a significant contribution to the hybridization background. Statistical methods for improving homology are examined, taking human coding sequences as an example, and considerations of codon utilization and dinucleotide frequencies yield an overall homology of greater than 82%. Recommendations for probe design and hybridization are presented, and the choice between using multiple probes reflecting all codon possibilities and a unique optimal probe is discussed.

  17. Changes in the flagellar bundling time account for variations in swimming behavior of flagellated bacteria in viscous media

    NASA Astrophysics Data System (ADS)

    Qu, Zijie; Temel, Fatma; Henderikx, Rene; Breuer, Kenneth

    2017-11-01

    The motility of bacteria E.coli in viscous fluids has been widely studied, although conflicting results on the effect of viscosity on swimming speed abound. The swimming mode of wild-type E.coli is idealized as a run-and-tumble sequence in which periods of straight swimming at a constant speed are randomly interrupted by a tumble, defined as a sudden change of direction with a very low speed. Using a tracking microscope, we follow cells for extended time and find that the swimming behavior of a single cell can exhibit a variety of behaviors including run-and-tumble and ``slow-random-walk'' in which the cells move at relatively low speed without the characteristic run. Although the characteristic swimming speed varies between individuals and in different polymer solutions, we find that the skewness of the speed distribution is solely a function of viscosity, and uniquely determines the ratio of the average speed to the characteristic run speed. Using Resistive Force Theory and the cell-specific measured characteristic run speed, we show that differences in the swimming behavior observed in solutions of different viscosity are due to changes in the flagellar bundling time, which increases as the viscosity rises, due to lower rotation rate of the flagellar motor. National Science Foundation.

  18. Rotary pin-in-maze discriminator

    DOEpatents

    Benavides, Gilbert L.

    1997-01-01

    A discriminator apparatus and method that discriminates between a unique signal and any other (incorrect) signal. The unique signal is a sequence of events; each event can assume one of two possible event states. Given the unique signal, a maze wheel is allowed to rotate fully in one direction. Given an incorrect signal, both the maze wheel and a pin wheel lock in position.

  19. Genome sequence of an aflatoxigenic pathogen of Argentinian peanut, Aspergillus arachidicola

    USDA-ARS?s Scientific Manuscript database

    In this study we sequenced the genome of the A. arachidicola Type strain (CBS 117610) and found its genome size to be 38.9 Mb, and its number of predicted genes to be 12,091, which are values comparable to those in other sequenced Aspergilli. Of its predicted genes, 691 were identified as unique to ...

  20. Complete Genome Sequences of Bacillus Phages Janet and OTooleKemple52.

    PubMed

    Kent, Brenna; Raymond, Thomas; Mosier, Philip D; Johnson, Allison A

    2018-05-10

    We report here the genome sequences of two novel Bacillus cereus group-infecting bacteriophages, Janet and OTooleKemple52. These bacteriophages are double-stranded DNA-containing Myoviridae isolated from soil samples. While their genomes share a high degree of sequence identity with one another, their host preferences are unique. Copyright © 2018 Kent et al.

  1. Genome sequence of the ultrasmall unicellular red alga Cyanidioschyzon merolae 10D.

    PubMed

    Matsuzaki, Motomichi; Misumi, Osami; Shin-I, Tadasu; Maruyama, Shinichiro; Takahara, Manabu; Miyagishima, Shin-Ya; Mori, Toshiyuki; Nishida, Keiji; Yagisawa, Fumi; Nishida, Keishin; Yoshida, Yamato; Nishimura, Yoshiki; Nakao, Shunsuke; Kobayashi, Tamaki; Momoyama, Yu; Higashiyama, Tetsuya; Minoda, Ayumi; Sano, Masako; Nomoto, Hisayo; Oishi, Kazuko; Hayashi, Hiroko; Ohta, Fumiko; Nishizaka, Satoko; Haga, Shinobu; Miura, Sachiko; Morishita, Tomomi; Kabeya, Yukihiro; Terasawa, Kimihiro; Suzuki, Yutaka; Ishii, Yasuyuki; Asakawa, Shuichi; Takano, Hiroyoshi; Ohta, Niji; Kuroiwa, Haruko; Tanaka, Kan; Shimizu, Nobuyoshi; Sugano, Sumio; Sato, Naoki; Nozaki, Hisayoshi; Ogasawara, Naotake; Kohara, Yuji; Kuroiwa, Tsuneyoshi

    2004-04-08

    Small, compact genomes of ultrasmall unicellular algae provide information on the basic and essential genes that support the lives of photosynthetic eukaryotes, including higher plants. Here we report the 16,520,305-base-pair sequence of the 20 chromosomes of the unicellular red alga Cyanidioschyzon merolae 10D as the first complete algal genome. We identified 5,331 genes in total, of which at least 86.3% were expressed. Unique characteristics of this genomic structure include: a lack of introns in all but 26 genes; only three copies of ribosomal DNA units that maintain the nucleolus; and two dynamin genes that are involved only in the division of mitochondria and plastids. The conserved mosaic origin of Calvin cycle enzymes in this red alga and in green plants supports the hypothesis of the existence of single primary plastid endosymbiosis. The lack of a myosin gene, in addition to the unexpressed actin gene, suggests a simpler system of cytokinesis. These results indicate that the C. merolae genome provides a model system with a simple gene composition for studying the origin, evolution and fundamental mechanisms of eukaryotic cells.

  2. Clustering method for counting passengers getting in a bus with single camera

    NASA Astrophysics Data System (ADS)

    Yang, Tao; Zhang, Yanning; Shao, Dapei; Li, Ying

    2010-03-01

    Automatic counting of passengers is very important for both business and security applications. We present a single-camera-based vision system that is able to count passengers in a highly crowded situation at the entrance of a traffic bus. The unique characteristics of the proposed system include, First, a novel feature-point-tracking- and online clustering-based passenger counting framework, which performs much better than those of background-modeling-and foreground-blob-tracking-based methods. Second, a simple and highly accurate clustering algorithm is developed that projects the high-dimensional feature point trajectories into a 2-D feature space by their appearance and disappearance times and counts the number of people through online clustering. Finally, all test video sequences in the experiment are captured from a real traffic bus in Shanghai, China. The results show that the system can process two 320×240 video sequences at a frame rate of 25 fps simultaneously, and can count passengers reliably in various difficult scenarios with complex interaction and occlusion among people. The method achieves high accuracy rates up to 96.5%.

  3. Global insights into acetic acid resistance mechanisms and genetic stability of Acetobacter pasteurianus strains by comparative genomics

    NASA Astrophysics Data System (ADS)

    Wang, Bin; Shao, Yanchun; Chen, Tao; Chen, Wanping; Chen, Fusheng

    2015-12-01

    Acetobacter pasteurianus (Ap) CICC 20001 and CGMCC 1.41 are two acetic acid bacteria strains that, because of their strong abilities to produce and tolerate high concentrations of acetic acid, have been widely used to brew vinegar in China. To globally understand the fermentation characteristics, acid-tolerant mechanisms and genetic stabilities, their genomes were sequenced. Genomic comparisons with 9 other sequenced Ap strains revealed that their chromosomes were evolutionarily conserved, whereas the plasmids were unique compared with other Ap strains. Analysis of the acid-tolerant metabolic pathway at the genomic level indicated that the metabolism of some amino acids and the known mechanisms of acetic acid tolerance, might collaboratively contribute to acetic acid resistance in Ap strains. The balance of instability factors and stability factors in the genomes of Ap CICC 20001 and CGMCC 1.41 strains might be the basis for their genetic stability, consistent with their stable industrial performances. These observations provide important insights into the acid resistance mechanism and the genetic stability of Ap strains and lay a foundation for future genetic manipulation and engineering of these two strains.

  4. Global insights into acetic acid resistance mechanisms and genetic stability of Acetobacter pasteurianus strains by comparative genomics.

    PubMed

    Wang, Bin; Shao, Yanchun; Chen, Tao; Chen, Wanping; Chen, Fusheng

    2015-12-22

    Acetobacter pasteurianus (Ap) CICC 20001 and CGMCC 1.41 are two acetic acid bacteria strains that, because of their strong abilities to produce and tolerate high concentrations of acetic acid, have been widely used to brew vinegar in China. To globally understand the fermentation characteristics, acid-tolerant mechanisms and genetic stabilities, their genomes were sequenced. Genomic comparisons with 9 other sequenced Ap strains revealed that their chromosomes were evolutionarily conserved, whereas the plasmids were unique compared with other Ap strains. Analysis of the acid-tolerant metabolic pathway at the genomic level indicated that the metabolism of some amino acids and the known mechanisms of acetic acid tolerance, might collaboratively contribute to acetic acid resistance in Ap strains. The balance of instability factors and stability factors in the genomes of Ap CICC 20001 and CGMCC 1.41 strains might be the basis for their genetic stability, consistent with their stable industrial performances. These observations provide important insights into the acid resistance mechanism and the genetic stability of Ap strains and lay a foundation for future genetic manipulation and engineering of these two strains.

  5. Insights into heliobacterial photosynthesis and physiology from the genome of Heliobacterium modesticaldum.

    PubMed

    Sattley, W Matthew; Blankenship, Robert E

    2010-06-01

    The complete annotated genome sequence of Heliobacterium modesticaldum strain Ice1 provides our first glimpse into the genetic potential of the Heliobacteriaceae, a unique family of anoxygenic phototrophic bacteria. H. modesticaldum str. Ice1 is the first completely sequenced phototrophic representative of the Firmicutes, and heliobacteria are the only phototrophic members of this large bacterial phylum. The H. modesticaldum genome consists of a single 3.1-Mb circular chromosome with no plasmids. Of special interest are genomic features that lend insight to the physiology and ecology of heliobacteria, including the genetic inventory of the photosynthesis gene cluster. Genes involved in transport, photosynthesis, and central intermediary metabolism are described and catalogued. The obligately heterotrophic metabolism of heliobacteria is a key feature of the physiology and evolution of these phototrophs. The conspicuous absence of recognizable genes encoding the enzyme ATP-citrate lyase prevents autotrophic growth via the reverse citric acid cycle in heliobacteria, thus being a distinguishing differential characteristic between heliobacteria and green sulfur bacteria. The identities of electron carriers that enable energy conservation by cyclic light-driven electron transfer remain in question.

  6. Methylotrophic Methylobacterium Bacteria Nodulate and Fix Nitrogen in Symbiosis with Legumes

    PubMed Central

    Sy, Abdoulaye; Giraud, Eric; Jourand, Philippe; Garcia, Nelly; Willems, Anne; de Lajudie, Philippe; Prin, Yves; Neyra, Marc; Gillis, Monique; Boivin-Masson, Catherine; Dreyfus, Bernard

    2001-01-01

    Rhizobia described so far belong to three distinct phylogenetic branches within the α-2 subclass of Proteobacteria. Here we report the discovery of a fourth rhizobial branch involving bacteria of the Methylobacterium genus. Rhizobia isolated from Crotalaria legumes were assigned to a new species, “Methylobacterium nodulans,” within the Methylobacterium genus on the basis of 16S ribosomal DNA analyses. We demonstrated that these rhizobia facultatively grow on methanol, which is a characteristic of Methylobacterium spp. but a unique feature among rhizobia. Genes encoding two key enzymes of methylotrophy and nodulation, the mxaF gene, encoding the α subunit of the methanol dehydrogenase, and the nodA gene, encoding an acyltransferase involved in Nod factor biosynthesis, were sequenced for the type strain, ORS2060. Plant tests and nodA amplification assays showed that “M. nodulans” is the only nodulating Methylobacterium sp. identified so far. Phylogenetic sequence analysis showed that “M. nodulans” NodA is closely related to Bradyrhizobium NodA, suggesting that this gene was acquired by horizontal gene transfer. PMID:11114919

  7. Chromosomal structures and repetitive sequences divergence in Cucumis species revealed by comparative cytogenetic mapping.

    PubMed

    Zhang, Yunxia; Cheng, Chunyan; Li, Ji; Yang, Shuqiong; Wang, Yunzhu; Li, Ziang; Chen, Jinfeng; Lou, Qunfeng

    2015-09-25

    Differentiation and copy number of repetitive sequences affect directly chromosome structure which contributes to reproductive isolation and speciation. Comparative cytogenetic mapping has been verified an efficient tool to elucidate the differentiation and distribution of repetitive sequences in genome. In present study, the distinct chromosomal structures of five Cucumis species were revealed through genomic in situ hybridization (GISH) technique and comparative cytogenetic mapping of major satellite repeats. Chromosome structures of five Cucumis species were investigated using GISH and comparative mapping of specific satellites. Southern hybridization was employed to study the proliferation of satellites, whose structural characteristics were helpful for analyzing chromosome evolution. Preferential distribution of repetitive DNAs at the subtelomeric regions was found in C. sativus, C hystrix and C. metuliferus, while majority was positioned at the pericentromeric heterochromatin regions in C. melo and C. anguria. Further, comparative GISH (cGISH) through using genomic DNA of other species as probes revealed high homology of repeats between C. sativus and C. hystrix. Specific satellites including 45S rDNA, Type I/II, Type III, Type IV, CentM and telomeric repeat were then comparatively mapped in these species. Type I/II and Type IV produced bright signals at the subtelomeric regions of C. sativus and C. hystrix simultaneously, which might explain the significance of their amplification in the divergence of Cucumis subgenus from the ancient ancestor. Unique positioning of Type III and CentM only at the centromeric domains of C. sativus and C. melo, respectively, combining with unique southern bands, revealed rapid evolutionary patterns of centromeric DNA in Cucumis. Obvious interstitial telomeric repeats were observed in chromosomes 1 and 2 of C. sativus, which might provide evidence of the fusion hypothesis of chromosome evolution from x = 12 to x = 7 in Cucumis species. Besides, the significant correlation was found between gene density along chromosome and GISH band intensity in C. sativus and C. melo. In summary, comparative cytogenetic mapping of major satellites and GISH revealed the distinct differentiation of chromosome structure during species formation. The evolution of repetitive sequences was the main force for the divergence of Cucumis species from common ancestor.

  8. Microgravity

    NASA Image and Video Library

    1998-12-01

    Type II restriction enzymes, such as Eco R1 endonulease, present a unique advantage for the study of sequence-specific recognition because they leave a record of where they have been in the form of the cleaved ends of the DNA sites where they were bound. The differential behavior of a sequence -specific protein at sites of differing base sequence is the essence of the sequence-specificity; the core question is how do these proteins discriminate between different DNA sequences especially when the two sequences are very similar. Principal Investigator: Dan Carter/New Century Pharmaceuticals

  9. Protein Crystal Eco R1 Endonulease-DNA Complex

    NASA Technical Reports Server (NTRS)

    1998-01-01

    Type II restriction enzymes, such as Eco R1 endonulease, present a unique advantage for the study of sequence-specific recognition because they leave a record of where they have been in the form of the cleaved ends of the DNA sites where they were bound. The differential behavior of a sequence -specific protein at sites of differing base sequence is the essence of the sequence-specificity; the core question is how do these proteins discriminate between different DNA sequences especially when the two sequences are very similar. Principal Investigator: Dan Carter/New Century Pharmaceuticals

  10. The central nervous system transcriptome of the weakly electric brown ghost knifefish (Apteronotus leptorhynchus): de novo assembly, annotation, and proteomics validation.

    PubMed

    Salisbury, Joseph P; Sîrbulescu, Ruxandra F; Moran, Benjamin M; Auclair, Jared R; Zupanc, Günther K H; Agar, Jeffrey N

    2015-03-11

    The brown ghost knifefish (Apteronotus leptorhynchus) is a weakly electric teleost fish of particular interest as a versatile model system for a variety of research areas in neuroscience and biology. The comprehensive information available on the neurophysiology and neuroanatomy of this organism has enabled significant advances in such areas as the study of the neural basis of behavior, the development of adult-born neurons in the central nervous system and their involvement in the regeneration of nervous tissue, as well as brain aging and senescence. Despite substantial scientific interest in this species, no genomic resources are currently available. Here, we report the de novo assembly and annotation of the A. leptorhynchus transcriptome. After evaluating several trimming and transcript reconstruction strategies, de novo assembly using Trinity uncovered 42,459 unique contigs containing at least a partial protein-coding sequence based on alignment to a reference set of known Actinopterygii sequences. As many as 11,847 of these contigs contained full or near-full length protein sequences, providing broad coverage of the proteome. A variety of non-coding RNA sequences were also identified and annotated, including conserved long intergenic non-coding RNA and other long non-coding RNA observed previously to be expressed in adult zebrafish (Danio rerio) brain, as well as a variety of miRNA, snRNA, and snoRNA. Shotgun proteomics confirmed translation of open reading frames from over 2,000 transcripts, including alternative splice variants. Assignment of tandem mass spectra was greatly improved by use of the assembly compared to databases of sequences from closely related organisms. The assembly and raw reads have been deposited at DDBJ/EMBL/GenBank under the accession number GBKR00000000. Tandem mass spectrometry data is available via ProteomeXchange with identifier PXD001285. Presented here is the first release of an annotated de novo transcriptome assembly from Apteronotus leptorhynchus, providing a broad overview of RNA expressed in central nervous system tissue. The assembly, which includes substantial coverage of a wide variety of both protein coding and non-coding transcripts, will allow the development of better tools to understand the mechanisms underlying unique characteristics of the knifefish model system, such as their tremendous regenerative capacity and negligible brain senescence.

  11. Transposon Variants and Their Effects on Gene Expression in Arabidopsis

    PubMed Central

    Wang, Xi; Weigel, Detlef; Smith, Lisa M.

    2013-01-01

    Transposable elements (TEs) make up the majority of many plant genomes. Their transcription and transposition is controlled through siRNAs and epigenetic marks including DNA methylation. To dissect the interplay of siRNA–mediated regulation and TE evolution, and to examine how TE differences affect nearby gene expression, we investigated genome-wide differences in TEs, siRNAs, and gene expression among three Arabidopsis thaliana accessions. Both TE sequence polymorphisms and presence of linked TEs are positively correlated with intraspecific variation in gene expression. The expression of genes within 2 kb of conserved TEs is more stable than that of genes next to variant TEs harboring sequence polymorphisms. Polymorphism levels of TEs and closely linked adjacent genes are positively correlated as well. We also investigated the distribution of 24-nt-long siRNAs, which mediate TE repression. TEs targeted by uniquely mapping siRNAs are on average farther from coding genes, apparently because they more strongly suppress expression of adjacent genes. Furthermore, siRNAs, and especially uniquely mapping siRNAs, are enriched in TE regions missing in other accessions. Thus, targeting by uniquely mapping siRNAs appears to promote sequence deletions in TEs. Overall, our work indicates that siRNA–targeting of TEs may influence removal of sequences from the genome and hence evolution of gene expression in plants. PMID:23408902

  12. Magnetic suspension actuator concepts and applications

    NASA Technical Reports Server (NTRS)

    Kroeger, John

    1993-01-01

    The fundamental aspect which makes magnetic suspension systems possible is the magnetic phenomena by which significant forces can be generated. Each of these force-producing phenomena has unique characteristics and is implementable in a unique fashion, such that each performs the magnetic suspension task differently than the others. A practical overview of the force-producing concepts, their unique characteristics, and their typical methods of application is provided.

  13. Identification of Group G Streptococcal Isolates from Companion Animals in Japan and Their Antimicrobial Resistance Patterns.

    PubMed

    Tsuyuki, Yuzo; Kurita, Goro; Murata, Yoshiteru; Goto, Mieko; Takahashi, Takashi

    2017-07-24

    In this study, we conducted a species-level identification of group G streptococcal (GGS) isolates from companion animals in Japan and analyzed antimicrobial resistance (AMR) patterns. Strains were isolated from sterile and non-sterile specimens collected from 72 animals with clinical signs or symptoms in April-May, 2015. We identified the strain by 16S rRNA sequencing, mass spectrometry (MS), and an automated method based on their biochemical properties. Antimicrobial susceptibility was determined using the broth microdilution method and E-test. AMR determinants (erm(A), erm(B), mef(A), tet(M), tet(O), tet(K), tet(L), and tet(S)) in corresponding resistant isolates were amplified by PCR. The 16S rRNA sequencing identified the GGS species as Streptococcus canis (n = 68), Streptococcus dysgalactiae subsp. equisimilis (n = 3), and S. dysgalactiae subsp. dysgalactiae (n = 1). However, there were discrepancies between the sequencing data and both the MS and automated identification data. MS and the automated biochemical technique identified 18 and 37 of the 68 sequencing-identified S. canis strains, respectively. The AMR rates were 20.8% for tetracycline and 5.6% for clarithromycin, with minimum inhibitory concentrations (MIC) 50 -MIC 90 of 2-64 and ≤ 0.12-0.25μg/mL, respectively. AMR genotyping showed single or combined genotypes: erm(B) or tet(M)-tet(O)-tet(S). Our findings show the unique characteristics of GGS isolates from companion animals in Japan in terms of species-level identification and AMR patterns.

  14. GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences

    PubMed Central

    Di, Yanming; Schafer, Daniel W.; Wilhelm, Larry J.; Fox, Samuel E.; Sullivan, Christopher M.; Curzon, Aron D.; Carrington, James C.; Mockler, Todd C.; Chang, Jeff H.

    2011-01-01

    GENE-counter is a complete Perl-based computational pipeline for analyzing RNA-Sequencing (RNA-Seq) data for differential gene expression. In addition to its use in studying transcriptomes of eukaryotic model organisms, GENE-counter is applicable for prokaryotes and non-model organisms without an available genome reference sequence. For alignments, GENE-counter is configured for CASHX, Bowtie, and BWA, but an end user can use any Sequence Alignment/Map (SAM)-compliant program of preference. To analyze data for differential gene expression, GENE-counter can be run with any one of three statistics packages that are based on variations of the negative binomial distribution. The default method is a new and simple statistical test we developed based on an over-parameterized version of the negative binomial distribution. GENE-counter also includes three different methods for assessing differentially expressed features for enriched gene ontology (GO) terms. Results are transparent and data are systematically stored in a MySQL relational database to facilitate additional analyses as well as quality assessment. We used next generation sequencing to generate a small-scale RNA-Seq dataset derived from the heavily studied defense response of Arabidopsis thaliana and used GENE-counter to process the data. Collectively, the support from analysis of microarrays as well as the observed and substantial overlap in results from each of the three statistics packages demonstrates that GENE-counter is well suited for handling the unique characteristics of small sample sizes and high variability in gene counts. PMID:21998647

  15. Longitudinal Analysis of Cerebrospinal Fluid and Plasma HIV-1 Envelope Sequences Isolated From a Single Donor with HIV Asymptomatic Neurocognitive Impairment.

    PubMed

    Vázquez-Santiago, Fabián; García, Yashira; Rivera-Román, Ivelisse; Noel, Richard J; Wojna, Valerie; Meléndez, Loyda M; Rivera-Amill, Vanessa

    Combined antiretroviral treatment (cART) has changed the clinical presentation of HIV-associated neurocognitive disorders (HAND) to that of the milder forms of the disease. Asymptomatic neurocognitive impairment (ANI) is now more prevalent and is associated with increased morbidity and mortality risk in HIV-1-infected people. HIV-1 envelope ( env ) genetic heterogeneity has been detected within the central nervous system (CNS) of individuals with ANI. Changes within env determine co-receptor use, cellular tropism, and neuropathogenesis. We hypothesize that compartmental changes are associated with HIV-1 env C2V4 during ANI and sought to analyze paired HIV-1 env sequences from plasma and cerebrospinal fluid (CSF) of a female subject undergoing long-term cART. Paired plasma and CSF samples were collected at 12-month intervals and HIV-1 env C2V4 was cloned and sequenced. Phylogenetic analysis of paired samples consistently showed genetic variants unique to the CSF. Phenotypic prediction showed CCR5 (R5) variants for all CSF-derived sequences and showed minor X4 variants (or dual-tropic) in the plasma at later time points. Viral compartmentalization was evident throughout the study, suggesting that the occurrence of distinctive env strains may contribute to the neuropathogenesis of HAND. Our study provides new insights about the genetic characteristics within the C2V4 of HIV-1 env that persist after long-term cART and during the course of persistent ANI.

  16. Computational approaches for identification of conserved/unique binding pockets in the A chain of ricin

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ecale Zhou, C L; Zemla, A T; Roe, D

    2005-01-29

    Specific and sensitive ligand-based protein detection assays that employ antibodies or small molecules such as peptides, aptamers, or other small molecules require that the corresponding surface region of the protein be accessible and that there be minimal cross-reactivity with non-target proteins. To reduce the time and cost of laboratory screening efforts for diagnostic reagents, we developed new methods for evaluating and selecting protein surface regions for ligand targeting. We devised combined structure- and sequence-based methods for identifying 3D epitopes and binding pockets on the surface of the A chain of ricin that are conserved with respect to a set ofmore » ricin A chains and unique with respect to other proteins. We (1) used structure alignment software to detect structural deviations and extracted from this analysis the residue-residue correspondence, (2) devised a method to compare corresponding residues across sets of ricin structures and structures of closely related proteins, (3) devised a sequence-based approach to determine residue infrequency in local sequence context, and (4) modified a pocket-finding algorithm to identify surface crevices in close proximity to residues determined to be conserved/unique based on our structure- and sequence-based methods. In applying this combined informatics approach to ricin A we identified a conserved/unique pocket in close proximity (but not overlapping) the active site that is suitable for bi-dentate ligand development. These methods are generally applicable to identification of surface epitopes and binding pockets for development of diagnostic reagents, therapeutics, and vaccines.« less

  17. DNA Barcoding in the Cycadales: Testing the Potential of Proposed Barcoding Markers for Species Identification of Cycads

    PubMed Central

    Sass, Chodon; Little, Damon P.; Stevenson, Dennis Wm.; Specht, Chelsea D.

    2007-01-01

    Barcodes are short segments of DNA that can be used to uniquely identify an unknown specimen to species, particularly when diagnostic morphological features are absent. These sequences could offer a new forensic tool in plant and animal conservation—especially for endangered species such as members of the Cycadales. Ideally, barcodes could be used to positively identify illegally obtained material even in cases where diagnostic features have been purposefully removed or to release confiscated organisms into the proper breeding population. In order to be useful, a DNA barcode sequence must not only easily PCR amplify with universal or near-universal reaction conditions and primers, but also contain enough variation to generate unique identifiers at either the species or population levels. Chloroplast regions suggested by the Plant Working Group of the Consortium for the Barcode of Life (CBoL), and two alternatives, the chloroplast psbA-trnH intergenic spacer and the nuclear ribosomal internal transcribed spacer (nrITS), were tested for their utility in generating unique identifiers for members of the Cycadales. Ease of amplification and sequence generation with universal primers and reaction conditions was determined for each of the seven proposed markers. While none of the proposed markers provided unique identifiers for all species tested, nrITS showed the most promise in terms of variability, although sequencing difficulties remain a drawback. We suggest a workflow for DNA barcoding, including database generation and management, which will ultimately be necessary if we are to succeed in establishing a universal DNA barcode for plants. PMID:17987130

  18. MySSP: Non-stationary evolutionary sequence simulation, including indels

    PubMed Central

    Rosenberg, Michael S.

    2007-01-01

    MySSP is a new program for the simulation of DNA sequence evolution across a phylogenetic tree. Although many programs are available for sequence simulation, MySSP is unique in its inclusion of indels, flexibility in allowing for non-stationary patterns, and output of ancestral sequences. Some of these features can individually be found in existing programs, but have not all have been previously available in a single package. PMID:19325855

  19. Degree sequence in message transfer

    NASA Astrophysics Data System (ADS)

    Yamuna, M.

    2017-11-01

    Message encryption is always an issue in current communication scenario. Methods are being devised using various domains. Graphs satisfy numerous unique properties which can be used for message transfer. In this paper, I propose a message encryption method based on degree sequence of graphs.

  20. Full genome sequence of Rocio virus reveal substantial variations from the prototype Rocio virus SPH 34675 sequence.

    PubMed

    Setoh, Yin Xiang; Amarilla, Alberto A; Peng, Nias Y; Slonchak, Andrii; Periasamy, Parthiban; Figueiredo, Luiz T M; Aquino, Victor H; Khromykh, Alexander A

    2018-01-01

    Rocio virus (ROCV) is an arbovirus belonging to the genus Flavivirus, family Flaviviridae. We present an updated sequence of ROCV strain SPH 34675 (GenBank: AY632542.4), the only available full genome sequence prior to this study. Using next-generation sequencing of the entire genome, we reveal substantial sequence variation from the prototype sequence, with 30 nucleotide differences amounting to 14 amino acid changes, as well as significant changes to predicted 3'UTR RNA structures. Our results present an updated and corrected sequence of a potential emerging human-virulent flavivirus uniquely indigenous to Brazil (GenBank: MF461639).

  1. Rotary pin-in-maze discriminator

    DOEpatents

    Benavides, G.L.

    1997-05-06

    A discriminator apparatus and method that discriminates between a unique signal and any other (incorrect) signal are disclosed. The unique signal is a sequence of events; each event can assume one of two possible event states. Given the unique signal, a maze wheel is allowed to rotate fully in one direction. Given an incorrect signal, both the maze wheel and a pin wheel lock in position. 4 figs.

  2. nuID: a universal naming scheme of oligonucleotides for Illumina, Affymetrix, and other microarrays

    PubMed Central

    Du, Pan; Kibbe, Warren A; Lin, Simon M

    2007-01-01

    Background Oligonucleotide probes that are sequence identical may have different identifiers between manufacturers and even between different versions of the same company's microarray; and sometimes the same identifier is reused and represents a completely different oligonucleotide, resulting in ambiguity and potentially mis-identification of the genes hybridizing to that probe. Results We have devised a unique, non-degenerate encoding scheme that can be used as a universal representation to identify an oligonucleotide across manufacturers. We have named the encoded representation 'nuID', for nucleotide universal identifier. Inspired by the fact that the raw sequence of the oligonucleotide is the true definition of identity for a probe, the encoding algorithm uniquely and non-degenerately transforms the sequence itself into a compact identifier (a lossless compression). In addition, we added a redundancy check (checksum) to validate the integrity of the identifier. These two steps, encoding plus checksum, result in an nuID, which is a unique, non-degenerate, permanent, robust and efficient representation of the probe sequence. For commercial applications that require the sequence identity to be confidential, we have an encryption schema for nuID. We demonstrate the utility of nuIDs for the annotation of Illumina microarrays, and we believe it has universal applicability as a source-independent naming convention for oligomers. Reviewers This article was reviewed by Itai Yanai, Rong Chen (nominated by Mark Gerstein), and Gregory Schuler (nominated by David Lipman). PMID:17540033

  3. Cloning of the anhidrotic ectodermal dysplasia gene: Identification of cDNAs associated with CpG islands mapped near translocation breakpoint in two female patients

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Srivastava, A.K.; Schlessinger, D.; Kere, J.

    1994-09-01

    The gene for the X chromosomal developmental disorder anhidrotic ectodermal dysplasia (EDA) has been mapped to Xq12-q13 by linkage analysis and is expressed in a few females with chromosomal translocations involving band Xq12-q13. A yeast artificial chromosome (YAC) contig (2.0 Mb) spanning two translocation breakpoints has been assembled by sequence-tagged site (STS)-based chromosomal walking. The two translocation breakpoints (X:autosome translocations from the affected female patients) have been mapped less than 60 kb apart within a YAC contig. Unique probes and intragenic STSs (mapped between the two translocations) have been developed and a somatic cell hybrid carrying the translocated X chromosomemore » from the AK patient has been analyzed by isolating unique probes that span the breakpoint. Several STSs made from intragenic sequences have been found to be conserved in mouse, hamster and monkey, but we have detected no mRNAs in a number of tissues tested. However, a probe and STS developed from the DNA spanning the AK breakpoint is conserved in mouse, hamster and monkey, and we have detected expressed sequences in skin cells and cDNA libraries. In addition, unique sequences have been obtained from two CpG islands in the region that maps proximal to the breakpoints. cDNAs containing these sequences are being studied as candidates for the gene affected in the etiology of EDA.« less

  4. Recombinatorial biases and convergent recombination determine interindividual TCRβ sharing in murine thymocytes.

    PubMed

    Li, Hanjie; Ye, Congting; Ji, Guoli; Wu, Xiaohui; Xiang, Zhe; Li, Yuanyue; Cao, Yonghao; Liu, Xiaolong; Douek, Daniel C; Price, David A; Han, Jiahuai

    2012-09-01

    Overlap of TCR repertoires among individuals provides the molecular basis for public T cell responses. By deep-sequencing the TCRβ repertoires of CD4+CD8+ thymocytes from three individual mice, we observed that a substantial degree of TCRβ overlap, comprising ∼10-15% of all unique amino acid sequences and ∼5-10% of all unique nucleotide sequences across any two individuals, is already present at this early stage of T cell development. The majority of TCRβ sharing between individual thymocyte repertoires could be attributed to the process of convergent recombination, with additional contributions likely arising from recombinatorial biases; the role of selection during intrathymic development was negligible. These results indicate that the process of TCR gene recombination is the major determinant of clonotype sharing between individuals.

  5. The Physics and Mathematics of MRI

    NASA Astrophysics Data System (ADS)

    Ansorge, Richard; Graves, Martin

    2016-10-01

    Magnetic Resonance Imaging is a very important clinical imaging tool. It combines different fields of physics and engineering in a uniquely complex way. MRI is also surprisingly versatile, `pulse sequences' can be designed to yield many different types of contrast. This versatility is unique to MRI. This short book gives both an in depth account of the methods used for the operation and construction of modern MRI systems and also the principles of sequence design and many examples of applications. An important additional feature of this book is the detailed discussion of the mathematical principles used in building optimal MRI systems and for sequence design. The mathematical discussion is very suitable for undergraduates attending medical physics courses. It is also more complete than usually found in alternative books for physical scientists or more clinically orientated works.

  6. Pair-barcode high-throughput sequencing for large-scale multiplexed sample analysis

    PubMed Central

    2012-01-01

    Background The multiplexing becomes the major limitation of the next-generation sequencing (NGS) in application to low complexity samples. Physical space segregation allows limited multiplexing, while the existing barcode approach only permits simultaneously analysis of up to several dozen samples. Results Here we introduce pair-barcode sequencing (PBS), an economic and flexible barcoding technique that permits parallel analysis of large-scale multiplexed samples. In two pilot runs using SOLiD sequencer (Applied Biosystems Inc.), 32 independent pair-barcoded miRNA libraries were simultaneously discovered by the combination of 4 unique forward barcodes and 8 unique reverse barcodes. Over 174,000,000 reads were generated and about 64% of them are assigned to both of the barcodes. After mapping all reads to pre-miRNAs in miRBase, different miRNA expression patterns are captured from the two clinical groups. The strong correlation using different barcode pairs and the high consistency of miRNA expression in two independent runs demonstrates that PBS approach is valid. Conclusions By employing PBS approach in NGS, large-scale multiplexed pooled samples could be practically analyzed in parallel so that high-throughput sequencing economically meets the requirements of samples which are low sequencing throughput demand. PMID:22276739

  7. Pair-barcode high-throughput sequencing for large-scale multiplexed sample analysis.

    PubMed

    Tu, Jing; Ge, Qinyu; Wang, Shengqin; Wang, Lei; Sun, Beili; Yang, Qi; Bai, Yunfei; Lu, Zuhong

    2012-01-25

    The multiplexing becomes the major limitation of the next-generation sequencing (NGS) in application to low complexity samples. Physical space segregation allows limited multiplexing, while the existing barcode approach only permits simultaneously analysis of up to several dozen samples. Here we introduce pair-barcode sequencing (PBS), an economic and flexible barcoding technique that permits parallel analysis of large-scale multiplexed samples. In two pilot runs using SOLiD sequencer (Applied Biosystems Inc.), 32 independent pair-barcoded miRNA libraries were simultaneously discovered by the combination of 4 unique forward barcodes and 8 unique reverse barcodes. Over 174,000,000 reads were generated and about 64% of them are assigned to both of the barcodes. After mapping all reads to pre-miRNAs in miRBase, different miRNA expression patterns are captured from the two clinical groups. The strong correlation using different barcode pairs and the high consistency of miRNA expression in two independent runs demonstrates that PBS approach is valid. By employing PBS approach in NGS, large-scale multiplexed pooled samples could be practically analyzed in parallel so that high-throughput sequencing economically meets the requirements of samples which are low sequencing throughput demand.

  8. Unique nucleotide sequence-guided assembly of repetitive DNA parts for synthetic biology applications

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Torella, JP; Lienert, F; Boehm, CR

    2014-08-07

    Recombination-based DNA construction methods, such as Gibson assembly, have made it possible to easily and simultaneously assemble multiple DNA parts, and they hold promise for the development and optimization of metabolic pathways and functional genetic circuits. Over time, however, these pathways and circuits have become more complex, and the increasing need for standardization and insulation of genetic parts has resulted in sequence redundancies-for example, repeated terminator and insulator sequences-that complicate recombination-based assembly. We and others have recently developed DNA assembly methods, which we refer to collectively as unique nucleotide sequence (UNS)-guided assembly, in which individual DNA parts are flanked withmore » UNSs to facilitate the ordered, recombination-based assembly of repetitive sequences. Here we present a detailed protocol for UNS-guided assembly that enables researchers to convert multiple DNA parts into sequenced, correctly assembled constructs, or into high-quality combinatorial libraries in only 2-3 d. If the DNA parts must be generated from scratch, an additional 2-5 d are necessary. This protocol requires no specialized equipment and can easily be implemented by a student with experience in basic cloning techniques.« less

  9. Unique nucleotide sequence (UNS)-guided assembly of repetitive DNA parts for synthetic biology applications

    PubMed Central

    Torella, Joseph P.; Lienert, Florian; Boehm, Christian R.; Chen, Jan-Hung; Way, Jeffrey C.; Silver, Pamela A.

    2016-01-01

    Recombination-based DNA construction methods, such as Gibson assembly, have made it possible to easily and simultaneously assemble multiple DNA parts and hold promise for the development and optimization of metabolic pathways and functional genetic circuits. Over time, however, these pathways and circuits have become more complex, and the increasing need for standardization and insulation of genetic parts has resulted in sequence redundancies — for example repeated terminator and insulator sequences — that complicate recombination-based assembly. We and others have recently developed DNA assembly methods that we refer to collectively as unique nucleotide sequence (UNS)-guided assembly, in which individual DNA parts are flanked with UNSs to facilitate the ordered, recombination-based assembly of repetitive sequences. Here we present a detailed protocol for UNS-guided assembly that enables researchers to convert multiple DNA parts into sequenced, correctly-assembled constructs, or into high-quality combinatorial libraries in only 2–3 days. If the DNA parts must be generated from scratch, an additional 2–5 days are necessary. This protocol requires no specialized equipment and can easily be implemented by a student with experience in basic cloning techniques. PMID:25101822

  10. Seismicity and source spectra analysis in Salton Sea Geothermal Field

    NASA Astrophysics Data System (ADS)

    Cheng, Y.; Chen, X.

    2016-12-01

    The surge of "man-made" earthquakes in recent years has led to considerable concerns about the associated hazards. Improved monitoring of small earthquakes would significantly help understand such phenomena and the underlying physical mechanisms. In the Salton Sea Geothermal field in southern California, open access of a local borehole network provides a unique opportunity to better understand the seismicity characteristics, the related earthquake hazards, and the relationship with the geothermal system, tectonic faulting and other physical conditions. We obtain high-resolution earthquake locations in the Salton Sea Geothermal Field, analyze characteristics of spatiotemporal isolated earthquake clusters, magnitude-frequency distributions and spatial variation of stress drops. The analysis reveals spatial coherent distributions of different types of clustering, b-value distributions, and stress drop distribution. The mixture type clusters (short-duration rapid bursts with high aftershock productivity) are predominately located within active geothermal field that correlate with high b-value, low stress drop microearthquake clouds, while regular aftershock sequences and swarms are distributed throughout the study area. The differences between earthquakes inside and outside of geothermal operation field suggest a possible way to distinguish directly induced seismicity due to energy operation versus typical seismic slip driven sequences. The spatial coherent b-value distribution enables in-situ estimation of probabilities for M≥3 earthquakes, and shows that the high large-magnitude-event (LME) probability zones with high stress drop are likely associated with tectonic faulting. The high stress drop in shallow (1-3 km) depth indicates the existence of active faults, while low stress drops near injection wells likely corresponds to the seismic response to fluid injection. I interpret the spatial variation of seismicity and source characteristics as the result of fluid circulation, the fracture network, and tectonic faulting.

  11. Palaeoclimate characteristics in interior Siberia of MIS 6-2: first insights from the Batagay permafrost mega-thaw slump in the Yana Highlands

    NASA Astrophysics Data System (ADS)

    Ashastina, Kseniia; Schirrmeister, Lutz; Fuchs, Margret; Kienast, Frank

    2017-07-01

    Syngenetic permafrost deposits formed extensively on and around the arising Beringian subcontinent during the Late Pleistocene sea level lowstands. Syngenetic deposition implies that all material, both mineral and organic, freezes parallel to sedimentation and remains frozen until degradation of the permafrost. Permafrost is therefore a unique archive of Late Pleistocene palaeoclimate. Most studied permafrost outcrops are situated in the coastal lowlands of northeastern Siberia; inland sections are, however, scarcely available. Here, we describe the stratigraphical, cryolithological, and geochronological characteristics of a permafrost sequence near Batagay in the Siberian Yana Highlands, the interior of the Sakha Republic (Yakutia), Russia, with focus on the Late Pleistocene Yedoma ice complex (YIC). The recently formed Batagay mega-thaw slump exposes permafrost deposits to a depth of up to 80 m and gives insight into a climate record close to Verkhoyansk, which has the most severe continental climate in the Northern Hemisphere. Geochronological dating (optically stimulated luminescence, OSL, and 14C ages) and stratigraphic implications delivered a temporal frame from the Middle Pleistocene to the Holocene for our sedimentological interpretations and also revealed interruptions in the deposition. The sequence of lithological units indicates a succession of several distinct climate phases: a Middle Pleistocene ice complex indicates cold stage climate. Then, ice wedge growth stopped due to highly increased sedimentation rates and eventually a rise in temperature. Full interglacial climate conditions existed during accumulation of an organic-rich layer - plant macrofossils reflected open forest vegetation existing under dry conditions during Marine Isotope Stage (MIS) 5e. The Late Pleistocene YIC (MIS 4-MIS 2) suggests severe cold-stage climate conditions. No alas deposits, potentially indicating thermokarst processes, were detected at the site. A detailed comparison of the permafrost deposits exposed in the Batagay thaw slump with well-studied permafrost sequences, both coastal and inland, is made to highlight common features and differences in their formation processes and palaeoclimatic histories. Fluvial and lacustrine influence is temporarily common in the majority of permafrost exposures, but has to be excluded for the Batagay sequence. We interpret the characteristics of permafrost deposits at this location as a result of various climatically induced processes that are partly seasonally controlled. Nival deposition might have been dominant during winter time, whereas proluvial and aeolian deposition could have prevailed during the snowmelt period and the dry summer season.

  12. MacoNPV baculovirus midgut-specific gene expression during infection of the bertha armyworm, Mamestra configurata

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Donly, B. Cameron, E-mail: Cam.Donly@agr.gc.ca

    Baculoviruses have two forms, occlusion derived virus (ODV) which is responsible for primary infection in host midgut tissue and budded virus (BV), which infects all other host tissues during secondary infection. This study examined the primary infection by ODV of midgut cells of bertha armyworm Mamestra configurata fourth instar larvae and measured the expression of viral genes over a time course of infection. Both digital PCR and RNA sequencing methods showed the profile of transcription to be different from those produced by AcMNPV BV infection of in vitro cell cultures. This included having unique collections of genes expressed early, asmore » well as much greater late gene expression of p6.9 and much reduced expression of polh and p10. These differences likely reflect characteristics unique to the critical step of in vivo midgut cell infection, and provide insights into the processes that regulate viral gene expression in different host tissues. -- Highlights: •The transcriptome of MacoNPV ODV in larval midgut was measured by RNA-seq and digital PCR. •The earliest genes expressed included fusion protein, hoar, and me53. •p6.9 was highly expressed late but polH and p10 were less so. •These patterns are unique from BV of other baculoviruses in tissue culture cells.« less

  13. Identification of Novel Virulence Determinants in Mycobacterium paratuberculosis by Screening a Library of Insertional Mutants†

    PubMed Central

    Shin, Sung Jae; Wu, Chia-wei; Steinberg, Howard; Talaat, Adel M.

    2006-01-01

    Johne's disease, caused by Mycobacterium paratuberculosis infection, is a worldwide problem for the dairy industry and has a possible involvement in Crohn's disease in humans. To identify virulence determinants of this economically important pathogen, a library of 5,060 transposon mutants was constructed using Tn5367 insertion mutagenesis, followed by large-scale sequencing to identify disrupted genes. In this report, 1,150 mutants were analyzed and 970 unique insertion sites were identified. Sequence analysis of the disrupted genes indicated that the insertion of Tn5367 was more prevalent in genomic regions with G+C content (50.5 to 60.5%) lower than the average G+C content (69.3%) of the rest of the genome. Phenotypic screening of the library identified disruptions of genes involved in iron, tryptophan, or mycolic acid metabolic pathways that displayed unique growth characteristics. Bioinformatic analysis of disrupted genes identified a list of potential virulence determinants for further testing with animals. Mouse infection studies showed a significant decrease in tissue colonization by mutants with a disruption in the gcpE, pstA, kdpC, papA2, impA, umaA1, or fabG2_2 gene. Attenuation phenotypes were tissue specific (e.g., for the umaA1 mutant) as well as time specific (e.g., for the impA mutant), suggesting that those genes may be involved in different virulence mechanisms. The identified potential virulence determinants represent novel functional classes that could be necessary for mycobacterial survival during infection and could provide suitable targets for vaccine and drug development against Johne's and Crohn's diseases. PMID:16790754

  14. Characterization of brain tumours with spin-spin relaxation: pilot case study reveals unique T 2 distribution profiles of glioblastoma, oligodendroglioma and meningioma.

    PubMed

    Laule, Cornelia; Bjarnason, Thorarin A; Vavasour, Irene M; Traboulsee, Anthony L; Wayne Moore, G R; Li, David K B; MacKay, Alex L

    2017-11-01

    Prolonged spin-spin relaxation times in tumour tissue have been observed since some of the earliest nuclear magnetic resonance investigations of the brain. Over the last three decades, numerous studies have sought to characterize tumour morphology and malignancy using quantitative assessment of T 2 relaxation times, although attempts to categorize and differentiate tumours have had limited success. However, previous work must be interpreted with caution as relaxation data were typically acquired using a variety of multiple echo sequences with a range of echoes and T 2 decay curves and were frequently fit with monoexponential analysis. We defined the distribution of T 2 components in three different human brain tumours (glioblastoma, oligodendroglioma, meningioma) using a multi-echo sequence with a greater number of echoes and a longer acquisition window than previously used (48 echoes, data collection out to 1120 ms) with no a priori assumptions about the number of exponential components contributing to the T 2 decay. T 2 relaxation times were increased in tumour tissue and each tumour showed a distinct T 2 distribution profile. Tumours have complex and unique compartmentalization characteristics. Quantitative assessment of T 2 relaxation in brain cancer may be useful in evaluating different grades of brain tumours on the basis of their T 2 distribution profile, and has the potential to be a non-invasive diagnostic tool which may also be useful in monitoring therapy. Further study with a larger sample size and varying grades of tumours is warranted.

  15. Generation and Analysis of Expressed Sequence Tags from Olea europaea L.

    PubMed Central

    Ozdemir Ozgenturk, Nehir; Oruç, Fatma; Sezerman, Ugur; Kuçukural, Alper; Vural Korkut, Senay; Toksoz, Feriha; Un, Cemal

    2010-01-01

    Olive (Olea europaea L.) is an important source of edible oil which was originated in Near-East region. In this study, two cDNA libraries were constructed from young olive leaves and immature olive fruits for generation of ESTs to discover the novel genes and search the function of unknown genes of olive. The randomly selected 3840 colonies were sequenced for EST collection from both libraries. Readable 2228 sequences for olive leaf and 1506 sequences for olive fruit were assembled into 205 and 69 contigs, respectively, whereas 2478 were singletons. Putative functions of all 2752 differentially expressed unique sequences were designated by gene homology based on BLAST and annotated using BLAST2GO. While 1339 ESTs show no homology to the database, 2024 ESTs have homology (under 80%) with hypothetical proteins, putative proteins, expressed proteins, and unknown proteins in NCBI-GenBank. 635 EST's unique genes sequence have been identified by over 80% homology to known function in other species which were not previously described in Olea family. Only 3.1% of total EST's was shown similarity with olive database existing in NCBI. This generated EST's data and consensus sequences were submitted to NCBI as valuable source for functional genome studies of olive. PMID:21197085

  16. The first genome sequence of a metatherian herpesvirus: Macropodid herpesvirus 1.

    PubMed

    Vaz, Paola K; Mahony, Timothy J; Hartley, Carol A; Fowler, Elizabeth V; Ficorilli, Nino; Lee, Sang W; Gilkerson, James R; Browning, Glenn F; Devlin, Joanne M

    2016-01-22

    While many placental herpesvirus genomes have been fully sequenced, the complete genome of a marsupial herpesvirus has not been described. Here we present the first genome sequence of a metatherian herpesvirus, Macropodid herpesvirus 1 (MaHV-1). The MaHV-1 viral genome was sequenced using an Illumina MiSeq sequencer, de novo assembly was performed and the genome was annotated. The MaHV-1 genome was 140 kbp in length and clustered phylogenetically with the primate simplexviruses, sharing 67% nucleotide sequence identity with Human herpesviruses 1 and 2. The MaHV-1 genome contained 66 predicted open reading frames (ORFs) homologous to those in other herpesvirus genomes, but lacked homologues of UL3, UL4, UL56 and glycoprotein J. This is the first alphaherpesvirus genome that has been found to lack the UL3 and UL4 homologues. We identified six novel ORFs and confirmed their transcription by RT-PCR. This is the first genome sequence of a herpesvirus that infects metatherians, a taxonomically unique mammalian clade. Members of the Simplexvirus genus are remarkably conserved, so the absence of ORFs otherwise retained in eutherian and avian alphaherpesviruses contributes to our understanding of the Alphaherpesvirinae. Further study of metatherian herpesvirus genetics and pathogenesis provides a unique approach to understanding herpesvirus-mammalian interactions.

  17. DNABIT Compress - Genome compression algorithm.

    PubMed

    Rajarajeswari, Pothuraju; Apparao, Allam

    2011-01-22

    Data compression is concerned with how information is organized in data. Efficient storage means removal of redundancy from the data being stored in the DNA molecule. Data compression algorithms remove redundancy and are used to understand biologically important molecules. We present a compression algorithm, "DNABIT Compress" for DNA sequences based on a novel algorithm of assigning binary bits for smaller segments of DNA bases to compress both repetitive and non repetitive DNA sequence. Our proposed algorithm achieves the best compression ratio for DNA sequences for larger genome. Significantly better compression results show that "DNABIT Compress" algorithm is the best among the remaining compression algorithms. While achieving the best compression ratios for DNA sequences (Genomes),our new DNABIT Compress algorithm significantly improves the running time of all previous DNA compression programs. Assigning binary bits (Unique BIT CODE) for (Exact Repeats, Reverse Repeats) fragments of DNA sequence is also a unique concept introduced in this algorithm for the first time in DNA compression. This proposed new algorithm could achieve the best compression ratio as much as 1.58 bits/bases where the existing best methods could not achieve a ratio less than 1.72 bits/bases.

  18. Fabrication of a New Lineage of Artificial Luciferases from Natural Luciferase Pools.

    PubMed

    Kim, Sung Bae; Nishihara, Ryo; Citterio, Daniel; Suzuki, Koji

    2017-09-11

    The fabrication of artificial luciferases (ALucs) with unique optical properties has a fundamental impact on bioassays and molecular imaging. In this study, we developed a new lineage of ALucs with unique substrate preferences by extracting consensus amino acids from the alignment of 25 copepod luciferase sequences available in natural luciferase pools. The primary sequence was first created with a sequence logo generator resulting in a total of 11 sibling sequences. Phylogenetic analysis shows that the newly fabricated ALucs form an independent branch, genetically isolated from the natural luciferases, and from a prior series of ALucs produced by our laboratory using a smaller basis set. The new lineage of ALucs were strongly luminescent in living mammalian cells with specific substrate selectivity to native coelenterazine. A single-residue-level comparison of the C-terminal sequences of new ALucs reveals that some amino acids in the C-terminal ends are greatly influential on the optical intensities but limited in the color variance. The success of this approach guides on how to engineer and functionalize marine luciferases for bioluminescence imaging and assays.

  19. Evidence for Interspecies Gene Transfer in the Evolution of 2,4-Dichlorophenoxyacetic Acid Degraders

    PubMed Central

    McGowan, Catherine; Fulthorpe, Roberta; Wright, Alice; Tiedje, J. M.

    1998-01-01

    Small-subunit ribosomal DNA (SSU rDNA) from 20 phenotypically distinct strains of 2,4-dichlorophenoxyacetic acid (2,4-D)-degrading bacteria was partially sequenced, yielding 18 unique strains belonging to members of the alpha, beta, and gamma subgroups of the class Proteobacteria. To understand the origin of 2,4-D degradation in this diverse collection, the first gene in the 2,4-D pathway, tfdA, was sequenced. The sequences fell into three unique classes found in various members of the beta and gamma subgroups of Proteobacteria. None of the α-Proteobacteria yielded tfdA PCR products. A comparison of the dendrogram of the tfdA genes with that of the SSU rDNA genes demonstrated incongruency in phylogenies, and hence 2,4-D degradation must have originated from gene transfer between species. Only those strains with tfdA sequences highly similar to the tfdA sequence of strain JMP134 (tfdA class I) transferred all the 2,4-D genes and conferred the 2,4-D degradation phenotype to a Burkholderia cepacia recipient. PMID:9758850

  20. Rescaled earthquake recurrence time statistics: application to microrepeaters

    NASA Astrophysics Data System (ADS)

    Goltz, Christian; Turcotte, Donald L.; Abaimov, Sergey G.; Nadeau, Robert M.; Uchida, Naoki; Matsuzawa, Toru

    2009-01-01

    Slip on major faults primarily occurs during `characteristic' earthquakes. The recurrence statistics of characteristic earthquakes play an important role in seismic hazard assessment. A major problem in determining applicable statistics is the short sequences of characteristic earthquakes that are available worldwide. In this paper, we introduce a rescaling technique in which sequences can be superimposed to establish larger numbers of data points. We consider the Weibull and log-normal distributions, in both cases we rescale the data using means and standard deviations. We test our approach utilizing sequences of microrepeaters, micro-earthquakes which recur in the same location on a fault. It seems plausible to regard these earthquakes as a miniature version of the classic characteristic earthquakes. Microrepeaters are much more frequent than major earthquakes, leading to longer sequences for analysis. In this paper, we present results for the analysis of recurrence times for several microrepeater sequences from Parkfield, CA as well as NE Japan. We find that, once the respective sequence can be considered to be of sufficient stationarity, the statistics can be well fitted by either a Weibull or a log-normal distribution. We clearly demonstrate this fact by our technique of rescaled combination. We conclude that the recurrence statistics of the microrepeater sequences we consider are similar to the recurrence statistics of characteristic earthquakes on major faults.

  1. Short-term memory stores organized by information domain.

    PubMed

    Noyce, Abigail L; Cestero, Nishmar; Shinn-Cunningham, Barbara G; Somers, David C

    2016-04-01

    Vision and audition have complementary affinities, with vision excelling in spatial resolution and audition excelling in temporal resolution. Here, we investigated the relationships among the visual and auditory modalities and spatial and temporal short-term memory (STM) using change detection tasks. We created short sequences of visual or auditory items, such that each item within a sequence arose at a unique spatial location at a unique time. On each trial, two successive sequences were presented; subjects attended to either space (the sequence of locations) or time (the sequence of inter item intervals) and reported whether the patterns of locations or intervals were identical. Each subject completed blocks of unimodal trials (both sequences presented in the same modality) and crossmodal trials (Sequence 1 visual, Sequence 2 auditory, or vice versa) for both spatial and temporal tasks. We found a strong interaction between modality and task: Spatial performance was best on unimodal visual trials, whereas temporal performance was best on unimodal auditory trials. The order of modalities on crossmodal trials also mattered, suggesting that perceptual fidelity at encoding is critical to STM. Critically, no cost was attributable to crossmodal comparison: In both tasks, performance on crossmodal trials was as good as or better than on the weaker unimodal trials. STM representations of space and time can guide change detection in either the visual or the auditory modality, suggesting that the temporal or spatial organization of STM may supersede sensory-specific organization.

  2. Development of a PCR-based marker utilizing a deletion mutation in the dihydroflavonol 4-reductase (DFR) gene responsible for the lack of anthocyanin production in yellow onions (Allium cepa).

    PubMed

    Kim, Sunggil; Yoo, Kil Sun; Pike, Leonard M

    2005-02-01

    Bulb color in onions (Allium cepa) is an important trait, but the mechanism of color inheritance is poorly understood at the molecular level. A previous study showed that inactivation of the dihydroflavonol 4-reductase (DFR) gene at the transcriptional level resulted in a lack of anthocyanin production in yellow onions. The objectives of the present study were the identification of the critical mutations in the DFR gene (DFR-A) and the development of a PCR-based marker for allelic selection. We report the isolation of two additional DFR homologs (DFR-B and DFR-C). No unique sequences were identified in either DFR homolog, even in the untranslated region (UTR). Both genes shared more than 95% nucleotide sequence identity with the DFR-A gene. To obtain a unique sequence from each gene, we isolated the promoter regions. Sequences of the DFR-A and DFR-B promoters differed completely from one another, except for an approximately 100-bp sequence adjacent to the 5'UTR. It was possible to specifically amplify only the DFR-A gene using primers designed to anneal to the unique promoter region. The sequences of yellow and red DFR-A alleles were the same except for a single base-pair change in the promoter and an approximately 800-bp deletion within the 3' region of the yellow DFR-A allele. This deletion was used to develop a co-dominant PCR-based marker that segregated perfectly with color phenotypes in the F2 population. These results indicate that a deletion mutation in the yellow DFR-A gene results in the lack of anthocyanin production in yellow onions.

  3. Complete nucleotide sequence and genome structure of a Japanese isolate of hibiscus latent Fort Pierce virus, a unique tobamovirus that contains an internal poly(A) region in its 3' end.

    PubMed

    Yoshida, Tetsuya; Kitazawa, Yugo; Komatsu, Ken; Neriya, Yutaro; Ishikawa, Kazuya; Fujita, Naoko; Hashimoto, Masayoshi; Maejima, Kensaku; Yamaji, Yasuyuki; Namba, Shigetou

    2014-11-01

    In this study, we detected a Japanese isolate of hibiscus latent Fort Pierce virus (HLFPV-J), a member of the genus Tobamovirus, in a hibiscus plant in Japan and determined the complete sequence and organization of its genome. HLFPV-J has four open reading frames (ORFs), each of which shares more than 98 % nucleotide sequence identity with those of other HLFPV isolates. Moreover, HLFPV-J contains a unique internal poly(A) region of variable length, ranging from 44 to 78 nucleotides, in its 3'-untranslated region (UTR), as is the case with hibiscus latent Singapore virus (HLSV), another hibiscus-infecting tobamovirus. The length of the HLFPV-J genome was 6431 nucleotides, including the shortest internal poly(A) region. The sequence identities of ORFs 1, 2, 3 and 4 of HLFPV-J to other tobamoviruses were 46.6-68.7, 49.9-70.8, 31.0-70.8 and 39.4-70.1 %, respectively, at the nucleotide level and 39.8-75.0, 43.6-77.8, 19.2-70.4 and 31.2-74.2 %, respectively, at the amino acid level. The 5'- and 3'-UTRs of HLFPV-J showed 24.3-58.6 and 13.0-79.8 % identity, respectively, to other tobamoviruses. In particular, when compared to other tobamoviruses, each ORF and UTR of HLFPV-J showed the highest sequence identity to those of HLSV. Phylogenetic analysis showed that HLFPV-J, other HLFPV isolates and HLSV constitute a malvaceous-plant-infecting tobamovirus cluster. These results indicate that the genomic structure of HLFPV-J has unique features similar to those of HLSV. To our knowledge, this is the first report of the complete genome sequence of HLFPV.

  4. Targeted Capture and High-Throughput Sequencing Using Molecular Inversion Probes (MIPs).

    PubMed

    Cantsilieris, Stuart; Stessman, Holly A; Shendure, Jay; Eichler, Evan E

    2017-01-01

    Molecular inversion probes (MIPs) in combination with massively parallel DNA sequencing represent a versatile, yet economical tool for targeted sequencing of genomic DNA. Several thousand genomic targets can be selectively captured using long oligonucleotides containing unique targeting arms and universal linkers. The ability to append sequencing adaptors and sample-specific barcodes allows large-scale pooling and subsequent high-throughput sequencing at relatively low cost per sample. Here, we describe a "wet bench" protocol detailing the capture and subsequent sequencing of >2000 genomic targets from 192 samples, representative of a single lane on the Illumina HiSeq 2000 platform.

  5. Comparison and quantitative verification of mapping algorithms for whole genome bisulfite sequencing

    USDA-ARS?s Scientific Manuscript database

    Coupling bisulfite conversion with next-generation sequencing (Bisulfite-seq) enables genome-wide measurement of DNA methylation, but poses unique challenges for mapping. However, despite a proliferation of Bisulfite-seq mapping tools, no systematic comparison of their genomic coverage and quantitat...

  6. The Pizza Problem: A Solution with Sequences

    ERIC Educational Resources Information Center

    Shafer, Kathryn G.; Mast, Caleb J.

    2008-01-01

    This article addresses the issues of coaching and assessing. A preservice middle school teacher's unique solution to the Pizza problem was not what the professor expected. The student's solution strategy, based on sequences and a reinvention of Pascal's triangle, is explained in detail. (Contains 8 figures.)

  7. Complete genome sequence of the acetylene-fermenting Pelobacter sp. strain SFB93

    USGS Publications Warehouse

    Sutton, John M.; Baesman, Shaun; Fierst, Janna L.; Poret-Peterson, Amisha T.; Oremland, Ronald S.; Dunlap, Darren S.; Akob, Denise M.

    2017-01-01

    Acetylene fermentation is a rare metabolism that was previously reported as being unique to Pelobacter acetylenicus. Here, we report the genome sequence of Pelobacter sp. strain SFB93, an acetylene-fermenting bacterium isolated from sediments collected in San Francisco Bay, CA.

  8. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chauhan, Archana; Layton, Alice; Williams, Daniel W

    Pseudomonas fluorescens strain HK44 (DSM 6700) is a genetically engineered lux-based bioluminescent bioreporter. Here we report the draft genome sequence of strain HK44. Annotation of {approx}6.1 Mb sequence indicates that 30% of the traits are unique and distributed over 5 genomic islands, a prophage and two plasmids.

  9. RNA Sequencing Analysis of the Gametophyte Transcriptome from the Liverwort, Marchantia polymorpha

    PubMed Central

    Sharma, Niharika; Jung, Chol-Hee; Bhalla, Prem L.; Singh, Mohan B.

    2014-01-01

    The liverwort Marchantia polymorpha is a member of the most basal lineage of land plants (embryophytes) and likely retains many ancestral morphological, physiological and molecular characteristics. Despite its phylogenetic importance and the availability of previous EST studies, M. polymorpha’s lack of economic importance limits accessible genomic resources for this species. We employed Illumina RNA-Seq technology to sequence the gametophyte transcriptome of M. polymorpha. cDNA libraries from 6 different male and female developmental tissues were sequenced to delineate a global view of the M. polymorpha transcriptome. Approximately 80 million short reads were obtained and assembled into a non-redundant set of 46,533 transcripts (> = 200 bp) from 46,070 loci. The average length and the N50 length of the transcripts were 757 bp and 471 bp, respectively. Sequence comparison of assembled transcripts with non-redundant proteins from embryophytes resulted in the annotation of 43% of the transcripts. The transcripts were also compared with M. polymorpha expressed sequence tags (ESTs), and approximately 69.5% of the transcripts appeared to be novel. Twenty-one percent of the transcripts were assigned GO terms to improve annotation. In addition, 6,112 simple sequence repeats (SSRs) were identified as potential molecular markers, which may be useful in studies of genetic diversity. A comparative genomics approach revealed that a substantial proportion of the genes (35.5%) expressed in M. polymorpha were conserved across phylogenetically related species, such as Selaginella and Physcomitrella, and identified 580 genes that are potentially unique to liverworts. Our study presents an extensive amount of novel sequence information for M. polymorpha. This information will serve as a valuable genomics resource for further molecular, developmental and comparative evolutionary studies, as well as for the isolation and characterization of functional genes that are involved in sex differentiation and sexual reproduction in this liverwort. PMID:24841988

  10. Re-analysis of human immunodeficiency virus type 1 isolates from Cyprus and Greece, initially designated 'subtype I', reveals a unique complex A/G/H/K/? mosaic pattern.

    PubMed

    Paraskevis, D; Magiorkinis, M; Vandamme, A M; Kostrikis, L G; Hatzakis, A

    2001-03-01

    Human immunodeficiency virus type 1 (HIV-1) has been classified into three main groups and 11 distinct subtypes. Moreover, several circulating recombinant forms (CRFs) of HIV-1 have been recently documented to have spread widely causing extensive HIV-1 epidemics. A subtype, initially designated I (CRF04_cpx), was documented in Cyprus and Greece and was found to comprise regions of sequence derived from subtypes A and G as well as regions of unclassified sequence. Re-analysis of the three full-length CRF04_cpx sequences that were available revealed a mosaic genomic organization of unique complexity comprising regions of sequence from at least five distinct subtypes, A, G, H, K and unclassified regions. These strains account for approximately 2% of the total HIV-1-infected population in Greece, thus providing evidence of the great capability of HIV-1 to recombine and produce highly divergent strains which can be spread successfully through different infection routes.

  11. Digital Biological Converter

    DTIC Science & Technology

    2013-06-28

    of cuts that each fragment should be cut into so the fragments are no greater than a specific length threshold. Additionally, vector sequences and...restriction sites are attached to each fragment while ensuring the restriction sites are unique to each sequence. The vector sequences serve as hooks...for assembly into vector for cloning purposes, and also as primer binding domains for PCR ampl ification. The restriction sites are added to

  12. Transcriptome analysis of Bupleurum chinense focusing on genes involved in the biosynthesis of saikosaponins

    PubMed Central

    2011-01-01

    Abstract Background Bupleurum chinense DC. is a widely used traditional Chinese medicinal plant. Saikosaponins are the major bioactive constituents of B. chinense, but relatively little is known about saikosaponin biosynthesis. The 454 pyrosequencing technology provides a promising opportunity for finding novel genes that participate in plant metabolism. Consequently, this technology may help to identify the candidate genes involved in the saikosaponin biosynthetic pathway. Results One-quarter of the 454 pyrosequencing runs produced a total of 195, 088 high-quality reads, with an average read length of 356 bases (NCBI SRA accession SRA039388). A de novo assembly generated 24, 037 unique sequences (22, 748 contigs and 1, 289 singletons), 12, 649 (52.6%) of which were annotated against three public protein databases using a basic local alignment search tool (E-value ≤1e-10). All unique sequences were compared with NCBI expressed sequence tags (ESTs) (237) and encoding sequences (44) from the Bupleurum genus, and with a Sanger-sequenced EST dataset (3, 111). The 23, 173 (96.4%) unique sequences obtained in the present study represent novel Bupleurum genes. The ESTs of genes related to saikosaponin biosynthesis were found to encode known enzymes that catalyze the formation of the saikosaponin backbone; 246 cytochrome P450 (P450s) and 102 glycosyltransferases (GTs) unique sequences were also found in the 454 dataset. Full length cDNAs of 7 P450s and 7 uridine diphosphate GTs (UGTs) were verified by reverse transcriptase polymerase chain reaction or by cloning using 5' and/or 3' rapid amplification of cDNA ends. Two P450s and three UGTs were identified as the most likely candidates involved in saikosaponin biosynthesis. This finding was based on the coordinate up-regulation of their expression with β-AS in methyl jasmonate-treated adventitious roots and on their similar expression patterns with β-AS in various B. chinense tissues. Conclusions A collection of high-quality ESTs for B. chinense obtained by 454 pyrosequencing is provided here for the first time. These data should aid further research on the functional genomics of B. chinense and other Bupleurum species. The candidate genes for enzymes involved in saikosaponin biosynthesis, especially the P450s and UGTs, that were revealed provide a substantial foundation for follow-up research on the metabolism and regulation of the saikosaponins. PMID:22047182

  13. Identification of Entamoeba polecki with Unique 18S rRNA Gene Sequences from Celebes Crested Macaques and Pigs in Tangkoko Nature Reserve, North Sulawesi, Indonesia.

    PubMed

    Tuda, Josef; Feng, Meng; Imada, Mihoko; Kobayashi, Seiki; Cheng, Xunjia; Tachibana, Hiroshi

    2016-09-01

    Unique species of macaques are distributed across Sulawesi Island, Indonesia, and the details of Entamoeba infections in these macaques are unknown. A total of 77 stool samples from Celebes crested macaques (Macaca nigra) and 14 stool samples from pigs were collected in Tangkoko Nature Reserve, North Sulawesi, and the prevalence of Entamoeba infection was examined by PCR. Entamoeba polecki was detected in 97% of the macaques and all of the pigs, but no other Entamoeba species were found. The nucleotide sequence of the 18S rRNA gene in E. polecki from M. nigra was unique and showed highest similarity with E. polecki subtype (ST) 4. This is the first case of identification of E. polecki ST4 from wild nonhuman primates. The sequence of the 18S rRNA gene in E. polecki from pigs was also unique and showed highest similarity with E. polecki ST1. These results suggest that the diversity of the 18S rRNA gene in E. polecki is associated with differences in host species and geographic localization, and that there has been no transmission of E. polecki between macaques and pigs in the study area. © 2016 The Author(s) Journal of Eukaryotic Microbiology © 2016 International Society of Protistologists.

  14. A Unique Autothermal Thermophilic Aerobic Digestion Process Showing a Dynamic Transition of Physicochemical and Bacterial Characteristics from the Mesophilic to the Thermophilic Phase.

    PubMed

    Tashiro, Yukihiro; Kanda, Kosuke; Asakura, Yuya; Kii, Toshihiko; Cheng, Huijun; Poudel, Pramod; Okugawa, Yuki; Tashiro, Kosuke; Sakai, Kenji

    2018-03-15

    A unique autothermal thermophilic aerobic digestion (ATAD) process has been used to convert human excreta to liquid fertilizer in Japan. This study investigated the changes in physicochemical and bacterial community characteristics during the full-scale ATAD process operated for approximately 3 weeks in 2 different years. After initiating simultaneous aeration and mixing using an air-inducing circulator (aerator), the temperature autothermally increased rapidly in the first 1 to 2 days with exhaustive oxygen consumption, leading to a drastic decrease and gradual increase in oxidation-reduction potential in the first 2 days, reached >50°C in the middle 4 to 6 days, and remained steady in the final phase. Volatile fatty acids were rapidly consumed and diminished in the first 2 days, whereas the ammonia nitrogen concentration was relatively stable during the process, despite a gradual pH increase to 9.3. Principal-coordinate analysis of 16S rRNA gene amplicons using next-generation sequencing divided the bacterial community structures into distinct clusters corresponding to three phases, and they were similar in the final phase in both years despite different transitions in the middle phase. The predominant phyla (closest species, dominancy) in the initial, middle, and final phases were Proteobacteria ( Arcobacter trophiarum , 19 to 43%; Acinetobacter towneri , 6.3 to 30%), Bacteroidetes ( Moheibacter sediminis , 43 to 54%), and Firmicutes ( Thermaerobacter composti , 11 to 28%; Heliorestis baculata , 2.1 to 16%), respectively. Two predominant operational taxonomic units (OTUs) in the final phase showed very low similarities to the closest species, indicating that the process is unique compared with previously published ones. This unique process with three distinctive phases would be caused by the aerator with complete aeration. IMPORTANCE Although the autothermal thermophilic aerobic digestion (ATAD) process has several advantages, such as a high degradation capacity, a short treatment period, and inactivation of pathogens, one of the factors limiting its broad application is the high electric power consumption for aerators with a full-scale bioreactor. We elucidated the dynamics of the bacterial community structures, as well as the physicochemical characteristics, in the ATAD process with a full-scale bioreactor from human excreta for 3 weeks. Our results indicated that this unique process can be divided into three distinguishable phases by an aerator with complete aeration and showed a possibility of shortening the digestion period to approximately 10 days. This research not only helps to identify which bacteria play significant roles and how the process can be improved and controlled but also demonstrates an efficient ATAD process with less electric power consumption for worldwide application. Copyright © 2018 American Society for Microbiology.

  15. Comparison of the Genome Sequence of the Poultry Pathogen Bordetella avium with Those of B. bronchiseptica, B. pertussis, and B. parapertussis Reveals Extensive Diversity in Surface Structures Associated with Host Interaction

    PubMed Central

    Sebaihia, Mohammed; Preston, Andrew; Maskell, Duncan J.; Kuzmiak, Holly; Connell, Terry D.; King, Natalie D.; Orndorff, Paul E.; Miyamoto, David M.; Thomson, Nicholas R.; Harris, David; Goble, Arlette; Lord, Angela; Murphy, Lee; Quail, Michael A.; Rutter, Simon; Squares, Robert; Squares, Steven; Woodward, John; Parkhill, Julian; Temple, Louise M.

    2006-01-01

    Bordetella avium is a pathogen of poultry and is phylogenetically distinct from Bordetella bronchiseptica, Bordetella pertussis, and Bordetella parapertussis, which are other species in the Bordetella genus that infect mammals. In order to understand the evolutionary relatedness of Bordetella species and further the understanding of pathogenesis, we obtained the complete genome sequence of B. avium strain 197N, a pathogenic strain that has been extensively studied. With 3,732,255 base pairs of DNA and 3,417 predicted coding sequences, it has the smallest genome and gene complement of the sequenced bordetellae. In this study, the presence or absence of previously reported virulence factors from B. avium was confirmed, and the genetic bases for growth characteristics were elucidated. Over 1,100 genes present in B. avium but not in B. bronchiseptica were identified, and most were predicted to encode surface or secreted proteins that are likely to define an organism adapted to the avian rather than the mammalian respiratory tracts. These include genes coding for the synthesis of a polysaccharide capsule, hemagglutinins, a type I secretion system adjacent to two very large genes for secreted proteins, and unique genes for both lipopolysaccharide and fimbrial biogenesis. Three apparently complete prophages are also present. The BvgAS virulence regulatory system appears to have polymorphisms at a poly(C) tract that is involved in phase variation in other bordetellae. A number of putative iron-regulated outer membrane proteins were predicted from the sequence, and this regulation was confirmed experimentally for five of these. PMID:16885469

  16. Multilocus sequence analysis of Thermoanaerobacter isolates reveals recombining, but differentiated, populations from geothermal springs of the Uzon Caldera, Kamchatka, Russia

    PubMed Central

    Wagner, Isaac D.; Varghese, Litty B.; Hemme, Christopher L.; Wiegel, Juergen

    2013-01-01

    Thermal environments have island-like characteristics and provide a unique opportunity to study population structure and diversity patterns of microbial taxa inhabiting these sites. Strains having ≥98% 16S rRNA gene sequence similarity to the obligately anaerobic Firmicutes Thermoanaerobacter uzonensis were isolated from seven geothermal springs, separated by up to 1600 m, within the Uzon Caldera (Kamchatka, Russian Far East). The intraspecies variation and spatial patterns of diversity for this taxon were assessed by multilocus sequence analysis (MLSA) of 106 strains. Analysis of eight protein-coding loci (gyrB, lepA, leuS, pyrG, recA, recG, rplB, and rpoB) revealed that all loci were polymorphic and that nucleotide substitutions were mostly synonymous. There were 148 variable nucleotide sites across 8003 bp concatenates of the protein-coding loci. While pairwise FST values indicated a small but significant level of genetic differentiation between most subpopulations, there was a negligible relationship between genetic divergence and spatial separation. Strains with the same allelic profile were only isolated from the same hot spring, occasionally from consecutive years, and single locus variant (SLV) sequence types were usually derived from the same spring. While recombination occurred, there was an “epidemic” population structure in which a particular T. uzonensis sequence type rose in frequency relative to the rest of the population. These results demonstrate spatial diversity patterns for an anaerobic bacterial species in a relative small geographic location and reinforce the view that terrestrial geothermal springs are excellent places to look for biogeographic diversity patterns regardless of the involved distances. PMID:23801987

  17. Identification of a precursor genomic segment that provided a sequence unique to glycophorin B and E genes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Onda, M.; Kudo, S.; Fukuda, M.

    Human glycophorin A, B, and E (GPA, GPB, and GPE) genes belong to a gene family located at the long arm of chromosome 4. These three genes are homologous from the 5'-flanking sequence to the Alu sequence, which is 1 kb downstream from the exon encoding the transmembrane domain. Analysis of the Alu sequence and flanking direct repeat sequences suggested that the GPA gene most closely resembles the ancestral gene, whereas the GPB and GPE gene arose by homologous recombination within the Alu sequence, acquiring 3' sequences from an unrelated precursor genomic segment. Here the authors describe the identification ofmore » this putative precursor genomic segment. A human genomic library was screened by using the sequence of the 3' region of the GPB gene as a probe. The genomic clones isolated were found to contain an Alu sequence that appeared to be involved in the recombination. Downstream from the Alu sequence, the nucleotide sequence of the precursor genomic segment is almost identical to that of the GPB or GPE gene. In contrast, the upstream sequence of the genomic segment differs entirely from that of the GPA, GPB, and GPE genes. Conservation of the direct repeats flanking the Alu sequence of the genomic segment strongly suggests that the sequence of this genomic segment has been maintained during evolution. This identified genomic segment was found to reside downstream from the GPA gene by both gene mapping and in situ chromosomal localization. The precursor genomic segment was also identified in the orangutan genome, which is known to lack GPB and GPE genes. These results indicate that one of the duplicated ancestral glycophorin genes acquired a unique 3' sequence by unequal crossing-over through its Alu sequence and the further downstream Alu sequence present in the duplicated gene. Further duplication and divergence of this gene yielded the GPB and GPE genes. 37 refs., 5 figs.« less

  18. Molecular Evolution and Functional Diversification of Replication Protein A1 in Plants

    PubMed Central

    Aklilu, Behailu B.; Culligan, Kevin M.

    2016-01-01

    Replication protein A (RPA) is a heterotrimeric, single-stranded DNA binding complex required for eukaryotic DNA replication, repair, and recombination. RPA is composed of three subunits, RPA1, RPA2, and RPA3. In contrast to single RPA subunit genes generally found in animals and yeast, plants encode multiple paralogs of RPA subunits, suggesting subfunctionalization. Genetic analysis demonstrates that five Arabidopsis thaliana RPA1 paralogs (RPA1A to RPA1E) have unique and overlapping functions in DNA replication, repair, and meiosis. We hypothesize here that RPA1 subfunctionalities will be reflected in major structural and sequence differences among the paralogs. To address this, we analyzed amino acid and nucleotide sequences of RPA1 paralogs from 25 complete genomes representing a wide spectrum of plants and unicellular green algae. We find here that the plant RPA1 gene family is divided into three general groups termed RPA1A, RPA1B, and RPA1C, which likely arose from two progenitor groups in unicellular green algae. In the family Brassicaceae the RPA1B and RPA1C groups have further expanded to include two unique sub-functional paralogs RPA1D and RPA1E, respectively. In addition, RPA1 groups have unique domains, motifs, cis-elements, gene expression profiles, and pattern of conservation that are consistent with proposed functions in monocot and dicot species, including a novel C-terminal zinc-finger domain found only in plant RPA1C-like sequences. These results allow for improved prediction of RPA1 subunit functions in newly sequenced plant genomes, and potentially provide a unique molecular tool to improve classification of Brassicaceae species. PMID:26858742

  19. Recombination and Population Mosaic of a Multifunctional Viral Gene, Adeno-Associated Virus cap

    PubMed Central

    Takeuchi, Yasuhiro; Myers, Richard; Danos, Olivier

    2008-01-01

    Homologous recombination is a dominant force in evolution and results in genetic mosaics. To detect evidence of recombination events and assess the biological significance of genetic mosaics, genome sequences for various viral populations of reasonably large size are now available in the GenBank. We studied a multi-functional viral gene, the adeno-associated virus (AAV) cap gene, which codes for three capsid proteins, VP1, VP2 and VP3. VP1-3 share a common C-terminal domain corresponding to VP3, which forms the viral core structure, while the VP1 unique N-terminal part contains an enzymatic domain with phospholipase A2 activity. Our recombinant detection program (RecI) revealed five novel recombination events, four of which have their cross-over points in the N-terminal, VP1 and VP2 unique region. Comparison of phylogenetic trees for different cap gene regions confirmed discordant phylogenies for the recombinant sequences. Furthermore, differences in the phylogenetic tree structures for the VP1 unique (VP1u) region and the rest of cap highlighted the mosaic nature of cap gene in the AAV population: two dominant forms of VP1u sequences were identified and these forms are linked to diverse sequences in the rest of cap gene. This observation together with the finding of frequent recombination in the VP1 and 2 unique regions suggests that this region is a recombination hot spot. Recombination events in this region preserve protein blocks of distinctive functions and contribute to convergence in VP1u and divergence of the rest of cap. Additionally the possible biological significance of two dominant VP1u forms is inferred. PMID:18286191

  20. A Repeat Look at Repeating Patterns

    ERIC Educational Resources Information Center

    Markworth, Kimberly A.

    2016-01-01

    A "repeating pattern" is a cyclical repetition of an identifiable core. Children in the primary grades usually begin pattern work with fairly simple patterns, such as AB, ABC, or ABB patterns. The unique letters represent unique elements, whereas the sequence of letters represents the core that is repeated. Based on color, shape,…

  1. MRO Sequence Checking Tool

    NASA Technical Reports Server (NTRS)

    Fisher, Forest; Gladden, Roy; Khanampornpan, Teerapat

    2008-01-01

    The MRO Sequence Checking Tool program, mro_check, automates significant portions of the MRO (Mars Reconnaissance Orbiter) sequence checking procedure. Though MRO has similar checks to the ODY s (Mars Odyssey) Mega Check tool, the checks needed for MRO are unique to the MRO spacecraft. The MRO sequence checking tool automates the majority of the sequence validation procedure and check lists that are used to validate the sequences generated by MRO MPST (mission planning and sequencing team). The tool performs more than 50 different checks on the sequence. The automation varies from summarizing data about the sequence needed for visual verification of the sequence, to performing automated checks on the sequence and providing a report for each step. To allow for the addition of new checks as needed, this tool is built in a modular fashion.

  2. Restricted transfer of learning between unimanual and bimanual finger sequences

    PubMed Central

    Bai, Wenjun

    2016-01-01

    When training bimanual skills, such as playing piano, people sometimes practice each hand separately and at a later stage combine the movements of the two hands. This poses the critical question of whether motor skills can be acquired by separately practicing each subcomponent or should be trained as a whole. In the present study, we addressed this question by training human subjects for 4 days in a unimanual or bimanual version of the discrete sequence production task. Both groups were then tested on trained and untrained sequences on both unimanual and bimanual versions of the task. Surprisingly, we found no evidence of transfer from trained unimanual to bimanual or from trained bimanual to unimanual sequences. In half the participants, we also investigated whether cuing the sequences on the left and right hand with unique letters would change transfer. With these cues, untrained sequences that shared some components with the trained sequences were performed more quickly than sequences that did not. However, the amount of this transfer was limited to ∼10% of the overall sequence-specific learning gains. These results suggest that unimanual and bimanual sequences are learned in separate representations. Making participants aware of the interrelationship between sequences can induce some transferrable component, although the main component of the skill remains unique to unimanual or bimanual execution. NEW & NOTEWORTHY Studies in reaching movement demonstrated that approximately half of motor learning can transfer across unimanual and bimanual contexts, suggesting that neural representations for unimanual and bimanual movements are fairly overlapping at the level of elementary movement. In this study, we show that little or no transfer occurred across unimanual and bimanual sequential finger movements. This result suggests that bimanual sequences are represented at a level of the motor hierarchy that integrates movements of both hands. PMID:27974447

  3. Molecular Dynamics Simulations of the 136 Unique Tetranucleotide Sequences of DNA Oligonucleotides. I. Research Design and Results on d(CpG) Steps

    PubMed Central

    Beveridge, David L.; Barreiro, Gabriela; Byun, K. Suzie; Case, David A.; Cheatham, Thomas E.; Dixit, Surjit B.; Giudice, Emmanuel; Lankas, Filip; Lavery, Richard; Maddocks, John H.; Osman, Roman; Seibert, Eleanore; Sklenar, Heinz; Stoll, Gautier; Thayer, Kelly M.; Varnai, Péter; Young, Matthew A.

    2004-01-01

    We describe herein a computationally intensive project aimed at carrying out molecular dynamics (MD) simulations including water and counterions on B-DNA oligomers containing all 136 unique tetranucleotide base sequences. This initiative was undertaken by an international collaborative effort involving nine research groups, the “Ascona B-DNA Consortium” (ABC). Calculations were carried out on the 136 cases imbedded in 39 DNA oligomers with repeating tetranucleotide sequences, capped on both ends by GC pairs and each having a total length of 15 nucleotide pairs. All MD simulations were carried out using a well-defined protocol, the AMBER suite of programs, and the parm94 force field. Phase I of the ABC project involves a total of ∼0.6 μs of simulation for systems containing ∼24,000 atoms. The resulting trajectories involve 600,000 coordinate sets and represent ∼400 gigabytes of data. In this article, the research design, details of the simulation protocol, informatics issues, and the organization of the results into a web-accessible database are described. Preliminary results from 15-ns MD trajectories are presented for the d(CpG) step in its 10 unique sequence contexts, and issues of stability and convergence, the extent of quasiergodic problems, and the possibility of long-lived conformational substates are discussed. PMID:15326025

  4. A low molecular weight artificial RNA of unique size with multiple probe target regions

    NASA Technical Reports Server (NTRS)

    Pitulle, C.; Dsouza, L.; Fox, G. E.

    1997-01-01

    Artificial RNAs (aRNAs) containing novel sequence segments embedded in a deletion mutant of Vibrio proteolyticus 5S rRNA have previously been shown to be expressed from a plasmid borne growth rate regulated promoter in E. coli. These aRNAs accumulate to high levels and their detection is a promising tool for studies in molecular microbial ecology and in environmental monitoring. Herein a new construct is described which illustrates the versatility of detection that is possible with aRNAs. This 3xPen aRNA construct carries a 72 nucleotide insert with three copies of a unique 17 base probe target sequence. This aRNA is 160 nucleotides in length and again accumulates to high levels in the E. coli cytoplasm without incorporating into ribosomes. The 3xPen aRNA illustrates two improvements in detection. First, by appropriate selection of insert size, we obtained an aRNA which provides a unique and hence, easily quantifiable peak, on a high resolution gel profile of low molecular weight RNAs. Second, the existence of multiple probe targets results in a nearly commensurate increase in signal when detection is by hybridization. These aRNAs are naturally amplified and carry sequence segments that are not found in known rRNA sequences. It thus may be possible to detect them directly. An experimental step involving RT-PCR or PCR amplification of the gene could therefore be avoided.

  5. Exploring the Presence of microDNAs in Prostate Cancer Cell Lines, Tissue, and Sera of Prostate Cancer Patients and its Possible Application as Biomarker

    DTIC Science & Technology

    2016-04-01

    Sequence tags were mapped on the human reference genome using the Novoalign software. Only those...ends of the linear islands to create a novel junctional sequence that does not exist in the genome . Thus the PE- sequence of a fragment that breaks at... genome (Fig. 3b). Those PE-tags where one tag maps uniquely to an island and the other remains unmapped, but passes the sequence quality filter,

  6. Genomic sequence for the aflatoxigenic filamentous fungus Aspergillus nomius

    USDA-ARS?s Scientific Manuscript database

    The genome of the A. nomius type strain was sequenced using a personal genome machine. Annotation of the genes was undertaken, followed by gene ontology and an investigation into the number of secondary metabolite clusters. Comparative studies with other Aspergillus species involved shared/unique ge...

  7. Evaluation of ribosomal RNA removal protocols for Salmonella RNA-Seq projects

    USDA-ARS?s Scientific Manuscript database

    Next generation sequencing is a powerful technology and its application to sequencing entire RNA populations of food-borne pathogens will provide valuable insights. A problem unique to prokaryotic RNA-Seq is the massive abundance of ribosomal RNA. Unlike eukaryotic messenger RNA (mRNA), bacterial ...

  8. Application of circular consensus sequencing and network analysis to characterize the bovine IgG repertoire

    USDA-ARS?s Scientific Manuscript database

    Background: Vertebrate immune systems generate diverse repertoires of antibodies capable of mediating response to a variety of antigens. Next generation sequencing methods provide unique approaches to a number of immuno-based research areas including antibody discovery and engineering, disease surve...

  9. IsomiR expression profiles in human lymphoblastoid cell lines exhibit population and gender dependencies

    PubMed Central

    Loher, Phillipe; Londin, Eric R.; Rigoutsos, Isidore

    2014-01-01

    For many years it was believed that each mature microRNA (miRNA) existed as a single entity with fixed endpoints and a ‘static’ and unchangeable primary sequence. However, recent evidence suggests that mature miRNAs are more ‘dynamic’ and that each miRNA precursor arm gives rise to multiple isoforms, the isomiRs. Here we report on our identification of numerous and abundant isomiRs in the lymphoblastoid cell lines (LCLs) of 452 men and women from five different population groups. Unexpectedly, we find that these isomiRs exhibit an expression profile that is population-dependent and gender-dependent. This is important as it indicates that the LCLs of each gender/population combination have their own unique collection of mature miRNA transcripts. Moreover, each identified isomiR has its own characteristic abundance that remains consistent across biological replicates indicating that these are not degradation products. The primary sequences of identified isomiRs differ from the known miRBase miRNA either at their 5´-endpoint (leads to a different ‘seed’ sequence and suggests a different targetome), their 3´-endpoint, or both simultaneously. Our analysis of Argonaute PAR-CLIP data from LCLs supports the association of many of these newly identified isomiRs with the Argonaute silencing complex and thus their functional roles through participation in the RNA interference pathway. PMID:25229428

  10. Molecular evolution of flavonoid dioxygenases in the family Apiaceae.

    PubMed

    Gebhardt, Yvonne; Witte, Simone; Forkmann, Gert; Lukacin, Richard; Matern, Ulrich; Martens, Stefan

    2005-06-01

    Plant species of the family Apiaceae are known to accumulate flavonoids mainly in the form of flavones and flavonols. Three 2-oxoglutarate-dependent dioxygenases, flavone synthase or flavanone 3 beta-hydroxylase and flavonol synthase are involved in the biosynthesis of these secondary metabolites. The corresponding genes were cloned recently from parsley (Petroselinum crispum) leaves. Flavone synthase I appears to be confined to the Apiaceae, and the unique occurrence as well as its high sequence similarity to flavanone 3beta-hydroxylase laid the basis for evolutionary studies. In order to examine the relationship of these two enzymes throughout the Apiaceae, RT-PCR based cloning and functional identification of flavone synthases I or flavanone 3beta-hydroxylases were accomplished from Ammi majus, Anethum graveolens, Apium graveolens, Pimpinella anisum, Conium maculatum and Daucus carota, yielding three additional synthase and three additional hydroxylase cDNAs. Molecular and phylogenetic analyses of these sequences were compatible with the phylogeny based on morphological characteristics and suggested that flavone synthase I most likely resulted from gene duplication of flavanone 3beta-hydroxylase, and functional diversification at some point during the development of the apiaceae subfamilies. Furthermore, the genomic sequences from Petroselinum crispum and Daucus carota revealed two introns in each of the synthases and a lack of introns in the hydroxylases. These results might be explained by intron losses from the hydroxylases occurring at a later stage of evolution.

  11. IsomiR expression profiles in human lymphoblastoid cell lines exhibit population and gender dependencies.

    PubMed

    Loher, Phillipe; Londin, Eric R; Rigoutsos, Isidore

    2014-09-30

    For many years it was believed that each mature microRNA (miRNA) existed as a single entity with fixed endpoints and a 'static' and unchangeable primary sequence. However, recent evidence suggests that mature miRNAs are more 'dynamic' and that each miRNA precursor arm gives rise to multiple isoforms, the isomiRs. Here we report on our identification of numerous and abundant isomiRs in the lymphoblastoid cell lines (LCLs) of 452 men and women from five different population groups. Unexpectedly, we find that these isomiRs exhibit an expression profile that is population-dependent and gender-dependent. This is important as it indicates that the LCLs of each gender/population combination have their own unique collection of mature miRNA transcripts. Moreover, each identified isomiR has its own characteristic abundance that remains consistent across biological replicates indicating that these are not degradation products. The primary sequences of identified isomiRs differ from the known miRBase miRNA either at their 5´-endpoint (leads to a different 'seed' sequence and suggests a different targetome), their 3´-endpoint, or both simultaneously. Our analysis of Argonaute PAR-CLIP data from LCLs supports the association of many of these newly identified isomiRs with the Argonaute silencing complex and thus their functional roles through participation in the RNA interference pathway.

  12. Mycobacterium shottsii sp. nov., a slowly growing species isolated from Chesapeake Bay striped bass (Morone saxatilis)

    USGS Publications Warehouse

    Rhodes, M.W.; Kator, H.; Kotob, S.; van Berkum, P.; Kaattari, I.; Vogelbein, W.; Quinn, F.; Floyd, M.M.; Butler, W.R.; Ottinger, C.A.

    2003-01-01

    Slowly growing, non-pigmented mycobacteria were isolated from striped bass (Morone saxatilis) during an epizootic of mycobacteriosis in the Chesapeake Bay. Growth characteristics, acid-fastness and results of 16S rRNA gene sequencing were consistent with those of the genus Mycobacterium. A unique profile of biochemical reactions was observed among the 21 isolates. A single cluster of eight peaks identified by analysis of mycolic acids (HPLC) resembled those of reference patterns but differed in peak elution times from profiles of reference species of the Mycobacterium tuberculosis complex. One isolate (M175T) was placed within the slowly growing mycobacteria by analysis of aligned 16S rRNA gene sequences and was proximate in phylogeny to Mycobacterium ulcerans and Mycobacterium marinum. However, distinct nucleotide differences were detected in the 16S rRNA gene sequence among M175T, M. ulcerans and M. marinum (99.2% similarity). Isolate M175T could be differentiated from other slowly growing, non-pigmented mycobacteria by its inability to grow at 37??C, production of niacin and urease, absence of nitrate reductase and resistance to isoniazid (1 ??g ml-1), thiacetazone and thiophene-2-carboxylic hydrazide. Based upon these genetic and phenotypic differences, isolate M175T (= ATCC 700981T = NCTC 13215T) is proposed as the type strain of a novel species, Mycobacterium shottsii sp. nov.

  13. The complete mitochondrial genome of the onychophoran Epiperipatus biolleyi reveals a unique transfer RNA set and provides further support for the ecdysozoa hypothesis.

    PubMed

    Podsiadlowski, Lars; Braband, Anke; Mayer, Georg

    2008-01-01

    Onychophora (velvet worms) play a crucial role in current discussions on position of arthropods. The ongoing Articulata/Ecdysozoa debate is in need of additional ground pattern characters for Panarthropoda (Arthropoda, Tardigrada, and Onychophora). Hence, Onychophora is an important outgroup taxon in resolving the relationships among arthropods, irrespective of whether morphological or molecular data are used. To date, there has been a noticeable lack of mitochondrial genome data from onychophorans. Here, we present the first complete mitochondrial genome sequence of an onychophoran, Epiperipatus biolleyi (Peripatidae), which shows several characteristic features. Specifically, the gene order is considerably different from that in other arthropods and other bilaterians. In addition, there is a lack of 9 tRNA genes usually present in bilaterian mitochondrial genomes. All these missing tRNAs have anticodon sequences corresponding to 4-fold degenerate codons, whereas the persisting 13 tRNAs all have anticodons pairing with 2-fold degenerate codons. Sequence-based phylogenetic analysis of the mitochondrial protein-coding genes provides a robust support for a clade consisting of Onychophora, Priapulida, and Arthropoda, which confirms the Ecdysozoa hypothesis. However, resolution of the internal ecdysozoan relationships suffers from a cluster of long-branching taxa (including Nematoda and Platyhelminthes) and a lack of data from Tardigrada and further nemathelminth taxa in addition to nematodes and priapulids.

  14. Genome sequence of Aspergillus luchuensis NBRC 4314

    PubMed Central

    Yamada, Osamu; Machida, Masayuki; Hosoyama, Akira; Goto, Masatoshi; Takahashi, Toru; Futagami, Taiki; Yamagata, Youhei; Takeuchi, Michio; Kobayashi, Tetsuo; Koike, Hideaki; Abe, Keietsu; Asai, Kiyoshi; Arita, Masanori; Fujita, Nobuyuki; Fukuda, Kazuro; Higa, Ken-ichi; Horikawa, Hiroshi; Ishikawa, Takeaki; Jinno, Koji; Kato, Yumiko; Kirimura, Kohtaro; Mizutani, Osamu; Nakasone, Kaoru; Sano, Motoaki; Shiraishi, Yohei; Tsukahara, Masatoshi; Gomi, Katsuya

    2016-01-01

    Awamori is a traditional distilled beverage made from steamed Thai-Indica rice in Okinawa, Japan. For brewing the liquor, two microbes, local kuro (black) koji mold Aspergillus luchuensis and awamori yeast Saccharomyces cerevisiae are involved. In contrast, that yeasts are used for ethanol fermentation throughout the world, a characteristic of Japanese fermentation industries is the use of Aspergillus molds as a source of enzymes for the maceration and saccharification of raw materials. Here we report the draft genome of a kuro (black) koji mold, A. luchuensis NBRC 4314 (RIB 2604). The total length of nonredundant sequences was nearly 34.7 Mb, comprising approximately 2,300 contigs with 16 telomere-like sequences. In total, 11,691 genes were predicted to encode proteins. Most of the housekeeping genes, such as transcription factors and N-and O-glycosylation system, were conserved with respect to Aspergillus niger and Aspergillus oryzae. An alternative oxidase and acid-stable α-amylase regarding citric acid production and fermentation at a low pH as well as a unique glutamic peptidase were also found in the genome. Furthermore, key biosynthetic gene clusters of ochratoxin A and fumonisin B were absent when compared with A. niger genome, showing the safety of A. luchuensis for food and beverage production. This genome information will facilitate not only comparative genomics with industrial kuro-koji molds, but also molecular breeding of the molds in improvements of awamori fermentation. PMID:27651094

  15. Noncoding sequence classification based on wavelet transform analysis: part II

    NASA Astrophysics Data System (ADS)

    Paredes, O.; Strojnik, M.; Romo-Vázquez, R.; Vélez-Pérez, H.; Ranta, R.; Garcia-Torales, G.; Scholl, M. K.; Morales, J. A.

    2017-09-01

    DNA sequences in human genome can be divided into the coding and noncoding ones. We hypothesize that the characteristic periodicities of the noncoding sequences are related to their function. We describe the procedure to identify these characteristic periodicities using the wavelet analysis. Our results show that three groups of noncoding sequences, each one with different biological function, may be differentiated by their wavelet coefficients within specific frequency range.

  16. Microbial biogeography of wine grapes is conditioned by cultivar, vintage, and climate

    PubMed Central

    Bokulich, Nicholas A.; Thorngate, John H.; Richardson, Paul M.; Mills, David A.

    2014-01-01

    Wine grapes present a unique biogeography model, wherein microbial biodiversity patterns across viticultural zones not only answer questions of dispersal and community maintenance, they are also an inherent component of the quality, consumer acceptance, and economic appreciation of a culturally important food product. On their journey from the vineyard to the wine bottle, grapes are transformed to wine through microbial activity, with indisputable consequences for wine quality parameters. Wine grapes harbor a wide range of microbes originating from the surrounding environment, many of which are recognized for their role in grapevine health and wine quality. However, determinants of regional wine characteristics have not been identified, but are frequently assumed to stem from viticultural or geological factors alone. This study used a high-throughput, short-amplicon sequencing approach to demonstrate that regional, site-specific, and grape-variety factors shape the fungal and bacterial consortia inhabiting wine-grape surfaces. Furthermore, these microbial assemblages are correlated to specific climatic features, suggesting a link between vineyard environmental conditions and microbial inhabitation patterns. Taken together, these factors shape the unique microbial inputs to regional wine fermentations, posing the existence of nonrandom “microbial terroir” as a determining factor in regional variation among wine grapes. PMID:24277822

  17. The microbiomes and metagenomes of forest biochars

    NASA Astrophysics Data System (ADS)

    Noyce, Genevieve L.; Winsborough, Carolyn; Fulthorpe, Roberta; Basiliko, Nathan

    2016-05-01

    Biochar particles have been hypothesized to provide unique microhabitats for a portion of the soil microbial community, but few studies have systematically compared biochar communities to bulk soil communities. Here, we used a combination of sequencing techniques to assess the taxonomic and functional characteristics of microbial communities in four-year-old biochar particles and in adjacent soils across three forest environments. Though effects varied between sites, the microbial community living in and around the biochar particles had significantly lower prokaryotic diversity and higher eukaryotic diversity than the surrounding soil. In particular, the biochar bacterial community had proportionally lower abundance of Acidobacteria, Planctomycetes, and β-Proteobacteria taxa, compared to the soil, while the eukaryotic biochar community had an 11% higher contribution of protists belonging to the Aveolata superphylum. Additionally, we were unable to detect a consistent biochar effect on the genetic functional potential of these microbial communities for the subset of the genetic data for which we were able to assign functions through MG-RAST. Overall, these results show that while biochar particles did select for a unique subset of the biota found in adjacent soils, effects on the microbial genetic functional potential appeared to be specific to contrasting forest soil environments.

  18. Microbial biogeography of wine grapes is conditioned by cultivar, vintage, and climate.

    PubMed

    Bokulich, Nicholas A; Thorngate, John H; Richardson, Paul M; Mills, David A

    2014-01-07

    Wine grapes present a unique biogeography model, wherein microbial biodiversity patterns across viticultural zones not only answer questions of dispersal and community maintenance, they are also an inherent component of the quality, consumer acceptance, and economic appreciation of a culturally important food product. On their journey from the vineyard to the wine bottle, grapes are transformed to wine through microbial activity, with indisputable consequences for wine quality parameters. Wine grapes harbor a wide range of microbes originating from the surrounding environment, many of which are recognized for their role in grapevine health and wine quality. However, determinants of regional wine characteristics have not been identified, but are frequently assumed to stem from viticultural or geological factors alone. This study used a high-throughput, short-amplicon sequencing approach to demonstrate that regional, site-specific, and grape-variety factors shape the fungal and bacterial consortia inhabiting wine-grape surfaces. Furthermore, these microbial assemblages are correlated to specific climatic features, suggesting a link between vineyard environmental conditions and microbial inhabitation patterns. Taken together, these factors shape the unique microbial inputs to regional wine fermentations, posing the existence of nonrandom "microbial terroir" as a determining factor in regional variation among wine grapes.

  19. The microbiomes and metagenomes of forest biochars

    PubMed Central

    Noyce, Genevieve L.; Winsborough, Carolyn; Fulthorpe, Roberta; Basiliko, Nathan

    2016-01-01

    Biochar particles have been hypothesized to provide unique microhabitats for a portion of the soil microbial community, but few studies have systematically compared biochar communities to bulk soil communities. Here, we used a combination of sequencing techniques to assess the taxonomic and functional characteristics of microbial communities in four-year-old biochar particles and in adjacent soils across three forest environments. Though effects varied between sites, the microbial community living in and around the biochar particles had significantly lower prokaryotic diversity and higher eukaryotic diversity than the surrounding soil. In particular, the biochar bacterial community had proportionally lower abundance of Acidobacteria, Planctomycetes, and β-Proteobacteria taxa, compared to the soil, while the eukaryotic biochar community had an 11% higher contribution of protists belonging to the Aveolata superphylum. Additionally, we were unable to detect a consistent biochar effect on the genetic functional potential of these microbial communities for the subset of the genetic data for which we were able to assign functions through MG-RAST. Overall, these results show that while biochar particles did select for a unique subset of the biota found in adjacent soils, effects on the microbial genetic functional potential appeared to be specific to contrasting forest soil environments. PMID:27212657

  20. Expanding the foundation for personalized medicine: implications and challenges for dentistry.

    PubMed

    Garcia, I; Kuska, R; Somerman, M J

    2013-07-01

    Personalized medicine aims to individualize care based on a person's unique genetic, environmental, and clinical profile. Dentists and physicians have long recognized variations between and among patients, and have customized care based on each individual's health history, environment, and behavior. However, the sequencing of the human genome in 2003 and breakthroughs in regenerative medicine, imaging, and computer science redefined "personalized medicine" as clinical care that takes advantage of new molecular tools to facilitate highly precise health care based on an individual's unique genomic and molecular characteristics. Major investments in science bring a new urgency toward realizing the promise of personalized medicine; yet, many challenges stand in the way. In this article, we present an overview of the opportunities and challenges that influence the oral health community's full participation in personalized medicine. We highlight selected research advances that are solidifying the foundation of personalized oral health care, elaborate on their impact on dentistry, and explore obstacles toward their adoption into practice. It is our view that now is the time for oral health professionals, educators, students, researchers, and patients to engage fully in preparations for the arrival of personalized medicine as a means to provide quality, customized, and effective oral health care for all.

  1. 76 FR 23333 - Notice of Proposed Withdrawal Extension and Opportunity for Public Meeting; Wyoming

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-04-26

    ... Land Management, Interior. ACTION: Notice. SUMMARY: The United States Department of Agriculture (USDA... mining laws to protect unique topographic characteristics and recreation values of the Snowy Range Area... withdrawal extension is to continue to protect the unique topographic characteristics of the Snowy Range Area...

  2. A new endonuclease recognizing the deoxynucleotide sequence CCNNGG from the cyanobacterium Synechocystis 6701.

    PubMed

    Calléja, F; Tandeau de Marsac, N; Coursin, T; van Ormondt, H; de Waard, A

    1985-09-25

    A new sequence-specific endonuclease from the cyanobacterium Synechocystis species PCC 6701 has been purified and characterized. This enzyme, SecI, is unique in recognizing the nucleotide sequence: 5' -CCNNGG-3' 3' -GGNNCC-5' and cleaves it at the position indicated by the symbol. Two other restriction endonucleases, SecII and SecIII, found in this organism are isoschizomers of MspI and MstII, respectively.

  3. Transcriptomic analysis of Siberian ginseng (Eleutherococcus senticosus) to discover genes involved in saponin biosynthesis.

    PubMed

    Hwang, Hwan-Su; Lee, Hyoshin; Choi, Yong Eui

    2015-03-14

    Eleutherococcus senticosus, Siberian ginseng, is a highly valued woody medicinal plant belonging to the family Araliaceae. E. senticosus produces a rich variety of saponins such as oleanane-type, noroleanane-type, 29-hydroxyoleanan-type, and lupane-type saponins. Genomic or transcriptomic approaches have not been used to investigate the saponin biosynthetic pathway in this plant. In this study, de novo sequencing was performed to select candidate genes involved in the saponin biosynthetic pathway. A half-plate 454 pyrosequencing run produced 627,923 high-quality reads with an average sequence length of 422 bases. De novo assembly generated 72,811 unique sequences, including 15,217 contigs and 57,594 singletons. Approximately 48,300 (66.3%) unique sequences were annotated using BLAST similarity searches. All of the mevalonate pathway genes for saponin biosynthesis starting from acetyl-CoA were isolated. Moreover, 206 reads of cytochrome P450 (CYP) and 145 reads of uridine diphosphate glycosyltransferase (UGT) sequences were isolated. Based on methyl jasmonate (MeJA) treatment and real-time PCR (qPCR) analysis, 3 CYPs and 3 UGTs were finally selected as candidate genes involved in the saponin biosynthetic pathway. The identified sequences associated with saponin biosynthesis will facilitate the study of the functional genomics of saponin biosynthesis and genetic engineering of E. senticosus.

  4. Comprehensive Survey of Genetic Diversity in Chloroplast Genomes and 45S nrDNAs within Panax ginseng Species

    PubMed Central

    Kim, Kyunghee; Lee, Sang-Choon; Lee, Junki; Lee, Hyun Oh; Joh, Ho Jun; Kim, Nam-Hoon; Park, Hyun-Seung; Yang, Tae-Jin

    2015-01-01

    We report complete sequences of chloroplast (cp) genome and 45S nuclear ribosomal DNA (45S nrDNA) for 11 Panax ginseng cultivars. We have obtained complete sequences of cp and 45S nrDNA, the representative barcoding target sequences for cytoplasm and nuclear genome, respectively, based on low coverage NGS sequence of each cultivar. The cp genomes sizes ranged from 156,241 to 156,425 bp and the major size variation was derived from differences in copy number of tandem repeats in the ycf1 gene and in the intergenic regions of rps16-trnUUG and rpl32-trnUAG. The complete 45S nrDNA unit sequences were 11,091 bp, representing a consensus single transcriptional unit with an intergenic spacer region. Comparative analysis of these sequences as well as those previously reported for three Chinese accessions identified very rare but unique polymorphism in the cp genome within P. ginseng cultivars. There were 12 intra-species polymorphisms (six SNPs and six InDels) among 14 cultivars. We also identified five SNPs from 45S nrDNA of 11 Korean ginseng cultivars. From the 17 unique informative polymorphic sites, we developed six reliable markers for analysis of ginseng diversity and cultivar authentication. PMID:26061692

  5. Protein sequences from mastodon and Tyrannosaurus rex revealed by mass spectrometry.

    PubMed

    Asara, John M; Schweitzer, Mary H; Freimark, Lisa M; Phillips, Matthew; Cantley, Lewis C

    2007-04-13

    Fossilized bones from extinct taxa harbor the potential for obtaining protein or DNA sequences that could reveal evolutionary links to extant species. We used mass spectrometry to obtain protein sequences from bones of a 160,000- to 600,000-year-old extinct mastodon (Mammut americanum) and a 68-million-year-old dinosaur (Tyrannosaurus rex). The presence of T. rex sequences indicates that their peptide bonds were remarkably stable. Mass spectrometry can thus be used to determine unique sequences from ancient organisms from peptide fragmentation patterns, a valuable tool to study the evolution and adaptation of ancient taxa from which genomic sequences are unlikely to be obtained.

  6. Comparison of CNVs in Buffalo with other species

    USDA-ARS?s Scientific Manuscript database

    Using a read-depth (RD) and a hybrid read-pair, split-read (RAPTR-SV) CNV detection method, we identified over 1425 unique CNVs in 14 Water Buffalo individual compared to the cattle genome sequence. Total variable sequence of the CNV regions (CNVR) from the RD method approached 59 megabases (~ 2% of...

  7. A matrix-based approach to solving the inverse Frobenius-Perron problem using sequences of density functions of stochastically perturbed dynamical systems

    NASA Astrophysics Data System (ADS)

    Nie, Xiaokai; Coca, Daniel

    2018-01-01

    The paper introduces a matrix-based approach to estimate the unique one-dimensional discrete-time dynamical system that generated a given sequence of probability density functions whilst subjected to an additive stochastic perturbation with known density.

  8. A matrix-based approach to solving the inverse Frobenius-Perron problem using sequences of density functions of stochastically perturbed dynamical systems.

    PubMed

    Nie, Xiaokai; Coca, Daniel

    2018-01-01

    The paper introduces a matrix-based approach to estimate the unique one-dimensional discrete-time dynamical system that generated a given sequence of probability density functions whilst subjected to an additive stochastic perturbation with known density.

  9. A Unique (3+2) Annulation Reaction between Meldrum's Acid and Nitrones: Mechanistic Insight by ESI-IMS-MS and DFT Studies.

    PubMed

    Lespes, Nicolas; Pair, Etienne; Maganga, Clisy; Bretier, Marie; Tognetti, Vincent; Joubert, Laurent; Levacher, Vincent; Hubert-Roux, Marie; Afonso, Carlos; Loutelier-Bourhis, Corinne; Brière, Jean-François

    2018-03-15

    The fragile intermediates of the domino process leading to an isoxazolidin-5-one, triggered by unique reactivity between Meldrum's acid and an N-benzyl nitrone in the presence of a Brønsted base, were determined thanks to the softness and accuracy of electrospray ionization mass spectrometry coupled to ion mobility spectrometry (ESI-IMS-MS). The combined DFT study shed light on the overall organocatalytic sequence that starts with a stepwise (3+2) annulation reaction that is followed by a decarboxylative protonation sequence encompassing a stereoselective pathway issue. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  10. On continuous user authentication via typing behavior.

    PubMed

    Roth, Joseph; Liu, Xiaoming; Metaxas, Dimitris

    2014-10-01

    We hypothesize that an individual computer user has a unique and consistent habitual pattern of hand movements, independent of the text, while typing on a keyboard. As a result, this paper proposes a novel biometric modality named typing behavior (TB) for continuous user authentication. Given a webcam pointing toward a keyboard, we develop real-time computer vision algorithms to automatically extract hand movement patterns from the video stream. Unlike the typical continuous biometrics, such as keystroke dynamics (KD), TB provides a reliable authentication with a short delay, while avoiding explicit key-logging. We collect a video database where 63 unique subjects type static text and free text for multiple sessions. For one typing video, the hands are segmented in each frame and a unique descriptor is extracted based on the shape and position of hands, as well as their temporal dynamics in the video sequence. We propose a novel approach, named bag of multi-dimensional phrases, to match the cross-feature and cross-temporal pattern between a gallery sequence and probe sequence. The experimental results demonstrate a superior performance of TB when compared with KD, which, together with our ultrareal-time demo system, warrant further investigation of this novel vision application and biometric modality.

  11. The unique C- and N-terminal sequences of Metallothionein isoform 3 mediate growth inhibition and Vectorial active transport in MCF-7 cells.

    PubMed

    Voels, Brent; Wang, Liping; Sens, Donald A; Garrett, Scott H; Zhang, Ke; Somji, Seema

    2017-05-25

    The 3rd isoform of the metallothionein (MT3) gene family has been shown to be overexpressed in most ductal breast cancers. A previous study has shown that the stable transfection of MCF-7 cells with the MT3 gene inhibits cell growth. The goal of the present study was to determine the role of the unique C-terminal and N-terminal sequences of MT3 on phenotypic properties and gene expression profiles of MCF-7 cells. MCF-7 cells were transfected with various metallothionein gene constructs which contain the insertion or the removal of the unique MT3 C- and N-terminal domains. Global gene expression analysis was performed on the MCF-7 cells containing the various constructs and the expression of the unique C- and N- terminal domains of MT3 was correlated to phenotypic properties of the cells. The results of the present study demonstrate that the C-terminal sequence of MT3, in the absence of the N-terminal sequence, induces dome formation in MCF-7 cells, which in cell cultures is the phenotypic manifestation of a cell's ability to perform vectorial active transport. Global gene expression analysis demonstrated that the increased expression of the GAGE gene family correlated with dome formation. Expression of the C-terminal domain induced GAGE gene expression, whereas the N-terminal domain inhibited GAGE gene expression and that the effect of the N-terminal domain inhibition was dominant over the C-terminal domain of MT3. Transfection with the metallothionein 1E gene increased the expression of GAGE genes. In addition, both the C- and the N-terminal sequences of the MT3 gene had growth inhibitory properties, which correlated to an increased expression of the interferon alpha-inducible protein 6. Our study shows that the C-terminal domain of MT3 confers dome formation in MCF-7 cells and the presence of this domain induces expression of the GAGE family of genes. The differential effects of MT3 and metallothionein 1E on the expression of GAGE genes suggests unique roles of these genes in the development and progression of breast cancer. The finding that interferon alpha-inducible protein 6 expression is associated with the ability of MT3 to inhibit growth needs further investigation.

  12. Myelin protein zero gene sequencing diagnoses Charcot-Marie-Tooth Type 1B disease

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Su, Y.; Zhang, H.; Madrid, R.

    1994-09-01

    Charcot-Marie-Tooth disease (CMT), the most common genetic neuropathy, affects about 1 in 2600 people in Norway and is found worldwide. CMT Type 1 (CMT1) has slow nerve conduction with demyelinated Schwann cells. Autosomal dominant CMT Type 1B (CMT1B) results from mutations in the myelin protein zero gene which directs the synthesis of more than half of all Schwann cell protein. This gene was mapped to the chromosome 1q22-1q23.1 borderline by fluorescence in situ hybridization. The first 7 of 7 reported CMT1B mutations are unique. Thus the most effective means to identify CMT1B mutations in at-risk family members and fetuses ismore » to sequence the entire coding sequence in dominant or sporadic CMT patients without the CMT1A duplication. Of the 19 primers used in 16 pars to uniquely amplify the entire MPZ coding sequence, 6 primer pairs were used to amplify and sequence the 6 exons. The DyeDeoxy Terminator cycle sequencing method used with four different color fluorescent lables was superior to manual sequencing because it sequences more bases unambiguously from extracted genomic DNA samples within 24 hours. This protocol was used to test 28 CMT and Dejerine-Sottas patients without CMT1A gene duplication. Sequencing MPZ gene-specific amplified fragments identified 9 polymorphic sites within the 6 exons that encode the 248 amino acid MPZ protein. The large number of major CMT1B mutations identified by single strand sequencing are being verified by reverse strand sequencing and when possible, by restriction enzyme analysis. This protocol can be used to distringuish CMT1B patients from othre CMT phenotypes and to determine the CMT1B status of relatives both presymptomatically and prenatally.« less

  13. A public HTLV-1 molecular epidemiology database for sequence management and data mining.

    PubMed

    Araujo, Thessika Hialla Almeida; Souza-Brito, Leandro Inacio; Libin, Pieter; Deforche, Koen; Edwards, Dustin; de Albuquerque-Junior, Antonio Eduardo; Vandamme, Anne-Mieke; Galvao-Castro, Bernardo; Alcantara, Luiz Carlos Junior

    2012-01-01

    It is estimated that 15 to 20 million people are infected with the human T-cell lymphotropic virus type 1 (HTLV-1). At present, there are more than 2,000 unique HTLV-1 isolate sequences published. A central database to aggregate sequence information from a range of epidemiological aspects including HTLV-1 infections, pathogenesis, origins, and evolutionary dynamics would be useful to scientists and physicians worldwide. Described here, we have developed a database that collects and annotates sequence data and can be accessed through a user-friendly search interface. The HTLV-1 Molecular Epidemiology Database website is available at http://htlv1db.bahia.fiocruz.br/. All data was obtained from publications available at GenBank or through contact with the authors. The database was developed using Apache Webserver 2.1.6 and SGBD MySQL. The webpage interfaces were developed in HTML and sever-side scripting written in PHP. The HTLV-1 Molecular Epidemiology Database is hosted on the Gonçalo Moniz/FIOCRUZ Research Center server. There are currently 2,457 registered sequences with 2,024 (82.37%) of those sequences representing unique isolates. Of these sequences, 803 (39.67%) contain information about clinical status (TSP/HAM, 17.19%; ATL, 7.41%; asymptomatic, 12.89%; other diseases, 2.17%; and no information, 60.32%). Further, 7.26% of sequences contain information on patient gender while 5.23% of sequences provide the age of the patient. The HTLV-1 Molecular Epidemiology Database retrieves and stores annotated HTLV-1 proviral sequences from clinical, epidemiological, and geographical studies. The collected sequences and related information are now accessible on a publically available and user-friendly website. This open-access database will support clinical research and vaccine development related to viral genotype.

  14. Unique autosomal recessive variant of palmoplantar keratoderma associated with hearing loss not caused by known mutations*

    PubMed Central

    Hegazi, Moustafa Abdelaal; Manou, Sommen; Sakr, Hazem; Camp, Guy Van

    2017-01-01

    Inherited Palmoplantar Keratodermas are rare disorders of genodermatosis that are conventionally regarded as autosomal dominant in inheritance with extensive clinical and genetic heterogeneity. This is the first report of a unique autosomal recessive Inherited Palmoplantar keratoderma - sensorineural hearing loss syndrome which has not been reported before in 3 siblings of a large consanguineous family. The patients presented unique clinical features that were different from other known Inherited Palmoplantar Keratodermas - hearing loss syndromes. Mutations in GJB2 or GJB6 and the mitochondrial A7445G mutation, known to be the major causes of diverse Inherited Palmoplantar Keratodermas -hearing loss syndromes were not detected by Sanger sequencing. Moreover, the pathogenic mutation could not be identified using whole exome sequencing. Other known Inherited Palmoplantar keratoderma syndromes were excluded based on both clinical criteria and genetic analysis. PMID:29267478

  15. Quantitative profiling of immune repertoires for minor lymphocyte counts using unique molecular identifiers.

    PubMed

    Egorov, Evgeny S; Merzlyak, Ekaterina M; Shelenkov, Andrew A; Britanova, Olga V; Sharonov, George V; Staroverov, Dmitriy B; Bolotin, Dmitriy A; Davydov, Alexey N; Barsova, Ekaterina; Lebedev, Yuriy B; Shugay, Mikhail; Chudakov, Dmitriy M

    2015-06-15

    Emerging high-throughput sequencing methods for the analyses of complex structure of TCR and BCR repertoires give a powerful impulse to adaptive immunity studies. However, there are still essential technical obstacles for performing a truly quantitative analysis. Specifically, it remains challenging to obtain comprehensive information on the clonal composition of small lymphocyte populations, such as Ag-specific, functional, or tissue-resident cell subsets isolated by sorting, microdissection, or fine needle aspirates. In this study, we report a robust approach based on unique molecular identifiers that allows profiling Ag receptors for several hundred to thousand lymphocytes while preserving qualitative and quantitative information on clonal composition of the sample. We also describe several general features regarding the data analysis with unique molecular identifiers that are critical for accurate counting of starting molecules in high-throughput sequencing applications. Copyright © 2015 by The American Association of Immunologists, Inc.

  16. Simplifying complex sequence information: a PCP-consensus protein binds antibodies against all four Dengue serotypes.

    PubMed

    Bowen, David M; Lewis, Jessica A; Lu, Wenzhe; Schein, Catherine H

    2012-09-14

    Designing proteins that reflect the natural variability of a pathogen is essential for developing novel vaccines and drugs. Flaviviruses, including Dengue (DENV) and West Nile (WNV), evolve rapidly and can "escape" neutralizing monoclonal antibodies by mutation. Designing antigens that represent many distinct strains is important for DENV, where infection with a strain from one of the four serotypes may lead to severe hemorrhagic disease on subsequent infection with a strain from another serotype. Here, a DENV physicochemical property (PCP)-consensus sequence was derived from 671 unique sequences from the Flavitrack database. PCP-consensus proteins for domain 3 of the envelope protein (EdomIII) were expressed from synthetic genes in Escherichia coli. The ability of the purified consensus proteins to bind polyclonal antibodies generated in response to infection with strains from each of the four DENV serotypes was determined. The initial consensus protein bound antibodies from DENV-1-3 in ELISA and Western blot assays. This sequence was altered in 3 steps to incorporate regions of maximum variability, identified as significant changes in the PCPs, characteristic of DENV-4 strains. The final protein was recognized by antibodies against all four serotypes. Two amino acids essential for efficient binding to all DENV antibodies are part of a discontinuous epitope previously defined for a neutralizing monoclonal antibody. The PCP-consensus method can significantly reduce the number of experiments required to define a multivalent antigen, which is particularly important when dealing with pathogens that must be tested at higher biosafety levels. Copyright © 2012 Elsevier Ltd. All rights reserved.

  17. Longitudinal Analysis of Cerebrospinal Fluid and Plasma HIV-1 Envelope Sequences Isolated From a Single Donor with HIV Asymptomatic Neurocognitive Impairment

    PubMed Central

    Vázquez-Santiago, Fabián; García, Yashira; Rivera-Román, Ivelisse; Noel, Richard J.; Wojna, Valerie; Meléndez, Loyda M.; Rivera-Amill, Vanessa

    2015-01-01

    Objective Combined antiretroviral treatment (cART) has changed the clinical presentation of HIV-associated neurocognitive disorders (HAND) to that of the milder forms of the disease. Asymptomatic neurocognitive impairment (ANI) is now more prevalent and is associated with increased morbidity and mortality risk in HIV-1–infected people. HIV-1 envelope (env) genetic heterogeneity has been detected within the central nervous system (CNS) of individuals with ANI. Changes within env determine co-receptor use, cellular tropism, and neuropathogenesis. We hypothesize that compartmental changes are associated with HIV-1 env C2V4 during ANI and sought to analyze paired HIV-1 env sequences from plasma and cerebrospinal fluid (CSF) of a female subject undergoing long-term cART. Methods Paired plasma and CSF samples were collected at 12-month intervals and HIV-1 env C2V4 was cloned and sequenced. Results Phylogenetic analysis of paired samples consistently showed genetic variants unique to the CSF. Phenotypic prediction showed CCR5 (R5) variants for all CSF-derived sequences and showed minor X4 variants (or dual-tropic) in the plasma at later time points. Viral compartmentalization was evident throughout the study, suggesting that the occurrence of distinctive env strains may contribute to the neuropathogenesis of HAND. Conclusions Our study provides new insights about the genetic characteristics within the C2V4 of HIV-1 env that persist after long-term cART and during the course of persistent ANI. PMID:26167513

  18. Unique microbial community in drilling fluids from Chinese continental scientific drilling

    USGS Publications Warehouse

    Zhang, Gengxin; Dong, Hailiang; Jiang, Hongchen; Xu, Zhiqin; Eberl, Dennis D.

    2006-01-01

    Circulating drilling fluid is often regarded as a contamination source in investigations of subsurface microbiology. However, it also provides an opportunity to sample geological fluids at depth and to study contained microbial communities. During our study of deep subsurface microbiology of the Chinese Continental Scientific Deep drilling project, we collected 6 drilling fluid samples from a borehole from 2290 to 3350 m below the land surface. Microbial communities in these samples were characterized with cultivation-dependent and -independent techniques. Characterization of 16S rRNA genes indicated that the bacterial clone sequences related to Firmicutes became progressively dominant with increasing depth. Most sequences were related to anaerobic, thermophilic, halophilic or alkaliphilic bacteria. These habitats were consistent with the measured geochemical characteristics of the drilling fluids that have incorporated geological fluids and partly reflected the in-situ conditions. Several clone types were closely related to Thermoanaerobacter ethanolicus, Caldicellulosiruptor lactoaceticus, and Anaerobranca gottschalkii, an anaerobic metal-reducer, an extreme thermophile, and an anaerobic chemoorganotroph, respectively, with an optimal growth temperature of 50–68°C. Seven anaerobic, thermophilic Fe(III)-reducing bacterial isolates were obtained and they were capable of reducing iron oxide and clay minerals to produce siderite, vivianite, and illite. The archaeal diversity was low. Most archaeal sequences were not related to any known cultivated species, but rather to environmental clone sequences recovered from subsurface environments. We infer that the detected microbes were derived from geological fluids at depth and their growth habitats reflected the deep subsurface conditions. These findings have important implications for microbial survival and their ecological functions in the deep subsurface.

  19. Deep sequencing and in silico analysis of small RNA library reveals novel miRNA from leaf Persicaria minor transcriptome.

    PubMed

    Samad, Abdul Fatah A; Nazaruddin, Nazaruddin; Murad, Abdul Munir Abdul; Jani, Jaeyres; Zainal, Zamri; Ismail, Ismanizan

    2018-03-01

    In current era, majority of microRNA (miRNA) are being discovered through computational approaches which are more confined towards model plants. Here, for the first time, we have described the identification and characterization of novel miRNA in a non-model plant, Persicaria minor ( P . minor ) using computational approach. Unannotated sequences from deep sequencing were analyzed based on previous well-established parameters. Around 24 putative novel miRNAs were identified from 6,417,780 reads of the unannotated sequence which represented 11 unique putative miRNA sequences. PsRobot target prediction tool was deployed to identify the target transcripts of putative novel miRNAs. Most of the predicted target transcripts (mRNAs) were known to be involved in plant development and stress responses. Gene ontology showed that majority of the putative novel miRNA targets involved in cellular component (69.07%), followed by molecular function (30.08%) and biological process (0.85%). Out of 11 unique putative miRNAs, 7 miRNAs were validated through semi-quantitative PCR. These novel miRNAs discoveries in P . minor may develop and update the current public miRNA database.

  20. DNABIT Compress – Genome compression algorithm

    PubMed Central

    Rajarajeswari, Pothuraju; Apparao, Allam

    2011-01-01

    Data compression is concerned with how information is organized in data. Efficient storage means removal of redundancy from the data being stored in the DNA molecule. Data compression algorithms remove redundancy and are used to understand biologically important molecules. We present a compression algorithm, “DNABIT Compress” for DNA sequences based on a novel algorithm of assigning binary bits for smaller segments of DNA bases to compress both repetitive and non repetitive DNA sequence. Our proposed algorithm achieves the best compression ratio for DNA sequences for larger genome. Significantly better compression results show that “DNABIT Compress” algorithm is the best among the remaining compression algorithms. While achieving the best compression ratios for DNA sequences (Genomes),our new DNABIT Compress algorithm significantly improves the running time of all previous DNA compression programs. Assigning binary bits (Unique BIT CODE) for (Exact Repeats, Reverse Repeats) fragments of DNA sequence is also a unique concept introduced in this algorithm for the first time in DNA compression. This proposed new algorithm could achieve the best compression ratio as much as 1.58 bits/bases where the existing best methods could not achieve a ratio less than 1.72 bits/bases. PMID:21383923

  1. The Genome Sequencer FLX System--longer reads, more applications, straight forward bioinformatics and more complete data sets.

    PubMed

    Droege, Marcus; Hill, Brendon

    2008-08-31

    The Genome Sequencer FLX System (GS FLX), powered by 454 Sequencing, is a next-generation DNA sequencing technology featuring a unique mix of long reads, exceptional accuracy, and ultra-high throughput. It has been proven to be the most versatile of all currently available next-generation sequencing technologies, supporting many high-profile studies in over seven applications categories. GS FLX users have pursued innovative research in de novo sequencing, re-sequencing of whole genomes and target DNA regions, metagenomics, and RNA analysis. 454 Sequencing is a powerful tool for human genetics research, having recently re-sequenced the genome of an individual human, currently re-sequencing the complete human exome and targeted genomic regions using the NimbleGen sequence capture process, and detected low-frequency somatic mutations linked to cancer.

  2. Ads' click-through rates predicting based on gated recurrent unit neural networks

    NASA Astrophysics Data System (ADS)

    Chen, Qiaohong; Guo, Zixuan; Dong, Wen; Jin, Lingzi

    2018-05-01

    In order to improve the effect of online advertising and to increase the revenue of advertising, the gated recurrent unit neural networks(GRU) model is used as the ads' click through rates(CTR) predicting. Combined with the characteristics of gated unit structure and the unique of time sequence in data, using BPTT algorithm to train the model. Furthermore, by optimizing the step length algorithm of the gated unit recurrent neural networks, making the model reach optimal point better and faster in less iterative rounds. The experiment results show that the model based on the gated recurrent unit neural networks and its optimization of step length algorithm has the better effect on the ads' CTR predicting, which helps advertisers, media and audience achieve a win-win and mutually beneficial situation in Three-Side Game.

  3. Serological and genetic examination of some nontypical Streptococcus mutans strains.

    PubMed

    Coykendall, A L; Bratthall, D; O'Connor, K; Dvarskas, R A

    1976-09-01

    Thirty-four strains of Streptococcus mutans whose antigenic or genetic positions were unclear or unknown with respect to the serological scheme of Bratthall (1970) and Perch et al. (1974), or the genetic (deoxyribonucleic acid base sequence homology) scheme of Coykendall were analyzed to clarify their relationship to previously well-characterized strains. Strain OMZ175 of the "new" serotype f was genetically homologous with strains of S. mutans subsp. mutans. Strains of the "new" serotype g were homologous with serotype d strains (S. mutans subsp. sobrinus). Strains isolated from wild rats constituted a new genetic group but carried the c antigen. Thus, strains within a "genospecies" (subspecies) of S. mutans may not always carry a unique or characteristic antigen. We suggest that the existence of multiple serotypes within subspecies represents antigenic variation and adaptations to hosts.

  4. Proposals of Sphingomonas paucimobilis gen. nov. and comb. nov., Sphingomonas parapaucimobilis sp. nov., Sphingomonas yanoikuyae sp. nov., Sphingomonas adhaesiva sp. nov., Sphingomonas capsulata comb. nov., and two genospecies of the genus Sphingomonas.

    PubMed

    Yabuuchi, E; Yano, I; Oyaizu, H; Hashimoto, Y; Ezaki, T; Yamamoto, H

    1990-01-01

    Based on the partial nucleotide sequence analysis of 16S ribosomal ribonucleic acid (rRNA), presence of unique sphingoglycolipids in cellular lipid, and the major type of ubiquinone (Q10), we propose Sphingomonas gen. nov. with the type species Sphingomonas paucimobilis (Holmes et al, 1977) comb. nov. From the homology values of deoxyribonucleic acid-deoxyribonucleic acid hybridization and the phenotypic characteristics, three new species, Sphingomonas parapaucimobilis, Sphingomonas yanoikuyae, Sphingomonas adhaesiva, and one new combination, Sphingomonas capsulata, are described. S. parapaucimobilis JCM 7510 (= GIFU 11387), S. yanoikuyae JCM 7371 (= GIFU 9882), and S. adhaesiva JCM 7370 (= GIFU 11458) are designated as the type strains of the three new species. Emended description of the type strain of S. capsulata is presented.

  5. PNA binding to the non-template DNA strand interferes with transcription, suggesting a blockage mechanism mediated by R-loop formation.

    PubMed

    Belotserkovskii, Boris P; Hanawalt, Philip C

    2015-11-01

    Peptide Nucleic Acids (PNAs) are artificial DNA mimics with superior nucleic acid binding capabilities. T7 RNA polymerase (T7 RNAP) transcription upon encountering PNA bound to the non-template DNA strand was studied in vitro. A characteristic pattern of blockage signals was observed, extending downstream from the PNA binding site, similar to that produced by G-rich homopurine-homopyrimidine (hPu-hPy) sequences and likely caused by R-loop formation. Since blocked transcription complexes in association with stable R-loops may interfere with replication and in some cases trigger apoptosis, targeted R-loop formation might be employed to inactivate selected cells, such as those in tumors, based upon their unique complement of expressed genes. © 2014 The Authors. Molecular Carcinogenesis published by Wiley Periodicals, Inc.

  6. Single Cell Multi-Omics Technology: Methodology and Application.

    PubMed

    Hu, Youjin; An, Qin; Sheu, Katherine; Trejo, Brandon; Fan, Shuxin; Guo, Ying

    2018-01-01

    In the era of precision medicine, multi-omics approaches enable the integration of data from diverse omics platforms, providing multi-faceted insight into the interrelation of these omics layers on disease processes. Single cell sequencing technology can dissect the genotypic and phenotypic heterogeneity of bulk tissue and promises to deepen our understanding of the underlying mechanisms governing both health and disease. Through modification and combination of single cell assays available for transcriptome, genome, epigenome, and proteome profiling, single cell multi-omics approaches have been developed to simultaneously and comprehensively study not only the unique genotypic and phenotypic characteristics of single cells, but also the combined regulatory mechanisms evident only at single cell resolution. In this review, we summarize the state-of-the-art single cell multi-omics methods and discuss their applications, challenges, and future directions.

  7. Single Cell Multi-Omics Technology: Methodology and Application

    PubMed Central

    Hu, Youjin; An, Qin; Sheu, Katherine; Trejo, Brandon; Fan, Shuxin; Guo, Ying

    2018-01-01

    In the era of precision medicine, multi-omics approaches enable the integration of data from diverse omics platforms, providing multi-faceted insight into the interrelation of these omics layers on disease processes. Single cell sequencing technology can dissect the genotypic and phenotypic heterogeneity of bulk tissue and promises to deepen our understanding of the underlying mechanisms governing both health and disease. Through modification and combination of single cell assays available for transcriptome, genome, epigenome, and proteome profiling, single cell multi-omics approaches have been developed to simultaneously and comprehensively study not only the unique genotypic and phenotypic characteristics of single cells, but also the combined regulatory mechanisms evident only at single cell resolution. In this review, we summarize the state-of-the-art single cell multi-omics methods and discuss their applications, challenges, and future directions. PMID:29732369

  8. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Taneja, Bhupesh; Patel, Asmita; Slesarev, Alexei

    Topoisomerases are involved in controlling and maintaining the topology of DNA and are present in all kingdoms of life. Unlike all other types of topoisomerases, similar type IB enzymes have only been identified in bacteria and eukarya. The only putative type IB topoisomerase in archaea is represented by Methanopyrus kandleri topoisomerase V. Despite several common functional characteristics, topoisomerase V shows no sequence similarity to other members of the same type. The structure of the 61 kDa N-terminal fragment of topoisomerase V reveals no structural similarity to other topoisomerases. Furthermore, the structure of the active site region is different, suggesting nomore » conservation in the cleavage and religation mechanism. Additionally, the active site is buried, indicating the need of a conformational change for activity. The presence of a topoisomerase in archaea with a unique structure suggests the evolution of a separate mechanism to alter DNA.« less

  9. The Comprehensive Antibiotic Resistance Database

    PubMed Central

    McArthur, Andrew G.; Waglechner, Nicholas; Nizam, Fazmin; Yan, Austin; Azad, Marisa A.; Baylay, Alison J.; Bhullar, Kirandeep; Canova, Marc J.; De Pascale, Gianfranco; Ejim, Linda; Kalan, Lindsay; King, Andrew M.; Koteva, Kalinka; Morar, Mariya; Mulvey, Michael R.; O'Brien, Jonathan S.; Pawlowski, Andrew C.; Piddock, Laura J. V.; Spanogiannopoulos, Peter; Sutherland, Arlene D.; Tang, Irene; Taylor, Patricia L.; Thaker, Maulik; Wang, Wenliang; Yan, Marie; Yu, Tennison

    2013-01-01

    The field of antibiotic drug discovery and the monitoring of new antibiotic resistance elements have yet to fully exploit the power of the genome revolution. Despite the fact that the first genomes sequenced of free living organisms were those of bacteria, there have been few specialized bioinformatic tools developed to mine the growing amount of genomic data associated with pathogens. In particular, there are few tools to study the genetics and genomics of antibiotic resistance and how it impacts bacterial populations, ecology, and the clinic. We have initiated development of such tools in the form of the Comprehensive Antibiotic Research Database (CARD; http://arpcard.mcmaster.ca). The CARD integrates disparate molecular and sequence data, provides a unique organizing principle in the form of the Antibiotic Resistance Ontology (ARO), and can quickly identify putative antibiotic resistance genes in new unannotated genome sequences. This unique platform provides an informatic tool that bridges antibiotic resistance concerns in health care, agriculture, and the environment. PMID:23650175

  10. Unique features of a global human ectoparasite identified through sequencing of the bed bug genome.

    PubMed

    Benoit, Joshua B; Adelman, Zach N; Reinhardt, Klaus; Dolan, Amanda; Poelchau, Monica; Jennings, Emily C; Szuter, Elise M; Hagan, Richard W; Gujar, Hemant; Shukla, Jayendra Nath; Zhu, Fang; Mohan, M; Nelson, David R; Rosendale, Andrew J; Derst, Christian; Resnik, Valentina; Wernig, Sebastian; Menegazzi, Pamela; Wegener, Christian; Peschel, Nicolai; Hendershot, Jacob M; Blenau, Wolfgang; Predel, Reinhard; Johnston, Paul R; Ioannidis, Panagiotis; Waterhouse, Robert M; Nauen, Ralf; Schorn, Corinna; Ott, Mark-Christoph; Maiwald, Frank; Johnston, J Spencer; Gondhalekar, Ameya D; Scharf, Michael E; Peterson, Brittany F; Raje, Kapil R; Hottel, Benjamin A; Armisén, David; Crumière, Antonin Jean Johan; Refki, Peter Nagui; Santos, Maria Emilia; Sghaier, Essia; Viala, Sèverine; Khila, Abderrahman; Ahn, Seung-Joon; Childers, Christopher; Lee, Chien-Yueh; Lin, Han; Hughes, Daniel S T; Duncan, Elizabeth J; Murali, Shwetha C; Qu, Jiaxin; Dugan, Shannon; Lee, Sandra L; Chao, Hsu; Dinh, Huyen; Han, Yi; Doddapaneni, Harshavardhan; Worley, Kim C; Muzny, Donna M; Wheeler, David; Panfilio, Kristen A; Vargas Jentzsch, Iris M; Vargo, Edward L; Booth, Warren; Friedrich, Markus; Weirauch, Matthew T; Anderson, Michelle A E; Jones, Jeffery W; Mittapalli, Omprakash; Zhao, Chaoyang; Zhou, Jing-Jiang; Evans, Jay D; Attardo, Geoffrey M; Robertson, Hugh M; Zdobnov, Evgeny M; Ribeiro, Jose M C; Gibbs, Richard A; Werren, John H; Palli, Subba R; Schal, Coby; Richards, Stephen

    2016-02-02

    The bed bug, Cimex lectularius, has re-established itself as a ubiquitous human ectoparasite throughout much of the world during the past two decades. This global resurgence is likely linked to increased international travel and commerce in addition to widespread insecticide resistance. Analyses of the C. lectularius sequenced genome (650 Mb) and 14,220 predicted protein-coding genes provide a comprehensive representation of genes that are linked to traumatic insemination, a reduced chemosensory repertoire of genes related to obligate hematophagy, host-symbiont interactions, and several mechanisms of insecticide resistance. In addition, we document the presence of multiple putative lateral gene transfer events. Genome sequencing and annotation establish a solid foundation for future research on mechanisms of insecticide resistance, human-bed bug and symbiont-bed bug associations, and unique features of bed bug biology that contribute to the unprecedented success of C. lectularius as a human ectoparasite.

  11. Unique features of a global human ectoparasite identified through sequencing of the bed bug genome

    PubMed Central

    Benoit, Joshua B.; Adelman, Zach N.; Reinhardt, Klaus; Dolan, Amanda; Poelchau, Monica; Jennings, Emily C.; Szuter, Elise M.; Hagan, Richard W.; Gujar, Hemant; Shukla, Jayendra Nath; Zhu, Fang; Mohan, M.; Nelson, David R.; Rosendale, Andrew J.; Derst, Christian; Resnik, Valentina; Wernig, Sebastian; Menegazzi, Pamela; Wegener, Christian; Peschel, Nicolai; Hendershot, Jacob M.; Blenau, Wolfgang; Predel, Reinhard; Johnston, Paul R.; Ioannidis, Panagiotis; Waterhouse, Robert M.; Nauen, Ralf; Schorn, Corinna; Ott, Mark-Christoph; Maiwald, Frank; Johnston, J. Spencer; Gondhalekar, Ameya D.; Scharf, Michael E.; Peterson, Brittany F.; Raje, Kapil R.; Hottel, Benjamin A.; Armisén, David; Crumière, Antonin Jean Johan; Refki, Peter Nagui; Santos, Maria Emilia; Sghaier, Essia; Viala, Sèverine; Khila, Abderrahman; Ahn, Seung-Joon; Childers, Christopher; Lee, Chien-Yueh; Lin, Han; Hughes, Daniel S. T.; Duncan, Elizabeth J.; Murali, Shwetha C.; Qu, Jiaxin; Dugan, Shannon; Lee, Sandra L.; Chao, Hsu; Dinh, Huyen; Han, Yi; Doddapaneni, Harshavardhan; Worley, Kim C.; Muzny, Donna M.; Wheeler, David; Panfilio, Kristen A.; Vargas Jentzsch, Iris M.; Vargo, Edward L.; Booth, Warren; Friedrich, Markus; Weirauch, Matthew T.; Anderson, Michelle A. E.; Jones, Jeffery W.; Mittapalli, Omprakash; Zhao, Chaoyang; Zhou, Jing-Jiang; Evans, Jay D.; Attardo, Geoffrey M.; Robertson, Hugh M.; Zdobnov, Evgeny M.; Ribeiro, Jose M. C.; Gibbs, Richard A.; Werren, John H.; Palli, Subba R.; Schal, Coby; Richards, Stephen

    2016-01-01

    The bed bug, Cimex lectularius, has re-established itself as a ubiquitous human ectoparasite throughout much of the world during the past two decades. This global resurgence is likely linked to increased international travel and commerce in addition to widespread insecticide resistance. Analyses of the C. lectularius sequenced genome (650 Mb) and 14,220 predicted protein-coding genes provide a comprehensive representation of genes that are linked to traumatic insemination, a reduced chemosensory repertoire of genes related to obligate hematophagy, host–symbiont interactions, and several mechanisms of insecticide resistance. In addition, we document the presence of multiple putative lateral gene transfer events. Genome sequencing and annotation establish a solid foundation for future research on mechanisms of insecticide resistance, human–bed bug and symbiont–bed bug associations, and unique features of bed bug biology that contribute to the unprecedented success of C. lectularius as a human ectoparasite. PMID:26836814

  12. Capturing Complexities of Relationship-Level Family Planning Trajectories in Malawi.

    PubMed

    Furnas, Hannah E

    2016-09-01

    In a transitioning fertility climate, preferences and decisions surrounding family planning are constantly in flux. Malawi provides an ideal case study of family planning complexities as fertility preferences are flexible, the relationship context is unstable, and childbearing begins early. I use intensive longitudinal data from Tsogolo la Thanzi-a research project in Malawi that follows young adults in romantic partnerships through the course of their relationship. I examine two questions: (1) What are the typical patterns of family planning as young adults transition through a relationship? (2) How are family planning trajectories related to individual and relationship-level characteristics? I use sequence analysis to order family planning across time and to contextualize it within each relationship. I generate and cluster the family planning trajectories and find six distinct groups of young adults who engage in family planning in similar ways. I find that family planning is complex, dynamic, and unique to each relationship. I argue that (a) family planning research should use the relationship as the unit of analysis and (b) family planning behaviors and preferences should be sequenced over time for a better understanding of key concepts, such as unmet need. © 2016 The Population Council, Inc.

  13. Capturing Complexities of Relationship-Level Family Planning Trajectories in Malawi

    PubMed Central

    Furnas, Hannah E.

    2017-01-01

    In a transitioning fertility climate, preferences and decisions surrounding family planning are constantly in flux. Malawi provides an ideal case study of family planning complexities as fertility preferences are flexible, the relationship context is unstable, and childbearing begins early. I use intensive longitudinal data from Tsogolo la Thanzi—a research project in Malawi that follows young adults in romantic partnerships through the course of their relationship and allows me to ask two questions: (1) What are the typical patterns of family planning as young adults transition through a relationship? (2) How are family planning trajectories related to individual and relationship-level characteristics? I use sequence analysis to order family planning across time and to contextualize it within each relationship. I generate and cluster the family planning trajectories and find six distinct groups of young adults who engage in family planning in similar ways. I find that family planning is complex, dynamic, and unique to each relationship. I argue that (a) family planning research should use the relationship as the unit of analysis and (b) family planning behaviors and preferences should be sequenced over time for a better understanding of key concepts, such as unmet need. PMID:27517867

  14. Uronic polysaccharide degrading enzymes.

    PubMed

    Garron, Marie-Line; Cygler, Miroslaw

    2014-10-01

    In the past several years progress has been made in the field of structure and function of polysaccharide lyases (PLs). The number of classified polysaccharide lyase families has increased to 23 and more detailed analysis has allowed the identification of more closely related subfamilies, leading to stronger correlation between each subfamily and a unique substrate. The number of as yet unclassified polysaccharide lyases has also increased and we expect that sequencing projects will allow many of these unclassified sequences to emerge as new families. The progress in structural analysis of PLs has led to having at least one representative structure for each of the families and for two unclassified enzymes. The newly determined structures have folds observed previously in other PL families and their catalytic mechanisms follow either metal-assisted or Tyr/His mechanisms characteristic for other PL enzymes. Comparison of PLs with glycoside hydrolases (GHs) shows several folds common to both classes but only for the β-helix fold is there strong indication of divergent evolution from a common ancestor. Analysis of bacterial genomes identified gene clusters containing multiple polysaccharide cleaving enzymes, the Polysaccharides Utilization Loci (PULs), and their gene complement suggests that they are organized to process completely a specific polysaccharide. Copyright © 2014 Elsevier Ltd. All rights reserved.

  15. The Non-Photosynthetic Algae Helicosporidium spp.: Emergence of a Novel Group of Insect Pathogens.

    PubMed

    Tartar, Aurélien

    2013-07-17

    Since the original description of Helicosporidium parasiticum in 1921, members of the genus Helicosporidium have been reported to infect a wide variety of invertebrates, but their characterization has remained dependent on occasional reports of infection. Recently, several new Helicosporidium isolates have been successfully maintained in axenic cultures. The ability to produce large quantity of biological material has led to very significant advances in the understanding of Helicosporidium biology and its interactions with insect hosts. In particular, the unique infectious process has been well documented; the highly characteristic cyst and its included filamentous cell have been shown to play a central role during host infection and have been the focus of detailed morphological and developmental studies. In addition, phylogenetic analyses inferred from a multitude of molecular sequences have demonstrated that Helicosporidium are highly specialized non-photosynthetic algae (Chlorophyta: Trebouxiophyceae), and represent the first described entomopathogenic algae. This review provides an overview of (i) the morphology of Helicosporidium cell types, (ii) the Helicosporidium life cycle, including the entire infectious sequence and its impact on insect hosts, (iii) the phylogenetic analyses that have prompted the taxonomic classification of Helicosporidium as green algae, and (iv) the documented host range for this novel group of entomopathogens.

  16. Functional characteristics of the calcium modulated proteins seen from an evolutionary perspective

    NASA Technical Reports Server (NTRS)

    Kretsinger, R. H.; Nakayama, S.; Moncrief, N. D.

    1991-01-01

    We have constructed dendrograms relating 173 EF-hand proteins of known amino acid sequence. We aligned all of these proteins by their EF-hand domains, omitting interdomain regions. Initial dendrograms were computed by minimum mutation distance methods. Using these as starting points, we determined the best dendrogram by the method of maximum parsimony, scored by minimum mutation distance. We identified 14 distinct subfamilies as well as 6 unique proteins that are perhaps the sole representatives of other subfamilies. This information is given in tabular form. Within subfamilies one can easily align interdomain regions. The resulting dendrograms are very similar to those computed using domains only. Dendrograms constructed using pairs of domains show general congruence. However, there are enough exceptions to caution against an overly simple scheme in which one pair of gene duplications leads from one domain precurser to a four domain prototype from which all other forms evolved. The ability to bind calcium was lost and acquired several times during evolution. The distribution of introns does not conform to the dendrogram based on amino acid sequences. The rates of evolution appear to be much slower within subfamilies, especially within calmodulin, than those prior to the definition of subfamily.

  17. Lactobacillus crustorum sp. nov., isolated from two traditional Belgian wheat sourdoughs.

    PubMed

    Scheirlinck, Ilse; Van der Meulen, Roel; Van Schoor, Ann; Huys, Geert; Vandamme, Peter; De Vuyst, Luc; Vancanneyt, Marc

    2007-07-01

    A polyphasic taxonomic study of the lactic acid bacteria (LAB) population in three traditional Belgian sourdoughs, sampled between 2002 and 2004, revealed a group of isolates that could not be assigned to any recognized LAB species. Initially, sourdough isolates were screened by means of (GTG)(5)-PCR fingerprinting. Four isolates displaying unique (GTG)(5)-PCR patterns were further investigated by means of phenylalanyl-tRNA synthase (pheS) gene sequence analysis and represented a bifurcated branch that could not be allocated to any LAB species present in the in-house pheS database. Their phylogenetic affiliation was determined using 16S rRNA gene sequence analysis and showed that the four sourdough isolates belong to the Lactobacillus plantarum group with Lactobacillus mindensis, Lactobacillus farciminis and Lactobacillus nantensis as closest relatives. Further genotypic and phenotypic studies, including whole-cell protein analysis (SDS-PAGE), amplified fragment length polymorphism (AFLP) fingerprinting, DNA-DNA hybridization, DNA G+C content analysis, growth characteristics and biochemical features, demonstrated that the new sourdough isolates represent a novel Lactobacillus species for which the name Lactobacillus crustorum sp. nov. is proposed. The type strain of the new species is LMG 23699(T) (=CCUG 53174(T)).

  18. Global insights into acetic acid resistance mechanisms and genetic stability of Acetobacter pasteurianus strains by comparative genomics

    PubMed Central

    Wang, Bin; Shao, Yanchun; Chen, Tao; Chen, Wanping; Chen, Fusheng

    2015-01-01

    Acetobacter pasteurianus (Ap) CICC 20001 and CGMCC 1.41 are two acetic acid bacteria strains that, because of their strong abilities to produce and tolerate high concentrations of acetic acid, have been widely used to brew vinegar in China. To globally understand the fermentation characteristics, acid-tolerant mechanisms and genetic stabilities, their genomes were sequenced. Genomic comparisons with 9 other sequenced Ap strains revealed that their chromosomes were evolutionarily conserved, whereas the plasmids were unique compared with other Ap strains. Analysis of the acid-tolerant metabolic pathway at the genomic level indicated that the metabolism of some amino acids and the known mechanisms of acetic acid tolerance, might collaboratively contribute to acetic acid resistance in Ap strains. The balance of instability factors and stability factors in the genomes of Ap CICC 20001 and CGMCC 1.41 strains might be the basis for their genetic stability, consistent with their stable industrial performances. These observations provide important insights into the acid resistance mechanism and the genetic stability of Ap strains and lay a foundation for future genetic manipulation and engineering of these two strains. PMID:26691589

  19. Genetic and morphological diversity of Trisetacus species (Eriophyoidea: Phytoptidae) associated with coniferous trees in Poland: phylogeny, barcoding, host and habitat specialization.

    PubMed

    Lewandowski, Mariusz; Skoracka, Anna; Szydło, Wiktoria; Kozak, Marcin; Druciarek, Tobiasz; Griffiths, Don A

    2014-08-01

    Eriophyoid species belonging to the genus Trisetacus are economically important as pests of conifers. A narrow host specialization to conifers and some unique morphological characteristics have made these mites interesting subjects for scientific inquiry. In this study, we assessed morphological and genetic variation of seven Trisetacus species originating from six coniferous hosts in Poland by morphometric analysis and molecular sequencing of the mitochondrial cytochrome oxidase subunit I gene and the nuclear D2 region of 28S rDNA. The results confirmed the monophyly of the genus Trisetacus as well as the monophyly of five of the seven species studied. Both DNA sequences were effective in discriminating between six of the seven species tested. Host-dependent genetic and morphological variation in T. silvestris and T. relocatus, and habitat-dependent genetic and morphological variation in T. juniperinus were detected, suggesting the existence of races or even distinct species within these Trisetacus taxa. This is the first molecular phylogenetic analysis of the Trisetacus species. The findings presented here will stimulate further investigations on the evolutionary relationships of Trisetacus as well as the entire Phytoptidae family.

  20. Formation and Evolution of Lakshmi Planum (V-7), Venus: Assessment of Models using Observations from Geological Mapping

    NASA Technical Reports Server (NTRS)

    Ivanov, M. A.; Head, James W.

    2008-01-01

    Lakshmi Planum is a high-standing plateau (3.5-4.5 km above MPR) surrounded by the highest mountain ranges on Venus. Lakshmi represents a unique type of elevated region different from dome-shaped and rifted rises and tessera-bearing crustal plateaus. The unique characteristics of Lakshmi suggest that it formed by an unusual combination of processes and played an important role in Venus geologic history. Lakshmi was studied with Venera-15/16 and Magellan data, resulting in two classes of models, divergent and convergent, to explain its unusual topographic and morphologic characteristics. Divergent models explain Lakshmi as a site of mantle upwelling due to rising and subsequent collapse of a mantle diapir; such models explain emplacement of a lava plateau inside Lakshmi and, in some circumstances, formation of the mountain ranges. The convergent models consider Lakshmi as a locus of mantle downwelling, convergence, underthrusting, and possible subduction. Key features in these models are the mountain ranges, high topography of Lakshmi interior, and the large volcanic centers in the plateau center. These divergent and convergent models entail principally different mechanisms of formation and suggest different geodynamic regimes on Venus. Almost all models make either explicit or implicit predictions about the type and sequence of major events during formation and evolution of Lakshmi and thus detailed geological mapping can be used to test them. Here we present the results of such geological mapping (the V-7 quadrangle, 50-75degN, 300-360degE; scale 1:5M) that allows testing the proposed models for Lakshmi.

  1. The Family Environment and Developmental Psychopathology: The Unique and Interactive Effects of Depression, Attention, and Conduct Problems

    ERIC Educational Resources Information Center

    George, Carrie; Herman, Keith C.; Ostrander, Rick

    2006-01-01

    Prior studies have found remarkable similarity in the family characteristics across a wide range of child psychopathologies. This study investigated the unique relationships between symptoms of depression, conduct problems/aggression, and inattention/hyperactivity and characteristics of the family environment. Parents and teachers completed…

  2. Characterization, sequencing and comparative genomic analysis of vB_AbaM-IME-AB2, a novel lytic bacteriophage that infects multidrug-resistant Acinetobacter baumannii clinical isolates.

    PubMed

    Peng, Fan; Mi, Zhiqiang; Huang, Yong; Yuan, Xin; Niu, Wenkai; Wang, Yahui; Hua, Yuhui; Fan, Huahao; Bai, Changqing; Tong, Yigang

    2014-07-05

    With the use of broad-spectrum antibiotics, immunosuppressive drugs, and glucocorticoids, multidrug-resistant Acinetobacter baumannii (MDR-AB) has become a major nosocomial pathogen species. The recent renaissance of bacteriophage therapy may provide new treatment strategies for combatting drug-resistant bacterial infections. In this study, we isolated a lytic bacteriophage vB_AbaM-IME-AB2 has a short latent period and a small burst size, which clear its host's suspension quickly, was selected for characterization and a complete genomic comparative study. The isolated bacteriophage vB_AbaM-IME-AB2 has an icosahedral head and displays morphology resembling Myoviridae family. Gel separation assays showed that the phage particle contains at least nine protein bands with molecular weights ranging 15-100 kDa. vB_AbaM-IME-AB2 could adsorb its host cells in 9 min with an adsorption rate more than 99% and showed a short latent period (20 min) and a small burst size (62 pfu/cell). It could form clear plaques in the double-layer assay and clear its host's suspension in just 4 hours. Whole genome of vB_AbaM-IME-AB2 was sequenced and annotated and the results showed that its genome is a double-stranded DNA molecule consisting of 43,665 nucleotides. The genome has a G + C content of 37.5% and 82 putative coding sequences (CDSs). We compared the characteristics and complete genome sequence of all known Acinetobacter baumannii bacteriophages. There are only three that have been sequenced Acinetobacter baumannii phages AB1, AP22, and phiAC-1, which have a relatively high similarity and own a coverage of 65%, 50%, 8% respectively when compared with our phage vB_AbaM-IME-AB2. A nucleotide alignment of the four Acinetobacter baumannii phages showed that some CDSs are similar, with no significant rearrangements observed. Yet some sections of these strains of phage are nonhomologous. vB_AbaM-IME-AB2 was a novel and unique A. baumannii bacteriophage. These findings suggest a common ancestry and microbial diversity and evolution. A clear understanding of its characteristics and genes is conducive to the treatment of multidrug-resistant A. baumannii in the future.

  3. Disease surveillance of Atlantic herring: molecular characterization of hepatic coccidiosis and a morphological report of a novel intestinal coccidian

    USGS Publications Warehouse

    Friend, Sarah E; Lovey, J; Hershberger, Paul

    2016-01-01

    Surveillance for pathogens of Atlantic herring, including viral hemorrhagic septicemia virus (VHSV),Ichthyophonus hoferi, and hepatic and intestinal coccidians, was conducted from 2012 to 2016 in the NW Atlantic Ocean, New Jersey, USA. Neither VHSV nor I. hoferi was detected in any sample. Goussia clupearum was found in the livers of 40 to 78% of adult herring in varying parasite loads; however, associated pathological changes were negligible. Phylogenetic analysis based on small subunit 18S rRNA gene sequences placed G. clupearum most closely with other extraintestinal liver coccidia from the genus Calyptospora, though the G. clupearum isolates had a unique nucleotide insertion between 604 and 729 bp that did not occur in any other coccidian species. G. clupearum oocysts from Atlantic and Pacific herring were morphologically similar, though differences occurred in oocyst dimensions. Comparison of G. clupearum genetic sequences from Atlantic and Pacific herring revealed 4 nucleotide substitutions and 2 gaps in a 1749 bp region, indicating some divergence in the geographically separate populations. Pacific G. clupearum oocysts were not directly infective, suggesting that a heteroxenous life cycle is likely. Intestinal coccidiosis was described for the first time from juvenile and adult Atlantic herring. A novel intestinal coccidian species was detected based on morphological characteristics of exogenously sporulated oocysts. A unique feature in these oocysts was the presence of 3 long (15.1 ± 5.1 µm, mean ±SD) spiny projections on both ends of the oocyst. The novel morphology of this coccidian led us to tentatively name this parasite G. echinata n. sp.

  4. Unique CD44 intronic SNP is associated with tumor grade in breast cancer: a case control study and in silico analysis.

    PubMed

    Esmaeili, Rezvan; Abdoli, Nasrin; Yadegari, Fatemeh; Neishaboury, Mohamadreza; Farahmand, Leila; Kaviani, Ahmad; Majidzadeh-A, Keivan

    2018-01-01

    CD44 encoded by a single gene is a cell surface transmembrane glycoprotein. Exon 2 is one of the important exons to bind CD44 protein to hyaluronan. Experimental evidences show that hyaluronan-CD44 interaction intensifies the proliferation, migration, and invasion of breast cancer cells. Therefore, the current study aimed at investigating the association between specific polymorphisms in exon 2 and its flanking region of CD44 with predisposition to breast cancer. In the current study, 175 Iranian female patients with breast cancer and 175 age-matched healthy controls were recruited in biobank, Breast Cancer Research Center, Tehran, Iran. Single nucleotide polymorphisms of CD44 exon 2 and its flanking were analyzed via polymerase chain reaction and gene sequencing techniques. Association between the observed variation with breast cancer risk and clinico-pathological characteristics were studied. Subsequently, bioinformatics analysis was conducted to predict potential exonic splicing enhancer (ESE) motifs changed as the result of a mutation. A unique polymorphism of the gene encoding CD44 was identified at position 14 nucleotide upstream of exon 2 (A37692→G) by the sequencing method. The A > G polymorphism exhibited a significant association with higher-grades of breast cancer, although no significant relation was found between this polymorphism and breast cancer risk. Finally, computational analysis revealed that the intronic mutation generated a new consensus-binding motif for the splicing factor, SC35, within intron 1. The current study results indicated that A > G polymorphism was associated with breast cancer development; in addition, in silico analysis with ESE finder prediction software showed that the change created a new SC35 binding site.

  5. Preservation of cell structures in a medieval infant brain: a paleohistological, paleogenetic, radiological and physico-chemical study.

    PubMed

    Papageorgopoulou, Christina; Rentsch, Katharina; Raghavan, Maanasa; Hofmann, Maria Ines; Colacicco, Giovanni; Gallien, Véronique; Bianucci, Raffaella; Rühli, Frank

    2010-04-15

    Cerebral tissues from archaeological human remains are extremely rare findings. Hereby, we report a multidisciplinary study of a unique case of a left cerebral hemisphere from a 13th century AD child, found in north-western France. The cerebral tissue-reduced by ca. 80% of its original weight-had been fixed in formalin since its discovery. However, it fully retained its gross anatomical characteristics such as sulci, and gyri; the frontal, temporal and occipital lobe as well as grey and white matter could be readily recognised. Neuronal remains near the hippocampus area and Nissl bodies from the motor cortex area were observed (Nissl, Klüver-Barrera staining). Also, computed tomography (CT) and magnetic resonance imaging (T1, proton density, ultra short echo time sequences) were feasible. They produced high quality morpho-diagnostic images. Both histological and radiological examinations could not confirm the pathologist's previously suggested diagnosis of cerebral haemorrhage as the cause of death. Reproducible cloned mtDNA sequences were recovered from the skeleton but not from the brain itself. This was most likely due to the combined effect of formaldehyde driven DNA-DNA and/or DNA-protein cross-linking, plus hydrolytic fragmentation of the DNA. The chemical profile of the brain tissue, from gas-chromatography/mass-spectroscopy analysis, suggested adipocerous formation as the main aetiology of the mummification process. The hereby presented child brain is a unique paleo-case of well-preserved neuronal cellular tissue, which is a conditio sine qua non for any subsequent study addressing wider perspectives in neuroscience research, such as the evolution of brain morphology and pathology. Copyright 2010 Elsevier Inc. All rights reserved.

  6. A Molecular Framework for Understanding DCIS

    DTIC Science & Technology

    2016-10-01

    frozen patient biopsies, these have been annotated by our pathologist and prepared to be taken on for sequencing. The tissue includes DCIS, IDC...stroma adjacent to DCIS/IDC and normal tissue . We have initiated the RNA sequencing from these samples and also the DNA sequencing 15. SUBJECT TERMS DCIS...before they reach 55. Utilizing a unique bank of frozen mammary biopsies, containing samples with DCIS alone, and a combination of DCIS and IDC, we aim

  7. A new endonuclease recognizing the deoxynucleotide sequence CCNNGG from the cyanobacterium Synechocystis 6701.

    PubMed Central

    Calléja, F; Tandeau de Marsac, N; Coursin, T; van Ormondt, H; de Waard, A

    1985-01-01

    A new sequence-specific endonuclease from the cyanobacterium Synechocystis species PCC 6701 has been purified and characterized. This enzyme, SecI, is unique in recognizing the nucleotide sequence: 5' -CCNNGG-3' 3' -GGNNCC-5' and cleaves it at the position indicated by the symbol. Two other restriction endonucleases, SecII and SecIII, found in this organism are isoschizomers of MspI and MstII, respectively. Images PMID:2997722

  8. Ohmic resistance in a multi-anode MxCs

    EPA Pesticide Factsheets

    A-3txf_sequence summary.xksx: Abundance of contigs or unique sequences for each biofilm samples from anodes in the MEC reactorHodon Waterloo final_fasta_working.docx: Raw sequences with their identification numbersRNA S1_MEC.docx: Representative sequences with their ID number and taxonomyThis dataset is associated with the following publication:Santodomingo, J., H. Ryu, B. Dhar, and H. Lee. Ohmic resistance affects microbial community and electrochemical kinetics in a multi-anode microbial electrochemical cell. JOURNAL OF POWER SOURCES. Elsevier Science Ltd, New York, NY, USA, 331: 315-321, (2016).

  9. Repeated sequence sets in mitochondrial DNA molecules of root knot nematodes (Meloidogyne): nucleotide sequences, genome location and potential for host-race identification.

    PubMed Central

    Okimoto, R; Chamberlin, H M; Macfarlane, J L; Wolstenholme, D R

    1991-01-01

    Within a 7 kb segment of the mtDNA molecule of the root knot nematode, Meloidogyne javanica, that lacks standard mitochondrial genes, are three sets of strictly tandemly arranged, direct repeat sequences: approximately 36 copies of a 102 ntp sequence that contains a TaqI site; 11 copies of a 63 ntp sequence, and 5 copies of an 8 ntp sequence. The 7 kb repeat-containing segment is bounded by putative tRNAasp and tRNAf-met genes and the arrangement of sequences within this segment is: the tRNAasp gene; a unique 1,528 ntp segment that contains two highly stable hairpin-forming sequences; the 102 ntp repeat set; the 8 ntp repeat set; a unique 1,068 ntp segment; the 63 ntp repeat set; and the tRNAf-met gene. The nucleotide sequences of the 102 ntp copies and the 63 ntp copies have been conserved among the species examined. Data from Southern hybridization experiments indicate that 102 ntp and 63 ntp repeats occur in the mtDNAs of three, two and two races of M.incognita, M.hapla and M.arenaria, respectively. Nucleotide sequences of the M.incognita Race-3 102 ntp repeat were found to be either identical or highly similar to those of the M.javanica 102 ntp repeat. Differences in migration distance and number of 102 ntp repeat-containing bands seen in Southern hybridization autoradiographs of restriction-digested mtDNAs of M.javanica and the different host races of M.incognita, M.hapla and M.arenaria are sufficient to distinguish the different host races of each species. Images PMID:2027769

  10. Mitochondrial Genome Sequence of the Legume Vicia faba

    PubMed Central

    Negruk, Valentine

    2013-01-01

    The number of plant mitochondrial genomes sequenced exceeds two dozen. However, for a detailed comparative study of different phylogenetic branches more plant mitochondrial genomes should be sequenced. This article presents sequencing data and comparative analysis of mitochondrial DNA (mtDNA) of the legume Vicia faba. The size of the V. faba circular mitochondrial master chromosome of cultivar Broad Windsor was estimated as 588,000 bp with a genome complexity of 387,745 bp and 52 conservative mitochondrial genes; 32 of them encoding proteins, 3 rRNA, and 17 tRNA genes. Six tRNA genes were highly homologous to chloroplast genome sequences. In addition to the 52 conservative genes, 114 unique open reading frames (ORFs) were found, 36 without significant homology to any known proteins and 29 with homology to the Medicago truncatula nuclear genome and to other plant mitochondrial ORFs, 49 ORFs were not homologous to M. truncatula but possessed sequences with significant homology to other plant mitochondrial or nuclear ORFs. In general, the unique ORFs revealed very low homology to known closely related legumes, but several sequence homologies were found between V. faba, Beta vulgaris, Nicotiana tabacum, Vitis vinifera, and even the monocots Oryza sativa and Zea mays. Most likely these ORFs arose independently during angiosperm evolution (Kubo and Mikami, 2007; Kubo and Newton, 2008). Computational analysis revealed in total about 45% of V. faba mtDNA sequence being homologous to the Medicago truncatula nuclear genome (more than to any sequenced plant mitochondrial genome), and 35% of this homology ranging from a few dozen to 12,806 bp are located on chromosome 1. Apparently, mitochondrial rrn5, rrn18, rps10, ATP synthase subunit alpha, cox2, and tRNA sequences are part of transcribed nuclear mosaic ORFs. PMID:23675376

  11. 'Candidatus Phytoplasma solani', a novel taxon associated with stolbur- and bois noir-related diseases of plants.

    PubMed

    Quaglino, Fabio; Zhao, Yan; Casati, Paola; Bulgari, Daniela; Bianco, Piero Attilio; Wei, Wei; Davis, Robert Edward

    2013-08-01

    Phytoplasmas classified in group 16SrXII infect a wide range of plants and are transmitted by polyphagous planthoppers of the family Cixiidae. Based on 16S rRNA gene sequence identity and biological properties, group 16SrXII encompasses several species, including 'Candidatus Phytoplasma australiense', 'Candidatus Phytoplasma japonicum' and 'Candidatus Phytoplasma fragariae'. Other group 16SrXII phytoplasma strains are associated with stolbur disease in wild and cultivated herbaceous and woody plants and with bois noir disease in grapevines (Vitis vinifera L.). Such latter strains have been informally proposed to represent a separate species, 'Candidatus Phytoplasma solani', but a formal description of this taxon has not previously been published. In the present work, stolbur disease strain STOL11 (STOL) was distinguished from reference strains of previously described species of the 'Candidatus Phytoplasma' genus based on 16S rRNA gene sequence similarity and a unique signature sequence in the 16S rRNA gene. Other stolbur- and bois noir-associated ('Ca. Phytoplasma solani') strains shared >99 % 16S rRNA gene sequence similarity with strain STOL11 and contained the signature sequence. 'Ca. Phytoplasma solani' is the only phytoplasma known to be transmitted by Hyalesthes obsoletus. Insect vectorship and molecular characteristics are consistent with the concept that diverse 'Ca. Phytoplasma solani' strains share common properties and represent an ecologically distinct gene pool. Phylogenetic analyses of 16S rRNA, tuf, secY and rplV-rpsC gene sequences supported this view and yielded congruent trees in which 'Ca. Phytoplasma solani' strains formed, within the group 16SrXII clade, a monophyletic subclade that was most closely related to, but distinct from, that of 'Ca. Phytoplasma australiense'-related strains. Based on distinct molecular and biological properties, stolbur- and bois noir-associated strains are proposed to represent a novel species level taxon, 'Ca. Phytoplasma solani'; STOL11 is designated the reference strain.

  12. Epstein-Barr virus latent gene sequences as geographical markers of viral origin: unique EBNA3 gene signatures identify Japanese viruses as distinct members of the Asian virus family.

    PubMed

    Sawada, Akihisa; Croom-Carter, Deborah; Kondo, Osamu; Yasui, Masahiro; Koyama-Sato, Maho; Inoue, Masami; Kawa, Keisei; Rickinson, Alan B; Tierney, Rosemary J

    2011-05-01

    Polymorphisms in Epstein-Barr virus (EBV) latent genes can identify virus strains from different human populations and individual strains within a population. An Asian EBV signature has been defined almost exclusively from Chinese viruses, with little information from other Asian countries. Here we sequenced polymorphic regions of the EBNA1, 2, 3A, 3B, 3C and LMP1 genes of 31 Japanese strains from control donors and EBV-associated T/NK-cell lymphoproliferative disease (T/NK-LPD) patients. Though identical to Chinese strains in their dominant EBNA1 and LMP1 alleles, Japanese viruses were subtly different at other loci. Thus, while Chinese viruses mainly fall into two families with strongly linked 'Wu' or 'Li' alleles at EBNA2 and EBNA3A/B/C, Japanese viruses all have the consensus Wu EBNA2 allele but fall into two families at EBNA3A/B/C. One family has variant Li-like sequences at EBNA3A and 3B and the consensus Li sequence at EBNA3C; the other family has variant Wu-like sequences at EBNA3A, variants of a low frequency Chinese allele 'Sp' at EBNA3B and a consensus Sp sequence at EBNA3C. Thus, EBNA3A/B/C allelotypes clearly distinguish Japanese from Chinese strains. Interestingly, most Japanese viruses also lack those immune-escape mutations in the HLA-A11 epitope-encoding region of EBNA3B that are so characteristic of viruses from the highly A11-positive Chinese population. Control donor-derived and T/NK-LPD-derived strains were similarly distributed across allelotypes and, by using allelic polymorphisms to track virus strains in patients pre- and post-haematopoietic stem-cell transplant, we show that a single strain can induce both T/NK-LPD and B-cell-lymphoproliferative disease in the same patient.

  13. WS-SNPs&GO: a web server for predicting the deleterious effect of human protein variants using functional annotation

    PubMed Central

    2013-01-01

    Background SNPs&GO is a method for the prediction of deleterious Single Amino acid Polymorphisms (SAPs) using protein functional annotation. In this work, we present the web server implementation of SNPs&GO (WS-SNPs&GO). The server is based on Support Vector Machines (SVM) and for a given protein, its input comprises: the sequence and/or its three-dimensional structure (when available), a set of target variations and its functional Gene Ontology (GO) terms. The output of the server provides, for each protein variation, the probabilities to be associated to human diseases. Results The server consists of two main components, including updated versions of the sequence-based SNPs&GO (recently scored as one of the best algorithms for predicting deleterious SAPs) and of the structure-based SNPs&GO3d programs. Sequence and structure based algorithms are extensively tested on a large set of annotated variations extracted from the SwissVar database. Selecting a balanced dataset with more than 38,000 SAPs, the sequence-based approach achieves 81% overall accuracy, 0.61 correlation coefficient and an Area Under the Curve (AUC) of the Receiver Operating Characteristic (ROC) curve of 0.88. For the subset of ~6,600 variations mapped on protein structures available at the Protein Data Bank (PDB), the structure-based method scores with 84% overall accuracy, 0.68 correlation coefficient, and 0.91 AUC. When tested on a new blind set of variations, the results of the server are 79% and 83% overall accuracy for the sequence-based and structure-based inputs, respectively. Conclusions WS-SNPs&GO is a valuable tool that includes in a unique framework information derived from protein sequence, structure, evolutionary profile, and protein function. WS-SNPs&GO is freely available at http://snps.biofold.org/snps-and-go. PMID:23819482

  14. Genome sequencing and analysis of a type A Clostridium perfringens isolate from a case of bovine clostridial abomasitis.

    PubMed

    Nowell, Victoria J; Kropinski, Andrew M; Songer, J Glenn; MacInnes, Janet I; Parreira, Valeria R; Prescott, John F

    2012-01-01

    Clostridium perfringens is a common inhabitant of the avian and mammalian gastrointestinal tracts and can behave commensally or pathogenically. Some enteric diseases caused by type A C. perfringens, including bovine clostridial abomasitis, remain poorly understood. To investigate the potential basis of virulence in strains causing this disease, we sequenced the genome of a type A C. perfringens isolate (strain F262) from a case of bovine clostridial abomasitis. The ∼3.34 Mbp chromosome of C. perfringens F262 is predicted to contain 3163 protein-coding genes, 76 tRNA genes, and an integrated plasmid sequence, Cfrag (∼18 kb). In addition, sequences of two complete circular plasmids, pF262C (4.8 kb) and pF262D (9.1 kb), and two incomplete plasmid fragments, pF262A (48.5 kb) and pF262B (50.0 kb), were identified. Comparison of the chromosome sequence of C. perfringens F262 to complete C. perfringens chromosomes, plasmids and phages revealed 261 unique genes. No novel toxin genes related to previously described clostridial toxins were identified: 60% of the 261 unique genes were hypothetical proteins. There was a two base pair deletion in virS, a gene reported to encode the main sensor kinase involved in virulence gene activation. Despite this frameshift mutation, C. perfringens F262 expressed perfringolysin O, alpha-toxin and the beta2-toxin, suggesting that another regulation system might contribute to the pathogenicity of this strain. Two complete plasmids, pF262C (4.8 kb) and pF262D (9.1 kb), unique to this strain of C. perfringens were identified.

  15. Genome Sequencing and Analysis of a Type A Clostridium perfringens Isolate from a Case of Bovine Clostridial Abomasitis

    PubMed Central

    Nowell, Victoria J.; Kropinski, Andrew M.; Songer, J. Glenn; MacInnes, Janet I.; Parreira, Valeria R.; Prescott, John F.

    2012-01-01

    Clostridium perfringens is a common inhabitant of the avian and mammalian gastrointestinal tracts and can behave commensally or pathogenically. Some enteric diseases caused by type A C. perfringens, including bovine clostridial abomasitis, remain poorly understood. To investigate the potential basis of virulence in strains causing this disease, we sequenced the genome of a type A C. perfringens isolate (strain F262) from a case of bovine clostridial abomasitis. The ∼3.34 Mbp chromosome of C. perfringens F262 is predicted to contain 3163 protein-coding genes, 76 tRNA genes, and an integrated plasmid sequence, Cfrag (∼18 kb). In addition, sequences of two complete circular plasmids, pF262C (4.8 kb) and pF262D (9.1 kb), and two incomplete plasmid fragments, pF262A (48.5 kb) and pF262B (50.0 kb), were identified. Comparison of the chromosome sequence of C. perfringens F262 to complete C. perfringens chromosomes, plasmids and phages revealed 261 unique genes. No novel toxin genes related to previously described clostridial toxins were identified: 60% of the 261 unique genes were hypothetical proteins. There was a two base pair deletion in virS, a gene reported to encode the main sensor kinase involved in virulence gene activation. Despite this frameshift mutation, C. perfringens F262 expressed perfringolysin O, alpha-toxin and the beta2-toxin, suggesting that another regulation system might contribute to the pathogenicity of this strain. Two complete plasmids, pF262C (4.8 kb) and pF262D (9.1 kb), unique to this strain of C. perfringens were identified. PMID:22412860

  16. Variations in Nuclear Localization Strategies Among Pol X Family Enzymes.

    PubMed

    Kirby, Thomas W; Pedersen, Lars C; Gabel, Scott A; Gassman, Natalie R; London, Robert E

    2018-06-22

    Despite the essential roles of pol X family enzymes in DNA repair, information about the structural basis of their nuclear import is limited. Recent studies revealed the unexpected presence of a functional NLS in DNA polymerase β, indicating the importance of active nuclear targeting, even for enzymes likely to leak into and out of the nucleus. The current studies further explore the active nuclear transport of these enzymes by identifying and structurally characterizing the functional NLS sequences in the three remaining human pol X enzymes: terminal deoxynucleotidyl transferase (TdT), DNA polymerase μ (pol μ), and DNA polymerase λ (pol λ). NLS identifications are based on Importin α (Impα) binding affinity determined by fluorescence polarization of fluorescein-labeled NLS peptides, X-ray crystallographic analysis of the Impα∆IBB•NLS complexes, and fluorescence-based subcellular localization studies. All three polymerases use NLS sequences located near their N-terminus; TdT and pol μ utilize monopartite NLS sequences, while pol λ utilizes a bipartite sequence, unique among the pol X family members. The pol μ NLS has relatively weak measured affinity for Impα, due in part to its proximity to the N-terminus that limits non-specific interactions of flanking residues preceding the NLS. However, this effect is partially mitigated by an N-terminal sequence unsupportive of Met1 removal by methionine aminopeptidase, leading to a 3-fold increase in affinity when the N-terminal methionine is present. Nuclear targeting is unique to each pol X family enzyme with variations dependent on the structure and unique functional role of each polymerase. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  17. Analysis and functional annotation of expressed sequence tags from in vitro cell lines of elasmobranchs: spiny dogfish shark (Squalus acanthias) and little skate (Leucoraja erinacea)

    PubMed Central

    Parton, Angela; Bayne, Christopher J.; Barnes, David W.

    2010-01-01

    Elasmobranchs are the most commonly used experimental models among the jawed, cartilaginous fish (Chondrichthyes). Previously we developed cell lines from embryos of two elasmobranchs, Squalus acanthias the spiny dogfish shark (SAE line), and Leucoraja erinacea the little skate (LEE-1 line). From these lines cDNA libraries were derived and expressed sequence tags (ESTs) generated. From the SAE cell line 4303 unique transcripts were identified, with 1848 of these representing unknown sequences (showing no BLASTX identification). From the LEE-1 cell line, 3660 unique transcripts were identified, and unknown, unique sequences totaled 1333. Gene Ontology (GO) annotation showed that GO assignments for the two cell lines were in general similar. These results suggest that the procedures used to derive the cell lines led to isolation of cell types of the same general embryonic origin from both species. The LEE-1 transcripts included GO categories “envelope” and “oxidoreductase activity” but the SAE transcripts did not. GO analysis of SAE transcripts identified the category “anatomical structure formation” that was not present in LEE-1 cells. Increased organelle compartments may exist within LEE-1 cells compared to SAE cells, and the higher oxidoreductase activity in LEE-1 cells may indicate a role for these cells in responses associated with innate immunity or in steroidogenesis. These EST libraries from elasmobranch cell lines provide information for assembly of genomic sequences and are useful in revealing gene diversity, new genes and molecular markers, as well as in providing means for elucidation of full-length cDNAs and probes for gene array analyses. This is the first study of this type with members of the Chondrichthyes. PMID:20471924

  18. Analysis and functional annotation of expressed sequence tags from in vitro cell lines of elasmobranchs: Spiny dogfish shark (Squalus acanthias) and little skate (Leucoraja erinacea).

    PubMed

    Parton, Angela; Bayne, Christopher J; Barnes, David W

    2010-09-01

    Elasmobranchs are the most commonly used experimental models among the jawed, cartilaginous fish (Chondrichthyes). Previously we developed cell lines from embryos of two elasmobranchs, Squalus acanthias the spiny dogfish shark (SAE line), and Leucoraja erinacea the little skate (LEE-1 line). From these lines cDNA libraries were derived and expressed sequence tags (ESTs) generated. From the SAE cell line 4303 unique transcripts were identified, with 1848 of these representing unknown sequences (showing no BLASTX identification). From the LEE-1 cell line, 3660 unique transcripts were identified, and unknown, unique sequences totaled 1333. Gene Ontology (GO) annotation showed that GO assignments for the two cell lines were in general similar. These results suggest that the procedures used to derive the cell lines led to isolation of cell types of the same general embryonic origin from both species. The LEE-1 transcripts included GO categories "envelope" and "oxidoreductase activity" but the SAE transcripts did not. GO analysis of SAE transcripts identified the category "anatomical structure formation" that was not present in LEE-1 cells. Increased organelle compartments may exist within LEE-1 cells compared to SAE cells, and the higher oxidoreductase activity in LEE-1 cells may indicate a role for these cells in responses associated with innate immunity or in steroidogenesis. These EST libraries from elasmobranch cell lines provide information for assembly of genomic sequences and are useful in revealing gene diversity, new genes and molecular markers, as well as in providing means for elucidation of full-length cDNAs and probes for gene array analyses. This is the first study of this type with members of the Chondrichthyes. Copyright 2010 Elsevier Inc. All rights reserved.

  19. Measures of Working Memory, Sequence Learning, and Speech Recognition in the Elderly.

    ERIC Educational Resources Information Center

    Humes, Larry E.; Floyd, Shari S.

    2005-01-01

    This study describes the measurement of 2 cognitive functions, working-memory capacity and sequence learning, in 2 groups of listeners: young adults with normal hearing and elderly adults with impaired hearing. The measurement of these 2 cognitive abilities with a unique, nonverbal technique capable of auditory, visual, and auditory-visual…

  20. An Investigation of the Effects of CRA Instruction and Students with Autism Spectrum Disorder

    ERIC Educational Resources Information Center

    Stroizer, Shaunita; Hinton, Vanessa; Flores, Margaret; Terry, LaTonya

    2015-01-01

    Students with Autism Spectrum Disorders (ASD) have unique educational needs. The concrete representational abstract (CRA) instructional sequence has been shown effective in teaching students with mathematical difficulties. The purpose of this study was to examine the effects of the CRA sequence in teaching students with ASD. A multiple baseline…

  1. Nucleotide cleaving agents and method

    DOEpatents

    Que, Jr., Lawrence; Hanson, Richard S.; Schnaith, Leah M. T.

    2000-01-01

    The present invention provides a unique series of nucleotide cleaving agents and a method for cleaving a nucleotide sequence, whether single-stranded or double-stranded DNA or RNA, using and a cationic metal complex having at least one polydentate ligand to cleave the nucleotide sequence phosphate backbone to yield a hydroxyl end and a phosphate end.

  2. Programming and Reprogramming Sequence Timing Following High and Low Contextual Interference Practice

    ERIC Educational Resources Information Center

    Wright, David L.; Magnuson, Curt E.; Black, Charles B.

    2005-01-01

    Individuals practiced two unique discrete sequence production tasks that differed in their relative time profile in either a blocked or random practice schedule. Each participant was subsequently administered a "precuing" protocol to examine the cost of initially compiling or modifying the plan for an upcoming movement's relative timing. The…

  3. Differentially expressed genes of Coptotermes formosanus (Isoptera: Rhinotermitidae) challenged by chemical insecticides.

    PubMed

    Zhang, Yi; Zhao, Yuanyuan; Qiu, Xuehong; Han, Richou

    2013-08-01

    Coptotermes formosanus Shiraki (Isoptera: Rhinotermitidae) termites are harmful social insects to wood constructions. The current control methods heavily depend on the chemical insecticides with increasing resistance. Analysis of the differentially expressed genes mediated by chemical insecticides will contribute to the understanding of the termite resistance to chemicals and to the establishment of alternative control measures. In the present article, a full-length cDNA library was constructed from the termites induced by a mixture of commonly used insecticides (0.01% sulfluramid and 0.01% triflumuron) for 24 h, by using the RNA ligase-mediated Rapid Amplification cDNA End method. Fifty-eight differentially expressed clones were obtained by polymerase chain reaction and confirmed by dot-blot hybridization. Forty-six known sequences were obtained, which clustered into 33 unique sequences grouped in 6 contigs and 27 singlets. Sixty-seven percent (22) of the sequences had counterpart genes from other organisms, whereas 33% (11) were undescribed. A Gene Ontology analysis classified 33 unique sequences into different functional categories. In general, most of the differential expression genes were involved in binding and catalytic activity.

  4. Predicting protein crystallization propensity from protein sequence

    PubMed Central

    2011-01-01

    The high-throughput structure determination pipelines developed by structural genomics programs offer a unique opportunity for data mining. One important question is how protein properties derived from a primary sequence correlate with the protein’s propensity to yield X-ray quality crystals (crystallizability) and 3D X-ray structures. A set of protein properties were computed for over 1,300 proteins that expressed well but were insoluble, and for ~720 unique proteins that resulted in X-ray structures. The correlation of the protein’s iso-electric point and grand average hydropathy (GRAVY) with crystallizability was analyzed for full length and domain constructs of protein targets. In a second step, several additional properties that can be calculated from the protein sequence were added and evaluated. Using statistical analyses we have identified a set of the attributes correlating with a protein’s propensity to crystallize and implemented a Support Vector Machine (SVM) classifier based on these. We have created applications to analyze and provide optimal boundary information for query sequences and to visualize the data. These tools are available via the web site http://bioinformatics.anl.gov/cgi-bin/tools/pdpredictor. PMID:20177794

  5. Genome Sequence of the Bacterium Streptomyces davawensis JCM 4913 and Heterologous Production of the Unique Antibiotic Roseoflavin

    PubMed Central

    Jankowitsch, Frank; Schwarz, Julia; Rückert, Christian; Gust, Bertolt; Szczepanowski, Rafael; Blom, Jochen; Pelzer, Stefan; Kalinowski, Jörn

    2012-01-01

    Streptomyces davawensis JCM 4913 synthesizes the antibiotic roseoflavin, a structural riboflavin (vitamin B2) analog. Here, we report the 9,466,619-bp linear chromosome of S. davawensis JCM 4913 and a 89,331-bp linear plasmid. The sequence has an average G+C content of 70.58% and contains six rRNA operons (16S-23S-5S) and 69 tRNA genes. The 8,616 predicted protein-coding sequences include 32 clusters coding for secondary metabolites, several of which are unique to S. davawensis. The chromosome contains long terminal inverted repeats of 33,255 bp each and atypical telomeres. Sequence analysis with regard to riboflavin biosynthesis revealed three different patterns of gene organization in Streptomyces species. Heterologous expression of a set of genes present on a subgenomic fragment of S. davawensis resulted in the production of roseoflavin by the host Streptomyces coelicolor M1152. Phylogenetic analysis revealed that S. davawensis is a close relative of Streptomyces cinnabarinus, and much to our surprise, we found that the latter bacterium is a roseoflavin producer as well. PMID:23043000

  6. Onco-Regulon: an integrated database and software suite for site specific targeting of transcription factors of cancer genes

    PubMed Central

    Tomar, Navneet; Mishra, Akhilesh; Mrinal, Nirotpal; Jayaram, B.

    2016-01-01

    Transcription factors (TFs) bind at multiple sites in the genome and regulate expression of many genes. Regulating TF binding in a gene specific manner remains a formidable challenge in drug discovery because the same binding motif may be present at multiple locations in the genome. Here, we present Onco-Regulon (http://www.scfbio-iitd.res.in/software/onco/NavSite/index.htm), an integrated database of regulatory motifs of cancer genes clubbed with Unique Sequence-Predictor (USP) a software suite that identifies unique sequences for each of these regulatory DNA motifs at the specified position in the genome. USP works by extending a given DNA motif, in 5′→3′, 3′ →5′ or both directions by adding one nucleotide at each step, and calculates the frequency of each extended motif in the genome by Frequency Counter programme. This step is iterated till the frequency of the extended motif becomes unity in the genome. Thus, for each given motif, we get three possible unique sequences. Closest Sequence Finder program predicts off-target drug binding in the genome. Inclusion of DNA-Protein structural information further makes Onco-Regulon a highly informative repository for gene specific drug development. We believe that Onco-Regulon will help researchers to design drugs which will bind to an exclusive site in the genome with no off-target effects, theoretically. Database URL: http://www.scfbio-iitd.res.in/software/onco/NavSite/index.htm PMID:27515825

  7. Emergence and Evolution of Hominidae-Specific Coding and Noncoding Genomic Sequences

    PubMed Central

    Saber, Morteza Mahmoudi; Adeyemi Babarinde, Isaac; Hettiarachchi, Nilmini; Saitou, Naruya

    2016-01-01

    Family Hominidae, which includes humans and great apes, is recognized for unique complex social behavior and intellectual abilities. Despite the increasing genome data, however, the genomic origin of its phenotypic uniqueness has remained elusive. Clade-specific genes and highly conserved noncoding sequences (HCNSs) are among the high-potential evolutionary candidates involved in driving clade-specific characters and phenotypes. On this premise, we analyzed whole genome sequences along with gene orthology data retrieved from major DNA databases to find Hominidae-specific (HS) genes and HCNSs. We discovered that Down syndrome critical region 4 (DSCR4) is the only experimentally verified gene uniquely present in Hominidae. DSCR4 has no structural homology to any known protein and was inferred to have emerged in several steps through LTR/ERV1, LTR/ERVL retrotransposition, and transversion. Using the genomic distance as neutral evolution threshold, we identified 1,658 HS HCNSs. Polymorphism coverage and derived allele frequency analysis of HS HCNSs showed that these HCNSs are under purifying selection, indicating that they may harbor important functions. They are overrepresented in promoters/untranslated regions, in close proximity of genes involved in sensory perception of sound and developmental process, and also showed a significantly lower nucleosome occupancy probability. Interestingly, many ancestral sequences of the HS HCNSs showed very high evolutionary rates. This suggests that new functions emerged through some kind of positive selection, and then purifying selection started to operate to keep these functions. PMID:27289096

  8. Identification of characteristic oligonucleotides in the bacterial 16S ribosomal RNA sequence dataset

    NASA Technical Reports Server (NTRS)

    Zhang, Zhengdong; Willson, Richard C.; Fox, George E.

    2002-01-01

    MOTIVATION: The phylogenetic structure of the bacterial world has been intensively studied by comparing sequences of 16S ribosomal RNA (16S rRNA). This database of sequences is now widely used to design probes for the detection of specific bacteria or groups of bacteria one at a time. The success of such methods reflects the fact that there are local sequence segments that are highly characteristic of particular organisms or groups of organisms. It is not clear, however, the extent to which such signature sequences exist in the 16S rRNA dataset. A better understanding of the numbers and distribution of highly informative oligonucleotide sequences may facilitate the design of hybridization arrays that can characterize the phylogenetic position of an unknown organism or serve as the basis for the development of novel approaches for use in bacterial identification. RESULTS: A computer-based algorithm that characterizes the extent to which any individual oligonucleotide sequence in 16S rRNA is characteristic of any particular bacterial grouping was developed. A measure of signature quality, Q(s), was formulated and subsequently calculated for every individual oligonucleotide sequence in the size range of 5-11 nucleotides and for 15mers with reference to each cluster and subcluster in a 929 organism representative phylogenetic tree. Subsequently, the perfect signature sequences were compared to the full set of 7322 sequences to see how common false positives were. The work completed here establishes beyond any doubt that highly characteristic oligonucleotides exist in the bacterial 16S rRNA sequence dataset in large numbers. Over 16,000 15mers were identified that might be useful as signatures. Signature oligonucleotides are available for over 80% of the nodes in the representative tree.

  9. Molecular characterization of Streptococcus agalactiae and Streptococcus uberis isolates from bovine milk.

    PubMed

    Shome, Bibek Ranjan; Bhuvana, Mani; Mitra, Susweta Das; Krithiga, Natesan; Shome, Rajeswari; Velu, Dhanikachalam; Banerjee, Apala; Barbuddhe, Sukhadeo B; Prabhudas, Krishnamshetty; Rahman, Habibar

    2012-12-01

    Streptococci are one among the major mastitis pathogens which have a considerable impact on cow health, milk quality, and productivity. The aim of the present study was to investigate the occurrence and virulence characteristics of streptococci from bovine milk and to assess the molecular epidemiology and population structure of the Indian isolates using multilocus sequence typing (MLST) and pulsed-field gel electrophoresis (PFGE). Out of a total of 209 bovine composite milk samples screened from four herds (A-D), 30 Streptococcus spp. were isolated from 29 milk samples. Among the 30 isolates, species-specific PCR and partial 16S rRNA gene sequence analysis identified 17 Streptococcus agalactiae arising from herd A and 13 Streptococcus uberis comprising of 5, 7, and 1 isolates from herds B, C, and D respectively. PCR based screening for virulence genes revealed the presence of the cfb and the pavA genes in 17 and 1 S. agalactiae isolates, respectively. Similarly, in S. uberis isolates, cfu gene was present in six isolates from herd C, the pau A/skc gene in all the isolates from herds B, C, and D, whereas the sua gene was present in four isolates from herd B and the only isolate from herd D. On MLST analysis, all the S. agalactiae isolates were found to be of a novel sequence type (ST), ST-483, reported for the first time and is a single locus variant of the predicted subgroup founder ST-310, while the S. uberis isolates were found to be of three novel sequence types, namely ST-439, ST-474, and ST-475, all reported for the first time. ST-474 was a double locus variant of three different STs of global clonal complex ST-143 considered to be associated with clinical and subclinical mastitis, but ST-439 and ST-475 were singletons. Unique sequence types identified for both S. agalactiae and S. uberis were found to be herd specific. On PFGE analysis, identical or closely related restriction patterns for S. agalactiae ST-483 and S. uberis ST-439 in herds A and B respectively, but an unrelated restriction pattern for S. uberis ST-474 and ST-475 isolates from herds D and C respectively, were obtained. This signifies that the isolates of particular ST may exhibit related PFGE patterns suggesting detection of a faster molecular clock by PFGE than MLST. Since all the isolates of both the species belonged to novel sequence types, their epidemiological significance in global context could not be ascertained, however, evidence suggests that they have uniquely evolved in Indian conditions. Further research would be useful for understanding the role of these pathogens in bovine sub-clinical mastitis and implementing effective control strategies in India.

  10. Program Fair Evaluation--Summative Appraisal of Instructional Sequences with Dissimilar Objectives.

    ERIC Educational Resources Information Center

    Popham, W. James

    A comparative evaluation involving two instructional programs is given, although the approach can easily serve to compare more than two programs. The steps involved in conducting a program fair evaluation of two instructional programs are: (1) Identify objectives (a) common to both programs, (b) unique to one program, and (c) unique to the other…

  11. A porcine G9 rotavirus strain shares neutralization and VP7 phylogenetic sequence lineage 3 characteristics with contemporary human G9 rotavirus strains.

    PubMed

    Hoshino, Yasutaka; Honma, Shinjiro; Jones, Ronald W; Ross, Jerri; Santos, Norma; Gentsch, Jon R; Kapikian, Albert Z; Hesse, Richard A

    2005-02-05

    Of five globally important VP7 (G) serotypes (G1-4 and 9) of group A rotaviruses (the single most important etiologic agents of infantile diarrhea worldwide), G9 continues to attract considerable attention because of its unique natural history. Serotype G9 rotavirus was isolated from a child with diarrhea first in the United States in 1983 and subsequently in Japan in 1985. Curiously, soon after their detection, G9 rotaviruses were not detected for about a decade in both countries and then reemerged in both countries in the mid-1990s. Unexpectedly, however, such reemerged G9 strains were distinct genetically and molecularly from those isolated in the 1980s. Thus, the origin of the reemerged G9 viruses remains an enigma. Sequence analysis has demonstrated that the G9 rotavirus VP7 gene belongs to one of at least three phylogenetic lineages: lineage 1 (strains isolated in the 1980s in the United States and Japan), lineage 2 (strains first isolated in 1986 and exclusively in India thus far), and lineage 3 (strains that emerged/reemerged in the mid-1990s). Currently, lineage 3 G9 viruses are the most frequently detected G9 strains globally. We characterized a porcine rotavirus (A2 strain) isolated in the United States that was known to belong to the P[7] genotype but had not been serotyped by neutralization. The A2 strain was found to bear serotype G9 and P9 specificities as well as NSP4 [B] and subgroup I characteristics. By VP7-specific neutralization, the porcine G9 strain was more closely related to lineage 3 viruses than to lineage 1 or 2 viruses. Furthermore, by sequence analysis, the A2 VP7 was shown to belong to lineage 3 G9. These findings raise intriguing questions regarding possible explanations for the emergence of variations among the G9 strains.

  12. Whole-Genome Phylogenetic Analysis of Influenza B/Phuket/3073/2013-Like Viruses and Unique Reassortants Detected in Malaysia between 2012 and 2014

    PubMed Central

    Tan, Joon Ling; Chan, Kok Gan; Kamarulzaman, Adeeba; Chan, Yoke Fun; Sam, I-Ching; Tee, Kok Keng

    2017-01-01

    Reassortment of genetic segments between and within influenza B lineages (Victoria and Yamagata) has been shown to generate novel reassortants with unique genetic characteristics. Based on hemagglutinin (HA) and neuraminidase (NA) genes, recent surveillance study has identified reassortment properties in B/Phuket/3073/2013-like virus, which is currently used in the WHO-recommended influenza vaccine. To understand the potential reassortment patterns for all gene segments, four B/Phuket/3073/2013-like viruses and two unique reassortants (one each from Yamagata and Victoria) detected in Malaysia from 2012–2014 were subjected to whole-genome sequencing. Each gene was phylogenetically classified into lineages, clades and sub-clades. Three B/Phuket/3073/2013-like viruses from Yamagata lineage were found to be intra-clade reassortants, possessing PA and NA genes derived from Stockholm/12-like sub-clade, while the remaining genes from Wisconsin/01-like sub-clade (both sub-clades were within Yamagata Clade 3/Yam-3). However, the other B/Phuket/3073/2013-like virus had NS gene that derived from Stockholm/12-like sub-clade instead of Wisconsin/01-like sub-clade. One inter-clade reassortant had Yamagata Clade 2/Yam-2-derived HA and NP, and its remaining genes were Yam-3-derived. Within Victoria Clade 1/Vic-1 in Victoria lineage, one virus had intra-clade reassortment properties: HA and PB2 from Vic-1B sub-clade, MP and NS from a unique sub-clade “Vic-1C”, and the remaining genes from Vic-1A sub-clade. Although random reassortment event may generate unique reassortants, detailed phylogenetic classification of gene segments showed possible genetic linkage between PA and NA genes in B/Phuket/3073/2013-like viruses, which requires further investigation. Understanding on reassortment patterns in influenza B evolution may contribute to future vaccine design. PMID:28129386

  13. A disruptive sequencer meets disruptive publishing.

    PubMed

    Loman, Nick; Goodwin, Sarah; Jansen, Hans; Loose, Matt

    2015-01-01

    Nanopore sequencing was recently made available to users in the form of the Oxford Nanopore MinION. Released to users through an early access programme, the MinION is made unique by its tiny form factor and ability to generate very long sequences from single DNA molecules. The platform is undergoing rapid evolution with three distinct nanopore types and five updates to library preparation chemistry in the last 18 months. To keep pace with the rapid evolution of this sequencing platform, and to provide a space where new analysis methods can be openly discussed, we present a new F1000Research channel devoted to updates to and analysis of nanopore sequence data.

  14. Probability of coding of a DNA sequence: an algorithm to predict translated reading frames from their thermodynamic characteristics.

    PubMed Central

    Tramontano, A; Macchiato, M F

    1986-01-01

    An algorithm to determine the probability that a reading frame codifies for a protein is presented. It is based on the results of our previous studies on the thermodynamic characteristics of a translated reading frame. We also develop a prediction procedure to distinguish between coding and non-coding reading frames. The procedure is based on the characteristics of the putative product of the DNA sequence and not on periodicity characteristics of the sequence, so the prediction is not biased by the presence of overlapping translated reading frames or by the presence of translated reading frames on the complementary DNA strand. PMID:3753761

  15. Sequence and structural implications of a bovine corneal keratan sulfate proteoglycan core protein. Protein 37B represents bovine lumican and proteins 37A and 25 are unique

    NASA Technical Reports Server (NTRS)

    Funderburgh, J. L.; Funderburgh, M. L.; Brown, S. J.; Vergnes, J. P.; Hassell, J. R.; Mann, M. M.; Conrad, G. W.; Spooner, B. S. (Principal Investigator)

    1993-01-01

    Amino acid sequence from tryptic peptides of three different bovine corneal keratan sulfate proteoglycan (KSPG) core proteins (designated 37A, 37B, and 25) showed similarities to the sequence of a chicken KSPG core protein lumican. Bovine lumican cDNA was isolated from a bovine corneal expression library by screening with chicken lumican cDNA. The bovine cDNA codes for a 342-amino acid protein, M(r) 38,712, containing amino acid sequences identified in the 37B KSPG core protein. The bovine lumican is 68% identical to chicken lumican, with an 83% identity excluding the N-terminal 40 amino acids. Location of 6 cysteine and 4 consensus N-glycosylation sites in the bovine sequence were identical to those in chicken lumican. Bovine lumican had about 50% identity to bovine fibromodulin and 20% identity to bovine decorin and biglycan. About two-thirds of the lumican protein consists of a series of 10 amino acid leucine-rich repeats that occur in regions of calculated high beta-hydrophobic moment, suggesting that the leucine-rich repeats contribute to beta-sheet formation in these proteins. Sequences obtained from 37A and 25 core proteins were absent in bovine lumican, thus predicting a unique primary structure and separate mRNA for each of the three bovine KSPG core proteins.

  16. Improved serial analysis of V1 ribosomal sequence tags (SARST-V1) provides a rapid, comprehensive, sequence-based characterization of bacterial diversity and community composition.

    PubMed

    Yu, Zhongtang; Yu, Marie; Morrison, Mark

    2006-04-01

    Serial analysis of ribosomal sequence tags (SARST) is a recently developed technology that can generate large 16S rRNA gene (rrs) sequence data sets from microbiomes, but there are numerous enzymatic and purification steps required to construct the ribosomal sequence tag (RST) clone libraries. We report here an improved SARST method, which still targets the V1 hypervariable region of rrs genes, but reduces the number of enzymes, oligonucleotides, reagents, and technical steps needed to produce the RST clone libraries. The new method, hereafter referred to as SARST-V1, was used to examine the eubacterial diversity present in community DNA recovered from the microbiome resident in the ovine rumen. The 190 sequenced clones contained 1055 RSTs and no less than 236 unique phylotypes (based on > or = 95% sequence identity) that were assigned to eight different eubacterial phyla. Rarefaction and monomolecular curve analyses predicted that the complete RST clone library contains 99% of the 353 unique phylotypes predicted to exist in this microbiome. When compared with ribosomal intergenic spacer analysis (RISA) of the same community DNA sample, as well as a compilation of nine previously published conventional rrs clone libraries prepared from the same type of samples, the RST clone library provided a more comprehensive characterization of the eubacterial diversity present in rumen microbiomes. As such, SARST-V1 should be a useful tool applicable to comprehensive examination of diversity and composition in microbiomes and offers an affordable, sequence-based method for diversity analysis.

  17. ``Sequence space soup'' of proteins and copolymers

    NASA Astrophysics Data System (ADS)

    Chan, Hue Sun; Dill, Ken A.

    1991-09-01

    To study the protein folding problem, we use exhaustive computer enumeration to explore ``sequence space soup,'' an imaginary solution containing the ``native'' conformations (i.e., of lowest free energy) under folding conditions, of every possible copolymer sequence. The model is of short self-avoiding chains of hydrophobic (H) and polar (P) monomers configured on the two-dimensional square lattice. By exhaustive enumeration, we identify all native structures for every possible sequence. We find that random sequences of H/P copolymers will bear striking resemblance to known proteins: Most sequences under folding conditions will be approximately as compact as known proteins, will have considerable amounts of secondary structure, and it is most probable that an arbitrary sequence will fold to a number of lowest free energy conformations that is of order one. In these respects, this simple model shows that proteinlike behavior should arise simply in copolymers in which one monomer type is highly solvent averse. It suggests that the structures and uniquenesses of native proteins are not consequences of having 20 different monomer types, or of unique properties of amino acid monomers with regard to special packing or interactions, and thus that simple copolymers might be designable to collapse to proteinlike structures and properties. A good strategy for designing a sequence to have a minimum possible number of native states is to strategically insert many P monomers. Thus known proteins may be marginally stable due to a balance: More H residues stabilize the desired native state, but more P residues prevent simultaneous stabilization of undesired native states.

  18. A new buckwheat dihydroflavonol 4-reductase (DFR), with a unique substrate binding structure, has altered substrate specificity.

    PubMed

    Katsu, Kenjiro; Suzuki, Rintaro; Tsuchiya, Wataru; Inagaki, Noritoshi; Yamazaki, Toshimasa; Hisano, Tomomi; Yasui, Yasuo; Komori, Toshiyuki; Koshio, Motoyuki; Kubota, Seiji; Walker, Amanda R; Furukawa, Kiyoshi; Matsui, Katsuhiro

    2017-12-11

    Dihydroflavonol 4-reductase (DFR) is the key enzyme committed to anthocyanin and proanthocyanidin biosynthesis in the flavonoid biosynthetic pathway. DFR proteins can catalyse mainly the three substrates (dihydrokaempferol, dihydroquercetin, and dihydromyricetin), and show different substrate preferences. Although relationships between the substrate preference and amino acids in the region responsible for substrate specificity have been investigated in several plant species, the molecular basis of the substrate preference of DFR is not yet fully understood. By using degenerate primers in a PCR, we isolated two cDNA clones that encoded DFR in buckwheat (Fagopyrum esculentum). Based on sequence similarity, one cDNA clone (FeDFR1a) was identical to the FeDFR in DNA databases (DDBJ/Gen Bank/EMBL). The other cDNA clone, FeDFR2, had a similar sequence to FeDFR1a, but a different exon-intron structure. Linkage analysis in an F 2 segregating population showed that the two loci were linked. Unlike common DFR proteins in other plant species, FeDFR2 contained a valine instead of the typical asparagine at the third position and an extra glycine between sites 6 and 7 in the region that determines substrate specificity, and showed less activity against dihydrokaempferol than did FeDFR1a with an asparagine at the third position. Our 3D model suggested that the third residue and its neighbouring residues contribute to substrate specificity. FeDFR1a was expressed in all organs that we investigated, whereas FeDFR2 was preferentially expressed in roots and seeds. We isolated two buckwheat cDNA clones of DFR genes. FeDFR2 has unique structural and functional features that differ from those of previously reported DFRs in other plants. The 3D model suggested that not only the amino acid at the third position but also its neighbouring residues that are involved in the formation of the substrate-binding pocket play important roles in determining substrate preferences. The unique characteristics of FeDFR2 would provide a useful tool for future studies on the substrate specificity and organ-specific expression of DFRs.

  19. Restricted transfer of learning between unimanual and bimanual finger sequences.

    PubMed

    Yokoi, Atsushi; Bai, Wenjun; Diedrichsen, Jörn

    2017-03-01

    When training bimanual skills, such as playing piano, people sometimes practice each hand separately and at a later stage combine the movements of the two hands. This poses the critical question of whether motor skills can be acquired by separately practicing each subcomponent or should be trained as a whole. In the present study, we addressed this question by training human subjects for 4 days in a unimanual or bimanual version of the discrete sequence production task. Both groups were then tested on trained and untrained sequences on both unimanual and bimanual versions of the task. Surprisingly, we found no evidence of transfer from trained unimanual to bimanual or from trained bimanual to unimanual sequences. In half the participants, we also investigated whether cuing the sequences on the left and right hand with unique letters would change transfer. With these cues, untrained sequences that shared some components with the trained sequences were performed more quickly than sequences that did not. However, the amount of this transfer was limited to ∼10% of the overall sequence-specific learning gains. These results suggest that unimanual and bimanual sequences are learned in separate representations. Making participants aware of the interrelationship between sequences can induce some transferrable component, although the main component of the skill remains unique to unimanual or bimanual execution. NEW & NOTEWORTHY Studies in reaching movement demonstrated that approximately half of motor learning can transfer across unimanual and bimanual contexts, suggesting that neural representations for unimanual and bimanual movements are fairly overlapping at the level of elementary movement. In this study, we show that little or no transfer occurred across unimanual and bimanual sequential finger movements. This result suggests that bimanual sequences are represented at a level of the motor hierarchy that integrates movements of both hands. Copyright © 2017 the American Physiological Society.

  20. Purification and characterization of a pilin specific for Brazilian purpuric fever-associated Haemophilus influenzae biogroup aegyptius (H. aegyptius) strains.

    PubMed Central

    Weyant, R S; Bibb, W F; Stephens, D S; Holloway, B P; Moo-Penn, W F; Birkness, K A; Helsel, L O; Mayer, L W

    1990-01-01

    Brazilian purpuric fever (BPF) is a recently described fatal pediatric disease caused by systemic infection with Haemophilus influenzae biogroup aegyptius. Previous studies have shown that all H. influenzae biogroup aegyptius strains isolated from BPF cases and case contacts share several unique phenotypic and genotypic characteristics that differentiate them from other H. influenzae biogroup aegyptius strains isolated from conjunctivitis cases in Brazil. One key characteristic of this BPF clone is reactivity in a BPF-specific monoclonal antibody enzyme-linked immunosorbent assay. We have purified and partially characterized a pilin, referred to as the 25-kilodalton (kDa) protein. Aggregates of this protein contain a heat-labile epitope which is recognized by a monoclonal antibody used in the BPF-specific enzyme-linked immunosorbent assay. The protein has a molecular weight of approximately 25,000, is insoluble in most detergents, and fractionates with outer membrane vesicles after LiCl extraction. Biochemical analysis of the 25-kDa protein shows it to have an amino acid composition similar but not identical to that of the H. influenzae type b pilin. The sequence of 20 N-terminal amino acids of the 25-kDa protein shows almost complete homology with the N terminus of the H. influenzae type b pilin and the types 1 and P pilins of Escherichia coli. Transmission electron microscopic analysis of the purified protein shows the presence of filamentous structures similar in morphology to those of H. influenzae pili. Reactivity between the 25-kDa protein and the BPF-specific monoclonal antibody is demonstrated by Western blotting (immunoblotting) and colloidal gold-enhanced immunoelectron microscopy. Hemadsorption analysis shows that expression of this protein is associated with increases in piliated cells and enhanced binding of these cells to human erythrocytes. These studies indicate that expression of the 25-kDa protein is a characteristic unique to the BPF clone and suggest that this protein plays a role in the pathogenesis of BPF. Images PMID:1970577

  1. De Novo Transcriptome Sequencing Reveals Important Molecular Networks and Metabolic Pathways of the Plant, Chlorophytum borivilianum

    PubMed Central

    Kalra, Shikha; Puniya, Bhanwar Lal; Kulshreshtha, Deepika; Kumar, Sunil; Kaur, Jagdeep; Ramachandran, Srinivasan; Singh, Kashmir

    2013-01-01

    Chlorophytum borivilianum, an endangered medicinal plant species is highly recognized for its aphrodisiac properties provided by saponins present in the plant. The transcriptome information of this species is limited and only few hundred expressed sequence tags (ESTs) are available in the public databases. To gain molecular insight of this plant, high throughput transcriptome sequencing of leaf RNA was carried out using Illumina's HiSeq 2000 sequencing platform. A total of 22,161,444 single end reads were retrieved after quality filtering. Available (e.g., De-Bruijn/Eulerian graph) and in-house developed bioinformatics tools were used for assembly and annotation of transcriptome. A total of 101,141 assembled transcripts were obtained, with coverage size of 22.42 Mb and average length of 221 bp. Guanine-cytosine (GC) content was found to be 44%. Bioinformatics analysis, using non-redundant proteins, gene ontology (GO), enzyme commission (EC) and kyoto encyclopedia of genes and genomes (KEGG) databases, extracted all the known enzymes involved in saponin and flavonoid biosynthesis. Few genes of the alkaloid biosynthesis, along with anticancer and plant defense genes, were also discovered. Additionally, several cytochrome P450 (CYP450) and glycosyltransferase unique sequences were also found. We identified simple sequence repeat motifs in transcripts with an abundance of di-nucleotide simple sequence repeat (SSR; 43.1%) markers. Large scale expression profiling through Reads per Kilobase per Million mapped reads (RPKM) showed major genes involved in different metabolic pathways of the plant. Genes, expressed sequence tags (ESTs) and unique sequences from this study provide an important resource for the scientific community, interested in the molecular genetics and functional genomics of C. borivilianum. PMID:24376689

  2. Optimization of sequence alignment for simple sequence repeat regions.

    PubMed

    Jighly, Abdulqader; Hamwieh, Aladdin; Ogbonnaya, Francis C

    2011-07-20

    Microsatellites, or simple sequence repeats (SSRs), are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs) mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs).SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type.When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic phylogenic relationship.

  3. The gene space in wheat: the complete γ-gliadin gene family from the wheat cultivar Chinese Spring.

    PubMed

    Anderson, Olin D; Huo, Naxin; Gu, Yong Q

    2013-06-01

    The complete set of unique γ-gliadin genes is described for the wheat cultivar Chinese Spring using a combination of expressed sequence tag (EST) and Roche 454 DNA sequences. Assemblies of Chinese Spring ESTs yielded 11 different γ-gliadin gene sequences. Two of the sequences encode identical polypeptides and are assumed to be the result of a recent gene duplication. One gene has a 3' coding mutation that changes the reading frame in the final eight codons. A second assembly of Chinese Spring γ-gliadin sequences was generated using Roche 454 total genomic DNA sequences. The 454 assembly confirmed the same 11 active genes as the EST assembly plus two pseudogenes not represented by ESTs. These 13 γ-gliadin sequences represent the complete unique set of γ-gliadin genes for cv Chinese Spring, although not ruled out are additional genes that are exact duplications of these 13 genes. A comparison with the ESTs of two other hexaploid cultivars (Butte 86 and Recital) finds that the most active genes are present in all three cultivars, with exceptions likely due to too few ESTs for detection in Butte 86 and Recital. A comparison of the numbers of ESTs per gene indicates differential levels of expression within the γ-gliadin gene family. Genome assignments were made for 6 of the 13 Chinese Spring γ-gliadin genes, i.e., one assignment from a match to two γ-gliadin genes found within a tetraploid wheat A genome BAC and four genes that match four distinct γ-gliadin sequences assembled from Roche 454 sequences from Aegilops tauschii, the hexaploid wheat D-genome ancestor.

  4. De Novo transcriptome sequencing reveals important molecular networks and metabolic pathways of the plant, Chlorophytum borivilianum.

    PubMed

    Kalra, Shikha; Puniya, Bhanwar Lal; Kulshreshtha, Deepika; Kumar, Sunil; Kaur, Jagdeep; Ramachandran, Srinivasan; Singh, Kashmir

    2013-01-01

    Chlorophytum borivilianum, an endangered medicinal plant species is highly recognized for its aphrodisiac properties provided by saponins present in the plant. The transcriptome information of this species is limited and only few hundred expressed sequence tags (ESTs) are available in the public databases. To gain molecular insight of this plant, high throughput transcriptome sequencing of leaf RNA was carried out using Illumina's HiSeq 2000 sequencing platform. A total of 22,161,444 single end reads were retrieved after quality filtering. Available (e.g., De-Bruijn/Eulerian graph) and in-house developed bioinformatics tools were used for assembly and annotation of transcriptome. A total of 101,141 assembled transcripts were obtained, with coverage size of 22.42 Mb and average length of 221 bp. Guanine-cytosine (GC) content was found to be 44%. Bioinformatics analysis, using non-redundant proteins, gene ontology (GO), enzyme commission (EC) and kyoto encyclopedia of genes and genomes (KEGG) databases, extracted all the known enzymes involved in saponin and flavonoid biosynthesis. Few genes of the alkaloid biosynthesis, along with anticancer and plant defense genes, were also discovered. Additionally, several cytochrome P450 (CYP450) and glycosyltransferase unique sequences were also found. We identified simple sequence repeat motifs in transcripts with an abundance of di-nucleotide simple sequence repeat (SSR; 43.1%) markers. Large scale expression profiling through Reads per Kilobase per Million mapped reads (RPKM) showed major genes involved in different metabolic pathways of the plant. Genes, expressed sequence tags (ESTs) and unique sequences from this study provide an important resource for the scientific community, interested in the molecular genetics and functional genomics of C. borivilianum.

  5. Surface display of a massively variable lipoprotein by a Legionella diversity-generating retroelement.

    PubMed

    Arambula, Diego; Wong, Wenge; Medhekar, Bob A; Guo, Huatao; Gingery, Mari; Czornyj, Elizabeth; Liu, Minghsun; Dey, Sanghamitra; Ghosh, Partho; Miller, Jeff F

    2013-05-14

    Diversity-generating retroelements (DGRs) are a unique family of retroelements that confer selective advantages to their hosts by facilitating localized DNA sequence evolution through a specialized error-prone reverse transcription process. We characterized a DGR in Legionella pneumophila, an opportunistic human pathogen that causes Legionnaires disease. The L. pneumophila DGR is found within a horizontally acquired genomic island, and it can theoretically generate 10(26) unique nucleotide sequences in its target gene, legionella determinent target A (ldtA), creating a repertoire of 10(19) distinct proteins. Expression of the L. pneumophila DGR resulted in transfer of DNA sequence information from a template repeat to a variable repeat (VR) accompanied by adenine-specific mutagenesis of progeny VRs at the 3'end of ldtA. ldtA encodes a twin-arginine translocated lipoprotein that is anchored in the outer leaflet of the outer membrane, with its C-terminal variable region surface exposed. Related DGRs were identified in L. pneumophila clinical isolates that encode unique target proteins with homologous VRs, demonstrating the adaptability of DGR components. This work characterizes a DGR that diversifies a bacterial protein and confirms the hypothesis that DGR-mediated mutagenic homing occurs through a conserved mechanism. Comparative bioinformatics predicts that surface display of massively variable proteins is a defining feature of a subset of bacterial DGRs.

  6. Methods for chromosome-specific staining

    DOEpatents

    Gray, Joe W.; Pinkel, Daniel

    1995-01-01

    Methods and compositions for chromosome-specific staining are provided. Compositions comprise heterogenous mixtures of labeled nucleic acid fragments having substantially complementary base sequences to unique sequence regions of the chromosomal DNA for which their associated staining reagent is specific. Methods include methods for making the chromosome-specific staining compositions of the invention, and methods for applying the staining compositions to chromosomes.

  7. Genetic Diversity of Bacterial Communities and Gene Transfer Agents in Northern South China Sea

    PubMed Central

    Sun, Fu-Lin; Wang, You-Shao; Wu, Mei-Lin; Jiang, Zhao-Yu; Sun, Cui-Ci; Cheng, Hao

    2014-01-01

    Pyrosequencing of the 16S ribosomal RNA gene (rDNA) amplicons was performed to investigate the unique distribution of bacterial communities in northern South China Sea (nSCS) and evaluate community structure and spatial differences of bacterial diversity. Cyanobacteria, Proteobacteria, Actinobacteria, and Bacteroidetes constitute the majority of bacteria. The taxonomic description of bacterial communities revealed that more Chroococcales, SAR11 clade, Acidimicrobiales, Rhodobacterales, and Flavobacteriales are present in the nSCS waters than other bacterial groups. Rhodobacterales were less abundant in tropical water (nSCS) than in temperate and cold waters. Furthermore, the diversity of Rhodobacterales based on the gene transfer agent (GTA) major capsid gene (g5) was investigated. Four g5 gene clone libraries were constructed from samples representing different regions and yielded diverse sequences. Fourteen g5 clusters could be identified among 197 nSCS clones. These clusters were also related to known g5 sequences derived from genome-sequenced Rhodobacterales. The composition of g5 sequences in surface water varied with the g5 sequences in the sampling sites; this result indicated that the Rhodobacterales population could be highly diverse in nSCS. Phylogenetic tree analysis result indicated distinguishable diversity patterns among tropical (nSCS), temperate, and cold waters, thereby supporting the niche adaptation of specific Rhodobacterales members in unique environments. PMID:25364820

  8. A novel approach for monitoring genetically engineered microorganisms by using artificial, stable RNAs

    NASA Technical Reports Server (NTRS)

    Pitulle, C.; Hedenstierna, K. O.; Fox, G. E.

    1995-01-01

    Further improvements in technology for efficient monitoring of genetically engineered microorganisms (GEMs) in the environment are needed. Technology for monitoring rRNA is well established but has not generally been applicable to GEMs because of the lack of unique rRNA target sequences. In the work described herein, it is demonstrated that a deletion mutant of a plasmid-borne Vibrio proteolyticus 5S rRNA gene continues to accumulate to high levels in Escherichia coli although it is no longer incorporated into 70S ribosomes. This deletion construct was subsequently modified by mutagenesis to create a unique recognition site for the restriction endonuclease BstEII, into which new sequences could be readily inserted. Finally, a novel 17-nucleotide identifier sequence from Pennisetum purpureum was embedded into the construct to create an RNA identification cassette. The artificial identifier RNA, expressed from this cassette in vivo, accumulated in E. coli to levels comparable to those of wild-type 5S rRNA without being seriously detrimental to cell survival in laboratory experiments and without entering the ribosomes. These results demonstrate that artificial, stable RNAs containing sequence segments remarkably different from those present in any known rRNA can be designed and that neither the deleted sequence segment nor ribosome incorporation is essential for accumulation of an RNA product.

  9. Bat Biology, Genomes, and the Bat1K Project: To Generate Chromosome-Level Genomes for All Living Bat Species.

    PubMed

    Teeling, Emma C; Vernes, Sonja C; Dávalos, Liliana M; Ray, David A; Gilbert, M Thomas P; Myers, Eugene

    2018-02-15

    Bats are unique among mammals, possessing some of the rarest mammalian adaptations, including true self-powered flight, laryngeal echolocation, exceptional longevity, unique immunity, contracted genomes, and vocal learning. They provide key ecosystem services, pollinating tropical plants, dispersing seeds, and controlling insect pest populations, thus driving healthy ecosystems. They account for more than 20% of all living mammalian diversity, and their crown-group evolutionary history dates back to the Eocene. Despite their great numbers and diversity, many species are threatened and endangered. Here we announce Bat1K, an initiative to sequence the genomes of all living bat species (n∼1,300) to chromosome-level assembly. The Bat1K genome consortium unites bat biologists (>148 members as of writing), computational scientists, conservation organizations, genome technologists, and any interested individuals committed to a better understanding of the genetic and evolutionary mechanisms that underlie the unique adaptations of bats. Our aim is to catalog the unique genetic diversity present in all living bats to better understand the molecular basis of their unique adaptations; uncover their evolutionary history; link genotype with phenotype; and ultimately better understand, promote, and conserve bats. Here we review the unique adaptations of bats and highlight how chromosome-level genome assemblies can uncover the molecular basis of these traits. We present a novel sequencing and assembly strategy and review the striking societal and scientific benefits that will result from the Bat1K initiative.

  10. Plasmid Characterization and Chromosome Analysis of Two netF+ Clostridium perfringens Isolates Associated with Foal and Canine Necrotizing Enteritis.

    PubMed

    Mehdizadeh Gohari, Iman; Kropinski, Andrew M; Weese, Scott J; Parreira, Valeria R; Whitehead, Ashley E; Boerlin, Patrick; Prescott, John F

    2016-01-01

    The recent discovery of a novel beta-pore-forming toxin, NetF, which is strongly associated with canine and foal necrotizing enteritis should improve our understanding of the role of type A Clostridium perfringens associated disease in these animals. The current study presents the complete genome sequence of two netF-positive strains, JFP55 and JFP838, which were recovered from cases of foal necrotizing enteritis and canine hemorrhagic gastroenteritis, respectively. Genome sequencing was done using Single Molecule, Real-Time (SMRT) technology-PacBio and Illumina Hiseq2000. The JFP55 and JFP838 genomes include a single 3.34 Mb and 3.53 Mb chromosome, respectively, and both genomes include five circular plasmids. Plasmid annotation revealed that three plasmids were shared by the two newly sequenced genomes, including a NetF/NetE toxins-encoding tcp-conjugative plasmid, a CPE/CPB2 toxins-encoding tcp-conjugative plasmid and a putative bacteriocin-encoding plasmid. The putative beta-pore-forming toxin genes, netF, netE and netG, were located in unique pathogenicity loci on tcp-conjugative plasmids. The C. perfringens JFP55 chromosome carries 2,825 protein-coding genes whereas the chromosome of JFP838 contains 3,014 protein-encoding genes. Comparison of these two chromosomes with three available reference C. perfringens chromosome sequences identified 48 (~247 kb) and 81 (~430 kb) regions unique to JFP55 and JFP838, respectively. Some of these divergent genomic regions in both chromosomes are phage- and plasmid-related segments. Sixteen of these unique chromosomal regions (~69 kb) were shared between the two isolates. Five of these shared regions formed a mosaic of plasmid-integrated segments, suggesting that these elements were acquired early in a clonal lineage of netF-positive C. perfringens strains. These results provide significant insight into the basis of canine and foal necrotizing enteritis and are the first to demonstrate that netF resides on a large and unique plasmid-encoded locus.

  11. Structural characteristics of ScBx genes controlling the biosynthesis of hydroxamic acids in rye (Secale cereale L.).

    PubMed

    Bakera, Beata; Makowska, Bogna; Groszyk, Jolanta; Niziołek, Michał; Orczyk, Wacław; Bolibok-Brągoszewska, Hanna; Hromada-Judycka, Aneta; Rakoczy-Trojanowska, Monika

    2015-08-01

    Benzoxazinoids (BX) are major secondary metabolites of gramineous plants that play an important role in disease resistance and allelopathy. They also have many other unique properties including anti-bacterial and anti-fungal activity, and the ability to reduce alfa-amylase activity. The biosynthesis and modification of BX are controlled by the genes Bx1 ÷ Bx10, GT and glu, and the majority of these Bx genes have been mapped in maize, wheat and rye. However, the genetic basis of BX biosynthesis remains largely uncharacterized apart from some data from maize and wheat. The aim of this study was to isolate, sequence and characterize five genes (ScBx1, ScBx2, ScBx3, ScBx4 and ScBx5) encoding enzymes involved in the synthesis of DIBOA, an important defense compound of rye. Using a modified 3D procedure of BAC library screening, seven BAC clones containing all of the ScBx genes were isolated and sequenced. Bioinformatic analyses of the resulting contigs were used to examine the structure and other features of these genes, including their promoters, introns and 3'UTRs. Comparative analysis showed that the ScBx genes are similar to those of other Poaceae species, especially to the TaBx genes. The polymorphisms present both in the coding sequences and non-coding regions of ScBx in relation to other Bx genes are predicted to have an impact on the expression, structure and properties of the encoded proteins.

  12. Whole genome analysis of an MDR Beijing/W strain of Mycobacterium tuberculosis with large genomic deletions associated with resistance to isoniazid.

    PubMed

    Zhang, Qiufen; Wan, Baoshan; Zhou, Aiping; Ni, Jinjing; Xu, Zhihong; Li, Shuxian; Tao, Jing; Yao, YuFeng

    2016-05-15

    Mycobacterium tuberculosis (M.tb) is one of the most prevalent bacterial pathogens in the world. With geographical wide spread and hypervirulence, Beijing/W family is the most successful M.tb lineage. China is a country of high tuberculosis (TB) and high multiple drug-resistant TB (MDR-TB) burden, and the Beijing/W family strains take the largest share of MDR strains. To study the genetic basis of Beijing/W family strains' virulence and drug resistance, we performed the whole genome sequencing of M.tb strain W146, a clinical Beijing/W genotype MDR isolated from Wuxi, Jiangsu province, China. Compared with genome sequence of M.tb strain H37Rv, we found that strain W146 lacks three large fragments and the missing of furA-katG operon confers isoniazid resistance. Besides the missing of furA-katG operon, strain W146 harbored almost all known drug resistance-associated mutations. Comparison analysis of single nucleotide polymorphisms (SNPs) and indels between strain W146 and Beijing/W genotype strains and non-Beijing/W genotype strains revealed that strain W146 possessed some unique mutations, which may be related to drug resistance, transmission and pathogenicity. These findings will help to understand the large sequence polymorphisms (LSPs) and the transmission and drug resistance related genetic characteristics of the Beijing/W genotype of M.tb. Copyright © 2016 Elsevier B.V. All rights reserved.

  13. New genetic lineage within the Siberian subtype of tick-borne encephalitis virus found in Western Siberia, Russia.

    PubMed

    Tkachev, Sergey E; Chicherina, Galina S; Golovljova, Irina; Belokopytova, Polina S; Tikunov, Artem Yu; Zadora, Oksana V; Glupov, Victor V; Tikunova, Nina V

    2017-12-01

    Tick-borne encephalitis virus (TBEV), a member of the Flaviviridae family, is a causative agent of a severe neurological disease. There are three main TBEV subtypes: the European (TBEV-Eu), Far Eastern (TBEV-FE), and Siberian (TBEV-Sib). Currently, three lineages within TBEV-Sib have been recorded. In this study, the genetic and biological characteristics of a new original strain, TBEV-2871, isolated in the Novosibirsk province of Western Siberia, Russia were investigated. The strain has low neuroinvasiveness in mice. Phylogenetic analysis demonstrated that TBEV-2871 belongs to TBEV-Sib, but does not cluster with any of the TBEV-Sib lineages. The TBEV-2871 strain has 88-89% nucleotide sequence identity with the other TBEV-Sib strains, 84-86% nucleotide sequence identity with the TBEV-FE and TBEV-Eu subtypes and is genetically close to the subtype division border. The TBEV-2871 polyprotein sequence includes 43 unique amino acid substitutions, 30 of which are recorded at positions that are conserved among all TBEV subtypes. Strain TBEV-2871 and two similar but not identical isolates found in Kemerovo province, Western Siberia are separated into a new lineage tentatively named Obskaya after the name of Ob riber, in the vicinity of which the TBEV-2871 was first found. A molecular evolution investigation demonstrated that within TBEV-Sib, the Obskaya lineage likely separated 1535years ago, which is even earlier than the Baltic lineage. Copyright © 2017 Elsevier B.V. All rights reserved.

  14. Where's the Psychology? A Commentary on "Unique Characteristics of Diagnostic Classification Models: A Comprehensive Review of the Current State-of-the-Art"

    ERIC Educational Resources Information Center

    Leighton, Jacqueline P.

    2008-01-01

    In this commentary, the author asks the analogous question, "where's the psychology?" Not because the authors of the focus article "Unique Characteristics of Diagnostic Classification Models: A Comprehensive Review of the Current State-of-the-Art" have not provided a solid review of the technical aspects of Diagnostic…

  15. Operational Suitability Guide. Volume 2. Templates

    DTIC Science & Technology

    1990-05-01

    Intended mission, and the required technical and operational characteristics. The mission must be adequately defined and key hardware and software ...operational availability. With the use of fault-tolerant computer hardware and software , the system R&M will significantly improve end-to-end...should Include both hardware and software elements, as appropriate. Unique characteristics or unique support concepts should be Identified if they result

  16. Community-level characteristics associated with variation in rates of homelessness among families and single adults.

    PubMed

    Fargo, Jamison D; Munley, Ellen A; Byrne, Thomas H; Montgomery, Ann Elizabeth; Culhane, Dennis P

    2013-12-01

    We modeled rates of family and single-adult homelessness in the United States in metropolitan and nonmetropolitan regions and as a function of community-level demographic, behavioral, health, economic, and safety net characteristics. We entered community-level characteristics and US Department of Housing and Urban Development point-in-time counts for a single night in January 2009 into separate mixed-effects statistical analyses that modeled homelessness rates for 4 subpopulations: families and single adults in metropolitan and nonmetropolitan regions. Community-level factors accounted for 25% to 50% of the variance in homelessness rates across models. In metropolitan regions, alcohol consumption, social support, and several economic indicators were uniquely associated with family homelessness, and drug use and homicide were uniquely associated with single-adult homelessness. In nonmetropolitan regions, life expectancy, religious adherence, unemployment, and rent burden were uniquely associated with family homelessness, and health care access, crime, several economic indicators, and receipt of Supplemental Security Income were uniquely associated with single-adult homelessness. Considering homeless families and single adults separately enabled more precise modeling of associations between homelessness rates and community-level characteristics, indicating targets for interventions to reduce homelessness among these subpopulations.

  17. Community-Level Characteristics Associated With Variation in Rates of Homelessness Among Families and Single Adults

    PubMed Central

    Fargo, Jamison D.; Munley, Ellen A.; Byrne, Thomas H.; Montgomery, Ann Elizabeth; Culhane, Dennis P.

    2013-01-01

    Objectives. We modeled rates of family and single-adult homelessness in the United States in metropolitan and nonmetropolitan regions and as a function of community-level demographic, behavioral, health, economic, and safety net characteristics. Methods. We entered community-level characteristics and US Department of Housing and Urban Development point-in-time counts for a single night in January 2009 into separate mixed-effects statistical analyses that modeled homelessness rates for 4 subpopulations: families and single adults in metropolitan and nonmetropolitan regions. Results. Community-level factors accounted for 25% to 50% of the variance in homelessness rates across models. In metropolitan regions, alcohol consumption, social support, and several economic indicators were uniquely associated with family homelessness, and drug use and homicide were uniquely associated with single-adult homelessness. In nonmetropolitan regions, life expectancy, religious adherence, unemployment, and rent burden were uniquely associated with family homelessness, and health care access, crime, several economic indicators, and receipt of Supplemental Security Income were uniquely associated with single-adult homelessness. Conclusions. Considering homeless families and single adults separately enabled more precise modeling of associations between homelessness rates and community-level characteristics, indicating targets for interventions to reduce homelessness among these subpopulations. PMID:24148057

  18. Cross-Species Analyses Identify the BNIP-2 and Cdc42GAP Homology (BCH) Domain as a Distinct Functional Subclass of the CRAL_TRIO/Sec14 Superfamily

    PubMed Central

    Gupta, Anjali Bansal; Wee, Liang En; Zhou, Yi Ting; Hortsch, Michael; Low, Boon Chuan

    2012-01-01

    The CRAL_TRIO protein domain, which is unique to the Sec14 protein superfamily, binds to a diverse set of small lipophilic ligands. Similar domains are found in a range of different proteins including neurofibromatosis type-1, a Ras GTPase-activating Protein (RasGAP) and Rho guanine nucleotide exchange factors (RhoGEFs). Proteins containing this structural protein domain exhibit a low sequence similarity and ligand specificity while maintaining an overall characteristic three-dimensional structure. We have previously demonstrated that the BNIP-2 and Cdc42GAP Homology (BCH) protein domain, which shares a low sequence homology with the CRAL_TRIO domain, can serve as a regulatory scaffold that binds to Rho, RhoGEFs and RhoGAPs to control various cell signalling processes. In this work, we investigate 175 BCH domain-containing proteins from a wide range of different organisms. A phylogenetic analysis with ∼100 CRAL_TRIO and similar domains from eight representative species indicates a clear distinction of BCH-containing proteins as a novel subclass within the CRAL_TRIO/Sec14 superfamily. BCH-containing proteins contain a hallmark sequence motif R(R/K)h(R/K)(R/K)NL(R/K)xhhhhHPs (‘h’ is large and hydrophobic residue and ‘s’ is small and weekly polar residue) and can be further subdivided into three unique subtypes associated with BNIP-2-N, macro- and RhoGAP-type protein domains. A previously unknown group of genes encoding ‘BCH-only’ domains is also identified in plants and arthropod species. Based on an analysis of their gene-structure and their protein domain context we hypothesize that BCH domain-containing genes evolved through gene duplication, intron insertions and domain swapping events. Furthermore, we explore the point of divergence between BCH and CRAL-TRIO proteins in relation to their ability to bind small GTPases, GAPs and GEFs and lipid ligands. Our study suggests a need for a more extensive analysis of previously uncharacterized BCH, ‘BCH-like’ and CRAL_TRIO-containing proteins and their significance in regulating signaling events involving small GTPases. PMID:22479462

  19. Denitrification and Phosphorus Sequestration in Restored Oyster Beds in the Indian River Lagoon, Florida, USA

    NASA Astrophysics Data System (ADS)

    Gallagher, S. M.; Schmidt, C. A.; Walters, L.

    2016-12-01

    In 2016, an algae bloom in the St. Lucie River in Florida led the governor to declare a state of emergency. The river is part of a connected system of estuaries along the Atlantic coast of Florida called the Indian River Lagoon (IRL). As with many estuaries around the world, nutrient loading in the IRL has led to periodic eutrophication. As a result, much research has been done to address nutrients in these systems. Previous estuary studies have related oyster restoration to denitrification and phosphorus sequestration in their bed sediment. To this point, these studies have been inconclusive, and have only focused on seasonal variation in nutrient cycling. In 2007, yearly oyster bed installation and restoration began in a study area in the IRL. By 2016, beds aged up to eleven years were available for sampling. This unique advantage allowed investigation of bed sediment and nutrient cycling over long periods of time. Sediment from the IRL was measured for organic matter, microbial weight, carbon, nitrogen, and phosphorus. Denitrification was measured using an acetylene block technique. A statistical analysis was used to find differences in sediment characteristics and denitrification between restored beds and control sites over time. In addition, sequencing of 16S rRNA DNA and a variety of denitrifying genes was used to identify bacterial species and their denitrifying capability in the sediment. The ability to sequence denitrification genes in established oyster beds over a period of years was also unique to this study. Significant differences were found in soil properties, denitrification rates, and phosphorus sequestration between control sites and restored oyster beds. Gene sequencing also found differences in bacterial populations between the sites. Oyster bed restoration resulted in a rapid increase in nutrient removal as beds developed over three years, but additional benefits were limited as restoration progressed further. This study adds an investigation of IRL oysters to existing knowledge of nutrient removal by oysters in other estuaries. These results help clarify single year studies focused on seasonal changes by showing a rapid increase in oyster bed nutrient removal over a period of three years.

  20. Characterization of casein and alpha lactalbumin of African elephant (Loxodonta africana) milk.

    PubMed

    Madende, M; Osthoff, G; Patterton, H-G; Patterton, H E; Martin, P; Opperman, D J

    2015-12-01

    The current research reports partial characterization of the caseins and α-lactalbumin (α-LA) of the African elephant with proposed unique structure-function properties. Extensive research has been carried out to understand the structure of the casein micelles. Crystallographic structure elucidation of caseins and casein micelles is not possible. Consequently, several models have been developed in an effort to describe the casein micelle, specifically of cow milk. Here we report the characterization of African elephant milk caseins. The κ-caseins and β-caseins were investigated, and their relative ratio was found to be approximately 1:8.5, whereas α-caseins were not detected. The gene sequence of β-casein in the NCBI database was revisited, and a different sequence in the N-terminal region is proposed. Amino acid sequence alignment and hydropathy plots showed that the κ-casein of African elephant milk is similar to that of other mammals, whereas the β-casein is similar to the human protein, and displayed a section of unique AA composition and additional hydrophilic regions compared with bovine caseins. Elephant milk is destabilized by 62% alcohol, and it is speculated that the β-casein characteristics may allow maintenance of the colloidal nature of the casein micelle, a role that was previously only associated with κ-casein. The oligosaccharide content of milk was reported to be low in dairy animals but high in some other species such as humans and elephants. In the milk of the African elephant, lactose and oligosaccharides both occur at high levels. These levels are typically related to the content of α-LA in the mammary gland and thus point to a specialized carbohydrate synthesis, where the whey protein α-LA plays a role. We report the characterization of African elephant α-LA. Homology modeling of the α-LA showed that it is structurally similar to crystal structures of other mammalian species, which in turn may be an indication that its functional properties, such as lactose synthesis, should not be impaired. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  1. Molecular identification of Armillaria gallica from the Niobrara Valley Preserve in Nebraska

    Treesearch

    Mee-Sook Kim; Ned B. Klopfenstein

    2011-01-01

    Armillaria isolates were collected from a unique forest ecosystem in the Niobrara Valley Preserve in Nebraska, USA, which comprises a glacial and early postglacial refugium in the central plains of North America. The isolates were collected from diverse forest trees representing a unique mixture of forest types. Combined methods of rDNA sequencing and flow cytometric...

  2. Y and W Chromosome Assemblies: Approaches and Discoveries.

    PubMed

    Tomaszkiewicz, Marta; Medvedev, Paul; Makova, Kateryna D

    2017-04-01

    Hundreds of vertebrate genomes have been sequenced and assembled to date. However, most sequencing projects have ignored the sex chromosomes unique to the heterogametic sex - Y and W - that are known as sex-limited chromosomes (SLCs). Indeed, haploid and repetitive Y chromosomes in species with male heterogamety (XY), and W chromosomes in species with female heterogamety (ZW), are difficult to sequence and assemble. Nevertheless, obtaining their sequences is important for understanding the intricacies of vertebrate genome function and evolution. Recent progress has been made towards the adaptation of next-generation sequencing (NGS) techniques to deciphering SLC sequences. We review here currently available methodology and results with regard to SLC sequencing and assembly. We focus on vertebrates, but bring in some examples from other taxa. Copyright © 2017 Elsevier Ltd. All rights reserved.

  3. New biometric modalities using internal physical characteristics

    NASA Astrophysics Data System (ADS)

    Mortenson, Juliana (Brooks)

    2010-04-01

    Biometrics is described as the science of identifying people based on physical characteristics such as their fingerprints, facial features, hand geometry, iris patterns, palm prints, or speech recognition. Notably, all of these physical characteristics are visible or detectable from the exterior of the body. These external characteristics can be lifted, photographed, copied or recorded for unauthorized access to a biometric system. Individual humans are unique internally, however, just as they are unique externally. New biometric modalities have been developed which identify people based on their unique internal characteristics. For example, "BoneprintsTM" use acoustic fields to scan the unique bone density pattern of a thumb pressed on a small acoustic sensor. Thanks to advances in piezoelectric materials the acoustic sensor can be placed in virtually any device such as a steering wheel, door handle, or keyboard. Similarly, "Imp-PrintsTM" measure the electrical impedance patterns of a hand to identify or verify a person's identity. Small impedance sensors can be easily embedded in devices such as smart cards, handles, or wall mounts. These internal biometric modalities rely on physical characteristics which are not visible or photographable, providing an added level of security. In addition, both the acoustic and impedance methods can be combined with physiologic measurements such as acoustic Doppler or impedance plethysmography, respectively. Added verification that the biometric pattern came from a living person can be obtained. These new biometric modalities have the potential to allay user concerns over protection of privacy, while providing a higher level of security.*

  4. GENESUS: a two-step sequence design program for DNA nanostructure self-assembly.

    PubMed

    Tsutsumi, Takanobu; Asakawa, Takeshi; Kanegami, Akemi; Okada, Takao; Tahira, Tomoko; Hayashi, Kenshi

    2014-01-01

    DNA has been recognized as an ideal material for bottom-up construction of nanometer scale structures by self-assembly. The generation of sequences optimized for unique self-assembly (GENESUS) program reported here is a straightforward method for generating sets of strand sequences optimized for self-assembly of arbitrarily designed DNA nanostructures by a generate-candidates-and-choose-the-best strategy. A scalable procedure to prepare single-stranded DNA having arbitrary sequences is also presented. Strands for the assembly of various structures were designed and successfully constructed, validating both the program and the procedure.

  5. Genetic architecture of the Delis-Kaplan Executive Function System Trail Making Test: evidence for distinct genetic influences on executive function.

    PubMed

    Vasilopoulos, Terrie; Franz, Carol E; Panizzon, Matthew S; Xian, Hong; Grant, Michael D; Lyons, Michael J; Toomey, Rosemary; Jacobson, Kristen C; Kremen, William S

    2012-03-01

    To examine how genes and environments contribute to relationships among Trail Making Test (TMT) conditions and the extent to which these conditions have unique genetic and environmental influences. Participants included 1,237 middle-aged male twins from the Vietnam Era Twin Study of Aging. The Delis-Kaplan Executive Function System TMT included visual searching, number and letter sequencing, and set-shifting components. Phenotypic correlations among TMT conditions ranged from 0.29 to 0.60, and genes accounted for the majority (58-84%) of each correlation. Overall heritability ranged from 0.34 to 0.62 across conditions. Phenotypic factor analysis suggested a single factor. In contrast, genetic models revealed a single common genetic factor but also unique genetic influences separate from the common factor. Genetic variance (i.e., heritability) of number and letter sequencing was completely explained by the common genetic factor while unique genetic influences separate from the common factor accounted for 57% and 21% of the heritabilities of visual search and set shifting, respectively. After accounting for general cognitive ability, unique genetic influences accounted for 64% and 31% of those heritabilities. A common genetic factor, most likely representing a combination of speed and sequencing, accounted for most of the correlation among TMT 1-4. Distinct genetic factors, however, accounted for a portion of variance in visual scanning and set shifting. Thus, although traditional phenotypic shared variance analysis techniques suggest only one general factor underlying different neuropsychological functions in nonpatient populations, examining the genetic underpinnings of cognitive processes with twin analysis can uncover more complex etiological processes.

  6. Study on recognition algorithm for paper currency numbers based on neural network

    NASA Astrophysics Data System (ADS)

    Li, Xiuyan; Liu, Tiegen; Li, Yuanyao; Zhang, Zhongchuan; Deng, Shichao

    2008-12-01

    Based on the unique characteristic, the paper currency numbers can be put into record and the automatic identification equipment for paper currency numbers is supplied to currency circulation market in order to provide convenience for financial sectors to trace the fiduciary circulation socially and provide effective supervision on paper currency. Simultaneously it is favorable for identifying forged notes, blacklisting the forged notes numbers and solving the major social problems, such as armor cash carrier robbery, money laundering. For the purpose of recognizing the paper currency numbers, a recognition algorithm based on neural network is presented in the paper. Number lines in original paper currency images can be draw out through image processing, such as image de-noising, skew correction, segmentation, and image normalization. According to the different characteristics between digits and letters in serial number, two kinds of classifiers are designed. With the characteristics of associative memory, optimization-compute and rapid convergence, the Discrete Hopfield Neural Network (DHNN) is utilized to recognize the letters; with the characteristics of simple structure, quick learning and global optimum, the Radial-Basis Function Neural Network (RBFNN) is adopted to identify the digits. Then the final recognition results are obtained by combining the two kinds of recognition results in regular sequence. Through the simulation tests, it is confirmed by simulation results that the recognition algorithm of combination of two kinds of recognition methods has such advantages as high recognition rate and faster recognition simultaneously, which is worthy of broad application prospect.

  7. The complete mitochondrial genome sequence of the maned wolf (Chrysocyon brachyurus).

    PubMed

    Zhao, Chao; Yang, Xiufeng; Zhang, Honghai; Zhang, Jin; Chen, Lei; Sha, Weilai; Liu, Guangshuai

    2016-01-01

    In this study, the complete mitochondrial genome of the maned wolf (Chrysocyon brachyurus), the unique species in Chrysocyon, was sequenced and reported for the first time using blood samples obtained from a female individual in Shanghai Zoo, China. Sequence analysis showed that the genome structure was in accordance with other Canidae species and it contained 12 S rRNA gene, 16 S rRNA gene, 22 tRNA genes, 13 protein-coding genes and 1 control region.

  8. Targeting of Repeated Sequences Unique to a Gene Results in Significant Increases in Antisense Oligonucleotide Potency

    PubMed Central

    Vickers, Timothy A.; Freier, Susan M.; Bui, Huynh-Hoa; Watt, Andrew; Crooke, Stanley T.

    2014-01-01

    A new strategy for identifying potent RNase H-dependent antisense oligonucleotides (ASOs) is presented. Our analysis of the human transcriptome revealed that a significant proportion of genes contain unique repeated sequences of 16 or more nucleotides in length. Activities of ASOs targeting these repeated sites in several representative genes were compared to those of ASOs targeting unique single sites in the same transcript. Antisense activity at repeated sites was also evaluated in a highly controlled minigene system. Targeting both native and minigene repeat sites resulted in significant increases in potency as compared to targeting of non-repeated sites. The increased potency at these sites is a result of increased frequency of ASO/RNA interactions which, in turn, increases the probability of a productive interaction between the ASO/RNA heteroduplex and human RNase H1 in the cell. These results suggest a new, highly efficient strategy for rapid identification of highly potent ASOs. PMID:25334092

  9. Solexa-Sequencing Based Transcriptome Study of Plaice Skin Phenotype in Rex Rabbits (Oryctolagus cuniculus)

    PubMed Central

    Pan, Lei; Liu, Yan; Wei, Qiang; Xiao, Chenwen; Ji, Quanan; Bao, Guolian; Wu, Xinsheng

    2015-01-01

    Background Fur is an important genetically-determined characteristic of domestic rabbits; rabbit furs are of great economic value. We used the Solexa sequencing technology to assess gene expression in skin tissues from full-sib Rex rabbits of different phenotypes in order to explore the molecular mechanisms associated with fur determination. Methodology/Principal Findings Transcriptome analysis included de novo assembly, gene function identification, and gene function classification and enrichment. We obtained 74,032,912 and 71,126,891 short reads of 100 nt, which were assembled into 377,618 unique sequences by Trinity strategy (N50=680 nt). Based on BLAST results with known proteins, 50,228 sequences were identified at a cut-off E-value ≥ 10-5. Using Blast to Gene Ontology (GO), Clusters of Orthologous Groups (KOG) and Kyoto Encyclopedia of Genes and Genomes (KEGG), we obtained several genes with important protein functions. A total of 308 differentially expressed genes were obtained by transcriptome analysis of plaice and un-plaice phenotype animals; 209 additional differentially expressed genes were not found in any database. These genes included 49 that were only expressed in plaice skin rabbits. The novel genes may play important roles during skin growth and development. In addition, 99 known differentially expressed genes were assigned to PI3K-Akt signaling, focal adhesion, and ECM-receptor interactin, among others. Growth factors play a role in skin growth and development by regulating these signaling pathways. We confirmed the altered expression levels of seven target genes by qRT-PCR. And chosen a key gene for SNP to found the differentially between plaice and un-plaice phenotypes rabbit. Conclusions/Significance The rabbit transcriptome profiling data provide new insights in understanding the molecular mechanisms underlying rabbit skin growth and development. PMID:25955442

  10. Carboxy-terminal sequence variation of LMP1 gene in Epstein-Barr-virus-associated mononucleosis and tumors from Serbian patients.

    PubMed

    Banko, Ana; Lazarevic, Ivana; Cupic, Maja; Stevanovic, Goran; Boricic, Ivan; Jovanovic, Tanja

    2012-04-01

    Seven strains of Epstein-Barr virus (EBV) are defined based on C-terminal sequence variations of the latent membrane protein 1 (LMP1). Some strains, especially those with a 30-bp deletion, are thought to be related to tumorigenic activity and geographical localization. The aims of the study were to determine the prevalence of different LMP1 strains and to investigate sequence variation in the C-terminal region of LMP1 in Serbian isolates. This study included 53 EBV-DNA-positive plasma and tissue block samples from patients with mononucleosis syndrome, renal transplantation, and tumors, mostly nasopharyngeal carcinoma. The sequence of the 506-bp fragment of LMP1 C terminus was used for phylogenetic analyses and identification of LMP1 strains, deletions, and mutations. The majority of isolates were non-deleted (66%), and the rest had 30-bp, rare 69-bp, or yet unknown 27-bp deletions, which were not related to malignant or non-malignant isolate origin. However, the majority of 69-bp deletion isolates were derived from patients with nasopharyngeal carcinoma. Less than five 33-bp repeats were found in the majority of non-deleted isolates (68.6%), whereas most 69-bp deletion isolates (75%) had five or six repeats. Serbian isolates were assigned to four LMP1 strains: B95-8 (32.1%), China 1 (24.5%), North Carolina (NC; 18.9%), and Mediterranean (Med; 24.5%). In NC isolates, three new mutations unique for this strain were identified. EBV EBNA2 genotypes 1 and 2 were both found, with dominance of genotype 1 (90.7%). This study demonstrated noticeable geographical-associated characteristics in the LMP1 C terminus of investigated isolates. Copyright © 2012 Wiley Periodicals, Inc.

  11. Complete chloroplast genome sequences of Praxelis (Eupatorium catarium Veldkamp), an important invasive species.

    PubMed

    Zhang, Ying; Li, Lei; Yan, Ting Liang; Liu, Qiang

    2014-10-01

    Praxelis (Eupatorium catarium Veldkamp) is a new hazardous invasive plant species that has caused serious economic losses and environmental damage in the Northern hemisphere tropical and subtropical regions. Although previous studies focused on detecting the biological characteristics of this plant to prevent its expansion, little effort has been made to understand the impact of Praxelis on the ecosystem in an evolutionary process. The genetic information of Praxelis is required for further phylogenetic identification and evolutionary studies. Here, we report the complete Praxelis chloroplast (cp) genome sequence. The Praxelis chloroplast genome is 151,410 bp in length including a small single-copy region (18,547 bp) and a large single-copy region (85,311 bp) separated by a pair of inverted repeats (IRs; 23,776 bp). The genome contains 85 unique and 18 duplicated genes in the IR region. The gene content and organization are similar to other Asteraceae tribe cp genomes. We also analyzed the whole cp genome sequence, repeat structure, codon usage, contraction of the IR and gene structure/organization features between native and invasive Asteraceae plants, in order to understand the evolution of organelle genomes between native and invasive Asteraceae. Comparative analysis identified the 14 markers containing greater than 2% parsimony-informative characters, indicating that they are potential informative markers for barcoding and phylogenetic analysis. Moreover, a sister relationship between Praxelis and seven other species in Asteraceae was found based on phylogenetic analysis of 28 protein-coding sequences. Complete cp genome information is useful for plant phylogenetic and evolutionary studies within this invasive species and also within the Asteraceae family. Copyright © 2014 Elsevier B.V. All rights reserved.

  12. Single-cell RNA-sequencing reveals a distinct population of proglucagon-expressing cells specific to the mouse upper small intestine.

    PubMed

    Glass, Leslie L; Calero-Nieto, Fernando J; Jawaid, Wajid; Larraufie, Pierre; Kay, Richard G; Göttgens, Berthold; Reimann, Frank; Gribble, Fiona M

    2017-10-01

    To identify sub-populations of intestinal preproglucagon-expressing (PPG) cells producing Glucagon-like Peptide-1, and their associated expression profiles of sensory receptors, thereby enabling the discovery of therapeutic strategies that target these cell populations for the treatment of diabetes and obesity. We performed single cell RNA sequencing of PPG-cells purified by flow cytometry from the upper small intestine of 3 GLU-Venus mice. Cells from 2 mice were sequenced at low depth, and from the third mouse at high depth. High quality sequencing data from 234 PPG-cells were used to identify clusters by tSNE analysis. qPCR was performed to compare the longitudinal and crypt/villus locations of cluster-specific genes. Immunofluorescence and mass spectrometry were used to confirm protein expression. PPG-cells formed 3 major clusters: a group with typical characteristics of classical L-cells, including high expression of Gcg and Pyy (comprising 51% of all PPG-cells); a cell type overlapping with Gip-expressing K-cells (14%); and a unique cluster expressing Tph1 and Pzp that was predominantly located in proximal small intestine villi and co-produced 5-HT (35%). Expression of G-protein coupled receptors differed between clusters, suggesting the cell types are differentially regulated and would be differentially targetable. Our findings support the emerging concept that many enteroendocrine cell populations are highly overlapping, with individual cells producing a range of peptides previously assigned to distinct cell types. Different receptor expression profiles across the clusters highlight potential drug targets to increase gut hormone secretion for the treatment of diabetes and obesity. Copyright © 2017 The Authors. Published by Elsevier GmbH.. All rights reserved.

  13. Multidrug Resistant Pseudomonas aeruginosa Causing Prosthetic Valve Endocarditis: A Genetic-Based Chronicle of Evolving Antibiotic Resistance.

    PubMed

    Domitrovic, T Nicholas; Hujer, Andrea M; Perez, Federico; Marshall, Steven H; Hujer, Kristine M; Woc-Colburn, Laila E; Parta, Mark; Bonomo, Robert A

    2016-10-01

    Background.  Successful treatment of infections caused by multidrug-resistant (MDR) Pseudomonas aeruginosa is thwarted by the emergence of antibiotic resistance and biofilm formation on prosthetic devices. Our aims were to decipher the molecular basis of resistance in a unique case of prosthetic valve endocarditis (PVE) caused by MDR P. aeruginosa . Methods.  Five sequential MDR P. aeruginosa blood isolates collected during a 7-month period were recovered from a patient suffering from PVE previously exposed to β-lactam antibiotics. Minimum inhibitory concentrations (MICs) of several classes of antibiotics were used to indicate clinical resistance characteristics; relatedness of the isolates was determined using multilocus sequence typing and repetitive sequence-based polymerase chain reaction. Amplification and sequencing of regulatory and resistance genes was performed. Results.  All isolates belonged to ST 298, possessed bla PDC-16 , and were resistant to fluoroquinolones and carbapenems. In the course of therapy, we observed a >2-fold increase in cephalosporin resistance (4 µg/mL to >16 µg/mL). Sequencing of the AmpC regulator, amp R, revealed a D135N point mutation in cephalosporin-resistant isolates. Common carbapenemase genes were not identified. All isolates demonstrated a premature stop codon at amino acid 79 of the outer membrane protein OprD and mutations in the quinolone resistance-determining regions of gyr A and par C. Point mutations in nal C, an efflux pump regulator, were also observed. Conclusions.  In this analysis, we chart the molecular evolution of β-lactam resistance in a case of PVE. We show that mutations in regulatory genes controlling efflux and cephalosporinase production contributed to the MDR phenotype.

  14. Isolation, genome sequencing and functional analysis of two T7-like coliphages of avian pathogenic Escherichia coli.

    PubMed

    Chen, Mianmian; Xu, Juntian; Yao, Huochun; Lu, Chengping; Zhang, Wei

    2016-05-10

    Avian pathogenic Escherichia coli (APEC) causes colibacillosis, which results in significant economic losses to the poultry industry worldwide. Due to the drug residues and increased antibiotic resistance caused by antibiotic use, bacteriophages and other alternative therapeutic agents are expected to control APEC infection in poultry. Two APEC phages, named P483 and P694, were isolated from the feces from the farmers market in China. We then studied their biological properties, and carried out high-throughput genome sequencing and homology analyses of these phages. Assembly results of high-throughput sequencing showed that the structures of both P483 and P694 genomes consist of linear and double-stranded DNA. Results of the electron microscopy and homology analysis revealed that both P483 and P694 belong to T7-like virus which is a member of the Podoviridae family of the Caudovirales order. Comparative genomic analysis showed that most of the predicted proteins of these two phages showed strongest sequence similarity to the Enterobacteria phages BA14 and 285P, Erwinia phage FE44, and Kluyvera phage Kvp1; however, some proteins such as gp0.6a, gp1.7 and gp17 showed lower similarity (<85%) with the homologs of other phages in the T7 subgroup. We also found some unique characteristics of P483 and P694, such as the two types of the genes of P694 and no lytic activity of P694 against its host bacteria in liquid medium. Our results serve to further our understanding of phage evolution of T7-like coliphages and provide the potential application of the phages as therapeutic agents for the treatment of diseases. Copyright © 2016 Elsevier B.V. All rights reserved.

  15. TIA: algorithms for development of identity-linked SNP islands for analysis by massively parallel DNA sequencing.

    PubMed

    Farris, M Heath; Scott, Andrew R; Texter, Pamela A; Bartlett, Marta; Coleman, Patricia; Masters, David

    2018-04-11

    Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms. The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands. TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions. It also allows the definition of sequence length and sequence variability of the target region as well as the less variable flanking regions for tailoring to MPS platforms. As shown in this study, TIA can be used to discover identity-linked SNP islands within the human genome, useful for differentiating individuals by targeted resequencing on MPS technologies.

  16. Genomic insights from whole genome sequencing of four clonal outbreak Campylobacter jejuni assessed within the global C. jejuni population.

    PubMed

    Clark, Clifford G; Berry, Chrystal; Walker, Matthew; Petkau, Aaron; Barker, Dillon O R; Guan, Cai; Reimer, Aleisha; Taboada, Eduardo N

    2016-12-03

    Whole genome sequencing (WGS) is useful for determining clusters of human cases, investigating outbreaks, and defining the population genetics of bacteria. It also provides information about other aspects of bacterial biology, including classical typing results, virulence, and adaptive strategies of the organism. Cell culture invasion and protein expression patterns of four related multilocus sequence type 21 (ST21) C. jejuni isolates from a significant Canadian water-borne outbreak were previously associated with the presence of a CJIE1 prophage. Whole genome sequencing was used to examine the genetic diversity among these isolates and confirm that previous observations could be attributed to differential prophage carriage. Moreover, we sought to determine the presence of genome sequences that could be used as surrogate markers to delineate outbreak-associated isolates. Differential carriage of the CJIE1 prophage was identified as the major genetic difference among the four outbreak isolates. High quality single-nucleotide variant (hqSNV) and core genome multilocus sequence typing (cgMLST) clustered these isolates within expanded datasets consisting of additional C. jejuni strains. The number and location of homopolymeric tract regions was identical in all four outbreak isolates but differed from all other C. jejuni examined. Comparative genomics and PCR amplification enabled the identification of large chromosomal inversions of approximately 93 kb and 388 kb within the outbreak isolates associated with transducer-like proteins containing long nucleotide repeat sequences. The 93-kb inversion was characteristic of the outbreak-associated isolates, and the gene content of this inverted region displayed high synteny with the reference strain. The four outbreak isolates were clonally derived and differed mainly in the presence of the CJIE1 prophage, validating earlier findings linking the prophage to phenotypic differences in virulence assays and protein expression. The identification of large, genetically syntenous chromosomal inversions in the genomes of outbreak-associated isolates provided a unique method for discriminating outbreak isolates from the background population. Transducer-like proteins appear to be associated with the chromosomal inversions. CgMLST and hqSNV analysis also effectively delineated the outbreak isolates within the larger C. jejuni population structure.

  17. RDNAnalyzer: A tool for DNA secondary structure prediction and sequence analysis.

    PubMed

    Afzal, Muhammad; Shahid, Ahmad Ali; Shehzadi, Abida; Nadeem, Shahid; Husnain, Tayyab

    2012-01-01

    RDNAnalyzer is an innovative computer based tool designed for DNA secondary structure prediction and sequence analysis. It can randomly generate the DNA sequence or user can upload the sequences of their own interest in RAW format. It uses and extends the Nussinov dynamic programming algorithm and has various application for the sequence analysis. It predicts the DNA secondary structure and base pairings. It also provides the tools for routinely performed sequence analysis by the biological scientists such as DNA replication, reverse compliment generation, transcription, translation, sequence specific information as total number of nucleotide bases, ATGC base contents along with their respective percentages and sequence cleaner. RDNAnalyzer is a unique tool developed in Microsoft Visual Studio 2008 using Microsoft Visual C# and Windows Presentation Foundation and provides user friendly environment for sequence analysis. It is freely available. http://www.cemb.edu.pk/sw.html RDNAnalyzer - Random DNA Analyser, GUI - Graphical user interface, XAML - Extensible Application Markup Language.

  18. A genomic comparison of two termites with different social complexity.

    PubMed

    Korb, Judith; Poulsen, Michael; Hu, Haofu; Li, Cai; Boomsma, Jacobus J; Zhang, Guojie; Liebig, Jürgen

    2015-01-01

    The termites evolved eusociality and complex societies before the ants, but have been studied much less. The recent publication of the first two termite genomes provides a unique comparative opportunity, particularly because the sequenced termites represent opposite ends of the social complexity spectrum. Zootermopsis nevadensis has simple colonies with totipotent workers that can develop into all castes (dispersing reproductives, nest-inheriting replacement reproductives, and soldiers). In contrast, the fungus-growing termite Macrotermes natalensis belongs to the higher termites and has very large and complex societies with morphologically distinct castes that are life-time sterile. Here we compare key characteristics of genomic architecture, focusing on genes involved in communication, immune defenses, mating biology and symbiosis that were likely important in termite social evolution. We discuss these in relation to what is known about these genes in the ants and outline hypothesis for further testing.

  19. Dynamic Tunneling Junctions at the Atomic Intersection of Two Twisted Graphene Edges.

    PubMed

    Bellunato, Amedeo; Vrbica, Sasha D; Sabater, Carlos; de Vos, Erik W; Fermin, Remko; Kanneworff, Kirsten N; Galli, Federica; van Ruitenbeek, Jan M; Schneider, Grégory F

    2018-04-11

    The investigation of the transport properties of single molecules by flowing tunneling currents across extremely narrow gaps is relevant for challenges as diverse as the development of molecular electronics and sequencing of DNA. The achievement of well-defined electrode architectures remains a technical challenge, especially due to the necessity of high precision fabrication processes and the chemical instability of most bulk metals. Here, we illustrate a continuously adjustable tunneling junction between the edges of two twisted graphene sheets. The unique property of the graphene electrodes is that the sheets are rigidly supported all the way to the atomic edge. By analyzing the tunneling current characteristics, we also demonstrate that the spacing across the gap junction can be controllably adjusted. Finally, we demonstrate the transition from the tunneling regime to contact and the formation of an atomic-sized junction between the two edges of graphene.

  20. Acid extraction and purification of recombinant spider silk proteins.

    PubMed

    Mello, Charlene M; Soares, Jason W; Arcidiacono, Steven; Butler, Michelle M

    2004-01-01

    A procedure has been developed for the isolation of recombinant spider silk proteins based upon their unique stability and solubilization characteristics. Three recombinant silk proteins, (SpI)7, NcDS, and [(SpI)4/(SpII)1]4, were purified by extraction with organic acids followed by affinity or ion exchange chromatography resulting in 90-95% pure silk solutions. The protein yield of NcDS (15 mg/L culture) and (SpI)7 (35 mg/L) increased 4- and 5-fold, respectively, from previously reported values presumably due to a more complete solubilization of the expressed recombinant protein. [(SpI)4/(SpII)1]4, a hybrid protein based on the repeat sequences of spidroin I and spidroin II, had a yield of 12.4 mg/L. This method is an effective, reproducible technique that has broad applicability for a variety of silk proteins as well as other acid stable biopolymers.

  1. Epigenetics: the language of the cell?

    PubMed

    Huang, Biao; Jiang, Cizhong; Zhang, Rongxin

    2014-02-01

    Epigenetics is one of the most rapidly developing fields of biological research. Breakthroughs in several technologies have enabled the possibility of genome-wide epigenetic research, for example the mapping of human genome-wide DNA methylation. In addition, with the development of various high-throughput and high-resolution sequencing technologies, a large number of functional noncoding RNAs have been identified. Massive studies indicated that these functional ncRNA also play an important role in epigenetics. In this review, we gain inspiration from the recent proposal of the ceRNAs hypothesis. This hypothesis proposes that miRNAs act as a language of communication. Accordingly, we further deduce that all of epigenetics may functionally acquire such a unique language characteristic. In summary, various epigenetic markers may not only participate in regulating cellular processes, but they may also act as the intracellular 'language' of communication and are involved in extensive information exchanges within cell.

  2. Detection of isotype switch rearrangement in bulk culture by PCR.

    PubMed

    Max, E E; Mills, F C; Chu, C

    2001-05-01

    When a B lymphocyte changes from synthesizing IgM to synthesizing IgG, IgA, or IgE, this isotype switch is generally accompanied by a unique DNA rearrangement. The protocols in this unit describe two polymerase chain reaction (PCR)-based strategies for detecting switch rearrangements in bulk culture. The first involves direct PCR across the switch junctions, providing the opportunity for characterizing the recombination products by nucleotide sequence analysis; however, because of characteristics inherent to the PCR methodology this strategy cannot easily be used as a quantitative assay for recombination. A support protocol details the preparation of the 5' Su PCR probe for this protocol. The second basic protocol describes a method known as digestion-circularization PCR (DCPCR) that is more amenable to quantitation but yields no information on structure of the recombination products. Both techniques should be capable of detecting reciprocal deletion circles as well as functional recombination products remaining on the expressed chromosome.

  3. A genomic comparison of two termites with different social complexity

    PubMed Central

    Korb, Judith; Poulsen, Michael; Hu, Haofu; Li, Cai; Boomsma, Jacobus J.; Zhang, Guojie; Liebig, Jürgen

    2015-01-01

    The termites evolved eusociality and complex societies before the ants, but have been studied much less. The recent publication of the first two termite genomes provides a unique comparative opportunity, particularly because the sequenced termites represent opposite ends of the social complexity spectrum. Zootermopsis nevadensis has simple colonies with totipotent workers that can develop into all castes (dispersing reproductives, nest-inheriting replacement reproductives, and soldiers). In contrast, the fungus-growing termite Macrotermes natalensis belongs to the higher termites and has very large and complex societies with morphologically distinct castes that are life-time sterile. Here we compare key characteristics of genomic architecture, focusing on genes involved in communication, immune defenses, mating biology and symbiosis that were likely important in termite social evolution. We discuss these in relation to what is known about these genes in the ants and outline hypothesis for further testing. PMID:25788900

  4. Streptomyces species: Ideal chassis for natural product discovery and overproduction.

    PubMed

    Liu, Ran; Deng, Zixin; Liu, Tiangang

    2018-05-28

    There is considerable interest in mining organisms for new natural products (NPs) and in improving methods to overproduce valuable NPs. Because of the rapid development of tools and strategies for metabolic engineering and the markedly increased knowledge of the biosynthetic pathways and genetics of NP-producing organisms, genome mining and overproduction of NPs can be dramatically accelerated. In particular, Streptomyces species have been proposed as suitable chassis organisms for NP discovery and overproduction because of their many unique characteristics not shared with yeast, Escherichia coli, or other microorganisms. In this review, we summarize the methods for genome sequencing, gene cluster prediction, and gene editing in Streptomyces, as well as metabolic engineering strategies for NP overproduction and approaches for generating new products. Finally, two strategies for utilizing Streptomyces as the chassis for NP discovery and overproduction are emphasized. Copyright © 2018 International Metabolic Engineering Society. Published by Elsevier Inc. All rights reserved.

  5. Exon–intron organization of genes in the slime mold Physarum polycephalum

    PubMed Central

    Trzcinska-Danielewicz, Joanna; Fronk, Jan

    2000-01-01

    The slime mold Physarum polycephalum is a morphologically simple organism with a large and complex genome. The exon–intron organization of its genes exhibits features typical for protists and fungi as well as those characteristic for the evolutionarily more advanced species. This indicates that both the taxonomic position as well as the size of the genome shape the exon–intron organization of an organism. The average gene has 3.7 introns which are on average 138 bp, with a rather narrow size distribution. Introns are enriched in AT base pairs by 13% relative to exons. The consensus sequences at exon–intron boundaries resemble those found for other species, with minor differences between short and long introns. A unique feature of P.polycephalum introns is the strong preference for pyrimidines in the coding strand throughout their length, without a particular enrichment at the 3′-ends. PMID:10982858

  6. SURVEY AND SUMMARY: exon-intron organization of genes in the slime mold Physarum polycephalum.

    PubMed

    Trzcinska-Danielewicz, J; Fronk, J

    2000-09-15

    The slime mold Physarum polycephalum is a morphologically simple organism with a large and complex genome. The exon-intron organization of its genes exhibits features typical for protists and fungi as well as those characteristic for the evolutionarily more advanced species. This indicates that both the taxonomic position as well as the size of the genome shape the exon-intron organization of an organism. The average gene has 3.7 introns which are on average 138 bp, with a rather narrow size distribution. Introns are enriched in AT base pairs by 13% relative to exons. The consensus sequences at exon-intron boundaries resemble those found for other species, with minor differences between short and long introns. A unique feature of P.polycephalum introns is the strong preference for pyrimidines in the coding strand throughout their length, without a particular enrichment at the 3'-ends.

  7. Morphological and molecular characteristics of a new species of Pasteuria parasitic on Meloidogyne ardenensis.

    PubMed

    Bishop, Alistair H; Gowen, Simon R; Pembroke, Barbara; Trotter, James R

    2007-09-01

    A species of the hyper-parasitic bacterium Pasteuria was isolated from the root-knot nematode Meloidogyne ardenensis infecting the roots of ash (Fraxinus excelsior). It is morphologically different from some other Pasteuria pathogens of nematodes in that the spores lack a basal ring on the ventral side of the spore and have a unique clumping nature. Transmission electron microscopy (TEM) showed that the clumps of spores are not random aggregates but result from the disintegration of the suicide cells of the thalli. Sporulation within each vegetative mycelium was shown to be asynchronous. In addition to the novel morphological features 16S rRNA sequence analysis showed this to be a new species of Pasteuria which we have called P. hartismeri. Spores of P. hartismeri attach to juveniles of root-knot nematodes infecting a wide range of plants such as mint (Meloidogyne hapla), rye grass (unidentified Meloidogyne sp.) and potato (Meloidogyne fallax).

  8. Structural insight into the mechanism of synergistic autoinhibition of SAD kinases

    PubMed Central

    Wu, Jing-Xiang; Cheng, Yun-Sheng; Wang, Jue; Chen, Lei; Ding, Mei; Wu, Jia-Wei

    2015-01-01

    The SAD/BRSK kinases participate in various important life processes, including neural development, cell cycle and energy metabolism. Like other members of the AMPK family, SAD contains an N-terminal kinase domain followed by the characteristic UBA and KA1 domains. Here we identify a unique autoinhibitory sequence (AIS) in SAD kinases, which exerts autoregulation in cooperation with UBA. Structural studies of mouse SAD-A revealed that UBA binds to the kinase domain in a distinct mode and, more importantly, AIS nestles specifically into the KD-UBA junction. The cooperative action of AIS and UBA results in an ‘αC-out' inactive kinase, which is conserved across species and essential for presynaptic vesicle clustering in C. elegans. In addition, the AIS, along with the KA1 domain, is indispensable for phospholipid binding. Taken together, these data suggest a model for synergistic autoinhibition and membrane activation of SAD kinases. PMID:26626945

  9. Structural insight into the mechanism of synergistic autoinhibition of SAD kinases.

    PubMed

    Wu, Jing-Xiang; Cheng, Yun-Sheng; Wang, Jue; Chen, Lei; Ding, Mei; Wu, Jia-Wei

    2015-12-02

    The SAD/BRSK kinases participate in various important life processes, including neural development, cell cycle and energy metabolism. Like other members of the AMPK family, SAD contains an N-terminal kinase domain followed by the characteristic UBA and KA1 domains. Here we identify a unique autoinhibitory sequence (AIS) in SAD kinases, which exerts autoregulation in cooperation with UBA. Structural studies of mouse SAD-A revealed that UBA binds to the kinase domain in a distinct mode and, more importantly, AIS nestles specifically into the KD-UBA junction. The cooperative action of AIS and UBA results in an 'αC-out' inactive kinase, which is conserved across species and essential for presynaptic vesicle clustering in C. elegans. In addition, the AIS, along with the KA1 domain, is indispensable for phospholipid binding. Taken together, these data suggest a model for synergistic autoinhibition and membrane activation of SAD kinases.

  10. Genetic variation in Mycobacterium tuberculosis isolates from a London outbreak associated with isoniazid resistance.

    PubMed

    Satta, Giovanni; Witney, Adam A; Shorten, Robert J; Karlikowska, Magdalena; Lipman, Marc; McHugh, Timothy D

    2016-08-16

    The largest outbreak of isoniazid-resistant (INH-R) Mycobacterium tuberculosis in Western Europe is centred in North London, with over 400 cases diagnosed since 1995. In the current study, we evaluated the genetic variation in a subset of clinical samples from the outbreak with the hypothesis that these isolates have unique biological characteristics that have served to prolong the outbreak. Fitness assays, mutation rate estimation, and whole-genome sequencing were performed to test for selective advantage and compensatory mutations. This detailed analysis of the genetic variation of these INH-R samples suggests that this outbreak consists of successful, closely related, circulating strains with heterogeneous resistance profiles and little or no associated fitness cost or impact on their mutation rate. Specific deletions and SNPs could be a peculiar feature of these INH-R M. tuberculosis isolates, and could potentially explain their persistence over the years.

  11. Dynamic Tunneling Junctions at the Atomic Intersection of Two Twisted Graphene Edges

    PubMed Central

    2018-01-01

    The investigation of the transport properties of single molecules by flowing tunneling currents across extremely narrow gaps is relevant for challenges as diverse as the development of molecular electronics and sequencing of DNA. The achievement of well-defined electrode architectures remains a technical challenge, especially due to the necessity of high precision fabrication processes and the chemical instability of most bulk metals. Here, we illustrate a continuously adjustable tunneling junction between the edges of two twisted graphene sheets. The unique property of the graphene electrodes is that the sheets are rigidly supported all the way to the atomic edge. By analyzing the tunneling current characteristics, we also demonstrate that the spacing across the gap junction can be controllably adjusted. Finally, we demonstrate the transition from the tunneling regime to contact and the formation of an atomic-sized junction between the two edges of graphene. PMID:29513997

  12. Lesion bypass activity of DNA polymerase θ (POLQ) is an intrinsic property of the pol domain and depends on unique sequence inserts.

    PubMed

    Hogg, Matthew; Seki, Mineaki; Wood, Richard D; Doublié, Sylvie; Wallace, Susan S

    2011-01-21

    DNA polymerase θ (POLQ, polθ) is a large, multidomain DNA polymerase encoded in higher eukaryotic genomes. It is important for maintaining genetic stability in cells and helping protect cells from DNA damage caused by ionizing radiation. POLQ contains an N-terminal helicase-like domain, a large central domain of indeterminate function, and a C-terminal polymerase domain with sequence similarity to the A-family of DNA polymerases. The enzyme has several unique properties, including low fidelity and the ability to insert and extend past abasic sites and thymine glycol lesions. It is not known whether the abasic site bypass activity is an intrinsic property of the polymerase domain or whether helicase activity is also required. Three "insertion" sequence elements present in POLQ are not found in any other A-family DNA polymerase, and it has been proposed that they may lend some unique properties to POLQ. Here, we analyzed the activity of the DNA polymerase in the absence of each sequence insertion. We found that the pol domain is capable of highly efficient bypass of abasic sites in the absence of the helicase-like or central domains. Insertion 1 increases the processivity of the polymerase but has little, if any, bearing on the translesion synthesis properties of the enzyme. However, removal of insertions 2 and 3 reduces activity on undamaged DNA and completely abrogates the ability of the enzyme to bypass abasic sites or thymine glycol lesions. Copyright © 2010 Elsevier Ltd. All rights reserved.

  13. Emergence and Evolution of Hominidae-Specific Coding and Noncoding Genomic Sequences.

    PubMed

    Saber, Morteza Mahmoudi; Adeyemi Babarinde, Isaac; Hettiarachchi, Nilmini; Saitou, Naruya

    2016-07-12

    Family Hominidae, which includes humans and great apes, is recognized for unique complex social behavior and intellectual abilities. Despite the increasing genome data, however, the genomic origin of its phenotypic uniqueness has remained elusive. Clade-specific genes and highly conserved noncoding sequences (HCNSs) are among the high-potential evolutionary candidates involved in driving clade-specific characters and phenotypes. On this premise, we analyzed whole genome sequences along with gene orthology data retrieved from major DNA databases to find Hominidae-specific (HS) genes and HCNSs. We discovered that Down syndrome critical region 4 (DSCR4) is the only experimentally verified gene uniquely present in Hominidae. DSCR4 has no structural homology to any known protein and was inferred to have emerged in several steps through LTR/ERV1, LTR/ERVL retrotransposition, and transversion. Using the genomic distance as neutral evolution threshold, we identified 1,658 HS HCNSs. Polymorphism coverage and derived allele frequency analysis of HS HCNSs showed that these HCNSs are under purifying selection, indicating that they may harbor important functions. They are overrepresented in promoters/untranslated regions, in close proximity of genes involved in sensory perception of sound and developmental process, and also showed a significantly lower nucleosome occupancy probability. Interestingly, many ancestral sequences of the HS HCNSs showed very high evolutionary rates. This suggests that new functions emerged through some kind of positive selection, and then purifying selection started to operate to keep these functions. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  14. Microsporidia: emerging pathogenic protists.

    PubMed

    Weiss, L M

    2001-02-23

    Microsporidia are eukaryotic spore forming obligate intracellular protozoan parasites first recognized over 100 years ago. These organisms infect all of the major animal groups and are now recognized as opportunistic pathogens of humans. Microsporidian spores are common in the environment and microsporidia pathogenic to humans have been found in water supplies. The genera Nosema, Vittaforma, Brachiola, Pleistophora, Encephalitozoon, Enterocytozoon, Septata (reclassified to Encephalitozoon) and Trachipleistophora have been found in human infections. These organisms have the smallest known eukaryotic genomes. Microsporidian ribosomal RNA sequences have proven useful as diagnostic tools as well as for phylogenetic analysis. Recent phylogenetic analysis suggests that Microsporidia are related to the fungi. These organisms are defined by the presence of a unique invasion organelle consisting of a single polar tube that coils around the interior of the spore. All microsporidia exhibit the same response to stimuli, that is, the polar tube discharges from the anterior pole of the spore in an explosive reaction. If the polar tube is discharged next to a cell, it can pierce the cell and transfer its sporoplasm into the cell. A technique was developed for the purification of polar tube proteins (PTPs) using differential extraction followed by reverse phase HPLC. This method was used to purify the PTPs from Glugea americanus, Encephalitozoon cuniculi, Enc. hellem and Enc. intestinalis. These PTPs demonstrate conserved characteristics such as solubility, hydrophobicity, mass, proline content and immunologic epitopes. The major PTP gene from Enc. cuniculi and Enc. hellem has been cloned and expressed in vitro. The gene sequences support the importance of ER and in the formation of the polar tube as suggested by morphologic studies. Analysis of the cloned proteins also indicates that secondary structural characteristics are conserved. These characteristics are probably important in the function of this protein during the eversion/assembly of the polar tube and in providing elasticity and resiliency for sporoplasm passage.

  15. Evidence for Interhemispheric Coupling during the Unusual Northern Polar Summer Mesosphere of 2002

    NASA Technical Reports Server (NTRS)

    Goldberg, Richard A.; Feofilov, Artem; Kutepov, Alexandr; Schmidlin, Francis J.; Russell, James M.

    2009-01-01

    Data from the MaCWAVE MIDAS Rocket Program launched during July, 2002, from Andoya Rocket Range (ARR) in Norway have demonstrated that the temperature structure of the summer polar mesosphere during this period was atypical, at least above ARR. The summer polar mesopause region was warmer than normal and of shorter duration than for other years analyzed. Theoretical studies have since been published that imply that the abnormal characteristics of this polar summer were generated by unusual dynamical processes occurring in the southern polar winter hemisphere. We have used data from the SABER instrument aboard the NASA TIMED Satellite to study these characteristics on a global scale and compare them with the features observed in the ensuing seven years. For background, The TIMED Satellite was launched on December 7, 2001 to study the dynamics and energy of the mesosphere and lower thermosphere (MLT). The SABER instrument on TIMED is a limb scanning infrared radiometer designed to measure a large number of minor constituents as well as temperature of the MLT. In this study, we have investigated the temperature characteristics of the polar mesosphere as a function of spatial and temporal considerations. We have used the most recent SABER dataset (1.07) that includes the improved temperature retrievals in Earth polar regions, Weekly averages were used 10 make the comparisons between the winter and summer hemispheres. The unusually short polar summer in the northern hemisphere during 2002 is clearly defined by this analysis and is shown to be unique for the 7 years analyzed. Furthermore, the data analysis agrees with recent theoretical studies showing that this behavior is a result of anomalous heating events in the southern polar stratosphere. The time sequence of the coupling process, as predicted by recent theoretical models, is well defined in a sequence of weekly temperature contour maps measured by SABER.

  16. Unique mutation portraits and frequent COL2A1 gene alteration in chondrosarcoma

    PubMed Central

    Totoki, Yasushi; Yoshida, Akihiko; Hosoda, Fumie; Nakamura, Hiromi; Hama, Natsuko; Ogura, Koichi; Yoshida, Aki; Fujiwara, Tomohiro; Arai, Yasuhito; Toguchida, Junya; Tsuda, Hitoshi; Miyano, Satoru; Kawai, Akira

    2014-01-01

    Chondrosarcoma is the second most frequent malignant bone tumor. However, the etiological background of chondrosarcomagenesis remains largely unknown, along with details on molecular alterations and potential therapeutic targets. Massively parallel paired-end sequencing of whole genomes of 10 primary chondrosarcomas revealed that the process of accumulation of somatic mutations is homogeneous irrespective of the pathological subtype or the presence of IDH1 mutations, is unique among a range of cancer types, and shares significant commonalities with that of prostate cancer. Clusters of structural alterations localized within a single chromosome were observed in four cases. Combined with targeted resequencing of additional cartilaginous tumor cohorts, we identified somatic alterations of the COL2A1 gene, which encodes an essential extracellular matrix protein in chondroskeletal development, in 19.3% of chondrosarcoma and 31.7% of enchondroma cases. Epigenetic regulators (IDH1 and YEATS2) and an activin/BMP signal component (ACVR2A) were recurrently altered. Furthermore, a novel FN1-ACVR2A fusion transcript was observed in both chondrosarcoma and osteochondromatosis cases. With the characteristic accumulative process of somatic changes as a background, molecular defects in chondrogenesis and aberrant epigenetic control are primarily causative of both benign and malignant cartilaginous tumors. PMID:25024164

  17. Improving Science Communication and Engaging the Public in Astronomy and Nature

    NASA Astrophysics Data System (ADS)

    Arion, Douglas N.

    2016-01-01

    A partnershipship between Carthage College and the Appalachian Mountain Club has delivered a successful public education and outreach program that merges natural environment topics and astronomy. Over the four years of activity, over 25,000 people have received programming. The effort has trained nature educators, permanent and seasonal AMC staff, and undergraduate physics and astronomy students to integrate diverse topical material and deliver high quality programming to the lay public. Unique to the program is the holistic nature of the material delivered - an 'atypical' astronomy program. Linking observable characteristics of the natural world with astronomical history and phenomena, and emphasizing the unique sequence of events that have led to human life on Earth, the program has changed attitudes and behaviors among the public participants. Successful interventions have included hands-on observing programs (day and night) that link nature content to the observed objects; table-talk presentations on nature/astronomy topics; dark skies preservation workshops; and hands-on activities developed for younger audiences, including schools, camps, and family groups. An extensive evaluation and assessment effort managed by a leading sociologist has demonstrated the effectiveness of the approach, and contributed to continuous improvement in the program content and methods. This work was supported in part by NSF Grant 1432662.

  18. Transcriptional transitions in Alphonso mango (Mangifera indica L.) during fruit development and ripening explain its distinct aroma and shelf life characteristics.

    PubMed

    Deshpande, Ashish B; Anamika, Krishanpal; Jha, Vineet; Chidley, Hemangi G; Oak, Pranjali S; Kadoo, Narendra Y; Pujari, Keshav H; Giri, Ashok P; Gupta, Vidya S

    2017-08-18

    Alphonso is known as the "King of mangos" due to its unique flavor, attractive color, low fiber pulp and long shelf life. We analyzed the transcriptome of Alphonso mango through Illumina sequencing from seven stages of fruit development and ripening as well as flower. Total transcriptome data from these stages ranged between 65 and 143 Mb. Importantly, 20,755 unique transcripts were annotated and 4,611 were assigned enzyme commission numbers, which encoded 142 biological pathways. These included ethylene and flavor related secondary metabolite biosynthesis pathways, as well as those involved in metabolism of starch, sucrose, amino acids and fatty acids. Differential regulation (p-value ≤ 0.05) of thousands of transcripts was evident in various stages of fruit development and ripening. Novel transcripts for biosynthesis of mono-terpenes, sesqui-terpenes, di-terpenes, lactones and furanones involved in flavor formation were identified. Large number of transcripts encoding cell wall modifying enzymes was found to be steady in their expression, while few were differentially regulated through these stages. Novel 79 transcripts of inhibitors of cell wall modifying enzymes were simultaneously detected throughout Alphonso fruit development and ripening, suggesting controlled activity of these enzymes involved in fruit softening.

  19. Candida glabrata's Genome Plasticity Confers a Unique Pattern of Expressed Cell Wall Proteins.

    PubMed

    López-Fuentes, Eunice; Gutiérrez-Escobedo, Guadalupe; Timmermans, Bea; Van Dijck, Patrick; De Las Peñas, Alejandro; Castaño, Irene

    2018-06-05

    Candida glabrata is the second most common cause of candidemia, and its ability to adhere to different host cell types, to microorganisms, and to medical devices are important virulence factors. Here, we consider three characteristics that confer extraordinary advantages to C. glabrata within the host. (1) C. glabrata has a large number of genes encoding for adhesins most of which are localized at subtelomeric regions. The number and sequence of these genes varies substantially depending on the strain, indicating that C. glabrata can tolerate high genomic plasticity; (2) The largest family of CWPs (cell wall proteins) is the EPA (epithelial adhesin) family of adhesins. Epa1 is the major adhesin and mediates adherence to epithelial, endothelial and immune cells. Several layers of regulation like subtelomeric silencing, cis- acting regulatory regions, activators, nutritional signaling, and stress conditions tightly regulate the expression of many adhesin-encoding genes in C. glabrata , while many others are not expressed. Importantly, there is a connection between acquired resistance to xenobiotics and increased adherence; (3) Other subfamilies of adhesins mediate adherence to Candida albicans , allowing C. glabrata to efficiently invade the oral epithelium and form robust biofilms. It is noteworthy that every C. glabrata strain analyzed presents a unique pattern of CWPs at the cell surface.

  20. Study of lubricant circulation in HVAC systems. Volume 1: Description of technical effort and results; Final technical report, March 1995--April 1996

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Biancardi, F.R.; Michels, H.H.; Sienel, T.H.

    1996-10-01

    The purpose of this program was to conduct experimental and analytical efforts to determine lubricant circulation characteristics of new HFC/POE pairs and HFC/mineral oil pairs in a representative central residential HVAC system and to compare their behavior with the traditional HCFC-22/mineral oil (refrigerant/lubricant) pair. A dynamic test facility was designed and built to conduct the experimental efforts. This facility provided a unique capability to visually and physically measure oil circulation rates, on-line, in operating systems. A unique on-line ultraviolet-based measurement device was used to obtain detailed data on the rate and level of lubricant oil circulated within the operating heatmore » pump system. The experimental and analytical data developed during the program are presented as a function of vapor velocity, refrigerant/lubricant viscosity, system features and equipment. Both visual observations and instrumentation were used to understand ``worst case`` oil circulation situations. This report is presented in two volumes. Volume 1 contains a complete description of the program scope, objective, test results summary, conclusions, description of test facility and recommendations for future effort. Volume 2 contains all of the program test data essentially as taken from the laboratory dynamic test facility during the sequence of runs.« less

Top